BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 044947
         (404 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|118481049|gb|ABK92478.1| unknown [Populus trichocarpa]
          Length = 456

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 223/389 (57%), Positives = 289/389 (74%), Gaps = 6/389 (1%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           ITCD SH  YD+  ING  L DP TSTFF++G T+STP   +  +K  PYPRK+DK+A S
Sbjct: 70  ITCDCSHGYYDVWSINGPTLLDPITSTFFAIGATNSTPLDST--VKFHPYPRKTDKNAKS 127

Query: 74  KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
           KV ELT+T SAPP  SCG+ H+SPA+VFS GGY GNF+H+F D  +PL+ITINS   DQD
Sbjct: 128 KVNELTLT-SAPPKSSCGIRHSSPAIVFSTGGYTGNFYHQFNDGLLPLYITINSLTLDQD 186

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           VIL + + +D WA+KYA+LL +FTRHPI+N++NQT THCF S  +GL++HG +V++P LL
Sbjct: 187 VILVVTNWSDWWAKKYADLLHQFTRHPIVNMDNQTRTHCFPSAIVGLMTHGPLVVDPRLL 246

Query: 194 PKPKTLVDFQSFLANAYNENTN---TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
           P+ KTL+DF + L N Y    N   TS      +P+LVLVNR   VGR ILNL+EV KA 
Sbjct: 247 PRNKTLLDFHALLQNTYGPRGNYLSTSGKSKGARPQLVLVNRKNGVGREILNLKEVLKAI 306

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           +E+GF   +FEP+ + S+ +++R +H  HAM+ VHGA +TH LFLR G+V+ ++VPIGT 
Sbjct: 307 KEVGFKAIVFEPKRNASVRETYRLLHGSHAMLAVHGAAMTHLLFLRVGTVVGEIVPIGTD 366

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLK 370
           W +  ++EKPARVLGLEY++YKI+  ESSL EKY  NDLVLKNP AF   N++   VY+K
Sbjct: 367 WPAKTFYEKPARVLGLEYMKYKIEVNESSLAEKYRVNDLVLKNPSAFVNGNYTKAMVYMK 426

Query: 371 TQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
           TQNVKLDI RFR YLK+A+ KA++ MDKE
Sbjct: 427 TQNVKLDIVRFRAYLKEAFVKAQRFMDKE 455


>gi|224101421|ref|XP_002312272.1| predicted protein [Populus trichocarpa]
 gi|222852092|gb|EEE89639.1| predicted protein [Populus trichocarpa]
          Length = 424

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 223/389 (57%), Positives = 289/389 (74%), Gaps = 6/389 (1%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           ITCD SH  YD+  ING  L DP TSTFF++G T+STP   +  +K  PYPRK+DK+A S
Sbjct: 38  ITCDCSHGYYDVWSINGPTLLDPITSTFFAIGATNSTPLDST--VKFHPYPRKTDKNAKS 95

Query: 74  KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
           KV ELT+T SAPP  SCG+ H+SPA+VFS GGY GNF+H+F D  +PL+ITINS   DQD
Sbjct: 96  KVNELTLT-SAPPKSSCGIRHSSPAIVFSTGGYTGNFYHQFNDGLLPLYITINSLTLDQD 154

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           VIL + + +D WA+KYA+LL +FTRHPI+N++NQT THCF S  +GL++HG +V++P LL
Sbjct: 155 VILVVTNWSDWWAKKYADLLHQFTRHPIVNMDNQTRTHCFPSAIVGLMTHGPLVVDPRLL 214

Query: 194 PKPKTLVDFQSFLANAYNENTN---TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
           P+ KTL+DF + L N Y    N   TS      +P+LVLVNR   VGR ILNL+EV KA 
Sbjct: 215 PRNKTLLDFHALLQNTYGPRGNYLSTSGKSKGARPQLVLVNRKNGVGREILNLKEVLKAI 274

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           +E+GF   +FEP+ + S+ +++R +H  HAM+ VHGA +TH LFLR G+V+ ++VPIGT 
Sbjct: 275 KEVGFKAIVFEPKRNASVRETYRLLHGSHAMLAVHGAAMTHLLFLRVGTVVGEIVPIGTD 334

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLK 370
           W +  ++EKPARVLGLEY++YKI+  ESSL EKY  NDLVLKNP AF   N++   VY+K
Sbjct: 335 WPAKTFYEKPARVLGLEYMKYKIEVNESSLAEKYRVNDLVLKNPSAFVNGNYTKAMVYMK 394

Query: 371 TQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
           TQNVKLDI RFR YLK+A+ KA++ MDKE
Sbjct: 395 TQNVKLDIVRFRAYLKEAFVKAQRFMDKE 423


>gi|224108924|ref|XP_002315017.1| predicted protein [Populus trichocarpa]
 gi|222864057|gb|EEF01188.1| predicted protein [Populus trichocarpa]
          Length = 385

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 221/388 (56%), Positives = 285/388 (73%), Gaps = 6/388 (1%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           ITCD SH  YDL  ING  L DP TSTFF+ G T ST   P   +K +PYPRK+D+ A S
Sbjct: 1   ITCDCSHNDYDLWFINGPTLLDPSTSTFFTTGPTIST--SPDFAVKFRPYPRKTDERARS 58

Query: 74  KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
           KV ELT+T SAPP  SCG+TH+SPA+VFS GGY GNF+H+F D  + L+ITINS   ++D
Sbjct: 59  KVNELTLT-SAPPRSSCGITHSSPAIVFSTGGYTGNFYHQFNDGLLALYITINSLSLNRD 117

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           VIL + + +D WA+KYA+LL RFT+HPIIN++NQT THCF S  +GL++HG + ++PTL 
Sbjct: 118 VILTVTNWSDWWAQKYADLLHRFTKHPIINMDNQTRTHCFPSAIVGLMTHGPLAVDPTLT 177

Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHH--TKPKLVLVNRNARVGRTILNLREVKKAAE 251
            + KTL+DF + L + Y+      S+      +P+LVLVNR   VGR ILNL+E  KA E
Sbjct: 178 -QHKTLLDFHALLESTYSPRGKHVSTLKSKGARPQLVLVNRKNGVGREILNLKEALKAIE 236

Query: 252 ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQW 311
           E+GF   +FEP+ + ++ D++R +H  HAM+ VHGA +TH LFLR G VL ++VPIGT W
Sbjct: 237 EVGFKAIVFEPKRNGTVGDTYRLLHGSHAMLAVHGAAMTHLLFLRVGMVLGEIVPIGTDW 296

Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKT 371
           L+  ++EKPARVLGLEY++YKI+  ESSL EKYGANDLVLKNPQAF   +W   +VY+KT
Sbjct: 297 LAKTFYEKPARVLGLEYMKYKIEVNESSLAEKYGANDLVLKNPQAFVNGDWPKAKVYMKT 356

Query: 372 QNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
           QNVKLD+ RFR YLK+ + KAK+ MDKE
Sbjct: 357 QNVKLDMVRFRKYLKEVFVKAKRFMDKE 384


>gi|356502586|ref|XP_003520099.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Glycine
           max]
          Length = 406

 Score =  426 bits (1096), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 212/389 (54%), Positives = 274/389 (70%), Gaps = 8/389 (2%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVG-HTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           ITCD SH  YD+C INGS L D  +ST F++G HT     +P +  K  PY  K DK+AM
Sbjct: 20  ITCDYSHNGYDICTINGSTLLDQASSTLFALGPHTQQ--DKPHIPFKILPYTLKGDKTAM 77

Query: 73  SKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           S VKE+T+T  APP LSCGVTH +PALVFS GGY GNF+HE  + F+PLFITINS FP+Q
Sbjct: 78  SNVKEVTLTL-APPKLSCGVTHHTPALVFSVGGYTGNFYHEINENFIPLFITINSLFPNQ 136

Query: 133 DVILAIADCNDQWARKYAELLPRFT-RHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
           +VIL + +    W +KYAELL  F+  H IIN NN +  HCF S T+GLI HG M I+P 
Sbjct: 137 NVILVVLEGKSWWFKKYAELLSAFSPNHMIINTNNISTVHCFPSATIGLIKHGDMTIDPK 196

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSF--HHTKPKLVLVNRNARVGRTILNLREVKKA 249
           LLP PKTL+DF++FL   Y ++ +T   +   + KP+L L++R   V R +LN  +V K 
Sbjct: 197 LLPNPKTLLDFRAFLDKVYTKDDDTPFVYPNENGKPRLTLISRRGNVSRLLLNENDVIKV 256

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           AEE+GF+V +FEP+ +T +A  +R IH+   ++GVHGAGLT+ LFLRPGSVL+QVVPI  
Sbjct: 257 AEEIGFNVHVFEPK-NTPMAKVYRLIHASDVLLGVHGAGLTNFLFLRPGSVLVQVVPIEL 315

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
            W S  Y+EKP + LG++Y+EYKI+  ESSL+E++GAN LV K+P AF   NWS  RVYL
Sbjct: 316 YWASRTYYEKPPKFLGVDYIEYKIEPNESSLLERFGANSLVFKDPPAFHKGNWSKQRVYL 375

Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
           K QNVK+++ RFR YL  AY+KAK  + K
Sbjct: 376 KEQNVKINVVRFRKYLTKAYEKAKIFISK 404


>gi|224108926|ref|XP_002315018.1| predicted protein [Populus trichocarpa]
 gi|222864058|gb|EEF01189.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 214/391 (54%), Positives = 264/391 (67%), Gaps = 11/391 (2%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           I CDRSHR YDLC ING  + DP  ST +    T+ST  +     K +PYPRK +K  M+
Sbjct: 1   INCDRSHRFYDLCTINGPTVLDPVNSTLYLSVPTNSTTVE-----KIRPYPRKWEKPIMA 55

Query: 74  KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD-Q 132
            ++E T+ +++   L C   H  PA+VFSA GY GNFFH+F D F+PLFIT+NS FPD Q
Sbjct: 56  GIQEFTLISNSKSPL-CQAQHKLPAIVFSAAGYTGNFFHDFNDGFIPLFITVNSVFPDNQ 114

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
           D IL I+   + W  KY +LL  +++HP+I   N+T THCF S TLGLISHG M I+P L
Sbjct: 115 DFILVISQAQNWWISKYGDLLHTYSKHPVIIPENETSTHCFPSATLGLISHGFMTIDPKL 174

Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPK--LVLVNRNARVGRTILNLREVKKAA 250
           +P  + L  F +FL  AYN   N     +  KP+  LVL  RN  VGR I N  EVK  A
Sbjct: 175 MPNSQALTHFHAFLDKAYNHGQNHPWKSNPPKPRARLVLATRNGGVGRVISNQNEVKHLA 234

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           EE+GFDV IFEP   T L  ++  I+S HAMVGVHGAGLTHSLFLRPG V MQVVPIG  
Sbjct: 235 EEIGFDVIIFEPIPQTPLQQAYALINSSHAMVGVHGAGLTHSLFLRPGVVFMQVVPIGAD 294

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN--MRVY 368
           WL+ V F   AR +GLEYLEY+I  EESSL++KYG N L++K+P  F G NWS+  M +Y
Sbjct: 295 WLAEVCFANSARAMGLEYLEYRIGAEESSLIDKYGKNSLLIKDPATFRGQNWSSAIMDIY 354

Query: 369 LKTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
           LK QNVK+D+ RFR YLK+AYKKAK+ M+KE
Sbjct: 355 LKEQNVKIDLIRFREYLKEAYKKAKEFMEKE 385


>gi|255542124|ref|XP_002512126.1| glycosyltransferase, putative [Ricinus communis]
 gi|223549306|gb|EEF50795.1| glycosyltransferase, putative [Ricinus communis]
          Length = 394

 Score =  413 bits (1062), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 204/390 (52%), Positives = 267/390 (68%), Gaps = 20/390 (5%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           I+CDR+   YDLC ING  + DP TS FF V  T+S P  P L    +PYPRK +   M+
Sbjct: 20  ISCDRTQLRYDLCWINGQTVLDPTTSAFFVVRSTNSAP--PYLVETIKPYPRKFEAFIMA 77

Query: 74  KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
           ++KELTIT S P   SC V HT+PALVFSAGGY GNFFH+F D F+PL+IT+N+ +PDQD
Sbjct: 78  QIKELTIT-SGPFAPSCQVQHTAPALVFSAGGYTGNFFHDFNDGFIPLYITVNTIYPDQD 136

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
            ++ +++  D W  KY +LL  FT HPI+ +N+ T THCF SVT GLISHG M +N  L+
Sbjct: 137 FVMVVSEAPDWWISKYVDLLSAFTAHPIVTLND-TSTHCFPSVTFGLISHGFMTMNQRLM 195

Query: 194 PKPKTLVDFQSFLANAYNE----NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA 249
           P  KT+  F+  L  AY++    N N + S   ++P+L++ +RN   GR ILN  E+ + 
Sbjct: 196 PNSKTITQFRGLLDKAYSQSLTSNVNNNLSAPKSRPRLIIASRNGSAGRVILNQDELIEM 255

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           ++ELGFDV IFEP+ +TSL +S+  ++S HAM+GVHGA LTHSLFLRPGSVL+QVVPIG 
Sbjct: 256 SKELGFDVIIFEPKANTSLQESYVLVNSSHAMIGVHGAALTHSLFLRPGSVLVQVVPIGL 315

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
           +W S  +F +  R L LEY+EYKI  EESSLV  YG++ L+L +P              +
Sbjct: 316 EWASDAFFGRVGRGLKLEYVEYKIGVEESSLVGTYGSDSLLLTDPHG------------I 363

Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
           + QNVKLD+ RFR YLK AYKKAK+ MD+E
Sbjct: 364 QEQNVKLDMKRFREYLKQAYKKAKQFMDRE 393


>gi|186499036|ref|NP_001118256.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
 gi|330250598|gb|AEC05692.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
          Length = 455

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 199/402 (49%), Positives = 277/402 (68%), Gaps = 12/402 (2%)

Query: 2   KLRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQ 61
           K  P  ++ +  ITCDRSH SYDLC INGS + +PKT T   +  T +T S P +  K +
Sbjct: 57  KQNPKVMAASAKITCDRSHTSYDLCSINGSCILNPKTGTLTLMDRTLTT-SAPLVE-KIR 114

Query: 62  PYPRKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
           PYPRKS+   M +++EL +T S P +L  SC +TH SPA+VFSAGGY G+ +H+F+D F+
Sbjct: 115 PYPRKSENWIMPRIRELKLT-SGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFI 173

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ--NQTITHCFQSVT 177
           PLFIT NS +PD+D IL + +  + W  KY ++L  F++H  I ++  N +ITHCF S T
Sbjct: 174 PLFITANSVYPDRDFILVVVNPKEWWMPKYIDILGTFSKHKTILLDKENASITHCFTSAT 233

Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVG 237
           +GLISHG M I+PT +P  K+LVDF + L  A N N    S     KP+L+LV R   +G
Sbjct: 234 VGLISHGPMTIDPTQIPNSKSLVDFHNLLDKALNPNL---SIIKINKPRLILVRRYGNIG 290

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R ILN  E+++  E++GF+V  F P ++TSL ++++ I S H M+GVHGA LT  LFLRP
Sbjct: 291 RVILNEEEIREMLEDVGFEVITFRPSKTTSLREAYKLIKSSHGMIGVHGAALTQLLFLRP 350

Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
           GSVL+Q+VP+G  W+S   FE PA+ + L+Y EY++  EESSL+EKY  +DLVLK+P A+
Sbjct: 351 GSVLVQIVPVGLGWVSKTCFETPAKAMKLDYTEYRVNVEESSLIEKYSRDDLVLKDPIAY 410

Query: 358 AGANW--SNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
            G +W  + M+VYLK Q+V+LD++RFR ++ +AYKKAK  MD
Sbjct: 411 RGMDWNVTKMKVYLKDQDVRLDVNRFRKHMNEAYKKAKSFMD 452


>gi|186499027|ref|NP_178435.2| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
 gi|330250597|gb|AEC05691.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
          Length = 451

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 199/402 (49%), Positives = 277/402 (68%), Gaps = 12/402 (2%)

Query: 2   KLRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQ 61
           K  P  ++ +  ITCDRSH SYDLC INGS + +PKT T   +  T +T S P +  K +
Sbjct: 53  KQNPKVMAASAKITCDRSHTSYDLCSINGSCILNPKTGTLTLMDRTLTT-SAPLVE-KIR 110

Query: 62  PYPRKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
           PYPRKS+   M +++EL +T S P +L  SC +TH SPA+VFSAGGY G+ +H+F+D F+
Sbjct: 111 PYPRKSENWIMPRIRELKLT-SGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFI 169

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ--NQTITHCFQSVT 177
           PLFIT NS +PD+D IL + +  + W  KY ++L  F++H  I ++  N +ITHCF S T
Sbjct: 170 PLFITANSVYPDRDFILVVVNPKEWWMPKYIDILGTFSKHKTILLDKENASITHCFTSAT 229

Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVG 237
           +GLISHG M I+PT +P  K+LVDF + L  A N N    S     KP+L+LV R   +G
Sbjct: 230 VGLISHGPMTIDPTQIPNSKSLVDFHNLLDKALNPNL---SIIKINKPRLILVRRYGNIG 286

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R ILN  E+++  E++GF+V  F P ++TSL ++++ I S H M+GVHGA LT  LFLRP
Sbjct: 287 RVILNEEEIREMLEDVGFEVITFRPSKTTSLREAYKLIKSSHGMIGVHGAALTQLLFLRP 346

Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
           GSVL+Q+VP+G  W+S   FE PA+ + L+Y EY++  EESSL+EKY  +DLVLK+P A+
Sbjct: 347 GSVLVQIVPVGLGWVSKTCFETPAKAMKLDYTEYRVNVEESSLIEKYSRDDLVLKDPIAY 406

Query: 358 AGANW--SNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
            G +W  + M+VYLK Q+V+LD++RFR ++ +AYKKAK  MD
Sbjct: 407 RGMDWNVTKMKVYLKDQDVRLDVNRFRKHMNEAYKKAKSFMD 448


>gi|4335762|gb|AAD17439.1| unknown protein [Arabidopsis thaliana]
          Length = 393

 Score =  406 bits (1044), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 197/396 (49%), Positives = 275/396 (69%), Gaps = 12/396 (3%)

Query: 8   LSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS 67
           ++ +  ITCDRSH SYDLC INGS + +PKT T   +  T +T S P +  K +PYPRKS
Sbjct: 1   MAASAKITCDRSHTSYDLCSINGSCILNPKTGTLTLMDRTLTT-SAPLVE-KIRPYPRKS 58

Query: 68  DKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
           +   M +++EL +T S P +L  SC +TH SPA+VFSAGGY G+ +H+F+D F+PLFIT 
Sbjct: 59  ENWIMPRIRELKLT-SGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFIPLFITA 117

Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ--NQTITHCFQSVTLGLISH 183
           NS +PD+D IL + +  + W  KY ++L  F++H  I ++  N +ITHCF S T+GLISH
Sbjct: 118 NSVYPDRDFILVVVNPKEWWMPKYIDILGTFSKHKTILLDKENASITHCFTSATVGLISH 177

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNL 243
           G M I+PT +P  K+LVDF + L  A N N    S     KP+L+LV R   +GR ILN 
Sbjct: 178 GPMTIDPTQIPNSKSLVDFHNLLDKALNPNL---SIIKINKPRLILVRRYGNIGRVILNE 234

Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
            E+++  E++GF+V  F P ++TSL ++++ I S H M+GVHGA LT  LFLRPGSVL+Q
Sbjct: 235 EEIREMLEDVGFEVITFRPSKTTSLREAYKLIKSSHGMIGVHGAALTQLLFLRPGSVLVQ 294

Query: 304 VVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW- 362
           +VP+G  W+S   FE PA+ + L+Y EY++  EESSL+EKY  +DLVLK+P A+ G +W 
Sbjct: 295 IVPVGLGWVSKTCFETPAKAMKLDYTEYRVNVEESSLIEKYSRDDLVLKDPIAYRGMDWN 354

Query: 363 -SNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
            + M+VYLK Q+V+LD++RFR ++ +AYKKAK  MD
Sbjct: 355 VTKMKVYLKDQDVRLDVNRFRKHMNEAYKKAKSFMD 390


>gi|357487363|ref|XP_003613969.1| Glycosyltransferase, putative [Medicago truncatula]
 gi|355515304|gb|AES96927.1| Glycosyltransferase, putative [Medicago truncatula]
          Length = 451

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 196/392 (50%), Positives = 267/392 (68%), Gaps = 4/392 (1%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           ITCDRS++ +D+C ++   L DP + T F++     T  QP + +K QPYP KSDK+AMS
Sbjct: 61  ITCDRSNQRFDICSMSNPTLLDPTSLTLFTLN--PHTRIQPYINVKIQPYPLKSDKNAMS 118

Query: 74  KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            V+E+T+T++ P +  CGVTH  PALVF+A GYNGNF+HE  D F+PLFITINS F DQD
Sbjct: 119 SVREVTLTSAPPKSSQCGVTHNIPALVFNARGYNGNFYHEINDIFIPLFITINSLFHDQD 178

Query: 134 VILAIADCNDQWARKYAELLPRFT-RHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
           VIL I D    W +KY +LL  F+  H IIN NN T  HCF S  +GLI HG + INP L
Sbjct: 179 VILVIVDGMTWWYQKYVDLLNAFSPNHKIINTNNLTTAHCFPSAVVGLIKHGPVTINPKL 238

Query: 193 LPKPKTLVDFQSFLANAY-NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAE 251
           LP PKTL+DF +FL NAY  E+T      +++KP L LV+R     R ILN  EV K A+
Sbjct: 239 LPNPKTLLDFSTFLKNAYIKEDTPLLFPSNNSKPLLTLVDRKGSSSRVILNQEEVVKLAK 298

Query: 252 ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQW 311
           E+GF+V + +  + +++A+ +R +HS H ++GVHGAGLT+  FLR GSV++QVVPIG +W
Sbjct: 299 EVGFNVHVLDHSKDSTMANVYRLVHSSHVLLGVHGAGLTNLFFLRQGSVVVQVVPIGLEW 358

Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKT 371
            S   ++ P+  LGLEY+EYK++  ESSL   YG + L++K+P+A+    W    VYLK 
Sbjct: 359 ASDTCYKNPSPFLGLEYVEYKVEANESSLSWDYGVDSLMVKDPKAYTEGKWEKSIVYLKN 418

Query: 372 QNVKLDIDRFRIYLKDAYKKAKKLMDKEIKLA 403
           QNVK+D+ RF+ +L  AY+KAK  M+   ++A
Sbjct: 419 QNVKIDLVRFKKWLMKAYEKAKMFMNSTSQVA 450


>gi|359473475|ref|XP_002266097.2| PREDICTED: uncharacterized glycosyltransferase AER61-like [Vitis
           vinifera]
          Length = 389

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 197/394 (50%), Positives = 264/394 (67%), Gaps = 12/394 (3%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
           AK I+CDRSH+ YD+C +N + + +P TSTFF V      P+Q +L  K +PYPRK + S
Sbjct: 2   AKQISCDRSHQRYDMCTVNATTVLEPTTSTFFLV-----EPTQ-ALMEKVRPYPRKWETS 55

Query: 71  AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
            M ++KE+ +T S PP+ SC V H++PALVFSAGGY GN FH+F D F+PL+IT+NS FP
Sbjct: 56  VMGRIKEVRLT-SGPPSPSCQVHHSAPALVFSAGGYTGNVFHDFNDGFIPLYITVNSIFP 114

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
           D + +L I      W  KYA+LL   ++HPI+N+     THCF    +GLISHG M I+P
Sbjct: 115 DGNYVLVIHSSRRWWESKYADLLHTLSKHPIVNLEKANATHCFPYAHVGLISHGFMTIDP 174

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVK 247
           TL+P    L  F+ FL  AY +N    S   S    +P+LVLV R+   GR ILN  ++ 
Sbjct: 175 TLMPSSINLTHFRGFLDAAYAQNHPFPSPNISKQKARPRLVLVTRSGGAGRHILNQGDLN 234

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
            AAEE+GF V +F P  +TSL +++  I+S HAM+GVHGA LTHSLFLRPGSVLMQVVP+
Sbjct: 235 NAAEEVGFHVILFHPTPTTSLREAYALINSSHAMMGVHGAALTHSLFLRPGSVLMQVVPL 294

Query: 308 GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN--M 365
           G  W +   F   +R LGLEY+EYKI ++ES+L +KYG +D+++K+P    G  WS   M
Sbjct: 295 GLAWAAETCFGNSSRELGLEYMEYKIGEKESTLADKYGNDDIMVKDPVRAQGKGWSTKIM 354

Query: 366 RVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
            VYLK QN+ LD+ RFR +L++AY KA K +  +
Sbjct: 355 DVYLKEQNITLDLVRFRRHLEEAYNKASKFLQNK 388


>gi|147781784|emb|CAN72290.1| hypothetical protein VITISV_007350 [Vitis vinifera]
          Length = 515

 Score =  396 bits (1018), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 197/387 (50%), Positives = 261/387 (67%), Gaps = 12/387 (3%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
           AK I+CDRS + YD C +N + + +P T TFF V      P+Q +L  K +PYPRK + S
Sbjct: 38  AKQISCDRSXQRYDXCTVNATTVLEPTTXTFFLV-----EPTQ-ALVEKVRPYPRKWETS 91

Query: 71  AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
            M ++KE+++T S PP+ SC V H++PALVFSAGGY GN FH+F D F+PLFIT+NS FP
Sbjct: 92  VMGRIKEVSLT-SGPPSPSCQVHHSAPALVFSAGGYTGNVFHDFNDGFIPLFITVNSIFP 150

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
           D + +L I +C   W  KYA+LL   ++HPI+N+     THCF    +GLISHG M I+P
Sbjct: 151 DGNYVLVIHNCRRWWESKYADLLHTLSKHPIVNLEKANATHCFPYAHVGLISHGFMTIDP 210

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVK 247
           TL+P    L  F+ FL  AY +N    S   S    +P+LVLV R+   GR ILN  ++ 
Sbjct: 211 TLMPSSINLTHFRGFLDAAYAQNHPFPSPNISKQKARPRLVLVTRSGGAGRHILNQGDLH 270

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
            AAEE+GF V +F P  +TSL +++  I+S HAM+GVHGA LTHSLFLRPGSVLMQVVP+
Sbjct: 271 NAAEEVGFHVILFHPTPTTSLREAYALINSSHAMMGVHGAALTHSLFLRPGSVLMQVVPL 330

Query: 308 GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN--M 365
           G  W +   F   +R LGLEY+EYKI ++ESSL +KYG +D+++K+P    G  WS   M
Sbjct: 331 GLAWAAETCFGNSSRELGLEYMEYKIGEKESSLADKYGNDDIMVKDPVRAQGKGWSTKIM 390

Query: 366 RVYLKTQNVKLDIDRFRIYLKDAYKKA 392
            VYLK QN+ LD+ RFR +L++AY KA
Sbjct: 391 DVYLKEQNITLDLVRFRRHLEEAYNKA 417


>gi|79551059|ref|NP_178436.2| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
 gi|49660111|gb|AAT68346.1| hypothetical protein At2g03370 [Arabidopsis thaliana]
 gi|60547697|gb|AAX23812.1| hypothetical protein At2g03370 [Arabidopsis thaliana]
 gi|330250599|gb|AEC05693.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
          Length = 452

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 192/394 (48%), Positives = 270/394 (68%), Gaps = 16/394 (4%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
           + +ITCDRSH +YDLC INGS   D KT T      T   P+   L  K +PYP+K+D  
Sbjct: 65  SATITCDRSHTNYDLCSINGSCNLDLKTGTL-----TLMDPTSAPLVEKIRPYPKKADNW 119

Query: 71  AMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
            M +++ELT+T S P  L  SC +TH  PA+VFSAGGY G+ +H+ MD F+PLFIT NS 
Sbjct: 120 IMPRIRELTLT-SGPLGLPRSCDITHDLPAIVFSAGGYTGSIYHDLMDGFIPLFITANSV 178

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRH-PIININNQTI--THCFQSVTLGLISHGR 185
           +PD+D I  + +  + W  KY ++L  F++H PI+ ++ +++  THCF S  +GLI+H  
Sbjct: 179 YPDRDFIPVVVNAKEWWMPKYIDILGTFSKHKPILLLDKESVATTHCFTSAIVGLITHWP 238

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
           M I+PT +P  K+LVDF + L  A+  N +T  +    KP+L+LV+R   +GR ILN +E
Sbjct: 239 MTIDPTQIPNSKSLVDFHNLLEKAFTTNISTPKTH---KPRLMLVSRYGNIGRVILNEQE 295

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           +K+  E++GF+V IF P ++T+L ++++ I S H MVGVHGA LTH LFLRPGS+ +QVV
Sbjct: 296 IKEMLEDVGFEVIIFRPSKTTNLKEAYKLIKSSHGMVGVHGAALTHLLFLRPGSIFVQVV 355

Query: 306 PIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW--S 363
           P+G  W S   +E PA+ + LEYLEYK+  EESSL+EKY  +DLVLK+P A+ G +W  +
Sbjct: 356 PLGLGWASKPCYESPAKTMKLEYLEYKVNVEESSLIEKYNRDDLVLKDPIAYRGMDWNAT 415

Query: 364 NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
            M+VYLK Q+V LD++RFR ++ +AYKKAK  MD
Sbjct: 416 KMKVYLKEQDVSLDVNRFRKHMNEAYKKAKIFMD 449


>gi|4335761|gb|AAD17438.1| unknown protein [Arabidopsis thaliana]
          Length = 458

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 192/394 (48%), Positives = 270/394 (68%), Gaps = 16/394 (4%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
           + +ITCDRSH +YDLC INGS   D KT T      T   P+   L  K +PYP+K+D  
Sbjct: 71  SATITCDRSHTNYDLCSINGSCNLDLKTGTL-----TLMDPTSAPLVEKIRPYPKKADNW 125

Query: 71  AMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
            M +++ELT+T S P  L  SC +TH  PA+VFSAGGY G+ +H+ MD F+PLFIT NS 
Sbjct: 126 IMPRIRELTLT-SGPLGLPRSCDITHDLPAIVFSAGGYTGSIYHDLMDGFIPLFITANSV 184

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRH-PIININNQTI--THCFQSVTLGLISHGR 185
           +PD+D I  + +  + W  KY ++L  F++H PI+ ++ +++  THCF S  +GLI+H  
Sbjct: 185 YPDRDFIPVVVNAKEWWMPKYIDILGTFSKHKPILLLDKESVATTHCFTSAIVGLITHWP 244

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
           M I+PT +P  K+LVDF + L  A+  N +T  +    KP+L+LV+R   +GR ILN +E
Sbjct: 245 MTIDPTQIPNSKSLVDFHNLLEKAFTTNISTPKTH---KPRLMLVSRYGNIGRVILNEQE 301

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           +K+  E++GF+V IF P ++T+L ++++ I S H MVGVHGA LTH LFLRPGS+ +QVV
Sbjct: 302 IKEMLEDVGFEVIIFRPSKTTNLKEAYKLIKSSHGMVGVHGAALTHLLFLRPGSIFVQVV 361

Query: 306 PIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW--S 363
           P+G  W S   +E PA+ + LEYLEYK+  EESSL+EKY  +DLVLK+P A+ G +W  +
Sbjct: 362 PLGLGWASKPCYESPAKTMKLEYLEYKVNVEESSLIEKYNRDDLVLKDPIAYRGMDWNAT 421

Query: 364 NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
            M+VYLK Q+V LD++RFR ++ +AYKKAK  MD
Sbjct: 422 KMKVYLKEQDVSLDVNRFRKHMNEAYKKAKIFMD 455


>gi|357487365|ref|XP_003613970.1| Glycosyltransferase, putative [Medicago truncatula]
 gi|355515305|gb|AES96928.1| Glycosyltransferase, putative [Medicago truncatula]
          Length = 543

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 197/384 (51%), Positives = 256/384 (66%), Gaps = 9/384 (2%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           ITCDRS++ +DLC +N   L DP + T F++G    T  Q  + +K +P+P K+D +AMS
Sbjct: 66  ITCDRSNKDFDLCTMNSPTLLDPTSLTLFALG--PHTRIQHHIHMKIKPFPLKNDTNAMS 123

Query: 74  KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN---SHFP 130
            + ELT+T++   +  CGVTH SPALVFS GGY GNF+H+  + F+PLFITIN   SH  
Sbjct: 124 PISELTLTSAPLKSSQCGVTHHSPALVFSVGGYTGNFYHDMNEIFIPLFITINYSLSH-- 181

Query: 131 DQDVILAIADCNDQWARKYAELLPRFT-RHPIININNQTITHCFQSVTLGLISHGRMVIN 189
           DQDVIL I D    W  KY +LL  F+  H IIN NN T THCF S  +GLI HG+M+I+
Sbjct: 182 DQDVILVIIDVKPWWFEKYVDLLSAFSPNHKIINTNNLTTTHCFPSAIVGLIKHGQMIID 241

Query: 190 PTLLPKPKTLVDFQSFLANAY-NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKK 248
           P LLP PKTL+DF SFL  AY  E+          KP L LV+R     R ILN  EV K
Sbjct: 242 PKLLPNPKTLLDFHSFLKRAYVKEDIPFVYLNSKGKPILTLVSRKGSSSRDILNEEEVIK 301

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            AE++GF+V + +P    S+AD+F+ IHS H ++GVHGAGLT+ LFLR GSV +QVVPIG
Sbjct: 302 LAEDVGFNVRVLKPSRDFSVADAFKLIHSSHVLLGVHGAGLTNLLFLRQGSVSVQVVPIG 361

Query: 309 TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVY 368
            +W S  Y+ KP ++LGLEY+EYK++  ESSL  +YGA+ LV+K+P+A+    W    VY
Sbjct: 362 LEWASETYYNKPTKILGLEYVEYKVEANESSLSWEYGADSLVIKDPKAYTEGKWDKQLVY 421

Query: 369 LKTQNVKLDIDRFRIYLKDAYKKA 392
           LK QNVK+D+ RFR  L   Y+K 
Sbjct: 422 LKKQNVKIDLIRFRNCLTKVYEKV 445


>gi|297817994|ref|XP_002876880.1| serine carboxypeptidase [Arabidopsis lyrata subsp. lyrata]
 gi|297322718|gb|EFH53139.1| serine carboxypeptidase [Arabidopsis lyrata subsp. lyrata]
          Length = 428

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 187/393 (47%), Positives = 265/393 (67%), Gaps = 36/393 (9%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
           + +ITCDRSH  YDLC INGS + DPKT TF  +  T +T    SL  K +PYPRK++  
Sbjct: 63  SATITCDRSHSDYDLCSINGSCILDPKTGTFTLMDPTFAT----SLVEKIRPYPRKAENW 118

Query: 71  AMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
            M ++KELT++ S P  L+  C +TH SPA+VFSAGGY G+ +H+F+D F+PLFIT NS 
Sbjct: 119 TMPRIKELTLS-SGPLGLTRACDITHDSPAIVFSAGGYTGSIYHDFLDGFIPLFITANSV 177

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ--NQTITHCFQSVTLGLISHGRM 186
           +PD+D IL + +  + W  KY ++L  F++H  I ++  N + THCF SVT+GL+SHG  
Sbjct: 178 YPDRDFILVVVNSKEWWMPKYIDILGAFSKHKTILLDKENASFTHCFPSVTVGLVSHGPK 237

Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           +I+PT +P  K+L                         P+L+LV+R   +GR ILN +E+
Sbjct: 238 IIDPTQIPNSKSL-------------------------PRLILVSRYGNIGRVILNEKEI 272

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
           K+  E++GF+V  F   ++TS+ ++++ I S H MVGVHGA LTH LFLRPGS+L+QVVP
Sbjct: 273 KEMLEDVGFEVVTFRSSKTTSVREAYKLIKSSHVMVGVHGAALTHLLFLRPGSMLVQVVP 332

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW--SN 364
           +G  W S   +E PA+ + LEY+EY++  EESSLVEKY  +DLVLK+P A+ G +W  + 
Sbjct: 333 LGLGWASKTCYESPAKAMKLEYIEYRVNVEESSLVEKYNRDDLVLKDPIAYRGMDWNVTK 392

Query: 365 MRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
           M+VYLK Q+V+LD++RFR ++ +AY+KAK  MD
Sbjct: 393 MKVYLKEQDVRLDVNRFRKHMNEAYEKAKLFMD 425


>gi|297738189|emb|CBI27390.3| unnamed protein product [Vitis vinifera]
          Length = 354

 Score =  363 bits (932), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 184/391 (47%), Positives = 246/391 (62%), Gaps = 41/391 (10%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
           AK I+CDRSH+ YD+C +N + + +P TSTFF V      P+Q +L  K +PYPRK + S
Sbjct: 2   AKQISCDRSHQRYDMCTVNATTVLEPTTSTFFLV-----EPTQ-ALMEKVRPYPRKWETS 55

Query: 71  AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
            M ++KE+ +T S PP+ SC V H++PALVFSAGGY GN FH+F D F+PL+IT+NS FP
Sbjct: 56  VMGRIKEVRLT-SGPPSPSCQVHHSAPALVFSAGGYTGNVFHDFNDGFIPLYITVNSIFP 114

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
           D + +L I      W  KYA+LL   ++HPI+N+     THCF    +GLISHG M I+P
Sbjct: 115 DGNYVLVIHSSRRWWESKYADLLHTLSKHPIVNLEKANATHCFPYAHVGLISHGFMTIDP 174

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
           TL+P    L  F+ FL  AY +N        H  P                         
Sbjct: 175 TLMPSSINLTHFRGFLDAAYAQN--------HPFP------------------------T 202

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           EE+GF V +F P  +TSL +++  I+S HAM+GVHGA LTHSLFLRPGSVLMQVVP+G  
Sbjct: 203 EEVGFHVILFHPTPTTSLREAYALINSSHAMMGVHGAALTHSLFLRPGSVLMQVVPLGLA 262

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN--MRVY 368
           W +   F   +R LGLEY+EYKI ++ES+L +KYG +D+++K+P    G  WS   M VY
Sbjct: 263 WAAETCFGNSSRELGLEYMEYKIGEKESTLADKYGNDDIMVKDPVRAQGKGWSTKIMDVY 322

Query: 369 LKTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
           LK QN+ LD+ RFR +L++AY KA K +  +
Sbjct: 323 LKEQNITLDLVRFRRHLEEAYNKASKFLQNK 353


>gi|255542122|ref|XP_002512125.1| glycosyltransferase, putative [Ricinus communis]
 gi|223549305|gb|EEF50794.1| glycosyltransferase, putative [Ricinus communis]
          Length = 390

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 154/303 (50%), Positives = 204/303 (67%), Gaps = 5/303 (1%)

Query: 12  KSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI-KTQPYPRKSDKS 70
           + I CDRS+++YDLC I+G    DP  STF+ V      PS P   + K +PYPRK +  
Sbjct: 74  RKINCDRSNKAYDLCTIHGPTFLDPTVSTFYLVDPQSQGPSPPLHSMEKIRPYPRKWETV 133

Query: 71  AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
            M+++KELT+T S P +  C V H  PALVFSAGGY GNFFH+F D  +PLFIT+ + F 
Sbjct: 134 TMNRIKELTLT-SGPSSPPCQVHHNVPALVFSAGGYTGNFFHDFNDGLIPLFITVKTVFS 192

Query: 131 D-QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
           D QD +L I+   D W  KYA+LL  F+++PIIN++N + THCF S  +GL+SHG M IN
Sbjct: 193 DDQDFVLVISKARDWWVSKYADLLRAFSKYPIINLDNDSSTHCFPSANIGLVSHGFMTIN 252

Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSFH--HTKPKLVLVNRNARVGRTILNLREVK 247
           P LLP  ++   F + L  AY  + N  S F+    +P+LV+ +R+  VGR ILN  EVK
Sbjct: 253 PKLLPNSQSFTHFHALLDKAYGHHQNQPSEFNSARKRPRLVITSRSGSVGRLILNQNEVK 312

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
           K A+ +GFDVT+FEP   T L +++  I+S HAM+GVHGA LTHSLFLRPGSV +QVVP+
Sbjct: 313 KIAQNIGFDVTVFEPTPHTPLREAYALINSSHAMIGVHGAALTHSLFLRPGSVFLQVVPL 372

Query: 308 GTQ 310
           G +
Sbjct: 373 GNE 375


>gi|148910102|gb|ABR18133.1| unknown [Picea sitchensis]
          Length = 456

 Score =  310 bits (795), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 152/396 (38%), Positives = 244/396 (61%), Gaps = 12/396 (3%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
           P + +  + I CDRSH   DLC   G        S+F  +   D   S   +  K +PY 
Sbjct: 67  PGSYNEGEGIVCDRSHFRTDLCTAFGHVQMLANLSSFL-LHAQDKINS--GIEEKVRPYT 123

Query: 65  RKSDKSAMSKVKELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
           RK +K  M+ V E+T+ +   ++  N++C V H  PA+V+S  GY GN +HEF D  +PL
Sbjct: 124 RKWEKDVMAIVHEVTLKSVMLTSSSNVNCDVVHDVPAIVYSTSGYTGNLYHEFNDGIIPL 183

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
           +IT  +   +++V+  I DC++ W  KY E+L + T++ +IN  N+T+ HCF  VT GL 
Sbjct: 184 YIT--TQHLEKEVVFVIVDCHNWWLTKYDEILKQLTKYRVINFENETMVHCFPEVTAGLF 241

Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
            HG ++I+P+L+   K+++DF++ +  AY  +       +  +P+L ++ R     R IL
Sbjct: 242 IHGDLMIDPSLMFHNKSILDFRALINRAYTPHWFIPEP-NSDQPRLTILVREG--NRVIL 298

Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           NL+EV   AE+LGF+VT+++P  +T L  ++  ++S H ++GVHGA LTH LF+RPGSV 
Sbjct: 299 NLKEVVGLAEQLGFNVTVWKPLRTTELKTTYALLNSSHVLLGVHGAALTHFLFMRPGSVF 358

Query: 302 MQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN 361
           +QV+P+GT+W +  YF +PA  +G +Y+ YKI+ EES+L  KY  ND++L NP+A     
Sbjct: 359 IQVIPLGTEWAAHTYFGEPAERMGFQYIGYKIRLEESTLSHKYSKNDIILTNPRAVVQQG 418

Query: 362 WS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
           W+   ++YL++Q+V +++ R +  L +A +KA K M
Sbjct: 419 WAVTKQIYLESQDVIINLSRMKRVLINAKRKANKFM 454


>gi|302816141|ref|XP_002989750.1| glycosyltransferase belonging to CAZy family GT61 [Selaginella
           moellendorffii]
 gi|300142527|gb|EFJ09227.1| glycosyltransferase belonging to CAZy family GT61 [Selaginella
           moellendorffii]
          Length = 460

 Score =  290 bits (743), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 150/385 (38%), Positives = 228/385 (59%), Gaps = 17/385 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CDRSH   D+C + G    D ++S+F  V    ST      RIK  PY RK ++S M  V
Sbjct: 76  CDRSHPRSDVCYLKGDVRMDSRSSSFVLVAKNASTRLGEE-RIK--PYTRKWEQSCMDIV 132

Query: 76  KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
            E+ +   A     C V H+ PA+VF+ GGY GN +HEF D  +PL+IT  S   +++V+
Sbjct: 133 HEVRVRAGA--ERRCDVYHSVPAVVFTTGGYTGNVYHEFHDGLIPLYIT--SQHLNREVV 188

Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPK 195
               + ++ W  KY +++ + + HP+I+ +     HCF  VT+GL  H  M I P+L+P 
Sbjct: 189 FVGVELHNWWLTKYGDVIAQMSNHPVIDFDRDERIHCFPEVTVGLHIHDEMAIEPSLMPG 248

Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHT-------KPKLVLVNRNARVGRTILNLREVKK 248
            +T+VDF++ L  AY E    +             +P+L ++ RN    R ILNL E+  
Sbjct: 249 NQTIVDFRNLLDAAYQEELAQAPEPPPPSPASSIGQPRLTIIARND--TRVILNLDEIVG 306

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            A ELGF V I +P+ ++ L   +R ++S   ++GVHGA +TH LF+RPGSV +QVVP+G
Sbjct: 307 MARELGFWVEIRKPDRTSELKRIYRALNSSDVLLGVHGAAMTHFLFMRPGSVFIQVVPLG 366

Query: 309 TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
           T+W +  Y+ +PA+ LGL+Y+ Y+I+  ESSL ++Y  ND VL +P   +   W+ ++ +
Sbjct: 367 TKWAAAAYYGQPAQKLGLDYIGYEIEASESSLSDRYDENDTVLTDPAKISTQGWAVVKEI 426

Query: 368 YLKTQNVKLDIDRFRIYLKDAYKKA 392
           YL+ QNV+L + RF+  L DA +KA
Sbjct: 427 YLEGQNVRLSLPRFKRTLLDARRKA 451


>gi|18405766|ref|NP_565952.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
 gi|13877689|gb|AAK43922.1|AF370603_1 Unknown protein [Arabidopsis thaliana]
 gi|16930451|gb|AAL31911.1|AF419579_1 At2g41640/T32G6.16 [Arabidopsis thaliana]
 gi|2618699|gb|AAB84346.1| expressed protein [Arabidopsis thaliana]
 gi|27764926|gb|AAO23584.1| At2g41640/T32G6.16 [Arabidopsis thaliana]
 gi|330254916|gb|AEC10010.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
          Length = 500

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 142/391 (36%), Positives = 230/391 (58%), Gaps = 20/391 (5%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           +I CDR+    D+C++ G    +  +S+ F    + +  ++P    K +PY RK + S M
Sbjct: 98  TICCDRTGLRSDICVMKGDVRTNSASSSIFLFTSSTNNNTKPE---KIKPYTRKWETSVM 154

Query: 73  SKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
             V+EL + T      S   C V H  PA+ FS GGY GN +HEF D  +PLFIT  S  
Sbjct: 155 DTVQELNLITKDSNKSSDRVCDVYHDVPAVFFSTGGYTGNVYHEFNDGIIPLFIT--SQH 212

Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
            ++ V+  I + +D W  KY +++ + + +P+++ N  T THCF+  T+GL  H  + +N
Sbjct: 213 YNKKVVFVIVEYHDWWEMKYGDVVSQLSDYPLVDFNGDTRTHCFKEATVGLRIHDELTVN 272

Query: 190 PTLLPKPKTLVDFQSFLANAYNE----------NTNTSSSFHHTKPKLVLVNRNARVGRT 239
            +L+   +T+VDF++ L   Y+             N ++     KPKLV+++RN    R 
Sbjct: 273 SSLVIGNQTIVDFRNVLDRGYSHRIQSLTQEETEANVTALDFKKKPKLVILSRNGS-SRA 331

Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
           ILN   + + AE+ GF+V +  P+++T +A  +R +++   M+GVHGA +TH LFL+P +
Sbjct: 332 ILNENLLVELAEKTGFNVEVLRPQKTTEMAKIYRSLNTSDVMIGVHGAAMTHFLFLKPKT 391

Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
           V +Q++P+GT W +  Y+ +PA+ LGL+Y+ YKI  +ESSL E+YG +D V+++P +   
Sbjct: 392 VFIQIIPLGTDWAAETYYGEPAKKLGLKYVGYKIAPKESSLYEEYGKDDPVIRDPDSLND 451

Query: 360 ANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
             W    ++YL+ QNVKLD+ RFR  L  +Y
Sbjct: 452 KGWEYTKKIYLQGQNVKLDLRRFRETLTRSY 482


>gi|334184861|ref|NP_001189728.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
 gi|330254917|gb|AEC10011.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
          Length = 492

 Score =  280 bits (717), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 140/384 (36%), Positives = 227/384 (59%), Gaps = 20/384 (5%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           +I CDR+    D+C++ G    +  +S+ F    + +  ++P    K +PY RK + S M
Sbjct: 98  TICCDRTGLRSDICVMKGDVRTNSASSSIFLFTSSTNNNTKPE---KIKPYTRKWETSVM 154

Query: 73  SKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
             V+EL + T      S   C V H  PA+ FS GGY GN +HEF D  +PLFIT  S  
Sbjct: 155 DTVQELNLITKDSNKSSDRVCDVYHDVPAVFFSTGGYTGNVYHEFNDGIIPLFIT--SQH 212

Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
            ++ V+  I + +D W  KY +++ + + +P+++ N  T THCF+  T+GL  H  + +N
Sbjct: 213 YNKKVVFVIVEYHDWWEMKYGDVVSQLSDYPLVDFNGDTRTHCFKEATVGLRIHDELTVN 272

Query: 190 PTLLPKPKTLVDFQSFLANAYNE----------NTNTSSSFHHTKPKLVLVNRNARVGRT 239
            +L+   +T+VDF++ L   Y+             N ++     KPKLV+++RN    R 
Sbjct: 273 SSLVIGNQTIVDFRNVLDRGYSHRIQSLTQEETEANVTALDFKKKPKLVILSRNGS-SRA 331

Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
           ILN   + + AE+ GF+V +  P+++T +A  +R +++   M+GVHGA +TH LFL+P +
Sbjct: 332 ILNENLLVELAEKTGFNVEVLRPQKTTEMAKIYRSLNTSDVMIGVHGAAMTHFLFLKPKT 391

Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
           V +Q++P+GT W +  Y+ +PA+ LGL+Y+ YKI  +ESSL E+YG +D V+++P +   
Sbjct: 392 VFIQIIPLGTDWAAETYYGEPAKKLGLKYVGYKIAPKESSLYEEYGKDDPVIRDPDSLND 451

Query: 360 ANWS-NMRVYLKTQNVKLDIDRFR 382
             W    ++YL+ QNVKLD+ RFR
Sbjct: 452 KGWEYTKKIYLQGQNVKLDLRRFR 475


>gi|297824111|ref|XP_002879938.1| hypothetical protein ARALYDRAFT_483244 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325777|gb|EFH56197.1| hypothetical protein ARALYDRAFT_483244 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 498

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 139/391 (35%), Positives = 229/391 (58%), Gaps = 20/391 (5%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           +I CDR+    D+C + G    +  +S+ F    +    ++P    K +PY RK + S M
Sbjct: 96  TICCDRTGLRSDICEMKGDIRTNSASSSIFLFTSSTKNNTKPE---KIKPYTRKWETSVM 152

Query: 73  SKVKELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
             V+EL + T   ++  +  C V H  PA+ FS GGY GN +HEF D  +PLFIT  S  
Sbjct: 153 DTVQELNLITKDSNSSSDRVCDVYHDVPAVFFSTGGYTGNVYHEFNDGIIPLFIT--SQH 210

Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
            ++ V+  I + +D W  KY +++ + + +P+++ +    THCF+  T+GL  H  + +N
Sbjct: 211 YNKKVVFVIVEYHDWWEMKYGDIVSQLSDYPLVDFSGDARTHCFKEATVGLRIHDELTVN 270

Query: 190 PTLLPKPKTLVDFQSFLANAYNE----------NTNTSSSFHHTKPKLVLVNRNARVGRT 239
            +L+   +T+VDF++ L   Y+             N ++     KPKLV+++RN    R 
Sbjct: 271 SSLVIGNQTIVDFRNVLDRGYSHRIQSLIQEETEANVTALDFKKKPKLVILSRNGS-SRA 329

Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
           ILN   + + AEE GF+V +  P+++T +A  +R +++   M+GVHGA +TH LFL+P +
Sbjct: 330 ILNENLLVELAEETGFNVEVLRPQKTTEMAKIYRSLNTSDVMIGVHGAAMTHFLFLKPKT 389

Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
           V +Q++P+GT W +  Y+ +PA+ LGL+Y+ YKI  +ESSL E+YG +D ++++P +   
Sbjct: 390 VFIQIIPLGTDWAAETYYGEPAKKLGLKYIGYKIAPKESSLYEEYGKDDPIIRDPDSLND 449

Query: 360 ANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
             W    ++YL+ QNVKLD+ RFR  L  +Y
Sbjct: 450 KGWEYTKKIYLQGQNVKLDLRRFRETLTRSY 480


>gi|115436744|ref|NP_001043125.1| Os01g0498300 [Oryza sativa Japonica Group]
 gi|55297629|dbj|BAD69004.1| HGA1-like protein [Oryza sativa Japonica Group]
 gi|56202275|dbj|BAD73716.1| HGA1-like protein [Oryza sativa Japonica Group]
 gi|113532656|dbj|BAF05039.1| Os01g0498300 [Oryza sativa Japonica Group]
 gi|125526083|gb|EAY74197.1| hypothetical protein OsI_02077 [Oryza sativa Indica Group]
 gi|125570517|gb|EAZ12032.1| hypothetical protein OsJ_01912 [Oryza sativa Japonica Group]
 gi|215697344|dbj|BAG91338.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 536

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 151/421 (35%), Positives = 222/421 (52%), Gaps = 36/421 (8%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKT-STFFSVGHTDSTPSQPSLRIKTQPY 63
           P A     +I CDR+    D+C++ G    +  + S F  V   D++ +      + +PY
Sbjct: 105 PCAAVSNGTICCDRTAMRTDVCIMRGDVRTEAASNSLFLLVPPPDNSTAAAGRDERIRPY 164

Query: 64  PRKSDKSAMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
            RK + S MS + EL +       +AP   SC V H  PA+VFS GGY GN +HEF D  
Sbjct: 165 TRKWESSIMSTIDELRLRAVPEGGAAPA--SCDVRHDVPAVVFSTGGYTGNVYHEFNDGI 222

Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTL 178
           +PL+IT   +  ++ V+  + + +D W  KY  ++ + + +  I+  N   THCF    +
Sbjct: 223 IPLYITARQY--NKKVVFVMLEYHDWWMTKYGHIVEQLSDYAPIDFTNDRRTHCFPEAVV 280

Query: 179 GLISHGRMVINPTLLPKPKTLVDFQSFLANAY-----------------------NENTN 215
           GL  H  + I+   +P  +T+ DF+  L +AY                         +  
Sbjct: 281 GLRIHDELAIDAARMPGNRTIQDFRRMLDDAYRGRIQMIIEEEEKAAAVALGTPTQGSIR 340

Query: 216 TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
             S+    KP+LV+V+RN    R I N  E+ +AA   GF V + +P + T LA  +R +
Sbjct: 341 KKSALKDDKPRLVIVSRNG--SRAIENEAELVRAAAGAGFRVAVLQPRQDTELAKMYRAL 398

Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQ 335
           ++   MVGVHGA +TH LF+RPGSV +QVVP+GT W +  Y+ +PAR LGL Y+ YKIK 
Sbjct: 399 NASDVMVGVHGAAMTHFLFMRPGSVFIQVVPLGTDWAAETYYGEPARRLGLRYMPYKIKP 458

Query: 336 EESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
            ESSL  +Y  +D VL +P       W    +VYL  QNV+LD+ RFR  L+DAY    +
Sbjct: 459 AESSLYRQYAKDDAVLTDPDTVNAKGWQVTKKVYLDGQNVRLDMVRFRRRLRDAYDHWAE 518

Query: 395 L 395
           L
Sbjct: 519 L 519


>gi|449518915|ref|XP_004166481.1| PREDICTED: uncharacterized LOC101213872 [Cucumis sativus]
          Length = 518

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 144/401 (35%), Positives = 226/401 (56%), Gaps = 24/401 (5%)

Query: 10  PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDS----TPSQPSLRI-KTQPYP 64
           P  +I CDRS    D+C++ G    D  +S+ F     DS          +++ K +PY 
Sbjct: 108 PNGTICCDRSSIRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYT 167

Query: 65  RKSDKSAMSKVKELTITTSAPPN-----LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
           RK +K+ M  + EL +      N       C V H  PA+ FS GGY GN +HEF D  +
Sbjct: 168 RKWEKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGIL 227

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
           PL+IT  SH  +++V+  I + +  W  KYA++L + + +P+I++     THCF  V  G
Sbjct: 228 PLYIT--SHSMNKEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAG 285

Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAYN---------ENTNTSSSFHHTK-PKLVL 229
           L  H  + ++P+L+   K++VDF++ L  AY          E      S H +K PKLV+
Sbjct: 286 LRIHDELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVV 345

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
           ++R     R I N + + K AE +GF+V +  P+++T LA  +R ++  + +VGVHGA +
Sbjct: 346 LSRKGS-SRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAM 404

Query: 290 THSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
           THSLF+RP +V +Q++P+GT W +  Y+ +PA+ LGL+Y+ Y+I  +ESSL   +  +D 
Sbjct: 405 THSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDP 464

Query: 350 VLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
           VL NP +     W    ++YL  QNV+L++ RF   L+ AY
Sbjct: 465 VLVNPDSITKKGWEYTKKIYLDGQNVRLNLGRFEKRLERAY 505


>gi|449468035|ref|XP_004151727.1| PREDICTED: uncharacterized protein LOC101213872 [Cucumis sativus]
          Length = 518

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 144/401 (35%), Positives = 226/401 (56%), Gaps = 24/401 (5%)

Query: 10  PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDS----TPSQPSLRI-KTQPYP 64
           P  +I CDRS    D+C++ G    D  +S+ F     DS          +++ K +PY 
Sbjct: 108 PNGTICCDRSSIRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYT 167

Query: 65  RKSDKSAMSKVKELTITTSAPPN-----LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
           RK +K+ M  + EL +      N       C V H  PA+ FS GGY GN +HEF D  +
Sbjct: 168 RKWEKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGIL 227

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
           PL+IT  SH  +++V+  I + +  W  KYA++L + + +P+I++     THCF  V  G
Sbjct: 228 PLYIT--SHNMNKEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAG 285

Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAYN---------ENTNTSSSFHHTK-PKLVL 229
           L  H  + ++P+L+   K++VDF++ L  AY          E      S H +K PKLV+
Sbjct: 286 LRIHDELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVV 345

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
           ++R     R I N + + K AE +GF+V +  P+++T LA  +R ++  + +VGVHGA +
Sbjct: 346 LSRKGS-SRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAM 404

Query: 290 THSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
           THSLF+RP +V +Q++P+GT W +  Y+ +PA+ LGL+Y+ Y+I  +ESSL   +  +D 
Sbjct: 405 THSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDP 464

Query: 350 VLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
           VL NP +     W    ++YL  QNV+L++ RF   L+ AY
Sbjct: 465 VLVNPDSITKKGWEYTKKIYLDGQNVRLNLGRFEKRLERAY 505


>gi|15228253|ref|NP_187643.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
 gi|6056194|gb|AAF02811.1|AC009400_7 unknown protein [Arabidopsis thaliana]
 gi|28973746|gb|AAO64189.1| unknown protein [Arabidopsis thaliana]
 gi|29824255|gb|AAP04088.1| unknown protein [Arabidopsis thaliana]
 gi|110736729|dbj|BAF00327.1| hypothetical protein [Arabidopsis thaliana]
 gi|332641370|gb|AEE74891.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
          Length = 494

 Score =  270 bits (690), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 145/407 (35%), Positives = 220/407 (54%), Gaps = 24/407 (5%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
           P      +SI+CDR+    D+C + G       +S+ F     D T  Q  L+ K +PY 
Sbjct: 91  PKTSQKEESISCDRTGYRSDICFMKGDIRTHSPSSSIFLYTSNDLTTDQ-VLQEKIKPYT 149

Query: 65  RKSDKSAMSKVKELTITTSA----PPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
           RK + S M  + EL + T           C V H  PA++FS GGY GN +HEF D  +P
Sbjct: 150 RKWETSIMETIPELKLVTKDMKLFGDKRKCEVIHEVPAVLFSTGGYTGNLYHEFNDGLIP 209

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
           L+IT  S   ++ V+  IA+ +  W  KY ++L + + + +I+ N    THCF+   +GL
Sbjct: 210 LYIT--SKRFNKKVVFVIAEYHKWWEMKYGDVLSQLSDYSLIDFNKDKRTHCFKEAIVGL 267

Query: 181 ISHGRMVINPTLLPKPKTLV-DFQSFLANAYNENTN-----TSSSFHH--------TKPK 226
             HG + ++P+ +    T + +F++ L  AY    N         FH          +PK
Sbjct: 268 RIHGELTVDPSQMQDDGTTINEFRNVLDRAYRPRINRLDRLEEQRFHARLAQRRKAKRPK 327

Query: 227 LVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHG 286
           L L +R     R I N   + K A+ +GFD+ +  P+ +T LA  +R ++S   MVGVHG
Sbjct: 328 LALFSRTG--SRGITNEDLMVKMAQRIGFDIEVLRPDRTTELAKIYRVLNSSKVMVGVHG 385

Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGA 346
           A +TH LF++PGS+ +Q++P+GT W +  Y+ +PA+ LGL+Y  YKI   ESSL EKY  
Sbjct: 386 AAMTHFLFMKPGSIFIQIIPLGTDWAAETYYGEPAKKLGLDYNGYKILPRESSLYEKYDK 445

Query: 347 NDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKA 392
           +D +LK+P +     W   + +YL  Q V+LD+ RF+  L DAY K+
Sbjct: 446 DDPILKDPNSITKKGWQFTKGIYLNDQKVRLDLHRFKKLLIDAYAKS 492


>gi|224139884|ref|XP_002323324.1| predicted protein [Populus trichocarpa]
 gi|222867954|gb|EEF05085.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 149/421 (35%), Positives = 230/421 (54%), Gaps = 45/421 (10%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRI---------K 59
            ++I CDRS    D+C++ G       +S+   F+  + +S  ++ S  +         K
Sbjct: 13  VRTICCDRSSMRSDVCVMKGDVRTHSASSSIFLFTSRNNNSVMNKVSSLVDKDEELHHEK 72

Query: 60  TQPYPRKSDKSAMSKVKELTI---TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
            +PY RK + S M  + EL +   T ++  N  C V H  PA+ FS GGY GN +HEF D
Sbjct: 73  IKPYTRKWETSVMDSIDELGLIAKTENSRINHHCDVMHDVPAVFFSTGGYTGNVYHEFND 132

Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
             +PL+IT + HF  + V+  I D ++ W  KY  +L   + +P I+ +    THCF   
Sbjct: 133 GILPLYIT-SQHFKKK-VVFVILDYHNWWIMKYGNILSLLSDYPAIDFSGDKKTHCFPEA 190

Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY-------------------------- 210
             GL  H  + ++P+L+ + K++VDF++FL  AY                          
Sbjct: 191 IAGLRIHDELTVDPSLMQENKSIVDFRNFLDRAYWPRIKSMIKGEERGAQKKLELKAHSS 250

Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLAD 270
            +N          KPKLV+++RN    R I N   + K AEE+GF V +  PE +T LA 
Sbjct: 251 KKNLKQVHEATLKKPKLVILSRNG--SRAITNENLLVKMAEEIGFRVEVMRPEPTTELAR 308

Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
            +R ++S   M+GVHGA +TH LF++PGSV +QV+P+GT+W +  Y+ +PAR LGL+Y+ 
Sbjct: 309 IYRALNSSEVMIGVHGAAMTHFLFMKPGSVFIQVIPLGTEWAADAYYGEPARKLGLKYIG 368

Query: 331 YKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAY 389
           Y+I   ESSL +KY  ND VL++P++ +   W   + +YL  QNV+L++ RF+  L  AY
Sbjct: 369 YQILPRESSLYDKYDKNDPVLRDPRSVSDKGWQYTKSIYLDNQNVRLNLGRFQRRLLRAY 428

Query: 390 K 390
           +
Sbjct: 429 R 429


>gi|195615916|gb|ACG29788.1| glycosyltransferase [Zea mays]
          Length = 494

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 148/414 (35%), Positives = 220/414 (53%), Gaps = 36/414 (8%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
           P A     ++ CDR+    D+C++ G       +++ F +   +S  S+P+   + +PY 
Sbjct: 69  PCADVANDTVCCDRTALRTDVCVMRGDVRTQAASNSLFLLLPPNS--SRPATDERIRPYT 126

Query: 65  RKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
           RK + S MS + EL +  SAP      C V H  PA+VFS GGY GN +HEF D  +PL+
Sbjct: 127 RKWESSIMSTIDELRLR-SAPEGAPSRCDVRHDVPAVVFSTGGYTGNVYHEFNDGIIPLY 185

Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
           IT   H+ ++ V+  + + +D W  KY  ++ + + +P I+  N   THCF    +GL  
Sbjct: 186 ITAR-HY-NKKVVFVMLEYHDWWMTKYGHIVEQLSDYPPIDFTNDRRTHCFPEAVVGLRI 243

Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYNENTNT-------------------------- 216
           H  + I+   +P  +T+ DF+  L +A+     T                          
Sbjct: 244 HDELAIDAARMPANQTIQDFRLMLDDAHRGRIQTIIEEEKKEAAAARRQEHRGSKRSRQR 303

Query: 217 SSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
           S +    KP+L +++RN    R I N  E+ +AA   GF V + +P + T LA  +R ++
Sbjct: 304 SKTALAEKPRLTIISRNG--SRAIENEAELVRAAAGAGFRVAVLQPRQDTELARMYRALN 361

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
           +   MVGVHGA +TH LF+RPGSV +QVVP+GT W +  Y+ +PAR LGL YL YKI   
Sbjct: 362 ASDVMVGVHGAAMTHFLFMRPGSVFIQVVPLGTDWAAETYYGEPARRLGLRYLPYKILPS 421

Query: 337 ESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
           ESS+  KY  +D VL +P       W    +VYL  QNV+LD+ RFR  L+ AY
Sbjct: 422 ESSIYRKYANDDPVLTDPDTVNAKGWQVTKQVYLDGQNVRLDMARFRRRLRQAY 475


>gi|195611628|gb|ACG27644.1| glycosyltransferase [Zea mays]
          Length = 528

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 148/414 (35%), Positives = 220/414 (53%), Gaps = 36/414 (8%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
           P A     ++ CDR+    D+C++ G       +++ F +   +S  S+P+   + +PY 
Sbjct: 103 PCADVANDTVCCDRTALRTDVCVMRGDVRTQAASNSLFLLLPPNS--SRPATDERIRPYT 160

Query: 65  RKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
           RK + S MS + EL +  SAP      C V H  PA+VFS GGY GN +HEF D  +PL+
Sbjct: 161 RKWESSIMSTIDELRLR-SAPEGAPSRCDVRHDVPAVVFSTGGYTGNVYHEFNDGIIPLY 219

Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
           IT   H+ ++ V+  + + +D W  KY  ++ + + +P I+  N   THCF    +GL  
Sbjct: 220 ITAR-HY-NKKVVFVMLEYHDWWMTKYGHIVEQLSDYPPIDFTNDRRTHCFPEAVVGLRI 277

Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYNENTNT-------------------------- 216
           H  + I+   +P  +T+ DF+  L +A+     T                          
Sbjct: 278 HDELAIDAARMPANQTIQDFRLMLDDAHRGRIQTIIEEEKKEAAAARRQEHRGSKRSRQR 337

Query: 217 SSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
           S +    KP+L +++RN    R I N  E+ +AA   GF V + +P + T LA  +R ++
Sbjct: 338 SKTALAEKPRLTIISRNG--SRAIENEAELVRAAAGAGFRVAVLQPRQDTELARMYRALN 395

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
           +   MVGVHGA +TH LF+RPGSV +QVVP+GT W +  Y+ +PAR LGL YL YKI   
Sbjct: 396 ASDVMVGVHGAAMTHFLFMRPGSVFIQVVPLGTDWAAETYYGEPARRLGLRYLPYKILPS 455

Query: 337 ESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
           ESS+  KY  +D VL +P       W    +VYL  QNV+LD+ RFR  L+ AY
Sbjct: 456 ESSIYRKYANDDPVLTDPDTVNAKGWQVTKQVYLDGQNVRLDMARFRRRLRQAY 509


>gi|226492060|ref|NP_001145867.1| uncharacterized protein LOC100279381 [Zea mays]
 gi|219884767|gb|ACL52758.1| unknown [Zea mays]
          Length = 527

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 148/413 (35%), Positives = 219/413 (53%), Gaps = 35/413 (8%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
           P A     ++ CDR+    D+C++ G       +++ F +   +S  S+P+     +PY 
Sbjct: 103 PCADVANDTVCCDRTALRTDVCVMRGHVRTQAASNSLFLLLPPNS--SRPATDEHIRPYT 160

Query: 65  RKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
           RK + S MS + EL +  SAP      C V H  PA+VFS GGY GN +HEF D  +PL+
Sbjct: 161 RKWESSIMSTIDELRLR-SAPEGAPSRCDVRHDVPAVVFSTGGYTGNVYHEFNDGIIPLY 219

Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
           IT   H+ ++ V+  + + +D W  KY  ++ + + +P I+  N   THCF    +GL  
Sbjct: 220 ITAR-HY-NKKVVFVMLEYHDWWMTKYGHIVEQLSDYPPIDFTNDRRTHCFPEAVVGLRI 277

Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYNENTNT-------------------------S 217
           H  + I+   +P  +T+ DF+  L +A+     T                         S
Sbjct: 278 HDELAIDAARMPANQTIQDFRLMLDDAHRGRIQTIIEEKKEAAAARRQEHRGSKRSRQRS 337

Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
            +    KP+L +++RN    R I N  E+ +AA   GF V + +P + T LA  +R +++
Sbjct: 338 KTALAEKPRLTIISRNG--SRAIENEAELVRAAAGAGFRVAVLQPRQDTELARMYRALNA 395

Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEE 337
              MVGVHGA +TH LF+RPGSV +QVVP+GT W +  Y+ +PAR LGL YL YKI   E
Sbjct: 396 SDVMVGVHGAAMTHFLFMRPGSVFIQVVPLGTDWAAETYYGEPARRLGLRYLPYKILPSE 455

Query: 338 SSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
           SS+  KY  +D VL +P       W    +VYL  QNV+LD+ RFR  L+ AY
Sbjct: 456 SSIYRKYANDDPVLTDPDTVNAKGWQVTKQVYLDGQNVRLDMARFRRRLRQAY 508


>gi|413948188|gb|AFW80837.1| glycosyltransferase [Zea mays]
          Length = 527

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 148/413 (35%), Positives = 219/413 (53%), Gaps = 35/413 (8%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
           P A     ++ CDR+    D+C++ G       +++ F +   +S  S+P+     +PY 
Sbjct: 103 PCADVANDTVCCDRTALRTDVCVMRGHVRTQAASNSLFLLLPPNS--SRPATDEHIRPYT 160

Query: 65  RKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
           RK + S MS + EL +  SAP      C V H  PA+VFS GGY GN +HEF D  +PL+
Sbjct: 161 RKWESSIMSTIDELRLR-SAPEGAPSRCDVRHDVPAVVFSTGGYTGNVYHEFNDGIIPLY 219

Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
           IT   H+ ++ V+  + + +D W  KY  ++ + + +P I+  N   THCF    +GL  
Sbjct: 220 ITAR-HY-NKKVVFVMLEYHDWWMTKYGHIVEQLSDYPPIDFTNDRRTHCFPEAVVGLRI 277

Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYNENTNT-------------------------S 217
           H  + I+   +P  +T+ DF+  L +A+     T                         S
Sbjct: 278 HDELAIDAARMPANQTIQDFRLMLDDAHRGRIQTIIEEKKEAAAARRQEHRGSKRSRQRS 337

Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
            +    KP+L +++RN    R I N  E+ +AA   GF V + +P + T LA  +R +++
Sbjct: 338 KTALAEKPRLTIISRNG--SRAIENEAELVRAAAGAGFRVAVLQPRQDTELARMYRALNA 395

Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEE 337
              MVGVHGA +TH LF+RPGSV +QVVP+GT W +  Y+ +PAR LGL YL YKI   E
Sbjct: 396 SDVMVGVHGAAMTHFLFMRPGSVFIQVVPLGTDWAAETYYGEPARRLGLRYLPYKILPSE 455

Query: 338 SSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
           SS+  KY  +D VL +P       W    +VYL  QNV+LD+ RFR  L+ AY
Sbjct: 456 SSIYRKYANDDPVLTDPDTVNAKGWQVTKQVYLDGQNVRLDMARFRRRLRQAY 508


>gi|297833814|ref|XP_002884789.1| hypothetical protein ARALYDRAFT_478362 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330629|gb|EFH61048.1| hypothetical protein ARALYDRAFT_478362 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 493

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 141/407 (34%), Positives = 224/407 (55%), Gaps = 25/407 (6%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
           P      +SI+CDR+    D+C + G       +S+   + +T +  +   L  K +PY 
Sbjct: 91  PKTSQNEESISCDRTGYRSDICFMKGDIRTHSPSSSI--ILYTSNDLTDNVLPEKIKPYT 148

Query: 65  RKSDKSAMSKVKELTITTSAPPNLS----CGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
           RK + S M  + EL + T           C V H  PA++FS GGY GN +HEF D  +P
Sbjct: 149 RKWETSIMETIHELKLVTKDMKRFGDRCKCEVIHEVPAVLFSTGGYTGNLYHEFNDGLIP 208

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
           L+IT  S   ++ V+L IA+ +  W  KY ++L + + +P+I+ +    THCF+   +GL
Sbjct: 209 LYIT--SKRFNKKVLLVIAEYHKWWEMKYGDVLSQLSDYPLIDFSKDKRTHCFKEAIVGL 266

Query: 181 ISHGRMVINPTLLPKPKTLV-DFQSFLANAYNENTN-----TSSSFHH--------TKPK 226
             HG + ++P+ +   +T + +F++ L  AY    N         FH          +PK
Sbjct: 267 RIHGELTVDPSQMQDGRTTINEFRNVLDRAYGPRINRLDRLEEQRFHARVAKRRKAQRPK 326

Query: 227 LVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHG 286
           L L +R     R I N   + + A+ +GF+V +  P+ +T LA  +R ++S   MVGVHG
Sbjct: 327 LALFSRTG--SRGITNEDLMVQLAQRIGFEVEVLRPDRTTELAKIYRVLNSSKVMVGVHG 384

Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGA 346
           A +TH LF++PGS+ +Q++P+GT W +  Y+ +PA+ LGL+Y+ YKI   ESSL EKY  
Sbjct: 385 AAMTHFLFMQPGSIFIQIIPLGTDWAAETYYGEPAKKLGLDYIGYKILPRESSLYEKYDK 444

Query: 347 NDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKA 392
           +D +L++P +     W   + +YL  Q V+LD+ RF+  L DAY K+
Sbjct: 445 DDPILRDPNSITKKGWQFTKGIYLNDQKVRLDLHRFKKVLVDAYAKS 491


>gi|42565997|ref|NP_191297.2| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
 gi|332646127|gb|AEE79648.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
          Length = 504

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 143/400 (35%), Positives = 225/400 (56%), Gaps = 32/400 (8%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           +I CDR+    D+C++ G       +S+ F      S  ++  +  K +PY RK + S M
Sbjct: 94  TICCDRTGSRSDVCIMKGDVRTHSASSSVFLF---TSLKNKTKITKKIKPYTRKWETSVM 150

Query: 73  SKVKELTITTSAPPNLS---------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
             V+EL +      N S         C V +  PA+ FS GGY GN +HEF D  +PLFI
Sbjct: 151 QTVQELNLVYRDEENNSLVVSSVNDICDVFYNVPAVFFSTGGYTGNVYHEFNDGIIPLFI 210

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T  SH  ++ V+  I + +  W  KY +++ + + +P ++ N    THCF+   +GL  H
Sbjct: 211 T--SHHFNKKVVFVIVEYHSWWIMKYGDIVSQLSDYPPVDFNGDKRTHCFKEAIVGLKIH 268

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAY-------------NENTNTSSSFHHTKPKLVLV 230
             + +  +L+   KT++DF++ L  AY                  T   F   KP LV++
Sbjct: 269 DELTVESSLMLGNKTILDFRNVLDQAYWPRIHGLIQEEELKAANKTEDGFK--KPILVIL 326

Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
           +RN    R ILN   + + AEE+GF V +  P+++T LA  +R ++S   M+GVHGA +T
Sbjct: 327 SRNG--SREILNESLLVELAEEIGFIVHVLRPDKTTELAKIYRCLNSSDVMIGVHGAAMT 384

Query: 291 HSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
           H LFL+P +V +Q++PIGT+W +  Y+ KPA+ + L+Y+ YKIK +ESSL ++YG +D +
Sbjct: 385 HLLFLKPKTVFIQIIPIGTEWAAETYYGKPAKKMRLKYIGYKIKPKESSLYDEYGIDDPI 444

Query: 351 LKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
           +++P++F    W    ++YL+ QNVKLD+ RFR  L  AY
Sbjct: 445 IRDPKSFTQKGWDYTKKIYLERQNVKLDLKRFRKPLSRAY 484


>gi|6735317|emb|CAB68144.1| putative protein [Arabidopsis thaliana]
          Length = 470

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 143/400 (35%), Positives = 225/400 (56%), Gaps = 32/400 (8%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           +I CDR+    D+C++ G       +S+ F      S  ++  +  K +PY RK + S M
Sbjct: 60  TICCDRTGSRSDVCIMKGDVRTHSASSSVFLF---TSLKNKTKITKKIKPYTRKWETSVM 116

Query: 73  SKVKELTITTSAPPNLS---------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
             V+EL +      N S         C V +  PA+ FS GGY GN +HEF D  +PLFI
Sbjct: 117 QTVQELNLVYRDEENNSLVVSSVNDICDVFYNVPAVFFSTGGYTGNVYHEFNDGIIPLFI 176

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T  SH  ++ V+  I + +  W  KY +++ + + +P ++ N    THCF+   +GL  H
Sbjct: 177 T--SHHFNKKVVFVIVEYHSWWIMKYGDIVSQLSDYPPVDFNGDKRTHCFKEAIVGLKIH 234

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAY-------------NENTNTSSSFHHTKPKLVLV 230
             + +  +L+   KT++DF++ L  AY                  T   F   KP LV++
Sbjct: 235 DELTVESSLMLGNKTILDFRNVLDQAYWPRIHGLIQEEELKAANKTEDGF--KKPILVIL 292

Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
           +RN    R ILN   + + AEE+GF V +  P+++T LA  +R ++S   M+GVHGA +T
Sbjct: 293 SRNG--SREILNESLLVELAEEIGFIVHVLRPDKTTELAKIYRCLNSSDVMIGVHGAAMT 350

Query: 291 HSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
           H LFL+P +V +Q++PIGT+W +  Y+ KPA+ + L+Y+ YKIK +ESSL ++YG +D +
Sbjct: 351 HLLFLKPKTVFIQIIPIGTEWAAETYYGKPAKKMRLKYIGYKIKPKESSLYDEYGIDDPI 410

Query: 351 LKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
           +++P++F    W    ++YL+ QNVKLD+ RFR  L  AY
Sbjct: 411 IRDPKSFTQKGWDYTKKIYLERQNVKLDLKRFRKPLSRAY 450


>gi|357509679|ref|XP_003625128.1| Glycosyltransferase [Medicago truncatula]
 gi|355500143|gb|AES81346.1| Glycosyltransferase [Medicago truncatula]
          Length = 541

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 147/431 (34%), Positives = 229/431 (53%), Gaps = 57/431 (13%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFF---SVGHTDSTPSQPSLRI----------- 58
           +I CDRS    D+C++ G       +S+ F   S+ H ++       R            
Sbjct: 100 TICCDRSGYRSDICVMKGDIRTHSSSSSIFLYNSISHGNNVSRTIEARKGEDEEDQVLQH 159

Query: 59  -KTQPYPRKSDKSAMSKVKELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
            K +PY RK + S M  + EL + +   ++P    C V H  PA+ FS GGY GN +HEF
Sbjct: 160 EKIKPYTRKWETSVMDTIDELNLISKKVNSPSVRGCDVQHDVPAVFFSNGGYTGNVYHEF 219

Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
            D  +PL+IT + HF ++ V+  I + ++ W  KY ++L   +  P IN +N   THCF 
Sbjct: 220 NDGIIPLYIT-SQHF-NKKVVFVILEYHEWWITKYGDILSHLSDFPPINFSNDNRTHCFP 277

Query: 175 SVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN----------------------- 211
              +GL  H  + ++  L+   K++V F++ L  AY+                       
Sbjct: 278 EAIVGLKIHDELAVDSALMEGNKSIVYFRNLLDEAYSPRIKGLIQDEEREAQEKLRQQQQ 337

Query: 212 --------ENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIF 260
                    ++ TS        TKPKLV+V+R+    R I N   + K AEE+GF V + 
Sbjct: 338 QQISLSPSSDSETSQGLQEIARTKPKLVIVSRSG--SRAITNENLLVKMAEEIGFKVNVL 395

Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKP 320
           +P+++T LA  +R ++    M+GVHGA +TH +F++P SV +QVVP+GT W +  Y+ +P
Sbjct: 396 KPQKTTELAKIYRVLNESDVMIGVHGAAMTHFMFMKPKSVFIQVVPLGTNWAADTYYGEP 455

Query: 321 ARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDID 379
           AR LGL+Y+ Y+I  +ESSL E+Y  +D +L++P++     W    ++YL +QNVKLD+ 
Sbjct: 456 ARKLGLKYIGYEIHPKESSLYERYDKSDPILRDPESINKKGWEYTKKIYLDSQNVKLDLR 515

Query: 380 RFRIYLKDAYK 390
           RFR  L  AY+
Sbjct: 516 RFRKRLHRAYE 526


>gi|317106706|dbj|BAJ53206.1| JHL06B08.8 [Jatropha curcas]
          Length = 530

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 144/421 (34%), Positives = 224/421 (53%), Gaps = 47/421 (11%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTP------SQPSLRIKTQPYPRK 66
           +I CDRS    D C++ G       +S+ F     ++T        +     K +PY RK
Sbjct: 99  TICCDRSSFRTDFCIMKGDVRTQYASSSIFLYTSRNTTSLIREDEEEEFQHEKIKPYTRK 158

Query: 67  SDKSAMSKVKELTITTSAPP---NLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
            + S M  +++L + +       +  C V H+ PA+ FS GGY GN +HEF D  +PL+I
Sbjct: 159 WETSVMDTIEQLHLISKQEKFAIDHQCDVKHSVPAVFFSTGGYTGNVYHEFNDGILPLYI 218

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T + HF  Q V+  I + +D W  KY ++L   + +P I+ +    THCF    +GL  H
Sbjct: 219 T-SQHFNKQ-VVFVILEYHDWWITKYGDILSHLSDYPAIDFSGDKRTHCFPEAIVGLKIH 276

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYN-------------------------------- 211
             + ++P+L+   K++VDF + L  AY                                 
Sbjct: 277 NELTVDPSLMQGNKSIVDFHNLLGKAYKPRINGLIRDEKREAEEKLKQKVLSLSPSSGTL 336

Query: 212 -ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLAD 270
            E  N        +PKLV+++RNA   R I N   + K AE +GF V +  P+ +T LA 
Sbjct: 337 LELKNDVQEAKLKRPKLVILSRNA--SRAITNEDLLVKMAEGIGFHVEVLRPDRTTELAK 394

Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
            +R ++S   M+GVHGA +TH LF+RPGSV +QV+P+GT+W +  Y+ +PA+ LGL+Y+ 
Sbjct: 395 IYRALNSSEVMIGVHGAAMTHFLFMRPGSVFIQVIPLGTEWAAETYYGEPAKKLGLKYIG 454

Query: 331 YKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAY 389
           YKI   ESSL +KY  ND VL++P++ +   W   + +YL +QN++L++ RF+  L  AY
Sbjct: 455 YKILPRESSLYDKYDKNDPVLQDPESISKKGWQYTKTIYLDSQNLRLNLGRFQKRLVRAY 514

Query: 390 K 390
           +
Sbjct: 515 Q 515


>gi|297817062|ref|XP_002876414.1| hypothetical protein ARALYDRAFT_486181 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322252|gb|EFH52673.1| hypothetical protein ARALYDRAFT_486181 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 512

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 138/405 (34%), Positives = 226/405 (55%), Gaps = 35/405 (8%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           +I CDR+    D+C++ G       +S+ F      S  ++ ++  K +PY RK + S M
Sbjct: 95  TICCDRTGFRSDVCIMKGDVRTHSASSSVFLF---TSLKNKITITGKIKPYTRKWETSVM 151

Query: 73  SKVKELT-----------ITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
             V++L            ++     N  C V +  PA+ FS GGY GN +HEF D  +PL
Sbjct: 152 QTVQQLNLVYRDEKNNYLVSVDEHNNNICDVFYNVPAVFFSTGGYTGNVYHEFNDGIIPL 211

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
           FIT  SH  ++ V+  I + +  W  KY +++ + + +P ++ N    T CF+   +GL 
Sbjct: 212 FIT--SHHFNKKVVFVIVEYHSWWVMKYGDIVSQLSDYPPVDFNGDKRTQCFKEAIVGLK 269

Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAY----------------NENTNTSSSFHHTKP 225
            H  + ++ +L+   KT++DF++ L  AY                N+           KP
Sbjct: 270 IHDELTVDSSLMLGNKTILDFRNVLNQAYWPRIRGLSQEEELEAANKTGKRVQEDGFKKP 329

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           KLV+++RN    R ILN   +   AEE+GF V +  P+++T LA  ++ ++S   M+GVH
Sbjct: 330 KLVILSRNG--SREILNDGLLVALAEEIGFIVYVLRPDKTTELAKIYKCLNSSDVMIGVH 387

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYG 345
           GA +TH LF++P +V +Q++PIGT+W +  Y+ KPA+ + L+Y+ YKIK +ESSL ++YG
Sbjct: 388 GAAMTHFLFMKPKTVFIQIIPIGTEWAAETYYGKPAKKMRLKYIGYKIKPKESSLYDEYG 447

Query: 346 ANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
            +D ++++P++F    W    ++YL+ QNVKLD+ RFR  L  AY
Sbjct: 448 KDDPIIRDPKSFTQKGWDYTKKIYLERQNVKLDLKRFRKPLSRAY 492


>gi|56409840|emb|CAI30145.1| glycosyltransferase [Medicago truncatula]
          Length = 541

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 146/431 (33%), Positives = 228/431 (52%), Gaps = 57/431 (13%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFF---SVGHTDSTPSQPSLRI----------- 58
           +I CDRS    D+C++ G       +S+ F   S+ H ++       R            
Sbjct: 100 TICCDRSGYRSDICVMKGDIRTHSSSSSIFLYNSISHGNNVSRTIEARKGEDEEDQVLQH 159

Query: 59  -KTQPYPRKSDKSAMSKVKELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
            K +PY RK + S M  + EL + +   ++P    C V H  PA+ FS GGY GN +HEF
Sbjct: 160 EKIKPYTRKWETSVMDTIDELNLISKKVNSPSVRGCDVQHDVPAVFFSNGGYTGNVYHEF 219

Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
            D  +PL+IT + HF ++ V+  I + ++ W  KY ++L   +  P IN +N   THCF 
Sbjct: 220 NDGIIPLYIT-SQHF-NKKVVFVILEYHEWWITKYGDILSHLSDFPPINFSNDNRTHCFP 277

Query: 175 SVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN----------------------- 211
              +GL  H  + ++  L+   K++V F++ L  AY+                       
Sbjct: 278 EAIVGLKIHDELAVDSALMEGNKSIVYFRNLLDEAYSPRIKGLIQDEEREAQEKLRQQQQ 337

Query: 212 --------ENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIF 260
                    ++ TS        TKPKLV+V+R+    R I N   + K AEE+G  V + 
Sbjct: 338 QQISLSPSSDSETSQGLQEIARTKPKLVIVSRSG--SRAITNENLLVKMAEEIGLKVNVL 395

Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKP 320
           +P+++T LA  +R ++    M+GVHGA +TH +F++P SV +QVVP+GT W +  Y+ +P
Sbjct: 396 KPQKTTELAKIYRVLNESDVMIGVHGAAMTHFMFMKPKSVFIQVVPLGTNWAADTYYGEP 455

Query: 321 ARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDID 379
           AR LGL+Y+ Y+I  +ESSL E+Y  +D +L++P++     W    ++YL +QNVKLD+ 
Sbjct: 456 ARKLGLKYIGYEIHPKESSLYERYDKSDPILRDPESINKKGWEYTKKIYLDSQNVKLDLR 515

Query: 380 RFRIYLKDAYK 390
           RFR  L  AY+
Sbjct: 516 RFRKRLHRAYE 526


>gi|255585488|ref|XP_002533436.1| glycosyltransferase, putative [Ricinus communis]
 gi|223526710|gb|EEF28943.1| glycosyltransferase, putative [Ricinus communis]
          Length = 533

 Score =  261 bits (667), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 145/426 (34%), Positives = 225/426 (52%), Gaps = 45/426 (10%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDS--TPSQPSLRIKTQPYPRKSD 68
           ++ CDR     D+C++ G       +S+   ++  +T S    ++     K +PY RK +
Sbjct: 104 TMCCDRRSFRSDICIMKGDIRTHSASSSVLLYTSRNTSSLIKDNEEIQHEKIKPYTRKWE 163

Query: 69  KSAMSKVKELTI---TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
            S M  + +L +      +  N  C V H  PA+ FS GGY GN +HEF D  VPL+IT 
Sbjct: 164 TSVMGTIDQLDLILKQEKSSVNHRCDVKHDVPAVFFSTGGYTGNVYHEFNDGIVPLYIT- 222

Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
            S    + V+  I + +  W  KY ++L R + +P I+ +    THCF    +GL  H  
Sbjct: 223 -SQHLKRKVVFVILEYHTWWMMKYGDILSRLSDYPAIDYSGDKRTHCFPEAIVGLRIHDE 281

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN---------------------------------E 212
           + ++ +L+   K++VDF + L  AY                                  E
Sbjct: 282 LTVDSSLMKGNKSIVDFHNLLDKAYRPRIKGLIREEEHEALKKLKQKVLPLSPSSETLLE 341

Query: 213 NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
                    H +PKLV+++RNA   R I N   + K AE +GF V +  PE +T LA  +
Sbjct: 342 FRKDVQESKHKRPKLVILSRNA--SRAITNEDLLVKMAEGIGFRVEVLRPERTTELAKIY 399

Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           R ++S   M+GVHGA +TH LF++PGSV +QV+P+GT+W +  Y+ +PAR LGL+Y+ Y+
Sbjct: 400 RALNSSEVMIGVHGAAMTHFLFMKPGSVFIQVIPLGTEWAAETYYGEPARKLGLKYIGYQ 459

Query: 333 IKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
           I   ESSL +KY  ND VL++P + +   W   + +YL +QNV+LD++RF+  L  AY+ 
Sbjct: 460 ILPRESSLYDKYDKNDPVLQDPASISNKGWQYTKTIYLDSQNVRLDLERFQKQLVLAYEH 519

Query: 392 AKKLMD 397
           + K M+
Sbjct: 520 SMKKMN 525


>gi|302812177|ref|XP_002987776.1| glycosyltransferase belonging to CAZy family GT61 [Selaginella
           moellendorffii]
 gi|300144395|gb|EFJ11079.1| glycosyltransferase belonging to CAZy family GT61 [Selaginella
           moellendorffii]
          Length = 445

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 138/392 (35%), Positives = 228/392 (58%), Gaps = 19/392 (4%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + CD S+    +C + G       T T FS  +TD    +  L  K +PYPRK  K  + 
Sbjct: 63  VACDLSNYRTAVCRLAGDLRISGSTVTLFSPRNTD----EEILVQKIKPYPRKWQKQLLE 118

Query: 74  KVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
           K+ E+TI    ++S+ P   C V HT  A++FS GGY G+ +H+F D  +P++IT  SH 
Sbjct: 119 KISEVTIKVRRSSSSTPQHQCDVNHTQAAMIFSTGGYTGSVYHDFNDGLIPIYIT--SHG 176

Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININN---QTITHCFQSVTLGLISHGRM 186
            + +V+  +++    W +KY  ++ + T++P+ + ++   Q   HCF  V +GL  H  +
Sbjct: 177 FEGEVVFLVSEFQPWWMKKYGSIVKQMTKYPVQDFSSNPAQHRVHCFPKVVVGLDIHDEL 236

Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
            IN   +   K++ DFQS L+ A++  +++ +    T+PKLV + R  R  R + N  EV
Sbjct: 237 AINAAKMSHGKSIRDFQSILSAAFSA-SDSRTKVPSTRPKLVFITR--RRTRVVTNEEEV 293

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            + AE  GFDV   EP  +  +A+ +  I S   ++GVHGA +TH LF+RPGS+L+Q+VP
Sbjct: 294 VQLAERAGFDVEALEPGFNHEMANLYGIIQSADVLLGVHGAAMTHLLFMRPGSLLLQIVP 353

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN--WSN 364
           +GT+  S  ++  PA   G++Y+EY ++  ESSL++++G N  V+ +P    G++  WS+
Sbjct: 354 LGTKSPSRSFYGNPAMKAGMQYMEYIVEASESSLLKRFGQNHSVIVSPPENPGSSSGWSD 413

Query: 365 M-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
           M ++YL  QNV + + RF   L++A++K   L
Sbjct: 414 MKKIYLDKQNVTISLSRFEPVLREAFEKISSL 445


>gi|224086964|ref|XP_002308019.1| predicted protein [Populus trichocarpa]
 gi|222853995|gb|EEE91542.1| predicted protein [Populus trichocarpa]
          Length = 413

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 137/362 (37%), Positives = 207/362 (57%), Gaps = 35/362 (9%)

Query: 59  KTQPYPRKSDKSAMSKVKELTI---TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFM 115
           K +PY RK + S M  + EL +   T +   N  C V H  PA+ FS GGY GN +HEF 
Sbjct: 43  KVKPYTRKWETSVMDTIDELDLIVKTENFRNNHHCDVKHDVPAVFFSTGGYTGNVYHEFN 102

Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
           D  +PL+IT + HF ++ V+  I + +D W  KY ++L   + +P I+ +    THCF  
Sbjct: 103 DGLLPLYIT-SQHF-NKKVVFVILEYHDWWIMKYGDILSHLSDYPAIDFSGDKRTHCFPE 160

Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY-------------------NENTNT 216
             +GL  H  + ++P+L+   K++VDF++ L  AY                    +  ++
Sbjct: 161 AIVGLRIHDELTVDPSLMQGNKSVVDFRNVLDRAYLPRVQSLLKEEERLAQEKLKQKVHS 220

Query: 217 SSSFHHT--------KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSL 268
           SSS            +PKLV+++RN    R I N   + K AEE+GF V +  PE +T L
Sbjct: 221 SSSEIRKEVQDATLKRPKLVILSRNG--SRAITNEDLLVKMAEEIGFRVEVVRPERTTEL 278

Query: 269 ADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           A  +  ++S   M+GVHGA +TH LF+RPGSV +QV+P+GT+W +  Y+ +PAR LGL+Y
Sbjct: 279 ARIYGALNSSEVMIGVHGAAMTHFLFMRPGSVFIQVIPLGTEWAADAYYGEPARKLGLKY 338

Query: 329 LEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKD 387
           + Y+I   ESSL +KY  ND VL++P++ +   W   + +YL  QNV+L++ RF+  L  
Sbjct: 339 IGYQILPRESSLYDKYDRNDPVLRDPESVSEKGWQYTKSIYLDNQNVRLNLGRFQKRLLR 398

Query: 388 AY 389
           AY
Sbjct: 399 AY 400


>gi|326491913|dbj|BAJ98181.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326506118|dbj|BAJ91298.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 535

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 151/425 (35%), Positives = 217/425 (51%), Gaps = 51/425 (12%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
           P A     ++ CDR+    DLC++ G    D +T    +     +  +    RI+  PY 
Sbjct: 106 PCAGVADGTVCCDRTSPRADLCVMRG----DVRTHAASNSLFLLAAAAPADERIR--PYT 159

Query: 65  RKSDKSAMSKVKELTITTSAPPNLS---------CGVTHTSPALVFSAGGYNGNFFHEFM 115
           RK + + MS + EL +  + PP  S         C V H  PA+VFS GGY GN +HEF 
Sbjct: 160 RKWESTVMSTIDELRLR-AVPPEGSDAAAAGPAGCDVRHDVPAVVFSTGGYTGNVYHEFN 218

Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
           D  +PL+IT   +  D+ V+  + + +D W  KY  ++ + +  P ++ +N T THCF  
Sbjct: 219 DGIIPLYITARRY--DRKVVFVMLEYHDWWITKYGHIVDQLSDFPPVDFSNDTRTHCFPE 276

Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNT------------------- 216
             +GL  H  + I+ + +P  + ++DF+  L +A+    N                    
Sbjct: 277 AVVGLRIHDELAIDASRMPGNQGILDFRQILDDAHRGRINAIIDEEKASPLAAPAAAQAK 336

Query: 217 ---SSSFHHT--------KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEES 265
                S   T        KP+LV+V+RN    R I N  E+ +AA   GF VT+  P   
Sbjct: 337 KQQPRSISGTEELLEDEYKPRLVIVSRNG--SRAIENEDELARAAARAGFRVTVLRPRPD 394

Query: 266 TSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG 325
           T LA  +R ++    MVGVHGA +TH LF+RPGS  +QVVP+GT W +  Y+ +PAR LG
Sbjct: 395 TELAQMYRVLNGSDVMVGVHGAAMTHFLFMRPGSAFIQVVPLGTDWAAENYYGEPARRLG 454

Query: 326 LEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIY 384
           L Y+ YKI   ESSL  +Y  +D VL +P A     W    +VYL  QNV+LD+ RFR  
Sbjct: 455 LRYIPYKILPSESSLFRRYARDDPVLTDPVAVNAKGWQVTKKVYLDGQNVRLDMARFRRR 514

Query: 385 LKDAY 389
           L DAY
Sbjct: 515 LSDAY 519


>gi|357116180|ref|XP_003559861.1| PREDICTED: uncharacterized protein LOC100829143 [Brachypodium
           distachyon]
          Length = 536

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 142/416 (34%), Positives = 217/416 (52%), Gaps = 35/416 (8%)

Query: 6   AALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSV-GHTDSTPS---QPSLRIKTQ 61
           AA+    ++ CDR+    D+C   G       +S+F  V G+ +S+P+   +     + +
Sbjct: 115 AAVRGNDTLCCDRTSERADMCFARGDVRMHSASSSFLLVSGNKESSPAPGKKEEQEERIR 174

Query: 62  PYPRKSDKSAMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
           PY RK + + M+ + E+ I         P    C V H  PA++ S GG+ GN +HEF D
Sbjct: 175 PYTRKWEANVMATIDEVRIRRVHPAHGGPSAPRCDVVHDVPAVLLSTGGFTGNVYHEFND 234

Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
             +P+F+T  +H   + V+  I + +D W  KY +++ R +  P I+ +     HCF  +
Sbjct: 235 GLIPMFVTA-AHL-RRRVVFVILEYHDWWITKYGDVVSRLSAFPPIDFSADRRVHCFPEL 292

Query: 177 TLGLISHGRMVINPTLLPKPKT-LVDFQSFLANAYNEN--------------------TN 215
             GL  HG + ++P   P   T + DF++ L +AY                       + 
Sbjct: 293 IAGLRIHGELTVDPARTPDGATSIADFRALLDDAYRGRLLYLDRLAAARKHRRRPRRRSA 352

Query: 216 TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
            +S     +P+L +V+R     R I N   V   A E+GF+V +  PE ST +   +R +
Sbjct: 353 INSVEIEKRPRLTIVSRTG--SRVIENEEAVVSLASEIGFEVRVIRPERSTEMCKIYREL 410

Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQ 335
           +   AMVGVHGA +TH LF+RPG V +QVVP+GT W +  Y+ +PA  LGL Y+ YKI+ 
Sbjct: 411 NGSDAMVGVHGAAMTHFLFMRPGKVFIQVVPLGTDWAAGAYYGEPAARLGLRYVGYKIRP 470

Query: 336 EESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
           +ESSL  +Y A D VL +P A A   W    +VYL  QNV+LD+ RFR  L  A++
Sbjct: 471 DESSLAREYPAGDPVLVDPAAVAKRGWDVTKKVYLDRQNVRLDLARFREELVKAHR 526


>gi|414887942|tpg|DAA63956.1| TPA: hypothetical protein ZEAMMB73_235532 [Zea mays]
          Length = 555

 Score =  253 bits (647), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 143/419 (34%), Positives = 214/419 (51%), Gaps = 42/419 (10%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
           P +     S+ CDR+    D+C   G       +++F  V   +STP +     + +PY 
Sbjct: 112 PCSAVGNNSLCCDRTSERADICFARGDLRMHSASASFQLVSTGNSTPGE-----RIRPYT 166

Query: 65  RKSDKSAMSKVKELTITTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
           RK + + M+ + E+ +   AP + + C V H  PA+VFS GGY GN +HEF D  +PLF+
Sbjct: 167 RKWEANVMATIDEVRLRRVAPGDAARCDVRHDVPAVVFSTGGYTGNVYHEFNDGILPLFV 226

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T   H   + V L I + +D W  KY +++ + +  P I+ +     HCF  V  GL  H
Sbjct: 227 TAR-HL-RRRVALVILEYHDWWMTKYGDVVSQLSEFPPIDFSADRRVHCFPEVIAGLRIH 284

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNEN------------------------------ 213
           G + ++P   P+ + + DF+ FL +AY                                 
Sbjct: 285 GELTVDPARTPERRGIGDFRRFLDDAYRGRIEFLERLERRAARRRPHRHHRGGALVPRAP 344

Query: 214 -TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
                 +    +P+LV+V+R     R I N  +V   A ++GFDV +  P+ +T L   +
Sbjct: 345 PAGPREAEAERRPRLVIVSRTG--SRVIENEADVAALAADVGFDVRVIRPDRTTELCKIY 402

Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           R +++  AMVGVHGA +TH LF+RPG V +QVVP+GT W +  Y+  PA  +GL Y+ YK
Sbjct: 403 RELNASDAMVGVHGAAMTHFLFMRPGKVFIQVVPLGTDWAAGAYYGDPAARMGLRYVGYK 462

Query: 333 IKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
           I  EESSL  +Y   D VL +P   A   W    +VYL  QNV+LD+ RFR  L  A++
Sbjct: 463 ILPEESSLSREYPTGDPVLTDPAGVAQRGWDVTKKVYLDRQNVRLDLARFREELVAAHR 521


>gi|242046804|ref|XP_002461148.1| hypothetical protein SORBIDRAFT_02g041650 [Sorghum bicolor]
 gi|241924525|gb|EER97669.1| hypothetical protein SORBIDRAFT_02g041650 [Sorghum bicolor]
          Length = 554

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 137/413 (33%), Positives = 212/413 (51%), Gaps = 39/413 (9%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           S  CDR+    D+C   G       +++F  V   +ST +      + +PY RK + + M
Sbjct: 128 SFCCDRTSERADICFAKGDLRMHSASASFQLVSAGNSTAAAEEEEERIRPYTRKWEANVM 187

Query: 73  SKVKELTITTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
           + + E+ +    P + + C V H  PA++FS GGY GN +HEF D  +PLF+T  +H   
Sbjct: 188 ATIDEVRLRRVVPGDAARCDVRHDVPAVLFSTGGYTGNVYHEFNDGILPLFVT--AHHLR 245

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
           + V+L I + +D W  KY +++ + +  P I+       HCF  V  GL  HG + ++P 
Sbjct: 246 RRVVLVILEYHDWWMTKYGDVVSQLSAFPPIDFTADRRVHCFPEVIAGLRIHGELSVDPA 305

Query: 192 LLPKPKTLVDFQSFLANAYNENT---------------------------------NTSS 218
             P+ K++ DF+  L +AY                                     + + 
Sbjct: 306 RTPEGKSIGDFRRLLDDAYRGRIEFLERLERRTARKRLRLHRHRRGAVVPRAPPGPSHAD 365

Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSC 278
                +P+LV+V+R     R I N  ++   A ++GFDV +  P+ +T L   +R +++ 
Sbjct: 366 DDRRPRPRLVIVSRTG--SRVIENEADLAALAADVGFDVRVIRPDRTTELCKIYRELNAS 423

Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
            AM+GVHGA +TH LF+RPG V +QVVP+GT W +  Y+ +PA  +GL Y+ YKI  EES
Sbjct: 424 DAMIGVHGAAMTHFLFMRPGKVFIQVVPLGTDWAAGAYYGEPAARMGLRYVGYKISPEES 483

Query: 339 SLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
           SL  +Y   D VL +P   A   W    +VYL  QNV+LD+ RFR  L  A++
Sbjct: 484 SLSREYPTGDPVLTDPAGVAQRGWDVTKKVYLDRQNVRLDLARFREELVGAHR 536


>gi|356504769|ref|XP_003521167.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like [Glycine
           max]
          Length = 537

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 145/431 (33%), Positives = 220/431 (51%), Gaps = 49/431 (11%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF-----SVGHTDSTPSQPSLRIK 59
           P +     +I CDRS    D+C++ G       +S+ F     S  +      +     K
Sbjct: 96  PCSSVSTGTICCDRSGYRSDVCVMKGDIRTHSASSSVFLYNSRSNNNVSRNFEEELQHEK 155

Query: 60  TQPYPRKSDKSAMSKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHEFMD 116
            +PY RK + S M  + EL + +    + S   C V H  PA+ FS GGY GN +HEF D
Sbjct: 156 IKPYTRKWETSVMDTIDELNLISKKVNSGSVGGCDVQHDVPAVFFSNGGYTGNVYHEFND 215

Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
             VPL+IT + HF  + V+  I + ++ W  KY ++L R +  P I+      THCF   
Sbjct: 216 GIVPLYIT-SQHFKKK-VVFVILEYHNWWIMKYGDILSRLSDFPPIDFQGDNRTHCFTEA 273

Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY---------NENTNTSSSFHHT---- 223
            +GL  H  + ++ +L+   K++ DF++ L  AY         +E               
Sbjct: 274 IVGLRIHDELTVDSSLMRGNKSIADFRNLLDKAYWPRIKGLIRDEERKAQEKLREQVSSS 333

Query: 224 -----------------------KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIF 260
                                  KPKLV+++R+    R I N   + K A+E+GF V + 
Sbjct: 334 ESSEASQQQYITIRQQVQENPMKKPKLVILSRSG--SRAITNENLLVKMAKEIGFLVQVL 391

Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKP 320
           +P+ +T +A  +R +++   M+GVHGA +TH LFLRPGSV +QVVP+GT W +  Y+ +P
Sbjct: 392 KPDRTTEMAKIYRTLNASDVMIGVHGAAMTHFLFLRPGSVFIQVVPLGTTWAAETYYGEP 451

Query: 321 ARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDID 379
           AR LGL+Y+ Y+I   ES+L EKY  ND +L++P +     W    ++YL +QNV LD+ 
Sbjct: 452 ARKLGLKYIGYQILPRESTLYEKYDKNDPILRDPTSINKKGWEYTKKIYLDSQNVMLDLR 511

Query: 380 RFRIYLKDAYK 390
           RFR  L  AY+
Sbjct: 512 RFRKRLHRAYE 522


>gi|242090431|ref|XP_002441048.1| hypothetical protein SORBIDRAFT_09g019360 [Sorghum bicolor]
 gi|241946333|gb|EES19478.1| hypothetical protein SORBIDRAFT_09g019360 [Sorghum bicolor]
          Length = 546

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 143/404 (35%), Positives = 208/404 (51%), Gaps = 38/404 (9%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + CDRSH   D+C + G    DP TS+         + ++     K +PY RK + S MS
Sbjct: 129 LCCDRSHYRSDVCYLRGDVRTDPSTSSVLLYNAPRGSAAE-----KVRPYTRKFEDSIMS 183

Query: 74  KVKELTITTSAPP-NLS------------CGVTHTS--PALVFSAGGYNGNFFHEFMDCF 118
            + E+TI   A   N S            C V H    PA+VFS GGY GN +HEF D  
Sbjct: 184 TIDEVTIQPVAGAYNASASASDGGTLRRRCDVRHPRGVPAVVFSTGGYTGNVYHEFSDGL 243

Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTL 178
           +PLF+T        +V+  + + +  W  +Y  +L + T + +++       HCF  + +
Sbjct: 244 IPLFVTAQRF--AGEVVFVVLEYHYWWLGRYGAVLEQLTNYKVVDFRYDRRVHCFDEMIV 301

Query: 179 GLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH-------------TKP 225
           GL  HG +V++P L+P  K++ DFQ+ L   Y+  T+++S                  KP
Sbjct: 302 GLRIHGELVVDPKLMPNGKSIKDFQALLHQGYSRTTSSASPPVPLPLAPPSRPCPRPAKP 361

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           KL++  R  +  R +LNL  V  A    GF   +      T L      + S  AMV VH
Sbjct: 362 KLLIFIR--KQNRVLLNLPHVVTACRRAGFAPHVMNLRRQTPLPVIHAALASADAMVAVH 419

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYG 345
           GA +TH LF+RPGSVL+Q+VP+G  W +  ++ KPA+ LGLEYLEYK+  EESSL  +YG
Sbjct: 420 GAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLEYLEYKVAPEESSLAAEYG 479

Query: 346 ANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
            N  V++ P   +   W  M +VY+  QNV ++I RF   L+ A
Sbjct: 480 LNSTVVRTPWVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRAA 523


>gi|168060869|ref|XP_001782415.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666086|gb|EDQ52750.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 142/389 (36%), Positives = 211/389 (54%), Gaps = 18/389 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFS---VGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           CDRS    D+C + G    D +  TF     +      P+  S+    +PY RK +KS M
Sbjct: 1   CDRSQFRTDICNMKG----DIRMLTFNGNKPIVLYAKDPATSSVTEIVKPYTRKWEKSCM 56

Query: 73  SKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
             V E+T+      S      C V H  P +VFS  GY GN FHEF D  +PLFIT  S 
Sbjct: 57  DTVHEVTLRIVPANSQTDKTPCDVHHKVPGVVFSTSGYTGNLFHEFNDGLIPLFIT--SQ 114

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
               +V+  I + ++ W  KY E+L + +++ II+  N T  HCF  + +GL  H  + +
Sbjct: 115 HLKGEVVFIITEFHNWWLTKYFEVLQQLSQYEIISFENDTRVHCFPELEVGLHIHDDLTV 174

Query: 189 NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH-TKPKLVLVNRNARVGRTILNLREVK 247
           +P  +P  +++ DF+  L   Y       S     +KPKL ++ RN    R  LNL ++ 
Sbjct: 175 DPNRMPNHESIRDFRKLLDRGYENALRFDSPIPDVSKPKLSIIVRNGT--RKFLNLGDIV 232

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
             AEELGF+V++  P+ +  L   F+ ++S   ++GVHGA +TH LF++PG VL+QV+P+
Sbjct: 233 TTAEELGFNVSLLSPDPTMELKRLFQLLNSTDVLMGVHGAAMTHFLFMKPGKVLIQVIPL 292

Query: 308 GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP-QAFAGANWSNM- 365
           G  W ST Y+ KP + +GL YL YKI   ESSL  +Y A+D +L NP + F    W  M 
Sbjct: 293 GIDWASTTYYGKPTKKMGLHYLPYKILPSESSLSRQYNASDPILVNPDEIFNQQGWWTMK 352

Query: 366 RVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
           +++L+ Q+V+  + R R     A KK K+
Sbjct: 353 KIFLEGQDVRPSLTRMRKIFMRALKKLKR 381


>gi|302765455|ref|XP_002966148.1| glycosyltransferase in CAZy family GT61 [Selaginella
           moellendorffii]
 gi|300165568|gb|EFJ32175.1| glycosyltransferase in CAZy family GT61 [Selaginella
           moellendorffii]
          Length = 486

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 134/395 (33%), Positives = 211/395 (53%), Gaps = 21/395 (5%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           +ITCD SH   D+C + G        +   ++   +S      +  K +PY RK +KS M
Sbjct: 96  NITCDFSHFRTDVCELRGDVRVSGGRAGKVALLDAESPQDSHQVVAKIKPYTRKWEKSCM 155

Query: 73  SKVKELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
           + + E+++     S+ P++ C   H+ PA++ S GGY GN +HEF D  +PLFIT  SH 
Sbjct: 156 ATIGEVSLEILPLSSSPSMPCDTNHSVPAVILSTGGYTGNVYHEFNDGLIPLFIT--SHK 213

Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
              +V+  + + +  W  KY  ++ + + +P+ + +     HCF    LG+  H  + I 
Sbjct: 214 FHGEVVFLVLELHKWWMMKYGSIVSKLSNYPVQDFDRSKRIHCFPQAILGMQIHDELAI- 272

Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH-----------TKPKLVLVNRNARVGR 238
            +  P+  ++ DFQ  L  + N   +                  + PKLVL+ R     R
Sbjct: 273 ASQAPEA-SMRDFQQLLKASLNSQKSPLKPMSRVAKVGNKVGGSSSPKLVLLARKG--SR 329

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
            +LN   + + A+++GF V +  P   +SL +    +HS H MVGVHGA LTH LF+RP 
Sbjct: 330 VLLNQNALVRLAKKIGFRVVVLAPNSHSSLFELHEELHSAHVMVGVHGAALTHFLFMRPA 389

Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
           SV +Q+VP+GT+W +  Y+ +PA   GL YLEYKI  EESSLV+K G     + +P+   
Sbjct: 390 SVFIQIVPLGTEWAAQTYYGQPAMKAGLRYLEYKIVAEESSLVKKLGRESAAVAHPEEIT 449

Query: 359 GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKA 392
              W  M ++YL+ Q+V L + RFR  L+ A+K++
Sbjct: 450 SKGWWEMKKIYLQNQDVMLSLHRFRPLLEQAFKES 484


>gi|413949498|gb|AFW82147.1| glycosyltransferase [Zea mays]
          Length = 543

 Score =  251 bits (641), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 145/408 (35%), Positives = 211/408 (51%), Gaps = 42/408 (10%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + CDRSH   D+C + G    DP TS+     +  +  S P    K +PY RK + S MS
Sbjct: 128 LCCDRSHYRSDVCYLRGDVRTDPSTSSVLL--YNAARGSGPE---KVRPYTRKFEDSVMS 182

Query: 74  KVKELTIT--------TSAPPNLSCGVTHTS--PALVFSAGGYNGNFFHEFMDCFVPLFI 123
            + E+TI         +++     C V H    PA+V S GGY GN +HEF D  +PLF+
Sbjct: 183 SIDEVTIVPVPGGHNASASALRRRCDVRHPRGVPAVVLSTGGYTGNVYHEFSDGLIPLFV 242

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T        +V+L + + +  W  +Y  +L + T + +++  +    HCF  + +GL  H
Sbjct: 243 TAQRF--GGEVVLVVLEYHYWWLGRYGAVLEQLTNYKVVDFRSDRRVHCFDEMIVGLRIH 300

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH--------------------- 222
           G +V++P L+P  K++ DFQ+ L  +Y+  T T S+                        
Sbjct: 301 GELVVDPKLMPNGKSIKDFQALLHRSYSRTTTTPSASPPVPVPLPLAPPSRSRSRACPRP 360

Query: 223 -TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
             KPKL++  R     R +LNL  V  A    GF   +      T LA     + S  AM
Sbjct: 361 AAKPKLLIFIRKQ--NRVLLNLPHVVTACRRAGFAPHVMNLRRQTPLAAIHAALASADAM 418

Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLV 341
           V VHGA +TH LF+RPGSVL+Q+VP+G  W +  ++ KPA+ LGLEYLEYK+  EESSL 
Sbjct: 419 VAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLEYLEYKVAPEESSLA 478

Query: 342 EKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
            +YG +  VL+NP   +   W  M +VY+  QNV ++I RF   L+ A
Sbjct: 479 AEYGLDSTVLRNPWVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRTA 526


>gi|356572020|ref|XP_003554168.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like [Glycine
           max]
          Length = 530

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 146/433 (33%), Positives = 218/433 (50%), Gaps = 54/433 (12%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF-----SVGHTDSTPSQPSLRIK 59
           P +     +I CDRS    D+C++ G       +S+ F     S  +      +     K
Sbjct: 90  PCSSVSTGTICCDRSGYRSDVCVMKGDIRTHSASSSVFLYNSRSNNNVSRNFEEELQHEK 149

Query: 60  TQPYPRKSDKSAMSKVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHE 113
            +PY RK + S M  + EL +    P NL+      C V H  PA+ FS GGY GN +HE
Sbjct: 150 IKPYTRKWETSVMDTIDELNL---VPKNLNLGGVGGCDVQHDVPAVFFSNGGYTGNVYHE 206

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           F D  +PL+IT + HF  + V+  I + +  W  KY ++L R +  P I+      THCF
Sbjct: 207 FNDGIIPLYIT-SQHF-KKKVVFVILEYHSWWIMKYGDILSRLSDFPPIDFRGDNRTHCF 264

Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY---------NENTNTSSSFHHT- 223
               +GL  H  + ++  L+   K++ DF++ L  AY         +E            
Sbjct: 265 PEAIVGLRIHDELTVDSALMRGNKSIADFRNLLDKAYWPRIKGLIRDEERKAQEKLREQV 324

Query: 224 -------------------------KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
                                    KP LV+++R+    R I N   + K A+E+GF V 
Sbjct: 325 SSSESSEASQQQYIIRQQVQENPTKKPTLVILSRSG--SRAITNENLLVKMAKEIGFLVQ 382

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           + +P+ +T LA  +R +++   M+GVHGA +TH LFLRPGSV +QVVP+GT W +  Y+ 
Sbjct: 383 VLKPDRTTELAKVYRSLNASDVMIGVHGAAMTHFLFLRPGSVFIQVVPLGTTWAAETYYG 442

Query: 319 KPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLD 377
           +PAR LGL+Y+ Y+I   ES+L EKY  ND +L++P +     W    ++YL +QNV LD
Sbjct: 443 EPARKLGLKYIGYQILPRESTLYEKYDKNDPILRDPTSINKKGWEYTKKIYLDSQNVMLD 502

Query: 378 IDRFRIYLKDAYK 390
           + RFR  L  AY+
Sbjct: 503 LRRFRKRLHRAYE 515


>gi|414591074|tpg|DAA41645.1| TPA: hypothetical protein ZEAMMB73_888474 [Zea mays]
          Length = 547

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 138/419 (32%), Positives = 214/419 (51%), Gaps = 40/419 (9%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTF--FSVGHTDSTPSQPSLRIKTQP 62
           P +     S+ CDR+    D+C   G       +++F   S G++ +   +     + +P
Sbjct: 119 PCSAVGNDSLCCDRTSERADICFAKGDVRMHSASASFQLVSAGNSSAAAEEEE---RIRP 175

Query: 63  YPRKSDKSAMSKVKELTITTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
           Y RK + + M+ + E+ +    P + + C V H +PA++FS GGY GN +HEF D  +PL
Sbjct: 176 YTRKWEANVMATIDEVRLRRVLPGDAARCDVRHDAPAVLFSTGGYTGNVYHEFNDGILPL 235

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
           F+T  +H   + V+L I + +D W  KY +++ + +  P I+       HCF  +  GL 
Sbjct: 236 FVT--AHHLRRRVVLVILEYHDWWMTKYGDVVSQLSAFPPIDFAADRRVHCFPELIAGLR 293

Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYNENTN-------------------------- 215
            HG + ++P   P+ K++ DF+  L +AY                               
Sbjct: 294 IHGELTVDPARTPEGKSIGDFRRLLDDAYRGRIGFLERLERRAARKRLRQRRHRRGGGGG 353

Query: 216 ---TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
                      +P+LV+V+R     R I N  +V   A ++GFDV +  PE +T L   +
Sbjct: 354 APPAPGGPREDRPRLVIVSRTG--SRVIENEADVAALAADVGFDVRVIRPERTTELCKIY 411

Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           R ++   AMVGVHGA +TH LF+RPG V +QVVP+GT W +  Y+ +PA  +GL Y+ Y+
Sbjct: 412 RELNGSDAMVGVHGAAMTHFLFMRPGKVFIQVVPLGTHWAAGAYYGEPAARMGLRYVGYR 471

Query: 333 IKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
           I  EESSL  +Y   D VL +P   A   W    +VYL  QNV+LD+ RFR  L  A++
Sbjct: 472 ILPEESSLSREYPTGDPVLTDPAGVAQRGWDVTKKVYLDRQNVRLDLARFREELVRAHR 530


>gi|219884207|gb|ACL52478.1| unknown [Zea mays]
          Length = 543

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 145/408 (35%), Positives = 210/408 (51%), Gaps = 42/408 (10%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + CDRSH   D+C + G    DP TS+     +  +  S P    K +PY RK + S MS
Sbjct: 128 LCCDRSHYRSDVCYLRGDVRTDPSTSSVLL--YNAARGSGPE---KVRPYTRKFEDSVMS 182

Query: 74  KVKELTIT--------TSAPPNLSCGVTHTS--PALVFSAGGYNGNFFHEFMDCFVPLFI 123
            + E+TI         +++     C V H    PA+V S GGY GN +HEF D  +PLF+
Sbjct: 183 SIDEVTIVPVPGGHNASASALRRRCDVRHPRGVPAVVLSTGGYTGNVYHEFSDGLIPLFV 242

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T        +V+L +   +  W  +Y  +L + T + +++  +    HCF  + +GL  H
Sbjct: 243 TAQRF--GGEVVLVVLGYHYWWLGRYGAVLEQLTNYKVVDFRSDRRVHCFDEMIVGLRIH 300

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH--------------------- 222
           G +V++P L+P  K++ DFQ+ L  +Y+  T T S+                        
Sbjct: 301 GELVVDPKLMPNGKSIKDFQALLHRSYSRTTTTPSASPPVPVPLPLAPPSRSRSRACPRP 360

Query: 223 -TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
             KPKL++  R     R +LNL  V  A    GF   +      T LA     + S  AM
Sbjct: 361 AAKPKLLIFIRKQ--NRVLLNLPHVVTACRRAGFAPHVMNLRRQTPLAAIHAALASADAM 418

Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLV 341
           V VHGA +TH LF+RPGSVL+Q+VP+G  W +  ++ KPA+ LGLEYLEYK+  EESSL 
Sbjct: 419 VAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLEYLEYKVAPEESSLA 478

Query: 342 EKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
            +YG +  VL+NP   +   W  M +VY+  QNV ++I RF   L+ A
Sbjct: 479 AEYGLDSTVLRNPWVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRTA 526


>gi|326512050|dbj|BAJ96006.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 553

 Score =  248 bits (632), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 142/416 (34%), Positives = 207/416 (49%), Gaps = 44/416 (10%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + CDRSH   D+C + G A  DP TS+            +     K +PY RK ++S M 
Sbjct: 134 LCCDRSHYRSDVCYLRGDARTDPSTSSVLLYNAPRGAAPE-----KVRPYTRKFEESIMR 188

Query: 74  KVKELTIT------------TSAPPNLSCGVTHTS--PALVFSAGGYNGNFFHEFMDCFV 119
            + E+TI             T       C V H +  PA+VFS GGY GN +HEF D  +
Sbjct: 189 SIDEVTIVPVDSTGNGTDKDTRDSLRRRCDVRHPAGVPAVVFSTGGYTGNVYHEFSDGLI 248

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
           PLF+T        +V+  + + +  W  +Y  +L R T + II+       HCF  + +G
Sbjct: 249 PLFVTAERF--GGEVVFVVLEYHYWWLGRYGAILERLTNYKIIDFRYDRRVHCFSEMIVG 306

Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF------------------- 220
           L  HG +V++P L+P  K++ DFQ+ L   Y+   + +S+                    
Sbjct: 307 LRIHGELVVDPKLMPNGKSIKDFQALLHQGYSGKPSATSAAPLPLPLATPSRPCVRPDDH 366

Query: 221 -HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
               KPK+++  R  +  R ILNL  V  A    GF   +      T L      + S  
Sbjct: 367 AKAAKPKMLIFIR--KQNRVILNLPHVVTACRRAGFAPHVMNLRRQTPLPVIHAALASAD 424

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
           AMV VHGA +TH LF+RPG+VL+Q+VP+G  W +  ++ KPA+ LGLEYLEY++  EESS
Sbjct: 425 AMVAVHGAAVTHFLFMRPGTVLLQIVPVGLDWAAESFYGKPAQQLGLEYLEYRVAPEESS 484

Query: 340 LVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
           L  +YG N  V+++P   +   W  M +VY+  QNV + + RF   L+ A    KK
Sbjct: 485 LAAEYGLNSTVVQDPSVISSQGWWEMKKVYMDRQNVTVSVKRFGELLRAAKLHLKK 540


>gi|326496216|dbj|BAJ94570.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 553

 Score =  248 bits (632), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 142/416 (34%), Positives = 207/416 (49%), Gaps = 44/416 (10%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + CDRSH   D+C + G A  DP TS+            +     K +PY RK ++S M 
Sbjct: 134 LCCDRSHYRSDVCYLRGDARTDPSTSSVLLYNAPRGAAPE-----KVRPYTRKFEESIMR 188

Query: 74  KVKELTIT------------TSAPPNLSCGVTHTS--PALVFSAGGYNGNFFHEFMDCFV 119
            + E+TI             T       C V H +  PA+VFS GGY GN +HEF D  +
Sbjct: 189 SIDEVTIVPVDSTGNGTDKDTRDSLRRRCDVRHPAGVPAVVFSTGGYTGNVYHEFSDGLI 248

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
           PLF+T        +V+  + + +  W  +Y  +L R T + II+       HCF  + +G
Sbjct: 249 PLFVTAERF--GGEVVFVVLEYHYWWLGRYGAILERLTNYKIIDFRYDRRVHCFSEMIVG 306

Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF------------------- 220
           L  HG +V++P L+P  K++ DFQ+ L   Y+   + +S+                    
Sbjct: 307 LRIHGELVVDPKLMPNGKSIKDFQALLHQGYSGKPSATSAAPLPLPLATPSRPCVRPDDH 366

Query: 221 -HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
               KPK+++  R  +  R ILNL  V  A    GF   +      T L      + S  
Sbjct: 367 AKAAKPKMLIFIR--KQNRVILNLPHVVTACRRAGFAPHVMNLRRQTPLPVIHAALASAD 424

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
           AMV VHGA +TH LF+RPG+VL+Q+VP+G  W +  ++ KPA+ LGLEYLEY++  EESS
Sbjct: 425 AMVAVHGAAVTHFLFMRPGTVLLQIVPVGLDWAAESFYGKPAQQLGLEYLEYRVAPEESS 484

Query: 340 LVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
           L  +YG N  V+++P   +   W  M +VY+  QNV + + RF   L+ A    KK
Sbjct: 485 LAAEYGLNSTVVQDPSVISSQGWWEMKKVYMDRQNVTVSVKRFGELLRAAKLHLKK 540


>gi|297739632|emb|CBI29814.3| unnamed protein product [Vitis vinifera]
          Length = 486

 Score =  247 bits (630), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 142/388 (36%), Positives = 210/388 (54%), Gaps = 17/388 (4%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           +I CDRS    D C + G       +S+ F  G           +IK  PY RK + S M
Sbjct: 89  TICCDRSAFRTDTCFMKGDVRTHSSSSSIFLYGKEFEEEQLQHEKIK--PYTRKWEASVM 146

Query: 73  SKVKELTITTSAP---PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
             + EL + +       +  C V H  PA+ FS GGY GN +HEF D  +PL+IT  S  
Sbjct: 147 DTIDELHLISKKEGYGTHHHCDVHHDVPAVFFSTGGYTGNIYHEFNDGILPLYIT--SQH 204

Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
            ++ V+  I + +D W  KY +++ + + +P I+ +    THCF    +GL  H  + ++
Sbjct: 205 LNKRVVFVILEYHDWWITKYGDVISQLSDYPPIDFSGDNRTHCFPEAIVGLRIHDELTVD 264

Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNT------SSSFHH-TKPKLVLVNRNARVGRTILN 242
            +L+   +++ DF++ L  AY     +          H   KPKLV+++R     R I N
Sbjct: 265 SSLVEGNESIRDFRNLLDQAYLPRIRSLIQAKEQKQVHQLKKPKLVVLSRTG--ARAITN 322

Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
              + + A+E+GF V +  P  +T LA  +R ++S  AMVGVHGA +TH LFL+PGSV +
Sbjct: 323 EDLMVQMAKEIGFQVKVLRPNRATELAKIYRVLNSSDAMVGVHGAAMTHILFLQPGSVFI 382

Query: 303 QVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
           QV+P+GT+W +  Y+ +PA  LGL+Y+ YKI   ESSL  +Y  +D VL++P + A   W
Sbjct: 383 QVIPLGTEWPAETYYGEPAEKLGLKYMGYKILPRESSLYNEYAKDDPVLRDPDSLAKKGW 442

Query: 363 S-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
               RVYL  Q V LD+ RFR  L  AY
Sbjct: 443 EFTKRVYLDHQTVTLDLRRFRKQLVCAY 470


>gi|195616038|gb|ACG29849.1| glycosyltransferase [Zea mays]
          Length = 545

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 143/410 (34%), Positives = 209/410 (50%), Gaps = 44/410 (10%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + CDRSH   D+C + G    DP TS+     +  +  S P    K +PY RK + S MS
Sbjct: 128 LCCDRSHYRSDVCYLRGDVRTDPSTSSVLL--YNAARGSGPE---KVRPYTRKFEDSVMS 182

Query: 74  KVKELTIT--------TSAPPNLSCGVTHTS--PALVFSAGGYNGNFFHEFMDCFVPLFI 123
            + E+TI         +++     C V H    PA+V S GGY GN +HEF D  +PLF+
Sbjct: 183 SIDEVTIVPVPGGHNASASALRRRCDVRHPRGVPAVVLSTGGYTGNVYHEFSDGLIPLFV 242

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T        +V+L + + +  W  +Y  +L + T + +++  +    HCF  + +GL  H
Sbjct: 243 TAQRF--GGEVVLVVLEYHYWWLGRYGAVLEQLTNYKVVDFRSDRRVHCFDEMIVGLRIH 300

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAY------------------------NENTNTSSS 219
           G +V++P L+P  K++ DFQ+ L   Y                        + + + +  
Sbjct: 301 GELVVDPKLMPNGKSIKDFQALLHRGYSRTTTTTTPSASPPVPLPLPLAPLSRSRSRACP 360

Query: 220 FHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
               KPKL++  R     R +LNL  V  A    GF   +      T LA     + S  
Sbjct: 361 RPAAKPKLLIFIRKQ--NRVLLNLPHVVTACRRAGFAPHVMNLRRQTPLAAIHAALASAD 418

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
           AMV VHGA +TH LF+RPGSVL+QVVP+G  W +  ++ KPA+ LGLEYLEYK+  EESS
Sbjct: 419 AMVAVHGAAVTHFLFMRPGSVLLQVVPVGLDWAADAFYGKPAQQLGLEYLEYKVAPEESS 478

Query: 340 LVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
           L  +YG +  VL+NP   +   W  M +VY+  QNV ++I RF   L+ A
Sbjct: 479 LAAEYGLDSTVLRNPWVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRTA 528


>gi|357129279|ref|XP_003566292.1| PREDICTED: uncharacterized protein LOC100836135 [Brachypodium
           distachyon]
          Length = 555

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 143/411 (34%), Positives = 207/411 (50%), Gaps = 45/411 (10%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + CDRSH   D+C + G A  DP TS+            +     K +PY RK + S MS
Sbjct: 131 LCCDRSHYRSDVCYLRGDARTDPSTSSVLLYNAPRGAAPE-----KVRPYTRKFEGSIMS 185

Query: 74  KVKELTITTSAPPN--------------LSCGVTHTS--PALVFSAGGYNGNFFHEFMDC 117
            + E+TI      N                C V H +  PA+VFS GGY GN +HEF D 
Sbjct: 186 TIDEVTILPVPDGNGTSDAASRDRDSLRRRCDVRHPAGVPAVVFSTGGYTGNVYHEFSDG 245

Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
            +PLFIT  S     +V+  + + +  W  +Y  +L R T + I++       HCF  + 
Sbjct: 246 LIPLFIT--SQRFAGEVVFVVLEYHYWWLGRYGAILERLTNYKIVDFRYDRRVHCFSEMI 303

Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTS-------------------S 218
           +GL  HG +V++P L+P  K++ DFQ+ L   Y++   +S                   +
Sbjct: 304 VGLRIHGELVVDPKLMPNGKSIKDFQALLHQGYSKAPASSSSAPVPLPLVPLSRPCLDHA 363

Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSC 278
           +    KPK+++  R  +  R ILNL  V  A    GF   +      T L      + S 
Sbjct: 364 TTKAAKPKMLIFIR--KQNRVILNLPHVVTACRRAGFAPHVMNLRRQTPLPVIHAALASA 421

Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
            AMV VHGA +TH LF+RPG+VL+Q+VP+G  W +  ++ KPA+ LGLEYLEYK+  EES
Sbjct: 422 DAMVAVHGAAVTHFLFMRPGTVLLQIVPVGLDWAAESFYGKPAQQLGLEYLEYKVAPEES 481

Query: 339 SLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
           SL  +YG +  V+++P   +   W  M +VY+  QNV + I RF   L+ A
Sbjct: 482 SLAAEYGLDSTVVRDPSVISSRGWWEMKKVYMDRQNVTVGIKRFGELLRAA 532


>gi|125559459|gb|EAZ04995.1| hypothetical protein OsI_27175 [Oryza sativa Indica Group]
          Length = 433

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 143/418 (34%), Positives = 211/418 (50%), Gaps = 44/418 (10%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI-KTQPYPRKSDKSA 71
           S+ CDR+    D+C   G       +++F  V  +    +  ++   + +PY RK + + 
Sbjct: 5   SLCCDRTSTRADICFARGDVRMHSASASFQLVSSSSGNATAAAVEEERIRPYTRKWEANV 64

Query: 72  MSKVKELTIT-TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           M+ + E+ +    A     C V H  PA+VFS GGY GN +HEF D  +PLF+T N H  
Sbjct: 65  MATIDEVRLRRVPAGGAARCDVVHDVPAVVFSTGGYTGNVYHEFNDGILPLFVTSN-HL- 122

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
            + V+  I + +D W  KY +++ R +  P I+       HCF  V  GL  HG + ++P
Sbjct: 123 RRRVVFVILEYHDWWMTKYGDVVSRLSAFPPIDFTADRRVHCFPEVIAGLRIHGELTVDP 182

Query: 191 TLLPKPKTLVDFQSFLANAYNEN------------------------TNTS--------- 217
              P+ K++  F++ L +AY                           T TS         
Sbjct: 183 EKTPEGKSIRHFRTLLDDAYRGRIQYLERLERRAARSRKRRAAAAKPTTTSIALPIMAPP 242

Query: 218 ----SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
               S     +P+LV+V+R     R I N  +V   A ++GFDV +  PE +T L   +R
Sbjct: 243 AKQASPSPPDRPRLVIVSRTG--SRVIENEADVAALAADVGFDVRVVRPERTTELCKIYR 300

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
            +++  AMVGVHGA +TH LF+RPG V +QVVP+GT W +  Y+ +PA  LGL Y+ YKI
Sbjct: 301 ELNASDAMVGVHGAAMTHFLFMRPGKVFVQVVPLGTDWAAGAYYGEPAARLGLRYVGYKI 360

Query: 334 KQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
             EESSL  +Y   D VL +P       W    +VYL  QNV+LD+ RFR  L  A++
Sbjct: 361 LPEESSLSREYPTGDPVLTDPAGVGKRGWDVTKKVYLDRQNVRLDLPRFREVLVGAHR 418


>gi|125601368|gb|EAZ40944.1| hypothetical protein OsJ_25426 [Oryza sativa Japonica Group]
          Length = 555

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 143/418 (34%), Positives = 211/418 (50%), Gaps = 44/418 (10%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI-KTQPYPRKSDKSA 71
           S+ CDR+    D+C   G       +++F  V  +    +  ++   + +PY RK + + 
Sbjct: 127 SLCCDRTSTRADICFARGDVRMHSASASFQLVSSSSGNATAAAVEEERIRPYTRKWEANV 186

Query: 72  MSKVKELTIT-TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           M+ + E+ +    A     C V H  PA+VFS GGY GN +HEF D  +PLF+T N H  
Sbjct: 187 MATIDEVRLRRVPAGGAARCDVVHDVPAVVFSTGGYTGNVYHEFNDGILPLFVTSN-HL- 244

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
            + V+  I + +D W  KY +++ R +  P I+       HCF  V  GL  HG + ++P
Sbjct: 245 RRRVVFVILEYHDWWMTKYGDVVSRLSAFPPIDFTADRRVHCFPEVIAGLRIHGELTVDP 304

Query: 191 TLLPKPKTLVDFQSFLANAYNEN------------------------TNTS--------- 217
              P+ K++  F++ L +AY                           T TS         
Sbjct: 305 EKTPEGKSIRHFRTLLDDAYRGRIQYLERLERRAARSRKRRAAAAKPTTTSIALPIMAPP 364

Query: 218 ----SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
               S     +P+LV+V+R     R I N  +V   A ++GFDV +  PE +T L   +R
Sbjct: 365 AKQASPSPPDRPRLVIVSRTG--SRVIENEADVAALAADVGFDVRVVRPERTTELCKIYR 422

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
            +++  AMVGVHGA +TH LF+RPG V +QVVP+GT W +  Y+ +PA  LGL Y+ YKI
Sbjct: 423 ELNASDAMVGVHGAAMTHFLFMRPGKVFVQVVPLGTDWAAGAYYGEPAARLGLRYVGYKI 482

Query: 334 KQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
             EESSL  +Y   D VL +P       W    +VYL  QNV+LD+ RFR  L  A++
Sbjct: 483 LPEESSLSREYPTGDPVLTDPAGVGKRGWDVTKKVYLDRQNVRLDLPRFREVLVGAHR 540


>gi|115473825|ref|NP_001060511.1| Os07g0657400 [Oryza sativa Japonica Group]
 gi|22775615|dbj|BAC15469.1| symbiosis-related protein-like protein [Oryza sativa Japonica
           Group]
 gi|50510132|dbj|BAD31097.1| symbiosis-related protein-like protein [Oryza sativa Japonica
           Group]
 gi|113612047|dbj|BAF22425.1| Os07g0657400 [Oryza sativa Japonica Group]
          Length = 555

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 143/418 (34%), Positives = 211/418 (50%), Gaps = 44/418 (10%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI-KTQPYPRKSDKSA 71
           S+ CDR+    D+C   G       +++F  V  +    +  ++   + +PY RK + + 
Sbjct: 127 SLCCDRTSTRADICFARGDVRMHSASASFQLVSSSSGNATAAAVEEERIRPYTRKWEANV 186

Query: 72  MSKVKELTIT-TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           M+ + E+ +    A     C V H  PA+VFS GGY GN +HEF D  +PLF+T N H  
Sbjct: 187 MATIDEVRLRRVPAGGAARCDVVHDVPAVVFSTGGYTGNVYHEFNDGILPLFVTSN-HL- 244

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
            + V+  I + +D W  KY +++ R +  P I+       HCF  V  GL  HG + ++P
Sbjct: 245 RRRVVFVILEYHDWWMTKYGDVVSRLSAFPPIDFTADRRVHCFPEVIAGLRIHGELTVDP 304

Query: 191 TLLPKPKTLVDFQSFLANAYNEN------------------------TNTS--------- 217
              P+ K++  F++ L +AY                           T TS         
Sbjct: 305 EKTPEGKSIRHFRTLLDDAYRGRIQYLERLERRAARSRKRRAAAAKPTTTSIALPIMAPP 364

Query: 218 ----SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
               S     +P+LV+V+R     R I N  +V   A ++GFDV +  PE +T L   +R
Sbjct: 365 AKQASPSPPDRPRLVIVSRTG--SRVIENEADVAALAADVGFDVRVVRPERTTELCKIYR 422

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
            +++  AMVGVHGA +TH LF+RPG V +QVVP+GT W +  Y+ +PA  LGL Y+ YKI
Sbjct: 423 ELNASDAMVGVHGAAMTHFLFMRPGKVFVQVVPLGTDWAAGAYYGEPAARLGLRYVGYKI 482

Query: 334 KQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
             EESSL  +Y   D VL +P       W    +VYL  QNV+LD+ RFR  L  A++
Sbjct: 483 LPEESSLSREYPTGDPVLTDPAGVGKRGWDVTKKVYLDRQNVRLDLPRFREVLVGAHR 540


>gi|168015237|ref|XP_001760157.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162688537|gb|EDQ74913.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 131/399 (32%), Positives = 211/399 (52%), Gaps = 19/399 (4%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFD--PKTSTFFSVGHTDSTPSQPSLRIKTQP 62
           P  +   +   CDRS    D+C + G    +    T  F    +     +Q   +  T+P
Sbjct: 69  PGTVRDDQIFQCDRSQWRTDICNLRGDVQLEICNGTKAFVLYANQAVETTQLVRKETTKP 128

Query: 63  YPRKSDKSAMSKVKELTITTSAPPNLS-------CGVTHTSPALVFSAGGYNGNFFHEFM 115
           Y RK ++ +MS V E+T+      +L+       C V H  P +VFS  GY GN +HEF 
Sbjct: 129 YSRKWEEDSMSSVNEVTLLRMPALSLAAQATRRPCDVRHKVPGIVFSTAGYTGNLYHEFN 188

Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
           D  +PLFIT  S    ++ +L I+  ++ W  KY E++ + T++ II++      HCF  
Sbjct: 189 DGIIPLFIT--SQHLRREAVLIISSFHNWWYSKYREVIEQITKYEIIDLERDERVHCFPE 246

Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY-----NENTNTSSSFHHTKPKLVLV 230
           +  GL  HG + I+   +P  + + +F+  L  AY     +E+   +   +   P+L ++
Sbjct: 247 IETGLHIHGELSIDANRMPLKEGIQEFRDMLNRAYKPGPEDEHKIRARLKNKINPRLTII 306

Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
            R     R +LNL +V   AE +GF V +  P+ +  L   F  +++   ++GVHGA +T
Sbjct: 307 VRQG--TRKLLNLDDVIHLAERIGFKVHLLTPDPTMELKKIFWLLNNTDVLLGVHGAAMT 364

Query: 291 HSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
           H LF+RPGSV +Q+VP+GT W +  YF +P   LGL+Y+ YKI+ +ESSL + Y A D V
Sbjct: 365 HFLFMRPGSVFIQIVPLGTDWAANEYFGEPVSKLGLKYMPYKIQPDESSLSDIYNATDPV 424

Query: 351 LKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
           L +P       W ++ ++YL+ Q+V+  + R R  L+ A
Sbjct: 425 LVDPDRITQRGWGDLKKIYLEAQDVRPSLHRLRQVLQQA 463


>gi|224135457|ref|XP_002322078.1| predicted protein [Populus trichocarpa]
 gi|222869074|gb|EEF06205.1| predicted protein [Populus trichocarpa]
          Length = 378

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 129/368 (35%), Positives = 214/368 (58%), Gaps = 15/368 (4%)

Query: 24  DLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTI--T 81
           D C I G    D K+ T F V       +  +     +PY RK D++AM  V+E T+   
Sbjct: 5   DFCEIKGDIRIDGKSYTVFIVSSETDILTAENTSWSIRPYARKGDQAAMGAVREWTVKLV 64

Query: 82  TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADC 141
           T A     C   H+ PA++FSAGGY GN FH F D  +PLF+T   +  + ++   I + 
Sbjct: 65  TVASDIPQCTQNHSVPAILFSAGGYAGNHFHAFTDVILPLFLTSRPY--NGEIQFLITNG 122

Query: 142 NDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI--SHGRMVINPTLLPKPKTL 199
              W  K+  ++   +R+ +I+I+N    HC+ S+T+GL   ++  + I+P     P ++
Sbjct: 123 RPAWISKFETIMKALSRYQLISIDNSQDIHCYDSMTVGLKRRTNKELSIDPDPSSSPYSM 182

Query: 200 VDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAAEELGFD 256
            DF+ FL ++Y+     ++   +    +P+L++++R  +  R   N+ E+   A+ LG+ 
Sbjct: 183 KDFRKFLRSSYSLKKAMATKIRNGSKKRPRLLIISR--KRSRAFTNVGEIVTMAKRLGYR 240

Query: 257 VTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTV 315
           V + EP+   S     + I+SC  ++GVHGAGLT+ +FL   +VL+QV+P G T+WLS  
Sbjct: 241 VVVAEPDADVS--GFAQIINSCDVVMGVHGAGLTNIVFLPENAVLVQVIPFGGTEWLSRT 298

Query: 316 YFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNV 374
           YFE+PA+ + + YL+YKI+ EES+L+++Y A+ +VL++P A     WS +  +YL+ QNV
Sbjct: 299 YFEEPAKGMNIRYLDYKIRLEESTLIQQYPADHVVLRDPSAIWKQGWSAVESIYLRQQNV 358

Query: 375 KLDIDRFR 382
            L+++RFR
Sbjct: 359 TLNVNRFR 366


>gi|326520289|dbj|BAK07403.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525054|dbj|BAK07797.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 516

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 141/405 (34%), Positives = 209/405 (51%), Gaps = 18/405 (4%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSV------GHTDSTPSQPSLRI 58
           P A      I CDRS  + D+C + G    D  + +F         G  D+  +      
Sbjct: 111 PCASMGGHGICCDRSDFNTDVCFMAGDVRTDAASLSFLLFPPQPPNGTADANATVAEKEE 170

Query: 59  KTQPYPRKSDKSAMSKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHEFM 115
           + QPY RK +K  M+K+ E+ +  + P       C V H +PALV +AGGY GN FH F 
Sbjct: 171 RVQPYTRKWEKHLMAKIHEVRLRAARPDEADAHRCDVRHDAPALVMTAGGYTGNLFHAFN 230

Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
           D F+P ++T+  H   + V+LA+ + N  WA  + EL+   +   +I++     THCF  
Sbjct: 231 DGFLPAWLTVQ-HL-RRRVVLAVLEYNPWWAGTFRELVSGLSDRLVIDLLRDNRTHCFPG 288

Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT--KPKLVLVNRN 233
             +G   HG + + P      +TLVDF  FLA AY ++ +T         +P+L L +R 
Sbjct: 289 AIVGTRFHGILSVEPARTRDNRTLVDFHDFLAGAYKDDKSTPEPEKQQPRRPRLGLYSRK 348

Query: 234 ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
               R I N   V + AE +GFDV+I        L+  +  + +C  +VGVHGA LT  L
Sbjct: 349 GN--RMIENEAAVARLAESVGFDVSILATANGAPLSSEYAAVSACDVLVGVHGADLTKLL 406

Query: 294 FLRPG-SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLK 352
           FLRPG + L+QV P+G   ++   +EK   ++G+ Y +Y     ESSLV KY A+D+VL+
Sbjct: 407 FLRPGHAALLQVAPLGVPHVARGCYEKATNMMGIHYEQYDAAANESSLVRKYAADDVVLR 466

Query: 353 NPQAFAGA-NWS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
           +P+A      W     VYL  QNV LD+ RF   L+  + +A +L
Sbjct: 467 DPEAATRERGWDLTAHVYLGGQNVTLDLGRFGDVLRKLHSRALRL 511


>gi|115463691|ref|NP_001055445.1| Os05g0391600 [Oryza sativa Japonica Group]
 gi|54287512|gb|AAV31256.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578996|dbj|BAF17359.1| Os05g0391600 [Oryza sativa Japonica Group]
 gi|215766743|dbj|BAG98971.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 581

 Score =  241 bits (615), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 143/429 (33%), Positives = 210/429 (48%), Gaps = 62/429 (14%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           ++ CDRSH   D+C + G    DP TS+         +  +     K +PY RK + S M
Sbjct: 141 ALCCDRSHYRSDVCYLRGDVRTDPSTSSVLLYNAPRGSAPE-----KVRPYTRKFEGSIM 195

Query: 73  SKVKELTI------------TTSAPPN-----LSCGVTHTS--PALVFSAGGYNGNFFHE 113
           S + E+TI            TT+           C V H    PA+VFS GGY GN +HE
Sbjct: 196 STIDEVTIVPVVDAGSGSNGTTAGDTGKDSLRRRCDVRHPPGVPAVVFSTGGYTGNVYHE 255

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           F D  +PLFIT        +V+  + + +  W  +Y  +L R T + +++       HCF
Sbjct: 256 FSDGLIPLFITAQRF--AGEVVFVVLEYHYWWLGRYGAVLERLTNYKVVDFRYDRRVHCF 313

Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH----------- 222
             + +GL  HG +V++P L+P  K + DFQ+ L   Y+   + +++              
Sbjct: 314 SEMIVGLRIHGELVVDPKLMPNGKGIQDFQALLHQGYSRTPSATAAAAAAQPPVPLALAA 373

Query: 223 ---------------TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTS 267
                           KPKLV+  R  +  R +LNL  +  A    GF   +      T 
Sbjct: 374 PPSRPCLRPDDHAKVAKPKLVIFIR--KQNRVLLNLPHIVTACRRAGFAPHVMNLRRQTP 431

Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE 327
           L      + S  AMV VHGA +TH LF+RPGSVL+Q+VP+G  W +  ++ KPA+ LGL 
Sbjct: 432 LPAIHAALSSADAMVAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLG 491

Query: 328 YLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF----- 381
           YLEYK+  EESSL  +YG N  V+++P   +   W  M +VY+  QNV ++I RF     
Sbjct: 492 YLEYKVAPEESSLAAEYGVNSTVVRDPSVISSRGWWEMKKVYMDRQNVTVNIKRFGELLR 551

Query: 382 --RIYLKDA 388
             R++LK+A
Sbjct: 552 SARLHLKNA 560


>gi|125580725|gb|EAZ21656.1| hypothetical protein OsJ_05289 [Oryza sativa Japonica Group]
          Length = 510

 Score =  237 bits (605), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 130/382 (34%), Positives = 214/382 (56%), Gaps = 17/382 (4%)

Query: 15  TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           TC  + +  + C   G    +   ST + +G  D          KT+PY R  D  AM+ 
Sbjct: 137 TCRMTSKRSERCEARGDIRVEGNASTIY-IGGIDK-------EWKTKPYARYHDPVAMAV 188

Query: 75  VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
           V+E T+      + +C   H+ PA VFS GG++GN +H++ D  VPLF++ +  F  Q V
Sbjct: 189 VREFTLKPVTESSPACTRNHSVPAFVFSNGGFSGNLYHDYTDVLVPLFLSTH-QFKGQ-V 246

Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
              ++     W  K+     + T++ I++I+N    HCF  + +G   H  M ++P   P
Sbjct: 247 QFLLSGLKPWWVNKFNLFFRQLTKYDILDIDNDKDVHCFPRIVVGATFHKDMGVDPKRSP 306

Query: 195 KPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAE 251
              ++VDF+  L  A+      +S    T   KP+L++++R  +  R  LN RE+ +AA 
Sbjct: 307 GHVSVVDFKRALRRAFGLERVAASRGGATGNGKPRLLIISR--KNSRRFLNEREMAQAAA 364

Query: 252 ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-Q 310
            +GF+V I EP++ T ++   + ++S   M+GVHGAGLT+ +FL  G+VL+QVVP G  +
Sbjct: 365 AVGFEVRIAEPDQHTDMSTFAQLVNSADVMIGVHGAGLTNMVFLPRGAVLIQVVPFGGLE 424

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
           WL+TV F+ PA+ + + Y++Y ++ EESSL+++Y  N  VL +P A     W  ++  YL
Sbjct: 425 WLTTVTFKNPAKDMEVTYMDYNVQLEESSLIDQYPRNHQVLTDPYAVHKQGWDALKTAYL 484

Query: 370 KTQNVKLDIDRFRIYLKDAYKK 391
             QN+K+D+DRF+  L++A  +
Sbjct: 485 DKQNIKMDMDRFKKTLQEALDR 506


>gi|115444075|ref|NP_001045817.1| Os02g0135500 [Oryza sativa Japonica Group]
 gi|42408837|dbj|BAD10097.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|42409107|dbj|BAD10357.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|113535348|dbj|BAF07731.1| Os02g0135500 [Oryza sativa Japonica Group]
 gi|125537988|gb|EAY84383.1| hypothetical protein OsI_05759 [Oryza sativa Indica Group]
 gi|215686733|dbj|BAG89583.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 485

 Score =  237 bits (604), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 130/382 (34%), Positives = 214/382 (56%), Gaps = 17/382 (4%)

Query: 15  TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           TC  + +  + C   G    +   ST + +G  D          KT+PY R  D  AM+ 
Sbjct: 112 TCRMTSKRSERCEARGDIRVEGNASTIY-IGGIDK-------EWKTKPYARYHDPVAMAV 163

Query: 75  VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
           V+E T+      + +C   H+ PA VFS GG++GN +H++ D  VPLF++ +  F  Q V
Sbjct: 164 VREFTLKPVTESSPACTRNHSVPAFVFSNGGFSGNLYHDYTDVLVPLFLSTH-QFKGQ-V 221

Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
              ++     W  K+     + T++ I++I+N    HCF  + +G   H  M ++P   P
Sbjct: 222 QFLLSGLKPWWVNKFNLFFRQLTKYDILDIDNDKDVHCFPRIVVGATFHKDMGVDPKRSP 281

Query: 195 KPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAE 251
              ++VDF+  L  A+      +S    T   KP+L++++R  +  R  LN RE+ +AA 
Sbjct: 282 GHVSVVDFKRALRRAFGLERVAASRGGATGNGKPRLLIISR--KNSRRFLNEREMAQAAA 339

Query: 252 ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-Q 310
            +GF+V I EP++ T ++   + ++S   M+GVHGAGLT+ +FL  G+VL+QVVP G  +
Sbjct: 340 AVGFEVRIAEPDQHTDMSTFAQLVNSADVMIGVHGAGLTNMVFLPRGAVLIQVVPFGGLE 399

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
           WL+TV F+ PA+ + + Y++Y ++ EESSL+++Y  N  VL +P A     W  ++  YL
Sbjct: 400 WLTTVTFKNPAKDMEVTYMDYNVQLEESSLIDQYPRNHQVLTDPYAVHKQGWDALKTAYL 459

Query: 370 KTQNVKLDIDRFRIYLKDAYKK 391
             QN+K+D+DRF+  L++A  +
Sbjct: 460 DKQNIKMDMDRFKKTLQEALDR 481


>gi|226532070|ref|NP_001147864.1| glycosyltransferase [Zea mays]
 gi|195614204|gb|ACG28932.1| glycosyltransferase [Zea mays]
          Length = 521

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 140/416 (33%), Positives = 214/416 (51%), Gaps = 30/416 (7%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTST---FFSVGHTDSTPSQPSLRIKTQPYPRKSDK 69
           +I CDRS    D+C + G    D  + +   F       +  S+     + +PYPRK ++
Sbjct: 110 TICCDRSSVHTDVCFMAGDVRTDAASLSLLLFPPHQQAQNGTSEEDKEERVRPYPRKWER 169

Query: 70  SAMSKVKELTITTSAP--PNLS-------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
             M KV E+ +  +AP  P+         C V H +P LV SAGGY GN FH F D F+P
Sbjct: 170 FIMDKVPEVRLRVAAPRRPDGEREEVEHRCDVRHDAPLLVMSAGGYTGNLFHAFNDGFLP 229

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
            ++T+  H   + V+L +   N  WA  + E++   + HP++++ + T THCF    +G 
Sbjct: 230 SWLTVQ-HL-RRRVVLGVVSYNPWWAGMFVEVISGLSDHPVVDLLHDTRTHCFPGAIVGT 287

Query: 181 ISHGRMVINPTLLPKPKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLV 230
             HG ++++P  L   KT+VDF   LA+AY           +  +   +    +P+L +V
Sbjct: 288 RYHGILIVDPARLRDNKTIVDFHQMLADAYEKPPRETTRPAQQQDLRDAEQRRRPRLGIV 347

Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
           +R     R I N   V + A  +GFDV I E      L+  +  + +C A+VGVHGA LT
Sbjct: 348 SRKGT--RVIENQAAVARLASSVGFDVDILETANGLPLSAWYASLRACDALVGVHGADLT 405

Query: 291 HSLFLRPG-SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
             LFLRPG + L Q+ P+G   ++   F +PA  +GL Y +Y+++  ESSL  +Y   D 
Sbjct: 406 KFLFLRPGRASLTQIAPLGVSPIAREDFGEPAARMGLAYEQYEVRAGESSLARRYAPGDA 465

Query: 350 VLKNPQAFA--GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDKEIKL 402
           VL +P+A       W  + RVYL  QNV LD+ RFR  L   +  A +   ++ +L
Sbjct: 466 VLTDPEAAKRDKGGWDLVARVYLGGQNVTLDLARFRQTLARMHAHAMRQRRRQERL 521


>gi|413945240|gb|AFW77889.1| hypothetical protein ZEAMMB73_009888 [Zea mays]
          Length = 555

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 142/415 (34%), Positives = 210/415 (50%), Gaps = 48/415 (11%)

Query: 6   AALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPR 65
           AA      + CDRSH   D+C + G    DP T++     +     S P    K +PY R
Sbjct: 124 AAADAEGVLCCDRSHYRSDVCYLRGDVRTDPSTTSVLL--YNAPRGSAPE---KVRPYTR 178

Query: 66  KSDKSAMSKVKELTITTS--------APPNLS---------CGVTHTS--PALVFSAGGY 106
           K + S MS + E+TI           A  N+S         C V H     A+VFS GGY
Sbjct: 179 KFEGSVMSSIDEVTIVPVPPGGGGGGAHSNMSVGVGGGGRRCDVRHPRGVAAVVFSTGGY 238

Query: 107 NGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN 166
            GN +HEF D  VPLF+T        +V+  + + +  W  +Y  +L + T + +++  +
Sbjct: 239 TGNVYHEFSDGLVPLFVTAQRF--AGEVVFVVLEYHYWWLGRYGAVLEQLTNYRVVDFRH 296

Query: 167 QTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---- 222
               HCF  + +GL  HG +V++P L+P  K++ DFQ+ L   Y+  T T+S+       
Sbjct: 297 DRRVHCFDEMVVGLRIHGELVVDPKLMPDGKSIKDFQALLHQGYSRTTTTASASSSSPTP 356

Query: 223 ---------------TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTS 267
                           KPK+++  R  +  R +LNL +V  A    GF   +      T 
Sbjct: 357 LLPLAPPTRPCPRSANKPKMLIFIR--KQNRVLLNLPQVVTACRRAGFAPHVMNLRRQTP 414

Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE 327
           LA     + S  AMV VHGA +TH LF+RPGSVL+Q+VP+G  W +  ++ KPA+ LGL+
Sbjct: 415 LAAIHAALASADAMVAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLD 474

Query: 328 YLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF 381
           YLEY++  EESSL  +YG N  V++ P   +   W  M +VY+  QNV +++ RF
Sbjct: 475 YLEYRVAPEESSLAAEYGLNSTVVRTPWVISSRGWWEMKKVYMDRQNVTVNVKRF 529


>gi|225441880|ref|XP_002284307.1| PREDICTED: uncharacterized protein LOC100245312 isoform 1 [Vitis
           vinifera]
          Length = 529

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 144/429 (33%), Positives = 212/429 (49%), Gaps = 56/429 (13%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVG---------HTDSTPSQPS-------L 56
           +I CDRS    D C + G       +S+ F  G         H  S   +          
Sbjct: 89  TICCDRSAFRTDTCFMKGDVRTHSSSSSIFLYGSRNPNAFTDHVSSISGEGKEFEEEQLQ 148

Query: 57  RIKTQPYPRKSDKSAMSKVKELTITTSAP---PNLSCGVTHTSPALVFSAGGYNGNFFHE 113
             K +PY RK + S M  + EL + +       +  C V H  PA+ FS GGY GN +HE
Sbjct: 149 HEKIKPYTRKWEASVMDTIDELHLISKKEGYGTHHHCDVHHDVPAVFFSTGGYTGNIYHE 208

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           F D  +PL+IT  S   ++ V+  I + +D W  KY +++ + + +P I+ +    THCF
Sbjct: 209 FNDGILPLYIT--SQHLNKRVVFVILEYHDWWITKYGDVISQLSDYPPIDFSGDNRTHCF 266

Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY----------------------- 210
               +GL  H  + ++ +L+   +++ DF++ L  AY                       
Sbjct: 267 PEAIVGLRIHDELTVDSSLVEGNESIRDFRNLLDQAYLPRIRSLIQAKEQKVQSKMKEDP 326

Query: 211 --------NENTNTSSSFHH-TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFE 261
                      T      H   KPKLV+++R     R I N   + + A+E+GF V +  
Sbjct: 327 SLPPSLKPPLETGKEEQVHQLKKPKLVVLSRTG--ARAITNEDLMVQMAKEIGFQVKVLR 384

Query: 262 PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPA 321
           P  +T LA  +R ++S  AMVGVHGA +TH LFL+PGSV +QV+P+GT+W +  Y+ +PA
Sbjct: 385 PNRATELAKIYRVLNSSDAMVGVHGAAMTHILFLQPGSVFIQVIPLGTEWPAETYYGEPA 444

Query: 322 RVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDR 380
             LGL+Y+ YKI   ESSL  +Y  +D VL++P + A   W    RVYL  Q V LD+ R
Sbjct: 445 EKLGLKYMGYKILPRESSLYNEYAKDDPVLRDPDSLAKKGWEFTKRVYLDHQTVTLDLRR 504

Query: 381 FRIYLKDAY 389
           FR  L  AY
Sbjct: 505 FRKQLVCAY 513


>gi|242060410|ref|XP_002451494.1| hypothetical protein SORBIDRAFT_04g002850 [Sorghum bicolor]
 gi|241931325|gb|EES04470.1| hypothetical protein SORBIDRAFT_04g002850 [Sorghum bicolor]
          Length = 499

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 130/386 (33%), Positives = 209/386 (54%), Gaps = 24/386 (6%)

Query: 15  TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           TC  + +  + C  +G    D   S  +  G             KT+PY R  D  AM+ 
Sbjct: 125 TCYMTSKRSERCDASGDIRVDGNRSAIYVSGIDKE--------WKTKPYARYHDPVAMAH 176

Query: 75  VKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           V+E T+       AP   +C   H+ P  +FS GG++GN +H++ D  VPLFI+  +H  
Sbjct: 177 VREYTLKPLPAAEAP---ACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--THQF 231

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
              V   ++     W  K+     + T++ +I+++N    HCF  +  G   H  M ++P
Sbjct: 232 RGRVQFLLSGMKPWWVAKFTPFFRQLTKYDVIDVDNDQEVHCFPRIVAGATFHKDMGVDP 291

Query: 191 TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
              P   ++VDF+  L  A+    E  +   +  H KP+L++++R  R  R  LN RE+ 
Sbjct: 292 RRSPGHVSVVDFKRALRRAFGLEREAASRGGATGHGKPRLLIISR--RGSRRFLNEREMA 349

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
           +AA + GF+V + EP++ T +A     ++S   MVGVHGAGLT+ +FL  G+VL+QVVP 
Sbjct: 350 RAAADAGFEVRVAEPDQHTDMATFAALVNSADVMVGVHGAGLTNMVFLPRGAVLIQVVPF 409

Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           G  +WL++V F+ PA  + + Y++Y +K EESSL+++Y  N  VL +P A     W  ++
Sbjct: 410 GGLEWLTSVTFKDPAADMEVNYMDYNVKLEESSLLDQYPRNHQVLTDPYAVHKQGWDALK 469

Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKK 391
             YL  QN+++D+DRFR  L++A  +
Sbjct: 470 TAYLDKQNIRMDLDRFRATLREAMSR 495


>gi|42408838|dbj|BAD10098.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|42409108|dbj|BAD10358.1| putative HGA1 [Oryza sativa Japonica Group]
          Length = 371

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 132/383 (34%), Positives = 214/383 (55%), Gaps = 21/383 (5%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           +T  RS R    C   G    +   ST + +G  D          KT+PY R  D  AM+
Sbjct: 1   MTSKRSER----CEARGDIRVEGNASTIY-IGGIDK-------EWKTKPYARYHDPVAMA 48

Query: 74  KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            V+E T+      + +C   H+ PA VFS GG++GN +H++ D  VPLF++ +  F  Q 
Sbjct: 49  VVREFTLKPVTESSPACTRNHSVPAFVFSNGGFSGNLYHDYTDVLVPLFLSTH-QFKGQ- 106

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   ++     W  K+     + T++ I++I+N    HCF  + +G   H  M ++P   
Sbjct: 107 VQFLLSGLKPWWVNKFNLFFRQLTKYDILDIDNDKDVHCFPRIVVGATFHKDMGVDPKRS 166

Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAA 250
           P   ++VDF+  L  A+      +S    T   KP+L++++R  +  R  LN RE+ +AA
Sbjct: 167 PGHVSVVDFKRALRRAFGLERVAASRGGATGNGKPRLLIISR--KNSRRFLNEREMAQAA 224

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
             +GF+V I EP++ T ++   + ++S   M+GVHGAGLT+ +FL  G+VL+QVVP G  
Sbjct: 225 AAVGFEVRIAEPDQHTDMSTFAQLVNSADVMIGVHGAGLTNMVFLPRGAVLIQVVPFGGL 284

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
           +WL+TV F+ PA+ + + Y++Y ++ EESSL+++Y  N  VL +P A     W  ++  Y
Sbjct: 285 EWLTTVTFKNPAKDMEVTYMDYNVQLEESSLIDQYPRNHQVLTDPYAVHKQGWDALKTAY 344

Query: 369 LKTQNVKLDIDRFRIYLKDAYKK 391
           L  QN+K+D+DRF+  L++A  +
Sbjct: 345 LDKQNIKMDMDRFKKTLQEALDR 367


>gi|224118736|ref|XP_002317893.1| predicted protein [Populus trichocarpa]
 gi|222858566|gb|EEE96113.1| predicted protein [Populus trichocarpa]
          Length = 383

 Score =  234 bits (597), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 132/376 (35%), Positives = 211/376 (56%), Gaps = 18/376 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C    RS D C I G    D  + T F V       +  +   + +PY RK D++AM  V
Sbjct: 5   CTIMGRS-DFCEIKGDIRIDGSSYTVFIVSSETDILAAENTSWRIRPYARKGDQTAMGAV 63

Query: 76  KELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
           +E T+   A  +    C   H+ P ++FSAGGY GN FH F D  VPLF T   +  + +
Sbjct: 64  REWTLKLVAGGSDIPQCTQNHSVPGILFSAGGYAGNHFHAFTDIIVPLFSTARPY--NGE 121

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI--SHGRMVINPT 191
           V   I +    W  K+  +L   +R+ +INI+N+   HCF S+T+GL   S+  + I+P+
Sbjct: 122 VQFIITNGWSAWIAKFKTILKALSRYELINIDNRKDIHCFGSMTVGLKRPSYKELSIDPS 181

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFH---HTKPKLVLVNRNARVGRTILNLREVKK 248
               P ++ DF+ FL ++Y+     +         +P+L++++R  +  R   N+ E+  
Sbjct: 182 --KSPYSIKDFRQFLRSSYSLKKTRAIKIRDGMKKRPRLLIISR--KRSRAFTNVGEIVN 237

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            AE LGF V + EP     ++   + I+SC  ++GVHGAGLT+ +FL   +VL+QV+P G
Sbjct: 238 MAERLGFRVVVAEP--GMDVSGFSQIINSCDVVMGVHGAGLTNIVFLPEKAVLIQVIPFG 295

Query: 309 -TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
             +WLS  +FE+PA+ + + YL+YKI+ EES+L+++Y A+  VL++P       W   + 
Sbjct: 296 GAEWLSRTFFEEPAKDMNIRYLDYKIRVEESTLIQQYPADHAVLRDPSVIGKQGWLAFQS 355

Query: 367 VYLKTQNVKLDIDRFR 382
           +YL+ QNV +D++RFR
Sbjct: 356 IYLQKQNVTIDVNRFR 371


>gi|242059953|ref|XP_002459122.1| hypothetical protein SORBIDRAFT_03g046290 [Sorghum bicolor]
 gi|241931097|gb|EES04242.1| hypothetical protein SORBIDRAFT_03g046290 [Sorghum bicolor]
          Length = 527

 Score =  234 bits (596), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 142/395 (35%), Positives = 209/395 (52%), Gaps = 19/395 (4%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFF----SVGHTDSTPSQPSLRIKTQPYPRKSD 68
           +I CDRS    D+C ++G    D  + +           D   +  S     +PYPRK +
Sbjct: 126 TICCDRSSVHTDVCFMSGDVRTDAASLSLLLFPPQQHRHDQALNGTSEEETVRPYPRKWE 185

Query: 69  KSAMSKVKELTITTSAP----PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
              M KV E+ +  +AP     +  C V H +P LV SAGGY GN FH F D F+P ++T
Sbjct: 186 SFIMDKVPEVRLRVAAPRGAEEDHRCDVQHDAPLLVMSAGGYTGNLFHAFNDGFLPSWVT 245

Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHG 184
           +  H   + V+L +   N  WA  ++E++   + H ++++ + T THCF    +G   HG
Sbjct: 246 VQ-HL-RRRVVLGVLSYNPWWAGMFSEVISGLSDHHVVDLLHDTRTHCFPGAIVGTRYHG 303

Query: 185 RMVINPTLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRTI 240
            +V++   L   KT+VDF   LA+AY     E T T++     +P+L +V+R     R I
Sbjct: 304 ILVVDSARLRDNKTIVDFHQMLADAYERPPRETTTTTTVEQRRRPRLGIVSRKGT--RVI 361

Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG-S 299
            N   V + A  +GFDV I E  +   L+  +  + +C A+VGVHGA LT  LFLRPG +
Sbjct: 362 ENQAAVARLASSVGFDVEILETADGRPLSTWYASLRACDALVGVHGADLTKFLFLRPGHA 421

Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ-AFA 358
            L Q+ P+G   ++   F  PA  +GLEY +Y+++  ESSL   Y A+D VL +P+ A  
Sbjct: 422 SLTQIAPLGVSPIAQEDFGVPAARMGLEYEQYEVRAGESSLARLYAADDAVLADPEKAMR 481

Query: 359 GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKA 392
              W  + RVYL  QNV LD+ RFR  L   +  A
Sbjct: 482 EQGWDLVARVYLGGQNVTLDLARFRRTLARMHAHA 516


>gi|302766161|ref|XP_002966501.1| glycosyltransferase, CAZy family GT61 [Selaginella moellendorffii]
 gi|300165921|gb|EFJ32528.1| glycosyltransferase, CAZy family GT61 [Selaginella moellendorffii]
          Length = 441

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 132/387 (34%), Positives = 203/387 (52%), Gaps = 14/387 (3%)

Query: 7   ALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRK 66
            L+  +S++CD SH S  +C + G      K S       + +  +  ++  K +PY RK
Sbjct: 65  GLASRESVSCDFSHPSTSICSLQGDVRV--KGSNLTVALTSANQSAHSNVLAKIRPYTRK 122

Query: 67  SDKSAMSKVKELT-ITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
            +K  M  + E+  ++      ++C V H+ PA++FS GGY G+ FH+F D  VPLFIT 
Sbjct: 123 WEK-MMKTIGEVNMVSLPKSKQMACDVRHSVPAVIFSTGGYTGSIFHDFNDGLVPLFITA 181

Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
                  +V+  +      W  KYA +L   T +P+++ + + + HCF  V +GL  HG 
Sbjct: 182 QRF--KGEVVFMVLQFKHWWPGKYAPILKHLTHYPVVDFDREQLVHCFPKVIVGLRIHGD 239

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
           ++I   L     ++  FQ+ L  A N           TKP LVLVNR     R I+N  E
Sbjct: 240 LLIEEGL--AGTSMRSFQNLLDIALNPG---QVVLPKTKPMLVLVNRET--SRVIVNRNE 292

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
               AE+LG++V  F P  +T L++ +  +HS   ++GVHGA LTH LF+RPGS L+Q++
Sbjct: 293 TIALAEKLGYEVHTFAPNFNTRLSEIYSLLHSADVLIGVHGAALTHFLFMRPGSTLIQII 352

Query: 306 PIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
           P G    +   F +PA   GL Y+ Y+I   ES+L +++G N  V+ NP       W+  
Sbjct: 353 PFGLNGPAETCFGRPAEKAGLNYVGYQILPSESTLSDEFGLNHTVIVNPDEVTAKGWTER 412

Query: 366 -RVYLKTQNVKLDIDRFRIYLKDAYKK 391
            R+YL  QN++L + R    LKD   +
Sbjct: 413 KRIYLDGQNIRLHLPRLEQILKDVLTR 439


>gi|242093072|ref|XP_002437026.1| hypothetical protein SORBIDRAFT_10g018300 [Sorghum bicolor]
 gi|241915249|gb|EER88393.1| hypothetical protein SORBIDRAFT_10g018300 [Sorghum bicolor]
          Length = 572

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 139/382 (36%), Positives = 209/382 (54%), Gaps = 21/382 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           CD S+   ++C + G     P  +   F   GH     SQ     K +PYPRK D+  +S
Sbjct: 197 CDFSNFRANVCEMRGDVRVHPNATEIMFMEPGH-----SQRDELWKIKPYPRKGDEFCLS 251

Query: 74  KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
            + ELT+ +S  AP    C   H  PA++FS  GY GN FH+F D  VPLF T  S F +
Sbjct: 252 HITELTVKSSKVAP---ECTKYHDMPAVIFSLTGYTGNLFHDFTDVMVPLFTTA-SEF-N 306

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
            +V   I D    W  KY  +L + +++P+I+ +     HCF+ V +GL ++    I+ +
Sbjct: 307 GEVQFLITDMALWWTIKYHTVLQKLSKYPVIDFSKDDQVHCFKHVIVGLHAYMEFTIDSS 366

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVKK 248
             P   ++VDF  F+  AY+   +T +    +   KP+L+++ R+    R  LNL E+  
Sbjct: 367 KAPHNYSMVDFNRFMRGAYSLGRDTVTVLGEYPKVKPRLLIIKRHRT--RMFLNLDEIVA 424

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            AEELGF+V I E   S+ ++   R +++   M+GVHGAGLT+ +FL   + L+Q+VP G
Sbjct: 425 MAEELGFEVVIDEANVSSDISKFARLVNTVDVMMGVHGAGLTNCVFLPQNATLIQIVPWG 484

Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
             +W+S   F  PA ++GL Y +Y I  +ESSL E+Y  +  + KNP AF    +  +R 
Sbjct: 485 GLEWVSRTDFGNPAELMGLHYKQYSIGVDESSLTEQYPRDHEIFKNPIAFHKNGFDFIRQ 544

Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
            ++  QNVKLD  RFR  L +A
Sbjct: 545 TFMDKQNVKLDCKRFRPILLEA 566


>gi|115442317|ref|NP_001045438.1| Os01g0956200 [Oryza sativa Japonica Group]
 gi|20161877|dbj|BAB90790.1| putative glycosyltransferase [Oryza sativa Japonica Group]
 gi|113534969|dbj|BAF07352.1| Os01g0956200 [Oryza sativa Japonica Group]
 gi|215741036|dbj|BAG97531.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 491

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 138/406 (33%), Positives = 210/406 (51%), Gaps = 25/406 (6%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP-----SLRIK 59
           P +     +I CDRS  + D+C + G    DP + +          P QP     ++  +
Sbjct: 89  PCSSLAGDTICCDRSDFNTDVCFMAGDVRTDPSSLSLLLF------PKQPPAANATVEER 142

Query: 60  TQPYPRKSDKSAMSKVKELTITTSAP---PNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
            +PY RK +   MS+V+E+ +  + P   P   C V H +P LV +AGGY GN FH F D
Sbjct: 143 IRPYTRKWEALIMSRVEEVRLRMAPPEEEPGHRCDVRHDAPLLVMTAGGYTGNLFHAFSD 202

Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
            FVP ++T+  H   + V+L +   N  WA  Y E++   + + ++++ +   THCF   
Sbjct: 203 GFVPAWLTVQ-HL-RRRVVLGVLSYNPWWAGTYGEIISGLSDYHVVDLLHDKRTHCFPGA 260

Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT-----KPKLVLVN 231
            +G   HG + ++P  L   KT+VDF   LA  Y    +T      T     +P+L +V+
Sbjct: 261 IVGTRFHGILSVDPARLRDNKTIVDFHDLLAGVYETAGDTVVVDDVTQPAPRRPRLGIVS 320

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
           R  R  R I N   V + A  +GFDV I E      L  S+  + +C  +VGVHGA LT 
Sbjct: 321 R--RGTRVIENQAAVARLARSVGFDVDILETANGLPLPASYASVSACDVLVGVHGADLTK 378

Query: 292 SLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
            LFLRPG+ L+Q+ P+G   ++   + + +  +GL Y +Y  +  ESSL  KYG  D+V+
Sbjct: 379 LLFLRPGAALVQIAPLGVAPIARGCYAEASARMGLHYEQYDAEGHESSLSRKYGLRDVVV 438

Query: 352 KNPQAFA-GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
            +P+A      W  + RVYL  QNV LD+ RFR  L   + +A ++
Sbjct: 439 SDPEAAKRDKGWGFVARVYLGGQNVTLDLSRFRHTLTRLHARALRV 484


>gi|56409842|emb|CAI30069.1| glycosyltransferase [Triticum aestivum]
          Length = 503

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 139/406 (34%), Positives = 211/406 (51%), Gaps = 21/406 (5%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF--------SVGHTDSTPSQPSL 56
           P A      I CDRS  + D+C + G    D  + +F              ++T ++   
Sbjct: 99  PCASMGGHGICCDRSDFNTDVCFMAGDVRTDAASLSFLLFAPQAPNGTASANATVAEKEE 158

Query: 57  RIKTQPYPRKSDKSAMSKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHE 113
           R+  QPY RK +K  M+ ++E+ +  + P       C V H +PAL  +AGGY GN FH 
Sbjct: 159 RV--QPYTRKWEKHLMANIQEVRLRPARPDEADAQPCQVRHDAPALGMTAGGYTGNLFHA 216

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           F D F+P ++T+  H   + V+LA+   N  WA  + EL+   + H +I++ +   THCF
Sbjct: 217 FNDGFLPAWLTVQ-HL-RRRVVLAVLAYNPWWAGTFRELVSGLSDHHVIDLLHDKRTHCF 274

Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY-NENTNTSSSFHHTKPKLVLVNR 232
               +G   HG + + P      +TLVDF  FLA AY +++T         +P+L L +R
Sbjct: 275 PGAIVGTRFHGILAVEPARTKDNRTLVDFHDFLAGAYKDDSTPQQQQKPRRRPRLGLYSR 334

Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
                R I N   V + AE +GFDV+I E      L+  +  + +C  +VGVHGA LT  
Sbjct: 335 KGT--RMIENEAAVARLAESVGFDVSILETANGAPLSSEYAAVSACDVLVGVHGADLTKL 392

Query: 293 LFLRP-GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
           LFLRP  + L+Q+ P+G   ++   +EK   ++ + Y +Y     ESSLV KY A+D+VL
Sbjct: 393 LFLRPRRAALLQIAPMGVPHVARGCYEKATAMMEMHYEQYDAAANESSLVRKYPADDVVL 452

Query: 352 KNPQAFAGA-NWS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
           ++P+A      W    RVYL  QNV LD+ RF   L+  + +A +L
Sbjct: 453 RDPEAATRERGWDLTARVYLGGQNVSLDLGRFGDTLRKLHSRALRL 498


>gi|56409848|emb|CAI30072.1| glycosyltransferase [Sorghum bicolor]
          Length = 572

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 139/382 (36%), Positives = 209/382 (54%), Gaps = 21/382 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           CD S+   ++C + G     P  +   F   GH     SQ     K +PYPRK D+  +S
Sbjct: 197 CDFSNFRANVCEMRGDVRVHPNATEIMFMEPGH-----SQRDELWKIKPYPRKGDEFCLS 251

Query: 74  KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
            + ELT+ +S  AP    C   H  PA++FS  GY GN FH+F D  VPLF T  S F +
Sbjct: 252 HITELTVKSSKVAP---ECTKYHDMPAVIFSLTGYTGNLFHDFTDVMVPLFTTA-SEF-N 306

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
            +V   I D    W  KY  +L + +++P+I+ +     HCF+ V +GL ++    I+ +
Sbjct: 307 GEVQFLITDMALWWTIKYHTVLQKLSKYPVIDFSKDDQVHCFKHVIVGLHAYMEFTIDSS 366

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVKK 248
             P   ++VDF  F+  AY+   +T +    +   KP+L+++ R+    R  LNL E+  
Sbjct: 367 KAPHNYSMVDFNRFMRGAYSLGRDTVTVLGEYPKVKPRLLIIKRHRT--RMFLNLDEIVV 424

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            AEELGF+V I E   S+ ++   R +++   M+GVHGAGLT+ +FL   + L+Q+VP G
Sbjct: 425 MAEELGFEVVIDEANVSSDISKFARLVNTVDVMMGVHGAGLTNCVFLPQNATLIQIVPWG 484

Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
             +W+S   F  PA ++GL Y +Y I  +ESSL E+Y  +  + KNP AF    +  +R 
Sbjct: 485 GLEWVSRTDFGNPAELMGLHYKQYSIGVDESSLTEQYPRDHEIFKNPIAFHKNGFDFIRQ 544

Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
            ++  QNVKLD  RFR  L +A
Sbjct: 545 TFMDKQNVKLDCKRFRPILLEA 566


>gi|226507655|ref|NP_001142280.1| uncharacterized protein LOC100274449 [Zea mays]
 gi|194707986|gb|ACF88077.1| unknown [Zea mays]
 gi|413951273|gb|AFW83922.1| glycosyltransferase [Zea mays]
          Length = 529

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 140/419 (33%), Positives = 216/419 (51%), Gaps = 33/419 (7%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFF----SVGHTDSTPSQPSLRIKTQPYPRKSD 68
           +I CDRS    D+C + G    D  + +           + T S+     + +PYPRK +
Sbjct: 115 TICCDRSSVHTDVCFMAGDVRTDAASLSLLLFPPHQQAQNGTSSEEDKEERVRPYPRKWE 174

Query: 69  KSAMSKVKELTITTSAP--PNLS-------CGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
           +  M KV E+ +  +AP  P+         C V H +P LV SAGGY GN FH F D F+
Sbjct: 175 RFIMDKVPEVRLRVAAPRRPDGEREEEEHRCDVRHDAPLLVMSAGGYTGNLFHAFNDGFL 234

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
           P ++T+  H   + V+L +   N  WA  + E++   + HP++++ + T THCF    +G
Sbjct: 235 PSWLTVQ-HL-RRRVVLGVVSYNPWWAGMFGEVISGLSDHPVVDLLHGTRTHCFPGAIVG 292

Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAY------------NENTNTSSSFHHTKPKL 227
              HG ++++P  L   KT+VDF   LA+AY             ++   +      +P+L
Sbjct: 293 TRYHGILIVDPARLRDNKTIVDFHQMLADAYEKPPRETARPAQQQDLRDAEQRRRRRPRL 352

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
            +V+R     R I N   V + A  +GFDV I E      L+  +  + +C A+VGVHGA
Sbjct: 353 GIVSRKGT--RVIENQAAVARLASSVGFDVDILETANGLPLSAWYASLRACDALVGVHGA 410

Query: 288 GLTHSLFLRPG-SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGA 346
            LT  LFLRPG + L Q+ P+G   ++   F +PA  +GL Y +Y+++  ESSL  +Y  
Sbjct: 411 DLTKFLFLRPGRASLTQIAPLGVSPIAREDFGEPAARMGLAYEQYEVRAGESSLARRYAP 470

Query: 347 NDLVLKNPQAFA--GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDKEIKL 402
            D+VL +P+A       W+ + RVYL  QNV LD+ RFR  L   +  A +   ++ +L
Sbjct: 471 GDVVLTDPEAAKRDKGGWNLVARVYLGGQNVTLDLARFRQTLARMHAHAMRQRRRQERL 529


>gi|225456057|ref|XP_002277670.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like [Vitis
           vinifera]
          Length = 462

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 139/389 (35%), Positives = 215/389 (55%), Gaps = 20/389 (5%)

Query: 12  KSITCDRSHRSYDLCLINGSALFDPKTSTFF--SVGHTDSTPSQPSLRIKTQPYPRKSDK 69
           +++ C+   RS D C+I+G       +ST F  S    D  P   S  I+  PY RK D 
Sbjct: 81  EALVCNIEPRS-DFCVISGDVRVHGNSSTVFIASSAPVDILPENGSWSIR--PYARKGDA 137

Query: 70  SAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
            AM  +K  T+  TT       C   HT PA++FS GGY+GN FH F D  +PL++T  S
Sbjct: 138 RAMKHIKNFTVKMTTGRQHLPHCTQNHTVPAILFSLGGYSGNHFHAFSDVLIPLYLT--S 195

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
              + +V   +   +  W  K+  LL   +R+PII+I+ +   HCF S  +GL  H  + 
Sbjct: 196 RQFNGEVQFLVTSKSLWWIAKFRILLQELSRYPIIDIDREEGIHCFSSAIIGLKCHKELD 255

Query: 188 INPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH----TKPKLVLVNRNARVGRTILNL 243
           I+P+    P ++ DF+ FL ++Y+    T+          KP+L+++ R  +  R+  N 
Sbjct: 256 IDPS--KSPYSMKDFREFLRSSYSLKRATAIKVRDGTDTKKPRLLIIAR--KKSRSFTND 311

Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
            ++ + A  LG++V + EP   T ++     ++SC  ++GVHGAGLT+ +FL   +VL+Q
Sbjct: 312 GKIAEMARSLGYEVIVAEPN-GTEISRFAELVNSCDVLMGVHGAGLTNIVFLPENAVLIQ 370

Query: 304 VVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
           VVP+G  +W++   F  PA  + + Y+EY+IK+EESSL+EKY     VL+ P +     W
Sbjct: 371 VVPLGGLEWVARYDFGLPAVDMKIRYIEYQIKEEESSLIEKYPHEHAVLREPHSITKLGW 430

Query: 363 SNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
             ++ VYL  QNVKLD++RFR  L  A +
Sbjct: 431 LELKAVYLDKQNVKLDLNRFRNTLLQALQ 459


>gi|56409850|emb|CAI30073.1| glycosyltransferase [Saccharum officinarum]
          Length = 573

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 138/382 (36%), Positives = 209/382 (54%), Gaps = 21/382 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           CD S+   ++C + G     P  ++  F   GH     SQ     K +PYPRK D+  +S
Sbjct: 198 CDFSNFRANVCEMRGDVRVHPNATSIMFMEPGH-----SQRDELWKIKPYPRKGDEFCLS 252

Query: 74  KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
            + ELT+ +S  AP    C   H  PA++FS  GY GN FH+F D  VPLF T  S F +
Sbjct: 253 HITELTVKSSKVAP---ECTKYHDVPAVIFSLTGYTGNLFHDFTDVMVPLFTTA-SEF-N 307

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
            +V   I D    W  KY  +L + +++P+I+ +     HCF+ V +GL ++    I+ +
Sbjct: 308 GEVQFLITDMALWWTIKYHTVLQKLSKYPVIDFSKDDQVHCFKHVIVGLHAYMEFTIDSS 367

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVKK 248
             P   ++VDF  F+  AY+   +T +    +   KP+L+++ R+    R  LNL E+  
Sbjct: 368 KAPHNYSMVDFNRFMRGAYSLGRDTVTVLGEYPKVKPRLLIIKRHRT--RMFLNLDEIIA 425

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            AEELGF+V I E   S+ ++     +++   M+GVHGAGLT+ +FL   + L+Q+VP G
Sbjct: 426 MAEELGFEVVIDEANVSSDISKFASLVNTVDVMMGVHGAGLTNCVFLPQNATLIQIVPWG 485

Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
             +W+S   F  PA ++GL Y +Y I  +ESSL E+Y  +  + KNP AF    +  +R 
Sbjct: 486 GLEWVSRTDFGNPAELMGLHYKQYSIGVDESSLTEQYPRDHEIFKNPIAFHKKGFDFIRQ 545

Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
            ++  QNVKLD  RFR  L +A
Sbjct: 546 TFMDKQNVKLDCKRFRPILLEA 567


>gi|125529168|gb|EAY77282.1| hypothetical protein OsI_05257 [Oryza sativa Indica Group]
          Length = 491

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 137/406 (33%), Positives = 210/406 (51%), Gaps = 25/406 (6%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP-----SLRIK 59
           P +     +I CDRS  + D+C + G    DP + +          P QP     ++  +
Sbjct: 89  PCSSLAGDTICCDRSDFNTDVCFMAGDVRTDPSSLSLLLF------PKQPPAANATVEER 142

Query: 60  TQPYPRKSDKSAMSKVKELTITTSAP---PNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
            +PY RK +   MS+V+E+ +  + P   P   C V H +P LV +AGGY GN FH F D
Sbjct: 143 IRPYTRKWEALIMSRVEEVRLRMAPPEEEPGHRCDVRHDAPLLVMTAGGYTGNLFHAFSD 202

Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
            FVP ++T+  H   + V+L +   N  WA  Y E++   + + ++++ +   THCF   
Sbjct: 203 GFVPAWLTVQ-HL-RRRVVLGVLSYNPWWAGTYGEIISGLSDYHVVDLLHDKRTHCFPGA 260

Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT-----KPKLVLVN 231
            +G   HG + ++P  L   KT+VDF   LA  Y    +T      T     +P+L +V+
Sbjct: 261 IVGTRFHGILSVDPARLRDNKTIVDFHDLLAGVYETAGDTVVVDDVTQPAPRRPRLGIVS 320

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
           R  R  R I N   V + A  +GF+V I E      L  S+  + +C  +VGVHGA LT 
Sbjct: 321 R--RGTRVIENQAAVARLARSVGFNVDILETANGLPLPASYASVSACDVLVGVHGADLTK 378

Query: 292 SLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
            LFLRPG+ L+Q+ P+G   ++   + + +  +GL Y +Y  +  ESSL  KYG  D+V+
Sbjct: 379 LLFLRPGAALVQIAPLGVAPIARGCYAEASARMGLHYEQYDAEGHESSLSRKYGLRDVVV 438

Query: 352 KNPQAFA-GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
            +P+A      W  + RVYL  QNV LD+ RFR  L   + +A ++
Sbjct: 439 SDPEAAKRDKGWGFVARVYLGGQNVTLDLSRFRHTLTRLHARALRV 484


>gi|53791308|dbj|BAD52573.1| putative HGA6 [Oryza sativa Japonica Group]
 gi|53791442|dbj|BAD52494.1| putative HGA6 [Oryza sativa Japonica Group]
          Length = 603

 Score =  230 bits (587), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 136/391 (34%), Positives = 209/391 (53%), Gaps = 26/391 (6%)

Query: 10  PAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQP 62
           P   I+CD   +          +C ++G     P TS   SV  T       +   + +P
Sbjct: 209 PKSKISCDDKSKDEGFPYARPIVCHLSGDVRVSPATS---SVTLTMPLQQGEAAARRIRP 265

Query: 63  YPRKSDKSAMSKVKELTITTSAPPN--LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
           Y R+ D   +  V+E+ IT++A      SC V+H  PA++FS GGY GNFFH+  D  VP
Sbjct: 266 YARRDD-FLLPLVREVAITSAASEGDAPSCNVSHGVPAVIFSIGGYTGNFFHDMADVLVP 324

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
           L++T   HF  + V L +A+    W +KY  +L R +   +++ ++    HCF  V +GL
Sbjct: 325 LYLT-TFHFKGK-VQLFVANYKQWWIQKYKPVLRRLSHRAVVDFDSDGDVHCFDHVIVGL 382

Query: 181 ISHGRMVI--NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNAR 235
           +    +++  +PT  PK  T+VDF  FL +AY    +       T   KP++++++R  R
Sbjct: 383 VRDRDLILGQHPTRNPKGYTMVDFTRFLRHAYGLRRDKPMVLGETSGKKPRMLIISR--R 440

Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHS 292
             R +LNLR+V   A ELGF+V + E           RF   ++SC  +VGVHGAGLT+ 
Sbjct: 441 RTRKLLNLRQVAAMARELGFEVVVSEAGVGGGSGGVKRFASAVNSCDVLVGVHGAGLTNQ 500

Query: 293 LFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
            FL  G V++Q+VP G  +W++T ++  PA  + L Y+EY +  EESSL  +Y     V 
Sbjct: 501 AFLPRGGVVVQIVPWGRMEWMATNFYGAPAAAMELRYVEYHVAAEESSLARRYPREHAVF 560

Query: 352 KNPQAFAGANWSNMRVYLKTQNVKLDIDRFR 382
           ++P A  G  W  +   + TQ+VKL++ RFR
Sbjct: 561 RDPMAIHGQGWKALADIVMTQDVKLNLRRFR 591


>gi|357132085|ref|XP_003567663.1| PREDICTED: uncharacterized protein LOC100828889 [Brachypodium
           distachyon]
          Length = 657

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 134/382 (35%), Positives = 205/382 (53%), Gaps = 20/382 (5%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           CD  S+R  D C ++G         T   V   D+  +    R+K  PYPRK+D SAM  
Sbjct: 278 CDFASNRRIDWCELDGDVRVHGANGT---VTLVDAAMAAEEWRVK--PYPRKADASAMRF 332

Query: 75  VKELTITTSAPPNLS-----CGVTHTS-PALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
           V+E+T+  S PPN +     C   H   PALVFS  GY GN+FH + D  +PLF+T   +
Sbjct: 333 VREITVR-STPPNSANAAPACTERHEGVPALVFSDRGYTGNYFHAYTDVILPLFLTARQY 391

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
               +V   ++D    W  K+  +    + +P+I++   +  HCF+ V +GL  H    I
Sbjct: 392 --SGEVQFMVSDFQMWWIGKFMPVFKSLSNYPLIDLAADSRVHCFKHVQVGLTCHADFSI 449

Query: 189 NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKK 248
           +P+  P   ++VDF  F+   Y    + ++  +  +P+L+++ R AR  R   NL E+ +
Sbjct: 450 DPSRSPNGYSMVDFTKFMRQTYKLPRDLAAPINGARPRLLIIAR-ART-RRFDNLAEIVR 507

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            AE++GF+  + E +    +A      ++C  M+GVHGAGLT+ +FL  G  ++QVVP+G
Sbjct: 508 GAEKVGFEAVVSEGDHE--VAPFAELSNTCDVMLGVHGAGLTNMIFLPTGGAVIQVVPLG 565

Query: 309 TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
                  YF  PA  +GL YLEY+I   ESSL E+Y  +  V  +P+      W +++  
Sbjct: 566 GLEFVAGYFRGPAADMGLRYLEYRIAPAESSLSEQYPPDHPVFTDPEGVKSKGWDSLKEA 625

Query: 368 YLKTQNVKLDIDRFRIYLKDAY 389
           YL  Q+VKLD+ RFR  LK A+
Sbjct: 626 YLDKQDVKLDMRRFRPLLKKAF 647


>gi|297595986|ref|NP_001041861.2| Os01g0118600 [Oryza sativa Japonica Group]
 gi|255672811|dbj|BAF03775.2| Os01g0118600 [Oryza sativa Japonica Group]
          Length = 560

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 136/391 (34%), Positives = 209/391 (53%), Gaps = 26/391 (6%)

Query: 10  PAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQP 62
           P   I+CD   +          +C ++G     P TS   SV  T       +   + +P
Sbjct: 166 PKSKISCDDKSKDEGFPYARPIVCHLSGDVRVSPATS---SVTLTMPLQQGEAAARRIRP 222

Query: 63  YPRKSDKSAMSKVKELTITTSAPPN--LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
           Y R+ D   +  V+E+ IT++A      SC V+H  PA++FS GGY GNFFH+  D  VP
Sbjct: 223 YARRDD-FLLPLVREVAITSAASEGDAPSCNVSHGVPAVIFSIGGYTGNFFHDMADVLVP 281

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
           L++T   HF  + V L +A+    W +KY  +L R +   +++ ++    HCF  V +GL
Sbjct: 282 LYLT-TFHFKGK-VQLFVANYKQWWIQKYKPVLRRLSHRAVVDFDSDGDVHCFDHVIVGL 339

Query: 181 ISHGRMVI--NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNAR 235
           +    +++  +PT  PK  T+VDF  FL +AY    +       T   KP++++++R  R
Sbjct: 340 VRDRDLILGQHPTRNPKGYTMVDFTRFLRHAYGLRRDKPMVLGETSGKKPRMLIISR--R 397

Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHS 292
             R +LNLR+V   A ELGF+V + E           RF   ++SC  +VGVHGAGLT+ 
Sbjct: 398 RTRKLLNLRQVAAMARELGFEVVVSEAGVGGGSGGVKRFASAVNSCDVLVGVHGAGLTNQ 457

Query: 293 LFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
            FL  G V++Q+VP G  +W++T ++  PA  + L Y+EY +  EESSL  +Y     V 
Sbjct: 458 AFLPRGGVVVQIVPWGRMEWMATNFYGAPAAAMELRYVEYHVAAEESSLARRYPREHAVF 517

Query: 352 KNPQAFAGANWSNMRVYLKTQNVKLDIDRFR 382
           ++P A  G  W  +   + TQ+VKL++ RFR
Sbjct: 518 RDPMAIHGQGWKALADIVMTQDVKLNLRRFR 548


>gi|449459640|ref|XP_004147554.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like [Cucumis sativus]
          Length = 407

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 132/378 (34%), Positives = 205/378 (54%), Gaps = 12/378 (3%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITTSA 84
           +CL N     +     F+   + DS  +  S  I   PY R+ DK  +  V  L I    
Sbjct: 35  VCLTNNPTRINNTNLEFYISTNNDSQQNNFS-PILIHPYARQEDKITLRDVTPLQIIFQP 93

Query: 85  PPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
              L   C   H  P L+FS GG+ GN FHEF +  +PLFIT + HF  + V   I D  
Sbjct: 94  NKTLLPLCQFIHNVPVLIFSTGGFTGNLFHEFDETIIPLFIT-SYHFQTR-VRFLITDHK 151

Query: 143 DQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDF 202
             W +KY  +L   +R  ++N       HCF    +GL  H  + +N T +P   ++ DF
Sbjct: 152 TWWVQKYNRILSGLSRFNVVNPAEDGSVHCFNGGVIGLKFHNILSLNNTDIPGGYSMSDF 211

Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEP 262
           +SFL   YN   N  S     KP ++L++R  +  R  +N  E+ +  +E+GF+V    P
Sbjct: 212 RSFLRQTYNLKVNNVSELSGKKPMVMLISR--QTSRRFMNEGEMVEMMKEVGFEVMTTTP 269

Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR 322
           +  ++L      ++ C  ++G HGAGLT+ +FL  G+V++QVVP G  W ST +F KPA 
Sbjct: 270 QRMSNLDKFSSVVNLCSVIIGAHGAGLTNEVFLANGAVVVQVVPFGLDWPSTYFFGKPAA 329

Query: 323 VLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA-FAGANWSNMRVYLKTQNVKLDIDRF 381
            + L+YLEYKI+ +ESSL +KYG N  V+++P++ FA   +++  +Y+  QN+K+++ RF
Sbjct: 330 EMELQYLEYKIEAKESSLWDKYGENHPVIRDPESIFAQGYFASRAIYIDEQNLKINLTRF 389

Query: 382 RIYLKDAYKKAKKLMDKE 399
           R    D   + KKL++++
Sbjct: 390 R----DTMIQVKKLIEEK 403


>gi|449503225|ref|XP_004161896.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like [Cucumis sativus]
          Length = 407

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 132/378 (34%), Positives = 205/378 (54%), Gaps = 12/378 (3%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITTSA 84
           +CL N     +     F+   + DS  +  S  I   PY R+ DK  +  V  L I    
Sbjct: 35  VCLTNNPTRINNTNLEFYISTNNDSQQNNFS-PILIHPYARQEDKITLRDVTPLQIIFQP 93

Query: 85  PPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
              L   C   H  P L+FS GG+ GN FHEF +  +PLFIT + HF  + V   I D  
Sbjct: 94  NKTLLPLCQFIHNVPVLIFSTGGFTGNLFHEFDETIIPLFIT-SYHFQTR-VRFLITDHK 151

Query: 143 DQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDF 202
             W +KY  +L   +R  ++N       HCF    +GL  H  + +N T +P   ++ DF
Sbjct: 152 TWWVQKYNRILSGLSRFNVVNPAEDGSVHCFNGGVIGLKFHNILSLNNTDIPGGYSMSDF 211

Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEP 262
           +SFL   YN   N  S     KP ++L++R  +  R  +N  E+ +  +E+GF+V    P
Sbjct: 212 RSFLRQTYNLKVNNVSELSGKKPMVMLISR--QTSRRFMNEGEMVEMMKEVGFEVMTTTP 269

Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR 322
           +  ++L      ++ C  ++G HGAGLT+ +FL  G+V++QVVP G  W ST +F KPA 
Sbjct: 270 QRMSNLDKFSSVVNLCSVIIGAHGAGLTNEVFLANGAVVVQVVPFGLDWPSTYFFGKPAA 329

Query: 323 VLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA-FAGANWSNMRVYLKTQNVKLDIDRF 381
            + L+YLEYKI+ +ESSL +KYG N  V+++P++ FA   +++  +Y+  QN+K+++ RF
Sbjct: 330 EMELQYLEYKIEAKESSLWDKYGENHPVIRDPESIFAQGYFASRAIYIDEQNLKINLTRF 389

Query: 382 RIYLKDAYKKAKKLMDKE 399
           R    D   + KKL++++
Sbjct: 390 R----DTMIQVKKLIEEK 403


>gi|223974685|gb|ACN31530.1| unknown [Zea mays]
          Length = 572

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 136/382 (35%), Positives = 208/382 (54%), Gaps = 21/382 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           CD S+   ++C + G     P  ++  F   GH     SQ     K +PYPRK D+  +S
Sbjct: 197 CDFSNFRANVCEMRGDVRVHPNAASIMFMEPGH-----SQRDELWKIKPYPRKGDEFCLS 251

Query: 74  KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
            + ELT+ +S  AP    C   H  PA++FS  GY GN FH+F D  VPLF T  S F +
Sbjct: 252 HITELTVKSSKVAP---ECSKYHDVPAVIFSLTGYTGNLFHDFTDVMVPLFTTA-SEF-N 306

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
            +V   I D    W  KY  +L + +++P+I+ +     HCF+ V +GL ++    I+ +
Sbjct: 307 GEVQFLITDMALWWTIKYHTVLEKLSKYPVIDFSKDGEVHCFKHVIVGLHAYMEFTIDSS 366

Query: 192 LLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKK 248
             P   ++VDF  F+  AY+   ++      +   KP+L+++ R+    R  LNL E+  
Sbjct: 367 KAPHNYSMVDFNRFMRGAYSLGRDSVTVLGEYPKVKPRLLIIKRHRT--RMFLNLDEIIA 424

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            AEELGF+V I E   S+ ++     +++   M+GVHGAGLT+ +FL   + L+Q+VP G
Sbjct: 425 MAEELGFEVVIDEANVSSDISKFAALVNTVDVMMGVHGAGLTNCVFLPQNATLIQIVPWG 484

Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
             +W+S   F  PA ++GL Y +Y I   ESSL E+Y ++  + KNP +F    +  +R 
Sbjct: 485 GLEWVSRTDFGNPAELMGLHYKQYSIGVHESSLTEQYPSDHEIFKNPISFHKKGFDFIRQ 544

Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
            ++  QNVKLD  RFR  L +A
Sbjct: 545 TFMDKQNVKLDCKRFRPILLEA 566


>gi|255558677|ref|XP_002520363.1| glycosyltransferase, putative [Ricinus communis]
 gi|223540461|gb|EEF42029.1| glycosyltransferase, putative [Ricinus communis]
          Length = 515

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 128/384 (33%), Positives = 207/384 (53%), Gaps = 18/384 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C+   RS D C + G    D  +ST F V   +   +  +     +PY RK D +AM   
Sbjct: 133 CNLMERS-DFCELKGDIRIDANSSTIFIVSSGNDNLAATNTSWSIRPYARKGDAAAMRHT 191

Query: 76  KELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
           +E ++   +       C   H +  ++FS GGY+GN FH F D  VPLF T      + D
Sbjct: 192 REWSVKQVSNHRKIPECTQNHNALGIIFSLGGYSGNHFHAFTDIIVPLFSTARPF--NGD 249

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   + D    W  K+  LL   +R+ +I+I+ +   HCF S+T+GL       +N    
Sbjct: 250 VQFLVTDRQPWWIAKFRILLKALSRYEVIDIDKREEIHCFTSITIGLKRQSNKELNIDQS 309

Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSF-------HHTKPKLVLVNRNARVGRTILNLREV 246
               ++ DF+ FL ++Y+    T+  F        + +P+L++++R  +  R   N+ E+
Sbjct: 310 KFRYSMKDFRQFLRSSYSLRKTTAIKFMKGTGREKNRRPRLLIISR--KRSRAFTNVGEI 367

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            K A+ LG+ V + EP+   S   S + ++SC  ++GVHGAGLT+ +FL   ++L+QVVP
Sbjct: 368 AKMAKGLGYKVVVDEPDADVS--RSAQVMNSCDVVLGVHGAGLTNMVFLPDNAILIQVVP 425

Query: 307 IG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  +W+S ++FE+P++ + + YLEYKI  EESSLV +Y ++ +VL++P       W   
Sbjct: 426 FGGAEWVSKIFFEEPSKDMNIRYLEYKISIEESSLVHQYPSDHVVLRDPSVIQNQGWEAF 485

Query: 366 R-VYLKTQNVKLDIDRFRIYLKDA 388
           + +Y   QNVK+D++RFR  L  A
Sbjct: 486 KSIYFDKQNVKIDLNRFRPTLSKA 509


>gi|414876592|tpg|DAA53723.1| TPA: hypothetical protein ZEAMMB73_458729 [Zea mays]
          Length = 412

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 138/396 (34%), Positives = 206/396 (52%), Gaps = 26/396 (6%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C + G    +   S F  V    +    P+   + +PY RK D ++M +V
Sbjct: 18  CDLSGSRADVCDLTGDIRLNATASEFIVVDPA-AADDGPTTAYEIRPYARKGDATSMGRV 76

Query: 76  KELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
             +T+   A  + +  C VTH  PA+ FS GGY GN FH+F D  VPL+ T   +    D
Sbjct: 77  TAVTVRVRATADAAPRCTVTHAEPAVAFSVGGYTGNLFHDFTDVIVPLYGTAQRY--RGD 134

Query: 134 VILAIADCNDQ---WARKYAELLPRFTRHPIININ---NQTITHCFQSVTLGLISHGRMV 187
           V L + D       W  KY  +L   +RHP ++++        HCF  V LGL +   ++
Sbjct: 135 VRLVVTDAGSSSSWWLAKYDAVLRGLSRHPPLHLHLAKAAGEVHCFGHVVLGLRARRELM 194

Query: 188 INPTLLP---KPKT--LVDFQSFLANAYN------ENTNTSSSFHHTKPKLVLVNRNARV 236
           + P   P   +P    + DF  FL  A +           +SS    KP+L++V+R  R 
Sbjct: 195 VEPDPDPDRGRPDGVGMADFARFLRRALSLPRDAPTRPAAASSDDRRKPRLLVVSR--RG 252

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
            R +LN   V + AEE+GF+  + E E S   A   R I+S  A+VGVHGAGLT+ +FL 
Sbjct: 253 TRRLLNADAVARVAEEVGFEAVVSELEVSRDAAGVGRLINSFDALVGVHGAGLTNMVFLP 312

Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
           PG+ ++Q+VP G  QW++ + F  PA  +GL Y++Y+I   ESSL +KY  +  +  NP 
Sbjct: 313 PGATVVQIVPWGGLQWIARMDFGDPADAMGLRYIQYEIAVHESSLADKYPRDHEIFTNPT 372

Query: 356 AFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
           A     +  +R  +L  Q++ LD+DRFR+ L  A +
Sbjct: 373 ALHKKGFKFLRHTFLIGQDITLDVDRFRVVLLQALR 408


>gi|162460428|ref|NP_001105751.1| glycosyltransferase [Zea mays]
 gi|56409852|emb|CAI30074.1| glycosyltransferase [Zea mays]
          Length = 572

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 136/382 (35%), Positives = 209/382 (54%), Gaps = 21/382 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           CD S+   ++C + G     P  ++  F   GH     SQ     K +PYPRK D+  +S
Sbjct: 197 CDFSNFRANVCEMRGDVRVHPNAASIMFMEPGH-----SQRDELWKIKPYPRKGDEFCLS 251

Query: 74  KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
            + ELT+ +S  AP    C   H  PA++FS  GY GN FH+F D  VPLF T  S F +
Sbjct: 252 HITELTVKSSKVAP---ECSKYHDVPAVIFSLTGYTGNLFHDFTDVMVPLFTTA-SEF-N 306

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
            +V   I D    W  KY  +L + +++P+I+ +     HCF+ V +GL ++    I+ +
Sbjct: 307 GEVQFLITDMALWWTIKYHTVLEKLSKYPVIDFSKDGEVHCFKHVIVGLHAYMEFTIDSS 366

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVKK 248
             P   ++VDF  F+  AY+   ++ +    +   KP+L+++ R+    R  LNL E+  
Sbjct: 367 KAPHNYSMVDFNRFMRGAYSLGRDSVTLLGEYPKVKPRLLIIKRHRT--RMFLNLDEIIA 424

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            AEELGF+V I E   S+ ++     +++   M+GVHGAGLT+ +FL   + L+Q+VP G
Sbjct: 425 MAEELGFEVVIDEANVSSDISKFAALVNTVDVMMGVHGAGLTNCVFLPQNATLIQIVPWG 484

Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
             +W+S   F  PA ++GL Y +Y I   ESSL E+Y ++  + KNP +F    +  +R 
Sbjct: 485 GLEWVSRTDFGNPAELMGLHYKQYSIGVHESSLTEQYPSDHEIFKNPISFHKKGFDFIRQ 544

Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
            ++  QNVKLD  RFR  L +A
Sbjct: 545 TFMDKQNVKLDCKRFRPILLEA 566


>gi|223945917|gb|ACN27042.1| unknown [Zea mays]
 gi|414876578|tpg|DAA53709.1| TPA: hypothetical protein ZEAMMB73_101479 [Zea mays]
          Length = 681

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 135/393 (34%), Positives = 206/393 (52%), Gaps = 18/393 (4%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV----GHTDSTPSQPSLRIKTQPYPRKSDKS 70
           CD  S+R  D C ++G        ++   V    G    T    S RIK  PYPRK+D +
Sbjct: 288 CDITSNRRIDWCELDGDVRVLGANASVTLVAPPGGADGRTFRAESWRIK--PYPRKADPN 345

Query: 71  AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           AM  V+ LT+ +      +C   H  PALVFS  GY GN+FH F D  +PLF+T   +  
Sbjct: 346 AMRVVRVLTVRSVPGEAPACTDRHGVPALVFSDRGYTGNYFHAFTDVILPLFLTARQYA- 404

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
             +V L + D    W  K+A +    + + +++++     HCF+ V +GL SH    I+P
Sbjct: 405 -GEVRLLVTDLQAWWVGKFAPVFRSISNYELVDLDRDPRVHCFRHVQVGLTSHDDFSIDP 463

Query: 191 TLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
              P   +++DF  F+  AY     +    +      +P+L+L+ R AR  R  +N  E+
Sbjct: 464 RRAPNGYSMLDFTGFMRAAYGLPRGDVAAAAGPSSKRRPRLLLIAR-ART-RRFVNAEEI 521

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            + AE+LGF+V + E   +  +A      +SC A++GVHGAGLT+ +F+  G V++QVVP
Sbjct: 522 VRGAEKLGFEVVVSE--GTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPTGGVVIQVVP 579

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           +G       YF  P+R +GL YLEY+I  EES+L+ +Y  +  +  +P       W +++
Sbjct: 580 LGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLINQYPRDHPIFTDPNGIKSKGWESLK 639

Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
             YL  Q+V LD+ RFR  LK A    +K   K
Sbjct: 640 DAYLDKQDVSLDMKRFRPTLKKAIAHIRKARAK 672


>gi|212275650|ref|NP_001130385.1| uncharacterized protein LOC100191481 [Zea mays]
 gi|194688994|gb|ACF78581.1| unknown [Zea mays]
 gi|414876577|tpg|DAA53708.1| TPA: glycosyltransferase [Zea mays]
          Length = 682

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 135/393 (34%), Positives = 206/393 (52%), Gaps = 18/393 (4%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV----GHTDSTPSQPSLRIKTQPYPRKSDKS 70
           CD  S+R  D C ++G        ++   V    G    T    S RIK  PYPRK+D +
Sbjct: 289 CDITSNRRIDWCELDGDVRVLGANASVTLVAPPGGADGRTFRAESWRIK--PYPRKADPN 346

Query: 71  AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           AM  V+ LT+ +      +C   H  PALVFS  GY GN+FH F D  +PLF+T   +  
Sbjct: 347 AMRVVRVLTVRSVPGEAPACTDRHGVPALVFSDRGYTGNYFHAFTDVILPLFLTARQYA- 405

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
             +V L + D    W  K+A +    + + +++++     HCF+ V +GL SH    I+P
Sbjct: 406 -GEVRLLVTDLQAWWVGKFAPVFRSISNYELVDLDRDPRVHCFRHVQVGLTSHDDFSIDP 464

Query: 191 TLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
              P   +++DF  F+  AY     +    +      +P+L+L+ R AR  R  +N  E+
Sbjct: 465 RRAPNGYSMLDFTGFMRAAYGLPRGDVAAAAGPSSKRRPRLLLIAR-ART-RRFVNAEEI 522

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            + AE+LGF+V + E   +  +A      +SC A++GVHGAGLT+ +F+  G V++QVVP
Sbjct: 523 VRGAEKLGFEVVVSE--GTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPTGGVVIQVVP 580

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           +G       YF  P+R +GL YLEY+I  EES+L+ +Y  +  +  +P       W +++
Sbjct: 581 LGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLINQYPRDHPIFTDPNGIKSKGWESLK 640

Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
             YL  Q+V LD+ RFR  LK A    +K   K
Sbjct: 641 DAYLDKQDVSLDMKRFRPTLKKAIAHIRKARAK 673


>gi|413935496|gb|AFW70047.1| hypothetical protein ZEAMMB73_657787 [Zea mays]
          Length = 436

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 130/384 (33%), Positives = 205/384 (53%), Gaps = 19/384 (4%)

Query: 15  TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           TC  + +  + C  +G    D   ST +  G             KT+PY R  D  AM+ 
Sbjct: 60  TCYMTSKRSERCDASGDIRVDGNRSTIYVSGI--------DREWKTKPYARYHDPVAMAH 111

Query: 75  VKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           V+E T+     A P  +C   H+ P  +FS GG++GN +H++ D  VPLFI+  +H    
Sbjct: 112 VREYTLKPLPEAAPAPACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--THQFRG 169

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
            V   ++     W  K+     + TR+ +I+++N    HCF  + +G   H  M ++P  
Sbjct: 170 RVQFLLSGMKPWWVAKFTPFFRQLTRYDVIDVDNDQEVHCFPRIVVGATFHKDMGVDPRR 229

Query: 193 LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA 249
            P   ++VDF+  L  A+    E  +   +    KP+L++++R  R  R  LN RE+ +A
Sbjct: 230 SPGHVSVVDFKRALRRAFGLPREAASRGGATGRGKPRLLIISR--RGSRRFLNEREMARA 287

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           A   GF+V + EP++ T  A     ++S   MVGVHGAGLT+ +FL  G+VL+QVVP G 
Sbjct: 288 AAGAGFEVRVAEPDQHTDTAAFAALVNSADVMVGVHGAGLTNMVFLPRGAVLVQVVPFGG 347

Query: 310 -QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
            +WL+ V F+ PA  + + Y+ Y +  EESSL+++Y  N  VL +P A     W  ++  
Sbjct: 348 LEWLTGVTFKDPAADMEVSYMGYDVTLEESSLIDQYPRNHQVLTDPYAVHKQGWDALKAA 407

Query: 368 YLKTQNVKLDIDRFRIYLKDAYKK 391
           YL  QN+++D+DRFR  L++A  +
Sbjct: 408 YLDKQNIRMDLDRFRATLREAMSR 431


>gi|226492922|ref|NP_001147068.1| glycosyltransferase [Zea mays]
 gi|224028515|gb|ACN33333.1| unknown [Zea mays]
 gi|413935495|gb|AFW70046.1| glycosyltransferase [Zea mays]
          Length = 488

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 130/384 (33%), Positives = 205/384 (53%), Gaps = 19/384 (4%)

Query: 15  TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           TC  + +  + C  +G    D   ST +  G             KT+PY R  D  AM+ 
Sbjct: 112 TCYMTSKRSERCDASGDIRVDGNRSTIYVSGI--------DREWKTKPYARYHDPVAMAH 163

Query: 75  VKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           V+E T+     A P  +C   H+ P  +FS GG++GN +H++ D  VPLFI+  +H    
Sbjct: 164 VREYTLKPLPEAAPAPACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--THQFRG 221

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
            V   ++     W  K+     + TR+ +I+++N    HCF  + +G   H  M ++P  
Sbjct: 222 RVQFLLSGMKPWWVAKFTPFFRQLTRYDVIDVDNDQEVHCFPRIVVGATFHKDMGVDPRR 281

Query: 193 LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA 249
            P   ++VDF+  L  A+    E  +   +    KP+L++++R  R  R  LN RE+ +A
Sbjct: 282 SPGHVSVVDFKRALRRAFGLPREAASRGGATGRGKPRLLIISR--RGSRRFLNEREMARA 339

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           A   GF+V + EP++ T  A     ++S   MVGVHGAGLT+ +FL  G+VL+QVVP G 
Sbjct: 340 AAGAGFEVRVAEPDQHTDTAAFAALVNSADVMVGVHGAGLTNMVFLPRGAVLVQVVPFGG 399

Query: 310 -QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
            +WL+ V F+ PA  + + Y+ Y +  EESSL+++Y  N  VL +P A     W  ++  
Sbjct: 400 LEWLTGVTFKDPAADMEVSYMGYDVTLEESSLIDQYPRNHQVLTDPYAVHKQGWDALKAA 459

Query: 368 YLKTQNVKLDIDRFRIYLKDAYKK 391
           YL  QN+++D+DRFR  L++A  +
Sbjct: 460 YLDKQNIRMDLDRFRATLREAMSR 483


>gi|242095990|ref|XP_002438485.1| hypothetical protein SORBIDRAFT_10g020470 [Sorghum bicolor]
 gi|241916708|gb|EER89852.1| hypothetical protein SORBIDRAFT_10g020470 [Sorghum bicolor]
          Length = 498

 Score =  227 bits (579), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 137/399 (34%), Positives = 211/399 (52%), Gaps = 27/399 (6%)

Query: 9   SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSD 68
           +P   +TC+      D C + G      K++T + V  +   P   S+ I+  PY RK +
Sbjct: 105 APISKLTCNFDTPHSDTCTMEGDLRIHGKSATVYVVAASTHRPENSSITIR--PYTRKWE 162

Query: 69  KSAMSKVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
           +  MS+++E+T+  S PP  S      C V+H  PA+VFS GG   NFFH   D  VPL+
Sbjct: 163 QETMSRIREITMR-SMPPAFSFIIPPKCTVSHGVPAVVFSTGGCGTNFFHAMSDLIVPLY 221

Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
           IT   +  +  V L IAD   +W  K+  +L   + +P+I+ +  T   CF S  +GL S
Sbjct: 222 ITSREY--NGRVQLLIADYEPKWVAKFRPILAALSMYPVIDFDADTAVRCFPSAHVGLES 279

Query: 183 HGRMV-INPTLLPKPKTLVDFQSFLANAYNEN-------TNTSSSFHHTKPKLVLVNRNA 234
           H +M+ I+P+L     T++ F+ FL + ++         +  SSS    KP+LV+V R  
Sbjct: 280 HNKMLAIDPSLSRNGYTMMGFRDFLRSVFSLQRPWSKPVSRRSSSGRQQKPRLVMVLR-- 337

Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           R  R + N  +   A E+LGF+V    PE+   +      ++SC  MVGVHGAGLT+ +F
Sbjct: 338 RHSRELTNEADAISAMEDLGFEVVAALPEDVRDMGHFAGVVNSCDVMVGVHGAGLTNMVF 397

Query: 295 LRPGSVLMQVVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKN 353
           L     ++Q+VP  G +W     F +P   +GL Y+EY+   EE++L EKY  +  V  +
Sbjct: 398 LPHNGTVVQIVPWGGMKWACWYAFGEPVPAMGLRYVEYEATAEETTLKEKYPRDHPVFAD 457

Query: 354 PQAF--AGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
           PQ+    G  W+    +L  QNV LDIDRF+  ++  ++
Sbjct: 458 PQSIHRQGKTWAT---FLNGQNVTLDIDRFKGVMQQVFQ 493


>gi|357133695|ref|XP_003568459.1| PREDICTED: uncharacterized protein LOC100837177 [Brachypodium
           distachyon]
          Length = 574

 Score =  227 bits (578), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 133/378 (35%), Positives = 205/378 (54%), Gaps = 13/378 (3%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G     P+ +   S+   +   SQ     K +PYPRK D+  +S +
Sbjct: 199 CDFSNFRANVCEMRGDVRVHPRAT---SILFMEPDGSQRDEVWKIKPYPRKGDEFCLSHI 255

Query: 76  KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
            ELT+ +S      C   H  PA++FS  GY GN FH+F D  VPLF T  S F + +V 
Sbjct: 256 TELTVKSSKVA-AECTKYHDVPAVIFSLTGYTGNLFHDFTDVLVPLFTTA-SEF-NGEVQ 312

Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPK 195
             I D    W  KY  +L + +++P+I+ +     HCF+   +G  ++    I+ +  P 
Sbjct: 313 FLITDMALWWTIKYQTVLQKLSKYPLIDFSKDDQVHCFKHAIVGTHAYMEFTIDSSKAPH 372

Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEE 252
             T+VDF  F+  AY+   +T+     T   KP+L+++ R+    R  LNL E+   AEE
Sbjct: 373 GLTMVDFNRFMRGAYSLGKDTAVVLGETPKVKPRLLIIKRHRT--RMFLNLEEIITMAEE 430

Query: 253 LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QW 311
           LGF+V I E   S+ ++   + ++S   M+GVHGAGLT+ +FL   + L+Q+VP G   W
Sbjct: 431 LGFEVVIDEANVSSDISGFAKLVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGLDW 490

Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLK 370
           +S   F  PA ++GL Y +Y I  +ESSL ++Y  +  + KNP +F    +  + R ++ 
Sbjct: 491 ISRTDFGNPAEMMGLRYKQYAISVDESSLTDQYPKDHEIFKNPISFHKRGFEFIRRTFMD 550

Query: 371 TQNVKLDIDRFRIYLKDA 388
            QNVKLD  RFR  L +A
Sbjct: 551 KQNVKLDCKRFRPILLEA 568


>gi|326497163|dbj|BAK02166.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 548

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 131/392 (33%), Positives = 205/392 (52%), Gaps = 10/392 (2%)

Query: 7   ALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRK 66
           A +P     CD S    D+C + G      K++T + V  +D +    +  +K +PYPRK
Sbjct: 153 ATAPRSKSICDFSSERTDICAMQGDVRMHGKSATVYIVLASDDSYRPENGTVKIRPYPRK 212

Query: 67  SDKSAMSKVKELTITTSAPPNL-SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
           S++  M  ++E+TI  S   ++  C VTH  PA+VFS G Y  NFFH   D  +PLF T+
Sbjct: 213 SEEGTMHSIREVTIRWSGLEDVPRCTVTHDVPAVVFSTGAYLDNFFHAMTDGIIPLFNTV 272

Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
             +  +  V L + D N +W  K+  +L   + +P+I+ +      CF SV +G   H  
Sbjct: 273 REY--EGRVQLVVTDYNRKWVDKFQGILGALSIYPVIDFDADDKVRCFPSVQVGTEGHKE 330

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILN 242
           M I P L  K  T+ DF++FL + Y+     S   + T   +P+L++V R  R  R   N
Sbjct: 331 MGIIPALSRKGYTMTDFRAFLRSVYSLKREWSVPVNRTSSDRPRLLMVLR--RNSRAFAN 388

Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
             E   AA E+GF+V +  PE  + +A     ++SC  MVGVHGAGLT+ +FL   + L+
Sbjct: 389 EAEAVSAATEVGFEVVLGAPEALSDMARFAEVVNSCDVMVGVHGAGLTNLVFLPRNATLV 448

Query: 303 QVVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN 361
           Q+VP  G  W S   F  P+  +GL Y++Y+   EE++L  KY     +  +  +     
Sbjct: 449 QIVPWGGMSWGSNAAFGAPSADMGLRYVQYETTAEETTLKYKYPKEHAIFTDVASIKRQG 508

Query: 362 WS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKA 392
           +     ++L  QN+ LDIDR++  L+  Y+ +
Sbjct: 509 YHVTWELFLNGQNITLDIDRYKGVLQKIYQDS 540


>gi|326520259|dbj|BAK07388.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 625

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 133/384 (34%), Positives = 205/384 (53%), Gaps = 21/384 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S R  D+C   G    D   STFF V   D+  +  S +++  PYPRK D + M +V
Sbjct: 245 CDFSDRRSDVCDFTGDIRMDANASTFFLV--VDAATAAQSHKVR--PYPRKGDPTCMGRV 300

Query: 76  KELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
            E+T+ T   S+PP   C  TH  PA+ FS GGY GN FH+F D  VPL+ T++ +    
Sbjct: 301 PEITVRTTSSSSPPE--CTRTHGVPAVTFSIGGYTGNIFHDFSDVLVPLYNTVHRY--RG 356

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTI---THCFQSVTLGLISHGRMVIN 189
           DV L +A+    W  KY +LL   +RH  +++        THCF+   + L +H  ++I 
Sbjct: 357 DVQLVMANVAPWWLVKYDKLLRELSRHAPLDLAAAAAKGETHCFRHAVVSLRAHRELIIE 416

Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREV 246
               P      DF  F+  A +   +  +        KP+L+++ R+    R +LNL ++
Sbjct: 417 RDRSPDGLATPDFTRFIRRAISLPRDAPTRLADGTGRKPRLLIIARHRT--RILLNLGDM 474

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            + AEE GF+  + E +    ++     I+S   +VGVHGAGLT+ +FL PG+ ++QVVP
Sbjct: 475 MRVAEEAGFEAAVSESDVGDPISRVGAEINSADVLVGVHGAGLTNMMFLAPGATMVQVVP 534

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  QW++ + +  PA  +GL Y++Y+I  +ESSL +KY     +  +P +     +  M
Sbjct: 535 WGGLQWIARMDYGDPAEAMGLRYVQYEIGVDESSLKDKYPRGHKIFTDPTSLHKKGFGFM 594

Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
            R  +  QN+ LD+ RFR  L+ A
Sbjct: 595 RRTLMDGQNITLDLGRFRGVLQQA 618


>gi|326487780|dbj|BAK05562.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 125/382 (32%), Positives = 211/382 (55%), Gaps = 21/382 (5%)

Query: 10  PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDK 69
           P  S  CD S+  +D+C + G A    ++ST   V  T ++ S+    I+ Q       +
Sbjct: 169 PLNSTVCDLSNPRFDICELCGDARTIGQSSTVMYVPRTQTSDSE-EWSIRAQ------SR 221

Query: 70  SAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
             +  +KE+T+ +  ++ P   C   H  PA+VF+ GG   N +H+F D  VPLF+T   
Sbjct: 222 KNLPWIKEVTVKSLNTSQPAPRCTSKHAMPAIVFALGGLTANVWHDFSDVLVPLFLTARQ 281

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
              D+DV L + +    +++KY  +L + TR+ II+ ++     C+  V +GL SHG + 
Sbjct: 282 F--DRDVQLLVTNNQPWFSKKYMTILSKLTRYDIIDFDSDDQVRCYPYVIVGLRSHGDLG 339

Query: 188 INPTLLPKPKTLVDFQSFLANAYNENT-------NTSSSFHHTKPKLVLVNRNARVGRTI 240
           I P L P+  T++DF+ F+  AY                    KP+++L++R     R  
Sbjct: 340 IYPNLSPQNYTMMDFRLFVREAYGLPAAKVAIPYKADRDDPDKKPRIMLIDRGKT--RRF 397

Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
           +N   + +  E  GF+V   +P+  +SL +  R + SC A++G HGAGLT+ +FLR G V
Sbjct: 398 INAPYIVQGLEWFGFEVVKVDPKMDSSLDEFARLVDSCDAIMGAHGAGLTNMVFLRSGGV 457

Query: 301 LMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGA 360
           ++ +VP G ++++  ++ KPAR +GL +++Y I  EES+L+EKYG N  V+K+P+A   +
Sbjct: 458 VVHIVPYGIEFMADGFYGKPARDMGLGHVKYGISPEESTLLEKYGWNHTVIKDPEAIRSS 517

Query: 361 NWSNM-RVYLKTQNVKLDIDRF 381
            W  +  VY+  Q++ L++ RF
Sbjct: 518 GWDKVGEVYMSKQDIVLNMTRF 539


>gi|242061414|ref|XP_002451996.1| hypothetical protein SORBIDRAFT_04g013150 [Sorghum bicolor]
 gi|241931827|gb|EES04972.1| hypothetical protein SORBIDRAFT_04g013150 [Sorghum bicolor]
          Length = 589

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 130/383 (33%), Positives = 211/383 (55%), Gaps = 15/383 (3%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   + C + G+    P  S+   V  T S+P +   + K +PYPRK D+  ++ +
Sbjct: 212 CDFSNFRANTCEMRGNIRIHPNGSSVIYVEPTSSSPKRNE-QWKIKPYPRKGDELCLNHI 270

Query: 76  KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+ +S  AP    C   H  PA++F+  GY GN FH+F D  VPLF T  S F + +
Sbjct: 271 TEVTVKSSKVAP---ECSKYHHVPAVIFALTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 325

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   I D    W RKYA +  + T++P+I+ N     HCF+   +GL ++    I+P   
Sbjct: 326 VQFLITDMAIWWTRKYAVVFEKLTKYPLIDFNKDNEVHCFKHAIVGLHAYMEFTIDPLKA 385

Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAA 250
           P   ++VDF  F+   Y+   +  ++      T+P+L++++R     R  LNL+E+   A
Sbjct: 386 PHNYSMVDFNRFMRRTYSLPRDAVTALGEIPKTRPRLLIISRQR--TRMFLNLKEIVAMA 443

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
           +E+G+DV + E   ++++    + ++S   M+GVHGAGLT+ +FL   ++L+Q+VP G  
Sbjct: 444 DEIGYDVVVEEANVNSNVTHFAKVVNSVDVMMGVHGAGLTNCVFLPHDAILIQIVPWGAL 503

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
             +  + F  PA  +GL Y  Y I   ESSL E+Y  +  + +NP AF    +  +R  +
Sbjct: 504 DGICRIDFGYPAEQMGLRYKHYSIGVHESSLTEQYPLDHEIFRNPLAFHKNGFEFVRQTF 563

Query: 369 LKTQNVKLDIDRFRIYLKDAYKK 391
           + TQNV+LD +RFR  L +A  +
Sbjct: 564 MDTQNVRLDCNRFRPILLEALDQ 586


>gi|195607024|gb|ACG25342.1| glycosyltransferase [Zea mays]
          Length = 488

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 132/389 (33%), Positives = 206/389 (52%), Gaps = 23/389 (5%)

Query: 10  PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDK 69
           P   +T  RS R    C  +G    D   ST +  G             KT+PY R  D 
Sbjct: 111 PTSYMTSKRSER----CDASGDIRVDGNRSTIYVSGI--------DREWKTKPYARYHDP 158

Query: 70  SAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
            AM+ V+E T+     A P  +C   H+ P  +FS GG++GN +H++ D  VPLFI+  +
Sbjct: 159 VAMAHVREYTLKPLPEAAPAPACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--T 216

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
           H     V   ++     W  K+     + TR+ +I+++N    HCF  + +G   H  M 
Sbjct: 217 HQFRGRVQFLLSGMKPWWVAKFTPFFRQLTRYDVIDVDNDQEVHCFPRIVVGATFHKDMG 276

Query: 188 INPTLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLR 244
           ++P   P   ++VDF+  L  A+    E  +   +    KP+L++++R  R  R  LN R
Sbjct: 277 VDPRRSPGHVSVVDFKRALRRAFGLPREAASRGGATGRGKPRLLIISR--RGSRRFLNER 334

Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
           E+ +AA   GF+V + EP++ T  A     ++S   MVGVHGAGLT+ +FL  G+VL+QV
Sbjct: 335 EMARAAAGAGFEVRVAEPDQHTDTAAFAALVNSADVMVGVHGAGLTNMVFLPRGAVLVQV 394

Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
           VP G  +WL+ V F+ PA  + + ++ Y +  EESSL+++Y  N  VL +P A     W 
Sbjct: 395 VPFGGLEWLTGVTFKDPAADMEVSHMGYDVTLEESSLIDQYPRNHQVLTDPYAVHKQGWD 454

Query: 364 NMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
            ++  YL  QN+++D+DRFR  L++A  +
Sbjct: 455 ALKAAYLDKQNIRMDLDRFRATLREAMSR 483


>gi|115453815|ref|NP_001050508.1| Os03g0567600 [Oryza sativa Japonica Group]
 gi|13957626|gb|AAK50581.1|AC084404_6 unknown protein [Oryza sativa Japonica Group]
 gi|108709375|gb|ABF97170.1| glycosyltransferase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113548979|dbj|BAF12422.1| Os03g0567600 [Oryza sativa Japonica Group]
 gi|125586859|gb|EAZ27523.1| hypothetical protein OsJ_11476 [Oryza sativa Japonica Group]
 gi|215704366|dbj|BAG93800.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 576

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 134/380 (35%), Positives = 205/380 (53%), Gaps = 17/380 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G     P  +   SV   +   SQ     K +PYPRK D+  +S +
Sbjct: 201 CDFSNFRANVCEMRGDVRIHPTAT---SVLFMEPEGSQRDEVWKIKPYPRKGDEFCLSHI 257

Query: 76  KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+ +S  AP    C   H  PA++FS  GY GN FH+F D  VPLF T  S F + +
Sbjct: 258 TEVTVKSSKVAP---ECTKYHDVPAVIFSLTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 312

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   I D    W  KY  +L + +++P+I+ +     HCF+   +GL ++    I+ T  
Sbjct: 313 VQFLITDMALWWTIKYQTVLQKLSKYPVIDFSKDDQVHCFKHAIVGLHAYMEFTIDSTKA 372

Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
           P   ++ DF  F+  AY+   ++      +   KP+L+++ R+    R  LNL E+   A
Sbjct: 373 PHNYSMADFNRFMRGAYSLGRDSVTVLGEYPKIKPRLLIIKRHRT--RMFLNLDEIISMA 430

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
           EELGF+V I E   S+ ++   R ++S   M+GVHGAGLT+ +FL   + L+Q+VP G  
Sbjct: 431 EELGFEVVIDEANVSSDISRFARLVNSVDVMMGVHGAGLTNCVFLPQHATLIQIVPWGGL 490

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
            W+S   F  PA ++GL Y +Y I  +ESSL ++Y  +  + KNP +F    +  +R  +
Sbjct: 491 DWISRTDFGNPAELMGLRYKQYSIGVDESSLTDQYPRDHEIFKNPISFHQRGFDFIRQTF 550

Query: 369 LKTQNVKLDIDRFRIYLKDA 388
           +  QNVKLD  RFR  L +A
Sbjct: 551 MDKQNVKLDCKRFRPILLEA 570


>gi|125544555|gb|EAY90694.1| hypothetical protein OsI_12297 [Oryza sativa Indica Group]
          Length = 576

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 134/380 (35%), Positives = 205/380 (53%), Gaps = 17/380 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G     P  +   SV   +   SQ     K +PYPRK D+  +S +
Sbjct: 201 CDFSNFRANVCEMRGDVRIHPTAT---SVLFMEPEGSQRDEVWKIKPYPRKGDEFCLSHI 257

Query: 76  KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+ +S  AP    C   H  PA++FS  GY GN FH+F D  VPLF T  S F + +
Sbjct: 258 TEVTVKSSKVAP---ECTKYHDVPAVIFSLTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 312

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   I D    W  KY  +L + +++P+I+ +     HCF+   +GL ++    I+ T  
Sbjct: 313 VQFLITDMALWWTIKYQTVLQKLSKYPVIDFSKDDQVHCFKHAIVGLHAYMEFTIDSTKA 372

Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
           P   ++ DF  F+  AY+   ++      +   KP+L+++ R+    R  LNL E+   A
Sbjct: 373 PHNYSMADFNRFMRGAYSLGRDSVTVLGEYPKIKPRLLIIKRHRT--RMFLNLDEIISMA 430

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
           EELGF+V I E   S+ ++   R ++S   M+GVHGAGLT+ +FL   + L+Q+VP G  
Sbjct: 431 EELGFEVVIDEANVSSDISRFARLVNSVDVMMGVHGAGLTNCVFLPQHATLIQIVPWGGL 490

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
            W+S   F  PA ++GL Y +Y I  +ESSL ++Y  +  + KNP +F    +  +R  +
Sbjct: 491 DWISRTDFGNPAELMGLRYKQYSIGVDESSLTDQYPRDHEIFKNPISFHQRGFDFIRQTF 550

Query: 369 LKTQNVKLDIDRFRIYLKDA 388
           +  QNVKLD  RFR  L +A
Sbjct: 551 MDKQNVKLDCKRFRPILLEA 570


>gi|242095992|ref|XP_002438486.1| hypothetical protein SORBIDRAFT_10g020480 [Sorghum bicolor]
 gi|241916709|gb|EER89853.1| hypothetical protein SORBIDRAFT_10g020480 [Sorghum bicolor]
          Length = 478

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 135/396 (34%), Positives = 209/396 (52%), Gaps = 24/396 (6%)

Query: 9   SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSD 68
           +P   +TC+ S+   D C + G      K++T + V  +   P   +  ++  PY RK +
Sbjct: 88  APVSKLTCNFSNPHSDTCTMEGDLRIHGKSATVYVVAASTHRPENSTFIVR--PYTRKWE 145

Query: 69  KSAMSKVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
           +  MS+++E+T+  S PP  S      C V H  PA+VFS GG  GNFFH   D  VPL+
Sbjct: 146 QETMSRIREVTMR-SMPPAFSFIIPPKCTVRHDVPAVVFSTGGCGGNFFHAMSDLIVPLY 204

Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
           IT   +  +  V   I D   +W  K+  +L   + +P+I+ +  T   CF S  +GL S
Sbjct: 205 ITSREY--NGHVQFLITDYRPEWVAKFRPILAALSMYPVIDFDADTAVRCFPSAHVGLQS 262

Query: 183 HGRMV-INPTLLPKPKTLVDFQSFLANAYNEN----TNTSSSFHHTKPKLVLVNRNARVG 237
           H +M+ I+P+L     T++ F+ FL + ++         S S    KP+LV+V R  R  
Sbjct: 263 HNKMLAIDPSLSRNGYTMMGFRDFLRSVFSLQRPWVEPVSRSSGRQKPRLVMVLR--RHS 320

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R + N  E   A E+LGF+V    PE+   +      ++SC  MVGVHGAGLT+ +FL  
Sbjct: 321 RELTNEAETITAMEDLGFEVVAALPEDVRDMGHFAGVVNSCDVMVGVHGAGLTNMVFLPH 380

Query: 298 GSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA 356
              ++Q+VP G  +W     F +P   +GL Y+EY++  EE++L EKY  +  V  +P++
Sbjct: 381 NGTVVQIVPWGDMKWACWYDFGEPVPGMGLRYVEYEVTAEETTLKEKYPRDHPVFTDPRS 440

Query: 357 F--AGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
               G  WS    +L  QN+ LDIDRFR  ++  ++
Sbjct: 441 IHRQGKAWST---FLNGQNITLDIDRFRAVMQQVFQ 473


>gi|326532652|dbj|BAJ89171.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 577

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 133/378 (35%), Positives = 203/378 (53%), Gaps = 13/378 (3%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G     PK +   SV   +   SQ     K +PYPRK D+  +S +
Sbjct: 202 CDFSNFRANVCEMRGDVRVHPKAT---SVLFMEPEGSQRDEVWKIKPYPRKGDEFCLSHI 258

Query: 76  KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
            ELT+ +S      C   H  P ++FS  GY GN FH+F D  VPLF T  S F D +V 
Sbjct: 259 TELTVKSSKVA-AECTRYHDVPVVIFSLTGYTGNLFHDFTDVIVPLFTTA-SQF-DGEVQ 315

Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPK 195
             I D    W  KY  LL + +++P+I+       HCF+   +G  ++    I+ +  P 
Sbjct: 316 FLITDMALWWTIKYHTLLQKLSKYPLIDFGKDDQVHCFKHAIVGTHAYMEFTIDASKSPH 375

Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEE 252
             T+VDF  F+ +AY+    T+++   +   KP+L+++ R+    R  LNL E+   AEE
Sbjct: 376 GVTMVDFNRFMRDAYSLPKETAAALGESPKVKPRLLIIKRHRT--RMFLNLEEIIAMAEE 433

Query: 253 LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QW 311
           LGF+V I E   S+ +    + ++S   M+GVHGAGLT+ +FL   + L+Q+VP G   W
Sbjct: 434 LGFEVVIDEANVSSDINGFAKLVNSVDVMMGVHGAGLTNCVFLPQNATLIQIVPFGGLDW 493

Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLK 370
           +S   F  P+ ++GL Y +Y I  +ESSL + Y  +  + K+P +F    +  + R ++ 
Sbjct: 494 ISRTDFGNPSEMMGLRYKQYAITVDESSLTDHYPRDHKIFKDPISFHKRGFEFIRRTFMD 553

Query: 371 TQNVKLDIDRFRIYLKDA 388
            QNVKLD  RFR  L +A
Sbjct: 554 KQNVKLDCKRFRPVLLEA 571


>gi|413954173|gb|AFW86822.1| hypothetical protein ZEAMMB73_198004 [Zea mays]
          Length = 486

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 135/390 (34%), Positives = 209/390 (53%), Gaps = 23/390 (5%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           +TCD  +   D C + G      K++T + V  +   P   ++ ++  PY RK ++  MS
Sbjct: 102 LTCDFGNPHSDTCTMEGDLRIHGKSATVYVVAASTHRPENSTITVR--PYTRKWEQETMS 159

Query: 74  KVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
           +++E+T+  S PP  S      C V H  PA+VFS GG   NFFH   D  VPL+IT   
Sbjct: 160 RIREVTMR-SMPPAFSFVIPPKCTVRHDVPAVVFSTGGCGTNFFHGMSDLVVPLYITSRE 218

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
           +  D  V L IAD N +W  K+  +L   +  P+I+++      CF SV +GL SH +M+
Sbjct: 219 Y--DGRVQLLIADYNAEWVAKFRPVLAALSTFPVIDLDADAAVRCFPSVHVGLESHKKML 276

Query: 188 -INPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH---HTKPKLVLVNRNARVGRTILNL 243
            I+P+L     T++ F+ FL + ++     ++        KP+L++V R  R  R + N 
Sbjct: 277 AIDPSLSRNGYTMMGFRDFLRSVFSLQRPWATPVRLGSGQKPRLLMVLR--RRSRELSNE 334

Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
            E   A EE+GF+V    PE+ + ++     ++SC  MVGVHGAGLT+ +FL     ++Q
Sbjct: 335 AETIAAMEEIGFEVVAARPEDVSDMSRFAGVVNSCDVMVGVHGAGLTNMVFLPHNGTVVQ 394

Query: 304 VVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF--AGA 360
           +VP G  +W     F +P   +GL Y+EY+   EE++L EKY  +  V  +PQ+    G 
Sbjct: 395 IVPWGNMKWACWYAFGEPVPGMGLRYVEYEATAEETTLKEKYPRDHPVFTDPQSIHRQGK 454

Query: 361 NWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
            WS    +L  QNV LDIDRF+  +   ++
Sbjct: 455 AWST---FLNGQNVTLDIDRFKAVMLQVFQ 481


>gi|238011450|gb|ACR36760.1| unknown [Zea mays]
 gi|413926794|gb|AFW66726.1| hypothetical protein ZEAMMB73_527920 [Zea mays]
          Length = 491

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 129/389 (33%), Positives = 207/389 (53%), Gaps = 24/389 (6%)

Query: 15  TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           TC  + +  + C  +G    D   ST + VG  +          +T+PY R  D  AM+ 
Sbjct: 111 TCYMTSKRSERCDASGDIRVDGNRSTIY-VGGIERE-------WRTKPYARYHDPVAMAH 162

Query: 75  VKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           V+E T+       A    +C   H+ P  +FS GG++GN +H++ D  VPLFI+  +H  
Sbjct: 163 VREYTLKALPEPGAAAAPACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--THQF 220

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
              V   ++     W  K+     + TRH +I+++     HCF  + +G   H  M ++P
Sbjct: 221 RGRVQFLVSGMKPWWVGKFTPFFRQLTRHDVIDVDKDGEVHCFPRIVVGATFHRDMGVDP 280

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSS------SFHHTKPKLVLVNRNARVGRTILNLR 244
              P   + VDF+  L  A+      +S      +    KP+L++++R  R  R  LN R
Sbjct: 281 RRAPGHVSAVDFKRALRAAFGLKREAASRGGGGGATGDGKPRLLIISR--RGSRRFLNSR 338

Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
           E+  AA + GF+V + EP++ T +A     ++S  AMVGVHGAGLT+ +FL  G+VL+QV
Sbjct: 339 EMAVAAGDAGFEVRVAEPDQRTDMAAFAALVNSADAMVGVHGAGLTNMVFLPRGAVLVQV 398

Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
           VP G  +WL+ V F++PA  + + Y++Y ++ EESSLV++Y     VL +P A     W 
Sbjct: 399 VPFGGLEWLTGVTFKEPAADMEVSYMDYHVRLEESSLVDQYPRGHQVLTDPYAVHRQGWD 458

Query: 364 NMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
            ++  YL  QN+++D+DRFR  L++   +
Sbjct: 459 ALKTAYLDKQNIRMDLDRFRATLREVMAR 487


>gi|242052307|ref|XP_002455299.1| hypothetical protein SORBIDRAFT_03g008000 [Sorghum bicolor]
 gi|241927274|gb|EES00419.1| hypothetical protein SORBIDRAFT_03g008000 [Sorghum bicolor]
          Length = 568

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 134/389 (34%), Positives = 209/389 (53%), Gaps = 28/389 (7%)

Query: 10  PAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTPS-QPSLRIKTQ 61
           P   I CD   +          +C ++G     P++S+        +TP  Q     +  
Sbjct: 180 PKSKIYCDDKSKDDGFPYARPIICQMSGDVRVSPESSSV-----ALNTPMLQGEEERRIT 234

Query: 62  PYPRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
           PY RK D S +S V+E+ I   A  N +  C ++H  PA++FS GGY GNFFH+  D  +
Sbjct: 235 PYARKDD-SLLSLVREVVIRAVANENDAPKCSISHDVPAVIFSVGGYTGNFFHDMSDVLI 293

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
           PL++T   +     V   I +    W +KY  +L R + H II+ ++    HCFQ V LG
Sbjct: 294 PLYLTSFQY--KGRVKFFITNYKQWWIQKYKPVLRRLSHHDIIDFDSNKDVHCFQHVILG 351

Query: 180 LISHGRMVI--NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNA 234
           L     +++  +PT  PK  +++DF  FL ++Y    N           KP++++++R  
Sbjct: 352 LTRDRDLILRPHPTRNPKGYSMLDFTRFLRHSYGLKRNRPLVLGEQPGKKPRMLIISR-- 409

Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           R  R +LNLR+V   +  LGFDV I   E   +L      ++SC  ++ VHGAGLT+ +F
Sbjct: 410 RGTRKLLNLRQVAATSRALGFDVII--SEARGNLKRFATMVNSCDVLLAVHGAGLTNQVF 467

Query: 295 LRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKN 353
           L P +V++Q+VP G   W++T ++ +PAR + L YLEY + ++ESSL ++Y  + LV K+
Sbjct: 468 LPPQAVVVQIVPWGKMDWMATNFYGQPARGMNLRYLEYYVSEKESSLAQRYPRDHLVFKD 527

Query: 354 PQAFAGANWSNMRVYLKTQNVKLDIDRFR 382
           P A  G  W+ +      Q+VKL+I RFR
Sbjct: 528 PMAIHGQGWNALADIFMAQDVKLNIRRFR 556


>gi|224135465|ref|XP_002322080.1| predicted protein [Populus trichocarpa]
 gi|222869076|gb|EEF06207.1| predicted protein [Populus trichocarpa]
          Length = 387

 Score =  224 bits (571), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 128/376 (34%), Positives = 203/376 (53%), Gaps = 14/376 (3%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS--DKSA 71
           + C+   RS + C I G    D  +ST F V       +  +     +PY RK    +  
Sbjct: 7   LRCNIMGRS-EFCEIKGDIRIDGNSSTAFIVSSETDILTAENTSWSIRPYARKEALGEKD 65

Query: 72  MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
            ++   + + T  P    C   H+ PA++FS GGY+GNFFH F D  VPL+ T      +
Sbjct: 66  FARKWSVKLVTDRPDIPRCTRNHSVPAILFSVGGYSGNFFHAFTDIIVPLYSTAQPF--N 123

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
           ++V   I +    W  K+  LL   +R+ II+I+++   HCFQS+T+GL       ++  
Sbjct: 124 REVQFLITNRKSSWIAKFKTLLEALSRYEIIDIDDRHDMHCFQSLTIGLKGRNNKELSID 183

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFH---HTKPKLVLVNRNARVGRTILNLREVKK 248
               P ++ DF  FL   Y+    T++        KP+L++++R  +  R   N+ E+ +
Sbjct: 184 SSTSPYSMKDFTQFLRRWYSLKKITAAKIRDGDKRKPRLLIISR--KRSRAFTNVGEIAE 241

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            AE L + V + EP     ++   + I+SC  ++GVHGAGLT+ +FL   ++L+QVVP G
Sbjct: 242 LAESLSYQVIVAEP--GPDVSGFAKIINSCDVVMGVHGAGLTNIVFLPENAILIQVVPFG 299

Query: 309 -TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
             +W S V FE PA+ + + YL+YKIK EES+L+++Y A+ +VL++P       W   R 
Sbjct: 300 RVEWASRVSFEDPAKDMNIRYLDYKIKVEESTLIQQYPADHVVLRDPSVIGKQGWLAFRS 359

Query: 367 VYLKTQNVKLDIDRFR 382
           +YL  QNV LD++RFR
Sbjct: 360 IYLDKQNVTLDVNRFR 375


>gi|195657307|gb|ACG48121.1| glycosyltransferase [Zea mays]
          Length = 681

 Score =  224 bits (571), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 133/392 (33%), Positives = 205/392 (52%), Gaps = 17/392 (4%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV----GHTDSTPSQPSLRIKTQPYPRKSDKS 70
           CD  S+R  D C ++G        ++   V    G    T    S RIK  PYPRK+D +
Sbjct: 289 CDITSNRRIDWCELDGDVRVLGANASVTLVAPPGGADGRTFRAESWRIK--PYPRKADPN 346

Query: 71  AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           AM  V+ LT+ +      +C   H  PALVFS  GY GN+FH F D  +PLF+T   +  
Sbjct: 347 AMRVVRVLTVRSVPGEAPACTDRHGVPALVFSDRGYTGNYFHAFTDVILPLFLTARQYA- 405

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
             +V L + D    W  K+A +    + + +++++     HCF+ V +GL SH    I+P
Sbjct: 406 -GEVRLLVTDLQAWWVGKFAPVFRSISNYELVDLDRDPRVHCFRHVQVGLTSHDDFSIDP 464

Query: 191 TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
              P   +++DF  F+  AY     +   +      +P+L+++ R AR  R  +N  E+ 
Sbjct: 465 LRAPNGYSMLDFTGFMRAAYGLPRGDVAAAGPSSKRRPRLLVIAR-ART-RRFVNAEEIV 522

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
           + A +LGF+V + E   +  +A      +SC A++GVHGAGLT+ +F+  G V++QVVP+
Sbjct: 523 RGAVKLGFEVVVSE--GTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPTGGVVIQVVPL 580

Query: 308 GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
           G       YF  P+R +GL YLEY+I  EES+L+ +Y  +  +  +P       W +++ 
Sbjct: 581 GGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLINQYPRDHPIFTDPNGIKSKGWESLKD 640

Query: 367 VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
            YL  Q+V LD+ RFR  LK A    +K   K
Sbjct: 641 AYLDKQDVSLDMKRFRPTLKKAIAHIRKARAK 672


>gi|383100766|emb|CCG47997.1| glycosyltransferase, HGA-like,putative,expressed [Triticum
           aestivum]
          Length = 633

 Score =  223 bits (569), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 130/389 (33%), Positives = 203/389 (52%), Gaps = 21/389 (5%)

Query: 16  CDR-SHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           CD  S+R  D C ++G         T   VG     P     R++  PYPRK D +AM  
Sbjct: 256 CDMTSNRRIDWCELDGDVRVHGARGTVTLVG----APRAEEWRVR--PYPRKVDPNAMRH 309

Query: 75  VKELTI--TTSAP---PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
           V  +T+  TT+ P       C + H+ PAL+FS  GY GN+FH + D  +PLF+T   + 
Sbjct: 310 VTNITVRSTTTLPGAAEEEECAIKHSVPALLFSDRGYTGNYFHAYTDVILPLFLTAKRY- 368

Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
              +V   ++D    W  K+  +    + + ++++     T CFQ V +GL  H    I+
Sbjct: 369 -GGEVQFLVSDMQMWWVGKFLPVFKSLSNYDLVDLAADNRTRCFQHVQVGLTCHADFSID 427

Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA 249
           P   P   ++VDF   +   Y      +      +P+L+L+ R +   R  +N  E+ +A
Sbjct: 428 PLRAPNGYSMVDFTRHMRGVYGLPRGLAVPAAGARPRLLLIARAST--RRFVNADEIVRA 485

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           A+++GF+V +   E +  +A      ++C AM+GVHGAGLT+ +FL  G V++QVVP+G 
Sbjct: 486 AQKVGFEVVV--SEGTHEVAPFAELANTCDAMLGVHGAGLTNMVFLPTGGVVIQVVPLGG 543

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
                 YF  P+R +GL YLEY+I   ES+L E+Y  +  +  +P       W +++ VY
Sbjct: 544 LEFVAGYFRTPSRDMGLRYLEYRIAPAESTLTEQYPPDHPIFTDPDGVKSKGWDSLKQVY 603

Query: 369 LKTQNVKLDIDRFRIYLKD--AYKKAKKL 395
           L  Q+V+LD+ RFR  LK   A+ +A KL
Sbjct: 604 LDKQDVRLDLKRFRPLLKKAIAHIRANKL 632


>gi|308080322|ref|NP_001183140.1| uncharacterized protein LOC100501509 [Zea mays]
 gi|238009592|gb|ACR35831.1| unknown [Zea mays]
          Length = 378

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 131/390 (33%), Positives = 207/390 (53%), Gaps = 28/390 (7%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           +T  RS R    C  +G    D   ST + VG  +          +T+PY R  D  AM+
Sbjct: 1   MTSKRSER----CDASGDIRVDGNRSTIY-VGGIERE-------WRTKPYARYHDPVAMA 48

Query: 74  KVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
            V+E T+       A    +C   H+ P  +FS GG++GN +H++ D  VPLFI+  +H 
Sbjct: 49  HVREYTLKALPEPGAAAAPACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--THQ 106

Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
               V   ++     W  K+     + TRH +I+++     HCF  + +G   H  M ++
Sbjct: 107 FRGRVQFLVSGMKPWWVGKFTPFFRQLTRHDVIDVDKDGEVHCFPRIVVGATFHRDMGVD 166

Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSS------SFHHTKPKLVLVNRNARVGRTILNL 243
           P   P   + VDF+  L  A+      +S      +    KP+L++++R  R  R  LN 
Sbjct: 167 PRRAPGHVSAVDFKRALRAAFGLKREAASRGGGGGATGDGKPRLLIISR--RGSRRFLNS 224

Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
           RE+  AA + GF+V + EP++ T +A     ++S  AMVGVHGAGLT+ +FL  G+VL+Q
Sbjct: 225 REMAVAAGDAGFEVRVAEPDQRTDMAAFAALVNSADAMVGVHGAGLTNMVFLPRGAVLVQ 284

Query: 304 VVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
           VVP G  +WL+ V F++PA  + + Y++Y ++ EESSLV++Y     VL +P A     W
Sbjct: 285 VVPFGGLEWLTGVTFKEPAADMEVSYMDYHVRLEESSLVDQYPRGHQVLTDPYAVHRQGW 344

Query: 363 SNMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
             ++  YL  QN+++D+DRFR  L++   +
Sbjct: 345 DALKTAYLDKQNIRMDLDRFRATLREVMAR 374


>gi|413925803|gb|AFW65735.1| hypothetical protein ZEAMMB73_558289 [Zea mays]
          Length = 596

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 131/384 (34%), Positives = 211/384 (54%), Gaps = 16/384 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G+    P  S+   +  T S+ S+ + + K +PYPRK D+  ++ +
Sbjct: 218 CDFSNFRANVCEMRGNIRIHPNGSSVIYMEPT-SSGSKRNEQWKIKPYPRKGDELCLNHI 276

Query: 76  KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+ +S  AP    C   H  PA++F+  GY GN FH+F D  VPLF T  S F + +
Sbjct: 277 TEVTVKSSIVAP---ECSKYHDVPAVIFALTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 331

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   I D    W RKYA +  + T++P+I+ N     HCF    +GL ++    I+P   
Sbjct: 332 VQFLITDMAIWWTRKYAVVFEKLTKYPLIDFNKDNEVHCFNHAIVGLHAYMEFTIDPLKA 391

Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHHT----KPKLVLVNRNARVGRTILNLREVKKA 249
           P   ++VDF  F+   Y+   +   +   T    KP+L++++R     R  LNL+E+   
Sbjct: 392 PHNYSMVDFNQFMRRTYSLPRDAVRAPGETPQTRKPRLLIISRQR--TRMFLNLKEIVAM 449

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           AEE+G++V + E   ++++    + ++S   M+GVHGAGLT+ +FL  G+ L+Q+VP G 
Sbjct: 450 AEEIGYEVVVEEANVNSNVGHFAKVVNSVDVMMGVHGAGLTNCVFLPHGATLIQIVPWGA 509

Query: 310 -QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
              +  + F  PA  +GL Y  Y I   ESSL E+Y  +  + +NP AF    +  +R  
Sbjct: 510 LDGICRIDFGYPAEQMGLRYKHYSIGVHESSLTEQYPLDHEIFRNPLAFHKNGFEFVRQT 569

Query: 368 YLKTQNVKLDIDRFRIYLKDAYKK 391
           ++ TQNV+LD +RFR  L +A  +
Sbjct: 570 FMDTQNVRLDCNRFRPVLLEALDQ 593


>gi|413943929|gb|AFW76578.1| hypothetical protein ZEAMMB73_859887 [Zea mays]
          Length = 478

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 128/397 (32%), Positives = 209/397 (52%), Gaps = 24/397 (6%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
           A  +TC+ S+R  D C + G      +++T + V    +     +  +  +PY RK ++ 
Sbjct: 84  ASRLTCNLSNRHSDTCTMEGDVRIHGRSATVYVVAAASNHRRPENSTVTVRPYTRKWEQE 143

Query: 71  AMSKVKELTITTSAPPNL--------SCGVTHT-SPALVFSAGGYNGNFFHEFMDCFVPL 121
            M++++E+T+  + PP           C   H  +PA+VFS GG   NFFH   D  VPL
Sbjct: 144 TMARIREVTVRYTPPPAPFGSGVIPPRCTAVHDGAPAVVFSTGGCGTNFFHAMSDLVVPL 203

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
           +IT   +     V L + D   +W  K+  +L   + HP++++++     CF +  +GL 
Sbjct: 204 YITAREY--GGRVHLLVTDYRPEWVAKFRPVLDALSAHPVVDLDSDAAVRCFPAARVGLE 261

Query: 182 SH-GRMVINPTLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARV 236
           SH G + I+PTL     T+V F+ FL + ++       ++S       P+LV+V R  R 
Sbjct: 262 SHNGMLAIDPTLSRHGYTMVGFRDFLRSVFSLPRPWAWSSSRPVISRPPRLVMVLR--RH 319

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
            R + N  +   A E+LGFDV    PE+   +    R ++SC  MVGVHGAGLT+ +FL 
Sbjct: 320 SRALTNEADTVAAMEDLGFDVVPARPEDVADMGRFARVVNSCDVMVGVHGAGLTNMVFLP 379

Query: 297 PGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
             + ++Q+VP G  +W     F +P + +GL Y+EY++  EE++L EKY  +  V  +PQ
Sbjct: 380 HNATVVQIVPWGDMKWACWYDFGEPVQGMGLRYVEYEVTAEETTLKEKYARDHPVFTDPQ 439

Query: 356 AF--AGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
           +    G  W+    +L  QNV LDIDRF+  ++  ++
Sbjct: 440 SIHRQGKAWAT---FLDGQNVTLDIDRFKAAMQQVFQ 473


>gi|326514418|dbj|BAJ96196.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529397|dbj|BAK04645.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 503

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 193/340 (56%), Gaps = 11/340 (3%)

Query: 59  KTQPYPRKSDKSAMSKVKE--LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
           +T+PY R  D  AM+ V+E  L    +  P  +C   H+ P  VFS  G++GN +H++ D
Sbjct: 163 RTKPYARYHDPVAMTHVREFVLKPFPADAPPPACTKNHSVPGFVFSNRGFSGNLYHDYTD 222

Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
             VPLF++        +V   ++D    W  K+  L  + + + ++++NN    HC   +
Sbjct: 223 VLVPLFLSTRKF--KGEVQFLLSDLKPWWVAKFRPLFRQLSNYEVVDVNNDLEVHCVPRI 280

Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRN 233
            +G   H  M I P+      ++VDF+  L +A+      +S    T   KP+L++++R 
Sbjct: 281 VVGSDFHKDMGIIPSKAAGHVSIVDFKRTLRDAFGLERAAASRGGATGAGKPRLLIISR- 339

Query: 234 ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
            +  R  LN RE+  AA  +GFDV I EP++ T ++   R ++S   MVGVHGAGLT+ +
Sbjct: 340 -KNSRRFLNEREMAAAATAMGFDVRIAEPDQHTDMSTFARLVNSADVMVGVHGAGLTNMV 398

Query: 294 FLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLK 352
           FL  G+VL+QVVP G  +WL+ V F+ PA  + + Y++Y ++ EESSL+E+Y  N  VL 
Sbjct: 399 FLPAGAVLIQVVPFGGLEWLTGVTFKNPAADMEVTYMDYNVQLEESSLLEQYPRNHQVLT 458

Query: 353 NPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
           +P A     W  ++  YL  QNV+LD+D+FR  L+DA  +
Sbjct: 459 DPYAVHKQGWDALKAAYLDKQNVRLDLDKFRATLRDALSR 498


>gi|388512767|gb|AFK44445.1| unknown [Lotus japonicus]
          Length = 365

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 118/319 (36%), Positives = 175/319 (54%), Gaps = 36/319 (11%)

Query: 89  SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARK 148
           +C V H  PA+ FS GGY GN +HEF D  +PL+IT  S   ++ V+  I + +D W  K
Sbjct: 21  TCDVQHDVPAVFFSNGGYTGNVYHEFNDGIIPLYIT--SQRFNKKVVFVILEYHDWWMMK 78

Query: 149 YAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLAN 208
           Y ++L R +  P I+      THCF    +GL  H  + ++  L+   K++VDF++ +  
Sbjct: 79  YGDILSRLSDFPAIDFRGDNRTHCFPEAVVGLKIHDELTVDSKLMEGNKSIVDFRNLMDK 138

Query: 209 AY---------NENTN---------TSSSFHHT-------------KPKLVLVNRNARVG 237
           AY         +E            +SS    T             KPKLV+V+R+    
Sbjct: 139 AYWPRIKGLIQDEEREAQGKLREQISSSPLSETPLIKQEVQENPKKKPKLVIVSRSG--S 196

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R I N   + K AEE+GF V + +P ++T LA  +R ++    M+GVHGA +TH +FL+ 
Sbjct: 197 RAITNENLLVKMAEEIGFFVEVLKPAKTTELAKIYRVLNGSDVMIGVHGAAMTHFMFLKS 256

Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
           GSV +Q VP+GT W +  Y+ +PAR LGL+Y+ Y+I   ESSL EKY  ND +L++P + 
Sbjct: 257 GSVFIQAVPLGTNWAAETYYGEPARKLGLKYIGYEIHPRESSLYEKYDKNDPILRDPASI 316

Query: 358 AGANWS-NMRVYLKTQNVK 375
               W    ++YL +QNVK
Sbjct: 317 NEKGWEYTKKIYLDSQNVK 335


>gi|357127186|ref|XP_003565265.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
           [Brachypodium distachyon]
          Length = 498

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 138/408 (33%), Positives = 222/408 (54%), Gaps = 55/408 (13%)

Query: 14  ITCDRSHRSYDLC------LINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS 67
           + C+R     D C       INGSAL    + T       ++T  +   +I+  PY R++
Sbjct: 110 VVCNRDGPFSDTCEVTGDVRINGSAL----SVTLLPSSRQETTSRRQQWKIR--PYSRRT 163

Query: 68  DKSAMSKVKELTIT-------TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
               M+ ++E+T+T        SA P  +C VTH  PA+VF+ GG  GN+FH+F D  VP
Sbjct: 164 ----MTDIREVTVTQLASADEASAAP--ACTVTHEVPAIVFALGGLTGNYFHDFSDALVP 217

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
           LF+    +    +V+  +++    W  KY  ++ R +++  ++++    T CF+ V++GL
Sbjct: 218 LFVASRRY--GGEVLFLVSNIQPWWLAKYGAVVRRLSKYDAVDLDKDNQTRCFRHVSVGL 275

Query: 181 -------ISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTN---------TSSSFHHTK 224
                  I+ G+   NP  +P      DF +FL   Y+   N         T S+  + K
Sbjct: 276 RLTKEFDIAAGKN--NPLSMP------DFTAFLRETYSLPRNSPTKISLGATGSNDDNQK 327

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           P+L+L++R+    R +LN+ E+  AAE  GF VTI +P     ++D  + ++S   ++GV
Sbjct: 328 PRLMLIHRSHY--RKLLNVPEIVAAAESAGFAVTISDPRFDVRISDLAKSVNSFDVLMGV 385

Query: 285 HGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEK 343
           HGAGLT++ FL PG V++QVVP G  + L+   F +P + +GLEYLEY +  +ESSL+E 
Sbjct: 386 HGAGLTNAAFLPPGGVVIQVVPYGKMEGLARTDFGEPVKDMGLEYLEYSVSAQESSLLEM 445

Query: 344 YGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
            G   LV+K+P+A   + W  +   YL  Q+V+LD++RFR  L  A +
Sbjct: 446 LGPEHLVIKDPEAVHRSGWDKVAEYYLGKQDVRLDVERFRPTLDKAME 493


>gi|242052309|ref|XP_002455300.1| hypothetical protein SORBIDRAFT_03g008020 [Sorghum bicolor]
 gi|241927275|gb|EES00420.1| hypothetical protein SORBIDRAFT_03g008020 [Sorghum bicolor]
          Length = 626

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 129/386 (33%), Positives = 200/386 (51%), Gaps = 20/386 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSV-GHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           CD S    D+C I G    D   S+F  V     ++  Q     K +PYPRK D++ M +
Sbjct: 236 CDFSDFRSDICDIAGDVRLDANVSSFVVVVDPASASGQQEEEEHKVRPYPRKGDETCMGR 295

Query: 75  VKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           + E+T+  T  A     C  TH +PA+VFS GGY GN FH+F D  VPL+ T+  +    
Sbjct: 296 ITEITVRATRGAAGAPRCTRTHAAPAVVFSIGGYTGNIFHDFSDVLVPLYNTVRRY--GG 353

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTIT---HCFQSVTLGLISHGRMVIN 189
           DV L +A+    W  KY  LL   +RH  +++         HCF+   + L +H  ++I 
Sbjct: 354 DVQLVMANSASWWLVKYDRLLRELSRHAPLDLAGAGAAREVHCFRHAVVSLRAHKELIIE 413

Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSF--------HHTKPKLVLVNRNARVGRTIL 241
                      DF  FL  A     +  +             KP+L++++R+    R +L
Sbjct: 414 RDRSLDGLATPDFTRFLRRALGLPRDAPTRLVVGGGDGTGRKKPRLLIISRHRT--RLLL 471

Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           NL  V +AAEE+GF+  + E + +  +A     I+SC AMVGVHGAGLT+ +FL PG+ L
Sbjct: 472 NLDAVVRAAEEVGFEAIVNESDVANDIAQVGGLINSCDAMVGVHGAGLTNMMFLPPGAAL 531

Query: 302 MQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGA 360
           +Q+VP G  QW++   +  PA  +GL+Y++Y+I  +ES+L +K+ +   +  NP      
Sbjct: 532 VQIVPWGGLQWMARADYGDPAEAMGLKYIQYEIGVDESTLKDKFPSGHKIFTNPTELHKK 591

Query: 361 NWSNMR-VYLKTQNVKLDIDRFRIYL 385
            +  +R   +  Q++ +D+ RFR  L
Sbjct: 592 GFMFIRQTLMDGQDITVDVARFREVL 617


>gi|326487412|dbj|BAJ89690.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 644

 Score =  221 bits (564), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 128/386 (33%), Positives = 203/386 (52%), Gaps = 18/386 (4%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           CD  S+R  D C ++G         T   VG    T      R+K  PYPRK D +AM  
Sbjct: 270 CDLTSNRRIDWCELDGDVRVHGAQGTVTLVG----TAKAEEWRVK--PYPRKVDPNAMRH 323

Query: 75  VKELTITTSAPP--NLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           V+E+ + ++  P  +  C V H+ PAL+FS  GY GN+FH + D  +PLF+T   +    
Sbjct: 324 VREIAVRSTTLPGGDEECAVKHSVPALLFSDRGYTGNYFHAYTDVILPLFLTAKRY--GG 381

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
           +V   ++D    W  K+  +    + + ++++     T CF  V +GL  H    I+P  
Sbjct: 382 EVQFLVSDLQMWWIGKFLPVFKSLSNYDLVDLAADNRTRCFAHVQVGLTCHADFSIDPLR 441

Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEE 252
            P   ++VDF   +   Y      +      +P+L+L+ R +   R  +N  E+ +AA++
Sbjct: 442 APNGYSMVDFTRHMRGTYGLPRGLAVPAAGARPRLLLIARAST--RRFVNADEIVRAAQK 499

Query: 253 LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWL 312
           +GF+V + E   +  +A      ++C AM+GVHGAGLT+ +FL    V++QVVP+G    
Sbjct: 500 VGFEVVVSE--GTHEVAPFAELANTCDAMLGVHGAGLTNMVFLPTRGVVIQVVPLGGLEF 557

Query: 313 STVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKT 371
              YF  P+R +GL+YLEY+I   ES+L E+Y  +  +  +P       W +++ VYL  
Sbjct: 558 VAGYFRTPSRDMGLKYLEYRISPAESTLTEQYPPDHPIFTDPDGVKSKGWESLKQVYLDK 617

Query: 372 QNVKLDIDRFRIYLKD--AYKKAKKL 395
           Q+V+LD+ RFR  LK   A+ +A KL
Sbjct: 618 QDVRLDLKRFRPLLKKAIAHIRANKL 643


>gi|413936952|gb|AFW71503.1| hypothetical protein ZEAMMB73_965201 [Zea mays]
          Length = 541

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 129/383 (33%), Positives = 204/383 (53%), Gaps = 27/383 (7%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ----PSLRIKTQPYPRKSD 68
           +  CD S++ YD+C + G A    ++ST   V    ++  +    P+   K+ P+ +K  
Sbjct: 160 ATVCDLSNQRYDICELCGDARAIGRSSTVMYVPQPLTSNGEEWNIPAQSRKSLPWIKKV- 218

Query: 69  KSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
                 VK L  +  AP   SC   HT PA+VF+ GG+ GN +H+  D  VPLF+T    
Sbjct: 219 -----TVKTLKASQQAP---SCTSRHTMPAIVFALGGFTGNVWHDVSDVLVPLFLTARQF 270

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
             DQDV L + D    + +KY+ +L R TRH +I+ +      C+  V +GL SH  + I
Sbjct: 271 --DQDVQLLVTDNRPWFIKKYSAILRRLTRHSVIDFDADDEVRCYPHVIVGLRSHRDLGI 328

Query: 189 NPTLLP-KPKTLVDFQSFLANAYN--------ENTNTSSSFHHTKPKLVLVNRNARVGRT 239
           +P   P K  T+VDF+ F+  AY                     KP+++L+ R     R 
Sbjct: 329 DPDSTPQKNYTMVDFRLFVREAYGLPAPGVDIPYRADKDEPEKKKPRMMLIERGKT--RR 386

Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
           ++N  +V       GF+V   +P   + L +  R + SC AM+GVHGAGLT+ +FLR G+
Sbjct: 387 LVNTPDVLLGLGWFGFEVVRADPRTHSDLEEFARLVDSCDAMMGVHGAGLTNMVFLRSGA 446

Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
            L+ VVP G ++++  ++  PAR +GL ++ Y +  +ES+L+EKYG N  V+++P+A   
Sbjct: 447 ALVHVVPYGVEFMADGFYGAPARDMGLRHVRYSVGPDESTLLEKYGRNHAVVRDPEAVRS 506

Query: 360 ANWSNM-RVYLKTQNVKLDIDRF 381
             W  +  VY+  Q+V L++ RF
Sbjct: 507 GGWEKVGEVYMTEQDVVLNVTRF 529


>gi|357144077|ref|XP_003573162.1| PREDICTED: uncharacterized protein LOC100845100 [Brachypodium
           distachyon]
          Length = 575

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 131/373 (35%), Positives = 203/373 (54%), Gaps = 16/373 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G+    P  S   SV + +   S+   + K +PYPRK D+  +S +
Sbjct: 201 CDFSNFRANVCEMRGNIKIHPNAS---SVMYMEPAGSKRDEQWKIKPYPRKGDELCLSHI 257

Query: 76  KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            ELT+ +S  AP    C   H  PA+VF+  GY GN FH+F D  VPLF T  S F + +
Sbjct: 258 TELTVKSSKVAP---ECTKYHNVPAVVFALTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 312

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   I D    W RKY  +  + +++P+I+ N     HCF    +GL ++    I+ +  
Sbjct: 313 VQFLITDMAIWWTRKYHVVFEKLSKYPLIDFNKDDQVHCFNHAIVGLHAYMEFTIDSSKA 372

Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAA 250
           P   ++VDF  F+   Y+   +  S+      TKP+L++++R     R  LNL+EV   A
Sbjct: 373 PHNYSMVDFNRFMRQTYSLPRDAVSALGEIPKTKPRLLIISRQRT--RMFLNLQEVVAMA 430

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
           EELGF+V + E   S+ L+   + ++S   M+GVHGAGLT+ +FL   + L+Q+VP G  
Sbjct: 431 EELGFEVVVEEANVSSDLSHFSKAVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGL 490

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
           + +  + F  P+  +GL Y +Y I   ESSL ++Y  +  + KNP AF        + ++
Sbjct: 491 EGVCRIDFGDPSEQMGLRYKQYSISVHESSLTDQYPLDHEIFKNPLAFHKGFEFIRQTFM 550

Query: 370 KTQNVKLDIDRFR 382
             QNV+LD +RFR
Sbjct: 551 DKQNVRLDCNRFR 563


>gi|242052291|ref|XP_002455291.1| hypothetical protein SORBIDRAFT_03g007910 [Sorghum bicolor]
 gi|241927266|gb|EES00411.1| hypothetical protein SORBIDRAFT_03g007910 [Sorghum bicolor]
          Length = 631

 Score =  221 bits (563), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 131/389 (33%), Positives = 204/389 (52%), Gaps = 14/389 (3%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
           CD  S+R  D C ++G        ++   V   G  D T    S RIK  PYPRK+D +A
Sbjct: 244 CDLTSNRRIDWCELDGDVRVLGANASVTLVAPPGADDRTFRAESWRIK--PYPRKADPNA 301

Query: 72  M-SKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           M         + S     +C   +  PALVFS  GY GN+FH F D  +PLF+T   +  
Sbjct: 302 MHVVRVVTVQSVSGGGAPACTDRYDVPALVFSDRGYTGNYFHAFTDVILPLFLTARQYA- 360

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
             +V L + D    W  K++ +    + + +++++      CF+ V +GL SH    I+P
Sbjct: 361 -GEVRLLVTDLQAWWVGKFSPVFKAISNYELVDLDKDPRVQCFRHVQVGLTSHDDFSIDP 419

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
              P   ++VDF +F+   Y      +++    +P+L+L+ R AR  R  +N  E+ + A
Sbjct: 420 RRAPNGYSMVDFTAFMRATYGLPRGVAAADATKRPRLLLIAR-ART-RRFVNTEEIVRGA 477

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           E LGF+V +   E +  +A      +SC A++GVHGAGLT+ +F+  G V++QVVP+G  
Sbjct: 478 ERLGFEVVV--SEGTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPTGGVVIQVVPLGGL 535

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
                YF  P+R +GL YLEY+I  EES+L+++Y  +  +  +P       W +++  YL
Sbjct: 536 EFVAGYFRGPSRDMGLRYLEYRITPEESTLIDQYPRDHPIFTDPNGIKSKGWESLKDAYL 595

Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
             Q+V+LD+ RFR  LK A    +K   K
Sbjct: 596 DKQDVRLDMKRFRPTLKKAIAHLRKAKAK 624


>gi|342186837|emb|CCC14964.1| putative glycosyl transferase [Triticum aestivum]
          Length = 578

 Score =  221 bits (562), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 133/376 (35%), Positives = 199/376 (52%), Gaps = 16/376 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G+    P  S   SV + +   S+ +   K +PYPRK D+  +SKV
Sbjct: 204 CDFSNFRANVCEMRGNIRIHPNAS---SVMYMEPASSKRNELWKLKPYPRKGDELCLSKV 260

Query: 76  KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            ELT+ +S  AP    C   H  P +VF+  GY GN FH+F D  VPLF T  S F + +
Sbjct: 261 TELTVKSSKVAP---ECTKYHNVPVVVFALTGYTGNLFHDFTDALVPLFTTA-SEF-NGE 315

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   I D    W RKY  +  + + +P+I+ N  T  HC +   +GL ++    I+P+  
Sbjct: 316 VQFLITDMAIWWTRKYHVVFKKLSNYPLIDFNKDTDVHCVKHAIVGLHAYMEFTIDPSKA 375

Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
           P   T+VDF  F+   Y    E  +        KP+L++++R     R  LNL E+   A
Sbjct: 376 PHNYTMVDFNRFMRRTYELPREAVSALGEIPKAKPRLLIISRQR--TRMFLNLPEIIAMA 433

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI-GT 309
           E LGF+V + E   S+ L+   + ++S   M+GVHGAGLT+ +FL   + L+Q+VP  G 
Sbjct: 434 EGLGFEVVVEEANVSSDLSQFSKVVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGI 493

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
           + +  + F  PA  +GL Y +Y I   ESSL ++Y  +  + KNP AF          ++
Sbjct: 494 EGVCRIDFGDPAEQMGLRYKQYSIAVHESSLTDQYPLDHEIFKNPLAFHKGFEFIKETFM 553

Query: 370 KTQNVKLDIDRFRIYL 385
             QNV+LD +RFR  L
Sbjct: 554 DKQNVRLDCNRFRPVL 569


>gi|326524055|dbj|BAJ97038.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326528547|dbj|BAJ93455.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326533864|dbj|BAJ93705.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 582

 Score =  220 bits (561), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 131/373 (35%), Positives = 199/373 (53%), Gaps = 16/373 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G+    P  S   SV + +   S+ +   K +PYPRK D+  +SKV
Sbjct: 208 CDFSNFRANVCEMRGNIRIHPNAS---SVMYMEPASSKRNELWKLKPYPRKGDELCLSKV 264

Query: 76  KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            ELT+ +S  AP    C   H  P +VF+  GY GN FH+F D  VPLF T  S F + +
Sbjct: 265 TELTVKSSKVAP---ECTKYHNVPVVVFALTGYTGNLFHDFTDALVPLFTTA-SEF-NGE 319

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   I D    W RKY  +  + +++P+I+ N  T  HC +   +GL ++    I+P+  
Sbjct: 320 VQFLITDMAIWWTRKYHVVFKKLSKYPLIDFNKDTDVHCVKHAIVGLHAYMEFTIDPSKA 379

Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
           P   T+VDF  F+   Y    E  +        KP+L++++R     R  LNL E+   A
Sbjct: 380 PHNYTMVDFNRFMRRTYELPREAVSALGEIPKAKPRLLIISRQR--TRMFLNLPEIIAMA 437

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI-GT 309
           E LGF+V + E   S+ L+   + ++S   M+GVHGAGLT+ +FL   + L+Q+VP  G 
Sbjct: 438 EGLGFEVVVEEANVSSDLSQFSKVVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGI 497

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
           + +  + F  PA  +GL Y +Y I   ESSL ++Y  +  + KNP AF          ++
Sbjct: 498 EGVCRIDFGDPAEQMGLRYKQYSIAVHESSLTDQYPLDHEIFKNPLAFHKGFEFIKETFM 557

Query: 370 KTQNVKLDIDRFR 382
             QNV+LD +RF+
Sbjct: 558 DKQNVRLDCNRFK 570


>gi|115445881|ref|NP_001046720.1| Os02g0330200 [Oryza sativa Japonica Group]
 gi|46390159|dbj|BAD15592.1| putative HGA6 [Oryza sativa Japonica Group]
 gi|113536251|dbj|BAF08634.1| Os02g0330200 [Oryza sativa Japonica Group]
 gi|218190637|gb|EEC73064.1| hypothetical protein OsI_07022 [Oryza sativa Indica Group]
 gi|222622748|gb|EEE56880.1| hypothetical protein OsJ_06523 [Oryza sativa Japonica Group]
          Length = 583

 Score =  220 bits (561), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 128/377 (33%), Positives = 202/377 (53%), Gaps = 17/377 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G+    P  S   SV + +   S+     K +PYPRK D+  +  +
Sbjct: 208 CDFSNFRANVCEMRGNIRIHPNAS---SVMYMEPASSKREEIWKVKPYPRKGDELCLGHI 264

Query: 76  KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+ +S  AP    C   H  PA+VF+  GY GN FH+F D  VPLF T  S F + +
Sbjct: 265 TEITVKSSKVAP---ECSKYHNVPAVVFALTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 319

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   I D    W RKY  +  + +++P+I+ NN    HCF+   +GL ++    I+ +  
Sbjct: 320 VQFLITDMAIWWTRKYKVVFDKLSKYPLIDFNNDDQVHCFKHAIVGLHAYMEFTIDSSKA 379

Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
           P   ++VDF  F+   Y+   +           KP+L++++R     R  LNL E+   A
Sbjct: 380 PHNYSMVDFNRFMRRTYSLPRDFVTALGEIPKAKPRLLIISRQRT--RMFLNLNEIVAMA 437

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
           EE+G++V + E   S+ L+   + ++S   M+GVHGAGLT+ +FL   + L+Q+VP G  
Sbjct: 438 EEIGYEVVVEEANVSSDLSHFGKVVNSVDVMMGVHGAGLTNCVFLPQNATLIQIVPWGGL 497

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
            W+S + F  PA  +GL Y +Y I   ESSL ++Y  +  +  NP +F    +  +R  +
Sbjct: 498 DWISRIDFGNPAEQMGLRYKQYSIGVHESSLTDQYPLDHEIFTNPLSFHKHGFEFIRQTF 557

Query: 369 LKTQNVKLDIDRFRIYL 385
           +  QNVKLD +RF+  L
Sbjct: 558 MDKQNVKLDCNRFKPVL 574


>gi|300681544|emb|CBH32641.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
           aestivum]
          Length = 638

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 127/388 (32%), Positives = 203/388 (52%), Gaps = 20/388 (5%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           CD  S+R  D C ++G        +T   VG   +       RI+  PYPRK D +AM  
Sbjct: 261 CDMTSNRRIDWCELDGDVRVHGAKATVTMVGAARAE----EWRIR--PYPRKVDPNAMRH 314

Query: 75  VKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           V  +T+    T        C + H+ PAL+FS  GY GN+FH + D  +PLF+T   +  
Sbjct: 315 VTNITVRSTMTLPGAGEGECAIKHSVPALLFSDRGYTGNYFHAYTDVILPLFLTAKQY-- 372

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
             +V L ++D    W  K+  +    + + ++++     T CF+ V +GL  H    I+P
Sbjct: 373 GGEVQLLVSDMQMWWIGKFLPVFKSLSNYDLVDLAADNRTRCFRHVQVGLTCHADFSIDP 432

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
              P   ++VDF   +   Y      +      +P+L+L+ R +   R  +N  ++ +AA
Sbjct: 433 LRAPNGYSMVDFTKHMRGVYGLPRGLAVPAAGARPRLLLIARAST--RRFVNADDIVRAA 490

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           +++GF+V + E   +  +A      ++C  M+GVHGAGLT+ +FL  G V++QVVP+G  
Sbjct: 491 QKVGFEVVVSE--GTHEVAPFAELANTCDVMLGVHGAGLTNMVFLPTGGVVIQVVPLGGL 548

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
                YF  P+R +GL+YLEY+I   ES+L E+Y A+  +  +P       W +++ VYL
Sbjct: 549 EFVAGYFRAPSRDMGLKYLEYRIAPAESTLTEQYPADHPIFTDPDGVKSKGWESLKQVYL 608

Query: 370 KTQNVKLDIDRFRIYLKD--AYKKAKKL 395
             Q+V+LD+ RFR  LK   A+ +A KL
Sbjct: 609 DKQDVRLDLKRFRPLLKKAIAHIRANKL 636


>gi|326500646|dbj|BAJ94989.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 503

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 121/340 (35%), Positives = 192/340 (56%), Gaps = 11/340 (3%)

Query: 59  KTQPYPRKSDKSAMSKVKE--LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
           +T+PY R  D  AM+ V+   L    +  P  +C   H+ P  VFS  G++GN +H++ D
Sbjct: 163 RTKPYARYHDPVAMTHVRGFVLKPFPADAPPPACTKNHSVPGFVFSNRGFSGNLYHDYTD 222

Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
             VPLF++        +V   ++D    W  K+  L  + + + ++++NN    HC   +
Sbjct: 223 VLVPLFLSTRKF--KGEVQFLLSDLKPWWVAKFRPLFRQLSNYEVVDVNNDLEVHCVPRI 280

Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRN 233
            +G   H  M I P+      ++VDF+  L +A+      +S    T   KP+L++++R 
Sbjct: 281 VVGSDFHKDMGIIPSKAAGHVSIVDFKRTLRDAFGLERAAASRGGATGAGKPRLLIISR- 339

Query: 234 ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
            +  R  LN RE+  AA  +GFDV I EP++ T ++   R ++S   MVGVHGAGLT+ +
Sbjct: 340 -KNSRRFLNEREMAAAATAMGFDVRIAEPDQHTDMSTFARLVNSADVMVGVHGAGLTNMV 398

Query: 294 FLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLK 352
           FL  G+VL+QVVP G  +WL+ V F+ PA  + + Y++Y ++ EESSL+E+Y  N  VL 
Sbjct: 399 FLPAGAVLIQVVPFGGLEWLTGVTFKNPAADMEVTYMDYNVQLEESSLLEQYPRNHQVLT 458

Query: 353 NPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
           +P A     W  ++  YL  QNV+LD+D+FR  L+DA  +
Sbjct: 459 DPYAVHKQGWDALKAAYLDKQNVRLDLDKFRATLRDALSR 498


>gi|77554062|gb|ABA96858.1| HGA6, putative, expressed [Oryza sativa Japonica Group]
 gi|218186635|gb|EEC69062.1| hypothetical protein OsI_37919 [Oryza sativa Indica Group]
          Length = 584

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 132/395 (33%), Positives = 206/395 (52%), Gaps = 31/395 (7%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C + G    D   S F  V         P+ +++  PYPRK D ++M +V
Sbjct: 187 CDLSDHRTDVCDLAGDIRMDANASAFVVVVDPAVGADGPTYKVR--PYPRKGDATSMGRV 244

Query: 76  KELTITTSAP--PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+ T+A   P   C  TH +PA+VFS  GY GN FH+F D  VPL+ T   +    D
Sbjct: 245 TEITVRTTAAGAPPPRCTTTHAAPAVVFSISGYTGNLFHDFTDVIVPLYNTAARYC--GD 302

Query: 134 VILAIADCN---DQWARKYAELLPRFTRHPIINI-----NNQTITHCFQSVTLGLISHGR 185
           V L + D N    +W  +Y  +L   +RH  +++           HCF    +GL +HG 
Sbjct: 303 VQLVVTDGNAATRRWLARYGAVLRGLSRHVPLDLAAEAAAGGGEVHCFGHTVVGLRAHGE 362

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN--ENTNTSSSFHHT-------KPKLVLVNRNARV 236
           ++I+    P    + DF  FL  A +   +  T     H        +P+L++++R  R 
Sbjct: 363 LIIDRERSPDGLGMPDFTRFLRRALSLPRDAPTRPGGGHGDATKPQPRPRLLIISR--RG 420

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF----RFIHSCHAMVGVHGAGLTHS 292
            R +LN   V +AAE++GF+    E + + +  D      R ++S  A+VGVHGAGLT+ 
Sbjct: 421 TRLLLNTDAVARAAEQVGFEAVASELDMAGADHDDVARVARLVNSFDAVVGVHGAGLTNM 480

Query: 293 LFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
           +FL PG+  +Q+VP G  +WL+   F +PA  +GL Y++Y++   ES+L +KY  +  + 
Sbjct: 481 VFLPPGAAAVQIVPWGGLRWLARADFGEPAVAMGLRYIQYEVAAGESTLKDKYPRDHEIF 540

Query: 352 KNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYL 385
            NP A     ++ MR  +L  Q++ +DIDRF+  L
Sbjct: 541 TNPTALHKKGFTFMRHTFLNGQDIIVDIDRFKPVL 575


>gi|219362665|ref|NP_001137085.1| uncharacterized protein LOC100217259 [Zea mays]
 gi|194698284|gb|ACF83226.1| unknown [Zea mays]
 gi|414876764|tpg|DAA53895.1| TPA: hypothetical protein ZEAMMB73_695029 [Zea mays]
          Length = 547

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 138/418 (33%), Positives = 206/418 (49%), Gaps = 32/418 (7%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTF--FSVGHTDSTPSQPSLRIKTQP 62
           P +  P  +I CDRS    D+C + G    D  +     F      S P  P+   + +P
Sbjct: 117 PCSSLPNHTICCDRSDYHSDVCFMAGDVRTDAASLALLLFPPRAASSAPEPPAAEERIRP 176

Query: 63  YPRKSDKSAMSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHEFMDC 117
           Y RK D      + E+T+  + P   +     C V H +P LV +AGGY+ N FH F D 
Sbjct: 177 YTRKWDAYITKTIHEVTLRVARPEEAAAAAHRCDVRHDAPVLVVTAGGYSHNMFHVFNDG 236

Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
           F+PL++T   H   +  +LA+   + +WA  Y E+L   +R+  I++     THCF    
Sbjct: 237 FLPLWLTAQ-HL-RRRAVLAVLSYSPRWAGTYGEILAGLSRYHAIDLLRDKRTHCFPGAV 294

Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN--------TNTSSS-------FHH 222
           +G   H  + +N T L   KT+ DF  FLA  Y+++          +SSS       +  
Sbjct: 295 VGTRYHDYLAVNSTRLRDNKTIADFHDFLAGVYSDDDVRNDKAAGGSSSSRRPEMAWYER 354

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
            +P+L +V+R  R  R + N   V + A  +GFDV I E      L+  +  + S  A+V
Sbjct: 355 RRPRLGIVSRKGR--RVVENQAAVAQLAASVGFDVDIMETANGAPLSAVYASVSSYDALV 412

Query: 283 GVHGAGLTHSLFLRPG-SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLV 341
           GVHGA LT  LFLRPG + L Q+ P+G   LS   F  PA  +GL Y++Y +   ESSL 
Sbjct: 413 GVHGADLTTFLFLRPGRAALAQIAPLGITMLSRNLFGVPAARMGLHYVQYDVSARESSLS 472

Query: 342 EKYGANDLVLKNP----QAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
            +Y  + +V+ +P    +      W  +  VYL+ QNV LD+ RFR  L   + + K+
Sbjct: 473 RRYPLDHVVVADPARARREQGKQEWELVEHVYLRGQNVSLDLGRFRETLARIHSRLKE 530


>gi|223942755|gb|ACN25461.1| unknown [Zea mays]
          Length = 529

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 128/382 (33%), Positives = 203/382 (53%), Gaps = 27/382 (7%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ----PSLRIKTQPYPRKSD 68
           +  CD S++ YD+C + G A    ++ST   V    ++  +    P+   K+ P+ +K  
Sbjct: 160 ATVCDLSNQRYDICELCGDARAIGRSSTVMYVPQPLTSNGEEWNIPAQSRKSLPWIKKV- 218

Query: 69  KSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
                 VK L  +  AP   SC   HT PA+VF+ GG+ GN +H+  D  VPLF+T    
Sbjct: 219 -----TVKTLKASQQAP---SCTSRHTMPAIVFALGGFTGNVWHDVSDVLVPLFLTARQF 270

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
             DQDV L + D    + +KY+ +L R TRH +I+ +      C+  V +GL SH  + I
Sbjct: 271 --DQDVQLLVTDNRPWFIKKYSAILRRLTRHSVIDFDADDEVRCYPHVIVGLRSHRDLGI 328

Query: 189 NPTLLP-KPKTLVDFQSFLANAYN--------ENTNTSSSFHHTKPKLVLVNRNARVGRT 239
           +P   P K  T+VDF+ F+  AY                     KP+++L+ R     R 
Sbjct: 329 DPDSTPQKNYTMVDFRLFVREAYGLPAPGVDIPYRADKDEPEKKKPRMMLIERGKT--RR 386

Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
           ++N  +V       GF+V   +P   + L +  R + SC AM+GVHGAGLT+ +FLR G+
Sbjct: 387 LVNTPDVLLGLGWFGFEVVRADPRTHSDLEEFARLVDSCDAMMGVHGAGLTNMVFLRSGA 446

Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
            L+ VVP G ++++  ++  PAR +GL ++ Y +  +ES+L+EKYG N  V+++P+A   
Sbjct: 447 ALVHVVPYGVEFMADGFYGAPARDMGLRHVRYSVGPDESTLLEKYGRNHAVVRDPEAVRS 506

Query: 360 ANWSNM-RVYLKTQNVKLDIDR 380
             W  +  VY+  Q+V L++ R
Sbjct: 507 GGWEKVGEVYMTEQDVVLNVTR 528


>gi|413947215|gb|AFW79864.1| hypothetical protein ZEAMMB73_561794 [Zea mays]
          Length = 628

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 132/393 (33%), Positives = 210/393 (53%), Gaps = 18/393 (4%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
           CD  S+R  D C ++G        ++   V   G  + T    S RIK  PYPRK+D +A
Sbjct: 237 CDVTSNRRIDWCELDGDVRVVGANASVTLVAPPGADNRTFRAESWRIK--PYPRKADPNA 294

Query: 72  MSKVKELTITT--SAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
           M  V+ LT+ +  S     +C  G     PALVFS  GY GN+FH F D  +PLF+T   
Sbjct: 295 MRFVRVLTVRSVASGSGEAACTDGGDDDVPALVFSDRGYTGNYFHAFTDVILPLFLTARR 354

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
           +    +V L +AD    W  K+  +    +++ +++++      CF+ V +GL SH    
Sbjct: 355 YA--GEVRLLVADLQPWWVGKFLPVFRSISKYELVDLDRDPRVRCFRHVQVGLTSHADFS 412

Query: 188 INPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           I+P   P   +++DF  F+  AY     +  ++    +P+L++V R AR  R  +N  E+
Sbjct: 413 IDPRRAPNGYSMLDFTRFMRAAYGLPRGDVVAAAPARRPRLLVVAR-ART-RRFVNTEEI 470

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            + AE +GF+  +   E +  +A      + C A++GVHGAGLT+ +FL  G V++QVVP
Sbjct: 471 VRGAEAVGFEAVV--SEGTHEVAPFAELANGCDAIMGVHGAGLTNMVFLPTGGVVIQVVP 528

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           +G       YF  P+  +GL YLEY+I+ EES+LV++Y  +  +  +P       W +++
Sbjct: 529 LGGLEFVAGYFRGPSVDMGLRYLEYRIEPEESTLVDQYPRDHPIFTDPNGIKSKGWESLK 588

Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
             YL  Q+V+LD++RFR  L++A    +K   K
Sbjct: 589 DAYLDKQDVRLDMERFRPTLQEAIAHLRKAKAK 621


>gi|212275101|ref|NP_001130987.1| hypothetical protein [Zea mays]
 gi|194690638|gb|ACF79403.1| unknown [Zea mays]
 gi|224030713|gb|ACN34432.1| unknown [Zea mays]
 gi|413947219|gb|AFW79868.1| hypothetical protein ZEAMMB73_561794 [Zea mays]
          Length = 625

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 132/393 (33%), Positives = 210/393 (53%), Gaps = 18/393 (4%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
           CD  S+R  D C ++G        ++   V   G  + T    S RIK  PYPRK+D +A
Sbjct: 234 CDVTSNRRIDWCELDGDVRVVGANASVTLVAPPGADNRTFRAESWRIK--PYPRKADPNA 291

Query: 72  MSKVKELTITT--SAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
           M  V+ LT+ +  S     +C  G     PALVFS  GY GN+FH F D  +PLF+T   
Sbjct: 292 MRFVRVLTVRSVASGSGEAACTDGGDDDVPALVFSDRGYTGNYFHAFTDVILPLFLTARR 351

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
           +    +V L +AD    W  K+  +    +++ +++++      CF+ V +GL SH    
Sbjct: 352 YA--GEVRLLVADLQPWWVGKFLPVFRSISKYELVDLDRDPRVRCFRHVQVGLTSHADFS 409

Query: 188 INPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           I+P   P   +++DF  F+  AY     +  ++    +P+L++V R AR  R  +N  E+
Sbjct: 410 IDPRRAPNGYSMLDFTRFMRAAYGLPRGDVVAAAPARRPRLLVVAR-ART-RRFVNTEEI 467

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            + AE +GF+  +   E +  +A      + C A++GVHGAGLT+ +FL  G V++QVVP
Sbjct: 468 VRGAEAVGFEAVV--SEGTHEVAPFAELANGCDAIMGVHGAGLTNMVFLPTGGVVIQVVP 525

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           +G       YF  P+  +GL YLEY+I+ EES+LV++Y  +  +  +P       W +++
Sbjct: 526 LGGLEFVAGYFRGPSVDMGLRYLEYRIEPEESTLVDQYPRDHPIFTDPNGIKSKGWESLK 585

Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
             YL  Q+V+LD++RFR  L++A    +K   K
Sbjct: 586 DAYLDKQDVRLDMERFRPTLQEAIAHLRKAKAK 618


>gi|413947218|gb|AFW79867.1| hypothetical protein ZEAMMB73_561794 [Zea mays]
          Length = 641

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 132/393 (33%), Positives = 210/393 (53%), Gaps = 18/393 (4%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
           CD  S+R  D C ++G        ++   V   G  + T    S RIK  PYPRK+D +A
Sbjct: 250 CDVTSNRRIDWCELDGDVRVVGANASVTLVAPPGADNRTFRAESWRIK--PYPRKADPNA 307

Query: 72  MSKVKELTITT--SAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
           M  V+ LT+ +  S     +C  G     PALVFS  GY GN+FH F D  +PLF+T   
Sbjct: 308 MRFVRVLTVRSVASGSGEAACTDGGDDDVPALVFSDRGYTGNYFHAFTDVILPLFLTARR 367

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
           +    +V L +AD    W  K+  +    +++ +++++      CF+ V +GL SH    
Sbjct: 368 YA--GEVRLLVADLQPWWVGKFLPVFRSISKYELVDLDRDPRVRCFRHVQVGLTSHADFS 425

Query: 188 INPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           I+P   P   +++DF  F+  AY     +  ++    +P+L++V R AR  R  +N  E+
Sbjct: 426 IDPRRAPNGYSMLDFTRFMRAAYGLPRGDVVAAAPARRPRLLVVAR-ART-RRFVNTEEI 483

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            + AE +GF+  +   E +  +A      + C A++GVHGAGLT+ +FL  G V++QVVP
Sbjct: 484 VRGAEAVGFEAVV--SEGTHEVAPFAELANGCDAIMGVHGAGLTNMVFLPTGGVVIQVVP 541

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           +G       YF  P+  +GL YLEY+I+ EES+LV++Y  +  +  +P       W +++
Sbjct: 542 LGGLEFVAGYFRGPSVDMGLRYLEYRIEPEESTLVDQYPRDHPIFTDPNGIKSKGWESLK 601

Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
             YL  Q+V+LD++RFR  L++A    +K   K
Sbjct: 602 DAYLDKQDVRLDMERFRPTLQEAIAHLRKAKAK 634


>gi|242061682|ref|XP_002452130.1| hypothetical protein SORBIDRAFT_04g020210 [Sorghum bicolor]
 gi|241931961|gb|EES05106.1| hypothetical protein SORBIDRAFT_04g020210 [Sorghum bicolor]
          Length = 542

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 121/378 (32%), Positives = 208/378 (55%), Gaps = 25/378 (6%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ----PSLRIKTQPYPRKSDKSA 71
           CD S++ +D+C + G+A    ++ST   V  + ++  +    P+   K+ P+ +K     
Sbjct: 166 CDLSNQRFDICELCGNARTIGRSSTVMYVPQSLTSNGEEWNIPAQSRKSLPWIKKV---- 221

Query: 72  MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
              VK L  +   P    C   H  PA+VF+ GG+  N +H+  D  VPLF+T      D
Sbjct: 222 --TVKTLKASQQVP---RCTSRHAIPAIVFALGGFTANVWHDVSDVLVPLFLTAQQF--D 274

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
           +DV L I +    + +KY+ +  R TRH II+ +      C+  V +GL SH  + I+P 
Sbjct: 275 RDVQLLITNNQPWFIKKYSAIFHRLTRHNIIDFDADDEVRCYPHVIVGLRSHRDLGIDPN 334

Query: 192 LLPKPKTLVDFQSFLANAYNENT-------NTSSSFHHTKPKLVLVNRNARVGRTILNLR 244
             P+  T++DF+ F+  AY                    KP+++L++R     R  +N+ 
Sbjct: 335 STPQNYTMMDFRLFVREAYGLPAPEVDIPYRVDKDDPEKKPRIMLIDRGKT--RRFMNMP 392

Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
           +V +  +  GF+V   +P   ++L +  R + SC AM+GVHGAGLT+ +FLR G+V++ +
Sbjct: 393 DVLRGLDWFGFEVVRADPRIDSTLDEFVRLVDSCDAMMGVHGAGLTNMVFLRSGAVVVHI 452

Query: 305 VPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           VP G ++++  ++  PAR +GL +++Y I  +ES+L+EKYG N +V+K+P+A   + W  
Sbjct: 453 VPYGVEFMANGFYGAPARDMGLRHVQYSISPDESTLLEKYGENHMVIKDPEAIRNSGWEK 512

Query: 365 M-RVYLKTQNVKLDIDRF 381
           +  +Y+  Q+V L++ RF
Sbjct: 513 VGELYMTKQDVVLNMTRF 530


>gi|413947216|gb|AFW79865.1| hypothetical protein ZEAMMB73_561794 [Zea mays]
          Length = 532

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 132/393 (33%), Positives = 210/393 (53%), Gaps = 18/393 (4%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
           CD  S+R  D C ++G        ++   V   G  + T    S RIK  PYPRK+D +A
Sbjct: 141 CDVTSNRRIDWCELDGDVRVVGANASVTLVAPPGADNRTFRAESWRIK--PYPRKADPNA 198

Query: 72  MSKVKELTITT--SAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
           M  V+ LT+ +  S     +C  G     PALVFS  GY GN+FH F D  +PLF+T   
Sbjct: 199 MRFVRVLTVRSVASGSGEAACTDGGDDDVPALVFSDRGYTGNYFHAFTDVILPLFLTARR 258

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
           +    +V L +AD    W  K+  +    +++ +++++      CF+ V +GL SH    
Sbjct: 259 YA--GEVRLLVADLQPWWVGKFLPVFRSISKYELVDLDRDPRVRCFRHVQVGLTSHADFS 316

Query: 188 INPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           I+P   P   +++DF  F+  AY     +  ++    +P+L++V R AR  R  +N  E+
Sbjct: 317 IDPRRAPNGYSMLDFTRFMRAAYGLPRGDVVAAAPARRPRLLVVAR-ART-RRFVNTEEI 374

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            + AE +GF+  +   E +  +A      + C A++GVHGAGLT+ +FL  G V++QVVP
Sbjct: 375 VRGAEAVGFEAVV--SEGTHEVAPFAELANGCDAIMGVHGAGLTNMVFLPTGGVVIQVVP 432

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           +G       YF  P+  +GL YLEY+I+ EES+LV++Y  +  +  +P       W +++
Sbjct: 433 LGGLEFVAGYFRGPSVDMGLRYLEYRIEPEESTLVDQYPRDHPIFTDPNGIKSKGWESLK 492

Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
             YL  Q+V+LD++RFR  L++A    +K   K
Sbjct: 493 DAYLDKQDVRLDMERFRPTLQEAIAHLRKAKAK 525


>gi|357127184|ref|XP_003565264.1| PREDICTED: uncharacterized protein LOC100843951 [Brachypodium
           distachyon]
          Length = 621

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 136/386 (35%), Positives = 210/386 (54%), Gaps = 23/386 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S +  D+C   G    D  TS+F  V    ++P    +R    PYPRK D++ M +V
Sbjct: 238 CDFSDQRSDVCDFTGDIRMDANTSSFVLVVGQGTSPQSHKVR----PYPRKGDETCMGRV 293

Query: 76  KEL----TITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
            E+    T T S+PP L C  TH +PA+ FS GGY GN FH+F D  VPLF T++ +   
Sbjct: 294 TEINVRTTTTASSPPPL-CTQTHDAPAVTFSIGGYTGNIFHDFSDVIVPLFNTVHKYA-- 350

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLISHGRMVI 188
            DV L +A+    W  KY +LL   +R+  +++     +   HCF+   + L +H  ++I
Sbjct: 351 GDVQLVMANVAPWWLIKYDKLLRSISRYAPLDLAAAGTRGEVHCFRHAVVSLRAHRELII 410

Query: 189 NPTLLPKPKTLVDFQSFLANAY----NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLR 244
                       DF  FL +A     +  TN +      KP+L++++R+    R +LNL 
Sbjct: 411 EKDRSLDGLATPDFTRFLCSALSLPRDAPTNIADGSGR-KPRLLIISRHRT--RILLNLA 467

Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
            V  AAEE GF+  + E + +  +++  R I+S   +VGVHGAGLT+ +FL PG+ ++QV
Sbjct: 468 AVVLAAEEAGFEAVVNESDVANDISEVGRLINSADVLVGVHGAGLTNMMFLPPGATMVQV 527

Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
           VP G  QW++ + +  PA  +GL Y++Y+I  +ESSL EKY  +  +  NP       + 
Sbjct: 528 VPWGGLQWIARMDYGDPAEAMGLRYVQYEITVDESSLKEKYPRDHEIFTNPTGLHKKGFG 587

Query: 364 NMR-VYLKTQNVKLDIDRFRIYLKDA 388
            MR   +  QN+ +D+ RF+  L++A
Sbjct: 588 FMRQTLMDGQNITVDVARFKGVLQEA 613


>gi|31339690|gb|AAP49177.1| HGA6 [Hordeum vulgare subsp. vulgare]
          Length = 637

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 130/378 (34%), Positives = 198/378 (52%), Gaps = 22/378 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S R  D+C   G    D   STFF V    +  S      K +PYPRK D + M +V
Sbjct: 265 CDFSDRRSDVCDFTGDIRMDANASTFFLVVDAATAASH-----KVRPYPRKGDPTCMGRV 319

Query: 76  KELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
            E+T+ T   S+PP   C  TH  PA+ FS GGY GN FH+F D  VPL+ T++ +    
Sbjct: 320 PEITMRTTSSSSPPE--CTRTHGVPAVTFSIGGYTGNIFHDFSDVLVPLYNTVHRY--RG 375

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIIN---INNQTITHCFQSVTLGLISHGRMVIN 189
           DV L +A+    W  KY +LL   +RH  ++   +  +  THCF+   + L +H  ++I 
Sbjct: 376 DVQLVMANVAPWWLVKYDKLLRELSRHAPLDLAAVAAKGETHCFRHAVVSLRAHRELIIE 435

Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREV 246
               P      DF  F+  A +   +  +        KP+L+++ R+    R +LNL ++
Sbjct: 436 RDRSPDGLATPDFTRFIRRALSLPRDAPTRLADGTGRKPRLLIIARHRT--RILLNLGDM 493

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            + AEE GF+  + E +    ++     I+S   +VGVHGAGLT+ + L PG+ ++QVVP
Sbjct: 494 MRVAEEAGFEAAVSELDVGDPISRVGAEINSADVLVGVHGAGLTNMMSLAPGATMVQVVP 553

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  QW + + +  PA  LGL Y++Y+I  +ESSL +KY     +  +P +     +  M
Sbjct: 554 WGGLQWFARMDYGDPAEALGLRYVQYEIGVDESSLKDKYPRGHKIFTDPTSLHKKGFGFM 613

Query: 366 -RVYLKTQNVKLDIDRFR 382
            R  +  QN+ +D+ RFR
Sbjct: 614 RRTLMDGQNITIDLGRFR 631


>gi|56409846|emb|CAI30071.1| glycosyltransferase [Triticum aestivum]
          Length = 468

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 132/376 (35%), Positives = 199/376 (52%), Gaps = 16/376 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G+    P  S   SV + +   S+ +   K +PYPRK D+  +SKV
Sbjct: 94  CDFSNFRANVCEMRGNIRIHPNAS---SVMYMEPASSKRNELWKLKPYPRKGDELCLSKV 150

Query: 76  KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            ELT+ +S  AP    C   H  P +VF+  GY GN FH+F D  VPLF T  S F + +
Sbjct: 151 TELTVKSSKVAP---ECTKYHNVPVVVFALTGYTGNLFHDFTDALVPLFTTA-SEF-NGE 205

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V   I D    W RKY  +  + + +P+I+ N  T  HC +   +GL ++    I+P+  
Sbjct: 206 VQFLITDMAIWWTRKYHVVFKKLSNYPLIDFNKDTDVHCAKHAIVGLHAYMEFTIDPSKA 265

Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
           P   T+VDF  F+   Y    E  +        KP+L++++R     R  LNL E+   A
Sbjct: 266 PHNYTMVDFNRFMRRTYELPREAVSALGEIPKAKPRLLIISRQRT--RMFLNLPEIIAMA 323

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI-GT 309
           E LGF+V + E   S+ L+   + ++S   M+GVHGAGLT+ +FL   + L+Q+VP  G 
Sbjct: 324 EGLGFEVVVEEANVSSDLSQFSKVVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGI 383

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
           + +  + F  PA  +GL Y +Y I   ESSL ++Y  +  + KNP AF          ++
Sbjct: 384 EGVCRIDFGDPAEQMGLRYKQYSIAVHESSLTDQYPLDHEIFKNPLAFHKGFEFIKETFM 443

Query: 370 KTQNVKLDIDRFRIYL 385
             QNV+LD +RF+  L
Sbjct: 444 DKQNVRLDCNRFKPVL 459


>gi|357147066|ref|XP_003574209.1| PREDICTED: uncharacterized protein LOC100835033 [Brachypodium
           distachyon]
          Length = 501

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 120/338 (35%), Positives = 200/338 (59%), Gaps = 13/338 (3%)

Query: 59  KTQPYPRKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
           KT+PY R  D  AM+ V+E T+    ++S+PP  +C   H+ P  +FS GG++GN +H++
Sbjct: 154 KTKPYARYHDPVAMAHVREFTLKPFSSSSSPPPPACTKNHSVPGFLFSNGGFSGNLYHDY 213

Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
            D  +PLF+T  S     +V   ++     W  K+  L  + T + +++++N    HCF 
Sbjct: 214 TDVLIPLFLTTRSF--RGEVRFLLSGLKPWWVTKFTPLFRQLTNYDVLDVDNDGEIHCFP 271

Query: 175 SVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVN 231
            + +G   H  M ++P+  P   ++VDF+  L  A++    + + + +    KP+L++++
Sbjct: 272 RIVVGSTFHKDMGVDPSKSPGGVSVVDFKRTLRAAFDLPRASASRAGARGDGKPRLLIIS 331

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
           R +   R  LN +E+  A   +GF V I EP++ T +A   R ++S   MVGVHGAGLT+
Sbjct: 332 RKS--SRRFLNEKEMAAAGAAMGFQVRIAEPDQHTDMATFARLVNSADVMVGVHGAGLTN 389

Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
            +FL  G+VL+QVVP G  +WL+ V F++PA  + + Y++Y ++ EESSL+++Y  +  V
Sbjct: 390 MVFLPAGAVLVQVVPFGGLEWLTRVTFKEPAADMEVRYMDYNVQLEESSLLDQYPRSHQV 449

Query: 351 LKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKD 387
           L +P A     W  ++  YL  QNV+LD+DRFR  L+D
Sbjct: 450 LADPYAVHKQGWDALKTAYLDKQNVRLDLDRFRATLRD 487


>gi|326501348|dbj|BAJ98905.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 548

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 135/396 (34%), Positives = 203/396 (51%), Gaps = 22/396 (5%)

Query: 9   SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSD 68
           +P   ++C+ S    ++C++ G      K  T + V  +D +    +  +  +PYPRK +
Sbjct: 156 APRSKLSCNFSFYRMNICVMEGDVRMHGKAGTVYVVSASDDSYRPENGTVTIRPYPRKWE 215

Query: 69  KSAMSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
           K  M   +E+TI +S P         C  TH  PA+VFS GGY+ NFFH   D  +PL+ 
Sbjct: 216 KPTMQMAREVTIRSSGPGATDMAPPPCTATHDVPAVVFSTGGYSSNFFHAVTDIVIPLYN 275

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T   +  D  V L + D + +W  KY  +L   + +P I+ +      CF  V +G+ SH
Sbjct: 276 TAREY--DGRVQLVVTDYSRKWIAKYRHVLAALSDYPAIDFDADDTVRCFPKVHVGIESH 333

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTI 240
             + I P L  K  TL+DF+ FL +AY+     S+  + T   +P+LV++ R  R  R  
Sbjct: 334 KELGIIPVLSHKDYTLMDFRDFLRSAYSLKRAWSTPVNRTSGGRPRLVMLLR--RHSRAF 391

Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
            N  E   AA E+GF+V    PE    +A     ++SC  MVGVHGAGLT+ +FL     
Sbjct: 392 TNEAEAVAAAAEVGFEVVAAGPEAVRDMAQFAEVVNSCDVMVGVHGAGLTNMVFLPHNGT 451

Query: 301 LMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP----- 354
            MQ++P G  +W     F +    +GL Y+EY+   EE++L + Y  +  V  NP     
Sbjct: 452 AMQIIPWGEMKWACWSIFGETVPDMGLRYVEYEATAEETTLKDVYPRDHPVFTNPISIHK 511

Query: 355 QAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
           Q F G  W   +++L  QNV LDI+RFR  ++  YK
Sbjct: 512 QGF-GQLW---KIFLDGQNVTLDINRFRGVMQQIYK 543


>gi|222619895|gb|EEE56027.1| hypothetical protein OsJ_04811 [Oryza sativa Japonica Group]
          Length = 471

 Score =  218 bits (554), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 133/403 (33%), Positives = 199/403 (49%), Gaps = 39/403 (9%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP-----SLRIK 59
           P +     +I CDRS  + D+C + G    DP + +          P QP     ++  +
Sbjct: 89  PCSSLAGDTICCDRSDFNTDVCFMAGDVRTDPSSLSLLLF------PKQPPAANATVEER 142

Query: 60  TQPYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
            +PY RK                   P   C V H +P LV +AGGY GN FH F D FV
Sbjct: 143 IRPYTRKRS-----------------PGHRCDVRHDAPLLVMTAGGYTGNLFHAFSDGFV 185

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
           P ++T+  H   + V+L +   N  WA  Y E++   + + ++++ +   THCF    +G
Sbjct: 186 PAWLTVQ-HL-RRRVVLGVLSYNPWWAGTYGEIISGLSDYHVVDLLHDKRTHCFPGAIVG 243

Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT-----KPKLVLVNRNA 234
              HG + ++P  L   KT+VDF   LA  Y    +T      T     +P+L +V+R  
Sbjct: 244 TRFHGILSVDPARLRDNKTIVDFHDLLAGVYETAGDTVVVDDVTQPAPRRPRLGIVSR-- 301

Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           R  R I N   V + A  +GFDV I E      L  S+  + +C  +VGVHGA LT  LF
Sbjct: 302 RGTRVIENQAAVARLARSVGFDVDILETANGLPLPASYASVSACDVLVGVHGADLTKLLF 361

Query: 295 LRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
           LRPG+ L+Q+ P+G   ++   + + +  +GL Y +Y  +  ESSL  KYG  D+V+ +P
Sbjct: 362 LRPGAALVQIAPLGVAPIARGCYAEASARMGLHYEQYDAEGHESSLSRKYGLRDVVVSDP 421

Query: 355 QAFA-GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
           +A      W  + RVYL  QNV LD+ RFR  L   + +A ++
Sbjct: 422 EAAKRDKGWGFVARVYLGGQNVTLDLSRFRHTLTRLHARALRV 464


>gi|218187403|gb|EEC69830.1| hypothetical protein OsI_00151 [Oryza sativa Indica Group]
          Length = 562

 Score =  218 bits (554), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 130/385 (33%), Positives = 201/385 (52%), Gaps = 22/385 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C  +G       TS F  V    +      +R    PYPRK D + M +V
Sbjct: 181 CDFSDFRTDICDFSGDVRMAANTSEFVVVVDPAAAAVSHKVR----PYPRKGDATCMGRV 236

Query: 76  KELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+  +    ++  C   H  PA+ FS GGY GN FH+F D  VPL+ T+  +     
Sbjct: 237 TEITVRATGDAAVAPRCTRAHDVPAVAFSIGGYTGNIFHDFSDVIVPLYNTVQRY--RGG 294

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTIT---HCFQSVTLGLISHGRMVINP 190
           V L +A+    W  KY +LL   +RH  I++         HCF S  + L +H  ++I  
Sbjct: 295 VQLVMANVASWWLVKYDKLLRELSRHAPIDLAKAGAAGDVHCFPSAVVSLRAHRELIIER 354

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRT--ILNLRE 245
                     DF  FL  A +   +  +        +P+L++++R    GRT  +LNL  
Sbjct: 355 ERSLDGLATPDFTRFLRRALSLPRDAPTRLGDGTGRRPRLLVISR----GRTRLLLNLDA 410

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           V +AAEE+GF+V + E +   S+    + I+SC A++GVHGAGLT+ +FL PG+ ++QVV
Sbjct: 411 VVRAAEEVGFEVVVNESDVGNSIEQVGKLINSCDALLGVHGAGLTNMMFLPPGATMVQVV 470

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G  QW++ + + +PA  +GL Y++Y+I   ESSL +KY A D +  NP       ++ 
Sbjct: 471 PWGGLQWIARMDYGEPAAAMGLNYIQYEIAVAESSLKDKYPAGDEIFTNPTGLHKRGFAF 530

Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDA 388
           M+   +  Q++ +D+ RFR  L+ A
Sbjct: 531 MKQTLMDGQDITIDVTRFRPVLQQA 555


>gi|297830470|ref|XP_002883117.1| hypothetical protein ARALYDRAFT_479312 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328957|gb|EFH59376.1| hypothetical protein ARALYDRAFT_479312 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 464

 Score =  217 bits (552), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 120/334 (35%), Positives = 199/334 (59%), Gaps = 17/334 (5%)

Query: 62  PYPRKSDKSAMSKVKELTITTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
           PY RK   +AM +V+E T+      +LS C   H+ PA++FS GG++ N FH+F D  +P
Sbjct: 132 PYARKDQGAAMKRVREWTVKLIQNASLSRCVKNHSVPAILFSLGGFSLNNFHDFTDIVIP 191

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
           L+ T      D +V   + + N  W  K+  L+ +F+ + +I I+ +  THCF SV +GL
Sbjct: 192 LYTTARRF--DGEVQFLVTNKNPWWISKFKGLVRKFSNYEVIYIDEEDETHCFGSVIVGL 249

Query: 181 ISH----GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARV 236
             H      + I+  L     ++ DF+ +L +AY+      S++   +P+L++++R+  +
Sbjct: 250 NRHRDYDKELTID--LSNSECSMSDFRKYLRDAYSLRNAAVSTWR--RPRLLILSRS--I 303

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
            R  +N  E+ +AA ++GF+V + E   +T +A   + ++SC  M+GVHGAGLT+ +FL 
Sbjct: 304 SRAFVNADEIARAARQMGFNVVVVEA--NTGIASFAQTVNSCDVMLGVHGAGLTNMVFLP 361

Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
             +V++Q++PIG  +WL+   FE P++ + L YLEYKI  EES+L ++YG +  V+++P 
Sbjct: 362 ENAVVIQILPIGGFEWLANTDFEDPSKGMNLRYLEYKIAAEESTLAKQYGRDHEVVRDPS 421

Query: 356 AFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
           A A   W   + VYL  QNV +DI+RF+  L  A
Sbjct: 422 AVAKRGWGTFKSVYLVQQNVTVDINRFKPVLVKA 455


>gi|56409844|emb|CAI30070.1| glycosyltransferase [Triticum aestivum]
          Length = 580

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 129/378 (34%), Positives = 200/378 (52%), Gaps = 13/378 (3%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   ++C + G     PK +   SV   +   SQ     K +PYPRK D+  +S +
Sbjct: 205 CDFSNFRANVCEMRGDVRVHPKAT---SVLFMEPEGSQRDEVWKIKPYPRKGDEFCLSHI 261

Query: 76  KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
            ELT+ +S      C   H  P ++FS  GY GN FH+F D  VPLF T  + F D +V 
Sbjct: 262 TELTVKSSKVA-AECTRYHDVPVVIFSLTGYTGNLFHDFTDVIVPLFTTA-AQF-DGEVQ 318

Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPK 195
             I D    W  KY  LL + +++P+I+ +      CF+   +G  ++    I+    P 
Sbjct: 319 FLITDMALWWTVKYHTLLQKLSKYPLIDFSKDDQVRCFKHAIVGTHAYMEFTIDAAKSPN 378

Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEE 252
             T+VDF  F+  AY+     +++   +   KP+L+++ R+    R  LNL E+   AEE
Sbjct: 379 GVTMVDFNRFMRAAYSLPKAAAAALGESPRVKPRLLIIKRHRT--RMFLNLEEIIGMAEE 436

Query: 253 LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QW 311
           LGF+V I E   S+ +    + ++S   M+GVHGAGLT+ +FL   + L+Q+VP G   W
Sbjct: 437 LGFEVVIDEANVSSDINGFAKLVNSVDVMMGVHGAGLTNCVFLPQNATLIQIVPFGGLDW 496

Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLK 370
           +S   F  P+ ++GL Y +Y I  +ESSL + Y  +  + K+P +F    +  + R ++ 
Sbjct: 497 ISRTDFGNPSEMMGLRYKQYAITVDESSLTDHYPRDHKIFKDPISFHKRGFEFIRRTFMD 556

Query: 371 TQNVKLDIDRFRIYLKDA 388
            QNV+LD  RFR  L +A
Sbjct: 557 KQNVRLDCKRFRPVLLEA 574


>gi|53791309|dbj|BAD52574.1| putative HGA6 [Oryza sativa Japonica Group]
 gi|53791443|dbj|BAD52495.1| putative HGA6 [Oryza sativa Japonica Group]
          Length = 700

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 134/397 (33%), Positives = 205/397 (51%), Gaps = 26/397 (6%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDS-TPSQPSLRIKTQPYPRKSDKS 70
           CD  S+R  D C ++G         T   V   G  D  T    S  IK  PYPRK+D +
Sbjct: 302 CDLWSNRRIDWCELDGDVRVAGANGTVSLVAPPGPADERTFRAESWHIK--PYPRKADPN 359

Query: 71  AMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
           AM  V+ LT+ +     ++    +C   H  P LVFS  GY GN+FH + D  +PLF+T 
Sbjct: 360 AMRHVRVLTVQSLPAPAASAAAPACTERHDVPGLVFSDRGYTGNYFHAYTDVILPLFLTA 419

Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
             +    +V L ++D    W  K+  +    + + +IN+++    HCF+ V +GL  H  
Sbjct: 420 RQY--SGEVKLLVSDFQMWWLGKFLPVFKAVSNYDLINLDDDRRVHCFRHVQVGLTCHAD 477

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN-------ENTNTSSSFHHTKPKLVLVNRNARVGR 238
             I+P+  P   ++VDF  F+   Y          +         +P+L+++ R AR  R
Sbjct: 478 FSIDPSRAPNGYSMVDFTRFMRATYRLPRDAPFPASGEQQPRRPWRPRLLVIAR-ART-R 535

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
             +N  E+ + AE  GF+V + E E    +A      ++C AMVGVHGAGLT+ +FL  G
Sbjct: 536 RFVNADEIVRGAERAGFEVVVSEGEHE--VAPFAELANTCDAMVGVHGAGLTNMVFLPTG 593

Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
            V++QVVP+G       YF  P+R +GL YLEY+I  EES+L+++Y  +  +  +P    
Sbjct: 594 GVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLIDQYPRDHPIFTDPDGVK 653

Query: 359 GANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
              W++++  YL  Q+V+LD+ RFR  LK A    +K
Sbjct: 654 SKGWNSLKEAYLDKQDVRLDMKRFRPILKKAIAHLRK 690


>gi|414876595|tpg|DAA53726.1| TPA: hypothetical protein ZEAMMB73_648118 [Zea mays]
          Length = 624

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 128/380 (33%), Positives = 198/380 (52%), Gaps = 15/380 (3%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C + G    D   S F  V    ++ S      K +PYPRK D++ M ++
Sbjct: 241 CDFSDFRSDICDLAGDVRMDANVSAFVVVVDPATSGSDQEEH-KVRPYPRKGDETCMGRI 299

Query: 76  KELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+  T  A     C  TH +PA+VFS GGY GN FH+F D  VPL+ T   +    D
Sbjct: 300 TEVTVRATGDAAGAPRCTETHGAPAVVFSIGGYTGNIFHDFSDVLVPLYNTARRYR--GD 357

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTIT---HCFQSVTLGLISHGRMVINP 190
           V L +A+    W  KY  LL   +RH  +++         HCF    + L +H  ++I  
Sbjct: 358 VQLVMANAAPWWLVKYDRLLRALSRHAPLDLARAGAAREVHCFPRAVVSLRAHKELIIER 417

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVK 247
                     DF  FL  A +   +  +        KP+L++++R+    R +LNL  V 
Sbjct: 418 DRSLDGLATPDFTRFLRRALSLPRDAPTRLGDGTGRKPRLLVISRHRT--RLLLNLDAVV 475

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
           +AAEE+GF+  + E + +  ++     I+SC AMVGVHGAGLT+ +FL PG+ L+Q+VP 
Sbjct: 476 RAAEEVGFEAVVNESDVANDISQVGGLINSCDAMVGVHGAGLTNMMFLPPGAALVQIVPW 535

Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           G  QW++   +  PA  +GL+Y++Y+I   ES+L +K+ +   +  NP A     +  +R
Sbjct: 536 GGLQWMARADYGDPAEAMGLKYIQYEIGVAESTLKDKFPSGHKIFTNPTALHKKGFMFIR 595

Query: 367 -VYLKTQNVKLDIDRFRIYL 385
              +  Q++ +D+ RFR  L
Sbjct: 596 QTLMDGQDITVDVGRFREVL 615


>gi|218187405|gb|EEC69832.1| hypothetical protein OsI_00156 [Oryza sativa Indica Group]
          Length = 671

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 134/397 (33%), Positives = 205/397 (51%), Gaps = 26/397 (6%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDS-TPSQPSLRIKTQPYPRKSDKS 70
           CD  S+R  D C ++G         T   V   G  D  T    S  IK  PYPRK+D +
Sbjct: 273 CDLWSNRRIDWCELDGDVRVAGANGTVSLVAPPGPADERTFRAESWHIK--PYPRKADPN 330

Query: 71  AMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
           AM  V+ LT+ +     ++    +C   H  P LVFS  GY GN+FH + D  +PLF+T 
Sbjct: 331 AMRHVRVLTVQSLPAPAASAAAPACTERHDVPGLVFSDRGYTGNYFHAYTDVILPLFLTA 390

Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
             +    +V L ++D    W  K+  +    + + +IN+++    HCF+ V +GL  H  
Sbjct: 391 RQY--SGEVKLLVSDFQMWWLGKFLPVFKAVSNYDLINLDDDRRVHCFRHVQVGLTCHAD 448

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN-------ENTNTSSSFHHTKPKLVLVNRNARVGR 238
             I+P+  P   ++VDF  F+   Y          +         +P+L+++ R AR  R
Sbjct: 449 FSIDPSRAPNGYSMVDFTRFMRATYRLPRDAPFPASGEQQPRRPWRPRLLVIAR-ART-R 506

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
             +N  E+ + AE  GF+V + E E    +A      ++C AMVGVHGAGLT+ +FL  G
Sbjct: 507 RFVNADEIVRGAERAGFEVVVSEGEHE--VAPFAELANTCDAMVGVHGAGLTNMVFLPTG 564

Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
            V++QVVP+G       YF  P+R +GL YLEY+I  EES+L+++Y  +  +  +P    
Sbjct: 565 GVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLIDQYPRDHPIFTDPDGVK 624

Query: 359 GANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
              W++++  YL  Q+V+LD+ RFR  LK A    +K
Sbjct: 625 SKGWNSLKEAYLDKQDVRLDMKRFRPILKKAIAHLRK 661


>gi|297595988|ref|NP_001041864.2| Os01g0119100 [Oryza sativa Japonica Group]
 gi|255672812|dbj|BAF03778.2| Os01g0119100, partial [Oryza sativa Japonica Group]
          Length = 570

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 134/397 (33%), Positives = 205/397 (51%), Gaps = 26/397 (6%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDS-TPSQPSLRIKTQPYPRKSDKS 70
           CD  S+R  D C ++G         T   V   G  D  T    S  IK  PYPRK+D +
Sbjct: 172 CDLWSNRRIDWCELDGDVRVAGANGTVSLVAPPGPADERTFRAESWHIK--PYPRKADPN 229

Query: 71  AMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
           AM  V+ LT+ +     ++    +C   H  P LVFS  GY GN+FH + D  +PLF+T 
Sbjct: 230 AMRHVRVLTVQSLPAPAASAAAPACTERHDVPGLVFSDRGYTGNYFHAYTDVILPLFLTA 289

Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
             +    +V L ++D    W  K+  +    + + +IN+++    HCF+ V +GL  H  
Sbjct: 290 RQY--SGEVKLLVSDFQMWWLGKFLPVFKAVSNYDLINLDDDRRVHCFRHVQVGLTCHAD 347

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN-------ENTNTSSSFHHTKPKLVLVNRNARVGR 238
             I+P+  P   ++VDF  F+   Y          +         +P+L+++ R AR  R
Sbjct: 348 FSIDPSRAPNGYSMVDFTRFMRATYRLPRDAPFPASGEQQPRRPWRPRLLVIAR-ART-R 405

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
             +N  E+ + AE  GF+V + E E    +A      ++C AMVGVHGAGLT+ +FL  G
Sbjct: 406 RFVNADEIVRGAERAGFEVVVSEGEHE--VAPFAELANTCDAMVGVHGAGLTNMVFLPTG 463

Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
            V++QVVP+G       YF  P+R +GL YLEY+I  EES+L+++Y  +  +  +P    
Sbjct: 464 GVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLIDQYPRDHPIFTDPDGVK 523

Query: 359 GANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
              W++++  YL  Q+V+LD+ RFR  LK A    +K
Sbjct: 524 SKGWNSLKEAYLDKQDVRLDMKRFRPILKKAIAHLRK 560


>gi|326495222|dbj|BAJ85707.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 369

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 128/365 (35%), Positives = 195/365 (53%), Gaps = 16/365 (4%)

Query: 24  DLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITTS 83
           ++C + G+    P  S   SV + +   S+ +   K +PYPRK D+  +SKV ELT+ +S
Sbjct: 3   NVCEMRGNIRIHPNAS---SVMYMEPASSKRNELWKLKPYPRKGDELCLSKVTELTVKSS 59

Query: 84  --APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADC 141
             AP    C   H  P +VF+  GY GN FH+F D  VPLF T  S F + +V   I D 
Sbjct: 60  KVAP---ECTKYHNVPVVVFALTGYTGNLFHDFTDALVPLFTTA-SEF-NGEVQFLITDM 114

Query: 142 NDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVD 201
              W RKY  +  + +++P+I+ N  T  HC +   +GL ++    I+P+  P   T+VD
Sbjct: 115 AIWWTRKYHVVFKKLSKYPLIDFNKDTDVHCVKHAIVGLHAYMEFTIDPSKAPHNYTMVD 174

Query: 202 FQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
           F  F+   Y    E  +        KP+L++++R     R  LNL E+   AE LGF+V 
Sbjct: 175 FNRFMRRTYELPREAVSALGEIPKAKPRLLIISRQRT--RMFLNLPEIIAMAEGLGFEVV 232

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI-GTQWLSTVYF 317
           + E   S+ L+   + ++S   M+GVHGAGLT+ +FL   + L+Q+VP  G + +  + F
Sbjct: 233 VEEANVSSDLSQFSKVVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGIEGVCRIDF 292

Query: 318 EKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLD 377
             PA  +GL Y +Y I   ESSL ++Y  +  + KNP AF          ++  QNV+LD
Sbjct: 293 GDPAEQMGLRYKQYSIAVHESSLTDQYPLDHEIFKNPLAFHKGFEFIKETFMDKQNVRLD 352

Query: 378 IDRFR 382
            +RF+
Sbjct: 353 CNRFK 357


>gi|413947274|gb|AFW79923.1| hypothetical protein ZEAMMB73_646561 [Zea mays]
          Length = 592

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 133/390 (34%), Positives = 207/390 (53%), Gaps = 21/390 (5%)

Query: 16  CD-RSHRSYDLCLINGSALF----DPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
           CD  S+R  D C + G        +   +   + G  + +  + S  IK  PYPRK+D +
Sbjct: 203 CDLTSNRRIDWCELEGDVRVVGGSNGSVTLVAAPGADERSFHEESWSIK--PYPRKADPN 260

Query: 71  AMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
           AM  V+ L + +  +APP ++C   H  PALVFSA GY GN+FH + D  +PLF+T   +
Sbjct: 261 AMRSVRALAVRSVATAPP-VACTDWHGVPALVFSARGYTGNYFHAYTDVILPLFLTARQY 319

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI--THCFQSVTLGLISHGRM 186
               +V+L +      W  KY  +    + +  ++++        CF+ V +GL SH   
Sbjct: 320 A--GEVLLLVTGFQAWWVGKYLPVFRSLSNYEPVDLDRDRDPRVRCFRRVQVGLTSHDDF 377

Query: 187 VINPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
            I+P   P   +++DF  F+   Y                  +LV   AR  R  LN  +
Sbjct: 378 SIDPRRAPNGYSMLDFTRFMRATYGLPRDAVPRRGRGRPRPRLLVIARART-RRFLNAED 436

Query: 246 VKKAAEELGFDVTIFE-PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
           + + A ++GF+V + E  +E  + A+    +++C A+VGVHGAGLT+ +FL PG V++QV
Sbjct: 437 IVRGARKVGFEVVVSEVAQEVAAFAE---LVNTCDAVVGVHGAGLTNMVFLPPGGVVIQV 493

Query: 305 VPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW-S 363
           +P+G       YF  PAR +GL YLEY+I  EES+L+++Y  +  VL +P +    +W S
Sbjct: 494 LPLGPLEFVASYFRGPARDMGLSYLEYRISPEESTLLDQYPRDHPVLTDPMSVQAKDWVS 553

Query: 364 NMRVYLKTQNVKLDIDRFRIYLKDAYKKAK 393
            M VYL  Q+V+LD+ RFR  LK A  + +
Sbjct: 554 FMGVYLFKQDVRLDMKRFRPVLKKALARLR 583


>gi|297830468|ref|XP_002883116.1| hypothetical protein ARALYDRAFT_341951 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328956|gb|EFH59375.1| hypothetical protein ARALYDRAFT_341951 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 538

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 133/380 (35%), Positives = 212/380 (55%), Gaps = 23/380 (6%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFS-VGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           C +  R+ + C ++G      K++T  + +    S  S   +R    PY RK D  AM++
Sbjct: 159 CTKLART-EFCELSGDVRIHGKSATVLAAITFAFSGNSTWHMR----PYARKGDLVAMNR 213

Query: 75  VKELTITTS------APPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
           V++ T+            N S C   H+ PA++FS GGY  N FH+F D  +PL+ T   
Sbjct: 214 VRKWTVKLEQNADQLENANFSRCVRNHSVPAMIFSLGGYTMNNFHDFTDVVIPLYTTARR 273

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
              + +V   + + N  W  K+ EL+ + + + +I I+ +  THCF SVT+GLI H    
Sbjct: 274 F--NGEVQFLVTNRNPWWINKFKELVKKLSNYEVIYIDEEDETHCFSSVTVGLIRHREYF 331

Query: 188 INPTLLPKPK--TLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLR 244
              T+ P     ++ DF+SFL + Y+  N    +S    +   +L+   +R  R   N  
Sbjct: 332 TELTIDPSNSEYSMSDFRSFLRDTYSLRNAAVITSQIRRRRPRILILSRSR-SRAFENTG 390

Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
           E+ +AA ++GF+V +   E +T +A+  + ++SC  M+GVHGAGLT+ +FL   +V++Q+
Sbjct: 391 EIARAARQIGFEVVV--AEANTGVANFAQTVNSCDVMLGVHGAGLTNMVFLPDNAVVIQI 448

Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
           +PIG  +WL+   FE+P+  + L YLEYKI  EESSLV+KYG +  V+++P A A   W 
Sbjct: 449 LPIGGFEWLAKTDFERPSEGMNLRYLEYKIAAEESSLVKKYGRDHEVVRDPSAVAKHGWD 508

Query: 364 NMR-VYLKTQNVKLDIDRFR 382
             + VYL  QNV +DI+RF+
Sbjct: 509 MFKSVYLVHQNVSIDINRFK 528


>gi|30684813|ref|NP_188445.2| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
 gi|27808590|gb|AAO24575.1| At3g18170 [Arabidopsis thaliana]
 gi|110736165|dbj|BAF00054.1| hypothetical protein [Arabidopsis thaliana]
 gi|332642538|gb|AEE76059.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
          Length = 384

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/384 (33%), Positives = 209/384 (54%), Gaps = 19/384 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C +  R+ + C +NG      K++T  S   T +     +  I+  PY RK D  AM +V
Sbjct: 5   CTKLART-EFCELNGDVRVHGKSATV-SAAITFAFSGNSTWHIR--PYARKGDTVAMKRV 60

Query: 76  KELTITTS------APPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
           +E T+            N S C   H+ PA++FS GGY+ N FH+F D  +PL+ T    
Sbjct: 61  REWTVKLEQNADQLENANFSRCVRNHSVPAMIFSLGGYSMNNFHDFTDIVIPLYTTARRF 120

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
             + +V   + + +  W  K+ EL+ + + + +I I+ +  THCF SVT+GL  H     
Sbjct: 121 --NGEVQFLVTNKSPSWINKFKELVRKLSNYEVIYIDEEDETHCFSSVTVGLTRHREYFK 178

Query: 189 NPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
             T+ P     ++ DF+SFL + Y+   +  ++    + +  ++       R  +N  E+
Sbjct: 179 ELTIDPSNSEYSMSDFRSFLRDTYSLRNDAVATRQIRRRRPRILILARGRSRAFVNTGEI 238

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            +AA ++GF V +   E +  +A   + ++SC  M+GVHGAGLT+ +FL   +V++QV+P
Sbjct: 239 ARAARQIGFKVVV--AEANIGIAKFAQTVNSCDVMLGVHGAGLTNMVFLPENAVVIQVLP 296

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
           IG  +WL+   FEKP+  + L YLEYKI  EES+LV+KYG +  ++++P A A   W   
Sbjct: 297 IGGFEWLAKTDFEKPSEGMNLRYLEYKIAVEESTLVKKYGRDHEIVRDPSAVAKHGWEMF 356

Query: 366 R-VYLKTQNVKLDIDRFRIYLKDA 388
           + VYL  QNV +DI+RF+  L  A
Sbjct: 357 KSVYLVQQNVSIDINRFKPVLVKA 380


>gi|297595984|ref|NP_001041860.2| Os01g0118400 [Oryza sativa Japonica Group]
 gi|255672810|dbj|BAF03774.2| Os01g0118400 [Oryza sativa Japonica Group]
          Length = 393

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 131/385 (34%), Positives = 199/385 (51%), Gaps = 22/385 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C  +G       TS F  V    +         K +PYPRK D + M +V
Sbjct: 12  CDFSDFRTDICDFSGDVRMAANTSEFVVVVDPAAAAVSH----KVRPYPRKGDATCMGRV 67

Query: 76  KELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+  T  A     C   H  PA+ FS GGY GN FH+F D  VPL+ T+  +     
Sbjct: 68  TEITVRATGDAAGAPRCTRAHDVPAVAFSIGGYTGNIFHDFSDVIVPLYNTVQRY--RGG 125

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTI---THCFQSVTLGLISHGRMVINP 190
           V L + +    W  KY +LL   +RH  I++         HCF S  + L +H  ++I  
Sbjct: 126 VQLVMVNVASWWLVKYDKLLRELSRHAPIDLAKAGAAGEVHCFPSAVVSLRAHRELIIER 185

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRT--ILNLRE 245
                     DF  FL  A +   +  +        +P+L++++R    GRT  +LNL  
Sbjct: 186 ERSLDGLATPDFTRFLRRALSLPRDAPTRLGDGTGRRPRLLVISR----GRTRLLLNLDA 241

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           V +AAEE+GF+V + E +   S+    + I+SC A++GVHGAGLT+ +FL PG+ ++QVV
Sbjct: 242 VVRAAEEVGFEVVVNESDVGNSIEQVGKLINSCDALLGVHGAGLTNMMFLPPGATMVQVV 301

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G  QW++ + + +PA  +GL Y++Y+I   ESSL +KY A D +  NP       ++ 
Sbjct: 302 PWGGLQWIARMDYGEPAAAMGLNYIQYEIAVAESSLKDKYPAGDEIFTNPTGLHKRGFAF 361

Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDA 388
           M+   +  Q++ +D+ RFR  L+ A
Sbjct: 362 MKQTLMDGQDITIDVTRFRPVLQQA 386


>gi|53791307|dbj|BAD52572.1| putative HGA6 [Oryza sativa Japonica Group]
 gi|53791441|dbj|BAD52493.1| putative HGA6 [Oryza sativa Japonica Group]
          Length = 522

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 131/385 (34%), Positives = 199/385 (51%), Gaps = 22/385 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C  +G       TS F  V    +      +R    PYPRK D + M +V
Sbjct: 141 CDFSDFRTDICDFSGDVRMAANTSEFVVVVDPAAAAVSHKVR----PYPRKGDATCMGRV 196

Query: 76  KELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+  T  A     C   H  PA+ FS GGY GN FH+F D  VPL+ T+  +     
Sbjct: 197 TEITVRATGDAAGAPRCTRAHDVPAVAFSIGGYTGNIFHDFSDVIVPLYNTVQRY--RGG 254

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTI---THCFQSVTLGLISHGRMVINP 190
           V L + +    W  KY +LL   +RH  I++         HCF S  + L +H  ++I  
Sbjct: 255 VQLVMVNVASWWLVKYDKLLRELSRHAPIDLAKAGAAGEVHCFPSAVVSLRAHRELIIER 314

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRT--ILNLRE 245
                     DF  FL  A +   +  +        +P+L++++R    GRT  +LNL  
Sbjct: 315 ERSLDGLATPDFTRFLRRALSLPRDAPTRLGDGTGRRPRLLVISR----GRTRLLLNLDA 370

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           V +AAEE+GF+V + E +   S+    + I+SC A++GVHGAGLT+ +FL PG+ ++QVV
Sbjct: 371 VVRAAEEVGFEVVVNESDVGNSIEQVGKLINSCDALLGVHGAGLTNMMFLPPGATMVQVV 430

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G  QW++ + + +PA  +GL Y++Y+I   ESSL +KY A D +  NP       ++ 
Sbjct: 431 PWGGLQWIARMDYGEPAAAMGLNYIQYEIAVAESSLKDKYPAGDEIFTNPTGLHKRGFAF 490

Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDA 388
           M+   +  Q++ +D+ RFR  L+ A
Sbjct: 491 MKQTLMDGQDITIDVTRFRPVLQQA 515


>gi|9294073|dbj|BAB02030.1| unnamed protein product [Arabidopsis thaliana]
          Length = 535

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/384 (33%), Positives = 209/384 (54%), Gaps = 19/384 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C +  R+ + C +NG      K++T  S   T +     +  I+  PY RK D  AM +V
Sbjct: 156 CTKLART-EFCELNGDVRVHGKSATV-SAAITFAFSGNSTWHIR--PYARKGDTVAMKRV 211

Query: 76  KELTITTS------APPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
           +E T+            N S C   H+ PA++FS GGY+ N FH+F D  +PL+ T    
Sbjct: 212 REWTVKLEQNADQLENANFSRCVRNHSVPAMIFSLGGYSMNNFHDFTDIVIPLYTTARRF 271

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
             + +V   + + +  W  K+ EL+ + + + +I I+ +  THCF SVT+GL  H     
Sbjct: 272 --NGEVQFLVTNKSPSWINKFKELVRKLSNYEVIYIDEEDETHCFSSVTVGLTRHREYFK 329

Query: 189 NPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
             T+ P     ++ DF+SFL + Y+   +  ++    + +  ++       R  +N  E+
Sbjct: 330 ELTIDPSNSEYSMSDFRSFLRDTYSLRNDAVATRQIRRRRPRILILARGRSRAFVNTGEI 389

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            +AA ++GF V +   E +  +A   + ++SC  M+GVHGAGLT+ +FL   +V++QV+P
Sbjct: 390 ARAARQIGFKVVV--AEANIGIAKFAQTVNSCDVMLGVHGAGLTNMVFLPENAVVIQVLP 447

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
           IG  +WL+   FEKP+  + L YLEYKI  EES+LV+KYG +  ++++P A A   W   
Sbjct: 448 IGGFEWLAKTDFEKPSEGMNLRYLEYKIAVEESTLVKKYGRDHEIVRDPSAVAKHGWEMF 507

Query: 366 R-VYLKTQNVKLDIDRFRIYLKDA 388
           + VYL  QNV +DI+RF+  L  A
Sbjct: 508 KSVYLVQQNVSIDINRFKPVLVKA 531


>gi|300681531|emb|CBH32628.1| hga5 [Triticum aestivum]
          Length = 412

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 134/389 (34%), Positives = 206/389 (52%), Gaps = 28/389 (7%)

Query: 10  PAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTP-SQPSLRIKTQ 61
           P  SITCD   +  D       +C ++G     P +S+        S P  Q +   + +
Sbjct: 22  PKGSITCDDKSKDNDFPYARPVVCKMSGDVRIAPGSSSVIL-----SMPLYQSAEGRRVR 76

Query: 62  PYPRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTS-PALVFSAGGYNGNFFHEFMDCF 118
           PY R  D S    V+E+ I T A  + +  C V H   PA+VFS GGY  NFFH+  D  
Sbjct: 77  PYARHDD-SLPPLVREVAIKTVANGSDAPECSVGHGDIPAVVFSVGGYTRNFFHDMSDVL 135

Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTL 178
           +PL++T    F  + V   + D    W +KY  +L R +R+ I++ ++    HCF  V L
Sbjct: 136 IPLYLTA-FQFKGR-VQFFVTDYKQWWLKKYKPILRRLSRYDIVDFDSNNDVHCFHHVIL 193

Query: 179 GLISHGRMVI--NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRN 233
           GL+    +++  +PT  PK  ++V F  FL +AY    N           KP++++++R 
Sbjct: 194 GLVRDRDLILRRHPTRNPKGYSMVGFTRFLRHAYGLRRNRPFVLGENPGKKPRMLIISR- 252

Query: 234 ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
            R  R +LNL  V+  A  LGFDVT+ E     S+      ++SC  +V VHG GLT+ +
Sbjct: 253 -RGTRRLLNLHRVEAMATALGFDVTVSE-AGGNSVKRFAETVNSCDVLVAVHGGGLTNQM 310

Query: 294 FLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLK 352
           FL   +V++Q+VP G  +W++T ++ +PAR +GL YLEY +  EESSL  +Y  +  V +
Sbjct: 311 FLPAKAVVVQIVPWGGMEWMATNFYGEPARGMGLRYLEYHVAGEESSLARRYPRDHAVFR 370

Query: 353 NPQAFAGANWSNMRVYLKTQNVKLDIDRF 381
           +P A     W  +   + TQ+V+LD+DRF
Sbjct: 371 DPMAIHAQGWKALAEVVMTQDVRLDLDRF 399


>gi|357131745|ref|XP_003567495.1| PREDICTED: uncharacterized protein LOC100826127 [Brachypodium
           distachyon]
          Length = 508

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 129/391 (32%), Positives = 205/391 (52%), Gaps = 26/391 (6%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           I CDRS  + D+C ++G       + +F       +   Q     + +PY RK ++  M+
Sbjct: 110 ICCDRSDFNTDVCFMSGDVRTHAASLSFLLFNANATVEEQ-----RIRPYTRKWERHLMA 164

Query: 74  KVKELTI---TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
            + E+ +   T S      C V H +PALV +AGGY GN FH F D F+P ++T +SH  
Sbjct: 165 SIHEVRLRAPTASESETSKCDVVHEAPALVMTAGGYTGNLFHAFNDGFLPAWLT-SSHL- 222

Query: 131 DQDVILAIADCNDQWARKYAELLPRFT-RHPIININNQTITHCF-QSVTLGLISHGRMVI 188
              V+LA+   N  WA  + EL+   + R  ++++ + T THCF     +G   HG + +
Sbjct: 223 RHGVVLAVLAYNPWWAGTFRELISELSGRRGVVDLVHDTRTHCFPAGAIVGSRFHGVLSV 282

Query: 189 NPTLLPKPKTLVDFQSFLANAYNENTNT--SSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           +P      K+L+DF +FLA AY  +            +P+L ++ R     R I N   V
Sbjct: 283 DPARTRDHKSLLDFHTFLARAYEADNAALKQEEQQGRRPRLGILARKGN--RVIENQGAV 340

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG-------- 298
            + AE +GF+V+I E      L+ S+  + +C  ++GVHGA LT  LFLRP         
Sbjct: 341 ARLAESIGFEVSILETANGAPLSASYAAVSACDVLLGVHGADLTKLLFLRPSNNTNSNST 400

Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ-AF 357
           + ++Q+ P+G   ++   + +    +GL Y +Y +   ESSL  KY A+D+++ +P+ A 
Sbjct: 401 AAVLQIAPLGVGPIARGCYAEATVSMGLRYEQYDVVAGESSLRLKYAADDVIVADPETAK 460

Query: 358 AGANWSNM-RVYLKTQNVKLDIDRFRIYLKD 387
            GA W  + +VYL +QNV LD+DRF   L+D
Sbjct: 461 KGAGWELVAKVYLGSQNVTLDLDRFGDTLRD 491


>gi|31296712|gb|AAP46641.1| HGA1 [Hordeum vulgare]
          Length = 505

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 127/387 (32%), Positives = 208/387 (53%), Gaps = 31/387 (8%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + C       D C + G    +    T   V  T++     S   K QPY R+     MS
Sbjct: 117 VVCTTEGPFSDTCDVFGDVRTNGTAHTVTLVPATETE----SQEWKIQPYARR----GMS 168

Query: 74  KVKELTI-----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
            + E T+     T++A P  +C VTH  PA+VF+ GG  GN+FH+F D  VPLF+    +
Sbjct: 169 GISEFTVMQLDSTSAASPAPACTVTHRVPAIVFALGGLTGNYFHDFSDALVPLFVASRRY 228

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
               +V L  ++    W  KY  ++ + +++ ++++++     CF SVT+GL  H    I
Sbjct: 229 --GGEVQLLASNIQPWWLGKYEAVVRKLSKYDVVDLDHDDQIRCFPSVTVGLRMHKEFDI 286

Query: 189 NPTLLPK--PKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNRNARV 236
            P L+P   P ++VDF +FL   Y+          ++ +        KP+L+L++R    
Sbjct: 287 TPELVPGGVPLSMVDFTAFLRETYSLPRAGPISLMKDISPPEDQEKRKPRLMLLHRGHY- 345

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
            R ++N+ E+ KAAE+ GF+V I +P     + +  R ++S   ++GVHGAGLT+++FL 
Sbjct: 346 -RKLVNVPEIVKAAEKAGFEVAIADPRFDVRVEELARSVNSFDVLLGVHGAGLTNAVFLP 404

Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
             +V++QVVP G  + ++ V F  P   +GL YLEY I  EES+L+E  G +  V+K+P+
Sbjct: 405 TAAVVIQVVPYGNLERMAKVDFGDPVADMGLRYLEYSITAEESTLLEMLGPDHPVIKDPE 464

Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRF 381
           +   + W  +   YL  Q+V++D+DRF
Sbjct: 465 SVHRSGWDKVAEYYLGKQDVRVDVDRF 491


>gi|300681543|emb|CBH32640.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
           aestivum]
          Length = 526

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 127/387 (32%), Positives = 209/387 (54%), Gaps = 31/387 (8%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + C       D C + G    +    T   V  T++     S   K QPY R+     MS
Sbjct: 138 VVCTTEGPFSDTCDVFGDVRTNGTAHTVTLVPATETE----SREWKIQPYVRR----GMS 189

Query: 74  KVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
            + E+T+T     ++A P  +C VTH  PA+VF+ GG  GN+FH+F D  VPLF+    +
Sbjct: 190 GISEVTVTQLDSTSAASPAPACTVTHRVPAIVFALGGLTGNYFHDFSDALVPLFVASRRY 249

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
               +V L  ++    W  KY  ++ R +++ ++++++     CF SVT+GL  H    I
Sbjct: 250 --GGEVQLLASNIQPWWLGKYEAVVRRLSKYDVVDLDHDDQIRCFPSVTVGLRMHKEFDI 307

Query: 189 NPTLLP--KPKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNRNARV 236
            P L+P   P ++VDF +FL   Y           ++ +        KP+L+L++R    
Sbjct: 308 VPELVPGGAPLSMVDFTAFLRETYTLPRAAPISLMKDISPPEDQEKRKPRLMLLHRGHY- 366

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
            R  +N+ E+ KAAE+ GF+V+I +P     + +  R ++S   ++GVHGAGLT+++F+ 
Sbjct: 367 -RKFVNVPEIVKAAEKAGFEVSIADPRFDVRVEELARSVNSFDVLLGVHGAGLTNAVFMP 425

Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
            G+V++QVVP G  + ++ V F  P   +GL YLEY I  EES+L+E  G +  V+K+P+
Sbjct: 426 TGAVVIQVVPYGNLEHMAKVDFGDPVADMGLRYLEYSITAEESTLLEMLGPDHPVIKDPE 485

Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRF 381
           +   + W  +   YL  Q+V++D++RF
Sbjct: 486 SVHRSGWDKVAEYYLGKQDVRVDVERF 512


>gi|300681530|emb|CBH32627.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
           aestivum]
          Length = 629

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 129/385 (33%), Positives = 201/385 (52%), Gaps = 20/385 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S R  D+C   G    +  TS+F  V    +      +R    PYPRK D++ M +V
Sbjct: 246 CDFSDRRSDVCDFTGDIRMEANTSSFVVVVDAATAAQSHKVR----PYPRKGDQTCMGRV 301

Query: 76  KELTITTSAPPNL----SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
            E+T+ T++  +      C  TH+ PA+ FS GGY GN FH+F D  VPL+ T+  H   
Sbjct: 302 PEITVRTASSSSTPPPPQCTRTHSVPAVTFSIGGYTGNIFHDFSDVLVPLYNTV--HRYR 359

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININ---NQTITHCFQSVTLGLISHGRMVI 188
            DV L +A+    W  KY +LL   +RH  +++     +  THCF+   + L +H  ++I
Sbjct: 360 GDVQLVMANVVPWWLVKYDKLLRELSRHAPLDLAVAAAKGETHCFRHAVVSLRAHRELII 419

Query: 189 NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLRE 245
                P      DF  F+  A +   +  +        KP+L+++ R+    R +LNL +
Sbjct: 420 ERDRSPDGLATPDFTRFIRRALSLPRDAPTRLADGMGRKPRLLIIARHRT--RILLNLGD 477

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           + + AEE GF+  + E +   S++     I+S   ++GVHGAGLT+ +FL PG+ L+QVV
Sbjct: 478 MLRVAEEAGFEAAVSESDVGDSISRVGAEINSADVLLGVHGAGLTNMMFLAPGATLVQVV 537

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G  QW++ + +  PA  +GL Y++Y+I  EESSL + Y     +  +P +     +  
Sbjct: 538 PWGGLQWIARMDYGDPAEAMGLRYVQYEIGVEESSLKDTYPRGHKIFTDPTSLHKKGFGF 597

Query: 365 M-RVYLKTQNVKLDIDRFRIYLKDA 388
           M R  +  QN+ LD+ RFR  L  A
Sbjct: 598 MRRTLMDGQNITLDLGRFRGVLHQA 622


>gi|55792425|gb|AAV65333.1| HGA1 [Hordeum vulgare]
          Length = 505

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 126/387 (32%), Positives = 208/387 (53%), Gaps = 31/387 (8%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + C       D C + G    +    T   V  T++     S   K QPY R+     MS
Sbjct: 117 VVCTTEGPFSDTCDVFGDVRTNGTAHTVTLVPATETE----SQEWKIQPYARR----GMS 168

Query: 74  KVKELTI-----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
            + E T+     T++A P  +C VTH  PA+VF+ GG  GN+FH+F D  VPLF+    +
Sbjct: 169 GISEFTVMQLDSTSAASPAPACTVTHRVPAIVFALGGLTGNYFHDFSDALVPLFVASRRY 228

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
               +V L  ++    W  KY  ++ + +++ ++++++     CF +VT+GL  H    I
Sbjct: 229 --GGEVQLLASNIQPWWLGKYEAVVRKLSKYDVVDLDHDDQIRCFPNVTVGLRMHKEFDI 286

Query: 189 NPTLLPK--PKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNRNARV 236
            P L+P   P ++VDF +FL   Y+          ++ +        KP+L+L++R    
Sbjct: 287 TPELVPGGVPLSMVDFTAFLRETYSLPRAAPISLMKDISPPEDQEKRKPRLMLLHRGHY- 345

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
            R ++N+ E+ KAAE+ GF+V I +P     + +  R ++S   ++GVHGAGLT+++FL 
Sbjct: 346 -RKLVNVPEIVKAAEKAGFEVAIADPRFDVRVEELARSVNSFDVLLGVHGAGLTNAVFLP 404

Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
             +V++QVVP G  + ++ V F  P   +GL YLEY I  EES+L+E  G +  V+K+P+
Sbjct: 405 TAAVVIQVVPYGNLERMAKVDFGDPVADMGLRYLEYSITAEESTLLEMLGPDHPVIKDPE 464

Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRF 381
           +   + W  +   YL  Q+V++D+DRF
Sbjct: 465 SVHRSGWDKVAEYYLGKQDVRVDVDRF 491


>gi|334353407|emb|CCA61105.1| xylan arabinosyl transferase [Triticum aestivum]
          Length = 506

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 122/383 (31%), Positives = 202/383 (52%), Gaps = 21/383 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPYPRKSDKSAMSK 74
           C  S R  D C   G              G + +   +P  R  K +PY RK D  A+S 
Sbjct: 125 CYESGRRSDTCEAAGDVRVQ---------GRSQTIQVRPLDREWKVKPYCRKQDAYALSH 175

Query: 75  VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
           VKE T+   +     C V  ++ A V S GG+ GN FH++ D  VP FIT  +H    +V
Sbjct: 176 VKEWTLRPLSGGGPHCTVNSSATAFVISTGGFTGNLFHDYTDVLVPAFIT--AHRFGGEV 233

Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
              ++     W  +Y E+  + +RH +I+I+N     C+ SV +G   H  + ++ +  P
Sbjct: 234 QFLVSSFKSWWTNRYLEIFQQLSRHEVIDIDNDDEVRCYGSVVVGPTFHKELGVDASKTP 293

Query: 195 KPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLREVKKA 249
              ++VDF++ L  A+  +   +           +P+L++++R  +  R  LN R +   
Sbjct: 294 TGASMVDFRAMLRGAFGLSRAAAEPSGDRWDIRRRPRLLIISR--KNSRAFLNERAMADM 351

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           A  LG+DV + EP+ +T ++   R ++S   MVGVHGAGLT+ +FL  G+VL+QV+P G 
Sbjct: 352 AMSLGYDVRVGEPDTNTEVSRFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVIPYGG 411

Query: 310 -QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
            +WL+   FE+P++ + L Y+ YKI+ +E++L E+Y  +  VL +P +     W  ++ V
Sbjct: 412 LEWLARGTFEEPSKDMQLHYIGYKIQLDETTLSEQYPKDHPVLTDPLSIHKQGWEALKTV 471

Query: 368 YLKTQNVKLDIDRFRIYLKDAYK 390
           YL+ QNV+  + R ++   +A K
Sbjct: 472 YLEKQNVRPHLGRLKLTFLEALK 494


>gi|413954174|gb|AFW86823.1| hypothetical protein ZEAMMB73_859783 [Zea mays]
          Length = 388

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 129/388 (33%), Positives = 203/388 (52%), Gaps = 21/388 (5%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           +TC+ S+R  D C + G         T + +  ++  P+  ++ I+  PY RK ++  M 
Sbjct: 4   VTCNWSNRHSDTCKMEGDLRIHGMAGTVYVLSSSNFRPNNSTITIR--PYTRKWEQETML 61

Query: 74  KVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
           +++E++I ++AP   S      C V H  PA+VFS GG   NFFH   D  VPL+IT   
Sbjct: 62  RIREVSIRSTAPEPFSFVIPPKCTVRHDVPAVVFSTGGCGTNFFHAMTDLIVPLYITARE 121

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
           H  +  V L +AD   +W  K+  +L   + +P+I+ +  T   CF S  +GL SH  + 
Sbjct: 122 H--NGHVQLLVADYQPEWVAKFRPILTALSIYPVIDFDADTAVRCFPSAHVGLESHRILG 179

Query: 188 INPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLR 244
           INP L     T++ F+ FL + ++     ++    +   KP+LV V R  R  R + N  
Sbjct: 180 INPALSRNSYTMMGFRDFLRDVFSLRRPWATPVSRSSGQKPRLVFVLR--RHSREVTNEV 237

Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
           +   A   LGF+V +  PE+   +A     ++SC  MVGVHGAGLT+ +FL     ++Q+
Sbjct: 238 DAIAALAGLGFEVVVAGPEDVRDMAKIAGVVNSCDVMVGVHGAGLTNMVFLPHNGTIVQI 297

Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF--AGAN 361
           +P G  ++     F  PA  +GL Y+EY+   EE++L  KY  +  V  +P +   +G  
Sbjct: 298 IPWGNLKYPCRFDFGDPAPDMGLRYVEYEANAEETTLKYKYPRDHPVFTDPISIERSGKL 357

Query: 362 WSNMRVYLKTQNVKLDIDRFRIYLKDAY 389
           W     +L+ QNV LDIDRFR  ++  Y
Sbjct: 358 WET---FLEGQNVTLDIDRFREAMQQVY 382


>gi|242095988|ref|XP_002438484.1| hypothetical protein SORBIDRAFT_10g020460 [Sorghum bicolor]
 gi|241916707|gb|EER89851.1| hypothetical protein SORBIDRAFT_10g020460 [Sorghum bicolor]
          Length = 495

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 128/386 (33%), Positives = 205/386 (53%), Gaps = 19/386 (4%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + C+ S+R  D C + G      K++T + +  +   P+  ++ I+  PY RK ++  M+
Sbjct: 115 LACNWSNRHSDTCRMEGDLRIHGKSATVYVLSSSTFNPNNSTITIR--PYTRKWEQETMA 172

Query: 74  KVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
           +++E+TI ++AP   S      C V H  PA+VFS GG   NFFH   D  +PL+IT   
Sbjct: 173 RIREVTIRSTAPEPYSFVIPPKCTVRHDVPAVVFSTGGCGTNFFHAMTDLIIPLYITARE 232

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
           +  +  V L +AD   ++  K+  +L   + +PII+ +  T   CF S  +GL SH  + 
Sbjct: 233 Y--NGHVQLLVADYQPEFVAKFRPILAALSIYPIIDFDADTAVRCFPSAHVGLESHRILG 290

Query: 188 INPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
           INP L     T++ F+ FL + ++     ++     KP+LV V R  R  R + N  +  
Sbjct: 291 INPGLSRNGYTMMGFRDFLRDVFSLPRPWATPVSR-KPRLVFVVR--RHSRAVTNEADAI 347

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
            A  +LGF+V    PE+   +A     ++SC  MVGVHGAGLT+ +FL     ++Q++P 
Sbjct: 348 AAVADLGFEVVAAGPEDVGDMAKIAAVVNSCDVMVGVHGAGLTNMVFLPHNGTIVQIIPW 407

Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF--AGANWSN 364
           G  ++     F  P   +GL Y+EY++  EE++L  KY  +  V  +P +   +G  W  
Sbjct: 408 GNLKYPCRFDFGDPVPDMGLHYVEYEVNAEETTLKYKYPRDHPVFTDPLSIERSGKLWE- 466

Query: 365 MRVYLKTQNVKLDIDRFRIYLKDAYK 390
              +L+ QNV LDIDRFR  ++  YK
Sbjct: 467 --TFLEGQNVTLDIDRFREAMQQVYK 490


>gi|357521533|ref|XP_003631055.1| Glycosyltransferase, putative [Medicago truncatula]
 gi|355525077|gb|AET05531.1| Glycosyltransferase, putative [Medicago truncatula]
          Length = 420

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 129/379 (34%), Positives = 206/379 (54%), Gaps = 25/379 (6%)

Query: 24  DLCLINGSALFDPKTSTFFSVGH-TDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT 82
           + C   G      K+S+ + V   T+S     S  I+  PY RKSD   MS V + ++ T
Sbjct: 47  EYCQTQGDIRVHGKSSSVYIVSRKTNSLAENVSWIIR--PYARKSDAYTMSSVTKWSVKT 104

Query: 83  SAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIAD 140
             P +    C   H+ PA++FS  GY GN FHEF D  +PLF+T    F  Q V L I D
Sbjct: 105 VKPTHQVSQCTKYHSIPAVIFSTAGYTGNHFHEFSDIVIPLFLTCR-QFNGQ-VQLIITD 162

Query: 141 CNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS-HGRMVINPTLLPKPKTL 199
               W  K+   L + + + II+I+     HCF  V +GL   H  + I+P       ++
Sbjct: 163 KKSWWISKHQAFLKKLSNYEIIDIDRDDELHCFPKVIIGLKRYHKELSIDPQ--KYSYSI 220

Query: 200 VDFQSFLANAYNENTNTSSSF-----HHTKPKLVLVNRNARVGRTILNLREVKKAAEELG 254
            DF+ FL ++Y+    ++           KP+L++++R  +  R+  N  ++ K A+ LG
Sbjct: 221 KDFRDFLRSSYSLKRVSAIKIRDIGNQSKKPRLLILSR--KTSRSFTNTNQIAKMAKGLG 278

Query: 255 FDVTIFEPEEST-SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG---TQ 310
           F V + E   +  S+A+    ++SC  ++GVHGAGLT+ LFL   ++ +QVVP G    +
Sbjct: 279 FRVIVMEAGRNMRSIAN---VVNSCDVLMGVHGAGLTNILFLPQNAIFIQVVPFGGMQVE 335

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
           WL+T  F +P+  + ++YLEYKI+ +ES+L+++Y  + +++K+P +     W   R VY 
Sbjct: 336 WLATNDFARPSEDMNIKYLEYKIRLDESTLIQQYPLDHMIIKDPSSIEKQGWEAFRSVYF 395

Query: 370 KTQNVKLDIDRFRIYLKDA 388
             QNV+LD++RFR  L+ A
Sbjct: 396 DKQNVRLDVNRFRPTLQKA 414


>gi|383100765|emb|CCG47996.1| glycosyltransferase, HGA-like,putative,expressed [Triticum
           aestivum]
          Length = 527

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 132/388 (34%), Positives = 210/388 (54%), Gaps = 41/388 (10%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           S TCD     +     NG+A     T T   V  T+S         K QPY R+     M
Sbjct: 148 SDTCD----VFGDVRTNGTA----HTVTLVPVTQTESR------EWKIQPYTRR----GM 189

Query: 73  SKVKELTIT----TSA-PPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
           S + E+T+T    TSA  P  +C VTH  P +VF+ GG  GN+FH+F D  VPLF+    
Sbjct: 190 SGISEVTVTQLDSTSADSPAPACTVTHRVPGIVFALGGLTGNYFHDFSDALVPLFVASRR 249

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
           +    +V L  ++    W  KY  ++ R +++ ++++++     CF SVT+GL  H    
Sbjct: 250 Y--GGEVQLLASNIQPWWLGKYEAVVRRLSKYDVVDLDHDDQIRCFPSVTVGLRMHKEFD 307

Query: 188 INPTLLP--KPKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNRNAR 235
           I P L+P   P ++VDF +FL   Y           ++ +        KP+L+L++R   
Sbjct: 308 IVPELVPGGAPLSMVDFTAFLRETYTLPRAAPISLMKDISPPEDQEKRKPRLMLLHRGHY 367

Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
             R  +N+ E+ KAAE+ GF+V+I +P     + +  R ++S   ++GVHGAGLT+++F+
Sbjct: 368 --RKFVNVPEIVKAAEKAGFEVSIADPRFDVKVEELARSVNSFDVLLGVHGAGLTNAVFM 425

Query: 296 RPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
             G+V++QVVP G  + ++ V F  P   +GL YLEY I  EES+L+E  G +  V+K+P
Sbjct: 426 PTGAVVIQVVPYGNLEHMAKVDFGDPVADMGLRYLEYSITAEESTLLEMLGPDHPVIKDP 485

Query: 355 QAFAGANWSNM-RVYLKTQNVKLDIDRF 381
           ++   + W  +   YL  Q+V++D++RF
Sbjct: 486 ESVHRSGWDKVAEYYLGKQDVRVDVERF 513


>gi|297734272|emb|CBI15519.3| unnamed protein product [Vitis vinifera]
          Length = 446

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 133/389 (34%), Positives = 204/389 (52%), Gaps = 40/389 (10%)

Query: 12  KSITCDRSHRSYDLCLINGSALFDPKTSTFF--SVGHTDSTPSQPSLRIKTQPYPRKSDK 69
           +++ C+   RS D C+I+G       +ST F  S    D  P   S  I+  PY RK D 
Sbjct: 85  EALVCNIEPRS-DFCVISGDVRVHGNSSTVFIASSAPVDILPENGSWSIR--PYARKGDA 141

Query: 70  SAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
            AM  +K  T+  TT       C   HT PA++FS GGY+GN FH F D  +PL++T  S
Sbjct: 142 RAMKHIKNFTVKMTTGRQHLPHCTQNHTVPAILFSLGGYSGNHFHAFSDVLIPLYLT--S 199

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
              + +V   +   +  W  K+  LL   +R+PII+I+ +   HCF S  +GL       
Sbjct: 200 RQFNGEVQFLVTSKSLWWIAKFRILLQELSRYPIIDIDREEGIHCFSSAIIGL------- 252

Query: 188 INPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH----TKPKLVLVNRNARVGRTILNL 243
                          + FL ++Y+    T+          KP+L+++ R  +  R+  N 
Sbjct: 253 ---------------KEFLRSSYSLKRATAIKVRDGTDTKKPRLLIIAR--KKSRSFTND 295

Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
            ++ + A  LG++V + EP   T ++     ++SC  ++GVHGAGLT+ +FL   +VL+Q
Sbjct: 296 GKIAEMARSLGYEVIVAEPN-GTEISRFAELVNSCDVLMGVHGAGLTNIVFLPENAVLIQ 354

Query: 304 VVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
           VVP+G  +W++   F  PA  + + Y+EY+IK+EESSL+EKY     VL+ P +     W
Sbjct: 355 VVPLGGLEWVARYDFGLPAVDMKIRYIEYQIKEEESSLIEKYPHEHAVLREPHSITKLGW 414

Query: 363 SNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
             ++ VYL  QNVKLD++RFR  L  A +
Sbjct: 415 LELKAVYLDKQNVKLDLNRFRNTLLQALQ 443


>gi|356511269|ref|XP_003524349.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like [Glycine
           max]
          Length = 462

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 134/391 (34%), Positives = 205/391 (52%), Gaps = 28/391 (7%)

Query: 12  KSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSA 71
           + + C    R+   C   G      K+ST + V  + +T S+ ++    +PY R+ D  A
Sbjct: 80  EELLCTSEERT-KFCQARGDIRVHGKSSTVYIVS-SKTTMSEKNMSWNLKPYARRDDVDA 137

Query: 72  MSKVKE-----LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
           M +V+E     + ++  AP    C   H  PA++FS GGY GN FHEF D  +PLF+T  
Sbjct: 138 MIRVREWSVKAVNVSQKAP---QCTQYHNIPAVLFSTGGYAGNHFHEFTDIVIPLFLTAR 194

Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR- 185
               + +V   I D    W  K+  LL + + +  ++I+     HCF  VT+GL  + + 
Sbjct: 195 QF--NGEVQFIITDKRPWWISKHKPLLKKLSNYETMDIDGDDEVHCFPRVTVGLKRYQKE 252

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
           + I P       ++ DF+  L ++Y     E   T       KP+L++++R  +  R   
Sbjct: 253 LSIEPQ--KYSYSMKDFRDLLRSSYALKRVEAIKTRDGL-RGKPRLMILSR--KRSRFFT 307

Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           N  E+ K AE LGFDV I   E   S+      ++SC  ++GVHGAGLT+ LFL   +V 
Sbjct: 308 NTDEIAKMAESLGFDVII--KEAGWSMWGFANVVNSCDVLLGVHGAGLTNILFLPENAVF 365

Query: 302 MQVVPIG---TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
           +QVVP G     WL+T  F  P++ + ++YLEYKI  EES+L+++Y  + + +K+P    
Sbjct: 366 VQVVPYGGVTLDWLATNDFGNPSKDMNIKYLEYKISLEESTLIQQYPLDHMFIKDPPLIE 425

Query: 359 GANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
              W   + VYL  QNVKLD+DRFR  L+ A
Sbjct: 426 KIGWEEFKSVYLDKQNVKLDVDRFRPTLQKA 456


>gi|357521531|ref|XP_003631054.1| Glycosyltransferase, putative [Medicago truncatula]
 gi|355525076|gb|AET05530.1| Glycosyltransferase, putative [Medicago truncatula]
          Length = 530

 Score =  211 bits (538), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 129/379 (34%), Positives = 206/379 (54%), Gaps = 25/379 (6%)

Query: 24  DLCLINGSALFDPKTSTFFSVGH-TDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT 82
           + C   G      K+S+ + V   T+S     S  I+  PY RKSD   MS V + ++ T
Sbjct: 157 EYCQTQGDIRVHGKSSSVYIVSRKTNSLAENVSWIIR--PYARKSDAYTMSSVTKWSVKT 214

Query: 83  SAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIAD 140
             P +    C   H+ PA++FS  GY GN FHEF D  +PLF+T    F  Q V L I D
Sbjct: 215 VKPTHQVSQCTKYHSIPAVIFSTAGYTGNHFHEFSDIVIPLFLTCR-QFNGQ-VQLIITD 272

Query: 141 CNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS-HGRMVINPTLLPKPKTL 199
               W  K+   L + + + II+I+     HCF  V +GL   H  + I+P       ++
Sbjct: 273 KKSWWISKHQAFLKKLSNYEIIDIDRDDELHCFPKVIIGLKRYHKELSIDPQ--KYSYSI 330

Query: 200 VDFQSFLANAYNENTNTSSSF-----HHTKPKLVLVNRNARVGRTILNLREVKKAAEELG 254
            DF+ FL ++Y+    ++           KP+L++++R  +  R+  N  ++ K A+ LG
Sbjct: 331 KDFRDFLRSSYSLKRVSAIKIRDIGNQSKKPRLLILSR--KTSRSFTNTNQIAKMAKGLG 388

Query: 255 FDVTIFEPEEST-SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG---TQ 310
           F V + E   +  S+A+    ++SC  ++GVHGAGLT+ LFL   ++ +QVVP G    +
Sbjct: 389 FRVIVMEAGRNMRSIAN---VVNSCDVLMGVHGAGLTNILFLPQNAIFIQVVPFGGMQVE 445

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
           WL+T  F +P+  + ++YLEYKI+ +ES+L+++Y  + +++K+P +     W   R VY 
Sbjct: 446 WLATNDFARPSEDMNIKYLEYKIRLDESTLIQQYPLDHMIIKDPSSIEKQGWEAFRSVYF 505

Query: 370 KTQNVKLDIDRFRIYLKDA 388
             QNV+LD++RFR  L+ A
Sbjct: 506 DKQNVRLDVNRFRPTLQKA 524


>gi|56409866|emb|CAI30081.1| glycosyltransferase [Triticum aestivum]
          Length = 439

 Score =  211 bits (538), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 122/383 (31%), Positives = 203/383 (53%), Gaps = 21/383 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPYPRKSDKSAMSK 74
           C  S R  D C   G              G + +   +P  R  K +PY RK D  A+S 
Sbjct: 58  CYESGRRSDTCEAAGDVRVQ---------GRSQTIQVRPLDREWKVKPYCRKQDAYALSH 108

Query: 75  VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
           VKE T+   +     C V  ++ A V S GG+ GN FH++ D  VP FIT  +H    +V
Sbjct: 109 VKEWTLRPLSGGGPHCTVNSSATAFVISTGGFTGNPFHDYTDVLVPAFIT--AHRFGGEV 166

Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
              ++     W  +Y E+  + +R+ +I+I+N     C+ SV +G   H  + ++P+  P
Sbjct: 167 QFLVSSFKSWWTNRYLEIFQQLSRYEVIDIDNDDEVRCYGSVVVGPTFHKELGVDPSKTP 226

Query: 195 KPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLREVKKA 249
              ++VDF++ L  A+  +   +           +P+L++++R  +  R  LN R +   
Sbjct: 227 TGASMVDFRAMLRGAFGLSRAAAEPSGDRWDIRRRPRLLIISR--KNSRAFLNERAMADM 284

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           A  LG+DV + EP+ +T ++   R ++S   MVGVHGAGLT+ +FL  G+VL+QV+P G 
Sbjct: 285 AMSLGYDVRVGEPDTNTEVSRFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVIPYGG 344

Query: 310 -QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
            +WL+   FE+P++ + L Y+ YKI+ +E++L E+Y  +  VL +P +     W  ++ V
Sbjct: 345 LEWLARGTFEEPSKDMQLHYIGYKIQLDETTLSEQYPKDHPVLTDPLSIHKQGWEALKTV 404

Query: 368 YLKTQNVKLDIDRFRIYLKDAYK 390
           YL+ QNV+  + R ++   +A K
Sbjct: 405 YLEKQNVRPHLGRLKLTFLEALK 427


>gi|242056669|ref|XP_002457480.1| hypothetical protein SORBIDRAFT_03g008010 [Sorghum bicolor]
 gi|241929455|gb|EES02600.1| hypothetical protein SORBIDRAFT_03g008010 [Sorghum bicolor]
          Length = 558

 Score =  211 bits (538), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 144/403 (35%), Positives = 210/403 (52%), Gaps = 35/403 (8%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C   G    D   S F  V   D T    +   K +PY RK D ++MS+V
Sbjct: 158 CDLSGSRSDVCDFTGDIRLDANASAFIVV---DPTGDANAPTYKVRPYARKGDATSMSRV 214

Query: 76  KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
            E+T+ T+      C  TH  PA+VFS GGY GN FH+F D  VPL+ T   +     V 
Sbjct: 215 TEVTVRTTTADAPRCTATHAEPAVVFSIGGYAGNLFHDFTDVIVPLYGTAQRY--GGVVR 272

Query: 136 LAIADCN---DQWARKYAELLPRFTRHPIININNQT---ITHCFQSVTLGL-ISHGRMVI 188
           L +AD      +W  KY  +L   +RHP +++        THCF  V +GL  +H  ++I
Sbjct: 273 LVVADAGAGPSRWLAKYDAVLRGLSRHPPLDLAATAPGEETHCFGHVVVGLRAAHRELMI 332

Query: 189 NP--TLLPKPKT----LVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNR 232
           +        P      +VDF  FL  A +           +   ++     KP+L++V R
Sbjct: 333 DERDERSSGPDAVGVGMVDFARFLRRALSLPRDAVTTRPSSDAVATGTKKPKPRLLIVAR 392

Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTS---LADSFRFIHSCHAMVGVHGAGL 289
             R  R +LN   V + AEE+GF+  + E E S S   +A+  R I+S  A+VGVHGAGL
Sbjct: 393 --RGTRRLLNADAVARVAEEVGFEAVVSELEVSKSDDGIAEVGRRINSFDAVVGVHGAGL 450

Query: 290 THSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
           T+ +FL  G+ ++QVVP G  QW++ + F  PA  +GL Y++Y+I   ESSL +KY ++ 
Sbjct: 451 TNMVFLPRGATVVQVVPWGGLQWIARMDFGDPAEAMGLRYVQYEIAVHESSLRDKYPSDH 510

Query: 349 LVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
            +  NP A     +  +R  +L  Q+V LD+DRFR+ L  A++
Sbjct: 511 EIFTNPTALHRKGFKFLRHTFLIGQDVTLDVDRFRVVLLQAFQ 553


>gi|449484981|ref|XP_004157036.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like [Cucumis sativus]
          Length = 372

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 119/334 (35%), Positives = 179/334 (53%), Gaps = 7/334 (2%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITTSA 84
           +CL N     +     F+   + DS  +  S  I   PY R+ DK  +  V  L I    
Sbjct: 35  VCLTNNPTRINNTNLEFYISTNNDSQQNNFS-PILIHPYARQEDKITLRDVTPLQIIFQP 93

Query: 85  PPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
              L   C   H  P L+FS GG+ GN FHEF +  +PLFIT + HF  + V   I D  
Sbjct: 94  NKTLLPLCQFIHNVPVLIFSTGGFTGNLFHEFDETIIPLFIT-SYHFQTR-VRFLITDHK 151

Query: 143 DQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDF 202
             W +KY  +L   +R  ++N+      HCF    +GL  H  + +N T +P   ++ DF
Sbjct: 152 TWWVQKYNRILSGLSRFNVVNLAEDGSVHCFNGGVIGLKFHNILSLNNTDIPGGYSMSDF 211

Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEP 262
           +SFL   YN   N  S     KP ++L++R  +  R  +N  E+ +  +E+GF+V    P
Sbjct: 212 RSFLRQTYNLKVNNVSELSGKKPMVMLISR--QTSRRFMNEGEMVEMMKEVGFEVMTTTP 269

Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR 322
           +  ++L      ++ C  ++G HGAGLT+ +FL  G+V++QVVP G  W ST +F KPA 
Sbjct: 270 QRMSNLDKFSSVVNLCSVIIGAHGAGLTNEVFLANGAVVVQVVPFGLDWPSTYFFGKPAA 329

Query: 323 VLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA 356
            + L+YLEYKI+ +ESSL +KYG N  V+++P++
Sbjct: 330 EMELQYLEYKIEAKESSLWDKYGENHPVIRDPES 363


>gi|326527195|dbj|BAK04539.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 519

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 124/391 (31%), Positives = 205/391 (52%), Gaps = 24/391 (6%)

Query: 10  PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPYPRKSD 68
           P K + C  S R  D C   G              G + +   +P  R  K +PY RK D
Sbjct: 131 PGKPV-CYESGRRSDTCEAAGDVRVQ---------GRSQTIQVRPLDREWKVKPYCRKQD 180

Query: 69  KSAMSKVKELTIT--TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
             A+S VKE T+   +S+ P   C V  ++ A V S GG+ GN FH++ D  VP FIT  
Sbjct: 181 AYALSHVKEWTLRPLSSSGPVPHCTVNSSATAFVLSTGGFTGNLFHDYTDVLVPAFITAR 240

Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRM 186
                 +V   ++     W  +Y E+  + +++ +I+I+N     C+  V +G   H  +
Sbjct: 241 RF--GGEVQFLVSSFKSWWTNRYLEIFQQLSKYEVIDIDNDDEVRCYGGVVVGPTFHKEL 298

Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTIL 241
            ++ +  P   ++VDF++ L  A+  +   +           +P+L++++R  +  R  L
Sbjct: 299 GVDASKTPAGYSMVDFRAMLRGAFGLSRAAAEPSGDRWDIRRRPRLLIISR--KNSRAFL 356

Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           N R +   A  LG+DV + EP+ +T ++   R ++S   MVGVHGAGLT+ LFL  G+VL
Sbjct: 357 NERAMADMAMSLGYDVRVGEPDSNTDVSKFARLVNSADVMVGVHGAGLTNMLFLPAGAVL 416

Query: 302 MQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGA 360
           +QVVP G  +WL+   FE+P++ + L Y+ YKI+ +E++L E+Y  +  VL +P +    
Sbjct: 417 IQVVPYGGLEWLARGTFEEPSKDMQLHYIGYKIQLDETTLSEQYPKDHPVLTDPLSIHKQ 476

Query: 361 NWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
            W  ++ VYL+ QNV+  + R ++   +A K
Sbjct: 477 GWEALKTVYLEKQNVRPHLGRLKLTFMEALK 507


>gi|357521535|ref|XP_003631056.1| Glycosyltransferase, putative [Medicago truncatula]
 gi|355525078|gb|AET05532.1| Glycosyltransferase, putative [Medicago truncatula]
          Length = 568

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 129/379 (34%), Positives = 206/379 (54%), Gaps = 25/379 (6%)

Query: 24  DLCLINGSALFDPKTSTFFSVGH-TDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT 82
           + C   G      K+S+ + V   T+S     S  I+  PY RKSD   MS V + ++ T
Sbjct: 195 EYCQTQGDIRVHGKSSSVYIVSRKTNSLAENVSWIIR--PYARKSDAYTMSSVTKWSVKT 252

Query: 83  SAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIAD 140
             P +    C   H+ PA++FS  GY GN FHEF D  +PLF+T    F  Q V L I D
Sbjct: 253 VKPTHQVSQCTKYHSIPAVIFSTAGYTGNHFHEFSDIVIPLFLTCR-QFNGQ-VQLIITD 310

Query: 141 CNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS-HGRMVINPTLLPKPKTL 199
               W  K+   L + + + II+I+     HCF  V +GL   H  + I+P       ++
Sbjct: 311 KKSWWISKHQAFLKKLSNYEIIDIDRDDELHCFPKVIIGLKRYHKELSIDPQ--KYSYSI 368

Query: 200 VDFQSFLANAYNENTNTSSSF-----HHTKPKLVLVNRNARVGRTILNLREVKKAAEELG 254
            DF+ FL ++Y+    ++           KP+L++++R  +  R+  N  ++ K A+ LG
Sbjct: 369 KDFRDFLRSSYSLKRVSAIKIRDIGNQSKKPRLLILSR--KTSRSFTNTNQIAKMAKGLG 426

Query: 255 FDVTIFEPEEST-SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG---TQ 310
           F V + E   +  S+A+    ++SC  ++GVHGAGLT+ LFL   ++ +QVVP G    +
Sbjct: 427 FRVIVMEAGRNMRSIAN---VVNSCDVLMGVHGAGLTNILFLPQNAIFIQVVPFGGMQVE 483

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
           WL+T  F +P+  + ++YLEYKI+ +ES+L+++Y  + +++K+P +     W   R VY 
Sbjct: 484 WLATNDFARPSEDMNIKYLEYKIRLDESTLIQQYPLDHMIIKDPSSIEKQGWEAFRSVYF 543

Query: 370 KTQNVKLDIDRFRIYLKDA 388
             QNV+LD++RFR  L+ A
Sbjct: 544 DKQNVRLDVNRFRPTLQKA 562


>gi|357521529|ref|XP_003631053.1| Glycosyltransferase, putative [Medicago truncatula]
 gi|355525075|gb|AET05529.1| Glycosyltransferase, putative [Medicago truncatula]
          Length = 566

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 129/379 (34%), Positives = 206/379 (54%), Gaps = 25/379 (6%)

Query: 24  DLCLINGSALFDPKTSTFFSVGH-TDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT 82
           + C   G      K+S+ + V   T+S     S  I+  PY RKSD   MS V + ++ T
Sbjct: 193 EYCQTQGDIRVHGKSSSVYIVSRKTNSLAENVSWIIR--PYARKSDAYTMSSVTKWSVKT 250

Query: 83  SAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIAD 140
             P +    C   H+ PA++FS  GY GN FHEF D  +PLF+T    F  Q V L I D
Sbjct: 251 VKPTHQVSQCTKYHSIPAVIFSTAGYTGNHFHEFSDIVIPLFLTCR-QFNGQ-VQLIITD 308

Query: 141 CNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS-HGRMVINPTLLPKPKTL 199
               W  K+   L + + + II+I+     HCF  V +GL   H  + I+P       ++
Sbjct: 309 KKSWWISKHQAFLKKLSNYEIIDIDRDDELHCFPKVIIGLKRYHKELSIDPQ--KYSYSI 366

Query: 200 VDFQSFLANAYNENTNTSSSF-----HHTKPKLVLVNRNARVGRTILNLREVKKAAEELG 254
            DF+ FL ++Y+    ++           KP+L++++R  +  R+  N  ++ K A+ LG
Sbjct: 367 KDFRDFLRSSYSLKRVSAIKIRDIGNQSKKPRLLILSR--KTSRSFTNTNQIAKMAKGLG 424

Query: 255 FDVTIFEPEEST-SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG---TQ 310
           F V + E   +  S+A+    ++SC  ++GVHGAGLT+ LFL   ++ +QVVP G    +
Sbjct: 425 FRVIVMEAGRNMRSIAN---VVNSCDVLMGVHGAGLTNILFLPQNAIFIQVVPFGGMQVE 481

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
           WL+T  F +P+  + ++YLEYKI+ +ES+L+++Y  + +++K+P +     W   R VY 
Sbjct: 482 WLATNDFARPSEDMNIKYLEYKIRLDESTLIQQYPLDHMIIKDPSSIEKQGWEAFRSVYF 541

Query: 370 KTQNVKLDIDRFRIYLKDA 388
             QNV+LD++RFR  L+ A
Sbjct: 542 DKQNVRLDVNRFRPTLQKA 560


>gi|242094138|ref|XP_002437559.1| hypothetical protein SORBIDRAFT_10g029380 [Sorghum bicolor]
 gi|241915782|gb|EER88926.1| hypothetical protein SORBIDRAFT_10g029380 [Sorghum bicolor]
          Length = 462

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 122/343 (35%), Positives = 191/343 (55%), Gaps = 17/343 (4%)

Query: 59  KTQPYPRKSDKSAMSKVKELTITTSAP--PNLS-----CGVTHTSPALVFSAGGYNGNFF 111
           KT+PY R  D  AM  V+E T+    P  PN +     C   H+ P  +FS+GG+ GN +
Sbjct: 117 KTKPYARLHDPVAMDDVREFTLVPFGPGSPNGTVVPPLCTRNHSVPGFLFSSGGFAGNLY 176

Query: 112 HEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITH 171
           H++ D  VPLF + + HF   +V   +AD  D WA K+  L  + +R+ +I++NN    H
Sbjct: 177 HDYADVLVPLFASTH-HF-GGEVQFLLADIKDWWADKFKPLFRQLSRYDVIDVNNDREVH 234

Query: 172 CFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN----TNTSSSFHHTKPKL 227
           CF  + +G   H  M I+ +  P  +T+ DF+  L  A+       + + +     +P+L
Sbjct: 235 CFPRIVIGSTFHRAMGIDASRSPGGETVADFKRVLRRAFKLERAVASRSGAPRRKDRPRL 294

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           ++++R +   R  +N R + +AA    FDV I EP+  T + +  R ++S   M+GVHGA
Sbjct: 295 LIISRKS--SRRFVNERAMARAAAAAKFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGA 352

Query: 288 GLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGA 346
           GLT+ +FL   +VL+QVVP G  +WL+ V F+ PAR + + Y+EY +  EESSL + Y  
Sbjct: 353 GLTNMVFLPSRAVLVQVVPFGGLEWLTRVTFKDPARDMDVTYMEYNVSLEESSLRDLYPE 412

Query: 347 NDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
           +   LK+P       W  ++ VYL  QNV+L++ RF   L+ A
Sbjct: 413 DHFYLKHPYDVHKKGWDAIKTVYLDKQNVRLNLTRFTRTLEQA 455


>gi|212274959|ref|NP_001130300.1| uncharacterized protein LOC100191394 [Zea mays]
 gi|194688782|gb|ACF78475.1| unknown [Zea mays]
 gi|224030831|gb|ACN34491.1| unknown [Zea mays]
 gi|414876588|tpg|DAA53719.1| TPA: hypothetical protein ZEAMMB73_549999 [Zea mays]
          Length = 506

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 122/371 (32%), Positives = 200/371 (53%), Gaps = 31/371 (8%)

Query: 29  NGSALFDPKTSTFFSVGHTDSTPSQPSLR--IKTQPYPRKSDKSAMSKVKELTITTSAPP 86
           NGSAL    + TF         P+ PS R   K +PY R++    M  V  +T+T    P
Sbjct: 147 NGSAL----SVTF--------VPASPSERREWKVRPYSRRT----MVGVDRVTVTRLGSP 190

Query: 87  N---LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCND 143
           +     C V H+ P ++F+ GG  GN++H+F D  +PL+     +    +V+  +++   
Sbjct: 191 DDPAAPCAVAHSVPGVLFALGGLTGNYWHDFSDVLMPLYAASRRY--GGEVLFLVSNMQP 248

Query: 144 QWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL-LPKPKTLVDF 202
            W  KY  ++ R +R+  ++++      CF+ +T+GL  H  + + P L  P   T  DF
Sbjct: 249 WWLGKYEAVVRRLSRYDAVDLDRDARVRCFRRLTVGLRLHKELGVAPELTAPDRLTTADF 308

Query: 203 QSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI 259
            +FL   Y       ++   T   KP+L+L++R     R  +N+ E+ +AAE  GF+V +
Sbjct: 309 TAFLRETYALQRGAPAAVPTTEGRKPRLMLIHRAHY--RRFVNVPEITRAAEAAGFEVAV 366

Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFE 318
             P     + ++ R ++SC A++GVHGAGLT+++FL PG+VL+QVVP G  + ++   F 
Sbjct: 367 ASPRGDAPVEETARAVNSCDALLGVHGAGLTNAVFLPPGAVLIQVVPYGRLERMARRDFG 426

Query: 319 KPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLD 377
            PA  +GL YLEY +  EES+L+E  G    V+++P +   + W  +   YL  Q+V++D
Sbjct: 427 DPAEDMGLRYLEYSVSAEESTLLEMLGPEHPVIRDPDSVHRSGWDKVAEYYLGKQDVRID 486

Query: 378 IDRFRIYLKDA 388
           +DRF   L  A
Sbjct: 487 VDRFAPTLAQA 497


>gi|162460449|ref|NP_001105165.1| glycosyltransferase [Zea mays]
 gi|56409864|emb|CAI30080.1| glycosyltransferase [Zea mays]
 gi|414879190|tpg|DAA56321.1| TPA: glycosyltransferase [Zea mays]
          Length = 492

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 123/391 (31%), Positives = 207/391 (52%), Gaps = 26/391 (6%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C  + R  D C   G       T T +     DS   +     KT+PY RK D  A++ V
Sbjct: 112 CYETSRRSDTCEAAGDVRVVGSTQTVY----VDSLDRE----WKTKPYCRKHDNFALAHV 163

Query: 76  KELTIT---TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           KE ++    + A P   C V  ++ A V S GG+ GN FH++ D  +P FIT  +H    
Sbjct: 164 KEWSLKPLPSGAAPQ--CTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHPLRG 219

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
           +V   ++     W  +Y ++  + +RH +++I+      C++SV +G   H  + ++ + 
Sbjct: 220 EVQFLVSSYKSWWMNRYIQIFQQMSRHEVVDIDADDEVRCYRSVVVGATFHRELGVDASK 279

Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLREVK 247
            P   +  DF+  L +A+     T++          +P+L++++R    GR  +N R + 
Sbjct: 280 SPPGYSTADFRKMLRDAFGLERATATPSGDRWDIRRRPRLLIISRRPSRGRAFMNERAMA 339

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
             A  LGFDV I EP+ ST  +   R ++SC  MVGVHGAGLT+ +FL  G+V++QVVP 
Sbjct: 340 DMAASLGFDVRIGEPDTSTDTSKFARLVNSCDVMVGVHGAGLTNMVFLPAGAVVVQVVPY 399

Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           G  +WL+   F +P+  + + YLEY ++ +E++L E+Y ++  VL++P A     W+ ++
Sbjct: 400 GRLEWLARNTFAEPSAGMEVHYLEYVVQLDETTLSEQYPSDHPVLRDPMAIHKQGWNALK 459

Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
             YL  QNV+  + R    LK+ + +A K++
Sbjct: 460 TTYLDKQNVRPHLGR----LKNTFLQALKML 486


>gi|414879189|tpg|DAA56320.1| TPA: hypothetical protein ZEAMMB73_962821 [Zea mays]
          Length = 488

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 123/391 (31%), Positives = 207/391 (52%), Gaps = 26/391 (6%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C  + R  D C   G       T T +     DS   +     KT+PY RK D  A++ V
Sbjct: 108 CYETSRRSDTCEAAGDVRVVGSTQTVY----VDSLDRE----WKTKPYCRKHDNFALAHV 159

Query: 76  KELTIT---TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           KE ++    + A P   C V  ++ A V S GG+ GN FH++ D  +P FIT  +H    
Sbjct: 160 KEWSLKPLPSGAAPQ--CTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHPLRG 215

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
           +V   ++     W  +Y ++  + +RH +++I+      C++SV +G   H  + ++ + 
Sbjct: 216 EVQFLVSSYKSWWMNRYIQIFQQMSRHEVVDIDADDEVRCYRSVVVGATFHRELGVDASK 275

Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLREVK 247
            P   +  DF+  L +A+     T++          +P+L++++R    GR  +N R + 
Sbjct: 276 SPPGYSTADFRKMLRDAFGLERATATPSGDRWDIRRRPRLLIISRRPSRGRAFMNERAMA 335

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
             A  LGFDV I EP+ ST  +   R ++SC  MVGVHGAGLT+ +FL  G+V++QVVP 
Sbjct: 336 DMAASLGFDVRIGEPDTSTDTSKFARLVNSCDVMVGVHGAGLTNMVFLPAGAVVVQVVPY 395

Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           G  +WL+   F +P+  + + YLEY ++ +E++L E+Y ++  VL++P A     W+ ++
Sbjct: 396 GRLEWLARNTFAEPSAGMEVHYLEYVVQLDETTLSEQYPSDHPVLRDPMAIHKQGWNALK 455

Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
             YL  QNV+  + R    LK+ + +A K++
Sbjct: 456 TTYLDKQNVRPHLGR----LKNTFLQALKML 482


>gi|356527872|ref|XP_003532530.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like [Glycine
           max]
          Length = 462

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 130/392 (33%), Positives = 211/392 (53%), Gaps = 26/392 (6%)

Query: 12  KSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSA 71
           + + C    R+ + C   G      K+ST  S+  + +T  + ++    +PY R+ D  A
Sbjct: 80  EELLCTSEERT-EFCQARGDIRVHGKSSTV-SIVSSKTTMLEKTMSRSLKPYARRGDIDA 137

Query: 72  MSKVKELTI-----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
           M++V+E ++     +  AP    C  +H   A++FS GGY+GN FHEF D  +PLF+T  
Sbjct: 138 MNRVREWSVKAVNASQKAP---QCTQSHNITAVLFSTGGYSGNHFHEFTDIVIPLFLTAR 194

Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR- 185
               + +V   I D    W  K+  LL + + +  ++I+     HCF SVT+GL  + + 
Sbjct: 195 QF--NGEVQFIITDKRPWWISKHKPLLKKLSNYETMDIDGDDQVHCFPSVTVGLKRYQKE 252

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH---HTKPKLVLVNRNARVGRTILN 242
           + I+P       ++ DF+  L ++Y      +         KP+L++++R  +  R+  N
Sbjct: 253 LSIDPQ--KYSYSMKDFRDLLRSSYALKRVEAMKIRDGLRGKPRLMILSR--KRSRSFTN 308

Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
             E+ K A  LGFDV +   E   S+      ++SC  ++GVHGAGLT+ LFL   +V +
Sbjct: 309 TDEIAKMAASLGFDVIV--KEAGWSMWGFANVVNSCDVLLGVHGAGLTNILFLPENAVFI 366

Query: 303 QVVPIG---TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
           QVVP G     WL+T  F KP++ + L+YLEYKI  +ES+L+++Y  + + +K+P     
Sbjct: 367 QVVPYGGFTLDWLATNDFGKPSKDMNLKYLEYKIGLKESTLIQQYPLDHIFIKDPPLVEK 426

Query: 360 ANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
             W   + VYL  QNVKLD+DRFR  L+ A++
Sbjct: 427 IGWEEFKSVYLDKQNVKLDVDRFRPTLQKAFE 458


>gi|195649525|gb|ACG44230.1| glycosyltransferase [Zea mays]
 gi|414876589|tpg|DAA53720.1| TPA: glycosyltransferase [Zea mays]
          Length = 505

 Score =  210 bits (535), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 122/371 (32%), Positives = 200/371 (53%), Gaps = 31/371 (8%)

Query: 29  NGSALFDPKTSTFFSVGHTDSTPSQPSLR--IKTQPYPRKSDKSAMSKVKELTITTSAPP 86
           NGSAL    + TF         P+ PS R   K +PY R++    M  V  +T+T    P
Sbjct: 146 NGSAL----SVTF--------VPASPSERREWKVRPYSRRT----MVGVDRVTVTRLGSP 189

Query: 87  N---LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCND 143
           +     C V H+ P ++F+ GG  GN++H+F D  +PL+     +    +V+  +++   
Sbjct: 190 DDPAAPCAVAHSVPGVLFALGGLTGNYWHDFSDVLMPLYAASRRY--GGEVLFLVSNMQP 247

Query: 144 QWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL-LPKPKTLVDF 202
            W  KY  ++ R +R+  ++++      CF+ +T+GL  H  + + P L  P   T  DF
Sbjct: 248 WWLGKYEAVVRRLSRYDAVDLDRDARVRCFRRLTVGLRLHKELGVAPELTAPDRLTTADF 307

Query: 203 QSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI 259
            +FL   Y       ++   T   KP+L+L++R     R  +N+ E+ +AAE  GF+V +
Sbjct: 308 TAFLRETYALQRGAPAAVPTTEGRKPRLMLIHRAHY--RRFVNVPEITRAAEAAGFEVAV 365

Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFE 318
             P     + ++ R ++SC A++GVHGAGLT+++FL PG+VL+QVVP G  + ++   F 
Sbjct: 366 ASPRGDAPVEETARAVNSCDALLGVHGAGLTNAVFLPPGAVLIQVVPYGRLERMARRDFG 425

Query: 319 KPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLD 377
            PA  +GL YLEY +  EES+L+E  G    V+++P +   + W  +   YL  Q+V++D
Sbjct: 426 DPAEDMGLRYLEYSVSAEESTLLEMLGPEHPVIRDPDSVHRSGWDKVAEYYLGKQDVRID 485

Query: 378 IDRFRIYLKDA 388
           +DRF   L  A
Sbjct: 486 VDRFAPTLAQA 496


>gi|326512144|dbj|BAJ96053.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 498

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 123/406 (30%), Positives = 210/406 (51%), Gaps = 30/406 (7%)

Query: 4   RPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPY 63
           + A +  A    C  + R  D C   G      ++ T +     D+   +     KT+PY
Sbjct: 104 KAAVVVDASKPVCYETSRRSDTCEAAGDVRLVGRSQTIY----IDTLERE----WKTRPY 155

Query: 64  PRKSDKSAMSKVKELTIT------TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
            RK D  A+S VKE ++        +AP    C    ++ A V S GG+ GN FH++ D 
Sbjct: 156 CRKHDTYALSHVKEWSLKPFPSGDDAAP---KCTSNSSATAFVISTGGFTGNPFHDYTDV 212

Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
            +P FIT  +H    +V   ++     W  +Y ++  + +R+ +++I+      C+ S  
Sbjct: 213 LIPAFIT--AHRFAGEVQFLVSSYKSWWMSRYIQIFQQMSRYEVVDIDADDEVRCYPSAV 270

Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNR 232
           +G   H  + ++P+  P   ++ DF+  L NA+     T++          +P+L++++R
Sbjct: 271 VGPTFHKELGVDPSKAPSGASMADFRKMLRNAFGLERATATPSGDRWDIRRRPRLLIISR 330

Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
               GR  +N R +   A  LGFDV I +P+ ++  +   R ++SC  MVGVHGAGLT+ 
Sbjct: 331 RTSRGRAFMNERAMADMAGSLGFDVRIGDPDTTSDTSKFARLVNSCDVMVGVHGAGLTNM 390

Query: 293 LFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
           +FL  G+VL+QVVP G  +WL+   F +P+  + + YLEY ++ +E++L E+Y ++ LVL
Sbjct: 391 VFLPAGAVLVQVVPYGRLEWLARNTFAEPSAGMEVHYLEYVVQLDETTLSEQYPSDHLVL 450

Query: 352 KNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
           K+P A     W  ++  YL  QNV+  + R    LK  + +A K++
Sbjct: 451 KDPMAIHKQGWDALKTTYLDKQNVRPHLGR----LKKTFLQALKML 492


>gi|15229594|ref|NP_188446.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
 gi|9294074|dbj|BAB02031.1| unnamed protein product [Arabidopsis thaliana]
 gi|332642539|gb|AEE76060.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
          Length = 470

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 125/373 (33%), Positives = 209/373 (56%), Gaps = 18/373 (4%)

Query: 22  SYDLCLINGSALFDPKTSTFFS-VGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTI 80
           S + C ++G      K++T  + V    S  S   +R    PY RK    AM +V+E T+
Sbjct: 101 SKEFCDVSGDVRIHGKSATVLAAVTFAFSGNSTWYMR----PYARKDQVPAMKRVREWTV 156

Query: 81  TTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIA 139
                 +LS C   H+ PA++FS GG++ N FH+F D  +PL+ T        +V   + 
Sbjct: 157 KLVQNASLSRCVRNHSVPAILFSLGGFSLNNFHDFTDIVIPLYTTARRF--SGEVQFLVT 214

Query: 140 DCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPK-- 197
           + N  W  K+ EL+ + + + +I I+ +  THCF SV +GL  H       T  P     
Sbjct: 215 NKNLLWINKFKELVRKLSNYEVIYIDEEDETHCFSSVIVGLNRHRDYDKELTTDPSNSEY 274

Query: 198 TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDV 257
           ++ DF+ FL + Y+   +  ++    KP++++++R+    R  +N  E+ +AA ++GF V
Sbjct: 275 SMSDFRKFLRDTYSLRNSAVTT--RRKPRILILSRSR--SRAFVNAGEIARAARQIGFKV 330

Query: 258 TIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVY 316
            +   E +T +A     ++SC  M+GVHGAG+T+ +FL   ++++Q++PIG  +WL+ + 
Sbjct: 331 VV--AEANTEIASFAITVNSCDVMLGVHGAGMTNMVFLPDNAIVIQILPIGGFEWLAKMD 388

Query: 317 FEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVK 375
           FE P++ + L YLEYKI  EES+LV++YG +   +++P A A   W   + VYL  QNV 
Sbjct: 389 FEYPSKGMNLRYLEYKITAEESTLVKQYGRDHEFVRDPLAVAKRGWGTFKSVYLVQQNVS 448

Query: 376 LDIDRFRIYLKDA 388
           +DI+RF++ L  A
Sbjct: 449 VDINRFKLVLVKA 461


>gi|357161022|ref|XP_003578952.1| PREDICTED: uncharacterized protein LOC100833330 [Brachypodium
           distachyon]
          Length = 484

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 127/394 (32%), Positives = 210/394 (53%), Gaps = 31/394 (7%)

Query: 15  TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
            C  + R  D C + G      ++ T +     D    +     KT+PY RK D  A+S 
Sbjct: 104 VCYETSRRSDTCEVAGDVRLVGRSQTIY----VDVLKQE----WKTKPYCRKHDTFALSH 155

Query: 75  VKELTIT----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           VKE ++      SA P   C    ++ A V S GG+ GN FH++ D  +P FI+  +H  
Sbjct: 156 VKEWSLKPAGDGSAVPE--CTSNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIS--AHRF 211

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
             +V   ++     W  KY ++  + +R+ +I+I+      C++SV +G   H  + ++P
Sbjct: 212 AGEVQFLVSSYKPWWMNKYIQIFQQMSRYEVIDIDADDEVRCYRSVVVGPTFHKELGVDP 271

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLRE 245
           +      ++VDF+  L NA+     T++          +P+L++++R A  GR  +N R 
Sbjct: 272 S---SGISVVDFRKMLRNAFGLERATATPSGDRWDIRRRPRLLIISRRASRGRAFMNERA 328

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSF-RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
           +   A  LGFDV I +P+  ++    F R ++SC  MVGVHGAGLT+ +FL  G+VL+QV
Sbjct: 329 MADMAGSLGFDVRIGDPDTGSTDTSKFARLVNSCDVMVGVHGAGLTNMVFLPAGAVLVQV 388

Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
           VP G  +WL+   F +P+  + + YLEY ++ +E++L E+Y +N LVLK+P A     W 
Sbjct: 389 VPYGRLEWLARNTFAEPSAGMEILYLEYVVQLDETTLSEQYPSNHLVLKDPMAIHKQGWD 448

Query: 364 NMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
            ++  YL  QNV+  + R    LK+ + +A K++
Sbjct: 449 ALKTTYLDKQNVRPHLGR----LKNTFLQALKML 478


>gi|242052233|ref|XP_002455262.1| hypothetical protein SORBIDRAFT_03g007370 [Sorghum bicolor]
 gi|241927237|gb|EES00382.1| hypothetical protein SORBIDRAFT_03g007370 [Sorghum bicolor]
          Length = 664

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 123/349 (35%), Positives = 189/349 (54%), Gaps = 21/349 (6%)

Query: 61  QPYPRKSDKSAMSKVKELTI--------TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
           +PYPRK D +AM  V+ LT+         T A P  +C   H  PALVFS  GY GN+FH
Sbjct: 313 KPYPRKVDPNAMHSVRALTVRSVVAMATATDAAPPPACMDWHDVPALVFSVRGYTGNYFH 372

Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
            + D  +PLF+T   +    +V+L +      W  KY  +    +++  +++++    HC
Sbjct: 373 AYTDVILPLFLTARQY--SGEVLLLVTGFQAWWVGKYLPVFRSLSKYAAVDLDSDPRVHC 430

Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTS-----SSFHHTKPKL 227
           F+ V +GL SH    I+P   P   +++DF  F+   Y    + +     S     +   
Sbjct: 431 FRRVQVGLTSHHDFSIDPRRAPNGYSMLDFAQFMRATYALPRDVALSPSPSPERRRRRPR 490

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           +LV   AR  R  LN  E+ + A ++GF+V + E   +  +A     +++C A+VGVHGA
Sbjct: 491 LLVIARART-RRFLNTEEIVRGARKVGFEVVVSE--GTHEVAPFAELVNTCDAVVGVHGA 547

Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
           GLT+ +FL  G V++QV+P+G       YF  PA  +GL YLEY+I  EES+LV++Y  +
Sbjct: 548 GLTNMVFLPRGGVVVQVLPLGPLEFVASYFRGPAGDMGLTYLEYRISPEESTLVDQYPRD 607

Query: 348 DLVLKNPQAFA--GANW-SNMRVYLKTQNVKLDIDRFRIYLKDAYKKAK 393
             VL +P + +    +W S M VYL  Q+V+LD+ RFR  LK A  + +
Sbjct: 608 HPVLTDPMSLSSKAKDWVSFMGVYLFKQDVRLDMKRFRPVLKKALARLR 656


>gi|413934852|gb|AFW69403.1| glycosyltransferase [Zea mays]
          Length = 455

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 122/341 (35%), Positives = 191/341 (56%), Gaps = 15/341 (4%)

Query: 59  KTQPYPRKSDKSAMSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHE 113
           +T+PY R+ D  AM  V+E T+     PN +     C  TH+ P  +FS+GG+ GN +H+
Sbjct: 112 RTKPYARRHDAVAMDDVREFTLVPFGGPNDTAVPPLCTRTHSVPGFLFSSGGFAGNLYHD 171

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           + D  VPLF + N H    +V   +AD  D WA K+  L  + +R+ +I++NN    HCF
Sbjct: 172 YADVLVPLFASTN-HL-GGEVQFLLADIKDWWADKFRPLFRQLSRYDVIDVNNDREVHCF 229

Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN----TNTSSSFHHTKPKLVL 229
             + +G   H  M I+P+  P   T+ DF+  L  A+       + + +     +P+L++
Sbjct: 230 PRIIIGSTFHRAMGIDPSRSPGGVTVADFKRLLRRAFRLERAVASRSGAPRRRDRPRLLI 289

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
           ++R +   R  +N R + +AA    FDV I EP+  T + +  R ++S   M+GVHGAGL
Sbjct: 290 ISRKS--SRRFVNERAMARAAAAARFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGL 347

Query: 290 THSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
           T+ +FL   +VL+QVVP G  +WL+ V F+ PAR + + Y+EY +  EESSL + Y  + 
Sbjct: 348 TNMVFLPSRAVLVQVVPFGGLEWLTRVTFKDPARDMDVTYMEYNVSLEESSLRDLYPEDH 407

Query: 349 LVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
             LK+P       W  ++ VYL  QNV+L++ RF   L+ A
Sbjct: 408 FYLKHPYDVHKKGWDAIKTVYLDKQNVRLNLTRFTRTLEQA 448


>gi|115482674|ref|NP_001064930.1| Os10g0492200 [Oryza sativa Japonica Group]
 gi|10140717|gb|AAG13551.1|AC023240_24 unknown protein [Oryza sativa Japonica Group]
 gi|31432816|gb|AAP54403.1| glycosyltransferase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113639539|dbj|BAF26844.1| Os10g0492200 [Oryza sativa Japonica Group]
 gi|215766456|dbj|BAG98764.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222613059|gb|EEE51191.1| hypothetical protein OsJ_31998 [Oryza sativa Japonica Group]
          Length = 500

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 122/360 (33%), Positives = 197/360 (54%), Gaps = 25/360 (6%)

Query: 59  KTQPYPRKSDKSAMSKVKELTI---------TTSAPPNLSCGVTHTSPALVFSAGGYNGN 109
           KT+PY RK D  A+S VKE T+            A     C    T+ A V S GG+ GN
Sbjct: 144 KTKPYCRKHDAFALSHVKEWTLRPLPSGDGGGGGAAVAPRCTTNSTATAFVLSTGGFTGN 203

Query: 110 FFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI 169
            FH++ D  +P FIT  +H    +V   ++     W  KY ++  + +RH +++++    
Sbjct: 204 PFHDYTDVLIPAFIT--AHRFGGEVQFLVSSYKSWWMNKYIQIFQQMSRHDVVDVDADGD 261

Query: 170 T-HCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH-----HT 223
              C++S  +G   H  + ++PT  P   +++DF+  L  A+  +  T++          
Sbjct: 262 EVRCYRSAVVGPEFHRELGVDPTKTPSGYSVLDFRKMLRGAFGLDRATATPSGDRWDIRR 321

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           +P+L++++R A  GR  +N R +   A  LGFDV + EP+ ST  +   R ++SC  MVG
Sbjct: 322 RPRLLIISRRAARGRAFMNERAMADMAASLGFDVRVGEPDASTDTSKFARLVNSCDVMVG 381

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVE 342
           VHGAGLT+ +FL  G+VL+QVVP G  +WL+   F +P+  + + YLEY ++ +E++L E
Sbjct: 382 VHGAGLTNMVFLPAGAVLVQVVPYGKLEWLARNTFAEPSSAMEIHYLEYAVQLDETTLSE 441

Query: 343 KYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM--DKE 399
           +Y A+  VL++P A     W  ++  YL  QNV+  + R    LK+ + +A KL+  DKE
Sbjct: 442 QYPADHPVLRDPMAIHKQGWEALKTTYLDKQNVRPHLGR----LKNTFLQALKLLPHDKE 497


>gi|414868242|tpg|DAA46799.1| TPA: hypothetical protein ZEAMMB73_066234 [Zea mays]
          Length = 468

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 126/393 (32%), Positives = 203/393 (51%), Gaps = 22/393 (5%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + C+ S+   + C + G      K++T + V  +   P   +  IK +PY RK ++  M 
Sbjct: 77  LICNLSNHQSNTCTMKGDVRIHGKSATVYVVSASTYCPENST--IKLRPYARKWEEQVML 134

Query: 74  KVKELTITTSAPPNLS----------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
            V+E+T+ +S+PP             C V H  PA+VFS GGYN NFFH   D  +PL++
Sbjct: 135 LVREVTVRSSSPPAGGGSAHDPPPPQCSVRHDMPAVVFSTGGYNRNFFHVMTDVIIPLYL 194

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T   +  D  V L   D   +W  KY  +L   + +P+I+++ +    CFQS  +GL SH
Sbjct: 195 TAREY--DGHVQLLATDYEPKWIAKYKAILAALSSYPVIDMDTEDTVRCFQSAHVGLESH 252

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRT 239
             + I P L     T+V F+ F+ +AY+      T  S S    KP+LV+V R  R  R 
Sbjct: 253 KELGIVPALSRNGYTMVSFRDFIRSAYSLQRARVTPVSRSTTGRKPRLVMVLR--RNSRQ 310

Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
           + N  +   AA  +GF+V    P++ + L      ++SC  ++GVHGAGL + LFL   +
Sbjct: 311 LKNEADAIAAAAGVGFEVVAAGPDDVSDLERFPGVVNSCDVLMGVHGAGLANMLFLPHNA 370

Query: 300 VLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
            ++Q++P G  +W     +  P   +GL YL+Y+   EE++L E Y  +  V  +P +  
Sbjct: 371 TVVQIIPWGELRWACRHSYGDPVPDMGLRYLDYEASAEETTLKETYPRDHAVFTDPLSIH 430

Query: 359 GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
              +  M  +++  Q+V +DIDRF  ++K  Y+
Sbjct: 431 HQGFDKMWNIFINGQHVIVDIDRFTGFMKQLYQ 463


>gi|357123674|ref|XP_003563533.1| PREDICTED: uncharacterized protein LOC100826633 [Brachypodium
           distachyon]
          Length = 455

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 128/378 (33%), Positives = 195/378 (51%), Gaps = 28/378 (7%)

Query: 15  TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
           TC  + +  + C   G   FD   S  +        P     R  T+PY R+ D  AM  
Sbjct: 82  TCYNTSKRSERCAAVGDIRFDGNHSKIYI------NPLDKEWR--TKPYARRHDAVAMDD 133

Query: 75  VKELTI---------TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
           V+E T+         TT  PP   C   H+ PA +FS+GG+ GN +H++ D  VPLF T 
Sbjct: 134 VREFTLLPFDTESSNTTVVPP--LCTRNHSVPAFLFSSGGFAGNLYHDYTDVLVPLF-TS 190

Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
             HF  + V   + D  D W  K+  L  + + + +I+ +N    HCF+ + +G   H  
Sbjct: 191 THHFRGE-VQFLLTDIKDWWLDKFTPLFRQLSNYDVIDADNDQQVHCFRRIIIGATFHRA 249

Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
           M I+P   P  +T+ DF+  L +A++     +S      P+L++++R +   R  LN R 
Sbjct: 250 MGIDPKRSPGGETVADFKRLLRHAFHLTRPVAS---RDNPRLLIISRKS--SRRFLNERA 304

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           +  AA    FDV I EP+  T + +  R ++S   M+GVHGAGLT+ +FL   +VL+QVV
Sbjct: 305 MAHAAALAKFDVRIAEPDNHTDMPNFARLVNSADIMMGVHGAGLTNMVFLPSRAVLLQVV 364

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G  +WLS V F+ PA+ + + Y+EY +  EESSL   Y      LK+P       W  
Sbjct: 365 PFGGLEWLSRVTFKDPAKDMDVNYMEYNVSLEESSLRNLYPEGHFYLKHPYDVHKKGWDA 424

Query: 365 MR-VYLKTQNVKLDIDRF 381
           ++ VYL  Q+V+L++ +F
Sbjct: 425 IKTVYLDKQSVRLNLTKF 442


>gi|388494572|gb|AFK35352.1| unknown [Lotus japonicus]
          Length = 197

 Score =  207 bits (527), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 98/189 (51%), Positives = 133/189 (70%)

Query: 208 NAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTS 267
           N+   NT   +S  H KP+L+L++R+    R ILN  EV K AEE+GF+V + EP   +S
Sbjct: 5   NSSRLNTTDQNSDEHPKPRLILLSRSGNASRVILNQDEVIKLAEEVGFNVHVLEPSRKSS 64

Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE 327
           +A+ +  IH+ H ++GVHGAGLT+SLFLRPGSVL+QVVPIGT W S  Y+EKP  +LGL+
Sbjct: 65  MANIYNMIHTSHVLLGVHGAGLTNSLFLRPGSVLVQVVPIGTDWASKTYYEKPTEILGLQ 124

Query: 328 YLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKD 387
           Y+EYKI+  ESSL   YGA+ LV+K+P  +     +N R+YLK QN++++I RFR  L  
Sbjct: 125 YIEYKIEANESSLSLSYGADSLVIKDPATYLKEKGANKRIYLKKQNLEINIFRFRKCLAK 184

Query: 388 AYKKAKKLM 396
           AY+KAK  M
Sbjct: 185 AYEKAKIFM 193


>gi|56409862|emb|CAI30079.1| glycosyltransferase [Saccharum officinarum]
          Length = 484

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 114/348 (32%), Positives = 194/348 (55%), Gaps = 18/348 (5%)

Query: 59  KTQPYPRKSDKSAMSKVKELTIT---TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFM 115
           KT+PY RK D  A++ VKE ++    + A P   C V  ++ A V S GG+ GN FH++ 
Sbjct: 139 KTKPYCRKHDNFALAHVKEWSLKPLPSGAAPQ--CTVNSSATAFVLSTGGFTGNPFHDYT 196

Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
           D  +P FIT  +H    +V   ++     W  +Y ++  + +RH +++I+      C++S
Sbjct: 197 DVLIPAFIT--AHPLRGEVQFLVSSYKSWWMNRYIQIFQQMSRHEVVDIDADDEVRCYRS 254

Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLV 230
           V +G   H  + ++ +  P   +  DF+  L +A+     T++          +P+L+++
Sbjct: 255 VVVGPTFHRELGVDASKSPSGYSTADFRKMLRDAFGLERATATPSGDRWDIRRRPRLLII 314

Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
           +R    GR  +N R +   A  LGFDV I EP+ ST  +   R ++SC  MVGVHGAGLT
Sbjct: 315 SRRPSRGRAFMNERAMADMAASLGFDVRIGEPDSSTDTSKFARLVNSCDVMVGVHGAGLT 374

Query: 291 HSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
           + +FL  G+V++QVVP G  +WL+   F +P+  + + YLEY ++ +E++L E+Y ++  
Sbjct: 375 NMVFLPAGAVVVQVVPYGRLEWLARNTFAEPSAGMEVHYLEYVVQMDETTLSEQYPSDHP 434

Query: 350 VLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
           VL++P A     +  + R YL  QNV+  + R    LK+ + +A K++
Sbjct: 435 VLRDPMAIHKQGYRALNRTYLDKQNVRPHLGR----LKNTFLQALKML 478


>gi|147840299|emb|CAN66138.1| hypothetical protein VITISV_006754 [Vitis vinifera]
          Length = 395

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 112/316 (35%), Positives = 169/316 (53%), Gaps = 37/316 (11%)

Query: 107 NGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN 166
           NGN +HEF D  +PL+IT  S   ++ V+  I + +D W  KY +++ + + +P I+ + 
Sbjct: 69  NGNIYHEFNDGILPLYIT--SQHLNKRVVFVILEYHDWWITKYGDVISQLSDYPPIDFSG 126

Query: 167 QTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY---------------- 210
              THCF    +GL  H  + ++ +L+   +++ DF++ L  AY                
Sbjct: 127 DNRTHCFPEAIVGLRIHDELTVDSSLVEGNESIRDFRNLLDQAYLPRIRSLIQAKEQKVQ 186

Query: 211 ----------------NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELG 254
                            E           KPKLV+++R     R I N   + + A+E+G
Sbjct: 187 SKMKEEPSLPPSLKPPLETGKEEQVDQLKKPKLVVLSRTG--ARAITNEDLMVQMAKEIG 244

Query: 255 FDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLST 314
           F V +  P  +T LA  +R ++S  AMVGVHGA +TH LFL+PGSV +QV+P+GT+W + 
Sbjct: 245 FQVKVLRPNRATELAKIYRVLNSSDAMVGVHGAAMTHILFLQPGSVFIQVIPLGTEWPAE 304

Query: 315 VYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQN 373
            Y+ +PA  LGL+Y+ YKI   ESSL  +Y  +D VL++P++ A   W    RVYL  Q 
Sbjct: 305 TYYGEPAEKLGLKYMGYKILPRESSLYNEYAKDDPVLRDPESLAKKGWEFTKRVYLDRQT 364

Query: 374 VKLDIDRFRIYLKDAY 389
           V LD+ RFR  L  AY
Sbjct: 365 VTLDLRRFRKQLVCAY 380


>gi|125581917|gb|EAZ22848.1| hypothetical protein OsJ_06527 [Oryza sativa Japonica Group]
          Length = 481

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 128/393 (32%), Positives = 208/393 (52%), Gaps = 28/393 (7%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C  S R  D C   G      ++ T         +P +   ++K  PY RK D  A+S V
Sbjct: 99  CYESSRRSDTCEATGDVRVHGRSQTIHI------SPLEQEWKVK--PYCRKHDAFALSHV 150

Query: 76  KE-----LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           KE     L+   + P    C V  ++ A V S GG+ GN FH++ D  +P FIT  +H  
Sbjct: 151 KEWALRPLSTADTYPAVPHCTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHRF 208

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
             +V   ++     W  +Y ++  R +++ +++I+N     C++SV +G   H  + ++ 
Sbjct: 209 AGEVQFLVSSFKSWWTNRYMQIFQRLSKYELVDIDNDDEVRCYRSVVVGPTFHKELGVDA 268

Query: 191 TLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLRE 245
           +  P   ++VDF++ L  A   E      S        +P+L++++R  R  R  LN R 
Sbjct: 269 SRTPSGYSMVDFRTMLRGALGLERAVAEPSGDRWDIRRRPRLLIISR--RNSRAFLNERA 326

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           +   A  LGF V + EP+ ST ++   R ++S   MVGVHGAGLT+ +FL  G+VL+QVV
Sbjct: 327 MADMAMSLGFYVRLGEPDISTDVSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVV 386

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G  +WL+   F++PA  + + YLEY I+ +E++L E+Y  +D VLK+P +     W+ 
Sbjct: 387 PYGGLEWLARGTFKEPAADMQIHYLEYMIQLDETTLSEQYPKDDPVLKDPYSIHKQGWNA 446

Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
           ++ VYL  QNV+  + R    LK+ + +A KL+
Sbjct: 447 LKMVYLDKQNVRPHLGR----LKNTFMEALKLL 475


>gi|414876581|tpg|DAA53712.1| TPA: hypothetical protein ZEAMMB73_101479 [Zea mays]
          Length = 335

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 114/332 (34%), Positives = 178/332 (53%), Gaps = 11/332 (3%)

Query: 72  MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
           M  V+ LT+ +      +C   H  PALVFS  GY GN+FH F D  +PLF+T   +   
Sbjct: 1   MRVVRVLTVRSVPGEAPACTDRHGVPALVFSDRGYTGNYFHAFTDVILPLFLTARQY--A 58

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
            +V L + D    W  K+A +    + + +++++     HCF+ V +GL SH    I+P 
Sbjct: 59  GEVRLLVTDLQAWWVGKFAPVFRSISNYELVDLDRDPRVHCFRHVQVGLTSHDDFSIDPR 118

Query: 192 LLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
             P   +++DF  F+  AY     +    +      +P+L+L+ R AR  R  +N  E+ 
Sbjct: 119 RAPNGYSMLDFTGFMRAAYGLPRGDVAAAAGPSSKRRPRLLLIAR-ART-RRFVNAEEIV 176

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
           + AE+LGF+V +   E +  +A      +SC A++GVHGAGLT+ +F+  G V++QVVP+
Sbjct: 177 RGAEKLGFEVVV--SEGTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPTGGVVIQVVPL 234

Query: 308 GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
           G       YF  P+R +GL YLEY+I  EES+L+ +Y  +  +  +P       W +++ 
Sbjct: 235 GGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLINQYPRDHPIFTDPNGIKSKGWESLKD 294

Query: 367 VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
            YL  Q+V LD+ RFR  LK A    +K   K
Sbjct: 295 AYLDKQDVSLDMKRFRPTLKKAIAHIRKARAK 326


>gi|46390169|dbj|BAD15602.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|46390975|dbj|BAD16510.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|215686668|dbj|BAG88921.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 503

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 128/393 (32%), Positives = 208/393 (52%), Gaps = 28/393 (7%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C  S R  D C   G      ++ T         +P +   ++K  PY RK D  A+S V
Sbjct: 121 CYESSRRSDTCEATGDVRVHGRSQTIHI------SPLEQEWKVK--PYCRKHDAFALSHV 172

Query: 76  KE-----LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           KE     L+   + P    C V  ++ A V S GG+ GN FH++ D  +P FIT  +H  
Sbjct: 173 KEWALRPLSTADTYPAVPHCTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHRF 230

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
             +V   ++     W  +Y ++  R +++ +++I+N     C++SV +G   H  + ++ 
Sbjct: 231 AGEVQFLVSSFKSWWTNRYMQIFQRLSKYELVDIDNDDEVRCYRSVVVGPTFHKELGVDA 290

Query: 191 TLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLRE 245
           +  P   ++VDF++ L  A   E      S        +P+L++++R  R  R  LN R 
Sbjct: 291 SRTPSGYSMVDFRTMLRGALGLERAVAEPSGDRWDIRRRPRLLIISR--RNSRAFLNERA 348

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           +   A  LGF V + EP+ ST ++   R ++S   MVGVHGAGLT+ +FL  G+VL+QVV
Sbjct: 349 MADMAMSLGFYVRLGEPDISTDVSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVV 408

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G  +WL+   F++PA  + + YLEY I+ +E++L E+Y  +D VLK+P +     W+ 
Sbjct: 409 PYGGLEWLARGTFKEPAADMQIHYLEYMIQLDETTLSEQYPKDDPVLKDPYSIHKQGWNA 468

Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
           ++ VYL  QNV+  + R    LK+ + +A KL+
Sbjct: 469 LKMVYLDKQNVRPHLGR----LKNTFMEALKLL 497


>gi|115445889|ref|NP_001046724.1| Os02g0331200 [Oryza sativa Japonica Group]
 gi|46390170|dbj|BAD15603.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|46390974|dbj|BAD16509.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|113536255|dbj|BAF08638.1| Os02g0331200 [Oryza sativa Japonica Group]
 gi|215712340|dbj|BAG94467.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737036|dbj|BAG95965.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 128/393 (32%), Positives = 208/393 (52%), Gaps = 28/393 (7%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C  S R  D C   G      ++ T         +P +   ++K  PY RK D  A+S V
Sbjct: 122 CYESSRRSDTCEATGDVRVHGRSQTIHI------SPLEQEWKVK--PYCRKHDAFALSHV 173

Query: 76  KE-----LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           KE     L+   + P    C V  ++ A V S GG+ GN FH++ D  +P FIT  +H  
Sbjct: 174 KEWALRPLSTADTYPAVPHCTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHRF 231

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
             +V   ++     W  +Y ++  R +++ +++I+N     C++SV +G   H  + ++ 
Sbjct: 232 AGEVQFLVSSFKSWWTNRYMQIFQRLSKYELVDIDNDDEVRCYRSVVVGPTFHKELGVDA 291

Query: 191 TLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLRE 245
           +  P   ++VDF++ L  A   E      S        +P+L++++R  R  R  LN R 
Sbjct: 292 SRTPSGYSMVDFRTMLRGALGLERAVAEPSGDRWDIRRRPRLLIISR--RNSRAFLNERA 349

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           +   A  LGF V + EP+ ST ++   R ++S   MVGVHGAGLT+ +FL  G+VL+QVV
Sbjct: 350 MADMAMSLGFYVRLGEPDISTDVSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVV 409

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G  +WL+   F++PA  + + YLEY I+ +E++L E+Y  +D VLK+P +     W+ 
Sbjct: 410 PYGGLEWLARGTFKEPAADMQIHYLEYMIQLDETTLSEQYPKDDPVLKDPYSIHKQGWNA 469

Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
           ++ VYL  QNV+  + R    LK+ + +A KL+
Sbjct: 470 LKMVYLDKQNVRPHLGR----LKNTFMEALKLL 498


>gi|226531412|ref|NP_001151219.1| glycosyltransferase [Zea mays]
 gi|195645114|gb|ACG42025.1| glycosyltransferase [Zea mays]
          Length = 455

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 121/341 (35%), Positives = 190/341 (55%), Gaps = 15/341 (4%)

Query: 59  KTQPYPRKSDKSAMSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHE 113
           +T+PY R+ D  AM  V+E T+     PN +     C  TH+ P  +FS+GG+ GN +H+
Sbjct: 112 RTKPYARRHDAVAMDDVREFTLVPFGGPNDTAVPPLCTRTHSVPGFLFSSGGFAGNLYHD 171

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           + D  VPLF + N H    +V   +AD  D WA K+  +  + +R+ +I++NN    HCF
Sbjct: 172 YADVLVPLFASTN-HL-GGEVQFLLADIKDWWADKFRPVFRQLSRYDVIDVNNDREVHCF 229

Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN----TNTSSSFHHTKPKLVL 229
               +G   H  M I+P+  P   T+ DF+  L  A+       + + +     +P+L++
Sbjct: 230 PRTIIGSTFHRAMGIDPSRSPGGVTVADFKRLLRRAFRLERAVASRSGAPRRRDRPRLLI 289

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
           ++R +   R  +N R + +AA    FDV I EP+  T + +  R ++S   M+GVHGAGL
Sbjct: 290 ISRKS--SRRFVNERAMARAAAAARFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGL 347

Query: 290 THSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
           T+ +FL   +VL+QVVP G  +WL+ V F+ PAR + + Y+EY +  EESSL + Y  + 
Sbjct: 348 TNMVFLPSRAVLVQVVPFGGLEWLTRVTFKDPARDMDVTYMEYNVSLEESSLRDLYPEDH 407

Query: 349 LVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
             LK+P       W  ++ VYL  QNV+L++ RF   L+ A
Sbjct: 408 FYLKHPYDVHKKGWDAIKTVYLDKQNVRLNLTRFTRTLEQA 448


>gi|218184799|gb|EEC67226.1| hypothetical protein OsI_34143 [Oryza sativa Indica Group]
          Length = 504

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 123/363 (33%), Positives = 198/363 (54%), Gaps = 31/363 (8%)

Query: 59  KTQPYPRKSDKSAMSKVKELTI------------TTSAPPNLSCGVTHTSPALVFSAGGY 106
           KT+PY RK D  A+S VKE T+               AP    C    T+ A V S GG+
Sbjct: 148 KTKPYCRKHDAFALSHVKEWTLRPLPSGDGDGGGAAVAP---RCTTNSTATAFVLSTGGF 204

Query: 107 NGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN 166
            GN FH++ D  +P FIT  +H    +V   ++     W  KY ++  + +RH +++++ 
Sbjct: 205 TGNPFHDYTDVLIPAFIT--AHRFGGEVQFLVSSYKSWWMNKYIQIFQQMSRHDVVDVDA 262

Query: 167 QTIT-HCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH---- 221
                 C++S  +G   H  + ++PT  P   +++DF+  L  A+  +  T++       
Sbjct: 263 DGDEVRCYRSAVVGPEFHRELGVDPTKTPSGYSVLDFRKMLRGAFGLDRATATPSGDRWD 322

Query: 222 -HTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHA 280
              +P+L++++R A  GR  L+ R +   A  LGFDV + EP+ ST  +   R ++SC  
Sbjct: 323 IRRRPRLLIISRRAARGRAFLHERAMADMAASLGFDVRVGEPDASTDTSKFARLVNSCDV 382

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
           MVGVHGAGLT+ +FL  G+VL+QVVP G  +WL+   F +P+  + + YLEY ++ +E++
Sbjct: 383 MVGVHGAGLTNMVFLPAGAVLVQVVPYGKLEWLARNTFAEPSSAMEIHYLEYAVQLDETT 442

Query: 340 LVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM-- 396
           L E+Y A+  VL++P A     W  ++  YL  QNV+  + R    LK+ + +A KL+  
Sbjct: 443 LSEQYPADHPVLRDPMAIHKQGWEALKTTYLDKQNVRPHLGR----LKNTFLQALKLLPH 498

Query: 397 DKE 399
           DKE
Sbjct: 499 DKE 501


>gi|218190631|gb|EEC73058.1| hypothetical protein OsI_07015 [Oryza sativa Indica Group]
          Length = 534

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 118/376 (31%), Positives = 202/376 (53%), Gaps = 21/376 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+  +D+C + G A    ++ST   V    ++  +  +        R   +  +  +
Sbjct: 158 CDLSNPRFDICELCGDARTIGQSSTVVYVPQNRASNGEEWII-------RAQSRKHLPWI 210

Query: 76  KELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
           K++TI +  S+ P   C   H  PA+VF+ GG   N +H+F D  VPLF+T      ++D
Sbjct: 211 KKVTIKSVNSSEPEPICTSKHHIPAIVFALGGLTANVWHDFSDVLVPLFLTARQF--NRD 268

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V L I +    + +KY+ +  R TRH II+ ++     C+  V +GL SH  + I+P+  
Sbjct: 269 VQLIITNNQPWFIKKYSAIFSRLTRHEIIDFDSDGQIRCYPHVIVGLRSHRDLGIDPSSS 328

Query: 194 PKPKTLVDFQSFLANAYNENT-------NTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           P+  T+VDF+ F+  AY                    KP+++L++R     R  +N+  V
Sbjct: 329 PQNYTMVDFRLFVREAYGLPAAEVDIPYKADKDDPDKKPRIMLIDRGK--SRRFVNVAHV 386

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            +  +  GF+V   +P+  ++L +  R + SC A++GVHGAGLT+ +FLR G V++ +VP
Sbjct: 387 VQGLDWFGFEVVKADPKIDSNLDEFVRLVDSCDAIMGVHGAGLTNMVFLRSGGVVVHIVP 446

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM- 365
            G ++++  ++  PAR +GL ++EY I  EES+L+EKYG N  V+ +P+      W  + 
Sbjct: 447 YGIKFMADGFYGAPARDMGLRHVEYSISPEESTLLEKYGWNHTVINDPETIRKGGWEKVA 506

Query: 366 RVYLKTQNVKLDIDRF 381
             Y+  Q++ L++ RF
Sbjct: 507 EFYMSKQDIVLNMTRF 522


>gi|115487966|ref|NP_001066470.1| Os12g0238900 [Oryza sativa Japonica Group]
 gi|113648977|dbj|BAF29489.1| Os12g0238900, partial [Oryza sativa Japonica Group]
          Length = 520

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 127/393 (32%), Positives = 201/393 (51%), Gaps = 42/393 (10%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C + G    D   S F  V         P+ +++  PYPRK D ++M +V
Sbjct: 138 CDLSDHRTDVCDLAGDIRMDANASAFVVVVDPAVGADGPTYKVR--PYPRKGDATSMGRV 195

Query: 76  KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
            E+T+ T+A             A+VFS  GY GN FH+F D  VPL+ T   +    DV 
Sbjct: 196 TEITVRTTA-------------AVVFSISGYTGNLFHDFTDVIVPLYNTAARYC--GDVQ 240

Query: 136 LAIADCN---DQWARKYAELLPRFTRHPIININNQTI-----THCFQSVTLGLISHGRMV 187
           L + D N    +W  +Y  +L   +RH  +++  +        HCF    +GL +HG ++
Sbjct: 241 LVVTDGNAATRRWLARYGAVLRGLSRHVPLDLAAEAAAGGGEVHCFGHTVVGLRAHGELI 300

Query: 188 INPTLLPKPKTLVDFQSFLANAYN--ENTNTSSSFHHT-------KPKLVLVNRNARVGR 238
           I+    P    + DF  FL  A +   +  T     H        +P+L++++R  R  R
Sbjct: 301 IDRERSPDGLGMPDFTRFLRRALSLPRDAPTRPGGGHGDATKPQPRPRLLIISR--RGTR 358

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSF----RFIHSCHAMVGVHGAGLTHSLF 294
            +LN   V +AAE++GF+    E + + +  D      R ++S  A+VGVHGAGLT+ +F
Sbjct: 359 LLLNTDAVARAAEQVGFEAVASELDMAGADHDDVARVARLVNSFDAVVGVHGAGLTNMVF 418

Query: 295 LRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKN 353
           L PG+  +Q+VP G  +WL+   F +PA  +GL Y++Y++   ES+L +KY  +  +  N
Sbjct: 419 LPPGAAAVQIVPWGGLRWLARADFGEPAVAMGLRYIQYEVAAGESTLKDKYPRDHEIFTN 478

Query: 354 PQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYL 385
           P A     ++ MR  +L  Q++ +DIDRF+  L
Sbjct: 479 PTALHKKGFTFMRHTFLNGQDIIVDIDRFKPVL 511


>gi|242094622|ref|XP_002437801.1| hypothetical protein SORBIDRAFT_10g002800 [Sorghum bicolor]
 gi|241916024|gb|EER89168.1| hypothetical protein SORBIDRAFT_10g002800 [Sorghum bicolor]
          Length = 499

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 128/393 (32%), Positives = 204/393 (51%), Gaps = 34/393 (8%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPYPRKSDKSAMSK 74
           C  S R  D C   G             VG   +    P  R  K +PY RK D  A+S 
Sbjct: 120 CYESSRRSDTCEAAGDVRV---------VGRAQTVLVSPLEREWKVKPYCRKHDAFALSH 170

Query: 75  VKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           VKE T+    +  AP    C V  ++ A V S GG+ GN FH++ D  +P FIT   +  
Sbjct: 171 VKEWTLRPVGSDDAP---RCTVNSSATAFVLSTGGFTGNLFHDYTDVLIPAFITARRYA- 226

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
             DV L ++     W  KY ++L + +RH +++ +      C+  V +G   H  + ++ 
Sbjct: 227 -GDVQLLVSSYKPWWTTKYLQVLQQLSRHEVVDADADAEVRCYPRVVVGPTFHRELGVDA 285

Query: 191 TLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLRE 245
           +      ++ +F++ L +A+  E    + S        +P+L++++R  R  R +LN R 
Sbjct: 286 S--SSSPSMPEFRAMLRDAFGLERAAAAPSGDRWDIRRRPRLLIISR--RTSRRLLNERA 341

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           +   A  LGFDV   +PE ST +    R ++S   MVGVHG GLT+ +FL  G+VL+QVV
Sbjct: 342 MADMATSLGFDVRTGDPEVSTDVGRFARLVNSADVMVGVHGDGLTNMVFLPAGAVLVQVV 401

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G  +WL+   F +PA  + + YLEY ++++E++L E+YG +D V+++P A     W  
Sbjct: 402 PYGGLEWLARGTFREPAEGMEVHYLEYVVQKDETTLSEEYGEDDPVIRDPAAIHKQGWDA 461

Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
           ++ VYL  QNV+  + R    LK+ + +A KL+
Sbjct: 462 LKAVYLDKQNVRPHLGR----LKNTFVQALKLL 490


>gi|222622742|gb|EEE56874.1| hypothetical protein OsJ_06512 [Oryza sativa Japonica Group]
          Length = 549

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 118/376 (31%), Positives = 202/376 (53%), Gaps = 21/376 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+  +D+C + G A    ++ST   V    ++  +  +        R   +  +  +
Sbjct: 173 CDLSNPRFDICELCGDARTIGQSSTVVYVPQNRASNGEEWII-------RAQSRKHLPWI 225

Query: 76  KELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
           K++TI +  S+ P   C   H  PA+VF+ GG   N +H+F D  VPLF+T      ++D
Sbjct: 226 KKVTIKSVNSSEPEPICTSKHHIPAIVFALGGLTANVWHDFSDVLVPLFLTARQF--NRD 283

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V L I +    + +KY+ +  R TRH II+ ++     C+  V +GL SH  + I+P+  
Sbjct: 284 VQLIITNNQPWFIKKYSAIFSRLTRHEIIDFDSDGQIRCYPHVIVGLRSHRDLGIDPSSS 343

Query: 194 PKPKTLVDFQSFLANAYNENT-------NTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           P+  T+VDF+ F+  AY                    KP+++L++R     R  +N+  V
Sbjct: 344 PQNYTMVDFRLFVREAYGLPAAEVDIPYKADKDDPDKKPRIMLIDRGK--SRRFVNVAHV 401

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            +  +  GF+V   +P+  ++L +  R + SC A++GVHGAGLT+ +FLR G V++ +VP
Sbjct: 402 VQGLDWFGFEVVKADPKIDSNLDEFVRLVDSCDAIMGVHGAGLTNMVFLRSGGVVVHIVP 461

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM- 365
            G ++++  ++  PAR +GL ++EY I  EES+L+EKYG N  V+ +P+      W  + 
Sbjct: 462 YGIKFMADGFYGAPARDMGLRHVEYSISPEESTLLEKYGWNHTVINDPETIRKGGWEKVA 521

Query: 366 RVYLKTQNVKLDIDRF 381
             Y+  Q++ L++ RF
Sbjct: 522 EFYMSKQDIVLNMTRF 537


>gi|54291090|dbj|BAD61765.1| putative HGA6 [Oryza sativa Japonica Group]
 gi|125596993|gb|EAZ36773.1| hypothetical protein OsJ_21109 [Oryza sativa Japonica Group]
          Length = 526

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 126/403 (31%), Positives = 207/403 (51%), Gaps = 29/403 (7%)

Query: 7   ALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRK 66
           A +    ++C+ S    D C ++G      ++   + V  +D  P   +  I+  PYPRK
Sbjct: 125 AAAKTSHLSCNFSSAHMDTCAMDGDIRIHGRSGVVYVVASSDYRPENATAVIR--PYPRK 182

Query: 67  SDKSAMSKVKELTITTSAPPN--------------LSCGVTHTSPALVFSAGGYNGNFFH 112
            +++ M +V+++TI ++APP               L C V    PA+VFS GGY+ NFFH
Sbjct: 183 WEQATMERVRQITIRSTAPPGAAVADTDGGGAIIPLRCTVARDMPAVVFSTGGYSVNFFH 242

Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
              D  +PL+IT   H     V L  A+ + +W  KY   L   + +P+++++      C
Sbjct: 243 TMNDILLPLYITAREH--GGRVQLLAANYDRRWTAKYQHALAALSMYPVVDLDADAAVRC 300

Query: 173 FQSVTLGLISHGRMVIN-PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVN 231
           F S  +G+ SH  + I+ P       T+V F +FL +AY+   +  +     +P++V+V 
Sbjct: 301 FPSARVGVESHRVLGIDTPLTGSNGYTMVGFLAFLRSAYSLPRHAVTRTTPRRPRVVMVL 360

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
           R  R  R + N  EV  A  E GF+V    PEE+  +A     ++SC  MVGVHGAGLT+
Sbjct: 361 R--RKSRALTNEAEVVAAVAEAGFEVVAAGPEEAGDVAGFAATVNSCDVMVGVHGAGLTN 418

Query: 292 SLFLRPGSVLMQVVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
            +FL     ++Q++P  G +W     + +P   +GL Y+EY++   E++L E+Y  +  V
Sbjct: 419 MVFLPRNGTVVQIIPWGGMKWPCWYDYGEPVPAMGLRYVEYEVAANETTLRERYPMDHPV 478

Query: 351 LKNPQAF--AGAN--WSNMRVYLKTQNVKLDIDRFRIYLKDAY 389
             +P +    G N  WS    +L  QN+ LD++RF+  + + Y
Sbjct: 479 FADPVSIHRKGFNHLWST---FLNGQNLTLDVNRFKAVMAEVY 518


>gi|125555079|gb|EAZ00685.1| hypothetical protein OsI_22705 [Oryza sativa Indica Group]
          Length = 526

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 127/402 (31%), Positives = 209/402 (51%), Gaps = 28/402 (6%)

Query: 7   ALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRK 66
           A +    ++C+ S    D C ++G      ++   + V  +D  P   +  I+  PYPRK
Sbjct: 126 AAAKTSHLSCNFSSAHMDTCAMDGDIRIHGRSGVVYVVASSDYRPENATAVIR--PYPRK 183

Query: 67  SDKSAMSKVKELTITTSAPPN-----------LSCGVTHTSPALVFSAGGYNGNFFHEFM 115
            +++ M +V+++TI ++APP            L C V    PA+VFS GGY+ NFFH   
Sbjct: 184 WEQATMERVRQITIRSTAPPGADTDGGGAIIPLRCTVARDMPAVVFSTGGYSVNFFHTMN 243

Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
           D  +PL+IT   H     V L  A+ + +W  KY   L   + +P+++++      CF S
Sbjct: 244 DILLPLYITAREH--GGRVQLLAANYDRRWTAKYQHALAALSMYPVVDLDADAAVRCFPS 301

Query: 176 VTLGLISHGRMVIN-PTLLPKPKTLVDFQSFLANAYN--ENTNTSSSFHHTKPKLVLVNR 232
             +G+ SH  + I+ P       T+V F +FL +AY+   +  T+ +    +P++V+V R
Sbjct: 302 ARVGVESHRVLGIDTPLTGSNGYTMVGFLAFLRSAYSLPRHAVTTHTPSPRRPRVVMVLR 361

Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
             R  R + N  EV  A  E GF+V    PEE+  +A     ++SC  MVGVHGAGLT+ 
Sbjct: 362 --RKSRALTNEAEVVAAVAEAGFEVVAAGPEEAGDVAGFAATVNSCDVMVGVHGAGLTNM 419

Query: 293 LFLRPGSVLMQVVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
           +FL     ++Q++P  G +W     + +P   +GL Y+EY++   E++L E+Y  +  V 
Sbjct: 420 VFLPRNGTVVQIIPWGGMKWPCWYDYGEPVPAMGLRYVEYEVAANETTLRERYPMDHPVF 479

Query: 352 KNPQAF--AGAN--WSNMRVYLKTQNVKLDIDRFRIYLKDAY 389
            +P +    G N  WS    +L  QN+ LD++RF+  + + Y
Sbjct: 480 ADPVSIHRKGFNHLWST---FLNGQNLTLDVNRFKAVMAEVY 518


>gi|56409854|emb|CAI30075.1| glycosyltransferase [Triticum aestivum]
          Length = 516

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 129/388 (33%), Positives = 206/388 (53%), Gaps = 32/388 (8%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + C       D C + G    +    T   V  T +     S   K QPY R++    MS
Sbjct: 127 VVCTTEGPFSDTCDVFGDVRTNGTAHTVTLVPATQTE----SREWKIQPYARRT----MS 178

Query: 74  KVKELTIT----TSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
            + E+T+T    TSA  P  +C VTH  P +VF+ GG  GN+FH+F D  VPL I    +
Sbjct: 179 GISEVTVTQLDSTSAEYPAPACTVTHNIPGIVFALGGLTGNYFHDFSDALVPLVIASRGY 238

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
               +V L  ++    W  KY  L+ R +++ ++++++     CF SVT+GL  H    I
Sbjct: 239 --GGEVQLLASNIQPWWLGKYEALVQRLSKYDVVDLDHDDQIRCFPSVTVGLNMHKEFNI 296

Query: 189 NPTLLPK--PKTLVDFQSFLANAY-----------NENTNTSSSFHHTKPKLVLVNRNAR 235
            P L+P   P ++++F +FL   Y           N+ ++          +L+L++R   
Sbjct: 297 VPELVPGGVPLSMLNFTAFLRETYSLPRAAPIRLTNKKSSPPVDGKKRSRRLMLLDRGHY 356

Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
             R ++N+ E+ KAAE+ GF+VTI +P  +  + +    ++S   ++GVHGAGLT+S FL
Sbjct: 357 --RKLVNVPEIVKAAEKAGFEVTIADPRFNVRVKELALSVNSFDVLLGVHGAGLTNSAFL 414

Query: 296 RPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
            PG+V++QVVP G  + ++   F  PA  +GL YLEY I  EES+L+E  G +  V+K+P
Sbjct: 415 PPGAVVIQVVPYGNLEHMAKREFGDPAANMGLRYLEYSITAEESTLLEMLGPDHPVIKDP 474

Query: 355 QAFAGANWSNM-RVYLKTQNVKLDIDRF 381
           ++   + W  +   YL  Q+V++D+ RF
Sbjct: 475 ESVHRSGWDKVAEYYLGKQDVRVDVQRF 502


>gi|115445865|ref|NP_001046712.1| Os02g0327700 [Oryza sativa Japonica Group]
 gi|46390538|dbj|BAD16025.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|46390878|dbj|BAD16395.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|113536243|dbj|BAF08626.1| Os02g0327700 [Oryza sativa Japonica Group]
 gi|215701502|dbj|BAG92926.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 554

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 119/376 (31%), Positives = 204/376 (54%), Gaps = 21/376 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+  +D+C + G A    ++ST   V    ++  +  + I+ Q       +  +  +
Sbjct: 178 CDLSNPRFDICELCGDARTIGQSSTVVYVPQNRASNGEEWI-IRAQ------SRKHLPWI 230

Query: 76  KELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
           K++TI +  S+ P   C   H  PA+VF+ GG   N +H+F D  VPLF+T      ++D
Sbjct: 231 KKVTIKSVNSSEPEPICTSKHHIPAIVFALGGLTANVWHDFSDVLVPLFLTARQF--NRD 288

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
           V L I +    + +KY+ +  R TRH II+ ++     C+  V +GL SH  + I+P+  
Sbjct: 289 VQLIITNNQPWFIKKYSAIFSRLTRHEIIDFDSDGQIRCYPHVIVGLRSHRDLGIDPSSS 348

Query: 194 PKPKTLVDFQSFLANAYNENT-------NTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           P+  T+VDF+ F+  AY                    KP+++L++R     R  +N+  V
Sbjct: 349 PQNYTMVDFRLFVREAYGLPAAEVDIPYKADKDDPDKKPRIMLIDRGK--SRRFVNVAHV 406

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            +  +  GF+V   +P+  ++L +  R + SC A++GVHGAGLT+ +FLR G V++ +VP
Sbjct: 407 VQGLDWFGFEVVKADPKIDSNLDEFVRLVDSCDAIMGVHGAGLTNMVFLRSGGVVVHIVP 466

Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM- 365
            G ++++  ++  PAR +GL ++EY I  EES+L+EKYG N  V+ +P+      W  + 
Sbjct: 467 YGIKFMADGFYGAPARDMGLRHVEYSISPEESTLLEKYGWNHTVINDPETIRKGGWEKVA 526

Query: 366 RVYLKTQNVKLDIDRF 381
             Y+  Q++ L++ RF
Sbjct: 527 EFYMSKQDIVLNMTRF 542


>gi|125555301|gb|EAZ00907.1| hypothetical protein OsI_22937 [Oryza sativa Indica Group]
          Length = 524

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 126/403 (31%), Positives = 209/403 (51%), Gaps = 28/403 (6%)

Query: 4   RPAALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSL 56
           +P+      +I+CD +       +    +C + G     PK  T F V  + +     + 
Sbjct: 126 KPSGDEAEATISCDENGVDEGFPYARPPVCELTGDIRISPKEKTMFFVNPSSAGAFDGNG 185

Query: 57  RIKTQPYPRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEF 114
             K +PY RK D   +  V E+ I + + P ++  C  TH  PA+VFS  GY  NFFH+ 
Sbjct: 186 EKKIRPYARKDD-FLLPGVVEVIIKSVSSPAIAPACTRTHNVPAVVFSVAGYTDNFFHDN 244

Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
            D  +PLF+T  SH   + V   I +    W  K+  LL + + + +IN +     HCF+
Sbjct: 245 TDVMIPLFLT-TSHLAGE-VQFLITNFKPWWVHKFTPLLKKLSNYGVINFDKDDEVHCFR 302

Query: 175 SVTLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVL 229
              LGL     ++I+P  T  P+  ++VD+  FL  A+    ++ +        KPK+++
Sbjct: 303 RGHLGLYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFGLPRDSPAVLGDKTGAKPKMLM 362

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHG 286
           + R     R +LNLR+V    E+LGF VT+ E     + AD   F   +++   ++ VHG
Sbjct: 363 IERKGT--RKLLNLRDVAALCEDLGFAVTVAE-----AGADVRGFAEKVNAADVLLAVHG 415

Query: 287 AGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYG 345
           AGLT+ +FL  G+VL+Q+VP G   W++T ++ +PAR + L Y+EY + +EE++L +KY 
Sbjct: 416 AGLTNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYP 475

Query: 346 ANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
            +  V K+P A     W  +   +  Q+V +++ RF+ +L  A
Sbjct: 476 RDHYVFKDPMAIHAQGWPALAEIVMKQDVTVNVTRFKPFLLKA 518


>gi|297724843|ref|NP_001174785.1| Os06g0470150 [Oryza sativa Japonica Group]
 gi|51090436|dbj|BAD35388.1| putative HGA4 [Oryza sativa Japonica Group]
 gi|125597196|gb|EAZ36976.1| hypothetical protein OsJ_21315 [Oryza sativa Japonica Group]
 gi|215769459|dbj|BAH01688.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255677040|dbj|BAH93513.1| Os06g0470150 [Oryza sativa Japonica Group]
          Length = 524

 Score =  204 bits (520), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 126/403 (31%), Positives = 209/403 (51%), Gaps = 28/403 (6%)

Query: 4   RPAALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSL 56
           +P+      +I+CD +       +    +C + G     PK  T F V  + +     + 
Sbjct: 126 KPSGDEAEATISCDENGVDEGFPYARPPVCELTGDIRISPKEKTMFFVNPSSAGAFDGNG 185

Query: 57  RIKTQPYPRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEF 114
             K +PY RK D   +  V E+ I + + P ++  C  TH  PA+VFS  GY  NFFH+ 
Sbjct: 186 EKKIRPYARKDD-FLLPGVVEVIIKSVSSPAIAPACTRTHNVPAVVFSVAGYTDNFFHDN 244

Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
            D  +PLF+T  SH   + V   I +    W  K+  LL + + + +IN +     HCF+
Sbjct: 245 TDVMIPLFLT-TSHLAGE-VQFLITNFKPWWVHKFTPLLKKLSNYGVINFDKDDEVHCFR 302

Query: 175 SVTLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVL 229
              LGL     ++I+P  T  P+  ++VD+  FL  A+    ++ +        KPK+++
Sbjct: 303 RGHLGLYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFGLPRDSPAVLGDKTGAKPKMLM 362

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHG 286
           + R     R +LNLR+V    E+LGF VT+ E     + AD   F   +++   ++ VHG
Sbjct: 363 IERKGT--RKLLNLRDVAALCEDLGFAVTVAE-----AGADVRGFAEKVNAADVLLAVHG 415

Query: 287 AGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYG 345
           AGLT+ +FL  G+VL+Q+VP G   W++T ++ +PAR + L Y+EY + +EE++L +KY 
Sbjct: 416 AGLTNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYP 475

Query: 346 ANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
            +  V K+P A     W  +   +  Q+V +++ RF+ +L  A
Sbjct: 476 RDHYVFKDPMAIHAQGWPALAEIVMKQDVTVNVTRFKPFLLKA 518


>gi|242052289|ref|XP_002455290.1| hypothetical protein SORBIDRAFT_03g007900 [Sorghum bicolor]
 gi|241927265|gb|EES00410.1| hypothetical protein SORBIDRAFT_03g007900 [Sorghum bicolor]
          Length = 465

 Score =  204 bits (520), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 131/394 (33%), Positives = 197/394 (50%), Gaps = 27/394 (6%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + CD S    D+C + G     P  +             + S R+K  P+ RK+D+ A++
Sbjct: 75  VLCDFSSSRSDMCELKGDVRVLPNATIVLR----HPWARRQSWRMK--PHGRKNDRHALA 128

Query: 74  KVKELTITTS-----APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
           +V E+T+ +S     A     C  ++T+PA+VFS GGY GN FH+  D  VPLFIT    
Sbjct: 129 RVTEVTVASSHSHHTAGAAPRCTASYTAPAVVFSVGGYAGNMFHDLTDVLVPLFITTRRF 188

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ--NQTITHCFQSVTLGLISHGRM 186
               DV L ++D    W  K+  LL   +RH ++++N        C+  V LGL  H  M
Sbjct: 189 --GGDVHLLVSDAQPWWLDKFRPLLGALSRHAVVDMNRGGSRGVLCYPHVILGLKFHKEM 246

Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTS--------SSFHHTKPKLVLVNRNARVGR 238
            ++        ++ DF      +Y    +T+        SS    +P+L+L++R +   R
Sbjct: 247 SVDAARTAGEYSMADFTLLARRSYGLTRDTAIRLGDGNRSSSAAVRPRLLLISRKST--R 304

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
              N   V +AA  LGF+V + EP     L    R ++SC  +VGVHGAGL + +FL  G
Sbjct: 305 AFTNAGAVARAAAALGFEVVVGEPARHADLPSFARVVNSCDVLVGVHGAGLANLVFLPAG 364

Query: 299 SVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
           +V++QVVP+G    ++   F  PA   GL Y+ Y I  EESSL  +Y  +  VL++P A 
Sbjct: 365 AVVVQVVPLGGLDAMAAEDFGAPATDAGLRYVHYGIAVEESSLARRYPRDHRVLRDPAAV 424

Query: 358 AGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
               W  +R  YL  QNV +D+ RF   L+ A +
Sbjct: 425 RREGWMALRAAYLVGQNVTIDVRRFGGALRRAME 458


>gi|357123670|ref|XP_003563531.1| PREDICTED: uncharacterized protein LOC100826010 [Brachypodium
           distachyon]
          Length = 476

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 133/419 (31%), Positives = 217/419 (51%), Gaps = 46/419 (10%)

Query: 10  PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ--------------PS 55
           P   ++CD S    D C + G+      TS  F V    +T  +               +
Sbjct: 59  PEPHVSCDFSGNRSDYCELTGAIRIRGSTSEVFLVTPRRATAGEFLGPNATGWGMIAANA 118

Query: 56  LRIKTQPYPRKSDKSAMSKVKELTITTSA-----PPNLSCGVTHTS-PALVFSAGGYNGN 109
              + QPY RK +   M+ + +LT+  S+     PP  +C V H   PA+V+S GGY GN
Sbjct: 119 TSWEMQPYTRKGESRIMTGITKLTVRLSSTAEDSPP--ACDVMHDDVPAVVYSNGGYCGN 176

Query: 110 FFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI 169
           ++H+F D  +PLFIT   H   +   L +      W  KYAE++   ++H  ++++    
Sbjct: 177 YYHDFNDNIIPLFITAR-HLAGE-AQLVVTQKQAWWFGKYAEIVSGLSKHEPVDMDADAR 234

Query: 170 THCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPK 226
            HC++   +GL SH  + I+P   P   ++VDF+ FL   Y+   E+   +   H  KP+
Sbjct: 235 VHCYRRAFVGLRSHKDLSIDPRRAPNNVSMVDFKRFLMWRYSLPREHAIRTDDEHQRKPR 294

Query: 227 LVLVNRNARVGRTILNLREVKKAAEELGFDVTIFE-------PEESTS--------LADS 271
           L++V R +R  R  +NL E+  AAEE+GF+VT  +         ES++        +A++
Sbjct: 295 LLIVTRRSR--RRFVNLEEIVAAAEEVGFEVTASDLITSSKKKGESSADDSKMQARMAEA 352

Query: 272 FRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLE 330
              +++  AM+ VHG+GLT+ LFL   +V++QVVP+G  + L+   +  P R + + Y++
Sbjct: 353 SATVNAYDAMLAVHGSGLTNLLFLPMNAVVVQVVPLGRMEGLAMDEYGVPPRDMNMRYIQ 412

Query: 331 YKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
           Y I  EES+L E Y     V  +P      +WS ++ +YL  Q+V+LD++RFR  L+ A
Sbjct: 413 YNITAEESTLSEMYPRGHPVFLDPGPIHKQSWSLVKDIYLGKQDVRLDLNRFRPVLQKA 471


>gi|357112811|ref|XP_003558200.1| PREDICTED: uncharacterized protein LOC100844505 isoform 2
           [Brachypodium distachyon]
          Length = 543

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 212/404 (52%), Gaps = 23/404 (5%)

Query: 9   SPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPS-QPSLRIKT 60
           +P   +TCD +       +    +C ++G     PK  T + V  + +      +   + 
Sbjct: 145 TPPAKMTCDENGVDEGFPYARPAVCELSGDIRVSPKDKTMYLVNPSGAAAGFDENGEKRL 204

Query: 61  QPYPRKSDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
           +PY RK D+  +  V E+T+ +  SA     C   H  PA+VFS  GY  NFFH+  D  
Sbjct: 205 RPYARK-DEFLLPAVVEVTVKSVPSASGAPRCTKRHRVPAVVFSVAGYTDNFFHDNTDAL 263

Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQT--ITHCFQSV 176
           +PLF+T  +H    +V L I +    W +KY  +L + + + +IN +++     HCF   
Sbjct: 264 IPLFLT-TAHL-KGEVQLLITNYKPWWVQKYTPVLRKLSNYDVINFDDEDGGAVHCFPDG 321

Query: 177 TLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVN 231
            LGL     ++I+P  T  P+  T+VD+  FL  A     E           +P++++++
Sbjct: 322 YLGLYRDRDLIISPHPTRNPRNYTMVDYNKFLRGALELPREKPAVLGEEPGMRPRMLIIS 381

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
           R+    R +LNL EV  AA ELGF+VT+ E      +      ++S   ++ VHGAGLT+
Sbjct: 382 RSGT--RRLLNLDEVSAAASELGFNVTVAEAGGEADVPAFAAMVNSADVLLAVHGAGLTN 439

Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
            +FL   +V++Q+VP G   W++T ++ +PAR + L Y+EY + +EE+SL +KY  + +V
Sbjct: 440 QIFLPTNAVVLQIVPWGNMDWMATNFYGQPAREMQLRYVEYYVGEEETSLKDKYPRDHVV 499

Query: 351 LKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
            ++P+A     W  +   +  Q+V++D+ RFR +L  A  K ++
Sbjct: 500 FRDPKALHTQGWETLADTIMKQDVQVDLSRFRPFLLQAIDKLQE 543


>gi|242056793|ref|XP_002457542.1| hypothetical protein SORBIDRAFT_03g009110 [Sorghum bicolor]
 gi|241929517|gb|EES02662.1| hypothetical protein SORBIDRAFT_03g009110 [Sorghum bicolor]
          Length = 574

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 136/420 (32%), Positives = 203/420 (48%), Gaps = 36/420 (8%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF----SVGHTDSTPSQPSLRIKT 60
           P +  P+ +I CDRS    D+C ++G    D  + +        G   S+ +    RI+ 
Sbjct: 145 PCSSLPSHTICCDRSDYHSDVCFMSGDVRTDAASLSLLLFPPPQGQGASSTAVTEERIR- 203

Query: 61  QPYPRKSDKSAMSKVKELTI--------TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
            PY RK D      + E+ +          +A     C V H +P  V +AGGY+ N FH
Sbjct: 204 -PYTRKWDAYITKTIHEVRLRRVVRTEAEAAAGAAHRCDVRHDAPVFVVTAGGYSHNMFH 262

Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
            F D F+PL++T  +    + V+LA+   + +WA  Y E+L   +R+ +I++   T THC
Sbjct: 263 VFNDGFLPLWLT--AQHLRRRVVLAVLSYSPRWAGTYGEILSGLSRYRVIDLLRDTQTHC 320

Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF----------HH 222
           F    +G   H  + +N T L   +T+VDF  FLA  Y++      S             
Sbjct: 321 FPGAVVGTRYHDYLAVNSTRLRDNRTIVDFHDFLAGVYDDGGGGGGSSSTTEETTPAPRD 380

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
            +P+L +V+R  R  R I N   V   A  +GFDV + E      L+  +  + S  A+V
Sbjct: 381 RRPRLGIVSRKGR--RVIENQAAVAALAASVGFDVDVMETATGVPLSAVYASVSSYDALV 438

Query: 283 GVHGAGLTHSLFLRP---GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
           GVHGA LT  LFLRP   G+ L+Q+ P+G   LS   F  PA  +GL Y +Y +   ESS
Sbjct: 439 GVHGADLTAFLFLRPGGGGAALVQIAPLGIAMLSRNLFGVPAARMGLRYEQYDVSARESS 498

Query: 340 LVEKYGANDLVLKNP----QAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
           L  +Y A  +V+ +P    +      W  +  VYL+ QNV LD+ RFR  L   + +  K
Sbjct: 499 LSRRYPAGHVVVADPARARREQGKQEWELVEHVYLRGQNVSLDLGRFRETLARIHSRLLK 558


>gi|357112809|ref|XP_003558199.1| PREDICTED: uncharacterized protein LOC100844505 isoform 1
           [Brachypodium distachyon]
          Length = 565

 Score =  204 bits (518), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 212/404 (52%), Gaps = 23/404 (5%)

Query: 9   SPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPS-QPSLRIKT 60
           +P   +TCD +       +    +C ++G     PK  T + V  + +      +   + 
Sbjct: 167 TPPAKMTCDENGVDEGFPYARPAVCELSGDIRVSPKDKTMYLVNPSGAAAGFDENGEKRL 226

Query: 61  QPYPRKSDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
           +PY RK D+  +  V E+T+ +  SA     C   H  PA+VFS  GY  NFFH+  D  
Sbjct: 227 RPYARK-DEFLLPAVVEVTVKSVPSASGAPRCTKRHRVPAVVFSVAGYTDNFFHDNTDAL 285

Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQT--ITHCFQSV 176
           +PLF+T  +H    +V L I +    W +KY  +L + + + +IN +++     HCF   
Sbjct: 286 IPLFLT-TAHL-KGEVQLLITNYKPWWVQKYTPVLRKLSNYDVINFDDEDGGAVHCFPDG 343

Query: 177 TLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVN 231
            LGL     ++I+P  T  P+  T+VD+  FL  A     E           +P++++++
Sbjct: 344 YLGLYRDRDLIISPHPTRNPRNYTMVDYNKFLRGALELPREKPAVLGEEPGMRPRMLIIS 403

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
           R+    R +LNL EV  AA ELGF+VT+ E      +      ++S   ++ VHGAGLT+
Sbjct: 404 RSGT--RRLLNLDEVSAAASELGFNVTVAEAGGEADVPAFAAMVNSADVLLAVHGAGLTN 461

Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
            +FL   +V++Q+VP G   W++T ++ +PAR + L Y+EY + +EE+SL +KY  + +V
Sbjct: 462 QIFLPTNAVVLQIVPWGNMDWMATNFYGQPAREMQLRYVEYYVGEEETSLKDKYPRDHVV 521

Query: 351 LKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
            ++P+A     W  +   +  Q+V++D+ RFR +L  A  K ++
Sbjct: 522 FRDPKALHTQGWETLADTIMKQDVQVDLSRFRPFLLQAIDKLQE 565


>gi|308080932|ref|NP_001183369.1| uncharacterized protein LOC100501781 [Zea mays]
 gi|238011052|gb|ACR36561.1| unknown [Zea mays]
 gi|413934851|gb|AFW69402.1| hypothetical protein ZEAMMB73_940636 [Zea mays]
          Length = 488

 Score =  204 bits (518), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 137/424 (32%), Positives = 211/424 (49%), Gaps = 52/424 (12%)

Query: 10  PAKSITCDRSHRSYDLCLINGSALFDPKTSTFF----------SVGHTDSTPSQPSLRIK 59
           P   + CD +    D+C + G+      TS  F          +   T   P   S  + 
Sbjct: 71  PEPRVRCDFADPRSDVCELEGAIRVRGSTSEVFVLAPARADGLAANVTGLAPGTTSWTV- 129

Query: 60  TQPYPRKSDKSAMSKVKELTITT----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFM 115
            QPY RK +   M  +  LT+      +APP   C V H  PA+V+S GGY GN++H+F 
Sbjct: 130 -QPYTRKGEARVMRGISTLTVRVVPAGAAPP---CTVRHDVPAVVYSNGGYCGNYYHDFN 185

Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI--NNQTITHCF 173
           D  +PLF+T   H    +V L +A     W  KY E++   T +  +++  + +    CF
Sbjct: 186 DNIIPLFVTAR-HL-GGEVQLLVAQKQAWWFGKYREIVDGLTNYEAVDLGADAEGEVRCF 243

Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY-----------NENTNTSSSFHH 222
           ++ TLGL SH  + I+P   P+  ++VDF+ FL   Y            E          
Sbjct: 244 RAATLGLRSHKDLSIDPRRAPRNLSMVDFKRFLMWRYALPREHAIRTDEEEAGPGPGRAR 303

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFE--------------PEESTSL 268
           T+P+L++V R +R  R  +NL E+   AEE+GFDVT  +               E  + +
Sbjct: 304 TRPRLLVVARRSR--RRFVNLPEIVALAEEVGFDVTASDLMSGTASKSKSGAGDEGHSRM 361

Query: 269 ADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLE 327
           AD+ + ++S  AMV VHG+GLT+ +FL   +V++QVVP+G  + L+   +  P R + + 
Sbjct: 362 ADASKLVNSFDAMVAVHGSGLTNLVFLPMNAVVVQVVPLGRMEGLAMDEYGVPPRDMNMR 421

Query: 328 YLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLK 386
           YL+Y I  EES+L E Y     VL +P      +W+ ++ VYL  Q+V+LD+ RFR  L 
Sbjct: 422 YLQYNITAEESTLSEVYPRAHPVLMDPMPIHEQSWTLVKDVYLGKQDVRLDVRRFRPVLL 481

Query: 387 DAYK 390
            A +
Sbjct: 482 KAIQ 485


>gi|56409860|emb|CAI30078.1| glycosyltransferase [Sorghum bicolor]
          Length = 491

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 121/392 (30%), Positives = 207/392 (52%), Gaps = 27/392 (6%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C  + R  D C   G       + T +     DS   +     KT+PY RK D  A++ V
Sbjct: 110 CYETSRRSDTCEAAGDVRVLGSSQTVY----VDSLDRE----WKTKPYCRKHDNFALAHV 161

Query: 76  KELTIT---TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           KE ++    + A P   C V  ++ A V S GG+ GN FH++ D  +P FIT  +H    
Sbjct: 162 KEWSLKPLPSGAAPQ--CTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHPLRG 217

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
           +V   ++     W  +Y ++  + +RH +++I+      C+++V +G   H  + ++ + 
Sbjct: 218 EVQFLVSSYKSWWMNRYIQIFQQMSRHEVVDIDADDEVRCYRNVVVGPTFHRELGVDASK 277

Query: 193 LPKPK-TLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLREV 246
            P P  +  DF+  L +A+     T++          +P+L++++R    GR  +N R +
Sbjct: 278 SPSPGYSTADFRKMLRDAFGLERATATPSGDRWDIRRRPRLLIISRRPSRGRAFMNERAM 337

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
              A  LGFDV I EP+ ST  +   R ++S   MVGVHGAGLT+ +FL  G+V++QVVP
Sbjct: 338 ADMAASLGFDVRIGEPDSSTDTSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVVVQVVP 397

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  +WL+   F +P+  + + YLEY ++ +E++L E+Y ++  VL++P A     W+ +
Sbjct: 398 YGRLEWLARNTFAEPSAGMEVHYLEYVVQLDETTLSEQYPSDHPVLRDPMAIHKQGWNAL 457

Query: 366 R-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
           +  YL  QNV+  + R    LK+ + +A K++
Sbjct: 458 KTTYLDKQNVRPHLGR----LKNTFLQALKML 485


>gi|383100768|emb|CCG47999.1| glycosyltransferase, HGA-like,putative,expressed [Triticum
           aestivum]
          Length = 450

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 130/388 (33%), Positives = 204/388 (52%), Gaps = 27/388 (6%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C + G     P   T  ++ H   + S+ S R+K  P+ RK+D  A++ V
Sbjct: 73  CDMSSLRSDVCELKGDVRVIPSNITIIALLHPSVSESRRSWRMK--PHARKNDGHALASV 130

Query: 76  KEL--TITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+  ++T S+P    C     +PA+VFS G Y GN FH+F D  +PLFIT  S F   D
Sbjct: 131 TEVLVSVTPSSPHVPECKAESAAPAVVFSVGAYAGNMFHDFTDVLIPLFIT-ASRF-RSD 188

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTI-THCFQSVTLGLISHGRMVINPT- 191
           V L ++D    W  KY  LL   + H +I+++ Q+    C+  V +GL  H  M I+   
Sbjct: 189 VHLLVSDAPPWWLDKYRPLLRGLSHHAVIDMDRQSAEVLCYPHVVVGLSFHKEMSIDTAK 248

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFHHT--------KPKLVLVNRNARVGRTILNL 243
            +    ++ DF      +Y    +T+    H         +P+L++++R  +  R   N+
Sbjct: 249 TVGGHYSMADFARLARRSYGLERDTAIRLLHGSDNIKSPRRPRLLIISR--KTTRAFTNM 306

Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
             V +AA  LG++V + E E+ + L    R ++SC  +VGVHGAGLT+ +FL PG+V++Q
Sbjct: 307 GAVAQAAAMLGYEVIVGEAEQHSDLPAFARLVNSCDVLVGVHGAGLTNLVFLPPGAVVVQ 366

Query: 304 VVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
           VVP+ G + ++   F +PA  +GL Y++Y I   ES+L E Y  +  VL+        + 
Sbjct: 367 VVPLGGLEAMARDDFGEPAGDMGLGYVQYGISVGESTLAELYPRDRRVLR--------DL 418

Query: 363 SNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
           +    YL +QNV LD+ RF   L  A +
Sbjct: 419 ALRSEYLVSQNVTLDVARFSGALSRALE 446


>gi|326508758|dbj|BAJ95901.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516460|dbj|BAJ92385.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326523597|dbj|BAJ92969.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 462

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 121/341 (35%), Positives = 186/341 (54%), Gaps = 16/341 (4%)

Query: 59  KTQPYPRKSDKSAMSKVKELTI------TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
           +T+PY R+ D  AM  V+E  +        SA P L C   H+ PA +FS+GG+ GN +H
Sbjct: 117 RTKPYARRHDAVAMDDVREFALLPFGGGNDSAVPPL-CTRNHSVPAFLFSSGGFAGNLYH 175

Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
           ++ D  VPLF T   HF   +V   + D  D W  K+  L  + + + +I+++N    HC
Sbjct: 176 DYTDVLVPLF-TSTHHF-GGEVQFLLTDIKDWWLDKFTPLFRQLSNYDVIDVDNDQEVHC 233

Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVL 229
           F  + +G   H  M I+ T  P  +T+ DF+  L  A+  +   +S   S    KP+L++
Sbjct: 234 FPRIVIGSTFHRPMGIDGTRSPGGETVADFKRLLRRAFRLDRVVASHDGSASLGKPRLLI 293

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
           ++R +   R  LN R +  AA    FDV I EP+  T + +  R ++S   M+GVHGAGL
Sbjct: 294 ISRKS--SRRFLNERAMAHAAALAQFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGL 351

Query: 290 THSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
           T+ +FL   +VL+QVVP G  +WLS V F+ PA+   + Y+EY +  EESSL   Y  + 
Sbjct: 352 TNMVFLPSRAVLLQVVPFGGLEWLSRVTFKDPAKDFDVTYMEYNVSLEESSLKNLYPKDH 411

Query: 349 LVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
             L++P       W+ ++ VYL  Q+V+LD+ +    L+ A
Sbjct: 412 FYLQHPYDVHKKGWNAIKTVYLDKQSVRLDLAKLTRTLEHA 452


>gi|357153314|ref|XP_003576411.1| PREDICTED: uncharacterized protein LOC100832680 [Brachypodium
           distachyon]
          Length = 525

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 127/413 (30%), Positives = 205/413 (49%), Gaps = 34/413 (8%)

Query: 6   AALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPR 65
           AA +   +++C+ S    ++C + G      K ST + V  +D  P   ++ I+  PYPR
Sbjct: 114 AAPAMRSNLSCNFSSVRMNICGMEGDVRIQGKASTVYVVSVSDYRPENSTVIIR--PYPR 171

Query: 66  KSDKSAMSKVKELTITTSAPPNL------------------SCGVTHTSPALVFSAGGYN 107
           K +   M  V+E+T+ ++APP                     C VTH  PA+VFS GGY+
Sbjct: 172 KWEIPTMELVREITVRSTAPPREPGSSATNAMEDDTAPPAPRCTVTHDVPAVVFSTGGYS 231

Query: 108 GNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQ 167
            NFFH   D  VPL+ T   +  +  V L   + + +W  K+  +L   + +P+I+++  
Sbjct: 232 MNFFHAMTDVVVPLYNTARRY--EGRVQLLATNYDRKWIAKFRHVLGALSSYPVIDLDAD 289

Query: 168 TITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---- 223
               CF S  +G+ SH  + I+P L     T++DF+ FL +AY+   + ++         
Sbjct: 290 AAVRCFPSAHVGIESHMELSIDPALSFHGNTMMDFRDFLRSAYSLKRSWTTPVSRNNSSS 349

Query: 224 ----KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
               KP+LV++ R  R  R + N  +   AA E GF+V    PE    +      ++SC 
Sbjct: 350 SSSRKPRLVILLR--RHSRAMTNEGDAMAAATEAGFEVVPAGPEVFGDMGKFAEVVNSCD 407

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
            MVGVHGAGLT+ +FL     ++Q++P  G +W       +P   +GL Y+EY+   EE+
Sbjct: 408 VMVGVHGAGLTNMVFLPHNGTVVQIIPWGGMKWPCFHALGRPVPDMGLRYVEYEASAEET 467

Query: 339 SLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
           +L + Y  +  V  +P +     +  M   +L  QNV LDI RFR  ++  Y+
Sbjct: 468 TLKDVYPRDHAVFTDPLSIHRQGYGMMWATFLDGQNVTLDIGRFRGVMEQLYQ 520


>gi|115445877|ref|NP_001046718.1| Os02g0329800 [Oryza sativa Japonica Group]
 gi|46389948|dbj|BAD15800.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|113536249|dbj|BAF08632.1| Os02g0329800 [Oryza sativa Japonica Group]
 gi|125539253|gb|EAY85648.1| hypothetical protein OsI_07021 [Oryza sativa Indica Group]
 gi|125581910|gb|EAZ22841.1| hypothetical protein OsJ_06519 [Oryza sativa Japonica Group]
          Length = 566

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 124/383 (32%), Positives = 202/383 (52%), Gaps = 25/383 (6%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTI---- 80
           +C + G     PK  T + V  + +     +   + +PY RK D   +  V E+TI    
Sbjct: 196 VCELYGDVRVSPKQKTIYVVNPSGAGGFDENGEKRLRPYARKDD-FLLPGVVEVTIKSVP 254

Query: 81  TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIAD 140
           + +A P   C   H  PA+VFS  GY  NFFH+  D  +PLF+T  +H   +  IL I +
Sbjct: 255 SEAAAPK--CTKQHAVPAVVFSVAGYTDNFFHDMTDAMIPLFLT-TAHLKGEVQIL-ITN 310

Query: 141 CNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP--TLLPKPKT 198
               W +KY  LL + + + +IN +     HCF    LGL     ++I+P  T  P+  T
Sbjct: 311 YKPWWVQKYTPLLRKLSNYDVINFDEDAGVHCFPQGYLGLYRDRDLIISPHPTRNPRNYT 370

Query: 199 LVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEELGF 255
           +VD+  FL +A     +  S        +P++++++R     R +LNL EV  AA ELGF
Sbjct: 371 MVDYNRFLRDALELRRDRPSVLGEEPGMRPRMLIISRAGT--RKLLNLEEVAAAATELGF 428

Query: 256 DVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QW 311
           +VT+ E     + AD   F   ++S   ++ VHGAGLT+ +FL   +V++Q+VP G   W
Sbjct: 429 NVTVAE-----AGADVPAFAALVNSADVLLAVHGAGLTNQIFLPAEAVVVQIVPWGNMDW 483

Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKT 371
           ++T ++ +PAR + L Y+EY + +EE+SL   Y  + +V K+P+A     W  +   +  
Sbjct: 484 MATNFYGQPARDMQLRYVEYYVGEEETSLKHNYSRDHMVFKDPKALHAQGWQTLAATIMK 543

Query: 372 QNVKLDIDRFRIYLKDAYKKAKK 394
           Q+V++++ RFR  L  A  + ++
Sbjct: 544 QDVEVNLTRFRPILLQALDRLQQ 566


>gi|242052293|ref|XP_002455292.1| hypothetical protein SORBIDRAFT_03g007920 [Sorghum bicolor]
 gi|241927267|gb|EES00412.1| hypothetical protein SORBIDRAFT_03g007920 [Sorghum bicolor]
          Length = 468

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 123/392 (31%), Positives = 204/392 (52%), Gaps = 15/392 (3%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
           +  + C    R  D C ++G    + K  +   V    S+  +   +I+  PY R+S  +
Sbjct: 84  SNKVVCSTEERFSDYCELDGDVRINGKAWSVDIVPPGWSSEQRREWKIR--PYSRRSASN 141

Query: 71  AMS-KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
             +  V +L    SAP   +C VTH +P +VF+ GGY+GN FH+  D  +PL++T +  +
Sbjct: 142 VDTLNVTQLQDPASAP---ACTVTHHAPGVVFALGGYSGNAFHDHADVLLPLYLT-SLRY 197

Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
             +  +L I      W  KY   L R +++ ++N++      CF  +T+GL  H    + 
Sbjct: 198 DGEVQLLVINRVQPWWLGKYRLALRRMSKYDVVNLDGDAHVRCFPHLTVGLRLHMDFGVV 257

Query: 190 PTLLP----KPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
           P ++P    +  ++ DF  FL  AY             +P+L+L+ R     R  LN  E
Sbjct: 258 PEMVPGQGHRRVSMPDFTRFLREAYALPRGAPVKPGKNRPRLMLIQRQRT--RRFLNEAE 315

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           + +AAE  GF+V + +     ++ +  R ++S   MVG+HGAG+T+ +FL PG VL+QVV
Sbjct: 316 MVRAAEAAGFEVAVTDLLIDAAVDEQARVVNSFDVMVGIHGAGMTNEVFLPPGGVLVQVV 375

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G    ++ V + +PA  +GL+YL Y +  EESSL E  G +   +K+P +     W+ 
Sbjct: 376 PWGKLDLMARVEYGEPAADMGLKYLCYNVTLEESSLPELLGRDHPAIKDPDSIHRKGWAA 435

Query: 365 M-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
           M  +Y+  Q+V+LDI+RF + L +A    + L
Sbjct: 436 MFDIYMTKQDVRLDIERFALTLAEAMDHLRSL 467


>gi|115469852|ref|NP_001058525.1| Os06g0707200 [Oryza sativa Japonica Group]
 gi|53792589|dbj|BAD53604.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|53792782|dbj|BAD53817.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|113596565|dbj|BAF20439.1| Os06g0707200 [Oryza sativa Japonica Group]
 gi|125598448|gb|EAZ38228.1| hypothetical protein OsJ_22603 [Oryza sativa Japonica Group]
 gi|215765287|dbj|BAG86984.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 460

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 117/342 (34%), Positives = 186/342 (54%), Gaps = 18/342 (5%)

Query: 59  KTQPYPRKSDKSAMSKVKELTIT-------TSAPPNLSCGVTHTSPALVFSAGGYNGNFF 111
           +T+PY R  D  AM  V+E T+        T+ PP   C   H+ PA +FS+GG+ GN +
Sbjct: 118 RTKPYARLHDAVAMDDVREFTLVPFGGANHTAVPP--LCTRNHSVPAFLFSSGGFAGNLY 175

Query: 112 HEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITH 171
           H++ D  VPLF + N HF  + V   ++   D W  K+  L  + +R+ +I+++N    H
Sbjct: 176 HDYTDVLVPLFTSTN-HFGGE-VQFLLSGIKDWWLDKFTPLFRQLSRYDVIDVDNDQEVH 233

Query: 172 CFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLV 228
           CF  + +G   H  M I+P   P   T+ DF+  L   +      +S   +    KP+L+
Sbjct: 234 CFPRIFIGATFHRAMGIDPARSPGGVTVADFKRLLRRTFRLERAVASRTGAPRRDKPRLL 293

Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
           +++R +   R  LN R +  AA    FDV I EP+  T + +  R ++S   M+GVHGAG
Sbjct: 294 IISRKS--SRRFLNERAMAHAAALARFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAG 351

Query: 289 LTHSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
           LT+ +FL   +VL+QVVP G  +WL+ V F+ PA+ + + Y+EY +  +ESSL E Y  +
Sbjct: 352 LTNMVFLPSRAVLIQVVPFGGLEWLTRVTFKDPAKDMDVNYMEYNVSFDESSLRELYPRD 411

Query: 348 DLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
              +++P       W  ++ VYL  QNV+L++ +    L+ A
Sbjct: 412 HFYIQHPYDVHKKGWDAIKTVYLDKQNVELNLTKLTNTLERA 453


>gi|326488805|dbj|BAJ98014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 462

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 121/341 (35%), Positives = 186/341 (54%), Gaps = 16/341 (4%)

Query: 59  KTQPYPRKSDKSAMSKVKELTI------TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
           +T+PY R+ D  AM  V+E  +        SA P L C   H+ PA +FS+GG+ GN +H
Sbjct: 117 RTKPYARRHDAVAMDDVREFALLPFGGGNDSAVPPL-CTRNHSVPAFLFSSGGFAGNLYH 175

Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
           ++ D  VPLF T   HF   +V   + D  D W  K+  L  + + + +I+++N    HC
Sbjct: 176 DYTDVLVPLF-TSTHHF-GGEVQFLLTDIKDWWLDKFTPLFRQLSNYDVIDVDNDQEVHC 233

Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVL 229
           F  + +G   H  M I+ T  P  +T+ DF+  L  A+  +   +S   S    KP+L++
Sbjct: 234 FPRIVIGSTFHRPMGIDGTRSPGGETVADFKRLLRRAFRLDRVVASHDGSASLGKPRLLI 293

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
           ++R +   R  LN R +  AA    FDV I EP+  T + +  R ++S   M+GVHGAGL
Sbjct: 294 ISRKS--SRRFLNERAMAHAAALAQFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGL 351

Query: 290 THSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
           T+ +FL   +VL+QVVP G  +WLS V F+ PA+   + Y+EY +  EESSL   Y  + 
Sbjct: 352 TNMVFLPSRAVLLQVVPFGGLEWLSRVTFKDPAKDFYVTYMEYNVSLEESSLKNLYPKDH 411

Query: 349 LVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
             L++P       W+ ++ VYL  Q+V+LD+ +    L+ A
Sbjct: 412 FYLQHPYDVHKKGWNAIKTVYLDKQSVRLDLAKLTRTLEHA 452


>gi|242096984|ref|XP_002438982.1| hypothetical protein SORBIDRAFT_10g029360 [Sorghum bicolor]
 gi|241917205|gb|EER90349.1| hypothetical protein SORBIDRAFT_10g029360 [Sorghum bicolor]
          Length = 496

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 133/422 (31%), Positives = 206/422 (48%), Gaps = 45/422 (10%)

Query: 10  PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGH-----------TDSTPSQPSLRI 58
           P   + CD +    D+C + G+      TS  F V             T   P   +   
Sbjct: 76  PESRVRCDFADPRSDVCELEGAIRIRGSTSEVFVVAGAGADGLLAANVTGLAPGMNATSW 135

Query: 59  KTQPYPRKSDKSAMSKVKELTITTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDC 117
             QPY RK +   M  +  LT+   +P +   C V H  PA+V+S GGY GN++H+F D 
Sbjct: 136 TIQPYTRKGEVRVMRGITTLTVRVVSPGDAPPCTVRHDVPAVVYSNGGYCGNYYHDFNDN 195

Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
            +PLF+T   H    +V L +A     W  KY E++   T +  ++++      CF+  T
Sbjct: 196 IIPLFVT-TRHL-GGEVQLLVAQKQAWWFHKYREIVDGLTNYDAVDLDGAGEVRCFRKAT 253

Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAY----------NENTNTSSSFHHTKPKL 227
           LGL S   + I+P   P+  ++VDF+ FL   Y          +E     +   H +P+L
Sbjct: 254 LGLRSLKDLSIDPRRAPRNLSMVDFKRFLMWRYALPREHAIRTDEEEAAGAGGGHRRPRL 313

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFE-----------------PEESTSLAD 270
           +LV R +R  R  +N+ E+   AEE+GFDVT  +                  E  + +AD
Sbjct: 314 LLVTRRSR--RRFVNVPEIVALAEEVGFDVTTSDLMSASAKNNNKAGAGVGDEGHSRMAD 371

Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYL 329
           + + ++S   MV VHG+GLT+ +FL   +V++QVVP+G  + L+   +  P R + + Y+
Sbjct: 372 ASKLVNSFDVMVAVHGSGLTNLVFLPMNAVVVQVVPLGRMESLAMDEYGVPPRDMNMRYI 431

Query: 330 EYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
           +Y I  EES+L E Y     VL +P      +WS +  VYL  Q+V+LD+ RFR  L  A
Sbjct: 432 QYNITAEESTLSEVYPRAHPVLLDPMPIHEQSWSLVNDVYLGKQDVRLDVRRFRPVLLKA 491

Query: 389 YK 390
            +
Sbjct: 492 IQ 493


>gi|242052299|ref|XP_002455295.1| hypothetical protein SORBIDRAFT_03g007960 [Sorghum bicolor]
 gi|56409858|emb|CAI30077.1| glycosyltransferase [Sorghum bicolor]
 gi|241927270|gb|EES00415.1| hypothetical protein SORBIDRAFT_03g007960 [Sorghum bicolor]
          Length = 513

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 122/386 (31%), Positives = 207/386 (53%), Gaps = 25/386 (6%)

Query: 16  CDRSHRSYDLCLINGSALFDPKT-STFFSVGHTDSTPSQPSLR-IKTQPYPRKSDKSAMS 73
           C+      + C ++G    D +T  +  SV    ++PS    R  K +PY R++    MS
Sbjct: 127 CNTDGPVSETCELDG----DVRTNGSARSVTFVPASPSSSERREWKVRPYSRRT----MS 178

Query: 74  KVKELTIT------TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
            V  +T+T       ++PP  +C VTH  P ++F+ GG  GN++H+F D  VPLF     
Sbjct: 179 GVDRVTVTQLESPAAASPPPAACAVTHDVPGVLFALGGLTGNYWHDFSDVLVPLFAASRR 238

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
           +    +V+  +++    W  KY  ++   +R+  ++++      CF+ +T+GL  H  + 
Sbjct: 239 Y--KGEVLFLVSNIQPWWLGKYEAVVRALSRYDAVDLDRDARVRCFRHLTVGLRLHKELT 296

Query: 188 INPTLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLR 244
           I P L P   T+ DF +FL   Y          ++    KP+L+L++R     R  +N+ 
Sbjct: 297 IVPDLAPDRLTMADFTAFLRETYALPRGAPAIPTTEEGRKPRLLLIHRAHY--RRFVNVP 354

Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
           E+++AAE  GF+VT+  P     + ++ R ++S   ++GVHGAGLT+++FL  G V++QV
Sbjct: 355 EIRRAAESAGFEVTVASPRGDAPVEETARTVNSHDVLLGVHGAGLTNAVFLPAGGVVIQV 414

Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
           VP G  + ++   F +P   +GL YLEY +  EES+L+E  G    V+K+P+A   + W 
Sbjct: 415 VPYGRLERMARTDFGEPVADMGLRYLEYGVAAEESTLLEMLGPEHPVIKDPEAIHRSGWD 474

Query: 364 NM-RVYLKTQNVKLDIDRFRIYLKDA 388
            +   YL  Q+V++D++RF   L  A
Sbjct: 475 KVAEYYLGKQDVRIDVNRFAPTLAQA 500


>gi|242034639|ref|XP_002464714.1| hypothetical protein SORBIDRAFT_01g025290 [Sorghum bicolor]
 gi|241918568|gb|EER91712.1| hypothetical protein SORBIDRAFT_01g025290 [Sorghum bicolor]
          Length = 466

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 123/391 (31%), Positives = 201/391 (51%), Gaps = 21/391 (5%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           +TC+ S+   + C + G      K++T + V  +   P   +  IK +PY RK +   M 
Sbjct: 78  LTCNFSNHQSNTCSMEGDLRIHGKSATVYVVSASTYRPENAT--IKLRPYARKWEDQVML 135

Query: 74  KVKELTITTSAPPN----------LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
            V+E+T+ +S P              C V H  PA+VFS GGYN NFFH   D  +PL++
Sbjct: 136 LVREVTMRSSPPAGSAAADDPPPPPQCSVRHDVPAVVFSTGGYNRNFFHVMTDVIIPLYL 195

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN--QTITHCFQSVTLGLI 181
           T   +  +  V L   D   +W  KY  +L   + +P+I++++  +    CF S  +GL 
Sbjct: 196 TAREY--NGHVQLLATDYEPKWIAKYKAILAALSSYPVIDLDSEPEDTVRCFPSAHVGLE 253

Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
           SH  + I P L  K  T+V F+ F+ +AY+      S+    KP+LV++ R  R  R + 
Sbjct: 254 SHKELGIVPGLSHKGYTMVSFRDFIRSAYSLQRPRVSA-GRKKPRLVMILR--RNSRQLK 310

Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           N  +   AA  +GF+V    P++ + L      ++SC  ++GVHGAGL + LFL   + +
Sbjct: 311 NEDDAIAAAANVGFEVVAAGPDDVSDLERFPGVVNSCDVLMGVHGAGLANMLFLPHNATV 370

Query: 302 MQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGA 360
           +Q++P G  +W     +  P   +GL YLEY+   EE+SL + Y  +  V  +P +    
Sbjct: 371 VQIIPWGELKWACRHSYGDPVPDMGLRYLEYEATAEETSLKDSYPRDHAVFTDPLSIHRQ 430

Query: 361 NWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
            +  M  +++  Q+V +DIDRF  ++K  Y+
Sbjct: 431 GFDKMWNIFINGQHVIVDIDRFTGFMKQLYQ 461


>gi|326497271|dbj|BAK02220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 480

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 111/335 (33%), Positives = 183/335 (54%), Gaps = 7/335 (2%)

Query: 62  PYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
           PY R         V +L    +APP   C VTHT PA++F  GGY GN++H++ D  VPL
Sbjct: 150 PYTRSGQSLRAVTVTQLQDRAAAPP---CTVTHTMPAILFGIGGYVGNYWHDYADILVPL 206

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
           F+    +  +   +++      +W  KY  LL   +++ ++++++     CF  VT+GL 
Sbjct: 207 FVASRQYHGEVTFLVSNIKHLPRWLVKYKTLLQGLSKYGVVDMDHDAYVRCFPRVTVGLR 266

Query: 182 SHGRMVINPTLLPKPK-TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTI 240
                 I P L+P  + T+ DF  F+   Y           + KP+L+L++R     R  
Sbjct: 267 LDKDFSIVPELVPGGRLTMADFTQFVRETYALPRGAVIREPYKKPRLLLIHRGTF--RRF 324

Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
           LN  E+ +AAE +GF+  + E   + S  +  R ++S   ++GVHGAGLT+++ L PG V
Sbjct: 325 LNEPEIVQAAEAVGFEAVVTELRLNGSEVEQARVVNSFDVVLGVHGAGLTNAVHLPPGGV 384

Query: 301 LMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
           L+QVVP G  + ++ + F +PA  +GL+YL+Y +  EESSL+EK G +   +K+P +   
Sbjct: 385 LIQVVPFGKIEVMARLDFSEPATDMGLKYLDYSVSAEESSLLEKLGPDHPAIKDPDSIHR 444

Query: 360 ANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
           + W+ M  +   QNV+++  RF   L+ A+   +K
Sbjct: 445 SGWTTMYDFYLMQNVRINTTRFAPTLEQAFNHLRK 479


>gi|218198859|gb|EEC81286.1| hypothetical protein OsI_24404 [Oryza sativa Indica Group]
          Length = 460

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 116/340 (34%), Positives = 184/340 (54%), Gaps = 14/340 (4%)

Query: 59  KTQPYPRKSDKSAMSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHE 113
           +T+PY R  D  AM  V+E T+      N S     C   H+ PA +FS+GG+ GN +H+
Sbjct: 118 RTKPYARLHDAVAMDDVREFTLVPFGGANHSAVPPLCTRNHSVPAFLFSSGGFAGNLYHD 177

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           + D  VPLF + N HF  + V   ++   D W  K+  L  + +R+ +I+++N    HCF
Sbjct: 178 YTDVLVPLFTSTN-HFGGE-VQFLLSGIKDWWLDKFTPLFRQLSRYDVIDVDNDQEVHCF 235

Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLV 230
             + +G   H  M I+P   P   T+ DF+  L   +      +S   +    KP+L+++
Sbjct: 236 PRIFIGATFHRAMGIDPARSPGGVTVADFKRLLRRTFRLERAVASRTGAPRRDKPRLLII 295

Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
           +R +   R  LN R +  AA    FDV I EP+  T + +  R ++S   M+GVHGAGLT
Sbjct: 296 SRKS--SRRFLNERAMAHAAALARFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGLT 353

Query: 291 HSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
           + +FL   +VL+QVVP G  +WL+ V F+ PA+ + + Y+EY +  +ESSL E Y  +  
Sbjct: 354 NMVFLPSRAVLIQVVPFGGLEWLTRVTFKDPAKDMDVNYMEYNVSFDESSLRELYPRDHF 413

Query: 350 VLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
            +++P       W  ++ VYL  QNV+L++ +    L+ A
Sbjct: 414 YIQHPYDVHKKGWDAIKTVYLDKQNVELNLTKLTNTLERA 453


>gi|125556695|gb|EAZ02301.1| hypothetical protein OsI_24402 [Oryza sativa Indica Group]
          Length = 465

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 134/402 (33%), Positives = 208/402 (51%), Gaps = 33/402 (8%)

Query: 10  PAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP------------SL 56
           P   + CD R +RS D+C + G+     +TS  F V  + ++ S              + 
Sbjct: 69  PEPHLRCDFRDNRS-DVCEMEGAIRILGRTSEVFLVAPSLASISGGGGGVNATGVDANAT 127

Query: 57  RIKTQPYPRKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
           R K QPY RK +   M  + E+T+       APP   C   H  PA+V+S GGY GN++H
Sbjct: 128 RWKIQPYTRKGESRVMPGITEVTVRLVTADEAPP---CDEWHDVPAIVYSNGGYCGNYYH 184

Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
           +F D  +PLFIT  S     +V L +      W  KY E++   T++  ++++ +    C
Sbjct: 185 DFNDNIIPLFIT--SRHLAGEVQLLVTQKQRWWFGKYREIVEGLTKYEPVDLDAEQRVRC 242

Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY----NENTNTSSSFHHTKPKLV 228
           ++  T+GL SH  + I+P   P   ++VDF+ FL   Y              +  KP+L+
Sbjct: 243 YRRATVGLHSHKDLSIDPRRAPNNYSMVDFKRFLMWRYALPREHAIRMEEEDNSKKPRLL 302

Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
           ++NR +R  R  +NL E+  AAE +GF+V   E +     A S   ++S  AMV VHG+G
Sbjct: 303 VINRRSR--RRFVNLDEIVAAAEGVGFEVAAAELDAHIPAAAS--AVNSYDAMVAVHGSG 358

Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
           LT+ +FL   +V++QVVP+G  + L+   +  P R + + YL+Y I  EES+L E Y   
Sbjct: 359 LTNLVFLPMNAVVIQVVPLGRMEGLAMDEYGVPPRDMNMRYLQYNITAEESTLSEVYPRA 418

Query: 348 DLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
             V  +P      +WS ++ +YL  Q+V+LD+ RFR  L  A
Sbjct: 419 HPVFLDPLPIHKQSWSLVKDIYLGQQDVRLDVRRFRPVLLKA 460


>gi|413942817|gb|AFW75466.1| hypothetical protein ZEAMMB73_930495 [Zea mays]
          Length = 496

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 128/397 (32%), Positives = 196/397 (49%), Gaps = 29/397 (7%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
           A+   C  S R  D C   G      +  T   VG  D          K +PY RK D  
Sbjct: 102 ARRPVCYESSRRSDTCEAAGDVRVVGRAQTVL-VGALDRE-------WKVKPYCRKHDAF 153

Query: 71  AMSKVKELTI-----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
           A+S VKE T+      +       C V  ++ A V S GG+ GN FH++ D  +P FIT 
Sbjct: 154 ALSHVKEWTLRPLVGGSDGDEAPRCTVNSSATAFVLSTGGFTGNLFHDYTDVLIPAFITA 213

Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
             +    DV L ++     WA KY  +L + +RH +++ +      C+  V +G   H  
Sbjct: 214 RRY--GGDVQLLVSSHKPWWAAKYMPVLQQLSRHELVDADADGEVRCYPRVVVGPTFHRE 271

Query: 186 MVIN-----PTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNAR 235
           + +      P    +  ++ +F++ L  A   E    + S        +P+L++++R  R
Sbjct: 272 LGVGAETKAPGGEEEGVSMPEFRAMLRRALGLERAAAAPSGDRWDVRRRPRLLIISR--R 329

Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
             R +LN R +   A  LGFDV + +PE ST +A   R ++S   MVGVHG GLT+ +FL
Sbjct: 330 QSRRLLNERAMADMATSLGFDVRVGDPEASTDVARFARLVNSADVMVGVHGDGLTNMVFL 389

Query: 296 RPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
             G+VL+QVVP G  +WL+   F  PA  + + YLEY ++  E++L E+YG +D V+++P
Sbjct: 390 PAGAVLVQVVPYGGLEWLARGMFRDPAAGMQVHYLEYVVRAGETTLSEEYGEDDPVIRDP 449

Query: 355 QAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
            A     W  ++ VYL  Q+V+  + R R  L  A K
Sbjct: 450 AAVHRKGWDALKAVYLDKQDVRPHLGRLRNTLLQALK 486


>gi|115469848|ref|NP_001058523.1| Os06g0707000 [Oryza sativa Japonica Group]
 gi|113596563|dbj|BAF20437.1| Os06g0707000 [Oryza sativa Japonica Group]
          Length = 446

 Score =  201 bits (510), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 134/402 (33%), Positives = 208/402 (51%), Gaps = 33/402 (8%)

Query: 10  PAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP------------SL 56
           P   + CD R +RS D+C + G+     +TS  F V  + ++ S              + 
Sbjct: 50  PEPHLRCDFRDNRS-DVCEMEGAIRILGRTSEVFLVAPSLASISGGGGGVNATGVDANAT 108

Query: 57  RIKTQPYPRKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
           R K QPY RK +   M  + E+T+       APP   C   H  PA+V+S GGY GN++H
Sbjct: 109 RWKIQPYTRKGESRVMPGITEVTVRLVTADEAPP---CDEWHDVPAIVYSNGGYCGNYYH 165

Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
           +F D  +PLFIT + H   + V L +      W  KY E++   T++  ++++ +    C
Sbjct: 166 DFNDNIIPLFIT-SRHLAGE-VQLLVTQKQRWWFGKYREIVEGLTKYEPVDLDAEQRVRC 223

Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY----NENTNTSSSFHHTKPKLV 228
           ++  T+GL SH  + I+P   P   ++VDF+ FL   Y                 KP+L+
Sbjct: 224 YRRATVGLHSHKDLSIDPRRAPNNYSMVDFKRFLMWRYALPREHAIRMEEEDKSKKPRLL 283

Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
           ++NR +R  R  +NL E+  AAE +GF+V   E +     A S   ++S  AMV VHG+G
Sbjct: 284 VINRRSR--RRFVNLDEIVAAAEGVGFEVAAAELDAHIPAAAS--AVNSYDAMVAVHGSG 339

Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
           LT+ +FL   +V++QVVP+G  + L+   +  P R + + YL+Y I  EES+L E Y   
Sbjct: 340 LTNLVFLPMNAVVIQVVPLGRMEGLAMDEYGVPPRDMNMRYLQYNITAEESTLSEVYPRA 399

Query: 348 DLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
             V  +P      +WS ++ +YL  Q+V+LD+ RFR  L  A
Sbjct: 400 HPVFLDPLPIHKQSWSLVKDIYLGQQDVRLDVRRFRPVLLKA 441


>gi|300681542|emb|CBH32639.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
           aestivum]
          Length = 524

 Score =  201 bits (510), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 116/323 (35%), Positives = 186/323 (57%), Gaps = 20/323 (6%)

Query: 75  VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
           V +L  T+ A P  +C VTH  PA+V + GG+ GN+FH+F D  VPLF+    +    +V
Sbjct: 192 VTQLDSTSVASPAPACTVTHRVPAIVLALGGHLGNYFHDFSDALVPLFVASRRY--GGEV 249

Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
            L   +    W  KY  ++ R T++ ++++++     CF+ VT+GL  H    I P L+P
Sbjct: 250 QLLAGNIQPWWLGKYEAVVRRLTKYEVLDLDHDDQIRCFRHVTVGLNMHKEFNIVPELVP 309

Query: 195 K--PKTLVDFQSFLANAYN-------ENTNTSSS-----FHHTKPKLVLVNRNARVGRTI 240
              P ++++F +FL   Y+         TN  SS       + KP+L+L++R     R +
Sbjct: 310 GGVPLSMLNFTAFLRETYSLPRAAPISLTNNKSSPPVDDNKNKKPRLMLLDRGHY--RKL 367

Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
           +N+ E+ KAAE+ GF+VTI +P  +  + +    ++S   ++GVHGAGLT+S FL PG+V
Sbjct: 368 VNVPEIVKAAEKAGFEVTIADPRFNVRVKELALSVNSFDVLLGVHGAGLTNSAFLPPGAV 427

Query: 301 LMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
           ++QVVP G  + ++   F  PA  +GL YLEY I  EES+L+E  G +   +K+P +   
Sbjct: 428 VIQVVPYGKLEPMAQREFGDPAANMGLRYLEYSISVEESTLLETLGPDHPAIKDPDSVHR 487

Query: 360 ANWSNM-RVYLKTQNVKLDIDRF 381
           + W  +   YL  QNV++D++RF
Sbjct: 488 SGWDKVAEYYLGKQNVRVDVERF 510


>gi|53792588|dbj|BAD53603.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|53792781|dbj|BAD53816.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|125598446|gb|EAZ38226.1| hypothetical protein OsJ_22601 [Oryza sativa Japonica Group]
          Length = 465

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 134/402 (33%), Positives = 208/402 (51%), Gaps = 33/402 (8%)

Query: 10  PAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP------------SL 56
           P   + CD R +RS D+C + G+     +TS  F V  + ++ S              + 
Sbjct: 69  PEPHLRCDFRDNRS-DVCEMEGAIRILGRTSEVFLVAPSLASISGGGGGVNATGVDANAT 127

Query: 57  RIKTQPYPRKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
           R K QPY RK +   M  + E+T+       APP   C   H  PA+V+S GGY GN++H
Sbjct: 128 RWKIQPYTRKGESRVMPGITEVTVRLVTADEAPP---CDEWHDVPAIVYSNGGYCGNYYH 184

Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
           +F D  +PLFIT + H   + V L +      W  KY E++   T++  ++++ +    C
Sbjct: 185 DFNDNIIPLFIT-SRHLAGE-VQLLVTQKQRWWFGKYREIVEGLTKYEPVDLDAEQRVRC 242

Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY----NENTNTSSSFHHTKPKLV 228
           ++  T+GL SH  + I+P   P   ++VDF+ FL   Y                 KP+L+
Sbjct: 243 YRRATVGLHSHKDLSIDPRRAPNNYSMVDFKRFLMWRYALPREHAIRMEEEDKSKKPRLL 302

Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
           ++NR +R  R  +NL E+  AAE +GF+V   E +     A S   ++S  AMV VHG+G
Sbjct: 303 VINRRSR--RRFVNLDEIVAAAEGVGFEVAAAELDAHIPAAAS--AVNSYDAMVAVHGSG 358

Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
           LT+ +FL   +V++QVVP+G  + L+   +  P R + + YL+Y I  EES+L E Y   
Sbjct: 359 LTNLVFLPMNAVVIQVVPLGRMEGLAMDEYGVPPRDMNMRYLQYNITAEESTLSEVYPRA 418

Query: 348 DLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
             V  +P      +WS ++ +YL  Q+V+LD+ RFR  L  A
Sbjct: 419 HPVFLDPLPIHKQSWSLVKDIYLGQQDVRLDVRRFRPVLLKA 460


>gi|383100764|emb|CCG47995.1| glycosyltransferase, HGA-like,putative,expressed [Triticum
           aestivum]
          Length = 518

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 114/321 (35%), Positives = 184/321 (57%), Gaps = 18/321 (5%)

Query: 75  VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
           V +L  T++  P  +C VTH  PA+V + GG+ GN+FH+F D  VPLF+    +    +V
Sbjct: 188 VTQLDSTSAGSPAPACTVTHRVPAIVLALGGHLGNYFHDFSDALVPLFVASRRY--GGEV 245

Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
            L   +    W  KY  ++ R T++ ++++++     CF+ VT+GL  H    I P L+P
Sbjct: 246 QLLAGNIQPWWLGKYEAVVRRLTKYEVLDLDHDDQIRCFRHVTVGLNMHKEFNIVPELVP 305

Query: 195 K--PKTLVDFQSFLANAYN-------ENTNTSS---SFHHTKPKLVLVNRNARVGRTILN 242
              P ++ +F +FL   Y+         TN SS        KP+L+L++R     R ++N
Sbjct: 306 GGVPLSMANFTAFLRETYSLPRAAPISLTNDSSPPVDAKKRKPRLMLLDRGHY--RKLVN 363

Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
           + E+ KAAE+ GF+VTI +P  +  + +    ++S   ++GVHGAGLT+S FL PG+V++
Sbjct: 364 VPEIVKAAEKAGFEVTIADPRFNVRVKELAMSVNSFDVLLGVHGAGLTNSAFLPPGAVVI 423

Query: 303 QVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN 361
           QVVP G  + ++   F  PA  +GL YLEY I  +ES+L+E  G +   +K+P +   + 
Sbjct: 424 QVVPYGKLEPMAQREFGDPAANMGLRYLEYSISVDESTLLETLGPDHPAIKDPDSVHRSG 483

Query: 362 WSNM-RVYLKTQNVKLDIDRF 381
           W  +   YL  QNV++D++RF
Sbjct: 484 WDKVAEYYLGKQNVRVDVERF 504


>gi|115434208|ref|NP_001041862.1| Os01g0118700 [Oryza sativa Japonica Group]
 gi|11034624|dbj|BAB17148.1| putative HGA6 [Oryza sativa Japonica Group]
 gi|14090224|dbj|BAB55485.1| putative HGA6 [Oryza sativa Japonica Group]
 gi|113531393|dbj|BAF03776.1| Os01g0118700 [Oryza sativa Japonica Group]
 gi|125524185|gb|EAY72299.1| hypothetical protein OsI_00154 [Oryza sativa Indica Group]
 gi|125568814|gb|EAZ10329.1| hypothetical protein OsJ_00164 [Oryza sativa Japonica Group]
 gi|215687353|dbj|BAG91918.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 518

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 119/390 (30%), Positives = 202/390 (51%), Gaps = 18/390 (4%)

Query: 9   SPAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPYPRK 66
           +PAK   CD +   + D+C I+G                 +   S P+ R  +  PY RK
Sbjct: 130 APAKKPACDIQGPWASDVCSIDGDVRIHGAAHDVVIPPPIEGGGSNPNPREWRVVPYSRK 189

Query: 67  SDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
                M  +KE+ +    SA    +C V    PALVF+ GG  GN++H+F D  +PL++ 
Sbjct: 190 H----MGGLKEVAVREVASAAEAPACDVRSPVPALVFAMGGLTGNYWHDFSDVLIPLYLQ 245

Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHG 184
                 D +V L + +    +  KY  +L R +RH I++++      CF    +G+  H 
Sbjct: 246 ARRF--DGEVQLVVENIQMWYVGKYKRVLDRLSRHDIVDMDRDDKVRCFPGAVVGIRMHK 303

Query: 185 RMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF----HHTKPKLVLVNRNARVGRTI 240
              I+P   P   ++ +F  FL + ++   +   S        +P+L++++R  R  R +
Sbjct: 304 EFSIDPARDPTGHSMPEFTKFLRDTFSLPRDAPVSLVDNAAAVRPRLMIISR--RHPRKL 361

Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
           +N+ EV +AAE +GF+V I +P  +  + +  + ++    ++GVHGAGLT+S+FL  G+V
Sbjct: 362 MNVEEVVRAAERIGFEVVIGDPPFNVDVGEFAKEVNRADVLMGVHGAGLTNSVFLPTGAV 421

Query: 301 LMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
           L+QVVP G  + +  V F  PA  + L+Y+ Y    EES+LVE  G +   +++P++   
Sbjct: 422 LIQVVPYGKMEHIGKVDFGDPAEDMRLKYMAYSAGVEESTLVETLGRDHPAVRDPESVHR 481

Query: 360 ANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
           + W  +   YL  Q+++LD+ RF   L+DA
Sbjct: 482 SGWGKVAEYYLGKQDIRLDLARFEPLLRDA 511


>gi|383100757|emb|CCG47989.1| glycosyltransferase, HGA-like, expressed [Triticum aestivum]
          Length = 523

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 125/389 (32%), Positives = 207/389 (53%), Gaps = 20/389 (5%)

Query: 9   SPAKSITCD-RSHRSYDLCLI---NGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPY 63
           +PAK   CD +   + D+C +   +G       T T       +S  S P+ +  +  PY
Sbjct: 134 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSTHTVLVPPTIESGGSNPNPQEWRVLPY 193

Query: 64  PRKSDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
            RK     MS +KE+T+    +A     C VT   PALVF+ GG  GN++H+F D  +PL
Sbjct: 194 SRKH----MSGIKEITVRELPTAADAPRCAVTSQVPALVFAMGGLTGNYWHDFSDVMIPL 249

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
           ++   S F D +V L + +    +A KY  ++ R +R+ +++++      CF S  +G+ 
Sbjct: 250 YLQA-SRF-DGEVQLVVTNIQPWYAGKYRHIIARLSRYDVVDMDKDDQVRCFPSAVVGIR 307

Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVGR 238
            H    I+PT  P   ++ +F  FL N +          ++    KP++++++R  R  R
Sbjct: 308 MHKEFSIDPTKEPTGHSMPEFTKFLRNVFALPRAAPMRVTAGSDKKPRMMIISR--RHPR 365

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
            ++N+ EV   A+ +GF+V I +P  +  + D  R +++   ++GVHGAGLT+SLFL  G
Sbjct: 366 KLVNVAEVVALAKRIGFEVVIGDPPFNVDVGDFAREVNAADVLMGVHGAGLTNSLFLPTG 425

Query: 299 SVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
           +V +QV P G  + +  V F  PA  +GL+Y+ Y    EES+LV+  G +   +K+P++ 
Sbjct: 426 AVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMAYSCGMEESTLVDTLGRDHPAVKDPESI 485

Query: 358 AGANWSNM-RVYLKTQNVKLDIDRFRIYL 385
             + WS +   YL  Q+VKLD+ RF   L
Sbjct: 486 HRSGWSKVAEYYLGKQDVKLDLQRFEPVL 514


>gi|383100754|emb|CCG47984.1| glycosyltransferase, HGA-like,putative,expressed [Triticum
           aestivum]
          Length = 618

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 124/382 (32%), Positives = 187/382 (48%), Gaps = 18/382 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S R  D+C   G    D   S+F  V    +      +R    PYPRK D++ M +V
Sbjct: 239 CDFSDRRSDVCDFTGDIRMDANASSFVVVVDAATAAQSHKVR----PYPRKGDQTCMGRV 294

Query: 76  KELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
            E+T+    ++S PP   C  TH+ PA+ FS GGY GN FH+F D  VPL+ T+  H   
Sbjct: 295 PEITVRTTSSSSTPPPPQCTRTHSVPAVTFSIGGYTGNIFHDFSDVLVPLYNTV--HRYR 352

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI---THCFQSVTLGLISHGRMVI 188
            DV L +A+    W  KY +LL   +RH  +++        THCF+   + L +H  ++I
Sbjct: 353 GDVQLVMANVAPWWLVKYDKLLRELSRHAPLDLAAAAAKGETHCFRHAVVSLRAHRELII 412

Query: 189 NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKK 248
                P      D                                 R+   +LNL ++ +
Sbjct: 413 ERNRSPDGLATPDXXXXXXXXXXXXXXXXXXXXXXXXXXXXARHRTRI---LLNLGDMMR 469

Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            AEE GF+  + E +   S++     I+S   ++GVHGAGLT+ +FL PG+ ++QVVP G
Sbjct: 470 VAEEAGFEAAVSESDVGDSISRVGAEINSADVLLGVHGAGLTNMMFLAPGATMVQVVPWG 529

Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-R 366
             QW++ + +  PA  +GL Y++Y+I  EESSL +KY     +  +P +     +  M R
Sbjct: 530 GLQWIARMDYGDPAEAMGLRYVQYEIGVEESSLKDKYPRGHKIFTDPTSLHKKGFGFMRR 589

Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
             +  QN+ LD+ RFR  L+ A
Sbjct: 590 TLMDGQNITLDLGRFRGVLQQA 611


>gi|226500048|ref|NP_001149318.1| HGA4 [Zea mays]
 gi|195626358|gb|ACG35009.1| HGA4 [Zea mays]
          Length = 536

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 127/385 (32%), Positives = 202/385 (52%), Gaps = 25/385 (6%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT-- 82
           +C ++G     PK  T + V  +           + +PY RK D   +  V E+T+ +  
Sbjct: 162 VCELSGDVRVSPKQRTVYLVNPSGGGGFDERGEKRVRPYARKDD-FLLPGVTEVTVKSVP 220

Query: 83  SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
           SA     C   HT+PA+VFS  GY  NFFH+  D  +PLF+T  +H  D DV L + +  
Sbjct: 221 SAAVAPKCTRQHTAPAVVFSVAGYTDNFFHDMADALIPLFLT-TAHL-DGDVQLLVTNYK 278

Query: 143 DQWARKYAELLPRFTRHPIININ---NQTITHCFQSVT-LGLISHGRMVINP--TLLPKP 196
             W +KY  +L + +RH +++ +        HCF +   LGL     ++++P  T  P+ 
Sbjct: 279 PWWVQKYTPVLRKLSRHDVMSFDAADEDDAVHCFPAGAFLGLYRDRDLILSPHPTRNPRN 338

Query: 197 KTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAAEEL 253
            T+VDF  FL  A     +  ++       +P++++V+R     R +LNL EV  AA+EL
Sbjct: 339 LTMVDFARFLRGALALPRDRPAALGEQPGARPRMLIVSRAGT--RRLLNLDEVAAAADEL 396

Query: 254 GFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-T 309
           GF+VT  E     + AD   F   +++   +VGVHGAGLT+ +FL   +V++Q+VP G  
Sbjct: 397 GFNVTAAE-----AGADVPAFAAQVNAADVLVGVHGAGLTNQVFLPTEAVVVQIVPWGKM 451

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
            W++T ++ +PA  LGL YLEY +  EE+SL +KY    +V  +P A     W  +   +
Sbjct: 452 DWMATNFYARPAAGLGLRYLEYYVGAEETSLKDKYPREHVVFSDPMALHKQGWQALAQTV 511

Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKK 394
             Q+V +++  FR  L  A  K ++
Sbjct: 512 MKQDVTVNLTMFRPVLLQALDKLQQ 536


>gi|226494337|ref|NP_001149180.1| HGA4 [Zea mays]
 gi|195625278|gb|ACG34469.1| HGA4 [Zea mays]
 gi|413953957|gb|AFW86606.1| HGA4 [Zea mays]
          Length = 521

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 124/400 (31%), Positives = 204/400 (51%), Gaps = 28/400 (7%)

Query: 7   ALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIK 59
           A  P   ITCD +       +    +C + G     PK  T + V  + + P   +   K
Sbjct: 126 ATKPEAKITCDENGVDEGFPYARPPVCELAGDIRVSPKEKTMYFVNPSGAGPFDSNGEKK 185

Query: 60  TQPYPRKSDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
            +P+ RK D   +  V E+TI   +SA     C   H  P +VFS  GY  NFFH+  D 
Sbjct: 186 IRPFARKDD-FLLPGVVEVTIKSVSSAAAAPRCTRRHDVPVVVFSVAGYTDNFFHDNTDV 244

Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
            +PLF+T  +H    +V L + +    W RK+  LL + + + +I+       HCF++  
Sbjct: 245 LIPLFLT-TAHL-QGEVQLLVTNYKPWWVRKFTPLLRKLSNYDVIDFGKDDEVHCFRAGH 302

Query: 178 LGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNR 232
           LG+     ++I+P  T  P+  ++VD+  FL  A+N   +  +        KP+++++ R
Sbjct: 303 LGMYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFNLPRDAPAVLGEETSAKPQMLIIER 362

Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGL 289
                R +LNL EV    E LGF VT+ E +     AD   F   +++   ++ VHGAGL
Sbjct: 363 KGT--RKLLNLPEVSALCEALGFAVTVAEAD-----ADVRVFAEKVNAADVLLAVHGAGL 415

Query: 290 THSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
           T+ +FL  G+VL+Q+VP G   W++T ++ +PAR + L Y+EY + +EE++L +KY  + 
Sbjct: 416 TNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYPRDH 475

Query: 349 LVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
            V K+P       W  +   +  Q+V +++ RF+ +L  A
Sbjct: 476 YVFKDPMHIHAQGWPAIAEIIMKQDVMVNMTRFKPFLLQA 515


>gi|357144041|ref|XP_003573147.1| PREDICTED: uncharacterized protein LOC100838010 isoform 3
           [Brachypodium distachyon]
          Length = 507

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 120/386 (31%), Positives = 197/386 (51%), Gaps = 23/386 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C  + R  D C  +G      +T T          P +   ++K  PY RK D  A S V
Sbjct: 124 CYETSRRSDTCEASGDVRVQGRTQTVHI------RPLEKEWKVK--PYCRKHDAFAQSHV 175

Query: 76  KELTITTSAPPNLSCGVTHTSPAL---VFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           KE T+   +  +  C V  +   L   V S GG+ GN FH++ D  +P FIT  S     
Sbjct: 176 KEWTLRPLSSSSPHCTVNSSGSGLTAFVLSTGGFTGNLFHDYTDVLIPAFIT--SRRFHG 233

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
           +V   ++     W  +Y ++  + + + +++I+      C+    +G   H  + ++ + 
Sbjct: 234 EVQFLVSSYKSWWTSRYLQIFQQLSNYEVLDIDADDEVRCYDKAVVGPTFHKELGVDASK 293

Query: 193 LPKPK-TLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLREV 246
            P P  ++VDF++ L  A+  E      S        +P+L++++R  +  R  LN R +
Sbjct: 294 TPSPGYSMVDFRAMLRQAFGLERAAAEPSGDRWDIRRRPRLLIISR--KHTRAFLNERAM 351

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
              A  LGFDV + EP+ ST +    R ++S   MVGVHGAGLT+ LFL  G+VL+Q+VP
Sbjct: 352 ADMAMSLGFDVRVGEPDVSTDVGKFARLVNSADVMVGVHGAGLTNMLFLPAGAVLVQIVP 411

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  +WL+   F++P+  + + YLEY I+ +E++L E+Y  +D VL++P +     W  +
Sbjct: 412 YGGLEWLARGTFKEPSPDMRIHYLEYMIQLDETTLSEQYAKDDPVLRDPASIHKQGWDKL 471

Query: 366 R-VYLKTQNVKLDIDRFRIYLKDAYK 390
           + VYL  QNV+  + R +I   +A K
Sbjct: 472 KTVYLDKQNVRPHLGRLKITFMEALK 497


>gi|357144037|ref|XP_003573145.1| PREDICTED: uncharacterized protein LOC100838010 isoform 1
           [Brachypodium distachyon]
          Length = 499

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 120/386 (31%), Positives = 197/386 (51%), Gaps = 23/386 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C  + R  D C  +G      +T T          P +   ++K  PY RK D  A S V
Sbjct: 116 CYETSRRSDTCEASGDVRVQGRTQTVHI------RPLEKEWKVK--PYCRKHDAFAQSHV 167

Query: 76  KELTITTSAPPNLSCGVTHTSPAL---VFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           KE T+   +  +  C V  +   L   V S GG+ GN FH++ D  +P FIT  S     
Sbjct: 168 KEWTLRPLSSSSPHCTVNSSGSGLTAFVLSTGGFTGNLFHDYTDVLIPAFIT--SRRFHG 225

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
           +V   ++     W  +Y ++  + + + +++I+      C+    +G   H  + ++ + 
Sbjct: 226 EVQFLVSSYKSWWTSRYLQIFQQLSNYEVLDIDADDEVRCYDKAVVGPTFHKELGVDASK 285

Query: 193 LPKPK-TLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLREV 246
            P P  ++VDF++ L  A+  E      S        +P+L++++R  +  R  LN R +
Sbjct: 286 TPSPGYSMVDFRAMLRQAFGLERAAAEPSGDRWDIRRRPRLLIISR--KHTRAFLNERAM 343

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
              A  LGFDV + EP+ ST +    R ++S   MVGVHGAGLT+ LFL  G+VL+Q+VP
Sbjct: 344 ADMAMSLGFDVRVGEPDVSTDVGKFARLVNSADVMVGVHGAGLTNMLFLPAGAVLVQIVP 403

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  +WL+   F++P+  + + YLEY I+ +E++L E+Y  +D VL++P +     W  +
Sbjct: 404 YGGLEWLARGTFKEPSPDMRIHYLEYMIQLDETTLSEQYAKDDPVLRDPASIHKQGWDKL 463

Query: 366 R-VYLKTQNVKLDIDRFRIYLKDAYK 390
           + VYL  QNV+  + R +I   +A K
Sbjct: 464 KTVYLDKQNVRPHLGRLKITFMEALK 489


>gi|300681532|emb|CBH32629.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
           aestivum]
          Length = 526

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 125/391 (31%), Positives = 206/391 (52%), Gaps = 22/391 (5%)

Query: 9   SPAKSITCD-RSHRSYDLCLI---NGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPY 63
           +PAK   CD +   + D+C +   +G       T T       +S  S P+ +  +  PY
Sbjct: 135 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSTHTVLVPPTIESGGSNPNPQEWRVLPY 194

Query: 64  PRKSDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
            RK     MS +KE+T+    +A     C VT   PALVF+ GG  GN++H+F D  +PL
Sbjct: 195 SRKH----MSGIKEITVRELPTAADAPRCAVTSQVPALVFAMGGLTGNYWHDFSDVMIPL 250

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
           ++   S F D +V L + +    +A KY  ++ R +R+ +++++      CF S  +G+ 
Sbjct: 251 YLQA-SRF-DGEVQLVVTNIQPWYAGKYRHIIARLSRYDVVDMDKDDQVRCFPSAIVGIR 308

Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYN-----ENTNTSSSFHHTKPKLVLVNRNARV 236
            H    I+P   P   ++ +F  FL N +          T+      KP++++++R  R 
Sbjct: 309 MHKEFSIDPAKEPTGHSMPEFTKFLRNVFALPRAAPMRVTAGLISDKKPRMMIISR--RH 366

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
            R ++N+ EV   A+ +GF+V I +P  +  + D  R +++   ++GVHGAGLT+SLFL 
Sbjct: 367 PRKLVNVAEVVALAKRIGFEVVIGDPPFNVDVGDFAREVNAADVLMGVHGAGLTNSLFLP 426

Query: 297 PGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
            G+V +QV P G  + +  V F  PA  +GL+Y+ Y    EES+LV+  G +   +K+P+
Sbjct: 427 TGAVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMAYSCGMEESTLVDTLGRDHPAVKDPE 486

Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRFRIYL 385
           +   + WS +   YL  Q+VKLD+ RF   L
Sbjct: 487 SIHRSGWSKVAEYYLGKQDVKLDLQRFEPVL 517


>gi|357144039|ref|XP_003573146.1| PREDICTED: uncharacterized protein LOC100838010 isoform 2
           [Brachypodium distachyon]
          Length = 506

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 120/386 (31%), Positives = 197/386 (51%), Gaps = 23/386 (5%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C  + R  D C  +G      +T T          P +   ++K  PY RK D  A S V
Sbjct: 123 CYETSRRSDTCEASGDVRVQGRTQTVHI------RPLEKEWKVK--PYCRKHDAFAQSHV 174

Query: 76  KELTITTSAPPNLSCGVTHTSPAL---VFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
           KE T+   +  +  C V  +   L   V S GG+ GN FH++ D  +P FIT  S     
Sbjct: 175 KEWTLRPLSSSSPHCTVNSSGSGLTAFVLSTGGFTGNLFHDYTDVLIPAFIT--SRRFHG 232

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
           +V   ++     W  +Y ++  + + + +++I+      C+    +G   H  + ++ + 
Sbjct: 233 EVQFLVSSYKSWWTSRYLQIFQQLSNYEVLDIDADDEVRCYDKAVVGPTFHKELGVDASK 292

Query: 193 LPKPK-TLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLREV 246
            P P  ++VDF++ L  A+  E      S        +P+L++++R  +  R  LN R +
Sbjct: 293 TPSPGYSMVDFRAMLRQAFGLERAAAEPSGDRWDIRRRPRLLIISR--KHTRAFLNERAM 350

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
              A  LGFDV + EP+ ST +    R ++S   MVGVHGAGLT+ LFL  G+VL+Q+VP
Sbjct: 351 ADMAMSLGFDVRVGEPDVSTDVGKFARLVNSADVMVGVHGAGLTNMLFLPAGAVLVQIVP 410

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  +WL+   F++P+  + + YLEY I+ +E++L E+Y  +D VL++P +     W  +
Sbjct: 411 YGGLEWLARGTFKEPSPDMRIHYLEYMIQLDETTLSEQYAKDDPVLRDPASIHKQGWDKL 470

Query: 366 R-VYLKTQNVKLDIDRFRIYLKDAYK 390
           + VYL  QNV+  + R +I   +A K
Sbjct: 471 KTVYLDKQNVRPHLGRLKITFMEALK 496


>gi|238009766|gb|ACR35918.1| unknown [Zea mays]
          Length = 548

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 127/385 (32%), Positives = 202/385 (52%), Gaps = 25/385 (6%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT-- 82
           +C ++G     PK  T + V  +           + +PY RK D   +  V E+T+ +  
Sbjct: 174 VCELSGDVRVSPKQRTVYLVNPSGGGGFDERGEKRVRPYARKDD-FLLPGVTEVTVKSVP 232

Query: 83  SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
           SA     C   HT+PA+VFS  GY  NFFH+  D  +PLF+T  +H  D DV L + +  
Sbjct: 233 SAAVAPKCTRQHTAPAVVFSVAGYTDNFFHDMADALIPLFLT-TAHL-DGDVQLLVTNYK 290

Query: 143 DQWARKYAELLPRFTRHPIININ---NQTITHCFQSVT-LGLISHGRMVINP--TLLPKP 196
             W +KY  +L + +RH +++ +        HCF +   LGL     ++++P  T  P+ 
Sbjct: 291 PWWVQKYTPVLRKLSRHDVMSFDAADEDDAVHCFPAGAFLGLYRDRDLILSPHPTRNPRN 350

Query: 197 KTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAAEEL 253
            T+VDF  FL  A     +  ++       +P++++V+R     R +LNL EV  AA+EL
Sbjct: 351 LTMVDFARFLRGALALPRDRPAALGEQPGARPRMLIVSRAGT--RRLLNLDEVAAAADEL 408

Query: 254 GFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-T 309
           GF+VT  E     + AD   F   +++   +VGVHGAGLT+ +FL   +V++Q+VP G  
Sbjct: 409 GFNVTSAE-----AGADVPAFAAQVNAADVLVGVHGAGLTNQVFLPTEAVVVQIVPWGKM 463

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
            W++T ++ +PA  LGL YLEY +  EE+SL +KY    +V  +P A     W  +   +
Sbjct: 464 DWMATNFYARPAAGLGLRYLEYYVGAEETSLKDKYPREHVVFSDPMALHKQGWQALAQTV 523

Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKK 394
             Q+V +++  FR  L  A  K ++
Sbjct: 524 MKQDVTVNLTMFRPVLLQALDKLQQ 548


>gi|413953958|gb|AFW86607.1| hypothetical protein ZEAMMB73_638297 [Zea mays]
          Length = 460

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 124/400 (31%), Positives = 204/400 (51%), Gaps = 28/400 (7%)

Query: 7   ALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIK 59
           A  P   ITCD +       +    +C + G     PK  T + V  + + P   +   K
Sbjct: 65  ATKPEAKITCDENGVDEGFPYARPPVCELAGDIRVSPKEKTMYFVNPSGAGPFDSNGEKK 124

Query: 60  TQPYPRKSDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
            +P+ RK D   +  V E+TI   +SA     C   H  P +VFS  GY  NFFH+  D 
Sbjct: 125 IRPFARKDD-FLLPGVVEVTIKSVSSAAAAPRCTRRHDVPVVVFSVAGYTDNFFHDNTDV 183

Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
            +PLF+T  +H    +V L + +    W RK+  LL + + + +I+       HCF++  
Sbjct: 184 LIPLFLT-TAHL-QGEVQLLVTNYKPWWVRKFTPLLRKLSNYDVIDFGKDDEVHCFRAGH 241

Query: 178 LGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNR 232
           LG+     ++I+P  T  P+  ++VD+  FL  A+N   +  +        KP+++++ R
Sbjct: 242 LGMYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFNLPRDAPAVLGEETSAKPQMLIIER 301

Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGL 289
                R +LNL EV    E LGF VT+ E +     AD   F   +++   ++ VHGAGL
Sbjct: 302 KGT--RKLLNLPEVSALCEALGFAVTVAEAD-----ADVRVFAEKVNAADVLLAVHGAGL 354

Query: 290 THSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
           T+ +FL  G+VL+Q+VP G   W++T ++ +PAR + L Y+EY + +EE++L +KY  + 
Sbjct: 355 TNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYPRDH 414

Query: 349 LVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
            V K+P       W  +   +  Q+V +++ RF+ +L  A
Sbjct: 415 YVFKDPMHIHAQGWPAIAEIIMKQDVMVNMTRFKPFLLQA 454


>gi|300681547|emb|CBH32644.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
           aestivum]
          Length = 450

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 133/396 (33%), Positives = 208/396 (52%), Gaps = 31/396 (7%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C + G         T  ++ H + +  + S R+K  P+ RK+D  A++ V
Sbjct: 73  CDMSSLRSDVCELRGDVRVILSNITIIALLHPNVSVRRRSWRMK--PHARKNDGHALANV 130

Query: 76  KEL--TITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+  ++T S+P    C     +PA+VFS GGY GN FH+F D  +PLFIT  S F   D
Sbjct: 131 TEVLVSVTPSSPYAPGCTAESAAPAVVFSVGGYAGNMFHDFTDVLIPLFIT-ASRF-RSD 188

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTI-THCFQSVTLGLISHGRMVINPT- 191
           V L +++    W  KY  LL   +RH +I+++ Q+    C+  V +GL  H  M I+   
Sbjct: 189 VHLLVSNAPPWWLDKYGPLLRGLSRHAVIDMDRQSEEVLCYPHVVVGLSFHKEMSIDTVK 248

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFHHT--------KPKLVLVNRNARVGRTILNL 243
            +    ++ DF      +Y    +T+    H         +P+L++++R  +  R   N+
Sbjct: 249 TVGGHYSMADFARLARRSYGLERDTAIRLLHGGDNIKSPRRPRLLIISR--KTTRAFTNM 306

Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
             V +AA  LG++V + E E+ + L    R ++SC  +VGVHGAGLT+ +FL PG+V++Q
Sbjct: 307 GAVAQAAAMLGYEVIVGEAEQHSDLPAFARLVNSCGVLVGVHGAGLTNLVFLPPGAVVVQ 366

Query: 304 VVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
           VVP+G  + ++   F  PA  +GL Y++Y I   ES+L E Y  +  VL+     A A  
Sbjct: 367 VVPLGGLEAMAGDDFGVPAGDMGLGYVQYGIAVGESTLAELYPRDHRVLR-----ALALR 421

Query: 363 SNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
           S    YL  QNV LD+ RF      A  +A +L+ +
Sbjct: 422 SE---YLVGQNVTLDVARF----SGALSRALELLHR 450


>gi|413947214|gb|AFW79863.1| hypothetical protein ZEAMMB73_417176 [Zea mays]
          Length = 451

 Score =  197 bits (502), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 123/384 (32%), Positives = 199/384 (51%), Gaps = 20/384 (5%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKT-STFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           + C+   RS D C ++G    D +T  T  SV    ++ S+       +PY R+      
Sbjct: 72  VVCNMEGRS-DTCEVDG----DVRTNGTALSVTLVPASRSERH-EWMIRPYSRRFASVRK 125

Query: 73  SKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
             V +L     A P   C VTH  PA++F+ GGY GN++H++ D  VPLF+    +  + 
Sbjct: 126 VTVTQLQDRADAAP---CAVTHDVPAVLFAIGGYAGNYWHDYADILVPLFVASRRYNGEV 182

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
             +++      +W  KY   L   + +  ++++      CF  VT+GL       I P L
Sbjct: 183 KFLISNIRFQPRWLAKYKAFLQGLSLYDAVDMDGDAQVRCFPHVTVGLRLDKEFSIVPEL 242

Query: 193 LPKPK--TLVDFQSFLANAYNENTNTSSSFH----HTKPKLVLVNRNARVGRTILNLREV 246
           +P  +  ++ DF  FL   Y     +++S      H KP+L+L++R     R I N  EV
Sbjct: 243 VPGGRRLSMADFTRFLRETYALPRGSAASRDREQPHKKPRLLLIHRGHY--RRITNEPEV 300

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            +AAE  GF+  + E     + A+  R ++S   ++GVHGAGLT+++FL PG VL+QVVP
Sbjct: 301 ARAAEAAGFEAVVAELRGDATEAEQARVVNSFDVVLGVHGAGLTNAVFLPPGGVLIQVVP 360

Query: 307 IG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  ++++   F +PA  +GL+YL+Y +  EESSL+E  G     +K+P +   + W  +
Sbjct: 361 YGKMEYIARAEFSEPATDMGLKYLDYSVSAEESSLMETLGPEHPAVKDPDSVHRSGWDQV 420

Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
             +YL  QNV++++ RF   L  A
Sbjct: 421 FELYLAKQNVRINVTRFAPTLAQA 444


>gi|357144127|ref|XP_003573181.1| PREDICTED: uncharacterized protein LOC100825423 isoform 1
           [Brachypodium distachyon]
          Length = 565

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 126/400 (31%), Positives = 209/400 (52%), Gaps = 33/400 (8%)

Query: 13  SITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDS-TPSQPSLRIKTQPYP 64
           ++TCD +       +    +C ++G     PK  T + V  + + T        + +PY 
Sbjct: 175 NMTCDENGVDEGFPYARPAVCELSGDIRVSPKQKTMYLVNPSGAATGFDEKGEKRLRPYA 234

Query: 65  RKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
           R +D   +  V E+T+    +T+A P   C   H  PA+VFS  GY  NFFH+  D  +P
Sbjct: 235 R-NDDFLLPGVVEVTVKSVPSTAAAPQ--CTKQHRVPAVVFSVAGYTDNFFHDNTDALIP 291

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
           L++T  +H    +V L I +    W +KY  +L + + + +IN +     HCF    LGL
Sbjct: 292 LYVT-TAHL-KGEVQLLITNYKPWWVQKYTPVLRKLSSYDVINFDEDAGVHCFHEGYLGL 349

Query: 181 ISHGRMVINP--TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNAR 235
                ++I+P  T  P+  T+VD+  FL   +    E           +P++++++R+  
Sbjct: 350 YRDRDLIISPHPTRNPRNYTMVDYNRFLRGVFELRRERPAVLGEEPGMRPRMLIISRSGT 409

Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHS 292
             R +LNL EV   A ELGF+VT+ E     + AD   F   ++S   ++ VHGAGLT+ 
Sbjct: 410 --RKLLNLDEVAAEASELGFNVTVAE-----AGADVPAFAALVNSADVLLAVHGAGLTNQ 462

Query: 293 LFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
           +FL   +V++Q+VP G   W +T ++ +PAR + L Y+EY + +EE+SL +KY  + +V 
Sbjct: 463 IFLPTDAVVLQIVPWGNMDWQATNFYGQPAREMQLRYVEYYVGEEETSLKDKYPRDHMVF 522

Query: 352 KNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKK 391
           K+P+A     W  +   +  Q+V+++I RFR +L  A  K
Sbjct: 523 KDPKALHKQGWQTLANTIMKQDVQVNITRFRPFLLQAIDK 562


>gi|55792424|gb|AAV65332.1| HGA2 [Hordeum vulgare]
          Length = 529

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 113/340 (33%), Positives = 191/340 (56%), Gaps = 27/340 (7%)

Query: 61  QPYPRKSDKSAMSKVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFM 115
           QPY R +    M+ +  +T+T     ++A P  +C VTH  PA+V   GG+ GN+FH+F 
Sbjct: 184 QPYARYN----MTGIPNITVTQLDSTSAASPAPACTVTHRVPAIVIGLGGHLGNYFHDFS 239

Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
           D  VPLF+    +    +V L  ++    W  KY  ++ R T++ ++++++     CF+ 
Sbjct: 240 DALVPLFVASRRY--GGEVQLLASNIQPWWLGKYEAVVRRLTKYDVVDLDHDDQIRCFRR 297

Query: 176 VTLGLISHGRMVINPTLLPK--PKTLVDFQSFLANAYN----------ENTNTSSSFHHT 223
           VT+GL  H    I P L+P   P ++ +F +FL   Y+          + ++        
Sbjct: 298 VTVGLNMHREFDIVPELVPGDVPLSMANFTAFLRETYSLPRAAPIRLTKGSSPPVDKKKK 357

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           KP+L+L++R     R ++N+ E+ KAAE+ GF+V I +P  +  + +    ++S   ++G
Sbjct: 358 KPRLMLLDRGHY--RKLVNVPEIVKAAEKAGFEVVIADPRFNVRVKELAMSVNSFDVLLG 415

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVE 342
           VHGAGLT++ FL PG+V++QVVP G  + ++   F  PA  +GL YL+Y I  EES+L+E
Sbjct: 416 VHGAGLTNAAFLPPGAVVIQVVPYGKLEPMAQREFGDPAADMGLRYLQYSITAEESTLLE 475

Query: 343 KYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF 381
             G +   +K+P +   + W  +   YL  QNV++D++RF
Sbjct: 476 TLGPDHPAIKDPDSVHRSGWDKVAEFYLGKQNVRVDVERF 515


>gi|357144129|ref|XP_003573182.1| PREDICTED: uncharacterized protein LOC100825423 isoform 2
           [Brachypodium distachyon]
          Length = 542

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 123/397 (30%), Positives = 206/397 (51%), Gaps = 27/397 (6%)

Query: 13  SITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDS-TPSQPSLRIKTQPYP 64
           ++TCD +       +    +C ++G     PK  T + V  + + T        + +PY 
Sbjct: 152 NMTCDENGVDEGFPYARPAVCELSGDIRVSPKQKTMYLVNPSGAATGFDEKGEKRLRPYA 211

Query: 65  RKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
           R +D   +  V E+T+    +T+A P   C   H  PA+VFS  GY  NFFH+  D  +P
Sbjct: 212 R-NDDFLLPGVVEVTVKSVPSTAAAPQ--CTKQHRVPAVVFSVAGYTDNFFHDNTDALIP 268

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
           L++T  +H    +V L I +    W +KY  +L + + + +IN +     HCF    LGL
Sbjct: 269 LYVT-TAHL-KGEVQLLITNYKPWWVQKYTPVLRKLSSYDVINFDEDAGVHCFHEGYLGL 326

Query: 181 ISHGRMVINP--TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNAR 235
                ++I+P  T  P+  T+VD+  FL   +    E           +P++++++R+  
Sbjct: 327 YRDRDLIISPHPTRNPRNYTMVDYNRFLRGVFELRRERPAVLGEEPGMRPRMLIISRSGT 386

Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
             R +LNL EV   A ELGF+VT+ E      +      ++S   ++ VHGAGLT+ +FL
Sbjct: 387 --RKLLNLDEVAAEASELGFNVTVAE--AGADVPAFAALVNSADVLLAVHGAGLTNQIFL 442

Query: 296 RPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
              +V++Q+VP G   W +T ++ +PAR + L Y+EY + +EE+SL +KY  + +V K+P
Sbjct: 443 PTDAVVLQIVPWGNMDWQATNFYGQPAREMQLRYVEYYVGEEETSLKDKYPRDHMVFKDP 502

Query: 355 QAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKK 391
           +A     W  +   +  Q+V+++I RFR +L  A  K
Sbjct: 503 KALHKQGWQTLANTIMKQDVQVNITRFRPFLLQAIDK 539


>gi|326504684|dbj|BAK06633.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 522

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 126/390 (32%), Positives = 202/390 (51%), Gaps = 14/390 (3%)

Query: 9   SPAKS-ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDST-PSQPSLRIKTQPYPRK 66
           +P +S ++C+ S    ++C + G      K +T + V  +D   P   ++ I+  PYPRK
Sbjct: 132 APTRSKLSCNLSSYRTNMCAMQGDVRVHGKAATVYVVSASDDNRPDNGTITIR--PYPRK 189

Query: 67  SDKSAMSKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
            +   M  V+E+TI   APP      C VT+  PA+VFS GGY  N FH   D  +PL+ 
Sbjct: 190 WETPTMQLVREVTIRWRAPPGPGAPRCTVTYDVPAVVFSTGGYGVNIFHAITDIIIPLYN 249

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T   +  D  V L   + + +W  KY   L   + +PII+++      CF S  +G  SH
Sbjct: 250 TAREY--DGRVRLIATNYDRKWIAKYRHALSLISIYPIIDLDADNEVRCFPSAHVGTESH 307

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH-TKPKLVLVNRNARVGRTILN 242
             + I+  L  K  T++ F+  + +AY+      +  +H +KP+LV+V R  R  R + N
Sbjct: 308 KELGIDSALSGKGYTMMGFRGLIRSAYSLKREWVTPINHGSKPRLVMVLR--RNSRALTN 365

Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
             +V  AA E+GF+V    PE    L      ++SC  +VGVHGAGLT+ +FL     ++
Sbjct: 366 EAQVVAAAAEVGFEVVAAGPEVVRDLGKFAETVNSCDVLVGVHGAGLTNMVFLPRNGTVL 425

Query: 303 QVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN 361
           Q+VP G  +W +   + +P   +GL Y EY++  EE++L + Y  N  V  +P +     
Sbjct: 426 QIVPWGEMKWPAWTSYGEPVAPMGLRYAEYEVTAEETTLKDVYPRNHTVFTDPVSIHKQG 485

Query: 362 WSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
           ++ +   +L  QN+ LD+ RF   L+  Y+
Sbjct: 486 FNMLWETFLNGQNLTLDVHRFTGVLQHIYQ 515


>gi|326489348|dbj|BAK01657.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326527847|dbj|BAK08167.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 527

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 125/395 (31%), Positives = 199/395 (50%), Gaps = 32/395 (8%)

Query: 9   SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ-PSLRIKTQPYPRKS 67
           +P+K I CD S   YD+C I+G A       T   V   D   +  P   IK Q   RK 
Sbjct: 130 APSKPI-CDLSDPRYDICEISGDARAIGTNRTVLYVPPVDERGADGPEWAIKDQS--RKY 186

Query: 68  DKSAMS-KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
            +  M  KVK L+   S  AP    C   H  PA+VF+  G  GN +H+  D  +PLFIT
Sbjct: 187 LEDIMEVKVKTLSAAQSLVAP---ECTSKHDVPAIVFAMNGIIGNPWHDLSDVLIPLFIT 243

Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHG 184
             ++  D +V   + +    +  KY  +L   +R+ I++ N  +   C+  + +GL SHG
Sbjct: 244 ARAY--DGEVQFLVTELQPWFVEKYRLILTNLSRYDIVDFNKDSGVRCYPHIVVGLHSHG 301

Query: 185 RMVINPTLLPKPKTLVDFQSFL-----------------ANAYNENTNTSSSFHHTKPKL 227
            + I+P   P+  TL+DF+ ++                 AN  N   + ++     KP+L
Sbjct: 302 DLDIDPARTPRNYTLLDFRLYIRDIFSLPSKGIGIPYKEANKKNSTDDNTTVTEKQKPRL 361

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           +L+NR   + R  +NL E+  A +  GF+V + EP    SL +  R + S   ++GVHGA
Sbjct: 362 LLINRG--MSRKFVNLPEITAAVQAAGFEVLVVEPHRDMSLEEFARMLDSSDVLMGVHGA 419

Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
            LT+  FLR  +V++QVV +G +  +  Y+   A  + L+++EY I  EES+L EKYG +
Sbjct: 420 ALTNFFFLRTNAVVLQVVGLGLEREAMHYYGAQAMKVMLQHVEYFIAAEESTLYEKYGKD 479

Query: 348 DLVLKNPQAFAGANWSNMRVYL-KTQNVKLDIDRF 381
              +++P +     W   + Y    Q+++L++ RF
Sbjct: 480 HPAVRDPDSIHKQGWQGAKQYFWAEQDIRLNVTRF 514


>gi|326531726|dbj|BAJ97867.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 521

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 125/395 (31%), Positives = 199/395 (50%), Gaps = 32/395 (8%)

Query: 9   SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ-PSLRIKTQPYPRKS 67
           +P+K I CD S   YD+C I+G A       T   V   D   +  P   IK Q   RK 
Sbjct: 124 APSKPI-CDLSDPRYDICEISGDARAIGTNRTVLYVPPVDERGADGPEWAIKDQS--RKY 180

Query: 68  DKSAMS-KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
            +  M  KVK L+   S  AP    C   H  PA+VF+  G  GN +H+  D  +PLFIT
Sbjct: 181 LEDIMEVKVKTLSAAQSLVAP---ECTSKHDVPAIVFAMNGIIGNPWHDLSDVLIPLFIT 237

Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHG 184
             ++  D +V   + +    +  KY  +L   +R+ I++ N  +   C+  + +GL SHG
Sbjct: 238 ARAY--DGEVQFLVTELQPWFVEKYRLILTNLSRYDIVDFNKDSGVRCYPHIVVGLHSHG 295

Query: 185 RMVINPTLLPKPKTLVDFQSFL-----------------ANAYNENTNTSSSFHHTKPKL 227
            + I+P   P+  TL+DF+ ++                 AN  N   + ++     KP+L
Sbjct: 296 DLDIDPARTPRNYTLLDFRLYIRDIFSLPSKGIGIPYKEANKKNSTDDNTTVTEKQKPRL 355

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           +L+NR   + R  +NL E+  A +  GF+V + EP    SL +  R + S   ++GVHGA
Sbjct: 356 LLINRG--MSRKFVNLPEITAAVQAAGFEVLVVEPHRDMSLEEFARMLDSSDVLMGVHGA 413

Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
            LT+  FLR  +V++QVV +G +  +  Y+   A  + L+++EY I  EES+L EKYG +
Sbjct: 414 ALTNFFFLRTNAVVLQVVGLGLEREAMHYYGAQAMKVMLQHVEYFIAAEESTLYEKYGKD 473

Query: 348 DLVLKNPQAFAGANWSNMRVYL-KTQNVKLDIDRF 381
              +++P +     W   + Y    Q+++L++ RF
Sbjct: 474 HPAVRDPDSIHKQGWQGAKQYFWAEQDIRLNVTRF 508


>gi|326490115|dbj|BAJ94131.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 394

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 123/387 (31%), Positives = 206/387 (53%), Gaps = 31/387 (8%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + C  S  SY  C ++G    +    T   V  T  T  +       QPY R +    M+
Sbjct: 6   VVCRASPFSY-TCDVSGDVRTNGTAHTVTLVPATSLTERR---EWSIQPYARYN----MT 57

Query: 74  KVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
            +  +T+T     ++A P  +C VTH  PA+V   GG+ GN+FH+F D  VPLF+    +
Sbjct: 58  GIPNITVTQLDSTSAASPAPACTVTHRVPAIVIGLGGHLGNYFHDFSDALVPLFVASRRY 117

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
               +V L  ++    W  KY  ++ R T++ ++++++     CF+ VT+GL  H    I
Sbjct: 118 --GGEVQLLASNIQPWWLGKYEAVVRRLTKYDVLDLDHDDQIRCFRRVTVGLNMHREFDI 175

Query: 189 NPTLLPK--PKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNRNARV 236
            P L+P   P ++ +F +FL   Y+          + ++        KP+L+L++R    
Sbjct: 176 VPELVPGDVPLSMANFTAFLRETYSLPRAAPIRLTKGSSPPVDKKKKKPRLMLLDRGHY- 234

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
            R ++N+ E+ KAAE+ GF+V I +P  +  + +    ++S   ++GVHGAGLT++ FL 
Sbjct: 235 -RKLVNVPEIVKAAEKAGFEVVIADPRFNVRVKELAMSVNSFDVLLGVHGAGLTNAAFLP 293

Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
           PG+V++QVVP G  + ++   F  PA  +GL YL+Y I  EES+L+E  G +   +K+P 
Sbjct: 294 PGAVVIQVVPYGKLEPMAQREFGDPAADMGLRYLQYSITAEESTLLETLGPDHPAIKDPD 353

Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRF 381
           +   + W  +   YL  QNV++D++RF
Sbjct: 354 SVHRSGWDKVAEFYLGKQNVRVDVERF 380


>gi|56409856|emb|CAI30076.1| glycosyltransferase [Triticum aestivum]
          Length = 525

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 125/394 (31%), Positives = 207/394 (52%), Gaps = 22/394 (5%)

Query: 9   SPAKSITCD-RSHRSYDLCLI---NGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPY 63
           +PAK   CD +   + D+C +   +G       T T       +S  S P+ +  +  PY
Sbjct: 134 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSTHTVLVPPTIESGGSNPNPQEWRVLPY 193

Query: 64  PRKSDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
            RK     MS +KE+T+    +A     C VT   PALVF+ GG  GN++H+F D  +PL
Sbjct: 194 SRKH----MSGIKEITVRELPTAADAPRCAVTSQVPALVFAMGGLTGNYWHDFSDVMIPL 249

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
           ++   + F D +V L + +    +A KY  ++ R +R+ +++++      CF S  +G+ 
Sbjct: 250 YLQA-ARF-DGEVQLVVTNIQPWYAGKYRHIIARLSRYDVVDMDKGRQVRCFPSAVVGIR 307

Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYN-----ENTNTSSSFHHTKPKLVLVNRNARV 236
            H    I+PT  P   ++ +F  FL N +          T+      KP++++++R  R 
Sbjct: 308 MHKEFSIDPTKEPTGHSMPEFTKFLRNVFALPRAAPMRVTAGLISDKKPRMMIISR--RH 365

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
            R ++N+ EV   A+ +GF+V I +P  +  + D  R +++   ++GVHGAGLT+SLFL 
Sbjct: 366 PRKLVNVAEVVALAKRIGFEVVIGDPPFNVDVGDFAREVNAADVLMGVHGAGLTNSLFLP 425

Query: 297 PGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
            G+V +QV P G  + +  V F  PA  +GL+Y+ Y    EES+LV+  G +   +K+P+
Sbjct: 426 TGAVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMAYSCGMEESTLVDTLGRDHPAVKDPE 485

Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
           +   + WS +   YL  Q+VKLD+ R    L  A
Sbjct: 486 SIHRSGWSKVAEYYLGKQDVKLDLQRVEPVLIKA 519


>gi|413953955|gb|AFW86604.1| hypothetical protein ZEAMMB73_638297 [Zea mays]
          Length = 528

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 125/401 (31%), Positives = 207/401 (51%), Gaps = 29/401 (7%)

Query: 6   AALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI 58
           A ++ AK ITCD +       +    +C + G     PK  T + V  + + P   +   
Sbjct: 133 AIITEAK-ITCDENGVDEGFPYARPPVCELAGDIRVSPKEKTMYFVNPSGAGPFDSNGEK 191

Query: 59  KTQPYPRKSDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
           K +P+ RK D   +  V E+TI   +SA     C   H  P +VFS  GY  NFFH+  D
Sbjct: 192 KIRPFARKDD-FLLPGVVEVTIKSVSSAAAAPRCTRRHDVPVVVFSVAGYTDNFFHDNTD 250

Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
             +PLF+T  +H    +V L + +    W RK+  LL + + + +I+       HCF++ 
Sbjct: 251 VLIPLFLT-TAHL-QGEVQLLVTNYKPWWVRKFTPLLRKLSNYDVIDFGKDDEVHCFRAG 308

Query: 177 TLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVN 231
            LG+     ++I+P  T  P+  ++VD+  FL  A+N   +  +        KP+++++ 
Sbjct: 309 HLGMYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFNLPRDAPAVLGEETSAKPQMLIIE 368

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAG 288
           R     R +LNL EV    E LGF VT+ E +     AD   F   +++   ++ VHGAG
Sbjct: 369 RKGT--RKLLNLPEVSALCEALGFAVTVAEAD-----ADVRVFAEKVNAADVLLAVHGAG 421

Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
           LT+ +FL  G+VL+Q+VP G   W++T ++ +PAR + L Y+EY + +EE++L +KY  +
Sbjct: 422 LTNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYPRD 481

Query: 348 DLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
             V K+P       W  +   +  Q+V +++ RF+ +L  A
Sbjct: 482 HYVFKDPMHIHAQGWPAIAEIIMKQDVMVNMTRFKPFLLQA 522


>gi|242060158|ref|XP_002451368.1| hypothetical protein SORBIDRAFT_04g000840 [Sorghum bicolor]
 gi|241931199|gb|EES04344.1| hypothetical protein SORBIDRAFT_04g000840 [Sorghum bicolor]
          Length = 571

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 128/402 (31%), Positives = 211/402 (52%), Gaps = 31/402 (7%)

Query: 14  ITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRK 66
           +TCD +       +    +C ++G     PK  T + V  + +     S   + +PY RK
Sbjct: 180 MTCDENGVDEGFPYARPTVCELSGDVRVSPKQKTVYLVNPSGAGGFDESGEKRLRPYARK 239

Query: 67  SDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
            D   M  V E+T+ +  SA     C   HT PA++FS  GY  NFFH+ +D  VPLF+T
Sbjct: 240 DD-FLMPGVTEVTVKSVPSAAVAPKCTKHHTVPAVLFSIAGYTDNFFHDMVDAMVPLFLT 298

Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI--THCFQSVT-LGLI 181
             SH    +V L I +    W +KY  LL + + H +IN + +     HCF +   +GL 
Sbjct: 299 -TSHL-KGEVQLLITNYKPWWVQKYTPLLRKMSLHDVINFDAEDADDVHCFPAGAFVGLY 356

Query: 182 SHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARV 236
               ++++P  T  P+  T+VDF  F+  A     +  +        +P++++++R    
Sbjct: 357 RDRDLILSPHPTRNPRNLTMVDFSRFMRGALALPRDRPAVLGEAPGMRPRMLIISRAGT- 415

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSL 293
            R +LNL EV K A+ELGF+VTI E     + AD   F   +++   +VGVHGAGL + +
Sbjct: 416 -RRLLNLDEVAKVADELGFNVTIAE-----AGADVPAFAAQVNAADVLVGVHGAGLANVV 469

Query: 294 FLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLK 352
           FL   +V++Q+VP G   W++T ++ +PA  + L YLEY + +EE+SL +KY  + +V +
Sbjct: 470 FLPTEAVVVQIVPWGKMDWMATNFYARPAAGMALRYLEYYVGEEETSLKDKYPRDHVVFR 529

Query: 353 NPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
           +P +     W  +   +  Q+V +++ +FR  L  A  K ++
Sbjct: 530 DPMSLHTQGWQALAQTIMKQDVAVNLTKFRPVLLQALDKLQQ 571


>gi|31296713|gb|AAP46642.1| HGA4 [Hordeum vulgare]
          Length = 525

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 123/392 (31%), Positives = 206/392 (52%), Gaps = 23/392 (5%)

Query: 9   SPAKSITCD-RSHRSYDLCLI---NGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPY 63
           +PAK   CD +   + D+C +   +G         T       +S  S P+ +  +  PY
Sbjct: 133 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSAHTVLVPPTIESGGSNPNPQEWRVLPY 192

Query: 64  PRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
            RK     MS +KE+T+   A P  +  C VT   PALVF+ GG  GN++H+F D  +PL
Sbjct: 193 SRKH----MSGIKEITVRELATPADAPRCAVTSQVPALVFAMGGLTGNYWHDFSDVMIPL 248

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
           ++   S F D +V L + +    +A KY +++ R +R+ +++++      CF S  +G+ 
Sbjct: 249 YLQA-SRF-DGEVQLVVTNYQRWYAGKYRDIIARLSRYDVVDMDKDDQVRCFPSAVVGIR 306

Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYN------ENTNTSSSFHHTKPKLVLVNRNAR 235
                 I+P   P   ++ +F  FL N +              S    KP++++++R  R
Sbjct: 307 MPKEFSIDPAKEPTGHSMPEFTKFLRNVFALPRAGPMGVTAGPSDGKKKPRMMIISR--R 364

Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
             R ++N+ EV   A+ +GF+V I +P  +  +AD  R +++   ++GVHGAG+T+SLFL
Sbjct: 365 HPRKLVNVDEVVALAKRIGFEVVIGDPPFNVDVADFAREVNAADVLMGVHGAGMTNSLFL 424

Query: 296 RPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
             G+V +QV P G  + +  V F  PA  +GL+Y+ Y    EES+LV+  G +   +K+P
Sbjct: 425 PTGAVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMSYSCGMEESTLVDTLGRDHPAVKDP 484

Query: 355 QAFAGANWSNM-RVYLKTQNVKLDIDRFRIYL 385
           ++   + W+ +   YL  Q+VKLD+ RF   L
Sbjct: 485 ESIHRSGWNKVAEFYLGKQDVKLDLQRFEPVL 516


>gi|357127198|ref|XP_003565271.1| PREDICTED: uncharacterized protein LOC100846082 [Brachypodium
           distachyon]
          Length = 456

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 136/403 (33%), Positives = 206/403 (51%), Gaps = 43/403 (10%)

Query: 13  SITCDRSHRSYDLCLINGSA-LFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSA 71
           ++ CD S    D+C + G   +F P T+  F       T  + S R+K  P+ RK+D+ A
Sbjct: 68  ALFCDMSSPRSDVCELKGDVRVFLPNTTIVF----LHPTIRRRSWRMK--PHARKNDRHA 121

Query: 72  MSKVKELTITTS-------APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
           +S V E++++         AP    C     +PA++FSAGGY GN FH+F D  VPLFIT
Sbjct: 122 LSSVTEVSLSVVASSSLRHAPSG--CTAESAAPAVIFSAGGYAGNMFHDFTDVLVPLFIT 179

Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI-THCFQSVTLGLISH 183
             S F   +V   ++D    W  KY  LL   +RH II++N ++    C++ V +GL  H
Sbjct: 180 -ASRF-HGEVHFLVSDAPSWWLDKYQPLLRMLSRHAIIDMNRRSSEVLCYRHVIVGLRFH 237

Query: 184 GRMVINPTLLPKPK-TLVDFQSFLANAY------------NENTNTSSSFH-HTKPKLVL 229
             M I+       + ++ DF      +Y            N+N N  S    H +P+L++
Sbjct: 238 KEMSIDAAKTVGGRYSMADFARLARTSYGLERDRAIQLPRNDNNNGGSGVESHHRPRLLI 297

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
           ++R A   R   N+  + + A  LG++V + E ++ + LA   R ++SC  +V +HGA L
Sbjct: 298 ISRKAT--RAFTNVDAIARTASILGYNVVVGEADQQSDLAALARLVNSCDVLVCLHGAVL 355

Query: 290 THSLFLRPGSVLMQVVPIGTQWLSTV-YFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
           T+ +FL  G+V++QVVP+G    + V  F  PAR +GL Y++Y I  EESS   +     
Sbjct: 356 TNLVFLPAGAVVVQVVPLGGLEAAAVEAFGAPARDMGLGYVQYNIAVEESSQAAR----- 410

Query: 349 LVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
            VL  P A     W  +   YL  QNV LD+ RFR  L  A +
Sbjct: 411 -VLAEPPAVRKEGWLALWSAYLVGQNVTLDVARFRGALSRALE 452


>gi|413934853|gb|AFW69404.1| hypothetical protein ZEAMMB73_073843 [Zea mays]
          Length = 331

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 116/328 (35%), Positives = 182/328 (55%), Gaps = 15/328 (4%)

Query: 72  MSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
           M  V+E T+     PN +     C  TH+ P  +FS+GG+ GN +H++ D  VPLF + N
Sbjct: 1   MDDVREFTLVPFGGPNDTAVPPLCTRTHSVPGFLFSSGGFAGNLYHDYADVLVPLFASTN 60

Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRM 186
            H   + V   +AD  D WA K+  L  + +R+ +I++NN    HCF  + +G   H  M
Sbjct: 61  -HLGGE-VQFLLADIKDWWADKFRPLFRQLSRYDVIDVNNDREVHCFPRIIIGSTFHRAM 118

Query: 187 VINPTLLPKPKTLVDFQSFLANAYNEN----TNTSSSFHHTKPKLVLVNRNARVGRTILN 242
            I+P+  P   T+ DF+  L  A+       + + +     +P+L++++R +   R  +N
Sbjct: 119 GIDPSRSPGGVTVADFKRLLRRAFRLERAVASRSGAPRRRDRPRLLIISRKS--SRRFVN 176

Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
            R + +AA    FDV I EP+  T + +  R ++S   M+GVHGAGLT+ +FL   +VL+
Sbjct: 177 ERAMARAAAAARFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGLTNMVFLPSRAVLV 236

Query: 303 QVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN 361
           QVVP G  +WL+ V F+ PAR + + Y+EY +  EESSL + Y  +   LK+P       
Sbjct: 237 QVVPFGGLEWLTRVTFKDPARDMDVTYMEYNVSLEESSLRDLYPEDHFYLKHPYDVHKKG 296

Query: 362 WSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
           W  ++ VYL  QNV+L++ RF   L+ A
Sbjct: 297 WDAIKTVYLDKQNVRLNLTRFTRTLEQA 324


>gi|326515946|dbj|BAJ87996.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516144|dbj|BAJ88095.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 525

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 126/397 (31%), Positives = 205/397 (51%), Gaps = 33/397 (8%)

Query: 9   SPAKSITCD-RSHRSYDLC---------LINGSALFDPKTSTFFSVGHTDSTPSQPSLRI 58
           +PAK   CD +   + D+C          I+GSA       T  S G   S P+    R+
Sbjct: 133 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSAHTVLVPPTIESGG---SNPNPQEWRV 189

Query: 59  KTQPYPRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMD 116
              PY RK     MS +KE+T+     P  +  C VT   PALVF+ GG  GN++H+F D
Sbjct: 190 --LPYSRKH----MSGIKEITVRELPTPADAPRCAVTSQVPALVFAMGGLTGNYWHDFSD 243

Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
             +PL++  +    D +V L + +    +A KY  ++ R +R+ +++++      CF S 
Sbjct: 244 VMIPLYLQASCF--DGEVQLVVTNYQRWYAGKYRHIIARLSRYDVVDMDKDDQVRCFPSA 301

Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN------ENTNTSSSFHHTKPKLVLV 230
            +G+  H    I+P   P   ++ +F  FL N +              S    KP+++++
Sbjct: 302 VVGIRMHKEFSIDPAKEPTGHSMPEFTKFLRNVFALPRAAPMRVTAGPSDGKKKPRMMII 361

Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
           +R  R  R ++N+ EV   A+ +GF+V I +P  +  + D  R +++   ++GVHGAGLT
Sbjct: 362 SR--RHPRKLVNVDEVVALAKRIGFEVVIGDPPFNVDVGDFAREVNAADVLMGVHGAGLT 419

Query: 291 HSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
           +SLFL  G+V +QV P G  + +  V F  PA  +GL+Y+ Y    EES+LV+  G +  
Sbjct: 420 NSLFLPTGAVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMAYSCGTEESTLVDTLGRDHP 479

Query: 350 VLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYL 385
            +K+P++   + W+ +   YL  Q+VKLD+ RF   L
Sbjct: 480 AVKDPESIHRSGWNKVAEFYLGKQDVKLDLQRFEPVL 516


>gi|242093064|ref|XP_002437022.1| hypothetical protein SORBIDRAFT_10g018270 [Sorghum bicolor]
 gi|241915245|gb|EER88389.1| hypothetical protein SORBIDRAFT_10g018270 [Sorghum bicolor]
          Length = 529

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 123/403 (30%), Positives = 203/403 (50%), Gaps = 28/403 (6%)

Query: 4   RPAALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSL 56
           +PA       ITCD +       +    +C + G     PK    + V  + + P   + 
Sbjct: 131 KPAGAVTEAKITCDENGVDEGFPYARPPVCELAGDIRISPKEKAMYLVNPSGAGPFDSNG 190

Query: 57  RIKTQPYPRKSDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
             K +P+ R +D   +  V E+TI   +SA     C   H  P +VFS  GY  NFFH+ 
Sbjct: 191 EKKIRPFAR-NDGFLLPGVVEVTIKSVSSAAAAPQCTRRHDVPVVVFSVAGYTDNFFHDN 249

Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
            D  +PLF+T  +H    +V   I +    W  K+  LL + + + +IN +     HCF+
Sbjct: 250 TDVLIPLFLT-TAHL-KGEVQFLITNFKPWWVNKFTPLLKKLSNYDVINFDEDKEVHCFR 307

Query: 175 SVTLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFH---HTKPKLVL 229
           +  LG+     ++I+P  T  P   ++VD+  FL  A++   +  +        KPK+++
Sbjct: 308 AGHLGMYRDRDLIISPHPTRNPHNYSMVDYNRFLRRAFSLPRDAPAVLGAETSAKPKMLI 367

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHG 286
           + R     R +LNLREV    E LGF VT+ E     + AD   F   +++   ++ VHG
Sbjct: 368 IERKGT--RKLLNLREVAAMCEALGFAVTVAE-----AGADVRGFAERVNAADVLLAVHG 420

Query: 287 AGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYG 345
           AGLT+ +FL  G+VL+Q+VP G   W++T ++ +PAR + L Y+EY + +EE++L +KY 
Sbjct: 421 AGLTNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYP 480

Query: 346 ANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
            +  V K+P       W  +   +  Q+V +++ RF+ +L  A
Sbjct: 481 RDHYVFKDPMRIHAQGWPAIAEIIMKQDVMVNMTRFKPFLLKA 523


>gi|326532542|dbj|BAK05200.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 559

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 122/395 (30%), Positives = 204/395 (51%), Gaps = 23/395 (5%)

Query: 9   SPAKSITCD-RSHRSYDLCLI---NGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPY 63
           +PAK   CD +   + D+C +   +G         T       +S  S P+ +  +  PY
Sbjct: 167 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSAHTVLVPPTIESGGSNPNPQEWRVLPY 226

Query: 64  PRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
            RK     MS +KE+T+     P  +  C VT   PALVF+ GG  GN++H+F D  +PL
Sbjct: 227 SRKH----MSGIKEITVRELPTPADAPRCAVTSQVPALVFAMGGLTGNYWHDFSDVMIPL 282

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
           ++  +    D +V L + +    +A KY  ++ R +R+ +++++      CF S  +G+ 
Sbjct: 283 YLQASCF--DGEVQLVVTNYQRWYAGKYRHIIARLSRYDVVDMDKDDQVRCFPSAVVGIR 340

Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYN------ENTNTSSSFHHTKPKLVLVNRNAR 235
            H    I+P   P   ++ +F  FL N +              S    KP++++++R  R
Sbjct: 341 MHKEFSIDPAKEPTGHSMPEFTKFLRNVFALPRAAPMRVTAGPSDGKKKPRMMIISR--R 398

Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
             R ++N+ EV   A+ +GF+V I +P  +  + D  R +++   ++GVHGAGLT+SLFL
Sbjct: 399 HPRKLVNVDEVVALAKRIGFEVVIGDPPFNVDVGDFAREVNAADVLMGVHGAGLTNSLFL 458

Query: 296 RPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
             G+V +QV P G  + +  V F  PA  +GL+Y+ Y    EES+LV+  G +   +K+P
Sbjct: 459 PTGAVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMAYSCGTEESTLVDTLGRDHPAVKDP 518

Query: 355 QAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
           ++   + W+ +   YL  Q+VKLD+ RF   L  A
Sbjct: 519 ESIHRSGWNKVAEFYLGKQDVKLDLQRFEPVLLKA 553


>gi|326517042|dbj|BAJ96513.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 123/396 (31%), Positives = 203/396 (51%), Gaps = 33/396 (8%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + C+   RS D C ++G    +  T+   ++    S P +    I   PY R+      +
Sbjct: 107 VVCNMEGRS-DTCEVDGDVRTN-GTALSVTLVPAASWPERHEWMIT--PYSRR-----FA 157

Query: 74  KVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
            V+++T+T      +APP   C VTH  PA++F+ GGY  N++H + D  VPLF+    +
Sbjct: 158 SVRKVTVTQVQDRAAAPP---CTVTHGMPAVLFAVGGYAANYWHAYADILVPLFVASRRY 214

Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
             +   +++      +W  +Y   L   +++ +++++      CF  VT+GL     + I
Sbjct: 215 HGEVTFLISNIQFRPRWPVQYRAFLQGLSKYELVDMDGDAEVRCFPRVTVGLRLDKELSI 274

Query: 189 NPTLLPKPK-TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
            P L+P  + T+ DF  FL   Y      +      KP+L+L++R     R ILN  EV 
Sbjct: 275 VPELVPGGRLTMADFTGFLRETYALPRGAARE-PEKKPRLLLIHRGHY--RRILNEPEVA 331

Query: 248 KAAEELGFDVTIFE-------PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
           +AAEE GF+  + E       PE     A   R ++S   ++G+HGAGLT+++FL PG V
Sbjct: 332 RAAEEAGFEAVVTELRGGGDTPEAEVEQA---RVVNSFDVVLGLHGAGLTNAMFLPPGGV 388

Query: 301 LMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
           L+QVVP G  + ++   F +PA  +GL YL+Y +  EESSL+E  G     +K+P +   
Sbjct: 389 LIQVVPYGNMEDIARAEFSEPATDMGLRYLDYSVGAEESSLMETLGPEHPAIKDPASVHR 448

Query: 360 ANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
           + W  +  +YL  QNV++++ RF   L  A    ++
Sbjct: 449 SGWDKVFELYLAKQNVRINVTRFAPTLAQALHHLRR 484


>gi|414876585|tpg|DAA53716.1| TPA: hypothetical protein ZEAMMB73_742502 [Zea mays]
          Length = 341

 Score =  191 bits (484), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 112/342 (32%), Positives = 180/342 (52%), Gaps = 26/342 (7%)

Query: 72  MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
           M  V +L   +SAPP   C VT+  PA++F+ GG  GNF+H+F D  VPLFI    +  D
Sbjct: 6   MVTVTQLEDKSSAPP---CTVTYNIPAVLFALGGLTGNFWHDFGDVLVPLFIASRRY--D 60

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQT-----ITHCFQSVTLGLISHGRM 186
            +V   I++    W   Y  +L R +++  ++++        +  CF  VT+G+  H  +
Sbjct: 61  GEVQFLISNMKPWWPAAYKTILQRLSKYDAVDLDGDGDGDAHVVRCFPHVTVGIHMHNGL 120

Query: 187 VINPTLLPKPK-----TLVDFQSFLANAYNENTNTSSSF-------HHTKPKLVLVNRNA 234
            I P   P P      T+ DF  F+   Y    +  +S            P+L+L++R  
Sbjct: 121 SIVPEWAPGPPGGRGLTMADFTRFMREVYALPRDAPASLVREEPGKQSPPPRLLLIHREH 180

Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
              R  +N RE+ +AAE  GF+    +     ++    R ++S   ++GVHGAGLT+S+F
Sbjct: 181 --SRRFMNEREILQAAEAAGFEAVALDLRRDVTVDAQARVVNSFDVLLGVHGAGLTNSVF 238

Query: 295 LRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKN 353
           L PG+VL+QVVP G    ++T+ F  PA+ +GL+YL+Y +  EES+L+E  G     +K+
Sbjct: 239 LPPGAVLVQVVPYGKMDVIATLEFGLPAKEMGLKYLDYVVSAEESTLLEMLGPEHPAIKD 298

Query: 354 PQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
           P +   + W  M   YL  Q+V++D+ RF   L  A+   ++
Sbjct: 299 PDSIHRSGWDKMTEFYLNMQDVRIDVARFAPVLTQAFDHLRQ 340


>gi|357157870|ref|XP_003577941.1| PREDICTED: uncharacterized protein LOC100833625 [Brachypodium
           distachyon]
          Length = 532

 Score =  191 bits (484), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 122/388 (31%), Positives = 199/388 (51%), Gaps = 33/388 (8%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI-------KTQPYPRKSDKSAMSKVKE 77
           +C + G     P+  T F V     TPS             K +PY RK D   +  V E
Sbjct: 160 VCELTGDIRISPREKTMFFV-----TPSAAGAAALDANGEKKIRPYARK-DTFLLPGVVE 213

Query: 78  LTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
           +TI +  SA    +C   H  PA+VFS  GY  NFFH+  D  +PLF+T  +H    +V 
Sbjct: 214 VTIKSVPSAEAAPACTRQHDVPAVVFSVAGYTDNFFHDNTDVMIPLFLT-TAHL-RGEVQ 271

Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP--TLL 193
           L I +    W +K+  LL + + + +IN +      CF+   LGL     ++++P  T  
Sbjct: 272 LLITNFKPWWVKKFTPLLKKLSNYEVINFDKDEEVRCFRQGNLGLYRDRDLILSPHPTRN 331

Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAA 250
           P+  T+VD+  FL  A+    +  +        +PK++++ R     R +LNL  V    
Sbjct: 332 PRNYTMVDYNRFLRGAFGLPRDAPAVLGEKTSARPKMLMIERKGT--RKLLNLAAVVAMC 389

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHA---MVGVHGAGLTHSLFLRPGSVLMQVVPI 307
           EELGF VT+ E     + AD   F  + +A   ++ VHGAGLT+ +FL  G+V++Q+VP 
Sbjct: 390 EELGFAVTVAE-----AGADVRGFAETVNAADVLLAVHGAGLTNQIFLPTGAVMVQIVPW 444

Query: 308 G-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
           G   W++T ++ +PAR + L Y+EY + +EE++L +++  +  V K+P A     W  + 
Sbjct: 445 GKMDWMATNFYGQPARDMQLRYVEYYVSEEETTLKDRFPRDHYVFKDPMAIHAQGWPALA 504

Query: 367 VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
             +  Q+V +++ RF+ +L  A  K ++
Sbjct: 505 DIVMKQDVMVNVTRFKPFLLSALDKLQE 532


>gi|125563028|gb|EAZ08408.1| hypothetical protein OsI_30669 [Oryza sativa Indica Group]
          Length = 410

 Score =  191 bits (484), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 118/386 (30%), Positives = 196/386 (50%), Gaps = 26/386 (6%)

Query: 9   SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSV------GHTDSTPSQPSLRIKTQP 62
           +P K I CD     YD+C I+G A       T   V      G  D +  + S+R +++ 
Sbjct: 25  APRKPI-CDLLDPRYDICEISGDARTMGTNRTILYVPPVGERGLADDS-HEWSIRDQSRK 82

Query: 63  YPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
           Y        ++KV   ++   A P   C   H  PA+VF+  G   N +H+F D  +PLF
Sbjct: 83  Y-----LEYINKVTVRSLDAQAAP--GCTSRHAVPAVVFAMNGLTSNPWHDFSDVLIPLF 135

Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
           IT   +  + +V   ++D    +  KY  +L   +R+ I++ N  +   C+  +T+GL S
Sbjct: 136 ITTRVY--EGEVQFLVSDLQPWFVDKYRLILTNLSRYDIVDFNQDSGVRCYPKITVGLRS 193

Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYN-----ENTNTSSSFHHTKPKLVLVNRNARVG 237
           H  + I+P   P+  T++DF+ ++   Y+      +     S    +P+ +L+NR     
Sbjct: 194 HRDLGIDPARTPRNYTMLDFRLYIREVYSLPPAGVDIPFKESSMQRRPRAMLINRGR--T 251

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R  +N +E+  A    GF+V   EP    S+ +  R + SC  ++G HGAGLT+  FLR 
Sbjct: 252 RKFVNFQEIAAAVVAAGFEVVPVEPRRDLSIEEFSRVVDSCDVLMGAHGAGLTNFFFLRT 311

Query: 298 GSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA 356
            +V++QVVP G  +  S V++  PAR + L  +EY I  EES+L +KYG +   +++P++
Sbjct: 312 NAVMLQVVPWGHMEHPSMVFYGGPAREMRLRDVEYSIAAEESTLYDKYGKDHPAIRDPES 371

Query: 357 FAGANWS-NMRVYLKTQNVKLDIDRF 381
                W   M+ Y   Q++KL++ RF
Sbjct: 372 IHKQGWQFGMKYYWIEQDIKLNVTRF 397


>gi|212275672|ref|NP_001130824.1| uncharacterized protein LOC100191928 [Zea mays]
 gi|194690210|gb|ACF79189.1| unknown [Zea mays]
 gi|413947220|gb|AFW79869.1| hypothetical protein ZEAMMB73_561794 [Zea mays]
          Length = 586

 Score =  191 bits (484), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 118/389 (30%), Positives = 192/389 (49%), Gaps = 49/389 (12%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
           CD  S+R  D C ++G        ++   V   G  + T    S RIK  PYPRK+D+  
Sbjct: 234 CDVTSNRRIDWCELDGDVRVVGANASVTLVAPPGADNRTFRAESWRIK--PYPRKADR-- 289

Query: 72  MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
                                            GY GN+FH F D  +PLF+T   +   
Sbjct: 290 ---------------------------------GYTGNYFHAFTDVILPLFLTARRYA-- 314

Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
            +V L +AD    W  K+  +    +++ +++++      CF+ V +GL SH    I+P 
Sbjct: 315 GEVRLLVADLQPWWVGKFLPVFRSISKYELVDLDRDPRVRCFRHVQVGLTSHADFSIDPR 374

Query: 192 LLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
             P   +++DF  F+  AY     +  ++    +P+L++V R AR  R  +N  E+ + A
Sbjct: 375 RAPNGYSMLDFTRFMRAAYGLPRGDVVAAAPARRPRLLVVAR-ART-RRFVNTEEIVRGA 432

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           E +GF+  +   E +  +A      + C A++GVHGAGLT+ +FL  G V++QVVP+G  
Sbjct: 433 EAVGFEAVV--SEGTHEVAPFAELANGCDAIMGVHGAGLTNMVFLPTGGVVIQVVPLGGL 490

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
                YF  P+  +GL YLEY+I+ EES+LV++Y  +  +  +P       W +++  YL
Sbjct: 491 EFVAGYFRGPSVDMGLRYLEYRIEPEESTLVDQYPRDHPIFTDPNGIKSKGWESLKDAYL 550

Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
             Q+V+LD++RFR  L++A    +K   K
Sbjct: 551 DKQDVRLDMERFRPTLQEAIAHLRKAKAK 579


>gi|115457236|ref|NP_001052218.1| Os04g0196600 [Oryza sativa Japonica Group]
 gi|38344223|emb|CAE03693.2| OSJNBb0026E15.11 [Oryza sativa Japonica Group]
 gi|113563789|dbj|BAF14132.1| Os04g0196600 [Oryza sativa Japonica Group]
 gi|215768520|dbj|BAH00749.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 550

 Score =  190 bits (483), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 122/384 (31%), Positives = 201/384 (52%), Gaps = 19/384 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS-DKSAMSK 74
           CD S   YD+C+++G            +V  T  +P       K +PY RK  D      
Sbjct: 170 CDTSFGKYDICVLDGDTRAQGGGGAGAAV-VTLVSPRAAPREWKIKPYSRKYLDGLKPVT 228

Query: 75  VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
           V+ +     APP   C      PA+V   GG  GN++H+F D  VPLFI         +V
Sbjct: 229 VRSVPNPEDAPP---CTTRLNVPAMVIELGGLTGNYWHDFTDVLVPLFIGARRF--GGEV 283

Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLISHGRMVINPT 191
            L + +    W  KY  +  + +RH I+++   +++ +  C+  V +G  S     I+P+
Sbjct: 284 QLLVVNLLPFWVDKYRRIFSQISRHDIVDLEKDDDRGVVRCYPHVVVGYGSRKEFTIDPS 343

Query: 192 L--LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           L       T+V+F  FL  +Y+   +      + H  +P+++++ R     R ++NL EV
Sbjct: 344 LDDTGGGYTMVNFTEFLRQSYSLPRDRPIKLGTNHGARPRMMILERTN--SRKLMNLPEV 401

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
             AA   GF+VT+     +++  +  R ++S   MVGVHGAGLT+ +FL  G+VL+Q+VP
Sbjct: 402 AAAARAAGFEVTVAGGRPTSTYDEFAREVNSYDVMVGVHGAGLTNCVFLPTGAVLLQIVP 461

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  + ++   F +PAR +GL Y+EY I  +ESSL++ +G +  ++K+P A   + W N+
Sbjct: 462 YGRLESIAQTDFGEPARDMGLRYIEYDIAADESSLMDVFGKDHPMIKDPVAVHLSGWGNV 521

Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
              YL  Q+V+++I+RFR +L  A
Sbjct: 522 AEWYLGKQDVRVNIERFRPFLTQA 545


>gi|226498526|ref|NP_001142120.1| uncharacterized protein LOC100274284 [Zea mays]
 gi|194707210|gb|ACF87689.1| unknown [Zea mays]
 gi|413947221|gb|AFW79870.1| hypothetical protein ZEAMMB73_137872 [Zea mays]
          Length = 476

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 131/411 (31%), Positives = 204/411 (49%), Gaps = 35/411 (8%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           + CD S    D+C + G     P  +             + S R+K  P+ RK+D+ A++
Sbjct: 76  VLCDFSSSRSDVCELRGDVRVLPNATIVLH----HPLARRQSWRMK--PHGRKNDRHALA 129

Query: 74  KVKELTITT---SAPPNLS----------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
           +V E+T+T    SA P+ +          C   HT+PA+VFS GGY GN FH+  D  VP
Sbjct: 130 RVTEVTVTVATASASPHHTSGAAAAAAPRCTANHTAPAVVFSVGGYAGNMFHDLTDVLVP 189

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITH---CFQSVT 177
           LFIT        DV L + D    W  K+  LL   +RH +++++  + +    C+  V 
Sbjct: 190 LFITARRF--GGDVHLLVGDAQPWWLDKFRPLLGGLSRHAVVDMSRGSSSGGVLCYPHVI 247

Query: 178 LGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTS-----SSFHHTKPKLVLV 230
           LGL  H  M ++   T      ++ DF      +Y    + +           +P+L+L+
Sbjct: 248 LGLEFHKEMSVDAARTAGGGEYSMADFTLLARRSYGLPRDKAIRVHGRGGGGVRPRLLLI 307

Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
           +R +   R   N   + +AA  LG++V + EP     LA   R ++SC  +VGVHGAGL 
Sbjct: 308 SRKST--RAFTNAGSIARAAASLGYEVVVGEPARHADLASFARVVNSCDVLVGVHGAGLA 365

Query: 291 HSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
           + +FL  G+V++QVVP+G    ++   F  PAR  GL Y+ Y I + ES+L  +Y  +  
Sbjct: 366 NLVFLPAGAVVVQVVPLGGLDAMAADDFGAPARDAGLRYVHYGIAEAESTLATRYPRDHR 425

Query: 350 VLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
           VL++P A     W  +R  YL  QNV +D+ RF   L+ A +  ++  D E
Sbjct: 426 VLRDPAAVRSEGWMALRAAYLVGQNVTIDVRRFSGALRRAMELLRQTHDHE 476


>gi|226503944|ref|NP_001140978.1| uncharacterized protein LOC100273057 [Zea mays]
 gi|194702014|gb|ACF85091.1| unknown [Zea mays]
 gi|413953934|gb|AFW86583.1| hypothetical protein ZEAMMB73_356444 [Zea mays]
          Length = 567

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 127/404 (31%), Positives = 200/404 (49%), Gaps = 35/404 (8%)

Query: 3   LRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF---SVGHTDSTPSQPSLRIK 59
           + PAA +P K I CD S   YD+C I+G A       T      VG   S   + S+R +
Sbjct: 161 MEPAATAPRKPI-CDLSDPRYDICEISGDARTMGANRTVLYVPPVGELGSDGQEWSIRDQ 219

Query: 60  TQPYPRKSDKSAMSKVKELTITTS-APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
           ++ +    +K     VK L  + S A P   C   H   ALVF+  G   N +H+F D  
Sbjct: 220 SRKHLEFVNKVT---VKSLNASQSLAAP--ECTSRHAVRALVFAMNGLTSNPWHDFSDVL 274

Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTL 178
           +PLFIT  +   + +V   ++D    +  KY  +L   +R+ II+ N      C+  VT+
Sbjct: 275 IPLFITTRAL--EGEVQFLVSDLQPWFVDKYRLVLKNLSRYDIIDFNQDGGVRCYPHVTV 332

Query: 179 GLISHGRMVINPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHT-------------- 223
           GL SH  + I+P   P+  T++DF+ ++   Y   +   S  +                 
Sbjct: 333 GLRSHRDLGIDPARAPRNYTMLDFRLYIREIYRLPSAGVSIPYKEANSNAAAAAPGAPAE 392

Query: 224 ----KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
               KP+L+L+NR     R  +N  E+  A ++ GF+V   EP    S+ D  R + SC 
Sbjct: 393 QLQRKPRLMLINRGRT--RKFVNFPEIVGAVQKAGFEVIPIEPRRDLSVEDFARTVDSCD 450

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
            ++G HGAGLT+  FLR  +V++QVVP G  +  S  ++  PA+ + L  +EY I   ES
Sbjct: 451 VLMGAHGAGLTNFFFLRTNAVMLQVVPWGHMEHPSMAFYGGPAKEMRLRDVEYSIADVES 510

Query: 339 SLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRF 381
           +L +KYG ++ V+ +P++     W   MR Y   Q+++L++ RF
Sbjct: 511 TLYDKYGKDNPVVSDPESIHKQGWQLGMRYYWIEQDIRLNVTRF 554


>gi|413953933|gb|AFW86582.1| glycosyltransferase [Zea mays]
          Length = 576

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 129/406 (31%), Positives = 202/406 (49%), Gaps = 39/406 (9%)

Query: 3   LRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF---SVGHTDSTPSQPSLRIK 59
           + PAA +P K I CD S   YD+C I+G A       T      VG   S   + S+R +
Sbjct: 170 MEPAATAPRKPI-CDLSDPRYDICEISGDARTMGANRTVLYVPPVGELGSDGQEWSIRDQ 228

Query: 60  TQPYPRKSDKSAMSKVKELTITTS-APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
           ++ +    +K     VK L  + S A P   C   H   ALVF+  G   N +H+F D  
Sbjct: 229 SRKHLEFVNKVT---VKSLNASQSLAAP--ECTSRHAVRALVFAMNGLTSNPWHDFSDVL 283

Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTL 178
           +PLFIT  +   + +V   ++D    +  KY  +L   +R+ II+ N      C+  VT+
Sbjct: 284 IPLFITTRAL--EGEVQFLVSDLQPWFVDKYRLVLKNLSRYDIIDFNQDGGVRCYPHVTV 341

Query: 179 GLISHGRMVINPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHT-------------- 223
           GL SH  + I+P   P+  T++DF+ ++   Y   +   S  +                 
Sbjct: 342 GLRSHRDLGIDPARAPRNYTMLDFRLYIREIYRLPSAGVSIPYKEANSNAAAAAPGAPAE 401

Query: 224 ----KPKLVLVNRNARVGRT--ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
               KP+L+L+NR    GRT   +N  E+  A ++ GF+V   EP    S+ D  R + S
Sbjct: 402 QLQRKPRLMLINR----GRTRKFVNFPEIVGAVQKAGFEVIPIEPRRDLSVEDFARTVDS 457

Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQE 336
           C  ++G HGAGLT+  FLR  +V++QVVP G  +  S  ++  PA+ + L  +EY I   
Sbjct: 458 CDVLMGAHGAGLTNFFFLRTNAVMLQVVPWGHMEHPSMAFYGGPAKEMRLRDVEYSIADV 517

Query: 337 ESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRF 381
           ES+L +KYG ++ V+ +P++     W   MR Y   Q+++L++ RF
Sbjct: 518 ESTLYDKYGKDNPVVSDPESIHKQGWQLGMRYYWIEQDIRLNVTRF 563


>gi|218194401|gb|EEC76828.1| hypothetical protein OsI_14971 [Oryza sativa Indica Group]
          Length = 548

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 121/384 (31%), Positives = 200/384 (52%), Gaps = 19/384 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS-DKSAMSK 74
           CD S   YD+C+++G            +   T  +P       K +PY RK  D      
Sbjct: 168 CDTSFGKYDICVLDGDTRAQGGGGAGAAA-VTLVSPRAAPREWKIKPYSRKYLDGLKPVT 226

Query: 75  VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
           V+ +     APP   C      PA+V   GG  GN++H+F D  VPLFI         +V
Sbjct: 227 VRSVPNPEDAPP---CTTRLNVPAMVIELGGLTGNYWHDFTDVLVPLFIGARRF--GGEV 281

Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLISHGRMVINPT 191
            L + +    W  KY  +  + +RH I+++   +++ +  C+  V +G  S     I+P+
Sbjct: 282 QLLVVNLLPFWVDKYRRIFSQISRHDIVDLEKDDDRGVVRCYPHVVVGYGSRKEFTIDPS 341

Query: 192 L--LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           L       T+V+F  FL  +Y+   +      + H  +P+++++ R     R ++NL EV
Sbjct: 342 LDDTGGGYTMVNFTEFLRQSYSLPRDRPIKLGTNHGARPRMMILERTN--SRKLMNLPEV 399

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
             AA   GF+VT+     +++  +  R ++S   MVGVHGAGLT+ +FL  G+VL+Q+VP
Sbjct: 400 AAAARAAGFEVTVAGGRPTSTYDEFAREVNSYDVMVGVHGAGLTNCVFLPTGAVLLQIVP 459

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  + ++   F +PAR +GL Y+EY I  +ESSL++ +G +  ++K+P A   + W N+
Sbjct: 460 YGRLESIAQTDFGEPARDMGLRYIEYDIAADESSLMDVFGKDHPMIKDPVAVHLSGWGNV 519

Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
              YL  Q+V+++I+RFR +L  A
Sbjct: 520 AEWYLGKQDVRVNIERFRPFLTQA 543


>gi|116317832|emb|CAH65867.1| OSIGBa0132I10.3 [Oryza sativa Indica Group]
          Length = 550

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 121/384 (31%), Positives = 200/384 (52%), Gaps = 19/384 (4%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS-DKSAMSK 74
           CD S   YD+C+++G            +   T  +P       K +PY RK  D      
Sbjct: 170 CDTSFGKYDICVLDGDTRAQGGGGAGAAA-VTLVSPRAAPREWKIKPYSRKYLDGLKPVT 228

Query: 75  VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
           V+ +     APP   C      PA+V   GG  GN++H+F D  VPLFI         +V
Sbjct: 229 VRSVPNPEDAPP---CTTRLNVPAMVIELGGLTGNYWHDFTDVLVPLFIGARRF--GGEV 283

Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLISHGRMVINPT 191
            L + +    W  KY  +  + +RH I+++   +++ +  C+  V +G  S     I+P+
Sbjct: 284 QLLVVNLLPFWVDKYRRIFSQISRHDIVDLEKDDDRGVVRCYPHVVVGYGSRKEFTIDPS 343

Query: 192 L--LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
           L       T+V+F  FL  +Y+   +      + H  +P+++++ R     R ++NL EV
Sbjct: 344 LDDTGGGYTMVNFTEFLRQSYSLPRDRPIKLGTNHGARPRMMILERTN--SRKLMNLPEV 401

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
             AA   GF+VT+     +++  +  R ++S   MVGVHGAGLT+ +FL  G+VL+Q+VP
Sbjct: 402 AAAARAAGFEVTVAGGRPTSTYDEFAREVNSYDVMVGVHGAGLTNCVFLPTGAVLLQIVP 461

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  + ++   F +PAR +GL Y+EY I  +ESSL++ +G +  ++K+P A   + W N+
Sbjct: 462 YGRLESIAQTDFGEPARDMGLRYIEYDIAADESSLMDVFGKDHPMIKDPVAVHLSGWGNV 521

Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
              YL  Q+V+++I+RFR +L  A
Sbjct: 522 AEWYLGKQDVRVNIERFRPFLTQA 545


>gi|242052297|ref|XP_002455294.1| hypothetical protein SORBIDRAFT_03g007950 [Sorghum bicolor]
 gi|241927269|gb|EES00414.1| hypothetical protein SORBIDRAFT_03g007950 [Sorghum bicolor]
          Length = 469

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 126/384 (32%), Positives = 199/384 (51%), Gaps = 20/384 (5%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTS-TFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           + C+   RS D C ++G    D +T+ T  SV    ++ S+ S     +PY R+      
Sbjct: 90  VVCNMEGRS-DTCEVDG----DVRTNGTALSVTLVPASRSEHS-EWMIRPYSRRFASLRK 143

Query: 73  SKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
             V +L    +A P   C  TH  PA++F+ GGY GN++H++ D  VPLF+    +  + 
Sbjct: 144 VTVTQLQDRAAAAP---CTATHDMPAVLFAIGGYAGNYWHDYADILVPLFVASRRYNGEV 200

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
             +++ A    QW  KY   L   +RH +++++      CF  VT+GL       I P L
Sbjct: 201 KFLISNAQFQPQWLVKYRAFLRGLSRHDVVDMDGDAEVRCFPHVTVGLRLDKEFSIVPEL 260

Query: 193 LPKPK--TLVDFQSFLANAYN--ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKK 248
           +P  +  T+ DF  FL   Y        +S     KP+L+L++R     R I N  EV +
Sbjct: 261 VPGDRRLTMADFTRFLRETYALPRGAVAASRSRGQKPRLLLIHRGHY--RRITNEAEVAR 318

Query: 249 AAEELGFDVTIFE--PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
           AAE  GF+  + E         A+  R +++   ++G+HGAGLT+++FL PG VL+QVVP
Sbjct: 319 AAEAAGFEAVVAELGGGGGGDEAEQARVVNAFDVVLGMHGAGLTNAVFLPPGGVLIQVVP 378

Query: 307 IG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  + ++   F +PA  +GLEYL+Y +  EESSL+E  G     +K+P +   + W  +
Sbjct: 379 YGKMEHIARAEFGEPAADMGLEYLDYSVSAEESSLMETLGPEHPAVKDPDSVHRSGWGQV 438

Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
             +YL  QNV++++ RF   L  A
Sbjct: 439 FELYLAKQNVRVNVTRFAPTLAQA 462


>gi|326503016|dbj|BAJ99133.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 574

 Score =  188 bits (477), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 119/408 (29%), Positives = 201/408 (49%), Gaps = 33/408 (8%)

Query: 9   SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPR 65
           +P K + CD S   YD+C I+G A       T   V   G   +   + +++ +++ Y  
Sbjct: 178 APPKPV-CDLSDPRYDICEISGDARAIGGNRTVLYVPPAGERRADGQEWAIKDQSRKYLE 236

Query: 66  KSDKSAMSKVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
             DK     VK L+   S  AP    C   H  PA+VF+  G   N +H+F D  VPLFI
Sbjct: 237 YIDKVT---VKTLSAAQSLVAP---ECTSRHAVPAVVFAMNGLTSNPWHDFSDVIVPLFI 290

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
           T  ++  D +V   + D    +  KY  +L   +R+ I++ N      C   + +GL SH
Sbjct: 291 TARAY--DGEVQFLVTDLQPWFVDKYRLILANLSRYDIVDFNKDAGVRCHPRIVVGLRSH 348

Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYN---------------ENTNTSSSFHHTKPKLV 228
             + I+P   P+  TL+DF+ ++ + ++                N N ++     KP+L+
Sbjct: 349 RDLGIDPARTPRNYTLLDFRMYIRDIFSLPPDGLGIPYKQKQEANRNATAGTEKRKPRLM 408

Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
           L+NR     R  +N+ E+  A E  GF+V + EP     L +  + + SC  ++G HGAG
Sbjct: 409 LINRGR--NRKFVNIPEISAAVEAAGFEVVVVEPRRDLRLEEFSKAVDSCDVLMGAHGAG 466

Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
           LT+  FLR  + ++QVVP G  +  + +++   A+ + L  +EY I  EES+L +KYG +
Sbjct: 467 LTNFFFLRTNATMLQVVPWGHMEHSAMIFYGVQAKEMRLRDVEYSITAEESTLYDKYGKD 526

Query: 348 DLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
              + +P++     W   M+ Y   Q+++L++ RF   L    + A++
Sbjct: 527 HPAVSDPESIHKQGWQLGMKYYWLEQDIRLNVTRFAPTLHRVLRTAEE 574


>gi|115468072|ref|NP_001057635.1| Os06g0475400 [Oryza sativa Japonica Group]
 gi|51090816|dbj|BAD35293.1| putative HGA1 [Oryza sativa Japonica Group]
 gi|113595675|dbj|BAF19549.1| Os06g0475400 [Oryza sativa Japonica Group]
 gi|215734833|dbj|BAG95555.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222635589|gb|EEE65721.1| hypothetical protein OsJ_21358 [Oryza sativa Japonica Group]
          Length = 534

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 119/388 (30%), Positives = 197/388 (50%), Gaps = 30/388 (7%)

Query: 9   SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSV------GHTDSTPSQPSLRIKTQP 62
           +P K I CD     YD+C I+G A       T   V      G  D +  + S+R +++ 
Sbjct: 149 APRKPI-CDLLDPRYDICEISGDARTMGTNRTILYVPPVGERGLADDS-HEWSIRDQSRK 206

Query: 63  YPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
           Y        ++KV   ++   A P   C   H  PA+VF+  G   N +H+F D  +PLF
Sbjct: 207 Y-----LEYINKVTVRSLDAQAAPG--CTSRHAVPAVVFAMNGLTSNPWHDFSDVLIPLF 259

Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
           IT   +  + +V   ++D    +  KY  +L   +R+ I++ N  +   C+  +T+GL S
Sbjct: 260 ITTRVY--EGEVQFLVSDLQPWFVDKYRLILTNLSRYDIVDFNQDSDVRCYPKITVGLRS 317

Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYN-----ENTNTSSSFHHTKPKLVLVNRNARVG 237
           H  + I+P    +  T++DF+ ++   Y+      +     S    +P+ +L+NR    G
Sbjct: 318 HRDLGIDPARTQRNYTMLDFRLYIREVYSLPPAGVDIPFKESSMQRRPRAMLINR----G 373

Query: 238 RT--ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           RT   +N +E+  A    GF+V   EP    S+ +  R + SC  ++G HGAGLT+  FL
Sbjct: 374 RTRKFVNFQEIAAAVVAAGFEVVPVEPRRDLSIEEFSRVVDSCDVLMGAHGAGLTNFFFL 433

Query: 296 RPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
           R  +V++QVVP G  +  S V++  PAR + L  +EY I  EES+L +KYG +   +++P
Sbjct: 434 RTNAVMLQVVPWGHMEHPSMVFYGGPAREMRLRDVEYSIAAEESTLYDKYGKDHPAIRDP 493

Query: 355 QAFAGANWS-NMRVYLKTQNVKLDIDRF 381
           ++     W   M+ Y   Q++KL++ RF
Sbjct: 494 ESIHKQGWQFGMKYYWIEQDIKLNVTRF 521


>gi|219884553|gb|ACL52651.1| unknown [Zea mays]
          Length = 531

 Score =  185 bits (469), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 197/394 (50%), Gaps = 19/394 (4%)

Query: 9   SPAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS 67
           +PAK   CD +   + D+C I+G         T       +S  +      + +PY RK 
Sbjct: 141 APAKKPACDIQGPWASDVCDIDGDVRIHGSAGTVLIAPSIESGGNPNPQEWRVRPYSRKH 200

Query: 68  DKSAMS-KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
                   V+EL  +  AP   +C V    PA+V + GG  GN++H+F D  +PL++   
Sbjct: 201 QGGIKEVTVRELASSADAP---ACDVRSPVPAMVLAMGGLTGNYWHDFSDIMIPLYLQA- 256

Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRM 186
           + F + +V L + +    +  KY  +L R +RH I++++      CF    +G+  H   
Sbjct: 257 ARF-EGEVQLVVENIQPWYVGKYRAILRRLSRHDIVDMDRDDRVRCFPGAVVGIRMHKEF 315

Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF--------HHTKPKLVLVNRNARVGR 238
            I+P   P   ++ +F +FL   Y+      +             +P++++++R  R  R
Sbjct: 316 SIDPAREPLGHSMPEFTAFLRETYSLPRAAPARLAGADGEEDERARPRMMVISR--RHPR 373

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
            ++NL  V   A  +GF+V I +P  +  + +  + +++   +VGVHGAGLT+SLFL  G
Sbjct: 374 KLVNLDAVVALARRVGFEVVIGDPPFNVDVGEFAKEVNAVDVLVGVHGAGLTNSLFLPTG 433

Query: 299 SVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
           +V +Q+ P G  + +  V F  PA  +GL+Y+ Y    EE++LV+  G +   +K+P++ 
Sbjct: 434 AVFIQINPYGKMEHIGEVDFGIPAVDMGLKYIAYSAGVEETTLVDTLGRDHPAVKDPESI 493

Query: 358 AGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
             + W  +   YL  Q+V+LD+ RF   L+ A +
Sbjct: 494 HRSGWGKVAEYYLGKQDVRLDLQRFEPVLRKAMQ 527


>gi|226529433|ref|NP_001140834.1| hypothetical protein [Zea mays]
 gi|194690360|gb|ACF79264.1| unknown [Zea mays]
 gi|194701354|gb|ACF84761.1| unknown [Zea mays]
 gi|224031103|gb|ACN34627.1| unknown [Zea mays]
 gi|414876590|tpg|DAA53721.1| TPA: hypothetical protein ZEAMMB73_455614 [Zea mays]
          Length = 531

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 197/394 (50%), Gaps = 19/394 (4%)

Query: 9   SPAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS 67
           +PAK   CD +   + D+C I+G         T       +S  +      + +PY RK 
Sbjct: 141 APAKKPACDIQGPWASDVCDIDGDVRIHGSAGTVLIAPSIESGGNPNPQEWRVRPYSRKH 200

Query: 68  DKSAMS-KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
                   V+EL  +  AP   +C V    PA+V + GG  GN++H+F D  +PL++   
Sbjct: 201 QGGIKEVTVRELASSADAP---ACDVRSPVPAMVLAMGGLTGNYWHDFSDIMIPLYLQA- 256

Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRM 186
           + F + +V L + +    +  KY  +L R +RH I++++      CF    +G+  H   
Sbjct: 257 ARF-EGEVQLVVENIQPWYVGKYRAILRRLSRHDIVDMDRDDRVRCFPGAVVGIRMHKEF 315

Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF--------HHTKPKLVLVNRNARVGR 238
            I+P   P   ++ +F +FL   Y+      +             +P++++++R  R  R
Sbjct: 316 SIDPAREPLGHSMPEFTAFLRETYSLPRAAPARLAGADGEEDERARPRMMVISR--RHPR 373

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
            ++NL  V   A  +GF+V I +P  +  + +  + +++   +VGVHGAGLT+SLFL  G
Sbjct: 374 KLVNLDAVVALARRVGFEVVIGDPPFNVDVGEFAKEVNAVDVLVGVHGAGLTNSLFLPTG 433

Query: 299 SVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
           +V +Q+ P G  + +  V F  PA  +GL+Y+ Y    EE++LV+  G +   +K+P++ 
Sbjct: 434 AVFIQINPYGKMEHIGEVDFGIPAVDMGLKYIAYSAGVEETTLVDTLGRDHPAVKDPESI 493

Query: 358 AGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
             + W  +   YL  Q+V+LD+ RF   L+ A +
Sbjct: 494 HRSGWGKVAEYYLGKQDVRLDLQRFEPVLRKAMQ 527


>gi|414876582|tpg|DAA53713.1| TPA: hypothetical protein ZEAMMB73_516543 [Zea mays]
          Length = 465

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 128/397 (32%), Positives = 202/397 (50%), Gaps = 15/397 (3%)

Query: 5   PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
           P+    A  + C    R  D C ++G    + K    +SV     +        K +PY 
Sbjct: 77  PSKTEKANKVVCSTEERLSDYCEVDGDVRVNGKA---WSVDIVPPSGWSERREWKIRPYS 133

Query: 65  RKSDKSAMS-KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
           R+S  +     V +L    +APP   C VTH  P +VF+ GGY+GN FH+  D  +PLF+
Sbjct: 134 RRSAANVDRLNVTQLQDPAAAPP---CTVTHHVPGVVFALGGYSGNAFHDHADVLLPLFL 190

Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
             +  +  +   L I      W  KY   L R +R+ ++N++      CF  +T+GL  H
Sbjct: 191 A-SLRYGREVQFLVINRVQPWWLGKYRLALRRLSRYDVVNLDGDAHVRCFPHLTVGLRLH 249

Query: 184 GRMVINPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTK-PKLVLVNRNARVGRTI 240
               + P  +P  +  ++ DF  FL  AY        S    K P+L+L+ R     R  
Sbjct: 250 KDFGVVPEWVPGKRRVSMPDFTRFLREAYALPRGAPVSREPGKRPRLMLIQRQR--SRRF 307

Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
           LN  E+ +AAE  GF+V + E     ++ +  R ++S   MVG+HGAG+T+ +FL PG V
Sbjct: 308 LNGEEMARAAEAAGFEVVVTELMLDAAVDEQARVVNSFDVMVGIHGAGMTNEVFLPPGGV 367

Query: 301 LMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
           L+QVVP G    ++ + + +PA  +GL+YL Y +  EESSL+E  G +   +K+P +   
Sbjct: 368 LIQVVPWGKLDLMARIEYGEPATDMGLKYLCYNVTLEESSLLELLGRDHPAIKDPDSVHR 427

Query: 360 ANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
           + W+ M  +Y+  Q+V+LDI RF + L +A    ++L
Sbjct: 428 SGWAAMYDIYMTKQDVRLDITRFALTLAEAMDHLRRL 464


>gi|125539258|gb|EAY85653.1| hypothetical protein OsI_07023 [Oryza sativa Indica Group]
          Length = 481

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 125/393 (31%), Positives = 199/393 (50%), Gaps = 51/393 (12%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           C  S R  D C   G      ++ T         +P +   ++K  PY RK D  A+S V
Sbjct: 122 CYESSRRSDTCEATGDVRVHGRSQTIHI------SPLEQEWKVK--PYCRKHDAFALSHV 173

Query: 76  KE-----LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
           KE     L+   + P    C V  ++ A V S GG+ GN FH++ D  +P FIT + HF 
Sbjct: 174 KEWALRPLSTADTYPAVPHCTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFITAH-HFA 232

Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
            + V   ++     W  +Y ++  R +++ +++I+N           +G           
Sbjct: 233 GE-VQFLVSSFKSWWTNRYMQIFQRLSKYELVDIDND------DEDAVGY---------- 275

Query: 191 TLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLRE 245
                  ++VDF++ L  A   E      S        +P+L++++R  R  R  LN R 
Sbjct: 276 -------SMVDFRTMLRGALGLERAVAEPSGDRWDIRRRPRLLIISR--RNSRAFLNERA 326

Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           +   A  LGFDV + EP+ ST ++   R ++S   MVGVHGAGLT+ +FL  G+VL+QVV
Sbjct: 327 MADMAMSLGFDVRLGEPDISTDVSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVV 386

Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
           P G  +WL+   F++PA  + + YLEY I+ +E++L E+Y  +D VLK+P +     W+ 
Sbjct: 387 PYGGLEWLARGTFKEPAADMQIHYLEYMIQLDETTLSEQYPKDDPVLKDPYSIHKQGWNA 446

Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
           ++ VYL  QNV+  + R    LK+ + +A KL+
Sbjct: 447 LKMVYLDKQNVRPHLGR----LKNTFMEALKLL 475


>gi|363543453|ref|NP_001241737.1| glycosyltransferase [Zea mays]
 gi|195620094|gb|ACG31877.1| glycosyltransferase [Zea mays]
          Length = 577

 Score =  184 bits (467), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 127/407 (31%), Positives = 198/407 (48%), Gaps = 41/407 (10%)

Query: 3   LRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF---SVGHTDSTPSQPSLRIK 59
           + PAA +P K I CD S   YD+C I+G A       T      VG   S   + S+R +
Sbjct: 171 MEPAATAPRKPI-CDLSDPRYDICEISGDARTMGANRTVLYVPPVGELGSDGQEWSIRDQ 229

Query: 60  TQPYPRKSDKSAMSKVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
           ++ +    +K     VK L  + S  AP    C   H   ALVF+  G   N +H+F D 
Sbjct: 230 SRKHLEFVNKVT---VKSLNASQSLTAP---ECTSRHAVRALVFAMNGLTSNPWHDFSDV 283

Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
            +PLFIT  +   + +V   ++D    +  KY  +L   +R+ II+ N      C+  VT
Sbjct: 284 LIPLFITTRAL--EGEVQFLVSDLQPWFVDKYRLVLKNLSRYNIIDFNQDGGVRCYPHVT 341

Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN-------------------ENTNTSS 218
           +GL SH  + I+P    +  T++DF+ ++   Y                          +
Sbjct: 342 VGLRSHRDLGIDPARTARNYTMLDFRLYIREIYRLPPAGVSIPYKEANSNAAAAAPGAPA 401

Query: 219 SFHHTKPKLVLVNRNARVGRT--ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
                KP+L+L+NR    GRT   +N  E+  A +  GF+V   EP    S+ D  R + 
Sbjct: 402 EQLQRKPRLMLINR----GRTRKFVNFPEIVGAVQNAGFEVIPIEPRRDLSVEDFARTVD 457

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQ 335
           SC  ++G HGAGLT+  FLR  +V++QVVP G  +  S  ++  PA+ + L  +EY I  
Sbjct: 458 SCDVLMGAHGAGLTNFFFLRTNAVMLQVVPWGHMEHPSMAFYGGPAKEMRLRDVEYSIAD 517

Query: 336 EESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRF 381
            ES+L +KYG ++ V+ +P++     W   MR Y   Q+++L++ RF
Sbjct: 518 VESTLYDKYGKDNPVVSDPESIHKQGWQLGMRYYWIEQDIRLNVTRF 564


>gi|357131990|ref|XP_003567616.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like
           [Brachypodium distachyon]
          Length = 518

 Score =  184 bits (467), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 119/395 (30%), Positives = 201/395 (50%), Gaps = 18/395 (4%)

Query: 5   PAALSPAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQP 62
           P   +PAK   CD +   + D+C I+          T       +S  + P+ +  +   
Sbjct: 125 PDESAPAKKPACDIQGPWASDVCAIDAGVRIQGSARTVLITPPIESGGANPNPQSWQIVA 184

Query: 63  YPRKSDKSAMS-KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
           Y RK     +   V+EL     AP   +C VT   PA+VF+ GG  GN++H+F D  +PL
Sbjct: 185 YSRKHQAGMIPITVRELATAAEAP---ACDVTSEVPAMVFAMGGLTGNYWHDFSDVMIPL 241

Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF-QSVTLGL 180
           ++   S F   +V L + +    +A KY ++L + +++ II+++N     C+ +   +G+
Sbjct: 242 YLQ-ASKF-QGEVQLVVTNLQPWYAGKYRQILGKLSKYQIIDMDNDKQVRCYPRGSVVGI 299

Query: 181 ISHGRMVINPTLLPKPKTLVDFQSFLANAYN-----ENTNTSSSFHHTKPKLVLVNRNAR 235
             H    I+P   P   ++ +F +FL + ++          +      KP++++++R  R
Sbjct: 300 RMHKEFSIDPEKAPTGHSMPEFTAFLRDVFSLPRAKPTPPAAIVSGEKKPRMMIISR--R 357

Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
             R ++N+  VK  AE +GF+V I +P  S  +      +++   ++GVHGAGLT+SLFL
Sbjct: 358 HPRALVNVAAVKAMAERVGFEVVIGDPPFSQDVGAFAAEVNTADVLLGVHGAGLTNSLFL 417

Query: 296 RPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
             G+V +Q+VP G  + ++   F  PA  +GL Y+ Y    EESSLVE  G   + + +P
Sbjct: 418 PTGAVFIQIVPYGKMEHIAETDFGIPAFDMGLHYVAYSAGVEESSLVETLGRGHVAVADP 477

Query: 355 QAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
           +A   + W  +   YL  Q+VKLD+ RF   L  A
Sbjct: 478 EAVHRSGWDKVAEYYLGRQDVKLDLARFEPVLLKA 512


>gi|218187404|gb|EEC69831.1| hypothetical protein OsI_00153 [Oryza sativa Indica Group]
          Length = 534

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 115/338 (34%), Positives = 177/338 (52%), Gaps = 26/338 (7%)

Query: 7   ALSPAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIK 59
           A  P   I+CD   +          +C ++G     P TS   SV  T       +   +
Sbjct: 163 APKPKSKISCDDKSKDEGFPYARPIVCHLSGDVRVSPATS---SVTLTMPLQQGEAAARR 219

Query: 60  TQPYPRKSDKSAMSKVKELTITTSAPPN--LSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
            +PY R+ D   +  V+E+ IT++A      SC V+H  PA++FS GGY GNFFH+  D 
Sbjct: 220 IRPYARRDD-FLLPLVREVAITSAASEGDAPSCNVSHGVPAVIFSIGGYTGNFFHDMADV 278

Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
            VPL++T   HF  + V L +A+    W +KY  +L R +   +++ ++    HCF  V 
Sbjct: 279 LVPLYLT-TFHFKGK-VQLFVANYKQWWIQKYKPVLRRLSHRAVVDFDSDGDVHCFDHVI 336

Query: 178 LGLISHGRMVIN--PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNR 232
           +GL+    +++   PT  PK  T+VDF  FL +AY    +       T   KP++++++R
Sbjct: 337 VGLVRDRDLILGQHPTRNPKGYTMVDFTRFLRHAYGLRRDKPMVLGETSGKKPRMLIISR 396

Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGL 289
             R  R +LNLR+V   A ELGF+V + E           RF   ++SC  +VGVHGAGL
Sbjct: 397 --RRTRKLLNLRQVAAMARELGFEVVVSEAGVGGGSGGVKRFASAVNSCDVLVGVHGAGL 454

Query: 290 THSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGL 326
           T+  FL  G V++Q+VP G  +W++T ++  PA  + L
Sbjct: 455 TNQAFLPRGGVVVQIVPWGRMEWMATNFYGAPAAAMEL 492


>gi|222617641|gb|EEE53773.1| hypothetical protein OsJ_00163 [Oryza sativa Japonica Group]
          Length = 514

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 125/392 (31%), Positives = 187/392 (47%), Gaps = 68/392 (17%)

Query: 10  PAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQP 62
           P   I+CD   +          +C ++G     P TS   SV  T       +   + +P
Sbjct: 166 PKSKISCDDKSKDEGFPYARPIVCHLSGDVRVSPATS---SVTLTMPLQQGEAAARRIRP 222

Query: 63  YPRKSDKSAMSKVKELTITTSAPPN--LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
           Y R+ D   +  V+E+ IT++A      SC V+H  PA++FS GGY GNFFH+  D  VP
Sbjct: 223 YARRDD-FLLPLVREVAITSAASEGDAPSCNVSHGVPAVIFSIGGYTGNFFHDMADVLVP 281

Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
           L++T   HF  + V L +A+        Y + + R T                       
Sbjct: 282 LYLT-TFHFKGK-VQLFVAN--------YKQCIRRGT----------------------- 308

Query: 181 ISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVG 237
                        P+   +VDF  FL +AY    +       T   KP++++++R  R  
Sbjct: 309 -------------PRGTPMVDFTRFLRHAYGLRRDKPMVLGETSGKKPRMLIISR--RRT 353

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSLF 294
           R +LNLR+V   A ELGF+V + E           RF   ++SC  +VGVHGAGLT+  F
Sbjct: 354 RKLLNLRQVAAMARELGFEVVVSEAGVGGGSGGVKRFASAVNSCDVLVGVHGAGLTNQAF 413

Query: 295 LRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKN 353
           L  G V++Q+VP G  +W++T ++  PA  + L Y+EY +  EESSL  +Y     V ++
Sbjct: 414 LPRGGVVVQIVPWGRMEWMATNFYGAPAAAMELRYVEYHVAAEESSLARRYPREHAVFRD 473

Query: 354 PQAFAGANWSNMRVYLKTQNVKLDIDRFRIYL 385
           P A  G  W  +   + TQ+VKL++ RFR  L
Sbjct: 474 PMAIHGQGWKALADIVMTQDVKLNLRRFRPTL 505


>gi|115434210|ref|NP_001041863.1| Os01g0119000 [Oryza sativa Japonica Group]
 gi|11034626|dbj|BAB17150.1| putative HGA6 [Oryza sativa Japonica Group]
 gi|14090226|dbj|BAB55487.1| putative HGA6 [Oryza sativa Japonica Group]
 gi|113531394|dbj|BAF03777.1| Os01g0119000 [Oryza sativa Japonica Group]
 gi|215740963|dbj|BAG97458.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 482

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 108/329 (32%), Positives = 183/329 (55%), Gaps = 22/329 (6%)

Query: 59  KTQPYPRKSDKSAMSKVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHE 113
           K QPYPR++    +S + E+T+T      +AP   +C VTH  P +VF+ GG  GN++H+
Sbjct: 148 KIQPYPRRT----VSGIAEVTVTRQQDRAAAP---ACTVTHGVPGVVFALGGLTGNYWHD 200

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           F D  VPLF+    +    +V   +++    W  KY  ++ R +R+  ++++  T   CF
Sbjct: 201 FSDVLVPLFVASRRY--GGEVQFLVSNIQPWWLGKYEAVVRRLSRYDAVDLDRDTEVRCF 258

Query: 174 QSVTLGLISHGRMVINPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVN 231
           + V +GL  H    + P L P  +  T+ DF +FL + Y      ++     +   ++V 
Sbjct: 259 RRVAVGLRMHKEFSVKPELAPGGQRLTMADFAAFLRDTYALPRAAAAG---ARRPRLVVI 315

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
           R A   R I+N+ EV +AAE  GF+  +  P     + +  R +++  AMVGVHGAGLT+
Sbjct: 316 RRAHY-RKIVNMDEVVRAAEAAGFEAAVMSPRFDEPVEEVARKVNAFDAMVGVHGAGLTN 374

Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
           ++FL  G+V++QVVP G  + ++   F +P   +GL Y+EY +  +ES+L+E  G    V
Sbjct: 375 AVFLPAGAVVIQVVPYGRLERMARADFGEPVADMGLRYMEYSVAADESTLLEMLGPEHQV 434

Query: 351 LKNPQAFAGANWSNM-RVYLKTQNVKLDI 378
           +K+P+A   + W  +   YL  Q+V++++
Sbjct: 435 VKDPEAVHRSGWDKVAEYYLGKQDVRINV 463


>gi|125568815|gb|EAZ10330.1| hypothetical protein OsJ_00165 [Oryza sativa Japonica Group]
          Length = 482

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 108/329 (32%), Positives = 183/329 (55%), Gaps = 22/329 (6%)

Query: 59  KTQPYPRKSDKSAMSKVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHE 113
           K QPYPR++    +S + E+T+T      +AP   +C VTH  P +VF+ GG  GN++H+
Sbjct: 148 KIQPYPRRT----VSGIAEVTVTRQQDRAAAP---ACTVTHGVPGVVFALGGLTGNYWHD 200

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           F D  VPLF+    +    +V   +++    W  KY  ++ R +R+  ++++  T   CF
Sbjct: 201 FSDVLVPLFVASRRY--GGEVQFLVSNIQPWWLGKYEAVVRRLSRYDAVDLDRDTEVRCF 258

Query: 174 QSVTLGLISHGRMVINPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVN 231
           + V +GL  H    + P L P  +  T+ DF +FL + Y      ++     +   ++V 
Sbjct: 259 RRVAVGLRMHKEFSVKPELAPGGQRLTMADFAAFLRDTYALPRAAAAG---ARRPRLVVI 315

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
           R A   R I+N+ EV +AAE  GF+  +  P     + +  R +++  AMVGVHGAGLT+
Sbjct: 316 RRAHY-RKIVNMDEVVRAAEAAGFEAAVMSPRFDEPVEEVARKVNAFDAMVGVHGAGLTN 374

Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
           ++FL  G+V++QVVP G  + ++   F +P   +GL Y+EY +  +ES+L+E  G    V
Sbjct: 375 AVFLPAGAVVIQVVPYGRLERMARADFGEPVADMGLRYMEYSVAADESTLLEMLGPEHQV 434

Query: 351 LKNPQAFAGANWSNM-RVYLKTQNVKLDI 378
           +K+P+A   + W  +   YL  Q+V++++
Sbjct: 435 VKDPEAVHRSGWDKVAEYYLGKQDVRINV 463


>gi|125524186|gb|EAY72300.1| hypothetical protein OsI_00155 [Oryza sativa Indica Group]
          Length = 443

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 107/329 (32%), Positives = 183/329 (55%), Gaps = 22/329 (6%)

Query: 59  KTQPYPRKSDKSAMSKVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHE 113
           K QPYPR++    +S + E+T+T      +AP   +C VTH  P +VF+ GG  GN++H+
Sbjct: 109 KIQPYPRRT----VSGIAEVTVTRQQDRAAAP---ACTVTHGVPGVVFALGGLTGNYWHD 161

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           F D  VPLF+    +    +V   +++    W  KY  ++ R +R+  ++++  T   C 
Sbjct: 162 FSDVLVPLFVASRRY--GGEVQFLVSNIQPWWLGKYEAVVRRLSRYDAVDLDRDTEVRCL 219

Query: 174 QSVTLGLISHGRMVINPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVN 231
           + V +GL  H    + P L P  +  T+ DF +FL + Y      ++     +   ++V 
Sbjct: 220 RRVAVGLRMHKEFSVKPELAPGGQRLTMADFAAFLRDTYALPRAAAAGARRPR---LVVI 276

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
           R A   R I+N+ EV +AAE  GF+  +  P    ++ +  R +++  AMVGVHGAGLT+
Sbjct: 277 RRAHY-RKIVNMDEVVRAAEAAGFEAAVMSPRFDEAVEEVARKVNAFDAMVGVHGAGLTN 335

Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
           ++FL  G+V++QVVP G  + ++   F +P   +GL Y+EY +  +ES+L+E  G    V
Sbjct: 336 AVFLPAGAVVIQVVPYGRLERMARADFGEPVADMGLRYMEYSVAADESTLLEMLGPEHQV 395

Query: 351 LKNPQAFAGANWSNM-RVYLKTQNVKLDI 378
           +K+P+A   + W  +   YL  Q+V++++
Sbjct: 396 VKDPEAVHRSGWDKVAEYYLGKQDVRINV 424


>gi|326504780|dbj|BAK06681.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  177 bits (449), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 123/382 (32%), Positives = 194/382 (50%), Gaps = 27/382 (7%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S+   D+C + G         T  ++ H  ++  + S R+K  P+ RK D   ++ V
Sbjct: 74  CDMSNLRSDVCELKGDVRVILSNITIIALVHPSASLRRRSRRMK--PHARKKDGHVLASV 131

Query: 76  KEL--TITTSAPPNLSCGVTHTSPALVFS-AGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
            ++  ++T S+P    C     +PA+VFS  GGY GN FH+F D  +PLFIT  S F   
Sbjct: 132 TDVLVSVTPSSPHVPGCMAESAAPAVVFSVGGGYEGNMFHDFTDVLIPLFIT-ASRF-RS 189

Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQ-TITHCFQSVTLGLISHGRMVINPT 191
           DV L  +D    W  KY  LL   + H +I+++ Q T   C+  V +GL  H  M IN  
Sbjct: 190 DVHLLASDAPSWWLDKYRPLLRGLSGHAVIDMDRQSTEVLCYPHVVVGLSFHKEMSINDA 249

Query: 192 LLPKPK-TLVDFQSFLANAYNENTNTSSSFHHT---------KPKLVLVNRNARVGRTIL 241
                  ++  F      +Y    +T+    H          +P+L++++R  +  R   
Sbjct: 250 KTAGGHYSMAAFARLARRSYGLERDTAIRLLHGSSDNVKSPRRPRLLIISR--KTTRAFT 307

Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           N+  V +AA  LG++V + E E+ + L+   R ++SC  +VGVHG GL + +FL PG+V+
Sbjct: 308 NMGTVAQAAAMLGYEVIVGEAEQRSDLSALARLVNSCDVLVGVHGTGLANLVFLPPGAVV 367

Query: 302 MQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGA 360
           +QVVP+G  + ++   F  PA  +GL Y+ Y +   ES+L E + ++     NP A    
Sbjct: 368 VQVVPLGGLEAMAGEDFGVPAGDMGLGYVRYTVAIGESTLAELHPSD-----NPAAVRSQ 422

Query: 361 NWSNMR-VYLKTQNVKLDIDRF 381
               +R  YL  QNV L++ RF
Sbjct: 423 ASLALRPAYLAGQNVTLNVTRF 444


>gi|357139224|ref|XP_003571184.1| PREDICTED: uncharacterized protein LOC100840829 [Brachypodium
           distachyon]
          Length = 546

 Score =  177 bits (449), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 116/395 (29%), Positives = 197/395 (49%), Gaps = 29/395 (7%)

Query: 9   SPAKSITCDRSHRSYDLCLINGSA--LFDPKTSTFFSVGHTDSTPS--QPSLRIKTQPYP 64
           +P K I CD S   YD+C ++G A  +  P  +  +  G         + ++R +++ + 
Sbjct: 146 APRKPI-CDLSDPRYDVCDMSGDARAIGGPNRTVLYISGDAAGAEEGHEWAIRDQSRKHL 204

Query: 65  RKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
              +  A+  +        + P   C   H  PA+VF+  G   N +H+F D  +PLFIT
Sbjct: 205 EYINTVAVKSLSAAQAQAQSAP--ECTSRHAVPAVVFAMNGLTSNPWHDFSDVLIPLFIT 262

Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTIT-HCFQSVTLGLISH 183
                 D  V   + D    +  KY  +L   +RH I++++ ++ +  C   V +GL SH
Sbjct: 263 ARGF--DGGVQFLVTDIQPWFLDKYRLILANLSRHDIVDLDKESGSVRCHPRVIVGLRSH 320

Query: 184 GRMVINPTLLP---KPKTLVDFQSFLANAYN------------ENTNTSSSFHHTKPKLV 228
             + I+P   P   K  T++DF+ ++   ++            ++   ++     KP+L+
Sbjct: 321 RDLGIDPARFPAGNKNYTMLDFRMYIRELFSLPPASVDIPYKEQSAAAAAEKQRKKPRLM 380

Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
           L+NR     R  +NL E+  AAE  GF+  + EP     L +  R + SC  ++G HGAG
Sbjct: 381 LINRGR--NRKFVNLPEIAAAAEAAGFETVVVEPRRDLKLEEFSRVVDSCDVLMGAHGAG 438

Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
           LT+  FLR G+V++QVVP G  +  S  ++  PA+ + L  +EY I  EES+L EKYG +
Sbjct: 439 LTNFFFLRTGAVMLQVVPWGHMERPSMEFYGVPAKEMRLRDVEYSITAEESTLYEKYGKD 498

Query: 348 DLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRF 381
              +++P++     W   MR Y   Q+++L++ RF
Sbjct: 499 HPAVRDPESIHRQGWQLGMRYYWLEQDIRLNVTRF 533


>gi|238007990|gb|ACR35030.1| unknown [Zea mays]
          Length = 260

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 93/254 (36%), Positives = 147/254 (57%), Gaps = 9/254 (3%)

Query: 145 WARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQS 204
           W  K+     + TRH +I+++     HCF  + +G   H  M ++P   P   + VDF+ 
Sbjct: 5   WVGKFTPFFRQLTRHDVIDVDKDGEVHCFPRIVVGATFHRDMGVDPRRAPGHVSAVDFKR 64

Query: 205 FLANAYNENTNTSS-----SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI 259
            L  A+      +S     +  H KP+L++++R  R  R  LN RE+  AA + GF+V +
Sbjct: 65  ALRAAFGLKREAASRGGGGATGHGKPRLLIISR--RGSRRFLNSREMAVAAGDAGFEVRV 122

Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFE 318
            EPE+ T +A     ++S  AMVGVHGAGLT+ +FL  G+VL+QVVP G  +WL+ V F+
Sbjct: 123 AEPEQRTDMAAFAALVNSADAMVGVHGAGLTNMVFLPRGAVLVQVVPFGGLEWLTGVTFK 182

Query: 319 KPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLD 377
           +PA  + + Y++Y ++ EESSLV++Y     VL +P A     W  ++  YL  QN+++D
Sbjct: 183 EPAADMEVSYMDYHVRLEESSLVDQYLRGHQVLTDPYAVHRQGWDALKTAYLDKQNIRMD 242

Query: 378 IDRFRIYLKDAYKK 391
           +DRFR  L++   +
Sbjct: 243 LDRFRATLREVMAR 256


>gi|222628421|gb|EEE60553.1| hypothetical protein OsJ_13906 [Oryza sativa Japonica Group]
          Length = 527

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 114/383 (29%), Positives = 197/383 (51%), Gaps = 40/383 (10%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S   YD+C+++G            +V  T  +P       K +PY RK     +  +
Sbjct: 170 CDTSFGKYDICVLDGDTRAQGGCGAGAAV-VTLVSPRAAPREWKIKPYSRK----YLDGL 224

Query: 76  KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
           K +T++++  P                     GN++H+F D  VPLFI         +V 
Sbjct: 225 KPVTVSSAGSP---------------------GNYWHDFTDVLVPLFIGARRF--GGEVQ 261

Query: 136 LAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLISHGRMVINPTL 192
           L + +    W  KY  +  + +RH I+++   +++ +  C+  V +G  S     I+P+L
Sbjct: 262 LLVVNLLPFWVDKYRRIFSQISRHDIVDLEKDDDRGVVRCYPHVVVGYGSRKEFTIDPSL 321

Query: 193 --LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
                  T+V+F  FL  +Y+   +      + H  +P+++++ R     R ++NL EV 
Sbjct: 322 DDTGGGYTMVNFTEFLRQSYSLPRDRPIKLGTNHGARPRMMILERTN--SRKLMNLPEVA 379

Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
            AA   GF+VT+     +++  +  R ++S   MVGVHGAGLT+ +FL  G+VL+Q+VP 
Sbjct: 380 AAARAAGFEVTVAGGRPTSTYDEFAREVNSYDVMVGVHGAGLTNCVFLPTGAVLLQIVPY 439

Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM- 365
           G  + ++   F +PAR +GL Y+EY I  +ESSL++ +G +  ++K+P A   + W N+ 
Sbjct: 440 GRLESIAQTDFGEPARDMGLRYIEYDIAADESSLMDVFGKDHPMIKDPVAVHLSGWGNVA 499

Query: 366 RVYLKTQNVKLDIDRFRIYLKDA 388
             YL  Q+V+++I+RFR +L  A
Sbjct: 500 EWYLGKQDVRVNIERFRPFLTQA 522


>gi|219888573|gb|ACL54661.1| unknown [Zea mays]
 gi|413953956|gb|AFW86605.1| hypothetical protein ZEAMMB73_638297 [Zea mays]
          Length = 501

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 115/397 (28%), Positives = 188/397 (47%), Gaps = 42/397 (10%)

Query: 7   ALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIK 59
           A  P   ITCD +       +    +C + G     PK  T + V  + + P   +   K
Sbjct: 126 ATKPEAKITCDENGVDEGFPYARPPVCELAGDIRVSPKEKTMYFVNPSGAGPFDSNGEKK 185

Query: 60  TQPYPRKSDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
            +P+ RK D   +  V E+TI   +SA     C   H  P +VFS  GY  NFFH+  D 
Sbjct: 186 IRPFARKDD-FLLPGVVEVTIKSVSSAAAAPRCTRRHDVPVVVFSVAGYTDNFFHDNTDV 244

Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
            +PLF+T  +H    +V L + +    W RK+  LL + + + +I+       HCF++  
Sbjct: 245 LIPLFLT-TAHL-QGEVQLLVTNYKPWWVRKFTPLLRKLSNYDVIDFGKDDEVHCFRAGH 302

Query: 178 LGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNR 232
           LG+     ++I+P  T  P+  ++VD+  FL  A+N   +  +        KP+++++ R
Sbjct: 303 LGMYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFNLPRDAPAVLGEETSAKPQMLIIER 362

Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
                R +LNL EV    E LGF VT+ E +     AD   F    +A            
Sbjct: 363 KGT--RKLLNLPEVSALCEALGFAVTVAEAD-----ADVRVFAEKVNA------------ 403

Query: 293 LFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
                  VL+Q+VP G   W++T ++ +PAR + L Y+EY + +EE++L +KY  +  V 
Sbjct: 404 -----ADVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYPRDHYVF 458

Query: 352 KNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
           K+P       W  +   +  Q+V +++ RF+ +L  A
Sbjct: 459 KDPMHIHAQGWPAIAEIIMKQDVMVNMTRFKPFLLQA 495


>gi|222617640|gb|EEE53772.1| hypothetical protein OsJ_00161 [Oryza sativa Japonica Group]
          Length = 546

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 117/380 (30%), Positives = 173/380 (45%), Gaps = 28/380 (7%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C  +G       TS F  V    +      +R    PYPRK D + M +V
Sbjct: 181 CDFSDFRTDICDFSGDVRMAANTSEFVVVVDPAAAAVSHKVR----PYPRKGDATCMGRV 236

Query: 76  KELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
            E+T+  T  A     C   H  PA+ FS GGY GN FH+F D  VPL+ T+  +     
Sbjct: 237 TEITVRATGDAAGAPRCTRAHDVPAVAFSIGGYTGNIFHDFSDVIVPLYNTVQRY--RGG 294

Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTIT---HCFQSVTLGLISHGRMVINP 190
           V L + +    W  KY +LL   +RH  I++         HCF S  + L +H  ++I  
Sbjct: 295 VQLVMVNVASWWLVKYDKLLRELSRHAPIDLAKAGAAGEVHCFPSAVVSLRAHRELIIER 354

Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
                     DF  FL  A +     + +                      +L    +AA
Sbjct: 355 ERSLDGLATPDFTRFLRRALSLPRERADAARRRHRAAAAPPG---------HLPGAHQAA 405

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
            E               L      I+SC A++GVHGAGLT+ +FL PG+ ++QVVP G  
Sbjct: 406 PEPRRRGPRGGGGRVGKL------INSCDALLGVHGAGLTNMMFLPPGATMVQVVPWGGL 459

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
           QW++ + + +PA  +GL Y++Y+I   ESSL +KY A D +  NP       ++ M+   
Sbjct: 460 QWIARMDYGEPAAAMGLNYIQYEIAVAESSLKDKYPAGDEIFTNPTGLHKRGFAFMKQTL 519

Query: 369 LKTQNVKLDIDRFRIYLKDA 388
           +  Q++ +D+ RFR  L+ A
Sbjct: 520 MDGQDITIDVTRFRPVLQQA 539


>gi|413950202|gb|AFW82851.1| hypothetical protein ZEAMMB73_004474 [Zea mays]
          Length = 329

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 103/338 (30%), Positives = 165/338 (48%), Gaps = 37/338 (10%)

Query: 72  MSKVKELTITTSAPPNLSCGVT----HTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
           M+ V+E T+     P           HT+P  +FS GG++GN +H++ D  VPLFI+ + 
Sbjct: 1   MAHVREYTLKALPEPGAGAAPASTRNHTNPGFLFSNGGFSGNMYHDYTDVLVPLFISTHQ 60

Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
            F ++     ++     W  K+     + TRH +I+++N    HCF  + +G   H  M 
Sbjct: 61  -FRER-----VSGMKPWWMGKFTPFFRQLTRHDVIDVDNDREVHCFPRIVVGATFHRDMG 114

Query: 188 INPTLLPKPKTLVDFQSFLANAYN---------------ENTNTSSSFHHTKPKLVLVNR 232
           ++P   P   ++VD +  L  A+                     +++ HH    L LV  
Sbjct: 115 VDPRRSPGHISVVDLKRALRAAFRLKREAASRGGVPQRRHGARQTAAAHHLPVWLTLVPE 174

Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
            A  G      R                    +T +A     ++S  +MVGVHGAGLT+ 
Sbjct: 175 RAGDGARRGRRRVRGAGGGA----------RPATDMATFVALVNSADSMVGVHGAGLTNM 224

Query: 293 LFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
           +FL  G VL+QVVP G  +WL+ V F++PA  + + Y++Y ++ EESSLV++Y     VL
Sbjct: 225 VFLPCGVVLVQVVPFGGLEWLTGVTFKEPASDMEVSYMDYNVRLEESSLVDQYPRGHQVL 284

Query: 352 KNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
            +P A     W  ++  YL  QN+++D+DRFR  L++A
Sbjct: 285 TDPYAVHRQGWDALKTAYLDKQNIQMDLDRFRATLREA 322


>gi|449526607|ref|XP_004170305.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like [Cucumis sativus]
          Length = 335

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 101/301 (33%), Positives = 150/301 (49%), Gaps = 8/301 (2%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITTSA 84
           +CL N     +     F+   + DS  +  S  I   PY R+ DK  +  V  L I    
Sbjct: 35  VCLTNNPTRINNTNLEFYISTNNDSQQNNFS-PILIHPYARQEDKITLRDVTPLQIIFQP 93

Query: 85  PPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
              L   C   H  P L+FS GG+ GN FHEF +  +PLFIT + HF  + V   I D  
Sbjct: 94  NKTLLPLCQFIHNVPVLIFSTGGFTGNLFHEFDETIIPLFIT-SYHFQTR-VRFLITDHK 151

Query: 143 DQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDF 202
             W +KY  +L   +R  ++N       HCF    +GL  H  + +N T +P   ++ DF
Sbjct: 152 TWWVQKYNRILSGLSRFNVVNPAEDGSVHCFNGGVIGLKFHNILSLNNTDIPGGYSMSDF 211

Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEP 262
           +SFL   YN   N  S     KP ++L++R  +  R  +N  E+ +  +E+GF+V    P
Sbjct: 212 RSFLRQTYNLKVNNVSELSGKKPMVMLISR--QTSRRFMNEGEMVEMMKEVGFEVMTTTP 269

Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR 322
           +  ++L      ++ C  ++G HGAGLT+ +FL  G+V++QVVP G  W ST YF    R
Sbjct: 270 QRMSNLDKFSSVVNLCSVIIGAHGAGLTNEVFLANGAVVVQVVPFGLDWPST-YFLGSQR 328

Query: 323 V 323
           +
Sbjct: 329 L 329


>gi|297737475|emb|CBI26676.3| unnamed protein product [Vitis vinifera]
          Length = 420

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 91/256 (35%), Positives = 137/256 (53%), Gaps = 3/256 (1%)

Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPK 195
             I D       +Y  +L + + +  IN       HCF    +GL  H  + +N + +P 
Sbjct: 158 FVITDDKPWMVERYKRILKQLSAYNAINPAEDGSVHCFPGAVIGLKYHDNLALNTSDIPG 217

Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGF 255
             ++ DF+ FL  +Y+    T+S   H KP L+L++R  R  R  LN  E     E+LGF
Sbjct: 218 GYSMSDFKHFLRKSYSLKITTASEIEHPKPVLILISR--RTTRKFLNEDETVGLMEDLGF 275

Query: 256 DVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTV 315
            V I  P   +SL    + ++SC  MVG HGAGL +S+ L  G+V +QVVP+G    S  
Sbjct: 276 QVIITPPYNMSSLDKFAQVVNSCSVMVGAHGAGLANSVLLPAGAVTVQVVPLGLDRASAA 335

Query: 316 YFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNV 374
            +  PAR +G++YLEYKI+ EESSL + YG +  ++ +PQ+     +   R VYL  Q++
Sbjct: 336 NYGDPARKIGVQYLEYKIEAEESSLFDLYGRDQPIIADPQSIHLKGYDVARAVYLDGQDM 395

Query: 375 KLDIDRFRIYLKDAYK 390
           K+++ R R  L  A K
Sbjct: 396 KINLVRLREILVQARK 411


>gi|242066748|ref|XP_002454663.1| hypothetical protein SORBIDRAFT_04g035100 [Sorghum bicolor]
 gi|241934494|gb|EES07639.1| hypothetical protein SORBIDRAFT_04g035100 [Sorghum bicolor]
          Length = 457

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 113/394 (28%), Positives = 183/394 (46%), Gaps = 29/394 (7%)

Query: 6   AALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDST-----PSQPSLRIKT 60
           A + P     CD S R YD C + G    D +T++    G   S      P Q       
Sbjct: 62  APVHPPLEPICDLSDRRYDGCEMWG----DARTAS----GADKSVVYFIPPPQQLATAAA 113

Query: 61  QPYPRKSDKSAMSKVKELTITTSAPPNL----SCGVTHTSPALVFSAGGYNGNFFHEFMD 116
             +  +S    +  V+E+ + +    +L     C V    PA+VF+ GG   N++H F D
Sbjct: 114 ATWSIRSQSRKIVGVREVIVRSLDASSLHEAPGCTVRRDVPAVVFALGGLTSNYWHAFSD 173

Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ-NQTITHCFQS 175
             VPLF T  +   D D++   A     +  KY  +L   +R+ +++++ +  +  C+  
Sbjct: 174 VLVPLFTTARAFGGDVDLLATGAGGQAWFLGKYDRVLRALSRYDVVDLDADGDVVRCYHH 233

Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFL------ANAYNENTNTSSSFHHTKPKLVL 229
           V +GL  H    I+    P    ++ F+ F+                 S    T+P+L+L
Sbjct: 234 VVVGLRGHRDFDIDAARAPNGYDMLAFREFVRAAYSLPPPPAAALPCKSGGGGTRPRLML 293

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEES-TSLADSFRFIHSCHAMVGVHGAG 288
           V R     R  +N   +  A E  GF+V   +   S  S+    R + +C  +VG HGAG
Sbjct: 294 VLRGRT--RRFVNEGAIVDAIERAGFEVARMDETASWGSVGAVAREVDACDVLVGAHGAG 351

Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
           LT+ +FLR G+V++QV+P G  +     +F  PA  +GL ++ Y I  EES+L E+YG +
Sbjct: 352 LTNMVFLRAGAVVVQVIPWGKMEPYGEGFFGAPAAHMGLRHVAYSIAAEESTLYERYGKD 411

Query: 348 DLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRF 381
             V+ +P  F   N SN + Y   Q+++L+  RF
Sbjct: 412 HPVMADPDVFY-RNGSNAKFYWWEQSIRLNTTRF 444


>gi|226503289|ref|NP_001147801.1| glycosyltransferase [Zea mays]
 gi|195613814|gb|ACG28737.1| glycosyltransferase [Zea mays]
          Length = 551

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 122/395 (30%), Positives = 189/395 (47%), Gaps = 56/395 (14%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPS-QPSLRIKTQPYPRKSDKSAMS-KVKELTITT 82
           +C ++G     PK  T + V  +        S+  + +PY RK D S     VK +    
Sbjct: 188 VCELSGDVRVSPKQRTVYLVNPSGGGGGFDESVEKRLRPYARKDDSSMPGVTVKSVASGA 247

Query: 83  SAPPNLSCGVTHTSPALVFSAGGYN--GNFFHEFM-DCFVPLFITINSHFPDQDVILAIA 139
           +AP    C   H  PA+VFS GGYN   N F + M D  VPLF+T  +H  D +V L +A
Sbjct: 248 AAP---ECTKRHAVPAVVFSVGGYNTDNNLFDDDMTDALVPLFLT-TAHL-DGEVQLVVA 302

Query: 140 DCNDQWARKYAELLPRFTRHPIININNQT---------ITHCF--QSVTLGLISHGRMVI 188
           D   +W RKYA LL + +RH +++++              HCF   +  +GL       +
Sbjct: 303 DYKPRWVRKYAPLLRKLSRHGVVSLDGDAEGEEEGPLDGVHCFPAGAFVVGLYRDRDRDL 362

Query: 189 N----PTLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
           +    P   P+  T+ DF  FL  A     +           +P+L++V+      R +L
Sbjct: 363 DLSPHPARNPRNVTMADFARFLRGALALPRDRPAVLGGAPGMRPRLLVVSP-----RRLL 417

Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           NL EV  AA+ LGFDV   E   +  +A+    +++   +VGV GAGLT+ +FL   +VL
Sbjct: 418 NLEEVAAAADALGFDVASAE-AGADDVAEFAARVNAADVLVGVRGAGLTNQVFLPTEAVL 476

Query: 302 MQVVPIGT--QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
           +Q+VP G   +W++T  + + A  +GL YLEY + +EE+ L +KY               
Sbjct: 477 VQIVPWGKKIEWVTTSSYGRAAAGMGLRYLEYYVGEEETRLRDKY--------------- 521

Query: 360 ANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
                 R  +  Q+V +++ RFR  L  A  K ++
Sbjct: 522 -----PRETVMEQDVVVNLTRFRPVLLQALDKLQQ 551


>gi|413935206|gb|AFW69757.1| hypothetical protein ZEAMMB73_948511 [Zea mays]
          Length = 479

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 123/399 (30%), Positives = 192/399 (48%), Gaps = 60/399 (15%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPS--QPSLRIKTQPYPRKSDKSAMS-KVKELTIT 81
           +C ++G     PK  T + V  +         S+  + +PY RK D S     VK +   
Sbjct: 112 VCELSGDVRVSPKQRTVYLVNPSGGGGGGFDESVEKRLRPYARKDDSSMPHITVKSVASG 171

Query: 82  TSAPPNLSCGVTHTSPALVFSAGGYN--GNFFHEFM-DCFVPLFITINSHFPDQDVILAI 138
            +AP    C   H  PA+VFS GGYN   N F + M D  VPLF+T  +H  D +V L +
Sbjct: 172 AAAP---ECTKRHAVPAVVFSVGGYNTDNNLFDDDMTDALVPLFLT-TAHL-DGEVQLVV 226

Query: 139 ADCNDQWARKYAELLPRFTRHPIININNQTI---------THCF--QSVTLGLISHGR-- 185
           AD   +W RKYA LL + +RH +++++              HCF   +  +GL+   R  
Sbjct: 227 ADYKPRWVRKYAPLLRKLSRHGVVSLDGDAEGEEEGPLDGVHCFPAGAFVVGLLYRDRDR 286

Query: 186 -MVINP----TLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVG 237
            + ++P       P+  T+ DF  FL  A     +           +P+L++V+      
Sbjct: 287 DLDLSPHPARNRNPRNVTMADFARFLRGALALPRDRPAVLGGAPGMRPRLLVVSP----- 341

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R +LNL EV  AA+ LGFDV   E   +  +A+    +++   +VGV GAGLT+ +FL  
Sbjct: 342 RRLLNLEEVAAAADALGFDVASAE-AGADDVAEFAARVNAADVLVGVRGAGLTNQVFLPT 400

Query: 298 GSVLMQVVPIGT--QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
            +VL+Q+VP G   +W++T  + + A  +GL YLEY + +EE+ L +KY           
Sbjct: 401 EAVLVQIVPWGKKIEWVTTSSYGRAAAGMGLRYLEYYVGEEETRLRDKY----------- 449

Query: 356 AFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
                     R  +  Q+V +++ RFR  L  A  K ++
Sbjct: 450 ---------PRETVMEQDVVVNLTRFRPVLLQALDKLQQ 479


>gi|413935205|gb|AFW69756.1| glycosyltransferase [Zea mays]
          Length = 555

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 123/399 (30%), Positives = 192/399 (48%), Gaps = 60/399 (15%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPS--QPSLRIKTQPYPRKSDKSAMS-KVKELTIT 81
           +C ++G     PK  T + V  +         S+  + +PY RK D S     VK +   
Sbjct: 188 VCELSGDVRVSPKQRTVYLVNPSGGGGGGFDESVEKRLRPYARKDDSSMPHITVKSVASG 247

Query: 82  TSAPPNLSCGVTHTSPALVFSAGGYN--GNFFHEFM-DCFVPLFITINSHFPDQDVILAI 138
            +AP    C   H  PA+VFS GGYN   N F + M D  VPLF+T  +H  D +V L +
Sbjct: 248 AAAP---ECTKRHAVPAVVFSVGGYNTDNNLFDDDMTDALVPLFLT-TAHL-DGEVQLVV 302

Query: 139 ADCNDQWARKYAELLPRFTRHPIININNQT---------ITHCF--QSVTLGLISHGR-- 185
           AD   +W RKYA LL + +RH +++++              HCF   +  +GL+   R  
Sbjct: 303 ADYKPRWVRKYAPLLRKLSRHGVVSLDGDAEGEEEGPLDGVHCFPAGAFVVGLLYRDRDR 362

Query: 186 -MVINP----TLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVG 237
            + ++P       P+  T+ DF  FL  A     +           +P+L++V+      
Sbjct: 363 DLDLSPHPARNRNPRNVTMADFARFLRGALALPRDRPAVLGGAPGMRPRLLVVSP----- 417

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R +LNL EV  AA+ LGFDV   E   +  +A+    +++   +VGV GAGLT+ +FL  
Sbjct: 418 RRLLNLEEVAAAADALGFDVASAE-AGADDVAEFAARVNAADVLVGVRGAGLTNQVFLPT 476

Query: 298 GSVLMQVVPIG--TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
            +VL+Q+VP G   +W++T  + + A  +GL YLEY + +EE+ L +KY           
Sbjct: 477 EAVLVQIVPWGKKIEWVTTSSYGRAAAGMGLRYLEYYVGEEETRLRDKY----------- 525

Query: 356 AFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
                     R  +  Q+V +++ RFR  L  A  K ++
Sbjct: 526 ---------PRETVMEQDVVVNLTRFRPVLLQALDKLQQ 555


>gi|413935207|gb|AFW69758.1| hypothetical protein ZEAMMB73_948511 [Zea mays]
          Length = 386

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 123/399 (30%), Positives = 192/399 (48%), Gaps = 60/399 (15%)

Query: 25  LCLINGSALFDPKTSTFFSVGHTDSTPS--QPSLRIKTQPYPRKSDKSAMS-KVKELTIT 81
           +C ++G     PK  T + V  +         S+  + +PY RK D S     VK +   
Sbjct: 19  VCELSGDVRVSPKQRTVYLVNPSGGGGGGFDESVEKRLRPYARKDDSSMPHITVKSVASG 78

Query: 82  TSAPPNLSCGVTHTSPALVFSAGGYN--GNFFHEFM-DCFVPLFITINSHFPDQDVILAI 138
            +AP    C   H  PA+VFS GGYN   N F + M D  VPLF+T  +H  D +V L +
Sbjct: 79  AAAP---ECTKRHAVPAVVFSVGGYNTDNNLFDDDMTDALVPLFLT-TAHL-DGEVQLVV 133

Query: 139 ADCNDQWARKYAELLPRFTRHPIININNQTI---------THCF--QSVTLGLISHGR-- 185
           AD   +W RKYA LL + +RH +++++              HCF   +  +GL+   R  
Sbjct: 134 ADYKPRWVRKYAPLLRKLSRHGVVSLDGDAEGEEEGPLDGVHCFPAGAFVVGLLYRDRDR 193

Query: 186 -MVINP----TLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVG 237
            + ++P       P+  T+ DF  FL  A     +           +P+L++V+      
Sbjct: 194 DLDLSPHPARNRNPRNVTMADFARFLRGALALPRDRPAVLGGAPGMRPRLLVVS-----P 248

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R +LNL EV  AA+ LGFDV   E   +  +A+    +++   +VGV GAGLT+ +FL  
Sbjct: 249 RRLLNLEEVAAAADALGFDVASAE-AGADDVAEFAARVNAADVLVGVRGAGLTNQVFLPT 307

Query: 298 GSVLMQVVPIG--TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
            +VL+Q+VP G   +W++T  + + A  +GL YLEY + +EE+ L +KY           
Sbjct: 308 EAVLVQIVPWGKKIEWVTTSSYGRAAAGMGLRYLEYYVGEEETRLRDKY----------- 356

Query: 356 AFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
                     R  +  Q+V +++ RFR  L  A  K ++
Sbjct: 357 ---------PRETVMEQDVVVNLTRFRPVLLQALDKLQQ 386


>gi|414876591|tpg|DAA53722.1| TPA: hypothetical protein ZEAMMB73_455614 [Zea mays]
          Length = 302

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 88/302 (29%), Positives = 161/302 (53%), Gaps = 14/302 (4%)

Query: 99  LVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTR 158
           +V + GG  GN++H+F D  +PL++   + F + +V L + +    +  KY  +L R +R
Sbjct: 1   MVLAMGGLTGNYWHDFSDIMIPLYLQ-AARF-EGEVQLVVENIQPWYVGKYRAILRRLSR 58

Query: 159 HPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS 218
           H I++++      CF    +G+  H    I+P   P   ++ +F +FL   Y+      +
Sbjct: 59  HDIVDMDRDDRVRCFPGAVVGIRMHKEFSIDPAREPLGHSMPEFTAFLRETYSLPRAAPA 118

Query: 219 SF--------HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLAD 270
                        +P++++++R  R  R ++NL  V   A  +GF+V I +P  +  + +
Sbjct: 119 RLAGADGEEDERARPRMMVISR--RHPRKLVNLDAVVALARRVGFEVVIGDPPFNVDVGE 176

Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYL 329
             + +++   +VGVHGAGLT+SLFL  G+V +Q+ P G  + +  V F  PA  +GL+Y+
Sbjct: 177 FAKEVNAVDVLVGVHGAGLTNSLFLPTGAVFIQINPYGKMEHIGEVDFGIPAVDMGLKYI 236

Query: 330 EYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
            Y    EE++LV+  G +   +K+P++   + W  +   YL  Q+V+LD+ RF   L+ A
Sbjct: 237 AYSAGVEETTLVDTLGRDHPAVKDPESIHRSGWGKVAEYYLGKQDVRLDLQRFEPVLRKA 296

Query: 389 YK 390
            +
Sbjct: 297 MQ 298


>gi|49660109|gb|AAT68345.1| hypothetical protein At2g03360 [Arabidopsis thaliana]
          Length = 214

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 76/158 (48%), Positives = 105/158 (66%), Gaps = 5/158 (3%)

Query: 2   KLRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQ 61
           K  P  ++ +  ITCDRSH SYDLC INGS + +PKT T   +  T +T S P +  K +
Sbjct: 57  KQNPKVMAASAKITCDRSHTSYDLCSINGSCILNPKTGTLTLMDRTLTT-SAPLVE-KIR 114

Query: 62  PYPRKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
           PYPRKS+   M +++EL +T S P +L  SC +TH SPA+VFSAGGY G+ +H+F+D F+
Sbjct: 115 PYPRKSENWIMPRIRELKLT-SGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFI 173

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFT 157
           PLFIT NS +PD+D IL + +  + W  KY ++L   T
Sbjct: 174 PLFITANSVYPDRDFILVVVNPKEWWMPKYIDILDMVT 211


>gi|55740543|gb|AAV63864.1| hypothetical protein At2g03360 [Arabidopsis thaliana]
          Length = 210

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 76/158 (48%), Positives = 105/158 (66%), Gaps = 5/158 (3%)

Query: 2   KLRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQ 61
           K  P  ++ +  ITCDRSH SYDLC INGS + +PKT T   +  T +T S P +  K +
Sbjct: 53  KQNPKVMAASAKITCDRSHTSYDLCSINGSCILNPKTGTLTLMDRTLTT-SAPLVE-KIR 110

Query: 62  PYPRKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
           PYPRKS+   M +++EL +T S P +L  SC +TH SPA+VFSAGGY G+ +H+F+D F+
Sbjct: 111 PYPRKSENWIMPRIRELKLT-SGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFI 169

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFT 157
           PLFIT NS +PD+D IL + +  + W  KY ++L   T
Sbjct: 170 PLFITANSVYPDRDFILVVVNPKEWWMPKYIDILDMVT 207


>gi|224035277|gb|ACN36714.1| unknown [Zea mays]
          Length = 250

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 84/226 (37%), Positives = 125/226 (55%), Gaps = 22/226 (9%)

Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH------------- 222
           + +GL  HG +V++P L+P  K++ DFQ+ L   Y+  T T+S+                
Sbjct: 1   MVVGLRIHGELVVDPKLMPDGKSIKDFQALLHQGYSRTTTTASASSSSPTPLLPLAPPTR 60

Query: 223 ------TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
                  KPK+++  R  +  R +LNL +V  A    GF   +      T LA     + 
Sbjct: 61  PCPRSANKPKMLIFIR--KQNRVLLNLPQVVTACRRAGFAPHVMNLRRQTPLAAIHAALA 118

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
           S  AMV VHGA +TH LF+RPGSVL+Q+VP+G  W +  ++ KPA+ LGL+YLEY++  E
Sbjct: 119 SADAMVAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLDYLEYRVAPE 178

Query: 337 ESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF 381
           ESSL  +YG N  V++ P   +   W  M +VY+  QNV +++ RF
Sbjct: 179 ESSLAAEYGLNSTVVRTPWVISSRGWWEMKKVYMDRQNVTVNVKRF 224


>gi|357481795|ref|XP_003611183.1| Glycosyltransferase, putative [Medicago truncatula]
 gi|355512518|gb|AES94141.1| Glycosyltransferase, putative [Medicago truncatula]
          Length = 341

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 105/354 (29%), Positives = 165/354 (46%), Gaps = 58/354 (16%)

Query: 1   MKLRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGH-TDSTPSQPSLRIK 59
           ++ R   L   + +    +   Y  C   G      K+S+ + V H T+S     S  I+
Sbjct: 28  LETRKVELEQEQPLCVSEARTEY--CQPQGDIRVHGKSSSVYIVSHKTNSLAENVSWIIR 85

Query: 60  TQPYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
             PY  K+ K +  KV +            C   H+ PA++FS  GY GN FHEF D  +
Sbjct: 86  --PYSVKTVKPS-HKVSQ------------CTKYHSIPAVIFSTAGYTGNHFHEFSDIVI 130

Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
           PLF+T      +  V L I D    W  K+   L + + + II+I+     HCF  V +G
Sbjct: 131 PLFLT--CRLFNGQVQLIITDKKSWWISKHQAFLKKLSNYEIIDIDRDDEVHCFSKVIIG 188

Query: 180 LIS-HGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT-----KPKLVLVNRN 233
           L   H  + I+P       ++ DF  FL ++Y+     +           KP+L++++R 
Sbjct: 189 LKRYHKELSIDPQ--KYSYSIKDFMEFLRSSYSLKRVGAIKIRDIGNKSKKPRLLILSR- 245

Query: 234 ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
            +  R+ +N  ++ K A+ LGF V                          VHGAGLT+ L
Sbjct: 246 -KTSRSFINTNQITKMAKGLGFRVI-------------------------VHGAGLTNIL 279

Query: 294 FLRPGSVLMQVVPIG---TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
           FL   ++ +QVVP G    +WL+T  F +P   + ++YLEYKI+ EES+L+++Y
Sbjct: 280 FLPQNAIFIQVVPFGGMQVEWLATNDFARPLENMNIKYLEYKIRLEESTLIQQY 333


>gi|226528202|ref|NP_001145173.1| uncharacterized protein LOC100278409 [Zea mays]
 gi|195652185|gb|ACG45560.1| hypothetical protein [Zea mays]
          Length = 248

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 79/240 (32%), Positives = 130/240 (54%), Gaps = 8/240 (3%)

Query: 159 HPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN---ENTN 215
           + +++++     HCF+ V +GL SH    I+P   P   +++DF  F+  AY     +  
Sbjct: 4   YELVDLDRDPRVHCFRHVQVGLTSHDDFSIDPLRAPNGYSMLDFTGFMRAAYGLPRGDVA 63

Query: 216 TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
            +      +P+L+++ R AR  R  +N  E+ + AE+LGF+V +   E +  +A      
Sbjct: 64  AAGPSSKRRPRLLVIAR-ART-RRFVNAEEIVRGAEKLGFEVVV--SEGTHEVAPFAELA 119

Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQ 335
           +SC A++GV GAGLT+   +  G V++QVVP+G       YF  P+R +GL YLEY+I  
Sbjct: 120 NSCDAIMGVXGAGLTNMXXVPTGGVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITP 179

Query: 336 EESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
           EES+L+++      +  +P       W +++  YL  Q+V LD+ RFR  LK A    +K
Sbjct: 180 EESTLIDQXPRXHPIFTDPNGIKSKGWXSLKDAYLDKQDVSLDMKRFRPTLKKAIAHIRK 239


>gi|125552212|gb|EAY97921.1| hypothetical protein OsI_19837 [Oryza sativa Indica Group]
          Length = 265

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 73/174 (41%), Positives = 101/174 (58%), Gaps = 10/174 (5%)

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
            KPKLV+  R  +  R +LNL  +  A    GF   +      T L      + S  AMV
Sbjct: 78  AKPKLVIFIR--KQNRVLLNLPHIVTACRRAGFAPHVMNLRRQTPLPAIHAALSSADAMV 135

Query: 283 GVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVE 342
            VHGA +TH LF+RPGSVL+Q+VP+G  W +  ++ KPA+ LGL YLEYK+  EESSL  
Sbjct: 136 AVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLGYLEYKVAPEESSLAA 195

Query: 343 KYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF-------RIYLKDA 388
           +YG N  V+++P   +   W  M +VY+  QNV ++I RF       R++LK+A
Sbjct: 196 EYGVNSTVVRDPSVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRSARLHLKNA 249


>gi|77551625|gb|ABA94422.1| glycosyltransferase, putative [Oryza sativa Japonica Group]
          Length = 295

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 76/213 (35%), Positives = 111/213 (52%), Gaps = 8/213 (3%)

Query: 94  HTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELL 153
           H +P LV +AGGY GN FH F D FVP ++T+  H   + V+L +   N  WA  Y E++
Sbjct: 87  HDAPLLVMTAGGYTGNLFHAFSDGFVPAWLTVQ-HL-RRRVVLGVLLYNPWWAGTYGEII 144

Query: 154 PRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN 213
                + ++++ +    HCF    +G   HG + +NP  L   KT+VDF   LA+ Y   
Sbjct: 145 SGLLDYHVVDLLHDKRKHCFPGAIIGTRFHGILSVNPARLRDNKTIVDFHDLLADVYETA 204

Query: 214 TNTS----SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLA 269
            +T           +P+L +V+   R  R I N   V + A  +GFDV I E  +   L 
Sbjct: 205 GDTVVVDVPQPAPRRPRLGIVS--CRGKRVIENQAAVARLARTVGFDVDILETADGLQLP 262

Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
            S+  + +C  +VGVH A LT  LFLRPG+ L+
Sbjct: 263 ASYASVSACDVLVGVHSADLTKLLFLRPGAALV 295


>gi|125539259|gb|EAY85654.1| hypothetical protein OsI_07024 [Oryza sativa Indica Group]
          Length = 205

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 79/205 (38%), Positives = 122/205 (59%), Gaps = 13/205 (6%)

Query: 199 LVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLREVKKAAEEL 253
           +VDF++ L  A   E      S        +P+L++++R  R  R  LN R +   A  L
Sbjct: 1   MVDFRTMLRGALGLERAVAEPSGDRWDIRRRPRLLIISR--RNSRAFLNERAMADMAMSL 58

Query: 254 GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWL 312
           GFDV + EP+ ST ++   R ++S   MVGVHGAGLT+ +FL  G+VL+QVVP G  +WL
Sbjct: 59  GFDVRLGEPDISTDVSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVVPYGGLEWL 118

Query: 313 STVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKT 371
           +   F++PA  + + YLEY I+ +E++L E+Y  +D VLK+P +     W+ ++ VYL  
Sbjct: 119 ARGTFKEPAADMQIHYLEYMIQLDETTLSEQYPKDDPVLKDPYSIHKQGWNALKMVYLDK 178

Query: 372 QNVKLDIDRFRIYLKDAYKKAKKLM 396
           QNV+  + R    LK+ + +A KL+
Sbjct: 179 QNVRPHLGR----LKNTFMEALKLL 199


>gi|222617642|gb|EEE53774.1| hypothetical protein OsJ_00166 [Oryza sativa Japonica Group]
          Length = 203

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 68/172 (39%), Positives = 104/172 (60%), Gaps = 5/172 (2%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           +P+L+++ R AR  R  +N  E+ + AE  GF+V + E E    +A      ++C AMVG
Sbjct: 26  RPRLLVIAR-ART-RRFVNADEIVRGAERAGFEVVVSEGEHE--VAPFAELANTCDAMVG 81

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEK 343
           VHGAGLT+ +FL  G V++QVVP+G       YF  P+R +GL YLEY+I  EES+L+++
Sbjct: 82  VHGAGLTNMVFLPTGGVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLIDQ 141

Query: 344 YGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
           Y  +  +  +P       W++++  YL  Q+V+LD+ RFR  LK A    +K
Sbjct: 142 YPRDHPIFTDPDGVKSKGWNSLKEAYLDKQDVRLDMKRFRPILKKAIAHLRK 193


>gi|297612065|ref|NP_001068134.2| Os11g0575500 [Oryza sativa Japonica Group]
 gi|125577583|gb|EAZ18805.1| hypothetical protein OsJ_34334 [Oryza sativa Japonica Group]
 gi|255680206|dbj|BAF28497.2| Os11g0575500 [Oryza sativa Japonica Group]
          Length = 202

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 72/206 (34%), Positives = 106/206 (51%), Gaps = 8/206 (3%)

Query: 101 FSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHP 160
            +AGGY GN FH F D FVP ++T+  H   + V+L +   N  WA  Y E++     + 
Sbjct: 1   MTAGGYTGNLFHAFSDGFVPAWLTVQ-HL-RRRVVLGVLLYNPWWAGTYGEIISGLLDYH 58

Query: 161 IININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTS--- 217
           ++++ +    HCF    +G   HG + +NP  L   KT+VDF   LA+ Y    +T    
Sbjct: 59  VVDLLHDKRKHCFPGAIIGTRFHGILSVNPARLRDNKTIVDFHDLLADVYETAGDTVVVD 118

Query: 218 -SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
                  +P+L +V+   R  R I N   V + A  +GFDV I E  +   L  S+  + 
Sbjct: 119 VPQPAPRRPRLGIVS--CRGKRVIENQAAVARLARTVGFDVDILETADGLQLPASYASVS 176

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLM 302
           +C  +VGVH A LT  LFLRPG+ L+
Sbjct: 177 ACDVLVGVHSADLTKLLFLRPGAALV 202


>gi|21393001|gb|AAL47575.2| symbiosis-related protein [Daucus carota]
          Length = 249

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/169 (39%), Positives = 101/169 (59%), Gaps = 5/169 (2%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           KPK+V++ RN    R ILN   + K A+ + F V +  P+++T     +R ++S   M+G
Sbjct: 69  KPKVVIIARND--SRAILNEASLVKMAQGIKFQVEVLIPQKTTEAIRIYRALNSSDVMIG 126

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEK 343
           VHGA +T   F+RP S  +Q++P+GT W +  Y+  PAR LG  +L YKI  +ESSL  +
Sbjct: 127 VHGAAMTRFAFMRPDSACIQIIPLGTDWAADTYYGLPARKLGSRWLIYKILPQESSLYNE 186

Query: 344 YGANDLVLKNPQAF--AGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
           Y   D VL +P +    G  ++  ++YL  QNVKL++ RF   L+ AY+
Sbjct: 187 YEKVDPVLTDPDSVNNRGCEFTK-KIYLDRQNVKLNLRRFLKRLQRAYR 234


>gi|222631478|gb|EEE63610.1| hypothetical protein OsJ_18427 [Oryza sativa Japonica Group]
          Length = 265

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 58/122 (47%), Positives = 81/122 (66%), Gaps = 8/122 (6%)

Query: 275 IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIK 334
           + S  AMV VHGA +TH LF+RPGSVL+Q+VP+G  W +  ++ KPA+ LGL YLEYK+ 
Sbjct: 128 LSSADAMVAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLGYLEYKVA 187

Query: 335 QEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF-------RIYLK 386
            EESSL  +YG N  V+++P   +   W  M +VY+  QNV ++I RF       R++LK
Sbjct: 188 PEESSLAAEYGVNSTVVRDPSVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRSARLHLK 247

Query: 387 DA 388
           +A
Sbjct: 248 NA 249


>gi|413949496|gb|AFW82145.1| hypothetical protein ZEAMMB73_426058 [Zea mays]
 gi|413949497|gb|AFW82146.1| hypothetical protein ZEAMMB73_426058 [Zea mays]
          Length = 147

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 57/110 (51%), Positives = 76/110 (69%), Gaps = 1/110 (0%)

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
           AMV VHGA +TH LF+RPGSVL+Q+VP+G  W +  ++ KPA+ LGLEYLEYK+  EESS
Sbjct: 21  AMVAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLEYLEYKVAPEESS 80

Query: 340 LVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
           L  +YG +  VL+NP   +   W  M +VY+  QNV ++I RF   L+ A
Sbjct: 81  LAAEYGLDSTVLRNPWVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRTA 130


>gi|449463274|ref|XP_004149359.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like [Cucumis sativus]
          Length = 155

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 60/150 (40%), Positives = 100/150 (66%), Gaps = 5/150 (3%)

Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           +E+GF+V    P+  ++L      ++ C  ++G HGAGLT+ +FL  G+V++QVVP G  
Sbjct: 6   KEVGFEVMTTTPQRMSNLDKFSSVVNLCSVIIGAHGAGLTNEVFLANGAVVVQVVPFGLD 65

Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA-FAGANWSNMRVYL 369
           W ST +F KPA  + L+YLEYKI+ +ESSL +KYG N  V+++P++ FA   +++  +Y+
Sbjct: 66  WPSTYFFGKPAAEMELQYLEYKIEAKESSLWDKYGENHPVIRDPESIFAQGYFASRAIYI 125

Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
             QN+K+++ RFR    D   + KKL++++
Sbjct: 126 DEQNLKINLTRFR----DTMIQVKKLIEEK 151


>gi|414876580|tpg|DAA53711.1| TPA: hypothetical protein ZEAMMB73_101479 [Zea mays]
          Length = 239

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 62/162 (38%), Positives = 94/162 (58%), Gaps = 3/162 (1%)

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R  +N  E+ + AE+LGF+V +   E +  +A      +SC A++GVHGAGLT+ +F+  
Sbjct: 71  RRFVNAEEIVRGAEKLGFEVVV--SEGTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPT 128

Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
           G V++QVVP+G       YF  P+R +GL YLEY+I  EES+L+ +Y  +  +  +P   
Sbjct: 129 GGVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLINQYPRDHPIFTDPNGI 188

Query: 358 AGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
               W +++  YL  Q+V LD+ RFR  LK A    +K   K
Sbjct: 189 KSKGWESLKDAYLDKQDVSLDMKRFRPTLKKAIAHIRKARAK 230



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 38/68 (55%)

Query: 72  MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
           M  V+ LT+ +      +C   H  PALVFS  GY GN+FH F D  +PLF+T   +  +
Sbjct: 1   MRVVRVLTVRSVPGEAPACTDRHGVPALVFSDRGYTGNYFHAFTDVILPLFLTARQYAGE 60

Query: 132 QDVILAIA 139
             +++A A
Sbjct: 61  VRLLIARA 68


>gi|218196736|gb|EEC79163.1| hypothetical protein OsI_19838 [Oryza sativa Indica Group]
          Length = 348

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 65/211 (30%), Positives = 101/211 (47%), Gaps = 26/211 (12%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           ++ CDRSH   D+C + G    DP TS+         +  +     K +PY RK + S M
Sbjct: 142 ALCCDRSHYRSDVCYLRGDVRTDPSTSSVLLYNAPRGSAPE-----KVRPYTRKFEGSIM 196

Query: 73  SKVKELTI------------TTSAPPN-----LSCGVTHTS--PALVFSAGGYNGNFFHE 113
           S + E+TI            TT+           C V H    PA+VFS GGY GN +HE
Sbjct: 197 STIDEVTIVPVVDAGSGSNGTTAGDTGKDSLRRRCDVRHPPGVPAVVFSTGGYTGNVYHE 256

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           F D  +PLFIT        +V+  + + +  W  +Y  +L R T + +++       HCF
Sbjct: 257 FSDGLIPLFITAQRFA--GEVVFVVLEYHYWWLGRYGAVLERLTNYKVVDFRYDRRVHCF 314

Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQS 204
             + +GL  HG +V++P L+P  ++  D ++
Sbjct: 315 SEMIVGLRIHGELVVDPKLMPNGESCGDHEA 345


>gi|222631479|gb|EEE63611.1| hypothetical protein OsJ_18428 [Oryza sativa Japonica Group]
          Length = 368

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 26/201 (12%)

Query: 13  SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
           ++ CDRSH   D+C + G    DP TS+         +  +     K +PY RK + S M
Sbjct: 141 ALCCDRSHYRSDVCYLRGDVRTDPSTSSVLLYNAPRGSAPE-----KVRPYTRKFEGSIM 195

Query: 73  SKVKELTI------------TTSAPPN-----LSCGVTHTS--PALVFSAGGYNGNFFHE 113
           S + E+TI            TT+           C V H    PA+VFS GGY GN +HE
Sbjct: 196 STIDEVTIVPVVDAGSGSNGTTAGDTGKDSLRRRCDVRHPPGVPAVVFSTGGYTGNVYHE 255

Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
           F D  +PLFIT        +V+  + + +  W  +Y  +L R T + +++       HCF
Sbjct: 256 FSDGLIPLFITAQRF--AGEVVFVVLEYHYWWLGRYGAVLERLTNYKVVDFRYDRRVHCF 313

Query: 174 QSVTLGLISHGRMVINPTLLP 194
             + +GL  HG +V++P L+P
Sbjct: 314 SEMIVGLRIHGELVVDPKLMP 334


>gi|222616867|gb|EEE52999.1| hypothetical protein OsJ_35683 [Oryza sativa Japonica Group]
          Length = 424

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 84/141 (59%), Gaps = 6/141 (4%)

Query: 251 EELGFDVTIFEPEESTSLADSF----RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
           + +GF+    E + + +  D      R ++S  A+VGVHGAGLT+ +FL PG+  +Q+VP
Sbjct: 275 QRVGFEAVASELDMAGADHDDVARVARLVNSFDAVVGVHGAGLTNMVFLPPGAAAVQIVP 334

Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
            G  +WL+   F +PA  +GL Y++Y++   ES+L +KY  +  +  NP A     ++ M
Sbjct: 335 WGGLRWLARADFGEPAVAMGLRYIQYEVAAGESTLKDKYPRDHEIFTNPTALHKKGFTFM 394

Query: 366 R-VYLKTQNVKLDIDRFRIYL 385
           R  +L  Q++ +DIDRF+  L
Sbjct: 395 RHTFLNGQDIIVDIDRFKPVL 415



 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 2/65 (3%)

Query: 16  CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
           CD S    D+C + G    D   S F  V         P+ +++  PYPRK D ++M +V
Sbjct: 187 CDLSDHRTDVCDLAGDIRMDANASAFVVVVDPAVGADGPTYKVR--PYPRKGDATSMGRV 244

Query: 76  KELTI 80
            E+T+
Sbjct: 245 TEITV 249


>gi|125568812|gb|EAZ10327.1| hypothetical protein OsJ_00162 [Oryza sativa Japonica Group]
          Length = 168

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 52/139 (37%), Positives = 79/139 (56%), Gaps = 9/139 (6%)

Query: 199 LVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEELGF 255
           +VDF  FL +AY    +       T   KP++++++R  R  R +LNLR+V   A ELGF
Sbjct: 1   MVDFTRFLRHAYGLRRDKPMVLGETSGKKPRMLIISR--RRTRKLLNLRQVAAMARELGF 58

Query: 256 DVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQW 311
           +V + E           RF   ++SC  +VGVHGAGLT+  FL  G V++Q+VP G  +W
Sbjct: 59  EVVVSEAGVGGGSGGVKRFASAVNSCDVLVGVHGAGLTNQAFLPRGGVVVQIVPWGRMEW 118

Query: 312 LSTVYFEKPARVLGLEYLE 330
           ++T ++  PA   G + L+
Sbjct: 119 MATNFYGAPAAAHGAQGLK 137


>gi|194703492|gb|ACF85830.1| unknown [Zea mays]
          Length = 175

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 53/151 (35%), Positives = 84/151 (55%), Gaps = 11/151 (7%)

Query: 199 LVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGF 255
           + DF  FL  A     +           +P+L++V+      R +LNL EV  AA+ LGF
Sbjct: 1   MADFARFLRGALALPRDRPAVLGGAPGMRPRLLVVS-----PRRLLNLEEVAAAADALGF 55

Query: 256 DVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG--TQWLS 313
           DV   E   +  +A+    +++   +VGV GAGLT+ +FL   +VL+Q+VP G   +W++
Sbjct: 56  DVASAEAG-ADDVAEFAARVNAADVLVGVRGAGLTNQVFLPTEAVLVQIVPWGKKIEWVT 114

Query: 314 TVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
           T  + + A  +GL YLEY + +EE+ L +KY
Sbjct: 115 TSSYGRAAAGMGLRYLEYYVGEEETRLRDKY 145


>gi|218198152|gb|EEC80579.1| hypothetical protein OsI_22912 [Oryza sativa Indica Group]
          Length = 301

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 74/146 (50%), Gaps = 6/146 (4%)

Query: 161 IININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTS--- 217
           ++++ +    HCF    +G   HG + +NP  L   KT+VDF   LA+ Y    +T    
Sbjct: 158 VVDLLHDKRKHCFPGAIIGTRFHGILSVNPARLRDNKTIVDFHDLLADVYETAGDTVVVD 217

Query: 218 -SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
                  +P+L +V+   R  R I N   V + A  +GFDV I E  +   L  S+  + 
Sbjct: 218 VPQPAPRRPRLGIVS--CRGKRVIENQAAVARLARTVGFDVDILETADGLPLPASYASVS 275

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLM 302
           +C  +VGVH A LT  LFLRPG+ L+
Sbjct: 276 ACDVLVGVHSADLTKLLFLRPGAALV 301


>gi|335929271|gb|AEH75985.1| putative glycosyltransferase [Wolffia australiana]
          Length = 254

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 59/114 (51%), Gaps = 11/114 (9%)

Query: 14  ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
           I C+ S+   ++C I         + + F+ G+            K +PYPRK +   M 
Sbjct: 133 ILCNNSNYRTNVCEIFTEVKIQGSSFSVFAAGNNS--------LWKIRPYPRKWEPGLME 184

Query: 74  KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNG-NFFHEFMDCFVPLFITIN 126
           ++KE T+   A P   C V H+ PA+VFS GG  G NFFH+  D  +PLF+T+N
Sbjct: 185 QIKEYTVKAEAGP--PCSVIHSVPAVVFSTGGLLGKNFFHDLSDVLIPLFLTVN 236


>gi|156408145|ref|XP_001641717.1| predicted protein [Nematostella vectensis]
 gi|156228857|gb|EDO49654.1| predicted protein [Nematostella vectensis]
          Length = 451

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 68/278 (24%), Positives = 124/278 (44%), Gaps = 20/278 (7%)

Query: 69  KSAMSKVKELTITTSAPPN-LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--I 125
           +S  ++++  T     P N  +C V    P +V        N +H F D FV L+ +  I
Sbjct: 131 QSWFAELQHFTSLPYHPMNDGACDVVIDKPTMVIKLDA-GVNMYHHFCD-FVNLYASQHI 188

Query: 126 NSHFPDQDVILAIADCND-QWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISH 183
           N  F   DV + + D +D  +A  ++E+   F++HP++ +N+ Q    C +     L++ 
Sbjct: 189 NGSFS-TDVNILVWDTSDMHYADFFSEMWMVFSKHPLLRLNSFQGKRVCLKDAVFSLLA- 246

Query: 184 GRMVI----NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRT 239
            RM+     N  L+P       F SF  +           +   + ++ L+ R  +  R 
Sbjct: 247 -RMIFGLYYNMPLVPNCHGSGLFHSFTHHTLERLGIIQDEYEEDEFRITLIERKTQY-RN 304

Query: 240 ILNLREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
           ILN  E+  A + +  F V +          D  +  H+   M+G+HGAGLTH LFL   
Sbjct: 305 ILNQDELIGAMKSVPNFKVKVVHYNREIPFLDQLKVTHNTDIMIGMHGAGLTHLLFLPDW 364

Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
           +V+ ++            +   AR+ G+ Y+ ++ +++
Sbjct: 365 AVIFEIYNTE----DPDCYGDLARLRGVTYMTWEKREK 398


>gi|198412913|ref|XP_002126731.1| PREDICTED: similar to AER61 glycosyltransferase [Ciona
           intestinalis]
          Length = 507

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 78/297 (26%), Positives = 131/297 (44%), Gaps = 57/297 (19%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIAD-CNDQWA 146
           C +    P ++F    + GN FH F D F  LF++  +N    ++DV + + D  +  + 
Sbjct: 182 CDIVINHP-VIFMKMDFGGNMFHHFCD-FFNLFVSLHVNGSSFNKDVQIVMWDTASSNYY 239

Query: 147 RKYAELLPRFTRHPI---ININNQTITHCFQSVTLGLI--SHGRMVINPTLLPKPKTLVD 201
             ++     FT  P+   ++ + + +  CF+     L+    G +  N  +   P+  V 
Sbjct: 240 DPFSSSWKAFTSRPVTPLVDWDKKKV--CFREAYFSLLPRMRGGLYYNTYV---PQNCVG 294

Query: 202 ---FQSF------------LANAYNENTNTSSSFHHTKPKL--VLVNR----NARVGRTI 240
              F+SF            L   + +  N        KPKL   L+ R    N RV R I
Sbjct: 295 SNLFRSFSKFFLQQMKVRQLGPVFVQGRNP-------KPKLRVTLLQRGTPDNDRVYRKI 347

Query: 241 LNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
            N RE++K   E    ++T+ E +    S  D     H+   M+G+HGAGLTH LFL P 
Sbjct: 348 KNQRELEKVFGEFEDLELTVVEYDWRKMSFKDQLLMTHNSDIMIGMHGAGLTHFLFLPPW 407

Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYL--------EYKIKQEESSLVEKYGAN 347
           +V  ++   G +     Y++  AR+ G++Y+        +++ K  E     KYG+N
Sbjct: 408 AVAFELYNCGDK---NCYYD-LARLGGIKYMTWSDGGNPKFEPKPSEKGKHHKYGSN 460


>gi|242060484|ref|XP_002451531.1| hypothetical protein SORBIDRAFT_04g003343 [Sorghum bicolor]
 gi|241931362|gb|EES04507.1| hypothetical protein SORBIDRAFT_04g003343 [Sorghum bicolor]
          Length = 152

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 42/101 (41%), Positives = 60/101 (59%), Gaps = 6/101 (5%)

Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFR-FIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
           RE+ +AA + G +V  +    ST    +F   ++S   MVGVHGAGLT+ +FL  G VL+
Sbjct: 56  REMARAATDAGLEVCGWRSPTSTRTWATFAALVNSADVMVGVHGAGLTNMVFLPRGGVLI 115

Query: 303 QVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVE 342
           QVVP G  +WL+ V F+ PA  + + Y       EESSL++
Sbjct: 116 QVVPFGGLKWLTGVTFKDPAVDMEVTYGLL----EESSLID 152


>gi|414876596|tpg|DAA53727.1| TPA: hypothetical protein ZEAMMB73_648118 [Zea mays]
          Length = 105

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 31/92 (33%), Positives = 55/92 (59%), Gaps = 2/92 (2%)

Query: 293 LFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
           +FL PG+ L+Q+VP G  QW++   +  PA  +GL+Y++Y+I   ES+L +K+ +   + 
Sbjct: 2   MFLPPGAALVQIVPWGGLQWMARADYGDPAEAMGLKYIQYEIGVAESTLKDKFPSGHKIF 61

Query: 352 KNPQAFAGANWSNMR-VYLKTQNVKLDIDRFR 382
            NP A     +  +R   +  Q++ +D+ RFR
Sbjct: 62  TNPTALHKKGFMFIRQTLMDGQDITVDVGRFR 93


>gi|222617643|gb|EEE53775.1| hypothetical protein OsJ_00167 [Oryza sativa Japonica Group]
          Length = 443

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 49/144 (34%), Positives = 69/144 (47%), Gaps = 14/144 (9%)

Query: 16  CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDS-TPSQPSLRIKTQPYPRKSDKS 70
           CD  S+R  D C ++G         T   V   G  D  T    S  IK  PYPRK+D +
Sbjct: 273 CDLWSNRRIDWCELDGDVRVAGANGTVSLVAPPGPADERTFRAESWHIK--PYPRKADPN 330

Query: 71  AMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
           AM  V+ LT+ +     ++    +C   H  P LVFS  GY GN+FH + D  +PLF+T 
Sbjct: 331 AMRHVRVLTVQSLPAPAASAAAPACTERHDVPGLVFSDRGYTGNYFHAYTDVILPLFLTA 390

Query: 126 NSHFPDQDVILAIADCNDQWARKY 149
             +    +V L ++D    W  K+
Sbjct: 391 RQY--SAEVKLLVSDFQMWWLGKF 412


>gi|242070281|ref|XP_002450417.1| hypothetical protein SORBIDRAFT_05g005021 [Sorghum bicolor]
 gi|241936260|gb|EES09405.1| hypothetical protein SORBIDRAFT_05g005021 [Sorghum bicolor]
          Length = 279

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 1/91 (1%)

Query: 255 FDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLS 313
            +V + EP     L    R ++SC  +VG+HG  L + +FL  G+V++QV P+G    ++
Sbjct: 122 IEVVVGEPARHADLPSFARVVNSCDVLVGMHGVRLANLVFLPAGAVVVQVAPLGGLDAMA 181

Query: 314 TVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
              F  PAR   + Y+ Y I  EES+L  +Y
Sbjct: 182 AEDFGAPARDAWIRYVHYGIAVEESTLARRY 212


>gi|357521537|ref|XP_003631057.1| Glycosyltransferase [Medicago truncatula]
 gi|355525079|gb|AET05533.1| Glycosyltransferase [Medicago truncatula]
          Length = 145

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 11/105 (10%)

Query: 20  HRSYDLCL-INGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKEL 78
           H S  LC  ++G      K+S  + V H  +     ++R    PY R+ D  AM  V++ 
Sbjct: 46  HYSTTLCFRVHG------KSSNVYIVSHKTTENMSWTIR----PYARREDAYAMRHVRKW 95

Query: 79  TITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
           ++  S      C   H+ PA++FS  GY GN FHEF D  +PLF+
Sbjct: 96  SVKASHHQVPQCTENHSIPAVIFSTAGYTGNHFHEFSDIIIPLFL 140


>gi|149412730|ref|XP_001510490.1| PREDICTED: uncharacterized glycosyltransferase AER61-like isoform 1
           [Ornithorhynchus anatinus]
          Length = 527

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 111/236 (47%), Gaps = 15/236 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   ++E    FT + II++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVNIVMWDTSSYGYGDLFSETWKAFTDYDIIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+ V   L+   R  +  N  L+   +    F++F  +  +    T +    
Sbjct: 303 TYDSKRVCFKEVVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQAGPKE 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+ KA + +  FDV + + + +     D  R  H+   
Sbjct: 363 GKIRITILARSTEY-RKILNQDELVKALKTVSAFDVQVVDYKYKKLGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
            VG+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++ K +
Sbjct: 422 FVGMHGAGLTHLLFLPDWATVFELYNCEDE---RCYLDL-ARLRGIHYVTWQKKNK 473


>gi|194466153|gb|ACF74307.1| glycosyltransferase [Arachis hypogaea]
          Length = 217

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 16/120 (13%)

Query: 11  AKSITCDRSHRSYDLCLINGSALFDPKTSTFF------------SVGHTDSTPSQPSLRI 58
           + +I CDR+    D+CL+ G       +S+ F            +VG       Q     
Sbjct: 97  SGTICCDRTGYRSDICLMKGDIRTHSPSSSIFLYNSGIINNVSRNVGAEKGKGDQILQHE 156

Query: 59  KTQPYPRKSDKSAMSKVKELTITTS----APPNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
           + +PY RK +KS M  + EL + +           C V H  PA+ FS GGY GN +HEF
Sbjct: 157 RIRPYTRKWEKSVMETIDELNLISKRVNFGNSGHGCDVKHEVPAVFFSNGGYTGNVYHEF 216


>gi|123496899|ref|XP_001327065.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121909989|gb|EAY14842.1| hypothetical protein TVAG_411080 [Trichomonas vaginalis G3]
          Length = 445

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 68/293 (23%), Positives = 129/293 (44%), Gaps = 41/293 (13%)

Query: 110 FFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI 169
            +H  +D  +P++  +++     D  +   D + ++   YA  L +   H   NI     
Sbjct: 159 LWHNLVDFVIPIYRAMHTTNISSDCTIHAFDNDGKYGLFYANALCQNIIHENSNI----- 213

Query: 170 THCFQSVTLGLISHG------RMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT 223
            +C + + +G+   G      ++ +N  + P+ + L+  +  +    N      S  H  
Sbjct: 214 -YCHKRMIIGVPKTGGSQSERKLFLNYDI-PRNE-LIGLRELMLKEANSTGCMPSREH-- 268

Query: 224 KPKLVLVNRNARV-GRTILNLREVKKAAEELGFDVTIFEPEESTSLAD-----SFRFIHS 277
            PK++L+ R  +   R ++N  EV KA  E    V  F    +T L D        F  S
Sbjct: 269 -PKVLLIKRRTKEEKRRLINSDEVSKAIHE----VCPFCEVLNTDLQDFNKMQQVSFTCS 323

Query: 278 CHAMVGVHGAGLTHSLFLRPGS-----VLMQVVPIGTQWLSTV--YFEKPARVLGLEYLE 330
              ++G+HG+GLTH ++  P S      +++++P    +L T   ++ K A + G+EY  
Sbjct: 324 VSLLIGLHGSGLTHLMWQYPSSKEQKTAVIEILP----YLYTCRDWYSKLASMAGVEYFS 379

Query: 331 YKIKQEESSLVEKYG-ANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFR 382
            K  ++  S  EK     +    +     G  W +   +L+ Q+V +DID+F+
Sbjct: 380 LKTLRKNQSRWEKVSDERERSCHSGSEMCGKGWCHD--FLRDQSVIVDIDQFK 430


>gi|407196281|gb|AFT64228.1| secreted O-linked N-acetylglucosamine transferase [Sus scrofa]
          Length = 527

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/232 (25%), Positives = 109/232 (46%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D FV L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FVNLYITQHVNNSFS-TDVYVVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +A +    T      
Sbjct: 303 TYDSKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHALHRLNITQQGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + + +  +  D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVRIVDYKYKELAFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++   G +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCGDE---RCYLDL-ARLRGVHYITWR 469


>gi|170047460|ref|XP_001851238.1| glycosyltransferase [Culex quinquefasciatus]
 gi|167869905|gb|EDS33288.1| glycosyltransferase [Culex quinquefasciatus]
          Length = 551

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/255 (22%), Positives = 114/255 (44%), Gaps = 23/255 (9%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD-----QDVILAIADCNDQ 144
           C VT   PA +        N +H F D F+ L+++ +++  D      DV + + + +  
Sbjct: 227 CDVTIERPAFIMKIDA-TINMYHHFCD-FINLYMSQHANLSDPDGFSTDVEMLVWE-SYT 283

Query: 145 WARKYAELLPRFTRHPIININ---NQTITHCFQSVTLGLISHGRMVI----NPTLLPKPK 197
           ++  +AE    FTRHPI ++     Q +  CF+++ L L+   RM+     N  ++   +
Sbjct: 284 YSSPFAEAFKVFTRHPIADLKTYAGQVV--CFKNLVLPLLP--RMIFGLYYNTPIISGCE 339

Query: 198 TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDV 257
               FQ+F  +  +       S    K ++  ++R+ +  R +     +++ +E+  + V
Sbjct: 340 NSGLFQAFSEHVLHRLRIPLKSHTDRKLRITFLSRDTKFRRVLNEHDLLEEISEDERYLV 399

Query: 258 TIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYF 317
                       +  R   +    +G+HGAGLTH LFL   + L ++         T  +
Sbjct: 400 NRVSYSYKMDFREQLRITRNTDIFIGMHGAGLTHLLFLPKWAALFELYHCE----DTNCY 455

Query: 318 EKPARVLGLEYLEYK 332
           +  AR+ G+ YL ++
Sbjct: 456 KDLARLKGVRYLTWE 470


>gi|170055913|ref|XP_001863794.1| glycosyltransferase [Culex quinquefasciatus]
 gi|167875762|gb|EDS39145.1| glycosyltransferase [Culex quinquefasciatus]
          Length = 495

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 57/254 (22%), Positives = 111/254 (43%), Gaps = 21/254 (8%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADC----NDQW 145
           C VT   PA +        N +H F D F+ L+++ +++  D D      +     +  +
Sbjct: 178 CDVTIERPAFIMKIDA-TINMYHHFCD-FINLYMSQHANLSDPDGFSTDVEVLVWESYTY 235

Query: 146 ARKYAELLPRFTRHPIININ---NQTITHCFQSVTLGLISHGRMVI----NPTLLPKPKT 198
           +  +AE    FTRHPI ++     Q +  CF+++ L L+   RM+     N  ++   + 
Sbjct: 236 SSPFAEAFKVFTRHPIADLKTYAGQVV--CFKNLVLPLLP--RMIFGLYYNTPIISGCEN 291

Query: 199 LVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
              FQ+F  +  +       S    K ++  ++R+ +  R +     +++ +E+  + V 
Sbjct: 292 SGLFQAFSEHVLHRLRIPLKSHTDRKLRITFLSRDTKFRRVLNEHDLLEEISEDERYLVN 351

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
                      +  R   +    +G+HGAGLTH LFL   + L ++         T  ++
Sbjct: 352 RVSYSYKMDFREQLRITRNTDIFIGMHGAGLTHLLFLPKWAALFELYHCE----DTNCYK 407

Query: 319 KPARVLGLEYLEYK 332
             AR+ G+ YL ++
Sbjct: 408 DLARLKGVRYLTWE 421


>gi|395516462|ref|XP_003762407.1| PREDICTED: uncharacterized glycosyltransferase AER61 [Sarcophilus
           harrisii]
          Length = 539

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 60/253 (23%), Positives = 115/253 (45%), Gaps = 16/253 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ- 165
           N +H F D F+ L+IT  +N+ F     I+     +  +   ++E    FT + +I++  
Sbjct: 244 NMYHHFCD-FINLYITQHVNNSFSTDVNIIMWDTSSYGYGDLFSETWKAFTDYDVIHLKV 302

Query: 166 NQTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT 223
             + T CF+     L+   R  +  N  L+   +    F++F  +       T       
Sbjct: 303 YDSKTVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLYRLNITRDGPQDG 362

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAM 281
           K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     +  R  H+    
Sbjct: 363 KIRVTILARSTEY-RKILNQNELVNALKTVSSFEVRIVDYKYKEIGFLEQLRITHNTDIF 421

Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE---ES 338
           +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+ G+ Y+ +K K +   + 
Sbjct: 422 IGMHGAGLTHLLFLPDWAVVFELYNCEDE---HCYLDL-ARLRGIHYVTWKKKNKVFPQD 477

Query: 339 SLVEKYGANDLVL 351
             +E++   D V+
Sbjct: 478 KRMEEFNFRDRVV 490


>gi|423062505|ref|ZP_17051295.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
 gi|406716413|gb|EKD11564.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
          Length = 805

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 66/288 (22%), Positives = 117/288 (40%), Gaps = 44/288 (15%)

Query: 36  PKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK--------VKELTITTS---- 83
           P T +  S+G T     QPS  +K  P  R  +   +          VK+L+   S    
Sbjct: 484 PTTVSKSSLGFTSHRTQQPSGAVKIFPQARGLNTGFLHTIITANNQVVKDLSSDNSGIAL 543

Query: 84  APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCND 143
           A  NL   + H    + F +     N+FH  ++    L + + S  P   +++      +
Sbjct: 544 AAHNLP-EIIHLEGNIAFVSASCGQNYFHWMVEVIPRLHLVLASGVPVDKIVV------N 596

Query: 144 QWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQ 203
           ++  KY +        P      Q +  CF+ V   +     +++    LP PK   DF 
Sbjct: 597 KFGHKYEDETLAMFDIP----EYQKMFGCFRHVQAEV-----LIVQSRTLPTPKWACDFV 647

Query: 204 SFLANAY-----NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
             L   +     +   N SS       K+ +   NA + R ++N +E+    + LGF+V 
Sbjct: 648 KDLVLKHPLLEEDNQCNYSS-------KIYISRANAYI-RKVINEQELIDILKPLGFEVV 699

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
             E   + S+      +H    ++  HGAGLT+ +F  PG+ ++++ P
Sbjct: 700 YLE---NMSVKQQALCLHHAEVVISPHGAGLTNLVFCEPGTKVIELFP 744


>gi|118150874|ref|NP_001071350.1| EGF domain-specific O-linked N-acetylglucosamine transferase
           precursor [Bos taurus]
 gi|158512478|sp|A0JND3.1|EOGT_BOVIN RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
           transferase; AltName: Full=Extracellular O-linked
           N-acetylglucosamine transferase; Flags: Precursor
 gi|117306286|gb|AAI26626.1| Chromosome 3 open reading frame 64 ortholog [Bos taurus]
 gi|296474967|tpg|DAA17082.1| TPA: AER61 glycosyltransferase precursor [Bos taurus]
          Length = 527

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYVVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDAKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKG 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++   G +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCGDE---RCYLDL-ARLRGVHYITWR 469


>gi|355667377|gb|AER93846.1| glycosyltransferase AER61 [Mustela putorius furo]
          Length = 527

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 68/286 (23%), Positives = 125/286 (43%), Gaps = 18/286 (6%)

Query: 57  RIKTQPYPRKSD-KSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
           R+ T    RKS  +S  ++++  T     P  +  C V    P           N +H F
Sbjct: 192 RMLTSQGQRKSPLQSWFAELQSYTQLNFRPLEDAKCDVVIEKPTYFMKLDA-GVNMYHHF 250

Query: 115 MDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININNQTITH 171
            D F+ L+IT  IN+ F   DV + + D +   +   +++    FT + +I++       
Sbjct: 251 CD-FINLYITQHINNSFS-TDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLKTYDSKR 308

Query: 172 -CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLV 228
            CF+     L+   R  +  N  L+   +    F++F  +  +    T       K ++ 
Sbjct: 309 VCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNVTQEGPKDGKIRVT 368

Query: 229 LVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHG 286
           ++ R+    R ILN  E+ KA + +  F+V I + + +     D  R  H+    +G+HG
Sbjct: 369 ILARSTEY-RKILNQNELVKALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDIFIGMHG 427

Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           AGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 428 AGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYVTWR 469


>gi|440910004|gb|ELR59843.1| Putative glycosyltransferase AER61, partial [Bos grunniens mutus]
          Length = 530

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 248 NMYHHFCD-FINLYITQHVNNSF-STDVYVVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 305

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 306 TYDAKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKG 365

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  R  H+   
Sbjct: 366 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 424

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++   G +     Y +  AR+ G+ Y+ ++
Sbjct: 425 FIGMHGAGLTHLLFLPDWAAVFELYNCGDE---RCYLDL-ARLRGVHYITWR 472


>gi|86606503|ref|YP_475266.1| hypothetical protein CYA_1852 [Synechococcus sp. JA-3-3Ab]
 gi|86555045|gb|ABD00003.1| TPR repeat protein [Synechococcus sp. JA-3-3Ab]
          Length = 641

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 6/104 (5%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           P+ V ++R A   R ++N  EV +A    GF     E     SLA+    +    A++G+
Sbjct: 501 PRRVYISRRAARWRRVINEAEVLEALRPWGFVPVQLE---VLSLAEQIALMQKAEAVMGI 557

Query: 285 HGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           HGAGLT+  F +PG+V +++ P         YF   A+V+GL Y
Sbjct: 558 HGAGLTNLAFCQPGTVAIEIFPSNA---VLPYFWSLAQVVGLNY 598


>gi|404255179|ref|ZP_10959147.1| capsular polysaccharide biosynthesis protein-like protein
           [Sphingomonas sp. PAMC 26621]
          Length = 398

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 53/96 (55%), Gaps = 5/96 (5%)

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
           T P L+ V+R A+ GRT+ N  +V +  E+ GF  T+ +P  S S  +  R       ++
Sbjct: 266 TGPTLLFVDRAAQHGRTLSNHADVLRLVEQRGF--TVVDPA-SLSFVEQVRLFSGAQVVI 322

Query: 283 GVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           G  GA +T+++F RP + ++ + P G  W+   Y++
Sbjct: 323 GQMGAAMTNTMFCRPSTTVIYLAPSG--WIEPFYWD 356


>gi|426249299|ref|XP_004018387.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
           transferase [Ovis aries]
          Length = 527

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYVVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKG 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++   G +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCGDE---RCYLDL-ARLRGVHYITWR 469


>gi|156399367|ref|XP_001638473.1| predicted protein [Nematostella vectensis]
 gi|156225594|gb|EDO46410.1| predicted protein [Nematostella vectensis]
          Length = 620

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 78/327 (23%), Positives = 130/327 (39%), Gaps = 37/327 (11%)

Query: 109 NFFHEFMDCFVPLFITI----NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI 164
           N  H   D  +PLF T+    NS     D+   +          Y+EL   F+R  ++  
Sbjct: 195 NIMHAIHDDLLPLFHTMKQYSNSGSSQIDLNSRLVFMEGYELGPYSELYQLFSRLQLVIK 254

Query: 165 NNQTITH---CFQSVTLGL------ISHGRMVIN---PTLLPKPKTLVDFQSFLANAYNE 212
           +N T+ +   CFQ+  +GL        +G   I    P +    K + +F  F+      
Sbjct: 255 DNLTVNNTLKCFQNAVVGLSKFTTWYQYGFDQIQGPLPEIQITAKQIYEFTRFVRERLGI 314

Query: 213 NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
           N + S   H   P++VL  R+    R ILN +E+  A              E+ SL    
Sbjct: 315 NESVS---HTQSPQVVLCTRHHN--RLILNDQEISNAIITKMNKRVAKVSFETHSLERMI 369

Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLE 330
           R I     ++G+HG+ L  ++FL  GS LM++ P G    +   ++  A + G  L Y +
Sbjct: 370 RIIGRSSGLIGMHGSILVMAMFLPQGSFLMELFPYGVVPWNYRPYKTLAGLPGMNLVYQD 429

Query: 331 YKIKQEESSL--------------VEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKL 376
           +    EE+++              + K    D++                +Y   Q+ K+
Sbjct: 430 WINTNEENTVTHPDRLPAFGGIAHLSKDEQEDIINTKEVPLHYCCNDPYWLYRIYQDTKI 489

Query: 377 DIDRFRIYLKDAYKKAKKLMDKEIKLA 403
           DID F   L  A + + KL  +  + A
Sbjct: 490 DIDSFSASLNTAIEGSHKLFTQGFQAA 516


>gi|123380767|ref|XP_001298478.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121879056|gb|EAX85548.1| hypothetical protein TVAG_527360 [Trichomonas vaginalis G3]
          Length = 560

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 52/244 (21%), Positives = 111/244 (45%), Gaps = 20/244 (8%)

Query: 111 FHEFMDCFVPLFITINSHFP--DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQT 168
           +H   D  +P++ +    +   D+D ++ I +  +    K  + +   T   +INI    
Sbjct: 268 WHSLFDFTLPIWWSKQMFWGHNDEDRVMVIDENKN--TEKGYQFIDILTHKNVINIKLDE 325

Query: 169 ITH---CFQSVTLGLISHGRMVINPTLLPKPKTL------VDFQSFLANAYNE-NTNTSS 218
             H   CF S+ LG +    + + P+  P    L      + F  F  ++ +  N N S 
Sbjct: 326 KYHNKTCFSSIILG-VPKTELEVTPSKWPNGYQLPYEFADIAFHQFREHSISTYNVNNSL 384

Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI-FEPEESTSLADSFRFIHS 277
               TKP+++ +NR+    R I+N +++    +E   DV I +    + ++ +      +
Sbjct: 385 CGKTTKPRVIFINRDTN-KRYIINSQDLINKMKEWAPDVDIDYVVYTNQTIGEQIAQFCN 443

Query: 278 CHAMVGVHGAGLTHSLFLRPG-SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
              ++ +HG+ L+H L++R   S ++++ P         ++E+ A+ +G++Y  +  K  
Sbjct: 444 ASLIISIHGSALSHMLWMRRNRSAIIEIFPYNYD--CRDWYEQVAKGMGIKYFFWINKIP 501

Query: 337 ESSL 340
           E+S 
Sbjct: 502 ENSF 505


>gi|432093450|gb|ELK25518.1| hypothetical protein MDA_GLEAN10017367 [Myotis davidii]
          Length = 523

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L++T  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 247 NMYHHFCD-FLNLYLTQHVNNSF-STDVYVVMWDTSTYGYGDLFSDTWKAFTDYDVIHLK 304

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+    +   F++F  +  +    T      
Sbjct: 305 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHSTGLFRAFSQHVLHRLNVTQEGPKD 364

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  R  H+   
Sbjct: 365 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 423

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++   G Q     Y +  AR+ G+ Y+ ++
Sbjct: 424 FIGMHGAGLTHLLFLPDWAAVFELYNCGDQ---RCYLDL-ARLRGVHYVTWR 471


>gi|74096367|ref|NP_001027841.1| glycosyltransferase aer61 precursor [Ciona intestinalis]
 gi|56550354|emb|CAI30563.1| glycosyltransferase [Ciona intestinalis]
          Length = 512

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 76/318 (23%), Positives = 132/318 (41%), Gaps = 42/318 (13%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWAR 147
           C V    P L F    + GN +H F D F  +++T   N+ +   DV +   D + ++  
Sbjct: 203 CDVIVDKPML-FVQLDFGGNMYHHFCD-FFNIYLTQMANNSWFGTDVQIVRWDLSYRYGE 260

Query: 148 KYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLA 207
            + E    FT    +++           +   + S     I       P  +    S L 
Sbjct: 261 VFRESWDAFTNRDHVSLREYMGKRV--CIADAMFSFLPRTILGLFYNTPVEVNCRGSSLF 318

Query: 208 NAYNEN----------TNTSSSFHHTKPKLVLVNRNA----RVGRTILNLREVKKAAEEL 253
            A++E+            TS      K ++ L+ R +    ++ R ILN+ E+  A  ++
Sbjct: 319 KAFSEHFLHRMGITSHPPTSLPNQPNKIRVTLLERGSNPRYKIYRRILNVDELGNAIRKI 378

Query: 254 -GFDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQW 311
            G +V + E +    S  D     H+   M+G+HGAGLTH LFL P +V  ++   G   
Sbjct: 379 PGLEVNVVEYDWRKMSFKDQLSMTHNSDIMIGMHGAGLTHFLFLPPWAVAFELYNCGD-- 436

Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN------- 364
               Y + P R+ G+ Y+ +    E+ S +E +  N+    +P       + N       
Sbjct: 437 -IRCYRDLP-RLRGVRYITW----EDDSKLEAFDQNE----HPHYGNQPKFWNYKFDVPE 486

Query: 365 -MRVYLKTQNVKLDIDRF 381
            +R+ LK ++  L+ DRF
Sbjct: 487 FVRLVLKARSWVLEHDRF 504


>gi|404448828|ref|ZP_11013820.1| capsular polysaccharide biosynthesis protein [Indibacter
           alkaliphilus LW1]
 gi|403765552|gb|EJZ26430.1| capsular polysaccharide biosynthesis protein [Indibacter
           alkaliphilus LW1]
          Length = 337

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 50/101 (49%), Gaps = 3/101 (2%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + ++R     R +LN  EV+    + G+++ + E     SL D    I  C  + G+HGA
Sbjct: 206 IYISRKDAQYRKVLNEPEVESVFSDFGYEIQVME---KFSLKDQVNMIRQCSHLAGLHGA 262

Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           GL++ +F+  G  +++   +G  W  +  F   A  LG EY
Sbjct: 263 GLSNMIFMPEGGKVLEFRNMGDSWSLSQSFFAMASDLGHEY 303


>gi|348575414|ref|XP_003473484.1| PREDICTED: uncharacterized glycosyltransferase AER61-like isoform 1
           [Cavia porcellus]
          Length = 527

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 108/232 (46%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVHIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T +    
Sbjct: 303 TYDSKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQAGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAVVFELYNCEDE---HCYLDL-ARLRGIHYITWR 469


>gi|157117440|ref|XP_001658768.1| glycosyltransferase [Aedes aegypti]
 gi|108876053|gb|EAT40278.1| AAEL007982-PA [Aedes aegypti]
          Length = 533

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 56/254 (22%), Positives = 111/254 (43%), Gaps = 19/254 (7%)

Query: 89  SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD-----QDVILAIADCND 143
           +C VT   PA +        N +H F D F+ L+ +++++  D      DV + + + + 
Sbjct: 219 TCDVTIDRPAFIMKIDA-TINMYHHFCD-FINLYGSLHANLSDPYGFSTDVQIMVWE-SY 275

Query: 144 QWARKYAELLPRFTRHPIININNQT-ITHCFQSVTLGLISHGRMVI----NPTLLPKPKT 198
            +   +AE    FT+HPI ++        CF+++ L L+   RM+     N  ++   + 
Sbjct: 276 TYDSPFAETFKVFTKHPIADLKTYAGKVVCFKNLVLPLLP--RMIFGLYYNTPIITGCEN 333

Query: 199 LVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
              FQ+F  +  +       S    K ++  ++R  +  R +     +++ +E   + V 
Sbjct: 334 SGFFQAFSEHVLHRLKVPQRSRSDRKLRITFLSRQTKFRRVLNENALLEEISENEDYLVN 393

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
                  T   +  +   +    +G+HGAGLTH LFL   +VL ++            ++
Sbjct: 394 QASFTYKTDFREQLKITRNTDIFIGMHGAGLTHLLFLPKWAVLFELYHCE----DPNCYK 449

Query: 319 KPARVLGLEYLEYK 332
             AR+ G+ YL ++
Sbjct: 450 DLARLKGVRYLSWE 463


>gi|354465590|ref|XP_003495262.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
           [Cricetulus griseus]
 gi|344248916|gb|EGW05020.1| Uncharacterized glycosyltransferase AER61 [Cricetulus griseus]
 gi|443496408|gb|AGC92969.1| EGF-O-GlcNAc transferase [Cricetulus griseus]
          Length = 527

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FLNLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDFKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 363 GKVRVTILARSTEY-RKILNQNELVNALKTVSTFEVRIVDYKYRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469


>gi|158261735|dbj|BAF83045.1| unnamed protein product [Homo sapiens]
          Length = 426

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 144 NMYHHFCD-FINLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 201

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 202 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 261

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 262 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 320

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 321 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 368


>gi|345569145|gb|EGX52013.1| hypothetical protein AOL_s00043g403 [Arthrobotrys oligospora ATCC
           24927]
          Length = 637

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 44/153 (28%), Positives = 76/153 (49%), Gaps = 16/153 (10%)

Query: 213 NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
           ++N  +     K K++   +N ++ R    ++ ++K   ++  ++T  E  + T   +  
Sbjct: 474 DSNPKTVEEKIKIKIIGRQKNRKILRLFDYVKILQKTYPDVDIEITKLEKLDVT---EQL 530

Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVY--FEKPARVLGLEYLE 330
           R I     ++GV GAGLTH+LFLR G+ L+++    TQ    VY  F   AR++GL Y +
Sbjct: 531 RMIRKTDILIGVTGAGLTHTLFLRKGAALIEL----TQPEPFVYFGFGNLARMIGLGYFQ 586

Query: 331 YKIKQEESSLVEKYGAN---DLVLKNPQAFAGA 360
            + ++ E     K G N   D VL +   F  A
Sbjct: 587 IQAEKREG----KSGINWQEDNVLVDEHTFIDA 615


>gi|222619896|gb|EEE56028.1| hypothetical protein OsJ_04812 [Oryza sativa Japonica Group]
          Length = 153

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 41/71 (57%), Gaps = 2/71 (2%)

Query: 324 LGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA-GANWSNM-RVYLKTQNVKLDIDRF 381
           +GL Y +Y  +  ESSL  KYG  D+V+ +P+A      W  + RVYL  QNV LD+ RF
Sbjct: 1   MGLHYEQYDAEGHESSLSRKYGLRDVVVSDPEAAKRDKGWGFVARVYLGGQNVTLDLSRF 60

Query: 382 RIYLKDAYKKA 392
           R  L   + +A
Sbjct: 61  RHTLTRLHARA 71


>gi|296225632|ref|XP_002758583.1| PREDICTED: uncharacterized glycosyltransferase AER61 isoform 1
           [Callithrix jacchus]
          Length = 527

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 106/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + II++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDIIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWQ 469


>gi|86606467|ref|YP_475230.1| hypothetical protein CYA_1816 [Synechococcus sp. JA-3-3Ab]
 gi|86555009|gb|ABC99967.1| tetratricopeptide repeat protein [Synechococcus sp. JA-3-3Ab]
          Length = 720

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 62/136 (45%), Gaps = 6/136 (4%)

Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
              S S    +P+ + ++R +   R ++N  EV       GF     E   + SL +   
Sbjct: 568 AQPSFSIDPLQPRRIYISRRSARWRRVINEAEVLACLHPWGFVPVQME---TLSLQEQIA 624

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
            +    A++G+HGAGLT+  F +PG+ +++++P         YF   A+V GL Y     
Sbjct: 625 LMQGAEAVIGIHGAGLTNLAFCQPGTTVIEILPSNA---VLPYFWSIAQVAGLRYFPLVA 681

Query: 334 KQEESSLVEKYGANDL 349
              + +LV    + DL
Sbjct: 682 PVCDPALVALLSSPDL 697


>gi|57114206|ref|NP_001009171.1| EGF domain-specific O-linked N-acetylglucosamine transferase
           precursor [Pan troglodytes]
 gi|75040721|sp|Q5NDL1.1|EOGT_PANTR RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
           transferase; AltName: Full=Extracellular O-linked
           N-acetylglucosamine transferase; Flags: Precursor
 gi|56550368|emb|CAI30570.1| glycosyltransferase [Pan troglodytes]
          Length = 527

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469


>gi|74708096|sp|Q5NDL2.1|EOGT_HUMAN RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
           transferase; AltName: Full=Extracellular O-linked
           N-acetylglucosamine transferase; Flags: Precursor
 gi|56550366|emb|CAI30569.1| glycosyltransferase [Homo sapiens]
 gi|443496410|gb|AGC92970.1| EGF-O-GlcNAc transferase [Homo sapiens]
          Length = 527

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469


>gi|397480750|ref|XP_003811634.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
           transferase isoform 1 [Pan paniscus]
          Length = 527

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTNYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWQ 469


>gi|409992949|ref|ZP_11276112.1| hypothetical protein APPUASWS_17665 [Arthrospira platensis str.
           Paraca]
 gi|291567436|dbj|BAI89708.1| TPR domain protein [Arthrospira platensis NIES-39]
 gi|409936195|gb|EKN77696.1| hypothetical protein APPUASWS_17665 [Arthrospira platensis str.
           Paraca]
          Length = 883

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 68/291 (23%), Positives = 119/291 (40%), Gaps = 44/291 (15%)

Query: 36  PKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK--------VKELTITTS---- 83
           P T +   +G T     QP+  +K  P  R  +   +          VK+L+   S    
Sbjct: 562 PTTVSNNLLGFTSHRTQQPASSVKILPQARGLNTGFLHTIITANNQVVKDLSSDNSGIAL 621

Query: 84  APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCND 143
           A  NL   + H    + F +     N+FH  ++    L + + S  P   +++      +
Sbjct: 622 AAHNLPETI-HLKGNIAFVSAYCGQNYFHWMVEVIPRLHLVLASGLPIDKIVV------N 674

Query: 144 QWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQ 203
           ++  KY +        P     NQ +  CF+ V   ++    MV + TL   PK   DF 
Sbjct: 675 KFGHKYEDETLAMFDIP----ENQKMFGCFRHVEADIL----MVPSRTL-STPKWACDFL 725

Query: 204 SFLANAY-----NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
             L   +     +   N SS       K+ +   NA + R ++N +E+    + LGF V 
Sbjct: 726 KDLVLKHPLLEEDNRCNYSS-------KIYISRANAYI-RQVINEQELMDILKPLGFQVV 777

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
             E   + S+ +    +H    ++  HGAGLT+ +F  PG+ ++++ P G 
Sbjct: 778 YLE---NMSVKEQALCLHHAEVVISPHGAGLTNLVFCEPGTKVIELFPPGA 825


>gi|338714518|ref|XP_003363099.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Equus
           caballus]
          Length = 527

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 106/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVHIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKQLGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469


>gi|351698200|gb|EHB01119.1| hypothetical protein GW7_02761 [Heterocephalus glaber]
          Length = 527

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +       T      
Sbjct: 303 TYDSKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLYRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWR 469


>gi|410951550|ref|XP_003982458.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
           transferase [Felis catus]
          Length = 527

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 54/232 (23%), Positives = 106/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWRAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            + ++ ++ R+    R ILN  E+  A + +   +V I + + +     D  R  H+   
Sbjct: 363 GRIRVTILARSTEY-RKILNQNELVNALKTVSTLEVRIVDYKYKELGFLDQLRISHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++   G +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCGDE---RCYLDL-ARLRGIHYITWR 469


>gi|322711774|gb|EFZ03347.1| DUF563 domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 477

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 54/101 (53%), Gaps = 10/101 (9%)

Query: 238 RTILNLREV----KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
           R ++N  E+    K+A   L  ++  F      S A+  R +     ++GVHGAGLTH++
Sbjct: 338 RKLINETELIESAKRAVPHLNIEIVDFA---GFSFAEQLRIVRETDLLIGVHGAGLTHAM 394

Query: 294 FLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIK 334
           FL PGS +++++P      + + F   A++LG +Y    +K
Sbjct: 395 FLPPGSAVVEILPRD---FAHMGFRNLAQLLGHQYHRTHVK 432


>gi|403297322|ref|XP_003939521.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
           transferase [Saimiri boliviensis boliviensis]
          Length = 527

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWQ 469


>gi|449474080|ref|XP_002192616.2| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
           transferase [Taeniopygia guttata]
          Length = 527

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D FV L+IT  IN+ F   DV + + D +   +   ++E    FT + II++ 
Sbjct: 245 NMYHHFCD-FVNLYITQHINNSFS-TDVNIVMWDTSSYGYGDLFSETWKAFTDYEIIHLK 302

Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
              +   CF+     L+   R  +  N  L+        F++F  +  +    T      
Sbjct: 303 TFDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +   +V + + + +    ++  R  H+   
Sbjct: 363 GKIRVTILARSTDY-RKILNQNELVNALKTVSTLEVRVVDYKYKELEFSEQLRITHNSDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGIHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLD-LARLRGIHYITWR 469


>gi|119585869|gb|EAW65465.1| hCG1777401 [Homo sapiens]
          Length = 290

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 8   NMYHHFCD-FINLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 65

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 66  TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 125

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 126 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 184

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 185 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 232


>gi|86607926|ref|YP_476688.1| hypothetical protein CYB_0430 [Synechococcus sp. JA-2-3B'a(2-13)]
 gi|86556468|gb|ABD01425.1| tetratricopeptide repeat protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 718

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 61/136 (44%), Gaps = 6/136 (4%)

Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
              S S    +P+ + ++R +   R ++N  EV       GF     E   + SL +   
Sbjct: 566 AQPSLSIDPLQPRRIYISRRSARWRRVINEAEVLACLRPWGFVPVQME---TLSLPEQIA 622

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
            +    A++G+HGAGLT+  F  PG+ +++++P         YF   A+V GL Y     
Sbjct: 623 LMQGAEAVIGIHGAGLTNLAFCPPGTTVIEILPSNA---VLPYFWSIAQVAGLRYFPLVA 679

Query: 334 KQEESSLVEKYGANDL 349
              + +LV    + DL
Sbjct: 680 PVCDPALVALLSSPDL 695


>gi|291393983|ref|XP_002713346.1| PREDICTED: AER61 glycosyltransferase isoform 1 [Oryctolagus
           cuniculus]
          Length = 524

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 104/232 (44%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L++T  +N+ F   DV + + D +   +   ++E    FT + +I++ 
Sbjct: 242 NMYHHFCD-FLNLYLTQHVNNSFS-TDVYIVMWDTSTYGYGDLFSETWKAFTDYEVIHLK 299

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+        F++F  +  +           
Sbjct: 300 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNIAQEGPQD 359

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  FDV I + + +     D  R  H+   
Sbjct: 360 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFDVQIVDYKYKELGFLDQLRITHNTDI 418

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 419 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 466


>gi|426341148|ref|XP_004035914.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
           transferase [Gorilla gorilla gorilla]
          Length = 527

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 100/232 (43%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDV-TIFEPEESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  EV      +  F V  I          D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNEVMNCLGTDFWFKVHVILYVHRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469


>gi|407426125|gb|EKF39586.1| hypothetical protein MOQ_000183 [Trypanosoma cruzi marinkellei]
          Length = 578

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 67/131 (51%), Gaps = 14/131 (10%)

Query: 209 AYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEES 265
           AYN N  T +    T   +P+L+L+NRN R    I + + V   AE +GF+V +   E+ 
Sbjct: 382 AYN-NMETGNKMQETFVYRPRLLLINRNYR---EIHDYQAVAALAERIGFNVQVVYFEK- 436

Query: 266 TSLADSFRFIHSCHAMVGVHGAGLTHSLFL----RPGS-VLMQVVPIGTQWLSTVYFEKP 320
            SL +          M+G+HG GLTH L++    RP    L++++P G       +++  
Sbjct: 437 MSLEEQVHVSRHADVMMGMHGMGLTHVLWMDGRRRPRCRALLELMPFGCPQKLIHFYKTF 496

Query: 321 ARVLGLEYLEY 331
           +  +G+ Y EY
Sbjct: 497 SDAIGIHY-EY 506


>gi|209523074|ref|ZP_03271631.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
 gi|209496661|gb|EDZ96959.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
          Length = 891

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/213 (20%), Positives = 91/213 (42%), Gaps = 20/213 (9%)

Query: 94  HTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELL 153
           H    + F +  Y  N+FH  ++    L + + S  P   +++      +++  KY +  
Sbjct: 637 HLEGNIAFVSALYGHNYFHWMVEVIPRLHLVLASGVPVDKIVV------NKFGHKYEDET 690

Query: 154 PRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN 213
                 P     +Q +  CF+ V   +     +++    LP PK   DF   L   +   
Sbjct: 691 LAMFDIP----EHQKMFGCFRHVQAEV-----LIVPCRTLPTPKWACDFVKDLVLKHPSL 741

Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
               +  +++    + ++R     R ++N +E+    + LGF+V   E   + S+     
Sbjct: 742 LEEDNRCNYSSK--IYISRANSYYRKVINEQELMDILKPLGFEVVYLE---NMSVKQQAL 796

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            +H    ++  HGAGLT+ +F  PG+ ++++ P
Sbjct: 797 CLHHAEVVISPHGAGLTNLVFCEPGTKVIELFP 829


>gi|109036798|ref|XP_001088832.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Macaca
           mulatta]
 gi|355559515|gb|EHH16243.1| hypothetical protein EGK_11501 [Macaca mulatta]
          Length = 527

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L++T  +N+ F   DV + + D +   +   +++    FT + II++ 
Sbjct: 245 NMYHHFCD-FINLYLTQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDIIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVPTFEVQIVDYKYRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWQ 469


>gi|58036800|emb|CAE45897.2| hypothetical protein [Homo sapiens]
          Length = 459

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 104/232 (44%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N  H F D F+ L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 177 NMCHHFCD-FINLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 234

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 235 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 294

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 295 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 353

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 354 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 401


>gi|57163723|ref|NP_001009187.1| EGF domain-specific O-linked N-acetylglucosamine transferase
           precursor [Canis lupus familiaris]
 gi|75040722|sp|Q5NDL9.1|EOGT_CANFA RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
           transferase; AltName: Full=Extracellular O-linked
           N-acetylglucosamine transferase; Flags: Precursor
 gi|56550352|emb|CAI30562.1| glycosyltransferase [Canis lupus familiaris]
          Length = 527

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D FV L+IT  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FVNLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +   +V I + + +     D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTLEVQIVDYKYKELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469


>gi|355746583|gb|EHH51197.1| hypothetical protein EGM_10532 [Macaca fascicularis]
          Length = 527

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L++T  +N+ F   DV + + D +   +   +++    FT + II++ 
Sbjct: 245 NMYHHFCD-FINLYLTQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDIIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVPTFEVQIVDYKYRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWQ 469


>gi|395824615|ref|XP_003785558.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
           transferase [Otolemur garnettii]
          Length = 523

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/235 (22%), Positives = 105/235 (44%), Gaps = 13/235 (5%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN 166
           N +H F D F+ L+IT  +N+ F  +  I+     +  +   +++    FT + +I++  
Sbjct: 241 NMYHHFCD-FINLYITQHLNNSFSTEVHIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLKT 299

Query: 167 QTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT 223
                 CF+     L+   R  +  N  L+   +    F++F  +  +    T       
Sbjct: 300 YDSKRVCFREAIFSLLPRMRYGLFYNTPLISGCQKTGLFRAFSQHVLHRLNITQEGPKDG 359

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAM 281
           K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  R  H+    
Sbjct: 360 KIRVTILARSTEY-RKILNQNELVNALKTVSMFEVRIVDYKYKELGFLDQLRITHNTDIF 418

Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
           +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y  ++ K +
Sbjct: 419 IGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYTTWRRKNK 469


>gi|449278758|gb|EMC86527.1| Putative glycosyltransferase AER61 [Columba livia]
          Length = 527

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 108/232 (46%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D FV L+IT  IN+ F   DV + + D +   +   ++E    FT + I+++ 
Sbjct: 245 NMYHHFCD-FVNLYITQHINNSFS-TDVNIVMWDTSSYGYGDLFSETWKAFTDYEIMHLK 302

Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
              +   CF+     L+   R  +  N  L+    +   F++F  +  +    T      
Sbjct: 303 TFDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHSTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +   +V + + + +    ++  R  H+   
Sbjct: 363 GKIRVTILARSTDY-RKILNQNELVNALKTVSTLEVKVVDYKYKELEFSEQLRITHNSDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLD-LARLRGIHYITWR 469


>gi|326928104|ref|XP_003210223.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
           [Meleagris gallopavo]
          Length = 530

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D FV L+IT  IN+ F   DV + + D +   +   ++E    FT + II++ 
Sbjct: 248 NMYHHFCD-FVNLYITQHINNSFS-TDVNIVMWDTSSYGYGDLFSETWKAFTDYDIIHLK 305

Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
              +   CF+     L+   R  +  N  L+        F++F  +  +    T      
Sbjct: 306 TFDSKRVCFREAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNITQEGPKD 365

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +   +V + + + +    ++  R  H+   
Sbjct: 366 GKIRVTILARSTDY-RKILNQNELVNALKTVSTLEVKVVDYKYKELEFSEQLRITHNSDI 424

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 425 FIGMHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLD-LARLRGIHYITWR 472


>gi|218189757|gb|EEC72184.1| hypothetical protein OsI_05258 [Oryza sativa Indica Group]
          Length = 81

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 2/74 (2%)

Query: 324 LGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA-GANWSNM-RVYLKTQNVKLDIDRF 381
           +GL Y +Y  +  ESSL  KYG  D+V+ +P+A      W  + RVYL  QNV LD+ RF
Sbjct: 1   MGLHYEQYDAEGHESSLSRKYGLRDVVVSDPEAAKRDKGWGFVARVYLGGQNVTLDLSRF 60

Query: 382 RIYLKDAYKKAKKL 395
           R  L   + +A ++
Sbjct: 61  RHTLTRLHARALRV 74


>gi|453086759|gb|EMF14801.1| glycosyltransferase family 61 protein [Mycosphaerella populorum
           SO2202]
          Length = 417

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 46/78 (58%), Gaps = 4/78 (5%)

Query: 275 IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE-YKI 333
           + S   ++GVHGAGLTH+LFL P S ++++ P G ++     F   A+ LG  YL+ Y  
Sbjct: 325 LRSTDILLGVHGAGLTHTLFLPPKSTVVEIQPPGLRYFG---FAALAKFLGHRYLQVYGE 381

Query: 334 KQEESSLVEKYGANDLVL 351
           ++E   +   + A+D+ L
Sbjct: 382 EREYEGMTHNWQADDVFL 399


>gi|322692964|gb|EFY84844.1| DUF563 domain protein [Metarhizium acridum CQMa 102]
          Length = 477

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 54/115 (46%), Gaps = 6/115 (5%)

Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
           +  V+KA   L  ++  F      S A+  + +     ++GVHGAGLTH++FL PGS ++
Sbjct: 347 IESVQKAIPHLDLEIVDFA---GFSFAEQLKIVRETDLLIGVHGAGLTHTMFLPPGSAVV 403

Query: 303 QVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
           +++P      + + F   A++LG  Y     +    +  E     D V    Q F
Sbjct: 404 EILPGD---FAHMGFRNLAQILGHRYYRTHAEMHGDASGESQWQFDAVEMEEQQF 455


>gi|71897159|ref|NP_001026580.1| EGF domain-specific O-linked N-acetylglucosamine transferase
           [Gallus gallus]
 gi|56550364|emb|CAI30568.1| glycosyltransferase [Gallus gallus]
          Length = 530

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 57/232 (24%), Positives = 106/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D FV L+IT  IN+ F   DV + + D +   +   ++E    FT + II + 
Sbjct: 248 NMYHHFCD-FVNLYITQHINNSFS-TDVNIVMWDTSSYGYGDLFSETWKAFTDYDIIYLK 305

Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
              +   CF+     L+   R  +  N  L+        F++F  +  +    T      
Sbjct: 306 TFDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNITQEGPKD 365

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +   +V + + + +    ++  R  H+   
Sbjct: 366 GKIRVTILARSTDY-RKILNQNELVNALKTVSTLEVKVVDYKYKELEFSEQLRITHNSDI 424

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 425 FIGMHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLD-LARLRGIHYITWR 472


>gi|148666934|gb|EDK99350.1| RIKEN cDNA A130022J15, isoform CRA_b [Mus musculus]
          Length = 533

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 53/232 (22%), Positives = 106/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L++T  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 251 NMYHHFCD-FLNLYLTQHVNNSFS-TDVYIVMWDTSTYGYGDLFSDTWKAFTDYDVIHLK 308

Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
              +   CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 309 TYDSKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 368

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V + + +       D  R  H+   
Sbjct: 369 GKVRVTILARSTEY-RKILNQDELVNALKTVSTFEVRVVDYKYRELGFLDQLRITHNTDI 427

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 428 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWR 475


>gi|194376580|dbj|BAG57436.1| unnamed protein product [Homo sapiens]
          Length = 290

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 104/232 (44%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   + +    FT + +I++ 
Sbjct: 8   NMYHHFCD-FINLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFFDTWNAFTDYDVIHLK 65

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 66  TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 125

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 126 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 184

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 185 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 232


>gi|158563925|sp|Q5NDL3.2|EOGT_CHICK RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
           transferase; AltName: Full=Extracellular O-linked
           N-acetylglucosamine transferase; Flags: Precursor
          Length = 535

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 57/232 (24%), Positives = 106/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D FV L+IT  IN+ F   DV + + D +   +   ++E    FT + II + 
Sbjct: 253 NMYHHFCD-FVNLYITQHINNSFS-TDVNIVMWDTSSYGYGDLFSETWKAFTDYDIIYLK 310

Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
              +   CF+     L+   R  +  N  L+        F++F  +  +    T      
Sbjct: 311 TFDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNITQEGPKD 370

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +   +V + + + +    ++  R  H+   
Sbjct: 371 GKIRVTILARSTDY-RKILNQNELVNALKTVSTLEVKVVDYKYKELEFSEQLRITHNSDI 429

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 430 FIGMHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLD-LARLRGIHYITWR 477


>gi|30424992|ref|NP_780522.1| EGF domain-specific O-linked N-acetylglucosamine transferase
           precursor [Mus musculus]
 gi|81898120|sp|Q8BYW9.1|EOGT_MOUSE RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
           transferase; AltName: Full=Extracellular O-linked
           N-acetylglucosamine transferase; Flags: Precursor
 gi|26332182|dbj|BAC29821.1| unnamed protein product [Mus musculus]
 gi|29294677|gb|AAH48939.1| RIKEN cDNA A130022J15 gene [Mus musculus]
 gi|148666933|gb|EDK99349.1| RIKEN cDNA A130022J15, isoform CRA_a [Mus musculus]
          Length = 527

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 53/232 (22%), Positives = 106/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L++T  +N+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FLNLYLTQHVNNSFS-TDVYIVMWDTSTYGYGDLFSDTWKAFTDYDVIHLK 302

Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
              +   CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V + + +       D  R  H+   
Sbjct: 363 GKVRVTILARSTEY-RKILNQDELVNALKTVSTFEVRVVDYKYRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWR 469


>gi|196015587|ref|XP_002117650.1| hypothetical protein TRIADDRAFT_32818 [Trichoplax adhaerens]
 gi|190579819|gb|EDV19908.1| hypothetical protein TRIADDRAFT_32818 [Trichoplax adhaerens]
          Length = 484

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 57/244 (23%), Positives = 109/244 (44%), Gaps = 23/244 (9%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI-- 164
           N FH F D FV L++T  +N  F     I+     +  +   +      F+RHP+  +  
Sbjct: 196 NMFHHFCD-FVNLYVTQHVNGSFTTDINIVMWDTSSMNYGDFFTLTWKAFSRHPVKRLSD 254

Query: 165 -NNQTITHCFQSVTLGL---ISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF 220
            +N+ +  CF+     L   + +G +  N  L+P        ++F  +  +    T+  +
Sbjct: 255 YDNKRV--CFKDAIFTLPPRMFYG-LYYNMPLIPGCYNSGLMKAFSEHVVHRLNITTKPY 311

Query: 221 HHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHSCH 279
              + ++ L++R+ +  R ILN  ++  A + +  +DVT  +              H+  
Sbjct: 312 DPNRYRITLLSRSTKY-RRILNEDKLIGALKTIWNYDVTRVDYTYQMPFEKQLEITHNSD 370

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL-----EYKIK 334
             +G+HG+GLTH LF     V  ++     +     Y++  AR+ G++YL     +Y I+
Sbjct: 371 IFIGMHGSGLTHMLFQPHWGVAFELYNCEDE---GCYYD-LARLRGVKYLTWEKKKYLIQ 426

Query: 335 QEES 338
           Q+E 
Sbjct: 427 QDEG 430


>gi|441665604|ref|XP_004091823.1| PREDICTED: LOW QUALITY PROTEIN: EGF domain-specific O-linked
           N-acetylglucosamine transferase [Nomascus leucogenys]
          Length = 527

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 104/232 (44%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +++     T + II++ 
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAXTDYDIIHLK 302

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  CF+     L+   R  +  N  L+   +    F++F  +  +    T      
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  R  H+   
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469


>gi|327266162|ref|XP_003217875.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Anolis
           carolinensis]
          Length = 626

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 58/240 (24%), Positives = 105/240 (43%), Gaps = 23/240 (9%)

Query: 109 NFFHEFMDCFVPLFIT---INSHFPDQDVILAIADCNDQWARKYAELLPR----FTRHPI 161
           N +H F D F  L+IT   INS   D ++++      D  A  Y +L       FT + I
Sbjct: 344 NMYHHFCD-FANLYITQHMINSFSTDVNIVMW-----DTSAYGYGDLFSETWKAFTDYEI 397

Query: 162 ININNQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSS 218
           +++ +      CF+     L+   R  +  N  L+        F++F  +  +    T  
Sbjct: 398 VHLKSYDSKRVCFKEAVFTLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNVTQE 457

Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIH 276
                K ++ ++ R+    R ILN  E+  A + L  F+V I   + +     +  +   
Sbjct: 458 GPKDGKIRVTILARSTEY-RKILNQNELANALKTLSLFEVQIVNYKYKELDFKEQLKITQ 516

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
           +    +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+ G+ Y+ ++ K +
Sbjct: 517 NSDIFIGMHGAGLTHLLFLPDWAVIFELYNCEDE---RCYLDL-ARLRGVHYITWEKKDK 572


>gi|300175586|emb|CBK20897.2| unnamed protein product [Blastocystis hominis]
          Length = 606

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 45/82 (54%), Gaps = 4/82 (4%)

Query: 237 GRTILN----LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
           GR ++N    L+E+++ A E GF++ +F   +  +L D F  +     ++G HG    + 
Sbjct: 478 GRDVVNENALLKELEQWASEKGFELVVFTASDYKTLDDLFMLLADVDVVLGPHGGAFYNM 537

Query: 293 LFLRPGSVLMQVVPIGTQWLST 314
           LF+R G  +++ VP    +LST
Sbjct: 538 LFMRRGITVIEFVPDSRMFLST 559


>gi|402859641|ref|XP_003894256.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
           transferase [Papio anubis]
          Length = 535

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 57/240 (23%), Positives = 107/240 (44%), Gaps = 23/240 (9%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIINI- 164
           N +H F D FV L++T  +N+ F   DV + + D +   +   +++    FT + II++ 
Sbjct: 245 NMYHHFCD-FVNLYLTQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDIIHLK 302

Query: 165 --------NNQTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENT 214
                   N   +  CF+     L+   R  +  N  L+   +    F++F  +  +   
Sbjct: 303 TYDSKGYWNFPEMFVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLN 362

Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSF 272
            T       K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D  
Sbjct: 363 ITQEGPKDGKIRVTILARSTEY-RKILNQNELVNALKTVPTFEVQIVDYKYRELGFLDQL 421

Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           R  H+    +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 RITHNTDIFIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWQ 477


>gi|434392082|ref|YP_007127029.1| capsular polysaccharide biosynthesis protein-like protein
           [Gloeocapsa sp. PCC 7428]
 gi|428263923|gb|AFZ29869.1| capsular polysaccharide biosynthesis protein-like protein
           [Gloeocapsa sp. PCC 7428]
          Length = 422

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 37/135 (27%), Positives = 67/135 (49%), Gaps = 11/135 (8%)

Query: 222 HTKPKL-----VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
           HT PK+     + ++R     R + N  +V    E+ GF +   E   S S+ +  +   
Sbjct: 272 HTSPKVASEEYIYISRGDARKRRVANEGDVINYLEKFGFKIVTLE---SMSVLEQAQLFS 328

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
           S  A+V  HGAGLT+++F +PG+ L++       ++  +Y++    V GLEY  +  +Q+
Sbjct: 329 SAKAIVAPHGAGLTNAVFCQPGTKLVEF--FSPNYVHPLYWDLSNHV-GLEYYYFLGEQQ 385

Query: 337 ESSLVEKYGANDLVL 351
                    A+D+V+
Sbjct: 386 HLPSYHNPNADDIVI 400


>gi|148235443|ref|NP_001085557.1| EGF domain-specific O-linked N-acetylglucosamine transferase
           precursor [Xenopus laevis]
 gi|82184549|sp|Q6GQ23.1|EOGT_XENLA RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
           transferase; AltName: Full=Extracellular O-linked
           N-acetylglucosamine transferase; Flags: Precursor
 gi|49117992|gb|AAH72925.1| MGC80419 protein [Xenopus laevis]
          Length = 525

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 55/233 (23%), Positives = 105/233 (45%), Gaps = 17/233 (7%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI-- 164
           N +H F D FV L+IT  +N+ F     I+        +   +++    FT + I ++  
Sbjct: 243 NMYHHFCD-FVNLYITQHVNNSFSTDINIVMWTTSVYGYGDLFSDTWKAFTDYEITHLKA 301

Query: 165 -NNQTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH 221
            +N+ +  CF+     L+   R  +  N  L+        F++F  +  +    T     
Sbjct: 302 YDNKRV--CFKDAVFALLPRMRYGLFYNTPLISHCHGSGLFRAFSQHVLHRLNITQHPAT 359

Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEEST-SLADSFRFIHSCH 279
             K ++ ++ R+    R ILNL E+ +A E +  F V + + +       +     H+  
Sbjct: 360 EAKIRVTILVRSTEF-RKILNLDELVQALEAVPTFQVKVVDYKYRVLGFLEQLSITHNSD 418

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
             +G+HGAGLTH LFL   +V+ ++        +  Y +  AR+ G++Y+ ++
Sbjct: 419 IFIGMHGAGLTHLLFLPDWAVVFELYNCED---ARCYLD-LARLRGIQYMTWE 467


>gi|57222245|ref|NP_001009502.1| EGF domain-specific O-linked N-acetylglucosamine transferase
           precursor [Rattus norvegicus]
 gi|81862277|sp|Q5NDL0.1|EOGT_RAT RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
           transferase; AltName: Full=Extracellular O-linked
           N-acetylglucosamine transferase; Flags: Precursor
 gi|56550370|emb|CAI30571.1| glycosyltransferase [Rattus norvegicus]
 gi|149036803|gb|EDL91421.1| glycosyltransferase Aer61, isoform CRA_b [Rattus norvegicus]
          Length = 527

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 53/232 (22%), Positives = 106/232 (45%), Gaps = 15/232 (6%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L++T  IN+ F   DV + + D +   +   +++    FT + +I++ 
Sbjct: 245 NMYHHFCD-FLNLYLTQHINNSFS-TDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302

Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
              +   CF+     L+   R  +  N  L+   +    F++F  +  +    +      
Sbjct: 303 TYDSKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNISQEGPKD 362

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
            K ++ ++ R+    R ILN  E+  A + +  F+V + + +       D  R  H+   
Sbjct: 363 GKLRVTILARSTEY-RKILNQNELVNALKTVSTFEVRVVDYKYRELGFLDQLRITHNTDI 421

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIYYITWQ 469


>gi|224006746|ref|XP_002292333.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971975|gb|EED90308.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 848

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 81/185 (43%), Gaps = 23/185 (12%)

Query: 229 LVNRNARVGRTILNLREVKKA------AEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
           L  R  R  R+ LNL EV KA      A+E      +   E + S  +      S  A++
Sbjct: 588 LAQRTYR--RSWLNLAEVMKACDTAEYAKEAKVLCVVVNVEHTASPYEQLLLHRSVDALI 645

Query: 283 GVHGAGLTHSLFLRPGSVLMQVVP-----IGTQWLSTVYFEKPARVL----GLEYLEYKI 333
           GVHGA LT ++ L P   +++++P     I   W+ T +   P  ++     L +L Y +
Sbjct: 646 GVHGAQLTQAVLLPPHGHILELLPWVPNYIRGAWVQTRHTPTPLGIIYHNTDLNHLGYSL 705

Query: 334 KQEESSLVEKYGANDL---VLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
            ++   L E    + L    LKN + F    W      ++   V   I+RF +Y+K   +
Sbjct: 706 GRDSVPLCEGVDDDQLEHCFLKNKKNFM---WDVRDFNVRPDIVIQYIERFVLYMKGEQR 762

Query: 391 KAKKL 395
           + K +
Sbjct: 763 QCKDV 767


>gi|346319407|gb|EGX89009.1| DUF563 domain protein [Cordyceps militaris CM01]
          Length = 477

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           +L  ++ A   +   V  F    S  L    +  H    ++GVHGAGLTHS+F++PG+VL
Sbjct: 347 HLERLRAAVPHMKLTVVDFA---SMPLHQQVQVAHQTDVLLGVHGAGLTHSMFMKPGAVL 403

Query: 302 MQVVPIGTQWLSTVYFEKPARVLGLEYL 329
           ++++P      +   F   A++LG  Y 
Sbjct: 404 IEILPAD---FTHKGFRNLAQMLGHRYF 428


>gi|118404476|ref|NP_001072691.1| EGF domain-specific O-linked N-acetylglucosamine transferase
           precursor [Xenopus (Silurana) tropicalis]
 gi|123884480|sp|Q08CY9.1|EOGT_XENTR RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
           transferase; AltName: Full=Extracellular O-linked
           N-acetylglucosamine transferase; Flags: Precursor
 gi|115313758|gb|AAI24029.1| glycosyltransferase AER61 [Xenopus (Silurana) tropicalis]
          Length = 525

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/233 (24%), Positives = 103/233 (44%), Gaps = 17/233 (7%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI-- 164
           N +H F D FV L+IT  +N+ F     I+        +   +++    FT + I ++  
Sbjct: 243 NMYHHFCD-FVNLYITQHVNNSFSTDINIVMWTTSVYGYGDLFSDTWKAFTDYDITHLKA 301

Query: 165 -NNQTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH 221
            +N+ +  CF+     L+   R  +  N  L+        F++F  +  +    T     
Sbjct: 302 YDNKRV--CFKDAVFALLPRMRYGLFYNTPLISNCHGSGLFRAFSQHVLHRLNITQQLPK 359

Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKAAE-ELGFDVTIFEPEEST-SLADSFRFIHSCH 279
             K ++ ++ R+    R ILNL E+  A E E  F V + + +       +     H+  
Sbjct: 360 EAKIRITILVRSTEF-RKILNLDELVHALEAEPTFQVKVVDYKYRVLGFLEQLEITHNSD 418

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
             +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 419 IFIGMHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLDL-ARLRGIRYMTWE 467


>gi|226482514|emb|CAX73856.1| glycoprotein 2-beta-D-xylosyltransferase [Schistosoma japonicum]
          Length = 453

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 60/128 (46%), Gaps = 14/128 (10%)

Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFHHTK----PKLVLVNR----------NARVGR 238
           L K   + +F+ F+  +YN N +  +    T     PK+V+V+R          N  + R
Sbjct: 259 LIKNMFIEEFRQFIFQSYNINNDEDTCQKRTSIRFLPKIVIVSRRDYIAHPRNINGTIHR 318

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
            I N  E+     +LGF  +        ++ +  + I S   ++G+HGA LT+SL L   
Sbjct: 319 KITNELELLNKLNQLGFQNSKVVCFTDLTMQEQLKLIMSTDILIGMHGAALTYSLLLSNT 378

Query: 299 SVLMQVVP 306
           S ++++ P
Sbjct: 379 SCVIELFP 386


>gi|432858928|ref|XP_004069008.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
           transferase-like [Oryzias latipes]
          Length = 528

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 62/256 (24%), Positives = 114/256 (44%), Gaps = 26/256 (10%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCN-DQWA 146
           C +    P  VF       N +H F D FV L+I+  IN+ F + D+++ + D +   + 
Sbjct: 228 CDIIIEKPT-VFMKLDAGVNMYHHFCD-FVNLYISQHINNSF-NSDILIVMWDTSFYGYG 284

Query: 147 RKYAELLPRFTRHPIININNQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQ 203
             ++E    FT + I+++        CF+     L+   R  +  N  L+    T   F+
Sbjct: 285 DLFSETWRAFTDYDIVHLKTYDSKRVCFKDSFFSLLPRMRYGLFYNTPLVSNCYTEGMFR 344

Query: 204 SFLANAYNENTNTSSSFHHTKPK-----LVLVNRNARVGRTILNLREVKKAAEELGF-DV 257
           +F     +++       H  KPK     L L+ R+    R ILN  ++  A + +   +V
Sbjct: 345 AF-----SQHVLHRLHIHQNKPKERHVRLTLLARSTEY-RRILNQEKLVNALKTVPLLEV 398

Query: 258 TIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVY 316
           T+ + + +  S        H+    +G+HGAGLTH LFL   +V+ ++     +      
Sbjct: 399 TVVDYKYKDVSFLKQLEISHNSDIFIGMHGAGLTHLLFLPDWAVIFELYNCQDE----SC 454

Query: 317 FEKPARVLGLEYLEYK 332
           +   AR+ G+ Y+ ++
Sbjct: 455 YRDLARLRGIRYVTWQ 470


>gi|427703345|ref|YP_007046567.1| hypothetical protein Cyagr_2107 [Cyanobium gracile PCC 6307]
 gi|427346513|gb|AFY29226.1| hypothetical protein Cyagr_2107 [Cyanobium gracile PCC 6307]
          Length = 435

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 57/235 (24%), Positives = 95/235 (40%), Gaps = 38/235 (16%)

Query: 103 AGGYNGNFFHEFMDCFVPLFITINSHFPDQ--DVILAIADCNDQWARKYAELLPRFT--R 158
           AG ++ N+FH  M  ++P+    +   P    D  L      D       E L RF    
Sbjct: 192 AGPWSDNYFHWLMQ-YLPILKIASQWQPLSTIDHFLVRGPVKD----FQREALARFGIPA 246

Query: 159 HPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS 218
           H +I IN   + +  + + L  I        P ++   + L                T S
Sbjct: 247 HKVIGINKNQV-YIIEDLLLTSIPCDNFTYEPWVIRMLREL----------------TES 289

Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGF---DVTIFEPEESTSLADSFRFI 275
                +P  +  +R A   R I+N+ E+    E  G    D +    +E  +LA S    
Sbjct: 290 GSRKGRPTSLFFDRRAPAPRRIVNMAEISDLLENYGIQSIDCSTISFQEQINLASSSSL- 348

Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
                ++GVHGA L +S+F  PG+VL++++P   +     YF + A    L++L+
Sbjct: 349 -----LIGVHGASLANSVFSAPGTVLIELLPRNYR---PRYFSELASACQLQHLK 395


>gi|332709295|ref|ZP_08429257.1| hypothetical protein LYNGBM3L_39730 [Moorea producens 3L]
 gi|332351841|gb|EGJ31419.1| hypothetical protein LYNGBM3L_39730 [Moorea producens 3L]
          Length = 959

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 47/81 (58%), Gaps = 3/81 (3%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           P+ + + R     R +LN  EVK+   + GF +TI    ES S+ +      S  A+V  
Sbjct: 810 PERIYITRAQARHRQVLNETEVKELLSQFGF-ITI--ALESLSVREQVALFASVKAIVAP 866

Query: 285 HGAGLTHSLFLRPGSVLMQVV 305
           HGAGL++++F RPG+ ++++V
Sbjct: 867 HGAGLSNTVFCRPGTQIIELV 887


>gi|407860860|gb|EKG07547.1| hypothetical protein TCSYLVIO_001321 [Trypanosoma cruzi]
          Length = 579

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/127 (31%), Positives = 65/127 (51%), Gaps = 11/127 (8%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           +P+L+L+NRN R    I + + V   AE +GF+V +   E+   L D          M+G
Sbjct: 400 RPRLLLINRNFR---EIHDYQAVAALAERVGFNVQVVYFEK-MPLEDQVHVSRHADVMMG 455

Query: 284 VHGAGLTHSLFL----RPGS-VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
           +HG GLTH L++    RP    L++++P G       +++  A  +G+ Y EY I+  + 
Sbjct: 456 MHGMGLTHVLWMDGRRRPRCRALLELMPFGCPGTLIHFYKLFADTVGIHY-EY-IEAADM 513

Query: 339 SLVEKYG 345
            L E  G
Sbjct: 514 YLKETGG 520


>gi|449133953|ref|ZP_21769462.1| capsular polysaccharide biosynthesis protein [Rhodopirellula
           europaea 6C]
 gi|448887370|gb|EMB17750.1| capsular polysaccharide biosynthesis protein [Rhodopirellula
           europaea 6C]
          Length = 313

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 8/108 (7%)

Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGF 255
           P+T+   ++++ N     ++    F+    + V ++R     R + N REV+    + GF
Sbjct: 152 PQTMESLRNWVWN-----SSQLDRFNVVPHRRVYISRQFAQMRRVENEREVQGFMRDEGF 206

Query: 256 DVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
           ++      E+ S  D  R      A+VG+HGAGLT+ LF+ PG+ +++
Sbjct: 207 EIV---HTENLSFDDQVRMFFETKALVGIHGAGLTNLLFMHPGTSVLE 251


>gi|326432569|gb|EGD78139.1| hypothetical protein PTSG_12822 [Salpingoeca sp. ATCC 50818]
          Length = 623

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 56/109 (51%), Gaps = 9/109 (8%)

Query: 226 KLVLVNRNARVGRTILNLREVKKA------AEELGFDVTIFEPEESTSLADSFRFIHSCH 279
           ++V+VNR A  GR+IL +  +  A      A  +  +V + E  E+ SL+      ++  
Sbjct: 426 RVVIVNRPASSGRSILGVDAIAGAVRATLAALNISHEVIVEEHLETKSLSAQIALFNAAT 485

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           A++  HGA  T+ LFLRP ++ ++V P          +   AR++G +Y
Sbjct: 486 AVISAHGAANTNWLFLRPNALAIEVFPFAYH---PRQYSTIARLIGAKY 531


>gi|71664664|ref|XP_819310.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70884606|gb|EAN97459.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 578

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 59/113 (52%), Gaps = 10/113 (8%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           +P+L+L+NRN R    I + + V   AE +GF+V +   E+   L D          M+G
Sbjct: 399 RPRLLLINRNFR---EIHDYQAVAALAERVGFNVQVVYFEK-MPLEDQVHVSRHADVMMG 454

Query: 284 VHGAGLTHSLFL----RPGS-VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
           +HG GLTH L++    RP    L++++P G       +++  A  +G+ Y EY
Sbjct: 455 MHGMGLTHVLWMDGRRRPRCRALLELMPFGCPRTLIHFYKLFADTVGIHY-EY 506


>gi|71419571|ref|XP_811209.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70875847|gb|EAN89358.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 579

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 59/113 (52%), Gaps = 10/113 (8%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           +P+L+L+NRN R    I + + V   AE +GF+V +   E+   L D          M+G
Sbjct: 400 RPRLLLINRNFR---EIHDYQAVAALAERVGFNVQVVYFEK-MPLEDQVHVSRHADVMMG 455

Query: 284 VHGAGLTHSLFL----RPGS-VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
           +HG GLTH L++    RP    L++++P G       +++  A  +G+ Y EY
Sbjct: 456 MHGMGLTHVLWMDGRRRPRCRALLELMPFGCPRTLIHFYKLFADTVGIHY-EY 507


>gi|322709092|gb|EFZ00668.1| DUF563 domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 362

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 10/95 (10%)

Query: 238 RTILNLREV----KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
           R ++N  E+    K+A   L  ++  F      S A+  + +     ++GVHGAGLTH++
Sbjct: 223 RKLINETELIASAKRAVPHLNIEIVDFA---EFSFAEQLKIVRETDLLIGVHGAGLTHTM 279

Query: 294 FLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           FL PGS +++++P      + + F   A++LG  Y
Sbjct: 280 FLPPGSAVVEILPGD---FAHMGFRNLAQLLGHRY 311


>gi|301778056|ref|XP_002924441.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
           [Ailuropoda melanoleuca]
 gi|281342540|gb|EFB18124.1| hypothetical protein PANDA_013786 [Ailuropoda melanoleuca]
          Length = 527

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/273 (21%), Positives = 118/273 (43%), Gaps = 17/273 (6%)

Query: 69  KSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--I 125
           +S  ++++  T  +  P  +  C +    PA          N +H F D F+ L+IT  +
Sbjct: 205 QSWFAELQSYTQLSFRPLEDAKCDIVIEKPAYFMKLDA-GVNMYHHFCD-FINLYITQHV 262

Query: 126 NSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININNQTITH-CFQSVTLGLISH 183
           N+ F   DV + + D +   +   + +    FT + +I++        CF+     L+  
Sbjct: 263 NNSF-GTDVYIVMWDTSSYGYGDLFLDTWKAFTDYDVIHLKTYDSKRVCFKEAVFSLLPR 321

Query: 184 GR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
            R  +  N  L+   +    F++F  +  +    T       K ++ ++ R+    R IL
Sbjct: 322 MRHGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKDGKIRVTILARSTEY-RKIL 380

Query: 242 NLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
           N  E+  A + +   +V I + + +     D  +  H+    +G+HGAGLTH LFL   +
Sbjct: 381 NQNELVNALKTVSTLEVQIVDYKYKELGFLDQLQITHNTDIFIGMHGAGLTHLLFLPDWA 440

Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 441 AVFELYNCEDE---RCYLDL-ARLRGVHYVTWR 469


>gi|291239169|ref|XP_002739505.1| PREDICTED: Uncharacterized glycosyltransferase AER61-like
           [Saccoglossus kowalevskii]
          Length = 588

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/203 (25%), Positives = 90/203 (44%), Gaps = 10/203 (4%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCND-QWARKYAELLPRFTRHPIININ 165
           N +H F D FV ++I+  IN+ F   DV + + D +   +   ++     F+ +PII I 
Sbjct: 303 NMYHHFCD-FVNIYISQHINNSFS-SDVNIVMWDTSGLSYGDFFSATWQAFSDYPIIPIK 360

Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
                  C +     L+   +     N  L+P        ++F  +  +           
Sbjct: 361 RWDGKKVCMKEAVFSLLPRMQRGFYYNMPLVPSCHGSGIIKAFSQHLMHRLKIPQEGPLK 420

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
            K ++ L+ RN +  R I+N  E+ KA  +E    V + E   +       ++ H+    
Sbjct: 421 NKVRVTLLARNTK-HRNIINQNELVKAMKKEKDLTVKVVEYNRNMPFLKQLKYTHNSDIF 479

Query: 282 VGVHGAGLTHSLFLRPGSVLMQV 304
           +G+HGAGLTHSLFL   +V+ ++
Sbjct: 480 IGMHGAGLTHSLFLPDWAVVFEL 502


>gi|358385599|gb|EHK23196.1| glycosyltransferase family 61 protein [Trichoderma virens Gv29-8]
          Length = 454

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 3/67 (4%)

Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE 327
            ++  + +     +VGVHGAGLTH +FL+PGS +++++P G Q      F   A++LG+ 
Sbjct: 346 FSEQIQIVRETDLLVGVHGAGLTHLMFLQPGSAVLEILPEGLQHKG---FRNLAQMLGIG 402

Query: 328 YLEYKIK 334
           +     K
Sbjct: 403 FFRAHAK 409


>gi|56550356|emb|CAI30564.1| glycosyltransferase [Ciona intestinalis]
          Length = 503

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 55/111 (49%), Gaps = 12/111 (10%)

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEEST-SLADSFRFIHSCHA 280
           TKP  +      +V R I+N +++ K    + GF V + E  + T S  D     H+   
Sbjct: 348 TKPDYL-----GKVYRQIVNEKDLIKVLNNIPGFQVKVVEYHQDTMSFKDQISMSHNSDI 402

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
           M+G+HGAGLTH LFL P +V  ++     +      +   AR+ G+ ++ +
Sbjct: 403 MIGMHGAGLTHFLFLPPWAVAFELYNCQAK-----CYRDLARLRGVRHMTW 448


>gi|160358325|ref|NP_001027840.1| glycosyltransferase aer61b precursor [Ciona intestinalis]
          Length = 498

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 55/111 (49%), Gaps = 12/111 (10%)

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEEST-SLADSFRFIHSCHA 280
           TKP  +      +V R I+N +++ K    + GF V + E  + T S  D     H+   
Sbjct: 343 TKPDYL-----GKVYRQIVNEKDLIKVLNNIPGFQVKVVEYHQDTMSFKDQISMSHNSDI 397

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
           M+G+HGAGLTH LFL P +V  ++     +      +   AR+ G+ ++ +
Sbjct: 398 MIGMHGAGLTHFLFLPPWAVAFELYNCQAK-----CYRDLARLRGVRHMTW 443


>gi|384488169|gb|EIE80349.1| hypothetical protein RO3G_05054 [Rhizopus delemar RA 99-880]
          Length = 198

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 46/187 (24%), Positives = 81/187 (43%), Gaps = 19/187 (10%)

Query: 215 NTSSSFH---HTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI--FEPEESTSLA 269
           N+   +H    T  K  ++NR  +  R I N+ +V +A  +   DV+I     +E  ++ 
Sbjct: 11  NSVREYHSGSKTNIKFAILNR--KQSRHITNIPDVIEAMLKEFKDVSIKLINYDEGCNIR 68

Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
            + + +      +  HG GL   LF++ GS    V+ I ++W S  +F  P + + +  L
Sbjct: 69  STAQLVEDIDVFISPHGNGLGSGLFMKKGST---VISIDSRWYSEDWFYWPMKAVNVRIL 125

Query: 330 EYKIK----QEESSLVEKYGANDLVLKNP-----QAFAGANWSNMRVYLKTQNVKLDIDR 380
            Y       QE    +    A DL  +       Q     +   +  Y K    ++DI+R
Sbjct: 126 YYDCNNPSCQEFDEQLLMKLAPDLTEEQKKEIMTQEVTSVDRLLVEQYRKDSKRRIDIER 185

Query: 381 FRIYLKD 387
           F ++LKD
Sbjct: 186 FIVFLKD 192


>gi|347759425|ref|YP_004866986.1| hypothetical protein GLX_02040 [Gluconacetobacter xylinus NBRC
           3288]
 gi|347578395|dbj|BAK82616.1| hypothetical protein GLX_02040 [Gluconacetobacter xylinus NBRC
           3288]
          Length = 579

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 61/127 (48%), Gaps = 20/127 (15%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           +LV ++R     R ILN   +      LGF++ +      TSLAD  R       +V  H
Sbjct: 454 RLVYISRQDTKARRILNEDALIDRLRSLGFEILV---ATGTSLADQIRIFREARLVVAGH 510

Query: 286 GAGLTHSLFLRPGSVLMQVV--------PI------GTQWLSTVYFEKP---ARVLGLEY 328
           GAG+++ LF R G+ L++++        P+      G ++ S ++FE        + ++ 
Sbjct: 511 GAGVSNMLFAREGTALLELIQASYLNVGPMRLAQIAGCRYYSMLFFEDGPHNGWYVDIDR 570

Query: 329 LEYKIKQ 335
           +E+ I+Q
Sbjct: 571 VEWAIRQ 577


>gi|440795174|gb|ELR16310.1| glycosyltransferase AER61, putative [Acanthamoeba castellanii str.
           Neff]
          Length = 524

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 117/266 (43%), Gaps = 45/266 (16%)

Query: 84  APPNLSCGVTHTSPALVFSAGGYNG-NFFH---EFMDCFVPLFITINSHFPDQDVILAIA 139
           A  +  CGV + +P +V     Y   N +H   E+++ F  L +       D++  + + 
Sbjct: 234 ADQSTECGVVYENPVVVIMR--YEAWNMYHQLGEWINAFTTLEVVDKL---DKNTQVLLL 288

Query: 140 DCNDQWARKYAELLPRFT-RHPIININNQTITH---CFQSVTLGLISHGRMVIN------ 189
           D +++    + ++L  F+  HP++ +  + +     CF+   +    +G  + N      
Sbjct: 289 DMHEK-TEPFTDMLKVFSPDHPLV-LGKELVGKGKVCFKDAIMPWEGYGTFIHNNVWRAS 346

Query: 190 ---PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRT---- 239
              P L      L  F  F+ N             H    +P++ L+ R   +GR     
Sbjct: 347 HGEPCL--DSDILEAFSHFVLNKL-------GMLKHNIPNEPRITLILRKDYMGRKLDRK 397

Query: 240 ILNLREVKKAAEELGFDVTIFEPE--ESTSLADSFRFIHS-CHAMVGVHGAGLTHSLFLR 296
           I N  +V KA EE+      F     E+ +  +    ++S  + ++GVHGAGL+H++FL 
Sbjct: 398 ISNEDQVVKALEEVSRGRASFSSVQLETMTFKEQVELMYSKTNILIGVHGAGLSHTVFLP 457

Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPAR 322
           P ++L++++P   +  S  YF   A+
Sbjct: 458 PEAILIELLPDSVK--SFTYFRNLAK 481


>gi|348510387|ref|XP_003442727.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized glycosyltransferase
           AER61-like [Oreochromis niloticus]
          Length = 528

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 56/251 (22%), Positives = 110/251 (43%), Gaps = 16/251 (6%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCN-DQWA 146
           C +    P  VF       N +H F D F+ L+I+  IN+ F   D+ + + D +   + 
Sbjct: 228 CDIIVEKPT-VFMKLDAGVNMYHHFCD-FINLYISQHINNSF-SSDINIVMWDTSFYGYG 284

Query: 147 RKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQ 203
             ++E    F+ + II++    +   CF+     L+   R  +  N  L+    +   F+
Sbjct: 285 DLFSETWRAFSEYDIIHLKTFDSKRVCFKDAFFSLLPRMRYGLFYNTPLISDCYSEGMFR 344

Query: 204 SFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTI--LNLREVKKAAEELGFDVTIFE 261
           +F  +  +    T       + ++ L+ R+    R +  + L    K A  L  +V  ++
Sbjct: 345 AFSQHILHRLNITRDKPQEGRVRVTLLARSTEYRRILNHMELVNALKTAPLLEVNVVDYK 404

Query: 262 PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPA 321
            ++   L +  R  H+    +G+HGAGLTH LFL   +V+ ++     +      +   A
Sbjct: 405 YKDVPFL-EQLRITHNSDIFIGMHGAGLTHLLFLPDWAVIFELYNCQDE----SCYRDLA 459

Query: 322 RVLGLEYLEYK 332
           R+ G+ Y+ ++
Sbjct: 460 RLRGIRYVTWQ 470


>gi|240141135|ref|YP_002965615.1| hypothetical protein MexAM1_META1p4710 [Methylobacterium extorquens
           AM1]
 gi|240011112|gb|ACS42338.1| hypothetical protein MexAM1_META1p4710 [Methylobacterium extorquens
           AM1]
          Length = 414

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 50/93 (53%), Gaps = 5/93 (5%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           + + V+R+   GR + N   V      LGF+   F+PE ++      RF  +   +VG+ 
Sbjct: 282 RRLFVDRHPGRGRGLANRDAVLGLVRGLGFEA--FDPELTSVRQQVVRF-SAAEIVVGIA 338

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           GAG+T+++F RPG+ ++ +VP G  W    Y E
Sbjct: 339 GAGMTNTVFCRPGTPVIHLVPEG--WEDRFYGE 369


>gi|91083375|ref|XP_966996.1| PREDICTED: similar to glycosyltransferase [Tribolium castaneum]
 gi|270006906|gb|EFA03354.1| hypothetical protein TcasGA2_TC013339 [Tribolium castaneum]
          Length = 518

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 116/266 (43%), Gaps = 25/266 (9%)

Query: 89  SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF--ITINSHFPDQ---DVILAIADCND 143
            C V    P  +        N +H F D F+ L+  I +NS   D    DV + I +   
Sbjct: 210 DCDVVIEKPTFIMKIDA-TVNMYHHFCD-FLNLYASIHLNSTQWDAFSTDVHVLIWETY- 266

Query: 144 QWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKT 198
            +   + +    FT HP+ ++   +  T CF++V   L+   RM+     N  ++   + 
Sbjct: 267 TYRSAFGDTWEAFTDHPVWDLKTFRGETVCFKNVVFPLLP--RMIFGLYYNTPIIYGCEN 324

Query: 199 LVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDV 257
              FQ+F  +  +         ++ K ++ L+ R+ +  R ILN  E+ +A A+   ++V
Sbjct: 325 SGLFQAFSQHILHRLKIPFHPRNNRKIRITLLARDTKY-RRILNEDELVEALAKNHDYEV 383

Query: 258 TIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYF 317
                 +            +   ++GVHGAGLTH LFL   + + ++        +  YF
Sbjct: 384 QKVVYNKDVPFKKQLEITRNSDVLIGVHGAGLTHLLFLPDWAAVFELYNCED---ANCYF 440

Query: 318 EKPARVLGLEYLEY----KIKQEESS 339
           +  AR+ GL+Y+ +    K+K +E  
Sbjct: 441 DL-ARLRGLKYVTWEKLDKLKSQEDG 465


>gi|418064455|ref|ZP_12701932.1| hypothetical protein MetexDRAFT_6668, partial [Methylobacterium
           extorquens DSM 13060]
 gi|373546436|gb|EHP73201.1| hypothetical protein MetexDRAFT_6668, partial [Methylobacterium
           extorquens DSM 13060]
          Length = 328

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 50/93 (53%), Gaps = 5/93 (5%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           + + V+R+   GR + N   V      LGF+   F+PE ++      RF  +   +VG+ 
Sbjct: 196 RRLFVDRHPGRGRGLANRDAVLGLVRGLGFEA--FDPELTSVRQQVVRF-SAAEIVVGIA 252

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           GAG+T+++F RPG+ ++ +VP G  W    Y E
Sbjct: 253 GAGMTNTVFCRPGTPVIHLVPEG--WEDRFYGE 283


>gi|70994078|ref|XP_751886.1| DUF563 domain protein [Aspergillus fumigatus Af293]
 gi|66849520|gb|EAL89848.1| DUF563 domain protein [Aspergillus fumigatus Af293]
          Length = 488

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 95/216 (43%), Gaps = 23/216 (10%)

Query: 149 YAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLAN 208
           Y +L   F + P++ IN+   T+ F+++ + L        NP      +      S L  
Sbjct: 260 YFDLWKIFAQRPLLRINDLHPTNEFENLIVPLAGGS----NPLWQGDWEIHSCEDSALVR 315

Query: 209 AYNENTNTSSSFH------HTKPKLVLVNRNARVGRTILNLRE----VKKAAEELGFDVT 258
            ++      S FH         P++V+   +    R ++N ++    VKK    +   + 
Sbjct: 316 TFSRRI--LSHFHVEFRRPRQGPQIVVTFIDRTGSRKLINQKDYFNTVKKQFPHITVQMI 373

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
            F    S    +  R       +VGVHGAGLTH +FL  GSV+++++P G   L+   F 
Sbjct: 374 DFA---SIPFQEQLRIAQGSDILVGVHGAGLTHGIFLPSGSVMVEILPPG---LNHKGFR 427

Query: 319 KPARVLG-LEYLEYKIKQEESSLVEKYGANDLVLKN 353
             A +LG L +  +  K  ++   + +  ND+ L++
Sbjct: 428 NLASLLGHLYFSAHATKPAKTVKRDDWHNNDVYLEH 463


>gi|433590319|ref|YP_007279815.1| hypothetical protein Natpe_0999 [Natrinema pellirubrum DSM 15624]
 gi|433305099|gb|AGB30911.1| hypothetical protein Natpe_0999 [Natrinema pellirubrum DSM 15624]
          Length = 406

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 93/206 (45%), Gaps = 17/206 (8%)

Query: 104 GGYNGNFFHEFMDCFVPLFITINSHFPDQD-VILAIADCNDQWARKYAELLPRFTRHPII 162
           G Y+ N++H F + ++P    +  H+P  D V L I     +W R   ELL  F  + I+
Sbjct: 168 GPYDKNYYHWFGN-YLPRLEGLQ-HYPGADEVTLMIPADPPEWLRDSLELL-GFGDYNIV 224

Query: 163 NINNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLV--DFQSFLANAYNENTNTSSSF 220
               + + H  + +    +S  R   +     +P  +   +F+ ++ +   ENT  ++  
Sbjct: 225 EWPGKPV-HADRYI----LSRYRSEASSETGNRPPIMSKKNFK-WILDRIGENTPNNTR- 277

Query: 221 HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHA 280
              +P++ +   +A   RT LN +EV     E  F+  I       SLA+  R   +   
Sbjct: 278 EKDRPRIYISREDATARRT-LNEQEVMATLSEYDFEKYILS---EYSLAEQIRLFSNAEI 333

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVP 306
           ++G HGAGL + LF    +V+  + P
Sbjct: 334 IIGAHGAGLINMLFASDATVIELLGP 359


>gi|118344554|ref|NP_001072048.1| glycosyltransferase precursor [Takifugu rubripes]
 gi|56550374|emb|CAI30573.1| glycosyltransferase [Takifugu rubripes]
          Length = 525

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 63/275 (22%), Positives = 123/275 (44%), Gaps = 21/275 (7%)

Query: 69  KSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--I 125
           +S  ++++  T   S P  N  C +T   P  VF       N +H F D FV L+I+  I
Sbjct: 203 QSWYAELQTYTELDSDPIVNGQCDLTVDKPT-VFMKLDAGVNMYHHFCD-FVNLYISQHI 260

Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLIS 182
           N+ F     I+     + ++   ++E    F+++ II++   +N+ +  CF+     L+ 
Sbjct: 261 NNSFSSDINIVMWDTSSYEYGDLFSETWRAFSQNDIIHLKVYDNKRV--CFRDALFSLLP 318

Query: 183 HGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTI 240
             R  +  N  L+    +   F++F  +  +            + ++ L+ R+    R I
Sbjct: 319 RMRYGLFYNTPLISDCYSEGMFRAFSQHILHRLHVPQDGPKDGRVRVTLLARSTEY-RKI 377

Query: 241 LNLREVKKAAE---ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           LN  E+  A +    L  +V  ++ ++   L    +  H+    +G+HGAGLTH LFL  
Sbjct: 378 LNQVELVNALKTVPNLEVNVVDYKYKDVPFLV-QLKTTHNSDIFIGMHGAGLTHLLFLPD 436

Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +V+ ++     +      +   AR+ G+ Y+ ++
Sbjct: 437 WAVIFELYNCQDE----SCYRDLARLRGVRYVTWQ 467


>gi|254563646|ref|YP_003070741.1| hypothetical protein METDI5316 [Methylobacterium extorquens DM4]
 gi|254270924|emb|CAX26929.1| hypothetical protein METDI5316 [Methylobacterium extorquens DM4]
          Length = 414

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 5/95 (5%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           + + + V+R+   GR + N   V      LGF+   F+PE ++      RF  +   +VG
Sbjct: 280 RGRRLFVDRHPGRGRGLANRDAVLGLLRGLGFEA--FDPELTSVRQQVVRF-SAAEIVVG 336

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           + GAG+T+++F RPG+ ++ +VP G  W    Y E
Sbjct: 337 IAGAGMTNTVFCRPGTPVIHLVPEG--WEDRFYGE 369


>gi|443724229|gb|ELU12341.1| hypothetical protein CAPTEDRAFT_168806 [Capitella teleta]
          Length = 519

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 54/244 (22%), Positives = 110/244 (45%), Gaps = 12/244 (4%)

Query: 69  KSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--I 125
           +S  +++++ T  +S P    +C V    P  +F       N +H F D ++ L+++  +
Sbjct: 192 QSWYAELEKYTQLSSDPFEEKTCEVIIDKPT-IFIKLDAGVNMYHHFCD-YINLYLSQHM 249

Query: 126 NSHFPDQDVILAIADCND-QWARKYAELLPRFTRHPIININNQTITH-CFQSVTLGLISH 183
           N+ F   DV + + D +   +   +      F+ H I+ +        CF+ V    ++ 
Sbjct: 250 NNSFS-TDVYIVMWDTSPMHYGDFFHVTWKAFSDHEIVPLKEYDGKKVCFKDVVFSFLAR 308

Query: 184 GR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
            R  +  N  L+P  +    F++F  +  +    T       K ++ L++R+ +  R IL
Sbjct: 309 MRYGLYYNMPLIPGCQGSSFFRAFNQHILHRLNITQDGPLLDKVRITLLSRSTKF-RKIL 367

Query: 242 NLREVKKAAEELG-FDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
           N  E+  A + +  + V + +    T   +  +  ++    +G+HGAGLTH LF    +V
Sbjct: 368 NEDELVTALKSVDDYKVQVVDFNYKTPFLEQLQVTYNSDFFIGMHGAGLTHVLFQPDWAV 427

Query: 301 LMQV 304
           L ++
Sbjct: 428 LFEI 431


>gi|56550372|emb|CAI30572.1| glycosyltransferase [Tetraodon nigroviridis]
          Length = 525

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 121/274 (44%), Gaps = 19/274 (6%)

Query: 69  KSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--I 125
           +S  ++++  T   S P  N  C +T   P  VF       N +H F D FV L+I+  I
Sbjct: 203 QSWYAELQTYTELDSDPIVNGQCDLTVDKPT-VFMKLDAGVNMYHHFCD-FVNLYISQHI 260

Query: 126 NSHFPDQDVILAIADCN-DQWARKYAELLPRFTRHPIININNQTITH-CFQSVTLGLISH 183
           N+ F   D+ + + D +  ++   ++E    F+ + II++        CF+     L+  
Sbjct: 261 NNSF-SSDINIVMWDTSFYEYGDLFSETWRAFSENDIIHLKTYDSKRVCFRDAFFSLLPR 319

Query: 184 GR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
            R  +  N  L+    +   F++F  +  +            + ++ L+ R+    R IL
Sbjct: 320 MRYGLFYNTPLISDCYSEGMFRAFSQHVLHRLNIPQDGPKDGRVRVTLLARSTEY-RKIL 378

Query: 242 NLREVKKAAE---ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
           N  E+  A +    L  +V  F+ ++   L    +  H+    +G+HGAGLTH LFL   
Sbjct: 379 NQVELVNALKTVPHLKVNVVDFKYKDVPFLV-QLKITHNSDIFIGMHGAGLTHLLFLPDW 437

Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           +V+ ++     +      +   AR+ G+ Y+ ++
Sbjct: 438 AVIFELYNCQDE----SCYRDLARLRGVRYVTWQ 467


>gi|312377513|gb|EFR24326.1| hypothetical protein AND_11153 [Anopheles darlingi]
          Length = 533

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 58/254 (22%), Positives = 109/254 (42%), Gaps = 21/254 (8%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF--PD---QDVILAIADCNDQ 144
           C V    PA +        N +H F D F+ L+ +++++   PD    D  + I + + +
Sbjct: 223 CDVVIEKPAFIMKIDA-AINMYHHFCD-FLNLYGSLHANLSHPDGFTTDTQVLIWE-SFR 279

Query: 145 WARKYAELLPRFTRHPIININNQT-ITHCFQSVTLGLISHGRMVI----NPTLLPKPKTL 199
           +   +A+    FTRHPI ++        CF++V L L+   RM+     N  ++   +  
Sbjct: 280 YISPFADTFKVFTRHPIADLKTYAGKVVCFRNVVLPLLP--RMIFGLYYNTPIIHGCENS 337

Query: 200 VDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE-VKKAAEELGFDVT 258
             F +F  +  +            + ++  ++R  R  R +LN  E V   ++   + V 
Sbjct: 338 GLFHAFSEHVLHRMRIRLVKRPDRRLRITFLSRQTRY-RRVLNENELVASISDNPDYSVQ 396

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           +    +  S A+  +   +    +G+HGAGLTH LFL     L ++            + 
Sbjct: 397 LVTYGQEMSFAEQLKITRNTDIFIGMHGAGLTHLLFLPKWGTLFELYHCE----DPNCYR 452

Query: 319 KPARVLGLEYLEYK 332
             AR+ G+ YL ++
Sbjct: 453 DLARLRGVHYLSWE 466


>gi|188584016|ref|YP_001927461.1| capsular polysaccharide biosynthesis protein-like protein
           [Methylobacterium populi BJ001]
 gi|179347514|gb|ACB82926.1| capsular polysaccharide biosynthesis protein-like protein
           [Methylobacterium populi BJ001]
          Length = 426

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 5/95 (5%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           + + + V+R+   GR + N   V     +LGF+   F+PE ++      RF      +VG
Sbjct: 292 RGRRLFVDRHPGRGRGLANREAVLARLRDLGFEA--FDPELTSVRQQVVRFA-GAEIVVG 348

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           + GAG+ +++F RPG+ ++ +VP G  W    Y E
Sbjct: 349 IAGAGMANTVFCRPGTPVIHLVPEG--WEDLFYGE 381


>gi|334335498|ref|XP_003341780.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
           [Monodelphis domestica]
          Length = 319

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 89/193 (46%), Gaps = 10/193 (5%)

Query: 149 YAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSF 205
           ++E    FT + +I++    + T CF+     L+   R  +  N  L+   +    F++F
Sbjct: 78  FSETWKAFTDYDVIHLKAYDSKTVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAF 137

Query: 206 LANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPE- 263
             +  +    T       K ++ ++ R+    R ILN  E+  A + +  F+V I + + 
Sbjct: 138 SQHVLHRLNITWDGPKDGKIRVTILARSTEY-RKILNQNELVNALKTVSAFEVQIVDYKY 196

Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARV 323
           +     +  R  H+    +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+
Sbjct: 197 KEMGFLEQLRITHNTDIFIGMHGAGLTHLLFLPDWAVIFELYNCEDE---RCYLDL-ARL 252

Query: 324 LGLEYLEYKIKQE 336
            G+ Y+ +K K +
Sbjct: 253 RGIHYVTWKKKSK 265


>gi|397614564|gb|EJK62878.1| hypothetical protein THAOC_16495 [Thalassiosira oceanica]
          Length = 719

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 17/122 (13%)

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVP-----IGTQWLSTVYFEKPARVL----GLE 327
           S  A+VGVHGA LT S+ L PG  +++++P     I  +W+ T +   P  ++     L 
Sbjct: 491 SLDALVGVHGAQLTQSVLLPPGGHVLELLPWVPQYIRGRWVQTTHAPTPLGIIFHNSDLN 550

Query: 328 YLEYKIKQEESSLVEKYGAND-----LVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFR 382
           +L Y + ++   L EK+G          L N + F    W N    +    V   ++RF 
Sbjct: 551 HLGYSLGRDSIPLCEKFGEGSAEEEACFLGNRKRFI---WENRDFAVDPGAVVYYVERFV 607

Query: 383 IY 384
           +Y
Sbjct: 608 LY 609


>gi|428315211|ref|YP_007113093.1| Tetratricopeptide TPR_2 repeat-containing protein [Oscillatoria
           nigro-viridis PCC 7112]
 gi|428238891|gb|AFZ04677.1| Tetratricopeptide TPR_2 repeat-containing protein [Oscillatoria
           nigro-viridis PCC 7112]
          Length = 949

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 47/81 (58%), Gaps = 3/81 (3%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           PK + ++R+    R +LN  +  +  E+ GF V+I    ES SLA+     +    +V  
Sbjct: 809 PKRIYISRSKARYRRVLNEEDAIEVLEKFGF-VSILP--ESMSLAEQIAHFYHAEVIVAA 865

Query: 285 HGAGLTHSLFLRPGSVLMQVV 305
           HG+GLT+++F R G+ ++++V
Sbjct: 866 HGSGLTNTIFCRQGTKVIELV 886


>gi|427407648|ref|ZP_18897850.1| hypothetical protein HMPREF9718_00324 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425714152|gb|EKU77163.1| hypothetical protein HMPREF9718_00324 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 828

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 57/109 (52%), Gaps = 9/109 (8%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           P+ V ++R A   R +LN   ++  A   GF++  F    +  L       H+   ++  
Sbjct: 693 PRRVYISRRAVPRRPMLNESHIEDHARSAGFEILDFA---TLPLWHQIAISHNAETIMSP 749

Query: 285 HGAGLTHSLFLRPGSVLMQVVPI--GTQWLSTVYFEKPARVLGLEYLEY 331
           HGAGL+H +F +PG+ +++++PI  GT  L   Y    AR+  L+ L+Y
Sbjct: 750 HGAGLSHLIFAKPGTQVIELLPIQDGTYQLRFNY----ARLSILKGLDY 794


>gi|334117762|ref|ZP_08491853.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
           vaginatus FGP-2]
 gi|333460871|gb|EGK89479.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
           vaginatus FGP-2]
          Length = 971

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 55/101 (54%), Gaps = 7/101 (6%)

Query: 205 FLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEE 264
           FL   + +    SSS+    PK + ++R+    R +LN  +V +  E+ GF V+I    E
Sbjct: 815 FLRREFLKGILPSSSY----PKRIYISRSKARYRRVLNEEDVVEVLEKFGF-VSILP--E 867

Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           S SLA+     +    +V  HG+GLT+++F   G+ ++++V
Sbjct: 868 SLSLAEQIAHFYHAEVIVAAHGSGLTNTIFCSQGTKVIELV 908


>gi|154421576|ref|XP_001583801.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121918045|gb|EAY22815.1| hypothetical protein TVAG_075620 [Trichomonas vaginalis G3]
          Length = 454

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/128 (22%), Positives = 67/128 (52%), Gaps = 13/128 (10%)

Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTV--YFEKPA 321
           E+  +    + +      V +HG+GL+H L+++PG+ ++++      WL T   +++K A
Sbjct: 329 ENVPIEKQMKIMSKASLFVSIHGSGLSHILWMKPGTCVIEL----KTWLHTCNDWYQKAA 384

Query: 322 RVLGLEYLEYKIKQ--EESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDID 379
           R  G+ Y+ Y   +  ++ S +  Y  + +   + + F G+   + +  L+ QN+ ++ +
Sbjct: 385 RATGIHYMAYYPHETLDKPSYISPYLQHCI---DNRIFCGS--KHCKDALRDQNITVNAE 439

Query: 380 RFRIYLKD 387
           RF   +K+
Sbjct: 440 RFENGIKE 447


>gi|428212874|ref|YP_007086018.1| capsular polysaccharide biosynthesis protein [Oscillatoria
           acuminata PCC 6304]
 gi|428001255|gb|AFY82098.1| capsular polysaccharide biosynthesis protein [Oscillatoria
           acuminata PCC 6304]
          Length = 791

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 13/118 (11%)

Query: 189 NPTLLPKPKTLVDF--QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
            P +   PK + DF  + F + A ++ + T         KL +  R A+V R  +N  E+
Sbjct: 615 GPFIEGPPKWVCDFLRERFASLADSQGSETPE-------KLYISRRKAKV-RRFINEDEI 666

Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
               E  GF   I E   S S+ +    +     ++  HGAGLT+++F +PG+ L+++
Sbjct: 667 STLLEFYGFKTVILE---SLSVQEQITLLAGAKTIIAPHGAGLTNTIFCQPGTQLLEI 721


>gi|428184519|gb|EKX53374.1| hypothetical protein GUITHDRAFT_101078 [Guillardia theta CCMP2712]
          Length = 1432

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 61/245 (24%), Positives = 97/245 (39%), Gaps = 29/245 (11%)

Query: 108  GNFFHEFMDCFVPLFITI------------NSHFPDQDVILAIADCNDQWARKYAELLPR 155
            G+  HE    F  LF TI             S F D  +I  + D  D+  +K++ LL  
Sbjct: 1129 GHLLHE---TFFALFHTILTFSGSDQQNLEESVFDDVILISDVIDA-DKSLKKFSFLLST 1184

Query: 156  FTRHPIININNQTITH---CFQSVTLGLISHGRMVINPTLLPKPKT------LVDFQSFL 206
             + HP    +         CF+ + +GL     M      + + KT        D Q   
Sbjct: 1185 LSNHPWFTFSTLRTAGRRVCFRQLVVGLSDGMNMFATEETVKEDKTSSPIPNYRDVQMLR 1244

Query: 207  ANAYNENTNTSSSFHHTKPKLV-LVNRNARVG--RTILNLREVKKAAEELGFDVTIFEPE 263
            +                  ++V  V+R+  +   R I NL ++ ++ E L   V I E E
Sbjct: 1245 SRVLKFVETAGRRRKSGGERVVAFVHRSEELTSRRGIFNLDQLVRSVESLNSRVMIIEFE 1304

Query: 264  ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARV 323
            +  SL +    +    A+VGV G GL ++L++R G+  +Q+ P G  +     FE   R 
Sbjct: 1305 K-MSLVEQVEVVQDVDALVGVTGTGLWNALWMRRGAAGIQIFPFGVGYKGGREFENAIRY 1363

Query: 324  LGLEY 328
               EY
Sbjct: 1364 GPGEY 1368


>gi|148224592|ref|NP_001086091.1| glycosyltransferase-like domain-containing protein 2 precursor
           [Xenopus laevis]
 gi|82076799|sp|Q5NDE7.1|GTDC2_XENLA RecName: Full=Glycosyltransferase-like domain-containing protein 2;
           AltName: Full=Extracellular O-linked N-acetylglucosamine
           transferase-like; Flags: Precursor
 gi|56605348|emb|CAI30871.1| glycosyltransferase [Xenopus laevis]
          Length = 578

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 63/249 (25%), Positives = 111/249 (44%), Gaps = 27/249 (10%)

Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRF--TRHPIININN 166
           N  H F D  +P+F TI   FPD D    +    + W       L +F   + P++    
Sbjct: 162 NLMHVFHDDLLPIFYTIQ-QFPDLDFESRLF-FMEGWNEGLHFELYKFMSNKQPLLKEQL 219

Query: 167 QTITH--CFQSVTLGL----ISHGRMVINPTLLPKPKTLVD------FQSFLANAYNENT 214
           +T+    CF    +GL      +    + P   PK   LV       F  F+    N   
Sbjct: 220 KTLGRLLCFTKSYVGLSKITTWYQYGFVQPQG-PKANILVSGNEIRHFAKFMMGKLNITL 278

Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFR 273
           + +++    +  +VL +R+  + R I+N  E+  A A+E     TI    E  S +D  R
Sbjct: 279 DQNAA----EAYIVLFSRS--MNRLIVNEAELLLALAQEFQMK-TITVSLEDHSFSDIVR 331

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEY 331
            + +   +V +HGA L  SLFL  G+V++++ P G        ++  + + G+E  Y+ +
Sbjct: 332 LLSNATMLVSMHGAQLVTSLFLPKGAVVVELFPYGINPEHYTPYKTLSTLPGMELQYVAW 391

Query: 332 KIKQEESSL 340
           +  +EE+++
Sbjct: 392 QNTEEENTI 400


>gi|213625326|gb|AAI70355.1| Glycosyltransferase [Xenopus laevis]
 gi|213626901|gb|AAI70357.1| Ago61 protein [Xenopus laevis]
          Length = 578

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 63/249 (25%), Positives = 111/249 (44%), Gaps = 27/249 (10%)

Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRF--TRHPIININN 166
           N  H F D  +P+F TI   FPD D    +    + W       L +F   + P++    
Sbjct: 162 NLMHVFHDDLLPIFYTIQ-QFPDLDFESRLF-FMEGWNEGLHFELYKFMSNKQPLLKEQL 219

Query: 167 QTITH--CFQSVTLGL----ISHGRMVINPTLLPKPKTLVD------FQSFLANAYNENT 214
           +T+    CF    +GL      +    + P   PK   LV       F  F+    N   
Sbjct: 220 KTLGRLLCFTKSYVGLSKITTWYQYGFVQPQG-PKANILVSGNEIRHFAKFMMGKLNITL 278

Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFR 273
           + +++    +  +VL +R+  + R I+N  E+  A A+E     TI    E  S +D  R
Sbjct: 279 DQNAA----EAYIVLFSRS--MNRLIVNEAELLLALAQEFQMK-TITVSLEDHSFSDIVR 331

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEY 331
            + +   +V +HGA L  SLFL  G+V++++ P G        ++  + + G+E  Y+ +
Sbjct: 332 LLSNATMLVSMHGAQLVTSLFLPKGAVVVELFPYGINPEHYTPYKTLSTLPGMELQYVAW 391

Query: 332 KIKQEESSL 340
           +  +EE+++
Sbjct: 392 QNTEEENTI 400


>gi|428212875|ref|YP_007086019.1| hypothetical protein Oscil6304_2480 [Oscillatoria acuminata PCC
           6304]
 gi|428001256|gb|AFY82099.1| tetratricopeptide repeat protein [Oscillatoria acuminata PCC 6304]
          Length = 692

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 44/77 (57%), Gaps = 3/77 (3%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + ++R     R  +N  E+ +   + GF + I E   S S+++    + S  A+V  HGA
Sbjct: 547 IYISRQRSQFRRFVNEEEILECLGKFGFKMVILE---SFSVSEQIAIMASAKAIVAPHGA 603

Query: 288 GLTHSLFLRPGSVLMQV 304
           GLT+++F +PG+ L+++
Sbjct: 604 GLTNAVFCQPGTKLIEI 620


>gi|152967606|ref|YP_001363390.1| capsular polysaccharide biosynthesis protein-like protein
           [Kineococcus radiotolerans SRS30216]
 gi|151362123|gb|ABS05126.1| Capsular polysaccharide biosynthesis protein-like [Kineococcus
           radiotolerans SRS30216]
          Length = 398

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 3/81 (3%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           V V+R     R +LN  +V  A    GF +       S ++ +      S   +VGVHGA
Sbjct: 262 VFVDRPETDRRRMLNREDVVAALRGRGFRIVDLA---SLTVQEEAVLFGSAAVVVGVHGA 318

Query: 288 GLTHSLFLRPGSVLMQVVPIG 308
           GLT+ +F RPG+V++++ P G
Sbjct: 319 GLTNLVFCRPGTVVLELAPRG 339


>gi|452821526|gb|EME28555.1| transferase, transferring glycosyl groups [Galdieria sulphuraria]
          Length = 433

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 32/116 (27%), Positives = 60/116 (51%), Gaps = 8/116 (6%)

Query: 226 KLVLVNRNA--RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           K++++ RN   R+ R  +  +E++K     G  V+I   E  ++L  + + +H    ++ 
Sbjct: 280 KIIMIQRNTTRRIRRAEIMTKELEKV---FGVKVSIILFEYLSALQQA-QLMHRVSIVIA 335

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
            HGA L++ +F++ GSVL+++ P         YF   A+ LGL Y       +ES+
Sbjct: 336 AHGASLSNIIFMKRGSVLIELSPSLCS--GEGYFGALAKFLGLTYFHISGMHQESN 389


>gi|365898907|ref|ZP_09436839.1| hypothetical protein BRAO3843_3160069 [Bradyrhizobium sp. STM 3843]
 gi|365420397|emb|CCE09381.1| hypothetical protein BRAS3843_3160069 [Bradyrhizobium sp. STM 3843]
          Length = 386

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 31/115 (26%), Positives = 57/115 (49%), Gaps = 6/115 (5%)

Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
           ++R     R +LN   V +  ++ GF+V   E   + S+ +    + SC  ++G HG G+
Sbjct: 252 ISRGLAGYRRVLNEDIVIEMLKKRGFEVAAME---TMSVPEQAAVMASCEVVIGPHGGGM 308

Query: 290 THSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
           ++ +F  PG+   +++ I +  L   YF K +  LGL+Y     K   ++L   Y
Sbjct: 309 SNVIFCSPGT---KIIEIYSPELVATYFWKLSNQLGLQYYYMLGKGHPTTLGTDY 360


>gi|58332110|ref|NP_001011203.1| glycosyltransferase-like domain-containing protein 2 precursor
           [Xenopus (Silurana) tropicalis]
 gi|82076796|sp|Q5NDE6.1|GTDC2_XENTR RecName: Full=Glycosyltransferase-like domain-containing protein 2;
           AltName: Full=Extracellular O-linked N-acetylglucosamine
           transferase-like; Flags: Precursor
 gi|56605350|emb|CAI30872.1| glycosyltransferase [Xenopus (Silurana) tropicalis]
 gi|134024387|gb|AAI35938.1| ago61 protein [Xenopus (Silurana) tropicalis]
          Length = 576

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 63/249 (25%), Positives = 110/249 (44%), Gaps = 27/249 (10%)

Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRF--TRHPIININN 166
           N  H F D  +P+F TI   F D D    +    + W       L +F   + P++    
Sbjct: 162 NLMHVFHDDLIPIFYTIQ-QFADLDFESRLFFM-EGWNEGLHFELYKFMSNKQPLLKEQL 219

Query: 167 QTITH--CFQSVTLGL----ISHGRMVINPTLLPKPKTLVD------FQSFLANAYNENT 214
           +T+    CF    +GL      +    + P   PK   LV       F  F+    N   
Sbjct: 220 KTLGRLLCFTKSYVGLSKITTWYQYGFVQPQG-PKANILVSGNEIRHFAKFMMGKLNITK 278

Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFR 273
           + +++    +  +VL +R+  + R I+N  E+  A A+E     TI    E  S AD  R
Sbjct: 279 DQNAA----EAYIVLFSRS--MNRLIVNEAELLLALAQEFQMK-TITVSLEDHSFADIVR 331

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEY 331
            I +   +V +HGA L  SLFL  G++++++ P G        ++  + + G+E  Y+ +
Sbjct: 332 LISNATMLVSMHGAQLITSLFLPKGAIVVELFPYGVNPEHYTPYKTLSTLPGMELQYVAW 391

Query: 332 KIKQEESSL 340
           +  +EE+++
Sbjct: 392 QNTEEENTI 400


>gi|89272827|emb|CAJ82077.1| novel protein [Xenopus (Silurana) tropicalis]
          Length = 576

 Score = 47.4 bits (111), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 63/249 (25%), Positives = 110/249 (44%), Gaps = 27/249 (10%)

Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRF--TRHPIININN 166
           N  H F D  +P+F TI   F D D    +    + W       L +F   + P++    
Sbjct: 162 NLMHVFHDDLIPIFYTIQ-QFADLDFESRLF-FMEGWNEGLHFELYKFMSNKQPLLKEQL 219

Query: 167 QTITH--CFQSVTLGL----ISHGRMVINPTLLPKPKTLVD------FQSFLANAYNENT 214
           +T+    CF    +GL      +    + P   PK   LV       F  F+    N   
Sbjct: 220 KTLGRLLCFTKSYVGLSKITTWYQYGFVQPQG-PKANILVSGNEIRHFAKFMMGKLNITK 278

Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFR 273
           + +++    +  +VL +R+  + R I+N  E+  A A+E     TI    E  S AD  R
Sbjct: 279 DQNAA----EAYIVLFSRS--MNRLIVNEAELLLALAQEFQMK-TITVSLEDHSFADIVR 331

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEY 331
            I +   +V +HGA L  SLFL  G++++++ P G        ++  + + G+E  Y+ +
Sbjct: 332 LISNATMLVSMHGAQLITSLFLPKGAIVVELFPYGVNPEHYTPYKTLSTLPGMELQYVAW 391

Query: 332 KIKQEESSL 340
           +  +EE+++
Sbjct: 392 QNTEEENTI 400


>gi|193632082|ref|XP_001951734.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
           [Acyrthosiphon pisum]
          Length = 499

 Score = 47.4 bits (111), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 54/259 (20%), Positives = 115/259 (44%), Gaps = 30/259 (11%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN-----SHFPDQDVILAIADCNDQ 144
           C +  T P ++        N +H F D F+ L+ + +     +    +D+ + + + +  
Sbjct: 197 CDLIITEPTVIMKIDA-TVNMYHHFCD-FLNLYASQHVNGSGASMFSKDIHILVWE-SFA 253

Query: 145 WARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTL 199
           +   +++    FT+HP+ N+N  +    CF  + L L+   RM+     N  L+   +  
Sbjct: 254 YESAFSDTFQAFTKHPVWNLNTFRGKVVCFNDIVLPLLP--RMIFGLYYNTPLIDGCENS 311

Query: 200 VDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI 259
             F++F  +  +          + K ++  ++RN +  R +LN  E+  A +    + + 
Sbjct: 312 GLFKAFSQHVLHRLNIDQKPNANEKIRITFLSRNTKY-RNVLNENELITALK----NYSQ 366

Query: 260 FEPEESTSLADSFRFIHSCHA------MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLS 313
           +E ++     +   F    H        +G+HGAGLTH LFL   +VL ++     +   
Sbjct: 367 YEVKKVVYSGNFLTFKEQVHITYNTDIFIGMHGAGLTHLLFLPEWAVLFELYNCEDEHC- 425

Query: 314 TVYFEKPARVLGLEYLEYK 332
              ++  AR+ G++Y+ ++
Sbjct: 426 ---YKDLARLRGVKYITWR 441


>gi|242049810|ref|XP_002462649.1| hypothetical protein SORBIDRAFT_02g029517 [Sorghum bicolor]
 gi|241926026|gb|EER99170.1| hypothetical protein SORBIDRAFT_02g029517 [Sorghum bicolor]
          Length = 114

 Score = 47.4 bits (111), Expect = 0.014,   Method: Composition-based stats.
 Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 7/83 (8%)

Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN-WS-NMRV 367
           ++L + +   PA+V  +E     I  +ESSL  +Y   D VL +P   A A  W    +V
Sbjct: 17  EYLLSGWCSSPAKVASVE-----IAPKESSLSREYPTGDPVLTDPAGVAHARGWDVTKKV 71

Query: 368 YLKTQNVKLDIDRFRIYLKDAYK 390
           YL  QNV+LD+ RFR  L  A++
Sbjct: 72  YLDRQNVRLDLARFREELVGAHR 94


>gi|72057504|ref|XP_791565.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like
           [Strongylocentrotus purpuratus]
          Length = 610

 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 55/220 (25%), Positives = 94/220 (42%), Gaps = 28/220 (12%)

Query: 108 GNFFHEFMDCFVPLFIT------INSHFPDQDVILAIADCNDQWARKYAELLPRFTRH-P 160
           GN  H F D  +P+F        +N      D  L  +D  DQ    + +L    ++  P
Sbjct: 161 GNLMHVFHDDLLPVFYMQLQLGLLNLRSGVNDATLLTSD--DQEEGPFFQLYKHISKKTP 218

Query: 161 II----NINNQTITHCFQSVTLGLISHGRMVINPTLLPK---------PKTLVDFQSFLA 207
           I+    + +  +   CF+ V +GL             P+          K +  F SF  
Sbjct: 219 ILTHQLSQDGSSRIACFEEVHVGLSKFTTWYQYGFDKPQGPLDDIAVTSKEITLFTSFYK 278

Query: 208 NAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEEST 266
           +  N +  ++++   +T    V+++R  R  R ILN  E+  A  +    + +    E  
Sbjct: 279 SKLNIDGCDSNAEIENT---FVILSR--RTNRLILNEVELSLALAQHFDALVVVASLEMY 333

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
           SL++    I     +V VHG+ L+ S+FL PGSVL+++ P
Sbjct: 334 SLSELIGLISCSKGLVAVHGSLLSLSIFLPPGSVLVEIFP 373


>gi|159464961|ref|XP_001690707.1| glycosyltransferase-like protein [Chlamydomonas reinhardtii]
 gi|158270384|gb|EDO96234.1| glycosyltransferase-like protein [Chlamydomonas reinhardtii]
          Length = 579

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 53/104 (50%), Gaps = 26/104 (25%)

Query: 226 KLVLVNRN---ARVGRTILNLREVKKAA------------------EELGFDVTIFEPEE 264
           KLV ++RN   + VGRTILN +E+ +                    + L F  T  E   
Sbjct: 328 KLVFISRNDDQSAVGRTILNEQELVQLCNAAPAGDLPPAAWGSPYKKYLCFAHTFGE--- 384

Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            + L D +  +    A++G+HGAGLT+ L+++PG  +++V P G
Sbjct: 385 -SQLMDVW-VMRKVDAILGMHGAGLTNGLYMKPGGAIIEVRPFG 426


>gi|339247853|ref|XP_003375560.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316971062|gb|EFV54900.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 554

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 68/295 (23%), Positives = 123/295 (41%), Gaps = 32/295 (10%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHF-PDQDVILAIADCNDQWA 146
           C V    P +V        N +H F D F+ L+++  +N  F  D D+IL   D +  + 
Sbjct: 254 CDVIFDQPTIVIKLDA-GINMYHHFCD-FINLYLSQHLNGSFHQDVDIILWDTDAS-PYF 310

Query: 147 RKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVINPTL-LPKPKTLVDFQS 204
             + E    FT  P+I++ +      CF+ V   +++  R V      +P P       S
Sbjct: 311 DIFRETWLAFTTKPLIDLQDFDGKRVCFREVMFPVLA--RKVFGLYYNMPMPDYCR--AS 366

Query: 205 FLANAYNENTNTSSSFHHTKP-----KLVLVNRNARVGRTILNLRE-VKKAAEELGFDVT 258
            L  A++ +           P     ++  + R+++  R I+N  E VK+   +  F VT
Sbjct: 367 GLVQAFSHHLIHRLQLKQNGPLREKLRVTYLVRSSQY-RIIMNTNEIVKRLKADPQFSVT 425

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           + +      + + ++  H+    + +HGAGLTH  FL   +V+ ++   G        + 
Sbjct: 426 VAKYTLDIPVLEQYQMSHNTDIFMSIHGAGLTHMFFLPDWAVVFELYHCG----DPECYR 481

Query: 319 KPARVLGLEYL----EYKIKQEESSL-----VEKYGANDLVLKNPQAFAGANWSN 364
             A + GL++     E K++ +E  +        Y   D V++     A   WSN
Sbjct: 482 DLATLRGLKFFGWEDETKVQYQEKDVKNPKFTNFYLDADEVMRIVHLAAEYVWSN 536


>gi|400530656|gb|AFP86479.1| glycosyltransferase, partial [Halosauropsis macrochir]
          Length = 274

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 70/156 (44%), Gaps = 14/156 (8%)

Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
           PK   LV       F +FLA   N           T   +V+ +R+  V R ILN  E+ 
Sbjct: 2   PKANILVSGNEVRQFSAFLAEKLN--VTVGEQVEKTDEYIVVFSRS--VNRLILNEAELI 57

Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            A A+E    V     EE  S AD  R I     +V +HGA L  SLFL  G+ ++++ P
Sbjct: 58  LALAQEFQLKVVTVSLEEQ-SFADIIRVISRASMLVSMHGAQLVTSLFLPRGAAVVELFP 116

Query: 307 IGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
                     ++    + G  L+Y+ ++  +EE+S+
Sbjct: 117 YAVNPEHYAPYKTLTSLPGMDLQYVAWRNTKEENSV 152


>gi|56756779|gb|AAW26561.1| SJCHGC00943 protein [Schistosoma japonicum]
          Length = 369

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 60/128 (46%), Gaps = 14/128 (10%)

Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFHHTK----PKLVLVNR----------NARVGR 238
           L K   + +F+ F+  +YN N +  +    T     PK+V+V+R          N  + R
Sbjct: 175 LIKNMFIEEFRQFIFQSYNINNDEDTCQKRTSIRFLPKIVIVSRRDYIAHPRNINGTIHR 234

Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
            I N  E+     +LGF  +        ++ +  + I S   ++G+HGA LT+SL L   
Sbjct: 235 KITNELELLNELNQLGFQNSKVVCFTDLTMQEQLKLIMSTDILIGMHGAALTYSLLLSNT 294

Query: 299 SVLMQVVP 306
           S ++++ P
Sbjct: 295 SCVIELFP 302


>gi|163853710|ref|YP_001641753.1| capsular polysaccharide biosynthesis protein-like protein
           [Methylobacterium extorquens PA1]
 gi|163665315|gb|ABY32682.1| Capsular polysaccharide biosynthesis protein-like protein
           [Methylobacterium extorquens PA1]
          Length = 414

 Score = 47.0 bits (110), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 5/95 (5%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           + + + V+R+   GR + N   V      LGF+   F+PE ++      RF  +   +VG
Sbjct: 280 RGRRLFVDRHPGRGRGLANRDAVLGLLRGLGFEA--FDPELTSVRQQVVRF-SAAEIVVG 336

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           + GAG+T+++F +PG+ ++ +VP G  W    Y E
Sbjct: 337 IAGAGMTNTVFCQPGTPVIHLVPEG--WEDRFYGE 369


>gi|218532568|ref|YP_002423384.1| capsular polysaccharide biosynthesis protein-like protein
           [Methylobacterium extorquens CM4]
 gi|218524871|gb|ACK85456.1| capsular polysaccharide biosynthesis protein-like protein
           [Methylobacterium extorquens CM4]
          Length = 414

 Score = 47.0 bits (110), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 5/95 (5%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           + + + V+R+   GR + N   V      LGF+   F+PE ++      RF  +   +VG
Sbjct: 280 RGRRLFVDRHPGRGRGLANRDAVLGLLHGLGFEA--FDPELTSVRQQVVRF-SAAEIVVG 336

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           + GAG+T+++F +PG+ ++ +VP G  W    Y E
Sbjct: 337 IAGAGMTNTVFCQPGTPVIHLVPEG--WEDRFYGE 369


>gi|384261549|ref|YP_005416735.1| Capsular polysaccharide biosynthesis protein-like protein
           [Rhodospirillum photometricum DSM 122]
 gi|378402649|emb|CCG07765.1| Capsular polysaccharide biosynthesis protein-like protein
           [Rhodospirillum photometricum DSM 122]
          Length = 384

 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 73/173 (42%), Gaps = 23/173 (13%)

Query: 169 ITHCFQSVTLGL-----------ISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTS 217
           IT  FQ  TL L           I  GR V+   L+  P    +   F  N  +EN +  
Sbjct: 176 ITLPFQQETLALAAGARPLRFHEIDRGRTVVVDDLVFVP----NLNDFGWNLRHENLDLY 231

Query: 218 SSFHHTKP--KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
                  P  + + + R     R ++N  E+       GFD+    P E  S+ +  R +
Sbjct: 232 PGLRPRGPGERKLYLARGPSAKRPLVNEPEIIARLAAAGFDII---PLEGVSVHEQARLM 288

Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
                +V  HGAGLT+ +F  PG+ + ++      +L+ V F + A +LGL Y
Sbjct: 289 AEARVIVAPHGAGLTNLVFCGPGTTVCEL--HMDSYLNWV-FRRMANLLGLRY 338


>gi|357509677|ref|XP_003625127.1| Glycosyltransferase [Medicago truncatula]
 gi|355500142|gb|AES81345.1| Glycosyltransferase [Medicago truncatula]
          Length = 87

 Score = 46.6 bits (109), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 16/45 (35%), Positives = 27/45 (60%)

Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWL 312
           +      ++ C  + GVHGA LTH +F++P  V +Q+V +G  W+
Sbjct: 39  IGKDLELLNECDVVFGVHGAALTHFMFMKPSFVFIQIVSLGIHWI 83


>gi|339021440|ref|ZP_08645518.1| hypothetical protein ATPR_1826 [Acetobacter tropicalis NBRC 101654]
 gi|338751504|dbj|GAA08822.1| hypothetical protein ATPR_1826 [Acetobacter tropicalis NBRC 101654]
          Length = 393

 Score = 46.6 bits (109), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + + R     R + N  ++ +A  E GF  T+  PE + S+AD  R       +VG  GA
Sbjct: 256 LFIERGGASNRLMPNEADLAQALAEAGF--TVVRPE-TLSVADQMRLFAKARLVVGALGA 312

Query: 288 GLTHSLFLRPGSVLMQVVP 306
           G+ +  + RPG+V+ ++VP
Sbjct: 313 GMANLAWCRPGTVICELVP 331


>gi|119500630|ref|XP_001267072.1| hypothetical protein NFIA_106630 [Neosartorya fischeri NRRL 181]
 gi|119415237|gb|EAW25175.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 488

 Score = 46.6 bits (109), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 37/137 (27%), Positives = 60/137 (43%), Gaps = 10/137 (7%)

Query: 225 PKLVLVNRNARVGRTILN----LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHA 280
           P++V+   +    R ++N     + VKK    +   +  F    S    +  R       
Sbjct: 336 PQIVVTFIDRTGSRKLINQEDYFKTVKKQFPHITVQMIDFA---SIPFREQLRIAQESDI 392

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSL 340
           +VGVHGAGLTH +FL  GSV+++++P G   L+   F   A +LG  Y      +   ++
Sbjct: 393 LVGVHGAGLTHGIFLPSGSVMVEILPPG---LNHKGFRNLASLLGHLYFSAHATKPAKTV 449

Query: 341 VEKYGANDLVLKNPQAF 357
                 ND V    + F
Sbjct: 450 KRDDWHNDDVYLEHEKF 466


>gi|242056795|ref|XP_002457543.1| hypothetical protein SORBIDRAFT_03g009115 [Sorghum bicolor]
 gi|241929518|gb|EES02663.1| hypothetical protein SORBIDRAFT_03g009115 [Sorghum bicolor]
          Length = 107

 Score = 46.6 bits (109), Expect = 0.022,   Method: Composition-based stats.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
           C V H +P  V +AGGYNGN+FH F D F+P ++
Sbjct: 74  CDVRHDAPVHVVTAGGYNGNYFHAFNDGFLPSWV 107


>gi|386397552|ref|ZP_10082330.1| hypothetical protein Bra1253DRAFT_03074 [Bradyrhizobium sp.
           WSM1253]
 gi|385738178|gb|EIG58374.1| hypothetical protein Bra1253DRAFT_03074 [Bradyrhizobium sp.
           WSM1253]
          Length = 366

 Score = 46.6 bits (109), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 43/81 (53%), Gaps = 7/81 (8%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           + V V R  +  R + N +E+  A  + GF +  F+  E     D   F   C  +VG H
Sbjct: 235 RRVYVPRKGQ--RKVANEQELMPALRKFGFQIYDFDDVE-----DEAAFFSECAIVVGPH 287

Query: 286 GAGLTHSLFLRPGSVLMQVVP 306
           GAGLT+ +F  PG+ +++++P
Sbjct: 288 GAGLTNLVFCSPGTKVLELIP 308


>gi|300865615|ref|ZP_07110389.1| Tetratricopeptide TPR_2 repeat protein (fragment) [Oscillatoria sp.
           PCC 6506]
 gi|300336370|emb|CBN55539.1| Tetratricopeptide TPR_2 repeat protein (fragment) [Oscillatoria sp.
           PCC 6506]
          Length = 438

 Score = 46.6 bits (109), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 46/163 (28%), Positives = 73/163 (44%), Gaps = 14/163 (8%)

Query: 204 SFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE 263
           +FL   +     T SS+    P  + ++RN    R + N  +V +   + GF V IF   
Sbjct: 282 NFLRRVFLAGITTLSSY----PDRIYISRNKARYRRVFNEEDVMEELSKFGF-VRIFL-- 334

Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP---IG-TQWLSTVYFEK 319
           ES  L +          +V  HG+GLT+++F +PG+ ++++V    +G   W S+ Y   
Sbjct: 335 ESMPLQEQIALFFHAKMIVTPHGSGLTNTIFCQPGTKVIELVSPHYVGHYYWPSSHYLHL 394

Query: 320 PARVLGLEYLE-YKIKQ--EESSLVEKYGANDLVLKNPQAFAG 359
               L  E  E Y I+Q   ++SL E    N   LK      G
Sbjct: 395 DHYYLTGEVFECYPIRQLMYQNSLTEDIMINLSSLKKMMELIG 437


>gi|357606973|gb|EHJ65302.1| hypothetical protein KGM_14784 [Danaus plexippus]
          Length = 272

 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 62/266 (23%), Positives = 112/266 (42%), Gaps = 26/266 (9%)

Query: 55  SLRIKTQPYPRKSDKSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHE 113
           S R+K +     + +S   +      T   P P+  C +T   P  +        N +H 
Sbjct: 17  STRLKNEAEHMSALQSWAPEFVNFVKTPGRPIPDGMCDITIDKPTYIMKLDA-TVNMYHH 75

Query: 114 FMDCF-VPLFITINSHFP-----DQDVILAIADCNDQWARKYAELLPRFTRHPIININN- 166
           F D F +   + +NS  P     D  +++      D     + +    FT +PI ++   
Sbjct: 76  FCDFFNLYASLHVNSTHPSTFSRDNHILVWETFTYDS---AFKDAFKAFTSNPIWDLKEF 132

Query: 167 QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQSF---LANAYNENTNTSSS 219
           +  T CF++    L+   RM+     N  L+   +T   F SF   + ++ N   +  + 
Sbjct: 133 RGKTVCFKNAVFPLLP--RMIFGLYYNTPLIYGCETSGLFHSFSKHILHSLNVKLHLRTD 190

Query: 220 FHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHSC 278
               + ++ L++R     RTILN +E+ +A  ++ G+ V     + +          H+ 
Sbjct: 191 ---DRVRITLLSRGTTY-RTILNEQEIVEALLKVKGYYVQRVVYDRTVPFTKQLDITHNT 246

Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQV 304
              +G+HGAGLTH LFL   + L +V
Sbjct: 247 DVFIGMHGAGLTHLLFLPDWAALFEV 272


>gi|376004134|ref|ZP_09781892.1| putative tetratricopeptide repeats containing glycosyl transferase,
           N-acetylglucosaminyl-transferase P110 family
           [Arthrospira sp. PCC 8005]
 gi|375327468|emb|CCE17645.1| putative tetratricopeptide repeats containing glycosyl transferase,
           N-acetylglucosaminyl-transferase P110 family
           [Arthrospira sp. PCC 8005]
          Length = 1679

 Score = 46.2 bits (108), Expect = 0.027,   Method: Composition-based stats.
 Identities = 24/81 (29%), Positives = 46/81 (56%), Gaps = 3/81 (3%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           PKL+ ++R++   R +LN   +    ++ GF  T   PE + S+A   +       +V  
Sbjct: 638 PKLIYISRHSAKYRRLLNEEAIASLLQQRGF--TTINPE-TLSVAQQAQVFSQAKVIVAS 694

Query: 285 HGAGLTHSLFLRPGSVLMQVV 305
           HG+GLT+ +F +PGS +++++
Sbjct: 695 HGSGLTNLIFCQPGSTIIELM 715


>gi|209524105|ref|ZP_03272656.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
 gi|209495480|gb|EDZ95784.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
          Length = 1676

 Score = 46.2 bits (108), Expect = 0.027,   Method: Composition-based stats.
 Identities = 24/81 (29%), Positives = 46/81 (56%), Gaps = 3/81 (3%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           PKL+ ++R++   R +LN   +    ++ GF  T   PE + S+A   +       +V  
Sbjct: 638 PKLIYISRHSAKYRRLLNEEAIASLLQQRGF--TTINPE-TLSVAQQAQVFSQAKVIVAS 694

Query: 285 HGAGLTHSLFLRPGSVLMQVV 305
           HG+GLT+ +F +PGS +++++
Sbjct: 695 HGSGLTNLIFCQPGSTIIELM 715


>gi|407844226|gb|EKG01856.1| hypothetical protein TCSYLVIO_007137 [Trypanosoma cruzi]
          Length = 591

 Score = 46.2 bits (108), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 74/356 (20%), Positives = 131/356 (36%), Gaps = 69/356 (19%)

Query: 90  CGVTHTSPAL-VFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVIL-AIADCN----- 142
           C    T+PA+ +F   G++   +H + +   P F T+   F  +D +   I D       
Sbjct: 233 CKYVVTAPAVFIFRMSGHST--YHLWRNNLGPFFDTLRDDFGIKDRLRKGIFDGESPVVI 290

Query: 143 --DQWARK-------YAELLPRFTRHPIININNQTITHCFQSVTLGLI--SHGRMVINPT 191
             D+  R          ELL  F   P++  +  T   CF    LG+   S  +  +   
Sbjct: 291 TVDKKPRSGPKAPHLLDELLHYFIDLPVLEASEFTEPTCFTRAILGISTNSFSQAALRHW 350

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFHHT----------KPKLVLVNR--------- 232
           LLP+      F  FL      +  T S  H T          K  L+L N+         
Sbjct: 351 LLPRVALQHRFLQFLRKRNKRDVATISGVHATTLRCLMPQYDKRSLILFNQWFAWKLRQW 410

Query: 233 -----------------NARVGRTILNLREVKKAAEELGFDVT------IFEPEESTSLA 269
                            N   GR ++N  +V  A E     +T      +F   E  +  
Sbjct: 411 QGLPQRPNVLYLSRNHPNITRGRRVVNEEDVMPALEAAVLAMTGGSLRRVF--LEEMAYV 468

Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE-----KPARVL 324
           D    +   + ++  HG G+ + +++ PGSV+++ VP     L  +Y +         VL
Sbjct: 469 DQIAAMLETNILIAPHGGGIANCVWMPPGSVVVEFVPPAGATLPEMYHKMCRDAAGGGVL 528

Query: 325 GLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDR 380
            ++++ +  +Q+ + L   + A +   K  +      W + R  ++     LD+ R
Sbjct: 529 PIQHISFVAEQDPAELEPDFAATNPAWKRNKRLFSNIWVSKRQLIEYFAKALDLYR 584


>gi|434398683|ref|YP_007132687.1| Glycosyltransferase AER61, uncharacterized [Stanieria cyanosphaera
           PCC 7437]
 gi|428269780|gb|AFZ35721.1| Glycosyltransferase AER61, uncharacterized [Stanieria cyanosphaera
           PCC 7437]
          Length = 900

 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 3/84 (3%)

Query: 221 HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHA 280
           HH  P LV ++R    GR ++N  EV     +LGF        E  S+ +          
Sbjct: 755 HHRYPDLVYISRAQARGRQVINETEVTDLLNQLGFKTVFL---EKMSVLEQVALFAHAKV 811

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQV 304
           +V  HG+ LT+ +F  PG  ++++
Sbjct: 812 IVSPHGSSLTNLVFCNPGVTVVEL 835


>gi|110637270|ref|YP_677477.1| capsular polysaccharide biosynthesis protein [Cytophaga
           hutchinsonii ATCC 33406]
 gi|110279951|gb|ABG58137.1| capsular polysaccharide biosynthesis protein [Cytophaga
           hutchinsonii ATCC 33406]
          Length = 366

 Score = 46.2 bits (108), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 47/215 (21%), Positives = 98/215 (45%), Gaps = 25/215 (11%)

Query: 100 VFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRH 159
           V++   ++ N+FH F D    LF+ ++    D   +L +      +  +  ELL     H
Sbjct: 125 VWAHDSWSNNYFHWFNDTLPRLFL-LSKQIEDSVAVLPVELSKITFIVESLELLK--IEH 181

Query: 160 PIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSS 219
             I+   Q  +H F+S+++   +  +  INP L  + +  V    F A   +        
Sbjct: 182 QWID---QKKSHRFESLSVLHTATLQPDINPLLQKQMRDAV----FSAMKIDP------- 227

Query: 220 FHHTKP-KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSC 278
               +P + + ++R     R I+N +E+    ++ G+D+      E+ S  +  +     
Sbjct: 228 --QERPFRKIYISRAHARYRKIINEQELLPVLKKYGYDIIY---PETYSFKEQVKLFAES 282

Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLS 313
           +A++ +HGAG T+ +F++  + +M++    T+W S
Sbjct: 283 NALISIHGAGHTNCMFMKQDAKVMEIR--NTEWES 315


>gi|167522571|ref|XP_001745623.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775972|gb|EDQ89594.1| predicted protein [Monosiga brevicollis MX1]
          Length = 526

 Score = 45.8 bits (107), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 64/125 (51%), Gaps = 11/125 (8%)

Query: 229 LVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + N+NA++ +     +   K A+  G  D   ++  + T L D+     S   ++GVHG 
Sbjct: 385 MCNQNAKMRQLRDESQLWAKLAQRYGHVDFVTYKSRDYT-LTDTIELFKSAEMVLGVHGG 443

Query: 288 GLTHSLFLRPGSVLMQV-VPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGA 346
            L + LF RPG+ L+++ +P  T    T ++E  A  L L+Y+   ++ ++  +    GA
Sbjct: 444 ALANVLFSRPGATLVELTIPRNT----TRHYEHMAAALDLQYVAVDVQPDQRGV----GA 495

Query: 347 NDLVL 351
            ++VL
Sbjct: 496 PEVVL 500


>gi|444705665|gb|ELW47065.1| hypothetical protein TREES_T100015279 [Tupaia chinensis]
          Length = 440

 Score = 45.8 bits (107), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 101/230 (43%), Gaps = 40/230 (17%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
           N +H F D F+ L+IT  +N+ F   DV + + D +   +   +AE    FT + +I++ 
Sbjct: 187 NMYHHFCD-FINLYITQHVNNSFS-TDVYVVMWDTSSYGYGDLFAETWKAFTDYDVIHLK 244

Query: 166 NQTITH-CFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTK 224
                  CF+                +LLP+ +  + + + L                 K
Sbjct: 245 TYDSKRVCFKEAVF------------SLLPRMRYGLFYNTPLDG---------------K 277

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMV 282
            ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  +  H+    +
Sbjct: 278 IRVTILARSTEY-RKILNQNELVNALKTVSTFEVRIVDYKYKQLGFLDQLKITHNTDIFI 336

Query: 283 GVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ +K
Sbjct: 337 GMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIYYITWK 382


>gi|423066047|ref|ZP_17054837.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
 gi|406712546|gb|EKD07731.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
          Length = 985

 Score = 45.8 bits (107), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 46/81 (56%), Gaps = 3/81 (3%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           PKL+ ++R++   R +LN   +    ++ GF  T   PE + S+A   +       +V  
Sbjct: 93  PKLIYISRHSAKYRRLLNEEAIASLLQQRGF--TTINPE-TLSVAQQAQVFSQAKVIVAS 149

Query: 285 HGAGLTHSLFLRPGSVLMQVV 305
           HG+GLT+ +F +PGS +++++
Sbjct: 150 HGSGLTNLIFCQPGSTIIELM 170


>gi|317419036|emb|CBN81074.1| Uncharacterized glycosyltransferase [Dicentrarchus labrax]
          Length = 525

 Score = 45.8 bits (107), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 58/253 (22%), Positives = 109/253 (43%), Gaps = 18/253 (7%)

Query: 89  SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCN-DQW 145
            C +    P  VF       N +H F D FV L+I+  IN+ F   D+ + + D +   +
Sbjct: 224 QCDIIIEKPT-VFMKLDAGVNMYHHFCD-FVNLYISQHINNSF-SSDINIVMWDTSFYGY 280

Query: 146 ARKYAELLPRFTRHPIININNQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDF 202
              ++E    F+ + II++        CF+     L+   R  +  N  L+    +   F
Sbjct: 281 GDLFSETWRAFSEYDIIHLKTYDSKRVCFKDAFFSLLPRMRYGLFYNTPLILDCYSEGMF 340

Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEE---LGFDVTI 259
           ++F  +  +            + ++ L+ R+    R ILN  E+  A +    L  +V  
Sbjct: 341 RAFSQHVLHRLNIPQDGPKEGRVRVTLLARSTEY-RRILNQVELVNALKTVPLLEVNVVD 399

Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEK 319
           ++ ++   L    R  H+    +G+HGAGLTH LFL   +V+ ++     +      +  
Sbjct: 400 YKYKDVPFLV-QLRITHNSDIFIGMHGAGLTHLLFLPDWAVIFELYNCQDE----SCYRD 454

Query: 320 PARVLGLEYLEYK 332
            AR+ G+ Y+ ++
Sbjct: 455 LARLRGIRYVTWQ 467


>gi|376001680|ref|ZP_09779540.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375329948|emb|CCE15293.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 883

 Score = 45.4 bits (106), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 48/219 (21%), Positives = 91/219 (41%), Gaps = 33/219 (15%)

Query: 94  HTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELL 153
           H    + F +     N+FH  ++    L + + S  P   +++      +++  KY +  
Sbjct: 631 HLKGNIAFVSAYCGQNYFHWMVEVIPRLHLVLASGVPIDKIVV------NKFGHKYEDET 684

Query: 154 PRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT-LLPKPKTLVDFQSFLA----- 207
                 P     +Q +  CF+ +         ++I P+  L  PK   +F   L      
Sbjct: 685 LAMFDIP----EHQKMFGCFRHI------QAEVLIVPSRTLSTPKWACNFVKDLVLKHPF 734

Query: 208 NAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTS 267
            A +   N SS       K+ +   NA + R ++N +E+    + LGF V   E   + S
Sbjct: 735 MAEDNRCNYSS-------KIYISRANAYI-RKVINEQELMDILKPLGFQVVYLE---NMS 783

Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
           +      +H    ++  HGAGLT+ +F  PG+ ++++ P
Sbjct: 784 VKQQALCLHHAEVVISPHGAGLTNLVFCEPGTKVIELFP 822


>gi|357607839|gb|EHJ65710.1| hypothetical protein KGM_02173 [Danaus plexippus]
          Length = 289

 Score = 45.4 bits (106), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 52/210 (24%), Positives = 95/210 (45%), Gaps = 36/210 (17%)

Query: 149 YAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQ 203
           + +    FT +PI ++   +  T CF++    L+   RM+     N  L+   +T   F 
Sbjct: 43  FKDAFKAFTSNPIWDLKEFRGKTVCFKNAVFPLLP--RMIFGLYYNTPLIYGCETSGLFH 100

Query: 204 SF---LANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTI 259
           SF   + ++ N   +  +     + ++ L++R     RTILN +E+ +A  ++ G+ V  
Sbjct: 101 SFSKHILHSLNVKLHLRTD---DRVRITLLSRGTTY-RTILNEQEIVEALLKVKGYYVQR 156

Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEK 319
              + +          H+    +G+HGAGLTH LFL   + L +V             E 
Sbjct: 157 VVYDRTVPFTKQLDITHNTDVFIGMHGAGLTHLLFLPDWAALFEVYNC----------ED 206

Query: 320 P------ARVLGLEYLEYK-----IKQEES 338
           P      AR+ GL+Y+ ++     ++QE+S
Sbjct: 207 PNCYADLARLRGLKYVTWEDKSKLVQQEQS 236


>gi|443691665|gb|ELT93454.1| hypothetical protein CAPTEDRAFT_191413 [Capitella teleta]
          Length = 501

 Score = 45.4 bits (106), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 52/223 (23%), Positives = 93/223 (41%), Gaps = 30/223 (13%)

Query: 130 PDQDVILAIADCNDQWARKY----AELLPRF-------TRHPIININNQTITH------- 171
           P+  +I  I   +++W   +     E LPR          +P I I+ +++ H       
Sbjct: 253 PEWTLIDKIFVISERWGEAFFHSLVENLPRLLPWLKYLQENPTIKIHIKSMYHNIIQKVF 312

Query: 172 --CFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVL 229
              +  VT G I    ++I P  +P  K  V     L   Y  +    +     +  LVL
Sbjct: 313 NISYSRVTAGNI-RANLIILPAGIPCGKPAVYPTQLLW--YQIHKQLPAIPMEERMNLVL 369

Query: 230 VNRNARVGRTILNLREVKK----AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           + R  R  R     +++ +     AE+    V +F+ +   S  D+    +    +V  H
Sbjct: 370 IKRLPRKNRYFAFHKDIHRLLVIEAEKHNMTVVVFDDQHMPSFEDTMLMFNQAFMVVSPH 429

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           GAGLT+ LF +PG++L++ +    +    + F   ++VLG  Y
Sbjct: 430 GAGLTNMLFSQPGTILIEGL---CRAALNLCFGAMSQVLGFRY 469


>gi|326502512|dbj|BAJ95319.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 45.4 bits (106), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 40/72 (55%), Gaps = 8/72 (11%)

Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           +V   +L  + V+ + E + F        E   +  + R++ + H +V VHGAG+T+  F
Sbjct: 323 QVQALLLKTQHVRSSMENVTF--------EHLDIPSTIRYMSNVHILVSVHGAGMTNMFF 374

Query: 295 LRPGSVLMQVVP 306
           + PGS +++++P
Sbjct: 375 MNPGSAVVEIIP 386


>gi|384247851|gb|EIE21336.1| hypothetical protein COCSUDRAFT_43090 [Coccomyxa subellipsoidea
           C-169]
          Length = 436

 Score = 45.4 bits (106), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 52/111 (46%), Gaps = 11/111 (9%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELG------FDVTIFEPEESTSLADSFRFIHSCH 279
           ++ L+ R  +  R ILNL EV +    L          T+   ++        R + +  
Sbjct: 244 QVALIERLTQ--RRILNLEEVTQRCRGLAPADGSALTCTVISFDDIIDYPALLRELQTID 301

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVP--IGTQWLSTVYFEKPARVLGLEY 328
            +VG+HGAGL +S F+RPG+  +++ P   GT+     Y+  P+ V    Y
Sbjct: 302 ILVGMHGAGLVNSYFMRPGTAFVEIFPCDFGTE-HHRYYYWHPSHVEAQTY 351


>gi|156353079|ref|XP_001622904.1| predicted protein [Nematostella vectensis]
 gi|156209537|gb|EDO30804.1| predicted protein [Nematostella vectensis]
          Length = 375

 Score = 45.1 bits (105), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 52/208 (25%), Positives = 85/208 (40%), Gaps = 16/208 (7%)

Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN 166
           N +H F D F  L+ T  +N  F     I+            ++     FTRHP++ + +
Sbjct: 172 NMYHHFCD-FFNLYATQHVNGSFSTDVNIVLWEAYKRGGLGNFSPTWRVFTRHPLLYLGH 230

Query: 167 QTITH--CFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQSF---LANAYNENTNTS 217
                  CF+     L+   RMV     N  L P       F++F   L          +
Sbjct: 231 DFAGKRVCFKRAIFSLLP--RMVFGLFYNTPLTPGCSGSGLFKAFSNHLVKRLGIVQERN 288

Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIH 276
            S      ++ L++R  +  R ILN  E+ +A +      + I +        D  +  H
Sbjct: 289 ESDVDAPVRVTLLSRGTKY-RDILNENELVEALSSHPAISLKIAKFSWDVPFLDQIKVTH 347

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
           +    +G+HGAGLTH+LFL   +VL ++
Sbjct: 348 NTDVFLGMHGAGLTHALFLPDWAVLFEL 375


>gi|443696729|gb|ELT97366.1| hypothetical protein CAPTEDRAFT_185193 [Capitella teleta]
          Length = 562

 Score = 45.1 bits (105), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 32/119 (26%), Positives = 59/119 (49%), Gaps = 5/119 (4%)

Query: 215 NTSSSFHHTKPK-LVLVNRNARVGRTILN----LREVKKAAEELGFDVTIFEPEESTSLA 269
           N  S F+  K   +VL+ R+ + GR   +    LR ++     LG  + ++      SLA
Sbjct: 412 NLRSRFNTNKQDTVVLIRRSKQKGRYFRHHGGILRMLQADVAPLGMKIDVYGDNPLPSLA 471

Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           ++ R  H    ++  HGAG ++ LF +PG++L++ +    +    + F   ++ LGL Y
Sbjct: 472 ETRRVFHQAAVVISPHGAGASNLLFSQPGTLLIEGLHSPFRTRPNLCFLVMSQALGLRY 530


>gi|158300077|ref|XP_320074.4| AGAP009279-PA [Anopheles gambiae str. PEST]
 gi|157013827|gb|EAA14911.4| AGAP009279-PA [Anopheles gambiae str. PEST]
          Length = 523

 Score = 45.1 bits (105), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 51/253 (20%), Positives = 104/253 (41%), Gaps = 19/253 (7%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP-----DQDVILAIADCNDQ 144
           C V    PA +        N +H F D F+ L+ +++ +       D D  + + + +  
Sbjct: 212 CDVVIERPAFIMKIDA-AINMYHHFCD-FLNLYASLHVNLSHAGGFDTDTQVLVWE-SFT 268

Query: 145 WARKYAELLPRFTRHPIININNQT-ITHCFQSVTLGLISHGRMVI----NPTLLPKPKTL 199
           +A  +A+    F++HPI ++        CF++V L L+   RM+     N  ++   +  
Sbjct: 269 YASPFADTFKVFSKHPIADLKTYAGKVVCFKNVVLPLLP--RMIFGLYYNTPIIYGCENS 326

Query: 200 VDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI 259
             F +F  +  +      ++    + ++  ++R  R  R +     + + A+   + V  
Sbjct: 327 GLFHAFSEHVLHRLKVRMTTRPDERVRITFLSRQTRYRRVLNEDELMGRIAKNPNYAVQR 386

Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEK 319
                     +  R   +    +G+HGAGLTH LFL   + L ++            +  
Sbjct: 387 VSYGHDLPFVEQLRITRNTDIFIGMHGAGLTHLLFLPKWATLFELY----HCEDPNCYRD 442

Query: 320 PARVLGLEYLEYK 332
            AR+ G+ YL ++
Sbjct: 443 LARLRGVHYLTWE 455


>gi|428210488|ref|YP_007100701.1| capsular polysaccharide biosynthesis protein [Oscillatoria
           acuminata PCC 6304]
 gi|428004938|gb|AFY85468.1| capsular polysaccharide biosynthesis protein [Oscillatoria
           acuminata PCC 6304]
          Length = 1588

 Score = 45.1 bits (105), Expect = 0.067,   Method: Composition-based stats.
 Identities = 29/102 (28%), Positives = 50/102 (49%), Gaps = 6/102 (5%)

Query: 204 SFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE 263
           +FL  A+  +   S   H   P+ + + R+    R ++N  EV +   + GF+    EPE
Sbjct: 615 TFLRQAFLNSVPASDCEH---PERIYIRRSTARNRRLINEEEVIEFLAQFGFEP--IEPE 669

Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
            S S  +          +V  HG+GLT+ +F RPG+  +++V
Sbjct: 670 -SLSFVEQIAVFSQAKIIVAPHGSGLTNLVFCRPGTQAIELV 710


>gi|428214106|ref|YP_007087250.1| hypothetical protein Oscil6304_3772 [Oscillatoria acuminata PCC
           6304]
 gi|428002487|gb|AFY83330.1| tetratricopeptide repeat protein [Oscillatoria acuminata PCC 6304]
          Length = 589

 Score = 45.1 bits (105), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 29/114 (25%), Positives = 54/114 (47%), Gaps = 6/114 (5%)

Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF 274
           + +++     P  + ++R     R ILN  E        GF   + E   S S  +    
Sbjct: 435 DKAATLRRQTPSRIYISRRHASYRRILNEEETIARLSPYGFVPIVLE---SLSFLEQVAL 491

Query: 275 IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
             +  A++  HGAGLT++LF  PG+   Q++ I +  + +V +   + ++GL+Y
Sbjct: 492 FANAKAIIAPHGAGLTNTLFCNPGT---QLIEIFSPDMVSVNYWVVSNIIGLDY 542


>gi|407404675|gb|EKF30041.1| hypothetical protein MOQ_006156 [Trypanosoma cruzi marinkellei]
          Length = 590

 Score = 45.1 bits (105), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 57/254 (22%), Positives = 98/254 (38%), Gaps = 58/254 (22%)

Query: 151 ELLPRFTRHPIININNQTITHCFQSVTLGLI--SHGRMVINPTLLPKPKTLVDFQSFLAN 208
           ELL  FT  P++  +  T   CF    LG+   S  +  +   LLP+      F  FL+ 
Sbjct: 307 ELLHYFTDLPVLEASEFTEPTCFTRAILGISASSFSQAALRHWLLPRIALRHRFLQFLSK 366

Query: 209 AYNENTNTSSSFHHT---------------------------------KPKLVLVNRN-- 233
                 +T S  H T                                 +P ++ ++RN  
Sbjct: 367 RNKREVSTVSRAHATTLRCLIPQYDRRSRIIFNQWFEWKLRQWQELPQRPNVLYLSRNHP 426

Query: 234 -ARVGRTILNLREVKKAAEELGFDVT------IFEPEESTSLADSFRFIHSCHAMVGVHG 286
               GR ++N  EV  A E     +T      +F   E  +  D    +   + ++  HG
Sbjct: 427 NITRGRRVVNEEEVIPALEAAVLAMTGGSLRRVFL--EEMAYVDQIATVLETNILIAPHG 484

Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR------VLGLEYLEYKIKQEESSL 340
            G+ + +++ PGSV+++ VP     L  +Y +K  R      VL ++++ +  +Q+ + L
Sbjct: 485 GGIANCVWMPPGSVVVEFVPPAGATLPEMY-DKMCRDAAGGGVLPIQHISFVAEQDPAEL 543

Query: 341 VEKYGANDLVLKNP 354
                  D V  NP
Sbjct: 544 -----EPDFVATNP 552


>gi|311748554|ref|ZP_07722339.1| hypothetical protein ALPR1_19833 [Algoriphagus sp. PR1]
 gi|126577073|gb|EAZ81321.1| hypothetical protein ALPR1_19833 [Algoriphagus sp. PR1]
          Length = 324

 Score = 45.1 bits (105), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 50/213 (23%), Positives = 86/213 (40%), Gaps = 31/213 (14%)

Query: 95  TSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELL- 153
           T P  ++    ++ N+FH   DC   L++ +N+   D+ VIL  +  +  +  +  ELL 
Sbjct: 85  TIPHGIWVKDEWSANYFHWMTDCLPRLWLGLNTGLSDR-VILHDSYRHLPYVSQSLELLG 143

Query: 154 --PRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN 211
             P +                +QS           V N  L P+  +  +F   L     
Sbjct: 144 IQPTY----------------YQS------QENIWVKNLVLTPRTSSFPNFHEDLTKMTR 181

Query: 212 ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADS 271
           E  + S     T  +L+ ++R     R   N  +V+      GF+V      E  SL + 
Sbjct: 182 ERLSVSPK--STPSRLIYISRKYANKRKTHNEIDVELLMIRHGFEVIY---TEKMSLKEQ 236

Query: 272 FRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
              +     +V +HGA LT+ LFL  GS ++++
Sbjct: 237 IDLMSETKILVSLHGAALTNMLFLPEGSKVVEL 269


>gi|224013993|ref|XP_002296660.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968710|gb|EED87055.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 408

 Score = 45.1 bits (105), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 63/140 (45%), Gaps = 21/140 (15%)

Query: 180 LISHGRMVINP---TLLPKPKTLVDFQSFLANAY----NENTNTSSSFHHTKPKLVLVNR 232
           LI    ++I P    LL      ++F+ + ++ +    ++  + SS      P    ++R
Sbjct: 200 LIEQADVLIEPPTNGLLWDLAWDMNFECYSSDMFKTFVSQFVDKSSKLDPPDPLACWISR 259

Query: 233 NARVGRTILNL-------REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
             R  R+  NL       REV   A EL F        +  S+ D    ++ C  + G+H
Sbjct: 260 QGRDVRSASNLNDTLGVMREVFTNARELRF-------TKEMSIDDVANALNDCRVIFGIH 312

Query: 286 GAGLTHSLFLRPGSVLMQVV 305
           GAG  ++LF RPG  +++++
Sbjct: 313 GAGHMNALFARPGVAVIEII 332


>gi|82232390|sp|Q5NDE3.1|GTDC2_TETNG RecName: Full=Glycosyltransferase-like domain-containing protein 2;
           AltName: Full=Extracellular O-linked N-acetylglucosamine
           transferase-like; Flags: Precursor
 gi|47213764|emb|CAF95593.1| unnamed protein product [Tetraodon nigroviridis]
 gi|56605356|emb|CAI30875.1| glycosyltransferase [Tetraodon nigroviridis]
          Length = 579

 Score = 44.7 bits (104), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 58/244 (23%), Positives = 101/244 (41%), Gaps = 16/244 (6%)

Query: 109 NFFHEFMDCFVPLFITINSHFP-DQDVILAIADCNDQWARKYAELLPRFTRHPII--NIN 165
           N  H F D  +P F T+      D+D  L   +  D+    +   L    + P++   + 
Sbjct: 162 NLMHVFHDDLLPAFYTMKQFLDLDEDARLVFMEGWDEGPHFHLYRLLS-DKQPLLKEQLR 220

Query: 166 NQTITHCFQSVTLGL----ISHGRMVINPTLLPKPKTLVDFQSF--LANAYNENTNTSSS 219
           N     CF    +GL      +    + P   PK   LV        A    E  N + +
Sbjct: 221 NFGKLMCFTKSYIGLSKMTTWYQYGFVQPQG-PKANILVSGNEIRHFAKVLMEKMNVTRA 279

Query: 220 FHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSC 278
               + + ++V   +   R ILN  E+  A A+E    V     EE  S A   + I + 
Sbjct: 280 EGGQEDEYIVVFSRSST-RLILNQAELVMALAQEFQMRVVTVSLEEQ-SFASIVQVIGAA 337

Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQE 336
             +V +HGA L  +LFL PG+V++++ P          +   A + G++  Y+ ++  +E
Sbjct: 338 SMLVSMHGAQLITALFLPPGAVVVELFPFAVNPDQYTPYRTLAALPGMDLHYISWRNTEE 397

Query: 337 ESSL 340
           E+++
Sbjct: 398 ENTI 401


>gi|71418840|ref|XP_810985.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70875598|gb|EAN89134.1| hypothetical protein Tc00.1047053506627.90 [Trypanosoma cruzi]
          Length = 591

 Score = 44.7 bits (104), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 56/279 (20%), Positives = 104/279 (37%), Gaps = 51/279 (18%)

Query: 151 ELLPRFTRHPIININNQTITHCFQSVTLGLI--SHGRMVINPTLLPKPKTLVDFQSFLAN 208
           ELL  F   P++  +  T   CF    LG+   S  +  +   LLP+      F  FL  
Sbjct: 308 ELLHYFIDLPVLEASEFTEPTCFTRAILGISANSFSQAALRHWLLPRIALQHRFLQFLRK 367

Query: 209 AYNENTNTSSSFHHT---------------------------------KPKLVLVNRN-- 233
               +  T S  H T                                 +P ++ ++RN  
Sbjct: 368 RNKRDVATVSGVHATTLRCLMPQYDRRSPILFNQWFAWKLRQWQWLPRRPNVLYLSRNHP 427

Query: 234 -ARVGRTILNLREVKKAAEELGFDVT------IFEPEESTSLADSFRFIHSCHAMVGVHG 286
               GR ++N  +V  A E     +T      +F   E  +  D    +   + ++  HG
Sbjct: 428 NITRGRRVVNEEDVIPALEAAVLAMTGGSLRRVFL--EEMAYVDQIAAMLETNILIAPHG 485

Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE-----KPARVLGLEYLEYKIKQEESSLV 341
            G+ + +++ PGSV+++ VP     L  +Y +         VL ++++ +  +Q+ + L 
Sbjct: 486 GGIANCVWMPPGSVVVEFVPPAGATLPELYHKMCRDAAGGGVLPIQHISFVAEQDPAELE 545

Query: 342 EKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDR 380
             + A +   K  +      W + R  ++     LD+ R
Sbjct: 546 PDFAATNPAWKRNKRLFSNIWVSKRQLIEYFAKALDLYR 584


>gi|302835343|ref|XP_002949233.1| hypothetical protein VOLCADRAFT_104177 [Volvox carteri f.
           nagariensis]
 gi|300265535|gb|EFJ49726.1| hypothetical protein VOLCADRAFT_104177 [Volvox carteri f.
           nagariensis]
          Length = 569

 Score = 44.7 bits (104), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 50/101 (49%), Gaps = 20/101 (19%)

Query: 226 KLVLVNR---NARVGRTILNLREV-KKAAEELGFDVTIFEPEESTSLADSFRFIH----- 276
           K+V ++R    + VGRTILN  E+ ++     G DV   + + +      F F H     
Sbjct: 314 KIVFISRTPHQSAVGRTILNEAEIIQRCNNAEGADVP--QADWNVPYQKYFCFAHVFGDN 371

Query: 277 ---------SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
                        ++G+HGAGLT+S +++PG  +++V P G
Sbjct: 372 QLMDVWLMRQVDVILGMHGAGLTNSFYMKPGGSVIEVRPFG 412


>gi|194041279|ref|XP_001925360.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Sus
           scrofa]
          Length = 443

 Score = 44.7 bits (104), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 51/97 (52%), Gaps = 6/97 (6%)

Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  F+V I + + +  +  D  R  H+    +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVRIVDYKYKELAFLDQLRITHNTDIFIGMHGAGLTHLLFL 352

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
              + + ++   G +     Y +  AR+ G+ Y+ ++
Sbjct: 353 PDWAAVFELYNCGDE---RCYLDL-ARLRGVHYITWR 385


>gi|400530660|gb|AFP86481.1| glycosyltransferase, partial [Anguilla rostrata]
          Length = 277

 Score = 44.7 bits (104), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 47/160 (29%), Positives = 73/160 (45%), Gaps = 22/160 (13%)

Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
           PK   LV       F SFLA   N           T   +V+ +R+  + R ILN  E+ 
Sbjct: 2   PKANILVSGNEIRQFSSFLAEKLN--VTVGEQTEKTDEYIVVFSRS--LNRLILNEAELI 57

Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            A A+E    V     EE  S AD  R +     +V +HGA L  SLFL  G+ ++++ P
Sbjct: 58  LALAQEFQMKVVTVSLEEQ-SFADIVRVLSRASMLVSMHGAQLVTSLFLPRGAAVVELYP 116

Query: 307 IGTQWLSTVYFEKPARVL------GLEYLEYKIKQEESSL 340
                ++  ++  P R L       L+Y+ ++  +EE+S+
Sbjct: 117 YA---VNPEHY-APYRTLTSLPGMDLQYVAWRNTKEENSV 152


>gi|242008735|ref|XP_002425156.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212508850|gb|EEB12418.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 508

 Score = 44.7 bits (104), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 57/246 (23%), Positives = 101/246 (41%), Gaps = 52/246 (21%)

Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILA--------------IADCNDQWARKYAELLP 154
           N  H   D   PLF+T+       DV +                 D    +++    LLP
Sbjct: 200 NIMHVIHDDLFPLFLTLEFLCMKNDVCMQSFKLIFHDNFPTGPFFDLYKIFSKGNPILLP 259

Query: 155 RFTRHPIININNQTITHCFQSVTLGLI------------SHGRMVINPTLLPKPKTLVDF 202
           +   H     NNQ +  C + +  GLI             HG   +N   L     +V F
Sbjct: 260 QLLLHN----NNQIL--CIEEMHAGLILDSIWYQYGFNEPHGP--VNNFFLNN-HDIVRF 310

Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV--------KKAAEELG 254
            S++    N ++N++ +     P +V+++R     R ILN+ EV        KK   +  
Sbjct: 311 TSYIKTKLNVHSNSTKN-----PDIVIISREK--TRKILNVNEVTEKVKNIMKKLLRKNE 363

Query: 255 FDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLST 314
            +V   +   S +     + + +C  ++G+HGA +  ++F++P S+++++ P   Q    
Sbjct: 364 INVMCIDLLNS-NFTFFIKILSNCDLVIGMHGAEMIFTIFMKPHSLIIELFPFAIQ-SDI 421

Query: 315 VYFEKP 320
           V F KP
Sbjct: 422 VSFIKP 427


>gi|449671770|ref|XP_004207561.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like isoform 3 [Hydra magnipapillata]
          Length = 516

 Score = 44.7 bits (104), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 98/236 (41%), Gaps = 24/236 (10%)

Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ-NQ 167
           N  H F D  + +F T +  +P  +V L + D  D+    Y +L   ++     +IN   
Sbjct: 111 NLMHVFHDDLMTIFYTKSLFYPHSEVNLVMTDRRDE--GPYFDLYKLYSNSIYTSINFTD 168

Query: 168 TITHCFQSVTLGL------ISHGRMV----INPTLLPKPKTLVDFQSFLANAYNENTNTS 217
           T   CFQS  +GL        +G  +    +N T  PK   L     FL           
Sbjct: 169 TDFVCFQSALVGLSKQLTWYQYGFKIPQGPLNNTHYPKNDLLYFKNDFLTRM------NL 222

Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIH 276
           S    TK  L+L   +    R ILN  ++        F + I+    E+ +L +    I 
Sbjct: 223 SDIPDTKCILLL---SRTTSRKILNEAQLLFKLSTF-FRLPIYSVSLETDALNNIISLIL 278

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
               ++ +HGA L   +F++PG+VL ++ P      +   ++  A ++ + Y+ +K
Sbjct: 279 RASLVISMHGAQLILGIFMKPGAVLAELFPYAVPPENYTPYKTLADLISVRYVAWK 334


>gi|123481170|ref|XP_001323514.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121906380|gb|EAY11291.1| hypothetical protein TVAG_061950 [Trichomonas vaginalis G3]
          Length = 535

 Score = 44.7 bits (104), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 44/188 (23%), Positives = 86/188 (45%), Gaps = 34/188 (18%)

Query: 225 PKLVLVNRNARVGRTILNLREVKK-AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           P +V++ R     R I N+ +++    +E  +   +    +S ++        S  A++G
Sbjct: 352 PTIVIIERKGS-SRDITNIDDIEDFVKKECDYCKVVRVDLKSLAINAQISLFTSASAIIG 410

Query: 284 VHGAGLTHSLFLRPGS-----VLMQVVPIGTQWLSTVYFEKPARVLGLEYL-----EYKI 333
           +HG+GL + L+++P S      + +V+P    W    ++E  + V  +EY      EY I
Sbjct: 411 LHGSGLANVLWMKPTSPEFPTAMFEVMPYN-YWCRD-WYETASNVADVEYFSIMNSEYSI 468

Query: 334 KQEESSL--VEKYGANDLVLKNPQAFAGANWSNMRV--YLKTQNVKLDIDRFRIYLKDAY 389
            ++E S+   E Y +  +             +N+    +L+ + VKL+ID     LK A+
Sbjct: 469 LKDEDSIRAFECYSSRSMC------------TNIVCHDFLRDKKVKLNID----LLKKAW 512

Query: 390 KKAKKLMD 397
              +K +D
Sbjct: 513 SPFQKKLD 520


>gi|194385256|dbj|BAG65005.1| unnamed protein product [Homo sapiens]
          Length = 251

 Score = 44.3 bits (103), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 42/182 (23%), Positives = 80/182 (43%), Gaps = 10/182 (5%)

Query: 156 FTRHPIININNQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNE 212
           FT + +I++        CF+     L+   R  +  N  L+   +    F++F  +  + 
Sbjct: 17  FTDYDVIHLKTYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHR 76

Query: 213 NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLAD 270
              T       K ++ ++ R+    R ILN  E+  A + +  F+V I + +       D
Sbjct: 77  LNITQEGPKDGKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLD 135

Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
             R  H+    +G+HGAGLTH LFL   + + ++     +     Y +  AR+ G+ Y+ 
Sbjct: 136 QLRITHNTDIFIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLD-LARLRGVHYIT 191

Query: 331 YK 332
           ++
Sbjct: 192 WR 193


>gi|113476439|ref|YP_722500.1| hypothetical protein Tery_2853 [Trichodesmium erythraeum IMS101]
 gi|110167487|gb|ABG52027.1| TPR repeat [Trichodesmium erythraeum IMS101]
          Length = 3172

 Score = 44.3 bits (103), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 8/105 (7%)

Query: 203 QSFLANAYNE--NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIF 260
           Q FL+    E    N +SSF+   P+ + ++RN    R + N  EV     E+GF     
Sbjct: 625 QVFLSRIKQEIFTENKTSSFY---PERIYISRNNSRYRRVFNEEEVLLKLSEIGF--VCI 679

Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           +P+ S +L +          ++G HG+GLT+ +F   G+ ++++V
Sbjct: 680 QPD-SMNLKEQIAIFSHAKVIIGAHGSGLTNIIFSPRGTKVIELV 723


>gi|448459513|ref|ZP_21596671.1| capsular polysaccharide biosynthesis protein-like protein
           [Halorubrum lipolyticum DSM 21995]
 gi|445808307|gb|EMA58378.1| capsular polysaccharide biosynthesis protein-like protein
           [Halorubrum lipolyticum DSM 21995]
          Length = 414

 Score = 44.3 bits (103), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 39/120 (32%), Positives = 53/120 (44%), Gaps = 12/120 (10%)

Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEE--STSL 268
           NE T+       T P  V V+R     R + N  E++   EE G      EP E  + SL
Sbjct: 270 NEATDGLDLSDDTGPDWVYVSRQNADTRRVANYAEIQPVLEEYGI-----EPVECETLSL 324

Query: 269 ADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
            +  R   S   +VG HGAGLT  ++    S  + VV I    +   Y+   A VLG +Y
Sbjct: 325 DEQIRLFSSVEGIVGPHGAGLTGLVW----SSDLDVVEIFNNVVKGPYYVL-AHVLGHDY 379


>gi|113474031|ref|YP_720092.1| group 1 glycosyl transferase [Trichodesmium erythraeum IMS101]
 gi|110165079|gb|ABG49619.1| glycosyl transferase, group 1 [Trichodesmium erythraeum IMS101]
          Length = 3301

 Score = 44.3 bits (103), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 8/105 (7%)

Query: 203 QSFLANAYNE--NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIF 260
           Q FL+    E    N +SSF+   P+ + ++RN    R + N  EV     E+GF     
Sbjct: 625 QVFLSRIKQEIFTENKTSSFY---PERIYISRNNSRYRRVFNEEEVLLKLSEIGF--VCI 679

Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           +P+ S +L +          ++G HG+GLT+ +F   G+ ++++V
Sbjct: 680 QPD-SMNLKEQIAIFSHAKVIIGAHGSGLTNIIFSPRGTKVIELV 723


>gi|221129933|ref|XP_002163521.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like isoform 1 [Hydra magnipapillata]
 gi|449671768|ref|XP_004207560.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like isoform 2 [Hydra magnipapillata]
          Length = 551

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 98/236 (41%), Gaps = 24/236 (10%)

Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ-NQ 167
           N  H F D  + +F T +  +P  +V L + D  D+    Y +L   ++     +IN   
Sbjct: 146 NLMHVFHDDLMTIFYTKSLFYPHSEVNLVMTDRRDE--GPYFDLYKLYSNSIYTSINFTD 203

Query: 168 TITHCFQSVTLGL------ISHGRMV----INPTLLPKPKTLVDFQSFLANAYNENTNTS 217
           T   CFQS  +GL        +G  +    +N T  PK   L     FL           
Sbjct: 204 TDFVCFQSALVGLSKQLTWYQYGFKIPQGPLNNTHYPKNDLLYFKNDFLTRM------NL 257

Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIH 276
           S    TK  L+L   +    R ILN  ++        F + I+    E+ +L +    I 
Sbjct: 258 SDIPDTKCILLL---SRTTSRKILNEAQLLFKLSTF-FRLPIYSVSLETDALNNIISLIL 313

Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
               ++ +HGA L   +F++PG+VL ++ P      +   ++  A ++ + Y+ +K
Sbjct: 314 RASLVISMHGAQLILGIFMKPGAVLAELFPYAVPPENYTPYKTLADLISVRYVAWK 369


>gi|291566530|dbj|BAI88802.1| TPR domain protein [Arthrospira platensis NIES-39]
          Length = 1687

 Score = 44.3 bits (103), Expect = 0.11,   Method: Composition-based stats.
 Identities = 41/162 (25%), Positives = 76/162 (46%), Gaps = 13/162 (8%)

Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
           S      PKL+ ++R++   R +LN   +    ++ GF +TI    E+ S+A+  +    
Sbjct: 631 SKLGQKTPKLIYISRHSAKYRRLLNEEAIAALLQQRGF-ITI--SPETLSVAEQAQVFSQ 687

Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEE 337
              +V  HG+GLT+ +F +P + ++++  +   ++   YF   +R LGL +  Y I  E 
Sbjct: 688 AEVIVASHGSGLTNLIFCQPETTIIEL--MSPHYIRP-YFWGLSRQLGLHH--YYITGEA 742

Query: 338 SSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDID 379
              ++ Y    L+   P A     W N +  L+  +  L  D
Sbjct: 743 ---LDCYPLRQLMY--PTALTEDIWINPQTLLQLLDSVLTTD 779


>gi|321252819|ref|XP_003192530.1| hypothetical protein CGB_C0550W [Cryptococcus gattii WM276]
 gi|317458999|gb|ADV20743.1| Hypothetical Protein CGB_C0550W [Cryptococcus gattii WM276]
          Length = 462

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 23/91 (25%), Positives = 50/91 (54%), Gaps = 12/91 (13%)

Query: 224 KPKLVLVNR-------NARVGRTILN-LREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
           KPK+V V+R       +  V + +L  L++++KA + +  D  + + E+     + F   
Sbjct: 302 KPKIVYVDRQRTERKFDMEVHKQLLKRLKKIEKAKKAVVVDAVLDDLEKK----EQFEMF 357

Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
                ++G+HG GL+H L++  G ++++++P
Sbjct: 358 SDADIILGIHGNGLSHELWMPEGGIIIEILP 388


>gi|400530662|gb|AFP86482.1| glycosyltransferase, partial [Conger oceanicus]
          Length = 277

 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 73/160 (45%), Gaps = 22/160 (13%)

Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
           PK   LV       F +FLA   N               +V+ +R+  + R ILN  E+ 
Sbjct: 2   PKANILVSGNEIRQFSAFLAEKLN--VTVGEPIEKMDEYIVVFSRS--LNRLILNEAELI 57

Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            A A+E    V     EE  S AD  R +     +V +HGA L  SLFL  G+ ++++ P
Sbjct: 58  LALAQEFQMKVVTISLEEQ-SFADIIRALSRASMLVSMHGAQLVCSLFLPRGAAVVELFP 116

Query: 307 IGTQWLSTVYFEKPARVL------GLEYLEYKIKQEESSL 340
            G   ++  ++  P R L       L+Y+ ++  +EE+S+
Sbjct: 117 YG---VNPEHY-APYRTLTSLPGMDLQYVAWRNTKEENSV 152


>gi|400530668|gb|AFP86485.1| glycosyltransferase, partial [Myrichthys maculosus]
          Length = 277

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 73/156 (46%), Gaps = 14/156 (8%)

Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
           PK   LV       F +FLA   N +    +        +V+ +R+  + R ILN  E+ 
Sbjct: 2   PKANILVSGNEIRQFSAFLAEKLNVSVGEPAEM--IDEYIVVFSRS--LNRLILNEAELI 57

Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            A A+E    V     EE  S AD  R I     +V +HGA L  +LFL  G+ ++++ P
Sbjct: 58  LALAQEYQMKVVTVSLEEQ-SFADIVRVISRASMLVSMHGAQLVTALFLPRGAAVVELFP 116

Query: 307 IGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
            G        ++  A + G++  Y+ ++  +EE+S+
Sbjct: 117 YGVNPEHYAPYKTLASLPGMDLHYVAWRNTKEENSV 152


>gi|123432397|ref|XP_001308412.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121890091|gb|EAX95482.1| hypothetical protein TVAG_222780 [Trichomonas vaginalis G3]
          Length = 295

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 33/165 (20%), Positives = 80/165 (48%), Gaps = 16/165 (9%)

Query: 171 HCFQSVTLGL--ISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLV 228
           HCF+S + G+  I+ G          KP   ++ ++ L +  +           +  K V
Sbjct: 85  HCFESFSFGIDKIAIGSESYYYNYDIKPNQTINLRNILLSQLD--------IKKSNEKKV 136

Query: 229 LVNRNARVGRTILNLREVKKAAEEL--GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHG 286
           ++++N   G  I N+ +V    ++L   ++++I E  + + + +  R +     ++G   
Sbjct: 137 IIHQNTGTG-VIRNINDVVDVIKKLFPDYNISIIELNKLSKI-EQIREVSQADVLIGPTS 194

Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
             L+  +F++P S+L+++ P     L+  +++  A+  GL++++Y
Sbjct: 195 PALSSLVFMKPSSILIEINPYKYSCLN--WYQTAAKGAGLKFIKY 237


>gi|334120934|ref|ZP_08495010.1| hypothetical protein MicvaDRAFT_2953 [Microcoleus vaginatus FGP-2]
 gi|333455653|gb|EGK84296.1| hypothetical protein MicvaDRAFT_2953 [Microcoleus vaginatus FGP-2]
          Length = 375

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 9/103 (8%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           KL L     R  +  + +REV K      FD  I E + S ++ +  R       +VG H
Sbjct: 246 KLYLSRSGRRKVKNEVQVREVLKE-----FDFEILE-DISRTVDEQIRLFAEAAVVVGPH 299

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           GAG T+ L+ +PG+ +++    G    +  YF    ++LGLEY
Sbjct: 300 GAGFTNLLWCQPGTKVLEFFYGG---YTPPYFYYICQLLGLEY 339


>gi|242806215|ref|XP_002484699.1| DUF563 domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218715324|gb|EED14746.1| DUF563 domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 459

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 6/81 (7%)

Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE 327
           L +  + I     +VGVHGAGLTH L+L   S +++++P G Q      F   A  LG +
Sbjct: 347 LQEQVQIIRGTDVLVGVHGAGLTHGLWLPQRSAMVEILPEGFQHKG---FRNLAGALGHD 403

Query: 328 YLEYKIKQEESSLVEKYGAND 348
           Y        ++S     GAND
Sbjct: 404 YFSTHASTLQTS---SRGAND 421


>gi|443704319|gb|ELU01420.1| hypothetical protein CAPTEDRAFT_225629 [Capitella teleta]
          Length = 515

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 65/141 (46%), Gaps = 23/141 (16%)

Query: 233 NARVGRTILNLREV----KKAAEEL---GFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           +  V R I N RE+    +   ++L   G  +  FE      + D  R I S   +VG+H
Sbjct: 379 DGHVVRKIHNEREILNHLQSKYQQLRVAGVQIDKFE------MVDQLRQIASTDILVGMH 432

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLS-TVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
           GAGLTH+LFL   + L+++ P+   W S   +F+  A    L Y  ++ + +       +
Sbjct: 433 GAGLTHALFLPRHAALVELFPL--YWSSINAHFKSIAAWRNLTYTSWENRDQSLEKANHF 490

Query: 345 -----GANDLVLKNPQAFAGA 360
                G  D ++KN  A+ G 
Sbjct: 491 THVPVGIADTLIKN--AYKGV 509


>gi|404328964|ref|ZP_10969412.1| Capsular polysaccharide biosynthesis protein-like protein
           [Sporolactobacillus vineae DSM 21990 = SL153]
          Length = 354

 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 39/152 (25%), Positives = 74/152 (48%), Gaps = 16/152 (10%)

Query: 205 FLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEE 264
           FL +++ +++N+         K + ++R  +  R I N   + +   + GF+    E   
Sbjct: 208 FLRDSFLKDSNSIGR------KRIYISR--KWSRKITNEDLLMEIIYKYGFEKVELE--- 256

Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVL 324
           S S+ +      S  A++GVHGA LT+  F RPG+ ++++           +F   + + 
Sbjct: 257 SLSVEEQVHLFSSAEAIIGVHGAALTNLTFCRPGTKVLEIFAPN---YIIAHFYGISSMG 313

Query: 325 GLEYLEYKIKQEESSLVEKY-GANDLVLKNPQ 355
            L+Y  Y I ++ S L + + GA D+VL  P+
Sbjct: 314 DLDYY-YYIGEKSSRLSKGWPGAEDIVLNIPR 344


>gi|405962463|gb|EKC28135.1| Beta-(1,2)-xylosyltransferase [Crassostrea gigas]
          Length = 504

 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 31/116 (26%), Positives = 57/116 (49%), Gaps = 4/116 (3%)

Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGL 289
           N    V R + N +E++ +  +L  +  I   + E+ S+ D    I +   ++G+HGAGL
Sbjct: 364 NPKGTVSRKVKNEKEIEDSVAKLYPNHLIESFQTENMSMTDQLSAIVNTDILIGMHGAGL 423

Query: 290 THSLFLRPGSVLMQVVPIGTQWLS-TVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
           T +LFL   + L+++ P    W S   +F   AR   L Y+ ++  + ++     Y
Sbjct: 424 TLALFLPKHAGLIEIYP--KYWSSDNAHFRAIARWRNLHYIHWQNVESKNEFTNHY 477


>gi|400530658|gb|AFP86480.1| glycosyltransferase, partial [Elops saurus]
          Length = 278

 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 70/156 (44%), Gaps = 13/156 (8%)

Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
           PK   LV       F +FLA   N      +        +V+ +R+  + R ILN  E+ 
Sbjct: 2   PKANILVSGNEIRQFAAFLAEKLNVTVEERAE-EQDDAYIVVFSRS--INRLILNEAELI 58

Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            A A+E    V     EE  SLAD  R +     +V +HGA L  SLFL  G+ ++++ P
Sbjct: 59  LALAQEFQMRVVTVSLEEH-SLADIVRVVGGASMLVSMHGAQLITSLFLPRGAAVVELFP 117

Query: 307 IGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
                     +   A + G  L+Y+ ++   EE+S+
Sbjct: 118 YAVNPEHYAPYRTLASLPGMDLQYVVWRNTMEENSV 153


>gi|409076566|gb|EKM76937.1| hypothetical protein AGABI1DRAFT_78054 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 530

 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 52/105 (49%), Gaps = 11/105 (10%)

Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTI-----LNLREVKKAAEELGF-DVTIFEPEE 264
           N    + S    +KP ++ +NR  + GR +      +L E  +A EE G  ++ I   EE
Sbjct: 350 NRTVISPSHVKSSKPVVIYLNRQ-KTGRRLDEESHESLVEALEALEEGGVCELQIVRMEE 408

Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP---GSVLMQVVP 306
              L +  R +     MVGVHG GLTH L++ P    SV+  V+P
Sbjct: 409 -IGLREQIRLVSQSTIMVGVHGNGLTHQLWMPPSKRSSVIEIVMP 452


>gi|389631144|ref|XP_003713225.1| hypothetical protein MGG_07979 [Magnaporthe oryzae 70-15]
 gi|351645557|gb|EHA53418.1| hypothetical protein MGG_07979 [Magnaporthe oryzae 70-15]
          Length = 517

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 34/119 (28%), Positives = 56/119 (47%), Gaps = 13/119 (10%)

Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEEST-SLADSFRFI-HSCH 279
             K ++  ++R  R  R +L    +  AA     DV +   + +T S  +  R + H   
Sbjct: 377 QKKLRVTFIDR--RGSRKLLGQERLLDAARRAYPDVQVRSVDFATLSFVEQIRLVRHETD 434

Query: 280 AMVGVHGAGLTHSLFLRP------GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
            +VGVHGAGLTH +FLR       G  +++++P     ++   F   A +LG +Y   K
Sbjct: 435 VLVGVHGAGLTHVMFLRAASEAVGGGAIVEILP---DVMNYKGFRNLAYMLGHDYFSVK 490


>gi|78183683|ref|YP_376117.1| capsular polysaccharide biosynthesis protein-like [Synechococcus
           sp. CC9902]
 gi|78167977|gb|ABB25074.1| Capsular polysaccharide biosynthesis protein-like [Synechococcus
           sp. CC9902]
          Length = 351

 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 41/77 (53%), Gaps = 3/77 (3%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
            KL +   NA+  RTI N+ EVK    + G+  ++ +P E  SL +          +VGV
Sbjct: 235 EKLFIKRGNAKNNRTIDNIEEVKLLCIDKGY--SVIDPGE-MSLLEQIEVFRKSTKVVGV 291

Query: 285 HGAGLTHSLFLRPGSVL 301
           HG+   + L+++ GSV+
Sbjct: 292 HGSAFVNMLYMKSGSVI 308


>gi|220910531|ref|YP_002485842.1| Capsular polysaccharide biosynthesis protein-like protein
           [Cyanothece sp. PCC 7425]
 gi|219867142|gb|ACL47481.1| Capsular polysaccharide biosynthesis protein-like protein
           [Cyanothece sp. PCC 7425]
          Length = 423

 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 22/81 (27%), Positives = 44/81 (54%), Gaps = 3/81 (3%)

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
           T+ +L+ ++R     R +LN  E+ +  E  GF+  + E     S+ +  +   +   +V
Sbjct: 275 TQRRLIYISRARATHRRVLNEAELLQFLEGFGFEPVVLE---QLSVVEQAQLFSAADIIV 331

Query: 283 GVHGAGLTHSLFLRPGSVLMQ 303
             HGAGLT+ +F +PG+ +++
Sbjct: 332 APHGAGLTNLVFCQPGTKVIE 352


>gi|242057407|ref|XP_002457849.1| hypothetical protein SORBIDRAFT_03g015771 [Sorghum bicolor]
 gi|241929824|gb|EES02969.1| hypothetical protein SORBIDRAFT_03g015771 [Sorghum bicolor]
          Length = 49

 Score = 43.5 bits (101), Expect = 0.18,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 28/43 (65%)

Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           LA  +R +++   MVGVH   +TH LF+RPGS+ +Q + I  Q
Sbjct: 4   LAKMYRALNASDIMVGVHDMAMTHFLFMRPGSLFIQALGIWHQ 46


>gi|356558433|ref|XP_003547511.1| PREDICTED: V-type proton ATPase 116 kDa subunit a isoform 1-like
           [Glycine max]
          Length = 822

 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 62/148 (41%), Gaps = 9/148 (6%)

Query: 58  IKTQ-PYPRKSDKSAMSKVKELTITTSAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEF 114
           IK Q PYP   D S      EL+   S    +S   GV H +  ++ S   +N +FF   
Sbjct: 520 IKYQDPYPFGVDPSWRGSRSELSFLNSLKMKMSILFGVAHMNLGIILSY--FNAHFFQNS 577

Query: 115 MDC---FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITH 171
           +D    FVP  I +NS F    V++ I  C    A  Y  ++  F   P  N+    +  
Sbjct: 578 LDIRYQFVPQMIFLNSLFGYLSVLIVIKWCTGSQADLYHVMIYMFLS-PTDNLGENQLFW 636

Query: 172 CFQSVTLGLISHGRMVINPTLLPKPKTL 199
             + + + L+    + +   L PKP  L
Sbjct: 637 GQRPLQIVLLLLAVIAVPWMLFPKPFIL 664


>gi|443728436|gb|ELU14792.1| hypothetical protein CAPTEDRAFT_132480 [Capitella teleta]
          Length = 548

 Score = 43.5 bits (101), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 62/260 (23%), Positives = 106/260 (40%), Gaps = 26/260 (10%)

Query: 108 GNFFHEFMDCFVPLFITI------NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPI 161
            N  H F D  +P+F T+      +S     DV +   D  D  +  +   L   +R PI
Sbjct: 137 SNIMHAFHDDLLPMFHTLRLITLHDSTTMPFDVQIVFMDNADHGSFSHLYGLYSSSR-PI 195

Query: 162 I--NINNQTITHCFQSVTLGL------ISHGRMVIN---PTLLPKPKTLVDFQSFLANAY 210
              +I +   T CF+   +GL        +G        P +      +  F +F  +  
Sbjct: 196 TKRDIQDAGETICFREAHVGLDKSMTWYQYGFFEPQGPLPDIHVTSSHIAHFANFTRHRL 255

Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTILN-LREVKKAAEELGFDVTIFEPEESTSLA 269
           N     S S   TK  ++L  ++ R+   ILN L      A++    V +   E  T+ A
Sbjct: 256 NLTNQASPS---TKIAVILSRKHNRL---ILNELSLSSNLAQQFNLKVVLVSLETHTA-A 308

Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
           +    I     +VG+HG+    SLFL PGSVL+++ P G        ++  A +  + Y 
Sbjct: 309 EIIETIGQADLLVGMHGSLFIMSLFLPPGSVLLELFPYGVNPKHYTPYKTLANIQRITYH 368

Query: 330 EYKIKQEESSLVEKYGANDL 349
            ++   ++ ++   Y   +L
Sbjct: 369 SWRNLNKDHAVPHPYRPREL 388


>gi|409991065|ref|ZP_11274360.1| hypothetical protein APPUASWS_08635, partial [Arthrospira platensis
           str. Paraca]
 gi|409938067|gb|EKN79436.1| hypothetical protein APPUASWS_08635, partial [Arthrospira platensis
           str. Paraca]
          Length = 965

 Score = 43.5 bits (101), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 58/111 (52%), Gaps = 6/111 (5%)

Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
           S      PKL+ ++R++   R +LN   +    ++ GF +TI    E+ S+A+  +    
Sbjct: 629 SKLGQKTPKLIYISRHSAKYRRLLNEEAIAALLQQRGF-ITI--SPETLSVAEQAQVFSQ 685

Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
              +V  HG+GLT+ +F +P + ++++  +   ++   YF   +R LGL +
Sbjct: 686 AEVIVASHGSGLTNLIFCQPETTIIEL--MSPHYIRP-YFWGLSRQLGLHH 733


>gi|400530670|gb|AFP86486.1| glycosyltransferase, partial [Saccopharynx ampullaceus]
          Length = 277

 Score = 43.5 bits (101), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 68/156 (43%), Gaps = 14/156 (8%)

Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
           PK   LV       F SFLA   N           T   +V+ +R+  + R ILN  E+ 
Sbjct: 2   PKANILVSGNEIRQFSSFLAEKLN--VTVGEPVEKTNEYIVVFSRS--LNRLILNEAELI 57

Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            A A+E    V     EE  S  D  R +     +V +HGA L  SLFL  G+ ++++ P
Sbjct: 58  LALAQEFQMKVVTVSLEEQ-SFTDIVRVLGRASMLVSMHGAQLVTSLFLPRGAAVVELYP 116

Query: 307 IGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
                     +   A + G  L+Y+ ++   EE+S+
Sbjct: 117 YAVNPEHYAPYRTLASLPGMDLQYVAWRNTMEENSV 152


>gi|428205792|ref|YP_007090145.1| capsular polysaccharide biosynthesis protein-like protein
           [Chroococcidiopsis thermalis PCC 7203]
 gi|428007713|gb|AFY86276.1| capsular polysaccharide biosynthesis protein-like protein
           [Chroococcidiopsis thermalis PCC 7203]
          Length = 423

 Score = 43.5 bits (101), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 47/101 (46%), Gaps = 8/101 (7%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + + R  + GRTI N  EV     +LGF   +    E  S AD  R       +V  HGA
Sbjct: 272 IYIVRTKKTGRTITNEEEVLSILTQLGF---VGYTLEELSFADQVRLFSQAEMVVAAHGA 328

Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           GL + +F       ++V+ +   + +  YF   +++LG +Y
Sbjct: 329 GLANIMFAEN----LKVIELFGSYGTAAYF-VLSKMLGFDY 364


>gi|119485065|ref|ZP_01619450.1| TPR repeat protein [Lyngbya sp. PCC 8106]
 gi|119457293|gb|EAW38418.1| TPR repeat protein [Lyngbya sp. PCC 8106]
          Length = 815

 Score = 43.5 bits (101), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 22/83 (26%), Positives = 43/83 (51%), Gaps = 3/83 (3%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           K + ++RN    R  +N + V +  E+ GF     E   + S+ +   ++++   +V  H
Sbjct: 680 KRIYIDRNKAGYRKTINNQAVVQLLEKYGFQRIYLE---TLSIVEQAHYLNAAEVVVAPH 736

Query: 286 GAGLTHSLFLRPGSVLMQVVPIG 308
           GAGLT+  F  PG+ ++++   G
Sbjct: 737 GAGLTNLTFCNPGTKVIEIFSPG 759


>gi|374329929|ref|YP_005080113.1| Capsular polysaccharide biosynthesis protein-like protein
           [Pseudovibrio sp. FO-BEG1]
 gi|359342717|gb|AEV36091.1| Capsular polysaccharide biosynthesis protein-like protein
           [Pseudovibrio sp. FO-BEG1]
          Length = 399

 Score = 43.5 bits (101), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 8/111 (7%)

Query: 220 FHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
            H T  + + + RN    R ILN   + +  E  GF   +  P + T   +  +   S  
Sbjct: 263 MHKTPTQKLYIARNDAKTRRILNETTLIEQLEARGFQSVV--PGKLTH-REQVKLFSSAK 319

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPI-----GTQWLSTVYFEKPARVLG 325
            +VG HGAGLT+ LF + G  L+++ P         WLS V   + A V+G
Sbjct: 320 IIVGTHGAGLTNLLFTQAGGKLVEIFPADYIQSAYAWLSHVRGLEYAPVIG 370


>gi|402813093|ref|ZP_10862688.1| capsular polysaccharide biosynthesis-like protein [Paenibacillus
           alvei DSM 29]
 gi|402509036|gb|EJW19556.1| capsular polysaccharide biosynthesis-like protein [Paenibacillus
           alvei DSM 29]
          Length = 290

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 46/103 (44%), Gaps = 6/103 (5%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + V+R     R ++N  EV +   E GF   +  P    S A+      S   +V   G+
Sbjct: 160 IYVSRQDAFARHVINEEEVMQVLAEKGFVRIVLTP---LSTAEKIAIYSSAQVIVSPFGS 216

Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
           G  +++F  PGS L+++ PI        YF K +   G+ Y E
Sbjct: 217 GSINTVFCNPGSTLIELTPITVM---DGYFWKISNHAGMNYYE 256


>gi|427717364|ref|YP_007065358.1| hypothetical protein Cal7507_2081 [Calothrix sp. PCC 7507]
 gi|427349800|gb|AFY32524.1| hypothetical protein Cal7507_2081 [Calothrix sp. PCC 7507]
          Length = 394

 Score = 43.1 bits (100), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 65/245 (26%), Positives = 101/245 (41%), Gaps = 38/245 (15%)

Query: 95  TSPALVFS---AGGYNGNFFHEFMDCFVPL-FITINSHFPDQDVILAIADCNDQWARKYA 150
           T  A+VFS    G    N+FH   DC   L  + ++  + D D+ L  +        KY+
Sbjct: 137 TFDAVVFSLLTGGAGQYNYFHWLFDCLPRLHLLKLSGLYDDIDLFLVPS-------LKYS 189

Query: 151 ELLPRFTRHPI--ININ-NQTITHCFQ-SVTLGLISHGRMVINPTLLPKPKTLVDF--QS 204
                F R  +  +NI+  +TI   +  ++T+  +      I  +  P P  + DF  QS
Sbjct: 190 -----FQRETLSYLNIDLARTIDSIYNPNITVKKLVVSDHPIEASSEPIPAWICDFLRQS 244

Query: 205 FLANAYNENTNTSSSFHHTKPKLVLVNR-NARVGRTILNLREVKKAAEELGFDVTIFEPE 263
           FL  A              K   + +NR +A   R ILN  E+ +A  +LGF   I    
Sbjct: 245 FLHKAI---------IPQLKSNKIYINRKDASNNRKILNEDELVEALTKLGF---ISISA 292

Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARV 323
              SL +      +   +V  HGAGL + +F +  +   +VV I         +E  +  
Sbjct: 293 TDYSLCEQIGIFANAEVIVAAHGAGLGNLVFCQAET---KVVEIFNYEFQPKMYEHLSHR 349

Query: 324 LGLEY 328
           LGL Y
Sbjct: 350 LGLPY 354


>gi|443702643|gb|ELU00569.1| hypothetical protein CAPTEDRAFT_211593 [Capitella teleta]
          Length = 481

 Score = 43.1 bits (100), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 62/281 (22%), Positives = 108/281 (38%), Gaps = 49/281 (17%)

Query: 108 GNFFHEFMDCFVPLFITINS-------------HFPDQDVILAIADCNDQWARKYAELLP 154
           GNF H + D FV L+  +N              +    D+  +  DC+D    KY + LP
Sbjct: 201 GNFHHFWNDEFVMLYSLMNISNRLRPEARNHVLYRSPTDLSPSEVDCDD--IHKYQDFLP 258

Query: 155 RF-TRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN 213
                HP           CF++   G+ +          L     LV  Q  L+ A    
Sbjct: 259 YINVVHPHEVFYKMPSKTCFRNGVFGMAN----------LNYDSKLV-VQHVLSKA---- 303

Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
             + S   H K  L++  R     R ++N  E+ +AA + GF  ++       S+ +  R
Sbjct: 304 GISPSECAHLKRVLIVQRRQ----RRLMNANELLQAAIDEGFPDSVVIDFAEYSVEEQIR 359

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE------ 327
                  M G+ GAGL  ++F+  GS L+++      W    YF       G++      
Sbjct: 360 LSACSTIMAGIQGAGLQWAIFMPVGSTLIEIAWPQKHW--GFYFNSFVTGYGIDHQTLAT 417

Query: 328 ---YLEYKIKQE---ESSLVEKYGANDLVLKNPQAFAGANW 362
              ++ +++ QE      +V +    D++  +P++     W
Sbjct: 418 DRVHVNWRVYQERVRRGQVVPEEERLDMLHSSPKSIIDNVW 458


>gi|426194527|gb|EKV44458.1| hypothetical protein AGABI2DRAFT_208690 [Agaricus bisporus var.
           bisporus H97]
          Length = 538

 Score = 43.1 bits (100), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 52/105 (49%), Gaps = 11/105 (10%)

Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTI-----LNLREVKKAAEELGF-DVTIFEPEE 264
           N    + S    +KP ++ +NR  + GR +      +L E  +A EE G  ++ I   EE
Sbjct: 359 NRTVISPSHVKSSKPVVIYLNRQ-KTGRRLDEESHESLVEALEALEEGGVCELQIVRMEE 417

Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP---GSVLMQVVP 306
              L +  R +     MVGVHG GLTH L++ P    SV+  V+P
Sbjct: 418 -IGLREQIRLVSRSTIMVGVHGNGLTHQLWMPPSKRSSVIEIVMP 461


>gi|348575416|ref|XP_003473485.1| PREDICTED: uncharacterized glycosyltransferase AER61-like isoform 2
           [Cavia porcellus]
          Length = 443

 Score = 43.1 bits (100), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 57/111 (51%), Gaps = 7/111 (6%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAM 281
           K ++ ++ R+    R ILN  E+  A + +  F+V I + + +     D  R  H+    
Sbjct: 280 KIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDIF 338

Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           +G+HGAGLTH LFL   +V+ ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 339 IGMHGAGLTHLLFLPDWAVVFELYNCEDE---HCYLDL-ARLRGIHYITWR 385


>gi|393243584|gb|EJD51098.1| hypothetical protein AURDEDRAFT_143041 [Auricularia delicata
           TFB-10046 SS5]
          Length = 469

 Score = 43.1 bits (100), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 11/91 (12%)

Query: 246 VKKAAEELGFDVTIFEPEEST--SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
           ++K A+  G ++ I +P ++   S  +    I     ++G   A L  ++FLRPG+ LM+
Sbjct: 336 LEKLAQATGAELHILDPTDTRTRSFKNRAELIARSRILIGSPSAALDDAVFLRPGATLME 395

Query: 304 VVPIGTQWLSTVYFEK---PARVLGLEYLEY 331
             P G      V       PAR++G+EY+ +
Sbjct: 396 FFPPG------VMLHDHGLPARMMGVEYIAW 420


>gi|374601970|ref|ZP_09674966.1| Capsular polysaccharide biosynthesis protein-like protein
           [Paenibacillus dendritiformis C454]
 gi|374392412|gb|EHQ63738.1| Capsular polysaccharide biosynthesis protein-like protein
           [Paenibacillus dendritiformis C454]
          Length = 361

 Score = 43.1 bits (100), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 35/126 (27%), Positives = 57/126 (45%), Gaps = 11/126 (8%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + V+R     R ++N  EV +   E GF   +  P    S+AD      S   +V   G+
Sbjct: 231 IYVSRQDAHVRHVINEEEVMQVLAEKGFVRIVLTP---LSMADKIAIYSSAQVIVSPFGS 287

Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY--KIKQEESSLVEKYG 345
           G  ++ F  PGS L+++ P+        YF K +   G+ Y E    I+Q    L    G
Sbjct: 288 GSINTAFCNPGSTLIELTPVTVM---DGYFWKISNHAGMNYYEVVCDIEQPPKPLG---G 341

Query: 346 ANDLVL 351
           A++L++
Sbjct: 342 ADNLIV 347


>gi|291393985|ref|XP_002713347.1| PREDICTED: AER61 glycosyltransferase isoform 2 [Oryctolagus
           cuniculus]
          Length = 440

 Score = 43.1 bits (100), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 49/97 (50%), Gaps = 6/97 (6%)

Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  FDV I + + +     D  R  H+    +G+HGAGLTH LFL
Sbjct: 290 RKILNQNELVNALKTVSTFDVQIVDYKYKELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 349

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
              + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 350 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 382


>gi|242007553|ref|XP_002424604.1| glycosyltransferase, putative [Pediculus humanus corporis]
 gi|212508047|gb|EEB11866.1| glycosyltransferase, putative [Pediculus humanus corporis]
          Length = 509

 Score = 43.1 bits (100), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 58/260 (22%), Positives = 115/260 (44%), Gaps = 25/260 (9%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF--ITINSHFP---DQDVILAIADCNDQ 144
           C +    P  +        N +H F D F  L+  + +N   P     D+ + I +    
Sbjct: 204 CDIIIMKPTFIMKIDA-TVNMYHHFCD-FFNLYASLHVNMSHPLTFSTDINIIIWE-TFP 260

Query: 145 WARKYAELLPRFTRHPIININNQT-ITHCFQSVTLGLISHGRMVI----NPTLLPKPKTL 199
           +   + E+   F+ +PI+ + N    T CF++V   L+   RM+     N  L+   K  
Sbjct: 261 YHSNFDEMWRVFSNNPILTLRNFIGKTVCFKNVVFPLLP--RMIFGLYYNTPLISGCKKS 318

Query: 200 VDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE-VKKAAEELGFDV- 257
             F++F     +          +++ K+ L++R     R ILN ++ +   ++   ++V 
Sbjct: 319 GLFKAFSEFVLHRLKIKEHERENSQIKITLLSRETSF-RNILNEKDLINSLSQNKSYNVK 377

Query: 258 -TIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVY 316
            T+F   ++   +     I +   ++G+HGAGLTH LFL   + + ++     +     Y
Sbjct: 378 KTVFN--KNMRFSSQLEIIRNTDILIGMHGAGLTHLLFLPDWAGVFELYNCEDE---NCY 432

Query: 317 FEKPARVLGLEYLEYKIKQE 336
            +  AR+ G+EY+ ++ K +
Sbjct: 433 MD-LARLRGVEYITWEDKNK 451


>gi|392965786|ref|ZP_10331205.1| hypothetical protein BN8_02330 [Fibrisoma limi BUZ 3]
 gi|387844850|emb|CCH53251.1| hypothetical protein BN8_02330 [Fibrisoma limi BUZ 3]
          Length = 395

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 59/121 (48%), Gaps = 12/121 (9%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + ++R  R  R I N  E+  A + LGF +   +P    S+A+  R   S   ++G HGA
Sbjct: 263 IYISRAGR--RRIRNEAELIVALKSLGFQIIEDKPR---SIAEQVRLYKSASFIIGPHGA 317

Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
             T+ ++  PG+ L ++    +   +  YF   A+V+ L Y  Y   + +    +++G +
Sbjct: 318 SFTNIIWCEPGTYLFELF---SPTYTPDYFLYLAQVMNLRYTAYSHGEAD----DRFGIS 370

Query: 348 D 348
           D
Sbjct: 371 D 371


>gi|428320698|ref|YP_007118580.1| hypothetical protein Osc7112_5970 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428244378|gb|AFZ10164.1| hypothetical protein Osc7112_5970 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 376

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 9/103 (8%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           KL L     R  +  + +REV K      FD  I E + S ++ +  R       +VG H
Sbjct: 247 KLYLSRSGRRKVKNEVQVREVLKE-----FDFEILE-DISRTVDEQIRLFAEAAVVVGPH 300

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           GAG T+ L+ +PG+ +++    G    +  YF    ++LGLEY
Sbjct: 301 GAGFTNLLWSQPGTKVLEFFYGG---YTPPYFYYICQLLGLEY 340


>gi|443705176|gb|ELU01832.1| hypothetical protein CAPTEDRAFT_196589 [Capitella teleta]
          Length = 478

 Score = 42.7 bits (99), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 31/134 (23%), Positives = 67/134 (50%), Gaps = 15/134 (11%)

Query: 224 KPKLVLVNRNA----RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
           + K+VL+ R+     R  ++I  + E +  A E G ++ IF  +   S+  + R  +   
Sbjct: 338 RNKIVLIKRSRKRFFRQHKSIAAMLEAQ--ASEHGLELFIFRDDPVPSINLTRRMFNEAI 395

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
            ++  HGAG ++ ++ +PG+V+++ +   ++    + ++  A++LG+ Y           
Sbjct: 396 MIIAPHGAGESNMIYAQPGTVIIEGMCFESRIKVNMAYKLTAQLLGMRYY---------G 446

Query: 340 LVEKYGANDLVLKN 353
           L+ KYG  D+  K 
Sbjct: 447 LLFKYGCFDITAKQ 460


>gi|297793023|ref|XP_002864396.1| AtXylT [Arabidopsis lyrata subsp. lyrata]
 gi|297310231|gb|EFH40655.1| AtXylT [Arabidopsis lyrata subsp. lyrata]
          Length = 534

 Score = 42.7 bits (99), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 42/80 (52%), Gaps = 6/80 (7%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S+ D  R I     ++G HGAGLTH +   P + + ++V +  Q     +FE  A+  GL
Sbjct: 448 SMKDQVRAIQEASVIIGAHGAGLTHIVSATPNTTIFEIVSVEFQ---RPHFELIAKWKGL 504

Query: 327 EYLEYKI---KQEESSLVEK 343
           EY    +   + E ++++EK
Sbjct: 505 EYHAMHLANSRAEPTAVIEK 524


>gi|241043295|ref|XP_002407103.1| glycosyltransferase, putative [Ixodes scapularis]
 gi|215492101|gb|EEC01742.1| glycosyltransferase, putative [Ixodes scapularis]
          Length = 482

 Score = 42.7 bits (99), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 6/108 (5%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           ++ L++RN +  R I+N +E+  AA  L G  V + +   ST          +   ++G+
Sbjct: 316 RVTLLSRNTK-HRRIVNEQELVAAARSLPGVRVRLVDFGHSTDFMHQLEVTANTDVLIGM 374

Query: 285 HGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           HGAGLTH LF     VL ++           Y +  AR+ G+EY+ ++
Sbjct: 375 HGAGLTHVLFQPDWGVLFEIFNCED---PGCYLDL-ARLRGVEYVTWE 418


>gi|392576589|gb|EIW69719.1| hypothetical protein TREMEDRAFT_43468 [Tremella mesenterica DSM
           1558]
          Length = 445

 Score = 42.7 bits (99), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 42/167 (25%), Positives = 66/167 (39%), Gaps = 26/167 (15%)

Query: 162 ININNQTITHCFQSVTLGLISHG---------RMVINPTLLPKPKT-LVDFQSFLANAYN 211
           I+  N   T  F  V     +H          +M ++   LP P T +V  ++ L     
Sbjct: 208 IDKKNSGTTFVFDRVRFRFAAHNHGNKVRVWNKMTMDLPDLPVPSTWMVPLRNALKAYAL 267

Query: 212 ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-------- 263
            N       H  K  + +V     V R + + R V   AE+L   +   E          
Sbjct: 268 AN---GCDLHRHKAGIPIVTY---VNRQMTSRRLVASDAEDLVMSLQDLEDRGEIEFNNA 321

Query: 264 --ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
             ES   A+ F        +VGVHG GL+H L++RPG  +++++  G
Sbjct: 322 FMESIPRAEQFCLALKTDILVGVHGNGLSHQLWMRPGGAVLELMDTG 368


>gi|400530672|gb|AFP86487.1| glycosyltransferase, partial [Eurypharynx pelecanoides]
          Length = 277

 Score = 42.7 bits (99), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 72/160 (45%), Gaps = 22/160 (13%)

Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
           PK   LV       F SFLA   N           T   +V+ +R+  + R ILN  E+ 
Sbjct: 2   PKANILVSGNEIRQFSSFLAEKLN--VTVGEQAEKTGEYIVVFSRS--LNRLILNEAELI 57

Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            A A+E          EE  S AD  R +     +V +HGA L  SLFL  G+ ++++ P
Sbjct: 58  LALAQEFQMKAVTVSLEEQ-SFADIVRVLSRASMLVSMHGAQLVTSLFLPRGAAVVELYP 116

Query: 307 IGTQWLSTVYFEKPARVL------GLEYLEYKIKQEESSL 340
                ++  ++  P R L       L+Y+ ++  +EE+S+
Sbjct: 117 YA---VNPEHY-APYRTLTSLPGMDLQYVAWRNTREENSV 152


>gi|443727464|gb|ELU14205.1| hypothetical protein CAPTEDRAFT_191673 [Capitella teleta]
          Length = 481

 Score = 42.4 bits (98), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 62/281 (22%), Positives = 108/281 (38%), Gaps = 49/281 (17%)

Query: 108 GNFFHEFMDCFVPLFITINS-------------HFPDQDVILAIADCNDQWARKYAELLP 154
           GNF H + D FV L+  +N              +    D+  +  DC+D    KY + LP
Sbjct: 201 GNFHHFWNDEFVMLYSLMNISNRLRPEARNHVLYRSPTDLSPSEVDCDD--IHKYQDFLP 258

Query: 155 RF-TRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN 213
                HP           CF++   G+ +          L     LV  Q  L+ A    
Sbjct: 259 YINVVHPHEVFYKMPSKTCFRNGVFGMAN----------LNYDSKLV-VQHVLSKA---- 303

Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
             + S   H K  L++  R     R ++N  E+ +AA + GF  ++       S+ +  R
Sbjct: 304 GISPSECAHLKRVLIVQRRQ----RRLMNANELLQAAIDEGFPDSVVIDFAVYSVEEQIR 359

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE------ 327
                  M G+ GAGL  ++F+  GS L+++      W    YF       G++      
Sbjct: 360 LSACSTIMAGIQGAGLQWAIFMPVGSTLIEIAWPQKHW--GFYFNSFVTGYGIDHQTLTT 417

Query: 328 ---YLEYKIKQE---ESSLVEKYGANDLVLKNPQAFAGANW 362
              ++ +++ QE      +V +    D++  +P++     W
Sbjct: 418 DRVHVNWRVYQERVRRGQVVPEEERLDMLHSSPKSIIDNVW 458


>gi|56550358|emb|CAI30565.1| glycosyltransferase [Caenorhabditis briggsae]
          Length = 541

 Score = 42.4 bits (98), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 59/265 (22%), Positives = 105/265 (39%), Gaps = 40/265 (15%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF--ITINSHFPDQDVILAIADC-----N 142
           C V    P +V        N +H F D FV L+  + IN  F +QD+ + + D      N
Sbjct: 208 CDVIFEKPTVVMKLDA-AVNLYHHFCD-FVNLYASLHINQTF-NQDIDIILWDTHPGGYN 264

Query: 143 DQW---------------ARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR-- 185
           D +                +++ + L         NI +     CF++V + L++  R  
Sbjct: 265 DHYYGITWKAFSRNEPFELKEFGKSLEFLNYEITENILSDQKRVCFKNVMMPLLARQRSG 324

Query: 186 MVINPTLL---PKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILN 242
           +  N  L+        L  F  F+ +          +    K ++V+++R+    R ILN
Sbjct: 325 LFYNSPLVYGCSGSTLLKTFSQFILHRLRLGIKPQKA-ELEKVRIVILSRST-AYRRILN 382

Query: 243 LREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           ++E+ K+   L      + +  E                 +G+HGAGLTH LFL   + +
Sbjct: 383 IKELLKSLGHLPNVTTRVVDYNERIPFLRQLNTTSQTDIFIGMHGAGLTHLLFLPDWAAI 442

Query: 302 MQVVPIGTQW-------LSTVYFEK 319
            +++ +G +         S +YF K
Sbjct: 443 FEIISLGLKRKLLLYSSYSDIYFSK 467


>gi|392562853|gb|EIW56033.1| hypothetical protein TRAVEDRAFT_152588 [Trametes versicolor
           FP-101664 SS1]
          Length = 486

 Score = 42.4 bits (98), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 44/81 (54%), Gaps = 7/81 (8%)

Query: 223 TKPKLVLVNRNARVGRTILN------LREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
           T+P +  ++R     R ++       ++E+ +  +E G++V + E ++ T + + FR   
Sbjct: 326 TRPVITYISRQGWNRRMLVQADHDKLVQELYRLRDEHGYEVNVVEMDKLTRI-EQFRLAA 384

Query: 277 SCHAMVGVHGAGLTHSLFLRP 297
               M+GVHG GLT  L++RP
Sbjct: 385 RTTIMMGVHGNGLTALLWMRP 405


>gi|242826030|ref|XP_002488559.1| glycosyltransferase, putative [Talaromyces stipitatus ATCC 10500]
 gi|218712377|gb|EED11803.1| glycosyltransferase, putative [Talaromyces stipitatus ATCC 10500]
          Length = 275

 Score = 42.4 bits (98), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 15/26 (57%), Positives = 24/26 (92%)

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVP 306
           +VGVHGAGL H++FL+P SV+++++P
Sbjct: 180 LVGVHGAGLPHAMFLKPKSVVVEILP 205


>gi|58265524|ref|XP_569918.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|134108887|ref|XP_776558.1| hypothetical protein CNBC0520 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50259238|gb|EAL21911.1| hypothetical protein CNBC0520 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57226150|gb|AAW42611.1| expressed protein [Cryptococcus neoformans var. neoformans JEC21]
          Length = 461

 Score = 42.4 bits (98), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 23/91 (25%), Positives = 45/91 (49%), Gaps = 12/91 (13%)

Query: 224 KPKLVLVNR-------NARVGRTIL-NLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
           KPK+V V+R       +  V   +L  L+ ++KA + +  D  +    E     + F   
Sbjct: 302 KPKIVYVDRQRTQRKFDVEVHTQLLKQLKNIEKAKKAVVVDAVL----EDLEKKEQFEMF 357

Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
                ++G+HG GL H L++  G ++++++P
Sbjct: 358 SDADIILGIHGNGLAHELWMPEGGIIIEILP 388


>gi|296533968|ref|ZP_06896486.1| TPR repeat protein, partial [Roseomonas cervicalis ATCC 49957]
 gi|296265699|gb|EFH11806.1| TPR repeat protein [Roseomonas cervicalis ATCC 49957]
          Length = 313

 Score = 42.4 bits (98), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 48/96 (50%), Gaps = 6/96 (6%)

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
           T P+L L  R+    R I N  E+++A   LGF V  F+      LA   R       +V
Sbjct: 169 TGPRLYLSRRDV-PRRGIENEAELEEALRRLGFTVMTFQ---GVPLARQIRAFQEAEVVV 224

Query: 283 GVHGAGLTHSLFLRPGSVLMQVVP--IGTQWLSTVY 316
             HGAGL H +  RPG  + +++P  +GT+ L + Y
Sbjct: 225 SPHGAGLAHLVAARPGLKVFELMPAWLGTEKLRSCY 260


>gi|440796868|gb|ELR17969.1| hypothetical protein ACA1_208470 [Acanthamoeba castellanii str.
           Neff]
          Length = 574

 Score = 42.4 bits (98), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 53/100 (53%), Gaps = 15/100 (15%)

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM- 281
           T+  + +++R+ +  R +LN +E+  A  +L     +        L D+  F     A+ 
Sbjct: 394 TESHVGIISRSFK--RFLLNEQELLHATLQLNVSAELL-------LFDTLPFYQQVQALR 444

Query: 282 -----VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVY 316
                VG+HG+GLT++L+L+ G+VL+Q++P  T   +  Y
Sbjct: 445 RTTVLVGMHGSGLTNALYLQRGAVLLQIMPFKTGGGAAAY 484


>gi|254471926|ref|ZP_05085327.1| hypothetical protein PJE062_312 [Pseudovibrio sp. JE062]
 gi|211959128|gb|EEA94327.1| hypothetical protein PJE062_312 [Pseudovibrio sp. JE062]
          Length = 399

 Score = 42.4 bits (98), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 52/111 (46%), Gaps = 8/111 (7%)

Query: 220 FHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
            H T  + + + R+    R ILN   + +  E  GF   +  P + T   +  +  +S  
Sbjct: 263 MHKTPTQKLYIARDDAKTRRILNETTLIEQLEARGFQSVV--PGKLTH-REQVKLFNSAK 319

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPI-----GTQWLSTVYFEKPARVLG 325
            +VG HGAGLT+ LF + G  L+++ P         WLS V   + A V+G
Sbjct: 320 IIVGTHGAGLTNLLFTQAGGKLVEIFPADYIQSAYAWLSHVRGLEYAPVIG 370


>gi|400530678|gb|AFP86490.1| glycosyltransferase, partial [Hiodon tergisus]
          Length = 276

 Score = 42.4 bits (98), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 52/109 (47%), Gaps = 7/109 (6%)

Query: 201 DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE-VKKAAEELGFDVTI 259
            F +FL    N     SS     +  +V+ +R+  V R ILN  E +   A+E    V  
Sbjct: 15  QFATFLMEKLNITRQGSS---RDEEYIVVFSRS--VNRLILNEAELILTLAQEFQMKVIT 69

Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
              EE  S +D  + I     +V +HGA L  SLFL  G+V++++ P G
Sbjct: 70  VSLEEH-SFSDIIQAISGASILVSMHGAQLVTSLFLPRGAVVVELFPYG 117


>gi|376005002|ref|ZP_09782572.1| hypothetical protein ARTHRO_370013 [Arthrospira sp. PCC 8005]
 gi|375326596|emb|CCE18325.1| hypothetical protein ARTHRO_370013 [Arthrospira sp. PCC 8005]
          Length = 915

 Score = 42.4 bits (98), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 50/108 (46%), Gaps = 6/108 (5%)

Query: 221 HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHA 280
           +H  P  + V+R     R +LN  +V      LGF   +    E  S +           
Sbjct: 771 NHDYPNRIYVSRAQARHRRVLNEEQVMAQLSRLGFVRVL---PEKLSFSQQVALFSRAEI 827

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           ++G HG+GLT+ +F +PG   ++VV + +      Y+   ++ LGLE+
Sbjct: 828 VIGPHGSGLTNLVFCQPG---VRVVELMSPHYDRHYYWVISQALGLEH 872


>gi|427702431|ref|YP_007045653.1| hypothetical protein Cyagr_1133 [Cyanobium gracile PCC 6307]
 gi|427345599|gb|AFY28312.1| hypothetical protein Cyagr_1133 [Cyanobium gracile PCC 6307]
          Length = 337

 Score = 42.4 bits (98), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 43/81 (53%), Gaps = 3/81 (3%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + ++R  +  R I N  E+ +  E  GF  T+ + E+ + L   + F  +   +VG HGA
Sbjct: 202 IFLSRGVKGARAIHNEWELAEQMESAGF--TVIDAEKYSVLQQIYIFSRA-RTIVGFHGA 258

Query: 288 GLTHSLFLRPGSVLMQVVPIG 308
           GLT+ +F  P   L+++  +G
Sbjct: 259 GLTNLIFADPSCELIELAIVG 279


>gi|241111502|ref|XP_002399294.1| glycosyltransferase, putative [Ixodes scapularis]
 gi|215492952|gb|EEC02593.1| glycosyltransferase, putative [Ixodes scapularis]
          Length = 521

 Score = 42.4 bits (98), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 43/216 (19%), Positives = 81/216 (37%), Gaps = 19/216 (8%)

Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI---- 164
           N  H F D  +P+F T   H         +++C D     + +  P+     +  +    
Sbjct: 104 NLMHVFHDDLIPIFATAREH-RGCSTGEEVSNCLDNLTLFFTDNRPKGPYWYLYQVLTKD 162

Query: 165 ------NNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQS------FLANAYNE 212
                 +  T  +CF    +GL            +P+     + QS           + +
Sbjct: 163 LLLVPPSETTQLYCFNKAIVGLQKQSTWYQYGFRMPQGPLERNLQSAGKEIKLFTKEFLK 222

Query: 213 NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
             N   S        V+V+R+    R ILN  E+    +     V +    E  +L+   
Sbjct: 223 MLNIQPSLSVEAGYAVIVSRSR--NRLILNEEELLDMVKTHAALVPVVVDLEREALSKVL 280

Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
           + +     +V +HG+ L  S+F++PG V++++ P G
Sbjct: 281 QLLVGAKLLVAMHGSALILSMFMKPGGVVLEMFPYG 316


>gi|443685144|gb|ELT88859.1| hypothetical protein CAPTEDRAFT_190671 [Capitella teleta]
          Length = 416

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 5/106 (4%)

Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           GR I+N RE     +E  F    F  + ST L            M+GVHGAGL + L   
Sbjct: 288 GRRIINKRETVHFLKE-RFPKNFFVFKGSTDLDKHASLFARTKLMIGVHGAGLYNMLLCN 346

Query: 297 PGSVLMQVVPIGTQW----LSTVYFEKPARVLGLEYLEYKIKQEES 338
             + +++++PI        LST  F K A  +G  Y    +++ +S
Sbjct: 347 SKTTILEIMPIDKGSLLPNLSTDIFWKMASSIGQNYWRIYVERIDS 392


>gi|440464716|gb|ELQ34090.1| DUF563 domain-containing protein [Magnaporthe oryzae Y34]
 gi|440480711|gb|ELQ61362.1| DUF563 domain-containing protein [Magnaporthe oryzae P131]
          Length = 506

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 10/91 (10%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEEST-SLADSFRFI-HSCHAM 281
           K ++  ++R  R  R +L    +  AA     DV +   + +T S  +  R + H    +
Sbjct: 327 KLRVTFIDR--RGSRKLLGQERLLDAARRAYPDVQVRSVDFATLSFVEQIRLVRHETDVL 384

Query: 282 VGVHGAGLTHSLFLRP------GSVLMQVVP 306
           VGVHGAGLTH +FLR       G  +++++P
Sbjct: 385 VGVHGAGLTHVMFLRAASEAVGGGAIVEILP 415


>gi|428180864|gb|EKX49730.1| hypothetical protein GUITHDRAFT_135893 [Guillardia theta CCMP2712]
          Length = 1399

 Score = 42.0 bits (97), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 71/310 (22%), Positives = 127/310 (40%), Gaps = 46/310 (14%)

Query: 20   HRSYDLCLINGSALFDP-KTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKE- 77
            +R  ++C+ N S LF P  +S +F + H       P L +    + + S K     +K  
Sbjct: 975  YRLRNVCIANRSILFFPIPSSPWFQLDH------DPVLLLGRHWWGQWSSKYCSVTLKRG 1028

Query: 78   ---LTITTSAPPNLSCGVTHTSPA--LVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
               +        NL  G    + A  +  +  G + +F H F D  +PLF TI+  F +Q
Sbjct: 1029 KSLVDFMKQNGINLDDGDAGEAGAGPVYVTLEGPSSHFGHTFTDSMLPLFATIS--FVEQ 1086

Query: 133  DV------ILAIADC-------NDQWAR-KYAELLPRFTRHPIININNQTI------THC 172
            +       I+    C        D+ ++ ++ +L    + HP +++ + +         C
Sbjct: 1087 ERRRERRKIILWPHCANFCPIKGDRRSQGRFRDLFYVASSHPQVDLLSLSSFSLSLRCRC 1146

Query: 173  FQSVTLGLISHGRMVI---NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPK-LV 228
            F+ V LG +   R+     N   +P  + L D    L      N   S++F     +   
Sbjct: 1147 FKEVVLGELIDTRVGYQRDNRRTVPFLEDLADAVRML----RRNILGSTAFRKVDQRERR 1202

Query: 229  LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGA 287
             +    R  R ++N+  +++  +ELGF    F+ + E   L    + +     +VGV+GA
Sbjct: 1203 RMTLVRRRDRLVVNIHGLERLGQELGF--LAFQVDWEHLRLFQQIQVMEQTDVLVGVYGA 1260

Query: 288  GLTHSLFLRP 297
            GL    F RP
Sbjct: 1261 GLVKIAFARP 1270


>gi|345569002|gb|EGX51871.1| hypothetical protein AOL_s00043g605 [Arthrobotrys oligospora ATCC
           24927]
          Length = 531

 Score = 42.0 bits (97), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 8/110 (7%)

Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKA--AEELGFDVTIFEPEESTSLADSFRFIHSCH 279
           H+   L  ++R     R + N +E+  A  A     +V + +  + T L +    + +  
Sbjct: 376 HSPINLTFIDRKG--SRKLTNSKELTDALIAAYPKVNVKVVDMADLT-LKEQISLVVNTD 432

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
            +VGVHGAG TH+ FL P S L++++P     L    F   A + G+ Y 
Sbjct: 433 VLVGVHGAGHTHAFFLPPQSSLVEILPAD---LKHKGFRNLAGLRGIRYF 479


>gi|113476309|ref|YP_722370.1| hypothetical protein Tery_2708 [Trichodesmium erythraeum IMS101]
 gi|110167357|gb|ABG51897.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
          Length = 979

 Score = 42.0 bits (97), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 54/115 (46%), Gaps = 10/115 (8%)

Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
           TN  S F    P+ + + R     R ++N  EV     + GF  T   PE S SL +   
Sbjct: 832 TNNKSYF----PEKIYIGRGNAKYRRVMNEAEVVDILRQFGF--TYVTPE-SISLENQIS 884

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
                  +V  HG+GLT+ +F  PG+   +V+ + +      Y+   +++LGLE+
Sbjct: 885 TFAHAKIIVAPHGSGLTNIVFCNPGT---KVIELFSPHYLRYYYWHISQLLGLEH 936


>gi|428205793|ref|YP_007090146.1| capsular polysaccharide biosynthesis protein-like protein
           [Chroococcidiopsis thermalis PCC 7203]
 gi|428007714|gb|AFY86277.1| capsular polysaccharide biosynthesis protein-like protein
           [Chroococcidiopsis thermalis PCC 7203]
          Length = 412

 Score = 42.0 bits (97), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 3/74 (4%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + ++R    GR ++N  EV  A   LGF   +    E  S AD  R       +V  HGA
Sbjct: 272 IYISRTKTTGRQVINEEEVLAALAPLGF---VSYTMERMSFADQVRLFSQAEIVVSPHGA 328

Query: 288 GLTHSLFLRPGSVL 301
            LT++LF +  SV+
Sbjct: 329 ALTNTLFAQNLSVI 342


>gi|18423754|ref|NP_568825.1| beta-(1,2)-xylosyltransferase [Arabidopsis thaliana]
 gi|17369167|sp|Q9LDH0.1|XYLT_ARATH RecName: Full=Beta-(1,2)-xylosyltransferase
 gi|15724166|gb|AAL06475.1|AF411785_1 AT5g55500/MTE17_21 [Arabidopsis thaliana]
 gi|7671513|emb|CAB89489.1| beta1,2-Xylosyltransferase [Arabidopsis thaliana]
 gi|7768105|emb|CAB90610.1| beta1,2-xylosyltransferase [Arabidopsis thaliana]
 gi|9758182|dbj|BAB08567.1| unnamed protein product [Arabidopsis thaliana]
 gi|23505973|gb|AAN28846.1| At5g55500/MTE17_21 [Arabidopsis thaliana]
 gi|332009252|gb|AED96635.1| beta-(1,2)-xylosyltransferase [Arabidopsis thaliana]
          Length = 534

 Score = 42.0 bits (97), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 6/80 (7%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S+ D  R I     ++G HGAGLTH +   P + + +++ +  Q     +FE  A+  GL
Sbjct: 448 SMKDQVRAIQDASVIIGAHGAGLTHIVSATPNTTIFEIISVEFQ---RPHFELIAKWKGL 504

Query: 327 EYLEYKI---KQEESSLVEK 343
           EY    +   + E ++++EK
Sbjct: 505 EYHAMHLANSRAEPTAVIEK 524


>gi|423065545|ref|ZP_17054335.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
 gi|406712988|gb|EKD08163.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
          Length = 917

 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 6/107 (5%)

Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
           H  P  + V+R     R +LN  +V      LGF   +    E  S +           +
Sbjct: 774 HDYPNRIYVSRAQARHRRVLNEEQVMAQLSRLGFVRVL---PEKLSFSQQVALFSRAEIV 830

Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           +G HG+GLT+ +F +PG   ++VV + +      Y+   ++ LGLE+
Sbjct: 831 IGPHGSGLTNLVFCQPG---VRVVELMSPHYDRHYYWVISQALGLEH 874


>gi|209528249|ref|ZP_03276714.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
 gi|209491321|gb|EDZ91711.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
          Length = 915

 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 6/107 (5%)

Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
           H  P  + V+R     R +LN  +V      LGF   +    E  S +           +
Sbjct: 772 HDYPNRIYVSRAQARHRRVLNEEQVMAQLSRLGFVRVL---PEKLSFSQQVALFSRAEIV 828

Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           +G HG+GLT+ +F +PG   ++VV + +      Y+   ++ LGLE+
Sbjct: 829 IGPHGSGLTNLVFCQPG---VRVVELMSPHYDRHYYWVISQALGLEH 872


>gi|8572250|gb|AAF77064.1| beta 1,2-xylosyltransferase [Arabidopsis thaliana]
          Length = 533

 Score = 42.0 bits (97), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 6/80 (7%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S+ D  R I     ++G HGAGLTH +   P + + +++ +  Q     +FE  A+  GL
Sbjct: 447 SMKDQVRAIQDASVIIGAHGAGLTHIVSATPNTTIFEIISVEFQ---RPHFELIAKWKGL 503

Query: 327 EYLEYKI---KQEESSLVEK 343
           EY    +   + E ++++EK
Sbjct: 504 EYHAMHLANSRAEPTAVIEK 523


>gi|344276098|ref|XP_003409846.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
           [Loxodonta africana]
          Length = 443

 Score = 42.0 bits (97), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 50/99 (50%), Gaps = 6/99 (6%)

Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  F+V I + + +     D  R  H+    +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVRIVDYKYKELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 352

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIK 334
              + + ++     +     Y +  AR+ G+ Y+ ++ K
Sbjct: 353 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWRKK 387


>gi|393213882|gb|EJC99377.1| hypothetical protein FOMMEDRAFT_160990 [Fomitiporia mediterranea
           MF3/22]
          Length = 537

 Score = 42.0 bits (97), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 42/94 (44%), Gaps = 4/94 (4%)

Query: 243 LREVKKAAEELGFDVTIFE----PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
           +RE+KK      +DV I E      E T   +  R       M+GV+G  LT+S+ LRPG
Sbjct: 402 VRELKKLGSSQKWDVHIIEIGGDGGEPTPWVEHIRAAAQSSIMLGVYGDSLTNSILLRPG 461

Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
                V+        T   E   R LG+ Y+ ++
Sbjct: 462 PPTPAVIEFFPDGRYTNEHEFVTRALGIHYVAWR 495


>gi|323451963|gb|EGB07838.1| hypothetical protein AURANDRAFT_71684 [Aureococcus anophagefferens]
          Length = 980

 Score = 42.0 bits (97), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 37/142 (26%), Positives = 63/142 (44%), Gaps = 20/142 (14%)

Query: 187 VINPTLLPKPKTLVD---FQSFL--ANAYNENTNTSSSFHHTKPKLVL--------VNRN 233
           V++ T +P+   L D    ++FL  A+  +   + + +   T+P   +        +NR 
Sbjct: 170 VVHATYVPRATYLRDPDATRAFLEAADGLHGGGSVADAAPSTRPPRTVTLLLRSANLNRP 229

Query: 234 A-RVGRTILNLREVKKAAEEL------GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHG 286
           A R G    N RE+  + E        G+ +       + S A  FR       +VG HG
Sbjct: 230 AKRAGPAWANSRELAASLERWLSSEHPGWVLQTVALAHAFSFASQFRTAQHSGVLVGAHG 289

Query: 287 AGLTHSLFLRPGSVLMQVVPIG 308
           A LT+ +F R G+ L++V+  G
Sbjct: 290 ASLTNGVFQRDGAALLEVLNCG 311


>gi|220926108|ref|YP_002501410.1| capsular polysaccharide biosynthesis protein-like protein
           [Methylobacterium nodulans ORS 2060]
 gi|219950715|gb|ACL61107.1| Capsular polysaccharide biosynthesis protein-like protein
           [Methylobacterium nodulans ORS 2060]
          Length = 350

 Score = 42.0 bits (97), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 6/105 (5%)

Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
           +R     R  +N   V     +LG+ +  F  E   S  D     H C ++VGV GA LT
Sbjct: 225 SRGRGASRDFVNEPAVADHLSKLGY-INKFAGE--LSFTDQISVFHGCSSVVGVMGAALT 281

Query: 291 HSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQ 335
           + +F +PG+ +   +P        V F + A    L+Y+E + ++
Sbjct: 282 NIVFCKPGTTVYCFMPSSA---CEVLFWRIAEARRLKYIEIRSRE 323


>gi|403235324|ref|ZP_10913910.1| hypothetical protein B1040_06025 [Bacillus sp. 10403023]
          Length = 386

 Score = 42.0 bits (97), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 3/69 (4%)

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R+I+N  E+ +A ++  F     E      ++      HS   ++  HGAGLT+  F RP
Sbjct: 261 RSIVNEDEILEALKDYKFKTIRLE---EFRVSKQIEIFHSASVIISPHGAGLTNLAFCRP 317

Query: 298 GSVLMQVVP 306
           G+ ++++ P
Sbjct: 318 GTKVLEIYP 326


>gi|397641682|gb|EJK74793.1| hypothetical protein THAOC_03511 [Thalassiosira oceanica]
          Length = 945

 Score = 42.0 bits (97), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 6/82 (7%)

Query: 229 LVNRNARVGRTILNLREVKKAAEE--LGFDVTIFEP--EESTSLADSFRFIHSCHAMVGV 284
           L  R++R  R  +N+  V  A  E   G  V I E   E +++ ++ F    S  AMVGV
Sbjct: 415 LAQRSSR--RRWINMPSVMNACREEFAGGGVVIVEVNVEHTSTPSEQFLLHRSLDAMVGV 472

Query: 285 HGAGLTHSLFLRPGSVLMQVVP 306
           HGA LT  + L P S +++++P
Sbjct: 473 HGAQLTQGVLLPPYSHILELLP 494


>gi|334117761|ref|ZP_08491852.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
            vaginatus FGP-2]
 gi|333460870|gb|EGK89478.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
            vaginatus FGP-2]
          Length = 1363

 Score = 42.0 bits (97), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 3/77 (3%)

Query: 228  VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
            + ++R     R I+N  EV K  E+LGF     E   + S+A+    + +   +V  HG 
Sbjct: 1229 IYISRQQASYRRIVNDEEVIKYLEKLGFRSVKLE---TMSIAEQASCLAAAKVVVAPHGG 1285

Query: 288  GLTHSLFLRPGSVLMQV 304
            GLT+ +F  PG+ ++++
Sbjct: 1286 GLTNLVFCSPGTKVIEI 1302


>gi|330792027|ref|XP_003284092.1| hypothetical protein DICPUDRAFT_147835 [Dictyostelium purpureum]
 gi|325086021|gb|EGC39418.1| hypothetical protein DICPUDRAFT_147835 [Dictyostelium purpureum]
          Length = 515

 Score = 42.0 bits (97), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 38/67 (56%), Gaps = 6/67 (8%)

Query: 233 NARVGRTILNLRE----VKKAAEELGFDVTI-FEPEESTSLADSFRFIHSCHAMVGVHGA 287
           N  +G+T++N+R+    +++   E+G    +  EP+E T LADS   I  C A  GV GA
Sbjct: 405 NYNIGKTLINIRQSFYRIRELMREMGAIADVPLEPKEQTELADSTMNIKGCVA-AGVPGA 463

Query: 288 GLTHSLF 294
           G   +LF
Sbjct: 464 GGFDALF 470


>gi|86564130|ref|NP_506677.3| Protein H12D21.10, isoform a [Caenorhabditis elegans]
 gi|72058551|emb|CAB07427.3| Protein H12D21.10, isoform a [Caenorhabditis elegans]
          Length = 523

 Score = 42.0 bits (97), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 53/253 (20%), Positives = 101/253 (39%), Gaps = 22/253 (8%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF--ITINSHFPDQDVILAIADC-----N 142
           C +    P ++        N +H F D F  L+  + +N  F DQDV + + D      N
Sbjct: 206 CDIIFEKPTIIMKLDA-AVNLYHHFCD-FFNLYASLHLNQTF-DQDVDIILWDTHPGGYN 262

Query: 143 DQWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTL 199
           D +   Y      F+++    +        CF+ V + L++  R  +  N  ++      
Sbjct: 263 DHY---YGVTWKAFSKNQPFELKEFDQKKVCFKRVMMPLLARQRTGLFYNSPVVEGCSGS 319

Query: 200 VDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVT 258
             F++F     +            K ++V+++R+    R ILN++E+ ++   L      
Sbjct: 320 KMFRTFSQFILHRLGIRQPKADLEKARIVILSRSTAF-RKILNIKEILRSLGHLPNVSTR 378

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
           + +  E                 +G+HGAGLTH LFL   + + ++   G        + 
Sbjct: 379 VVDYNERIPFEKQLNITSKTDIFIGMHGAGLTHLLFLPDWAAVFEIYNCGDPGC----YS 434

Query: 319 KPARVLGLEYLEY 331
             AR+ G++Y  +
Sbjct: 435 DLARLRGVKYYTW 447


>gi|384247850|gb|EIE21335.1| hypothetical protein COCSUDRAFT_56558 [Coccomyxa subellipsoidea
           C-169]
          Length = 394

 Score = 42.0 bits (97), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 25/93 (26%), Positives = 43/93 (46%), Gaps = 11/93 (11%)

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL---------GFDVTIFEPEESTSLADSFR 273
           T  K+  + R     R ILNL+++ +   EL             T    ++  +      
Sbjct: 219 TAIKVAFIERTK--SRHILNLKDLIRRCNELQVGEAPRNRSVQCTAVSFDDVGNFTGLLA 276

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            + +   +VGVHGAGL ++ F+RPGS  +++ P
Sbjct: 277 ELQTIDILVGVHGAGLVNTYFMRPGSAFLEIFP 309


>gi|395326538|gb|EJF58946.1| hypothetical protein DICSQDRAFT_65917 [Dichomitus squalens LYAD-421
           SS1]
          Length = 411

 Score = 41.6 bits (96), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 7/82 (8%)

Query: 222 HTKPKLVLVNRNARVGRTILN------LREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
            T+P +  ++R     R +        + E+ +  +E G++V + E ++ T + + FR  
Sbjct: 250 ETRPVISYISRQGWNRRKLRQEDHERLVEELYRLRDEYGYEVNVVEMDKLTRM-EQFRLA 308

Query: 276 HSCHAMVGVHGAGLTHSLFLRP 297
                M+GVHG GLT  L++RP
Sbjct: 309 GRTTIMMGVHGNGLTALLWMRP 330


>gi|242825040|ref|XP_002488357.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
 gi|218712175|gb|EED11601.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
          Length = 461

 Score = 41.6 bits (96), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 48/101 (47%), Gaps = 6/101 (5%)

Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
           LREV++    +   +  F    S    +  R       +VGVHG  L+H++FL+P SV++
Sbjct: 338 LREVQERFPYVKVQIVDFS---SIPFKEQIRIAQRTDILVGVHGVELSHAIFLKPQSVVV 394

Query: 303 QVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEK 343
           +++P      +   F+  +R+ G  Y      + E  LV +
Sbjct: 395 EILPSSC---NLTMFQHLSRLRGHTYYSTHGAEVEFDLVRR 432


>gi|71663377|ref|XP_818682.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70883947|gb|EAN96831.1| hypothetical protein Tc00.1047053509733.95 [Trypanosoma cruzi]
          Length = 362

 Score = 41.6 bits (96), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 18/59 (30%), Positives = 37/59 (62%), Gaps = 3/59 (5%)

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           F+H+    +  HGAG+ + + + PGSV++++ P G ++     +++ A +LGL Y+ Y+
Sbjct: 234 FMHNSSIFIAAHGAGMVNIMSMSPGSVVVELFPHGFRY---AMYQELAELLGLHYIAYE 289


>gi|392397499|ref|YP_006434100.1| hypothetical protein Fleli_1917 [Flexibacter litoralis DSM 6794]
 gi|390528577|gb|AFM04307.1| hypothetical protein Fleli_1917 [Flexibacter litoralis DSM 6794]
          Length = 337

 Score = 41.6 bits (96), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 19/79 (24%), Positives = 41/79 (51%), Gaps = 3/79 (3%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           K + ++R     R I N  EV++  ++  ++   FE     +       +    +++G+H
Sbjct: 207 KKIYISRQKSRFRKINNEDEVQQLLKKYNYETHYFE---DYNFEKQIELMQQTTSLIGLH 263

Query: 286 GAGLTHSLFLRPGSVLMQV 304
           GAGLT+ LF+ P + ++++
Sbjct: 264 GAGLTNMLFMNPNTKILEI 282


>gi|302765551|ref|XP_002966196.1| glycoprotein beta-1,2-xylosyltransferase [Selaginella
           moellendorffii]
 gi|300165616|gb|EFJ32223.1| glycoprotein beta-1,2-xylosyltransferase [Selaginella
           moellendorffii]
          Length = 495

 Score = 41.6 bits (96), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 28/38 (73%)

Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
           LA+  + + S   +VGVHGAGLTH +F RPG+ +++++
Sbjct: 411 LAEQIKAVKSSSIIVGVHGAGLTHIVFARPGTSVVEML 448


>gi|307109772|gb|EFN58009.1| hypothetical protein CHLNCDRAFT_57085 [Chlorella variabilis]
          Length = 1084

 Score = 41.6 bits (96), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 37/138 (26%), Positives = 59/138 (42%), Gaps = 7/138 (5%)

Query: 198 TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL--GF 255
            L  ++  L   Y  + +T +        +++V+R    GR +LN+R V+K   E   G 
Sbjct: 291 ALQSWRERLGACYGLDVHTPARVQGKT--VLIVDRPYEAGRHMLNVRSVEKFVRERFPGD 348

Query: 256 DVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP--IGTQWLS 313
           DV + +  E     D  R I     +V VHG+ +    FL P +V + + P       L 
Sbjct: 349 DVRL-QYFEDVPFPDQLRIISGVSILVMVHGSAIALWPFLPPKAVAIHISPDIESAHSLQ 407

Query: 314 TVYFEKPARVLGLEYLEY 331
            V+ E  AR  G   L +
Sbjct: 408 RVWAEAYARDWGFTGLTF 425


>gi|428777835|ref|YP_007169622.1| hypothetical protein PCC7418_3293 [Halothece sp. PCC 7418]
 gi|428692114|gb|AFZ45408.1| hypothetical protein PCC7418_3293 [Halothece sp. PCC 7418]
          Length = 467

 Score = 41.6 bits (96), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 9/83 (10%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGF---DVTIFEPEESTSLADSFRFIHSCHAMV 282
           K + ++R     R I+N +E+    E +GF   ++T    EE  S+      +     +V
Sbjct: 333 KYIYISRGDAKRRPIINQKEIISVLEAMGFSSVEMTYRSFEEQISI------VRDASIIV 386

Query: 283 GVHGAGLTHSLFLRPGSVLMQVV 305
             HGAGLT+ +F +PG+ +++++
Sbjct: 387 APHGAGLTNMVFCKPGTKIVELI 409


>gi|428165975|gb|EKX34960.1| hypothetical protein GUITHDRAFT_118891 [Guillardia theta CCMP2712]
          Length = 337

 Score = 41.6 bits (96), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 12/121 (9%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVT---IFEP-----EESTSLADSFRFIHSCH 279
           VLV R+A  GR ILN  EV  A +    DV    + +P     +E   LA +  F H+  
Sbjct: 200 VLVQRSANNGRNILNHDEVLLALKRTFKDVCDWRVMKPLWGGDKELGMLAAASLFRHA-S 258

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
            +V  HG  L + +F  P + L++++P    +   + +E+ +R LGL Y    +    S 
Sbjct: 259 LIVAPHGGSLANLMFASPHACLVEILP---AFRPNLCYERLSRALGLRYFGLVVPGSFSQ 315

Query: 340 L 340
           L
Sbjct: 316 L 316


>gi|431899810|gb|ELK07757.1| hypothetical protein PAL_GLEAN10022130 [Pteropus alecto]
          Length = 474

 Score = 41.6 bits (96), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 6/97 (6%)

Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  F+V I + + +     D  R  H+    +G+HGAGLTH LFL
Sbjct: 324 RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 383

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
              + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 384 PDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 416


>gi|400530666|gb|AFP86484.1| glycosyltransferase, partial [Echidna rhodochilus]
          Length = 277

 Score = 41.6 bits (96), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 54/123 (43%), Gaps = 12/123 (9%)

Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
           PK   LV       F +F+A   N    T       +  +V+ +R+  + R ILN  E+ 
Sbjct: 2   PKANILVSGNEIRQFSAFVAEKLN--VTTEEQAEKMEEYIVVFSRS--LNRLILNEAELI 57

Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
            A A E    V     EE  S +D  R I     +V +HGA L  SLFL  G+ ++++ P
Sbjct: 58  LALAREYQMKVVTVSLEEQ-SFSDIVRIISRASMLVSMHGAQLVTSLFLPRGAAVVELFP 116

Query: 307 IGT 309
              
Sbjct: 117 FAV 119


>gi|428315210|ref|YP_007113092.1| Tetratricopeptide TPR_2 repeat-containing protein [Oscillatoria
            nigro-viridis PCC 7112]
 gi|428238890|gb|AFZ04676.1| Tetratricopeptide TPR_2 repeat-containing protein [Oscillatoria
            nigro-viridis PCC 7112]
          Length = 1363

 Score = 41.6 bits (96), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 228  VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
            + + R     R ILN  EV K  E++GF     E   + S+A+    + +   +V  HG 
Sbjct: 1229 IYITRQQASYRRILNDEEVIKYLEKIGFRSVKLE---TMSVAEQASCLAAAKVVVAPHGG 1285

Query: 288  GLTHSLFLRPGSVLMQV 304
            GLT+ +F  PG+ ++++
Sbjct: 1286 GLTNLVFCSPGTKVIEI 1302


>gi|301097784|ref|XP_002897986.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262106431|gb|EEY64483.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 477

 Score = 41.6 bits (96), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 51/103 (49%), Gaps = 16/103 (15%)

Query: 237 GRTILNLREVKKA----------AEELGFDVTIFEPEESTSLADSFRFIHS-CHAMVGVH 285
           GR ILN  EV +A           E+L +    F  +E   L D   F+      M+G H
Sbjct: 327 GRQILNEPEVFEAISAMLNATGRPEQLQY----FRHDEFNGLEDVASFMRDRVKMMIGPH 382

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           GA   ++ F +P + L++++P   ++    ++E+ AR+LG +Y
Sbjct: 383 GAAFYNARFAQPRTALIEIIPDPGKFFVPCFWEQ-ARLLGQDY 424


>gi|23006630|ref|ZP_00048862.1| hypothetical protein Magn03002098 [Magnetospirillum magnetotacticum
           MS-1]
          Length = 113

 Score = 41.6 bits (96), Expect = 0.80,   Method: Composition-based stats.
 Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 6/76 (7%)

Query: 253 LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWL 312
           LGF+   F+PE +     + RF     A+VG+ GAG+ +++F  PG+ ++ +VP G  W 
Sbjct: 8   LGFEA--FDPELADMRQQAVRFA-GAEAVVGLAGAGMANTVFCAPGTPILHLVPEG--WE 62

Query: 313 STVYFEKPARVLGLEY 328
              Y E  A   G EY
Sbjct: 63  DPFYAEL-AMACGQEY 77


>gi|450221959|ref|ZP_21896674.1| region 2 capsular polysaccharide biosynthesis protein [Escherichia
           coli O08]
 gi|449315599|gb|EMD05740.1| region 2 capsular polysaccharide biosynthesis protein [Escherichia
           coli O08]
          Length = 378

 Score = 41.2 bits (95), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 60/118 (50%), Gaps = 13/118 (11%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGF---DVTIFEPEESTSLADSFRFIHSCHA 280
           K K+ +  RNA   R I N  ++ K  EE G+   D   + P E  +L     F H+   
Sbjct: 246 KHKIFISRRNA-TQRKIHNEIDLYKYFEEQGYYICDPGDYTPSEQIAL-----FSHA-EI 298

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
           +VG HG G+ +++F +   +L++++P  T W    Y+ +  +++  +Y  Y I++ E 
Sbjct: 299 IVGSHGMGIANAVFSKNLKILIEIMP--TDWNRVSYY-RTTQLMNCKYGCYWIEKNED 353


>gi|302846443|ref|XP_002954758.1| hypothetical protein VOLCADRAFT_118856 [Volvox carteri f.
           nagariensis]
 gi|300259941|gb|EFJ44164.1| hypothetical protein VOLCADRAFT_118856 [Volvox carteri f.
           nagariensis]
          Length = 888

 Score = 41.2 bits (95), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 37/64 (57%), Gaps = 7/64 (10%)

Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIG----TQWLSTVYFEKPARVLGLEYLEYKIKQE 336
           +VGVHG+ LT++LF+RPGS L+++ P G      W   +Y +   R  G+E   + I   
Sbjct: 604 LVGVHGSALTNALFMRPGSSLIELRPYGFSGRESW-PNIYMKSQTR--GMEVFWFGIDVM 660

Query: 337 ESSL 340
            ++L
Sbjct: 661 SANL 664


>gi|452824307|gb|EME31311.1| transferase, transferring glycosyl groups [Galdieria sulphuraria]
          Length = 637

 Score = 41.2 bits (95), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 45/182 (24%), Positives = 76/182 (41%), Gaps = 31/182 (17%)

Query: 162 ININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH 221
           ININ      CF+SV  G  S  + ++N           +   F  N   +   T+S + 
Sbjct: 327 ININASM--KCFRSVVTGKGSDVKYLLNS----------ENSFFKVNNLRKEARTASYWC 374

Query: 222 HTKPKLVLVNRNARVGRTIL-------NLREVKKAAE--------ELGFDVTIFEPEEST 266
               ++ +++R     RT++       N+R+++   E         + F   I   EE T
Sbjct: 375 EKPIQVTILSRKTNNARTLVGADNFAENIRKLQVTKESTQDKKTCHMTFHCQIVYFEEMT 434

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
            L +    +     ++ VHGAG T+ +FL   SVL+++ P   +      FE+ AR   L
Sbjct: 435 FL-EQVSIMQKTDILIAVHGAGNTNIVFLPENSVLIEIYPFAYK---ANIFEELARKYLL 490

Query: 327 EY 328
            Y
Sbjct: 491 RY 492


>gi|348665751|gb|EGZ05580.1| hypothetical protein PHYSODRAFT_261939 [Phytophthora sojae]
          Length = 463

 Score = 41.2 bits (95), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 65/133 (48%), Gaps = 12/133 (9%)

Query: 231 NRNARVGRTILNLREVKKA-AEELGF-----DVTIFEPEESTSLADSFRFIHS-CHAMVG 283
            R A  GR ILN  ++  A A  L        +  F+ +E   L +   F+      M+G
Sbjct: 302 GRAANGGRQILNEPKLFDAIAARLNATGRPEQLMYFKHDEFNGLEEVATFMRDRVKMMIG 361

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY--KIKQEESSLV 341
            HGA   ++ F +P + L+++VP   Q+    ++E+ AR+LG +Y  +    + E++ ++
Sbjct: 362 RHGAAFYNARFAQPRTALIEIVPDPEQFFVPCFWEQ-ARLLGQDYSAHVGNTQNEQNDMI 420

Query: 342 --EKYGANDLVLK 352
             E     DLVLK
Sbjct: 421 VDEVQDVVDLVLK 433


>gi|291230250|ref|XP_002735084.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 649

 Score = 41.2 bits (95), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 49/224 (21%), Positives = 92/224 (41%), Gaps = 38/224 (16%)

Query: 109 NFFHEFMDCFVPLFITI------NSHFPDQDVILAIADCNDQWA-RKYAELLPRFTRHPI 161
           N  H   D  +PLF T+      + H  + D  + + +    W   +Y +L   F+  P+
Sbjct: 215 NLMHVLHDDLLPLFTTLSQITSSDEHLFNLDTRIILME---GWKPGEYKDLYQLFSDVPV 271

Query: 162 ININNQTITH-------CFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENT 214
           +  +N   T        CF++  LGL      V       KP      Q  +AN     T
Sbjct: 272 VLKSNLKSTESPGNNLICFRNAFLGL--SKSTVWYDYGFTKP------QGRVANVKTTAT 323

Query: 215 NTSSSFHHTKPKLVLVNRNAR-----------VGRTILNLREVKKA-AEELGFDVTIFEP 262
                    + +L +V+ ++R           + R +LN  E+  A  ++ G  V     
Sbjct: 324 EIRQFTKFVEKRLGIVSDDSRAEEFIVILSRKINRLLLNEGELMFALIQQFGLKVMSLSV 383

Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
           E + ++ +    +     ++GVHG+ ++ S+FLR  +V++++ P
Sbjct: 384 E-THAIREQIELVSKASVLIGVHGSLMSLSMFLRESAVVIEIFP 426


>gi|296225634|ref|XP_002758584.1| PREDICTED: uncharacterized glycosyltransferase AER61 isoform 2
           [Callithrix jacchus]
          Length = 443

 Score = 41.2 bits (95), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 6/97 (6%)

Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  F+V I + + +     D  R  H+    +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 352

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
              + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 353 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWQ 385


>gi|356547204|ref|XP_003542006.1| PREDICTED: V-type proton ATPase 116 kDa subunit a isoform 1-like
           [Glycine max]
          Length = 853

 Score = 41.2 bits (95), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 61/148 (41%), Gaps = 9/148 (6%)

Query: 58  IKTQ-PYPRKSDKSAMSKVKELTITTSAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEF 114
           IK Q PYP   D S      EL    S    +S   GV H +  +V S   +N +FF   
Sbjct: 550 IKYQDPYPFGVDPSWRGSRSELPFLNSLKMKMSILFGVAHMNLGIVLSY--FNAHFFRNS 607

Query: 115 MDC---FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITH 171
           +D    FVP  I +NS F    +++ I  C    A  Y  ++  F   P  N+    +  
Sbjct: 608 LDIRYQFVPQMIFLNSLFGYLSLLIVIKWCTGSQADLYHVMIYMFLS-PTDNLGENQLFW 666

Query: 172 CFQSVTLGLISHGRMVINPTLLPKPKTL 199
             + + + L+    + +   L PKP  L
Sbjct: 667 GQRPLQIVLLLLAVIAVPWMLFPKPFIL 694


>gi|328766557|gb|EGF76611.1| hypothetical protein BATDEDRAFT_92595 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 778

 Score = 41.2 bits (95), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 28/114 (24%), Positives = 57/114 (50%), Gaps = 4/114 (3%)

Query: 227 LVLVNRNARVGRTILNLREVKKAAEE-LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           +V+++R     R ILN  ++ K  E+  G+ V IF   E  +  +   F+     ++G+H
Sbjct: 491 IVVLSRKG--NRLILNEEQMAKDLEKAFGYKV-IFVSNEQHTFEEQIVFLRKARIVIGMH 547

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
           G+ L   +F R G+V++++ P          ++  A + G++ +  K + + SS
Sbjct: 548 GSILVMVMFCRRGTVVIEMYPFAVPGDHYTPYKTLASLNGMDLVYRKWENKHSS 601


>gi|421861291|ref|ZP_16293335.1| capsular polysaccharide biosynthesis protein [Paenibacillus
           popilliae ATCC 14706]
 gi|410829184|dbj|GAC43772.1| capsular polysaccharide biosynthesis protein [Paenibacillus
           popilliae ATCC 14706]
          Length = 361

 Score = 41.2 bits (95), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 45/103 (43%), Gaps = 6/103 (5%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + V+R     R ++N  EV +   E GF   +  P    S  D      S   +V   G+
Sbjct: 231 IYVSRQDAHARYVINEEEVMQVLAEKGFVRIVLTP---LSTVDKISIYSSAQVIVSPFGS 287

Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
           G  ++ F  PGS L+++ P+    +   YF K +   G+ Y E
Sbjct: 288 GSINAAFCNPGSTLIELTPVT---VVDGYFWKISNHAGMNYFE 327


>gi|427721067|ref|YP_007069061.1| hypothetical protein Cal7507_5914 [Calothrix sp. PCC 7507]
 gi|427353503|gb|AFY36227.1| hypothetical protein Cal7507_5914 [Calothrix sp. PCC 7507]
          Length = 451

 Score = 41.2 bits (95), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 40/163 (24%), Positives = 78/163 (47%), Gaps = 15/163 (9%)

Query: 183 HGRMVINPTLLPKPKTLVDFQ-SFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
               +I P+L+  P  +  +   FL + + +N    +     +P  + +NR+  V R + 
Sbjct: 275 QAEQIIAPSLVGVPVQVPGWACDFLRHNFIDNKVAKNR----QPLRIYLNRSQVVHRKVE 330

Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           N  E+ +     GF        E+ S+ +    + S   ++  HGAGLT+ +F +PG+ +
Sbjct: 331 NEGEIIEFLSRFGFCNI---ATENLSIWEQAELLSSAEIVIAPHGAGLTNLVFCKPGTKV 387

Query: 302 MQVV-PIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEK 343
           ++++ P G   +ST+++     V GL+Y  Y +  E   L E+
Sbjct: 388 IELLSPKG---VSTLFWALGNHV-GLDY--YYLLGEGEGLPEE 424


>gi|356566724|ref|XP_003551579.1| PREDICTED: beta-(1,2)-xylosyltransferase-like [Glycine max]
          Length = 503

 Score = 41.2 bits (95), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 35/62 (56%), Gaps = 3/62 (4%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S+ D  + IH    ++G HGAGLTH +   P +V+++++   + +    +F   +R  GL
Sbjct: 417 SMKDQVQAIHDASVIIGAHGAGLTHIVSALPKTVILEII---SSYYRRPHFAYISRWRGL 473

Query: 327 EY 328
           EY
Sbjct: 474 EY 475


>gi|154284636|ref|XP_001543113.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150406754|gb|EDN02295.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 495

 Score = 41.2 bits (95), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 21/75 (28%), Positives = 36/75 (48%), Gaps = 3/75 (4%)

Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVL 324
           S    +  + +     + GVHGAGLTH +FL P S + +++P     L    F   A+ +
Sbjct: 384 SLPFREQIKLVRHTDILAGVHGAGLTHGMFLPPHSTIAEILPPK---LENKGFRNLAKKM 440

Query: 325 GLEYLEYKIKQEESS 339
           G +Y      + ++S
Sbjct: 441 GHKYFSSHAAEHQTS 455


>gi|149728372|ref|XP_001498814.1| PREDICTED: uncharacterized glycosyltransferase AER61-like isoform 2
           [Equus caballus]
          Length = 443

 Score = 41.2 bits (95), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 6/97 (6%)

Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  F+V I + + +     D  R  H+    +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVQIVDYKYKQLGFLDQLRITHNTDIFIGMHGAGLTHLLFL 352

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
              + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 353 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 385


>gi|300865616|ref|ZP_07110390.1| hypothetical protein OSCI_2080002 [Oscillatoria sp. PCC 6506]
 gi|300336371|emb|CBN55540.1| hypothetical protein OSCI_2080002 [Oscillatoria sp. PCC 6506]
          Length = 1330

 Score = 41.2 bits (95), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 8/91 (8%)

Query: 219  SFHHTKPKL-----VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
            S   +KPK+     + ++R     R  +N  EV    ++ GF+    E   + S+A+   
Sbjct: 1184 SREKSKPKIDKFRRIYISRELASYRRFINEEEVLNFLDKWGFESVKLE---TMSVAEQAS 1240

Query: 274  FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
             + +   +V  HGAGLT+ +F +PG+ ++++
Sbjct: 1241 CLANAEVVVAPHGAGLTNIVFCKPGTKIIEI 1271


>gi|428777356|ref|YP_007169143.1| hypothetical protein PCC7418_2797 [Halothece sp. PCC 7418]
 gi|428691635|gb|AFZ44929.1| hypothetical protein PCC7418_2797 [Halothece sp. PCC 7418]
          Length = 397

 Score = 41.2 bits (95), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 59/139 (42%), Gaps = 15/139 (10%)

Query: 170 THCFQSVTLGLISHGRMVINPTLLPK--PKTLVDFQSFLANAYNENTNTSSSFHHTKPKL 227
           + CF S      + GR++   TLL    P T  ++  +L          S       P  
Sbjct: 205 SKCFWSYQ----NKGRVLYTDTLLAPTFPGTWRNYPQWLPKFLQREITPSPPL----PTR 256

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
            L        R ++N   +K    + GF+  I+ PEE  +    F        +VG HGA
Sbjct: 257 RLYISRTGYRRNVVNEEAIKPILIKYGFE--IYNPEEHENQPYDFA---EATIIVGPHGA 311

Query: 288 GLTHSLFLRPGSVLMQVVP 306
           GLT+  F +PG+ +++++P
Sbjct: 312 GLTNLAFCQPGTKVLELMP 330


>gi|39930531|ref|NP_775925.1| EGF domain-specific O-linked N-acetylglucosamine transferase
           precursor [Homo sapiens]
 gi|38173838|gb|AAH60887.1| Chromosome 3 open reading frame 64 [Homo sapiens]
 gi|312153034|gb|ADQ33029.1| chromosome 3 open reading frame 64 [synthetic construct]
          Length = 443

 Score = 41.2 bits (95), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)

Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  F+V I + +       D  R  H+    +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 352

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
              + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 353 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 385


>gi|440790833|gb|ELR12100.1| hypothetical protein ACA1_077520, partial [Acanthamoeba castellanii
           str. Neff]
          Length = 484

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 51/93 (54%), Gaps = 15/93 (16%)

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM- 281
           T+  + +++R+ +  R +LN +E+  A  ++     +        L D+  F     A+ 
Sbjct: 161 TESHVGIISRSFK--RFLLNEQELLHATLQMNVSAELL-------LFDTLPFYQQVQALR 211

Query: 282 -----VGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
                VG+HG+GLT++L+L+ G+VL+Q++P  T
Sbjct: 212 RTTVLVGMHGSGLTNALYLQRGAVLLQIMPFKT 244


>gi|224008206|ref|XP_002293062.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971188|gb|EED89523.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 428

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 56/127 (44%), Gaps = 11/127 (8%)

Query: 213 NTNTSSSFHHTKPKLVLVNRNAR--VGRTILNLREVKKAAEELGFDVTIFEPEESTSLAD 270
           N++  S    T   L++  R  R  VG    +  EV  A   LG  V + E E+  S   
Sbjct: 267 NSSPLSGVEDTGHVLIVQRRGDRRLVGSRNGSFVEVVDAICNLGIPVKVIEFED-LSAEQ 325

Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV------VPIGTQWLSTVYFEKPARVL 324
             +   S + +V  HGA L+H+ +++PG  +++V      V  G        +   AR  
Sbjct: 326 QIQSAQSANVLVAAHGAALSHAAWMKPGGAIVEVLMRQGFVEFGD--YHKADYANLARFF 383

Query: 325 GLEYLEY 331
           GL+Y+ Y
Sbjct: 384 GLKYVYY 390


>gi|297670905|ref|XP_002813591.1| PREDICTED: uncharacterized glycosyltransferase AER61 isoform 2
           [Pongo abelii]
          Length = 444

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)

Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  F+V I + +       D  R  H+    +G+HGAGLTH LFL
Sbjct: 294 RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 353

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
              + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 354 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 386


>gi|225556661|gb|EEH04949.1| DUF563 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 501

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 3/70 (4%)

Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
           +  + +     + GVHGAGLTH +FL P S + +++P     L    F   A+ +G +Y 
Sbjct: 395 EQIKLVRHTDILTGVHGAGLTHGMFLPPHSTIAEILPPK---LENKGFRNLAKKMGHKYF 451

Query: 330 EYKIKQEESS 339
                + ++S
Sbjct: 452 SSHAAEHQTS 461


>gi|240281524|gb|EER45027.1| DUF563 domain-containing protein [Ajellomyces capsulatus H143]
          Length = 501

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 3/70 (4%)

Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
           +  + +     + GVHGAGLTH +FL P S + +++P     L    F   A+ +G +Y 
Sbjct: 395 EQIKLVRHTDILTGVHGAGLTHGMFLPPHSTIAEILPPK---LENKGFRNLAKKMGHKYF 451

Query: 330 EYKIKQEESS 339
                + ++S
Sbjct: 452 SSHAAEHQTS 461


>gi|407409787|gb|EKF32485.1| hypothetical protein MOQ_003667 [Trypanosoma cruzi marinkellei]
          Length = 479

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 17/59 (28%), Positives = 36/59 (61%), Gaps = 3/59 (5%)

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           F+H+    +  HGAG+ + + + PG V++++ P G ++     +++ A +LGL Y+ Y+
Sbjct: 351 FMHNSSIFIAAHGAGMVNIMSMSPGGVVVELFPYGFRY---AMYQELAELLGLHYIAYE 406


>gi|219123206|ref|XP_002181920.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406521|gb|EEC46460.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 467

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 6/82 (7%)

Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTS------LADSFRFIHSCHAMVGVH 285
           + AR GR   N  EV +A +  G +   F+  ++++      + D    + +  A+VGVH
Sbjct: 312 KEARQGRRCSNEEEVVEALKRNGVNTIFFDLAKASTSSGSDVIEDLMALLDNACAIVGVH 371

Query: 286 GAGLTHSLFLRPGSVLMQVVPI 307
           G GL +  F    + L++++PI
Sbjct: 372 GGGLYNQYFAPATTALVELIPI 393


>gi|324510177|gb|ADY44259.1| Glycosyltransferase AER61 [Ascaris suum]
          Length = 522

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 57/261 (21%), Positives = 114/261 (43%), Gaps = 36/261 (13%)

Query: 90  CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIADC------ 141
           C V    P +V      + N +H F D F+ L+++  +N  F D D+ +   D       
Sbjct: 214 CDVIFERPTVVMKLDA-SINMYHHFCD-FINLYLSQHLNGSF-DSDIDILWWDTFPGGFI 270

Query: 142 ----NDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPK 197
                  W R ++   P    + +I+++ + +  CF+ V L L++  R+ +   +   P 
Sbjct: 271 DASFGATW-RAFSLRQP----YELISLDQKRV--CFRQVMLPLLARQRLGLYYNM---PV 320

Query: 198 TLVDFQSFLANAYNENTNTSSSFHHTKP-----KLVLVNRNARVGRTILNLREVKKAAEE 252
                 S L +A++++       + T P     ++ L++R+    R I+N  E+      
Sbjct: 321 IDGCEGSGLFHAFSKHILHRLRVNQTGPLLNSVRVTLLSRSTNF-RRIVNENELLDVLNG 379

Query: 253 L-GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQW 311
           + G      +   + +  D     H+    +G+HG+GLTH LFL   +V+ ++       
Sbjct: 380 ISGVVAQKVDYNSNVAFFDQLSITHNTDIFIGMHGSGLTHLLFLPDWAVIFEIYNCED-- 437

Query: 312 LSTVYFEKPARVLGLEYLEYK 332
             + Y++  AR+ G++Y  +K
Sbjct: 438 -VSCYYD-LARLRGVKYFTWK 456


>gi|325087670|gb|EGC40980.1| DUF563 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 501

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 3/70 (4%)

Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
           +  + +     + GVHGAGLTH +FL P S + +++P     L    F   A+ +G +Y 
Sbjct: 395 EQIKLVRHTDILTGVHGAGLTHGMFLPPHSTIAEILPPK---LENKGFRNLAKKMGHKYF 451

Query: 330 EYKIKQEESS 339
                + ++S
Sbjct: 452 SSHAAEHQTS 461


>gi|307104008|gb|EFN52264.1| expressed protein [Chlorella variabilis]
          Length = 534

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 37/135 (27%), Positives = 61/135 (45%), Gaps = 9/135 (6%)

Query: 202 FQSFLANAYNENTNTSSSFHHTKPKLVLV-NRNARVGRTILNLREVKKAAEEL--GFDVT 258
           ++  L   Y  + +T +   +   K+VL+ +R    GR +LN+R V+K   +   G DV 
Sbjct: 313 WRERLGACYGLDVHTPAPVQN---KIVLILDRPYEAGRHMLNVRSVEKFVRQRFPGDDVR 369

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG--TQWLSTVY 316
           + +  E     D  R + S   +V VHG+ +    FL P +V + + P    T  L  V+
Sbjct: 370 L-QYFEDVPFPDQLRIMGSVSILVMVHGSAIALWPFLPPKAVAIHISPDAESTDSLQRVW 428

Query: 317 FEKPARVLGLEYLEY 331
            E   R  G   L +
Sbjct: 429 AEAYVRYWGFTGLSF 443


>gi|397480752|ref|XP_003811635.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
           transferase isoform 2 [Pan paniscus]
          Length = 443

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)

Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  F+V I + +       D  R  H+    +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 352

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
              + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 353 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWQ 385


>gi|159473064|ref|XP_001694659.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276471|gb|EDP02243.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 911

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
            +     +VGVHG+ LT+++F+RPGS L+++ P G
Sbjct: 752 LVRQMDVLVGVHGSALTNAMFMRPGSSLIELRPFG 786


>gi|405122953|gb|AFR97718.1| hypothetical protein CNAG_01513 [Cryptococcus neoformans var.
           grubii H99]
          Length = 461

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/91 (24%), Positives = 47/91 (51%), Gaps = 12/91 (13%)

Query: 224 KPKLVLVNR-------NARVGRTIL-NLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
           KPK+V V+R       +  V   +L  L+++++A + +  D  + + E+       F   
Sbjct: 302 KPKIVYVDRQRTTRKFDVEVHTQLLKQLKKIERAKKAVVVDAVLEDLEKKKQ----FEMF 357

Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
                ++G+HG GL H L++  G ++++++P
Sbjct: 358 TDADIILGIHGNGLAHELWMPEGGIVIEILP 388


>gi|158335347|ref|YP_001516519.1| hypothetical protein AM1_2192 [Acaryochloris marina MBIC11017]
 gi|158305588|gb|ABW27205.1| hypothetical protein AM1_2192 [Acaryochloris marina MBIC11017]
          Length = 394

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 7/91 (7%)

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R I+N  E+  A + LGF+  I+E E+  SL +     +    +VG +G G+TH LF   
Sbjct: 282 RHIVNEAELIAALKPLGFE--IYELED-YSLEEKVELFYDAEIVVGAYGGGITHVLFSES 338

Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
             +L   V   TQ     Y+   A+ LG +Y
Sbjct: 339 VKILELQVMAKTQ----TYYYYLAKALGHDY 365


>gi|340723323|ref|XP_003400040.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Bombus
           terrestris]
          Length = 557

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 55/262 (20%), Positives = 112/262 (42%), Gaps = 25/262 (9%)

Query: 85  PP--NLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN---SH---FPDQDVIL 136
           PP  N  C +    P  +        N +H F D F  L+ +++   SH   F   + I+
Sbjct: 211 PPIVNHDCDIVIEKPTYIMKIDAI-VNMYHHFCD-FFNLYASLHVNLSHPAAFSTDNHIM 268

Query: 137 AIADCNDQWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPT 191
                +  +   + +    FTR+P+ +++  +  T CF+++   L+   RM+     N  
Sbjct: 269 IWESYS--YRSAFQDAFQAFTRNPLWDLHTFRGETVCFKNLVFPLLP--RMIFGLYYNTP 324

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAE 251
           L+   +    F++F  +  +          + + ++ L++R+ +  R ILN  E+ KA +
Sbjct: 325 LIYGCEKSGLFKAFGDHVLHRLRTPHHERKNQRIRVTLLSRDTQY-RRILNEDELTKALK 383

Query: 252 E-LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           E   + V      +  S         +    +G+HGAGLTH +FL   + + ++      
Sbjct: 384 ENPEYKVRKVIYNKKISFKKQLEITRNSDIFIGIHGAGLTHLMFLPDWAAVFEIYNCEDP 443

Query: 311 WLSTVYFEKPARVLGLEYLEYK 332
                 ++  AR+ G++Y  ++
Sbjct: 444 GC----YKDLARLRGVKYFTWE 461


>gi|407849199|gb|EKG04022.1| hypothetical protein TCSYLVIO_004922 [Trypanosoma cruzi]
          Length = 479

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 17/59 (28%), Positives = 37/59 (62%), Gaps = 3/59 (5%)

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           F+H+    +  HGAG+ + + + PG+V++++ P G ++     +++ A +LGL Y+ Y+
Sbjct: 351 FMHNSSIFIAAHGAGMVNIMSMSPGTVVVELFPHGFRY---AMYQELAELLGLHYIAYE 406


>gi|356566720|ref|XP_003551577.1| PREDICTED: beta-(1,2)-xylosyltransferase-like [Glycine max]
          Length = 503

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 13/67 (19%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKP-----A 321
           S+ D  R I     ++G HGAGLTH +   P +V+++++        + YF +P     +
Sbjct: 417 SMKDQVRAIQDASVIIGAHGAGLTHIVSALPKTVILEII--------SSYFRRPHFAYIS 468

Query: 322 RVLGLEY 328
           R  GLEY
Sbjct: 469 RWKGLEY 475


>gi|87309616|ref|ZP_01091750.1| hypothetical protein DSM3645_02353 [Blastopirellula marina DSM
           3645]
 gi|87287380|gb|EAQ79280.1| hypothetical protein DSM3645_02353 [Blastopirellula marina DSM
           3645]
          Length = 397

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 68/173 (39%), Gaps = 18/173 (10%)

Query: 171 HCFQSVTLGLISHGRMVINPT---------LLPKPKTLVDFQSFLANAY-NENTNTSSSF 220
           H F S +L L+   R  I P           L  P     F      AY  E   T +  
Sbjct: 190 HAFASESLQLLGIERSQILPMGHYTHIQAKQLIIPSVACYFPQPAGVAYLREKMQTQAWS 249

Query: 221 HHTKP-KLVL-VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSC 278
           H+ +  +L L + R     R I+N  E+  A + LGF     E      L    +     
Sbjct: 250 HYERDDRLKLYIARRRFTSRYIVNEAELFAALKPLGFQQVYLE---DLPLKKQIQLFQQA 306

Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
             +VG HGAG ++ ++ RPG+ + ++ P        ++F   A +  L+Y  Y
Sbjct: 307 GVIVGPHGAGFSNLVYSRPGTAVFEITPTCR---PPLFFHYLAEINALQYAVY 356


>gi|400530674|gb|AFP86488.1| glycosyltransferase, partial [Arapaima gigas]
          Length = 276

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 59/117 (50%), Gaps = 6/117 (5%)

Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           LV+ +R+  + R ILN  E+  A A+E          EE  SLA+  R I     +V +H
Sbjct: 38  LVVFSRS--INRLILNEAELILALAQEFQMKAITVSLEEH-SLAEIIRVISRASVLVSMH 94

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
           GA L  SLFL  G+ ++++ P          ++  A + G  L+Y+ +K   EE+S+
Sbjct: 95  GAQLITSLFLPRGAAVVELFPYAVNPDHYTPYKTLASLPGMDLQYVAWKNTIEENSV 151


>gi|414341636|ref|YP_006983157.1| hypothetical protein B932_0621 [Gluconobacter oxydans H24]
 gi|411026971|gb|AFW00226.1| hypothetical protein B932_0621 [Gluconobacter oxydans H24]
          Length = 363

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 52/102 (50%), Gaps = 7/102 (6%)

Query: 202 FQSFLANAYNENTNTSSSFHHTKPKLVLVNRN--ARVGRTILNLREVKKAAEELGFDVTI 259
           F SF+ + YN      +     K + +LV+R+  +   R   N    +  A E G+++T 
Sbjct: 208 FHSFVRDFYNSLKPDIAP--EKKSRKILVSRSDFSHSNRAFPNRAAFETMAAERGYEIT- 264

Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
            +PE+  SL +  +      A++G HG+G+  +LF  PG+V+
Sbjct: 265 -QPEK-LSLEEQIQIFSEAAAVIGEHGSGMHSTLFSSPGTVV 304


>gi|219117675|ref|XP_002179628.1| betaxylosyltransferase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408681|gb|EEC48614.1| betaxylosyltransferase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 483

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 62/137 (45%), Gaps = 31/137 (22%)

Query: 193 LPKPK-----TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNA----------RVG 237
           LP+P+     + V   SF+   Y  NT   S  H    ++V++ R            +V 
Sbjct: 294 LPRPRESNCRSPVQMSSFVD--YVLNTYNLSHIHRIPGRVVILERIPYLDHPRRDLNKVE 351

Query: 238 RTILNLREVKKAAEE-----LGFDVTIFEPEESTSLAD-----SFRFIHSCHAMVGVHGA 287
           R + NLR + ++  +      G +VT+    E  +L D       R I   H +V  HGA
Sbjct: 352 RLLTNLRPLAESLPQRLTPYYGVNVTV----EVAALVDKPMNEQIRTIREAHVLVANHGA 407

Query: 288 GLTHSLFLRPGSVLMQV 304
           GLTHS+FL   + ++++
Sbjct: 408 GLTHSIFLDDATHVVEM 424


>gi|394337573|gb|AFN27774.1| glycosyltransferase, partial [Ostracion cubicus]
          Length = 274

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 49/190 (25%), Positives = 78/190 (41%), Gaps = 43/190 (22%)

Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           R ILN  E+  A A+EL   V     EE  S     + I     +V +HGA L  SLFL 
Sbjct: 52  RLILNEAELIMALAQELQMRVLTVSLEEQ-SFPSIVQVIGGASMLVSMHGAQLITSLFLP 110

Query: 297 PGSVLMQVVPIGTQ----------------------WLST------VYFEKPARVLGLEY 328
           PG+V++++ P                          W +T       + ++P    G+ +
Sbjct: 111 PGAVVVELFPFAVNPDQYTPYRTLASLPGMDLHYIPWRNTEEENTVTHPDRPWEQGGIAH 170

Query: 329 LEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
           LE   +++E  +  K     L  +NP+          R+Y   Q+  +DI  F   L++ 
Sbjct: 171 LEK--EEQERIMASKEVPRHLCCRNPEWL-------FRIY---QDTLVDIPSFLEVLQEG 218

Query: 389 YKKAKKLMDK 398
             KAK ++ K
Sbjct: 219 L-KAKPILKK 227


>gi|359459096|ref|ZP_09247659.1| hypothetical protein ACCM5_10223 [Acaryochloris sp. CCMEE 5410]
          Length = 385

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 7/91 (7%)

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R I+N  E+  A + LGF+  I+E E+  SL +     +    +VG +G G+TH LF   
Sbjct: 273 RHIVNEAELIAALKPLGFE--IYELED-FSLEEKVELFYDAEIVVGAYGGGITHVLFSES 329

Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
             +L   V   TQ     Y+   A+ LG +Y
Sbjct: 330 VKILELQVMAKTQ----TYYYYLAKALGHDY 356


>gi|400530664|gb|AFP86483.1| glycosyltransferase, partial [Heteroconger hassi]
          Length = 245

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 65/140 (46%), Gaps = 8/140 (5%)

Query: 204 SFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE-VKKAAEELGFDVTIFEP 262
           +FLA   N           +   +V+ +R+  + R ILN  E + + A+E    V     
Sbjct: 1   AFLAEMLN--VTVEEPAEKSDEYIVVFSRS--LNRLILNEAELILELAQEFQMKVLTVSL 56

Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR 322
           EE    AD  R I     +V +HGA L  SLFL  G+ ++++ P G        ++    
Sbjct: 57  EEQ-PFADIVRVISKASMLVSMHGAQLVTSLFLPRGAAVVELFPYGVNPEHYAPYKTLTS 115

Query: 323 VLG--LEYLEYKIKQEESSL 340
           + G  L+Y+ ++  +EE+S+
Sbjct: 116 LPGMDLQYVAWRNTKEENSV 135


>gi|389634703|ref|XP_003715004.1| hypothetical protein MGG_08052 [Magnaporthe oryzae 70-15]
 gi|351647337|gb|EHA55197.1| hypothetical protein MGG_08052 [Magnaporthe oryzae 70-15]
          Length = 623

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 12/116 (10%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIHS-CHAM 281
           K KL  ++R  R  R +L   ++ +A      DV I   +  S S  +  R + S    +
Sbjct: 398 KIKLTFIDR--RGSRKLLGQEKLLEATRRAHPDVEIRSIDFASLSFVEQIRLVRSETDIL 455

Query: 282 VGVHGAGLTHSLFLR-----PGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           VG HGAGLTH +FL+      G  ++++ P   + +S   F   A +LG EY   K
Sbjct: 456 VGAHGAGLTHIMFLQGEWEGGGKAVVEIQP---EKMSYKGFRNLAYMLGHEYFVAK 508


>gi|451818782|ref|YP_007454983.1| hypothetical protein DUF563 [Clostridium saccharoperbutylacetonicum
           N1-4(HMT)]
 gi|451784761|gb|AGF55729.1| hypothetical protein DUF563 [Clostridium saccharoperbutylacetonicum
           N1-4(HMT)]
          Length = 375

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 40/167 (23%), Positives = 78/167 (46%), Gaps = 13/167 (7%)

Query: 171 HCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENT-NTSSSFHHTKPKLVL 229
           +C+    L  IS   ++I P  L K  TL+ ++ ++ N    N  + + S + T  + + 
Sbjct: 191 YCYDIEKLIYISD--LLIGPIDLKK-NTLLRYEDYIMNNLAANLLHDNLSINGTASRKLF 247

Query: 230 VNRNARVG-RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
           ++R   +G R +LN  ++++  +E G+++      E  S  D  +       + G  GAG
Sbjct: 248 MSRKNCLGTRLLLNQDKIEQIFKEYGYEIIY---PEIMSFQDQLKIFSEAEFIAGASGAG 304

Query: 289 LTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKI 333
            T+ LF      ++ ++P   + +    +   A VLG +  YL+ KI
Sbjct: 305 FTNILFANKSVKIICILP---KEIKLSCYSNIAGVLGQQCYYLDAKI 348


>gi|356524323|ref|XP_003530779.1| PREDICTED: beta-(1,2)-xylosyltransferase-like [Glycine max]
          Length = 503

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 35/62 (56%), Gaps = 3/62 (4%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S+ D  + +H    ++G HGAGLTH +   P +V+++++   + +    +F   +R  GL
Sbjct: 417 SMKDQVQAVHDASVIIGAHGAGLTHIVSALPKTVILEII---SSYYRRPHFAYISRWRGL 473

Query: 327 EY 328
           EY
Sbjct: 474 EY 475


>gi|394337315|gb|AFN27645.1| glycosyltransferase, partial [Apogon lateralis]
          Length = 278

 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 6/117 (5%)

Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           +V+++R+    R ILN  E+  A A+E    V     EE  S     + I  C  +V +H
Sbjct: 52  IVVLSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSIVQVISGCSMLVSMH 108

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
           GA L  SLFL  G+V++++ P          ++  A + G  L Y+ ++  +EE ++
Sbjct: 109 GAQLITSLFLPRGAVVVELFPFAVNPERYTPYKTLASLPGIDLHYISWRNTEEEDTV 165


>gi|440470541|gb|ELQ39608.1| DUF563 domain-containing protein [Magnaporthe oryzae Y34]
 gi|440488522|gb|ELQ68247.1| DUF563 domain-containing protein [Magnaporthe oryzae P131]
          Length = 614

 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 12/116 (10%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIHS-CHAM 281
           K KL  ++R  R  R +L   ++ +A      DV I   +  S S  +  R + S    +
Sbjct: 389 KIKLTFIDR--RGSRKLLGQEKLLEATRRAHPDVEIRSIDFASLSFVEQIRLVRSETDIL 446

Query: 282 VGVHGAGLTHSLFLR-----PGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
           VG HGAGLTH +FL+      G  ++++ P   + +S   F   A +LG EY   K
Sbjct: 447 VGAHGAGLTHIMFLQGEWEGGGKAVVEIQP---EKMSYKGFRNLAYMLGHEYFVAK 499


>gi|300121625|emb|CBK22143.2| unnamed protein product [Blastocystis hominis]
          Length = 549

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 1/84 (1%)

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
           +K  L  V+RN    R I N+ E+ +  +         +  +  S A+    + S   + 
Sbjct: 388 SKLALRFVDRNT-ANRGIKNVEEIVELLKNQNGTEFTRQVSQQMSFAEQVISMFSIDILF 446

Query: 283 GVHGAGLTHSLFLRPGSVLMQVVP 306
            VHGAGLT  +F+ PGS ++++ P
Sbjct: 447 SVHGAGLTSVVFMLPGSAVLEIFP 470


>gi|152967605|ref|YP_001363389.1| capsular polysaccharide biosynthesis protein-like protein
           [Kineococcus radiotolerans SRS30216]
 gi|151362122|gb|ABS05125.1| Capsular polysaccharide biosynthesis protein-like [Kineococcus
           radiotolerans SRS30216]
          Length = 381

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 3/79 (3%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
           + ++R A   R +LN   V  A    GF+V       + S+AD      S   + GV GA
Sbjct: 248 IFLDRPATDKRRLLNRDRVLTAVRSGGFEVV---EAGALSVADQAALFASADVVAGVLGA 304

Query: 288 GLTHSLFLRPGSVLMQVVP 306
           GLT+ ++  PG+ +++++P
Sbjct: 305 GLTNLVYCYPGTTVVEILP 323


>gi|307111760|gb|EFN59994.1| hypothetical protein CHLNCDRAFT_49458 [Chlorella variabilis]
          Length = 656

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 24/116 (20%), Positives = 56/116 (48%), Gaps = 9/116 (7%)

Query: 199 LVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDV- 257
           +V  +   A      ++ +++    + ++V + R+   GR +LN  E+ +      ++  
Sbjct: 399 VVGLEPRPAGGGEGGSSDAAAGGTQRLRVVFLQRDGE-GRQLLNAEELLERCNAWRYEPA 457

Query: 258 ---TIFEPE----ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
              T F  E     + ++A+          +VG+HGA + ++ F+RPGS +++++P
Sbjct: 458 GSGTTFAAECRQARTPTVAEGVAVAQEADVLVGMHGANMANAFFMRPGSSVVELMP 513


>gi|11191827|emb|CAC16419.1| hypothetical protein [Verticillium dahliae]
          Length = 291

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 4/66 (6%)

Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG-SVLMQVVPIGTQWLSTVYFEKPARV 323
           + + A+  R +     +VGVHGAGLTH +F+R     +++V P GT  ++   F   A +
Sbjct: 158 ALTFAEQMRTVRETDVLVGVHGAGLTHVMFMRENVGAIVEVQP-GT--MTHAGFRNVAAM 214

Query: 324 LGLEYL 329
            GL Y 
Sbjct: 215 RGLGYF 220


>gi|193786960|dbj|BAG52283.1| unnamed protein product [Homo sapiens]
          Length = 206

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)

Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  F+V I + +       D  R  H+    +G+HGAGLTH LFL
Sbjct: 56  RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 115

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
              + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 116 PDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 148


>gi|118344568|ref|NP_001072049.1| glycosyltransferase-like domain-containing protein 2 precursor
           [Takifugu rubripes]
 gi|82076790|sp|Q5NDE4.1|GTDC2_TAKRU RecName: Full=Glycosyltransferase-like domain-containing protein 2;
           AltName: Full=Extracellular O-linked N-acetylglucosamine
           transferase-like; Flags: Precursor
 gi|56605354|emb|CAI30874.1| glycosyltransferase [Takifugu rubripes]
          Length = 590

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 61/123 (49%), Gaps = 9/123 (7%)

Query: 224 KPK---LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCH 279
           KPK   +V+ +R+    R ILN  E+  A A+E    V     EE  S     + I    
Sbjct: 293 KPKDEYIVVFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSIVQVISGAS 349

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEE 337
            +V +HGA L  SLFL PG+V++++ P          +   A + G++  Y+ ++  +EE
Sbjct: 350 MLVSMHGAQLITSLFLPPGAVVVELYPFAVNPDQYTPYRTLASLPGMDLHYIPWRNTEEE 409

Query: 338 SSL 340
           +++
Sbjct: 410 NTV 412


>gi|119493195|ref|ZP_01624070.1| TPR repeat protein [Lyngbya sp. PCC 8106]
 gi|119452760|gb|EAW33937.1| TPR repeat protein [Lyngbya sp. PCC 8106]
          Length = 899

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 52/104 (50%), Gaps = 6/104 (5%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           P+ + ++R     R ++N  EV +  + LGF V +F   ES S ++  +       +V  
Sbjct: 758 PERIYISRAKARHRQVMNETEVMEVLKPLGF-VEVFL--ESMSFSEQVQLFSQAKVIVAA 814

Query: 285 HGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           HG+GLT+ +F    S  ++V+   +      Y+   ++ LGLE+
Sbjct: 815 HGSGLTNIVFC---SKEVKVIEFISPHYERYYYRVISQYLGLEH 855


>gi|400530920|gb|AFP86611.1| glycosyltransferase, partial [Tetraodon miurus]
          Length = 284

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 59/117 (50%), Gaps = 6/117 (5%)

Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           +V+ +R+A   R ILN  E+  A A+E    V     EE  S     + I     +V +H
Sbjct: 48  IVVFSRSA--TRLILNEAELIMALAQEFQMRVLTVSLEEQ-SFPSIIQVISGASMLVSMH 104

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
           GA L  SLFL PG+V++++ P          +   A + G++  Y+ ++  +E+++L
Sbjct: 105 GAQLITSLFLPPGAVVVELFPFAVNPDQYTPYRTLASLPGMDLHYIPWRNTEEQNTL 161


>gi|322786227|gb|EFZ12831.1| hypothetical protein SINV_09199 [Solenopsis invicta]
          Length = 579

 Score = 40.0 bits (92), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 63/301 (20%), Positives = 130/301 (43%), Gaps = 28/301 (9%)

Query: 109 NFFHEFMDCFVPLFITINSHFP-----DQDVILAIADCNDQWARKYAELLPRFTRHPIIN 163
           N +H F D F  L+ +++ +       D D  + I + +  +   + +    FTR+P+ +
Sbjct: 260 NMYHHFCD-FFNLYASLHVNLSHPSAFDTDNHVMIWE-SYSYRSAFQDTFEAFTRNPLWD 317

Query: 164 INN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQSFLANAYNENTNTSS 218
           +   +  T CF+++   L+   RM+     N  L+   +    F++F  +  +       
Sbjct: 318 LKTFRGETVCFRNLVFPLLP--RMIFGLFYNTPLIYGCEKSGLFKAFSEHLLHRLQIPLH 375

Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPEESTSLADSFRFIHS 277
                K ++ L++R+ +  R ILN  E+ KA +E   + V      ++           +
Sbjct: 376 ERKDRKIRVTLLSRDTQY-RKILNENELVKALKENPVYKVRKVVYNKNIPFKKQLEITRN 434

Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEE 337
               +G+HGAGLTH +FL   + + ++            ++  AR+ G++Y  +K   + 
Sbjct: 435 SDIFIGIHGAGLTHLMFLPDWAAVFEIYNCE----DPNCYQDLARLRGVKYFTWK---DT 487

Query: 338 SSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
           S LV++         +P   A A ++N    +K     + + R  +   +A+KK   + +
Sbjct: 488 SKLVQQDPGT-----HPDGGAHAKFTNYSFDVKEFLRIVSLARNHVKHHNAFKKFNDIHN 542

Query: 398 K 398
           K
Sbjct: 543 K 543


>gi|394337317|gb|AFN27646.1| glycosyltransferase, partial [Cheilodipterus quinquelineatus]
          Length = 288

 Score = 40.0 bits (92), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 75/190 (39%), Gaps = 43/190 (22%)

Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           R ILN  E+  A A+E    V     EE  S     + I  C  +V +HGA L  SLFL 
Sbjct: 59  RLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSIVQVISGCSMLVSMHGAQLITSLFLP 117

Query: 297 PGSVLMQVVPIGTQ----------------------WLST------VYFEKPARVLGLEY 328
            G+V++++ P                          W +T       + ++P    G+ +
Sbjct: 118 RGAVVVELFPFAVNPEQYTPYKTLASLPGIDLHYISWRNTDEENTVTHPDRPWEQGGISH 177

Query: 329 LEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
           LE    ++E  L  K     L  +NP+          R+Y   Q+  +DI  F   LK+ 
Sbjct: 178 LEK--DEQERILASKDVPRHLCCRNPEWL-------FRIY---QDTLVDIPSFLDALKEG 225

Query: 389 YKKAKKLMDK 398
             K K +M K
Sbjct: 226 M-KTKPVMKK 234


>gi|453329814|dbj|GAC88062.1| hypothetical protein NBRC3255_1723 [Gluconobacter thailandicus NBRC
           3255]
          Length = 363

 Score = 40.0 bits (92), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 9/103 (8%)

Query: 202 FQSFLANAYNE-NTNTSSSFHHTKPKLVLVNRN--ARVGRTILNLREVKKAAEELGFDVT 258
           F SF+ + YN    N S      K + +LV+R+  +   R   N    +  A E G++  
Sbjct: 208 FHSFVRDFYNSLKPNISI---EKKSRKILVSRSDFSHSNRAFPNRAAFETMAAERGYE-- 262

Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           I +PE+  SL +  +      A++G HG+G+  +LF  PG+V+
Sbjct: 263 IIQPEK-LSLEEQIQIFSEAAAVIGEHGSGMHSTLFSSPGTVV 304


>gi|354594765|ref|ZP_09012802.1| hypothetical protein CIN_14980 [Commensalibacter intestini A911]
 gi|353671604|gb|EHD13306.1| hypothetical protein CIN_14980 [Commensalibacter intestini A911]
          Length = 372

 Score = 40.0 bits (92), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/82 (24%), Positives = 41/82 (50%), Gaps = 3/82 (3%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           PK   ++R     R + N  E+ +  E+ GF +   E     +L +      +   ++G 
Sbjct: 241 PKRFYIDRRTASNRVLHNEDELVEVLEQRGFAIVQLE---YLTLDEQAGLFSNAEYIIGA 297

Query: 285 HGAGLTHSLFLRPGSVLMQVVP 306
           HGAGL + +F +P + +++++P
Sbjct: 298 HGAGLANMVFCQPETKIIELMP 319


>gi|448310078|ref|ZP_21499930.1| capsular polysaccharide biosynthesis protein-like protein
           [Natronorubrum bangense JCM 10635]
 gi|445588408|gb|ELY42651.1| capsular polysaccharide biosynthesis protein-like protein
           [Natronorubrum bangense JCM 10635]
          Length = 225

 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 3/70 (4%)

Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
           P  + ++R     R + N  E +K   + G    + E +   SL + F+  +S   ++G+
Sbjct: 92  PDWIYISRQKANRRKVKNYEEFEKTLGKFGIKPVVLEDQ---SLEEQFKLFNSADGVIGL 148

Query: 285 HGAGLTHSLF 294
           HGAG T+ ++
Sbjct: 149 HGAGFTNLIW 158


>gi|380796235|gb|AFE69993.1| putative glycosyltransferase AER61 precursor, partial [Macaca
           mulatta]
          Length = 153

 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)

Query: 238 RTILNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           R ILN  E+  A + +  F+V I + +       D  R  H+    +G+HGAGLTH LFL
Sbjct: 3   RKILNQNELVNALKTVPTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 62

Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
              + + ++     +     Y +  AR+ G+ Y+ ++
Sbjct: 63  PDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWQ 95


>gi|428205791|ref|YP_007090144.1| capsular polysaccharide biosynthesis protein-like protein
           [Chroococcidiopsis thermalis PCC 7203]
 gi|428007712|gb|AFY86275.1| capsular polysaccharide biosynthesis protein-like protein
           [Chroococcidiopsis thermalis PCC 7203]
          Length = 339

 Score = 40.0 bits (92), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 42/87 (48%), Gaps = 4/87 (4%)

Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF 274
           N  SS     P+ + ++R    GR ++N  +V +A    GF    + PE+  S AD  R 
Sbjct: 185 NIESSQRSFSPR-IYISRAKTTGRKVINEDDVLEALSPFGF--VAYAPEK-MSFADEVRL 240

Query: 275 IHSCHAMVGVHGAGLTHSLFLRPGSVL 301
                 +V  HG+GL + +F +  SV+
Sbjct: 241 FSQAEMVVAPHGSGLVNIIFAQNLSVI 267


>gi|400530654|gb|AFP86478.1| glycosyltransferase, partial [Amia calva]
          Length = 276

 Score = 40.0 bits (92), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 60/120 (50%), Gaps = 4/120 (3%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
           +P+  +V  +  + R ILN  E+  A A+E          +E  S AD  + I     +V
Sbjct: 33  EPEEYIVVFSRSINRLILNEAELILALAQEFQMKTITVSLDEH-SFADIVQIISKASMLV 91

Query: 283 GVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
            +HGA L  SLFL  G++++++ P          ++  A + G  L+Y+ ++  +EE+S+
Sbjct: 92  SMHGAQLVTSLFLPRGAIVVELFPYAVNPEHYTPYKTIASLPGMDLQYVAWQNTKEENSI 151


>gi|400530688|gb|AFP86495.1| glycosyltransferase, partial [Heterotis niloticus]
          Length = 276

 Score = 39.7 bits (91), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 12/112 (10%)

Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           + R ILN  E+  A A+E          EE  SLA+  R I     +V +HGA L  SLF
Sbjct: 45  INRLILNEAELILALAQEFQMKAITVSLEEH-SLAEIIRVISGASMLVSMHGAQLITSLF 103

Query: 295 LRPGSVLMQVVPIGTQWLSTVYFEKPARVLG------LEYLEYKIKQEESSL 340
           L  G+ ++++ P     +S  ++  P R L       L+Y+ +K   EE+S+
Sbjct: 104 LPRGAAVVELFPYA---VSPDHY-TPYRTLASLPGMDLQYVAWKNTIEENSV 151


>gi|307180926|gb|EFN68718.1| Uncharacterized glycosyltransferase AER61 [Camponotus floridanus]
          Length = 554

 Score = 39.7 bits (91), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 52/246 (21%), Positives = 110/246 (44%), Gaps = 23/246 (9%)

Query: 109 NFFHEFMDCFVPLFITINSHFP-----DQDVILAIADCNDQWARKYAELLPRFTRHPIIN 163
           N +H F D F  L+ +++ +       D D  + I + +  +   + +    FT++P+ +
Sbjct: 234 NMYHHFCD-FFNLYASLHVNLSHPSVFDTDNHIMIWE-SYSYRSAFQDTFEAFTKNPLWD 291

Query: 164 INN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQSFLANAYNENTNTSS 218
           +   +  T CF+++   L+   RM+     N  L+   +    F++F  +  +       
Sbjct: 292 LKTFKGKTVCFKNLVFPLLP--RMIFGLYYNTPLIYGCEKSGLFKAFSDHVLHRLRIPLH 349

Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHS 277
                K ++ L++R+ +  R ILN  E+ KA +E   + V      ++ S         +
Sbjct: 350 QRKDRKIRITLLSRDTQY-RKILNENELLKALKENPEYKVKKVVYNKNLSFRKQLEITRN 408

Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEE 337
               +G+HGAGLTH +FL   + L ++            ++  AR+ G++Y  +   ++ 
Sbjct: 409 SDIFIGIHGAGLTHLMFLPDWAALFEIYNCE----DPNCYKDLARLRGVKYFTW---EDT 461

Query: 338 SSLVEK 343
           S L+++
Sbjct: 462 SKLIQQ 467


>gi|350403304|ref|XP_003486762.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Bombus
           impatiens]
          Length = 557

 Score = 39.7 bits (91), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 55/262 (20%), Positives = 112/262 (42%), Gaps = 25/262 (9%)

Query: 85  PP--NLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN---SH---FPDQDVIL 136
           PP  N  C +    P  +        N +H F D F  L+ +++   SH   F   + I+
Sbjct: 211 PPIVNHDCDIVIEKPTYIMKIDAI-VNMYHHFCD-FFNLYASLHVNLSHPATFSTDNHIM 268

Query: 137 AIADCNDQWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPT 191
                +  +   + +    FTR+P+ +++  +  T CF+++   L+   RM+     N  
Sbjct: 269 IWESYS--YRSAFQDAFQAFTRNPLWDLHTFRGETVCFKNLVFPLLP--RMIFGLYYNTP 324

Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAE 251
           L+   +    F++F  +  +          + + ++ L++R+ +  R ILN  E+ KA +
Sbjct: 325 LIYGCEKSGLFKAFGDHVLHRLRIPHHERKNQRIRVTLLSRDTQY-RRILNEDELTKALK 383

Query: 252 E-LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
           E   + V      +  S         +    +G+HGAGLTH +FL   + + ++      
Sbjct: 384 ENPEYKVRKVVYNKKISFKKQLEITRNSDIFIGIHGAGLTHLMFLPDWAAVFEIYNCEDP 443

Query: 311 WLSTVYFEKPARVLGLEYLEYK 332
                 ++  AR+ G++Y  ++
Sbjct: 444 GC----YKDLARLRGVKYFTWE 461


>gi|127459561|gb|ABO28372.1| glycosyltransferase [Amia calva]
          Length = 276

 Score = 39.7 bits (91), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 60/120 (50%), Gaps = 4/120 (3%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
           +P+  +V  +  + R ILN  E+  A A+E          +E  S AD  + I     +V
Sbjct: 33  EPEEYIVVFSRSINRLILNEAELILALAQEFQMKTITVSLDEH-SFADIVQIISKASMLV 91

Query: 283 GVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
            +HGA L  SLFL  G++++++ P          ++  A + G  L+Y+ ++  +EE+S+
Sbjct: 92  SMHGAQLVTSLFLPRGAIVVELFPYAVNPEHYTPYKTIASLPGMDLQYVAWQNTKEENSI 151


>gi|443692860|gb|ELT94365.1| hypothetical protein CAPTEDRAFT_192992 [Capitella teleta]
          Length = 459

 Score = 39.7 bits (91), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 26/107 (24%), Positives = 54/107 (50%), Gaps = 6/107 (5%)

Query: 226 KLVLVNRNA----RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
           K+VL+ R+     R  ++I  + E    A E G ++ +F  +   S+  + R  +    +
Sbjct: 321 KIVLIKRSLKRFFRQHKSIAAMLEA--LASEHGLELFVFRDDPVPSIDLTRRMFNEAIMV 378

Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           +  HGAG ++ L+ +PG+V+++ +   +       F+  A +LG+ Y
Sbjct: 379 IAPHGAGESNMLYAQPGTVILEGMCFTSNVRVNTLFKVVANLLGMRY 425


>gi|443728034|gb|ELU14515.1| hypothetical protein CAPTEDRAFT_192300 [Capitella teleta]
          Length = 534

 Score = 39.7 bits (91), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 4/109 (3%)

Query: 224 KPKLVLVNRNARVGRTILNLREVKKA----AEELGFDVTIFEPEESTSLADSFRFIHSCH 279
           +  ++L+ R+A   R      E+  A    A  +G  + +F  +   SL ++        
Sbjct: 363 RDTMLLIQRSAAKTRKFKYHDEIVAALRNEARLVGLRMAVFSDDPLPSLTETVDLFSRAV 422

Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
            ++  HGAG  + LF  PG+VL++ +   +     + F   A+ LGL Y
Sbjct: 423 IVISPHGAGAANLLFAHPGTVLIEALHPSSSHYPNLCFASMAQALGLRY 471


>gi|412991363|emb|CCO16208.1| predicted protein [Bathycoccus prasinos]
          Length = 782

 Score = 39.7 bits (91), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/87 (25%), Positives = 43/87 (49%), Gaps = 6/87 (6%)

Query: 256 DVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTV 315
           +V +   + S  +  +   +  C   +GVHGAG+ + ++    SV+ ++VP         
Sbjct: 679 EVELLNIQASDGVEQTRASMQKCGIFIGVHGAGMINEIYTLKDSVIFEMVPDN----RPA 734

Query: 316 YFEKPARVLGLEYLEYKIK--QEESSL 340
           Y+   A + GL+Y+++ I   QE  S+
Sbjct: 735 YYRNCAVLRGLKYVDFGISGTQETKSV 761


>gi|260809811|ref|XP_002599698.1| hypothetical protein BRAFLDRAFT_119381 [Branchiostoma floridae]
 gi|229284979|gb|EEN55710.1| hypothetical protein BRAFLDRAFT_119381 [Branchiostoma floridae]
          Length = 309

 Score = 39.7 bits (91), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 5/77 (6%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S  +  +   +   ++G+HGAGLTHSLFL    VL ++            ++  AR+ G+
Sbjct: 182 SFPEQLKVTQNSDLLIGMHGAGLTHSLFLPDWGVLFELYNCE----DPRCYKDLARLRGV 237

Query: 327 EYLEYKIKQEESSLVEK 343
           +Y  ++ KQE+ +  EK
Sbjct: 238 KYYTWE-KQEKITQTEK 253


>gi|58039897|ref|YP_191861.1| hypothetical protein GOX1456 [Gluconobacter oxydans 621H]
 gi|58002311|gb|AAW61205.1| Hypothetical protein GOX1456 [Gluconobacter oxydans 621H]
          Length = 367

 Score = 39.7 bits (91), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 21/81 (25%), Positives = 44/81 (54%), Gaps = 3/81 (3%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           + + + R +   R + N  E+    E+LGF+     PE + SL++  R       ++G+ 
Sbjct: 230 RRLYIERGSSANRHVPNEAELAAGLEKLGFERV--HPE-TLSLSEQIRLFSDAGMVMGML 286

Query: 286 GAGLTHSLFLRPGSVLMQVVP 306
           GAG+ +  + +PG+++ ++VP
Sbjct: 287 GAGMANIAWCQPGTLVYELVP 307


>gi|22797413|emb|CAD22108.1| beta 1,2-xylosyltransferase [Physcomitrella patens]
          Length = 595

 Score = 39.7 bits (91), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 20/76 (26%), Positives = 40/76 (52%), Gaps = 2/76 (2%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           +L +  + +     ++G HGAGL+H LF  P   +  +V + + +L   +FE  ++ +G+
Sbjct: 499 ALHEQLKIVRESSIIIGAHGAGLSHLLFAMPRETV--IVELSSPFLVRPHFEAMSQWMGM 556

Query: 327 EYLEYKIKQEESSLVE 342
           EY +  +   E+   E
Sbjct: 557 EYHKTDMAISEADCSE 572


>gi|428205790|ref|YP_007090143.1| hypothetical protein Chro_0728 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428007711|gb|AFY86274.1| hypothetical protein Chro_0728 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 424

 Score = 39.7 bits (91), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 79/192 (41%), Gaps = 23/192 (11%)

Query: 106 YNGNFFHEFMDCFVPLFITINSHFPDQDVI---LAIADCNDQWARKYAELLPRFTRHPII 162
           +N  +FH  +DC   L      ++  Q  I   L I     +W ++   LL  +     I
Sbjct: 165 WNSGYFHWLIDCLTRLEGI--EYYQAQTGIKPTLIIHANPQEWQKESLRLLG-YNPDDCI 221

Query: 163 NINNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
             N+  +    + + +        +I+PT     +     Q  L+N  ++NT+ S S   
Sbjct: 222 EWNDSGVK--VKKLVVPSFRREHNIISPTACQWLR-----QRLLSN-LSDNTSISLS--- 270

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
             P+ + ++R    GR I+N  +V      LGF   +    E  S AD  R       +V
Sbjct: 271 --PR-IYISRPKSAGRNIINEADVMAILAPLGF---VAYTMEELSFADEIRLFSQAEIVV 324

Query: 283 GVHGAGLTHSLF 294
             HGAGLT+ +F
Sbjct: 325 APHGAGLTNIIF 336


>gi|356524325|ref|XP_003530780.1| PREDICTED: beta-(1,2)-xylosyltransferase-like [Glycine max]
          Length = 501

 Score = 39.7 bits (91), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 13/67 (19%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKP-----A 321
           SL D  + I     ++G HGAGLTH +   P +V+++++        + YF +P     +
Sbjct: 415 SLKDQVQAIQDASVIIGAHGAGLTHIVSALPKTVILEII--------SSYFRRPHFAYIS 466

Query: 322 RVLGLEY 328
           R  GLEY
Sbjct: 467 RWKGLEY 473


>gi|434392192|ref|YP_007127139.1| capsular polysaccharide biosynthesis protein-like protein
           [Gloeocapsa sp. PCC 7428]
 gi|428264033|gb|AFZ29979.1| capsular polysaccharide biosynthesis protein-like protein
           [Gloeocapsa sp. PCC 7428]
          Length = 405

 Score = 39.7 bits (91), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 75/192 (39%), Gaps = 22/192 (11%)

Query: 106 YNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCND-QWARKYAELLPRFTRHPIINI 164
           YN N+FH  +DC + L      H       L I D N   W  +  +LL  +     I  
Sbjct: 163 YNRNYFHWIVDCLLRLEGIEYYHAQTGRKPLLIIDSNPPAWKIESLKLLG-YKPEDYIYW 221

Query: 165 NNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLAN--AYNENTNTSSSFHH 222
           N   I    + + +      + +I+P      +     Q  L+N     +  + SS  + 
Sbjct: 222 NGFGIN--IKRLVVSSFRREKSLISPVACHWLR-----QRVLSNLPKSQDKKSYSSRIYI 274

Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
           ++PK          GR ++N  EV KA   LGF   +    E+ S  D          +V
Sbjct: 275 SRPK--------SAGRQVINEEEVLKALTPLGF---VSYTLENMSFTDQVALFSQAEIVV 323

Query: 283 GVHGAGLTHSLF 294
             HGAGLT+ +F
Sbjct: 324 AAHGAGLTNIVF 335


>gi|394337421|gb|AFN27698.1| glycosyltransferase, partial [Hypoplectrus puella]
          Length = 288

 Score = 39.3 bits (90), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 48/182 (26%), Positives = 72/182 (39%), Gaps = 42/182 (23%)

Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           R ILN  E+  A A+E    V     EE  S     + I     +V +HGA L  SLFL 
Sbjct: 59  RLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSIVQVISGASMLVSMHGAQLITSLFLP 117

Query: 297 PGSVLMQVVPIGTQ----------------------WLST------VYFEKPARVLGLEY 328
            G+V++++ P G                        W +T       Y ++P    G+ +
Sbjct: 118 RGAVVVELFPFGVNPEQYTPYKTLATLPGMDLHYVSWRNTKEENTITYPDRPWEQGGIVH 177

Query: 329 LEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
           LE   +++E  L  K     L  +NP+          R+Y   Q+  +DI  F   LK+ 
Sbjct: 178 LEK--EEKERILASKEVPRHLCCRNPEWL-------FRIY---QDTLVDIPSFLEVLKEG 225

Query: 389 YK 390
            K
Sbjct: 226 MK 227


>gi|168055999|ref|XP_001780010.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668615|gb|EDQ55219.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 595

 Score = 39.3 bits (90), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 2/76 (2%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           +L +  + +     ++G HGAGL+H LF  P   +  +V + + +    +FE  ++ +G+
Sbjct: 499 ALHEQLKIVRESSIIIGAHGAGLSHLLFAMPRETV--IVELSSPFFVRPHFEAMSQWMGM 556

Query: 327 EYLEYKIKQEESSLVE 342
           EY +  +   E+   E
Sbjct: 557 EYHKIDMASSEADCSE 572


>gi|223996669|ref|XP_002288008.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977124|gb|EED95451.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 458

 Score = 39.3 bits (90), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 24/96 (25%), Positives = 44/96 (45%), Gaps = 13/96 (13%)

Query: 258 TIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP-----IGTQWL 312
           T  + EE+TS  +      S H +VG+HGA LT  + L   + +++++P     +   W+
Sbjct: 267 TKVDVEEATSPEEQLLLHRSLHGLVGIHGASLTQGVLLPSHASILELLPWIPNYLQGSWV 326

Query: 313 ST--------VYFEKPARVLGLEYLEYKIKQEESSL 340
            T        + F K   +  L ++ Y + +E   L
Sbjct: 327 QTKHRPTPLGIIFHKTGELSNLNHVGYGLDRESVPL 362


>gi|394337531|gb|AFN27753.1| glycosyltransferase, partial [Diademichthys lineatus]
          Length = 285

 Score = 39.3 bits (90), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 52/106 (49%), Gaps = 4/106 (3%)

Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           R I+N  E+  A  +EL   V     EE  S +   + I     +V +HGA L  SLFL 
Sbjct: 56  RLIVNEAELILALVQELQLKVVTVSLEEQ-SFSSIIQVISGASILVSMHGAQLITSLFLP 114

Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
            G+V++++ P G        +   A + G++  YL ++   EE+++
Sbjct: 115 RGAVVVELFPFGVNPEQYTPYRTLATLPGMDLHYLSWRNTNEENTI 160


>gi|53748600|emb|CAF02095.2| beta 1,2-xylosyltransferase [Physcomitrella patens]
          Length = 595

 Score = 39.3 bits (90), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 2/76 (2%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           +L +  + +     ++G HGAGL+H LF  P   +  +V + + +    +FE  ++ +G+
Sbjct: 499 ALHEQLKIVRESSIIIGAHGAGLSHLLFAMPRETV--IVELSSPFFVRPHFEAMSQWMGM 556

Query: 327 EYLEYKIKQEESSLVE 342
           EY +  +   E+   E
Sbjct: 557 EYHKIDMASSEADCSE 572


>gi|123420687|ref|XP_001305813.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121887353|gb|EAX92883.1| hypothetical protein TVAG_467300 [Trichomonas vaginalis G3]
          Length = 530

 Score = 39.3 bits (90), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 55/291 (18%), Positives = 112/291 (38%), Gaps = 36/291 (12%)

Query: 59  KTQPYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
           +  P+ + +D+ A   +         P N    +  TS  L++         +H   D F
Sbjct: 190 RAPPFDKVTDRFAKEPIVTNITFDEIPRNGKQVIEETS--LIYGVFYNFYMLWHMIYDFF 247

Query: 119 VPL--FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
           +P   FI +   F  +D        +  W  ++  L    +R+  I + N+   + F+  
Sbjct: 248 IPYYSFINVRHQFATRDTRRLFVRSDGFW--RFQALANWISRYEPIVLENRYKPYFFKYA 305

Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH-------------T 223
            +G          P   P P    +  S L   YN N++T+  F               T
Sbjct: 306 IIGCEK-------PEANPDPYRNNEINSILF-KYNLNSSTAPGFREDVIKYNNIHIENST 357

Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIHSCHAMV 282
           KP ++ + R     R + NL ++ + ++ +     I   E ++  + +    I     ++
Sbjct: 358 KPIVLFIKRGG-AKRDVKNLDKIVELSKTICTFCDIRTVELQNLDIREQIELIAPAQVLI 416

Query: 283 GVHGAGLTHSLFLRPG-----SVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           G HG+GL H +++        + L++++P   ++    ++E  A V  + Y
Sbjct: 417 GFHGSGLAHVIWMEESRKQRQTHLIEIMP--HEYNCRDWYETAASVARVNY 465


>gi|410503822|ref|YP_006941227.1| hypothetical protein BN8_p06856 [Fibrisoma limi BUZ 3]
 gi|387510265|emb|CCH57653.1| hypothetical protein BN8_p06856 [Fibrisoma limi BUZ 3]
          Length = 400

 Score = 39.3 bits (90), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 33/138 (23%), Positives = 67/138 (48%), Gaps = 12/138 (8%)

Query: 209 AYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSL 268
           A  ++     S   TK   + ++R    GR I+N  E+ +  ++ GF VTI   ++  S+
Sbjct: 245 ALKKHIQGRVSIGATKQSRIYISRQG--GRRIVNEDELIQMLQQFGF-VTI--EDKPRSI 299

Query: 269 ADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
            +      +   ++G HGA  T+ ++ +PG+ L+++      ++   YF     +LGL Y
Sbjct: 300 VEQVSIYANASFIIGPHGASFTNVIWCQPGTHLVEL--FSANYVPD-YFLYLCTLLGLRY 356

Query: 329 LEY----KIKQEESSLVE 342
             Y     +K++  +++E
Sbjct: 357 SAYCQGTDVKEDRRAILE 374


>gi|384253738|gb|EIE27212.1| hypothetical protein COCSUDRAFT_26591 [Coccomyxa subellipsoidea
           C-169]
          Length = 493

 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 30/116 (25%), Positives = 59/116 (50%), Gaps = 11/116 (9%)

Query: 240 ILNLREVKKAAEELGFDVTI-FEPEESTSLADSFRFIHSCHAMVGV----HGAGLTHSLF 294
           ++N  E+  A +++G  + +   P  +T+ A  F    S  A  GV    HG  L +S+F
Sbjct: 309 VVNREELMGALQDIGHTLGLAVRPYTATARA-PFESYLSVMARTGVLVSRHGPLLANSIF 367

Query: 295 LRPGSVLMQVVPIGTQW--LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
           L PG+V+M+++P   +W  +S +Y      +  + ++ ++    + +L   YG+ D
Sbjct: 368 LPPGAVVMELLPYNWEWKGISKIYVNLTRSLGDIHHIAWRATHPKWAL---YGSAD 420


>gi|400530896|gb|AFP86599.1| glycosyltransferase, partial [Antennarius striatus]
          Length = 287

 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 53/106 (50%), Gaps = 4/106 (3%)

Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           R ILN  E+  A A+E    V     EE  S +   + I S   +V +HGA L  SLFL 
Sbjct: 61  RLILNEAELIMALAQEFQMRVVTLSLEEQ-SFSSIVQMISSATILVSMHGAQLITSLFLP 119

Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
            GSV++++ P          ++  A + G++  Y+ ++   E++S+
Sbjct: 120 RGSVVVELFPYAVNPEQYTPYKTLASLPGMDLHYISWRNNNEDNSI 165


>gi|440803308|gb|ELR24216.1| hypothetical protein ACA1_377010 [Acanthamoeba castellanii str.
           Neff]
          Length = 469

 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 47/185 (25%), Positives = 79/185 (42%), Gaps = 23/185 (12%)

Query: 223 TKPKLVLVNRNARVGRTILN----LREVKKA----AEELGFDVTIFEPEESTSLADSFRF 274
           T+P++VL+ R     +TILN    L+ ++ A    A+     + +  P E  SL      
Sbjct: 296 TRPRVVLIRRTK--DKTILNSDTLLQNLRSALRNDADVFQVSMELLTPREVVSL------ 347

Query: 275 IHSCHAMVGVHGAGLTHSLFL-RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
           +     ++G HG+GL H  FL R  S+++   P     LS   F   A  L ++Y     
Sbjct: 348 MQGTTVLIGTHGSGLAHLWFLPRTSSMIVIHAPGADTLLSPKGFSDGAPWL-IDYQTIAS 406

Query: 334 KQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAK 393
           K + ++ V K       +++  A     W     Y+K  NV+++ D      K+A   A 
Sbjct: 407 KLKMNTFVHKVEKQAESIEDCGA---GRWGLYCAYVK--NVEVEADALATLTKEAINAAL 461

Query: 394 KLMDK 398
              D+
Sbjct: 462 ARKDE 466


>gi|384247587|gb|EIE21073.1| hypothetical protein COCSUDRAFT_56993 [Coccomyxa subellipsoidea
           C-169]
          Length = 515

 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 38/131 (29%), Positives = 59/131 (45%), Gaps = 12/131 (9%)

Query: 228 VLVNRNARVGRTILNLREVKKAAEE----LGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           VLV +   V R  LN+ ++ K   E     G  V       +  LA +   + S   +V 
Sbjct: 297 VLVEQRHGVVRNFLNMDQLLKDCGESGKLQGTSVACRSATFAGDLAWNIAAVSSADVLVT 356

Query: 284 VHGAGLTHSLFLRPGSVLMQVVP--IGT---QWLSTVYFEKPARVLG--LEYLEYKIKQE 336
           +HG+G  + LF+  GS L++V P   GT   +W +  +  K A   G  ++Y+   I  E
Sbjct: 357 IHGSGSNNVLFMHEGSTLIEVRPYKFGTETPEW-ANFFIPKVAEANGYRVQYIGLNILDE 415

Query: 337 ESSLVEKYGAN 347
           E S    + AN
Sbjct: 416 ELSRPGGFEAN 426


>gi|449273371|gb|EMC82866.1| Putative glycosyltransferase AGO61 [Columba livia]
          Length = 578

 Score = 38.9 bits (89), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 57/226 (25%), Positives = 94/226 (41%), Gaps = 26/226 (11%)

Query: 97  PALVFSAGGYN-GNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPR 155
           P +   A  +N  N  H F D  +P++ T+   F D D + A     + W+      L +
Sbjct: 149 PDVALIANRFNPDNLMHVFHDDLLPIYYTMQ-QFSDLD-LEARLFFMEGWSEGVHFDLYK 206

Query: 156 F--TRHPIININNQTITH--CFQSVTLGL----ISHGRMVINPTLLPKPKTLV------D 201
               + P++    +T+    CF    +GL      +    + P   PK   LV       
Sbjct: 207 LLSNKQPLLREQLKTLGRLLCFTKSYVGLSKITTWYQYGFVQPQG-PKANILVSGNEIRQ 265

Query: 202 FQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIF 260
           F  F+     E  N S     ++  +V+ +R   + R ILN  E+  A A+E        
Sbjct: 266 FTKFMM----EKLNVSLEESSSEEYIVVFSRT--INRLILNEAELILALAQEFQMKTITV 319

Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
             EE  S +D  R + +   +V +HGA L  SLFL  G+ ++++ P
Sbjct: 320 SLEEH-SFSDIVRLLSNASMLVSMHGAQLVMSLFLPRGATVVELFP 364


>gi|400530878|gb|AFP86590.1| glycosyltransferase, partial [Siganus spinus]
          Length = 267

 Score = 38.9 bits (89), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 54/106 (50%), Gaps = 4/106 (3%)

Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           R ILN  E+  A A+E    V     E+ T L    + I     +V +HGA L  SLFL 
Sbjct: 51  RLILNEPELIMALAQEFQMKVITVSLEDQT-LPSIVQVISGATMLVSMHGAQLITSLFLP 109

Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
            G+V++++ P          ++  A + G++  Y+ ++ K+EE+++
Sbjct: 110 RGAVVVELFPFAVNPEQYTPYKTLASLPGMDLHYISWRNKKEENTI 155


>gi|312113531|ref|YP_004011127.1| hypothetical protein Rvan_0752 [Rhodomicrobium vannielii ATCC
           17100]
 gi|311218660|gb|ADP70028.1| hypothetical protein Rvan_0752 [Rhodomicrobium vannielii ATCC
           17100]
          Length = 404

 Score = 38.9 bits (89), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 49/117 (41%), Gaps = 6/117 (5%)

Query: 193 LPKPKTLVDFQSFLANAYNENTN---TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA 249
           +P P +    QS L  A  ++        S     PK + ++RN    R +LN  E+   
Sbjct: 221 VPVPNSPRGLQSPLGLARLDDLGRFVAGGSERPDDPKRLYISRNDARLRRVLNEHELMPI 280

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
               GF+  +            FR   +  A+V  HGAGL H  + +PG+ +++  P
Sbjct: 281 LRSFGFERVVLGKLPIERQVALFR---NAEAVVAPHGAGLAHIAWAKPGTKVVEFFP 334


>gi|400530814|gb|AFP86558.1| glycosyltransferase, partial [Melamphaes polylepis]
          Length = 290

 Score = 38.9 bits (89), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 53/108 (49%), Gaps = 4/108 (3%)

Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           V R ILN  E+  A A+E    V     EE T +A   + I     +V +HGA L  SLF
Sbjct: 59  VTRLILNEAELIMALAQEFQMRVVTVSLEEQT-VASIVQLISGASMLVSMHGAQLVTSLF 117

Query: 295 LRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
           L  G+ ++++ P          ++  A + G  L+Y+ ++   EE+++
Sbjct: 118 LPRGAAVVELFPYAVNPEQYTPYKTLASLPGMDLQYVAWRNTMEENTI 165


>gi|345483345|ref|XP_001599568.2| PREDICTED: uncharacterized glycosyltransferase AER61-like [Nasonia
           vitripennis]
          Length = 537

 Score = 38.9 bits (89), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 43/183 (23%), Positives = 84/183 (45%), Gaps = 13/183 (7%)

Query: 156 FTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQSFLANAY 210
           FT++P+ ++   +  T CF++V   L+   RM+     N  L+   +    F +F  +  
Sbjct: 292 FTKNPLWDLKTFRGETVCFKNVVFPLLP--RMIFGLFYNTPLIYGCEKSGLFTAFSDHVL 349

Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEE-LGFDVTIFEPEESTSLA 269
           +          +TK ++ L++R+ +  R ILN  E+ KA E+   ++V      +     
Sbjct: 350 HRLRIPLHVRKNTKIRVTLLSRDTQY-RRILNEDELVKALEKNPRYEVKKVVYNKHMPFK 408

Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
                  +    +G+HGAGLTH +FL   +V  ++            ++  AR+ G++YL
Sbjct: 409 KQLEITRNSDIFIGIHGAGLTHFMFLPEWAVGFELYNCE----DASCYKDLARLKGIKYL 464

Query: 330 EYK 332
            ++
Sbjct: 465 TWE 467


>gi|400530724|gb|AFP86513.1| glycosyltransferase, partial [Bathylagus euryops]
          Length = 277

 Score = 38.9 bits (89), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 41/153 (26%), Positives = 70/153 (45%), Gaps = 8/153 (5%)

Query: 194 PKPKTLV---DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA- 249
           PK   LV   + + F A++  E  N +      + +  +V  +  + R ILN  E+  A 
Sbjct: 2   PKANILVSGNEIRQF-ASSLMEKLNITREESAEEKEEYIVVFSRSINRLILNEAELILAL 60

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           A+E    V     EE  S A   + +     +V +HGA L  SLFL  G+ ++++ P   
Sbjct: 61  AQEFQMKVVTVSLEEQ-SFAAIVKVVSEATMLVSMHGAQLVSSLFLPRGAAVVELFPYAV 119

Query: 310 QWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
                  ++  A + G  L+Y+ ++   EE+SL
Sbjct: 120 NPEQYAPYKTLASLPGMDLQYVSWRNTIEENSL 152


>gi|397572376|gb|EJK48230.1| hypothetical protein THAOC_32991 [Thalassiosira oceanica]
          Length = 696

 Score = 38.9 bits (89), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 48/89 (53%), Gaps = 18/89 (20%)

Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV----VPIGTQW--LSTVYFEKPARVL 324
           + +  HS   +VG HG  L +S+F RPG+ ++++    VP  +Q+  LST         L
Sbjct: 564 TVKLFHSATVVVGAHGEALVNSIFSRPGTTVIEMGFESVPQASQYRHLST--------SL 615

Query: 325 GLEYLEYKIKQEESSLVEKYGANDLVLKN 353
           GL++++  + ++  S    +GA + +L+ 
Sbjct: 616 GLKHVDINLAKDSRS----FGAAEYMLRE 640


>gi|307106684|gb|EFN54929.1| expressed protein [Chlorella variabilis]
          Length = 627

 Score = 38.9 bits (89), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 35/161 (21%), Positives = 74/161 (45%), Gaps = 17/161 (10%)

Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           R  R ++N   + K   E G +V + E   ST        + S   +V VH + L ++ F
Sbjct: 436 RANRRVVNEEALLKMLAEFG-EVRVVEFNASTPFRQQLETMASTSVLVSVHTSNLANAQF 494

Query: 295 LRPGSVLMQVVPIGTQW--LSTVYFEKPARVLGLEYLEYKIK-QEESSLVE-----KYGA 346
           ++PGS + +++     W  L   +  + A +  + +  ++ + + E+  ++     ++G 
Sbjct: 495 MQPGSAVFEIIQRNWFWHGLDKSFQVQTAMMGDIHHYAWRARLRNETEYIQERDAYRFGE 554

Query: 347 NDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKD 387
            + +  N +    A+ +N+       +V++DID FR  L D
Sbjct: 555 WEPLQCNTEECVEAH-TNV-------DVRVDIDAFRALLAD 587


>gi|400530722|gb|AFP86512.1| glycosyltransferase, partial [Argentina silus]
          Length = 277

 Score = 38.5 bits (88), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 40/153 (26%), Positives = 70/153 (45%), Gaps = 8/153 (5%)

Query: 194 PKPKTLV---DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA- 249
           PK   LV   + + F A++  E  N +      + +  +V  +  + R ILN  E+  A 
Sbjct: 2   PKANILVSGNEIRQF-ASSLMEKLNITREESVEEKEEYIVVFSRSINRLILNEAELILAL 60

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           A+E    V     EE  S A   + +     +V +HGA L  SLFL  G+ ++++ P   
Sbjct: 61  AQEFQMKVVTVSLEEQ-SFASIVKMVSEASMLVSMHGAQLVSSLFLPRGAAVVELFPYAV 119

Query: 310 QWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
                  ++  A + G  L+Y+ ++   EE+S+
Sbjct: 120 NPEQYAPYKTLASLPGMDLQYVAWRNMIEENSV 152


>gi|326922047|ref|XP_003207263.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like
           [Meleagris gallopavo]
          Length = 577

 Score = 38.5 bits (88), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 2/72 (2%)

Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           + R ILN  E+  A A+E          EE  S +D  R I +   +V +HGA L  SLF
Sbjct: 294 INRLILNEAELILALAQEFQMKTITVSLEEH-SFSDIVRLISNASMLVSMHGAQLVMSLF 352

Query: 295 LRPGSVLMQVVP 306
           L  G+ ++++ P
Sbjct: 353 LPRGATVVELFP 364


>gi|400530916|gb|AFP86609.1| glycosyltransferase, partial [Ranzania laevis]
          Length = 267

 Score = 38.5 bits (88), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 4/106 (3%)

Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           R ILN  E+  A A+E    V     EE  S     R I S   ++ +HGA L  SLFL 
Sbjct: 45  RLILNEAELIMALAQEFQMRVVTVSLEEH-SFPSIVRVISSATMLISMHGAQLITSLFLP 103

Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
            G+V++++ P          ++  A + G++  Y+ ++  +EE+++
Sbjct: 104 RGAVVVELFPFAVNPEQYTPYKTLAFLPGMDLHYISWRNNKEENTI 149


>gi|394337263|gb|AFN27619.1| glycosyltransferase, partial [Lophius americanus]
          Length = 289

 Score = 38.5 bits (88), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 4/106 (3%)

Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           R ILN  E+  A A+E    V     EE  S +   + I     +V +HGA L  SLFL 
Sbjct: 60  RLILNEAELIMALAQEFEMRVVTVSLEEQ-SFSSIIQVISRASMLVSMHGAQLITSLFLP 118

Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
            G+V++++ P          ++  A + G++  Y+ +K  +EE+++
Sbjct: 119 RGAVVVELFPFAVNPEQYTPYKTLASLPGMDLHYISWKNTKEENTI 164


>gi|307208002|gb|EFN85561.1| Uncharacterized glycosyltransferase AER61 [Harpegnathos saltator]
          Length = 556

 Score = 38.5 bits (88), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 66/315 (20%), Positives = 130/315 (41%), Gaps = 31/315 (9%)

Query: 89  SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP-----DQDVILAIADCND 143
            C V    P  +        N +H F D F  L+ +++ +       D D  + I + + 
Sbjct: 215 DCDVVIEKPTFIMKIDAI-VNMYHHFCD-FFNLYASLHVNLSHPSAFDTDNHIMIWE-SY 271

Query: 144 QWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDF 202
            +   + +    FTR+P+ ++   +    CF+++   L+   RM+        P      
Sbjct: 272 SYRSAFQDTFDVFTRNPLWDLKTFRGKIVCFRNLVFPLLP--RMIFG-LYYNTPLIYGCE 328

Query: 203 QSFLANAYNENT--NTSSSFHHTKPKLV---LVNRNARVGRTILNLREVKKAAEEL-GFD 256
           +S L  A+ ++         H  K + +   L++R+ R  R ILN  E+ KA  E   ++
Sbjct: 329 KSGLFKAFGDHVLHRLRIPLHQRKDRRIHVTLLSRDTRY-RKILNEDELVKALRENPEYE 387

Query: 257 VTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVY 316
           V      ++ S  +      +    +G+HGAGLTH +FL   + + ++            
Sbjct: 388 VRKVVYNKNVSFKEQLEITKNSDIYIGMHGAGLTHLMFLPDWAAVFEIYNCE----DPNC 443

Query: 317 FEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKL 376
           ++  AR+ G++Y  +   +  S LV++         +P   A A ++N    +K     +
Sbjct: 444 YKDLARLRGVKYFTW---ENSSKLVQQDPGT-----HPDGGAHAKFTNYSFDVKEFLRIV 495

Query: 377 DIDRFRIYLKDAYKK 391
            + R  +   +A+KK
Sbjct: 496 SLARNHVKHHNAFKK 510


>gi|59709491|ref|NP_001012294.1| glycosyltransferase-like domain-containing protein 2 precursor
           [Gallus gallus]
 gi|82076801|sp|Q5NDE8.1|GTDC2_CHICK RecName: Full=Glycosyltransferase-like domain-containing protein 2;
           AltName: Full=Extracellular O-linked N-acetylglucosamine
           transferase-like; Flags: Precursor
 gi|56605346|emb|CAI30870.1| glycosyltransferase [Gallus gallus]
          Length = 577

 Score = 38.5 bits (88), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 2/72 (2%)

Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           + R ILN  E+  A A+E          EE  S +D  R I +   +V +HGA L  SLF
Sbjct: 294 INRLILNEAELILALAQEFQMKTITVSLEEH-SFSDIVRLISNASMLVSMHGAQLVMSLF 352

Query: 295 LRPGSVLMQVVP 306
           L  G+ ++++ P
Sbjct: 353 LPRGATVVELFP 364


>gi|157688958|gb|ABV65039.1| glycosyltransferase [Argentina sialis]
          Length = 277

 Score = 38.5 bits (88), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 40/153 (26%), Positives = 70/153 (45%), Gaps = 8/153 (5%)

Query: 194 PKPKTLV---DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA- 249
           PK   LV   + + F A++  E  N +      + +  +V  +  + R ILN  E+  A 
Sbjct: 2   PKANILVSGNEIRQF-ASSLMEKLNITREESVEEKEEYIVVFSRSINRLILNEAELILAL 60

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           A+E    V     EE  S A   + +     +V +HGA L  SLFL  G+ ++++ P   
Sbjct: 61  AQEFQMKVVTVSLEEQ-SFASIVKMVSEASMLVSMHGAQLVSSLFLPRGAAVVELFPYAV 119

Query: 310 QWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
                  ++  A + G  L+Y+ ++   EE+S+
Sbjct: 120 NPEQYAPYKTLASLPGMDLQYVAWRNMIEENSV 152


>gi|86147246|ref|ZP_01065561.1| Capsular polysaccharide biosynthesis protein-like [Vibrio sp.
           MED222]
 gi|85834961|gb|EAQ53104.1| Capsular polysaccharide biosynthesis protein-like [Vibrio sp.
           MED222]
          Length = 432

 Score = 38.5 bits (88), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 24/92 (26%), Positives = 42/92 (45%), Gaps = 3/92 (3%)

Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF 274
           NT S+ +  K K + + R     R + N+ ++++     GFD   F    S    + +  
Sbjct: 289 NTLSNNYILKNKKIYLQRENTKLRKLSNVLDLERLLYRKGFD---FVDPGSLDFFEQYNL 345

Query: 275 IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
                 +VG  GA  T+ LF++PGS  + + P
Sbjct: 346 FSQAEVIVGASGAAFTNLLFMKPGSTAISLYP 377


>gi|34499919|gb|AAQ73546.1| beta 1,2 xylosyltransferase [Medicago sativa]
          Length = 511

 Score = 38.5 bits (88), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 33/62 (53%), Gaps = 3/62 (4%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S+ +  R I     ++G HGAGLTH +   P +V+++++   +      +F   AR  GL
Sbjct: 425 SMKEQVRAIQDASVIIGAHGAGLTHIVSALPKTVILEII---SSQFRRPHFAYIARWKGL 481

Query: 327 EY 328
           EY
Sbjct: 482 EY 483


>gi|410942805|ref|ZP_11374546.1| hypothetical protein GfraN1_00115 [Gluconobacter frateurii NBRC
           101659]
          Length = 363

 Score = 38.5 bits (88), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 48/212 (22%), Positives = 84/212 (39%), Gaps = 34/212 (16%)

Query: 100 VFSAGGYNGNFFHEFMDCFVPLFITINS----HFPDQDVILAIADCNDQWARKYAELLPR 155
           V  A G +   +  ++  ++P F  + +    HF D  ++L  +  + +W  ++  L   
Sbjct: 117 VIVADGISFEIWGHWLIDYLPRFGVVKAVKGEHFADLKILLPHS--SPEWIYRFLALACG 174

Query: 156 FTRHPIININ-NQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNE-- 212
             R  II  N    I  C +++       G                 F SF+ + YN   
Sbjct: 175 VNRDTIIGYNPGSDIVLCREAILPSYCYTGEF--------------SFHSFVRDFYNSLK 220

Query: 213 -NTNTSSSFHHTKPKLVLVNRN--ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLA 269
            N  T       K + +LV+R   +   R   N    +  A E G++  I  PE+  SL 
Sbjct: 221 PNIGTEK-----KSRKILVSRADFSHGNRPFPNRAAFESMAAERGYE--IIHPEK-LSLE 272

Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
           +         A++G HG+G+  +LF  P +V+
Sbjct: 273 EQIEIFSDAAAVIGEHGSGMHSTLFSGPDTVV 304


>gi|400530880|gb|AFP86591.1| glycosyltransferase, partial [Siganus vulpinus]
          Length = 289

 Score = 38.5 bits (88), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 54/106 (50%), Gaps = 4/106 (3%)

Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           R ILN  E+  A A+E    V     E+  SL    + I     +V +HGA L  SLFL 
Sbjct: 60  RLILNEPELIMALAQEFQMKVITVSLEDQ-SLPSIVQVISGATMLVSMHGAQLITSLFLP 118

Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
            G+V++++ P          ++  A + G++  Y+ ++ K+EE+++
Sbjct: 119 RGAVVVELFPFAVNPEQYTPYKTLASLPGMDLHYISWRNKKEENTI 164


>gi|224045437|ref|XP_002198311.1| PREDICTED: glycosyltransferase-like domain-containing protein 2
           isoform 2 [Taeniopygia guttata]
 gi|224045439|ref|XP_002198300.1| PREDICTED: glycosyltransferase-like domain-containing protein 2
           isoform 1 [Taeniopygia guttata]
 gi|224045441|ref|XP_002198318.1| PREDICTED: glycosyltransferase-like domain-containing protein 2
           isoform 3 [Taeniopygia guttata]
 gi|449492799|ref|XP_004175421.1| PREDICTED: glycosyltransferase-like domain-containing protein 2
           [Taeniopygia guttata]
 gi|449492803|ref|XP_004175422.1| PREDICTED: glycosyltransferase-like domain-containing protein 2
           [Taeniopygia guttata]
          Length = 578

 Score = 38.5 bits (88), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 2/72 (2%)

Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           + R ILN  E+  A A+E     TI    E  S +D  R I +   +V +HGA L  SLF
Sbjct: 294 INRLILNEAELILALAQEFQMK-TISVSLEEHSFSDIVRLISNASMLVSMHGAQLVMSLF 352

Query: 295 LRPGSVLMQVVP 306
           L  G+ ++++ P
Sbjct: 353 LPRGATVVELFP 364


>gi|222147143|ref|YP_002548100.1| hypothetical protein Avi_0161 [Agrobacterium vitis S4]
 gi|221734133|gb|ACM35096.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 349

 Score = 38.5 bits (88), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 51/237 (21%), Positives = 95/237 (40%), Gaps = 23/237 (9%)

Query: 92  VTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAE 151
           ++  SP ++    G+  N++H  +     L I    H  D +++LA+   N  W R  A 
Sbjct: 95  ISDISPLII--GNGWYKNYYHWTLQAIGCLLIY-KKHRDDGNILLAMPRLNG-W-RNEAL 149

Query: 152 LLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN 211
            L  F    ++ ++     H    +T  L   G        LP P  + +F++  A    
Sbjct: 150 SLTGFNGQ-LLEMDYHHFLHVDDCITSNLTGGGF-----AHLPHPAVMAEFEALAARVTI 203

Query: 212 ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADS 271
           E   +         K+ +   +A   R ++N + V    E  GF+  I  P E  S+ + 
Sbjct: 204 ERRFSR--------KIYVSRLDAGDVRRVVNEKAVCTLLESRGFE--IITPGE-LSVKEQ 252

Query: 272 FRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
                    +V  HGAGL + ++ +PG +  +V+ +         + +  +  GL+Y
Sbjct: 253 VVAFRDAEVIVAPHGAGLANLVYCQPG-IKTRVIELFQASCINACYARVCQAKGLDY 308


>gi|385234654|ref|YP_005795996.1| hypothetical protein KVU_2161 [Ketogulonicigenium vulgare WSH-001]
 gi|343463565|gb|AEM42000.1| hypothetical protein KVU_2161 [Ketogulonicigenium vulgare WSH-001]
          Length = 565

 Score = 38.5 bits (88), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 48/106 (45%), Gaps = 3/106 (2%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           K + V R +   R I+   E+    E LGF    FE  + T L  + + +     M+  H
Sbjct: 298 KRIYVGRKSGKSRNIVGEDELIARLETLGFKTLYFE--DYTPLEQA-QIMQGVDIMISGH 354

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
           GAG T+ L+ + G+ L+++  + T       F   A V G  ++ +
Sbjct: 355 GAGFTNMLYAKKGATLIEISHLQTARHRFGDFHMHAGVSGARHIHF 400


>gi|310817073|ref|YP_003965037.1| hypothetical protein EIO_2660 [Ketogulonicigenium vulgare Y25]
 gi|308755808|gb|ADO43737.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25]
          Length = 566

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 48/106 (45%), Gaps = 3/106 (2%)

Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           K + V R +   R I+   E+    E LGF    FE  + T L  + + +     M+  H
Sbjct: 299 KRIYVGRKSGKSRNIVGEDELIARLETLGFKTLYFE--DYTPLEQA-QIMQGVDIMISGH 355

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
           GAG T+ L+ + G+ L+++  + T       F   A V G  ++ +
Sbjct: 356 GAGFTNMLYAKKGATLIEISHLQTARHRFGDFHMHAGVSGARHIHF 401


>gi|345653323|gb|AEO15506.1| glycosyltransferase [Chionobathyscus dewitti]
          Length = 286

 Score = 38.1 bits (87), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 59/117 (50%), Gaps = 6/117 (5%)

Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           +VL +R+    R ILN  E+  A A+E    V     EE  S   + + I     ++ +H
Sbjct: 48  IVLFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSTIQVISGASMLISMH 104

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
           GA L  SLFL  G+V++++ P          ++  A + G++  Y+ ++  +EE+++
Sbjct: 105 GAQLITSLFLPRGAVVVELFPFAVNPEQYTPYKTLATLPGMDLHYVSWRNTKEENTI 161


>gi|357459889|ref|XP_003600225.1| Beta 1 2 xylosyltransferase [Medicago truncatula]
 gi|355489273|gb|AES70476.1| Beta 1 2 xylosyltransferase [Medicago truncatula]
          Length = 509

 Score = 38.1 bits (87), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 33/62 (53%), Gaps = 3/62 (4%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S+ +  R I     ++G HGAGLTH +   P +V+++++   +      +F   AR  GL
Sbjct: 423 SMKEQVRAIQDASVIIGAHGAGLTHIVSALPKTVILEII---SSQFRRPHFAYIARWKGL 479

Query: 327 EY 328
           EY
Sbjct: 480 EY 481


>gi|345653375|gb|AEO15532.1| glycosyltransferase [Bathydraco marri]
          Length = 284

 Score = 38.1 bits (87), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 63/129 (48%), Gaps = 10/129 (7%)

Query: 219 SFHHTKPK----LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFR 273
           SF   K K    +VL +R+    R ILN  E+  A A+E    V     EE  S   + +
Sbjct: 34  SFEDEKGKRDEYIVLFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSTVQ 90

Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEY 331
            I     ++ +HGA L  SLFL  G+V++++ P          ++  A + G++  Y+ +
Sbjct: 91  VISGASMLISMHGAQLITSLFLPRGAVVVELFPFAVNPEQYTPYKTLATLPGMDLHYVSW 150

Query: 332 KIKQEESSL 340
           +  +EE+++
Sbjct: 151 RNTKEENTI 159


>gi|345653309|gb|AEO15499.1| glycosyltransferase [Chaenodraco wilsoni]
          Length = 286

 Score = 38.1 bits (87), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 63/131 (48%), Gaps = 10/131 (7%)

Query: 217 SSSFHHTKPK----LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADS 271
             SF   K K    +VL +R+    R ILN  E+  A A+E    V     EE  S   +
Sbjct: 34  GGSFEDEKGKRDEYIVLFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPST 90

Query: 272 FRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YL 329
            + I     ++ +HGA L  SLFL  G+V++++ P          ++  A + G++  Y+
Sbjct: 91  VQVISGASMLISMHGAQLITSLFLPRGAVVVELFPFAVNPEQYTPYKTLATLPGMDLHYV 150

Query: 330 EYKIKQEESSL 340
            ++  +EE+++
Sbjct: 151 SWRNTKEENTI 161


>gi|123421413|ref|XP_001305986.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121887536|gb|EAX93056.1| hypothetical protein TVAG_220310 [Trichomonas vaginalis G3]
          Length = 553

 Score = 38.1 bits (87), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 57/269 (21%), Positives = 94/269 (34%), Gaps = 49/269 (18%)

Query: 54  PSLRIKTQPYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYN-GNFFH 112
           P L + +QP P     + +    +  +  + P  L  G         F   G N G  +H
Sbjct: 211 PFLSLSSQPIPNDVVDTRLDN--QPVVVKTNP--LGKGGNKIQGISYFYCRGVNLGMLWH 266

Query: 113 EFMDCFVPLFITI----------NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPII 162
              D   PL+ TI          + HF   D+ +            +   L  FT++PI 
Sbjct: 267 NIFDNVAPLYHTIQEVEGSVTGKDRHFIVTDIYVV---------EVFVLFLKMFTKYPIH 317

Query: 163 NINNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
           NI  + +   F    LGL    +    P       T  DF      +Y+    +   F  
Sbjct: 318 NIQLEKMDIQFDICVLGL---RKFNTRPLPFRNEFTTFDF------SYDPKDISIPGFRE 368

Query: 223 T---------------KPKLVLVNRNARVGRTILNLREVKKAAEE-LGFDVTIFEPEEST 266
           T               KP +++  R     R I+N+  V K  E    F   ++   +S 
Sbjct: 369 TILKELRFTIPIPDPKKPLVIIPLRKNNNTRFIVNMNSVIKEMERRCKFCEFLYLNLDSL 428

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
           ++      I     + G+HG+GL H +++
Sbjct: 429 TIEWQIELISHATVLAGIHGSGLAHQIWM 457


>gi|400530870|gb|AFP86586.1| glycosyltransferase, partial [Acanthurus nigricans]
          Length = 290

 Score = 38.1 bits (87), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 47/179 (26%), Positives = 75/179 (41%), Gaps = 26/179 (14%)

Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
           R ILN  E+  A A+E    V     EE  S     + I S   +V +HGA L  SLFL 
Sbjct: 61  RLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSIVQVISSATMLVSMHGAQLITSLFLP 119

Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSLV------EKYGAND 348
            G+V++++ P          ++  A + G  L Y+ ++  +EE+++       E+ G   
Sbjct: 120 RGAVVVELFPFAVNPEQYTPYKTLASLPGIDLHYISWRNTKEENTVTHPDRPWEQGGITH 179

Query: 349 LVLKNPQAFAGA------------NWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
           L  +  +   G+             W   R+Y   Q+  +DI  F   LK+  K    L
Sbjct: 180 LEKEEQERILGSKDVPRHLCCRNPEWL-FRIY---QDTLVDIPSFLEVLKEGMKTKPSL 234


>gi|345653315|gb|AEO15502.1| glycosyltransferase [Champsocephalus gunnari]
          Length = 286

 Score = 38.1 bits (87), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 59/117 (50%), Gaps = 6/117 (5%)

Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           +VL +R+    R ILN  E+  A A+E    V     EE  S   + + I     ++ +H
Sbjct: 48  IVLFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSTIQVISGASMLISMH 104

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
           GA L  SLFL  G+V++++ P          ++  A + G++  Y+ ++  +EE+++
Sbjct: 105 GAQLITSLFLPRGAVVVELFPFAVNPEQYTPYKTLATLPGMDLHYVSWRNTKEENTI 161


>gi|400530902|gb|AFP86602.1| glycosyltransferase, partial [Himantolophus sagamius]
          Length = 276

 Score = 38.1 bits (87), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 54/120 (45%), Gaps = 7/120 (5%)

Query: 194 PKPKTLV---DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA- 249
           PK   LV   + + F A A  E  N +      K + ++V   +   R ILN  E+  A 
Sbjct: 2   PKANMLVSGNEIRQF-AKALMEKMNITGVEEVEKDEYIVVFSRS-TTRLILNEAELIMAL 59

Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
           A+E    V     EE  S     + I S   +V +HGA L  SLFL  G+V++++ P   
Sbjct: 60  AQEFQMRVVTVSLEEQ-SFPSIIQVISSATMLVSMHGAQLITSLFLPRGAVVVELFPFAV 118


>gi|61845107|emb|CAI70375.1| beta 1,2 xylosyltransferase [Populus tremula x Populus alba]
          Length = 468

 Score = 38.1 bits (87), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 17/62 (27%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S+ +  R +     ++G HGAGLTH +   P +++++++   + +    +F+  A   GL
Sbjct: 382 SMKEQVRAVQDASVIIGAHGAGLTHIVSATPKTIILEII---SSYFRRPHFQLIAEWKGL 438

Query: 327 EY 328
           EY
Sbjct: 439 EY 440


>gi|345653327|gb|AEO15508.1| glycosyltransferase [Chionodraco hamatus]
          Length = 286

 Score = 38.1 bits (87), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 59/117 (50%), Gaps = 6/117 (5%)

Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           +VL +R+    R ILN  E+  A A+E    V     EE  S   + + I     ++ +H
Sbjct: 48  IVLFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSTIQVISGASMLISMH 104

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
           GA L  SLFL  G+V++++ P          ++  A + G++  Y+ ++  +EE+++
Sbjct: 105 GAQLITSLFLPRGAVVVELFPFAVNPEQYTPYKTLATLPGMDLHYVSWRNTKEENTI 161


>gi|296533944|ref|ZP_06896465.1| tetratricopeptide repeat protein [Roseomonas cervicalis ATCC 49957]
 gi|296265725|gb|EFH11829.1| tetratricopeptide repeat protein [Roseomonas cervicalis ATCC 49957]
          Length = 380

 Score = 38.1 bits (87), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 3/67 (4%)

Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
           R +LN  EV    + LGF+    E     SL++      S   + GV+G GL ++LF R 
Sbjct: 261 RNLLNEAEVAALLDRLGFESVALE---DFSLSEQVALFRSAECVAGVYGGGLFNTLFCRA 317

Query: 298 GSVLMQV 304
           G+ ++ +
Sbjct: 318 GTRVLSL 324


>gi|392576608|gb|EIW69738.1| hypothetical protein TREMEDRAFT_73612 [Tremella mesenterica DSM
           1558]
          Length = 465

 Score = 38.1 bits (87), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 47/95 (49%), Gaps = 8/95 (8%)

Query: 223 TKPKLVLVNRNARVGRTILN-----LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
           ++P +  ++R +  GR + N     L E  +   + G     +E  E TS+ + +  I  
Sbjct: 305 SRPVITYLSRQS-TGRRLANASHDALVEELETLSKSGRAEVRYEVFEDTSIPEQWARIAR 363

Query: 278 CHAMVGVHGAGLTHSLFL--RPGSVLMQVVPIGTQ 310
              ++GVHG GLTH +FL   PGS + ++ P   Q
Sbjct: 364 TTILLGVHGNGLTHIIFLPINPGSAVYELQPENCQ 398


>gi|224061108|ref|XP_002300350.1| predicted protein [Populus trichocarpa]
 gi|222847608|gb|EEE85155.1| predicted protein [Populus trichocarpa]
          Length = 508

 Score = 38.1 bits (87), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 17/62 (27%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S+ +  R +     ++G HGAGLTH +   P +++++++   + +    +F+  A   GL
Sbjct: 422 SMKEQVRAVQDASVIIGAHGAGLTHIVSATPKTIILEII---SSYFRRPHFQLIAEWKGL 478

Query: 327 EY 328
           EY
Sbjct: 479 EY 480


>gi|400530728|gb|AFP86515.1| glycosyltransferase, partial [Macropinna microstoma]
          Length = 271

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 52/108 (48%), Gaps = 4/108 (3%)

Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
           + R ILN  E+  A A+E    V     EE  S A   + +     +V +HGA L  SLF
Sbjct: 40  ISRLILNEAELILALAQEFQMKVVTVSLEEQ-SFASIVKMVSEASMLVSMHGAQLVSSLF 98

Query: 295 LRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
           L  G+ ++++ P          ++  A + G  L+Y+ ++   EE+S+
Sbjct: 99  LPRGAAVVELFPYAVNPEQYAPYKTLASLPGMDLQYVAWRNMIEENSV 146


>gi|400530758|gb|AFP86530.1| glycosyltransferase, partial [Coregonus clupeaformis]
          Length = 282

 Score = 37.7 bits (86), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 58/235 (24%), Positives = 89/235 (37%), Gaps = 53/235 (22%)

Query: 194 PKPKTLV------DFQSFLANAYNENTN---TSSSFHHTKPKLVLVNRNARVGRTILNLR 244
           PK   LV       F S L    N NT      S+       +V+ +R+  + R ILN  
Sbjct: 2   PKANILVSGNEIRQFASSLMEKLNINTQGGGEESAAEEKDEYIVVFSRS--INRLILNEA 59

Query: 245 EVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
           E+  A A+E          EE T   +  + I     +V +HGA L  SLFL  G+V+++
Sbjct: 60  ELILALAQEFQMRAVTVSLEEQT-FPNIVKVISGASMLVSMHGAQLVSSLFLPRGAVVVE 118

Query: 304 VVPIGTQ----------------------WLSTV------YFEKPARVLGLEYLEYKIKQ 335
           + P                          W + V      Y E+P    G+ +LE   ++
Sbjct: 119 LFPYAVNPEQYTPYKTLASLPGMDLQYVAWXNMVEENSVAYPERPWEQGGIAHLEK--EE 176

Query: 336 EESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
           +E  L  K     L  +NP+          R+Y   Q+  +DI      L++  K
Sbjct: 177 QEHILASKEVPRHLCCRNPEWL-------YRIY---QDTIVDIPSLLEALRETVK 221


>gi|321478050|gb|EFX89008.1| hypothetical protein DAPPUDRAFT_304713 [Daphnia pulex]
          Length = 537

 Score = 37.7 bits (86), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 47/105 (44%), Gaps = 7/105 (6%)

Query: 226 KLVLVNRNARVGRTILNLREV--KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
           K+ L++RN +  R ILN + +  K  +    + +   E    T        I      +G
Sbjct: 367 KITLLSRNTQY-RRILNEKALLDKLRSSPRQYTIQRVEFTHETDFRQQLNIIQDTDIFIG 425

Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
           +HGAGLTH LFL   + + ++   G +      +   AR+ G+ Y
Sbjct: 426 MHGAGLTHLLFLPDWASVFELYNCGDEHC----YADLARLRGVHY 466


>gi|154413981|ref|XP_001580019.1| TKL family protein kinase [Trichomonas vaginalis G3]
 gi|121914232|gb|EAY19033.1| TKL family protein kinase [Trichomonas vaginalis G3]
          Length = 881

 Score = 37.7 bits (86), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 73/160 (45%), Gaps = 22/160 (13%)

Query: 61  QPYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFV- 119
           +P P+ S       VK   +  S P N +   T T P  V     Y+   F+ F++ +  
Sbjct: 368 KPSPKPSQAPQTHAVKPSVL--SGPQNTADNTT-TDPVSVLK--NYHNVNFNAFVEGYAK 422

Query: 120 -------PLFITINSHFPDQ---DVILAIA-DCNDQWARKYAELLPRFTRHP---IININ 165
                  P F T+  +F D    +VI  IA +CN    R   + + +F   P    I IN
Sbjct: 423 TIGIEHFPFFSTVFQNFNDSTPPEVIRFIAKNCNSVMERN-PDFIYQFGGSPYYTTIPIN 481

Query: 166 -NQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQS 204
            N+++    +S  +  +S  ++ I+P L+P+ K L+DF++
Sbjct: 482 SNESVDEIIESFRILFVSAPQIAIHPNLIPQIKKLLDFRA 521


>gi|83715787|emb|CAJ47424.1| Beta-1,2-xylosyltransferase [Vitis vinifera]
          Length = 470

 Score = 37.7 bits (86), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
           S+ +  R I     ++G HGAGLTH +  RP +V+++++   +      +F   ++  GL
Sbjct: 384 SMKEQVRAIQDASVIIGAHGAGLTHIISARPKTVILEII---SSLYRRPHFALISQWKGL 440

Query: 327 EY 328
           EY
Sbjct: 441 EY 442


>gi|400530816|gb|AFP86559.1| glycosyltransferase, partial [Poromitra crassiceps]
          Length = 290

 Score = 37.7 bits (86), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 6/117 (5%)

Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
           +V+ +R+A   R ILN  E+  A A+E    V     EE T  A   + I     +V +H
Sbjct: 52  IVVFSRSA--TRLILNEAELIMALAQEFQMRVVTVSLEEQT-FASIVQVISGASMLVSMH 108

Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
           GA L  SLFL  G+ ++++ P          ++  A + G  L+Y+ ++   EE+++
Sbjct: 109 GAQLVTSLFLPRGAAVVELFPYAVNPEQYTPYKTLASLPGMDLQYVSWRNTMEENTI 165


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.134    0.400 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,420,428,658
Number of Sequences: 23463169
Number of extensions: 270739144
Number of successful extensions: 638064
Number of sequences better than 100.0: 747
Number of HSP's better than 100.0 without gapping: 393
Number of HSP's successfully gapped in prelim test: 354
Number of HSP's that attempted gapping in prelim test: 636119
Number of HSP's gapped (non-prelim): 823
length of query: 404
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 259
effective length of database: 8,957,035,862
effective search space: 2319872288258
effective search space used: 2319872288258
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)