BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 044947
(404 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|118481049|gb|ABK92478.1| unknown [Populus trichocarpa]
Length = 456
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 223/389 (57%), Positives = 289/389 (74%), Gaps = 6/389 (1%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
ITCD SH YD+ ING L DP TSTFF++G T+STP + +K PYPRK+DK+A S
Sbjct: 70 ITCDCSHGYYDVWSINGPTLLDPITSTFFAIGATNSTPLDST--VKFHPYPRKTDKNAKS 127
Query: 74 KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
KV ELT+T SAPP SCG+ H+SPA+VFS GGY GNF+H+F D +PL+ITINS DQD
Sbjct: 128 KVNELTLT-SAPPKSSCGIRHSSPAIVFSTGGYTGNFYHQFNDGLLPLYITINSLTLDQD 186
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
VIL + + +D WA+KYA+LL +FTRHPI+N++NQT THCF S +GL++HG +V++P LL
Sbjct: 187 VILVVTNWSDWWAKKYADLLHQFTRHPIVNMDNQTRTHCFPSAIVGLMTHGPLVVDPRLL 246
Query: 194 PKPKTLVDFQSFLANAYNENTN---TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P+ KTL+DF + L N Y N TS +P+LVLVNR VGR ILNL+EV KA
Sbjct: 247 PRNKTLLDFHALLQNTYGPRGNYLSTSGKSKGARPQLVLVNRKNGVGREILNLKEVLKAI 306
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
+E+GF +FEP+ + S+ +++R +H HAM+ VHGA +TH LFLR G+V+ ++VPIGT
Sbjct: 307 KEVGFKAIVFEPKRNASVRETYRLLHGSHAMLAVHGAAMTHLLFLRVGTVVGEIVPIGTD 366
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLK 370
W + ++EKPARVLGLEY++YKI+ ESSL EKY NDLVLKNP AF N++ VY+K
Sbjct: 367 WPAKTFYEKPARVLGLEYMKYKIEVNESSLAEKYRVNDLVLKNPSAFVNGNYTKAMVYMK 426
Query: 371 TQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
TQNVKLDI RFR YLK+A+ KA++ MDKE
Sbjct: 427 TQNVKLDIVRFRAYLKEAFVKAQRFMDKE 455
>gi|224101421|ref|XP_002312272.1| predicted protein [Populus trichocarpa]
gi|222852092|gb|EEE89639.1| predicted protein [Populus trichocarpa]
Length = 424
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 223/389 (57%), Positives = 289/389 (74%), Gaps = 6/389 (1%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
ITCD SH YD+ ING L DP TSTFF++G T+STP + +K PYPRK+DK+A S
Sbjct: 38 ITCDCSHGYYDVWSINGPTLLDPITSTFFAIGATNSTPLDST--VKFHPYPRKTDKNAKS 95
Query: 74 KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
KV ELT+T SAPP SCG+ H+SPA+VFS GGY GNF+H+F D +PL+ITINS DQD
Sbjct: 96 KVNELTLT-SAPPKSSCGIRHSSPAIVFSTGGYTGNFYHQFNDGLLPLYITINSLTLDQD 154
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
VIL + + +D WA+KYA+LL +FTRHPI+N++NQT THCF S +GL++HG +V++P LL
Sbjct: 155 VILVVTNWSDWWAKKYADLLHQFTRHPIVNMDNQTRTHCFPSAIVGLMTHGPLVVDPRLL 214
Query: 194 PKPKTLVDFQSFLANAYNENTN---TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P+ KTL+DF + L N Y N TS +P+LVLVNR VGR ILNL+EV KA
Sbjct: 215 PRNKTLLDFHALLQNTYGPRGNYLSTSGKSKGARPQLVLVNRKNGVGREILNLKEVLKAI 274
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
+E+GF +FEP+ + S+ +++R +H HAM+ VHGA +TH LFLR G+V+ ++VPIGT
Sbjct: 275 KEVGFKAIVFEPKRNASVRETYRLLHGSHAMLAVHGAAMTHLLFLRVGTVVGEIVPIGTD 334
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLK 370
W + ++EKPARVLGLEY++YKI+ ESSL EKY NDLVLKNP AF N++ VY+K
Sbjct: 335 WPAKTFYEKPARVLGLEYMKYKIEVNESSLAEKYRVNDLVLKNPSAFVNGNYTKAMVYMK 394
Query: 371 TQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
TQNVKLDI RFR YLK+A+ KA++ MDKE
Sbjct: 395 TQNVKLDIVRFRAYLKEAFVKAQRFMDKE 423
>gi|224108924|ref|XP_002315017.1| predicted protein [Populus trichocarpa]
gi|222864057|gb|EEF01188.1| predicted protein [Populus trichocarpa]
Length = 385
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 221/388 (56%), Positives = 285/388 (73%), Gaps = 6/388 (1%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
ITCD SH YDL ING L DP TSTFF+ G T ST P +K +PYPRK+D+ A S
Sbjct: 1 ITCDCSHNDYDLWFINGPTLLDPSTSTFFTTGPTIST--SPDFAVKFRPYPRKTDERARS 58
Query: 74 KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
KV ELT+T SAPP SCG+TH+SPA+VFS GGY GNF+H+F D + L+ITINS ++D
Sbjct: 59 KVNELTLT-SAPPRSSCGITHSSPAIVFSTGGYTGNFYHQFNDGLLALYITINSLSLNRD 117
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
VIL + + +D WA+KYA+LL RFT+HPIIN++NQT THCF S +GL++HG + ++PTL
Sbjct: 118 VILTVTNWSDWWAQKYADLLHRFTKHPIINMDNQTRTHCFPSAIVGLMTHGPLAVDPTLT 177
Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHH--TKPKLVLVNRNARVGRTILNLREVKKAAE 251
+ KTL+DF + L + Y+ S+ +P+LVLVNR VGR ILNL+E KA E
Sbjct: 178 -QHKTLLDFHALLESTYSPRGKHVSTLKSKGARPQLVLVNRKNGVGREILNLKEALKAIE 236
Query: 252 ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQW 311
E+GF +FEP+ + ++ D++R +H HAM+ VHGA +TH LFLR G VL ++VPIGT W
Sbjct: 237 EVGFKAIVFEPKRNGTVGDTYRLLHGSHAMLAVHGAAMTHLLFLRVGMVLGEIVPIGTDW 296
Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKT 371
L+ ++EKPARVLGLEY++YKI+ ESSL EKYGANDLVLKNPQAF +W +VY+KT
Sbjct: 297 LAKTFYEKPARVLGLEYMKYKIEVNESSLAEKYGANDLVLKNPQAFVNGDWPKAKVYMKT 356
Query: 372 QNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
QNVKLD+ RFR YLK+ + KAK+ MDKE
Sbjct: 357 QNVKLDMVRFRKYLKEVFVKAKRFMDKE 384
>gi|356502586|ref|XP_003520099.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Glycine
max]
Length = 406
Score = 426 bits (1096), Expect = e-117, Method: Compositional matrix adjust.
Identities = 212/389 (54%), Positives = 274/389 (70%), Gaps = 8/389 (2%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVG-HTDSTPSQPSLRIKTQPYPRKSDKSAM 72
ITCD SH YD+C INGS L D +ST F++G HT +P + K PY K DK+AM
Sbjct: 20 ITCDYSHNGYDICTINGSTLLDQASSTLFALGPHTQQ--DKPHIPFKILPYTLKGDKTAM 77
Query: 73 SKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
S VKE+T+T APP LSCGVTH +PALVFS GGY GNF+HE + F+PLFITINS FP+Q
Sbjct: 78 SNVKEVTLTL-APPKLSCGVTHHTPALVFSVGGYTGNFYHEINENFIPLFITINSLFPNQ 136
Query: 133 DVILAIADCNDQWARKYAELLPRFT-RHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
+VIL + + W +KYAELL F+ H IIN NN + HCF S T+GLI HG M I+P
Sbjct: 137 NVILVVLEGKSWWFKKYAELLSAFSPNHMIINTNNISTVHCFPSATIGLIKHGDMTIDPK 196
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSF--HHTKPKLVLVNRNARVGRTILNLREVKKA 249
LLP PKTL+DF++FL Y ++ +T + + KP+L L++R V R +LN +V K
Sbjct: 197 LLPNPKTLLDFRAFLDKVYTKDDDTPFVYPNENGKPRLTLISRRGNVSRLLLNENDVIKV 256
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
AEE+GF+V +FEP+ +T +A +R IH+ ++GVHGAGLT+ LFLRPGSVL+QVVPI
Sbjct: 257 AEEIGFNVHVFEPK-NTPMAKVYRLIHASDVLLGVHGAGLTNFLFLRPGSVLVQVVPIEL 315
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
W S Y+EKP + LG++Y+EYKI+ ESSL+E++GAN LV K+P AF NWS RVYL
Sbjct: 316 YWASRTYYEKPPKFLGVDYIEYKIEPNESSLLERFGANSLVFKDPPAFHKGNWSKQRVYL 375
Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
K QNVK+++ RFR YL AY+KAK + K
Sbjct: 376 KEQNVKINVVRFRKYLTKAYEKAKIFISK 404
>gi|224108926|ref|XP_002315018.1| predicted protein [Populus trichocarpa]
gi|222864058|gb|EEF01189.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 214/391 (54%), Positives = 264/391 (67%), Gaps = 11/391 (2%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
I CDRSHR YDLC ING + DP ST + T+ST + K +PYPRK +K M+
Sbjct: 1 INCDRSHRFYDLCTINGPTVLDPVNSTLYLSVPTNSTTVE-----KIRPYPRKWEKPIMA 55
Query: 74 KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD-Q 132
++E T+ +++ L C H PA+VFSA GY GNFFH+F D F+PLFIT+NS FPD Q
Sbjct: 56 GIQEFTLISNSKSPL-CQAQHKLPAIVFSAAGYTGNFFHDFNDGFIPLFITVNSVFPDNQ 114
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
D IL I+ + W KY +LL +++HP+I N+T THCF S TLGLISHG M I+P L
Sbjct: 115 DFILVISQAQNWWISKYGDLLHTYSKHPVIIPENETSTHCFPSATLGLISHGFMTIDPKL 174
Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPK--LVLVNRNARVGRTILNLREVKKAA 250
+P + L F +FL AYN N + KP+ LVL RN VGR I N EVK A
Sbjct: 175 MPNSQALTHFHAFLDKAYNHGQNHPWKSNPPKPRARLVLATRNGGVGRVISNQNEVKHLA 234
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
EE+GFDV IFEP T L ++ I+S HAMVGVHGAGLTHSLFLRPG V MQVVPIG
Sbjct: 235 EEIGFDVIIFEPIPQTPLQQAYALINSSHAMVGVHGAGLTHSLFLRPGVVFMQVVPIGAD 294
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN--MRVY 368
WL+ V F AR +GLEYLEY+I EESSL++KYG N L++K+P F G NWS+ M +Y
Sbjct: 295 WLAEVCFANSARAMGLEYLEYRIGAEESSLIDKYGKNSLLIKDPATFRGQNWSSAIMDIY 354
Query: 369 LKTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
LK QNVK+D+ RFR YLK+AYKKAK+ M+KE
Sbjct: 355 LKEQNVKIDLIRFREYLKEAYKKAKEFMEKE 385
>gi|255542124|ref|XP_002512126.1| glycosyltransferase, putative [Ricinus communis]
gi|223549306|gb|EEF50795.1| glycosyltransferase, putative [Ricinus communis]
Length = 394
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 204/390 (52%), Positives = 267/390 (68%), Gaps = 20/390 (5%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
I+CDR+ YDLC ING + DP TS FF V T+S P P L +PYPRK + M+
Sbjct: 20 ISCDRTQLRYDLCWINGQTVLDPTTSAFFVVRSTNSAP--PYLVETIKPYPRKFEAFIMA 77
Query: 74 KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
++KELTIT S P SC V HT+PALVFSAGGY GNFFH+F D F+PL+IT+N+ +PDQD
Sbjct: 78 QIKELTIT-SGPFAPSCQVQHTAPALVFSAGGYTGNFFHDFNDGFIPLYITVNTIYPDQD 136
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
++ +++ D W KY +LL FT HPI+ +N+ T THCF SVT GLISHG M +N L+
Sbjct: 137 FVMVVSEAPDWWISKYVDLLSAFTAHPIVTLND-TSTHCFPSVTFGLISHGFMTMNQRLM 195
Query: 194 PKPKTLVDFQSFLANAYNE----NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA 249
P KT+ F+ L AY++ N N + S ++P+L++ +RN GR ILN E+ +
Sbjct: 196 PNSKTITQFRGLLDKAYSQSLTSNVNNNLSAPKSRPRLIIASRNGSAGRVILNQDELIEM 255
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
++ELGFDV IFEP+ +TSL +S+ ++S HAM+GVHGA LTHSLFLRPGSVL+QVVPIG
Sbjct: 256 SKELGFDVIIFEPKANTSLQESYVLVNSSHAMIGVHGAALTHSLFLRPGSVLVQVVPIGL 315
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
+W S +F + R L LEY+EYKI EESSLV YG++ L+L +P +
Sbjct: 316 EWASDAFFGRVGRGLKLEYVEYKIGVEESSLVGTYGSDSLLLTDPHG------------I 363
Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
+ QNVKLD+ RFR YLK AYKKAK+ MD+E
Sbjct: 364 QEQNVKLDMKRFREYLKQAYKKAKQFMDRE 393
>gi|186499036|ref|NP_001118256.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
gi|330250598|gb|AEC05692.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
Length = 455
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 199/402 (49%), Positives = 277/402 (68%), Gaps = 12/402 (2%)
Query: 2 KLRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQ 61
K P ++ + ITCDRSH SYDLC INGS + +PKT T + T +T S P + K +
Sbjct: 57 KQNPKVMAASAKITCDRSHTSYDLCSINGSCILNPKTGTLTLMDRTLTT-SAPLVE-KIR 114
Query: 62 PYPRKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
PYPRKS+ M +++EL +T S P +L SC +TH SPA+VFSAGGY G+ +H+F+D F+
Sbjct: 115 PYPRKSENWIMPRIRELKLT-SGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFI 173
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ--NQTITHCFQSVT 177
PLFIT NS +PD+D IL + + + W KY ++L F++H I ++ N +ITHCF S T
Sbjct: 174 PLFITANSVYPDRDFILVVVNPKEWWMPKYIDILGTFSKHKTILLDKENASITHCFTSAT 233
Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVG 237
+GLISHG M I+PT +P K+LVDF + L A N N S KP+L+LV R +G
Sbjct: 234 VGLISHGPMTIDPTQIPNSKSLVDFHNLLDKALNPNL---SIIKINKPRLILVRRYGNIG 290
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R ILN E+++ E++GF+V F P ++TSL ++++ I S H M+GVHGA LT LFLRP
Sbjct: 291 RVILNEEEIREMLEDVGFEVITFRPSKTTSLREAYKLIKSSHGMIGVHGAALTQLLFLRP 350
Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
GSVL+Q+VP+G W+S FE PA+ + L+Y EY++ EESSL+EKY +DLVLK+P A+
Sbjct: 351 GSVLVQIVPVGLGWVSKTCFETPAKAMKLDYTEYRVNVEESSLIEKYSRDDLVLKDPIAY 410
Query: 358 AGANW--SNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
G +W + M+VYLK Q+V+LD++RFR ++ +AYKKAK MD
Sbjct: 411 RGMDWNVTKMKVYLKDQDVRLDVNRFRKHMNEAYKKAKSFMD 452
>gi|186499027|ref|NP_178435.2| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
gi|330250597|gb|AEC05691.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
Length = 451
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 199/402 (49%), Positives = 277/402 (68%), Gaps = 12/402 (2%)
Query: 2 KLRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQ 61
K P ++ + ITCDRSH SYDLC INGS + +PKT T + T +T S P + K +
Sbjct: 53 KQNPKVMAASAKITCDRSHTSYDLCSINGSCILNPKTGTLTLMDRTLTT-SAPLVE-KIR 110
Query: 62 PYPRKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
PYPRKS+ M +++EL +T S P +L SC +TH SPA+VFSAGGY G+ +H+F+D F+
Sbjct: 111 PYPRKSENWIMPRIRELKLT-SGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFI 169
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ--NQTITHCFQSVT 177
PLFIT NS +PD+D IL + + + W KY ++L F++H I ++ N +ITHCF S T
Sbjct: 170 PLFITANSVYPDRDFILVVVNPKEWWMPKYIDILGTFSKHKTILLDKENASITHCFTSAT 229
Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVG 237
+GLISHG M I+PT +P K+LVDF + L A N N S KP+L+LV R +G
Sbjct: 230 VGLISHGPMTIDPTQIPNSKSLVDFHNLLDKALNPNL---SIIKINKPRLILVRRYGNIG 286
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R ILN E+++ E++GF+V F P ++TSL ++++ I S H M+GVHGA LT LFLRP
Sbjct: 287 RVILNEEEIREMLEDVGFEVITFRPSKTTSLREAYKLIKSSHGMIGVHGAALTQLLFLRP 346
Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
GSVL+Q+VP+G W+S FE PA+ + L+Y EY++ EESSL+EKY +DLVLK+P A+
Sbjct: 347 GSVLVQIVPVGLGWVSKTCFETPAKAMKLDYTEYRVNVEESSLIEKYSRDDLVLKDPIAY 406
Query: 358 AGANW--SNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
G +W + M+VYLK Q+V+LD++RFR ++ +AYKKAK MD
Sbjct: 407 RGMDWNVTKMKVYLKDQDVRLDVNRFRKHMNEAYKKAKSFMD 448
>gi|4335762|gb|AAD17439.1| unknown protein [Arabidopsis thaliana]
Length = 393
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 197/396 (49%), Positives = 275/396 (69%), Gaps = 12/396 (3%)
Query: 8 LSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS 67
++ + ITCDRSH SYDLC INGS + +PKT T + T +T S P + K +PYPRKS
Sbjct: 1 MAASAKITCDRSHTSYDLCSINGSCILNPKTGTLTLMDRTLTT-SAPLVE-KIRPYPRKS 58
Query: 68 DKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
+ M +++EL +T S P +L SC +TH SPA+VFSAGGY G+ +H+F+D F+PLFIT
Sbjct: 59 ENWIMPRIRELKLT-SGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFIPLFITA 117
Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ--NQTITHCFQSVTLGLISH 183
NS +PD+D IL + + + W KY ++L F++H I ++ N +ITHCF S T+GLISH
Sbjct: 118 NSVYPDRDFILVVVNPKEWWMPKYIDILGTFSKHKTILLDKENASITHCFTSATVGLISH 177
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNL 243
G M I+PT +P K+LVDF + L A N N S KP+L+LV R +GR ILN
Sbjct: 178 GPMTIDPTQIPNSKSLVDFHNLLDKALNPNL---SIIKINKPRLILVRRYGNIGRVILNE 234
Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
E+++ E++GF+V F P ++TSL ++++ I S H M+GVHGA LT LFLRPGSVL+Q
Sbjct: 235 EEIREMLEDVGFEVITFRPSKTTSLREAYKLIKSSHGMIGVHGAALTQLLFLRPGSVLVQ 294
Query: 304 VVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW- 362
+VP+G W+S FE PA+ + L+Y EY++ EESSL+EKY +DLVLK+P A+ G +W
Sbjct: 295 IVPVGLGWVSKTCFETPAKAMKLDYTEYRVNVEESSLIEKYSRDDLVLKDPIAYRGMDWN 354
Query: 363 -SNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
+ M+VYLK Q+V+LD++RFR ++ +AYKKAK MD
Sbjct: 355 VTKMKVYLKDQDVRLDVNRFRKHMNEAYKKAKSFMD 390
>gi|357487363|ref|XP_003613969.1| Glycosyltransferase, putative [Medicago truncatula]
gi|355515304|gb|AES96927.1| Glycosyltransferase, putative [Medicago truncatula]
Length = 451
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 196/392 (50%), Positives = 267/392 (68%), Gaps = 4/392 (1%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
ITCDRS++ +D+C ++ L DP + T F++ T QP + +K QPYP KSDK+AMS
Sbjct: 61 ITCDRSNQRFDICSMSNPTLLDPTSLTLFTLN--PHTRIQPYINVKIQPYPLKSDKNAMS 118
Query: 74 KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
V+E+T+T++ P + CGVTH PALVF+A GYNGNF+HE D F+PLFITINS F DQD
Sbjct: 119 SVREVTLTSAPPKSSQCGVTHNIPALVFNARGYNGNFYHEINDIFIPLFITINSLFHDQD 178
Query: 134 VILAIADCNDQWARKYAELLPRFT-RHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
VIL I D W +KY +LL F+ H IIN NN T HCF S +GLI HG + INP L
Sbjct: 179 VILVIVDGMTWWYQKYVDLLNAFSPNHKIINTNNLTTAHCFPSAVVGLIKHGPVTINPKL 238
Query: 193 LPKPKTLVDFQSFLANAY-NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAE 251
LP PKTL+DF +FL NAY E+T +++KP L LV+R R ILN EV K A+
Sbjct: 239 LPNPKTLLDFSTFLKNAYIKEDTPLLFPSNNSKPLLTLVDRKGSSSRVILNQEEVVKLAK 298
Query: 252 ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQW 311
E+GF+V + + + +++A+ +R +HS H ++GVHGAGLT+ FLR GSV++QVVPIG +W
Sbjct: 299 EVGFNVHVLDHSKDSTMANVYRLVHSSHVLLGVHGAGLTNLFFLRQGSVVVQVVPIGLEW 358
Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKT 371
S ++ P+ LGLEY+EYK++ ESSL YG + L++K+P+A+ W VYLK
Sbjct: 359 ASDTCYKNPSPFLGLEYVEYKVEANESSLSWDYGVDSLMVKDPKAYTEGKWEKSIVYLKN 418
Query: 372 QNVKLDIDRFRIYLKDAYKKAKKLMDKEIKLA 403
QNVK+D+ RF+ +L AY+KAK M+ ++A
Sbjct: 419 QNVKIDLVRFKKWLMKAYEKAKMFMNSTSQVA 450
>gi|359473475|ref|XP_002266097.2| PREDICTED: uncharacterized glycosyltransferase AER61-like [Vitis
vinifera]
Length = 389
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 197/394 (50%), Positives = 264/394 (67%), Gaps = 12/394 (3%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
AK I+CDRSH+ YD+C +N + + +P TSTFF V P+Q +L K +PYPRK + S
Sbjct: 2 AKQISCDRSHQRYDMCTVNATTVLEPTTSTFFLV-----EPTQ-ALMEKVRPYPRKWETS 55
Query: 71 AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
M ++KE+ +T S PP+ SC V H++PALVFSAGGY GN FH+F D F+PL+IT+NS FP
Sbjct: 56 VMGRIKEVRLT-SGPPSPSCQVHHSAPALVFSAGGYTGNVFHDFNDGFIPLYITVNSIFP 114
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
D + +L I W KYA+LL ++HPI+N+ THCF +GLISHG M I+P
Sbjct: 115 DGNYVLVIHSSRRWWESKYADLLHTLSKHPIVNLEKANATHCFPYAHVGLISHGFMTIDP 174
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVK 247
TL+P L F+ FL AY +N S S +P+LVLV R+ GR ILN ++
Sbjct: 175 TLMPSSINLTHFRGFLDAAYAQNHPFPSPNISKQKARPRLVLVTRSGGAGRHILNQGDLN 234
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
AAEE+GF V +F P +TSL +++ I+S HAM+GVHGA LTHSLFLRPGSVLMQVVP+
Sbjct: 235 NAAEEVGFHVILFHPTPTTSLREAYALINSSHAMMGVHGAALTHSLFLRPGSVLMQVVPL 294
Query: 308 GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN--M 365
G W + F +R LGLEY+EYKI ++ES+L +KYG +D+++K+P G WS M
Sbjct: 295 GLAWAAETCFGNSSRELGLEYMEYKIGEKESTLADKYGNDDIMVKDPVRAQGKGWSTKIM 354
Query: 366 RVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
VYLK QN+ LD+ RFR +L++AY KA K + +
Sbjct: 355 DVYLKEQNITLDLVRFRRHLEEAYNKASKFLQNK 388
>gi|147781784|emb|CAN72290.1| hypothetical protein VITISV_007350 [Vitis vinifera]
Length = 515
Score = 396 bits (1018), Expect = e-108, Method: Compositional matrix adjust.
Identities = 197/387 (50%), Positives = 261/387 (67%), Gaps = 12/387 (3%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
AK I+CDRS + YD C +N + + +P T TFF V P+Q +L K +PYPRK + S
Sbjct: 38 AKQISCDRSXQRYDXCTVNATTVLEPTTXTFFLV-----EPTQ-ALVEKVRPYPRKWETS 91
Query: 71 AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
M ++KE+++T S PP+ SC V H++PALVFSAGGY GN FH+F D F+PLFIT+NS FP
Sbjct: 92 VMGRIKEVSLT-SGPPSPSCQVHHSAPALVFSAGGYTGNVFHDFNDGFIPLFITVNSIFP 150
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
D + +L I +C W KYA+LL ++HPI+N+ THCF +GLISHG M I+P
Sbjct: 151 DGNYVLVIHNCRRWWESKYADLLHTLSKHPIVNLEKANATHCFPYAHVGLISHGFMTIDP 210
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVK 247
TL+P L F+ FL AY +N S S +P+LVLV R+ GR ILN ++
Sbjct: 211 TLMPSSINLTHFRGFLDAAYAQNHPFPSPNISKQKARPRLVLVTRSGGAGRHILNQGDLH 270
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
AAEE+GF V +F P +TSL +++ I+S HAM+GVHGA LTHSLFLRPGSVLMQVVP+
Sbjct: 271 NAAEEVGFHVILFHPTPTTSLREAYALINSSHAMMGVHGAALTHSLFLRPGSVLMQVVPL 330
Query: 308 GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN--M 365
G W + F +R LGLEY+EYKI ++ESSL +KYG +D+++K+P G WS M
Sbjct: 331 GLAWAAETCFGNSSRELGLEYMEYKIGEKESSLADKYGNDDIMVKDPVRAQGKGWSTKIM 390
Query: 366 RVYLKTQNVKLDIDRFRIYLKDAYKKA 392
VYLK QN+ LD+ RFR +L++AY KA
Sbjct: 391 DVYLKEQNITLDLVRFRRHLEEAYNKA 417
>gi|79551059|ref|NP_178436.2| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
gi|49660111|gb|AAT68346.1| hypothetical protein At2g03370 [Arabidopsis thaliana]
gi|60547697|gb|AAX23812.1| hypothetical protein At2g03370 [Arabidopsis thaliana]
gi|330250599|gb|AEC05693.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
Length = 452
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 192/394 (48%), Positives = 270/394 (68%), Gaps = 16/394 (4%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
+ +ITCDRSH +YDLC INGS D KT T T P+ L K +PYP+K+D
Sbjct: 65 SATITCDRSHTNYDLCSINGSCNLDLKTGTL-----TLMDPTSAPLVEKIRPYPKKADNW 119
Query: 71 AMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
M +++ELT+T S P L SC +TH PA+VFSAGGY G+ +H+ MD F+PLFIT NS
Sbjct: 120 IMPRIRELTLT-SGPLGLPRSCDITHDLPAIVFSAGGYTGSIYHDLMDGFIPLFITANSV 178
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRH-PIININNQTI--THCFQSVTLGLISHGR 185
+PD+D I + + + W KY ++L F++H PI+ ++ +++ THCF S +GLI+H
Sbjct: 179 YPDRDFIPVVVNAKEWWMPKYIDILGTFSKHKPILLLDKESVATTHCFTSAIVGLITHWP 238
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
M I+PT +P K+LVDF + L A+ N +T + KP+L+LV+R +GR ILN +E
Sbjct: 239 MTIDPTQIPNSKSLVDFHNLLEKAFTTNISTPKTH---KPRLMLVSRYGNIGRVILNEQE 295
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+K+ E++GF+V IF P ++T+L ++++ I S H MVGVHGA LTH LFLRPGS+ +QVV
Sbjct: 296 IKEMLEDVGFEVIIFRPSKTTNLKEAYKLIKSSHGMVGVHGAALTHLLFLRPGSIFVQVV 355
Query: 306 PIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW--S 363
P+G W S +E PA+ + LEYLEYK+ EESSL+EKY +DLVLK+P A+ G +W +
Sbjct: 356 PLGLGWASKPCYESPAKTMKLEYLEYKVNVEESSLIEKYNRDDLVLKDPIAYRGMDWNAT 415
Query: 364 NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
M+VYLK Q+V LD++RFR ++ +AYKKAK MD
Sbjct: 416 KMKVYLKEQDVSLDVNRFRKHMNEAYKKAKIFMD 449
>gi|4335761|gb|AAD17438.1| unknown protein [Arabidopsis thaliana]
Length = 458
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 192/394 (48%), Positives = 270/394 (68%), Gaps = 16/394 (4%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
+ +ITCDRSH +YDLC INGS D KT T T P+ L K +PYP+K+D
Sbjct: 71 SATITCDRSHTNYDLCSINGSCNLDLKTGTL-----TLMDPTSAPLVEKIRPYPKKADNW 125
Query: 71 AMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
M +++ELT+T S P L SC +TH PA+VFSAGGY G+ +H+ MD F+PLFIT NS
Sbjct: 126 IMPRIRELTLT-SGPLGLPRSCDITHDLPAIVFSAGGYTGSIYHDLMDGFIPLFITANSV 184
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRH-PIININNQTI--THCFQSVTLGLISHGR 185
+PD+D I + + + W KY ++L F++H PI+ ++ +++ THCF S +GLI+H
Sbjct: 185 YPDRDFIPVVVNAKEWWMPKYIDILGTFSKHKPILLLDKESVATTHCFTSAIVGLITHWP 244
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
M I+PT +P K+LVDF + L A+ N +T + KP+L+LV+R +GR ILN +E
Sbjct: 245 MTIDPTQIPNSKSLVDFHNLLEKAFTTNISTPKTH---KPRLMLVSRYGNIGRVILNEQE 301
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+K+ E++GF+V IF P ++T+L ++++ I S H MVGVHGA LTH LFLRPGS+ +QVV
Sbjct: 302 IKEMLEDVGFEVIIFRPSKTTNLKEAYKLIKSSHGMVGVHGAALTHLLFLRPGSIFVQVV 361
Query: 306 PIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW--S 363
P+G W S +E PA+ + LEYLEYK+ EESSL+EKY +DLVLK+P A+ G +W +
Sbjct: 362 PLGLGWASKPCYESPAKTMKLEYLEYKVNVEESSLIEKYNRDDLVLKDPIAYRGMDWNAT 421
Query: 364 NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
M+VYLK Q+V LD++RFR ++ +AYKKAK MD
Sbjct: 422 KMKVYLKEQDVSLDVNRFRKHMNEAYKKAKIFMD 455
>gi|357487365|ref|XP_003613970.1| Glycosyltransferase, putative [Medicago truncatula]
gi|355515305|gb|AES96928.1| Glycosyltransferase, putative [Medicago truncatula]
Length = 543
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 197/384 (51%), Positives = 256/384 (66%), Gaps = 9/384 (2%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
ITCDRS++ +DLC +N L DP + T F++G T Q + +K +P+P K+D +AMS
Sbjct: 66 ITCDRSNKDFDLCTMNSPTLLDPTSLTLFALG--PHTRIQHHIHMKIKPFPLKNDTNAMS 123
Query: 74 KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN---SHFP 130
+ ELT+T++ + CGVTH SPALVFS GGY GNF+H+ + F+PLFITIN SH
Sbjct: 124 PISELTLTSAPLKSSQCGVTHHSPALVFSVGGYTGNFYHDMNEIFIPLFITINYSLSH-- 181
Query: 131 DQDVILAIADCNDQWARKYAELLPRFT-RHPIININNQTITHCFQSVTLGLISHGRMVIN 189
DQDVIL I D W KY +LL F+ H IIN NN T THCF S +GLI HG+M+I+
Sbjct: 182 DQDVILVIIDVKPWWFEKYVDLLSAFSPNHKIINTNNLTTTHCFPSAIVGLIKHGQMIID 241
Query: 190 PTLLPKPKTLVDFQSFLANAY-NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKK 248
P LLP PKTL+DF SFL AY E+ KP L LV+R R ILN EV K
Sbjct: 242 PKLLPNPKTLLDFHSFLKRAYVKEDIPFVYLNSKGKPILTLVSRKGSSSRDILNEEEVIK 301
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
AE++GF+V + +P S+AD+F+ IHS H ++GVHGAGLT+ LFLR GSV +QVVPIG
Sbjct: 302 LAEDVGFNVRVLKPSRDFSVADAFKLIHSSHVLLGVHGAGLTNLLFLRQGSVSVQVVPIG 361
Query: 309 TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVY 368
+W S Y+ KP ++LGLEY+EYK++ ESSL +YGA+ LV+K+P+A+ W VY
Sbjct: 362 LEWASETYYNKPTKILGLEYVEYKVEANESSLSWEYGADSLVIKDPKAYTEGKWDKQLVY 421
Query: 369 LKTQNVKLDIDRFRIYLKDAYKKA 392
LK QNVK+D+ RFR L Y+K
Sbjct: 422 LKKQNVKIDLIRFRNCLTKVYEKV 445
>gi|297817994|ref|XP_002876880.1| serine carboxypeptidase [Arabidopsis lyrata subsp. lyrata]
gi|297322718|gb|EFH53139.1| serine carboxypeptidase [Arabidopsis lyrata subsp. lyrata]
Length = 428
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 187/393 (47%), Positives = 265/393 (67%), Gaps = 36/393 (9%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
+ +ITCDRSH YDLC INGS + DPKT TF + T +T SL K +PYPRK++
Sbjct: 63 SATITCDRSHSDYDLCSINGSCILDPKTGTFTLMDPTFAT----SLVEKIRPYPRKAENW 118
Query: 71 AMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
M ++KELT++ S P L+ C +TH SPA+VFSAGGY G+ +H+F+D F+PLFIT NS
Sbjct: 119 TMPRIKELTLS-SGPLGLTRACDITHDSPAIVFSAGGYTGSIYHDFLDGFIPLFITANSV 177
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ--NQTITHCFQSVTLGLISHGRM 186
+PD+D IL + + + W KY ++L F++H I ++ N + THCF SVT+GL+SHG
Sbjct: 178 YPDRDFILVVVNSKEWWMPKYIDILGAFSKHKTILLDKENASFTHCFPSVTVGLVSHGPK 237
Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
+I+PT +P K+L P+L+LV+R +GR ILN +E+
Sbjct: 238 IIDPTQIPNSKSL-------------------------PRLILVSRYGNIGRVILNEKEI 272
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
K+ E++GF+V F ++TS+ ++++ I S H MVGVHGA LTH LFLRPGS+L+QVVP
Sbjct: 273 KEMLEDVGFEVVTFRSSKTTSVREAYKLIKSSHVMVGVHGAALTHLLFLRPGSMLVQVVP 332
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW--SN 364
+G W S +E PA+ + LEY+EY++ EESSLVEKY +DLVLK+P A+ G +W +
Sbjct: 333 LGLGWASKTCYESPAKAMKLEYIEYRVNVEESSLVEKYNRDDLVLKDPIAYRGMDWNVTK 392
Query: 365 MRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
M+VYLK Q+V+LD++RFR ++ +AY+KAK MD
Sbjct: 393 MKVYLKEQDVRLDVNRFRKHMNEAYEKAKLFMD 425
>gi|297738189|emb|CBI27390.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 363 bits (932), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 184/391 (47%), Positives = 246/391 (62%), Gaps = 41/391 (10%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
AK I+CDRSH+ YD+C +N + + +P TSTFF V P+Q +L K +PYPRK + S
Sbjct: 2 AKQISCDRSHQRYDMCTVNATTVLEPTTSTFFLV-----EPTQ-ALMEKVRPYPRKWETS 55
Query: 71 AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
M ++KE+ +T S PP+ SC V H++PALVFSAGGY GN FH+F D F+PL+IT+NS FP
Sbjct: 56 VMGRIKEVRLT-SGPPSPSCQVHHSAPALVFSAGGYTGNVFHDFNDGFIPLYITVNSIFP 114
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
D + +L I W KYA+LL ++HPI+N+ THCF +GLISHG M I+P
Sbjct: 115 DGNYVLVIHSSRRWWESKYADLLHTLSKHPIVNLEKANATHCFPYAHVGLISHGFMTIDP 174
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
TL+P L F+ FL AY +N H P
Sbjct: 175 TLMPSSINLTHFRGFLDAAYAQN--------HPFP------------------------T 202
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
EE+GF V +F P +TSL +++ I+S HAM+GVHGA LTHSLFLRPGSVLMQVVP+G
Sbjct: 203 EEVGFHVILFHPTPTTSLREAYALINSSHAMMGVHGAALTHSLFLRPGSVLMQVVPLGLA 262
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN--MRVY 368
W + F +R LGLEY+EYKI ++ES+L +KYG +D+++K+P G WS M VY
Sbjct: 263 WAAETCFGNSSRELGLEYMEYKIGEKESTLADKYGNDDIMVKDPVRAQGKGWSTKIMDVY 322
Query: 369 LKTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
LK QN+ LD+ RFR +L++AY KA K + +
Sbjct: 323 LKEQNITLDLVRFRRHLEEAYNKASKFLQNK 353
>gi|255542122|ref|XP_002512125.1| glycosyltransferase, putative [Ricinus communis]
gi|223549305|gb|EEF50794.1| glycosyltransferase, putative [Ricinus communis]
Length = 390
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 154/303 (50%), Positives = 204/303 (67%), Gaps = 5/303 (1%)
Query: 12 KSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI-KTQPYPRKSDKS 70
+ I CDRS+++YDLC I+G DP STF+ V PS P + K +PYPRK +
Sbjct: 74 RKINCDRSNKAYDLCTIHGPTFLDPTVSTFYLVDPQSQGPSPPLHSMEKIRPYPRKWETV 133
Query: 71 AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
M+++KELT+T S P + C V H PALVFSAGGY GNFFH+F D +PLFIT+ + F
Sbjct: 134 TMNRIKELTLT-SGPSSPPCQVHHNVPALVFSAGGYTGNFFHDFNDGLIPLFITVKTVFS 192
Query: 131 D-QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
D QD +L I+ D W KYA+LL F+++PIIN++N + THCF S +GL+SHG M IN
Sbjct: 193 DDQDFVLVISKARDWWVSKYADLLRAFSKYPIINLDNDSSTHCFPSANIGLVSHGFMTIN 252
Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSFH--HTKPKLVLVNRNARVGRTILNLREVK 247
P LLP ++ F + L AY + N S F+ +P+LV+ +R+ VGR ILN EVK
Sbjct: 253 PKLLPNSQSFTHFHALLDKAYGHHQNQPSEFNSARKRPRLVITSRSGSVGRLILNQNEVK 312
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
K A+ +GFDVT+FEP T L +++ I+S HAM+GVHGA LTHSLFLRPGSV +QVVP+
Sbjct: 313 KIAQNIGFDVTVFEPTPHTPLREAYALINSSHAMIGVHGAALTHSLFLRPGSVFLQVVPL 372
Query: 308 GTQ 310
G +
Sbjct: 373 GNE 375
>gi|148910102|gb|ABR18133.1| unknown [Picea sitchensis]
Length = 456
Score = 310 bits (795), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 152/396 (38%), Positives = 244/396 (61%), Gaps = 12/396 (3%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
P + + + I CDRSH DLC G S+F + D S + K +PY
Sbjct: 67 PGSYNEGEGIVCDRSHFRTDLCTAFGHVQMLANLSSFL-LHAQDKINS--GIEEKVRPYT 123
Query: 65 RKSDKSAMSKVKELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
RK +K M+ V E+T+ + ++ N++C V H PA+V+S GY GN +HEF D +PL
Sbjct: 124 RKWEKDVMAIVHEVTLKSVMLTSSSNVNCDVVHDVPAIVYSTSGYTGNLYHEFNDGIIPL 183
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
+IT + +++V+ I DC++ W KY E+L + T++ +IN N+T+ HCF VT GL
Sbjct: 184 YIT--TQHLEKEVVFVIVDCHNWWLTKYDEILKQLTKYRVINFENETMVHCFPEVTAGLF 241
Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
HG ++I+P+L+ K+++DF++ + AY + + +P+L ++ R R IL
Sbjct: 242 IHGDLMIDPSLMFHNKSILDFRALINRAYTPHWFIPEP-NSDQPRLTILVREG--NRVIL 298
Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
NL+EV AE+LGF+VT+++P +T L ++ ++S H ++GVHGA LTH LF+RPGSV
Sbjct: 299 NLKEVVGLAEQLGFNVTVWKPLRTTELKTTYALLNSSHVLLGVHGAALTHFLFMRPGSVF 358
Query: 302 MQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN 361
+QV+P+GT+W + YF +PA +G +Y+ YKI+ EES+L KY ND++L NP+A
Sbjct: 359 IQVIPLGTEWAAHTYFGEPAERMGFQYIGYKIRLEESTLSHKYSKNDIILTNPRAVVQQG 418
Query: 362 WS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
W+ ++YL++Q+V +++ R + L +A +KA K M
Sbjct: 419 WAVTKQIYLESQDVIINLSRMKRVLINAKRKANKFM 454
>gi|302816141|ref|XP_002989750.1| glycosyltransferase belonging to CAZy family GT61 [Selaginella
moellendorffii]
gi|300142527|gb|EFJ09227.1| glycosyltransferase belonging to CAZy family GT61 [Selaginella
moellendorffii]
Length = 460
Score = 290 bits (743), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 150/385 (38%), Positives = 228/385 (59%), Gaps = 17/385 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CDRSH D+C + G D ++S+F V ST RIK PY RK ++S M V
Sbjct: 76 CDRSHPRSDVCYLKGDVRMDSRSSSFVLVAKNASTRLGEE-RIK--PYTRKWEQSCMDIV 132
Query: 76 KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
E+ + A C V H+ PA+VF+ GGY GN +HEF D +PL+IT S +++V+
Sbjct: 133 HEVRVRAGA--ERRCDVYHSVPAVVFTTGGYTGNVYHEFHDGLIPLYIT--SQHLNREVV 188
Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPK 195
+ ++ W KY +++ + + HP+I+ + HCF VT+GL H M I P+L+P
Sbjct: 189 FVGVELHNWWLTKYGDVIAQMSNHPVIDFDRDERIHCFPEVTVGLHIHDEMAIEPSLMPG 248
Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHT-------KPKLVLVNRNARVGRTILNLREVKK 248
+T+VDF++ L AY E + +P+L ++ RN R ILNL E+
Sbjct: 249 NQTIVDFRNLLDAAYQEELAQAPEPPPPSPASSIGQPRLTIIARND--TRVILNLDEIVG 306
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
A ELGF V I +P+ ++ L +R ++S ++GVHGA +TH LF+RPGSV +QVVP+G
Sbjct: 307 MARELGFWVEIRKPDRTSELKRIYRALNSSDVLLGVHGAAMTHFLFMRPGSVFIQVVPLG 366
Query: 309 TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
T+W + Y+ +PA+ LGL+Y+ Y+I+ ESSL ++Y ND VL +P + W+ ++ +
Sbjct: 367 TKWAAAAYYGQPAQKLGLDYIGYEIEASESSLSDRYDENDTVLTDPAKISTQGWAVVKEI 426
Query: 368 YLKTQNVKLDIDRFRIYLKDAYKKA 392
YL+ QNV+L + RF+ L DA +KA
Sbjct: 427 YLEGQNVRLSLPRFKRTLLDARRKA 451
>gi|18405766|ref|NP_565952.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
gi|13877689|gb|AAK43922.1|AF370603_1 Unknown protein [Arabidopsis thaliana]
gi|16930451|gb|AAL31911.1|AF419579_1 At2g41640/T32G6.16 [Arabidopsis thaliana]
gi|2618699|gb|AAB84346.1| expressed protein [Arabidopsis thaliana]
gi|27764926|gb|AAO23584.1| At2g41640/T32G6.16 [Arabidopsis thaliana]
gi|330254916|gb|AEC10010.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
Length = 500
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 142/391 (36%), Positives = 230/391 (58%), Gaps = 20/391 (5%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
+I CDR+ D+C++ G + +S+ F + + ++P K +PY RK + S M
Sbjct: 98 TICCDRTGLRSDICVMKGDVRTNSASSSIFLFTSSTNNNTKPE---KIKPYTRKWETSVM 154
Query: 73 SKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
V+EL + T S C V H PA+ FS GGY GN +HEF D +PLFIT S
Sbjct: 155 DTVQELNLITKDSNKSSDRVCDVYHDVPAVFFSTGGYTGNVYHEFNDGIIPLFIT--SQH 212
Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
++ V+ I + +D W KY +++ + + +P+++ N T THCF+ T+GL H + +N
Sbjct: 213 YNKKVVFVIVEYHDWWEMKYGDVVSQLSDYPLVDFNGDTRTHCFKEATVGLRIHDELTVN 272
Query: 190 PTLLPKPKTLVDFQSFLANAYNE----------NTNTSSSFHHTKPKLVLVNRNARVGRT 239
+L+ +T+VDF++ L Y+ N ++ KPKLV+++RN R
Sbjct: 273 SSLVIGNQTIVDFRNVLDRGYSHRIQSLTQEETEANVTALDFKKKPKLVILSRNGS-SRA 331
Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
ILN + + AE+ GF+V + P+++T +A +R +++ M+GVHGA +TH LFL+P +
Sbjct: 332 ILNENLLVELAEKTGFNVEVLRPQKTTEMAKIYRSLNTSDVMIGVHGAAMTHFLFLKPKT 391
Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
V +Q++P+GT W + Y+ +PA+ LGL+Y+ YKI +ESSL E+YG +D V+++P +
Sbjct: 392 VFIQIIPLGTDWAAETYYGEPAKKLGLKYVGYKIAPKESSLYEEYGKDDPVIRDPDSLND 451
Query: 360 ANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
W ++YL+ QNVKLD+ RFR L +Y
Sbjct: 452 KGWEYTKKIYLQGQNVKLDLRRFRETLTRSY 482
>gi|334184861|ref|NP_001189728.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
gi|330254917|gb|AEC10011.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
Length = 492
Score = 280 bits (717), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 140/384 (36%), Positives = 227/384 (59%), Gaps = 20/384 (5%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
+I CDR+ D+C++ G + +S+ F + + ++P K +PY RK + S M
Sbjct: 98 TICCDRTGLRSDICVMKGDVRTNSASSSIFLFTSSTNNNTKPE---KIKPYTRKWETSVM 154
Query: 73 SKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
V+EL + T S C V H PA+ FS GGY GN +HEF D +PLFIT S
Sbjct: 155 DTVQELNLITKDSNKSSDRVCDVYHDVPAVFFSTGGYTGNVYHEFNDGIIPLFIT--SQH 212
Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
++ V+ I + +D W KY +++ + + +P+++ N T THCF+ T+GL H + +N
Sbjct: 213 YNKKVVFVIVEYHDWWEMKYGDVVSQLSDYPLVDFNGDTRTHCFKEATVGLRIHDELTVN 272
Query: 190 PTLLPKPKTLVDFQSFLANAYNE----------NTNTSSSFHHTKPKLVLVNRNARVGRT 239
+L+ +T+VDF++ L Y+ N ++ KPKLV+++RN R
Sbjct: 273 SSLVIGNQTIVDFRNVLDRGYSHRIQSLTQEETEANVTALDFKKKPKLVILSRNGS-SRA 331
Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
ILN + + AE+ GF+V + P+++T +A +R +++ M+GVHGA +TH LFL+P +
Sbjct: 332 ILNENLLVELAEKTGFNVEVLRPQKTTEMAKIYRSLNTSDVMIGVHGAAMTHFLFLKPKT 391
Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
V +Q++P+GT W + Y+ +PA+ LGL+Y+ YKI +ESSL E+YG +D V+++P +
Sbjct: 392 VFIQIIPLGTDWAAETYYGEPAKKLGLKYVGYKIAPKESSLYEEYGKDDPVIRDPDSLND 451
Query: 360 ANWS-NMRVYLKTQNVKLDIDRFR 382
W ++YL+ QNVKLD+ RFR
Sbjct: 452 KGWEYTKKIYLQGQNVKLDLRRFR 475
>gi|297824111|ref|XP_002879938.1| hypothetical protein ARALYDRAFT_483244 [Arabidopsis lyrata subsp.
lyrata]
gi|297325777|gb|EFH56197.1| hypothetical protein ARALYDRAFT_483244 [Arabidopsis lyrata subsp.
lyrata]
Length = 498
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 139/391 (35%), Positives = 229/391 (58%), Gaps = 20/391 (5%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
+I CDR+ D+C + G + +S+ F + ++P K +PY RK + S M
Sbjct: 96 TICCDRTGLRSDICEMKGDIRTNSASSSIFLFTSSTKNNTKPE---KIKPYTRKWETSVM 152
Query: 73 SKVKELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
V+EL + T ++ + C V H PA+ FS GGY GN +HEF D +PLFIT S
Sbjct: 153 DTVQELNLITKDSNSSSDRVCDVYHDVPAVFFSTGGYTGNVYHEFNDGIIPLFIT--SQH 210
Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
++ V+ I + +D W KY +++ + + +P+++ + THCF+ T+GL H + +N
Sbjct: 211 YNKKVVFVIVEYHDWWEMKYGDIVSQLSDYPLVDFSGDARTHCFKEATVGLRIHDELTVN 270
Query: 190 PTLLPKPKTLVDFQSFLANAYNE----------NTNTSSSFHHTKPKLVLVNRNARVGRT 239
+L+ +T+VDF++ L Y+ N ++ KPKLV+++RN R
Sbjct: 271 SSLVIGNQTIVDFRNVLDRGYSHRIQSLIQEETEANVTALDFKKKPKLVILSRNGS-SRA 329
Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
ILN + + AEE GF+V + P+++T +A +R +++ M+GVHGA +TH LFL+P +
Sbjct: 330 ILNENLLVELAEETGFNVEVLRPQKTTEMAKIYRSLNTSDVMIGVHGAAMTHFLFLKPKT 389
Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
V +Q++P+GT W + Y+ +PA+ LGL+Y+ YKI +ESSL E+YG +D ++++P +
Sbjct: 390 VFIQIIPLGTDWAAETYYGEPAKKLGLKYIGYKIAPKESSLYEEYGKDDPIIRDPDSLND 449
Query: 360 ANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
W ++YL+ QNVKLD+ RFR L +Y
Sbjct: 450 KGWEYTKKIYLQGQNVKLDLRRFRETLTRSY 480
>gi|115436744|ref|NP_001043125.1| Os01g0498300 [Oryza sativa Japonica Group]
gi|55297629|dbj|BAD69004.1| HGA1-like protein [Oryza sativa Japonica Group]
gi|56202275|dbj|BAD73716.1| HGA1-like protein [Oryza sativa Japonica Group]
gi|113532656|dbj|BAF05039.1| Os01g0498300 [Oryza sativa Japonica Group]
gi|125526083|gb|EAY74197.1| hypothetical protein OsI_02077 [Oryza sativa Indica Group]
gi|125570517|gb|EAZ12032.1| hypothetical protein OsJ_01912 [Oryza sativa Japonica Group]
gi|215697344|dbj|BAG91338.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 536
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 151/421 (35%), Positives = 222/421 (52%), Gaps = 36/421 (8%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKT-STFFSVGHTDSTPSQPSLRIKTQPY 63
P A +I CDR+ D+C++ G + + S F V D++ + + +PY
Sbjct: 105 PCAAVSNGTICCDRTAMRTDVCIMRGDVRTEAASNSLFLLVPPPDNSTAAAGRDERIRPY 164
Query: 64 PRKSDKSAMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
RK + S MS + EL + +AP SC V H PA+VFS GGY GN +HEF D
Sbjct: 165 TRKWESSIMSTIDELRLRAVPEGGAAPA--SCDVRHDVPAVVFSTGGYTGNVYHEFNDGI 222
Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTL 178
+PL+IT + ++ V+ + + +D W KY ++ + + + I+ N THCF +
Sbjct: 223 IPLYITARQY--NKKVVFVMLEYHDWWMTKYGHIVEQLSDYAPIDFTNDRRTHCFPEAVV 280
Query: 179 GLISHGRMVINPTLLPKPKTLVDFQSFLANAY-----------------------NENTN 215
GL H + I+ +P +T+ DF+ L +AY +
Sbjct: 281 GLRIHDELAIDAARMPGNRTIQDFRRMLDDAYRGRIQMIIEEEEKAAAVALGTPTQGSIR 340
Query: 216 TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
S+ KP+LV+V+RN R I N E+ +AA GF V + +P + T LA +R +
Sbjct: 341 KKSALKDDKPRLVIVSRNG--SRAIENEAELVRAAAGAGFRVAVLQPRQDTELAKMYRAL 398
Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQ 335
++ MVGVHGA +TH LF+RPGSV +QVVP+GT W + Y+ +PAR LGL Y+ YKIK
Sbjct: 399 NASDVMVGVHGAAMTHFLFMRPGSVFIQVVPLGTDWAAETYYGEPARRLGLRYMPYKIKP 458
Query: 336 EESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
ESSL +Y +D VL +P W +VYL QNV+LD+ RFR L+DAY +
Sbjct: 459 AESSLYRQYAKDDAVLTDPDTVNAKGWQVTKKVYLDGQNVRLDMVRFRRRLRDAYDHWAE 518
Query: 395 L 395
L
Sbjct: 519 L 519
>gi|449518915|ref|XP_004166481.1| PREDICTED: uncharacterized LOC101213872 [Cucumis sativus]
Length = 518
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 144/401 (35%), Positives = 226/401 (56%), Gaps = 24/401 (5%)
Query: 10 PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDS----TPSQPSLRI-KTQPYP 64
P +I CDRS D+C++ G D +S+ F DS +++ K +PY
Sbjct: 108 PNGTICCDRSSIRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYT 167
Query: 65 RKSDKSAMSKVKELTITTSAPPN-----LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
RK +K+ M + EL + N C V H PA+ FS GGY GN +HEF D +
Sbjct: 168 RKWEKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGIL 227
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
PL+IT SH +++V+ I + + W KYA++L + + +P+I++ THCF V G
Sbjct: 228 PLYIT--SHSMNKEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAG 285
Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAYN---------ENTNTSSSFHHTK-PKLVL 229
L H + ++P+L+ K++VDF++ L AY E S H +K PKLV+
Sbjct: 286 LRIHDELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVV 345
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
++R R I N + + K AE +GF+V + P+++T LA +R ++ + +VGVHGA +
Sbjct: 346 LSRKGS-SRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAM 404
Query: 290 THSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
THSLF+RP +V +Q++P+GT W + Y+ +PA+ LGL+Y+ Y+I +ESSL + +D
Sbjct: 405 THSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDP 464
Query: 350 VLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
VL NP + W ++YL QNV+L++ RF L+ AY
Sbjct: 465 VLVNPDSITKKGWEYTKKIYLDGQNVRLNLGRFEKRLERAY 505
>gi|449468035|ref|XP_004151727.1| PREDICTED: uncharacterized protein LOC101213872 [Cucumis sativus]
Length = 518
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 144/401 (35%), Positives = 226/401 (56%), Gaps = 24/401 (5%)
Query: 10 PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDS----TPSQPSLRI-KTQPYP 64
P +I CDRS D+C++ G D +S+ F DS +++ K +PY
Sbjct: 108 PNGTICCDRSSIRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYT 167
Query: 65 RKSDKSAMSKVKELTITTSAPPN-----LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
RK +K+ M + EL + N C V H PA+ FS GGY GN +HEF D +
Sbjct: 168 RKWEKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGIL 227
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
PL+IT SH +++V+ I + + W KYA++L + + +P+I++ THCF V G
Sbjct: 228 PLYIT--SHNMNKEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAG 285
Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAYN---------ENTNTSSSFHHTK-PKLVL 229
L H + ++P+L+ K++VDF++ L AY E S H +K PKLV+
Sbjct: 286 LRIHDELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVV 345
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
++R R I N + + K AE +GF+V + P+++T LA +R ++ + +VGVHGA +
Sbjct: 346 LSRKGS-SRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAM 404
Query: 290 THSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
THSLF+RP +V +Q++P+GT W + Y+ +PA+ LGL+Y+ Y+I +ESSL + +D
Sbjct: 405 THSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDP 464
Query: 350 VLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
VL NP + W ++YL QNV+L++ RF L+ AY
Sbjct: 465 VLVNPDSITKKGWEYTKKIYLDGQNVRLNLGRFEKRLERAY 505
>gi|15228253|ref|NP_187643.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
gi|6056194|gb|AAF02811.1|AC009400_7 unknown protein [Arabidopsis thaliana]
gi|28973746|gb|AAO64189.1| unknown protein [Arabidopsis thaliana]
gi|29824255|gb|AAP04088.1| unknown protein [Arabidopsis thaliana]
gi|110736729|dbj|BAF00327.1| hypothetical protein [Arabidopsis thaliana]
gi|332641370|gb|AEE74891.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
Length = 494
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 145/407 (35%), Positives = 220/407 (54%), Gaps = 24/407 (5%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
P +SI+CDR+ D+C + G +S+ F D T Q L+ K +PY
Sbjct: 91 PKTSQKEESISCDRTGYRSDICFMKGDIRTHSPSSSIFLYTSNDLTTDQ-VLQEKIKPYT 149
Query: 65 RKSDKSAMSKVKELTITTSA----PPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
RK + S M + EL + T C V H PA++FS GGY GN +HEF D +P
Sbjct: 150 RKWETSIMETIPELKLVTKDMKLFGDKRKCEVIHEVPAVLFSTGGYTGNLYHEFNDGLIP 209
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
L+IT S ++ V+ IA+ + W KY ++L + + + +I+ N THCF+ +GL
Sbjct: 210 LYIT--SKRFNKKVVFVIAEYHKWWEMKYGDVLSQLSDYSLIDFNKDKRTHCFKEAIVGL 267
Query: 181 ISHGRMVINPTLLPKPKTLV-DFQSFLANAYNENTN-----TSSSFHH--------TKPK 226
HG + ++P+ + T + +F++ L AY N FH +PK
Sbjct: 268 RIHGELTVDPSQMQDDGTTINEFRNVLDRAYRPRINRLDRLEEQRFHARLAQRRKAKRPK 327
Query: 227 LVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHG 286
L L +R R I N + K A+ +GFD+ + P+ +T LA +R ++S MVGVHG
Sbjct: 328 LALFSRTG--SRGITNEDLMVKMAQRIGFDIEVLRPDRTTELAKIYRVLNSSKVMVGVHG 385
Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGA 346
A +TH LF++PGS+ +Q++P+GT W + Y+ +PA+ LGL+Y YKI ESSL EKY
Sbjct: 386 AAMTHFLFMKPGSIFIQIIPLGTDWAAETYYGEPAKKLGLDYNGYKILPRESSLYEKYDK 445
Query: 347 NDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKA 392
+D +LK+P + W + +YL Q V+LD+ RF+ L DAY K+
Sbjct: 446 DDPILKDPNSITKKGWQFTKGIYLNDQKVRLDLHRFKKLLIDAYAKS 492
>gi|224139884|ref|XP_002323324.1| predicted protein [Populus trichocarpa]
gi|222867954|gb|EEF05085.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 149/421 (35%), Positives = 230/421 (54%), Gaps = 45/421 (10%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRI---------K 59
++I CDRS D+C++ G +S+ F+ + +S ++ S + K
Sbjct: 13 VRTICCDRSSMRSDVCVMKGDVRTHSASSSIFLFTSRNNNSVMNKVSSLVDKDEELHHEK 72
Query: 60 TQPYPRKSDKSAMSKVKELTI---TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
+PY RK + S M + EL + T ++ N C V H PA+ FS GGY GN +HEF D
Sbjct: 73 IKPYTRKWETSVMDSIDELGLIAKTENSRINHHCDVMHDVPAVFFSTGGYTGNVYHEFND 132
Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
+PL+IT + HF + V+ I D ++ W KY +L + +P I+ + THCF
Sbjct: 133 GILPLYIT-SQHFKKK-VVFVILDYHNWWIMKYGNILSLLSDYPAIDFSGDKKTHCFPEA 190
Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY-------------------------- 210
GL H + ++P+L+ + K++VDF++FL AY
Sbjct: 191 IAGLRIHDELTVDPSLMQENKSIVDFRNFLDRAYWPRIKSMIKGEERGAQKKLELKAHSS 250
Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLAD 270
+N KPKLV+++RN R I N + K AEE+GF V + PE +T LA
Sbjct: 251 KKNLKQVHEATLKKPKLVILSRNG--SRAITNENLLVKMAEEIGFRVEVMRPEPTTELAR 308
Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
+R ++S M+GVHGA +TH LF++PGSV +QV+P+GT+W + Y+ +PAR LGL+Y+
Sbjct: 309 IYRALNSSEVMIGVHGAAMTHFLFMKPGSVFIQVIPLGTEWAADAYYGEPARKLGLKYIG 368
Query: 331 YKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAY 389
Y+I ESSL +KY ND VL++P++ + W + +YL QNV+L++ RF+ L AY
Sbjct: 369 YQILPRESSLYDKYDKNDPVLRDPRSVSDKGWQYTKSIYLDNQNVRLNLGRFQRRLLRAY 428
Query: 390 K 390
+
Sbjct: 429 R 429
>gi|195615916|gb|ACG29788.1| glycosyltransferase [Zea mays]
Length = 494
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/414 (35%), Positives = 220/414 (53%), Gaps = 36/414 (8%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
P A ++ CDR+ D+C++ G +++ F + +S S+P+ + +PY
Sbjct: 69 PCADVANDTVCCDRTALRTDVCVMRGDVRTQAASNSLFLLLPPNS--SRPATDERIRPYT 126
Query: 65 RKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
RK + S MS + EL + SAP C V H PA+VFS GGY GN +HEF D +PL+
Sbjct: 127 RKWESSIMSTIDELRLR-SAPEGAPSRCDVRHDVPAVVFSTGGYTGNVYHEFNDGIIPLY 185
Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
IT H+ ++ V+ + + +D W KY ++ + + +P I+ N THCF +GL
Sbjct: 186 ITAR-HY-NKKVVFVMLEYHDWWMTKYGHIVEQLSDYPPIDFTNDRRTHCFPEAVVGLRI 243
Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYNENTNT-------------------------- 216
H + I+ +P +T+ DF+ L +A+ T
Sbjct: 244 HDELAIDAARMPANQTIQDFRLMLDDAHRGRIQTIIEEEKKEAAAARRQEHRGSKRSRQR 303
Query: 217 SSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
S + KP+L +++RN R I N E+ +AA GF V + +P + T LA +R ++
Sbjct: 304 SKTALAEKPRLTIISRNG--SRAIENEAELVRAAAGAGFRVAVLQPRQDTELARMYRALN 361
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
+ MVGVHGA +TH LF+RPGSV +QVVP+GT W + Y+ +PAR LGL YL YKI
Sbjct: 362 ASDVMVGVHGAAMTHFLFMRPGSVFIQVVPLGTDWAAETYYGEPARRLGLRYLPYKILPS 421
Query: 337 ESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
ESS+ KY +D VL +P W +VYL QNV+LD+ RFR L+ AY
Sbjct: 422 ESSIYRKYANDDPVLTDPDTVNAKGWQVTKQVYLDGQNVRLDMARFRRRLRQAY 475
>gi|195611628|gb|ACG27644.1| glycosyltransferase [Zea mays]
Length = 528
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/414 (35%), Positives = 220/414 (53%), Gaps = 36/414 (8%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
P A ++ CDR+ D+C++ G +++ F + +S S+P+ + +PY
Sbjct: 103 PCADVANDTVCCDRTALRTDVCVMRGDVRTQAASNSLFLLLPPNS--SRPATDERIRPYT 160
Query: 65 RKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
RK + S MS + EL + SAP C V H PA+VFS GGY GN +HEF D +PL+
Sbjct: 161 RKWESSIMSTIDELRLR-SAPEGAPSRCDVRHDVPAVVFSTGGYTGNVYHEFNDGIIPLY 219
Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
IT H+ ++ V+ + + +D W KY ++ + + +P I+ N THCF +GL
Sbjct: 220 ITAR-HY-NKKVVFVMLEYHDWWMTKYGHIVEQLSDYPPIDFTNDRRTHCFPEAVVGLRI 277
Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYNENTNT-------------------------- 216
H + I+ +P +T+ DF+ L +A+ T
Sbjct: 278 HDELAIDAARMPANQTIQDFRLMLDDAHRGRIQTIIEEEKKEAAAARRQEHRGSKRSRQR 337
Query: 217 SSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
S + KP+L +++RN R I N E+ +AA GF V + +P + T LA +R ++
Sbjct: 338 SKTALAEKPRLTIISRNG--SRAIENEAELVRAAAGAGFRVAVLQPRQDTELARMYRALN 395
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
+ MVGVHGA +TH LF+RPGSV +QVVP+GT W + Y+ +PAR LGL YL YKI
Sbjct: 396 ASDVMVGVHGAAMTHFLFMRPGSVFIQVVPLGTDWAAETYYGEPARRLGLRYLPYKILPS 455
Query: 337 ESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
ESS+ KY +D VL +P W +VYL QNV+LD+ RFR L+ AY
Sbjct: 456 ESSIYRKYANDDPVLTDPDTVNAKGWQVTKQVYLDGQNVRLDMARFRRRLRQAY 509
>gi|226492060|ref|NP_001145867.1| uncharacterized protein LOC100279381 [Zea mays]
gi|219884767|gb|ACL52758.1| unknown [Zea mays]
Length = 527
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 148/413 (35%), Positives = 219/413 (53%), Gaps = 35/413 (8%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
P A ++ CDR+ D+C++ G +++ F + +S S+P+ +PY
Sbjct: 103 PCADVANDTVCCDRTALRTDVCVMRGHVRTQAASNSLFLLLPPNS--SRPATDEHIRPYT 160
Query: 65 RKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
RK + S MS + EL + SAP C V H PA+VFS GGY GN +HEF D +PL+
Sbjct: 161 RKWESSIMSTIDELRLR-SAPEGAPSRCDVRHDVPAVVFSTGGYTGNVYHEFNDGIIPLY 219
Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
IT H+ ++ V+ + + +D W KY ++ + + +P I+ N THCF +GL
Sbjct: 220 ITAR-HY-NKKVVFVMLEYHDWWMTKYGHIVEQLSDYPPIDFTNDRRTHCFPEAVVGLRI 277
Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYNENTNT-------------------------S 217
H + I+ +P +T+ DF+ L +A+ T S
Sbjct: 278 HDELAIDAARMPANQTIQDFRLMLDDAHRGRIQTIIEEKKEAAAARRQEHRGSKRSRQRS 337
Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
+ KP+L +++RN R I N E+ +AA GF V + +P + T LA +R +++
Sbjct: 338 KTALAEKPRLTIISRNG--SRAIENEAELVRAAAGAGFRVAVLQPRQDTELARMYRALNA 395
Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEE 337
MVGVHGA +TH LF+RPGSV +QVVP+GT W + Y+ +PAR LGL YL YKI E
Sbjct: 396 SDVMVGVHGAAMTHFLFMRPGSVFIQVVPLGTDWAAETYYGEPARRLGLRYLPYKILPSE 455
Query: 338 SSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
SS+ KY +D VL +P W +VYL QNV+LD+ RFR L+ AY
Sbjct: 456 SSIYRKYANDDPVLTDPDTVNAKGWQVTKQVYLDGQNVRLDMARFRRRLRQAY 508
>gi|413948188|gb|AFW80837.1| glycosyltransferase [Zea mays]
Length = 527
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 148/413 (35%), Positives = 219/413 (53%), Gaps = 35/413 (8%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
P A ++ CDR+ D+C++ G +++ F + +S S+P+ +PY
Sbjct: 103 PCADVANDTVCCDRTALRTDVCVMRGHVRTQAASNSLFLLLPPNS--SRPATDEHIRPYT 160
Query: 65 RKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
RK + S MS + EL + SAP C V H PA+VFS GGY GN +HEF D +PL+
Sbjct: 161 RKWESSIMSTIDELRLR-SAPEGAPSRCDVRHDVPAVVFSTGGYTGNVYHEFNDGIIPLY 219
Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
IT H+ ++ V+ + + +D W KY ++ + + +P I+ N THCF +GL
Sbjct: 220 ITAR-HY-NKKVVFVMLEYHDWWMTKYGHIVEQLSDYPPIDFTNDRRTHCFPEAVVGLRI 277
Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYNENTNT-------------------------S 217
H + I+ +P +T+ DF+ L +A+ T S
Sbjct: 278 HDELAIDAARMPANQTIQDFRLMLDDAHRGRIQTIIEEKKEAAAARRQEHRGSKRSRQRS 337
Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
+ KP+L +++RN R I N E+ +AA GF V + +P + T LA +R +++
Sbjct: 338 KTALAEKPRLTIISRNG--SRAIENEAELVRAAAGAGFRVAVLQPRQDTELARMYRALNA 395
Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEE 337
MVGVHGA +TH LF+RPGSV +QVVP+GT W + Y+ +PAR LGL YL YKI E
Sbjct: 396 SDVMVGVHGAAMTHFLFMRPGSVFIQVVPLGTDWAAETYYGEPARRLGLRYLPYKILPSE 455
Query: 338 SSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
SS+ KY +D VL +P W +VYL QNV+LD+ RFR L+ AY
Sbjct: 456 SSIYRKYANDDPVLTDPDTVNAKGWQVTKQVYLDGQNVRLDMARFRRRLRQAY 508
>gi|297833814|ref|XP_002884789.1| hypothetical protein ARALYDRAFT_478362 [Arabidopsis lyrata subsp.
lyrata]
gi|297330629|gb|EFH61048.1| hypothetical protein ARALYDRAFT_478362 [Arabidopsis lyrata subsp.
lyrata]
Length = 493
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 141/407 (34%), Positives = 224/407 (55%), Gaps = 25/407 (6%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
P +SI+CDR+ D+C + G +S+ + +T + + L K +PY
Sbjct: 91 PKTSQNEESISCDRTGYRSDICFMKGDIRTHSPSSSI--ILYTSNDLTDNVLPEKIKPYT 148
Query: 65 RKSDKSAMSKVKELTITTSAPPNLS----CGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
RK + S M + EL + T C V H PA++FS GGY GN +HEF D +P
Sbjct: 149 RKWETSIMETIHELKLVTKDMKRFGDRCKCEVIHEVPAVLFSTGGYTGNLYHEFNDGLIP 208
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
L+IT S ++ V+L IA+ + W KY ++L + + +P+I+ + THCF+ +GL
Sbjct: 209 LYIT--SKRFNKKVLLVIAEYHKWWEMKYGDVLSQLSDYPLIDFSKDKRTHCFKEAIVGL 266
Query: 181 ISHGRMVINPTLLPKPKTLV-DFQSFLANAYNENTN-----TSSSFHH--------TKPK 226
HG + ++P+ + +T + +F++ L AY N FH +PK
Sbjct: 267 RIHGELTVDPSQMQDGRTTINEFRNVLDRAYGPRINRLDRLEEQRFHARVAKRRKAQRPK 326
Query: 227 LVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHG 286
L L +R R I N + + A+ +GF+V + P+ +T LA +R ++S MVGVHG
Sbjct: 327 LALFSRTG--SRGITNEDLMVQLAQRIGFEVEVLRPDRTTELAKIYRVLNSSKVMVGVHG 384
Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGA 346
A +TH LF++PGS+ +Q++P+GT W + Y+ +PA+ LGL+Y+ YKI ESSL EKY
Sbjct: 385 AAMTHFLFMQPGSIFIQIIPLGTDWAAETYYGEPAKKLGLDYIGYKILPRESSLYEKYDK 444
Query: 347 NDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKA 392
+D +L++P + W + +YL Q V+LD+ RF+ L DAY K+
Sbjct: 445 DDPILRDPNSITKKGWQFTKGIYLNDQKVRLDLHRFKKVLVDAYAKS 491
>gi|42565997|ref|NP_191297.2| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
gi|332646127|gb|AEE79648.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
Length = 504
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 143/400 (35%), Positives = 225/400 (56%), Gaps = 32/400 (8%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
+I CDR+ D+C++ G +S+ F S ++ + K +PY RK + S M
Sbjct: 94 TICCDRTGSRSDVCIMKGDVRTHSASSSVFLF---TSLKNKTKITKKIKPYTRKWETSVM 150
Query: 73 SKVKELTITTSAPPNLS---------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
V+EL + N S C V + PA+ FS GGY GN +HEF D +PLFI
Sbjct: 151 QTVQELNLVYRDEENNSLVVSSVNDICDVFYNVPAVFFSTGGYTGNVYHEFNDGIIPLFI 210
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T SH ++ V+ I + + W KY +++ + + +P ++ N THCF+ +GL H
Sbjct: 211 T--SHHFNKKVVFVIVEYHSWWIMKYGDIVSQLSDYPPVDFNGDKRTHCFKEAIVGLKIH 268
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAY-------------NENTNTSSSFHHTKPKLVLV 230
+ + +L+ KT++DF++ L AY T F KP LV++
Sbjct: 269 DELTVESSLMLGNKTILDFRNVLDQAYWPRIHGLIQEEELKAANKTEDGFK--KPILVIL 326
Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
+RN R ILN + + AEE+GF V + P+++T LA +R ++S M+GVHGA +T
Sbjct: 327 SRNG--SREILNESLLVELAEEIGFIVHVLRPDKTTELAKIYRCLNSSDVMIGVHGAAMT 384
Query: 291 HSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
H LFL+P +V +Q++PIGT+W + Y+ KPA+ + L+Y+ YKIK +ESSL ++YG +D +
Sbjct: 385 HLLFLKPKTVFIQIIPIGTEWAAETYYGKPAKKMRLKYIGYKIKPKESSLYDEYGIDDPI 444
Query: 351 LKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
+++P++F W ++YL+ QNVKLD+ RFR L AY
Sbjct: 445 IRDPKSFTQKGWDYTKKIYLERQNVKLDLKRFRKPLSRAY 484
>gi|6735317|emb|CAB68144.1| putative protein [Arabidopsis thaliana]
Length = 470
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 143/400 (35%), Positives = 225/400 (56%), Gaps = 32/400 (8%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
+I CDR+ D+C++ G +S+ F S ++ + K +PY RK + S M
Sbjct: 60 TICCDRTGSRSDVCIMKGDVRTHSASSSVFLF---TSLKNKTKITKKIKPYTRKWETSVM 116
Query: 73 SKVKELTITTSAPPNLS---------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
V+EL + N S C V + PA+ FS GGY GN +HEF D +PLFI
Sbjct: 117 QTVQELNLVYRDEENNSLVVSSVNDICDVFYNVPAVFFSTGGYTGNVYHEFNDGIIPLFI 176
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T SH ++ V+ I + + W KY +++ + + +P ++ N THCF+ +GL H
Sbjct: 177 T--SHHFNKKVVFVIVEYHSWWIMKYGDIVSQLSDYPPVDFNGDKRTHCFKEAIVGLKIH 234
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAY-------------NENTNTSSSFHHTKPKLVLV 230
+ + +L+ KT++DF++ L AY T F KP LV++
Sbjct: 235 DELTVESSLMLGNKTILDFRNVLDQAYWPRIHGLIQEEELKAANKTEDGF--KKPILVIL 292
Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
+RN R ILN + + AEE+GF V + P+++T LA +R ++S M+GVHGA +T
Sbjct: 293 SRNG--SREILNESLLVELAEEIGFIVHVLRPDKTTELAKIYRCLNSSDVMIGVHGAAMT 350
Query: 291 HSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
H LFL+P +V +Q++PIGT+W + Y+ KPA+ + L+Y+ YKIK +ESSL ++YG +D +
Sbjct: 351 HLLFLKPKTVFIQIIPIGTEWAAETYYGKPAKKMRLKYIGYKIKPKESSLYDEYGIDDPI 410
Query: 351 LKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
+++P++F W ++YL+ QNVKLD+ RFR L AY
Sbjct: 411 IRDPKSFTQKGWDYTKKIYLERQNVKLDLKRFRKPLSRAY 450
>gi|357509679|ref|XP_003625128.1| Glycosyltransferase [Medicago truncatula]
gi|355500143|gb|AES81346.1| Glycosyltransferase [Medicago truncatula]
Length = 541
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 147/431 (34%), Positives = 229/431 (53%), Gaps = 57/431 (13%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFF---SVGHTDSTPSQPSLRI----------- 58
+I CDRS D+C++ G +S+ F S+ H ++ R
Sbjct: 100 TICCDRSGYRSDICVMKGDIRTHSSSSSIFLYNSISHGNNVSRTIEARKGEDEEDQVLQH 159
Query: 59 -KTQPYPRKSDKSAMSKVKELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
K +PY RK + S M + EL + + ++P C V H PA+ FS GGY GN +HEF
Sbjct: 160 EKIKPYTRKWETSVMDTIDELNLISKKVNSPSVRGCDVQHDVPAVFFSNGGYTGNVYHEF 219
Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
D +PL+IT + HF ++ V+ I + ++ W KY ++L + P IN +N THCF
Sbjct: 220 NDGIIPLYIT-SQHF-NKKVVFVILEYHEWWITKYGDILSHLSDFPPINFSNDNRTHCFP 277
Query: 175 SVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN----------------------- 211
+GL H + ++ L+ K++V F++ L AY+
Sbjct: 278 EAIVGLKIHDELAVDSALMEGNKSIVYFRNLLDEAYSPRIKGLIQDEEREAQEKLRQQQQ 337
Query: 212 --------ENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIF 260
++ TS TKPKLV+V+R+ R I N + K AEE+GF V +
Sbjct: 338 QQISLSPSSDSETSQGLQEIARTKPKLVIVSRSG--SRAITNENLLVKMAEEIGFKVNVL 395
Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKP 320
+P+++T LA +R ++ M+GVHGA +TH +F++P SV +QVVP+GT W + Y+ +P
Sbjct: 396 KPQKTTELAKIYRVLNESDVMIGVHGAAMTHFMFMKPKSVFIQVVPLGTNWAADTYYGEP 455
Query: 321 ARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDID 379
AR LGL+Y+ Y+I +ESSL E+Y +D +L++P++ W ++YL +QNVKLD+
Sbjct: 456 ARKLGLKYIGYEIHPKESSLYERYDKSDPILRDPESINKKGWEYTKKIYLDSQNVKLDLR 515
Query: 380 RFRIYLKDAYK 390
RFR L AY+
Sbjct: 516 RFRKRLHRAYE 526
>gi|317106706|dbj|BAJ53206.1| JHL06B08.8 [Jatropha curcas]
Length = 530
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 144/421 (34%), Positives = 224/421 (53%), Gaps = 47/421 (11%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTP------SQPSLRIKTQPYPRK 66
+I CDRS D C++ G +S+ F ++T + K +PY RK
Sbjct: 99 TICCDRSSFRTDFCIMKGDVRTQYASSSIFLYTSRNTTSLIREDEEEEFQHEKIKPYTRK 158
Query: 67 SDKSAMSKVKELTITTSAPP---NLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
+ S M +++L + + + C V H+ PA+ FS GGY GN +HEF D +PL+I
Sbjct: 159 WETSVMDTIEQLHLISKQEKFAIDHQCDVKHSVPAVFFSTGGYTGNVYHEFNDGILPLYI 218
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T + HF Q V+ I + +D W KY ++L + +P I+ + THCF +GL H
Sbjct: 219 T-SQHFNKQ-VVFVILEYHDWWITKYGDILSHLSDYPAIDFSGDKRTHCFPEAIVGLKIH 276
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYN-------------------------------- 211
+ ++P+L+ K++VDF + L AY
Sbjct: 277 NELTVDPSLMQGNKSIVDFHNLLGKAYKPRINGLIRDEKREAEEKLKQKVLSLSPSSGTL 336
Query: 212 -ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLAD 270
E N +PKLV+++RNA R I N + K AE +GF V + P+ +T LA
Sbjct: 337 LELKNDVQEAKLKRPKLVILSRNA--SRAITNEDLLVKMAEGIGFHVEVLRPDRTTELAK 394
Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
+R ++S M+GVHGA +TH LF+RPGSV +QV+P+GT+W + Y+ +PA+ LGL+Y+
Sbjct: 395 IYRALNSSEVMIGVHGAAMTHFLFMRPGSVFIQVIPLGTEWAAETYYGEPAKKLGLKYIG 454
Query: 331 YKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAY 389
YKI ESSL +KY ND VL++P++ + W + +YL +QN++L++ RF+ L AY
Sbjct: 455 YKILPRESSLYDKYDKNDPVLQDPESISKKGWQYTKTIYLDSQNLRLNLGRFQKRLVRAY 514
Query: 390 K 390
+
Sbjct: 515 Q 515
>gi|297817062|ref|XP_002876414.1| hypothetical protein ARALYDRAFT_486181 [Arabidopsis lyrata subsp.
lyrata]
gi|297322252|gb|EFH52673.1| hypothetical protein ARALYDRAFT_486181 [Arabidopsis lyrata subsp.
lyrata]
Length = 512
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 138/405 (34%), Positives = 226/405 (55%), Gaps = 35/405 (8%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
+I CDR+ D+C++ G +S+ F S ++ ++ K +PY RK + S M
Sbjct: 95 TICCDRTGFRSDVCIMKGDVRTHSASSSVFLF---TSLKNKITITGKIKPYTRKWETSVM 151
Query: 73 SKVKELT-----------ITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
V++L ++ N C V + PA+ FS GGY GN +HEF D +PL
Sbjct: 152 QTVQQLNLVYRDEKNNYLVSVDEHNNNICDVFYNVPAVFFSTGGYTGNVYHEFNDGIIPL 211
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
FIT SH ++ V+ I + + W KY +++ + + +P ++ N T CF+ +GL
Sbjct: 212 FIT--SHHFNKKVVFVIVEYHSWWVMKYGDIVSQLSDYPPVDFNGDKRTQCFKEAIVGLK 269
Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAY----------------NENTNTSSSFHHTKP 225
H + ++ +L+ KT++DF++ L AY N+ KP
Sbjct: 270 IHDELTVDSSLMLGNKTILDFRNVLNQAYWPRIRGLSQEEELEAANKTGKRVQEDGFKKP 329
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
KLV+++RN R ILN + AEE+GF V + P+++T LA ++ ++S M+GVH
Sbjct: 330 KLVILSRNG--SREILNDGLLVALAEEIGFIVYVLRPDKTTELAKIYKCLNSSDVMIGVH 387
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYG 345
GA +TH LF++P +V +Q++PIGT+W + Y+ KPA+ + L+Y+ YKIK +ESSL ++YG
Sbjct: 388 GAAMTHFLFMKPKTVFIQIIPIGTEWAAETYYGKPAKKMRLKYIGYKIKPKESSLYDEYG 447
Query: 346 ANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
+D ++++P++F W ++YL+ QNVKLD+ RFR L AY
Sbjct: 448 KDDPIIRDPKSFTQKGWDYTKKIYLERQNVKLDLKRFRKPLSRAY 492
>gi|56409840|emb|CAI30145.1| glycosyltransferase [Medicago truncatula]
Length = 541
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 146/431 (33%), Positives = 228/431 (52%), Gaps = 57/431 (13%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFF---SVGHTDSTPSQPSLRI----------- 58
+I CDRS D+C++ G +S+ F S+ H ++ R
Sbjct: 100 TICCDRSGYRSDICVMKGDIRTHSSSSSIFLYNSISHGNNVSRTIEARKGEDEEDQVLQH 159
Query: 59 -KTQPYPRKSDKSAMSKVKELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
K +PY RK + S M + EL + + ++P C V H PA+ FS GGY GN +HEF
Sbjct: 160 EKIKPYTRKWETSVMDTIDELNLISKKVNSPSVRGCDVQHDVPAVFFSNGGYTGNVYHEF 219
Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
D +PL+IT + HF ++ V+ I + ++ W KY ++L + P IN +N THCF
Sbjct: 220 NDGIIPLYIT-SQHF-NKKVVFVILEYHEWWITKYGDILSHLSDFPPINFSNDNRTHCFP 277
Query: 175 SVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN----------------------- 211
+GL H + ++ L+ K++V F++ L AY+
Sbjct: 278 EAIVGLKIHDELAVDSALMEGNKSIVYFRNLLDEAYSPRIKGLIQDEEREAQEKLRQQQQ 337
Query: 212 --------ENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIF 260
++ TS TKPKLV+V+R+ R I N + K AEE+G V +
Sbjct: 338 QQISLSPSSDSETSQGLQEIARTKPKLVIVSRSG--SRAITNENLLVKMAEEIGLKVNVL 395
Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKP 320
+P+++T LA +R ++ M+GVHGA +TH +F++P SV +QVVP+GT W + Y+ +P
Sbjct: 396 KPQKTTELAKIYRVLNESDVMIGVHGAAMTHFMFMKPKSVFIQVVPLGTNWAADTYYGEP 455
Query: 321 ARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDID 379
AR LGL+Y+ Y+I +ESSL E+Y +D +L++P++ W ++YL +QNVKLD+
Sbjct: 456 ARKLGLKYIGYEIHPKESSLYERYDKSDPILRDPESINKKGWEYTKKIYLDSQNVKLDLR 515
Query: 380 RFRIYLKDAYK 390
RFR L AY+
Sbjct: 516 RFRKRLHRAYE 526
>gi|255585488|ref|XP_002533436.1| glycosyltransferase, putative [Ricinus communis]
gi|223526710|gb|EEF28943.1| glycosyltransferase, putative [Ricinus communis]
Length = 533
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/426 (34%), Positives = 225/426 (52%), Gaps = 45/426 (10%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDS--TPSQPSLRIKTQPYPRKSD 68
++ CDR D+C++ G +S+ ++ +T S ++ K +PY RK +
Sbjct: 104 TMCCDRRSFRSDICIMKGDIRTHSASSSVLLYTSRNTSSLIKDNEEIQHEKIKPYTRKWE 163
Query: 69 KSAMSKVKELTI---TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
S M + +L + + N C V H PA+ FS GGY GN +HEF D VPL+IT
Sbjct: 164 TSVMGTIDQLDLILKQEKSSVNHRCDVKHDVPAVFFSTGGYTGNVYHEFNDGIVPLYIT- 222
Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
S + V+ I + + W KY ++L R + +P I+ + THCF +GL H
Sbjct: 223 -SQHLKRKVVFVILEYHTWWMMKYGDILSRLSDYPAIDYSGDKRTHCFPEAIVGLRIHDE 281
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN---------------------------------E 212
+ ++ +L+ K++VDF + L AY E
Sbjct: 282 LTVDSSLMKGNKSIVDFHNLLDKAYRPRIKGLIREEEHEALKKLKQKVLPLSPSSETLLE 341
Query: 213 NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
H +PKLV+++RNA R I N + K AE +GF V + PE +T LA +
Sbjct: 342 FRKDVQESKHKRPKLVILSRNA--SRAITNEDLLVKMAEGIGFRVEVLRPERTTELAKIY 399
Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
R ++S M+GVHGA +TH LF++PGSV +QV+P+GT+W + Y+ +PAR LGL+Y+ Y+
Sbjct: 400 RALNSSEVMIGVHGAAMTHFLFMKPGSVFIQVIPLGTEWAAETYYGEPARKLGLKYIGYQ 459
Query: 333 IKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
I ESSL +KY ND VL++P + + W + +YL +QNV+LD++RF+ L AY+
Sbjct: 460 ILPRESSLYDKYDKNDPVLQDPASISNKGWQYTKTIYLDSQNVRLDLERFQKQLVLAYEH 519
Query: 392 AKKLMD 397
+ K M+
Sbjct: 520 SMKKMN 525
>gi|302812177|ref|XP_002987776.1| glycosyltransferase belonging to CAZy family GT61 [Selaginella
moellendorffii]
gi|300144395|gb|EFJ11079.1| glycosyltransferase belonging to CAZy family GT61 [Selaginella
moellendorffii]
Length = 445
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 138/392 (35%), Positives = 228/392 (58%), Gaps = 19/392 (4%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ CD S+ +C + G T T FS +TD + L K +PYPRK K +
Sbjct: 63 VACDLSNYRTAVCRLAGDLRISGSTVTLFSPRNTD----EEILVQKIKPYPRKWQKQLLE 118
Query: 74 KVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
K+ E+TI ++S+ P C V HT A++FS GGY G+ +H+F D +P++IT SH
Sbjct: 119 KISEVTIKVRRSSSSTPQHQCDVNHTQAAMIFSTGGYTGSVYHDFNDGLIPIYIT--SHG 176
Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININN---QTITHCFQSVTLGLISHGRM 186
+ +V+ +++ W +KY ++ + T++P+ + ++ Q HCF V +GL H +
Sbjct: 177 FEGEVVFLVSEFQPWWMKKYGSIVKQMTKYPVQDFSSNPAQHRVHCFPKVVVGLDIHDEL 236
Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
IN + K++ DFQS L+ A++ +++ + T+PKLV + R R R + N EV
Sbjct: 237 AINAAKMSHGKSIRDFQSILSAAFSA-SDSRTKVPSTRPKLVFITR--RRTRVVTNEEEV 293
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ AE GFDV EP + +A+ + I S ++GVHGA +TH LF+RPGS+L+Q+VP
Sbjct: 294 VQLAERAGFDVEALEPGFNHEMANLYGIIQSADVLLGVHGAAMTHLLFMRPGSLLLQIVP 353
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN--WSN 364
+GT+ S ++ PA G++Y+EY ++ ESSL++++G N V+ +P G++ WS+
Sbjct: 354 LGTKSPSRSFYGNPAMKAGMQYMEYIVEASESSLLKRFGQNHSVIVSPPENPGSSSGWSD 413
Query: 365 M-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
M ++YL QNV + + RF L++A++K L
Sbjct: 414 MKKIYLDKQNVTISLSRFEPVLREAFEKISSL 445
>gi|224086964|ref|XP_002308019.1| predicted protein [Populus trichocarpa]
gi|222853995|gb|EEE91542.1| predicted protein [Populus trichocarpa]
Length = 413
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/362 (37%), Positives = 207/362 (57%), Gaps = 35/362 (9%)
Query: 59 KTQPYPRKSDKSAMSKVKELTI---TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFM 115
K +PY RK + S M + EL + T + N C V H PA+ FS GGY GN +HEF
Sbjct: 43 KVKPYTRKWETSVMDTIDELDLIVKTENFRNNHHCDVKHDVPAVFFSTGGYTGNVYHEFN 102
Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
D +PL+IT + HF ++ V+ I + +D W KY ++L + +P I+ + THCF
Sbjct: 103 DGLLPLYIT-SQHF-NKKVVFVILEYHDWWIMKYGDILSHLSDYPAIDFSGDKRTHCFPE 160
Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY-------------------NENTNT 216
+GL H + ++P+L+ K++VDF++ L AY + ++
Sbjct: 161 AIVGLRIHDELTVDPSLMQGNKSVVDFRNVLDRAYLPRVQSLLKEEERLAQEKLKQKVHS 220
Query: 217 SSSFHHT--------KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSL 268
SSS +PKLV+++RN R I N + K AEE+GF V + PE +T L
Sbjct: 221 SSSEIRKEVQDATLKRPKLVILSRNG--SRAITNEDLLVKMAEEIGFRVEVVRPERTTEL 278
Query: 269 ADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
A + ++S M+GVHGA +TH LF+RPGSV +QV+P+GT+W + Y+ +PAR LGL+Y
Sbjct: 279 ARIYGALNSSEVMIGVHGAAMTHFLFMRPGSVFIQVIPLGTEWAADAYYGEPARKLGLKY 338
Query: 329 LEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKD 387
+ Y+I ESSL +KY ND VL++P++ + W + +YL QNV+L++ RF+ L
Sbjct: 339 IGYQILPRESSLYDKYDRNDPVLRDPESVSEKGWQYTKSIYLDNQNVRLNLGRFQKRLLR 398
Query: 388 AY 389
AY
Sbjct: 399 AY 400
>gi|326491913|dbj|BAJ98181.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326506118|dbj|BAJ91298.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 535
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 151/425 (35%), Positives = 217/425 (51%), Gaps = 51/425 (12%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
P A ++ CDR+ DLC++ G D +T + + + RI+ PY
Sbjct: 106 PCAGVADGTVCCDRTSPRADLCVMRG----DVRTHAASNSLFLLAAAAPADERIR--PYT 159
Query: 65 RKSDKSAMSKVKELTITTSAPPNLS---------CGVTHTSPALVFSAGGYNGNFFHEFM 115
RK + + MS + EL + + PP S C V H PA+VFS GGY GN +HEF
Sbjct: 160 RKWESTVMSTIDELRLR-AVPPEGSDAAAAGPAGCDVRHDVPAVVFSTGGYTGNVYHEFN 218
Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
D +PL+IT + D+ V+ + + +D W KY ++ + + P ++ +N T THCF
Sbjct: 219 DGIIPLYITARRY--DRKVVFVMLEYHDWWITKYGHIVDQLSDFPPVDFSNDTRTHCFPE 276
Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNT------------------- 216
+GL H + I+ + +P + ++DF+ L +A+ N
Sbjct: 277 AVVGLRIHDELAIDASRMPGNQGILDFRQILDDAHRGRINAIIDEEKASPLAAPAAAQAK 336
Query: 217 ---SSSFHHT--------KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEES 265
S T KP+LV+V+RN R I N E+ +AA GF VT+ P
Sbjct: 337 KQQPRSISGTEELLEDEYKPRLVIVSRNG--SRAIENEDELARAAARAGFRVTVLRPRPD 394
Query: 266 TSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG 325
T LA +R ++ MVGVHGA +TH LF+RPGS +QVVP+GT W + Y+ +PAR LG
Sbjct: 395 TELAQMYRVLNGSDVMVGVHGAAMTHFLFMRPGSAFIQVVPLGTDWAAENYYGEPARRLG 454
Query: 326 LEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIY 384
L Y+ YKI ESSL +Y +D VL +P A W +VYL QNV+LD+ RFR
Sbjct: 455 LRYIPYKILPSESSLFRRYARDDPVLTDPVAVNAKGWQVTKKVYLDGQNVRLDMARFRRR 514
Query: 385 LKDAY 389
L DAY
Sbjct: 515 LSDAY 519
>gi|357116180|ref|XP_003559861.1| PREDICTED: uncharacterized protein LOC100829143 [Brachypodium
distachyon]
Length = 536
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 142/416 (34%), Positives = 217/416 (52%), Gaps = 35/416 (8%)
Query: 6 AALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSV-GHTDSTPS---QPSLRIKTQ 61
AA+ ++ CDR+ D+C G +S+F V G+ +S+P+ + + +
Sbjct: 115 AAVRGNDTLCCDRTSERADMCFARGDVRMHSASSSFLLVSGNKESSPAPGKKEEQEERIR 174
Query: 62 PYPRKSDKSAMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
PY RK + + M+ + E+ I P C V H PA++ S GG+ GN +HEF D
Sbjct: 175 PYTRKWEANVMATIDEVRIRRVHPAHGGPSAPRCDVVHDVPAVLLSTGGFTGNVYHEFND 234
Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
+P+F+T +H + V+ I + +D W KY +++ R + P I+ + HCF +
Sbjct: 235 GLIPMFVTA-AHL-RRRVVFVILEYHDWWITKYGDVVSRLSAFPPIDFSADRRVHCFPEL 292
Query: 177 TLGLISHGRMVINPTLLPKPKT-LVDFQSFLANAYNEN--------------------TN 215
GL HG + ++P P T + DF++ L +AY +
Sbjct: 293 IAGLRIHGELTVDPARTPDGATSIADFRALLDDAYRGRLLYLDRLAAARKHRRRPRRRSA 352
Query: 216 TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
+S +P+L +V+R R I N V A E+GF+V + PE ST + +R +
Sbjct: 353 INSVEIEKRPRLTIVSRTG--SRVIENEEAVVSLASEIGFEVRVIRPERSTEMCKIYREL 410
Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQ 335
+ AMVGVHGA +TH LF+RPG V +QVVP+GT W + Y+ +PA LGL Y+ YKI+
Sbjct: 411 NGSDAMVGVHGAAMTHFLFMRPGKVFIQVVPLGTDWAAGAYYGEPAARLGLRYVGYKIRP 470
Query: 336 EESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
+ESSL +Y A D VL +P A A W +VYL QNV+LD+ RFR L A++
Sbjct: 471 DESSLAREYPAGDPVLVDPAAVAKRGWDVTKKVYLDRQNVRLDLARFREELVKAHR 526
>gi|414887942|tpg|DAA63956.1| TPA: hypothetical protein ZEAMMB73_235532 [Zea mays]
Length = 555
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/419 (34%), Positives = 214/419 (51%), Gaps = 42/419 (10%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
P + S+ CDR+ D+C G +++F V +STP + + +PY
Sbjct: 112 PCSAVGNNSLCCDRTSERADICFARGDLRMHSASASFQLVSTGNSTPGE-----RIRPYT 166
Query: 65 RKSDKSAMSKVKELTITTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
RK + + M+ + E+ + AP + + C V H PA+VFS GGY GN +HEF D +PLF+
Sbjct: 167 RKWEANVMATIDEVRLRRVAPGDAARCDVRHDVPAVVFSTGGYTGNVYHEFNDGILPLFV 226
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T H + V L I + +D W KY +++ + + P I+ + HCF V GL H
Sbjct: 227 TAR-HL-RRRVALVILEYHDWWMTKYGDVVSQLSEFPPIDFSADRRVHCFPEVIAGLRIH 284
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNEN------------------------------ 213
G + ++P P+ + + DF+ FL +AY
Sbjct: 285 GELTVDPARTPERRGIGDFRRFLDDAYRGRIEFLERLERRAARRRPHRHHRGGALVPRAP 344
Query: 214 -TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
+ +P+LV+V+R R I N +V A ++GFDV + P+ +T L +
Sbjct: 345 PAGPREAEAERRPRLVIVSRTG--SRVIENEADVAALAADVGFDVRVIRPDRTTELCKIY 402
Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
R +++ AMVGVHGA +TH LF+RPG V +QVVP+GT W + Y+ PA +GL Y+ YK
Sbjct: 403 RELNASDAMVGVHGAAMTHFLFMRPGKVFIQVVPLGTDWAAGAYYGDPAARMGLRYVGYK 462
Query: 333 IKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
I EESSL +Y D VL +P A W +VYL QNV+LD+ RFR L A++
Sbjct: 463 ILPEESSLSREYPTGDPVLTDPAGVAQRGWDVTKKVYLDRQNVRLDLARFREELVAAHR 521
>gi|242046804|ref|XP_002461148.1| hypothetical protein SORBIDRAFT_02g041650 [Sorghum bicolor]
gi|241924525|gb|EER97669.1| hypothetical protein SORBIDRAFT_02g041650 [Sorghum bicolor]
Length = 554
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 137/413 (33%), Positives = 212/413 (51%), Gaps = 39/413 (9%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
S CDR+ D+C G +++F V +ST + + +PY RK + + M
Sbjct: 128 SFCCDRTSERADICFAKGDLRMHSASASFQLVSAGNSTAAAEEEEERIRPYTRKWEANVM 187
Query: 73 SKVKELTITTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
+ + E+ + P + + C V H PA++FS GGY GN +HEF D +PLF+T +H
Sbjct: 188 ATIDEVRLRRVVPGDAARCDVRHDVPAVLFSTGGYTGNVYHEFNDGILPLFVT--AHHLR 245
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
+ V+L I + +D W KY +++ + + P I+ HCF V GL HG + ++P
Sbjct: 246 RRVVLVILEYHDWWMTKYGDVVSQLSAFPPIDFTADRRVHCFPEVIAGLRIHGELSVDPA 305
Query: 192 LLPKPKTLVDFQSFLANAYNENT---------------------------------NTSS 218
P+ K++ DF+ L +AY + +
Sbjct: 306 RTPEGKSIGDFRRLLDDAYRGRIEFLERLERRTARKRLRLHRHRRGAVVPRAPPGPSHAD 365
Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSC 278
+P+LV+V+R R I N ++ A ++GFDV + P+ +T L +R +++
Sbjct: 366 DDRRPRPRLVIVSRTG--SRVIENEADLAALAADVGFDVRVIRPDRTTELCKIYRELNAS 423
Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
AM+GVHGA +TH LF+RPG V +QVVP+GT W + Y+ +PA +GL Y+ YKI EES
Sbjct: 424 DAMIGVHGAAMTHFLFMRPGKVFIQVVPLGTDWAAGAYYGEPAARMGLRYVGYKISPEES 483
Query: 339 SLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
SL +Y D VL +P A W +VYL QNV+LD+ RFR L A++
Sbjct: 484 SLSREYPTGDPVLTDPAGVAQRGWDVTKKVYLDRQNVRLDLARFREELVGAHR 536
>gi|356504769|ref|XP_003521167.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like [Glycine
max]
Length = 537
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/431 (33%), Positives = 220/431 (51%), Gaps = 49/431 (11%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF-----SVGHTDSTPSQPSLRIK 59
P + +I CDRS D+C++ G +S+ F S + + K
Sbjct: 96 PCSSVSTGTICCDRSGYRSDVCVMKGDIRTHSASSSVFLYNSRSNNNVSRNFEEELQHEK 155
Query: 60 TQPYPRKSDKSAMSKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHEFMD 116
+PY RK + S M + EL + + + S C V H PA+ FS GGY GN +HEF D
Sbjct: 156 IKPYTRKWETSVMDTIDELNLISKKVNSGSVGGCDVQHDVPAVFFSNGGYTGNVYHEFND 215
Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
VPL+IT + HF + V+ I + ++ W KY ++L R + P I+ THCF
Sbjct: 216 GIVPLYIT-SQHFKKK-VVFVILEYHNWWIMKYGDILSRLSDFPPIDFQGDNRTHCFTEA 273
Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY---------NENTNTSSSFHHT---- 223
+GL H + ++ +L+ K++ DF++ L AY +E
Sbjct: 274 IVGLRIHDELTVDSSLMRGNKSIADFRNLLDKAYWPRIKGLIRDEERKAQEKLREQVSSS 333
Query: 224 -----------------------KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIF 260
KPKLV+++R+ R I N + K A+E+GF V +
Sbjct: 334 ESSEASQQQYITIRQQVQENPMKKPKLVILSRSG--SRAITNENLLVKMAKEIGFLVQVL 391
Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKP 320
+P+ +T +A +R +++ M+GVHGA +TH LFLRPGSV +QVVP+GT W + Y+ +P
Sbjct: 392 KPDRTTEMAKIYRTLNASDVMIGVHGAAMTHFLFLRPGSVFIQVVPLGTTWAAETYYGEP 451
Query: 321 ARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDID 379
AR LGL+Y+ Y+I ES+L EKY ND +L++P + W ++YL +QNV LD+
Sbjct: 452 ARKLGLKYIGYQILPRESTLYEKYDKNDPILRDPTSINKKGWEYTKKIYLDSQNVMLDLR 511
Query: 380 RFRIYLKDAYK 390
RFR L AY+
Sbjct: 512 RFRKRLHRAYE 522
>gi|242090431|ref|XP_002441048.1| hypothetical protein SORBIDRAFT_09g019360 [Sorghum bicolor]
gi|241946333|gb|EES19478.1| hypothetical protein SORBIDRAFT_09g019360 [Sorghum bicolor]
Length = 546
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 143/404 (35%), Positives = 208/404 (51%), Gaps = 38/404 (9%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ CDRSH D+C + G DP TS+ + ++ K +PY RK + S MS
Sbjct: 129 LCCDRSHYRSDVCYLRGDVRTDPSTSSVLLYNAPRGSAAE-----KVRPYTRKFEDSIMS 183
Query: 74 KVKELTITTSAPP-NLS------------CGVTHTS--PALVFSAGGYNGNFFHEFMDCF 118
+ E+TI A N S C V H PA+VFS GGY GN +HEF D
Sbjct: 184 TIDEVTIQPVAGAYNASASASDGGTLRRRCDVRHPRGVPAVVFSTGGYTGNVYHEFSDGL 243
Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTL 178
+PLF+T +V+ + + + W +Y +L + T + +++ HCF + +
Sbjct: 244 IPLFVTAQRF--AGEVVFVVLEYHYWWLGRYGAVLEQLTNYKVVDFRYDRRVHCFDEMIV 301
Query: 179 GLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH-------------TKP 225
GL HG +V++P L+P K++ DFQ+ L Y+ T+++S KP
Sbjct: 302 GLRIHGELVVDPKLMPNGKSIKDFQALLHQGYSRTTSSASPPVPLPLAPPSRPCPRPAKP 361
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
KL++ R + R +LNL V A GF + T L + S AMV VH
Sbjct: 362 KLLIFIR--KQNRVLLNLPHVVTACRRAGFAPHVMNLRRQTPLPVIHAALASADAMVAVH 419
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYG 345
GA +TH LF+RPGSVL+Q+VP+G W + ++ KPA+ LGLEYLEYK+ EESSL +YG
Sbjct: 420 GAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLEYLEYKVAPEESSLAAEYG 479
Query: 346 ANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
N V++ P + W M +VY+ QNV ++I RF L+ A
Sbjct: 480 LNSTVVRTPWVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRAA 523
>gi|168060869|ref|XP_001782415.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666086|gb|EDQ52750.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 142/389 (36%), Positives = 211/389 (54%), Gaps = 18/389 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFS---VGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
CDRS D+C + G D + TF + P+ S+ +PY RK +KS M
Sbjct: 1 CDRSQFRTDICNMKG----DIRMLTFNGNKPIVLYAKDPATSSVTEIVKPYTRKWEKSCM 56
Query: 73 SKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
V E+T+ S C V H P +VFS GY GN FHEF D +PLFIT S
Sbjct: 57 DTVHEVTLRIVPANSQTDKTPCDVHHKVPGVVFSTSGYTGNLFHEFNDGLIPLFIT--SQ 114
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
+V+ I + ++ W KY E+L + +++ II+ N T HCF + +GL H + +
Sbjct: 115 HLKGEVVFIITEFHNWWLTKYFEVLQQLSQYEIISFENDTRVHCFPELEVGLHIHDDLTV 174
Query: 189 NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH-TKPKLVLVNRNARVGRTILNLREVK 247
+P +P +++ DF+ L Y S +KPKL ++ RN R LNL ++
Sbjct: 175 DPNRMPNHESIRDFRKLLDRGYENALRFDSPIPDVSKPKLSIIVRNGT--RKFLNLGDIV 232
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
AEELGF+V++ P+ + L F+ ++S ++GVHGA +TH LF++PG VL+QV+P+
Sbjct: 233 TTAEELGFNVSLLSPDPTMELKRLFQLLNSTDVLMGVHGAAMTHFLFMKPGKVLIQVIPL 292
Query: 308 GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP-QAFAGANWSNM- 365
G W ST Y+ KP + +GL YL YKI ESSL +Y A+D +L NP + F W M
Sbjct: 293 GIDWASTTYYGKPTKKMGLHYLPYKILPSESSLSRQYNASDPILVNPDEIFNQQGWWTMK 352
Query: 366 RVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
+++L+ Q+V+ + R R A KK K+
Sbjct: 353 KIFLEGQDVRPSLTRMRKIFMRALKKLKR 381
>gi|302765455|ref|XP_002966148.1| glycosyltransferase in CAZy family GT61 [Selaginella
moellendorffii]
gi|300165568|gb|EFJ32175.1| glycosyltransferase in CAZy family GT61 [Selaginella
moellendorffii]
Length = 486
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 134/395 (33%), Positives = 211/395 (53%), Gaps = 21/395 (5%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
+ITCD SH D+C + G + ++ +S + K +PY RK +KS M
Sbjct: 96 NITCDFSHFRTDVCELRGDVRVSGGRAGKVALLDAESPQDSHQVVAKIKPYTRKWEKSCM 155
Query: 73 SKVKELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
+ + E+++ S+ P++ C H+ PA++ S GGY GN +HEF D +PLFIT SH
Sbjct: 156 ATIGEVSLEILPLSSSPSMPCDTNHSVPAVILSTGGYTGNVYHEFNDGLIPLFIT--SHK 213
Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
+V+ + + + W KY ++ + + +P+ + + HCF LG+ H + I
Sbjct: 214 FHGEVVFLVLELHKWWMMKYGSIVSKLSNYPVQDFDRSKRIHCFPQAILGMQIHDELAI- 272
Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH-----------TKPKLVLVNRNARVGR 238
+ P+ ++ DFQ L + N + + PKLVL+ R R
Sbjct: 273 ASQAPEA-SMRDFQQLLKASLNSQKSPLKPMSRVAKVGNKVGGSSSPKLVLLARKG--SR 329
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
+LN + + A+++GF V + P +SL + +HS H MVGVHGA LTH LF+RP
Sbjct: 330 VLLNQNALVRLAKKIGFRVVVLAPNSHSSLFELHEELHSAHVMVGVHGAALTHFLFMRPA 389
Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
SV +Q+VP+GT+W + Y+ +PA GL YLEYKI EESSLV+K G + +P+
Sbjct: 390 SVFIQIVPLGTEWAAQTYYGQPAMKAGLRYLEYKIVAEESSLVKKLGRESAAVAHPEEIT 449
Query: 359 GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKA 392
W M ++YL+ Q+V L + RFR L+ A+K++
Sbjct: 450 SKGWWEMKKIYLQNQDVMLSLHRFRPLLEQAFKES 484
>gi|413949498|gb|AFW82147.1| glycosyltransferase [Zea mays]
Length = 543
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 145/408 (35%), Positives = 211/408 (51%), Gaps = 42/408 (10%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ CDRSH D+C + G DP TS+ + + S P K +PY RK + S MS
Sbjct: 128 LCCDRSHYRSDVCYLRGDVRTDPSTSSVLL--YNAARGSGPE---KVRPYTRKFEDSVMS 182
Query: 74 KVKELTIT--------TSAPPNLSCGVTHTS--PALVFSAGGYNGNFFHEFMDCFVPLFI 123
+ E+TI +++ C V H PA+V S GGY GN +HEF D +PLF+
Sbjct: 183 SIDEVTIVPVPGGHNASASALRRRCDVRHPRGVPAVVLSTGGYTGNVYHEFSDGLIPLFV 242
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T +V+L + + + W +Y +L + T + +++ + HCF + +GL H
Sbjct: 243 TAQRF--GGEVVLVVLEYHYWWLGRYGAVLEQLTNYKVVDFRSDRRVHCFDEMIVGLRIH 300
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH--------------------- 222
G +V++P L+P K++ DFQ+ L +Y+ T T S+
Sbjct: 301 GELVVDPKLMPNGKSIKDFQALLHRSYSRTTTTPSASPPVPVPLPLAPPSRSRSRACPRP 360
Query: 223 -TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
KPKL++ R R +LNL V A GF + T LA + S AM
Sbjct: 361 AAKPKLLIFIRKQ--NRVLLNLPHVVTACRRAGFAPHVMNLRRQTPLAAIHAALASADAM 418
Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLV 341
V VHGA +TH LF+RPGSVL+Q+VP+G W + ++ KPA+ LGLEYLEYK+ EESSL
Sbjct: 419 VAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLEYLEYKVAPEESSLA 478
Query: 342 EKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
+YG + VL+NP + W M +VY+ QNV ++I RF L+ A
Sbjct: 479 AEYGLDSTVLRNPWVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRTA 526
>gi|356572020|ref|XP_003554168.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like [Glycine
max]
Length = 530
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 146/433 (33%), Positives = 218/433 (50%), Gaps = 54/433 (12%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF-----SVGHTDSTPSQPSLRIK 59
P + +I CDRS D+C++ G +S+ F S + + K
Sbjct: 90 PCSSVSTGTICCDRSGYRSDVCVMKGDIRTHSASSSVFLYNSRSNNNVSRNFEEELQHEK 149
Query: 60 TQPYPRKSDKSAMSKVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHE 113
+PY RK + S M + EL + P NL+ C V H PA+ FS GGY GN +HE
Sbjct: 150 IKPYTRKWETSVMDTIDELNL---VPKNLNLGGVGGCDVQHDVPAVFFSNGGYTGNVYHE 206
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
F D +PL+IT + HF + V+ I + + W KY ++L R + P I+ THCF
Sbjct: 207 FNDGIIPLYIT-SQHF-KKKVVFVILEYHSWWIMKYGDILSRLSDFPPIDFRGDNRTHCF 264
Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY---------NENTNTSSSFHHT- 223
+GL H + ++ L+ K++ DF++ L AY +E
Sbjct: 265 PEAIVGLRIHDELTVDSALMRGNKSIADFRNLLDKAYWPRIKGLIRDEERKAQEKLREQV 324
Query: 224 -------------------------KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
KP LV+++R+ R I N + K A+E+GF V
Sbjct: 325 SSSESSEASQQQYIIRQQVQENPTKKPTLVILSRSG--SRAITNENLLVKMAKEIGFLVQ 382
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
+ +P+ +T LA +R +++ M+GVHGA +TH LFLRPGSV +QVVP+GT W + Y+
Sbjct: 383 VLKPDRTTELAKVYRSLNASDVMIGVHGAAMTHFLFLRPGSVFIQVVPLGTTWAAETYYG 442
Query: 319 KPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLD 377
+PAR LGL+Y+ Y+I ES+L EKY ND +L++P + W ++YL +QNV LD
Sbjct: 443 EPARKLGLKYIGYQILPRESTLYEKYDKNDPILRDPTSINKKGWEYTKKIYLDSQNVMLD 502
Query: 378 IDRFRIYLKDAYK 390
+ RFR L AY+
Sbjct: 503 LRRFRKRLHRAYE 515
>gi|414591074|tpg|DAA41645.1| TPA: hypothetical protein ZEAMMB73_888474 [Zea mays]
Length = 547
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 138/419 (32%), Positives = 214/419 (51%), Gaps = 40/419 (9%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTF--FSVGHTDSTPSQPSLRIKTQP 62
P + S+ CDR+ D+C G +++F S G++ + + + +P
Sbjct: 119 PCSAVGNDSLCCDRTSERADICFAKGDVRMHSASASFQLVSAGNSSAAAEEEE---RIRP 175
Query: 63 YPRKSDKSAMSKVKELTITTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
Y RK + + M+ + E+ + P + + C V H +PA++FS GGY GN +HEF D +PL
Sbjct: 176 YTRKWEANVMATIDEVRLRRVLPGDAARCDVRHDAPAVLFSTGGYTGNVYHEFNDGILPL 235
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
F+T +H + V+L I + +D W KY +++ + + P I+ HCF + GL
Sbjct: 236 FVT--AHHLRRRVVLVILEYHDWWMTKYGDVVSQLSAFPPIDFAADRRVHCFPELIAGLR 293
Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYNENTN-------------------------- 215
HG + ++P P+ K++ DF+ L +AY
Sbjct: 294 IHGELTVDPARTPEGKSIGDFRRLLDDAYRGRIGFLERLERRAARKRLRQRRHRRGGGGG 353
Query: 216 ---TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
+P+LV+V+R R I N +V A ++GFDV + PE +T L +
Sbjct: 354 APPAPGGPREDRPRLVIVSRTG--SRVIENEADVAALAADVGFDVRVIRPERTTELCKIY 411
Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
R ++ AMVGVHGA +TH LF+RPG V +QVVP+GT W + Y+ +PA +GL Y+ Y+
Sbjct: 412 RELNGSDAMVGVHGAAMTHFLFMRPGKVFIQVVPLGTHWAAGAYYGEPAARMGLRYVGYR 471
Query: 333 IKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
I EESSL +Y D VL +P A W +VYL QNV+LD+ RFR L A++
Sbjct: 472 ILPEESSLSREYPTGDPVLTDPAGVAQRGWDVTKKVYLDRQNVRLDLARFREELVRAHR 530
>gi|219884207|gb|ACL52478.1| unknown [Zea mays]
Length = 543
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 145/408 (35%), Positives = 210/408 (51%), Gaps = 42/408 (10%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ CDRSH D+C + G DP TS+ + + S P K +PY RK + S MS
Sbjct: 128 LCCDRSHYRSDVCYLRGDVRTDPSTSSVLL--YNAARGSGPE---KVRPYTRKFEDSVMS 182
Query: 74 KVKELTIT--------TSAPPNLSCGVTHTS--PALVFSAGGYNGNFFHEFMDCFVPLFI 123
+ E+TI +++ C V H PA+V S GGY GN +HEF D +PLF+
Sbjct: 183 SIDEVTIVPVPGGHNASASALRRRCDVRHPRGVPAVVLSTGGYTGNVYHEFSDGLIPLFV 242
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T +V+L + + W +Y +L + T + +++ + HCF + +GL H
Sbjct: 243 TAQRF--GGEVVLVVLGYHYWWLGRYGAVLEQLTNYKVVDFRSDRRVHCFDEMIVGLRIH 300
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH--------------------- 222
G +V++P L+P K++ DFQ+ L +Y+ T T S+
Sbjct: 301 GELVVDPKLMPNGKSIKDFQALLHRSYSRTTTTPSASPPVPVPLPLAPPSRSRSRACPRP 360
Query: 223 -TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
KPKL++ R R +LNL V A GF + T LA + S AM
Sbjct: 361 AAKPKLLIFIRKQ--NRVLLNLPHVVTACRRAGFAPHVMNLRRQTPLAAIHAALASADAM 418
Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLV 341
V VHGA +TH LF+RPGSVL+Q+VP+G W + ++ KPA+ LGLEYLEYK+ EESSL
Sbjct: 419 VAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLEYLEYKVAPEESSLA 478
Query: 342 EKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
+YG + VL+NP + W M +VY+ QNV ++I RF L+ A
Sbjct: 479 AEYGLDSTVLRNPWVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRTA 526
>gi|326512050|dbj|BAJ96006.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 553
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 142/416 (34%), Positives = 207/416 (49%), Gaps = 44/416 (10%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ CDRSH D+C + G A DP TS+ + K +PY RK ++S M
Sbjct: 134 LCCDRSHYRSDVCYLRGDARTDPSTSSVLLYNAPRGAAPE-----KVRPYTRKFEESIMR 188
Query: 74 KVKELTIT------------TSAPPNLSCGVTHTS--PALVFSAGGYNGNFFHEFMDCFV 119
+ E+TI T C V H + PA+VFS GGY GN +HEF D +
Sbjct: 189 SIDEVTIVPVDSTGNGTDKDTRDSLRRRCDVRHPAGVPAVVFSTGGYTGNVYHEFSDGLI 248
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
PLF+T +V+ + + + W +Y +L R T + II+ HCF + +G
Sbjct: 249 PLFVTAERF--GGEVVFVVLEYHYWWLGRYGAILERLTNYKIIDFRYDRRVHCFSEMIVG 306
Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF------------------- 220
L HG +V++P L+P K++ DFQ+ L Y+ + +S+
Sbjct: 307 LRIHGELVVDPKLMPNGKSIKDFQALLHQGYSGKPSATSAAPLPLPLATPSRPCVRPDDH 366
Query: 221 -HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
KPK+++ R + R ILNL V A GF + T L + S
Sbjct: 367 AKAAKPKMLIFIR--KQNRVILNLPHVVTACRRAGFAPHVMNLRRQTPLPVIHAALASAD 424
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
AMV VHGA +TH LF+RPG+VL+Q+VP+G W + ++ KPA+ LGLEYLEY++ EESS
Sbjct: 425 AMVAVHGAAVTHFLFMRPGTVLLQIVPVGLDWAAESFYGKPAQQLGLEYLEYRVAPEESS 484
Query: 340 LVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
L +YG N V+++P + W M +VY+ QNV + + RF L+ A KK
Sbjct: 485 LAAEYGLNSTVVQDPSVISSQGWWEMKKVYMDRQNVTVSVKRFGELLRAAKLHLKK 540
>gi|326496216|dbj|BAJ94570.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 553
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 142/416 (34%), Positives = 207/416 (49%), Gaps = 44/416 (10%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ CDRSH D+C + G A DP TS+ + K +PY RK ++S M
Sbjct: 134 LCCDRSHYRSDVCYLRGDARTDPSTSSVLLYNAPRGAAPE-----KVRPYTRKFEESIMR 188
Query: 74 KVKELTIT------------TSAPPNLSCGVTHTS--PALVFSAGGYNGNFFHEFMDCFV 119
+ E+TI T C V H + PA+VFS GGY GN +HEF D +
Sbjct: 189 SIDEVTIVPVDSTGNGTDKDTRDSLRRRCDVRHPAGVPAVVFSTGGYTGNVYHEFSDGLI 248
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
PLF+T +V+ + + + W +Y +L R T + II+ HCF + +G
Sbjct: 249 PLFVTAERF--GGEVVFVVLEYHYWWLGRYGAILERLTNYKIIDFRYDRRVHCFSEMIVG 306
Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF------------------- 220
L HG +V++P L+P K++ DFQ+ L Y+ + +S+
Sbjct: 307 LRIHGELVVDPKLMPNGKSIKDFQALLHQGYSGKPSATSAAPLPLPLATPSRPCVRPDDH 366
Query: 221 -HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
KPK+++ R + R ILNL V A GF + T L + S
Sbjct: 367 AKAAKPKMLIFIR--KQNRVILNLPHVVTACRRAGFAPHVMNLRRQTPLPVIHAALASAD 424
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
AMV VHGA +TH LF+RPG+VL+Q+VP+G W + ++ KPA+ LGLEYLEY++ EESS
Sbjct: 425 AMVAVHGAAVTHFLFMRPGTVLLQIVPVGLDWAAESFYGKPAQQLGLEYLEYRVAPEESS 484
Query: 340 LVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
L +YG N V+++P + W M +VY+ QNV + + RF L+ A KK
Sbjct: 485 LAAEYGLNSTVVQDPSVISSQGWWEMKKVYMDRQNVTVSVKRFGELLRAAKLHLKK 540
>gi|297739632|emb|CBI29814.3| unnamed protein product [Vitis vinifera]
Length = 486
Score = 247 bits (630), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 142/388 (36%), Positives = 210/388 (54%), Gaps = 17/388 (4%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
+I CDRS D C + G +S+ F G +IK PY RK + S M
Sbjct: 89 TICCDRSAFRTDTCFMKGDVRTHSSSSSIFLYGKEFEEEQLQHEKIK--PYTRKWEASVM 146
Query: 73 SKVKELTITTSAP---PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
+ EL + + + C V H PA+ FS GGY GN +HEF D +PL+IT S
Sbjct: 147 DTIDELHLISKKEGYGTHHHCDVHHDVPAVFFSTGGYTGNIYHEFNDGILPLYIT--SQH 204
Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
++ V+ I + +D W KY +++ + + +P I+ + THCF +GL H + ++
Sbjct: 205 LNKRVVFVILEYHDWWITKYGDVISQLSDYPPIDFSGDNRTHCFPEAIVGLRIHDELTVD 264
Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNT------SSSFHH-TKPKLVLVNRNARVGRTILN 242
+L+ +++ DF++ L AY + H KPKLV+++R R I N
Sbjct: 265 SSLVEGNESIRDFRNLLDQAYLPRIRSLIQAKEQKQVHQLKKPKLVVLSRTG--ARAITN 322
Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
+ + A+E+GF V + P +T LA +R ++S AMVGVHGA +TH LFL+PGSV +
Sbjct: 323 EDLMVQMAKEIGFQVKVLRPNRATELAKIYRVLNSSDAMVGVHGAAMTHILFLQPGSVFI 382
Query: 303 QVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
QV+P+GT+W + Y+ +PA LGL+Y+ YKI ESSL +Y +D VL++P + A W
Sbjct: 383 QVIPLGTEWPAETYYGEPAEKLGLKYMGYKILPRESSLYNEYAKDDPVLRDPDSLAKKGW 442
Query: 363 S-NMRVYLKTQNVKLDIDRFRIYLKDAY 389
RVYL Q V LD+ RFR L AY
Sbjct: 443 EFTKRVYLDHQTVTLDLRRFRKQLVCAY 470
>gi|195616038|gb|ACG29849.1| glycosyltransferase [Zea mays]
Length = 545
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 143/410 (34%), Positives = 209/410 (50%), Gaps = 44/410 (10%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ CDRSH D+C + G DP TS+ + + S P K +PY RK + S MS
Sbjct: 128 LCCDRSHYRSDVCYLRGDVRTDPSTSSVLL--YNAARGSGPE---KVRPYTRKFEDSVMS 182
Query: 74 KVKELTIT--------TSAPPNLSCGVTHTS--PALVFSAGGYNGNFFHEFMDCFVPLFI 123
+ E+TI +++ C V H PA+V S GGY GN +HEF D +PLF+
Sbjct: 183 SIDEVTIVPVPGGHNASASALRRRCDVRHPRGVPAVVLSTGGYTGNVYHEFSDGLIPLFV 242
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T +V+L + + + W +Y +L + T + +++ + HCF + +GL H
Sbjct: 243 TAQRF--GGEVVLVVLEYHYWWLGRYGAVLEQLTNYKVVDFRSDRRVHCFDEMIVGLRIH 300
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAY------------------------NENTNTSSS 219
G +V++P L+P K++ DFQ+ L Y + + + +
Sbjct: 301 GELVVDPKLMPNGKSIKDFQALLHRGYSRTTTTTTPSASPPVPLPLPLAPLSRSRSRACP 360
Query: 220 FHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
KPKL++ R R +LNL V A GF + T LA + S
Sbjct: 361 RPAAKPKLLIFIRKQ--NRVLLNLPHVVTACRRAGFAPHVMNLRRQTPLAAIHAALASAD 418
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
AMV VHGA +TH LF+RPGSVL+QVVP+G W + ++ KPA+ LGLEYLEYK+ EESS
Sbjct: 419 AMVAVHGAAVTHFLFMRPGSVLLQVVPVGLDWAADAFYGKPAQQLGLEYLEYKVAPEESS 478
Query: 340 LVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
L +YG + VL+NP + W M +VY+ QNV ++I RF L+ A
Sbjct: 479 LAAEYGLDSTVLRNPWVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRTA 528
>gi|357129279|ref|XP_003566292.1| PREDICTED: uncharacterized protein LOC100836135 [Brachypodium
distachyon]
Length = 555
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 143/411 (34%), Positives = 207/411 (50%), Gaps = 45/411 (10%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ CDRSH D+C + G A DP TS+ + K +PY RK + S MS
Sbjct: 131 LCCDRSHYRSDVCYLRGDARTDPSTSSVLLYNAPRGAAPE-----KVRPYTRKFEGSIMS 185
Query: 74 KVKELTITTSAPPN--------------LSCGVTHTS--PALVFSAGGYNGNFFHEFMDC 117
+ E+TI N C V H + PA+VFS GGY GN +HEF D
Sbjct: 186 TIDEVTILPVPDGNGTSDAASRDRDSLRRRCDVRHPAGVPAVVFSTGGYTGNVYHEFSDG 245
Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
+PLFIT S +V+ + + + W +Y +L R T + I++ HCF +
Sbjct: 246 LIPLFIT--SQRFAGEVVFVVLEYHYWWLGRYGAILERLTNYKIVDFRYDRRVHCFSEMI 303
Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTS-------------------S 218
+GL HG +V++P L+P K++ DFQ+ L Y++ +S +
Sbjct: 304 VGLRIHGELVVDPKLMPNGKSIKDFQALLHQGYSKAPASSSSAPVPLPLVPLSRPCLDHA 363
Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSC 278
+ KPK+++ R + R ILNL V A GF + T L + S
Sbjct: 364 TTKAAKPKMLIFIR--KQNRVILNLPHVVTACRRAGFAPHVMNLRRQTPLPVIHAALASA 421
Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
AMV VHGA +TH LF+RPG+VL+Q+VP+G W + ++ KPA+ LGLEYLEYK+ EES
Sbjct: 422 DAMVAVHGAAVTHFLFMRPGTVLLQIVPVGLDWAAESFYGKPAQQLGLEYLEYKVAPEES 481
Query: 339 SLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
SL +YG + V+++P + W M +VY+ QNV + I RF L+ A
Sbjct: 482 SLAAEYGLDSTVVRDPSVISSRGWWEMKKVYMDRQNVTVGIKRFGELLRAA 532
>gi|125559459|gb|EAZ04995.1| hypothetical protein OsI_27175 [Oryza sativa Indica Group]
Length = 433
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 143/418 (34%), Positives = 211/418 (50%), Gaps = 44/418 (10%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI-KTQPYPRKSDKSA 71
S+ CDR+ D+C G +++F V + + ++ + +PY RK + +
Sbjct: 5 SLCCDRTSTRADICFARGDVRMHSASASFQLVSSSSGNATAAAVEEERIRPYTRKWEANV 64
Query: 72 MSKVKELTIT-TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
M+ + E+ + A C V H PA+VFS GGY GN +HEF D +PLF+T N H
Sbjct: 65 MATIDEVRLRRVPAGGAARCDVVHDVPAVVFSTGGYTGNVYHEFNDGILPLFVTSN-HL- 122
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+ V+ I + +D W KY +++ R + P I+ HCF V GL HG + ++P
Sbjct: 123 RRRVVFVILEYHDWWMTKYGDVVSRLSAFPPIDFTADRRVHCFPEVIAGLRIHGELTVDP 182
Query: 191 TLLPKPKTLVDFQSFLANAYNEN------------------------TNTS--------- 217
P+ K++ F++ L +AY T TS
Sbjct: 183 EKTPEGKSIRHFRTLLDDAYRGRIQYLERLERRAARSRKRRAAAAKPTTTSIALPIMAPP 242
Query: 218 ----SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
S +P+LV+V+R R I N +V A ++GFDV + PE +T L +R
Sbjct: 243 AKQASPSPPDRPRLVIVSRTG--SRVIENEADVAALAADVGFDVRVVRPERTTELCKIYR 300
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
+++ AMVGVHGA +TH LF+RPG V +QVVP+GT W + Y+ +PA LGL Y+ YKI
Sbjct: 301 ELNASDAMVGVHGAAMTHFLFMRPGKVFVQVVPLGTDWAAGAYYGEPAARLGLRYVGYKI 360
Query: 334 KQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
EESSL +Y D VL +P W +VYL QNV+LD+ RFR L A++
Sbjct: 361 LPEESSLSREYPTGDPVLTDPAGVGKRGWDVTKKVYLDRQNVRLDLPRFREVLVGAHR 418
>gi|125601368|gb|EAZ40944.1| hypothetical protein OsJ_25426 [Oryza sativa Japonica Group]
Length = 555
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 143/418 (34%), Positives = 211/418 (50%), Gaps = 44/418 (10%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI-KTQPYPRKSDKSA 71
S+ CDR+ D+C G +++F V + + ++ + +PY RK + +
Sbjct: 127 SLCCDRTSTRADICFARGDVRMHSASASFQLVSSSSGNATAAAVEEERIRPYTRKWEANV 186
Query: 72 MSKVKELTIT-TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
M+ + E+ + A C V H PA+VFS GGY GN +HEF D +PLF+T N H
Sbjct: 187 MATIDEVRLRRVPAGGAARCDVVHDVPAVVFSTGGYTGNVYHEFNDGILPLFVTSN-HL- 244
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+ V+ I + +D W KY +++ R + P I+ HCF V GL HG + ++P
Sbjct: 245 RRRVVFVILEYHDWWMTKYGDVVSRLSAFPPIDFTADRRVHCFPEVIAGLRIHGELTVDP 304
Query: 191 TLLPKPKTLVDFQSFLANAYNEN------------------------TNTS--------- 217
P+ K++ F++ L +AY T TS
Sbjct: 305 EKTPEGKSIRHFRTLLDDAYRGRIQYLERLERRAARSRKRRAAAAKPTTTSIALPIMAPP 364
Query: 218 ----SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
S +P+LV+V+R R I N +V A ++GFDV + PE +T L +R
Sbjct: 365 AKQASPSPPDRPRLVIVSRTG--SRVIENEADVAALAADVGFDVRVVRPERTTELCKIYR 422
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
+++ AMVGVHGA +TH LF+RPG V +QVVP+GT W + Y+ +PA LGL Y+ YKI
Sbjct: 423 ELNASDAMVGVHGAAMTHFLFMRPGKVFVQVVPLGTDWAAGAYYGEPAARLGLRYVGYKI 482
Query: 334 KQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
EESSL +Y D VL +P W +VYL QNV+LD+ RFR L A++
Sbjct: 483 LPEESSLSREYPTGDPVLTDPAGVGKRGWDVTKKVYLDRQNVRLDLPRFREVLVGAHR 540
>gi|115473825|ref|NP_001060511.1| Os07g0657400 [Oryza sativa Japonica Group]
gi|22775615|dbj|BAC15469.1| symbiosis-related protein-like protein [Oryza sativa Japonica
Group]
gi|50510132|dbj|BAD31097.1| symbiosis-related protein-like protein [Oryza sativa Japonica
Group]
gi|113612047|dbj|BAF22425.1| Os07g0657400 [Oryza sativa Japonica Group]
Length = 555
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 143/418 (34%), Positives = 211/418 (50%), Gaps = 44/418 (10%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI-KTQPYPRKSDKSA 71
S+ CDR+ D+C G +++F V + + ++ + +PY RK + +
Sbjct: 127 SLCCDRTSTRADICFARGDVRMHSASASFQLVSSSSGNATAAAVEEERIRPYTRKWEANV 186
Query: 72 MSKVKELTIT-TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
M+ + E+ + A C V H PA+VFS GGY GN +HEF D +PLF+T N H
Sbjct: 187 MATIDEVRLRRVPAGGAARCDVVHDVPAVVFSTGGYTGNVYHEFNDGILPLFVTSN-HL- 244
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+ V+ I + +D W KY +++ R + P I+ HCF V GL HG + ++P
Sbjct: 245 RRRVVFVILEYHDWWMTKYGDVVSRLSAFPPIDFTADRRVHCFPEVIAGLRIHGELTVDP 304
Query: 191 TLLPKPKTLVDFQSFLANAYNEN------------------------TNTS--------- 217
P+ K++ F++ L +AY T TS
Sbjct: 305 EKTPEGKSIRHFRTLLDDAYRGRIQYLERLERRAARSRKRRAAAAKPTTTSIALPIMAPP 364
Query: 218 ----SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
S +P+LV+V+R R I N +V A ++GFDV + PE +T L +R
Sbjct: 365 AKQASPSPPDRPRLVIVSRTG--SRVIENEADVAALAADVGFDVRVVRPERTTELCKIYR 422
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
+++ AMVGVHGA +TH LF+RPG V +QVVP+GT W + Y+ +PA LGL Y+ YKI
Sbjct: 423 ELNASDAMVGVHGAAMTHFLFMRPGKVFVQVVPLGTDWAAGAYYGEPAARLGLRYVGYKI 482
Query: 334 KQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYK 390
EESSL +Y D VL +P W +VYL QNV+LD+ RFR L A++
Sbjct: 483 LPEESSLSREYPTGDPVLTDPAGVGKRGWDVTKKVYLDRQNVRLDLPRFREVLVGAHR 540
>gi|168015237|ref|XP_001760157.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688537|gb|EDQ74913.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/399 (32%), Positives = 211/399 (52%), Gaps = 19/399 (4%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFD--PKTSTFFSVGHTDSTPSQPSLRIKTQP 62
P + + CDRS D+C + G + T F + +Q + T+P
Sbjct: 69 PGTVRDDQIFQCDRSQWRTDICNLRGDVQLEICNGTKAFVLYANQAVETTQLVRKETTKP 128
Query: 63 YPRKSDKSAMSKVKELTITTSAPPNLS-------CGVTHTSPALVFSAGGYNGNFFHEFM 115
Y RK ++ +MS V E+T+ +L+ C V H P +VFS GY GN +HEF
Sbjct: 129 YSRKWEEDSMSSVNEVTLLRMPALSLAAQATRRPCDVRHKVPGIVFSTAGYTGNLYHEFN 188
Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
D +PLFIT S ++ +L I+ ++ W KY E++ + T++ II++ HCF
Sbjct: 189 DGIIPLFIT--SQHLRREAVLIISSFHNWWYSKYREVIEQITKYEIIDLERDERVHCFPE 246
Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY-----NENTNTSSSFHHTKPKLVLV 230
+ GL HG + I+ +P + + +F+ L AY +E+ + + P+L ++
Sbjct: 247 IETGLHIHGELSIDANRMPLKEGIQEFRDMLNRAYKPGPEDEHKIRARLKNKINPRLTII 306
Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
R R +LNL +V AE +GF V + P+ + L F +++ ++GVHGA +T
Sbjct: 307 VRQG--TRKLLNLDDVIHLAERIGFKVHLLTPDPTMELKKIFWLLNNTDVLLGVHGAAMT 364
Query: 291 HSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
H LF+RPGSV +Q+VP+GT W + YF +P LGL+Y+ YKI+ +ESSL + Y A D V
Sbjct: 365 HFLFMRPGSVFIQIVPLGTDWAANEYFGEPVSKLGLKYMPYKIQPDESSLSDIYNATDPV 424
Query: 351 LKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
L +P W ++ ++YL+ Q+V+ + R R L+ A
Sbjct: 425 LVDPDRITQRGWGDLKKIYLEAQDVRPSLHRLRQVLQQA 463
>gi|224135457|ref|XP_002322078.1| predicted protein [Populus trichocarpa]
gi|222869074|gb|EEF06205.1| predicted protein [Populus trichocarpa]
Length = 378
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 129/368 (35%), Positives = 214/368 (58%), Gaps = 15/368 (4%)
Query: 24 DLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTI--T 81
D C I G D K+ T F V + + +PY RK D++AM V+E T+
Sbjct: 5 DFCEIKGDIRIDGKSYTVFIVSSETDILTAENTSWSIRPYARKGDQAAMGAVREWTVKLV 64
Query: 82 TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADC 141
T A C H+ PA++FSAGGY GN FH F D +PLF+T + + ++ I +
Sbjct: 65 TVASDIPQCTQNHSVPAILFSAGGYAGNHFHAFTDVILPLFLTSRPY--NGEIQFLITNG 122
Query: 142 NDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI--SHGRMVINPTLLPKPKTL 199
W K+ ++ +R+ +I+I+N HC+ S+T+GL ++ + I+P P ++
Sbjct: 123 RPAWISKFETIMKALSRYQLISIDNSQDIHCYDSMTVGLKRRTNKELSIDPDPSSSPYSM 182
Query: 200 VDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAAEELGFD 256
DF+ FL ++Y+ ++ + +P+L++++R + R N+ E+ A+ LG+
Sbjct: 183 KDFRKFLRSSYSLKKAMATKIRNGSKKRPRLLIISR--KRSRAFTNVGEIVTMAKRLGYR 240
Query: 257 VTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTV 315
V + EP+ S + I+SC ++GVHGAGLT+ +FL +VL+QV+P G T+WLS
Sbjct: 241 VVVAEPDADVS--GFAQIINSCDVVMGVHGAGLTNIVFLPENAVLVQVIPFGGTEWLSRT 298
Query: 316 YFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNV 374
YFE+PA+ + + YL+YKI+ EES+L+++Y A+ +VL++P A WS + +YL+ QNV
Sbjct: 299 YFEEPAKGMNIRYLDYKIRLEESTLIQQYPADHVVLRDPSAIWKQGWSAVESIYLRQQNV 358
Query: 375 KLDIDRFR 382
L+++RFR
Sbjct: 359 TLNVNRFR 366
>gi|326520289|dbj|BAK07403.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525054|dbj|BAK07797.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 516
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 141/405 (34%), Positives = 209/405 (51%), Gaps = 18/405 (4%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSV------GHTDSTPSQPSLRI 58
P A I CDRS + D+C + G D + +F G D+ +
Sbjct: 111 PCASMGGHGICCDRSDFNTDVCFMAGDVRTDAASLSFLLFPPQPPNGTADANATVAEKEE 170
Query: 59 KTQPYPRKSDKSAMSKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHEFM 115
+ QPY RK +K M+K+ E+ + + P C V H +PALV +AGGY GN FH F
Sbjct: 171 RVQPYTRKWEKHLMAKIHEVRLRAARPDEADAHRCDVRHDAPALVMTAGGYTGNLFHAFN 230
Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
D F+P ++T+ H + V+LA+ + N WA + EL+ + +I++ THCF
Sbjct: 231 DGFLPAWLTVQ-HL-RRRVVLAVLEYNPWWAGTFRELVSGLSDRLVIDLLRDNRTHCFPG 288
Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT--KPKLVLVNRN 233
+G HG + + P +TLVDF FLA AY ++ +T +P+L L +R
Sbjct: 289 AIVGTRFHGILSVEPARTRDNRTLVDFHDFLAGAYKDDKSTPEPEKQQPRRPRLGLYSRK 348
Query: 234 ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
R I N V + AE +GFDV+I L+ + + +C +VGVHGA LT L
Sbjct: 349 GN--RMIENEAAVARLAESVGFDVSILATANGAPLSSEYAAVSACDVLVGVHGADLTKLL 406
Query: 294 FLRPG-SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLK 352
FLRPG + L+QV P+G ++ +EK ++G+ Y +Y ESSLV KY A+D+VL+
Sbjct: 407 FLRPGHAALLQVAPLGVPHVARGCYEKATNMMGIHYEQYDAAANESSLVRKYAADDVVLR 466
Query: 353 NPQAFAGA-NWS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
+P+A W VYL QNV LD+ RF L+ + +A +L
Sbjct: 467 DPEAATRERGWDLTAHVYLGGQNVTLDLGRFGDVLRKLHSRALRL 511
>gi|115463691|ref|NP_001055445.1| Os05g0391600 [Oryza sativa Japonica Group]
gi|54287512|gb|AAV31256.1| unknown protein [Oryza sativa Japonica Group]
gi|113578996|dbj|BAF17359.1| Os05g0391600 [Oryza sativa Japonica Group]
gi|215766743|dbj|BAG98971.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 581
Score = 241 bits (615), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 143/429 (33%), Positives = 210/429 (48%), Gaps = 62/429 (14%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
++ CDRSH D+C + G DP TS+ + + K +PY RK + S M
Sbjct: 141 ALCCDRSHYRSDVCYLRGDVRTDPSTSSVLLYNAPRGSAPE-----KVRPYTRKFEGSIM 195
Query: 73 SKVKELTI------------TTSAPPN-----LSCGVTHTS--PALVFSAGGYNGNFFHE 113
S + E+TI TT+ C V H PA+VFS GGY GN +HE
Sbjct: 196 STIDEVTIVPVVDAGSGSNGTTAGDTGKDSLRRRCDVRHPPGVPAVVFSTGGYTGNVYHE 255
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
F D +PLFIT +V+ + + + W +Y +L R T + +++ HCF
Sbjct: 256 FSDGLIPLFITAQRF--AGEVVFVVLEYHYWWLGRYGAVLERLTNYKVVDFRYDRRVHCF 313
Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH----------- 222
+ +GL HG +V++P L+P K + DFQ+ L Y+ + +++
Sbjct: 314 SEMIVGLRIHGELVVDPKLMPNGKGIQDFQALLHQGYSRTPSATAAAAAAQPPVPLALAA 373
Query: 223 ---------------TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTS 267
KPKLV+ R + R +LNL + A GF + T
Sbjct: 374 PPSRPCLRPDDHAKVAKPKLVIFIR--KQNRVLLNLPHIVTACRRAGFAPHVMNLRRQTP 431
Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE 327
L + S AMV VHGA +TH LF+RPGSVL+Q+VP+G W + ++ KPA+ LGL
Sbjct: 432 LPAIHAALSSADAMVAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLG 491
Query: 328 YLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF----- 381
YLEYK+ EESSL +YG N V+++P + W M +VY+ QNV ++I RF
Sbjct: 492 YLEYKVAPEESSLAAEYGVNSTVVRDPSVISSRGWWEMKKVYMDRQNVTVNIKRFGELLR 551
Query: 382 --RIYLKDA 388
R++LK+A
Sbjct: 552 SARLHLKNA 560
>gi|125580725|gb|EAZ21656.1| hypothetical protein OsJ_05289 [Oryza sativa Japonica Group]
Length = 510
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 130/382 (34%), Positives = 214/382 (56%), Gaps = 17/382 (4%)
Query: 15 TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
TC + + + C G + ST + +G D KT+PY R D AM+
Sbjct: 137 TCRMTSKRSERCEARGDIRVEGNASTIY-IGGIDK-------EWKTKPYARYHDPVAMAV 188
Query: 75 VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
V+E T+ + +C H+ PA VFS GG++GN +H++ D VPLF++ + F Q V
Sbjct: 189 VREFTLKPVTESSPACTRNHSVPAFVFSNGGFSGNLYHDYTDVLVPLFLSTH-QFKGQ-V 246
Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
++ W K+ + T++ I++I+N HCF + +G H M ++P P
Sbjct: 247 QFLLSGLKPWWVNKFNLFFRQLTKYDILDIDNDKDVHCFPRIVVGATFHKDMGVDPKRSP 306
Query: 195 KPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAE 251
++VDF+ L A+ +S T KP+L++++R + R LN RE+ +AA
Sbjct: 307 GHVSVVDFKRALRRAFGLERVAASRGGATGNGKPRLLIISR--KNSRRFLNEREMAQAAA 364
Query: 252 ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-Q 310
+GF+V I EP++ T ++ + ++S M+GVHGAGLT+ +FL G+VL+QVVP G +
Sbjct: 365 AVGFEVRIAEPDQHTDMSTFAQLVNSADVMIGVHGAGLTNMVFLPRGAVLIQVVPFGGLE 424
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
WL+TV F+ PA+ + + Y++Y ++ EESSL+++Y N VL +P A W ++ YL
Sbjct: 425 WLTTVTFKNPAKDMEVTYMDYNVQLEESSLIDQYPRNHQVLTDPYAVHKQGWDALKTAYL 484
Query: 370 KTQNVKLDIDRFRIYLKDAYKK 391
QN+K+D+DRF+ L++A +
Sbjct: 485 DKQNIKMDMDRFKKTLQEALDR 506
>gi|115444075|ref|NP_001045817.1| Os02g0135500 [Oryza sativa Japonica Group]
gi|42408837|dbj|BAD10097.1| putative HGA1 [Oryza sativa Japonica Group]
gi|42409107|dbj|BAD10357.1| putative HGA1 [Oryza sativa Japonica Group]
gi|113535348|dbj|BAF07731.1| Os02g0135500 [Oryza sativa Japonica Group]
gi|125537988|gb|EAY84383.1| hypothetical protein OsI_05759 [Oryza sativa Indica Group]
gi|215686733|dbj|BAG89583.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 485
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 130/382 (34%), Positives = 214/382 (56%), Gaps = 17/382 (4%)
Query: 15 TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
TC + + + C G + ST + +G D KT+PY R D AM+
Sbjct: 112 TCRMTSKRSERCEARGDIRVEGNASTIY-IGGIDK-------EWKTKPYARYHDPVAMAV 163
Query: 75 VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
V+E T+ + +C H+ PA VFS GG++GN +H++ D VPLF++ + F Q V
Sbjct: 164 VREFTLKPVTESSPACTRNHSVPAFVFSNGGFSGNLYHDYTDVLVPLFLSTH-QFKGQ-V 221
Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
++ W K+ + T++ I++I+N HCF + +G H M ++P P
Sbjct: 222 QFLLSGLKPWWVNKFNLFFRQLTKYDILDIDNDKDVHCFPRIVVGATFHKDMGVDPKRSP 281
Query: 195 KPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAE 251
++VDF+ L A+ +S T KP+L++++R + R LN RE+ +AA
Sbjct: 282 GHVSVVDFKRALRRAFGLERVAASRGGATGNGKPRLLIISR--KNSRRFLNEREMAQAAA 339
Query: 252 ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-Q 310
+GF+V I EP++ T ++ + ++S M+GVHGAGLT+ +FL G+VL+QVVP G +
Sbjct: 340 AVGFEVRIAEPDQHTDMSTFAQLVNSADVMIGVHGAGLTNMVFLPRGAVLIQVVPFGGLE 399
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
WL+TV F+ PA+ + + Y++Y ++ EESSL+++Y N VL +P A W ++ YL
Sbjct: 400 WLTTVTFKNPAKDMEVTYMDYNVQLEESSLIDQYPRNHQVLTDPYAVHKQGWDALKTAYL 459
Query: 370 KTQNVKLDIDRFRIYLKDAYKK 391
QN+K+D+DRF+ L++A +
Sbjct: 460 DKQNIKMDMDRFKKTLQEALDR 481
>gi|226532070|ref|NP_001147864.1| glycosyltransferase [Zea mays]
gi|195614204|gb|ACG28932.1| glycosyltransferase [Zea mays]
Length = 521
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 140/416 (33%), Positives = 214/416 (51%), Gaps = 30/416 (7%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTST---FFSVGHTDSTPSQPSLRIKTQPYPRKSDK 69
+I CDRS D+C + G D + + F + S+ + +PYPRK ++
Sbjct: 110 TICCDRSSVHTDVCFMAGDVRTDAASLSLLLFPPHQQAQNGTSEEDKEERVRPYPRKWER 169
Query: 70 SAMSKVKELTITTSAP--PNLS-------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
M KV E+ + +AP P+ C V H +P LV SAGGY GN FH F D F+P
Sbjct: 170 FIMDKVPEVRLRVAAPRRPDGEREEVEHRCDVRHDAPLLVMSAGGYTGNLFHAFNDGFLP 229
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
++T+ H + V+L + N WA + E++ + HP++++ + T THCF +G
Sbjct: 230 SWLTVQ-HL-RRRVVLGVVSYNPWWAGMFVEVISGLSDHPVVDLLHDTRTHCFPGAIVGT 287
Query: 181 ISHGRMVINPTLLPKPKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLV 230
HG ++++P L KT+VDF LA+AY + + + +P+L +V
Sbjct: 288 RYHGILIVDPARLRDNKTIVDFHQMLADAYEKPPRETTRPAQQQDLRDAEQRRRPRLGIV 347
Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
+R R I N V + A +GFDV I E L+ + + +C A+VGVHGA LT
Sbjct: 348 SRKGT--RVIENQAAVARLASSVGFDVDILETANGLPLSAWYASLRACDALVGVHGADLT 405
Query: 291 HSLFLRPG-SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
LFLRPG + L Q+ P+G ++ F +PA +GL Y +Y+++ ESSL +Y D
Sbjct: 406 KFLFLRPGRASLTQIAPLGVSPIAREDFGEPAARMGLAYEQYEVRAGESSLARRYAPGDA 465
Query: 350 VLKNPQAFA--GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDKEIKL 402
VL +P+A W + RVYL QNV LD+ RFR L + A + ++ +L
Sbjct: 466 VLTDPEAAKRDKGGWDLVARVYLGGQNVTLDLARFRQTLARMHAHAMRQRRRQERL 521
>gi|413945240|gb|AFW77889.1| hypothetical protein ZEAMMB73_009888 [Zea mays]
Length = 555
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 142/415 (34%), Positives = 210/415 (50%), Gaps = 48/415 (11%)
Query: 6 AALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPR 65
AA + CDRSH D+C + G DP T++ + S P K +PY R
Sbjct: 124 AAADAEGVLCCDRSHYRSDVCYLRGDVRTDPSTTSVLL--YNAPRGSAPE---KVRPYTR 178
Query: 66 KSDKSAMSKVKELTITTS--------APPNLS---------CGVTHTS--PALVFSAGGY 106
K + S MS + E+TI A N+S C V H A+VFS GGY
Sbjct: 179 KFEGSVMSSIDEVTIVPVPPGGGGGGAHSNMSVGVGGGGRRCDVRHPRGVAAVVFSTGGY 238
Query: 107 NGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN 166
GN +HEF D VPLF+T +V+ + + + W +Y +L + T + +++ +
Sbjct: 239 TGNVYHEFSDGLVPLFVTAQRF--AGEVVFVVLEYHYWWLGRYGAVLEQLTNYRVVDFRH 296
Query: 167 QTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---- 222
HCF + +GL HG +V++P L+P K++ DFQ+ L Y+ T T+S+
Sbjct: 297 DRRVHCFDEMVVGLRIHGELVVDPKLMPDGKSIKDFQALLHQGYSRTTTTASASSSSPTP 356
Query: 223 ---------------TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTS 267
KPK+++ R + R +LNL +V A GF + T
Sbjct: 357 LLPLAPPTRPCPRSANKPKMLIFIR--KQNRVLLNLPQVVTACRRAGFAPHVMNLRRQTP 414
Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE 327
LA + S AMV VHGA +TH LF+RPGSVL+Q+VP+G W + ++ KPA+ LGL+
Sbjct: 415 LAAIHAALASADAMVAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLD 474
Query: 328 YLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF 381
YLEY++ EESSL +YG N V++ P + W M +VY+ QNV +++ RF
Sbjct: 475 YLEYRVAPEESSLAAEYGLNSTVVRTPWVISSRGWWEMKKVYMDRQNVTVNVKRF 529
>gi|225441880|ref|XP_002284307.1| PREDICTED: uncharacterized protein LOC100245312 isoform 1 [Vitis
vinifera]
Length = 529
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 144/429 (33%), Positives = 212/429 (49%), Gaps = 56/429 (13%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVG---------HTDSTPSQPS-------L 56
+I CDRS D C + G +S+ F G H S +
Sbjct: 89 TICCDRSAFRTDTCFMKGDVRTHSSSSSIFLYGSRNPNAFTDHVSSISGEGKEFEEEQLQ 148
Query: 57 RIKTQPYPRKSDKSAMSKVKELTITTSAP---PNLSCGVTHTSPALVFSAGGYNGNFFHE 113
K +PY RK + S M + EL + + + C V H PA+ FS GGY GN +HE
Sbjct: 149 HEKIKPYTRKWEASVMDTIDELHLISKKEGYGTHHHCDVHHDVPAVFFSTGGYTGNIYHE 208
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
F D +PL+IT S ++ V+ I + +D W KY +++ + + +P I+ + THCF
Sbjct: 209 FNDGILPLYIT--SQHLNKRVVFVILEYHDWWITKYGDVISQLSDYPPIDFSGDNRTHCF 266
Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY----------------------- 210
+GL H + ++ +L+ +++ DF++ L AY
Sbjct: 267 PEAIVGLRIHDELTVDSSLVEGNESIRDFRNLLDQAYLPRIRSLIQAKEQKVQSKMKEDP 326
Query: 211 --------NENTNTSSSFHH-TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFE 261
T H KPKLV+++R R I N + + A+E+GF V +
Sbjct: 327 SLPPSLKPPLETGKEEQVHQLKKPKLVVLSRTG--ARAITNEDLMVQMAKEIGFQVKVLR 384
Query: 262 PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPA 321
P +T LA +R ++S AMVGVHGA +TH LFL+PGSV +QV+P+GT+W + Y+ +PA
Sbjct: 385 PNRATELAKIYRVLNSSDAMVGVHGAAMTHILFLQPGSVFIQVIPLGTEWPAETYYGEPA 444
Query: 322 RVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDR 380
LGL+Y+ YKI ESSL +Y +D VL++P + A W RVYL Q V LD+ R
Sbjct: 445 EKLGLKYMGYKILPRESSLYNEYAKDDPVLRDPDSLAKKGWEFTKRVYLDHQTVTLDLRR 504
Query: 381 FRIYLKDAY 389
FR L AY
Sbjct: 505 FRKQLVCAY 513
>gi|242060410|ref|XP_002451494.1| hypothetical protein SORBIDRAFT_04g002850 [Sorghum bicolor]
gi|241931325|gb|EES04470.1| hypothetical protein SORBIDRAFT_04g002850 [Sorghum bicolor]
Length = 499
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 130/386 (33%), Positives = 209/386 (54%), Gaps = 24/386 (6%)
Query: 15 TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
TC + + + C +G D S + G KT+PY R D AM+
Sbjct: 125 TCYMTSKRSERCDASGDIRVDGNRSAIYVSGIDKE--------WKTKPYARYHDPVAMAH 176
Query: 75 VKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
V+E T+ AP +C H+ P +FS GG++GN +H++ D VPLFI+ +H
Sbjct: 177 VREYTLKPLPAAEAP---ACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--THQF 231
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
V ++ W K+ + T++ +I+++N HCF + G H M ++P
Sbjct: 232 RGRVQFLLSGMKPWWVAKFTPFFRQLTKYDVIDVDNDQEVHCFPRIVAGATFHKDMGVDP 291
Query: 191 TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
P ++VDF+ L A+ E + + H KP+L++++R R R LN RE+
Sbjct: 292 RRSPGHVSVVDFKRALRRAFGLEREAASRGGATGHGKPRLLIISR--RGSRRFLNEREMA 349
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
+AA + GF+V + EP++ T +A ++S MVGVHGAGLT+ +FL G+VL+QVVP
Sbjct: 350 RAAADAGFEVRVAEPDQHTDMATFAALVNSADVMVGVHGAGLTNMVFLPRGAVLIQVVPF 409
Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
G +WL++V F+ PA + + Y++Y +K EESSL+++Y N VL +P A W ++
Sbjct: 410 GGLEWLTSVTFKDPAADMEVNYMDYNVKLEESSLLDQYPRNHQVLTDPYAVHKQGWDALK 469
Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKK 391
YL QN+++D+DRFR L++A +
Sbjct: 470 TAYLDKQNIRMDLDRFRATLREAMSR 495
>gi|42408838|dbj|BAD10098.1| putative HGA1 [Oryza sativa Japonica Group]
gi|42409108|dbj|BAD10358.1| putative HGA1 [Oryza sativa Japonica Group]
Length = 371
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 132/383 (34%), Positives = 214/383 (55%), Gaps = 21/383 (5%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+T RS R C G + ST + +G D KT+PY R D AM+
Sbjct: 1 MTSKRSER----CEARGDIRVEGNASTIY-IGGIDK-------EWKTKPYARYHDPVAMA 48
Query: 74 KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
V+E T+ + +C H+ PA VFS GG++GN +H++ D VPLF++ + F Q
Sbjct: 49 VVREFTLKPVTESSPACTRNHSVPAFVFSNGGFSGNLYHDYTDVLVPLFLSTH-QFKGQ- 106
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V ++ W K+ + T++ I++I+N HCF + +G H M ++P
Sbjct: 107 VQFLLSGLKPWWVNKFNLFFRQLTKYDILDIDNDKDVHCFPRIVVGATFHKDMGVDPKRS 166
Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAA 250
P ++VDF+ L A+ +S T KP+L++++R + R LN RE+ +AA
Sbjct: 167 PGHVSVVDFKRALRRAFGLERVAASRGGATGNGKPRLLIISR--KNSRRFLNEREMAQAA 224
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
+GF+V I EP++ T ++ + ++S M+GVHGAGLT+ +FL G+VL+QVVP G
Sbjct: 225 AAVGFEVRIAEPDQHTDMSTFAQLVNSADVMIGVHGAGLTNMVFLPRGAVLIQVVPFGGL 284
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
+WL+TV F+ PA+ + + Y++Y ++ EESSL+++Y N VL +P A W ++ Y
Sbjct: 285 EWLTTVTFKNPAKDMEVTYMDYNVQLEESSLIDQYPRNHQVLTDPYAVHKQGWDALKTAY 344
Query: 369 LKTQNVKLDIDRFRIYLKDAYKK 391
L QN+K+D+DRF+ L++A +
Sbjct: 345 LDKQNIKMDMDRFKKTLQEALDR 367
>gi|224118736|ref|XP_002317893.1| predicted protein [Populus trichocarpa]
gi|222858566|gb|EEE96113.1| predicted protein [Populus trichocarpa]
Length = 383
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 132/376 (35%), Positives = 211/376 (56%), Gaps = 18/376 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C RS D C I G D + T F V + + + +PY RK D++AM V
Sbjct: 5 CTIMGRS-DFCEIKGDIRIDGSSYTVFIVSSETDILAAENTSWRIRPYARKGDQTAMGAV 63
Query: 76 KELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
+E T+ A + C H+ P ++FSAGGY GN FH F D VPLF T + + +
Sbjct: 64 REWTLKLVAGGSDIPQCTQNHSVPGILFSAGGYAGNHFHAFTDIIVPLFSTARPY--NGE 121
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI--SHGRMVINPT 191
V I + W K+ +L +R+ +INI+N+ HCF S+T+GL S+ + I+P+
Sbjct: 122 VQFIITNGWSAWIAKFKTILKALSRYELINIDNRKDIHCFGSMTVGLKRPSYKELSIDPS 181
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFH---HTKPKLVLVNRNARVGRTILNLREVKK 248
P ++ DF+ FL ++Y+ + +P+L++++R + R N+ E+
Sbjct: 182 --KSPYSIKDFRQFLRSSYSLKKTRAIKIRDGMKKRPRLLIISR--KRSRAFTNVGEIVN 237
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
AE LGF V + EP ++ + I+SC ++GVHGAGLT+ +FL +VL+QV+P G
Sbjct: 238 MAERLGFRVVVAEP--GMDVSGFSQIINSCDVVMGVHGAGLTNIVFLPEKAVLIQVIPFG 295
Query: 309 -TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
+WLS +FE+PA+ + + YL+YKI+ EES+L+++Y A+ VL++P W +
Sbjct: 296 GAEWLSRTFFEEPAKDMNIRYLDYKIRVEESTLIQQYPADHAVLRDPSVIGKQGWLAFQS 355
Query: 367 VYLKTQNVKLDIDRFR 382
+YL+ QNV +D++RFR
Sbjct: 356 IYLQKQNVTIDVNRFR 371
>gi|242059953|ref|XP_002459122.1| hypothetical protein SORBIDRAFT_03g046290 [Sorghum bicolor]
gi|241931097|gb|EES04242.1| hypothetical protein SORBIDRAFT_03g046290 [Sorghum bicolor]
Length = 527
Score = 234 bits (596), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 142/395 (35%), Positives = 209/395 (52%), Gaps = 19/395 (4%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFF----SVGHTDSTPSQPSLRIKTQPYPRKSD 68
+I CDRS D+C ++G D + + D + S +PYPRK +
Sbjct: 126 TICCDRSSVHTDVCFMSGDVRTDAASLSLLLFPPQQHRHDQALNGTSEEETVRPYPRKWE 185
Query: 69 KSAMSKVKELTITTSAP----PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
M KV E+ + +AP + C V H +P LV SAGGY GN FH F D F+P ++T
Sbjct: 186 SFIMDKVPEVRLRVAAPRGAEEDHRCDVQHDAPLLVMSAGGYTGNLFHAFNDGFLPSWVT 245
Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHG 184
+ H + V+L + N WA ++E++ + H ++++ + T THCF +G HG
Sbjct: 246 VQ-HL-RRRVVLGVLSYNPWWAGMFSEVISGLSDHHVVDLLHDTRTHCFPGAIVGTRYHG 303
Query: 185 RMVINPTLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRTI 240
+V++ L KT+VDF LA+AY E T T++ +P+L +V+R R I
Sbjct: 304 ILVVDSARLRDNKTIVDFHQMLADAYERPPRETTTTTTVEQRRRPRLGIVSRKGT--RVI 361
Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG-S 299
N V + A +GFDV I E + L+ + + +C A+VGVHGA LT LFLRPG +
Sbjct: 362 ENQAAVARLASSVGFDVEILETADGRPLSTWYASLRACDALVGVHGADLTKFLFLRPGHA 421
Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ-AFA 358
L Q+ P+G ++ F PA +GLEY +Y+++ ESSL Y A+D VL +P+ A
Sbjct: 422 SLTQIAPLGVSPIAQEDFGVPAARMGLEYEQYEVRAGESSLARLYAADDAVLADPEKAMR 481
Query: 359 GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKA 392
W + RVYL QNV LD+ RFR L + A
Sbjct: 482 EQGWDLVARVYLGGQNVTLDLARFRRTLARMHAHA 516
>gi|302766161|ref|XP_002966501.1| glycosyltransferase, CAZy family GT61 [Selaginella moellendorffii]
gi|300165921|gb|EFJ32528.1| glycosyltransferase, CAZy family GT61 [Selaginella moellendorffii]
Length = 441
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 132/387 (34%), Positives = 203/387 (52%), Gaps = 14/387 (3%)
Query: 7 ALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRK 66
L+ +S++CD SH S +C + G K S + + + ++ K +PY RK
Sbjct: 65 GLASRESVSCDFSHPSTSICSLQGDVRV--KGSNLTVALTSANQSAHSNVLAKIRPYTRK 122
Query: 67 SDKSAMSKVKELT-ITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
+K M + E+ ++ ++C V H+ PA++FS GGY G+ FH+F D VPLFIT
Sbjct: 123 WEK-MMKTIGEVNMVSLPKSKQMACDVRHSVPAVIFSTGGYTGSIFHDFNDGLVPLFITA 181
Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
+V+ + W KYA +L T +P+++ + + + HCF V +GL HG
Sbjct: 182 QRF--KGEVVFMVLQFKHWWPGKYAPILKHLTHYPVVDFDREQLVHCFPKVIVGLRIHGD 239
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
++I L ++ FQ+ L A N TKP LVLVNR R I+N E
Sbjct: 240 LLIEEGL--AGTSMRSFQNLLDIALNPG---QVVLPKTKPMLVLVNRET--SRVIVNRNE 292
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
AE+LG++V F P +T L++ + +HS ++GVHGA LTH LF+RPGS L+Q++
Sbjct: 293 TIALAEKLGYEVHTFAPNFNTRLSEIYSLLHSADVLIGVHGAALTHFLFMRPGSTLIQII 352
Query: 306 PIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
P G + F +PA GL Y+ Y+I ES+L +++G N V+ NP W+
Sbjct: 353 PFGLNGPAETCFGRPAEKAGLNYVGYQILPSESTLSDEFGLNHTVIVNPDEVTAKGWTER 412
Query: 366 -RVYLKTQNVKLDIDRFRIYLKDAYKK 391
R+YL QN++L + R LKD +
Sbjct: 413 KRIYLDGQNIRLHLPRLEQILKDVLTR 439
>gi|242093072|ref|XP_002437026.1| hypothetical protein SORBIDRAFT_10g018300 [Sorghum bicolor]
gi|241915249|gb|EER88393.1| hypothetical protein SORBIDRAFT_10g018300 [Sorghum bicolor]
Length = 572
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 139/382 (36%), Positives = 209/382 (54%), Gaps = 21/382 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
CD S+ ++C + G P + F GH SQ K +PYPRK D+ +S
Sbjct: 197 CDFSNFRANVCEMRGDVRVHPNATEIMFMEPGH-----SQRDELWKIKPYPRKGDEFCLS 251
Query: 74 KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
+ ELT+ +S AP C H PA++FS GY GN FH+F D VPLF T S F +
Sbjct: 252 HITELTVKSSKVAP---ECTKYHDMPAVIFSLTGYTGNLFHDFTDVMVPLFTTA-SEF-N 306
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
+V I D W KY +L + +++P+I+ + HCF+ V +GL ++ I+ +
Sbjct: 307 GEVQFLITDMALWWTIKYHTVLQKLSKYPVIDFSKDDQVHCFKHVIVGLHAYMEFTIDSS 366
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVKK 248
P ++VDF F+ AY+ +T + + KP+L+++ R+ R LNL E+
Sbjct: 367 KAPHNYSMVDFNRFMRGAYSLGRDTVTVLGEYPKVKPRLLIIKRHRT--RMFLNLDEIVA 424
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
AEELGF+V I E S+ ++ R +++ M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 425 MAEELGFEVVIDEANVSSDISKFARLVNTVDVMMGVHGAGLTNCVFLPQNATLIQIVPWG 484
Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
+W+S F PA ++GL Y +Y I +ESSL E+Y + + KNP AF + +R
Sbjct: 485 GLEWVSRTDFGNPAELMGLHYKQYSIGVDESSLTEQYPRDHEIFKNPIAFHKNGFDFIRQ 544
Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
++ QNVKLD RFR L +A
Sbjct: 545 TFMDKQNVKLDCKRFRPILLEA 566
>gi|115442317|ref|NP_001045438.1| Os01g0956200 [Oryza sativa Japonica Group]
gi|20161877|dbj|BAB90790.1| putative glycosyltransferase [Oryza sativa Japonica Group]
gi|113534969|dbj|BAF07352.1| Os01g0956200 [Oryza sativa Japonica Group]
gi|215741036|dbj|BAG97531.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 491
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 138/406 (33%), Positives = 210/406 (51%), Gaps = 25/406 (6%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP-----SLRIK 59
P + +I CDRS + D+C + G DP + + P QP ++ +
Sbjct: 89 PCSSLAGDTICCDRSDFNTDVCFMAGDVRTDPSSLSLLLF------PKQPPAANATVEER 142
Query: 60 TQPYPRKSDKSAMSKVKELTITTSAP---PNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
+PY RK + MS+V+E+ + + P P C V H +P LV +AGGY GN FH F D
Sbjct: 143 IRPYTRKWEALIMSRVEEVRLRMAPPEEEPGHRCDVRHDAPLLVMTAGGYTGNLFHAFSD 202
Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
FVP ++T+ H + V+L + N WA Y E++ + + ++++ + THCF
Sbjct: 203 GFVPAWLTVQ-HL-RRRVVLGVLSYNPWWAGTYGEIISGLSDYHVVDLLHDKRTHCFPGA 260
Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT-----KPKLVLVN 231
+G HG + ++P L KT+VDF LA Y +T T +P+L +V+
Sbjct: 261 IVGTRFHGILSVDPARLRDNKTIVDFHDLLAGVYETAGDTVVVDDVTQPAPRRPRLGIVS 320
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
R R R I N V + A +GFDV I E L S+ + +C +VGVHGA LT
Sbjct: 321 R--RGTRVIENQAAVARLARSVGFDVDILETANGLPLPASYASVSACDVLVGVHGADLTK 378
Query: 292 SLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
LFLRPG+ L+Q+ P+G ++ + + + +GL Y +Y + ESSL KYG D+V+
Sbjct: 379 LLFLRPGAALVQIAPLGVAPIARGCYAEASARMGLHYEQYDAEGHESSLSRKYGLRDVVV 438
Query: 352 KNPQAFA-GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
+P+A W + RVYL QNV LD+ RFR L + +A ++
Sbjct: 439 SDPEAAKRDKGWGFVARVYLGGQNVTLDLSRFRHTLTRLHARALRV 484
>gi|56409842|emb|CAI30069.1| glycosyltransferase [Triticum aestivum]
Length = 503
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 139/406 (34%), Positives = 211/406 (51%), Gaps = 21/406 (5%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF--------SVGHTDSTPSQPSL 56
P A I CDRS + D+C + G D + +F ++T ++
Sbjct: 99 PCASMGGHGICCDRSDFNTDVCFMAGDVRTDAASLSFLLFAPQAPNGTASANATVAEKEE 158
Query: 57 RIKTQPYPRKSDKSAMSKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHE 113
R+ QPY RK +K M+ ++E+ + + P C V H +PAL +AGGY GN FH
Sbjct: 159 RV--QPYTRKWEKHLMANIQEVRLRPARPDEADAQPCQVRHDAPALGMTAGGYTGNLFHA 216
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
F D F+P ++T+ H + V+LA+ N WA + EL+ + H +I++ + THCF
Sbjct: 217 FNDGFLPAWLTVQ-HL-RRRVVLAVLAYNPWWAGTFRELVSGLSDHHVIDLLHDKRTHCF 274
Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY-NENTNTSSSFHHTKPKLVLVNR 232
+G HG + + P +TLVDF FLA AY +++T +P+L L +R
Sbjct: 275 PGAIVGTRFHGILAVEPARTKDNRTLVDFHDFLAGAYKDDSTPQQQQKPRRRPRLGLYSR 334
Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
R I N V + AE +GFDV+I E L+ + + +C +VGVHGA LT
Sbjct: 335 KGT--RMIENEAAVARLAESVGFDVSILETANGAPLSSEYAAVSACDVLVGVHGADLTKL 392
Query: 293 LFLRP-GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
LFLRP + L+Q+ P+G ++ +EK ++ + Y +Y ESSLV KY A+D+VL
Sbjct: 393 LFLRPRRAALLQIAPMGVPHVARGCYEKATAMMEMHYEQYDAAANESSLVRKYPADDVVL 452
Query: 352 KNPQAFAGA-NWS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
++P+A W RVYL QNV LD+ RF L+ + +A +L
Sbjct: 453 RDPEAATRERGWDLTARVYLGGQNVSLDLGRFGDTLRKLHSRALRL 498
>gi|56409848|emb|CAI30072.1| glycosyltransferase [Sorghum bicolor]
Length = 572
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 139/382 (36%), Positives = 209/382 (54%), Gaps = 21/382 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
CD S+ ++C + G P + F GH SQ K +PYPRK D+ +S
Sbjct: 197 CDFSNFRANVCEMRGDVRVHPNATEIMFMEPGH-----SQRDELWKIKPYPRKGDEFCLS 251
Query: 74 KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
+ ELT+ +S AP C H PA++FS GY GN FH+F D VPLF T S F +
Sbjct: 252 HITELTVKSSKVAP---ECTKYHDMPAVIFSLTGYTGNLFHDFTDVMVPLFTTA-SEF-N 306
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
+V I D W KY +L + +++P+I+ + HCF+ V +GL ++ I+ +
Sbjct: 307 GEVQFLITDMALWWTIKYHTVLQKLSKYPVIDFSKDDQVHCFKHVIVGLHAYMEFTIDSS 366
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVKK 248
P ++VDF F+ AY+ +T + + KP+L+++ R+ R LNL E+
Sbjct: 367 KAPHNYSMVDFNRFMRGAYSLGRDTVTVLGEYPKVKPRLLIIKRHRT--RMFLNLDEIVV 424
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
AEELGF+V I E S+ ++ R +++ M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 425 MAEELGFEVVIDEANVSSDISKFARLVNTVDVMMGVHGAGLTNCVFLPQNATLIQIVPWG 484
Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
+W+S F PA ++GL Y +Y I +ESSL E+Y + + KNP AF + +R
Sbjct: 485 GLEWVSRTDFGNPAELMGLHYKQYSIGVDESSLTEQYPRDHEIFKNPIAFHKNGFDFIRQ 544
Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
++ QNVKLD RFR L +A
Sbjct: 545 TFMDKQNVKLDCKRFRPILLEA 566
>gi|226507655|ref|NP_001142280.1| uncharacterized protein LOC100274449 [Zea mays]
gi|194707986|gb|ACF88077.1| unknown [Zea mays]
gi|413951273|gb|AFW83922.1| glycosyltransferase [Zea mays]
Length = 529
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 140/419 (33%), Positives = 216/419 (51%), Gaps = 33/419 (7%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFF----SVGHTDSTPSQPSLRIKTQPYPRKSD 68
+I CDRS D+C + G D + + + T S+ + +PYPRK +
Sbjct: 115 TICCDRSSVHTDVCFMAGDVRTDAASLSLLLFPPHQQAQNGTSSEEDKEERVRPYPRKWE 174
Query: 69 KSAMSKVKELTITTSAP--PNLS-------CGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
+ M KV E+ + +AP P+ C V H +P LV SAGGY GN FH F D F+
Sbjct: 175 RFIMDKVPEVRLRVAAPRRPDGEREEEEHRCDVRHDAPLLVMSAGGYTGNLFHAFNDGFL 234
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
P ++T+ H + V+L + N WA + E++ + HP++++ + T THCF +G
Sbjct: 235 PSWLTVQ-HL-RRRVVLGVVSYNPWWAGMFGEVISGLSDHPVVDLLHGTRTHCFPGAIVG 292
Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAY------------NENTNTSSSFHHTKPKL 227
HG ++++P L KT+VDF LA+AY ++ + +P+L
Sbjct: 293 TRYHGILIVDPARLRDNKTIVDFHQMLADAYEKPPRETARPAQQQDLRDAEQRRRRRPRL 352
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+V+R R I N V + A +GFDV I E L+ + + +C A+VGVHGA
Sbjct: 353 GIVSRKGT--RVIENQAAVARLASSVGFDVDILETANGLPLSAWYASLRACDALVGVHGA 410
Query: 288 GLTHSLFLRPG-SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGA 346
LT LFLRPG + L Q+ P+G ++ F +PA +GL Y +Y+++ ESSL +Y
Sbjct: 411 DLTKFLFLRPGRASLTQIAPLGVSPIAREDFGEPAARMGLAYEQYEVRAGESSLARRYAP 470
Query: 347 NDLVLKNPQAFA--GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDKEIKL 402
D+VL +P+A W+ + RVYL QNV LD+ RFR L + A + ++ +L
Sbjct: 471 GDVVLTDPEAAKRDKGGWNLVARVYLGGQNVTLDLARFRQTLARMHAHAMRQRRRQERL 529
>gi|225456057|ref|XP_002277670.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like [Vitis
vinifera]
Length = 462
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 139/389 (35%), Positives = 215/389 (55%), Gaps = 20/389 (5%)
Query: 12 KSITCDRSHRSYDLCLINGSALFDPKTSTFF--SVGHTDSTPSQPSLRIKTQPYPRKSDK 69
+++ C+ RS D C+I+G +ST F S D P S I+ PY RK D
Sbjct: 81 EALVCNIEPRS-DFCVISGDVRVHGNSSTVFIASSAPVDILPENGSWSIR--PYARKGDA 137
Query: 70 SAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
AM +K T+ TT C HT PA++FS GGY+GN FH F D +PL++T S
Sbjct: 138 RAMKHIKNFTVKMTTGRQHLPHCTQNHTVPAILFSLGGYSGNHFHAFSDVLIPLYLT--S 195
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ +V + + W K+ LL +R+PII+I+ + HCF S +GL H +
Sbjct: 196 RQFNGEVQFLVTSKSLWWIAKFRILLQELSRYPIIDIDREEGIHCFSSAIIGLKCHKELD 255
Query: 188 INPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH----TKPKLVLVNRNARVGRTILNL 243
I+P+ P ++ DF+ FL ++Y+ T+ KP+L+++ R + R+ N
Sbjct: 256 IDPS--KSPYSMKDFREFLRSSYSLKRATAIKVRDGTDTKKPRLLIIAR--KKSRSFTND 311
Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
++ + A LG++V + EP T ++ ++SC ++GVHGAGLT+ +FL +VL+Q
Sbjct: 312 GKIAEMARSLGYEVIVAEPN-GTEISRFAELVNSCDVLMGVHGAGLTNIVFLPENAVLIQ 370
Query: 304 VVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
VVP+G +W++ F PA + + Y+EY+IK+EESSL+EKY VL+ P + W
Sbjct: 371 VVPLGGLEWVARYDFGLPAVDMKIRYIEYQIKEEESSLIEKYPHEHAVLREPHSITKLGW 430
Query: 363 SNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
++ VYL QNVKLD++RFR L A +
Sbjct: 431 LELKAVYLDKQNVKLDLNRFRNTLLQALQ 459
>gi|56409850|emb|CAI30073.1| glycosyltransferase [Saccharum officinarum]
Length = 573
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 138/382 (36%), Positives = 209/382 (54%), Gaps = 21/382 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
CD S+ ++C + G P ++ F GH SQ K +PYPRK D+ +S
Sbjct: 198 CDFSNFRANVCEMRGDVRVHPNATSIMFMEPGH-----SQRDELWKIKPYPRKGDEFCLS 252
Query: 74 KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
+ ELT+ +S AP C H PA++FS GY GN FH+F D VPLF T S F +
Sbjct: 253 HITELTVKSSKVAP---ECTKYHDVPAVIFSLTGYTGNLFHDFTDVMVPLFTTA-SEF-N 307
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
+V I D W KY +L + +++P+I+ + HCF+ V +GL ++ I+ +
Sbjct: 308 GEVQFLITDMALWWTIKYHTVLQKLSKYPVIDFSKDDQVHCFKHVIVGLHAYMEFTIDSS 367
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVKK 248
P ++VDF F+ AY+ +T + + KP+L+++ R+ R LNL E+
Sbjct: 368 KAPHNYSMVDFNRFMRGAYSLGRDTVTVLGEYPKVKPRLLIIKRHRT--RMFLNLDEIIA 425
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
AEELGF+V I E S+ ++ +++ M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 426 MAEELGFEVVIDEANVSSDISKFASLVNTVDVMMGVHGAGLTNCVFLPQNATLIQIVPWG 485
Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
+W+S F PA ++GL Y +Y I +ESSL E+Y + + KNP AF + +R
Sbjct: 486 GLEWVSRTDFGNPAELMGLHYKQYSIGVDESSLTEQYPRDHEIFKNPIAFHKKGFDFIRQ 545
Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
++ QNVKLD RFR L +A
Sbjct: 546 TFMDKQNVKLDCKRFRPILLEA 567
>gi|125529168|gb|EAY77282.1| hypothetical protein OsI_05257 [Oryza sativa Indica Group]
Length = 491
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 137/406 (33%), Positives = 210/406 (51%), Gaps = 25/406 (6%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP-----SLRIK 59
P + +I CDRS + D+C + G DP + + P QP ++ +
Sbjct: 89 PCSSLAGDTICCDRSDFNTDVCFMAGDVRTDPSSLSLLLF------PKQPPAANATVEER 142
Query: 60 TQPYPRKSDKSAMSKVKELTITTSAP---PNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
+PY RK + MS+V+E+ + + P P C V H +P LV +AGGY GN FH F D
Sbjct: 143 IRPYTRKWEALIMSRVEEVRLRMAPPEEEPGHRCDVRHDAPLLVMTAGGYTGNLFHAFSD 202
Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
FVP ++T+ H + V+L + N WA Y E++ + + ++++ + THCF
Sbjct: 203 GFVPAWLTVQ-HL-RRRVVLGVLSYNPWWAGTYGEIISGLSDYHVVDLLHDKRTHCFPGA 260
Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT-----KPKLVLVN 231
+G HG + ++P L KT+VDF LA Y +T T +P+L +V+
Sbjct: 261 IVGTRFHGILSVDPARLRDNKTIVDFHDLLAGVYETAGDTVVVDDVTQPAPRRPRLGIVS 320
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
R R R I N V + A +GF+V I E L S+ + +C +VGVHGA LT
Sbjct: 321 R--RGTRVIENQAAVARLARSVGFNVDILETANGLPLPASYASVSACDVLVGVHGADLTK 378
Query: 292 SLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
LFLRPG+ L+Q+ P+G ++ + + + +GL Y +Y + ESSL KYG D+V+
Sbjct: 379 LLFLRPGAALVQIAPLGVAPIARGCYAEASARMGLHYEQYDAEGHESSLSRKYGLRDVVV 438
Query: 352 KNPQAFA-GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
+P+A W + RVYL QNV LD+ RFR L + +A ++
Sbjct: 439 SDPEAAKRDKGWGFVARVYLGGQNVTLDLSRFRHTLTRLHARALRV 484
>gi|53791308|dbj|BAD52573.1| putative HGA6 [Oryza sativa Japonica Group]
gi|53791442|dbj|BAD52494.1| putative HGA6 [Oryza sativa Japonica Group]
Length = 603
Score = 230 bits (587), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 136/391 (34%), Positives = 209/391 (53%), Gaps = 26/391 (6%)
Query: 10 PAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQP 62
P I+CD + +C ++G P TS SV T + + +P
Sbjct: 209 PKSKISCDDKSKDEGFPYARPIVCHLSGDVRVSPATS---SVTLTMPLQQGEAAARRIRP 265
Query: 63 YPRKSDKSAMSKVKELTITTSAPPN--LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
Y R+ D + V+E+ IT++A SC V+H PA++FS GGY GNFFH+ D VP
Sbjct: 266 YARRDD-FLLPLVREVAITSAASEGDAPSCNVSHGVPAVIFSIGGYTGNFFHDMADVLVP 324
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
L++T HF + V L +A+ W +KY +L R + +++ ++ HCF V +GL
Sbjct: 325 LYLT-TFHFKGK-VQLFVANYKQWWIQKYKPVLRRLSHRAVVDFDSDGDVHCFDHVIVGL 382
Query: 181 ISHGRMVI--NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNAR 235
+ +++ +PT PK T+VDF FL +AY + T KP++++++R R
Sbjct: 383 VRDRDLILGQHPTRNPKGYTMVDFTRFLRHAYGLRRDKPMVLGETSGKKPRMLIISR--R 440
Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHS 292
R +LNLR+V A ELGF+V + E RF ++SC +VGVHGAGLT+
Sbjct: 441 RTRKLLNLRQVAAMARELGFEVVVSEAGVGGGSGGVKRFASAVNSCDVLVGVHGAGLTNQ 500
Query: 293 LFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
FL G V++Q+VP G +W++T ++ PA + L Y+EY + EESSL +Y V
Sbjct: 501 AFLPRGGVVVQIVPWGRMEWMATNFYGAPAAAMELRYVEYHVAAEESSLARRYPREHAVF 560
Query: 352 KNPQAFAGANWSNMRVYLKTQNVKLDIDRFR 382
++P A G W + + TQ+VKL++ RFR
Sbjct: 561 RDPMAIHGQGWKALADIVMTQDVKLNLRRFR 591
>gi|357132085|ref|XP_003567663.1| PREDICTED: uncharacterized protein LOC100828889 [Brachypodium
distachyon]
Length = 657
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 134/382 (35%), Positives = 205/382 (53%), Gaps = 20/382 (5%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
CD S+R D C ++G T V D+ + R+K PYPRK+D SAM
Sbjct: 278 CDFASNRRIDWCELDGDVRVHGANGT---VTLVDAAMAAEEWRVK--PYPRKADASAMRF 332
Query: 75 VKELTITTSAPPNLS-----CGVTHTS-PALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
V+E+T+ S PPN + C H PALVFS GY GN+FH + D +PLF+T +
Sbjct: 333 VREITVR-STPPNSANAAPACTERHEGVPALVFSDRGYTGNYFHAYTDVILPLFLTARQY 391
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
+V ++D W K+ + + +P+I++ + HCF+ V +GL H I
Sbjct: 392 --SGEVQFMVSDFQMWWIGKFMPVFKSLSNYPLIDLAADSRVHCFKHVQVGLTCHADFSI 449
Query: 189 NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKK 248
+P+ P ++VDF F+ Y + ++ + +P+L+++ R AR R NL E+ +
Sbjct: 450 DPSRSPNGYSMVDFTKFMRQTYKLPRDLAAPINGARPRLLIIAR-ART-RRFDNLAEIVR 507
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
AE++GF+ + E + +A ++C M+GVHGAGLT+ +FL G ++QVVP+G
Sbjct: 508 GAEKVGFEAVVSEGDHE--VAPFAELSNTCDVMLGVHGAGLTNMIFLPTGGAVIQVVPLG 565
Query: 309 TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
YF PA +GL YLEY+I ESSL E+Y + V +P+ W +++
Sbjct: 566 GLEFVAGYFRGPAADMGLRYLEYRIAPAESSLSEQYPPDHPVFTDPEGVKSKGWDSLKEA 625
Query: 368 YLKTQNVKLDIDRFRIYLKDAY 389
YL Q+VKLD+ RFR LK A+
Sbjct: 626 YLDKQDVKLDMRRFRPLLKKAF 647
>gi|297595986|ref|NP_001041861.2| Os01g0118600 [Oryza sativa Japonica Group]
gi|255672811|dbj|BAF03775.2| Os01g0118600 [Oryza sativa Japonica Group]
Length = 560
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 136/391 (34%), Positives = 209/391 (53%), Gaps = 26/391 (6%)
Query: 10 PAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQP 62
P I+CD + +C ++G P TS SV T + + +P
Sbjct: 166 PKSKISCDDKSKDEGFPYARPIVCHLSGDVRVSPATS---SVTLTMPLQQGEAAARRIRP 222
Query: 63 YPRKSDKSAMSKVKELTITTSAPPN--LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
Y R+ D + V+E+ IT++A SC V+H PA++FS GGY GNFFH+ D VP
Sbjct: 223 YARRDD-FLLPLVREVAITSAASEGDAPSCNVSHGVPAVIFSIGGYTGNFFHDMADVLVP 281
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
L++T HF + V L +A+ W +KY +L R + +++ ++ HCF V +GL
Sbjct: 282 LYLT-TFHFKGK-VQLFVANYKQWWIQKYKPVLRRLSHRAVVDFDSDGDVHCFDHVIVGL 339
Query: 181 ISHGRMVI--NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNAR 235
+ +++ +PT PK T+VDF FL +AY + T KP++++++R R
Sbjct: 340 VRDRDLILGQHPTRNPKGYTMVDFTRFLRHAYGLRRDKPMVLGETSGKKPRMLIISR--R 397
Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHS 292
R +LNLR+V A ELGF+V + E RF ++SC +VGVHGAGLT+
Sbjct: 398 RTRKLLNLRQVAAMARELGFEVVVSEAGVGGGSGGVKRFASAVNSCDVLVGVHGAGLTNQ 457
Query: 293 LFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
FL G V++Q+VP G +W++T ++ PA + L Y+EY + EESSL +Y V
Sbjct: 458 AFLPRGGVVVQIVPWGRMEWMATNFYGAPAAAMELRYVEYHVAAEESSLARRYPREHAVF 517
Query: 352 KNPQAFAGANWSNMRVYLKTQNVKLDIDRFR 382
++P A G W + + TQ+VKL++ RFR
Sbjct: 518 RDPMAIHGQGWKALADIVMTQDVKLNLRRFR 548
>gi|449459640|ref|XP_004147554.1| PREDICTED: glycosyltransferase-like domain-containing protein
2-like [Cucumis sativus]
Length = 407
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 132/378 (34%), Positives = 205/378 (54%), Gaps = 12/378 (3%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITTSA 84
+CL N + F+ + DS + S I PY R+ DK + V L I
Sbjct: 35 VCLTNNPTRINNTNLEFYISTNNDSQQNNFS-PILIHPYARQEDKITLRDVTPLQIIFQP 93
Query: 85 PPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
L C H P L+FS GG+ GN FHEF + +PLFIT + HF + V I D
Sbjct: 94 NKTLLPLCQFIHNVPVLIFSTGGFTGNLFHEFDETIIPLFIT-SYHFQTR-VRFLITDHK 151
Query: 143 DQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDF 202
W +KY +L +R ++N HCF +GL H + +N T +P ++ DF
Sbjct: 152 TWWVQKYNRILSGLSRFNVVNPAEDGSVHCFNGGVIGLKFHNILSLNNTDIPGGYSMSDF 211
Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEP 262
+SFL YN N S KP ++L++R + R +N E+ + +E+GF+V P
Sbjct: 212 RSFLRQTYNLKVNNVSELSGKKPMVMLISR--QTSRRFMNEGEMVEMMKEVGFEVMTTTP 269
Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR 322
+ ++L ++ C ++G HGAGLT+ +FL G+V++QVVP G W ST +F KPA
Sbjct: 270 QRMSNLDKFSSVVNLCSVIIGAHGAGLTNEVFLANGAVVVQVVPFGLDWPSTYFFGKPAA 329
Query: 323 VLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA-FAGANWSNMRVYLKTQNVKLDIDRF 381
+ L+YLEYKI+ +ESSL +KYG N V+++P++ FA +++ +Y+ QN+K+++ RF
Sbjct: 330 EMELQYLEYKIEAKESSLWDKYGENHPVIRDPESIFAQGYFASRAIYIDEQNLKINLTRF 389
Query: 382 RIYLKDAYKKAKKLMDKE 399
R D + KKL++++
Sbjct: 390 R----DTMIQVKKLIEEK 403
>gi|449503225|ref|XP_004161896.1| PREDICTED: glycosyltransferase-like domain-containing protein
2-like [Cucumis sativus]
Length = 407
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 132/378 (34%), Positives = 205/378 (54%), Gaps = 12/378 (3%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITTSA 84
+CL N + F+ + DS + S I PY R+ DK + V L I
Sbjct: 35 VCLTNNPTRINNTNLEFYISTNNDSQQNNFS-PILIHPYARQEDKITLRDVTPLQIIFQP 93
Query: 85 PPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
L C H P L+FS GG+ GN FHEF + +PLFIT + HF + V I D
Sbjct: 94 NKTLLPLCQFIHNVPVLIFSTGGFTGNLFHEFDETIIPLFIT-SYHFQTR-VRFLITDHK 151
Query: 143 DQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDF 202
W +KY +L +R ++N HCF +GL H + +N T +P ++ DF
Sbjct: 152 TWWVQKYNRILSGLSRFNVVNPAEDGSVHCFNGGVIGLKFHNILSLNNTDIPGGYSMSDF 211
Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEP 262
+SFL YN N S KP ++L++R + R +N E+ + +E+GF+V P
Sbjct: 212 RSFLRQTYNLKVNNVSELSGKKPMVMLISR--QTSRRFMNEGEMVEMMKEVGFEVMTTTP 269
Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR 322
+ ++L ++ C ++G HGAGLT+ +FL G+V++QVVP G W ST +F KPA
Sbjct: 270 QRMSNLDKFSSVVNLCSVIIGAHGAGLTNEVFLANGAVVVQVVPFGLDWPSTYFFGKPAA 329
Query: 323 VLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA-FAGANWSNMRVYLKTQNVKLDIDRF 381
+ L+YLEYKI+ +ESSL +KYG N V+++P++ FA +++ +Y+ QN+K+++ RF
Sbjct: 330 EMELQYLEYKIEAKESSLWDKYGENHPVIRDPESIFAQGYFASRAIYIDEQNLKINLTRF 389
Query: 382 RIYLKDAYKKAKKLMDKE 399
R D + KKL++++
Sbjct: 390 R----DTMIQVKKLIEEK 403
>gi|223974685|gb|ACN31530.1| unknown [Zea mays]
Length = 572
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 136/382 (35%), Positives = 208/382 (54%), Gaps = 21/382 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
CD S+ ++C + G P ++ F GH SQ K +PYPRK D+ +S
Sbjct: 197 CDFSNFRANVCEMRGDVRVHPNAASIMFMEPGH-----SQRDELWKIKPYPRKGDEFCLS 251
Query: 74 KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
+ ELT+ +S AP C H PA++FS GY GN FH+F D VPLF T S F +
Sbjct: 252 HITELTVKSSKVAP---ECSKYHDVPAVIFSLTGYTGNLFHDFTDVMVPLFTTA-SEF-N 306
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
+V I D W KY +L + +++P+I+ + HCF+ V +GL ++ I+ +
Sbjct: 307 GEVQFLITDMALWWTIKYHTVLEKLSKYPVIDFSKDGEVHCFKHVIVGLHAYMEFTIDSS 366
Query: 192 LLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKK 248
P ++VDF F+ AY+ ++ + KP+L+++ R+ R LNL E+
Sbjct: 367 KAPHNYSMVDFNRFMRGAYSLGRDSVTVLGEYPKVKPRLLIIKRHRT--RMFLNLDEIIA 424
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
AEELGF+V I E S+ ++ +++ M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 425 MAEELGFEVVIDEANVSSDISKFAALVNTVDVMMGVHGAGLTNCVFLPQNATLIQIVPWG 484
Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
+W+S F PA ++GL Y +Y I ESSL E+Y ++ + KNP +F + +R
Sbjct: 485 GLEWVSRTDFGNPAELMGLHYKQYSIGVHESSLTEQYPSDHEIFKNPISFHKKGFDFIRQ 544
Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
++ QNVKLD RFR L +A
Sbjct: 545 TFMDKQNVKLDCKRFRPILLEA 566
>gi|255558677|ref|XP_002520363.1| glycosyltransferase, putative [Ricinus communis]
gi|223540461|gb|EEF42029.1| glycosyltransferase, putative [Ricinus communis]
Length = 515
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 128/384 (33%), Positives = 207/384 (53%), Gaps = 18/384 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C+ RS D C + G D +ST F V + + + +PY RK D +AM
Sbjct: 133 CNLMERS-DFCELKGDIRIDANSSTIFIVSSGNDNLAATNTSWSIRPYARKGDAAAMRHT 191
Query: 76 KELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
+E ++ + C H + ++FS GGY+GN FH F D VPLF T + D
Sbjct: 192 REWSVKQVSNHRKIPECTQNHNALGIIFSLGGYSGNHFHAFTDIIVPLFSTARPF--NGD 249
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V + D W K+ LL +R+ +I+I+ + HCF S+T+GL +N
Sbjct: 250 VQFLVTDRQPWWIAKFRILLKALSRYEVIDIDKREEIHCFTSITIGLKRQSNKELNIDQS 309
Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSF-------HHTKPKLVLVNRNARVGRTILNLREV 246
++ DF+ FL ++Y+ T+ F + +P+L++++R + R N+ E+
Sbjct: 310 KFRYSMKDFRQFLRSSYSLRKTTAIKFMKGTGREKNRRPRLLIISR--KRSRAFTNVGEI 367
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
K A+ LG+ V + EP+ S S + ++SC ++GVHGAGLT+ +FL ++L+QVVP
Sbjct: 368 AKMAKGLGYKVVVDEPDADVS--RSAQVMNSCDVVLGVHGAGLTNMVFLPDNAILIQVVP 425
Query: 307 IG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G +W+S ++FE+P++ + + YLEYKI EESSLV +Y ++ +VL++P W
Sbjct: 426 FGGAEWVSKIFFEEPSKDMNIRYLEYKISIEESSLVHQYPSDHVVLRDPSVIQNQGWEAF 485
Query: 366 R-VYLKTQNVKLDIDRFRIYLKDA 388
+ +Y QNVK+D++RFR L A
Sbjct: 486 KSIYFDKQNVKIDLNRFRPTLSKA 509
>gi|414876592|tpg|DAA53723.1| TPA: hypothetical protein ZEAMMB73_458729 [Zea mays]
Length = 412
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 138/396 (34%), Positives = 206/396 (52%), Gaps = 26/396 (6%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C + G + S F V + P+ + +PY RK D ++M +V
Sbjct: 18 CDLSGSRADVCDLTGDIRLNATASEFIVVDPA-AADDGPTTAYEIRPYARKGDATSMGRV 76
Query: 76 KELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
+T+ A + + C VTH PA+ FS GGY GN FH+F D VPL+ T + D
Sbjct: 77 TAVTVRVRATADAAPRCTVTHAEPAVAFSVGGYTGNLFHDFTDVIVPLYGTAQRY--RGD 134
Query: 134 VILAIADCNDQ---WARKYAELLPRFTRHPIININ---NQTITHCFQSVTLGLISHGRMV 187
V L + D W KY +L +RHP ++++ HCF V LGL + ++
Sbjct: 135 VRLVVTDAGSSSSWWLAKYDAVLRGLSRHPPLHLHLAKAAGEVHCFGHVVLGLRARRELM 194
Query: 188 INPTLLP---KPKT--LVDFQSFLANAYN------ENTNTSSSFHHTKPKLVLVNRNARV 236
+ P P +P + DF FL A + +SS KP+L++V+R R
Sbjct: 195 VEPDPDPDRGRPDGVGMADFARFLRRALSLPRDAPTRPAAASSDDRRKPRLLVVSR--RG 252
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R +LN V + AEE+GF+ + E E S A R I+S A+VGVHGAGLT+ +FL
Sbjct: 253 TRRLLNADAVARVAEEVGFEAVVSELEVSRDAAGVGRLINSFDALVGVHGAGLTNMVFLP 312
Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
PG+ ++Q+VP G QW++ + F PA +GL Y++Y+I ESSL +KY + + NP
Sbjct: 313 PGATVVQIVPWGGLQWIARMDFGDPADAMGLRYIQYEIAVHESSLADKYPRDHEIFTNPT 372
Query: 356 AFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
A + +R +L Q++ LD+DRFR+ L A +
Sbjct: 373 ALHKKGFKFLRHTFLIGQDITLDVDRFRVVLLQALR 408
>gi|162460428|ref|NP_001105751.1| glycosyltransferase [Zea mays]
gi|56409852|emb|CAI30074.1| glycosyltransferase [Zea mays]
Length = 572
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 136/382 (35%), Positives = 209/382 (54%), Gaps = 21/382 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTST--FFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
CD S+ ++C + G P ++ F GH SQ K +PYPRK D+ +S
Sbjct: 197 CDFSNFRANVCEMRGDVRVHPNAASIMFMEPGH-----SQRDELWKIKPYPRKGDEFCLS 251
Query: 74 KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
+ ELT+ +S AP C H PA++FS GY GN FH+F D VPLF T S F +
Sbjct: 252 HITELTVKSSKVAP---ECSKYHDVPAVIFSLTGYTGNLFHDFTDVMVPLFTTA-SEF-N 306
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
+V I D W KY +L + +++P+I+ + HCF+ V +GL ++ I+ +
Sbjct: 307 GEVQFLITDMALWWTIKYHTVLEKLSKYPVIDFSKDGEVHCFKHVIVGLHAYMEFTIDSS 366
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLVNRNARVGRTILNLREVKK 248
P ++VDF F+ AY+ ++ + + KP+L+++ R+ R LNL E+
Sbjct: 367 KAPHNYSMVDFNRFMRGAYSLGRDSVTLLGEYPKVKPRLLIIKRHRT--RMFLNLDEIIA 424
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
AEELGF+V I E S+ ++ +++ M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 425 MAEELGFEVVIDEANVSSDISKFAALVNTVDVMMGVHGAGLTNCVFLPQNATLIQIVPWG 484
Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
+W+S F PA ++GL Y +Y I ESSL E+Y ++ + KNP +F + +R
Sbjct: 485 GLEWVSRTDFGNPAELMGLHYKQYSIGVHESSLTEQYPSDHEIFKNPISFHKKGFDFIRQ 544
Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
++ QNVKLD RFR L +A
Sbjct: 545 TFMDKQNVKLDCKRFRPILLEA 566
>gi|223945917|gb|ACN27042.1| unknown [Zea mays]
gi|414876578|tpg|DAA53709.1| TPA: hypothetical protein ZEAMMB73_101479 [Zea mays]
Length = 681
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 135/393 (34%), Positives = 206/393 (52%), Gaps = 18/393 (4%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV----GHTDSTPSQPSLRIKTQPYPRKSDKS 70
CD S+R D C ++G ++ V G T S RIK PYPRK+D +
Sbjct: 288 CDITSNRRIDWCELDGDVRVLGANASVTLVAPPGGADGRTFRAESWRIK--PYPRKADPN 345
Query: 71 AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
AM V+ LT+ + +C H PALVFS GY GN+FH F D +PLF+T +
Sbjct: 346 AMRVVRVLTVRSVPGEAPACTDRHGVPALVFSDRGYTGNYFHAFTDVILPLFLTARQYA- 404
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+V L + D W K+A + + + +++++ HCF+ V +GL SH I+P
Sbjct: 405 -GEVRLLVTDLQAWWVGKFAPVFRSISNYELVDLDRDPRVHCFRHVQVGLTSHDDFSIDP 463
Query: 191 TLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
P +++DF F+ AY + + +P+L+L+ R AR R +N E+
Sbjct: 464 RRAPNGYSMLDFTGFMRAAYGLPRGDVAAAAGPSSKRRPRLLLIAR-ART-RRFVNAEEI 521
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ AE+LGF+V + E + +A +SC A++GVHGAGLT+ +F+ G V++QVVP
Sbjct: 522 VRGAEKLGFEVVVSE--GTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPTGGVVIQVVP 579
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
+G YF P+R +GL YLEY+I EES+L+ +Y + + +P W +++
Sbjct: 580 LGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLINQYPRDHPIFTDPNGIKSKGWESLK 639
Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
YL Q+V LD+ RFR LK A +K K
Sbjct: 640 DAYLDKQDVSLDMKRFRPTLKKAIAHIRKARAK 672
>gi|212275650|ref|NP_001130385.1| uncharacterized protein LOC100191481 [Zea mays]
gi|194688994|gb|ACF78581.1| unknown [Zea mays]
gi|414876577|tpg|DAA53708.1| TPA: glycosyltransferase [Zea mays]
Length = 682
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 135/393 (34%), Positives = 206/393 (52%), Gaps = 18/393 (4%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV----GHTDSTPSQPSLRIKTQPYPRKSDKS 70
CD S+R D C ++G ++ V G T S RIK PYPRK+D +
Sbjct: 289 CDITSNRRIDWCELDGDVRVLGANASVTLVAPPGGADGRTFRAESWRIK--PYPRKADPN 346
Query: 71 AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
AM V+ LT+ + +C H PALVFS GY GN+FH F D +PLF+T +
Sbjct: 347 AMRVVRVLTVRSVPGEAPACTDRHGVPALVFSDRGYTGNYFHAFTDVILPLFLTARQYA- 405
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+V L + D W K+A + + + +++++ HCF+ V +GL SH I+P
Sbjct: 406 -GEVRLLVTDLQAWWVGKFAPVFRSISNYELVDLDRDPRVHCFRHVQVGLTSHDDFSIDP 464
Query: 191 TLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
P +++DF F+ AY + + +P+L+L+ R AR R +N E+
Sbjct: 465 RRAPNGYSMLDFTGFMRAAYGLPRGDVAAAAGPSSKRRPRLLLIAR-ART-RRFVNAEEI 522
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ AE+LGF+V + E + +A +SC A++GVHGAGLT+ +F+ G V++QVVP
Sbjct: 523 VRGAEKLGFEVVVSE--GTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPTGGVVIQVVP 580
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
+G YF P+R +GL YLEY+I EES+L+ +Y + + +P W +++
Sbjct: 581 LGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLINQYPRDHPIFTDPNGIKSKGWESLK 640
Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
YL Q+V LD+ RFR LK A +K K
Sbjct: 641 DAYLDKQDVSLDMKRFRPTLKKAIAHIRKARAK 673
>gi|413935496|gb|AFW70047.1| hypothetical protein ZEAMMB73_657787 [Zea mays]
Length = 436
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 130/384 (33%), Positives = 205/384 (53%), Gaps = 19/384 (4%)
Query: 15 TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
TC + + + C +G D ST + G KT+PY R D AM+
Sbjct: 60 TCYMTSKRSERCDASGDIRVDGNRSTIYVSGI--------DREWKTKPYARYHDPVAMAH 111
Query: 75 VKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
V+E T+ A P +C H+ P +FS GG++GN +H++ D VPLFI+ +H
Sbjct: 112 VREYTLKPLPEAAPAPACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--THQFRG 169
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
V ++ W K+ + TR+ +I+++N HCF + +G H M ++P
Sbjct: 170 RVQFLLSGMKPWWVAKFTPFFRQLTRYDVIDVDNDQEVHCFPRIVVGATFHKDMGVDPRR 229
Query: 193 LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA 249
P ++VDF+ L A+ E + + KP+L++++R R R LN RE+ +A
Sbjct: 230 SPGHVSVVDFKRALRRAFGLPREAASRGGATGRGKPRLLIISR--RGSRRFLNEREMARA 287
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
A GF+V + EP++ T A ++S MVGVHGAGLT+ +FL G+VL+QVVP G
Sbjct: 288 AAGAGFEVRVAEPDQHTDTAAFAALVNSADVMVGVHGAGLTNMVFLPRGAVLVQVVPFGG 347
Query: 310 -QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
+WL+ V F+ PA + + Y+ Y + EESSL+++Y N VL +P A W ++
Sbjct: 348 LEWLTGVTFKDPAADMEVSYMGYDVTLEESSLIDQYPRNHQVLTDPYAVHKQGWDALKAA 407
Query: 368 YLKTQNVKLDIDRFRIYLKDAYKK 391
YL QN+++D+DRFR L++A +
Sbjct: 408 YLDKQNIRMDLDRFRATLREAMSR 431
>gi|226492922|ref|NP_001147068.1| glycosyltransferase [Zea mays]
gi|224028515|gb|ACN33333.1| unknown [Zea mays]
gi|413935495|gb|AFW70046.1| glycosyltransferase [Zea mays]
Length = 488
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 130/384 (33%), Positives = 205/384 (53%), Gaps = 19/384 (4%)
Query: 15 TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
TC + + + C +G D ST + G KT+PY R D AM+
Sbjct: 112 TCYMTSKRSERCDASGDIRVDGNRSTIYVSGI--------DREWKTKPYARYHDPVAMAH 163
Query: 75 VKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
V+E T+ A P +C H+ P +FS GG++GN +H++ D VPLFI+ +H
Sbjct: 164 VREYTLKPLPEAAPAPACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--THQFRG 221
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
V ++ W K+ + TR+ +I+++N HCF + +G H M ++P
Sbjct: 222 RVQFLLSGMKPWWVAKFTPFFRQLTRYDVIDVDNDQEVHCFPRIVVGATFHKDMGVDPRR 281
Query: 193 LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA 249
P ++VDF+ L A+ E + + KP+L++++R R R LN RE+ +A
Sbjct: 282 SPGHVSVVDFKRALRRAFGLPREAASRGGATGRGKPRLLIISR--RGSRRFLNEREMARA 339
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
A GF+V + EP++ T A ++S MVGVHGAGLT+ +FL G+VL+QVVP G
Sbjct: 340 AAGAGFEVRVAEPDQHTDTAAFAALVNSADVMVGVHGAGLTNMVFLPRGAVLVQVVPFGG 399
Query: 310 -QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
+WL+ V F+ PA + + Y+ Y + EESSL+++Y N VL +P A W ++
Sbjct: 400 LEWLTGVTFKDPAADMEVSYMGYDVTLEESSLIDQYPRNHQVLTDPYAVHKQGWDALKAA 459
Query: 368 YLKTQNVKLDIDRFRIYLKDAYKK 391
YL QN+++D+DRFR L++A +
Sbjct: 460 YLDKQNIRMDLDRFRATLREAMSR 483
>gi|242095990|ref|XP_002438485.1| hypothetical protein SORBIDRAFT_10g020470 [Sorghum bicolor]
gi|241916708|gb|EER89852.1| hypothetical protein SORBIDRAFT_10g020470 [Sorghum bicolor]
Length = 498
Score = 227 bits (579), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 137/399 (34%), Positives = 211/399 (52%), Gaps = 27/399 (6%)
Query: 9 SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSD 68
+P +TC+ D C + G K++T + V + P S+ I+ PY RK +
Sbjct: 105 APISKLTCNFDTPHSDTCTMEGDLRIHGKSATVYVVAASTHRPENSSITIR--PYTRKWE 162
Query: 69 KSAMSKVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
+ MS+++E+T+ S PP S C V+H PA+VFS GG NFFH D VPL+
Sbjct: 163 QETMSRIREITMR-SMPPAFSFIIPPKCTVSHGVPAVVFSTGGCGTNFFHAMSDLIVPLY 221
Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
IT + + V L IAD +W K+ +L + +P+I+ + T CF S +GL S
Sbjct: 222 ITSREY--NGRVQLLIADYEPKWVAKFRPILAALSMYPVIDFDADTAVRCFPSAHVGLES 279
Query: 183 HGRMV-INPTLLPKPKTLVDFQSFLANAYNEN-------TNTSSSFHHTKPKLVLVNRNA 234
H +M+ I+P+L T++ F+ FL + ++ + SSS KP+LV+V R
Sbjct: 280 HNKMLAIDPSLSRNGYTMMGFRDFLRSVFSLQRPWSKPVSRRSSSGRQQKPRLVMVLR-- 337
Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
R R + N + A E+LGF+V PE+ + ++SC MVGVHGAGLT+ +F
Sbjct: 338 RHSRELTNEADAISAMEDLGFEVVAALPEDVRDMGHFAGVVNSCDVMVGVHGAGLTNMVF 397
Query: 295 LRPGSVLMQVVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKN 353
L ++Q+VP G +W F +P +GL Y+EY+ EE++L EKY + V +
Sbjct: 398 LPHNGTVVQIVPWGGMKWACWYAFGEPVPAMGLRYVEYEATAEETTLKEKYPRDHPVFAD 457
Query: 354 PQAF--AGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
PQ+ G W+ +L QNV LDIDRF+ ++ ++
Sbjct: 458 PQSIHRQGKTWAT---FLNGQNVTLDIDRFKGVMQQVFQ 493
>gi|357133695|ref|XP_003568459.1| PREDICTED: uncharacterized protein LOC100837177 [Brachypodium
distachyon]
Length = 574
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 133/378 (35%), Positives = 205/378 (54%), Gaps = 13/378 (3%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G P+ + S+ + SQ K +PYPRK D+ +S +
Sbjct: 199 CDFSNFRANVCEMRGDVRVHPRAT---SILFMEPDGSQRDEVWKIKPYPRKGDEFCLSHI 255
Query: 76 KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
ELT+ +S C H PA++FS GY GN FH+F D VPLF T S F + +V
Sbjct: 256 TELTVKSSKVA-AECTKYHDVPAVIFSLTGYTGNLFHDFTDVLVPLFTTA-SEF-NGEVQ 312
Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPK 195
I D W KY +L + +++P+I+ + HCF+ +G ++ I+ + P
Sbjct: 313 FLITDMALWWTIKYQTVLQKLSKYPLIDFSKDDQVHCFKHAIVGTHAYMEFTIDSSKAPH 372
Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEE 252
T+VDF F+ AY+ +T+ T KP+L+++ R+ R LNL E+ AEE
Sbjct: 373 GLTMVDFNRFMRGAYSLGKDTAVVLGETPKVKPRLLIIKRHRT--RMFLNLEEIITMAEE 430
Query: 253 LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QW 311
LGF+V I E S+ ++ + ++S M+GVHGAGLT+ +FL + L+Q+VP G W
Sbjct: 431 LGFEVVIDEANVSSDISGFAKLVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGLDW 490
Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLK 370
+S F PA ++GL Y +Y I +ESSL ++Y + + KNP +F + + R ++
Sbjct: 491 ISRTDFGNPAEMMGLRYKQYAISVDESSLTDQYPKDHEIFKNPISFHKRGFEFIRRTFMD 550
Query: 371 TQNVKLDIDRFRIYLKDA 388
QNVKLD RFR L +A
Sbjct: 551 KQNVKLDCKRFRPILLEA 568
>gi|326497163|dbj|BAK02166.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 548
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 131/392 (33%), Positives = 205/392 (52%), Gaps = 10/392 (2%)
Query: 7 ALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRK 66
A +P CD S D+C + G K++T + V +D + + +K +PYPRK
Sbjct: 153 ATAPRSKSICDFSSERTDICAMQGDVRMHGKSATVYIVLASDDSYRPENGTVKIRPYPRK 212
Query: 67 SDKSAMSKVKELTITTSAPPNL-SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
S++ M ++E+TI S ++ C VTH PA+VFS G Y NFFH D +PLF T+
Sbjct: 213 SEEGTMHSIREVTIRWSGLEDVPRCTVTHDVPAVVFSTGAYLDNFFHAMTDGIIPLFNTV 272
Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
+ + V L + D N +W K+ +L + +P+I+ + CF SV +G H
Sbjct: 273 REY--EGRVQLVVTDYNRKWVDKFQGILGALSIYPVIDFDADDKVRCFPSVQVGTEGHKE 330
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILN 242
M I P L K T+ DF++FL + Y+ S + T +P+L++V R R R N
Sbjct: 331 MGIIPALSRKGYTMTDFRAFLRSVYSLKREWSVPVNRTSSDRPRLLMVLR--RNSRAFAN 388
Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
E AA E+GF+V + PE + +A ++SC MVGVHGAGLT+ +FL + L+
Sbjct: 389 EAEAVSAATEVGFEVVLGAPEALSDMARFAEVVNSCDVMVGVHGAGLTNLVFLPRNATLV 448
Query: 303 QVVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN 361
Q+VP G W S F P+ +GL Y++Y+ EE++L KY + + +
Sbjct: 449 QIVPWGGMSWGSNAAFGAPSADMGLRYVQYETTAEETTLKYKYPKEHAIFTDVASIKRQG 508
Query: 362 WS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKA 392
+ ++L QN+ LDIDR++ L+ Y+ +
Sbjct: 509 YHVTWELFLNGQNITLDIDRYKGVLQKIYQDS 540
>gi|326520259|dbj|BAK07388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 625
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/384 (34%), Positives = 205/384 (53%), Gaps = 21/384 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S R D+C G D STFF V D+ + S +++ PYPRK D + M +V
Sbjct: 245 CDFSDRRSDVCDFTGDIRMDANASTFFLV--VDAATAAQSHKVR--PYPRKGDPTCMGRV 300
Query: 76 KELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
E+T+ T S+PP C TH PA+ FS GGY GN FH+F D VPL+ T++ +
Sbjct: 301 PEITVRTTSSSSPPE--CTRTHGVPAVTFSIGGYTGNIFHDFSDVLVPLYNTVHRY--RG 356
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTI---THCFQSVTLGLISHGRMVIN 189
DV L +A+ W KY +LL +RH +++ THCF+ + L +H ++I
Sbjct: 357 DVQLVMANVAPWWLVKYDKLLRELSRHAPLDLAAAAAKGETHCFRHAVVSLRAHRELIIE 416
Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREV 246
P DF F+ A + + + KP+L+++ R+ R +LNL ++
Sbjct: 417 RDRSPDGLATPDFTRFIRRAISLPRDAPTRLADGTGRKPRLLIIARHRT--RILLNLGDM 474
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ AEE GF+ + E + ++ I+S +VGVHGAGLT+ +FL PG+ ++QVVP
Sbjct: 475 MRVAEEAGFEAAVSESDVGDPISRVGAEINSADVLVGVHGAGLTNMMFLAPGATMVQVVP 534
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G QW++ + + PA +GL Y++Y+I +ESSL +KY + +P + + M
Sbjct: 535 WGGLQWIARMDYGDPAEAMGLRYVQYEIGVDESSLKDKYPRGHKIFTDPTSLHKKGFGFM 594
Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
R + QN+ LD+ RFR L+ A
Sbjct: 595 RRTLMDGQNITLDLGRFRGVLQQA 618
>gi|326487780|dbj|BAK05562.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 125/382 (32%), Positives = 211/382 (55%), Gaps = 21/382 (5%)
Query: 10 PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDK 69
P S CD S+ +D+C + G A ++ST V T ++ S+ I+ Q +
Sbjct: 169 PLNSTVCDLSNPRFDICELCGDARTIGQSSTVMYVPRTQTSDSE-EWSIRAQ------SR 221
Query: 70 SAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
+ +KE+T+ + ++ P C H PA+VF+ GG N +H+F D VPLF+T
Sbjct: 222 KNLPWIKEVTVKSLNTSQPAPRCTSKHAMPAIVFALGGLTANVWHDFSDVLVPLFLTARQ 281
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
D+DV L + + +++KY +L + TR+ II+ ++ C+ V +GL SHG +
Sbjct: 282 F--DRDVQLLVTNNQPWFSKKYMTILSKLTRYDIIDFDSDDQVRCYPYVIVGLRSHGDLG 339
Query: 188 INPTLLPKPKTLVDFQSFLANAYNENT-------NTSSSFHHTKPKLVLVNRNARVGRTI 240
I P L P+ T++DF+ F+ AY KP+++L++R R
Sbjct: 340 IYPNLSPQNYTMMDFRLFVREAYGLPAAKVAIPYKADRDDPDKKPRIMLIDRGKT--RRF 397
Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
+N + + E GF+V +P+ +SL + R + SC A++G HGAGLT+ +FLR G V
Sbjct: 398 INAPYIVQGLEWFGFEVVKVDPKMDSSLDEFARLVDSCDAIMGAHGAGLTNMVFLRSGGV 457
Query: 301 LMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGA 360
++ +VP G ++++ ++ KPAR +GL +++Y I EES+L+EKYG N V+K+P+A +
Sbjct: 458 VVHIVPYGIEFMADGFYGKPARDMGLGHVKYGISPEESTLLEKYGWNHTVIKDPEAIRSS 517
Query: 361 NWSNM-RVYLKTQNVKLDIDRF 381
W + VY+ Q++ L++ RF
Sbjct: 518 GWDKVGEVYMSKQDIVLNMTRF 539
>gi|242061414|ref|XP_002451996.1| hypothetical protein SORBIDRAFT_04g013150 [Sorghum bicolor]
gi|241931827|gb|EES04972.1| hypothetical protein SORBIDRAFT_04g013150 [Sorghum bicolor]
Length = 589
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 130/383 (33%), Positives = 211/383 (55%), Gaps = 15/383 (3%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ + C + G+ P S+ V T S+P + + K +PYPRK D+ ++ +
Sbjct: 212 CDFSNFRANTCEMRGNIRIHPNGSSVIYVEPTSSSPKRNE-QWKIKPYPRKGDELCLNHI 270
Query: 76 KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ +S AP C H PA++F+ GY GN FH+F D VPLF T S F + +
Sbjct: 271 TEVTVKSSKVAP---ECSKYHHVPAVIFALTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 325
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V I D W RKYA + + T++P+I+ N HCF+ +GL ++ I+P
Sbjct: 326 VQFLITDMAIWWTRKYAVVFEKLTKYPLIDFNKDNEVHCFKHAIVGLHAYMEFTIDPLKA 385
Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAA 250
P ++VDF F+ Y+ + ++ T+P+L++++R R LNL+E+ A
Sbjct: 386 PHNYSMVDFNRFMRRTYSLPRDAVTALGEIPKTRPRLLIISRQR--TRMFLNLKEIVAMA 443
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
+E+G+DV + E ++++ + ++S M+GVHGAGLT+ +FL ++L+Q+VP G
Sbjct: 444 DEIGYDVVVEEANVNSNVTHFAKVVNSVDVMMGVHGAGLTNCVFLPHDAILIQIVPWGAL 503
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
+ + F PA +GL Y Y I ESSL E+Y + + +NP AF + +R +
Sbjct: 504 DGICRIDFGYPAEQMGLRYKHYSIGVHESSLTEQYPLDHEIFRNPLAFHKNGFEFVRQTF 563
Query: 369 LKTQNVKLDIDRFRIYLKDAYKK 391
+ TQNV+LD +RFR L +A +
Sbjct: 564 MDTQNVRLDCNRFRPILLEALDQ 586
>gi|195607024|gb|ACG25342.1| glycosyltransferase [Zea mays]
Length = 488
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/389 (33%), Positives = 206/389 (52%), Gaps = 23/389 (5%)
Query: 10 PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDK 69
P +T RS R C +G D ST + G KT+PY R D
Sbjct: 111 PTSYMTSKRSER----CDASGDIRVDGNRSTIYVSGI--------DREWKTKPYARYHDP 158
Query: 70 SAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
AM+ V+E T+ A P +C H+ P +FS GG++GN +H++ D VPLFI+ +
Sbjct: 159 VAMAHVREYTLKPLPEAAPAPACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--T 216
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
H V ++ W K+ + TR+ +I+++N HCF + +G H M
Sbjct: 217 HQFRGRVQFLLSGMKPWWVAKFTPFFRQLTRYDVIDVDNDQEVHCFPRIVVGATFHKDMG 276
Query: 188 INPTLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLR 244
++P P ++VDF+ L A+ E + + KP+L++++R R R LN R
Sbjct: 277 VDPRRSPGHVSVVDFKRALRRAFGLPREAASRGGATGRGKPRLLIISR--RGSRRFLNER 334
Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
E+ +AA GF+V + EP++ T A ++S MVGVHGAGLT+ +FL G+VL+QV
Sbjct: 335 EMARAAAGAGFEVRVAEPDQHTDTAAFAALVNSADVMVGVHGAGLTNMVFLPRGAVLVQV 394
Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
VP G +WL+ V F+ PA + + ++ Y + EESSL+++Y N VL +P A W
Sbjct: 395 VPFGGLEWLTGVTFKDPAADMEVSHMGYDVTLEESSLIDQYPRNHQVLTDPYAVHKQGWD 454
Query: 364 NMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
++ YL QN+++D+DRFR L++A +
Sbjct: 455 ALKAAYLDKQNIRMDLDRFRATLREAMSR 483
>gi|115453815|ref|NP_001050508.1| Os03g0567600 [Oryza sativa Japonica Group]
gi|13957626|gb|AAK50581.1|AC084404_6 unknown protein [Oryza sativa Japonica Group]
gi|108709375|gb|ABF97170.1| glycosyltransferase, putative, expressed [Oryza sativa Japonica
Group]
gi|113548979|dbj|BAF12422.1| Os03g0567600 [Oryza sativa Japonica Group]
gi|125586859|gb|EAZ27523.1| hypothetical protein OsJ_11476 [Oryza sativa Japonica Group]
gi|215704366|dbj|BAG93800.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 576
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 134/380 (35%), Positives = 205/380 (53%), Gaps = 17/380 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G P + SV + SQ K +PYPRK D+ +S +
Sbjct: 201 CDFSNFRANVCEMRGDVRIHPTAT---SVLFMEPEGSQRDEVWKIKPYPRKGDEFCLSHI 257
Query: 76 KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ +S AP C H PA++FS GY GN FH+F D VPLF T S F + +
Sbjct: 258 TEVTVKSSKVAP---ECTKYHDVPAVIFSLTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 312
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V I D W KY +L + +++P+I+ + HCF+ +GL ++ I+ T
Sbjct: 313 VQFLITDMALWWTIKYQTVLQKLSKYPVIDFSKDDQVHCFKHAIVGLHAYMEFTIDSTKA 372
Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P ++ DF F+ AY+ ++ + KP+L+++ R+ R LNL E+ A
Sbjct: 373 PHNYSMADFNRFMRGAYSLGRDSVTVLGEYPKIKPRLLIIKRHRT--RMFLNLDEIISMA 430
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
EELGF+V I E S+ ++ R ++S M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 431 EELGFEVVIDEANVSSDISRFARLVNSVDVMMGVHGAGLTNCVFLPQHATLIQIVPWGGL 490
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
W+S F PA ++GL Y +Y I +ESSL ++Y + + KNP +F + +R +
Sbjct: 491 DWISRTDFGNPAELMGLRYKQYSIGVDESSLTDQYPRDHEIFKNPISFHQRGFDFIRQTF 550
Query: 369 LKTQNVKLDIDRFRIYLKDA 388
+ QNVKLD RFR L +A
Sbjct: 551 MDKQNVKLDCKRFRPILLEA 570
>gi|125544555|gb|EAY90694.1| hypothetical protein OsI_12297 [Oryza sativa Indica Group]
Length = 576
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 134/380 (35%), Positives = 205/380 (53%), Gaps = 17/380 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G P + SV + SQ K +PYPRK D+ +S +
Sbjct: 201 CDFSNFRANVCEMRGDVRIHPTAT---SVLFMEPEGSQRDEVWKIKPYPRKGDEFCLSHI 257
Query: 76 KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ +S AP C H PA++FS GY GN FH+F D VPLF T S F + +
Sbjct: 258 TEVTVKSSKVAP---ECTKYHDVPAVIFSLTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 312
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V I D W KY +L + +++P+I+ + HCF+ +GL ++ I+ T
Sbjct: 313 VQFLITDMALWWTIKYQTVLQKLSKYPVIDFSKDDQVHCFKHAIVGLHAYMEFTIDSTKA 372
Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P ++ DF F+ AY+ ++ + KP+L+++ R+ R LNL E+ A
Sbjct: 373 PHNYSMADFNRFMRGAYSLGRDSVTVLGEYPKIKPRLLIIKRHRT--RMFLNLDEIISMA 430
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
EELGF+V I E S+ ++ R ++S M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 431 EELGFEVVIDEANVSSDISRFARLVNSVDVMMGVHGAGLTNCVFLPQHATLIQIVPWGGL 490
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
W+S F PA ++GL Y +Y I +ESSL ++Y + + KNP +F + +R +
Sbjct: 491 DWISRTDFGNPAELMGLRYKQYSIGVDESSLTDQYPRDHEIFKNPISFHQRGFDFIRQTF 550
Query: 369 LKTQNVKLDIDRFRIYLKDA 388
+ QNVKLD RFR L +A
Sbjct: 551 MDKQNVKLDCKRFRPILLEA 570
>gi|242095992|ref|XP_002438486.1| hypothetical protein SORBIDRAFT_10g020480 [Sorghum bicolor]
gi|241916709|gb|EER89853.1| hypothetical protein SORBIDRAFT_10g020480 [Sorghum bicolor]
Length = 478
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 135/396 (34%), Positives = 209/396 (52%), Gaps = 24/396 (6%)
Query: 9 SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSD 68
+P +TC+ S+ D C + G K++T + V + P + ++ PY RK +
Sbjct: 88 APVSKLTCNFSNPHSDTCTMEGDLRIHGKSATVYVVAASTHRPENSTFIVR--PYTRKWE 145
Query: 69 KSAMSKVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
+ MS+++E+T+ S PP S C V H PA+VFS GG GNFFH D VPL+
Sbjct: 146 QETMSRIREVTMR-SMPPAFSFIIPPKCTVRHDVPAVVFSTGGCGGNFFHAMSDLIVPLY 204
Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
IT + + V I D +W K+ +L + +P+I+ + T CF S +GL S
Sbjct: 205 ITSREY--NGHVQFLITDYRPEWVAKFRPILAALSMYPVIDFDADTAVRCFPSAHVGLQS 262
Query: 183 HGRMV-INPTLLPKPKTLVDFQSFLANAYNEN----TNTSSSFHHTKPKLVLVNRNARVG 237
H +M+ I+P+L T++ F+ FL + ++ S S KP+LV+V R R
Sbjct: 263 HNKMLAIDPSLSRNGYTMMGFRDFLRSVFSLQRPWVEPVSRSSGRQKPRLVMVLR--RHS 320
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R + N E A E+LGF+V PE+ + ++SC MVGVHGAGLT+ +FL
Sbjct: 321 RELTNEAETITAMEDLGFEVVAALPEDVRDMGHFAGVVNSCDVMVGVHGAGLTNMVFLPH 380
Query: 298 GSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA 356
++Q+VP G +W F +P +GL Y+EY++ EE++L EKY + V +P++
Sbjct: 381 NGTVVQIVPWGDMKWACWYDFGEPVPGMGLRYVEYEVTAEETTLKEKYPRDHPVFTDPRS 440
Query: 357 F--AGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
G WS +L QN+ LDIDRFR ++ ++
Sbjct: 441 IHRQGKAWST---FLNGQNITLDIDRFRAVMQQVFQ 473
>gi|326532652|dbj|BAJ89171.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 577
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 133/378 (35%), Positives = 203/378 (53%), Gaps = 13/378 (3%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G PK + SV + SQ K +PYPRK D+ +S +
Sbjct: 202 CDFSNFRANVCEMRGDVRVHPKAT---SVLFMEPEGSQRDEVWKIKPYPRKGDEFCLSHI 258
Query: 76 KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
ELT+ +S C H P ++FS GY GN FH+F D VPLF T S F D +V
Sbjct: 259 TELTVKSSKVA-AECTRYHDVPVVIFSLTGYTGNLFHDFTDVIVPLFTTA-SQF-DGEVQ 315
Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPK 195
I D W KY LL + +++P+I+ HCF+ +G ++ I+ + P
Sbjct: 316 FLITDMALWWTIKYHTLLQKLSKYPLIDFGKDDQVHCFKHAIVGTHAYMEFTIDASKSPH 375
Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEE 252
T+VDF F+ +AY+ T+++ + KP+L+++ R+ R LNL E+ AEE
Sbjct: 376 GVTMVDFNRFMRDAYSLPKETAAALGESPKVKPRLLIIKRHRT--RMFLNLEEIIAMAEE 433
Query: 253 LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QW 311
LGF+V I E S+ + + ++S M+GVHGAGLT+ +FL + L+Q+VP G W
Sbjct: 434 LGFEVVIDEANVSSDINGFAKLVNSVDVMMGVHGAGLTNCVFLPQNATLIQIVPFGGLDW 493
Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLK 370
+S F P+ ++GL Y +Y I +ESSL + Y + + K+P +F + + R ++
Sbjct: 494 ISRTDFGNPSEMMGLRYKQYAITVDESSLTDHYPRDHKIFKDPISFHKRGFEFIRRTFMD 553
Query: 371 TQNVKLDIDRFRIYLKDA 388
QNVKLD RFR L +A
Sbjct: 554 KQNVKLDCKRFRPVLLEA 571
>gi|413954173|gb|AFW86822.1| hypothetical protein ZEAMMB73_198004 [Zea mays]
Length = 486
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 135/390 (34%), Positives = 209/390 (53%), Gaps = 23/390 (5%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+TCD + D C + G K++T + V + P ++ ++ PY RK ++ MS
Sbjct: 102 LTCDFGNPHSDTCTMEGDLRIHGKSATVYVVAASTHRPENSTITVR--PYTRKWEQETMS 159
Query: 74 KVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
+++E+T+ S PP S C V H PA+VFS GG NFFH D VPL+IT
Sbjct: 160 RIREVTMR-SMPPAFSFVIPPKCTVRHDVPAVVFSTGGCGTNFFHGMSDLVVPLYITSRE 218
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ D V L IAD N +W K+ +L + P+I+++ CF SV +GL SH +M+
Sbjct: 219 Y--DGRVQLLIADYNAEWVAKFRPVLAALSTFPVIDLDADAAVRCFPSVHVGLESHKKML 276
Query: 188 -INPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH---HTKPKLVLVNRNARVGRTILNL 243
I+P+L T++ F+ FL + ++ ++ KP+L++V R R R + N
Sbjct: 277 AIDPSLSRNGYTMMGFRDFLRSVFSLQRPWATPVRLGSGQKPRLLMVLR--RRSRELSNE 334
Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
E A EE+GF+V PE+ + ++ ++SC MVGVHGAGLT+ +FL ++Q
Sbjct: 335 AETIAAMEEIGFEVVAARPEDVSDMSRFAGVVNSCDVMVGVHGAGLTNMVFLPHNGTVVQ 394
Query: 304 VVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF--AGA 360
+VP G +W F +P +GL Y+EY+ EE++L EKY + V +PQ+ G
Sbjct: 395 IVPWGNMKWACWYAFGEPVPGMGLRYVEYEATAEETTLKEKYPRDHPVFTDPQSIHRQGK 454
Query: 361 NWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
WS +L QNV LDIDRF+ + ++
Sbjct: 455 AWST---FLNGQNVTLDIDRFKAVMLQVFQ 481
>gi|238011450|gb|ACR36760.1| unknown [Zea mays]
gi|413926794|gb|AFW66726.1| hypothetical protein ZEAMMB73_527920 [Zea mays]
Length = 491
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 129/389 (33%), Positives = 207/389 (53%), Gaps = 24/389 (6%)
Query: 15 TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
TC + + + C +G D ST + VG + +T+PY R D AM+
Sbjct: 111 TCYMTSKRSERCDASGDIRVDGNRSTIY-VGGIERE-------WRTKPYARYHDPVAMAH 162
Query: 75 VKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
V+E T+ A +C H+ P +FS GG++GN +H++ D VPLFI+ +H
Sbjct: 163 VREYTLKALPEPGAAAAPACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--THQF 220
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
V ++ W K+ + TRH +I+++ HCF + +G H M ++P
Sbjct: 221 RGRVQFLVSGMKPWWVGKFTPFFRQLTRHDVIDVDKDGEVHCFPRIVVGATFHRDMGVDP 280
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSS------SFHHTKPKLVLVNRNARVGRTILNLR 244
P + VDF+ L A+ +S + KP+L++++R R R LN R
Sbjct: 281 RRAPGHVSAVDFKRALRAAFGLKREAASRGGGGGATGDGKPRLLIISR--RGSRRFLNSR 338
Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
E+ AA + GF+V + EP++ T +A ++S AMVGVHGAGLT+ +FL G+VL+QV
Sbjct: 339 EMAVAAGDAGFEVRVAEPDQRTDMAAFAALVNSADAMVGVHGAGLTNMVFLPRGAVLVQV 398
Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
VP G +WL+ V F++PA + + Y++Y ++ EESSLV++Y VL +P A W
Sbjct: 399 VPFGGLEWLTGVTFKEPAADMEVSYMDYHVRLEESSLVDQYPRGHQVLTDPYAVHRQGWD 458
Query: 364 NMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
++ YL QN+++D+DRFR L++ +
Sbjct: 459 ALKTAYLDKQNIRMDLDRFRATLREVMAR 487
>gi|242052307|ref|XP_002455299.1| hypothetical protein SORBIDRAFT_03g008000 [Sorghum bicolor]
gi|241927274|gb|EES00419.1| hypothetical protein SORBIDRAFT_03g008000 [Sorghum bicolor]
Length = 568
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 134/389 (34%), Positives = 209/389 (53%), Gaps = 28/389 (7%)
Query: 10 PAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTPS-QPSLRIKTQ 61
P I CD + +C ++G P++S+ +TP Q +
Sbjct: 180 PKSKIYCDDKSKDDGFPYARPIICQMSGDVRVSPESSSV-----ALNTPMLQGEEERRIT 234
Query: 62 PYPRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
PY RK D S +S V+E+ I A N + C ++H PA++FS GGY GNFFH+ D +
Sbjct: 235 PYARKDD-SLLSLVREVVIRAVANENDAPKCSISHDVPAVIFSVGGYTGNFFHDMSDVLI 293
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
PL++T + V I + W +KY +L R + H II+ ++ HCFQ V LG
Sbjct: 294 PLYLTSFQY--KGRVKFFITNYKQWWIQKYKPVLRRLSHHDIIDFDSNKDVHCFQHVILG 351
Query: 180 LISHGRMVI--NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNA 234
L +++ +PT PK +++DF FL ++Y N KP++++++R
Sbjct: 352 LTRDRDLILRPHPTRNPKGYSMLDFTRFLRHSYGLKRNRPLVLGEQPGKKPRMLIISR-- 409
Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
R R +LNLR+V + LGFDV I E +L ++SC ++ VHGAGLT+ +F
Sbjct: 410 RGTRKLLNLRQVAATSRALGFDVII--SEARGNLKRFATMVNSCDVLLAVHGAGLTNQVF 467
Query: 295 LRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKN 353
L P +V++Q+VP G W++T ++ +PAR + L YLEY + ++ESSL ++Y + LV K+
Sbjct: 468 LPPQAVVVQIVPWGKMDWMATNFYGQPARGMNLRYLEYYVSEKESSLAQRYPRDHLVFKD 527
Query: 354 PQAFAGANWSNMRVYLKTQNVKLDIDRFR 382
P A G W+ + Q+VKL+I RFR
Sbjct: 528 PMAIHGQGWNALADIFMAQDVKLNIRRFR 556
>gi|224135465|ref|XP_002322080.1| predicted protein [Populus trichocarpa]
gi|222869076|gb|EEF06207.1| predicted protein [Populus trichocarpa]
Length = 387
Score = 224 bits (571), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 128/376 (34%), Positives = 203/376 (53%), Gaps = 14/376 (3%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS--DKSA 71
+ C+ RS + C I G D +ST F V + + +PY RK +
Sbjct: 7 LRCNIMGRS-EFCEIKGDIRIDGNSSTAFIVSSETDILTAENTSWSIRPYARKEALGEKD 65
Query: 72 MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
++ + + T P C H+ PA++FS GGY+GNFFH F D VPL+ T +
Sbjct: 66 FARKWSVKLVTDRPDIPRCTRNHSVPAILFSVGGYSGNFFHAFTDIIVPLYSTAQPF--N 123
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
++V I + W K+ LL +R+ II+I+++ HCFQS+T+GL ++
Sbjct: 124 REVQFLITNRKSSWIAKFKTLLEALSRYEIIDIDDRHDMHCFQSLTIGLKGRNNKELSID 183
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFH---HTKPKLVLVNRNARVGRTILNLREVKK 248
P ++ DF FL Y+ T++ KP+L++++R + R N+ E+ +
Sbjct: 184 SSTSPYSMKDFTQFLRRWYSLKKITAAKIRDGDKRKPRLLIISR--KRSRAFTNVGEIAE 241
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
AE L + V + EP ++ + I+SC ++GVHGAGLT+ +FL ++L+QVVP G
Sbjct: 242 LAESLSYQVIVAEP--GPDVSGFAKIINSCDVVMGVHGAGLTNIVFLPENAILIQVVPFG 299
Query: 309 -TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
+W S V FE PA+ + + YL+YKIK EES+L+++Y A+ +VL++P W R
Sbjct: 300 RVEWASRVSFEDPAKDMNIRYLDYKIKVEESTLIQQYPADHVVLRDPSVIGKQGWLAFRS 359
Query: 367 VYLKTQNVKLDIDRFR 382
+YL QNV LD++RFR
Sbjct: 360 IYLDKQNVTLDVNRFR 375
>gi|195657307|gb|ACG48121.1| glycosyltransferase [Zea mays]
Length = 681
Score = 224 bits (571), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 133/392 (33%), Positives = 205/392 (52%), Gaps = 17/392 (4%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV----GHTDSTPSQPSLRIKTQPYPRKSDKS 70
CD S+R D C ++G ++ V G T S RIK PYPRK+D +
Sbjct: 289 CDITSNRRIDWCELDGDVRVLGANASVTLVAPPGGADGRTFRAESWRIK--PYPRKADPN 346
Query: 71 AMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
AM V+ LT+ + +C H PALVFS GY GN+FH F D +PLF+T +
Sbjct: 347 AMRVVRVLTVRSVPGEAPACTDRHGVPALVFSDRGYTGNYFHAFTDVILPLFLTARQYA- 405
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+V L + D W K+A + + + +++++ HCF+ V +GL SH I+P
Sbjct: 406 -GEVRLLVTDLQAWWVGKFAPVFRSISNYELVDLDRDPRVHCFRHVQVGLTSHDDFSIDP 464
Query: 191 TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
P +++DF F+ AY + + +P+L+++ R AR R +N E+
Sbjct: 465 LRAPNGYSMLDFTGFMRAAYGLPRGDVAAAGPSSKRRPRLLVIAR-ART-RRFVNAEEIV 522
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
+ A +LGF+V + E + +A +SC A++GVHGAGLT+ +F+ G V++QVVP+
Sbjct: 523 RGAVKLGFEVVVSE--GTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPTGGVVIQVVPL 580
Query: 308 GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
G YF P+R +GL YLEY+I EES+L+ +Y + + +P W +++
Sbjct: 581 GGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLINQYPRDHPIFTDPNGIKSKGWESLKD 640
Query: 367 VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
YL Q+V LD+ RFR LK A +K K
Sbjct: 641 AYLDKQDVSLDMKRFRPTLKKAIAHIRKARAK 672
>gi|383100766|emb|CCG47997.1| glycosyltransferase, HGA-like,putative,expressed [Triticum
aestivum]
Length = 633
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 130/389 (33%), Positives = 203/389 (52%), Gaps = 21/389 (5%)
Query: 16 CDR-SHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
CD S+R D C ++G T VG P R++ PYPRK D +AM
Sbjct: 256 CDMTSNRRIDWCELDGDVRVHGARGTVTLVG----APRAEEWRVR--PYPRKVDPNAMRH 309
Query: 75 VKELTI--TTSAP---PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
V +T+ TT+ P C + H+ PAL+FS GY GN+FH + D +PLF+T +
Sbjct: 310 VTNITVRSTTTLPGAAEEEECAIKHSVPALLFSDRGYTGNYFHAYTDVILPLFLTAKRY- 368
Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
+V ++D W K+ + + + ++++ T CFQ V +GL H I+
Sbjct: 369 -GGEVQFLVSDMQMWWVGKFLPVFKSLSNYDLVDLAADNRTRCFQHVQVGLTCHADFSID 427
Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA 249
P P ++VDF + Y + +P+L+L+ R + R +N E+ +A
Sbjct: 428 PLRAPNGYSMVDFTRHMRGVYGLPRGLAVPAAGARPRLLLIARAST--RRFVNADEIVRA 485
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
A+++GF+V + E + +A ++C AM+GVHGAGLT+ +FL G V++QVVP+G
Sbjct: 486 AQKVGFEVVV--SEGTHEVAPFAELANTCDAMLGVHGAGLTNMVFLPTGGVVIQVVPLGG 543
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
YF P+R +GL YLEY+I ES+L E+Y + + +P W +++ VY
Sbjct: 544 LEFVAGYFRTPSRDMGLRYLEYRIAPAESTLTEQYPPDHPIFTDPDGVKSKGWDSLKQVY 603
Query: 369 LKTQNVKLDIDRFRIYLKD--AYKKAKKL 395
L Q+V+LD+ RFR LK A+ +A KL
Sbjct: 604 LDKQDVRLDLKRFRPLLKKAIAHIRANKL 632
>gi|308080322|ref|NP_001183140.1| uncharacterized protein LOC100501509 [Zea mays]
gi|238009592|gb|ACR35831.1| unknown [Zea mays]
Length = 378
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/390 (33%), Positives = 207/390 (53%), Gaps = 28/390 (7%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+T RS R C +G D ST + VG + +T+PY R D AM+
Sbjct: 1 MTSKRSER----CDASGDIRVDGNRSTIY-VGGIERE-------WRTKPYARYHDPVAMA 48
Query: 74 KVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
V+E T+ A +C H+ P +FS GG++GN +H++ D VPLFI+ +H
Sbjct: 49 HVREYTLKALPEPGAAAAPACTRNHSVPGFLFSNGGFSGNLYHDYTDVLVPLFIS--THQ 106
Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
V ++ W K+ + TRH +I+++ HCF + +G H M ++
Sbjct: 107 FRGRVQFLVSGMKPWWVGKFTPFFRQLTRHDVIDVDKDGEVHCFPRIVVGATFHRDMGVD 166
Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSS------SFHHTKPKLVLVNRNARVGRTILNL 243
P P + VDF+ L A+ +S + KP+L++++R R R LN
Sbjct: 167 PRRAPGHVSAVDFKRALRAAFGLKREAASRGGGGGATGDGKPRLLIISR--RGSRRFLNS 224
Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
RE+ AA + GF+V + EP++ T +A ++S AMVGVHGAGLT+ +FL G+VL+Q
Sbjct: 225 REMAVAAGDAGFEVRVAEPDQRTDMAAFAALVNSADAMVGVHGAGLTNMVFLPRGAVLVQ 284
Query: 304 VVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
VVP G +WL+ V F++PA + + Y++Y ++ EESSLV++Y VL +P A W
Sbjct: 285 VVPFGGLEWLTGVTFKEPAADMEVSYMDYHVRLEESSLVDQYPRGHQVLTDPYAVHRQGW 344
Query: 363 SNMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
++ YL QN+++D+DRFR L++ +
Sbjct: 345 DALKTAYLDKQNIRMDLDRFRATLREVMAR 374
>gi|413925803|gb|AFW65735.1| hypothetical protein ZEAMMB73_558289 [Zea mays]
Length = 596
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/384 (34%), Positives = 211/384 (54%), Gaps = 16/384 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G+ P S+ + T S+ S+ + + K +PYPRK D+ ++ +
Sbjct: 218 CDFSNFRANVCEMRGNIRIHPNGSSVIYMEPT-SSGSKRNEQWKIKPYPRKGDELCLNHI 276
Query: 76 KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ +S AP C H PA++F+ GY GN FH+F D VPLF T S F + +
Sbjct: 277 TEVTVKSSIVAP---ECSKYHDVPAVIFALTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 331
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V I D W RKYA + + T++P+I+ N HCF +GL ++ I+P
Sbjct: 332 VQFLITDMAIWWTRKYAVVFEKLTKYPLIDFNKDNEVHCFNHAIVGLHAYMEFTIDPLKA 391
Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHHT----KPKLVLVNRNARVGRTILNLREVKKA 249
P ++VDF F+ Y+ + + T KP+L++++R R LNL+E+
Sbjct: 392 PHNYSMVDFNQFMRRTYSLPRDAVRAPGETPQTRKPRLLIISRQR--TRMFLNLKEIVAM 449
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
AEE+G++V + E ++++ + ++S M+GVHGAGLT+ +FL G+ L+Q+VP G
Sbjct: 450 AEEIGYEVVVEEANVNSNVGHFAKVVNSVDVMMGVHGAGLTNCVFLPHGATLIQIVPWGA 509
Query: 310 -QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
+ + F PA +GL Y Y I ESSL E+Y + + +NP AF + +R
Sbjct: 510 LDGICRIDFGYPAEQMGLRYKHYSIGVHESSLTEQYPLDHEIFRNPLAFHKNGFEFVRQT 569
Query: 368 YLKTQNVKLDIDRFRIYLKDAYKK 391
++ TQNV+LD +RFR L +A +
Sbjct: 570 FMDTQNVRLDCNRFRPVLLEALDQ 593
>gi|413943929|gb|AFW76578.1| hypothetical protein ZEAMMB73_859887 [Zea mays]
Length = 478
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 128/397 (32%), Positives = 209/397 (52%), Gaps = 24/397 (6%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
A +TC+ S+R D C + G +++T + V + + + +PY RK ++
Sbjct: 84 ASRLTCNLSNRHSDTCTMEGDVRIHGRSATVYVVAAASNHRRPENSTVTVRPYTRKWEQE 143
Query: 71 AMSKVKELTITTSAPPNL--------SCGVTHT-SPALVFSAGGYNGNFFHEFMDCFVPL 121
M++++E+T+ + PP C H +PA+VFS GG NFFH D VPL
Sbjct: 144 TMARIREVTVRYTPPPAPFGSGVIPPRCTAVHDGAPAVVFSTGGCGTNFFHAMSDLVVPL 203
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
+IT + V L + D +W K+ +L + HP++++++ CF + +GL
Sbjct: 204 YITAREY--GGRVHLLVTDYRPEWVAKFRPVLDALSAHPVVDLDSDAAVRCFPAARVGLE 261
Query: 182 SH-GRMVINPTLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARV 236
SH G + I+PTL T+V F+ FL + ++ ++S P+LV+V R R
Sbjct: 262 SHNGMLAIDPTLSRHGYTMVGFRDFLRSVFSLPRPWAWSSSRPVISRPPRLVMVLR--RH 319
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R + N + A E+LGFDV PE+ + R ++SC MVGVHGAGLT+ +FL
Sbjct: 320 SRALTNEADTVAAMEDLGFDVVPARPEDVADMGRFARVVNSCDVMVGVHGAGLTNMVFLP 379
Query: 297 PGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
+ ++Q+VP G +W F +P + +GL Y+EY++ EE++L EKY + V +PQ
Sbjct: 380 HNATVVQIVPWGDMKWACWYDFGEPVQGMGLRYVEYEVTAEETTLKEKYARDHPVFTDPQ 439
Query: 356 AF--AGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
+ G W+ +L QNV LDIDRF+ ++ ++
Sbjct: 440 SIHRQGKAWAT---FLDGQNVTLDIDRFKAAMQQVFQ 473
>gi|326514418|dbj|BAJ96196.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326529397|dbj|BAK04645.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 503
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 193/340 (56%), Gaps = 11/340 (3%)
Query: 59 KTQPYPRKSDKSAMSKVKE--LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
+T+PY R D AM+ V+E L + P +C H+ P VFS G++GN +H++ D
Sbjct: 163 RTKPYARYHDPVAMTHVREFVLKPFPADAPPPACTKNHSVPGFVFSNRGFSGNLYHDYTD 222
Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
VPLF++ +V ++D W K+ L + + + ++++NN HC +
Sbjct: 223 VLVPLFLSTRKF--KGEVQFLLSDLKPWWVAKFRPLFRQLSNYEVVDVNNDLEVHCVPRI 280
Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRN 233
+G H M I P+ ++VDF+ L +A+ +S T KP+L++++R
Sbjct: 281 VVGSDFHKDMGIIPSKAAGHVSIVDFKRTLRDAFGLERAAASRGGATGAGKPRLLIISR- 339
Query: 234 ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
+ R LN RE+ AA +GFDV I EP++ T ++ R ++S MVGVHGAGLT+ +
Sbjct: 340 -KNSRRFLNEREMAAAATAMGFDVRIAEPDQHTDMSTFARLVNSADVMVGVHGAGLTNMV 398
Query: 294 FLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLK 352
FL G+VL+QVVP G +WL+ V F+ PA + + Y++Y ++ EESSL+E+Y N VL
Sbjct: 399 FLPAGAVLIQVVPFGGLEWLTGVTFKNPAADMEVTYMDYNVQLEESSLLEQYPRNHQVLT 458
Query: 353 NPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
+P A W ++ YL QNV+LD+D+FR L+DA +
Sbjct: 459 DPYAVHKQGWDALKAAYLDKQNVRLDLDKFRATLRDALSR 498
>gi|388512767|gb|AFK44445.1| unknown [Lotus japonicus]
Length = 365
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 118/319 (36%), Positives = 175/319 (54%), Gaps = 36/319 (11%)
Query: 89 SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARK 148
+C V H PA+ FS GGY GN +HEF D +PL+IT S ++ V+ I + +D W K
Sbjct: 21 TCDVQHDVPAVFFSNGGYTGNVYHEFNDGIIPLYIT--SQRFNKKVVFVILEYHDWWMMK 78
Query: 149 YAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLAN 208
Y ++L R + P I+ THCF +GL H + ++ L+ K++VDF++ +
Sbjct: 79 YGDILSRLSDFPAIDFRGDNRTHCFPEAVVGLKIHDELTVDSKLMEGNKSIVDFRNLMDK 138
Query: 209 AY---------NENTN---------TSSSFHHT-------------KPKLVLVNRNARVG 237
AY +E +SS T KPKLV+V+R+
Sbjct: 139 AYWPRIKGLIQDEEREAQGKLREQISSSPLSETPLIKQEVQENPKKKPKLVIVSRSG--S 196
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R I N + K AEE+GF V + +P ++T LA +R ++ M+GVHGA +TH +FL+
Sbjct: 197 RAITNENLLVKMAEEIGFFVEVLKPAKTTELAKIYRVLNGSDVMIGVHGAAMTHFMFLKS 256
Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
GSV +Q VP+GT W + Y+ +PAR LGL+Y+ Y+I ESSL EKY ND +L++P +
Sbjct: 257 GSVFIQAVPLGTNWAAETYYGEPARKLGLKYIGYEIHPRESSLYEKYDKNDPILRDPASI 316
Query: 358 AGANWS-NMRVYLKTQNVK 375
W ++YL +QNVK
Sbjct: 317 NEKGWEYTKKIYLDSQNVK 335
>gi|357127186|ref|XP_003565265.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
[Brachypodium distachyon]
Length = 498
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 138/408 (33%), Positives = 222/408 (54%), Gaps = 55/408 (13%)
Query: 14 ITCDRSHRSYDLC------LINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS 67
+ C+R D C INGSAL + T ++T + +I+ PY R++
Sbjct: 110 VVCNRDGPFSDTCEVTGDVRINGSAL----SVTLLPSSRQETTSRRQQWKIR--PYSRRT 163
Query: 68 DKSAMSKVKELTIT-------TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
M+ ++E+T+T SA P +C VTH PA+VF+ GG GN+FH+F D VP
Sbjct: 164 ----MTDIREVTVTQLASADEASAAP--ACTVTHEVPAIVFALGGLTGNYFHDFSDALVP 217
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
LF+ + +V+ +++ W KY ++ R +++ ++++ T CF+ V++GL
Sbjct: 218 LFVASRRY--GGEVLFLVSNIQPWWLAKYGAVVRRLSKYDAVDLDKDNQTRCFRHVSVGL 275
Query: 181 -------ISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTN---------TSSSFHHTK 224
I+ G+ NP +P DF +FL Y+ N T S+ + K
Sbjct: 276 RLTKEFDIAAGKN--NPLSMP------DFTAFLRETYSLPRNSPTKISLGATGSNDDNQK 327
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
P+L+L++R+ R +LN+ E+ AAE GF VTI +P ++D + ++S ++GV
Sbjct: 328 PRLMLIHRSHY--RKLLNVPEIVAAAESAGFAVTISDPRFDVRISDLAKSVNSFDVLMGV 385
Query: 285 HGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEK 343
HGAGLT++ FL PG V++QVVP G + L+ F +P + +GLEYLEY + +ESSL+E
Sbjct: 386 HGAGLTNAAFLPPGGVVIQVVPYGKMEGLARTDFGEPVKDMGLEYLEYSVSAQESSLLEM 445
Query: 344 YGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
G LV+K+P+A + W + YL Q+V+LD++RFR L A +
Sbjct: 446 LGPEHLVIKDPEAVHRSGWDKVAEYYLGKQDVRLDVERFRPTLDKAME 493
>gi|242052309|ref|XP_002455300.1| hypothetical protein SORBIDRAFT_03g008020 [Sorghum bicolor]
gi|241927275|gb|EES00420.1| hypothetical protein SORBIDRAFT_03g008020 [Sorghum bicolor]
Length = 626
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 129/386 (33%), Positives = 200/386 (51%), Gaps = 20/386 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSV-GHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
CD S D+C I G D S+F V ++ Q K +PYPRK D++ M +
Sbjct: 236 CDFSDFRSDICDIAGDVRLDANVSSFVVVVDPASASGQQEEEEHKVRPYPRKGDETCMGR 295
Query: 75 VKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
+ E+T+ T A C TH +PA+VFS GGY GN FH+F D VPL+ T+ +
Sbjct: 296 ITEITVRATRGAAGAPRCTRTHAAPAVVFSIGGYTGNIFHDFSDVLVPLYNTVRRY--GG 353
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTIT---HCFQSVTLGLISHGRMVIN 189
DV L +A+ W KY LL +RH +++ HCF+ + L +H ++I
Sbjct: 354 DVQLVMANSASWWLVKYDRLLRELSRHAPLDLAGAGAAREVHCFRHAVVSLRAHKELIIE 413
Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSF--------HHTKPKLVLVNRNARVGRTIL 241
DF FL A + + KP+L++++R+ R +L
Sbjct: 414 RDRSLDGLATPDFTRFLRRALGLPRDAPTRLVVGGGDGTGRKKPRLLIISRHRT--RLLL 471
Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
NL V +AAEE+GF+ + E + + +A I+SC AMVGVHGAGLT+ +FL PG+ L
Sbjct: 472 NLDAVVRAAEEVGFEAIVNESDVANDIAQVGGLINSCDAMVGVHGAGLTNMMFLPPGAAL 531
Query: 302 MQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGA 360
+Q+VP G QW++ + PA +GL+Y++Y+I +ES+L +K+ + + NP
Sbjct: 532 VQIVPWGGLQWMARADYGDPAEAMGLKYIQYEIGVDESTLKDKFPSGHKIFTNPTELHKK 591
Query: 361 NWSNMR-VYLKTQNVKLDIDRFRIYL 385
+ +R + Q++ +D+ RFR L
Sbjct: 592 GFMFIRQTLMDGQDITVDVARFREVL 617
>gi|326487412|dbj|BAJ89690.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 644
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 128/386 (33%), Positives = 203/386 (52%), Gaps = 18/386 (4%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
CD S+R D C ++G T VG T R+K PYPRK D +AM
Sbjct: 270 CDLTSNRRIDWCELDGDVRVHGAQGTVTLVG----TAKAEEWRVK--PYPRKVDPNAMRH 323
Query: 75 VKELTITTSAPP--NLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
V+E+ + ++ P + C V H+ PAL+FS GY GN+FH + D +PLF+T +
Sbjct: 324 VREIAVRSTTLPGGDEECAVKHSVPALLFSDRGYTGNYFHAYTDVILPLFLTAKRY--GG 381
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
+V ++D W K+ + + + ++++ T CF V +GL H I+P
Sbjct: 382 EVQFLVSDLQMWWIGKFLPVFKSLSNYDLVDLAADNRTRCFAHVQVGLTCHADFSIDPLR 441
Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEE 252
P ++VDF + Y + +P+L+L+ R + R +N E+ +AA++
Sbjct: 442 APNGYSMVDFTRHMRGTYGLPRGLAVPAAGARPRLLLIARAST--RRFVNADEIVRAAQK 499
Query: 253 LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWL 312
+GF+V + E + +A ++C AM+GVHGAGLT+ +FL V++QVVP+G
Sbjct: 500 VGFEVVVSE--GTHEVAPFAELANTCDAMLGVHGAGLTNMVFLPTRGVVIQVVPLGGLEF 557
Query: 313 STVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKT 371
YF P+R +GL+YLEY+I ES+L E+Y + + +P W +++ VYL
Sbjct: 558 VAGYFRTPSRDMGLKYLEYRISPAESTLTEQYPPDHPIFTDPDGVKSKGWESLKQVYLDK 617
Query: 372 QNVKLDIDRFRIYLKD--AYKKAKKL 395
Q+V+LD+ RFR LK A+ +A KL
Sbjct: 618 QDVRLDLKRFRPLLKKAIAHIRANKL 643
>gi|413936952|gb|AFW71503.1| hypothetical protein ZEAMMB73_965201 [Zea mays]
Length = 541
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 129/383 (33%), Positives = 204/383 (53%), Gaps = 27/383 (7%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ----PSLRIKTQPYPRKSD 68
+ CD S++ YD+C + G A ++ST V ++ + P+ K+ P+ +K
Sbjct: 160 ATVCDLSNQRYDICELCGDARAIGRSSTVMYVPQPLTSNGEEWNIPAQSRKSLPWIKKV- 218
Query: 69 KSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
VK L + AP SC HT PA+VF+ GG+ GN +H+ D VPLF+T
Sbjct: 219 -----TVKTLKASQQAP---SCTSRHTMPAIVFALGGFTGNVWHDVSDVLVPLFLTARQF 270
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
DQDV L + D + +KY+ +L R TRH +I+ + C+ V +GL SH + I
Sbjct: 271 --DQDVQLLVTDNRPWFIKKYSAILRRLTRHSVIDFDADDEVRCYPHVIVGLRSHRDLGI 328
Query: 189 NPTLLP-KPKTLVDFQSFLANAYN--------ENTNTSSSFHHTKPKLVLVNRNARVGRT 239
+P P K T+VDF+ F+ AY KP+++L+ R R
Sbjct: 329 DPDSTPQKNYTMVDFRLFVREAYGLPAPGVDIPYRADKDEPEKKKPRMMLIERGKT--RR 386
Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
++N +V GF+V +P + L + R + SC AM+GVHGAGLT+ +FLR G+
Sbjct: 387 LVNTPDVLLGLGWFGFEVVRADPRTHSDLEEFARLVDSCDAMMGVHGAGLTNMVFLRSGA 446
Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
L+ VVP G ++++ ++ PAR +GL ++ Y + +ES+L+EKYG N V+++P+A
Sbjct: 447 ALVHVVPYGVEFMADGFYGAPARDMGLRHVRYSVGPDESTLLEKYGRNHAVVRDPEAVRS 506
Query: 360 ANWSNM-RVYLKTQNVKLDIDRF 381
W + VY+ Q+V L++ RF
Sbjct: 507 GGWEKVGEVYMTEQDVVLNVTRF 529
>gi|357144077|ref|XP_003573162.1| PREDICTED: uncharacterized protein LOC100845100 [Brachypodium
distachyon]
Length = 575
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 131/373 (35%), Positives = 203/373 (54%), Gaps = 16/373 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G+ P S SV + + S+ + K +PYPRK D+ +S +
Sbjct: 201 CDFSNFRANVCEMRGNIKIHPNAS---SVMYMEPAGSKRDEQWKIKPYPRKGDELCLSHI 257
Query: 76 KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
ELT+ +S AP C H PA+VF+ GY GN FH+F D VPLF T S F + +
Sbjct: 258 TELTVKSSKVAP---ECTKYHNVPAVVFALTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 312
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V I D W RKY + + +++P+I+ N HCF +GL ++ I+ +
Sbjct: 313 VQFLITDMAIWWTRKYHVVFEKLSKYPLIDFNKDDQVHCFNHAIVGLHAYMEFTIDSSKA 372
Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAA 250
P ++VDF F+ Y+ + S+ TKP+L++++R R LNL+EV A
Sbjct: 373 PHNYSMVDFNRFMRQTYSLPRDAVSALGEIPKTKPRLLIISRQRT--RMFLNLQEVVAMA 430
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
EELGF+V + E S+ L+ + ++S M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 431 EELGFEVVVEEANVSSDLSHFSKAVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGL 490
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
+ + + F P+ +GL Y +Y I ESSL ++Y + + KNP AF + ++
Sbjct: 491 EGVCRIDFGDPSEQMGLRYKQYSISVHESSLTDQYPLDHEIFKNPLAFHKGFEFIRQTFM 550
Query: 370 KTQNVKLDIDRFR 382
QNV+LD +RFR
Sbjct: 551 DKQNVRLDCNRFR 563
>gi|242052291|ref|XP_002455291.1| hypothetical protein SORBIDRAFT_03g007910 [Sorghum bicolor]
gi|241927266|gb|EES00411.1| hypothetical protein SORBIDRAFT_03g007910 [Sorghum bicolor]
Length = 631
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 131/389 (33%), Positives = 204/389 (52%), Gaps = 14/389 (3%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
CD S+R D C ++G ++ V G D T S RIK PYPRK+D +A
Sbjct: 244 CDLTSNRRIDWCELDGDVRVLGANASVTLVAPPGADDRTFRAESWRIK--PYPRKADPNA 301
Query: 72 M-SKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
M + S +C + PALVFS GY GN+FH F D +PLF+T +
Sbjct: 302 MHVVRVVTVQSVSGGGAPACTDRYDVPALVFSDRGYTGNYFHAFTDVILPLFLTARQYA- 360
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+V L + D W K++ + + + +++++ CF+ V +GL SH I+P
Sbjct: 361 -GEVRLLVTDLQAWWVGKFSPVFKAISNYELVDLDKDPRVQCFRHVQVGLTSHDDFSIDP 419
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P ++VDF +F+ Y +++ +P+L+L+ R AR R +N E+ + A
Sbjct: 420 RRAPNGYSMVDFTAFMRATYGLPRGVAAADATKRPRLLLIAR-ART-RRFVNTEEIVRGA 477
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
E LGF+V + E + +A +SC A++GVHGAGLT+ +F+ G V++QVVP+G
Sbjct: 478 ERLGFEVVV--SEGTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPTGGVVIQVVPLGGL 535
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
YF P+R +GL YLEY+I EES+L+++Y + + +P W +++ YL
Sbjct: 536 EFVAGYFRGPSRDMGLRYLEYRITPEESTLIDQYPRDHPIFTDPNGIKSKGWESLKDAYL 595
Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
Q+V+LD+ RFR LK A +K K
Sbjct: 596 DKQDVRLDMKRFRPTLKKAIAHLRKAKAK 624
>gi|342186837|emb|CCC14964.1| putative glycosyl transferase [Triticum aestivum]
Length = 578
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 133/376 (35%), Positives = 199/376 (52%), Gaps = 16/376 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G+ P S SV + + S+ + K +PYPRK D+ +SKV
Sbjct: 204 CDFSNFRANVCEMRGNIRIHPNAS---SVMYMEPASSKRNELWKLKPYPRKGDELCLSKV 260
Query: 76 KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
ELT+ +S AP C H P +VF+ GY GN FH+F D VPLF T S F + +
Sbjct: 261 TELTVKSSKVAP---ECTKYHNVPVVVFALTGYTGNLFHDFTDALVPLFTTA-SEF-NGE 315
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V I D W RKY + + + +P+I+ N T HC + +GL ++ I+P+
Sbjct: 316 VQFLITDMAIWWTRKYHVVFKKLSNYPLIDFNKDTDVHCVKHAIVGLHAYMEFTIDPSKA 375
Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P T+VDF F+ Y E + KP+L++++R R LNL E+ A
Sbjct: 376 PHNYTMVDFNRFMRRTYELPREAVSALGEIPKAKPRLLIISRQR--TRMFLNLPEIIAMA 433
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI-GT 309
E LGF+V + E S+ L+ + ++S M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 434 EGLGFEVVVEEANVSSDLSQFSKVVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGI 493
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
+ + + F PA +GL Y +Y I ESSL ++Y + + KNP AF ++
Sbjct: 494 EGVCRIDFGDPAEQMGLRYKQYSIAVHESSLTDQYPLDHEIFKNPLAFHKGFEFIKETFM 553
Query: 370 KTQNVKLDIDRFRIYL 385
QNV+LD +RFR L
Sbjct: 554 DKQNVRLDCNRFRPVL 569
>gi|326524055|dbj|BAJ97038.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528547|dbj|BAJ93455.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326533864|dbj|BAJ93705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 582
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 131/373 (35%), Positives = 199/373 (53%), Gaps = 16/373 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G+ P S SV + + S+ + K +PYPRK D+ +SKV
Sbjct: 208 CDFSNFRANVCEMRGNIRIHPNAS---SVMYMEPASSKRNELWKLKPYPRKGDELCLSKV 264
Query: 76 KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
ELT+ +S AP C H P +VF+ GY GN FH+F D VPLF T S F + +
Sbjct: 265 TELTVKSSKVAP---ECTKYHNVPVVVFALTGYTGNLFHDFTDALVPLFTTA-SEF-NGE 319
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V I D W RKY + + +++P+I+ N T HC + +GL ++ I+P+
Sbjct: 320 VQFLITDMAIWWTRKYHVVFKKLSKYPLIDFNKDTDVHCVKHAIVGLHAYMEFTIDPSKA 379
Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P T+VDF F+ Y E + KP+L++++R R LNL E+ A
Sbjct: 380 PHNYTMVDFNRFMRRTYELPREAVSALGEIPKAKPRLLIISRQR--TRMFLNLPEIIAMA 437
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI-GT 309
E LGF+V + E S+ L+ + ++S M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 438 EGLGFEVVVEEANVSSDLSQFSKVVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGI 497
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
+ + + F PA +GL Y +Y I ESSL ++Y + + KNP AF ++
Sbjct: 498 EGVCRIDFGDPAEQMGLRYKQYSIAVHESSLTDQYPLDHEIFKNPLAFHKGFEFIKETFM 557
Query: 370 KTQNVKLDIDRFR 382
QNV+LD +RF+
Sbjct: 558 DKQNVRLDCNRFK 570
>gi|115445881|ref|NP_001046720.1| Os02g0330200 [Oryza sativa Japonica Group]
gi|46390159|dbj|BAD15592.1| putative HGA6 [Oryza sativa Japonica Group]
gi|113536251|dbj|BAF08634.1| Os02g0330200 [Oryza sativa Japonica Group]
gi|218190637|gb|EEC73064.1| hypothetical protein OsI_07022 [Oryza sativa Indica Group]
gi|222622748|gb|EEE56880.1| hypothetical protein OsJ_06523 [Oryza sativa Japonica Group]
Length = 583
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 128/377 (33%), Positives = 202/377 (53%), Gaps = 17/377 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G+ P S SV + + S+ K +PYPRK D+ + +
Sbjct: 208 CDFSNFRANVCEMRGNIRIHPNAS---SVMYMEPASSKREEIWKVKPYPRKGDELCLGHI 264
Query: 76 KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ +S AP C H PA+VF+ GY GN FH+F D VPLF T S F + +
Sbjct: 265 TEITVKSSKVAP---ECSKYHNVPAVVFALTGYTGNLFHDFTDVLVPLFTTA-SEF-NGE 319
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V I D W RKY + + +++P+I+ NN HCF+ +GL ++ I+ +
Sbjct: 320 VQFLITDMAIWWTRKYKVVFDKLSKYPLIDFNNDDQVHCFKHAIVGLHAYMEFTIDSSKA 379
Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P ++VDF F+ Y+ + KP+L++++R R LNL E+ A
Sbjct: 380 PHNYSMVDFNRFMRRTYSLPRDFVTALGEIPKAKPRLLIISRQRT--RMFLNLNEIVAMA 437
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
EE+G++V + E S+ L+ + ++S M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 438 EEIGYEVVVEEANVSSDLSHFGKVVNSVDVMMGVHGAGLTNCVFLPQNATLIQIVPWGGL 497
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
W+S + F PA +GL Y +Y I ESSL ++Y + + NP +F + +R +
Sbjct: 498 DWISRIDFGNPAEQMGLRYKQYSIGVHESSLTDQYPLDHEIFTNPLSFHKHGFEFIRQTF 557
Query: 369 LKTQNVKLDIDRFRIYL 385
+ QNVKLD +RF+ L
Sbjct: 558 MDKQNVKLDCNRFKPVL 574
>gi|300681544|emb|CBH32641.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
aestivum]
Length = 638
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/388 (32%), Positives = 203/388 (52%), Gaps = 20/388 (5%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
CD S+R D C ++G +T VG + RI+ PYPRK D +AM
Sbjct: 261 CDMTSNRRIDWCELDGDVRVHGAKATVTMVGAARAE----EWRIR--PYPRKVDPNAMRH 314
Query: 75 VKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
V +T+ T C + H+ PAL+FS GY GN+FH + D +PLF+T +
Sbjct: 315 VTNITVRSTMTLPGAGEGECAIKHSVPALLFSDRGYTGNYFHAYTDVILPLFLTAKQY-- 372
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+V L ++D W K+ + + + ++++ T CF+ V +GL H I+P
Sbjct: 373 GGEVQLLVSDMQMWWIGKFLPVFKSLSNYDLVDLAADNRTRCFRHVQVGLTCHADFSIDP 432
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P ++VDF + Y + +P+L+L+ R + R +N ++ +AA
Sbjct: 433 LRAPNGYSMVDFTKHMRGVYGLPRGLAVPAAGARPRLLLIARAST--RRFVNADDIVRAA 490
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
+++GF+V + E + +A ++C M+GVHGAGLT+ +FL G V++QVVP+G
Sbjct: 491 QKVGFEVVVSE--GTHEVAPFAELANTCDVMLGVHGAGLTNMVFLPTGGVVIQVVPLGGL 548
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
YF P+R +GL+YLEY+I ES+L E+Y A+ + +P W +++ VYL
Sbjct: 549 EFVAGYFRAPSRDMGLKYLEYRIAPAESTLTEQYPADHPIFTDPDGVKSKGWESLKQVYL 608
Query: 370 KTQNVKLDIDRFRIYLKD--AYKKAKKL 395
Q+V+LD+ RFR LK A+ +A KL
Sbjct: 609 DKQDVRLDLKRFRPLLKKAIAHIRANKL 636
>gi|326500646|dbj|BAJ94989.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 503
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 121/340 (35%), Positives = 192/340 (56%), Gaps = 11/340 (3%)
Query: 59 KTQPYPRKSDKSAMSKVKE--LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
+T+PY R D AM+ V+ L + P +C H+ P VFS G++GN +H++ D
Sbjct: 163 RTKPYARYHDPVAMTHVRGFVLKPFPADAPPPACTKNHSVPGFVFSNRGFSGNLYHDYTD 222
Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
VPLF++ +V ++D W K+ L + + + ++++NN HC +
Sbjct: 223 VLVPLFLSTRKF--KGEVQFLLSDLKPWWVAKFRPLFRQLSNYEVVDVNNDLEVHCVPRI 280
Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRN 233
+G H M I P+ ++VDF+ L +A+ +S T KP+L++++R
Sbjct: 281 VVGSDFHKDMGIIPSKAAGHVSIVDFKRTLRDAFGLERAAASRGGATGAGKPRLLIISR- 339
Query: 234 ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
+ R LN RE+ AA +GFDV I EP++ T ++ R ++S MVGVHGAGLT+ +
Sbjct: 340 -KNSRRFLNEREMAAAATAMGFDVRIAEPDQHTDMSTFARLVNSADVMVGVHGAGLTNMV 398
Query: 294 FLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLK 352
FL G+VL+QVVP G +WL+ V F+ PA + + Y++Y ++ EESSL+E+Y N VL
Sbjct: 399 FLPAGAVLIQVVPFGGLEWLTGVTFKNPAADMEVTYMDYNVQLEESSLLEQYPRNHQVLT 458
Query: 353 NPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKK 391
+P A W ++ YL QNV+LD+D+FR L+DA +
Sbjct: 459 DPYAVHKQGWDALKAAYLDKQNVRLDLDKFRATLRDALSR 498
>gi|77554062|gb|ABA96858.1| HGA6, putative, expressed [Oryza sativa Japonica Group]
gi|218186635|gb|EEC69062.1| hypothetical protein OsI_37919 [Oryza sativa Indica Group]
Length = 584
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 132/395 (33%), Positives = 206/395 (52%), Gaps = 31/395 (7%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C + G D S F V P+ +++ PYPRK D ++M +V
Sbjct: 187 CDLSDHRTDVCDLAGDIRMDANASAFVVVVDPAVGADGPTYKVR--PYPRKGDATSMGRV 244
Query: 76 KELTITTSAP--PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ T+A P C TH +PA+VFS GY GN FH+F D VPL+ T + D
Sbjct: 245 TEITVRTTAAGAPPPRCTTTHAAPAVVFSISGYTGNLFHDFTDVIVPLYNTAARYC--GD 302
Query: 134 VILAIADCN---DQWARKYAELLPRFTRHPIINI-----NNQTITHCFQSVTLGLISHGR 185
V L + D N +W +Y +L +RH +++ HCF +GL +HG
Sbjct: 303 VQLVVTDGNAATRRWLARYGAVLRGLSRHVPLDLAAEAAAGGGEVHCFGHTVVGLRAHGE 362
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN--ENTNTSSSFHHT-------KPKLVLVNRNARV 236
++I+ P + DF FL A + + T H +P+L++++R R
Sbjct: 363 LIIDRERSPDGLGMPDFTRFLRRALSLPRDAPTRPGGGHGDATKPQPRPRLLIISR--RG 420
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF----RFIHSCHAMVGVHGAGLTHS 292
R +LN V +AAE++GF+ E + + + D R ++S A+VGVHGAGLT+
Sbjct: 421 TRLLLNTDAVARAAEQVGFEAVASELDMAGADHDDVARVARLVNSFDAVVGVHGAGLTNM 480
Query: 293 LFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
+FL PG+ +Q+VP G +WL+ F +PA +GL Y++Y++ ES+L +KY + +
Sbjct: 481 VFLPPGAAAVQIVPWGGLRWLARADFGEPAVAMGLRYIQYEVAAGESTLKDKYPRDHEIF 540
Query: 352 KNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYL 385
NP A ++ MR +L Q++ +DIDRF+ L
Sbjct: 541 TNPTALHKKGFTFMRHTFLNGQDIIVDIDRFKPVL 575
>gi|219362665|ref|NP_001137085.1| uncharacterized protein LOC100217259 [Zea mays]
gi|194698284|gb|ACF83226.1| unknown [Zea mays]
gi|414876764|tpg|DAA53895.1| TPA: hypothetical protein ZEAMMB73_695029 [Zea mays]
Length = 547
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 138/418 (33%), Positives = 206/418 (49%), Gaps = 32/418 (7%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTF--FSVGHTDSTPSQPSLRIKTQP 62
P + P +I CDRS D+C + G D + F S P P+ + +P
Sbjct: 117 PCSSLPNHTICCDRSDYHSDVCFMAGDVRTDAASLALLLFPPRAASSAPEPPAAEERIRP 176
Query: 63 YPRKSDKSAMSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHEFMDC 117
Y RK D + E+T+ + P + C V H +P LV +AGGY+ N FH F D
Sbjct: 177 YTRKWDAYITKTIHEVTLRVARPEEAAAAAHRCDVRHDAPVLVVTAGGYSHNMFHVFNDG 236
Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
F+PL++T H + +LA+ + +WA Y E+L +R+ I++ THCF
Sbjct: 237 FLPLWLTAQ-HL-RRRAVLAVLSYSPRWAGTYGEILAGLSRYHAIDLLRDKRTHCFPGAV 294
Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN--------TNTSSS-------FHH 222
+G H + +N T L KT+ DF FLA Y+++ +SSS +
Sbjct: 295 VGTRYHDYLAVNSTRLRDNKTIADFHDFLAGVYSDDDVRNDKAAGGSSSSRRPEMAWYER 354
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
+P+L +V+R R R + N V + A +GFDV I E L+ + + S A+V
Sbjct: 355 RRPRLGIVSRKGR--RVVENQAAVAQLAASVGFDVDIMETANGAPLSAVYASVSSYDALV 412
Query: 283 GVHGAGLTHSLFLRPG-SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLV 341
GVHGA LT LFLRPG + L Q+ P+G LS F PA +GL Y++Y + ESSL
Sbjct: 413 GVHGADLTTFLFLRPGRAALAQIAPLGITMLSRNLFGVPAARMGLHYVQYDVSARESSLS 472
Query: 342 EKYGANDLVLKNP----QAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
+Y + +V+ +P + W + VYL+ QNV LD+ RFR L + + K+
Sbjct: 473 RRYPLDHVVVADPARARREQGKQEWELVEHVYLRGQNVSLDLGRFRETLARIHSRLKE 530
>gi|223942755|gb|ACN25461.1| unknown [Zea mays]
Length = 529
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 128/382 (33%), Positives = 203/382 (53%), Gaps = 27/382 (7%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ----PSLRIKTQPYPRKSD 68
+ CD S++ YD+C + G A ++ST V ++ + P+ K+ P+ +K
Sbjct: 160 ATVCDLSNQRYDICELCGDARAIGRSSTVMYVPQPLTSNGEEWNIPAQSRKSLPWIKKV- 218
Query: 69 KSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
VK L + AP SC HT PA+VF+ GG+ GN +H+ D VPLF+T
Sbjct: 219 -----TVKTLKASQQAP---SCTSRHTMPAIVFALGGFTGNVWHDVSDVLVPLFLTARQF 270
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
DQDV L + D + +KY+ +L R TRH +I+ + C+ V +GL SH + I
Sbjct: 271 --DQDVQLLVTDNRPWFIKKYSAILRRLTRHSVIDFDADDEVRCYPHVIVGLRSHRDLGI 328
Query: 189 NPTLLP-KPKTLVDFQSFLANAYN--------ENTNTSSSFHHTKPKLVLVNRNARVGRT 239
+P P K T+VDF+ F+ AY KP+++L+ R R
Sbjct: 329 DPDSTPQKNYTMVDFRLFVREAYGLPAPGVDIPYRADKDEPEKKKPRMMLIERGKT--RR 386
Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
++N +V GF+V +P + L + R + SC AM+GVHGAGLT+ +FLR G+
Sbjct: 387 LVNTPDVLLGLGWFGFEVVRADPRTHSDLEEFARLVDSCDAMMGVHGAGLTNMVFLRSGA 446
Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
L+ VVP G ++++ ++ PAR +GL ++ Y + +ES+L+EKYG N V+++P+A
Sbjct: 447 ALVHVVPYGVEFMADGFYGAPARDMGLRHVRYSVGPDESTLLEKYGRNHAVVRDPEAVRS 506
Query: 360 ANWSNM-RVYLKTQNVKLDIDR 380
W + VY+ Q+V L++ R
Sbjct: 507 GGWEKVGEVYMTEQDVVLNVTR 528
>gi|413947215|gb|AFW79864.1| hypothetical protein ZEAMMB73_561794 [Zea mays]
Length = 628
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 132/393 (33%), Positives = 210/393 (53%), Gaps = 18/393 (4%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
CD S+R D C ++G ++ V G + T S RIK PYPRK+D +A
Sbjct: 237 CDVTSNRRIDWCELDGDVRVVGANASVTLVAPPGADNRTFRAESWRIK--PYPRKADPNA 294
Query: 72 MSKVKELTITT--SAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
M V+ LT+ + S +C G PALVFS GY GN+FH F D +PLF+T
Sbjct: 295 MRFVRVLTVRSVASGSGEAACTDGGDDDVPALVFSDRGYTGNYFHAFTDVILPLFLTARR 354
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ +V L +AD W K+ + +++ +++++ CF+ V +GL SH
Sbjct: 355 YA--GEVRLLVADLQPWWVGKFLPVFRSISKYELVDLDRDPRVRCFRHVQVGLTSHADFS 412
Query: 188 INPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
I+P P +++DF F+ AY + ++ +P+L++V R AR R +N E+
Sbjct: 413 IDPRRAPNGYSMLDFTRFMRAAYGLPRGDVVAAAPARRPRLLVVAR-ART-RRFVNTEEI 470
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ AE +GF+ + E + +A + C A++GVHGAGLT+ +FL G V++QVVP
Sbjct: 471 VRGAEAVGFEAVV--SEGTHEVAPFAELANGCDAIMGVHGAGLTNMVFLPTGGVVIQVVP 528
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
+G YF P+ +GL YLEY+I+ EES+LV++Y + + +P W +++
Sbjct: 529 LGGLEFVAGYFRGPSVDMGLRYLEYRIEPEESTLVDQYPRDHPIFTDPNGIKSKGWESLK 588
Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
YL Q+V+LD++RFR L++A +K K
Sbjct: 589 DAYLDKQDVRLDMERFRPTLQEAIAHLRKAKAK 621
>gi|212275101|ref|NP_001130987.1| hypothetical protein [Zea mays]
gi|194690638|gb|ACF79403.1| unknown [Zea mays]
gi|224030713|gb|ACN34432.1| unknown [Zea mays]
gi|413947219|gb|AFW79868.1| hypothetical protein ZEAMMB73_561794 [Zea mays]
Length = 625
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 132/393 (33%), Positives = 210/393 (53%), Gaps = 18/393 (4%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
CD S+R D C ++G ++ V G + T S RIK PYPRK+D +A
Sbjct: 234 CDVTSNRRIDWCELDGDVRVVGANASVTLVAPPGADNRTFRAESWRIK--PYPRKADPNA 291
Query: 72 MSKVKELTITT--SAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
M V+ LT+ + S +C G PALVFS GY GN+FH F D +PLF+T
Sbjct: 292 MRFVRVLTVRSVASGSGEAACTDGGDDDVPALVFSDRGYTGNYFHAFTDVILPLFLTARR 351
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ +V L +AD W K+ + +++ +++++ CF+ V +GL SH
Sbjct: 352 YA--GEVRLLVADLQPWWVGKFLPVFRSISKYELVDLDRDPRVRCFRHVQVGLTSHADFS 409
Query: 188 INPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
I+P P +++DF F+ AY + ++ +P+L++V R AR R +N E+
Sbjct: 410 IDPRRAPNGYSMLDFTRFMRAAYGLPRGDVVAAAPARRPRLLVVAR-ART-RRFVNTEEI 467
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ AE +GF+ + E + +A + C A++GVHGAGLT+ +FL G V++QVVP
Sbjct: 468 VRGAEAVGFEAVV--SEGTHEVAPFAELANGCDAIMGVHGAGLTNMVFLPTGGVVIQVVP 525
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
+G YF P+ +GL YLEY+I+ EES+LV++Y + + +P W +++
Sbjct: 526 LGGLEFVAGYFRGPSVDMGLRYLEYRIEPEESTLVDQYPRDHPIFTDPNGIKSKGWESLK 585
Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
YL Q+V+LD++RFR L++A +K K
Sbjct: 586 DAYLDKQDVRLDMERFRPTLQEAIAHLRKAKAK 618
>gi|413947218|gb|AFW79867.1| hypothetical protein ZEAMMB73_561794 [Zea mays]
Length = 641
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 132/393 (33%), Positives = 210/393 (53%), Gaps = 18/393 (4%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
CD S+R D C ++G ++ V G + T S RIK PYPRK+D +A
Sbjct: 250 CDVTSNRRIDWCELDGDVRVVGANASVTLVAPPGADNRTFRAESWRIK--PYPRKADPNA 307
Query: 72 MSKVKELTITT--SAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
M V+ LT+ + S +C G PALVFS GY GN+FH F D +PLF+T
Sbjct: 308 MRFVRVLTVRSVASGSGEAACTDGGDDDVPALVFSDRGYTGNYFHAFTDVILPLFLTARR 367
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ +V L +AD W K+ + +++ +++++ CF+ V +GL SH
Sbjct: 368 YA--GEVRLLVADLQPWWVGKFLPVFRSISKYELVDLDRDPRVRCFRHVQVGLTSHADFS 425
Query: 188 INPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
I+P P +++DF F+ AY + ++ +P+L++V R AR R +N E+
Sbjct: 426 IDPRRAPNGYSMLDFTRFMRAAYGLPRGDVVAAAPARRPRLLVVAR-ART-RRFVNTEEI 483
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ AE +GF+ + E + +A + C A++GVHGAGLT+ +FL G V++QVVP
Sbjct: 484 VRGAEAVGFEAVV--SEGTHEVAPFAELANGCDAIMGVHGAGLTNMVFLPTGGVVIQVVP 541
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
+G YF P+ +GL YLEY+I+ EES+LV++Y + + +P W +++
Sbjct: 542 LGGLEFVAGYFRGPSVDMGLRYLEYRIEPEESTLVDQYPRDHPIFTDPNGIKSKGWESLK 601
Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
YL Q+V+LD++RFR L++A +K K
Sbjct: 602 DAYLDKQDVRLDMERFRPTLQEAIAHLRKAKAK 634
>gi|242061682|ref|XP_002452130.1| hypothetical protein SORBIDRAFT_04g020210 [Sorghum bicolor]
gi|241931961|gb|EES05106.1| hypothetical protein SORBIDRAFT_04g020210 [Sorghum bicolor]
Length = 542
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 121/378 (32%), Positives = 208/378 (55%), Gaps = 25/378 (6%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ----PSLRIKTQPYPRKSDKSA 71
CD S++ +D+C + G+A ++ST V + ++ + P+ K+ P+ +K
Sbjct: 166 CDLSNQRFDICELCGNARTIGRSSTVMYVPQSLTSNGEEWNIPAQSRKSLPWIKKV---- 221
Query: 72 MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
VK L + P C H PA+VF+ GG+ N +H+ D VPLF+T D
Sbjct: 222 --TVKTLKASQQVP---RCTSRHAIPAIVFALGGFTANVWHDVSDVLVPLFLTAQQF--D 274
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
+DV L I + + +KY+ + R TRH II+ + C+ V +GL SH + I+P
Sbjct: 275 RDVQLLITNNQPWFIKKYSAIFHRLTRHNIIDFDADDEVRCYPHVIVGLRSHRDLGIDPN 334
Query: 192 LLPKPKTLVDFQSFLANAYNENT-------NTSSSFHHTKPKLVLVNRNARVGRTILNLR 244
P+ T++DF+ F+ AY KP+++L++R R +N+
Sbjct: 335 STPQNYTMMDFRLFVREAYGLPAPEVDIPYRVDKDDPEKKPRIMLIDRGKT--RRFMNMP 392
Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
+V + + GF+V +P ++L + R + SC AM+GVHGAGLT+ +FLR G+V++ +
Sbjct: 393 DVLRGLDWFGFEVVRADPRIDSTLDEFVRLVDSCDAMMGVHGAGLTNMVFLRSGAVVVHI 452
Query: 305 VPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
VP G ++++ ++ PAR +GL +++Y I +ES+L+EKYG N +V+K+P+A + W
Sbjct: 453 VPYGVEFMANGFYGAPARDMGLRHVQYSISPDESTLLEKYGENHMVIKDPEAIRNSGWEK 512
Query: 365 M-RVYLKTQNVKLDIDRF 381
+ +Y+ Q+V L++ RF
Sbjct: 513 VGELYMTKQDVVLNMTRF 530
>gi|413947216|gb|AFW79865.1| hypothetical protein ZEAMMB73_561794 [Zea mays]
Length = 532
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 132/393 (33%), Positives = 210/393 (53%), Gaps = 18/393 (4%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
CD S+R D C ++G ++ V G + T S RIK PYPRK+D +A
Sbjct: 141 CDVTSNRRIDWCELDGDVRVVGANASVTLVAPPGADNRTFRAESWRIK--PYPRKADPNA 198
Query: 72 MSKVKELTITT--SAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
M V+ LT+ + S +C G PALVFS GY GN+FH F D +PLF+T
Sbjct: 199 MRFVRVLTVRSVASGSGEAACTDGGDDDVPALVFSDRGYTGNYFHAFTDVILPLFLTARR 258
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ +V L +AD W K+ + +++ +++++ CF+ V +GL SH
Sbjct: 259 YA--GEVRLLVADLQPWWVGKFLPVFRSISKYELVDLDRDPRVRCFRHVQVGLTSHADFS 316
Query: 188 INPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
I+P P +++DF F+ AY + ++ +P+L++V R AR R +N E+
Sbjct: 317 IDPRRAPNGYSMLDFTRFMRAAYGLPRGDVVAAAPARRPRLLVVAR-ART-RRFVNTEEI 374
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ AE +GF+ + E + +A + C A++GVHGAGLT+ +FL G V++QVVP
Sbjct: 375 VRGAEAVGFEAVV--SEGTHEVAPFAELANGCDAIMGVHGAGLTNMVFLPTGGVVIQVVP 432
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
+G YF P+ +GL YLEY+I+ EES+LV++Y + + +P W +++
Sbjct: 433 LGGLEFVAGYFRGPSVDMGLRYLEYRIEPEESTLVDQYPRDHPIFTDPNGIKSKGWESLK 492
Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
YL Q+V+LD++RFR L++A +K K
Sbjct: 493 DAYLDKQDVRLDMERFRPTLQEAIAHLRKAKAK 525
>gi|357127184|ref|XP_003565264.1| PREDICTED: uncharacterized protein LOC100843951 [Brachypodium
distachyon]
Length = 621
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 136/386 (35%), Positives = 210/386 (54%), Gaps = 23/386 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S + D+C G D TS+F V ++P +R PYPRK D++ M +V
Sbjct: 238 CDFSDQRSDVCDFTGDIRMDANTSSFVLVVGQGTSPQSHKVR----PYPRKGDETCMGRV 293
Query: 76 KEL----TITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
E+ T T S+PP L C TH +PA+ FS GGY GN FH+F D VPLF T++ +
Sbjct: 294 TEINVRTTTTASSPPPL-CTQTHDAPAVTFSIGGYTGNIFHDFSDVIVPLFNTVHKYA-- 350
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLISHGRMVI 188
DV L +A+ W KY +LL +R+ +++ + HCF+ + L +H ++I
Sbjct: 351 GDVQLVMANVAPWWLIKYDKLLRSISRYAPLDLAAAGTRGEVHCFRHAVVSLRAHRELII 410
Query: 189 NPTLLPKPKTLVDFQSFLANAY----NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLR 244
DF FL +A + TN + KP+L++++R+ R +LNL
Sbjct: 411 EKDRSLDGLATPDFTRFLCSALSLPRDAPTNIADGSGR-KPRLLIISRHRT--RILLNLA 467
Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
V AAEE GF+ + E + + +++ R I+S +VGVHGAGLT+ +FL PG+ ++QV
Sbjct: 468 AVVLAAEEAGFEAVVNESDVANDISEVGRLINSADVLVGVHGAGLTNMMFLPPGATMVQV 527
Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
VP G QW++ + + PA +GL Y++Y+I +ESSL EKY + + NP +
Sbjct: 528 VPWGGLQWIARMDYGDPAEAMGLRYVQYEITVDESSLKEKYPRDHEIFTNPTGLHKKGFG 587
Query: 364 NMR-VYLKTQNVKLDIDRFRIYLKDA 388
MR + QN+ +D+ RF+ L++A
Sbjct: 588 FMRQTLMDGQNITVDVARFKGVLQEA 613
>gi|31339690|gb|AAP49177.1| HGA6 [Hordeum vulgare subsp. vulgare]
Length = 637
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 130/378 (34%), Positives = 198/378 (52%), Gaps = 22/378 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S R D+C G D STFF V + S K +PYPRK D + M +V
Sbjct: 265 CDFSDRRSDVCDFTGDIRMDANASTFFLVVDAATAASH-----KVRPYPRKGDPTCMGRV 319
Query: 76 KELTITT---SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
E+T+ T S+PP C TH PA+ FS GGY GN FH+F D VPL+ T++ +
Sbjct: 320 PEITMRTTSSSSPPE--CTRTHGVPAVTFSIGGYTGNIFHDFSDVLVPLYNTVHRY--RG 375
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIIN---INNQTITHCFQSVTLGLISHGRMVIN 189
DV L +A+ W KY +LL +RH ++ + + THCF+ + L +H ++I
Sbjct: 376 DVQLVMANVAPWWLVKYDKLLRELSRHAPLDLAAVAAKGETHCFRHAVVSLRAHRELIIE 435
Query: 190 PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREV 246
P DF F+ A + + + KP+L+++ R+ R +LNL ++
Sbjct: 436 RDRSPDGLATPDFTRFIRRALSLPRDAPTRLADGTGRKPRLLIIARHRT--RILLNLGDM 493
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ AEE GF+ + E + ++ I+S +VGVHGAGLT+ + L PG+ ++QVVP
Sbjct: 494 MRVAEEAGFEAAVSELDVGDPISRVGAEINSADVLVGVHGAGLTNMMSLAPGATMVQVVP 553
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G QW + + + PA LGL Y++Y+I +ESSL +KY + +P + + M
Sbjct: 554 WGGLQWFARMDYGDPAEALGLRYVQYEIGVDESSLKDKYPRGHKIFTDPTSLHKKGFGFM 613
Query: 366 -RVYLKTQNVKLDIDRFR 382
R + QN+ +D+ RFR
Sbjct: 614 RRTLMDGQNITIDLGRFR 631
>gi|56409846|emb|CAI30071.1| glycosyltransferase [Triticum aestivum]
Length = 468
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 132/376 (35%), Positives = 199/376 (52%), Gaps = 16/376 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G+ P S SV + + S+ + K +PYPRK D+ +SKV
Sbjct: 94 CDFSNFRANVCEMRGNIRIHPNAS---SVMYMEPASSKRNELWKLKPYPRKGDELCLSKV 150
Query: 76 KELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
ELT+ +S AP C H P +VF+ GY GN FH+F D VPLF T S F + +
Sbjct: 151 TELTVKSSKVAP---ECTKYHNVPVVVFALTGYTGNLFHDFTDALVPLFTTA-SEF-NGE 205
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V I D W RKY + + + +P+I+ N T HC + +GL ++ I+P+
Sbjct: 206 VQFLITDMAIWWTRKYHVVFKKLSNYPLIDFNKDTDVHCAKHAIVGLHAYMEFTIDPSKA 265
Query: 194 PKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P T+VDF F+ Y E + KP+L++++R R LNL E+ A
Sbjct: 266 PHNYTMVDFNRFMRRTYELPREAVSALGEIPKAKPRLLIISRQRT--RMFLNLPEIIAMA 323
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI-GT 309
E LGF+V + E S+ L+ + ++S M+GVHGAGLT+ +FL + L+Q+VP G
Sbjct: 324 EGLGFEVVVEEANVSSDLSQFSKVVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGI 383
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
+ + + F PA +GL Y +Y I ESSL ++Y + + KNP AF ++
Sbjct: 384 EGVCRIDFGDPAEQMGLRYKQYSIAVHESSLTDQYPLDHEIFKNPLAFHKGFEFIKETFM 443
Query: 370 KTQNVKLDIDRFRIYL 385
QNV+LD +RF+ L
Sbjct: 444 DKQNVRLDCNRFKPVL 459
>gi|357147066|ref|XP_003574209.1| PREDICTED: uncharacterized protein LOC100835033 [Brachypodium
distachyon]
Length = 501
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 120/338 (35%), Positives = 200/338 (59%), Gaps = 13/338 (3%)
Query: 59 KTQPYPRKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
KT+PY R D AM+ V+E T+ ++S+PP +C H+ P +FS GG++GN +H++
Sbjct: 154 KTKPYARYHDPVAMAHVREFTLKPFSSSSSPPPPACTKNHSVPGFLFSNGGFSGNLYHDY 213
Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
D +PLF+T S +V ++ W K+ L + T + +++++N HCF
Sbjct: 214 TDVLIPLFLTTRSF--RGEVRFLLSGLKPWWVTKFTPLFRQLTNYDVLDVDNDGEIHCFP 271
Query: 175 SVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVN 231
+ +G H M ++P+ P ++VDF+ L A++ + + + + KP+L++++
Sbjct: 272 RIVVGSTFHKDMGVDPSKSPGGVSVVDFKRTLRAAFDLPRASASRAGARGDGKPRLLIIS 331
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
R + R LN +E+ A +GF V I EP++ T +A R ++S MVGVHGAGLT+
Sbjct: 332 RKS--SRRFLNEKEMAAAGAAMGFQVRIAEPDQHTDMATFARLVNSADVMVGVHGAGLTN 389
Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
+FL G+VL+QVVP G +WL+ V F++PA + + Y++Y ++ EESSL+++Y + V
Sbjct: 390 MVFLPAGAVLVQVVPFGGLEWLTRVTFKEPAADMEVRYMDYNVQLEESSLLDQYPRSHQV 449
Query: 351 LKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKD 387
L +P A W ++ YL QNV+LD+DRFR L+D
Sbjct: 450 LADPYAVHKQGWDALKTAYLDKQNVRLDLDRFRATLRD 487
>gi|326501348|dbj|BAJ98905.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 548
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 135/396 (34%), Positives = 203/396 (51%), Gaps = 22/396 (5%)
Query: 9 SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSD 68
+P ++C+ S ++C++ G K T + V +D + + + +PYPRK +
Sbjct: 156 APRSKLSCNFSFYRMNICVMEGDVRMHGKAGTVYVVSASDDSYRPENGTVTIRPYPRKWE 215
Query: 69 KSAMSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
K M +E+TI +S P C TH PA+VFS GGY+ NFFH D +PL+
Sbjct: 216 KPTMQMAREVTIRSSGPGATDMAPPPCTATHDVPAVVFSTGGYSSNFFHAVTDIVIPLYN 275
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T + D V L + D + +W KY +L + +P I+ + CF V +G+ SH
Sbjct: 276 TAREY--DGRVQLVVTDYSRKWIAKYRHVLAALSDYPAIDFDADDTVRCFPKVHVGIESH 333
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTI 240
+ I P L K TL+DF+ FL +AY+ S+ + T +P+LV++ R R R
Sbjct: 334 KELGIIPVLSHKDYTLMDFRDFLRSAYSLKRAWSTPVNRTSGGRPRLVMLLR--RHSRAF 391
Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
N E AA E+GF+V PE +A ++SC MVGVHGAGLT+ +FL
Sbjct: 392 TNEAEAVAAAAEVGFEVVAAGPEAVRDMAQFAEVVNSCDVMVGVHGAGLTNMVFLPHNGT 451
Query: 301 LMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP----- 354
MQ++P G +W F + +GL Y+EY+ EE++L + Y + V NP
Sbjct: 452 AMQIIPWGEMKWACWSIFGETVPDMGLRYVEYEATAEETTLKDVYPRDHPVFTNPISIHK 511
Query: 355 QAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
Q F G W +++L QNV LDI+RFR ++ YK
Sbjct: 512 QGF-GQLW---KIFLDGQNVTLDINRFRGVMQQIYK 543
>gi|222619895|gb|EEE56027.1| hypothetical protein OsJ_04811 [Oryza sativa Japonica Group]
Length = 471
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 133/403 (33%), Positives = 199/403 (49%), Gaps = 39/403 (9%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP-----SLRIK 59
P + +I CDRS + D+C + G DP + + P QP ++ +
Sbjct: 89 PCSSLAGDTICCDRSDFNTDVCFMAGDVRTDPSSLSLLLF------PKQPPAANATVEER 142
Query: 60 TQPYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
+PY RK P C V H +P LV +AGGY GN FH F D FV
Sbjct: 143 IRPYTRKRS-----------------PGHRCDVRHDAPLLVMTAGGYTGNLFHAFSDGFV 185
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
P ++T+ H + V+L + N WA Y E++ + + ++++ + THCF +G
Sbjct: 186 PAWLTVQ-HL-RRRVVLGVLSYNPWWAGTYGEIISGLSDYHVVDLLHDKRTHCFPGAIVG 243
Query: 180 LISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT-----KPKLVLVNRNA 234
HG + ++P L KT+VDF LA Y +T T +P+L +V+R
Sbjct: 244 TRFHGILSVDPARLRDNKTIVDFHDLLAGVYETAGDTVVVDDVTQPAPRRPRLGIVSR-- 301
Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
R R I N V + A +GFDV I E L S+ + +C +VGVHGA LT LF
Sbjct: 302 RGTRVIENQAAVARLARSVGFDVDILETANGLPLPASYASVSACDVLVGVHGADLTKLLF 361
Query: 295 LRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
LRPG+ L+Q+ P+G ++ + + + +GL Y +Y + ESSL KYG D+V+ +P
Sbjct: 362 LRPGAALVQIAPLGVAPIARGCYAEASARMGLHYEQYDAEGHESSLSRKYGLRDVVVSDP 421
Query: 355 QAFA-GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
+A W + RVYL QNV LD+ RFR L + +A ++
Sbjct: 422 EAAKRDKGWGFVARVYLGGQNVTLDLSRFRHTLTRLHARALRV 464
>gi|218187403|gb|EEC69830.1| hypothetical protein OsI_00151 [Oryza sativa Indica Group]
Length = 562
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 130/385 (33%), Positives = 201/385 (52%), Gaps = 22/385 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C +G TS F V + +R PYPRK D + M +V
Sbjct: 181 CDFSDFRTDICDFSGDVRMAANTSEFVVVVDPAAAAVSHKVR----PYPRKGDATCMGRV 236
Query: 76 KELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ + ++ C H PA+ FS GGY GN FH+F D VPL+ T+ +
Sbjct: 237 TEITVRATGDAAVAPRCTRAHDVPAVAFSIGGYTGNIFHDFSDVIVPLYNTVQRY--RGG 294
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTIT---HCFQSVTLGLISHGRMVINP 190
V L +A+ W KY +LL +RH I++ HCF S + L +H ++I
Sbjct: 295 VQLVMANVASWWLVKYDKLLRELSRHAPIDLAKAGAAGDVHCFPSAVVSLRAHRELIIER 354
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRT--ILNLRE 245
DF FL A + + + +P+L++++R GRT +LNL
Sbjct: 355 ERSLDGLATPDFTRFLRRALSLPRDAPTRLGDGTGRRPRLLVISR----GRTRLLLNLDA 410
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
V +AAEE+GF+V + E + S+ + I+SC A++GVHGAGLT+ +FL PG+ ++QVV
Sbjct: 411 VVRAAEEVGFEVVVNESDVGNSIEQVGKLINSCDALLGVHGAGLTNMMFLPPGATMVQVV 470
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G QW++ + + +PA +GL Y++Y+I ESSL +KY A D + NP ++
Sbjct: 471 PWGGLQWIARMDYGEPAAAMGLNYIQYEIAVAESSLKDKYPAGDEIFTNPTGLHKRGFAF 530
Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDA 388
M+ + Q++ +D+ RFR L+ A
Sbjct: 531 MKQTLMDGQDITIDVTRFRPVLQQA 555
>gi|297830470|ref|XP_002883117.1| hypothetical protein ARALYDRAFT_479312 [Arabidopsis lyrata subsp.
lyrata]
gi|297328957|gb|EFH59376.1| hypothetical protein ARALYDRAFT_479312 [Arabidopsis lyrata subsp.
lyrata]
Length = 464
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 120/334 (35%), Positives = 199/334 (59%), Gaps = 17/334 (5%)
Query: 62 PYPRKSDKSAMSKVKELTITTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
PY RK +AM +V+E T+ +LS C H+ PA++FS GG++ N FH+F D +P
Sbjct: 132 PYARKDQGAAMKRVREWTVKLIQNASLSRCVKNHSVPAILFSLGGFSLNNFHDFTDIVIP 191
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
L+ T D +V + + N W K+ L+ +F+ + +I I+ + THCF SV +GL
Sbjct: 192 LYTTARRF--DGEVQFLVTNKNPWWISKFKGLVRKFSNYEVIYIDEEDETHCFGSVIVGL 249
Query: 181 ISH----GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARV 236
H + I+ L ++ DF+ +L +AY+ S++ +P+L++++R+ +
Sbjct: 250 NRHRDYDKELTID--LSNSECSMSDFRKYLRDAYSLRNAAVSTWR--RPRLLILSRS--I 303
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R +N E+ +AA ++GF+V + E +T +A + ++SC M+GVHGAGLT+ +FL
Sbjct: 304 SRAFVNADEIARAARQMGFNVVVVEA--NTGIASFAQTVNSCDVMLGVHGAGLTNMVFLP 361
Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
+V++Q++PIG +WL+ FE P++ + L YLEYKI EES+L ++YG + V+++P
Sbjct: 362 ENAVVIQILPIGGFEWLANTDFEDPSKGMNLRYLEYKIAAEESTLAKQYGRDHEVVRDPS 421
Query: 356 AFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
A A W + VYL QNV +DI+RF+ L A
Sbjct: 422 AVAKRGWGTFKSVYLVQQNVTVDINRFKPVLVKA 455
>gi|56409844|emb|CAI30070.1| glycosyltransferase [Triticum aestivum]
Length = 580
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 129/378 (34%), Positives = 200/378 (52%), Gaps = 13/378 (3%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ ++C + G PK + SV + SQ K +PYPRK D+ +S +
Sbjct: 205 CDFSNFRANVCEMRGDVRVHPKAT---SVLFMEPEGSQRDEVWKIKPYPRKGDEFCLSHI 261
Query: 76 KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
ELT+ +S C H P ++FS GY GN FH+F D VPLF T + F D +V
Sbjct: 262 TELTVKSSKVA-AECTRYHDVPVVIFSLTGYTGNLFHDFTDVIVPLFTTA-AQF-DGEVQ 318
Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPK 195
I D W KY LL + +++P+I+ + CF+ +G ++ I+ P
Sbjct: 319 FLITDMALWWTVKYHTLLQKLSKYPLIDFSKDDQVRCFKHAIVGTHAYMEFTIDAAKSPN 378
Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEE 252
T+VDF F+ AY+ +++ + KP+L+++ R+ R LNL E+ AEE
Sbjct: 379 GVTMVDFNRFMRAAYSLPKAAAAALGESPRVKPRLLIIKRHRT--RMFLNLEEIIGMAEE 436
Query: 253 LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QW 311
LGF+V I E S+ + + ++S M+GVHGAGLT+ +FL + L+Q+VP G W
Sbjct: 437 LGFEVVIDEANVSSDINGFAKLVNSVDVMMGVHGAGLTNCVFLPQNATLIQIVPFGGLDW 496
Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLK 370
+S F P+ ++GL Y +Y I +ESSL + Y + + K+P +F + + R ++
Sbjct: 497 ISRTDFGNPSEMMGLRYKQYAITVDESSLTDHYPRDHKIFKDPISFHKRGFEFIRRTFMD 556
Query: 371 TQNVKLDIDRFRIYLKDA 388
QNV+LD RFR L +A
Sbjct: 557 KQNVRLDCKRFRPVLLEA 574
>gi|53791309|dbj|BAD52574.1| putative HGA6 [Oryza sativa Japonica Group]
gi|53791443|dbj|BAD52495.1| putative HGA6 [Oryza sativa Japonica Group]
Length = 700
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 134/397 (33%), Positives = 205/397 (51%), Gaps = 26/397 (6%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDS-TPSQPSLRIKTQPYPRKSDKS 70
CD S+R D C ++G T V G D T S IK PYPRK+D +
Sbjct: 302 CDLWSNRRIDWCELDGDVRVAGANGTVSLVAPPGPADERTFRAESWHIK--PYPRKADPN 359
Query: 71 AMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
AM V+ LT+ + ++ +C H P LVFS GY GN+FH + D +PLF+T
Sbjct: 360 AMRHVRVLTVQSLPAPAASAAAPACTERHDVPGLVFSDRGYTGNYFHAYTDVILPLFLTA 419
Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
+ +V L ++D W K+ + + + +IN+++ HCF+ V +GL H
Sbjct: 420 RQY--SGEVKLLVSDFQMWWLGKFLPVFKAVSNYDLINLDDDRRVHCFRHVQVGLTCHAD 477
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN-------ENTNTSSSFHHTKPKLVLVNRNARVGR 238
I+P+ P ++VDF F+ Y + +P+L+++ R AR R
Sbjct: 478 FSIDPSRAPNGYSMVDFTRFMRATYRLPRDAPFPASGEQQPRRPWRPRLLVIAR-ART-R 535
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
+N E+ + AE GF+V + E E +A ++C AMVGVHGAGLT+ +FL G
Sbjct: 536 RFVNADEIVRGAERAGFEVVVSEGEHE--VAPFAELANTCDAMVGVHGAGLTNMVFLPTG 593
Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
V++QVVP+G YF P+R +GL YLEY+I EES+L+++Y + + +P
Sbjct: 594 GVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLIDQYPRDHPIFTDPDGVK 653
Query: 359 GANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
W++++ YL Q+V+LD+ RFR LK A +K
Sbjct: 654 SKGWNSLKEAYLDKQDVRLDMKRFRPILKKAIAHLRK 690
>gi|414876595|tpg|DAA53726.1| TPA: hypothetical protein ZEAMMB73_648118 [Zea mays]
Length = 624
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 128/380 (33%), Positives = 198/380 (52%), Gaps = 15/380 (3%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C + G D S F V ++ S K +PYPRK D++ M ++
Sbjct: 241 CDFSDFRSDICDLAGDVRMDANVSAFVVVVDPATSGSDQEEH-KVRPYPRKGDETCMGRI 299
Query: 76 KELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ T A C TH +PA+VFS GGY GN FH+F D VPL+ T + D
Sbjct: 300 TEVTVRATGDAAGAPRCTETHGAPAVVFSIGGYTGNIFHDFSDVLVPLYNTARRYR--GD 357
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTIT---HCFQSVTLGLISHGRMVINP 190
V L +A+ W KY LL +RH +++ HCF + L +H ++I
Sbjct: 358 VQLVMANAAPWWLVKYDRLLRALSRHAPLDLARAGAAREVHCFPRAVVSLRAHKELIIER 417
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVK 247
DF FL A + + + KP+L++++R+ R +LNL V
Sbjct: 418 DRSLDGLATPDFTRFLRRALSLPRDAPTRLGDGTGRKPRLLVISRHRT--RLLLNLDAVV 475
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
+AAEE+GF+ + E + + ++ I+SC AMVGVHGAGLT+ +FL PG+ L+Q+VP
Sbjct: 476 RAAEEVGFEAVVNESDVANDISQVGGLINSCDAMVGVHGAGLTNMMFLPPGAALVQIVPW 535
Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
G QW++ + PA +GL+Y++Y+I ES+L +K+ + + NP A + +R
Sbjct: 536 GGLQWMARADYGDPAEAMGLKYIQYEIGVAESTLKDKFPSGHKIFTNPTALHKKGFMFIR 595
Query: 367 -VYLKTQNVKLDIDRFRIYL 385
+ Q++ +D+ RFR L
Sbjct: 596 QTLMDGQDITVDVGRFREVL 615
>gi|218187405|gb|EEC69832.1| hypothetical protein OsI_00156 [Oryza sativa Indica Group]
Length = 671
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 134/397 (33%), Positives = 205/397 (51%), Gaps = 26/397 (6%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDS-TPSQPSLRIKTQPYPRKSDKS 70
CD S+R D C ++G T V G D T S IK PYPRK+D +
Sbjct: 273 CDLWSNRRIDWCELDGDVRVAGANGTVSLVAPPGPADERTFRAESWHIK--PYPRKADPN 330
Query: 71 AMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
AM V+ LT+ + ++ +C H P LVFS GY GN+FH + D +PLF+T
Sbjct: 331 AMRHVRVLTVQSLPAPAASAAAPACTERHDVPGLVFSDRGYTGNYFHAYTDVILPLFLTA 390
Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
+ +V L ++D W K+ + + + +IN+++ HCF+ V +GL H
Sbjct: 391 RQY--SGEVKLLVSDFQMWWLGKFLPVFKAVSNYDLINLDDDRRVHCFRHVQVGLTCHAD 448
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN-------ENTNTSSSFHHTKPKLVLVNRNARVGR 238
I+P+ P ++VDF F+ Y + +P+L+++ R AR R
Sbjct: 449 FSIDPSRAPNGYSMVDFTRFMRATYRLPRDAPFPASGEQQPRRPWRPRLLVIAR-ART-R 506
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
+N E+ + AE GF+V + E E +A ++C AMVGVHGAGLT+ +FL G
Sbjct: 507 RFVNADEIVRGAERAGFEVVVSEGEHE--VAPFAELANTCDAMVGVHGAGLTNMVFLPTG 564
Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
V++QVVP+G YF P+R +GL YLEY+I EES+L+++Y + + +P
Sbjct: 565 GVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLIDQYPRDHPIFTDPDGVK 624
Query: 359 GANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
W++++ YL Q+V+LD+ RFR LK A +K
Sbjct: 625 SKGWNSLKEAYLDKQDVRLDMKRFRPILKKAIAHLRK 661
>gi|297595988|ref|NP_001041864.2| Os01g0119100 [Oryza sativa Japonica Group]
gi|255672812|dbj|BAF03778.2| Os01g0119100, partial [Oryza sativa Japonica Group]
Length = 570
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 134/397 (33%), Positives = 205/397 (51%), Gaps = 26/397 (6%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDS-TPSQPSLRIKTQPYPRKSDKS 70
CD S+R D C ++G T V G D T S IK PYPRK+D +
Sbjct: 172 CDLWSNRRIDWCELDGDVRVAGANGTVSLVAPPGPADERTFRAESWHIK--PYPRKADPN 229
Query: 71 AMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
AM V+ LT+ + ++ +C H P LVFS GY GN+FH + D +PLF+T
Sbjct: 230 AMRHVRVLTVQSLPAPAASAAAPACTERHDVPGLVFSDRGYTGNYFHAYTDVILPLFLTA 289
Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
+ +V L ++D W K+ + + + +IN+++ HCF+ V +GL H
Sbjct: 290 RQY--SGEVKLLVSDFQMWWLGKFLPVFKAVSNYDLINLDDDRRVHCFRHVQVGLTCHAD 347
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN-------ENTNTSSSFHHTKPKLVLVNRNARVGR 238
I+P+ P ++VDF F+ Y + +P+L+++ R AR R
Sbjct: 348 FSIDPSRAPNGYSMVDFTRFMRATYRLPRDAPFPASGEQQPRRPWRPRLLVIAR-ART-R 405
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
+N E+ + AE GF+V + E E +A ++C AMVGVHGAGLT+ +FL G
Sbjct: 406 RFVNADEIVRGAERAGFEVVVSEGEHE--VAPFAELANTCDAMVGVHGAGLTNMVFLPTG 463
Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
V++QVVP+G YF P+R +GL YLEY+I EES+L+++Y + + +P
Sbjct: 464 GVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLIDQYPRDHPIFTDPDGVK 523
Query: 359 GANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
W++++ YL Q+V+LD+ RFR LK A +K
Sbjct: 524 SKGWNSLKEAYLDKQDVRLDMKRFRPILKKAIAHLRK 560
>gi|326495222|dbj|BAJ85707.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 369
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/365 (35%), Positives = 195/365 (53%), Gaps = 16/365 (4%)
Query: 24 DLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITTS 83
++C + G+ P S SV + + S+ + K +PYPRK D+ +SKV ELT+ +S
Sbjct: 3 NVCEMRGNIRIHPNAS---SVMYMEPASSKRNELWKLKPYPRKGDELCLSKVTELTVKSS 59
Query: 84 --APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADC 141
AP C H P +VF+ GY GN FH+F D VPLF T S F + +V I D
Sbjct: 60 KVAP---ECTKYHNVPVVVFALTGYTGNLFHDFTDALVPLFTTA-SEF-NGEVQFLITDM 114
Query: 142 NDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVD 201
W RKY + + +++P+I+ N T HC + +GL ++ I+P+ P T+VD
Sbjct: 115 AIWWTRKYHVVFKKLSKYPLIDFNKDTDVHCVKHAIVGLHAYMEFTIDPSKAPHNYTMVD 174
Query: 202 FQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
F F+ Y E + KP+L++++R R LNL E+ AE LGF+V
Sbjct: 175 FNRFMRRTYELPREAVSALGEIPKAKPRLLIISRQRT--RMFLNLPEIIAMAEGLGFEVV 232
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI-GTQWLSTVYF 317
+ E S+ L+ + ++S M+GVHGAGLT+ +FL + L+Q+VP G + + + F
Sbjct: 233 VEEANVSSDLSQFSKVVNSVDVMMGVHGAGLTNCVFLPHNATLIQIVPWGGIEGVCRIDF 292
Query: 318 EKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLD 377
PA +GL Y +Y I ESSL ++Y + + KNP AF ++ QNV+LD
Sbjct: 293 GDPAEQMGLRYKQYSIAVHESSLTDQYPLDHEIFKNPLAFHKGFEFIKETFMDKQNVRLD 352
Query: 378 IDRFR 382
+RF+
Sbjct: 353 CNRFK 357
>gi|413947274|gb|AFW79923.1| hypothetical protein ZEAMMB73_646561 [Zea mays]
Length = 592
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 133/390 (34%), Positives = 207/390 (53%), Gaps = 21/390 (5%)
Query: 16 CD-RSHRSYDLCLINGSALF----DPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
CD S+R D C + G + + + G + + + S IK PYPRK+D +
Sbjct: 203 CDLTSNRRIDWCELEGDVRVVGGSNGSVTLVAAPGADERSFHEESWSIK--PYPRKADPN 260
Query: 71 AMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
AM V+ L + + +APP ++C H PALVFSA GY GN+FH + D +PLF+T +
Sbjct: 261 AMRSVRALAVRSVATAPP-VACTDWHGVPALVFSARGYTGNYFHAYTDVILPLFLTARQY 319
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI--THCFQSVTLGLISHGRM 186
+V+L + W KY + + + ++++ CF+ V +GL SH
Sbjct: 320 A--GEVLLLVTGFQAWWVGKYLPVFRSLSNYEPVDLDRDRDPRVRCFRRVQVGLTSHDDF 377
Query: 187 VINPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
I+P P +++DF F+ Y +LV AR R LN +
Sbjct: 378 SIDPRRAPNGYSMLDFTRFMRATYGLPRDAVPRRGRGRPRPRLLVIARART-RRFLNAED 436
Query: 246 VKKAAEELGFDVTIFE-PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
+ + A ++GF+V + E +E + A+ +++C A+VGVHGAGLT+ +FL PG V++QV
Sbjct: 437 IVRGARKVGFEVVVSEVAQEVAAFAE---LVNTCDAVVGVHGAGLTNMVFLPPGGVVIQV 493
Query: 305 VPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW-S 363
+P+G YF PAR +GL YLEY+I EES+L+++Y + VL +P + +W S
Sbjct: 494 LPLGPLEFVASYFRGPARDMGLSYLEYRISPEESTLLDQYPRDHPVLTDPMSVQAKDWVS 553
Query: 364 NMRVYLKTQNVKLDIDRFRIYLKDAYKKAK 393
M VYL Q+V+LD+ RFR LK A + +
Sbjct: 554 FMGVYLFKQDVRLDMKRFRPVLKKALARLR 583
>gi|297830468|ref|XP_002883116.1| hypothetical protein ARALYDRAFT_341951 [Arabidopsis lyrata subsp.
lyrata]
gi|297328956|gb|EFH59375.1| hypothetical protein ARALYDRAFT_341951 [Arabidopsis lyrata subsp.
lyrata]
Length = 538
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 133/380 (35%), Positives = 212/380 (55%), Gaps = 23/380 (6%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFS-VGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
C + R+ + C ++G K++T + + S S +R PY RK D AM++
Sbjct: 159 CTKLART-EFCELSGDVRIHGKSATVLAAITFAFSGNSTWHMR----PYARKGDLVAMNR 213
Query: 75 VKELTITTS------APPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
V++ T+ N S C H+ PA++FS GGY N FH+F D +PL+ T
Sbjct: 214 VRKWTVKLEQNADQLENANFSRCVRNHSVPAMIFSLGGYTMNNFHDFTDVVIPLYTTARR 273
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ +V + + N W K+ EL+ + + + +I I+ + THCF SVT+GLI H
Sbjct: 274 F--NGEVQFLVTNRNPWWINKFKELVKKLSNYEVIYIDEEDETHCFSSVTVGLIRHREYF 331
Query: 188 INPTLLPKPK--TLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLR 244
T+ P ++ DF+SFL + Y+ N +S + +L+ +R R N
Sbjct: 332 TELTIDPSNSEYSMSDFRSFLRDTYSLRNAAVITSQIRRRRPRILILSRSR-SRAFENTG 390
Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
E+ +AA ++GF+V + E +T +A+ + ++SC M+GVHGAGLT+ +FL +V++Q+
Sbjct: 391 EIARAARQIGFEVVV--AEANTGVANFAQTVNSCDVMLGVHGAGLTNMVFLPDNAVVIQI 448
Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
+PIG +WL+ FE+P+ + L YLEYKI EESSLV+KYG + V+++P A A W
Sbjct: 449 LPIGGFEWLAKTDFERPSEGMNLRYLEYKIAAEESSLVKKYGRDHEVVRDPSAVAKHGWD 508
Query: 364 NMR-VYLKTQNVKLDIDRFR 382
+ VYL QNV +DI+RF+
Sbjct: 509 MFKSVYLVHQNVSIDINRFK 528
>gi|30684813|ref|NP_188445.2| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
gi|27808590|gb|AAO24575.1| At3g18170 [Arabidopsis thaliana]
gi|110736165|dbj|BAF00054.1| hypothetical protein [Arabidopsis thaliana]
gi|332642538|gb|AEE76059.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
Length = 384
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/384 (33%), Positives = 209/384 (54%), Gaps = 19/384 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C + R+ + C +NG K++T S T + + I+ PY RK D AM +V
Sbjct: 5 CTKLART-EFCELNGDVRVHGKSATV-SAAITFAFSGNSTWHIR--PYARKGDTVAMKRV 60
Query: 76 KELTITTS------APPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
+E T+ N S C H+ PA++FS GGY+ N FH+F D +PL+ T
Sbjct: 61 REWTVKLEQNADQLENANFSRCVRNHSVPAMIFSLGGYSMNNFHDFTDIVIPLYTTARRF 120
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
+ +V + + + W K+ EL+ + + + +I I+ + THCF SVT+GL H
Sbjct: 121 --NGEVQFLVTNKSPSWINKFKELVRKLSNYEVIYIDEEDETHCFSSVTVGLTRHREYFK 178
Query: 189 NPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
T+ P ++ DF+SFL + Y+ + ++ + + ++ R +N E+
Sbjct: 179 ELTIDPSNSEYSMSDFRSFLRDTYSLRNDAVATRQIRRRRPRILILARGRSRAFVNTGEI 238
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+AA ++GF V + E + +A + ++SC M+GVHGAGLT+ +FL +V++QV+P
Sbjct: 239 ARAARQIGFKVVV--AEANIGIAKFAQTVNSCDVMLGVHGAGLTNMVFLPENAVVIQVLP 296
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
IG +WL+ FEKP+ + L YLEYKI EES+LV+KYG + ++++P A A W
Sbjct: 297 IGGFEWLAKTDFEKPSEGMNLRYLEYKIAVEESTLVKKYGRDHEIVRDPSAVAKHGWEMF 356
Query: 366 R-VYLKTQNVKLDIDRFRIYLKDA 388
+ VYL QNV +DI+RF+ L A
Sbjct: 357 KSVYLVQQNVSIDINRFKPVLVKA 380
>gi|297595984|ref|NP_001041860.2| Os01g0118400 [Oryza sativa Japonica Group]
gi|255672810|dbj|BAF03774.2| Os01g0118400 [Oryza sativa Japonica Group]
Length = 393
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/385 (34%), Positives = 199/385 (51%), Gaps = 22/385 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C +G TS F V + K +PYPRK D + M +V
Sbjct: 12 CDFSDFRTDICDFSGDVRMAANTSEFVVVVDPAAAAVSH----KVRPYPRKGDATCMGRV 67
Query: 76 KELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ T A C H PA+ FS GGY GN FH+F D VPL+ T+ +
Sbjct: 68 TEITVRATGDAAGAPRCTRAHDVPAVAFSIGGYTGNIFHDFSDVIVPLYNTVQRY--RGG 125
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTI---THCFQSVTLGLISHGRMVINP 190
V L + + W KY +LL +RH I++ HCF S + L +H ++I
Sbjct: 126 VQLVMVNVASWWLVKYDKLLRELSRHAPIDLAKAGAAGEVHCFPSAVVSLRAHRELIIER 185
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRT--ILNLRE 245
DF FL A + + + +P+L++++R GRT +LNL
Sbjct: 186 ERSLDGLATPDFTRFLRRALSLPRDAPTRLGDGTGRRPRLLVISR----GRTRLLLNLDA 241
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
V +AAEE+GF+V + E + S+ + I+SC A++GVHGAGLT+ +FL PG+ ++QVV
Sbjct: 242 VVRAAEEVGFEVVVNESDVGNSIEQVGKLINSCDALLGVHGAGLTNMMFLPPGATMVQVV 301
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G QW++ + + +PA +GL Y++Y+I ESSL +KY A D + NP ++
Sbjct: 302 PWGGLQWIARMDYGEPAAAMGLNYIQYEIAVAESSLKDKYPAGDEIFTNPTGLHKRGFAF 361
Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDA 388
M+ + Q++ +D+ RFR L+ A
Sbjct: 362 MKQTLMDGQDITIDVTRFRPVLQQA 386
>gi|53791307|dbj|BAD52572.1| putative HGA6 [Oryza sativa Japonica Group]
gi|53791441|dbj|BAD52493.1| putative HGA6 [Oryza sativa Japonica Group]
Length = 522
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/385 (34%), Positives = 199/385 (51%), Gaps = 22/385 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C +G TS F V + +R PYPRK D + M +V
Sbjct: 141 CDFSDFRTDICDFSGDVRMAANTSEFVVVVDPAAAAVSHKVR----PYPRKGDATCMGRV 196
Query: 76 KELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ T A C H PA+ FS GGY GN FH+F D VPL+ T+ +
Sbjct: 197 TEITVRATGDAAGAPRCTRAHDVPAVAFSIGGYTGNIFHDFSDVIVPLYNTVQRY--RGG 254
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTI---THCFQSVTLGLISHGRMVINP 190
V L + + W KY +LL +RH I++ HCF S + L +H ++I
Sbjct: 255 VQLVMVNVASWWLVKYDKLLRELSRHAPIDLAKAGAAGEVHCFPSAVVSLRAHRELIIER 314
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRT--ILNLRE 245
DF FL A + + + +P+L++++R GRT +LNL
Sbjct: 315 ERSLDGLATPDFTRFLRRALSLPRDAPTRLGDGTGRRPRLLVISR----GRTRLLLNLDA 370
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
V +AAEE+GF+V + E + S+ + I+SC A++GVHGAGLT+ +FL PG+ ++QVV
Sbjct: 371 VVRAAEEVGFEVVVNESDVGNSIEQVGKLINSCDALLGVHGAGLTNMMFLPPGATMVQVV 430
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G QW++ + + +PA +GL Y++Y+I ESSL +KY A D + NP ++
Sbjct: 431 PWGGLQWIARMDYGEPAAAMGLNYIQYEIAVAESSLKDKYPAGDEIFTNPTGLHKRGFAF 490
Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDA 388
M+ + Q++ +D+ RFR L+ A
Sbjct: 491 MKQTLMDGQDITIDVTRFRPVLQQA 515
>gi|9294073|dbj|BAB02030.1| unnamed protein product [Arabidopsis thaliana]
Length = 535
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/384 (33%), Positives = 209/384 (54%), Gaps = 19/384 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C + R+ + C +NG K++T S T + + I+ PY RK D AM +V
Sbjct: 156 CTKLART-EFCELNGDVRVHGKSATV-SAAITFAFSGNSTWHIR--PYARKGDTVAMKRV 211
Query: 76 KELTITTS------APPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
+E T+ N S C H+ PA++FS GGY+ N FH+F D +PL+ T
Sbjct: 212 REWTVKLEQNADQLENANFSRCVRNHSVPAMIFSLGGYSMNNFHDFTDIVIPLYTTARRF 271
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
+ +V + + + W K+ EL+ + + + +I I+ + THCF SVT+GL H
Sbjct: 272 --NGEVQFLVTNKSPSWINKFKELVRKLSNYEVIYIDEEDETHCFSSVTVGLTRHREYFK 329
Query: 189 NPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
T+ P ++ DF+SFL + Y+ + ++ + + ++ R +N E+
Sbjct: 330 ELTIDPSNSEYSMSDFRSFLRDTYSLRNDAVATRQIRRRRPRILILARGRSRAFVNTGEI 389
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+AA ++GF V + E + +A + ++SC M+GVHGAGLT+ +FL +V++QV+P
Sbjct: 390 ARAARQIGFKVVV--AEANIGIAKFAQTVNSCDVMLGVHGAGLTNMVFLPENAVVIQVLP 447
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
IG +WL+ FEKP+ + L YLEYKI EES+LV+KYG + ++++P A A W
Sbjct: 448 IGGFEWLAKTDFEKPSEGMNLRYLEYKIAVEESTLVKKYGRDHEIVRDPSAVAKHGWEMF 507
Query: 366 R-VYLKTQNVKLDIDRFRIYLKDA 388
+ VYL QNV +DI+RF+ L A
Sbjct: 508 KSVYLVQQNVSIDINRFKPVLVKA 531
>gi|300681531|emb|CBH32628.1| hga5 [Triticum aestivum]
Length = 412
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 134/389 (34%), Positives = 206/389 (52%), Gaps = 28/389 (7%)
Query: 10 PAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTP-SQPSLRIKTQ 61
P SITCD + D +C ++G P +S+ S P Q + + +
Sbjct: 22 PKGSITCDDKSKDNDFPYARPVVCKMSGDVRIAPGSSSVIL-----SMPLYQSAEGRRVR 76
Query: 62 PYPRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTS-PALVFSAGGYNGNFFHEFMDCF 118
PY R D S V+E+ I T A + + C V H PA+VFS GGY NFFH+ D
Sbjct: 77 PYARHDD-SLPPLVREVAIKTVANGSDAPECSVGHGDIPAVVFSVGGYTRNFFHDMSDVL 135
Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTL 178
+PL++T F + V + D W +KY +L R +R+ I++ ++ HCF V L
Sbjct: 136 IPLYLTA-FQFKGR-VQFFVTDYKQWWLKKYKPILRRLSRYDIVDFDSNNDVHCFHHVIL 193
Query: 179 GLISHGRMVI--NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRN 233
GL+ +++ +PT PK ++V F FL +AY N KP++++++R
Sbjct: 194 GLVRDRDLILRRHPTRNPKGYSMVGFTRFLRHAYGLRRNRPFVLGENPGKKPRMLIISR- 252
Query: 234 ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
R R +LNL V+ A LGFDVT+ E S+ ++SC +V VHG GLT+ +
Sbjct: 253 -RGTRRLLNLHRVEAMATALGFDVTVSE-AGGNSVKRFAETVNSCDVLVAVHGGGLTNQM 310
Query: 294 FLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLK 352
FL +V++Q+VP G +W++T ++ +PAR +GL YLEY + EESSL +Y + V +
Sbjct: 311 FLPAKAVVVQIVPWGGMEWMATNFYGEPARGMGLRYLEYHVAGEESSLARRYPRDHAVFR 370
Query: 353 NPQAFAGANWSNMRVYLKTQNVKLDIDRF 381
+P A W + + TQ+V+LD+DRF
Sbjct: 371 DPMAIHAQGWKALAEVVMTQDVRLDLDRF 399
>gi|357131745|ref|XP_003567495.1| PREDICTED: uncharacterized protein LOC100826127 [Brachypodium
distachyon]
Length = 508
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 129/391 (32%), Positives = 205/391 (52%), Gaps = 26/391 (6%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
I CDRS + D+C ++G + +F + Q + +PY RK ++ M+
Sbjct: 110 ICCDRSDFNTDVCFMSGDVRTHAASLSFLLFNANATVEEQ-----RIRPYTRKWERHLMA 164
Query: 74 KVKELTI---TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
+ E+ + T S C V H +PALV +AGGY GN FH F D F+P ++T +SH
Sbjct: 165 SIHEVRLRAPTASESETSKCDVVHEAPALVMTAGGYTGNLFHAFNDGFLPAWLT-SSHL- 222
Query: 131 DQDVILAIADCNDQWARKYAELLPRFT-RHPIININNQTITHCF-QSVTLGLISHGRMVI 188
V+LA+ N WA + EL+ + R ++++ + T THCF +G HG + +
Sbjct: 223 RHGVVLAVLAYNPWWAGTFRELISELSGRRGVVDLVHDTRTHCFPAGAIVGSRFHGVLSV 282
Query: 189 NPTLLPKPKTLVDFQSFLANAYNENTNT--SSSFHHTKPKLVLVNRNARVGRTILNLREV 246
+P K+L+DF +FLA AY + +P+L ++ R R I N V
Sbjct: 283 DPARTRDHKSLLDFHTFLARAYEADNAALKQEEQQGRRPRLGILARKGN--RVIENQGAV 340
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG-------- 298
+ AE +GF+V+I E L+ S+ + +C ++GVHGA LT LFLRP
Sbjct: 341 ARLAESIGFEVSILETANGAPLSASYAAVSACDVLLGVHGADLTKLLFLRPSNNTNSNST 400
Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ-AF 357
+ ++Q+ P+G ++ + + +GL Y +Y + ESSL KY A+D+++ +P+ A
Sbjct: 401 AAVLQIAPLGVGPIARGCYAEATVSMGLRYEQYDVVAGESSLRLKYAADDVIVADPETAK 460
Query: 358 AGANWSNM-RVYLKTQNVKLDIDRFRIYLKD 387
GA W + +VYL +QNV LD+DRF L+D
Sbjct: 461 KGAGWELVAKVYLGSQNVTLDLDRFGDTLRD 491
>gi|31296712|gb|AAP46641.1| HGA1 [Hordeum vulgare]
Length = 505
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 127/387 (32%), Positives = 208/387 (53%), Gaps = 31/387 (8%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ C D C + G + T V T++ S K QPY R+ MS
Sbjct: 117 VVCTTEGPFSDTCDVFGDVRTNGTAHTVTLVPATETE----SQEWKIQPYARR----GMS 168
Query: 74 KVKELTI-----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
+ E T+ T++A P +C VTH PA+VF+ GG GN+FH+F D VPLF+ +
Sbjct: 169 GISEFTVMQLDSTSAASPAPACTVTHRVPAIVFALGGLTGNYFHDFSDALVPLFVASRRY 228
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
+V L ++ W KY ++ + +++ ++++++ CF SVT+GL H I
Sbjct: 229 --GGEVQLLASNIQPWWLGKYEAVVRKLSKYDVVDLDHDDQIRCFPSVTVGLRMHKEFDI 286
Query: 189 NPTLLPK--PKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNRNARV 236
P L+P P ++VDF +FL Y+ ++ + KP+L+L++R
Sbjct: 287 TPELVPGGVPLSMVDFTAFLRETYSLPRAGPISLMKDISPPEDQEKRKPRLMLLHRGHY- 345
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ++N+ E+ KAAE+ GF+V I +P + + R ++S ++GVHGAGLT+++FL
Sbjct: 346 -RKLVNVPEIVKAAEKAGFEVAIADPRFDVRVEELARSVNSFDVLLGVHGAGLTNAVFLP 404
Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
+V++QVVP G + ++ V F P +GL YLEY I EES+L+E G + V+K+P+
Sbjct: 405 TAAVVIQVVPYGNLERMAKVDFGDPVADMGLRYLEYSITAEESTLLEMLGPDHPVIKDPE 464
Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRF 381
+ + W + YL Q+V++D+DRF
Sbjct: 465 SVHRSGWDKVAEYYLGKQDVRVDVDRF 491
>gi|300681543|emb|CBH32640.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
aestivum]
Length = 526
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 127/387 (32%), Positives = 209/387 (54%), Gaps = 31/387 (8%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ C D C + G + T V T++ S K QPY R+ MS
Sbjct: 138 VVCTTEGPFSDTCDVFGDVRTNGTAHTVTLVPATETE----SREWKIQPYVRR----GMS 189
Query: 74 KVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
+ E+T+T ++A P +C VTH PA+VF+ GG GN+FH+F D VPLF+ +
Sbjct: 190 GISEVTVTQLDSTSAASPAPACTVTHRVPAIVFALGGLTGNYFHDFSDALVPLFVASRRY 249
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
+V L ++ W KY ++ R +++ ++++++ CF SVT+GL H I
Sbjct: 250 --GGEVQLLASNIQPWWLGKYEAVVRRLSKYDVVDLDHDDQIRCFPSVTVGLRMHKEFDI 307
Query: 189 NPTLLP--KPKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNRNARV 236
P L+P P ++VDF +FL Y ++ + KP+L+L++R
Sbjct: 308 VPELVPGGAPLSMVDFTAFLRETYTLPRAAPISLMKDISPPEDQEKRKPRLMLLHRGHY- 366
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R +N+ E+ KAAE+ GF+V+I +P + + R ++S ++GVHGAGLT+++F+
Sbjct: 367 -RKFVNVPEIVKAAEKAGFEVSIADPRFDVRVEELARSVNSFDVLLGVHGAGLTNAVFMP 425
Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
G+V++QVVP G + ++ V F P +GL YLEY I EES+L+E G + V+K+P+
Sbjct: 426 TGAVVIQVVPYGNLEHMAKVDFGDPVADMGLRYLEYSITAEESTLLEMLGPDHPVIKDPE 485
Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRF 381
+ + W + YL Q+V++D++RF
Sbjct: 486 SVHRSGWDKVAEYYLGKQDVRVDVERF 512
>gi|300681530|emb|CBH32627.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
aestivum]
Length = 629
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 129/385 (33%), Positives = 201/385 (52%), Gaps = 20/385 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S R D+C G + TS+F V + +R PYPRK D++ M +V
Sbjct: 246 CDFSDRRSDVCDFTGDIRMEANTSSFVVVVDAATAAQSHKVR----PYPRKGDQTCMGRV 301
Query: 76 KELTITTSAPPNL----SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
E+T+ T++ + C TH+ PA+ FS GGY GN FH+F D VPL+ T+ H
Sbjct: 302 PEITVRTASSSSTPPPPQCTRTHSVPAVTFSIGGYTGNIFHDFSDVLVPLYNTV--HRYR 359
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININ---NQTITHCFQSVTLGLISHGRMVI 188
DV L +A+ W KY +LL +RH +++ + THCF+ + L +H ++I
Sbjct: 360 GDVQLVMANVVPWWLVKYDKLLRELSRHAPLDLAVAAAKGETHCFRHAVVSLRAHRELII 419
Query: 189 NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLRE 245
P DF F+ A + + + KP+L+++ R+ R +LNL +
Sbjct: 420 ERDRSPDGLATPDFTRFIRRALSLPRDAPTRLADGMGRKPRLLIIARHRT--RILLNLGD 477
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+ + AEE GF+ + E + S++ I+S ++GVHGAGLT+ +FL PG+ L+QVV
Sbjct: 478 MLRVAEEAGFEAAVSESDVGDSISRVGAEINSADVLLGVHGAGLTNMMFLAPGATLVQVV 537
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G QW++ + + PA +GL Y++Y+I EESSL + Y + +P + +
Sbjct: 538 PWGGLQWIARMDYGDPAEAMGLRYVQYEIGVEESSLKDTYPRGHKIFTDPTSLHKKGFGF 597
Query: 365 M-RVYLKTQNVKLDIDRFRIYLKDA 388
M R + QN+ LD+ RFR L A
Sbjct: 598 MRRTLMDGQNITLDLGRFRGVLHQA 622
>gi|55792425|gb|AAV65333.1| HGA1 [Hordeum vulgare]
Length = 505
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 126/387 (32%), Positives = 208/387 (53%), Gaps = 31/387 (8%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ C D C + G + T V T++ S K QPY R+ MS
Sbjct: 117 VVCTTEGPFSDTCDVFGDVRTNGTAHTVTLVPATETE----SQEWKIQPYARR----GMS 168
Query: 74 KVKELTI-----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
+ E T+ T++A P +C VTH PA+VF+ GG GN+FH+F D VPLF+ +
Sbjct: 169 GISEFTVMQLDSTSAASPAPACTVTHRVPAIVFALGGLTGNYFHDFSDALVPLFVASRRY 228
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
+V L ++ W KY ++ + +++ ++++++ CF +VT+GL H I
Sbjct: 229 --GGEVQLLASNIQPWWLGKYEAVVRKLSKYDVVDLDHDDQIRCFPNVTVGLRMHKEFDI 286
Query: 189 NPTLLPK--PKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNRNARV 236
P L+P P ++VDF +FL Y+ ++ + KP+L+L++R
Sbjct: 287 TPELVPGGVPLSMVDFTAFLRETYSLPRAAPISLMKDISPPEDQEKRKPRLMLLHRGHY- 345
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ++N+ E+ KAAE+ GF+V I +P + + R ++S ++GVHGAGLT+++FL
Sbjct: 346 -RKLVNVPEIVKAAEKAGFEVAIADPRFDVRVEELARSVNSFDVLLGVHGAGLTNAVFLP 404
Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
+V++QVVP G + ++ V F P +GL YLEY I EES+L+E G + V+K+P+
Sbjct: 405 TAAVVIQVVPYGNLERMAKVDFGDPVADMGLRYLEYSITAEESTLLEMLGPDHPVIKDPE 464
Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRF 381
+ + W + YL Q+V++D+DRF
Sbjct: 465 SVHRSGWDKVAEYYLGKQDVRVDVDRF 491
>gi|334353407|emb|CCA61105.1| xylan arabinosyl transferase [Triticum aestivum]
Length = 506
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 122/383 (31%), Positives = 202/383 (52%), Gaps = 21/383 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPYPRKSDKSAMSK 74
C S R D C G G + + +P R K +PY RK D A+S
Sbjct: 125 CYESGRRSDTCEAAGDVRVQ---------GRSQTIQVRPLDREWKVKPYCRKQDAYALSH 175
Query: 75 VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
VKE T+ + C V ++ A V S GG+ GN FH++ D VP FIT +H +V
Sbjct: 176 VKEWTLRPLSGGGPHCTVNSSATAFVISTGGFTGNLFHDYTDVLVPAFIT--AHRFGGEV 233
Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
++ W +Y E+ + +RH +I+I+N C+ SV +G H + ++ + P
Sbjct: 234 QFLVSSFKSWWTNRYLEIFQQLSRHEVIDIDNDDEVRCYGSVVVGPTFHKELGVDASKTP 293
Query: 195 KPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLREVKKA 249
++VDF++ L A+ + + +P+L++++R + R LN R +
Sbjct: 294 TGASMVDFRAMLRGAFGLSRAAAEPSGDRWDIRRRPRLLIISR--KNSRAFLNERAMADM 351
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
A LG+DV + EP+ +T ++ R ++S MVGVHGAGLT+ +FL G+VL+QV+P G
Sbjct: 352 AMSLGYDVRVGEPDTNTEVSRFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVIPYGG 411
Query: 310 -QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
+WL+ FE+P++ + L Y+ YKI+ +E++L E+Y + VL +P + W ++ V
Sbjct: 412 LEWLARGTFEEPSKDMQLHYIGYKIQLDETTLSEQYPKDHPVLTDPLSIHKQGWEALKTV 471
Query: 368 YLKTQNVKLDIDRFRIYLKDAYK 390
YL+ QNV+ + R ++ +A K
Sbjct: 472 YLEKQNVRPHLGRLKLTFLEALK 494
>gi|413954174|gb|AFW86823.1| hypothetical protein ZEAMMB73_859783 [Zea mays]
Length = 388
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/388 (33%), Positives = 203/388 (52%), Gaps = 21/388 (5%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+TC+ S+R D C + G T + + ++ P+ ++ I+ PY RK ++ M
Sbjct: 4 VTCNWSNRHSDTCKMEGDLRIHGMAGTVYVLSSSNFRPNNSTITIR--PYTRKWEQETML 61
Query: 74 KVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
+++E++I ++AP S C V H PA+VFS GG NFFH D VPL+IT
Sbjct: 62 RIREVSIRSTAPEPFSFVIPPKCTVRHDVPAVVFSTGGCGTNFFHAMTDLIVPLYITARE 121
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
H + V L +AD +W K+ +L + +P+I+ + T CF S +GL SH +
Sbjct: 122 H--NGHVQLLVADYQPEWVAKFRPILTALSIYPVIDFDADTAVRCFPSAHVGLESHRILG 179
Query: 188 INPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLR 244
INP L T++ F+ FL + ++ ++ + KP+LV V R R R + N
Sbjct: 180 INPALSRNSYTMMGFRDFLRDVFSLRRPWATPVSRSSGQKPRLVFVLR--RHSREVTNEV 237
Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
+ A LGF+V + PE+ +A ++SC MVGVHGAGLT+ +FL ++Q+
Sbjct: 238 DAIAALAGLGFEVVVAGPEDVRDMAKIAGVVNSCDVMVGVHGAGLTNMVFLPHNGTIVQI 297
Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF--AGAN 361
+P G ++ F PA +GL Y+EY+ EE++L KY + V +P + +G
Sbjct: 298 IPWGNLKYPCRFDFGDPAPDMGLRYVEYEANAEETTLKYKYPRDHPVFTDPISIERSGKL 357
Query: 362 WSNMRVYLKTQNVKLDIDRFRIYLKDAY 389
W +L+ QNV LDIDRFR ++ Y
Sbjct: 358 WET---FLEGQNVTLDIDRFREAMQQVY 382
>gi|242095988|ref|XP_002438484.1| hypothetical protein SORBIDRAFT_10g020460 [Sorghum bicolor]
gi|241916707|gb|EER89851.1| hypothetical protein SORBIDRAFT_10g020460 [Sorghum bicolor]
Length = 495
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/386 (33%), Positives = 205/386 (53%), Gaps = 19/386 (4%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ C+ S+R D C + G K++T + + + P+ ++ I+ PY RK ++ M+
Sbjct: 115 LACNWSNRHSDTCRMEGDLRIHGKSATVYVLSSSTFNPNNSTITIR--PYTRKWEQETMA 172
Query: 74 KVKELTITTSAPPNLS------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
+++E+TI ++AP S C V H PA+VFS GG NFFH D +PL+IT
Sbjct: 173 RIREVTIRSTAPEPYSFVIPPKCTVRHDVPAVVFSTGGCGTNFFHAMTDLIIPLYITARE 232
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ + V L +AD ++ K+ +L + +PII+ + T CF S +GL SH +
Sbjct: 233 Y--NGHVQLLVADYQPEFVAKFRPILAALSIYPIIDFDADTAVRCFPSAHVGLESHRILG 290
Query: 188 INPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
INP L T++ F+ FL + ++ ++ KP+LV V R R R + N +
Sbjct: 291 INPGLSRNGYTMMGFRDFLRDVFSLPRPWATPVSR-KPRLVFVVR--RHSRAVTNEADAI 347
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
A +LGF+V PE+ +A ++SC MVGVHGAGLT+ +FL ++Q++P
Sbjct: 348 AAVADLGFEVVAAGPEDVGDMAKIAAVVNSCDVMVGVHGAGLTNMVFLPHNGTIVQIIPW 407
Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF--AGANWSN 364
G ++ F P +GL Y+EY++ EE++L KY + V +P + +G W
Sbjct: 408 GNLKYPCRFDFGDPVPDMGLHYVEYEVNAEETTLKYKYPRDHPVFTDPLSIERSGKLWE- 466
Query: 365 MRVYLKTQNVKLDIDRFRIYLKDAYK 390
+L+ QNV LDIDRFR ++ YK
Sbjct: 467 --TFLEGQNVTLDIDRFREAMQQVYK 490
>gi|357521533|ref|XP_003631055.1| Glycosyltransferase, putative [Medicago truncatula]
gi|355525077|gb|AET05531.1| Glycosyltransferase, putative [Medicago truncatula]
Length = 420
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/379 (34%), Positives = 206/379 (54%), Gaps = 25/379 (6%)
Query: 24 DLCLINGSALFDPKTSTFFSVGH-TDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT 82
+ C G K+S+ + V T+S S I+ PY RKSD MS V + ++ T
Sbjct: 47 EYCQTQGDIRVHGKSSSVYIVSRKTNSLAENVSWIIR--PYARKSDAYTMSSVTKWSVKT 104
Query: 83 SAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIAD 140
P + C H+ PA++FS GY GN FHEF D +PLF+T F Q V L I D
Sbjct: 105 VKPTHQVSQCTKYHSIPAVIFSTAGYTGNHFHEFSDIVIPLFLTCR-QFNGQ-VQLIITD 162
Query: 141 CNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS-HGRMVINPTLLPKPKTL 199
W K+ L + + + II+I+ HCF V +GL H + I+P ++
Sbjct: 163 KKSWWISKHQAFLKKLSNYEIIDIDRDDELHCFPKVIIGLKRYHKELSIDPQ--KYSYSI 220
Query: 200 VDFQSFLANAYNENTNTSSSF-----HHTKPKLVLVNRNARVGRTILNLREVKKAAEELG 254
DF+ FL ++Y+ ++ KP+L++++R + R+ N ++ K A+ LG
Sbjct: 221 KDFRDFLRSSYSLKRVSAIKIRDIGNQSKKPRLLILSR--KTSRSFTNTNQIAKMAKGLG 278
Query: 255 FDVTIFEPEEST-SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG---TQ 310
F V + E + S+A+ ++SC ++GVHGAGLT+ LFL ++ +QVVP G +
Sbjct: 279 FRVIVMEAGRNMRSIAN---VVNSCDVLMGVHGAGLTNILFLPQNAIFIQVVPFGGMQVE 335
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
WL+T F +P+ + ++YLEYKI+ +ES+L+++Y + +++K+P + W R VY
Sbjct: 336 WLATNDFARPSEDMNIKYLEYKIRLDESTLIQQYPLDHMIIKDPSSIEKQGWEAFRSVYF 395
Query: 370 KTQNVKLDIDRFRIYLKDA 388
QNV+LD++RFR L+ A
Sbjct: 396 DKQNVRLDVNRFRPTLQKA 414
>gi|383100765|emb|CCG47996.1| glycosyltransferase, HGA-like,putative,expressed [Triticum
aestivum]
Length = 527
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 132/388 (34%), Positives = 210/388 (54%), Gaps = 41/388 (10%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
S TCD + NG+A T T V T+S K QPY R+ M
Sbjct: 148 SDTCD----VFGDVRTNGTA----HTVTLVPVTQTESR------EWKIQPYTRR----GM 189
Query: 73 SKVKELTIT----TSA-PPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
S + E+T+T TSA P +C VTH P +VF+ GG GN+FH+F D VPLF+
Sbjct: 190 SGISEVTVTQLDSTSADSPAPACTVTHRVPGIVFALGGLTGNYFHDFSDALVPLFVASRR 249
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ +V L ++ W KY ++ R +++ ++++++ CF SVT+GL H
Sbjct: 250 Y--GGEVQLLASNIQPWWLGKYEAVVRRLSKYDVVDLDHDDQIRCFPSVTVGLRMHKEFD 307
Query: 188 INPTLLP--KPKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNRNAR 235
I P L+P P ++VDF +FL Y ++ + KP+L+L++R
Sbjct: 308 IVPELVPGGAPLSMVDFTAFLRETYTLPRAAPISLMKDISPPEDQEKRKPRLMLLHRGHY 367
Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R +N+ E+ KAAE+ GF+V+I +P + + R ++S ++GVHGAGLT+++F+
Sbjct: 368 --RKFVNVPEIVKAAEKAGFEVSIADPRFDVKVEELARSVNSFDVLLGVHGAGLTNAVFM 425
Query: 296 RPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
G+V++QVVP G + ++ V F P +GL YLEY I EES+L+E G + V+K+P
Sbjct: 426 PTGAVVIQVVPYGNLEHMAKVDFGDPVADMGLRYLEYSITAEESTLLEMLGPDHPVIKDP 485
Query: 355 QAFAGANWSNM-RVYLKTQNVKLDIDRF 381
++ + W + YL Q+V++D++RF
Sbjct: 486 ESVHRSGWDKVAEYYLGKQDVRVDVERF 513
>gi|297734272|emb|CBI15519.3| unnamed protein product [Vitis vinifera]
Length = 446
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 133/389 (34%), Positives = 204/389 (52%), Gaps = 40/389 (10%)
Query: 12 KSITCDRSHRSYDLCLINGSALFDPKTSTFF--SVGHTDSTPSQPSLRIKTQPYPRKSDK 69
+++ C+ RS D C+I+G +ST F S D P S I+ PY RK D
Sbjct: 85 EALVCNIEPRS-DFCVISGDVRVHGNSSTVFIASSAPVDILPENGSWSIR--PYARKGDA 141
Query: 70 SAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
AM +K T+ TT C HT PA++FS GGY+GN FH F D +PL++T S
Sbjct: 142 RAMKHIKNFTVKMTTGRQHLPHCTQNHTVPAILFSLGGYSGNHFHAFSDVLIPLYLT--S 199
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ +V + + W K+ LL +R+PII+I+ + HCF S +GL
Sbjct: 200 RQFNGEVQFLVTSKSLWWIAKFRILLQELSRYPIIDIDREEGIHCFSSAIIGL------- 252
Query: 188 INPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH----TKPKLVLVNRNARVGRTILNL 243
+ FL ++Y+ T+ KP+L+++ R + R+ N
Sbjct: 253 ---------------KEFLRSSYSLKRATAIKVRDGTDTKKPRLLIIAR--KKSRSFTND 295
Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
++ + A LG++V + EP T ++ ++SC ++GVHGAGLT+ +FL +VL+Q
Sbjct: 296 GKIAEMARSLGYEVIVAEPN-GTEISRFAELVNSCDVLMGVHGAGLTNIVFLPENAVLIQ 354
Query: 304 VVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
VVP+G +W++ F PA + + Y+EY+IK+EESSL+EKY VL+ P + W
Sbjct: 355 VVPLGGLEWVARYDFGLPAVDMKIRYIEYQIKEEESSLIEKYPHEHAVLREPHSITKLGW 414
Query: 363 SNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
++ VYL QNVKLD++RFR L A +
Sbjct: 415 LELKAVYLDKQNVKLDLNRFRNTLLQALQ 443
>gi|356511269|ref|XP_003524349.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like [Glycine
max]
Length = 462
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 134/391 (34%), Positives = 205/391 (52%), Gaps = 28/391 (7%)
Query: 12 KSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSA 71
+ + C R+ C G K+ST + V + +T S+ ++ +PY R+ D A
Sbjct: 80 EELLCTSEERT-KFCQARGDIRVHGKSSTVYIVS-SKTTMSEKNMSWNLKPYARRDDVDA 137
Query: 72 MSKVKE-----LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
M +V+E + ++ AP C H PA++FS GGY GN FHEF D +PLF+T
Sbjct: 138 MIRVREWSVKAVNVSQKAP---QCTQYHNIPAVLFSTGGYAGNHFHEFTDIVIPLFLTAR 194
Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR- 185
+ +V I D W K+ LL + + + ++I+ HCF VT+GL + +
Sbjct: 195 QF--NGEVQFIITDKRPWWISKHKPLLKKLSNYETMDIDGDDEVHCFPRVTVGLKRYQKE 252
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
+ I P ++ DF+ L ++Y E T KP+L++++R + R
Sbjct: 253 LSIEPQ--KYSYSMKDFRDLLRSSYALKRVEAIKTRDGL-RGKPRLMILSR--KRSRFFT 307
Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
N E+ K AE LGFDV I E S+ ++SC ++GVHGAGLT+ LFL +V
Sbjct: 308 NTDEIAKMAESLGFDVII--KEAGWSMWGFANVVNSCDVLLGVHGAGLTNILFLPENAVF 365
Query: 302 MQVVPIG---TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
+QVVP G WL+T F P++ + ++YLEYKI EES+L+++Y + + +K+P
Sbjct: 366 VQVVPYGGVTLDWLATNDFGNPSKDMNIKYLEYKISLEESTLIQQYPLDHMFIKDPPLIE 425
Query: 359 GANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
W + VYL QNVKLD+DRFR L+ A
Sbjct: 426 KIGWEEFKSVYLDKQNVKLDVDRFRPTLQKA 456
>gi|357521531|ref|XP_003631054.1| Glycosyltransferase, putative [Medicago truncatula]
gi|355525076|gb|AET05530.1| Glycosyltransferase, putative [Medicago truncatula]
Length = 530
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 129/379 (34%), Positives = 206/379 (54%), Gaps = 25/379 (6%)
Query: 24 DLCLINGSALFDPKTSTFFSVGH-TDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT 82
+ C G K+S+ + V T+S S I+ PY RKSD MS V + ++ T
Sbjct: 157 EYCQTQGDIRVHGKSSSVYIVSRKTNSLAENVSWIIR--PYARKSDAYTMSSVTKWSVKT 214
Query: 83 SAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIAD 140
P + C H+ PA++FS GY GN FHEF D +PLF+T F Q V L I D
Sbjct: 215 VKPTHQVSQCTKYHSIPAVIFSTAGYTGNHFHEFSDIVIPLFLTCR-QFNGQ-VQLIITD 272
Query: 141 CNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS-HGRMVINPTLLPKPKTL 199
W K+ L + + + II+I+ HCF V +GL H + I+P ++
Sbjct: 273 KKSWWISKHQAFLKKLSNYEIIDIDRDDELHCFPKVIIGLKRYHKELSIDPQ--KYSYSI 330
Query: 200 VDFQSFLANAYNENTNTSSSF-----HHTKPKLVLVNRNARVGRTILNLREVKKAAEELG 254
DF+ FL ++Y+ ++ KP+L++++R + R+ N ++ K A+ LG
Sbjct: 331 KDFRDFLRSSYSLKRVSAIKIRDIGNQSKKPRLLILSR--KTSRSFTNTNQIAKMAKGLG 388
Query: 255 FDVTIFEPEEST-SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG---TQ 310
F V + E + S+A+ ++SC ++GVHGAGLT+ LFL ++ +QVVP G +
Sbjct: 389 FRVIVMEAGRNMRSIAN---VVNSCDVLMGVHGAGLTNILFLPQNAIFIQVVPFGGMQVE 445
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
WL+T F +P+ + ++YLEYKI+ +ES+L+++Y + +++K+P + W R VY
Sbjct: 446 WLATNDFARPSEDMNIKYLEYKIRLDESTLIQQYPLDHMIIKDPSSIEKQGWEAFRSVYF 505
Query: 370 KTQNVKLDIDRFRIYLKDA 388
QNV+LD++RFR L+ A
Sbjct: 506 DKQNVRLDVNRFRPTLQKA 524
>gi|56409866|emb|CAI30081.1| glycosyltransferase [Triticum aestivum]
Length = 439
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 122/383 (31%), Positives = 203/383 (53%), Gaps = 21/383 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPYPRKSDKSAMSK 74
C S R D C G G + + +P R K +PY RK D A+S
Sbjct: 58 CYESGRRSDTCEAAGDVRVQ---------GRSQTIQVRPLDREWKVKPYCRKQDAYALSH 108
Query: 75 VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
VKE T+ + C V ++ A V S GG+ GN FH++ D VP FIT +H +V
Sbjct: 109 VKEWTLRPLSGGGPHCTVNSSATAFVISTGGFTGNPFHDYTDVLVPAFIT--AHRFGGEV 166
Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
++ W +Y E+ + +R+ +I+I+N C+ SV +G H + ++P+ P
Sbjct: 167 QFLVSSFKSWWTNRYLEIFQQLSRYEVIDIDNDDEVRCYGSVVVGPTFHKELGVDPSKTP 226
Query: 195 KPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLREVKKA 249
++VDF++ L A+ + + +P+L++++R + R LN R +
Sbjct: 227 TGASMVDFRAMLRGAFGLSRAAAEPSGDRWDIRRRPRLLIISR--KNSRAFLNERAMADM 284
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
A LG+DV + EP+ +T ++ R ++S MVGVHGAGLT+ +FL G+VL+QV+P G
Sbjct: 285 AMSLGYDVRVGEPDTNTEVSRFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVIPYGG 344
Query: 310 -QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-V 367
+WL+ FE+P++ + L Y+ YKI+ +E++L E+Y + VL +P + W ++ V
Sbjct: 345 LEWLARGTFEEPSKDMQLHYIGYKIQLDETTLSEQYPKDHPVLTDPLSIHKQGWEALKTV 404
Query: 368 YLKTQNVKLDIDRFRIYLKDAYK 390
YL+ QNV+ + R ++ +A K
Sbjct: 405 YLEKQNVRPHLGRLKLTFLEALK 427
>gi|242056669|ref|XP_002457480.1| hypothetical protein SORBIDRAFT_03g008010 [Sorghum bicolor]
gi|241929455|gb|EES02600.1| hypothetical protein SORBIDRAFT_03g008010 [Sorghum bicolor]
Length = 558
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 144/403 (35%), Positives = 210/403 (52%), Gaps = 35/403 (8%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C G D S F V D T + K +PY RK D ++MS+V
Sbjct: 158 CDLSGSRSDVCDFTGDIRLDANASAFIVV---DPTGDANAPTYKVRPYARKGDATSMSRV 214
Query: 76 KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
E+T+ T+ C TH PA+VFS GGY GN FH+F D VPL+ T + V
Sbjct: 215 TEVTVRTTTADAPRCTATHAEPAVVFSIGGYAGNLFHDFTDVIVPLYGTAQRY--GGVVR 272
Query: 136 LAIADCN---DQWARKYAELLPRFTRHPIININNQT---ITHCFQSVTLGL-ISHGRMVI 188
L +AD +W KY +L +RHP +++ THCF V +GL +H ++I
Sbjct: 273 LVVADAGAGPSRWLAKYDAVLRGLSRHPPLDLAATAPGEETHCFGHVVVGLRAAHRELMI 332
Query: 189 NP--TLLPKPKT----LVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNR 232
+ P +VDF FL A + + ++ KP+L++V R
Sbjct: 333 DERDERSSGPDAVGVGMVDFARFLRRALSLPRDAVTTRPSSDAVATGTKKPKPRLLIVAR 392
Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTS---LADSFRFIHSCHAMVGVHGAGL 289
R R +LN V + AEE+GF+ + E E S S +A+ R I+S A+VGVHGAGL
Sbjct: 393 --RGTRRLLNADAVARVAEEVGFEAVVSELEVSKSDDGIAEVGRRINSFDAVVGVHGAGL 450
Query: 290 THSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
T+ +FL G+ ++QVVP G QW++ + F PA +GL Y++Y+I ESSL +KY ++
Sbjct: 451 TNMVFLPRGATVVQVVPWGGLQWIARMDFGDPAEAMGLRYVQYEIAVHESSLRDKYPSDH 510
Query: 349 LVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
+ NP A + +R +L Q+V LD+DRFR+ L A++
Sbjct: 511 EIFTNPTALHRKGFKFLRHTFLIGQDVTLDVDRFRVVLLQAFQ 553
>gi|449484981|ref|XP_004157036.1| PREDICTED: glycosyltransferase-like domain-containing protein
2-like [Cucumis sativus]
Length = 372
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 119/334 (35%), Positives = 179/334 (53%), Gaps = 7/334 (2%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITTSA 84
+CL N + F+ + DS + S I PY R+ DK + V L I
Sbjct: 35 VCLTNNPTRINNTNLEFYISTNNDSQQNNFS-PILIHPYARQEDKITLRDVTPLQIIFQP 93
Query: 85 PPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
L C H P L+FS GG+ GN FHEF + +PLFIT + HF + V I D
Sbjct: 94 NKTLLPLCQFIHNVPVLIFSTGGFTGNLFHEFDETIIPLFIT-SYHFQTR-VRFLITDHK 151
Query: 143 DQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDF 202
W +KY +L +R ++N+ HCF +GL H + +N T +P ++ DF
Sbjct: 152 TWWVQKYNRILSGLSRFNVVNLAEDGSVHCFNGGVIGLKFHNILSLNNTDIPGGYSMSDF 211
Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEP 262
+SFL YN N S KP ++L++R + R +N E+ + +E+GF+V P
Sbjct: 212 RSFLRQTYNLKVNNVSELSGKKPMVMLISR--QTSRRFMNEGEMVEMMKEVGFEVMTTTP 269
Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR 322
+ ++L ++ C ++G HGAGLT+ +FL G+V++QVVP G W ST +F KPA
Sbjct: 270 QRMSNLDKFSSVVNLCSVIIGAHGAGLTNEVFLANGAVVVQVVPFGLDWPSTYFFGKPAA 329
Query: 323 VLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA 356
+ L+YLEYKI+ +ESSL +KYG N V+++P++
Sbjct: 330 EMELQYLEYKIEAKESSLWDKYGENHPVIRDPES 363
>gi|326527195|dbj|BAK04539.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 519
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 124/391 (31%), Positives = 205/391 (52%), Gaps = 24/391 (6%)
Query: 10 PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPYPRKSD 68
P K + C S R D C G G + + +P R K +PY RK D
Sbjct: 131 PGKPV-CYESGRRSDTCEAAGDVRVQ---------GRSQTIQVRPLDREWKVKPYCRKQD 180
Query: 69 KSAMSKVKELTIT--TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
A+S VKE T+ +S+ P C V ++ A V S GG+ GN FH++ D VP FIT
Sbjct: 181 AYALSHVKEWTLRPLSSSGPVPHCTVNSSATAFVLSTGGFTGNLFHDYTDVLVPAFITAR 240
Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRM 186
+V ++ W +Y E+ + +++ +I+I+N C+ V +G H +
Sbjct: 241 RF--GGEVQFLVSSFKSWWTNRYLEIFQQLSKYEVIDIDNDDEVRCYGGVVVGPTFHKEL 298
Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTIL 241
++ + P ++VDF++ L A+ + + +P+L++++R + R L
Sbjct: 299 GVDASKTPAGYSMVDFRAMLRGAFGLSRAAAEPSGDRWDIRRRPRLLIISR--KNSRAFL 356
Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
N R + A LG+DV + EP+ +T ++ R ++S MVGVHGAGLT+ LFL G+VL
Sbjct: 357 NERAMADMAMSLGYDVRVGEPDSNTDVSKFARLVNSADVMVGVHGAGLTNMLFLPAGAVL 416
Query: 302 MQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGA 360
+QVVP G +WL+ FE+P++ + L Y+ YKI+ +E++L E+Y + VL +P +
Sbjct: 417 IQVVPYGGLEWLARGTFEEPSKDMQLHYIGYKIQLDETTLSEQYPKDHPVLTDPLSIHKQ 476
Query: 361 NWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
W ++ VYL+ QNV+ + R ++ +A K
Sbjct: 477 GWEALKTVYLEKQNVRPHLGRLKLTFMEALK 507
>gi|357521535|ref|XP_003631056.1| Glycosyltransferase, putative [Medicago truncatula]
gi|355525078|gb|AET05532.1| Glycosyltransferase, putative [Medicago truncatula]
Length = 568
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 129/379 (34%), Positives = 206/379 (54%), Gaps = 25/379 (6%)
Query: 24 DLCLINGSALFDPKTSTFFSVGH-TDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT 82
+ C G K+S+ + V T+S S I+ PY RKSD MS V + ++ T
Sbjct: 195 EYCQTQGDIRVHGKSSSVYIVSRKTNSLAENVSWIIR--PYARKSDAYTMSSVTKWSVKT 252
Query: 83 SAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIAD 140
P + C H+ PA++FS GY GN FHEF D +PLF+T F Q V L I D
Sbjct: 253 VKPTHQVSQCTKYHSIPAVIFSTAGYTGNHFHEFSDIVIPLFLTCR-QFNGQ-VQLIITD 310
Query: 141 CNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS-HGRMVINPTLLPKPKTL 199
W K+ L + + + II+I+ HCF V +GL H + I+P ++
Sbjct: 311 KKSWWISKHQAFLKKLSNYEIIDIDRDDELHCFPKVIIGLKRYHKELSIDPQ--KYSYSI 368
Query: 200 VDFQSFLANAYNENTNTSSSF-----HHTKPKLVLVNRNARVGRTILNLREVKKAAEELG 254
DF+ FL ++Y+ ++ KP+L++++R + R+ N ++ K A+ LG
Sbjct: 369 KDFRDFLRSSYSLKRVSAIKIRDIGNQSKKPRLLILSR--KTSRSFTNTNQIAKMAKGLG 426
Query: 255 FDVTIFEPEEST-SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG---TQ 310
F V + E + S+A+ ++SC ++GVHGAGLT+ LFL ++ +QVVP G +
Sbjct: 427 FRVIVMEAGRNMRSIAN---VVNSCDVLMGVHGAGLTNILFLPQNAIFIQVVPFGGMQVE 483
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
WL+T F +P+ + ++YLEYKI+ +ES+L+++Y + +++K+P + W R VY
Sbjct: 484 WLATNDFARPSEDMNIKYLEYKIRLDESTLIQQYPLDHMIIKDPSSIEKQGWEAFRSVYF 543
Query: 370 KTQNVKLDIDRFRIYLKDA 388
QNV+LD++RFR L+ A
Sbjct: 544 DKQNVRLDVNRFRPTLQKA 562
>gi|357521529|ref|XP_003631053.1| Glycosyltransferase, putative [Medicago truncatula]
gi|355525075|gb|AET05529.1| Glycosyltransferase, putative [Medicago truncatula]
Length = 566
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 129/379 (34%), Positives = 206/379 (54%), Gaps = 25/379 (6%)
Query: 24 DLCLINGSALFDPKTSTFFSVGH-TDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT 82
+ C G K+S+ + V T+S S I+ PY RKSD MS V + ++ T
Sbjct: 193 EYCQTQGDIRVHGKSSSVYIVSRKTNSLAENVSWIIR--PYARKSDAYTMSSVTKWSVKT 250
Query: 83 SAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIAD 140
P + C H+ PA++FS GY GN FHEF D +PLF+T F Q V L I D
Sbjct: 251 VKPTHQVSQCTKYHSIPAVIFSTAGYTGNHFHEFSDIVIPLFLTCR-QFNGQ-VQLIITD 308
Query: 141 CNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS-HGRMVINPTLLPKPKTL 199
W K+ L + + + II+I+ HCF V +GL H + I+P ++
Sbjct: 309 KKSWWISKHQAFLKKLSNYEIIDIDRDDELHCFPKVIIGLKRYHKELSIDPQ--KYSYSI 366
Query: 200 VDFQSFLANAYNENTNTSSSF-----HHTKPKLVLVNRNARVGRTILNLREVKKAAEELG 254
DF+ FL ++Y+ ++ KP+L++++R + R+ N ++ K A+ LG
Sbjct: 367 KDFRDFLRSSYSLKRVSAIKIRDIGNQSKKPRLLILSR--KTSRSFTNTNQIAKMAKGLG 424
Query: 255 FDVTIFEPEEST-SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG---TQ 310
F V + E + S+A+ ++SC ++GVHGAGLT+ LFL ++ +QVVP G +
Sbjct: 425 FRVIVMEAGRNMRSIAN---VVNSCDVLMGVHGAGLTNILFLPQNAIFIQVVPFGGMQVE 481
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
WL+T F +P+ + ++YLEYKI+ +ES+L+++Y + +++K+P + W R VY
Sbjct: 482 WLATNDFARPSEDMNIKYLEYKIRLDESTLIQQYPLDHMIIKDPSSIEKQGWEAFRSVYF 541
Query: 370 KTQNVKLDIDRFRIYLKDA 388
QNV+LD++RFR L+ A
Sbjct: 542 DKQNVRLDVNRFRPTLQKA 560
>gi|242094138|ref|XP_002437559.1| hypothetical protein SORBIDRAFT_10g029380 [Sorghum bicolor]
gi|241915782|gb|EER88926.1| hypothetical protein SORBIDRAFT_10g029380 [Sorghum bicolor]
Length = 462
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 122/343 (35%), Positives = 191/343 (55%), Gaps = 17/343 (4%)
Query: 59 KTQPYPRKSDKSAMSKVKELTITTSAP--PNLS-----CGVTHTSPALVFSAGGYNGNFF 111
KT+PY R D AM V+E T+ P PN + C H+ P +FS+GG+ GN +
Sbjct: 117 KTKPYARLHDPVAMDDVREFTLVPFGPGSPNGTVVPPLCTRNHSVPGFLFSSGGFAGNLY 176
Query: 112 HEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITH 171
H++ D VPLF + + HF +V +AD D WA K+ L + +R+ +I++NN H
Sbjct: 177 HDYADVLVPLFASTH-HF-GGEVQFLLADIKDWWADKFKPLFRQLSRYDVIDVNNDREVH 234
Query: 172 CFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN----TNTSSSFHHTKPKL 227
CF + +G H M I+ + P +T+ DF+ L A+ + + + +P+L
Sbjct: 235 CFPRIVIGSTFHRAMGIDASRSPGGETVADFKRVLRRAFKLERAVASRSGAPRRKDRPRL 294
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
++++R + R +N R + +AA FDV I EP+ T + + R ++S M+GVHGA
Sbjct: 295 LIISRKS--SRRFVNERAMARAAAAAKFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGA 352
Query: 288 GLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGA 346
GLT+ +FL +VL+QVVP G +WL+ V F+ PAR + + Y+EY + EESSL + Y
Sbjct: 353 GLTNMVFLPSRAVLVQVVPFGGLEWLTRVTFKDPARDMDVTYMEYNVSLEESSLRDLYPE 412
Query: 347 NDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
+ LK+P W ++ VYL QNV+L++ RF L+ A
Sbjct: 413 DHFYLKHPYDVHKKGWDAIKTVYLDKQNVRLNLTRFTRTLEQA 455
>gi|212274959|ref|NP_001130300.1| uncharacterized protein LOC100191394 [Zea mays]
gi|194688782|gb|ACF78475.1| unknown [Zea mays]
gi|224030831|gb|ACN34491.1| unknown [Zea mays]
gi|414876588|tpg|DAA53719.1| TPA: hypothetical protein ZEAMMB73_549999 [Zea mays]
Length = 506
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 200/371 (53%), Gaps = 31/371 (8%)
Query: 29 NGSALFDPKTSTFFSVGHTDSTPSQPSLR--IKTQPYPRKSDKSAMSKVKELTITTSAPP 86
NGSAL + TF P+ PS R K +PY R++ M V +T+T P
Sbjct: 147 NGSAL----SVTF--------VPASPSERREWKVRPYSRRT----MVGVDRVTVTRLGSP 190
Query: 87 N---LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCND 143
+ C V H+ P ++F+ GG GN++H+F D +PL+ + +V+ +++
Sbjct: 191 DDPAAPCAVAHSVPGVLFALGGLTGNYWHDFSDVLMPLYAASRRY--GGEVLFLVSNMQP 248
Query: 144 QWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL-LPKPKTLVDF 202
W KY ++ R +R+ ++++ CF+ +T+GL H + + P L P T DF
Sbjct: 249 WWLGKYEAVVRRLSRYDAVDLDRDARVRCFRRLTVGLRLHKELGVAPELTAPDRLTTADF 308
Query: 203 QSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI 259
+FL Y ++ T KP+L+L++R R +N+ E+ +AAE GF+V +
Sbjct: 309 TAFLRETYALQRGAPAAVPTTEGRKPRLMLIHRAHY--RRFVNVPEITRAAEAAGFEVAV 366
Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFE 318
P + ++ R ++SC A++GVHGAGLT+++FL PG+VL+QVVP G + ++ F
Sbjct: 367 ASPRGDAPVEETARAVNSCDALLGVHGAGLTNAVFLPPGAVLIQVVPYGRLERMARRDFG 426
Query: 319 KPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLD 377
PA +GL YLEY + EES+L+E G V+++P + + W + YL Q+V++D
Sbjct: 427 DPAEDMGLRYLEYSVSAEESTLLEMLGPEHPVIRDPDSVHRSGWDKVAEYYLGKQDVRID 486
Query: 378 IDRFRIYLKDA 388
+DRF L A
Sbjct: 487 VDRFAPTLAQA 497
>gi|162460449|ref|NP_001105165.1| glycosyltransferase [Zea mays]
gi|56409864|emb|CAI30080.1| glycosyltransferase [Zea mays]
gi|414879190|tpg|DAA56321.1| TPA: glycosyltransferase [Zea mays]
Length = 492
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 123/391 (31%), Positives = 207/391 (52%), Gaps = 26/391 (6%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C + R D C G T T + DS + KT+PY RK D A++ V
Sbjct: 112 CYETSRRSDTCEAAGDVRVVGSTQTVY----VDSLDRE----WKTKPYCRKHDNFALAHV 163
Query: 76 KELTIT---TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
KE ++ + A P C V ++ A V S GG+ GN FH++ D +P FIT +H
Sbjct: 164 KEWSLKPLPSGAAPQ--CTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHPLRG 219
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
+V ++ W +Y ++ + +RH +++I+ C++SV +G H + ++ +
Sbjct: 220 EVQFLVSSYKSWWMNRYIQIFQQMSRHEVVDIDADDEVRCYRSVVVGATFHRELGVDASK 279
Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLREVK 247
P + DF+ L +A+ T++ +P+L++++R GR +N R +
Sbjct: 280 SPPGYSTADFRKMLRDAFGLERATATPSGDRWDIRRRPRLLIISRRPSRGRAFMNERAMA 339
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
A LGFDV I EP+ ST + R ++SC MVGVHGAGLT+ +FL G+V++QVVP
Sbjct: 340 DMAASLGFDVRIGEPDTSTDTSKFARLVNSCDVMVGVHGAGLTNMVFLPAGAVVVQVVPY 399
Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
G +WL+ F +P+ + + YLEY ++ +E++L E+Y ++ VL++P A W+ ++
Sbjct: 400 GRLEWLARNTFAEPSAGMEVHYLEYVVQLDETTLSEQYPSDHPVLRDPMAIHKQGWNALK 459
Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
YL QNV+ + R LK+ + +A K++
Sbjct: 460 TTYLDKQNVRPHLGR----LKNTFLQALKML 486
>gi|414879189|tpg|DAA56320.1| TPA: hypothetical protein ZEAMMB73_962821 [Zea mays]
Length = 488
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 123/391 (31%), Positives = 207/391 (52%), Gaps = 26/391 (6%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C + R D C G T T + DS + KT+PY RK D A++ V
Sbjct: 108 CYETSRRSDTCEAAGDVRVVGSTQTVY----VDSLDRE----WKTKPYCRKHDNFALAHV 159
Query: 76 KELTIT---TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
KE ++ + A P C V ++ A V S GG+ GN FH++ D +P FIT +H
Sbjct: 160 KEWSLKPLPSGAAPQ--CTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHPLRG 215
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
+V ++ W +Y ++ + +RH +++I+ C++SV +G H + ++ +
Sbjct: 216 EVQFLVSSYKSWWMNRYIQIFQQMSRHEVVDIDADDEVRCYRSVVVGATFHRELGVDASK 275
Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLREVK 247
P + DF+ L +A+ T++ +P+L++++R GR +N R +
Sbjct: 276 SPPGYSTADFRKMLRDAFGLERATATPSGDRWDIRRRPRLLIISRRPSRGRAFMNERAMA 335
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
A LGFDV I EP+ ST + R ++SC MVGVHGAGLT+ +FL G+V++QVVP
Sbjct: 336 DMAASLGFDVRIGEPDTSTDTSKFARLVNSCDVMVGVHGAGLTNMVFLPAGAVVVQVVPY 395
Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
G +WL+ F +P+ + + YLEY ++ +E++L E+Y ++ VL++P A W+ ++
Sbjct: 396 GRLEWLARNTFAEPSAGMEVHYLEYVVQLDETTLSEQYPSDHPVLRDPMAIHKQGWNALK 455
Query: 367 -VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
YL QNV+ + R LK+ + +A K++
Sbjct: 456 TTYLDKQNVRPHLGR----LKNTFLQALKML 482
>gi|356527872|ref|XP_003532530.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like [Glycine
max]
Length = 462
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 130/392 (33%), Positives = 211/392 (53%), Gaps = 26/392 (6%)
Query: 12 KSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSA 71
+ + C R+ + C G K+ST S+ + +T + ++ +PY R+ D A
Sbjct: 80 EELLCTSEERT-EFCQARGDIRVHGKSSTV-SIVSSKTTMLEKTMSRSLKPYARRGDIDA 137
Query: 72 MSKVKELTI-----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
M++V+E ++ + AP C +H A++FS GGY+GN FHEF D +PLF+T
Sbjct: 138 MNRVREWSVKAVNASQKAP---QCTQSHNITAVLFSTGGYSGNHFHEFTDIVIPLFLTAR 194
Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR- 185
+ +V I D W K+ LL + + + ++I+ HCF SVT+GL + +
Sbjct: 195 QF--NGEVQFIITDKRPWWISKHKPLLKKLSNYETMDIDGDDQVHCFPSVTVGLKRYQKE 252
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH---HTKPKLVLVNRNARVGRTILN 242
+ I+P ++ DF+ L ++Y + KP+L++++R + R+ N
Sbjct: 253 LSIDPQ--KYSYSMKDFRDLLRSSYALKRVEAMKIRDGLRGKPRLMILSR--KRSRSFTN 308
Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
E+ K A LGFDV + E S+ ++SC ++GVHGAGLT+ LFL +V +
Sbjct: 309 TDEIAKMAASLGFDVIV--KEAGWSMWGFANVVNSCDVLLGVHGAGLTNILFLPENAVFI 366
Query: 303 QVVPIG---TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
QVVP G WL+T F KP++ + L+YLEYKI +ES+L+++Y + + +K+P
Sbjct: 367 QVVPYGGFTLDWLATNDFGKPSKDMNLKYLEYKIGLKESTLIQQYPLDHIFIKDPPLVEK 426
Query: 360 ANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
W + VYL QNVKLD+DRFR L+ A++
Sbjct: 427 IGWEEFKSVYLDKQNVKLDVDRFRPTLQKAFE 458
>gi|195649525|gb|ACG44230.1| glycosyltransferase [Zea mays]
gi|414876589|tpg|DAA53720.1| TPA: glycosyltransferase [Zea mays]
Length = 505
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 200/371 (53%), Gaps = 31/371 (8%)
Query: 29 NGSALFDPKTSTFFSVGHTDSTPSQPSLR--IKTQPYPRKSDKSAMSKVKELTITTSAPP 86
NGSAL + TF P+ PS R K +PY R++ M V +T+T P
Sbjct: 146 NGSAL----SVTF--------VPASPSERREWKVRPYSRRT----MVGVDRVTVTRLGSP 189
Query: 87 N---LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCND 143
+ C V H+ P ++F+ GG GN++H+F D +PL+ + +V+ +++
Sbjct: 190 DDPAAPCAVAHSVPGVLFALGGLTGNYWHDFSDVLMPLYAASRRY--GGEVLFLVSNMQP 247
Query: 144 QWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL-LPKPKTLVDF 202
W KY ++ R +R+ ++++ CF+ +T+GL H + + P L P T DF
Sbjct: 248 WWLGKYEAVVRRLSRYDAVDLDRDARVRCFRRLTVGLRLHKELGVAPELTAPDRLTTADF 307
Query: 203 QSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI 259
+FL Y ++ T KP+L+L++R R +N+ E+ +AAE GF+V +
Sbjct: 308 TAFLRETYALQRGAPAAVPTTEGRKPRLMLIHRAHY--RRFVNVPEITRAAEAAGFEVAV 365
Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFE 318
P + ++ R ++SC A++GVHGAGLT+++FL PG+VL+QVVP G + ++ F
Sbjct: 366 ASPRGDAPVEETARAVNSCDALLGVHGAGLTNAVFLPPGAVLIQVVPYGRLERMARRDFG 425
Query: 319 KPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLD 377
PA +GL YLEY + EES+L+E G V+++P + + W + YL Q+V++D
Sbjct: 426 DPAEDMGLRYLEYSVSAEESTLLEMLGPEHPVIRDPDSVHRSGWDKVAEYYLGKQDVRID 485
Query: 378 IDRFRIYLKDA 388
+DRF L A
Sbjct: 486 VDRFAPTLAQA 496
>gi|326512144|dbj|BAJ96053.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 498
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/406 (30%), Positives = 210/406 (51%), Gaps = 30/406 (7%)
Query: 4 RPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPY 63
+ A + A C + R D C G ++ T + D+ + KT+PY
Sbjct: 104 KAAVVVDASKPVCYETSRRSDTCEAAGDVRLVGRSQTIY----IDTLERE----WKTRPY 155
Query: 64 PRKSDKSAMSKVKELTIT------TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
RK D A+S VKE ++ +AP C ++ A V S GG+ GN FH++ D
Sbjct: 156 CRKHDTYALSHVKEWSLKPFPSGDDAAP---KCTSNSSATAFVISTGGFTGNPFHDYTDV 212
Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
+P FIT +H +V ++ W +Y ++ + +R+ +++I+ C+ S
Sbjct: 213 LIPAFIT--AHRFAGEVQFLVSSYKSWWMSRYIQIFQQMSRYEVVDIDADDEVRCYPSAV 270
Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNR 232
+G H + ++P+ P ++ DF+ L NA+ T++ +P+L++++R
Sbjct: 271 VGPTFHKELGVDPSKAPSGASMADFRKMLRNAFGLERATATPSGDRWDIRRRPRLLIISR 330
Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
GR +N R + A LGFDV I +P+ ++ + R ++SC MVGVHGAGLT+
Sbjct: 331 RTSRGRAFMNERAMADMAGSLGFDVRIGDPDTTSDTSKFARLVNSCDVMVGVHGAGLTNM 390
Query: 293 LFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
+FL G+VL+QVVP G +WL+ F +P+ + + YLEY ++ +E++L E+Y ++ LVL
Sbjct: 391 VFLPAGAVLVQVVPYGRLEWLARNTFAEPSAGMEVHYLEYVVQLDETTLSEQYPSDHLVL 450
Query: 352 KNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
K+P A W ++ YL QNV+ + R LK + +A K++
Sbjct: 451 KDPMAIHKQGWDALKTTYLDKQNVRPHLGR----LKKTFLQALKML 492
>gi|15229594|ref|NP_188446.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
gi|9294074|dbj|BAB02031.1| unnamed protein product [Arabidopsis thaliana]
gi|332642539|gb|AEE76060.1| Glycosyltransferase family 61 protein [Arabidopsis thaliana]
Length = 470
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 125/373 (33%), Positives = 209/373 (56%), Gaps = 18/373 (4%)
Query: 22 SYDLCLINGSALFDPKTSTFFS-VGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTI 80
S + C ++G K++T + V S S +R PY RK AM +V+E T+
Sbjct: 101 SKEFCDVSGDVRIHGKSATVLAAVTFAFSGNSTWYMR----PYARKDQVPAMKRVREWTV 156
Query: 81 TTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIA 139
+LS C H+ PA++FS GG++ N FH+F D +PL+ T +V +
Sbjct: 157 KLVQNASLSRCVRNHSVPAILFSLGGFSLNNFHDFTDIVIPLYTTARRF--SGEVQFLVT 214
Query: 140 DCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPK-- 197
+ N W K+ EL+ + + + +I I+ + THCF SV +GL H T P
Sbjct: 215 NKNLLWINKFKELVRKLSNYEVIYIDEEDETHCFSSVIVGLNRHRDYDKELTTDPSNSEY 274
Query: 198 TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDV 257
++ DF+ FL + Y+ + ++ KP++++++R+ R +N E+ +AA ++GF V
Sbjct: 275 SMSDFRKFLRDTYSLRNSAVTT--RRKPRILILSRSR--SRAFVNAGEIARAARQIGFKV 330
Query: 258 TIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVY 316
+ E +T +A ++SC M+GVHGAG+T+ +FL ++++Q++PIG +WL+ +
Sbjct: 331 VV--AEANTEIASFAITVNSCDVMLGVHGAGMTNMVFLPDNAIVIQILPIGGFEWLAKMD 388
Query: 317 FEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVK 375
FE P++ + L YLEYKI EES+LV++YG + +++P A A W + VYL QNV
Sbjct: 389 FEYPSKGMNLRYLEYKITAEESTLVKQYGRDHEFVRDPLAVAKRGWGTFKSVYLVQQNVS 448
Query: 376 LDIDRFRIYLKDA 388
+DI+RF++ L A
Sbjct: 449 VDINRFKLVLVKA 461
>gi|357161022|ref|XP_003578952.1| PREDICTED: uncharacterized protein LOC100833330 [Brachypodium
distachyon]
Length = 484
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/394 (32%), Positives = 210/394 (53%), Gaps = 31/394 (7%)
Query: 15 TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
C + R D C + G ++ T + D + KT+PY RK D A+S
Sbjct: 104 VCYETSRRSDTCEVAGDVRLVGRSQTIY----VDVLKQE----WKTKPYCRKHDTFALSH 155
Query: 75 VKELTIT----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
VKE ++ SA P C ++ A V S GG+ GN FH++ D +P FI+ +H
Sbjct: 156 VKEWSLKPAGDGSAVPE--CTSNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIS--AHRF 211
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+V ++ W KY ++ + +R+ +I+I+ C++SV +G H + ++P
Sbjct: 212 AGEVQFLVSSYKPWWMNKYIQIFQQMSRYEVIDIDADDEVRCYRSVVVGPTFHKELGVDP 271
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLRE 245
+ ++VDF+ L NA+ T++ +P+L++++R A GR +N R
Sbjct: 272 S---SGISVVDFRKMLRNAFGLERATATPSGDRWDIRRRPRLLIISRRASRGRAFMNERA 328
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSF-RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
+ A LGFDV I +P+ ++ F R ++SC MVGVHGAGLT+ +FL G+VL+QV
Sbjct: 329 MADMAGSLGFDVRIGDPDTGSTDTSKFARLVNSCDVMVGVHGAGLTNMVFLPAGAVLVQV 388
Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
VP G +WL+ F +P+ + + YLEY ++ +E++L E+Y +N LVLK+P A W
Sbjct: 389 VPYGRLEWLARNTFAEPSAGMEILYLEYVVQLDETTLSEQYPSNHLVLKDPMAIHKQGWD 448
Query: 364 NMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
++ YL QNV+ + R LK+ + +A K++
Sbjct: 449 ALKTTYLDKQNVRPHLGR----LKNTFLQALKML 478
>gi|242052233|ref|XP_002455262.1| hypothetical protein SORBIDRAFT_03g007370 [Sorghum bicolor]
gi|241927237|gb|EES00382.1| hypothetical protein SORBIDRAFT_03g007370 [Sorghum bicolor]
Length = 664
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 123/349 (35%), Positives = 189/349 (54%), Gaps = 21/349 (6%)
Query: 61 QPYPRKSDKSAMSKVKELTI--------TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
+PYPRK D +AM V+ LT+ T A P +C H PALVFS GY GN+FH
Sbjct: 313 KPYPRKVDPNAMHSVRALTVRSVVAMATATDAAPPPACMDWHDVPALVFSVRGYTGNYFH 372
Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
+ D +PLF+T + +V+L + W KY + +++ +++++ HC
Sbjct: 373 AYTDVILPLFLTARQY--SGEVLLLVTGFQAWWVGKYLPVFRSLSKYAAVDLDSDPRVHC 430
Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTS-----SSFHHTKPKL 227
F+ V +GL SH I+P P +++DF F+ Y + + S +
Sbjct: 431 FRRVQVGLTSHHDFSIDPRRAPNGYSMLDFAQFMRATYALPRDVALSPSPSPERRRRRPR 490
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+LV AR R LN E+ + A ++GF+V + E + +A +++C A+VGVHGA
Sbjct: 491 LLVIARART-RRFLNTEEIVRGARKVGFEVVVSE--GTHEVAPFAELVNTCDAVVGVHGA 547
Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
GLT+ +FL G V++QV+P+G YF PA +GL YLEY+I EES+LV++Y +
Sbjct: 548 GLTNMVFLPRGGVVVQVLPLGPLEFVASYFRGPAGDMGLTYLEYRISPEESTLVDQYPRD 607
Query: 348 DLVLKNPQAFA--GANW-SNMRVYLKTQNVKLDIDRFRIYLKDAYKKAK 393
VL +P + + +W S M VYL Q+V+LD+ RFR LK A + +
Sbjct: 608 HPVLTDPMSLSSKAKDWVSFMGVYLFKQDVRLDMKRFRPVLKKALARLR 656
>gi|413934852|gb|AFW69403.1| glycosyltransferase [Zea mays]
Length = 455
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 122/341 (35%), Positives = 191/341 (56%), Gaps = 15/341 (4%)
Query: 59 KTQPYPRKSDKSAMSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHE 113
+T+PY R+ D AM V+E T+ PN + C TH+ P +FS+GG+ GN +H+
Sbjct: 112 RTKPYARRHDAVAMDDVREFTLVPFGGPNDTAVPPLCTRTHSVPGFLFSSGGFAGNLYHD 171
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
+ D VPLF + N H +V +AD D WA K+ L + +R+ +I++NN HCF
Sbjct: 172 YADVLVPLFASTN-HL-GGEVQFLLADIKDWWADKFRPLFRQLSRYDVIDVNNDREVHCF 229
Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN----TNTSSSFHHTKPKLVL 229
+ +G H M I+P+ P T+ DF+ L A+ + + + +P+L++
Sbjct: 230 PRIIIGSTFHRAMGIDPSRSPGGVTVADFKRLLRRAFRLERAVASRSGAPRRRDRPRLLI 289
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
++R + R +N R + +AA FDV I EP+ T + + R ++S M+GVHGAGL
Sbjct: 290 ISRKS--SRRFVNERAMARAAAAARFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGL 347
Query: 290 THSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
T+ +FL +VL+QVVP G +WL+ V F+ PAR + + Y+EY + EESSL + Y +
Sbjct: 348 TNMVFLPSRAVLVQVVPFGGLEWLTRVTFKDPARDMDVTYMEYNVSLEESSLRDLYPEDH 407
Query: 349 LVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
LK+P W ++ VYL QNV+L++ RF L+ A
Sbjct: 408 FYLKHPYDVHKKGWDAIKTVYLDKQNVRLNLTRFTRTLEQA 448
>gi|115482674|ref|NP_001064930.1| Os10g0492200 [Oryza sativa Japonica Group]
gi|10140717|gb|AAG13551.1|AC023240_24 unknown protein [Oryza sativa Japonica Group]
gi|31432816|gb|AAP54403.1| glycosyltransferase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639539|dbj|BAF26844.1| Os10g0492200 [Oryza sativa Japonica Group]
gi|215766456|dbj|BAG98764.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222613059|gb|EEE51191.1| hypothetical protein OsJ_31998 [Oryza sativa Japonica Group]
Length = 500
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 122/360 (33%), Positives = 197/360 (54%), Gaps = 25/360 (6%)
Query: 59 KTQPYPRKSDKSAMSKVKELTI---------TTSAPPNLSCGVTHTSPALVFSAGGYNGN 109
KT+PY RK D A+S VKE T+ A C T+ A V S GG+ GN
Sbjct: 144 KTKPYCRKHDAFALSHVKEWTLRPLPSGDGGGGGAAVAPRCTTNSTATAFVLSTGGFTGN 203
Query: 110 FFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI 169
FH++ D +P FIT +H +V ++ W KY ++ + +RH +++++
Sbjct: 204 PFHDYTDVLIPAFIT--AHRFGGEVQFLVSSYKSWWMNKYIQIFQQMSRHDVVDVDADGD 261
Query: 170 T-HCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH-----HT 223
C++S +G H + ++PT P +++DF+ L A+ + T++
Sbjct: 262 EVRCYRSAVVGPEFHRELGVDPTKTPSGYSVLDFRKMLRGAFGLDRATATPSGDRWDIRR 321
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
+P+L++++R A GR +N R + A LGFDV + EP+ ST + R ++SC MVG
Sbjct: 322 RPRLLIISRRAARGRAFMNERAMADMAASLGFDVRVGEPDASTDTSKFARLVNSCDVMVG 381
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVE 342
VHGAGLT+ +FL G+VL+QVVP G +WL+ F +P+ + + YLEY ++ +E++L E
Sbjct: 382 VHGAGLTNMVFLPAGAVLVQVVPYGKLEWLARNTFAEPSSAMEIHYLEYAVQLDETTLSE 441
Query: 343 KYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM--DKE 399
+Y A+ VL++P A W ++ YL QNV+ + R LK+ + +A KL+ DKE
Sbjct: 442 QYPADHPVLRDPMAIHKQGWEALKTTYLDKQNVRPHLGR----LKNTFLQALKLLPHDKE 497
>gi|414868242|tpg|DAA46799.1| TPA: hypothetical protein ZEAMMB73_066234 [Zea mays]
Length = 468
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 126/393 (32%), Positives = 203/393 (51%), Gaps = 22/393 (5%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ C+ S+ + C + G K++T + V + P + IK +PY RK ++ M
Sbjct: 77 LICNLSNHQSNTCTMKGDVRIHGKSATVYVVSASTYCPENST--IKLRPYARKWEEQVML 134
Query: 74 KVKELTITTSAPPNLS----------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
V+E+T+ +S+PP C V H PA+VFS GGYN NFFH D +PL++
Sbjct: 135 LVREVTVRSSSPPAGGGSAHDPPPPQCSVRHDMPAVVFSTGGYNRNFFHVMTDVIIPLYL 194
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T + D V L D +W KY +L + +P+I+++ + CFQS +GL SH
Sbjct: 195 TAREY--DGHVQLLATDYEPKWIAKYKAILAALSSYPVIDMDTEDTVRCFQSAHVGLESH 252
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRT 239
+ I P L T+V F+ F+ +AY+ T S S KP+LV+V R R R
Sbjct: 253 KELGIVPALSRNGYTMVSFRDFIRSAYSLQRARVTPVSRSTTGRKPRLVMVLR--RNSRQ 310
Query: 240 ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
+ N + AA +GF+V P++ + L ++SC ++GVHGAGL + LFL +
Sbjct: 311 LKNEADAIAAAAGVGFEVVAAGPDDVSDLERFPGVVNSCDVLMGVHGAGLANMLFLPHNA 370
Query: 300 VLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA 358
++Q++P G +W + P +GL YL+Y+ EE++L E Y + V +P +
Sbjct: 371 TVVQIIPWGELRWACRHSYGDPVPDMGLRYLDYEASAEETTLKETYPRDHAVFTDPLSIH 430
Query: 359 GANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
+ M +++ Q+V +DIDRF ++K Y+
Sbjct: 431 HQGFDKMWNIFINGQHVIVDIDRFTGFMKQLYQ 463
>gi|357123674|ref|XP_003563533.1| PREDICTED: uncharacterized protein LOC100826633 [Brachypodium
distachyon]
Length = 455
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/378 (33%), Positives = 195/378 (51%), Gaps = 28/378 (7%)
Query: 15 TCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK 74
TC + + + C G FD S + P R T+PY R+ D AM
Sbjct: 82 TCYNTSKRSERCAAVGDIRFDGNHSKIYI------NPLDKEWR--TKPYARRHDAVAMDD 133
Query: 75 VKELTI---------TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
V+E T+ TT PP C H+ PA +FS+GG+ GN +H++ D VPLF T
Sbjct: 134 VREFTLLPFDTESSNTTVVPP--LCTRNHSVPAFLFSSGGFAGNLYHDYTDVLVPLF-TS 190
Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
HF + V + D D W K+ L + + + +I+ +N HCF+ + +G H
Sbjct: 191 THHFRGE-VQFLLTDIKDWWLDKFTPLFRQLSNYDVIDADNDQQVHCFRRIIIGATFHRA 249
Query: 186 MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
M I+P P +T+ DF+ L +A++ +S P+L++++R + R LN R
Sbjct: 250 MGIDPKRSPGGETVADFKRLLRHAFHLTRPVAS---RDNPRLLIISRKS--SRRFLNERA 304
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+ AA FDV I EP+ T + + R ++S M+GVHGAGLT+ +FL +VL+QVV
Sbjct: 305 MAHAAALAKFDVRIAEPDNHTDMPNFARLVNSADIMMGVHGAGLTNMVFLPSRAVLLQVV 364
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G +WLS V F+ PA+ + + Y+EY + EESSL Y LK+P W
Sbjct: 365 PFGGLEWLSRVTFKDPAKDMDVNYMEYNVSLEESSLRNLYPEGHFYLKHPYDVHKKGWDA 424
Query: 365 MR-VYLKTQNVKLDIDRF 381
++ VYL Q+V+L++ +F
Sbjct: 425 IKTVYLDKQSVRLNLTKF 442
>gi|388494572|gb|AFK35352.1| unknown [Lotus japonicus]
Length = 197
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 98/189 (51%), Positives = 133/189 (70%)
Query: 208 NAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTS 267
N+ NT +S H KP+L+L++R+ R ILN EV K AEE+GF+V + EP +S
Sbjct: 5 NSSRLNTTDQNSDEHPKPRLILLSRSGNASRVILNQDEVIKLAEEVGFNVHVLEPSRKSS 64
Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE 327
+A+ + IH+ H ++GVHGAGLT+SLFLRPGSVL+QVVPIGT W S Y+EKP +LGL+
Sbjct: 65 MANIYNMIHTSHVLLGVHGAGLTNSLFLRPGSVLVQVVPIGTDWASKTYYEKPTEILGLQ 124
Query: 328 YLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKD 387
Y+EYKI+ ESSL YGA+ LV+K+P + +N R+YLK QN++++I RFR L
Sbjct: 125 YIEYKIEANESSLSLSYGADSLVIKDPATYLKEKGANKRIYLKKQNLEINIFRFRKCLAK 184
Query: 388 AYKKAKKLM 396
AY+KAK M
Sbjct: 185 AYEKAKIFM 193
>gi|56409862|emb|CAI30079.1| glycosyltransferase [Saccharum officinarum]
Length = 484
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 114/348 (32%), Positives = 194/348 (55%), Gaps = 18/348 (5%)
Query: 59 KTQPYPRKSDKSAMSKVKELTIT---TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFM 115
KT+PY RK D A++ VKE ++ + A P C V ++ A V S GG+ GN FH++
Sbjct: 139 KTKPYCRKHDNFALAHVKEWSLKPLPSGAAPQ--CTVNSSATAFVLSTGGFTGNPFHDYT 196
Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
D +P FIT +H +V ++ W +Y ++ + +RH +++I+ C++S
Sbjct: 197 DVLIPAFIT--AHPLRGEVQFLVSSYKSWWMNRYIQIFQQMSRHEVVDIDADDEVRCYRS 254
Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLV 230
V +G H + ++ + P + DF+ L +A+ T++ +P+L+++
Sbjct: 255 VVVGPTFHRELGVDASKSPSGYSTADFRKMLRDAFGLERATATPSGDRWDIRRRPRLLII 314
Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
+R GR +N R + A LGFDV I EP+ ST + R ++SC MVGVHGAGLT
Sbjct: 315 SRRPSRGRAFMNERAMADMAASLGFDVRIGEPDSSTDTSKFARLVNSCDVMVGVHGAGLT 374
Query: 291 HSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
+ +FL G+V++QVVP G +WL+ F +P+ + + YLEY ++ +E++L E+Y ++
Sbjct: 375 NMVFLPAGAVVVQVVPYGRLEWLARNTFAEPSAGMEVHYLEYVVQMDETTLSEQYPSDHP 434
Query: 350 VLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
VL++P A + + R YL QNV+ + R LK+ + +A K++
Sbjct: 435 VLRDPMAIHKQGYRALNRTYLDKQNVRPHLGR----LKNTFLQALKML 478
>gi|147840299|emb|CAN66138.1| hypothetical protein VITISV_006754 [Vitis vinifera]
Length = 395
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 169/316 (53%), Gaps = 37/316 (11%)
Query: 107 NGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN 166
NGN +HEF D +PL+IT S ++ V+ I + +D W KY +++ + + +P I+ +
Sbjct: 69 NGNIYHEFNDGILPLYIT--SQHLNKRVVFVILEYHDWWITKYGDVISQLSDYPPIDFSG 126
Query: 167 QTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY---------------- 210
THCF +GL H + ++ +L+ +++ DF++ L AY
Sbjct: 127 DNRTHCFPEAIVGLRIHDELTVDSSLVEGNESIRDFRNLLDQAYLPRIRSLIQAKEQKVQ 186
Query: 211 ----------------NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELG 254
E KPKLV+++R R I N + + A+E+G
Sbjct: 187 SKMKEEPSLPPSLKPPLETGKEEQVDQLKKPKLVVLSRTG--ARAITNEDLMVQMAKEIG 244
Query: 255 FDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLST 314
F V + P +T LA +R ++S AMVGVHGA +TH LFL+PGSV +QV+P+GT+W +
Sbjct: 245 FQVKVLRPNRATELAKIYRVLNSSDAMVGVHGAAMTHILFLQPGSVFIQVIPLGTEWPAE 304
Query: 315 VYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQN 373
Y+ +PA LGL+Y+ YKI ESSL +Y +D VL++P++ A W RVYL Q
Sbjct: 305 TYYGEPAEKLGLKYMGYKILPRESSLYNEYAKDDPVLRDPESLAKKGWEFTKRVYLDRQT 364
Query: 374 VKLDIDRFRIYLKDAY 389
V LD+ RFR L AY
Sbjct: 365 VTLDLRRFRKQLVCAY 380
>gi|125581917|gb|EAZ22848.1| hypothetical protein OsJ_06527 [Oryza sativa Japonica Group]
Length = 481
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/393 (32%), Positives = 208/393 (52%), Gaps = 28/393 (7%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C S R D C G ++ T +P + ++K PY RK D A+S V
Sbjct: 99 CYESSRRSDTCEATGDVRVHGRSQTIHI------SPLEQEWKVK--PYCRKHDAFALSHV 150
Query: 76 KE-----LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
KE L+ + P C V ++ A V S GG+ GN FH++ D +P FIT +H
Sbjct: 151 KEWALRPLSTADTYPAVPHCTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHRF 208
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+V ++ W +Y ++ R +++ +++I+N C++SV +G H + ++
Sbjct: 209 AGEVQFLVSSFKSWWTNRYMQIFQRLSKYELVDIDNDDEVRCYRSVVVGPTFHKELGVDA 268
Query: 191 TLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLRE 245
+ P ++VDF++ L A E S +P+L++++R R R LN R
Sbjct: 269 SRTPSGYSMVDFRTMLRGALGLERAVAEPSGDRWDIRRRPRLLIISR--RNSRAFLNERA 326
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+ A LGF V + EP+ ST ++ R ++S MVGVHGAGLT+ +FL G+VL+QVV
Sbjct: 327 MADMAMSLGFYVRLGEPDISTDVSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVV 386
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G +WL+ F++PA + + YLEY I+ +E++L E+Y +D VLK+P + W+
Sbjct: 387 PYGGLEWLARGTFKEPAADMQIHYLEYMIQLDETTLSEQYPKDDPVLKDPYSIHKQGWNA 446
Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
++ VYL QNV+ + R LK+ + +A KL+
Sbjct: 447 LKMVYLDKQNVRPHLGR----LKNTFMEALKLL 475
>gi|414876581|tpg|DAA53712.1| TPA: hypothetical protein ZEAMMB73_101479 [Zea mays]
Length = 335
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 114/332 (34%), Positives = 178/332 (53%), Gaps = 11/332 (3%)
Query: 72 MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
M V+ LT+ + +C H PALVFS GY GN+FH F D +PLF+T +
Sbjct: 1 MRVVRVLTVRSVPGEAPACTDRHGVPALVFSDRGYTGNYFHAFTDVILPLFLTARQY--A 58
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
+V L + D W K+A + + + +++++ HCF+ V +GL SH I+P
Sbjct: 59 GEVRLLVTDLQAWWVGKFAPVFRSISNYELVDLDRDPRVHCFRHVQVGLTSHDDFSIDPR 118
Query: 192 LLPKPKTLVDFQSFLANAYN----ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
P +++DF F+ AY + + +P+L+L+ R AR R +N E+
Sbjct: 119 RAPNGYSMLDFTGFMRAAYGLPRGDVAAAAGPSSKRRPRLLLIAR-ART-RRFVNAEEIV 176
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
+ AE+LGF+V + E + +A +SC A++GVHGAGLT+ +F+ G V++QVVP+
Sbjct: 177 RGAEKLGFEVVV--SEGTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPTGGVVIQVVPL 234
Query: 308 GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR- 366
G YF P+R +GL YLEY+I EES+L+ +Y + + +P W +++
Sbjct: 235 GGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLINQYPRDHPIFTDPNGIKSKGWESLKD 294
Query: 367 VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
YL Q+V LD+ RFR LK A +K K
Sbjct: 295 AYLDKQDVSLDMKRFRPTLKKAIAHIRKARAK 326
>gi|46390169|dbj|BAD15602.1| putative HGA1 [Oryza sativa Japonica Group]
gi|46390975|dbj|BAD16510.1| putative HGA1 [Oryza sativa Japonica Group]
gi|215686668|dbj|BAG88921.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 503
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/393 (32%), Positives = 208/393 (52%), Gaps = 28/393 (7%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C S R D C G ++ T +P + ++K PY RK D A+S V
Sbjct: 121 CYESSRRSDTCEATGDVRVHGRSQTIHI------SPLEQEWKVK--PYCRKHDAFALSHV 172
Query: 76 KE-----LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
KE L+ + P C V ++ A V S GG+ GN FH++ D +P FIT +H
Sbjct: 173 KEWALRPLSTADTYPAVPHCTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHRF 230
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+V ++ W +Y ++ R +++ +++I+N C++SV +G H + ++
Sbjct: 231 AGEVQFLVSSFKSWWTNRYMQIFQRLSKYELVDIDNDDEVRCYRSVVVGPTFHKELGVDA 290
Query: 191 TLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLRE 245
+ P ++VDF++ L A E S +P+L++++R R R LN R
Sbjct: 291 SRTPSGYSMVDFRTMLRGALGLERAVAEPSGDRWDIRRRPRLLIISR--RNSRAFLNERA 348
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+ A LGF V + EP+ ST ++ R ++S MVGVHGAGLT+ +FL G+VL+QVV
Sbjct: 349 MADMAMSLGFYVRLGEPDISTDVSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVV 408
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G +WL+ F++PA + + YLEY I+ +E++L E+Y +D VLK+P + W+
Sbjct: 409 PYGGLEWLARGTFKEPAADMQIHYLEYMIQLDETTLSEQYPKDDPVLKDPYSIHKQGWNA 468
Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
++ VYL QNV+ + R LK+ + +A KL+
Sbjct: 469 LKMVYLDKQNVRPHLGR----LKNTFMEALKLL 497
>gi|115445889|ref|NP_001046724.1| Os02g0331200 [Oryza sativa Japonica Group]
gi|46390170|dbj|BAD15603.1| putative HGA1 [Oryza sativa Japonica Group]
gi|46390974|dbj|BAD16509.1| putative HGA1 [Oryza sativa Japonica Group]
gi|113536255|dbj|BAF08638.1| Os02g0331200 [Oryza sativa Japonica Group]
gi|215712340|dbj|BAG94467.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737036|dbj|BAG95965.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/393 (32%), Positives = 208/393 (52%), Gaps = 28/393 (7%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C S R D C G ++ T +P + ++K PY RK D A+S V
Sbjct: 122 CYESSRRSDTCEATGDVRVHGRSQTIHI------SPLEQEWKVK--PYCRKHDAFALSHV 173
Query: 76 KE-----LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
KE L+ + P C V ++ A V S GG+ GN FH++ D +P FIT +H
Sbjct: 174 KEWALRPLSTADTYPAVPHCTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHRF 231
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+V ++ W +Y ++ R +++ +++I+N C++SV +G H + ++
Sbjct: 232 AGEVQFLVSSFKSWWTNRYMQIFQRLSKYELVDIDNDDEVRCYRSVVVGPTFHKELGVDA 291
Query: 191 TLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLRE 245
+ P ++VDF++ L A E S +P+L++++R R R LN R
Sbjct: 292 SRTPSGYSMVDFRTMLRGALGLERAVAEPSGDRWDIRRRPRLLIISR--RNSRAFLNERA 349
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+ A LGF V + EP+ ST ++ R ++S MVGVHGAGLT+ +FL G+VL+QVV
Sbjct: 350 MADMAMSLGFYVRLGEPDISTDVSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVV 409
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G +WL+ F++PA + + YLEY I+ +E++L E+Y +D VLK+P + W+
Sbjct: 410 PYGGLEWLARGTFKEPAADMQIHYLEYMIQLDETTLSEQYPKDDPVLKDPYSIHKQGWNA 469
Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
++ VYL QNV+ + R LK+ + +A KL+
Sbjct: 470 LKMVYLDKQNVRPHLGR----LKNTFMEALKLL 498
>gi|226531412|ref|NP_001151219.1| glycosyltransferase [Zea mays]
gi|195645114|gb|ACG42025.1| glycosyltransferase [Zea mays]
Length = 455
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 121/341 (35%), Positives = 190/341 (55%), Gaps = 15/341 (4%)
Query: 59 KTQPYPRKSDKSAMSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHE 113
+T+PY R+ D AM V+E T+ PN + C TH+ P +FS+GG+ GN +H+
Sbjct: 112 RTKPYARRHDAVAMDDVREFTLVPFGGPNDTAVPPLCTRTHSVPGFLFSSGGFAGNLYHD 171
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
+ D VPLF + N H +V +AD D WA K+ + + +R+ +I++NN HCF
Sbjct: 172 YADVLVPLFASTN-HL-GGEVQFLLADIKDWWADKFRPVFRQLSRYDVIDVNNDREVHCF 229
Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN----TNTSSSFHHTKPKLVL 229
+G H M I+P+ P T+ DF+ L A+ + + + +P+L++
Sbjct: 230 PRTIIGSTFHRAMGIDPSRSPGGVTVADFKRLLRRAFRLERAVASRSGAPRRRDRPRLLI 289
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
++R + R +N R + +AA FDV I EP+ T + + R ++S M+GVHGAGL
Sbjct: 290 ISRKS--SRRFVNERAMARAAAAARFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGL 347
Query: 290 THSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
T+ +FL +VL+QVVP G +WL+ V F+ PAR + + Y+EY + EESSL + Y +
Sbjct: 348 TNMVFLPSRAVLVQVVPFGGLEWLTRVTFKDPARDMDVTYMEYNVSLEESSLRDLYPEDH 407
Query: 349 LVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
LK+P W ++ VYL QNV+L++ RF L+ A
Sbjct: 408 FYLKHPYDVHKKGWDAIKTVYLDKQNVRLNLTRFTRTLEQA 448
>gi|218184799|gb|EEC67226.1| hypothetical protein OsI_34143 [Oryza sativa Indica Group]
Length = 504
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 123/363 (33%), Positives = 198/363 (54%), Gaps = 31/363 (8%)
Query: 59 KTQPYPRKSDKSAMSKVKELTI------------TTSAPPNLSCGVTHTSPALVFSAGGY 106
KT+PY RK D A+S VKE T+ AP C T+ A V S GG+
Sbjct: 148 KTKPYCRKHDAFALSHVKEWTLRPLPSGDGDGGGAAVAP---RCTTNSTATAFVLSTGGF 204
Query: 107 NGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN 166
GN FH++ D +P FIT +H +V ++ W KY ++ + +RH +++++
Sbjct: 205 TGNPFHDYTDVLIPAFIT--AHRFGGEVQFLVSSYKSWWMNKYIQIFQQMSRHDVVDVDA 262
Query: 167 QTIT-HCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH---- 221
C++S +G H + ++PT P +++DF+ L A+ + T++
Sbjct: 263 DGDEVRCYRSAVVGPEFHRELGVDPTKTPSGYSVLDFRKMLRGAFGLDRATATPSGDRWD 322
Query: 222 -HTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHA 280
+P+L++++R A GR L+ R + A LGFDV + EP+ ST + R ++SC
Sbjct: 323 IRRRPRLLIISRRAARGRAFLHERAMADMAASLGFDVRVGEPDASTDTSKFARLVNSCDV 382
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
MVGVHGAGLT+ +FL G+VL+QVVP G +WL+ F +P+ + + YLEY ++ +E++
Sbjct: 383 MVGVHGAGLTNMVFLPAGAVLVQVVPYGKLEWLARNTFAEPSSAMEIHYLEYAVQLDETT 442
Query: 340 LVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM-- 396
L E+Y A+ VL++P A W ++ YL QNV+ + R LK+ + +A KL+
Sbjct: 443 LSEQYPADHPVLRDPMAIHKQGWEALKTTYLDKQNVRPHLGR----LKNTFLQALKLLPH 498
Query: 397 DKE 399
DKE
Sbjct: 499 DKE 501
>gi|218190631|gb|EEC73058.1| hypothetical protein OsI_07015 [Oryza sativa Indica Group]
Length = 534
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 202/376 (53%), Gaps = 21/376 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ +D+C + G A ++ST V ++ + + R + + +
Sbjct: 158 CDLSNPRFDICELCGDARTIGQSSTVVYVPQNRASNGEEWII-------RAQSRKHLPWI 210
Query: 76 KELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
K++TI + S+ P C H PA+VF+ GG N +H+F D VPLF+T ++D
Sbjct: 211 KKVTIKSVNSSEPEPICTSKHHIPAIVFALGGLTANVWHDFSDVLVPLFLTARQF--NRD 268
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V L I + + +KY+ + R TRH II+ ++ C+ V +GL SH + I+P+
Sbjct: 269 VQLIITNNQPWFIKKYSAIFSRLTRHEIIDFDSDGQIRCYPHVIVGLRSHRDLGIDPSSS 328
Query: 194 PKPKTLVDFQSFLANAYNENT-------NTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
P+ T+VDF+ F+ AY KP+++L++R R +N+ V
Sbjct: 329 PQNYTMVDFRLFVREAYGLPAAEVDIPYKADKDDPDKKPRIMLIDRGK--SRRFVNVAHV 386
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ + GF+V +P+ ++L + R + SC A++GVHGAGLT+ +FLR G V++ +VP
Sbjct: 387 VQGLDWFGFEVVKADPKIDSNLDEFVRLVDSCDAIMGVHGAGLTNMVFLRSGGVVVHIVP 446
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM- 365
G ++++ ++ PAR +GL ++EY I EES+L+EKYG N V+ +P+ W +
Sbjct: 447 YGIKFMADGFYGAPARDMGLRHVEYSISPEESTLLEKYGWNHTVINDPETIRKGGWEKVA 506
Query: 366 RVYLKTQNVKLDIDRF 381
Y+ Q++ L++ RF
Sbjct: 507 EFYMSKQDIVLNMTRF 522
>gi|115487966|ref|NP_001066470.1| Os12g0238900 [Oryza sativa Japonica Group]
gi|113648977|dbj|BAF29489.1| Os12g0238900, partial [Oryza sativa Japonica Group]
Length = 520
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/393 (32%), Positives = 201/393 (51%), Gaps = 42/393 (10%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C + G D S F V P+ +++ PYPRK D ++M +V
Sbjct: 138 CDLSDHRTDVCDLAGDIRMDANASAFVVVVDPAVGADGPTYKVR--PYPRKGDATSMGRV 195
Query: 76 KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
E+T+ T+A A+VFS GY GN FH+F D VPL+ T + DV
Sbjct: 196 TEITVRTTA-------------AVVFSISGYTGNLFHDFTDVIVPLYNTAARYC--GDVQ 240
Query: 136 LAIADCN---DQWARKYAELLPRFTRHPIININNQTI-----THCFQSVTLGLISHGRMV 187
L + D N +W +Y +L +RH +++ + HCF +GL +HG ++
Sbjct: 241 LVVTDGNAATRRWLARYGAVLRGLSRHVPLDLAAEAAAGGGEVHCFGHTVVGLRAHGELI 300
Query: 188 INPTLLPKPKTLVDFQSFLANAYN--ENTNTSSSFHHT-------KPKLVLVNRNARVGR 238
I+ P + DF FL A + + T H +P+L++++R R R
Sbjct: 301 IDRERSPDGLGMPDFTRFLRRALSLPRDAPTRPGGGHGDATKPQPRPRLLIISR--RGTR 358
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSF----RFIHSCHAMVGVHGAGLTHSLF 294
+LN V +AAE++GF+ E + + + D R ++S A+VGVHGAGLT+ +F
Sbjct: 359 LLLNTDAVARAAEQVGFEAVASELDMAGADHDDVARVARLVNSFDAVVGVHGAGLTNMVF 418
Query: 295 LRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKN 353
L PG+ +Q+VP G +WL+ F +PA +GL Y++Y++ ES+L +KY + + N
Sbjct: 419 LPPGAAAVQIVPWGGLRWLARADFGEPAVAMGLRYIQYEVAAGESTLKDKYPRDHEIFTN 478
Query: 354 PQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYL 385
P A ++ MR +L Q++ +DIDRF+ L
Sbjct: 479 PTALHKKGFTFMRHTFLNGQDIIVDIDRFKPVL 511
>gi|242094622|ref|XP_002437801.1| hypothetical protein SORBIDRAFT_10g002800 [Sorghum bicolor]
gi|241916024|gb|EER89168.1| hypothetical protein SORBIDRAFT_10g002800 [Sorghum bicolor]
Length = 499
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 128/393 (32%), Positives = 204/393 (51%), Gaps = 34/393 (8%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPYPRKSDKSAMSK 74
C S R D C G VG + P R K +PY RK D A+S
Sbjct: 120 CYESSRRSDTCEAAGDVRV---------VGRAQTVLVSPLEREWKVKPYCRKHDAFALSH 170
Query: 75 VKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
VKE T+ + AP C V ++ A V S GG+ GN FH++ D +P FIT +
Sbjct: 171 VKEWTLRPVGSDDAP---RCTVNSSATAFVLSTGGFTGNLFHDYTDVLIPAFITARRYA- 226
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
DV L ++ W KY ++L + +RH +++ + C+ V +G H + ++
Sbjct: 227 -GDVQLLVSSYKPWWTTKYLQVLQQLSRHEVVDADADAEVRCYPRVVVGPTFHRELGVDA 285
Query: 191 TLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLRE 245
+ ++ +F++ L +A+ E + S +P+L++++R R R +LN R
Sbjct: 286 S--SSSPSMPEFRAMLRDAFGLERAAAAPSGDRWDIRRRPRLLIISR--RTSRRLLNERA 341
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+ A LGFDV +PE ST + R ++S MVGVHG GLT+ +FL G+VL+QVV
Sbjct: 342 MADMATSLGFDVRTGDPEVSTDVGRFARLVNSADVMVGVHGDGLTNMVFLPAGAVLVQVV 401
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G +WL+ F +PA + + YLEY ++++E++L E+YG +D V+++P A W
Sbjct: 402 PYGGLEWLARGTFREPAEGMEVHYLEYVVQKDETTLSEEYGEDDPVIRDPAAIHKQGWDA 461
Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
++ VYL QNV+ + R LK+ + +A KL+
Sbjct: 462 LKAVYLDKQNVRPHLGR----LKNTFVQALKLL 490
>gi|222622742|gb|EEE56874.1| hypothetical protein OsJ_06512 [Oryza sativa Japonica Group]
Length = 549
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 202/376 (53%), Gaps = 21/376 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ +D+C + G A ++ST V ++ + + R + + +
Sbjct: 173 CDLSNPRFDICELCGDARTIGQSSTVVYVPQNRASNGEEWII-------RAQSRKHLPWI 225
Query: 76 KELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
K++TI + S+ P C H PA+VF+ GG N +H+F D VPLF+T ++D
Sbjct: 226 KKVTIKSVNSSEPEPICTSKHHIPAIVFALGGLTANVWHDFSDVLVPLFLTARQF--NRD 283
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V L I + + +KY+ + R TRH II+ ++ C+ V +GL SH + I+P+
Sbjct: 284 VQLIITNNQPWFIKKYSAIFSRLTRHEIIDFDSDGQIRCYPHVIVGLRSHRDLGIDPSSS 343
Query: 194 PKPKTLVDFQSFLANAYNENT-------NTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
P+ T+VDF+ F+ AY KP+++L++R R +N+ V
Sbjct: 344 PQNYTMVDFRLFVREAYGLPAAEVDIPYKADKDDPDKKPRIMLIDRGK--SRRFVNVAHV 401
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ + GF+V +P+ ++L + R + SC A++GVHGAGLT+ +FLR G V++ +VP
Sbjct: 402 VQGLDWFGFEVVKADPKIDSNLDEFVRLVDSCDAIMGVHGAGLTNMVFLRSGGVVVHIVP 461
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM- 365
G ++++ ++ PAR +GL ++EY I EES+L+EKYG N V+ +P+ W +
Sbjct: 462 YGIKFMADGFYGAPARDMGLRHVEYSISPEESTLLEKYGWNHTVINDPETIRKGGWEKVA 521
Query: 366 RVYLKTQNVKLDIDRF 381
Y+ Q++ L++ RF
Sbjct: 522 EFYMSKQDIVLNMTRF 537
>gi|54291090|dbj|BAD61765.1| putative HGA6 [Oryza sativa Japonica Group]
gi|125596993|gb|EAZ36773.1| hypothetical protein OsJ_21109 [Oryza sativa Japonica Group]
Length = 526
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/403 (31%), Positives = 207/403 (51%), Gaps = 29/403 (7%)
Query: 7 ALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRK 66
A + ++C+ S D C ++G ++ + V +D P + I+ PYPRK
Sbjct: 125 AAAKTSHLSCNFSSAHMDTCAMDGDIRIHGRSGVVYVVASSDYRPENATAVIR--PYPRK 182
Query: 67 SDKSAMSKVKELTITTSAPPN--------------LSCGVTHTSPALVFSAGGYNGNFFH 112
+++ M +V+++TI ++APP L C V PA+VFS GGY+ NFFH
Sbjct: 183 WEQATMERVRQITIRSTAPPGAAVADTDGGGAIIPLRCTVARDMPAVVFSTGGYSVNFFH 242
Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
D +PL+IT H V L A+ + +W KY L + +P+++++ C
Sbjct: 243 TMNDILLPLYITAREH--GGRVQLLAANYDRRWTAKYQHALAALSMYPVVDLDADAAVRC 300
Query: 173 FQSVTLGLISHGRMVIN-PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVN 231
F S +G+ SH + I+ P T+V F +FL +AY+ + + +P++V+V
Sbjct: 301 FPSARVGVESHRVLGIDTPLTGSNGYTMVGFLAFLRSAYSLPRHAVTRTTPRRPRVVMVL 360
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
R R R + N EV A E GF+V PEE+ +A ++SC MVGVHGAGLT+
Sbjct: 361 R--RKSRALTNEAEVVAAVAEAGFEVVAAGPEEAGDVAGFAATVNSCDVMVGVHGAGLTN 418
Query: 292 SLFLRPGSVLMQVVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
+FL ++Q++P G +W + +P +GL Y+EY++ E++L E+Y + V
Sbjct: 419 MVFLPRNGTVVQIIPWGGMKWPCWYDYGEPVPAMGLRYVEYEVAANETTLRERYPMDHPV 478
Query: 351 LKNPQAF--AGAN--WSNMRVYLKTQNVKLDIDRFRIYLKDAY 389
+P + G N WS +L QN+ LD++RF+ + + Y
Sbjct: 479 FADPVSIHRKGFNHLWST---FLNGQNLTLDVNRFKAVMAEVY 518
>gi|125555079|gb|EAZ00685.1| hypothetical protein OsI_22705 [Oryza sativa Indica Group]
Length = 526
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 127/402 (31%), Positives = 209/402 (51%), Gaps = 28/402 (6%)
Query: 7 ALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRK 66
A + ++C+ S D C ++G ++ + V +D P + I+ PYPRK
Sbjct: 126 AAAKTSHLSCNFSSAHMDTCAMDGDIRIHGRSGVVYVVASSDYRPENATAVIR--PYPRK 183
Query: 67 SDKSAMSKVKELTITTSAPPN-----------LSCGVTHTSPALVFSAGGYNGNFFHEFM 115
+++ M +V+++TI ++APP L C V PA+VFS GGY+ NFFH
Sbjct: 184 WEQATMERVRQITIRSTAPPGADTDGGGAIIPLRCTVARDMPAVVFSTGGYSVNFFHTMN 243
Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
D +PL+IT H V L A+ + +W KY L + +P+++++ CF S
Sbjct: 244 DILLPLYITAREH--GGRVQLLAANYDRRWTAKYQHALAALSMYPVVDLDADAAVRCFPS 301
Query: 176 VTLGLISHGRMVIN-PTLLPKPKTLVDFQSFLANAYN--ENTNTSSSFHHTKPKLVLVNR 232
+G+ SH + I+ P T+V F +FL +AY+ + T+ + +P++V+V R
Sbjct: 302 ARVGVESHRVLGIDTPLTGSNGYTMVGFLAFLRSAYSLPRHAVTTHTPSPRRPRVVMVLR 361
Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
R R + N EV A E GF+V PEE+ +A ++SC MVGVHGAGLT+
Sbjct: 362 --RKSRALTNEAEVVAAVAEAGFEVVAAGPEEAGDVAGFAATVNSCDVMVGVHGAGLTNM 419
Query: 293 LFLRPGSVLMQVVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
+FL ++Q++P G +W + +P +GL Y+EY++ E++L E+Y + V
Sbjct: 420 VFLPRNGTVVQIIPWGGMKWPCWYDYGEPVPAMGLRYVEYEVAANETTLRERYPMDHPVF 479
Query: 352 KNPQAF--AGAN--WSNMRVYLKTQNVKLDIDRFRIYLKDAY 389
+P + G N WS +L QN+ LD++RF+ + + Y
Sbjct: 480 ADPVSIHRKGFNHLWST---FLNGQNLTLDVNRFKAVMAEVY 518
>gi|56409854|emb|CAI30075.1| glycosyltransferase [Triticum aestivum]
Length = 516
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 129/388 (33%), Positives = 206/388 (53%), Gaps = 32/388 (8%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ C D C + G + T V T + S K QPY R++ MS
Sbjct: 127 VVCTTEGPFSDTCDVFGDVRTNGTAHTVTLVPATQTE----SREWKIQPYARRT----MS 178
Query: 74 KVKELTIT----TSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
+ E+T+T TSA P +C VTH P +VF+ GG GN+FH+F D VPL I +
Sbjct: 179 GISEVTVTQLDSTSAEYPAPACTVTHNIPGIVFALGGLTGNYFHDFSDALVPLVIASRGY 238
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
+V L ++ W KY L+ R +++ ++++++ CF SVT+GL H I
Sbjct: 239 --GGEVQLLASNIQPWWLGKYEALVQRLSKYDVVDLDHDDQIRCFPSVTVGLNMHKEFNI 296
Query: 189 NPTLLPK--PKTLVDFQSFLANAY-----------NENTNTSSSFHHTKPKLVLVNRNAR 235
P L+P P ++++F +FL Y N+ ++ +L+L++R
Sbjct: 297 VPELVPGGVPLSMLNFTAFLRETYSLPRAAPIRLTNKKSSPPVDGKKRSRRLMLLDRGHY 356
Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ++N+ E+ KAAE+ GF+VTI +P + + + ++S ++GVHGAGLT+S FL
Sbjct: 357 --RKLVNVPEIVKAAEKAGFEVTIADPRFNVRVKELALSVNSFDVLLGVHGAGLTNSAFL 414
Query: 296 RPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
PG+V++QVVP G + ++ F PA +GL YLEY I EES+L+E G + V+K+P
Sbjct: 415 PPGAVVIQVVPYGNLEHMAKREFGDPAANMGLRYLEYSITAEESTLLEMLGPDHPVIKDP 474
Query: 355 QAFAGANWSNM-RVYLKTQNVKLDIDRF 381
++ + W + YL Q+V++D+ RF
Sbjct: 475 ESVHRSGWDKVAEYYLGKQDVRVDVQRF 502
>gi|115445865|ref|NP_001046712.1| Os02g0327700 [Oryza sativa Japonica Group]
gi|46390538|dbj|BAD16025.1| putative HGA1 [Oryza sativa Japonica Group]
gi|46390878|dbj|BAD16395.1| putative HGA1 [Oryza sativa Japonica Group]
gi|113536243|dbj|BAF08626.1| Os02g0327700 [Oryza sativa Japonica Group]
gi|215701502|dbj|BAG92926.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 554
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 119/376 (31%), Positives = 204/376 (54%), Gaps = 21/376 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ +D+C + G A ++ST V ++ + + I+ Q + + +
Sbjct: 178 CDLSNPRFDICELCGDARTIGQSSTVVYVPQNRASNGEEWI-IRAQ------SRKHLPWI 230
Query: 76 KELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
K++TI + S+ P C H PA+VF+ GG N +H+F D VPLF+T ++D
Sbjct: 231 KKVTIKSVNSSEPEPICTSKHHIPAIVFALGGLTANVWHDFSDVLVPLFLTARQF--NRD 288
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLL 193
V L I + + +KY+ + R TRH II+ ++ C+ V +GL SH + I+P+
Sbjct: 289 VQLIITNNQPWFIKKYSAIFSRLTRHEIIDFDSDGQIRCYPHVIVGLRSHRDLGIDPSSS 348
Query: 194 PKPKTLVDFQSFLANAYNENT-------NTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
P+ T+VDF+ F+ AY KP+++L++R R +N+ V
Sbjct: 349 PQNYTMVDFRLFVREAYGLPAAEVDIPYKADKDDPDKKPRIMLIDRGK--SRRFVNVAHV 406
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ + GF+V +P+ ++L + R + SC A++GVHGAGLT+ +FLR G V++ +VP
Sbjct: 407 VQGLDWFGFEVVKADPKIDSNLDEFVRLVDSCDAIMGVHGAGLTNMVFLRSGGVVVHIVP 466
Query: 307 IGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM- 365
G ++++ ++ PAR +GL ++EY I EES+L+EKYG N V+ +P+ W +
Sbjct: 467 YGIKFMADGFYGAPARDMGLRHVEYSISPEESTLLEKYGWNHTVINDPETIRKGGWEKVA 526
Query: 366 RVYLKTQNVKLDIDRF 381
Y+ Q++ L++ RF
Sbjct: 527 EFYMSKQDIVLNMTRF 542
>gi|125555301|gb|EAZ00907.1| hypothetical protein OsI_22937 [Oryza sativa Indica Group]
Length = 524
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 126/403 (31%), Positives = 209/403 (51%), Gaps = 28/403 (6%)
Query: 4 RPAALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSL 56
+P+ +I+CD + + +C + G PK T F V + + +
Sbjct: 126 KPSGDEAEATISCDENGVDEGFPYARPPVCELTGDIRISPKEKTMFFVNPSSAGAFDGNG 185
Query: 57 RIKTQPYPRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEF 114
K +PY RK D + V E+ I + + P ++ C TH PA+VFS GY NFFH+
Sbjct: 186 EKKIRPYARKDD-FLLPGVVEVIIKSVSSPAIAPACTRTHNVPAVVFSVAGYTDNFFHDN 244
Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
D +PLF+T SH + V I + W K+ LL + + + +IN + HCF+
Sbjct: 245 TDVMIPLFLT-TSHLAGE-VQFLITNFKPWWVHKFTPLLKKLSNYGVINFDKDDEVHCFR 302
Query: 175 SVTLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVL 229
LGL ++I+P T P+ ++VD+ FL A+ ++ + KPK+++
Sbjct: 303 RGHLGLYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFGLPRDSPAVLGDKTGAKPKMLM 362
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHG 286
+ R R +LNLR+V E+LGF VT+ E + AD F +++ ++ VHG
Sbjct: 363 IERKGT--RKLLNLRDVAALCEDLGFAVTVAE-----AGADVRGFAEKVNAADVLLAVHG 415
Query: 287 AGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYG 345
AGLT+ +FL G+VL+Q+VP G W++T ++ +PAR + L Y+EY + +EE++L +KY
Sbjct: 416 AGLTNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYP 475
Query: 346 ANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
+ V K+P A W + + Q+V +++ RF+ +L A
Sbjct: 476 RDHYVFKDPMAIHAQGWPALAEIVMKQDVTVNVTRFKPFLLKA 518
>gi|297724843|ref|NP_001174785.1| Os06g0470150 [Oryza sativa Japonica Group]
gi|51090436|dbj|BAD35388.1| putative HGA4 [Oryza sativa Japonica Group]
gi|125597196|gb|EAZ36976.1| hypothetical protein OsJ_21315 [Oryza sativa Japonica Group]
gi|215769459|dbj|BAH01688.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255677040|dbj|BAH93513.1| Os06g0470150 [Oryza sativa Japonica Group]
Length = 524
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 126/403 (31%), Positives = 209/403 (51%), Gaps = 28/403 (6%)
Query: 4 RPAALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSL 56
+P+ +I+CD + + +C + G PK T F V + + +
Sbjct: 126 KPSGDEAEATISCDENGVDEGFPYARPPVCELTGDIRISPKEKTMFFVNPSSAGAFDGNG 185
Query: 57 RIKTQPYPRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEF 114
K +PY RK D + V E+ I + + P ++ C TH PA+VFS GY NFFH+
Sbjct: 186 EKKIRPYARKDD-FLLPGVVEVIIKSVSSPAIAPACTRTHNVPAVVFSVAGYTDNFFHDN 244
Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
D +PLF+T SH + V I + W K+ LL + + + +IN + HCF+
Sbjct: 245 TDVMIPLFLT-TSHLAGE-VQFLITNFKPWWVHKFTPLLKKLSNYGVINFDKDDEVHCFR 302
Query: 175 SVTLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVL 229
LGL ++I+P T P+ ++VD+ FL A+ ++ + KPK+++
Sbjct: 303 RGHLGLYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFGLPRDSPAVLGDKTGAKPKMLM 362
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHG 286
+ R R +LNLR+V E+LGF VT+ E + AD F +++ ++ VHG
Sbjct: 363 IERKGT--RKLLNLRDVAALCEDLGFAVTVAE-----AGADVRGFAEKVNAADVLLAVHG 415
Query: 287 AGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYG 345
AGLT+ +FL G+VL+Q+VP G W++T ++ +PAR + L Y+EY + +EE++L +KY
Sbjct: 416 AGLTNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYP 475
Query: 346 ANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
+ V K+P A W + + Q+V +++ RF+ +L A
Sbjct: 476 RDHYVFKDPMAIHAQGWPALAEIVMKQDVTVNVTRFKPFLLKA 518
>gi|242052289|ref|XP_002455290.1| hypothetical protein SORBIDRAFT_03g007900 [Sorghum bicolor]
gi|241927265|gb|EES00410.1| hypothetical protein SORBIDRAFT_03g007900 [Sorghum bicolor]
Length = 465
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 131/394 (33%), Positives = 197/394 (50%), Gaps = 27/394 (6%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ CD S D+C + G P + + S R+K P+ RK+D+ A++
Sbjct: 75 VLCDFSSSRSDMCELKGDVRVLPNATIVLR----HPWARRQSWRMK--PHGRKNDRHALA 128
Query: 74 KVKELTITTS-----APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
+V E+T+ +S A C ++T+PA+VFS GGY GN FH+ D VPLFIT
Sbjct: 129 RVTEVTVASSHSHHTAGAAPRCTASYTAPAVVFSVGGYAGNMFHDLTDVLVPLFITTRRF 188
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ--NQTITHCFQSVTLGLISHGRM 186
DV L ++D W K+ LL +RH ++++N C+ V LGL H M
Sbjct: 189 --GGDVHLLVSDAQPWWLDKFRPLLGALSRHAVVDMNRGGSRGVLCYPHVILGLKFHKEM 246
Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTS--------SSFHHTKPKLVLVNRNARVGR 238
++ ++ DF +Y +T+ SS +P+L+L++R + R
Sbjct: 247 SVDAARTAGEYSMADFTLLARRSYGLTRDTAIRLGDGNRSSSAAVRPRLLLISRKST--R 304
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
N V +AA LGF+V + EP L R ++SC +VGVHGAGL + +FL G
Sbjct: 305 AFTNAGAVARAAAALGFEVVVGEPARHADLPSFARVVNSCDVLVGVHGAGLANLVFLPAG 364
Query: 299 SVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
+V++QVVP+G ++ F PA GL Y+ Y I EESSL +Y + VL++P A
Sbjct: 365 AVVVQVVPLGGLDAMAAEDFGAPATDAGLRYVHYGIAVEESSLARRYPRDHRVLRDPAAV 424
Query: 358 AGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
W +R YL QNV +D+ RF L+ A +
Sbjct: 425 RREGWMALRAAYLVGQNVTIDVRRFGGALRRAME 458
>gi|357123670|ref|XP_003563531.1| PREDICTED: uncharacterized protein LOC100826010 [Brachypodium
distachyon]
Length = 476
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 133/419 (31%), Positives = 217/419 (51%), Gaps = 46/419 (10%)
Query: 10 PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ--------------PS 55
P ++CD S D C + G+ TS F V +T + +
Sbjct: 59 PEPHVSCDFSGNRSDYCELTGAIRIRGSTSEVFLVTPRRATAGEFLGPNATGWGMIAANA 118
Query: 56 LRIKTQPYPRKSDKSAMSKVKELTITTSA-----PPNLSCGVTHTS-PALVFSAGGYNGN 109
+ QPY RK + M+ + +LT+ S+ PP +C V H PA+V+S GGY GN
Sbjct: 119 TSWEMQPYTRKGESRIMTGITKLTVRLSSTAEDSPP--ACDVMHDDVPAVVYSNGGYCGN 176
Query: 110 FFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI 169
++H+F D +PLFIT H + L + W KYAE++ ++H ++++
Sbjct: 177 YYHDFNDNIIPLFITAR-HLAGE-AQLVVTQKQAWWFGKYAEIVSGLSKHEPVDMDADAR 234
Query: 170 THCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPK 226
HC++ +GL SH + I+P P ++VDF+ FL Y+ E+ + H KP+
Sbjct: 235 VHCYRRAFVGLRSHKDLSIDPRRAPNNVSMVDFKRFLMWRYSLPREHAIRTDDEHQRKPR 294
Query: 227 LVLVNRNARVGRTILNLREVKKAAEELGFDVTIFE-------PEESTS--------LADS 271
L++V R +R R +NL E+ AAEE+GF+VT + ES++ +A++
Sbjct: 295 LLIVTRRSR--RRFVNLEEIVAAAEEVGFEVTASDLITSSKKKGESSADDSKMQARMAEA 352
Query: 272 FRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLE 330
+++ AM+ VHG+GLT+ LFL +V++QVVP+G + L+ + P R + + Y++
Sbjct: 353 SATVNAYDAMLAVHGSGLTNLLFLPMNAVVVQVVPLGRMEGLAMDEYGVPPRDMNMRYIQ 412
Query: 331 YKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
Y I EES+L E Y V +P +WS ++ +YL Q+V+LD++RFR L+ A
Sbjct: 413 YNITAEESTLSEMYPRGHPVFLDPGPIHKQSWSLVKDIYLGKQDVRLDLNRFRPVLQKA 471
>gi|357112811|ref|XP_003558200.1| PREDICTED: uncharacterized protein LOC100844505 isoform 2
[Brachypodium distachyon]
Length = 543
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 212/404 (52%), Gaps = 23/404 (5%)
Query: 9 SPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPS-QPSLRIKT 60
+P +TCD + + +C ++G PK T + V + + + +
Sbjct: 145 TPPAKMTCDENGVDEGFPYARPAVCELSGDIRVSPKDKTMYLVNPSGAAAGFDENGEKRL 204
Query: 61 QPYPRKSDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
+PY RK D+ + V E+T+ + SA C H PA+VFS GY NFFH+ D
Sbjct: 205 RPYARK-DEFLLPAVVEVTVKSVPSASGAPRCTKRHRVPAVVFSVAGYTDNFFHDNTDAL 263
Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQT--ITHCFQSV 176
+PLF+T +H +V L I + W +KY +L + + + +IN +++ HCF
Sbjct: 264 IPLFLT-TAHL-KGEVQLLITNYKPWWVQKYTPVLRKLSNYDVINFDDEDGGAVHCFPDG 321
Query: 177 TLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVN 231
LGL ++I+P T P+ T+VD+ FL A E +P++++++
Sbjct: 322 YLGLYRDRDLIISPHPTRNPRNYTMVDYNKFLRGALELPREKPAVLGEEPGMRPRMLIIS 381
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
R+ R +LNL EV AA ELGF+VT+ E + ++S ++ VHGAGLT+
Sbjct: 382 RSGT--RRLLNLDEVSAAASELGFNVTVAEAGGEADVPAFAAMVNSADVLLAVHGAGLTN 439
Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
+FL +V++Q+VP G W++T ++ +PAR + L Y+EY + +EE+SL +KY + +V
Sbjct: 440 QIFLPTNAVVLQIVPWGNMDWMATNFYGQPAREMQLRYVEYYVGEEETSLKDKYPRDHVV 499
Query: 351 LKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
++P+A W + + Q+V++D+ RFR +L A K ++
Sbjct: 500 FRDPKALHTQGWETLADTIMKQDVQVDLSRFRPFLLQAIDKLQE 543
>gi|242056793|ref|XP_002457542.1| hypothetical protein SORBIDRAFT_03g009110 [Sorghum bicolor]
gi|241929517|gb|EES02662.1| hypothetical protein SORBIDRAFT_03g009110 [Sorghum bicolor]
Length = 574
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 136/420 (32%), Positives = 203/420 (48%), Gaps = 36/420 (8%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF----SVGHTDSTPSQPSLRIKT 60
P + P+ +I CDRS D+C ++G D + + G S+ + RI+
Sbjct: 145 PCSSLPSHTICCDRSDYHSDVCFMSGDVRTDAASLSLLLFPPPQGQGASSTAVTEERIR- 203
Query: 61 QPYPRKSDKSAMSKVKELTI--------TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
PY RK D + E+ + +A C V H +P V +AGGY+ N FH
Sbjct: 204 -PYTRKWDAYITKTIHEVRLRRVVRTEAEAAAGAAHRCDVRHDAPVFVVTAGGYSHNMFH 262
Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
F D F+PL++T + + V+LA+ + +WA Y E+L +R+ +I++ T THC
Sbjct: 263 VFNDGFLPLWLT--AQHLRRRVVLAVLSYSPRWAGTYGEILSGLSRYRVIDLLRDTQTHC 320
Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF----------HH 222
F +G H + +N T L +T+VDF FLA Y++ S
Sbjct: 321 FPGAVVGTRYHDYLAVNSTRLRDNRTIVDFHDFLAGVYDDGGGGGGSSSTTEETTPAPRD 380
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
+P+L +V+R R R I N V A +GFDV + E L+ + + S A+V
Sbjct: 381 RRPRLGIVSRKGR--RVIENQAAVAALAASVGFDVDVMETATGVPLSAVYASVSSYDALV 438
Query: 283 GVHGAGLTHSLFLRP---GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
GVHGA LT LFLRP G+ L+Q+ P+G LS F PA +GL Y +Y + ESS
Sbjct: 439 GVHGADLTAFLFLRPGGGGAALVQIAPLGIAMLSRNLFGVPAARMGLRYEQYDVSARESS 498
Query: 340 LVEKYGANDLVLKNP----QAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
L +Y A +V+ +P + W + VYL+ QNV LD+ RFR L + + K
Sbjct: 499 LSRRYPAGHVVVADPARARREQGKQEWELVEHVYLRGQNVSLDLGRFRETLARIHSRLLK 558
>gi|357112809|ref|XP_003558199.1| PREDICTED: uncharacterized protein LOC100844505 isoform 1
[Brachypodium distachyon]
Length = 565
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 212/404 (52%), Gaps = 23/404 (5%)
Query: 9 SPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPS-QPSLRIKT 60
+P +TCD + + +C ++G PK T + V + + + +
Sbjct: 167 TPPAKMTCDENGVDEGFPYARPAVCELSGDIRVSPKDKTMYLVNPSGAAAGFDENGEKRL 226
Query: 61 QPYPRKSDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
+PY RK D+ + V E+T+ + SA C H PA+VFS GY NFFH+ D
Sbjct: 227 RPYARK-DEFLLPAVVEVTVKSVPSASGAPRCTKRHRVPAVVFSVAGYTDNFFHDNTDAL 285
Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQT--ITHCFQSV 176
+PLF+T +H +V L I + W +KY +L + + + +IN +++ HCF
Sbjct: 286 IPLFLT-TAHL-KGEVQLLITNYKPWWVQKYTPVLRKLSNYDVINFDDEDGGAVHCFPDG 343
Query: 177 TLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVN 231
LGL ++I+P T P+ T+VD+ FL A E +P++++++
Sbjct: 344 YLGLYRDRDLIISPHPTRNPRNYTMVDYNKFLRGALELPREKPAVLGEEPGMRPRMLIIS 403
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
R+ R +LNL EV AA ELGF+VT+ E + ++S ++ VHGAGLT+
Sbjct: 404 RSGT--RRLLNLDEVSAAASELGFNVTVAEAGGEADVPAFAAMVNSADVLLAVHGAGLTN 461
Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
+FL +V++Q+VP G W++T ++ +PAR + L Y+EY + +EE+SL +KY + +V
Sbjct: 462 QIFLPTNAVVLQIVPWGNMDWMATNFYGQPAREMQLRYVEYYVGEEETSLKDKYPRDHVV 521
Query: 351 LKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
++P+A W + + Q+V++D+ RFR +L A K ++
Sbjct: 522 FRDPKALHTQGWETLADTIMKQDVQVDLSRFRPFLLQAIDKLQE 565
>gi|308080932|ref|NP_001183369.1| uncharacterized protein LOC100501781 [Zea mays]
gi|238011052|gb|ACR36561.1| unknown [Zea mays]
gi|413934851|gb|AFW69402.1| hypothetical protein ZEAMMB73_940636 [Zea mays]
Length = 488
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 137/424 (32%), Positives = 211/424 (49%), Gaps = 52/424 (12%)
Query: 10 PAKSITCDRSHRSYDLCLINGSALFDPKTSTFF----------SVGHTDSTPSQPSLRIK 59
P + CD + D+C + G+ TS F + T P S +
Sbjct: 71 PEPRVRCDFADPRSDVCELEGAIRVRGSTSEVFVLAPARADGLAANVTGLAPGTTSWTV- 129
Query: 60 TQPYPRKSDKSAMSKVKELTITT----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFM 115
QPY RK + M + LT+ +APP C V H PA+V+S GGY GN++H+F
Sbjct: 130 -QPYTRKGEARVMRGISTLTVRVVPAGAAPP---CTVRHDVPAVVYSNGGYCGNYYHDFN 185
Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI--NNQTITHCF 173
D +PLF+T H +V L +A W KY E++ T + +++ + + CF
Sbjct: 186 DNIIPLFVTAR-HL-GGEVQLLVAQKQAWWFGKYREIVDGLTNYEAVDLGADAEGEVRCF 243
Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY-----------NENTNTSSSFHH 222
++ TLGL SH + I+P P+ ++VDF+ FL Y E
Sbjct: 244 RAATLGLRSHKDLSIDPRRAPRNLSMVDFKRFLMWRYALPREHAIRTDEEEAGPGPGRAR 303
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFE--------------PEESTSL 268
T+P+L++V R +R R +NL E+ AEE+GFDVT + E + +
Sbjct: 304 TRPRLLVVARRSR--RRFVNLPEIVALAEEVGFDVTASDLMSGTASKSKSGAGDEGHSRM 361
Query: 269 ADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLE 327
AD+ + ++S AMV VHG+GLT+ +FL +V++QVVP+G + L+ + P R + +
Sbjct: 362 ADASKLVNSFDAMVAVHGSGLTNLVFLPMNAVVVQVVPLGRMEGLAMDEYGVPPRDMNMR 421
Query: 328 YLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLK 386
YL+Y I EES+L E Y VL +P +W+ ++ VYL Q+V+LD+ RFR L
Sbjct: 422 YLQYNITAEESTLSEVYPRAHPVLMDPMPIHEQSWTLVKDVYLGKQDVRLDVRRFRPVLL 481
Query: 387 DAYK 390
A +
Sbjct: 482 KAIQ 485
>gi|56409860|emb|CAI30078.1| glycosyltransferase [Sorghum bicolor]
Length = 491
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 121/392 (30%), Positives = 207/392 (52%), Gaps = 27/392 (6%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C + R D C G + T + DS + KT+PY RK D A++ V
Sbjct: 110 CYETSRRSDTCEAAGDVRVLGSSQTVY----VDSLDRE----WKTKPYCRKHDNFALAHV 161
Query: 76 KELTIT---TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
KE ++ + A P C V ++ A V S GG+ GN FH++ D +P FIT +H
Sbjct: 162 KEWSLKPLPSGAAPQ--CTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFIT--AHPLRG 217
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
+V ++ W +Y ++ + +RH +++I+ C+++V +G H + ++ +
Sbjct: 218 EVQFLVSSYKSWWMNRYIQIFQQMSRHEVVDIDADDEVRCYRNVVVGPTFHRELGVDASK 277
Query: 193 LPKPK-TLVDFQSFLANAYNENTNTSSSFH-----HTKPKLVLVNRNARVGRTILNLREV 246
P P + DF+ L +A+ T++ +P+L++++R GR +N R +
Sbjct: 278 SPSPGYSTADFRKMLRDAFGLERATATPSGDRWDIRRRPRLLIISRRPSRGRAFMNERAM 337
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A LGFDV I EP+ ST + R ++S MVGVHGAGLT+ +FL G+V++QVVP
Sbjct: 338 ADMAASLGFDVRIGEPDSSTDTSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVVVQVVP 397
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G +WL+ F +P+ + + YLEY ++ +E++L E+Y ++ VL++P A W+ +
Sbjct: 398 YGRLEWLARNTFAEPSAGMEVHYLEYVVQLDETTLSEQYPSDHPVLRDPMAIHKQGWNAL 457
Query: 366 R-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
+ YL QNV+ + R LK+ + +A K++
Sbjct: 458 KTTYLDKQNVRPHLGR----LKNTFLQALKML 485
>gi|383100768|emb|CCG47999.1| glycosyltransferase, HGA-like,putative,expressed [Triticum
aestivum]
Length = 450
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 130/388 (33%), Positives = 204/388 (52%), Gaps = 27/388 (6%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C + G P T ++ H + S+ S R+K P+ RK+D A++ V
Sbjct: 73 CDMSSLRSDVCELKGDVRVIPSNITIIALLHPSVSESRRSWRMK--PHARKNDGHALASV 130
Query: 76 KEL--TITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+ ++T S+P C +PA+VFS G Y GN FH+F D +PLFIT S F D
Sbjct: 131 TEVLVSVTPSSPHVPECKAESAAPAVVFSVGAYAGNMFHDFTDVLIPLFIT-ASRF-RSD 188
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTI-THCFQSVTLGLISHGRMVINPT- 191
V L ++D W KY LL + H +I+++ Q+ C+ V +GL H M I+
Sbjct: 189 VHLLVSDAPPWWLDKYRPLLRGLSHHAVIDMDRQSAEVLCYPHVVVGLSFHKEMSIDTAK 248
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFHHT--------KPKLVLVNRNARVGRTILNL 243
+ ++ DF +Y +T+ H +P+L++++R + R N+
Sbjct: 249 TVGGHYSMADFARLARRSYGLERDTAIRLLHGSDNIKSPRRPRLLIISR--KTTRAFTNM 306
Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
V +AA LG++V + E E+ + L R ++SC +VGVHGAGLT+ +FL PG+V++Q
Sbjct: 307 GAVAQAAAMLGYEVIVGEAEQHSDLPAFARLVNSCDVLVGVHGAGLTNLVFLPPGAVVVQ 366
Query: 304 VVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
VVP+ G + ++ F +PA +GL Y++Y I ES+L E Y + VL+ +
Sbjct: 367 VVPLGGLEAMARDDFGEPAGDMGLGYVQYGISVGESTLAELYPRDRRVLR--------DL 418
Query: 363 SNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
+ YL +QNV LD+ RF L A +
Sbjct: 419 ALRSEYLVSQNVTLDVARFSGALSRALE 446
>gi|326508758|dbj|BAJ95901.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516460|dbj|BAJ92385.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523597|dbj|BAJ92969.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 462
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 121/341 (35%), Positives = 186/341 (54%), Gaps = 16/341 (4%)
Query: 59 KTQPYPRKSDKSAMSKVKELTI------TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
+T+PY R+ D AM V+E + SA P L C H+ PA +FS+GG+ GN +H
Sbjct: 117 RTKPYARRHDAVAMDDVREFALLPFGGGNDSAVPPL-CTRNHSVPAFLFSSGGFAGNLYH 175
Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
++ D VPLF T HF +V + D D W K+ L + + + +I+++N HC
Sbjct: 176 DYTDVLVPLF-TSTHHF-GGEVQFLLTDIKDWWLDKFTPLFRQLSNYDVIDVDNDQEVHC 233
Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVL 229
F + +G H M I+ T P +T+ DF+ L A+ + +S S KP+L++
Sbjct: 234 FPRIVIGSTFHRPMGIDGTRSPGGETVADFKRLLRRAFRLDRVVASHDGSASLGKPRLLI 293
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
++R + R LN R + AA FDV I EP+ T + + R ++S M+GVHGAGL
Sbjct: 294 ISRKS--SRRFLNERAMAHAAALAQFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGL 351
Query: 290 THSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
T+ +FL +VL+QVVP G +WLS V F+ PA+ + Y+EY + EESSL Y +
Sbjct: 352 TNMVFLPSRAVLLQVVPFGGLEWLSRVTFKDPAKDFDVTYMEYNVSLEESSLKNLYPKDH 411
Query: 349 LVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
L++P W+ ++ VYL Q+V+LD+ + L+ A
Sbjct: 412 FYLQHPYDVHKKGWNAIKTVYLDKQSVRLDLAKLTRTLEHA 452
>gi|357153314|ref|XP_003576411.1| PREDICTED: uncharacterized protein LOC100832680 [Brachypodium
distachyon]
Length = 525
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 127/413 (30%), Positives = 205/413 (49%), Gaps = 34/413 (8%)
Query: 6 AALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPR 65
AA + +++C+ S ++C + G K ST + V +D P ++ I+ PYPR
Sbjct: 114 AAPAMRSNLSCNFSSVRMNICGMEGDVRIQGKASTVYVVSVSDYRPENSTVIIR--PYPR 171
Query: 66 KSDKSAMSKVKELTITTSAPPNL------------------SCGVTHTSPALVFSAGGYN 107
K + M V+E+T+ ++APP C VTH PA+VFS GGY+
Sbjct: 172 KWEIPTMELVREITVRSTAPPREPGSSATNAMEDDTAPPAPRCTVTHDVPAVVFSTGGYS 231
Query: 108 GNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQ 167
NFFH D VPL+ T + + V L + + +W K+ +L + +P+I+++
Sbjct: 232 MNFFHAMTDVVVPLYNTARRY--EGRVQLLATNYDRKWIAKFRHVLGALSSYPVIDLDAD 289
Query: 168 TITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---- 223
CF S +G+ SH + I+P L T++DF+ FL +AY+ + ++
Sbjct: 290 AAVRCFPSAHVGIESHMELSIDPALSFHGNTMMDFRDFLRSAYSLKRSWTTPVSRNNSSS 349
Query: 224 ----KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
KP+LV++ R R R + N + AA E GF+V PE + ++SC
Sbjct: 350 SSSRKPRLVILLR--RHSRAMTNEGDAMAAATEAGFEVVPAGPEVFGDMGKFAEVVNSCD 407
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPI-GTQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
MVGVHGAGLT+ +FL ++Q++P G +W +P +GL Y+EY+ EE+
Sbjct: 408 VMVGVHGAGLTNMVFLPHNGTVVQIIPWGGMKWPCFHALGRPVPDMGLRYVEYEASAEET 467
Query: 339 SLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
+L + Y + V +P + + M +L QNV LDI RFR ++ Y+
Sbjct: 468 TLKDVYPRDHAVFTDPLSIHRQGYGMMWATFLDGQNVTLDIGRFRGVMEQLYQ 520
>gi|115445877|ref|NP_001046718.1| Os02g0329800 [Oryza sativa Japonica Group]
gi|46389948|dbj|BAD15800.1| putative HGA1 [Oryza sativa Japonica Group]
gi|113536249|dbj|BAF08632.1| Os02g0329800 [Oryza sativa Japonica Group]
gi|125539253|gb|EAY85648.1| hypothetical protein OsI_07021 [Oryza sativa Indica Group]
gi|125581910|gb|EAZ22841.1| hypothetical protein OsJ_06519 [Oryza sativa Japonica Group]
Length = 566
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/383 (32%), Positives = 202/383 (52%), Gaps = 25/383 (6%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTI---- 80
+C + G PK T + V + + + + +PY RK D + V E+TI
Sbjct: 196 VCELYGDVRVSPKQKTIYVVNPSGAGGFDENGEKRLRPYARKDD-FLLPGVVEVTIKSVP 254
Query: 81 TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIAD 140
+ +A P C H PA+VFS GY NFFH+ D +PLF+T +H + IL I +
Sbjct: 255 SEAAAPK--CTKQHAVPAVVFSVAGYTDNFFHDMTDAMIPLFLT-TAHLKGEVQIL-ITN 310
Query: 141 CNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP--TLLPKPKT 198
W +KY LL + + + +IN + HCF LGL ++I+P T P+ T
Sbjct: 311 YKPWWVQKYTPLLRKLSNYDVINFDEDAGVHCFPQGYLGLYRDRDLIISPHPTRNPRNYT 370
Query: 199 LVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEELGF 255
+VD+ FL +A + S +P++++++R R +LNL EV AA ELGF
Sbjct: 371 MVDYNRFLRDALELRRDRPSVLGEEPGMRPRMLIISRAGT--RKLLNLEEVAAAATELGF 428
Query: 256 DVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QW 311
+VT+ E + AD F ++S ++ VHGAGLT+ +FL +V++Q+VP G W
Sbjct: 429 NVTVAE-----AGADVPAFAALVNSADVLLAVHGAGLTNQIFLPAEAVVVQIVPWGNMDW 483
Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKT 371
++T ++ +PAR + L Y+EY + +EE+SL Y + +V K+P+A W + +
Sbjct: 484 MATNFYGQPARDMQLRYVEYYVGEEETSLKHNYSRDHMVFKDPKALHAQGWQTLAATIMK 543
Query: 372 QNVKLDIDRFRIYLKDAYKKAKK 394
Q+V++++ RFR L A + ++
Sbjct: 544 QDVEVNLTRFRPILLQALDRLQQ 566
>gi|242052293|ref|XP_002455292.1| hypothetical protein SORBIDRAFT_03g007920 [Sorghum bicolor]
gi|241927267|gb|EES00412.1| hypothetical protein SORBIDRAFT_03g007920 [Sorghum bicolor]
Length = 468
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 123/392 (31%), Positives = 204/392 (52%), Gaps = 15/392 (3%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
+ + C R D C ++G + K + V S+ + +I+ PY R+S +
Sbjct: 84 SNKVVCSTEERFSDYCELDGDVRINGKAWSVDIVPPGWSSEQRREWKIR--PYSRRSASN 141
Query: 71 AMS-KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF 129
+ V +L SAP +C VTH +P +VF+ GGY+GN FH+ D +PL++T + +
Sbjct: 142 VDTLNVTQLQDPASAP---ACTVTHHAPGVVFALGGYSGNAFHDHADVLLPLYLT-SLRY 197
Query: 130 PDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVIN 189
+ +L I W KY L R +++ ++N++ CF +T+GL H +
Sbjct: 198 DGEVQLLVINRVQPWWLGKYRLALRRMSKYDVVNLDGDAHVRCFPHLTVGLRLHMDFGVV 257
Query: 190 PTLLP----KPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE 245
P ++P + ++ DF FL AY +P+L+L+ R R LN E
Sbjct: 258 PEMVPGQGHRRVSMPDFTRFLREAYALPRGAPVKPGKNRPRLMLIQRQRT--RRFLNEAE 315
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+ +AAE GF+V + + ++ + R ++S MVG+HGAG+T+ +FL PG VL+QVV
Sbjct: 316 MVRAAEAAGFEVAVTDLLIDAAVDEQARVVNSFDVMVGIHGAGMTNEVFLPPGGVLVQVV 375
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G ++ V + +PA +GL+YL Y + EESSL E G + +K+P + W+
Sbjct: 376 PWGKLDLMARVEYGEPAADMGLKYLCYNVTLEESSLPELLGRDHPAIKDPDSIHRKGWAA 435
Query: 365 M-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
M +Y+ Q+V+LDI+RF + L +A + L
Sbjct: 436 MFDIYMTKQDVRLDIERFALTLAEAMDHLRSL 467
>gi|115469852|ref|NP_001058525.1| Os06g0707200 [Oryza sativa Japonica Group]
gi|53792589|dbj|BAD53604.1| putative HGA1 [Oryza sativa Japonica Group]
gi|53792782|dbj|BAD53817.1| putative HGA1 [Oryza sativa Japonica Group]
gi|113596565|dbj|BAF20439.1| Os06g0707200 [Oryza sativa Japonica Group]
gi|125598448|gb|EAZ38228.1| hypothetical protein OsJ_22603 [Oryza sativa Japonica Group]
gi|215765287|dbj|BAG86984.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 460
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 117/342 (34%), Positives = 186/342 (54%), Gaps = 18/342 (5%)
Query: 59 KTQPYPRKSDKSAMSKVKELTIT-------TSAPPNLSCGVTHTSPALVFSAGGYNGNFF 111
+T+PY R D AM V+E T+ T+ PP C H+ PA +FS+GG+ GN +
Sbjct: 118 RTKPYARLHDAVAMDDVREFTLVPFGGANHTAVPP--LCTRNHSVPAFLFSSGGFAGNLY 175
Query: 112 HEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITH 171
H++ D VPLF + N HF + V ++ D W K+ L + +R+ +I+++N H
Sbjct: 176 HDYTDVLVPLFTSTN-HFGGE-VQFLLSGIKDWWLDKFTPLFRQLSRYDVIDVDNDQEVH 233
Query: 172 CFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLV 228
CF + +G H M I+P P T+ DF+ L + +S + KP+L+
Sbjct: 234 CFPRIFIGATFHRAMGIDPARSPGGVTVADFKRLLRRTFRLERAVASRTGAPRRDKPRLL 293
Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
+++R + R LN R + AA FDV I EP+ T + + R ++S M+GVHGAG
Sbjct: 294 IISRKS--SRRFLNERAMAHAAALARFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAG 351
Query: 289 LTHSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
LT+ +FL +VL+QVVP G +WL+ V F+ PA+ + + Y+EY + +ESSL E Y +
Sbjct: 352 LTNMVFLPSRAVLIQVVPFGGLEWLTRVTFKDPAKDMDVNYMEYNVSFDESSLRELYPRD 411
Query: 348 DLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
+++P W ++ VYL QNV+L++ + L+ A
Sbjct: 412 HFYIQHPYDVHKKGWDAIKTVYLDKQNVELNLTKLTNTLERA 453
>gi|326488805|dbj|BAJ98014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 462
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 121/341 (35%), Positives = 186/341 (54%), Gaps = 16/341 (4%)
Query: 59 KTQPYPRKSDKSAMSKVKELTI------TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
+T+PY R+ D AM V+E + SA P L C H+ PA +FS+GG+ GN +H
Sbjct: 117 RTKPYARRHDAVAMDDVREFALLPFGGGNDSAVPPL-CTRNHSVPAFLFSSGGFAGNLYH 175
Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
++ D VPLF T HF +V + D D W K+ L + + + +I+++N HC
Sbjct: 176 DYTDVLVPLF-TSTHHF-GGEVQFLLTDIKDWWLDKFTPLFRQLSNYDVIDVDNDQEVHC 233
Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVL 229
F + +G H M I+ T P +T+ DF+ L A+ + +S S KP+L++
Sbjct: 234 FPRIVIGSTFHRPMGIDGTRSPGGETVADFKRLLRRAFRLDRVVASHDGSASLGKPRLLI 293
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
++R + R LN R + AA FDV I EP+ T + + R ++S M+GVHGAGL
Sbjct: 294 ISRKS--SRRFLNERAMAHAAALAQFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGL 351
Query: 290 THSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
T+ +FL +VL+QVVP G +WLS V F+ PA+ + Y+EY + EESSL Y +
Sbjct: 352 TNMVFLPSRAVLLQVVPFGGLEWLSRVTFKDPAKDFYVTYMEYNVSLEESSLKNLYPKDH 411
Query: 349 LVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
L++P W+ ++ VYL Q+V+LD+ + L+ A
Sbjct: 412 FYLQHPYDVHKKGWNAIKTVYLDKQSVRLDLAKLTRTLEHA 452
>gi|242096984|ref|XP_002438982.1| hypothetical protein SORBIDRAFT_10g029360 [Sorghum bicolor]
gi|241917205|gb|EER90349.1| hypothetical protein SORBIDRAFT_10g029360 [Sorghum bicolor]
Length = 496
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 133/422 (31%), Positives = 206/422 (48%), Gaps = 45/422 (10%)
Query: 10 PAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGH-----------TDSTPSQPSLRI 58
P + CD + D+C + G+ TS F V T P +
Sbjct: 76 PESRVRCDFADPRSDVCELEGAIRIRGSTSEVFVVAGAGADGLLAANVTGLAPGMNATSW 135
Query: 59 KTQPYPRKSDKSAMSKVKELTITTSAPPNLS-CGVTHTSPALVFSAGGYNGNFFHEFMDC 117
QPY RK + M + LT+ +P + C V H PA+V+S GGY GN++H+F D
Sbjct: 136 TIQPYTRKGEVRVMRGITTLTVRVVSPGDAPPCTVRHDVPAVVYSNGGYCGNYYHDFNDN 195
Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
+PLF+T H +V L +A W KY E++ T + ++++ CF+ T
Sbjct: 196 IIPLFVT-TRHL-GGEVQLLVAQKQAWWFHKYREIVDGLTNYDAVDLDGAGEVRCFRKAT 253
Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAY----------NENTNTSSSFHHTKPKL 227
LGL S + I+P P+ ++VDF+ FL Y +E + H +P+L
Sbjct: 254 LGLRSLKDLSIDPRRAPRNLSMVDFKRFLMWRYALPREHAIRTDEEEAAGAGGGHRRPRL 313
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFE-----------------PEESTSLAD 270
+LV R +R R +N+ E+ AEE+GFDVT + E + +AD
Sbjct: 314 LLVTRRSR--RRFVNVPEIVALAEEVGFDVTTSDLMSASAKNNNKAGAGVGDEGHSRMAD 371
Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYL 329
+ + ++S MV VHG+GLT+ +FL +V++QVVP+G + L+ + P R + + Y+
Sbjct: 372 ASKLVNSFDVMVAVHGSGLTNLVFLPMNAVVVQVVPLGRMESLAMDEYGVPPRDMNMRYI 431
Query: 330 EYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
+Y I EES+L E Y VL +P +WS + VYL Q+V+LD+ RFR L A
Sbjct: 432 QYNITAEESTLSEVYPRAHPVLLDPMPIHEQSWSLVNDVYLGKQDVRLDVRRFRPVLLKA 491
Query: 389 YK 390
+
Sbjct: 492 IQ 493
>gi|242052299|ref|XP_002455295.1| hypothetical protein SORBIDRAFT_03g007960 [Sorghum bicolor]
gi|56409858|emb|CAI30077.1| glycosyltransferase [Sorghum bicolor]
gi|241927270|gb|EES00415.1| hypothetical protein SORBIDRAFT_03g007960 [Sorghum bicolor]
Length = 513
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 122/386 (31%), Positives = 207/386 (53%), Gaps = 25/386 (6%)
Query: 16 CDRSHRSYDLCLINGSALFDPKT-STFFSVGHTDSTPSQPSLR-IKTQPYPRKSDKSAMS 73
C+ + C ++G D +T + SV ++PS R K +PY R++ MS
Sbjct: 127 CNTDGPVSETCELDG----DVRTNGSARSVTFVPASPSSSERREWKVRPYSRRT----MS 178
Query: 74 KVKELTIT------TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
V +T+T ++PP +C VTH P ++F+ GG GN++H+F D VPLF
Sbjct: 179 GVDRVTVTQLESPAAASPPPAACAVTHDVPGVLFALGGLTGNYWHDFSDVLVPLFAASRR 238
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
+ +V+ +++ W KY ++ +R+ ++++ CF+ +T+GL H +
Sbjct: 239 Y--KGEVLFLVSNIQPWWLGKYEAVVRALSRYDAVDLDRDARVRCFRHLTVGLRLHKELT 296
Query: 188 INPTLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLR 244
I P L P T+ DF +FL Y ++ KP+L+L++R R +N+
Sbjct: 297 IVPDLAPDRLTMADFTAFLRETYALPRGAPAIPTTEEGRKPRLLLIHRAHY--RRFVNVP 354
Query: 245 EVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
E+++AAE GF+VT+ P + ++ R ++S ++GVHGAGLT+++FL G V++QV
Sbjct: 355 EIRRAAESAGFEVTVASPRGDAPVEETARTVNSHDVLLGVHGAGLTNAVFLPAGGVVIQV 414
Query: 305 VPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWS 363
VP G + ++ F +P +GL YLEY + EES+L+E G V+K+P+A + W
Sbjct: 415 VPYGRLERMARTDFGEPVADMGLRYLEYGVAAEESTLLEMLGPEHPVIKDPEAIHRSGWD 474
Query: 364 NM-RVYLKTQNVKLDIDRFRIYLKDA 388
+ YL Q+V++D++RF L A
Sbjct: 475 KVAEYYLGKQDVRIDVNRFAPTLAQA 500
>gi|242034639|ref|XP_002464714.1| hypothetical protein SORBIDRAFT_01g025290 [Sorghum bicolor]
gi|241918568|gb|EER91712.1| hypothetical protein SORBIDRAFT_01g025290 [Sorghum bicolor]
Length = 466
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 123/391 (31%), Positives = 201/391 (51%), Gaps = 21/391 (5%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+TC+ S+ + C + G K++T + V + P + IK +PY RK + M
Sbjct: 78 LTCNFSNHQSNTCSMEGDLRIHGKSATVYVVSASTYRPENAT--IKLRPYARKWEDQVML 135
Query: 74 KVKELTITTSAPPN----------LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
V+E+T+ +S P C V H PA+VFS GGYN NFFH D +PL++
Sbjct: 136 LVREVTMRSSPPAGSAAADDPPPPPQCSVRHDVPAVVFSTGGYNRNFFHVMTDVIIPLYL 195
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN--QTITHCFQSVTLGLI 181
T + + V L D +W KY +L + +P+I++++ + CF S +GL
Sbjct: 196 TAREY--NGHVQLLATDYEPKWIAKYKAILAALSSYPVIDLDSEPEDTVRCFPSAHVGLE 253
Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
SH + I P L K T+V F+ F+ +AY+ S+ KP+LV++ R R R +
Sbjct: 254 SHKELGIVPGLSHKGYTMVSFRDFIRSAYSLQRPRVSA-GRKKPRLVMILR--RNSRQLK 310
Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
N + AA +GF+V P++ + L ++SC ++GVHGAGL + LFL + +
Sbjct: 311 NEDDAIAAAANVGFEVVAAGPDDVSDLERFPGVVNSCDVLMGVHGAGLANMLFLPHNATV 370
Query: 302 MQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGA 360
+Q++P G +W + P +GL YLEY+ EE+SL + Y + V +P +
Sbjct: 371 VQIIPWGELKWACRHSYGDPVPDMGLRYLEYEATAEETSLKDSYPRDHAVFTDPLSIHRQ 430
Query: 361 NWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
+ M +++ Q+V +DIDRF ++K Y+
Sbjct: 431 GFDKMWNIFINGQHVIVDIDRFTGFMKQLYQ 461
>gi|326497271|dbj|BAK02220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 480
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 111/335 (33%), Positives = 183/335 (54%), Gaps = 7/335 (2%)
Query: 62 PYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
PY R V +L +APP C VTHT PA++F GGY GN++H++ D VPL
Sbjct: 150 PYTRSGQSLRAVTVTQLQDRAAAPP---CTVTHTMPAILFGIGGYVGNYWHDYADILVPL 206
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
F+ + + +++ +W KY LL +++ ++++++ CF VT+GL
Sbjct: 207 FVASRQYHGEVTFLVSNIKHLPRWLVKYKTLLQGLSKYGVVDMDHDAYVRCFPRVTVGLR 266
Query: 182 SHGRMVINPTLLPKPK-TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTI 240
I P L+P + T+ DF F+ Y + KP+L+L++R R
Sbjct: 267 LDKDFSIVPELVPGGRLTMADFTQFVRETYALPRGAVIREPYKKPRLLLIHRGTF--RRF 324
Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
LN E+ +AAE +GF+ + E + S + R ++S ++GVHGAGLT+++ L PG V
Sbjct: 325 LNEPEIVQAAEAVGFEAVVTELRLNGSEVEQARVVNSFDVVLGVHGAGLTNAVHLPPGGV 384
Query: 301 LMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
L+QVVP G + ++ + F +PA +GL+YL+Y + EESSL+EK G + +K+P +
Sbjct: 385 LIQVVPFGKIEVMARLDFSEPATDMGLKYLDYSVSAEESSLLEKLGPDHPAIKDPDSIHR 444
Query: 360 ANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
+ W+ M + QNV+++ RF L+ A+ +K
Sbjct: 445 SGWTTMYDFYLMQNVRINTTRFAPTLEQAFNHLRK 479
>gi|218198859|gb|EEC81286.1| hypothetical protein OsI_24404 [Oryza sativa Indica Group]
Length = 460
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 184/340 (54%), Gaps = 14/340 (4%)
Query: 59 KTQPYPRKSDKSAMSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHE 113
+T+PY R D AM V+E T+ N S C H+ PA +FS+GG+ GN +H+
Sbjct: 118 RTKPYARLHDAVAMDDVREFTLVPFGGANHSAVPPLCTRNHSVPAFLFSSGGFAGNLYHD 177
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
+ D VPLF + N HF + V ++ D W K+ L + +R+ +I+++N HCF
Sbjct: 178 YTDVLVPLFTSTN-HFGGE-VQFLLSGIKDWWLDKFTPLFRQLSRYDVIDVDNDQEVHCF 235
Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS---SFHHTKPKLVLV 230
+ +G H M I+P P T+ DF+ L + +S + KP+L+++
Sbjct: 236 PRIFIGATFHRAMGIDPARSPGGVTVADFKRLLRRTFRLERAVASRTGAPRRDKPRLLII 295
Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
+R + R LN R + AA FDV I EP+ T + + R ++S M+GVHGAGLT
Sbjct: 296 SRKS--SRRFLNERAMAHAAALARFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGLT 353
Query: 291 HSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
+ +FL +VL+QVVP G +WL+ V F+ PA+ + + Y+EY + +ESSL E Y +
Sbjct: 354 NMVFLPSRAVLIQVVPFGGLEWLTRVTFKDPAKDMDVNYMEYNVSFDESSLRELYPRDHF 413
Query: 350 VLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
+++P W ++ VYL QNV+L++ + L+ A
Sbjct: 414 YIQHPYDVHKKGWDAIKTVYLDKQNVELNLTKLTNTLERA 453
>gi|125556695|gb|EAZ02301.1| hypothetical protein OsI_24402 [Oryza sativa Indica Group]
Length = 465
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 134/402 (33%), Positives = 208/402 (51%), Gaps = 33/402 (8%)
Query: 10 PAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP------------SL 56
P + CD R +RS D+C + G+ +TS F V + ++ S +
Sbjct: 69 PEPHLRCDFRDNRS-DVCEMEGAIRILGRTSEVFLVAPSLASISGGGGGVNATGVDANAT 127
Query: 57 RIKTQPYPRKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
R K QPY RK + M + E+T+ APP C H PA+V+S GGY GN++H
Sbjct: 128 RWKIQPYTRKGESRVMPGITEVTVRLVTADEAPP---CDEWHDVPAIVYSNGGYCGNYYH 184
Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
+F D +PLFIT S +V L + W KY E++ T++ ++++ + C
Sbjct: 185 DFNDNIIPLFIT--SRHLAGEVQLLVTQKQRWWFGKYREIVEGLTKYEPVDLDAEQRVRC 242
Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY----NENTNTSSSFHHTKPKLV 228
++ T+GL SH + I+P P ++VDF+ FL Y + KP+L+
Sbjct: 243 YRRATVGLHSHKDLSIDPRRAPNNYSMVDFKRFLMWRYALPREHAIRMEEEDNSKKPRLL 302
Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
++NR +R R +NL E+ AAE +GF+V E + A S ++S AMV VHG+G
Sbjct: 303 VINRRSR--RRFVNLDEIVAAAEGVGFEVAAAELDAHIPAAAS--AVNSYDAMVAVHGSG 358
Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
LT+ +FL +V++QVVP+G + L+ + P R + + YL+Y I EES+L E Y
Sbjct: 359 LTNLVFLPMNAVVIQVVPLGRMEGLAMDEYGVPPRDMNMRYLQYNITAEESTLSEVYPRA 418
Query: 348 DLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
V +P +WS ++ +YL Q+V+LD+ RFR L A
Sbjct: 419 HPVFLDPLPIHKQSWSLVKDIYLGQQDVRLDVRRFRPVLLKA 460
>gi|413942817|gb|AFW75466.1| hypothetical protein ZEAMMB73_930495 [Zea mays]
Length = 496
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 128/397 (32%), Positives = 196/397 (49%), Gaps = 29/397 (7%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKS 70
A+ C S R D C G + T VG D K +PY RK D
Sbjct: 102 ARRPVCYESSRRSDTCEAAGDVRVVGRAQTVL-VGALDRE-------WKVKPYCRKHDAF 153
Query: 71 AMSKVKELTI-----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
A+S VKE T+ + C V ++ A V S GG+ GN FH++ D +P FIT
Sbjct: 154 ALSHVKEWTLRPLVGGSDGDEAPRCTVNSSATAFVLSTGGFTGNLFHDYTDVLIPAFITA 213
Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR 185
+ DV L ++ WA KY +L + +RH +++ + C+ V +G H
Sbjct: 214 RRY--GGDVQLLVSSHKPWWAAKYMPVLQQLSRHELVDADADGEVRCYPRVVVGPTFHRE 271
Query: 186 MVIN-----PTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNAR 235
+ + P + ++ +F++ L A E + S +P+L++++R R
Sbjct: 272 LGVGAETKAPGGEEEGVSMPEFRAMLRRALGLERAAAAPSGDRWDVRRRPRLLIISR--R 329
Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R +LN R + A LGFDV + +PE ST +A R ++S MVGVHG GLT+ +FL
Sbjct: 330 QSRRLLNERAMADMATSLGFDVRVGDPEASTDVARFARLVNSADVMVGVHGDGLTNMVFL 389
Query: 296 RPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
G+VL+QVVP G +WL+ F PA + + YLEY ++ E++L E+YG +D V+++P
Sbjct: 390 PAGAVLVQVVPYGGLEWLARGMFRDPAAGMQVHYLEYVVRAGETTLSEEYGEDDPVIRDP 449
Query: 355 QAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYK 390
A W ++ VYL Q+V+ + R R L A K
Sbjct: 450 AAVHRKGWDALKAVYLDKQDVRPHLGRLRNTLLQALK 486
>gi|115469848|ref|NP_001058523.1| Os06g0707000 [Oryza sativa Japonica Group]
gi|113596563|dbj|BAF20437.1| Os06g0707000 [Oryza sativa Japonica Group]
Length = 446
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 134/402 (33%), Positives = 208/402 (51%), Gaps = 33/402 (8%)
Query: 10 PAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP------------SL 56
P + CD R +RS D+C + G+ +TS F V + ++ S +
Sbjct: 50 PEPHLRCDFRDNRS-DVCEMEGAIRILGRTSEVFLVAPSLASISGGGGGVNATGVDANAT 108
Query: 57 RIKTQPYPRKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
R K QPY RK + M + E+T+ APP C H PA+V+S GGY GN++H
Sbjct: 109 RWKIQPYTRKGESRVMPGITEVTVRLVTADEAPP---CDEWHDVPAIVYSNGGYCGNYYH 165
Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
+F D +PLFIT + H + V L + W KY E++ T++ ++++ + C
Sbjct: 166 DFNDNIIPLFIT-SRHLAGE-VQLLVTQKQRWWFGKYREIVEGLTKYEPVDLDAEQRVRC 223
Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY----NENTNTSSSFHHTKPKLV 228
++ T+GL SH + I+P P ++VDF+ FL Y KP+L+
Sbjct: 224 YRRATVGLHSHKDLSIDPRRAPNNYSMVDFKRFLMWRYALPREHAIRMEEEDKSKKPRLL 283
Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
++NR +R R +NL E+ AAE +GF+V E + A S ++S AMV VHG+G
Sbjct: 284 VINRRSR--RRFVNLDEIVAAAEGVGFEVAAAELDAHIPAAAS--AVNSYDAMVAVHGSG 339
Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
LT+ +FL +V++QVVP+G + L+ + P R + + YL+Y I EES+L E Y
Sbjct: 340 LTNLVFLPMNAVVIQVVPLGRMEGLAMDEYGVPPRDMNMRYLQYNITAEESTLSEVYPRA 399
Query: 348 DLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
V +P +WS ++ +YL Q+V+LD+ RFR L A
Sbjct: 400 HPVFLDPLPIHKQSWSLVKDIYLGQQDVRLDVRRFRPVLLKA 441
>gi|300681542|emb|CBH32639.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
aestivum]
Length = 524
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 116/323 (35%), Positives = 186/323 (57%), Gaps = 20/323 (6%)
Query: 75 VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
V +L T+ A P +C VTH PA+V + GG+ GN+FH+F D VPLF+ + +V
Sbjct: 192 VTQLDSTSVASPAPACTVTHRVPAIVLALGGHLGNYFHDFSDALVPLFVASRRY--GGEV 249
Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
L + W KY ++ R T++ ++++++ CF+ VT+GL H I P L+P
Sbjct: 250 QLLAGNIQPWWLGKYEAVVRRLTKYEVLDLDHDDQIRCFRHVTVGLNMHKEFNIVPELVP 309
Query: 195 K--PKTLVDFQSFLANAYN-------ENTNTSSS-----FHHTKPKLVLVNRNARVGRTI 240
P ++++F +FL Y+ TN SS + KP+L+L++R R +
Sbjct: 310 GGVPLSMLNFTAFLRETYSLPRAAPISLTNNKSSPPVDDNKNKKPRLMLLDRGHY--RKL 367
Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
+N+ E+ KAAE+ GF+VTI +P + + + ++S ++GVHGAGLT+S FL PG+V
Sbjct: 368 VNVPEIVKAAEKAGFEVTIADPRFNVRVKELALSVNSFDVLLGVHGAGLTNSAFLPPGAV 427
Query: 301 LMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
++QVVP G + ++ F PA +GL YLEY I EES+L+E G + +K+P +
Sbjct: 428 VIQVVPYGKLEPMAQREFGDPAANMGLRYLEYSISVEESTLLETLGPDHPAIKDPDSVHR 487
Query: 360 ANWSNM-RVYLKTQNVKLDIDRF 381
+ W + YL QNV++D++RF
Sbjct: 488 SGWDKVAEYYLGKQNVRVDVERF 510
>gi|53792588|dbj|BAD53603.1| putative HGA1 [Oryza sativa Japonica Group]
gi|53792781|dbj|BAD53816.1| putative HGA1 [Oryza sativa Japonica Group]
gi|125598446|gb|EAZ38226.1| hypothetical protein OsJ_22601 [Oryza sativa Japonica Group]
Length = 465
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 134/402 (33%), Positives = 208/402 (51%), Gaps = 33/402 (8%)
Query: 10 PAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQP------------SL 56
P + CD R +RS D+C + G+ +TS F V + ++ S +
Sbjct: 69 PEPHLRCDFRDNRS-DVCEMEGAIRILGRTSEVFLVAPSLASISGGGGGVNATGVDANAT 127
Query: 57 RIKTQPYPRKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFH 112
R K QPY RK + M + E+T+ APP C H PA+V+S GGY GN++H
Sbjct: 128 RWKIQPYTRKGESRVMPGITEVTVRLVTADEAPP---CDEWHDVPAIVYSNGGYCGNYYH 184
Query: 113 EFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHC 172
+F D +PLFIT + H + V L + W KY E++ T++ ++++ + C
Sbjct: 185 DFNDNIIPLFIT-SRHLAGE-VQLLVTQKQRWWFGKYREIVEGLTKYEPVDLDAEQRVRC 242
Query: 173 FQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAY----NENTNTSSSFHHTKPKLV 228
++ T+GL SH + I+P P ++VDF+ FL Y KP+L+
Sbjct: 243 YRRATVGLHSHKDLSIDPRRAPNNYSMVDFKRFLMWRYALPREHAIRMEEEDKSKKPRLL 302
Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
++NR +R R +NL E+ AAE +GF+V E + A S ++S AMV VHG+G
Sbjct: 303 VINRRSR--RRFVNLDEIVAAAEGVGFEVAAAELDAHIPAAAS--AVNSYDAMVAVHGSG 358
Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
LT+ +FL +V++QVVP+G + L+ + P R + + YL+Y I EES+L E Y
Sbjct: 359 LTNLVFLPMNAVVIQVVPLGRMEGLAMDEYGVPPRDMNMRYLQYNITAEESTLSEVYPRA 418
Query: 348 DLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
V +P +WS ++ +YL Q+V+LD+ RFR L A
Sbjct: 419 HPVFLDPLPIHKQSWSLVKDIYLGQQDVRLDVRRFRPVLLKA 460
>gi|383100764|emb|CCG47995.1| glycosyltransferase, HGA-like,putative,expressed [Triticum
aestivum]
Length = 518
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 114/321 (35%), Positives = 184/321 (57%), Gaps = 18/321 (5%)
Query: 75 VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
V +L T++ P +C VTH PA+V + GG+ GN+FH+F D VPLF+ + +V
Sbjct: 188 VTQLDSTSAGSPAPACTVTHRVPAIVLALGGHLGNYFHDFSDALVPLFVASRRY--GGEV 245
Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLP 194
L + W KY ++ R T++ ++++++ CF+ VT+GL H I P L+P
Sbjct: 246 QLLAGNIQPWWLGKYEAVVRRLTKYEVLDLDHDDQIRCFRHVTVGLNMHKEFNIVPELVP 305
Query: 195 K--PKTLVDFQSFLANAYN-------ENTNTSS---SFHHTKPKLVLVNRNARVGRTILN 242
P ++ +F +FL Y+ TN SS KP+L+L++R R ++N
Sbjct: 306 GGVPLSMANFTAFLRETYSLPRAAPISLTNDSSPPVDAKKRKPRLMLLDRGHY--RKLVN 363
Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
+ E+ KAAE+ GF+VTI +P + + + ++S ++GVHGAGLT+S FL PG+V++
Sbjct: 364 VPEIVKAAEKAGFEVTIADPRFNVRVKELAMSVNSFDVLLGVHGAGLTNSAFLPPGAVVI 423
Query: 303 QVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN 361
QVVP G + ++ F PA +GL YLEY I +ES+L+E G + +K+P + +
Sbjct: 424 QVVPYGKLEPMAQREFGDPAANMGLRYLEYSISVDESTLLETLGPDHPAIKDPDSVHRSG 483
Query: 362 WSNM-RVYLKTQNVKLDIDRF 381
W + YL QNV++D++RF
Sbjct: 484 WDKVAEYYLGKQNVRVDVERF 504
>gi|115434208|ref|NP_001041862.1| Os01g0118700 [Oryza sativa Japonica Group]
gi|11034624|dbj|BAB17148.1| putative HGA6 [Oryza sativa Japonica Group]
gi|14090224|dbj|BAB55485.1| putative HGA6 [Oryza sativa Japonica Group]
gi|113531393|dbj|BAF03776.1| Os01g0118700 [Oryza sativa Japonica Group]
gi|125524185|gb|EAY72299.1| hypothetical protein OsI_00154 [Oryza sativa Indica Group]
gi|125568814|gb|EAZ10329.1| hypothetical protein OsJ_00164 [Oryza sativa Japonica Group]
gi|215687353|dbj|BAG91918.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 518
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 119/390 (30%), Positives = 202/390 (51%), Gaps = 18/390 (4%)
Query: 9 SPAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPYPRK 66
+PAK CD + + D+C I+G + S P+ R + PY RK
Sbjct: 130 APAKKPACDIQGPWASDVCSIDGDVRIHGAAHDVVIPPPIEGGGSNPNPREWRVVPYSRK 189
Query: 67 SDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
M +KE+ + SA +C V PALVF+ GG GN++H+F D +PL++
Sbjct: 190 H----MGGLKEVAVREVASAAEAPACDVRSPVPALVFAMGGLTGNYWHDFSDVLIPLYLQ 245
Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHG 184
D +V L + + + KY +L R +RH I++++ CF +G+ H
Sbjct: 246 ARRF--DGEVQLVVENIQMWYVGKYKRVLDRLSRHDIVDMDRDDKVRCFPGAVVGIRMHK 303
Query: 185 RMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF----HHTKPKLVLVNRNARVGRTI 240
I+P P ++ +F FL + ++ + S +P+L++++R R R +
Sbjct: 304 EFSIDPARDPTGHSMPEFTKFLRDTFSLPRDAPVSLVDNAAAVRPRLMIISR--RHPRKL 361
Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
+N+ EV +AAE +GF+V I +P + + + + ++ ++GVHGAGLT+S+FL G+V
Sbjct: 362 MNVEEVVRAAERIGFEVVIGDPPFNVDVGEFAKEVNRADVLMGVHGAGLTNSVFLPTGAV 421
Query: 301 LMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
L+QVVP G + + V F PA + L+Y+ Y EES+LVE G + +++P++
Sbjct: 422 LIQVVPYGKMEHIGKVDFGDPAEDMRLKYMAYSAGVEESTLVETLGRDHPAVRDPESVHR 481
Query: 360 ANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
+ W + YL Q+++LD+ RF L+DA
Sbjct: 482 SGWGKVAEYYLGKQDIRLDLARFEPLLRDA 511
>gi|383100757|emb|CCG47989.1| glycosyltransferase, HGA-like, expressed [Triticum aestivum]
Length = 523
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 125/389 (32%), Positives = 207/389 (53%), Gaps = 20/389 (5%)
Query: 9 SPAKSITCD-RSHRSYDLCLI---NGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPY 63
+PAK CD + + D+C + +G T T +S S P+ + + PY
Sbjct: 134 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSTHTVLVPPTIESGGSNPNPQEWRVLPY 193
Query: 64 PRKSDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
RK MS +KE+T+ +A C VT PALVF+ GG GN++H+F D +PL
Sbjct: 194 SRKH----MSGIKEITVRELPTAADAPRCAVTSQVPALVFAMGGLTGNYWHDFSDVMIPL 249
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
++ S F D +V L + + +A KY ++ R +R+ +++++ CF S +G+
Sbjct: 250 YLQA-SRF-DGEVQLVVTNIQPWYAGKYRHIIARLSRYDVVDMDKDDQVRCFPSAVVGIR 307
Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVGR 238
H I+PT P ++ +F FL N + ++ KP++++++R R R
Sbjct: 308 MHKEFSIDPTKEPTGHSMPEFTKFLRNVFALPRAAPMRVTAGSDKKPRMMIISR--RHPR 365
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
++N+ EV A+ +GF+V I +P + + D R +++ ++GVHGAGLT+SLFL G
Sbjct: 366 KLVNVAEVVALAKRIGFEVVIGDPPFNVDVGDFAREVNAADVLMGVHGAGLTNSLFLPTG 425
Query: 299 SVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
+V +QV P G + + V F PA +GL+Y+ Y EES+LV+ G + +K+P++
Sbjct: 426 AVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMAYSCGMEESTLVDTLGRDHPAVKDPESI 485
Query: 358 AGANWSNM-RVYLKTQNVKLDIDRFRIYL 385
+ WS + YL Q+VKLD+ RF L
Sbjct: 486 HRSGWSKVAEYYLGKQDVKLDLQRFEPVL 514
>gi|383100754|emb|CCG47984.1| glycosyltransferase, HGA-like,putative,expressed [Triticum
aestivum]
Length = 618
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 124/382 (32%), Positives = 187/382 (48%), Gaps = 18/382 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S R D+C G D S+F V + +R PYPRK D++ M +V
Sbjct: 239 CDFSDRRSDVCDFTGDIRMDANASSFVVVVDAATAAQSHKVR----PYPRKGDQTCMGRV 294
Query: 76 KELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
E+T+ ++S PP C TH+ PA+ FS GGY GN FH+F D VPL+ T+ H
Sbjct: 295 PEITVRTTSSSSTPPPPQCTRTHSVPAVTFSIGGYTGNIFHDFSDVLVPLYNTV--HRYR 352
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI---THCFQSVTLGLISHGRMVI 188
DV L +A+ W KY +LL +RH +++ THCF+ + L +H ++I
Sbjct: 353 GDVQLVMANVAPWWLVKYDKLLRELSRHAPLDLAAAAAKGETHCFRHAVVSLRAHRELII 412
Query: 189 NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKK 248
P D R+ +LNL ++ +
Sbjct: 413 ERNRSPDGLATPDXXXXXXXXXXXXXXXXXXXXXXXXXXXXARHRTRI---LLNLGDMMR 469
Query: 249 AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
AEE GF+ + E + S++ I+S ++GVHGAGLT+ +FL PG+ ++QVVP G
Sbjct: 470 VAEEAGFEAAVSESDVGDSISRVGAEINSADVLLGVHGAGLTNMMFLAPGATMVQVVPWG 529
Query: 309 T-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-R 366
QW++ + + PA +GL Y++Y+I EESSL +KY + +P + + M R
Sbjct: 530 GLQWIARMDYGDPAEAMGLRYVQYEIGVEESSLKDKYPRGHKIFTDPTSLHKKGFGFMRR 589
Query: 367 VYLKTQNVKLDIDRFRIYLKDA 388
+ QN+ LD+ RFR L+ A
Sbjct: 590 TLMDGQNITLDLGRFRGVLQQA 611
>gi|226500048|ref|NP_001149318.1| HGA4 [Zea mays]
gi|195626358|gb|ACG35009.1| HGA4 [Zea mays]
Length = 536
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 127/385 (32%), Positives = 202/385 (52%), Gaps = 25/385 (6%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT-- 82
+C ++G PK T + V + + +PY RK D + V E+T+ +
Sbjct: 162 VCELSGDVRVSPKQRTVYLVNPSGGGGFDERGEKRVRPYARKDD-FLLPGVTEVTVKSVP 220
Query: 83 SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
SA C HT+PA+VFS GY NFFH+ D +PLF+T +H D DV L + +
Sbjct: 221 SAAVAPKCTRQHTAPAVVFSVAGYTDNFFHDMADALIPLFLT-TAHL-DGDVQLLVTNYK 278
Query: 143 DQWARKYAELLPRFTRHPIININ---NQTITHCFQSVT-LGLISHGRMVINP--TLLPKP 196
W +KY +L + +RH +++ + HCF + LGL ++++P T P+
Sbjct: 279 PWWVQKYTPVLRKLSRHDVMSFDAADEDDAVHCFPAGAFLGLYRDRDLILSPHPTRNPRN 338
Query: 197 KTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAAEEL 253
T+VDF FL A + ++ +P++++V+R R +LNL EV AA+EL
Sbjct: 339 LTMVDFARFLRGALALPRDRPAALGEQPGARPRMLIVSRAGT--RRLLNLDEVAAAADEL 396
Query: 254 GFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-T 309
GF+VT E + AD F +++ +VGVHGAGLT+ +FL +V++Q+VP G
Sbjct: 397 GFNVTAAE-----AGADVPAFAAQVNAADVLVGVHGAGLTNQVFLPTEAVVVQIVPWGKM 451
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
W++T ++ +PA LGL YLEY + EE+SL +KY +V +P A W + +
Sbjct: 452 DWMATNFYARPAAGLGLRYLEYYVGAEETSLKDKYPREHVVFSDPMALHKQGWQALAQTV 511
Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKK 394
Q+V +++ FR L A K ++
Sbjct: 512 MKQDVTVNLTMFRPVLLQALDKLQQ 536
>gi|226494337|ref|NP_001149180.1| HGA4 [Zea mays]
gi|195625278|gb|ACG34469.1| HGA4 [Zea mays]
gi|413953957|gb|AFW86606.1| HGA4 [Zea mays]
Length = 521
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 124/400 (31%), Positives = 204/400 (51%), Gaps = 28/400 (7%)
Query: 7 ALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIK 59
A P ITCD + + +C + G PK T + V + + P + K
Sbjct: 126 ATKPEAKITCDENGVDEGFPYARPPVCELAGDIRVSPKEKTMYFVNPSGAGPFDSNGEKK 185
Query: 60 TQPYPRKSDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
+P+ RK D + V E+TI +SA C H P +VFS GY NFFH+ D
Sbjct: 186 IRPFARKDD-FLLPGVVEVTIKSVSSAAAAPRCTRRHDVPVVVFSVAGYTDNFFHDNTDV 244
Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
+PLF+T +H +V L + + W RK+ LL + + + +I+ HCF++
Sbjct: 245 LIPLFLT-TAHL-QGEVQLLVTNYKPWWVRKFTPLLRKLSNYDVIDFGKDDEVHCFRAGH 302
Query: 178 LGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNR 232
LG+ ++I+P T P+ ++VD+ FL A+N + + KP+++++ R
Sbjct: 303 LGMYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFNLPRDAPAVLGEETSAKPQMLIIER 362
Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGL 289
R +LNL EV E LGF VT+ E + AD F +++ ++ VHGAGL
Sbjct: 363 KGT--RKLLNLPEVSALCEALGFAVTVAEAD-----ADVRVFAEKVNAADVLLAVHGAGL 415
Query: 290 THSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
T+ +FL G+VL+Q+VP G W++T ++ +PAR + L Y+EY + +EE++L +KY +
Sbjct: 416 TNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYPRDH 475
Query: 349 LVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
V K+P W + + Q+V +++ RF+ +L A
Sbjct: 476 YVFKDPMHIHAQGWPAIAEIIMKQDVMVNMTRFKPFLLQA 515
>gi|357144041|ref|XP_003573147.1| PREDICTED: uncharacterized protein LOC100838010 isoform 3
[Brachypodium distachyon]
Length = 507
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 120/386 (31%), Positives = 197/386 (51%), Gaps = 23/386 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C + R D C +G +T T P + ++K PY RK D A S V
Sbjct: 124 CYETSRRSDTCEASGDVRVQGRTQTVHI------RPLEKEWKVK--PYCRKHDAFAQSHV 175
Query: 76 KELTITTSAPPNLSCGVTHTSPAL---VFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
KE T+ + + C V + L V S GG+ GN FH++ D +P FIT S
Sbjct: 176 KEWTLRPLSSSSPHCTVNSSGSGLTAFVLSTGGFTGNLFHDYTDVLIPAFIT--SRRFHG 233
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
+V ++ W +Y ++ + + + +++I+ C+ +G H + ++ +
Sbjct: 234 EVQFLVSSYKSWWTSRYLQIFQQLSNYEVLDIDADDEVRCYDKAVVGPTFHKELGVDASK 293
Query: 193 LPKPK-TLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLREV 246
P P ++VDF++ L A+ E S +P+L++++R + R LN R +
Sbjct: 294 TPSPGYSMVDFRAMLRQAFGLERAAAEPSGDRWDIRRRPRLLIISR--KHTRAFLNERAM 351
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A LGFDV + EP+ ST + R ++S MVGVHGAGLT+ LFL G+VL+Q+VP
Sbjct: 352 ADMAMSLGFDVRVGEPDVSTDVGKFARLVNSADVMVGVHGAGLTNMLFLPAGAVLVQIVP 411
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G +WL+ F++P+ + + YLEY I+ +E++L E+Y +D VL++P + W +
Sbjct: 412 YGGLEWLARGTFKEPSPDMRIHYLEYMIQLDETTLSEQYAKDDPVLRDPASIHKQGWDKL 471
Query: 366 R-VYLKTQNVKLDIDRFRIYLKDAYK 390
+ VYL QNV+ + R +I +A K
Sbjct: 472 KTVYLDKQNVRPHLGRLKITFMEALK 497
>gi|357144037|ref|XP_003573145.1| PREDICTED: uncharacterized protein LOC100838010 isoform 1
[Brachypodium distachyon]
Length = 499
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 120/386 (31%), Positives = 197/386 (51%), Gaps = 23/386 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C + R D C +G +T T P + ++K PY RK D A S V
Sbjct: 116 CYETSRRSDTCEASGDVRVQGRTQTVHI------RPLEKEWKVK--PYCRKHDAFAQSHV 167
Query: 76 KELTITTSAPPNLSCGVTHTSPAL---VFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
KE T+ + + C V + L V S GG+ GN FH++ D +P FIT S
Sbjct: 168 KEWTLRPLSSSSPHCTVNSSGSGLTAFVLSTGGFTGNLFHDYTDVLIPAFIT--SRRFHG 225
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
+V ++ W +Y ++ + + + +++I+ C+ +G H + ++ +
Sbjct: 226 EVQFLVSSYKSWWTSRYLQIFQQLSNYEVLDIDADDEVRCYDKAVVGPTFHKELGVDASK 285
Query: 193 LPKPK-TLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLREV 246
P P ++VDF++ L A+ E S +P+L++++R + R LN R +
Sbjct: 286 TPSPGYSMVDFRAMLRQAFGLERAAAEPSGDRWDIRRRPRLLIISR--KHTRAFLNERAM 343
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A LGFDV + EP+ ST + R ++S MVGVHGAGLT+ LFL G+VL+Q+VP
Sbjct: 344 ADMAMSLGFDVRVGEPDVSTDVGKFARLVNSADVMVGVHGAGLTNMLFLPAGAVLVQIVP 403
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G +WL+ F++P+ + + YLEY I+ +E++L E+Y +D VL++P + W +
Sbjct: 404 YGGLEWLARGTFKEPSPDMRIHYLEYMIQLDETTLSEQYAKDDPVLRDPASIHKQGWDKL 463
Query: 366 R-VYLKTQNVKLDIDRFRIYLKDAYK 390
+ VYL QNV+ + R +I +A K
Sbjct: 464 KTVYLDKQNVRPHLGRLKITFMEALK 489
>gi|300681532|emb|CBH32629.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
aestivum]
Length = 526
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 125/391 (31%), Positives = 206/391 (52%), Gaps = 22/391 (5%)
Query: 9 SPAKSITCD-RSHRSYDLCLI---NGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPY 63
+PAK CD + + D+C + +G T T +S S P+ + + PY
Sbjct: 135 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSTHTVLVPPTIESGGSNPNPQEWRVLPY 194
Query: 64 PRKSDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
RK MS +KE+T+ +A C VT PALVF+ GG GN++H+F D +PL
Sbjct: 195 SRKH----MSGIKEITVRELPTAADAPRCAVTSQVPALVFAMGGLTGNYWHDFSDVMIPL 250
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
++ S F D +V L + + +A KY ++ R +R+ +++++ CF S +G+
Sbjct: 251 YLQA-SRF-DGEVQLVVTNIQPWYAGKYRHIIARLSRYDVVDMDKDDQVRCFPSAIVGIR 308
Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYN-----ENTNTSSSFHHTKPKLVLVNRNARV 236
H I+P P ++ +F FL N + T+ KP++++++R R
Sbjct: 309 MHKEFSIDPAKEPTGHSMPEFTKFLRNVFALPRAAPMRVTAGLISDKKPRMMIISR--RH 366
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ++N+ EV A+ +GF+V I +P + + D R +++ ++GVHGAGLT+SLFL
Sbjct: 367 PRKLVNVAEVVALAKRIGFEVVIGDPPFNVDVGDFAREVNAADVLMGVHGAGLTNSLFLP 426
Query: 297 PGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
G+V +QV P G + + V F PA +GL+Y+ Y EES+LV+ G + +K+P+
Sbjct: 427 TGAVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMAYSCGMEESTLVDTLGRDHPAVKDPE 486
Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRFRIYL 385
+ + WS + YL Q+VKLD+ RF L
Sbjct: 487 SIHRSGWSKVAEYYLGKQDVKLDLQRFEPVL 517
>gi|357144039|ref|XP_003573146.1| PREDICTED: uncharacterized protein LOC100838010 isoform 2
[Brachypodium distachyon]
Length = 506
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 120/386 (31%), Positives = 197/386 (51%), Gaps = 23/386 (5%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C + R D C +G +T T P + ++K PY RK D A S V
Sbjct: 123 CYETSRRSDTCEASGDVRVQGRTQTVHI------RPLEKEWKVK--PYCRKHDAFAQSHV 174
Query: 76 KELTITTSAPPNLSCGVTHTSPAL---VFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
KE T+ + + C V + L V S GG+ GN FH++ D +P FIT S
Sbjct: 175 KEWTLRPLSSSSPHCTVNSSGSGLTAFVLSTGGFTGNLFHDYTDVLIPAFIT--SRRFHG 232
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
+V ++ W +Y ++ + + + +++I+ C+ +G H + ++ +
Sbjct: 233 EVQFLVSSYKSWWTSRYLQIFQQLSNYEVLDIDADDEVRCYDKAVVGPTFHKELGVDASK 292
Query: 193 LPKPK-TLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLREV 246
P P ++VDF++ L A+ E S +P+L++++R + R LN R +
Sbjct: 293 TPSPGYSMVDFRAMLRQAFGLERAAAEPSGDRWDIRRRPRLLIISR--KHTRAFLNERAM 350
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A LGFDV + EP+ ST + R ++S MVGVHGAGLT+ LFL G+VL+Q+VP
Sbjct: 351 ADMAMSLGFDVRVGEPDVSTDVGKFARLVNSADVMVGVHGAGLTNMLFLPAGAVLVQIVP 410
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G +WL+ F++P+ + + YLEY I+ +E++L E+Y +D VL++P + W +
Sbjct: 411 YGGLEWLARGTFKEPSPDMRIHYLEYMIQLDETTLSEQYAKDDPVLRDPASIHKQGWDKL 470
Query: 366 R-VYLKTQNVKLDIDRFRIYLKDAYK 390
+ VYL QNV+ + R +I +A K
Sbjct: 471 KTVYLDKQNVRPHLGRLKITFMEALK 496
>gi|238009766|gb|ACR35918.1| unknown [Zea mays]
Length = 548
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 127/385 (32%), Positives = 202/385 (52%), Gaps = 25/385 (6%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITT-- 82
+C ++G PK T + V + + +PY RK D + V E+T+ +
Sbjct: 174 VCELSGDVRVSPKQRTVYLVNPSGGGGFDERGEKRVRPYARKDD-FLLPGVTEVTVKSVP 232
Query: 83 SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
SA C HT+PA+VFS GY NFFH+ D +PLF+T +H D DV L + +
Sbjct: 233 SAAVAPKCTRQHTAPAVVFSVAGYTDNFFHDMADALIPLFLT-TAHL-DGDVQLLVTNYK 290
Query: 143 DQWARKYAELLPRFTRHPIININ---NQTITHCFQSVT-LGLISHGRMVINP--TLLPKP 196
W +KY +L + +RH +++ + HCF + LGL ++++P T P+
Sbjct: 291 PWWVQKYTPVLRKLSRHDVMSFDAADEDDAVHCFPAGAFLGLYRDRDLILSPHPTRNPRN 350
Query: 197 KTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAAEEL 253
T+VDF FL A + ++ +P++++V+R R +LNL EV AA+EL
Sbjct: 351 LTMVDFARFLRGALALPRDRPAALGEQPGARPRMLIVSRAGT--RRLLNLDEVAAAADEL 408
Query: 254 GFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-T 309
GF+VT E + AD F +++ +VGVHGAGLT+ +FL +V++Q+VP G
Sbjct: 409 GFNVTSAE-----AGADVPAFAAQVNAADVLVGVHGAGLTNQVFLPTEAVVVQIVPWGKM 463
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYL 369
W++T ++ +PA LGL YLEY + EE+SL +KY +V +P A W + +
Sbjct: 464 DWMATNFYARPAAGLGLRYLEYYVGAEETSLKDKYPREHVVFSDPMALHKQGWQALAQTV 523
Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKK 394
Q+V +++ FR L A K ++
Sbjct: 524 MKQDVTVNLTMFRPVLLQALDKLQQ 548
>gi|413953958|gb|AFW86607.1| hypothetical protein ZEAMMB73_638297 [Zea mays]
Length = 460
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 124/400 (31%), Positives = 204/400 (51%), Gaps = 28/400 (7%)
Query: 7 ALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIK 59
A P ITCD + + +C + G PK T + V + + P + K
Sbjct: 65 ATKPEAKITCDENGVDEGFPYARPPVCELAGDIRVSPKEKTMYFVNPSGAGPFDSNGEKK 124
Query: 60 TQPYPRKSDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
+P+ RK D + V E+TI +SA C H P +VFS GY NFFH+ D
Sbjct: 125 IRPFARKDD-FLLPGVVEVTIKSVSSAAAAPRCTRRHDVPVVVFSVAGYTDNFFHDNTDV 183
Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
+PLF+T +H +V L + + W RK+ LL + + + +I+ HCF++
Sbjct: 184 LIPLFLT-TAHL-QGEVQLLVTNYKPWWVRKFTPLLRKLSNYDVIDFGKDDEVHCFRAGH 241
Query: 178 LGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNR 232
LG+ ++I+P T P+ ++VD+ FL A+N + + KP+++++ R
Sbjct: 242 LGMYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFNLPRDAPAVLGEETSAKPQMLIIER 301
Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGL 289
R +LNL EV E LGF VT+ E + AD F +++ ++ VHGAGL
Sbjct: 302 KGT--RKLLNLPEVSALCEALGFAVTVAEAD-----ADVRVFAEKVNAADVLLAVHGAGL 354
Query: 290 THSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
T+ +FL G+VL+Q+VP G W++T ++ +PAR + L Y+EY + +EE++L +KY +
Sbjct: 355 TNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYPRDH 414
Query: 349 LVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
V K+P W + + Q+V +++ RF+ +L A
Sbjct: 415 YVFKDPMHIHAQGWPAIAEIIMKQDVMVNMTRFKPFLLQA 454
>gi|300681547|emb|CBH32644.1| glycosyltransferase, HGA-like, putative,expressed [Triticum
aestivum]
Length = 450
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 133/396 (33%), Positives = 208/396 (52%), Gaps = 31/396 (7%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C + G T ++ H + + + S R+K P+ RK+D A++ V
Sbjct: 73 CDMSSLRSDVCELRGDVRVILSNITIIALLHPNVSVRRRSWRMK--PHARKNDGHALANV 130
Query: 76 KEL--TITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+ ++T S+P C +PA+VFS GGY GN FH+F D +PLFIT S F D
Sbjct: 131 TEVLVSVTPSSPYAPGCTAESAAPAVVFSVGGYAGNMFHDFTDVLIPLFIT-ASRF-RSD 188
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTI-THCFQSVTLGLISHGRMVINPT- 191
V L +++ W KY LL +RH +I+++ Q+ C+ V +GL H M I+
Sbjct: 189 VHLLVSNAPPWWLDKYGPLLRGLSRHAVIDMDRQSEEVLCYPHVVVGLSFHKEMSIDTVK 248
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFHHT--------KPKLVLVNRNARVGRTILNL 243
+ ++ DF +Y +T+ H +P+L++++R + R N+
Sbjct: 249 TVGGHYSMADFARLARRSYGLERDTAIRLLHGGDNIKSPRRPRLLIISR--KTTRAFTNM 306
Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
V +AA LG++V + E E+ + L R ++SC +VGVHGAGLT+ +FL PG+V++Q
Sbjct: 307 GAVAQAAAMLGYEVIVGEAEQHSDLPAFARLVNSCGVLVGVHGAGLTNLVFLPPGAVVVQ 366
Query: 304 VVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANW 362
VVP+G + ++ F PA +GL Y++Y I ES+L E Y + VL+ A A
Sbjct: 367 VVPLGGLEAMAGDDFGVPAGDMGLGYVQYGIAVGESTLAELYPRDHRVLR-----ALALR 421
Query: 363 SNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
S YL QNV LD+ RF A +A +L+ +
Sbjct: 422 SE---YLVGQNVTLDVARF----SGALSRALELLHR 450
>gi|413947214|gb|AFW79863.1| hypothetical protein ZEAMMB73_417176 [Zea mays]
Length = 451
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 123/384 (32%), Positives = 199/384 (51%), Gaps = 20/384 (5%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKT-STFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
+ C+ RS D C ++G D +T T SV ++ S+ +PY R+
Sbjct: 72 VVCNMEGRS-DTCEVDG----DVRTNGTALSVTLVPASRSERH-EWMIRPYSRRFASVRK 125
Query: 73 SKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
V +L A P C VTH PA++F+ GGY GN++H++ D VPLF+ + +
Sbjct: 126 VTVTQLQDRADAAP---CAVTHDVPAVLFAIGGYAGNYWHDYADILVPLFVASRRYNGEV 182
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
+++ +W KY L + + ++++ CF VT+GL I P L
Sbjct: 183 KFLISNIRFQPRWLAKYKAFLQGLSLYDAVDMDGDAQVRCFPHVTVGLRLDKEFSIVPEL 242
Query: 193 LPKPK--TLVDFQSFLANAYNENTNTSSSFH----HTKPKLVLVNRNARVGRTILNLREV 246
+P + ++ DF FL Y +++S H KP+L+L++R R I N EV
Sbjct: 243 VPGGRRLSMADFTRFLRETYALPRGSAASRDREQPHKKPRLLLIHRGHY--RRITNEPEV 300
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+AAE GF+ + E + A+ R ++S ++GVHGAGLT+++FL PG VL+QVVP
Sbjct: 301 ARAAEAAGFEAVVAELRGDATEAEQARVVNSFDVVLGVHGAGLTNAVFLPPGGVLIQVVP 360
Query: 307 IG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G ++++ F +PA +GL+YL+Y + EESSL+E G +K+P + + W +
Sbjct: 361 YGKMEYIARAEFSEPATDMGLKYLDYSVSAEESSLMETLGPEHPAVKDPDSVHRSGWDQV 420
Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
+YL QNV++++ RF L A
Sbjct: 421 FELYLAKQNVRINVTRFAPTLAQA 444
>gi|357144127|ref|XP_003573181.1| PREDICTED: uncharacterized protein LOC100825423 isoform 1
[Brachypodium distachyon]
Length = 565
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 126/400 (31%), Positives = 209/400 (52%), Gaps = 33/400 (8%)
Query: 13 SITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDS-TPSQPSLRIKTQPYP 64
++TCD + + +C ++G PK T + V + + T + +PY
Sbjct: 175 NMTCDENGVDEGFPYARPAVCELSGDIRVSPKQKTMYLVNPSGAATGFDEKGEKRLRPYA 234
Query: 65 RKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
R +D + V E+T+ +T+A P C H PA+VFS GY NFFH+ D +P
Sbjct: 235 R-NDDFLLPGVVEVTVKSVPSTAAAPQ--CTKQHRVPAVVFSVAGYTDNFFHDNTDALIP 291
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
L++T +H +V L I + W +KY +L + + + +IN + HCF LGL
Sbjct: 292 LYVT-TAHL-KGEVQLLITNYKPWWVQKYTPVLRKLSSYDVINFDEDAGVHCFHEGYLGL 349
Query: 181 ISHGRMVINP--TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNAR 235
++I+P T P+ T+VD+ FL + E +P++++++R+
Sbjct: 350 YRDRDLIISPHPTRNPRNYTMVDYNRFLRGVFELRRERPAVLGEEPGMRPRMLIISRSGT 409
Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHS 292
R +LNL EV A ELGF+VT+ E + AD F ++S ++ VHGAGLT+
Sbjct: 410 --RKLLNLDEVAAEASELGFNVTVAE-----AGADVPAFAALVNSADVLLAVHGAGLTNQ 462
Query: 293 LFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
+FL +V++Q+VP G W +T ++ +PAR + L Y+EY + +EE+SL +KY + +V
Sbjct: 463 IFLPTDAVVLQIVPWGNMDWQATNFYGQPAREMQLRYVEYYVGEEETSLKDKYPRDHMVF 522
Query: 352 KNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKK 391
K+P+A W + + Q+V+++I RFR +L A K
Sbjct: 523 KDPKALHKQGWQTLANTIMKQDVQVNITRFRPFLLQAIDK 562
>gi|55792424|gb|AAV65332.1| HGA2 [Hordeum vulgare]
Length = 529
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 113/340 (33%), Positives = 191/340 (56%), Gaps = 27/340 (7%)
Query: 61 QPYPRKSDKSAMSKVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFM 115
QPY R + M+ + +T+T ++A P +C VTH PA+V GG+ GN+FH+F
Sbjct: 184 QPYARYN----MTGIPNITVTQLDSTSAASPAPACTVTHRVPAIVIGLGGHLGNYFHDFS 239
Query: 116 DCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQS 175
D VPLF+ + +V L ++ W KY ++ R T++ ++++++ CF+
Sbjct: 240 DALVPLFVASRRY--GGEVQLLASNIQPWWLGKYEAVVRRLTKYDVVDLDHDDQIRCFRR 297
Query: 176 VTLGLISHGRMVINPTLLPK--PKTLVDFQSFLANAYN----------ENTNTSSSFHHT 223
VT+GL H I P L+P P ++ +F +FL Y+ + ++
Sbjct: 298 VTVGLNMHREFDIVPELVPGDVPLSMANFTAFLRETYSLPRAAPIRLTKGSSPPVDKKKK 357
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
KP+L+L++R R ++N+ E+ KAAE+ GF+V I +P + + + ++S ++G
Sbjct: 358 KPRLMLLDRGHY--RKLVNVPEIVKAAEKAGFEVVIADPRFNVRVKELAMSVNSFDVLLG 415
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVE 342
VHGAGLT++ FL PG+V++QVVP G + ++ F PA +GL YL+Y I EES+L+E
Sbjct: 416 VHGAGLTNAAFLPPGAVVIQVVPYGKLEPMAQREFGDPAADMGLRYLQYSITAEESTLLE 475
Query: 343 KYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF 381
G + +K+P + + W + YL QNV++D++RF
Sbjct: 476 TLGPDHPAIKDPDSVHRSGWDKVAEFYLGKQNVRVDVERF 515
>gi|357144129|ref|XP_003573182.1| PREDICTED: uncharacterized protein LOC100825423 isoform 2
[Brachypodium distachyon]
Length = 542
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 123/397 (30%), Positives = 206/397 (51%), Gaps = 27/397 (6%)
Query: 13 SITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDS-TPSQPSLRIKTQPYP 64
++TCD + + +C ++G PK T + V + + T + +PY
Sbjct: 152 NMTCDENGVDEGFPYARPAVCELSGDIRVSPKQKTMYLVNPSGAATGFDEKGEKRLRPYA 211
Query: 65 RKSDKSAMSKVKELTI----TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
R +D + V E+T+ +T+A P C H PA+VFS GY NFFH+ D +P
Sbjct: 212 R-NDDFLLPGVVEVTVKSVPSTAAAPQ--CTKQHRVPAVVFSVAGYTDNFFHDNTDALIP 268
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
L++T +H +V L I + W +KY +L + + + +IN + HCF LGL
Sbjct: 269 LYVT-TAHL-KGEVQLLITNYKPWWVQKYTPVLRKLSSYDVINFDEDAGVHCFHEGYLGL 326
Query: 181 ISHGRMVINP--TLLPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNAR 235
++I+P T P+ T+VD+ FL + E +P++++++R+
Sbjct: 327 YRDRDLIISPHPTRNPRNYTMVDYNRFLRGVFELRRERPAVLGEEPGMRPRMLIISRSGT 386
Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R +LNL EV A ELGF+VT+ E + ++S ++ VHGAGLT+ +FL
Sbjct: 387 --RKLLNLDEVAAEASELGFNVTVAE--AGADVPAFAALVNSADVLLAVHGAGLTNQIFL 442
Query: 296 RPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
+V++Q+VP G W +T ++ +PAR + L Y+EY + +EE+SL +KY + +V K+P
Sbjct: 443 PTDAVVLQIVPWGNMDWQATNFYGQPAREMQLRYVEYYVGEEETSLKDKYPRDHMVFKDP 502
Query: 355 QAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKK 391
+A W + + Q+V+++I RFR +L A K
Sbjct: 503 KALHKQGWQTLANTIMKQDVQVNITRFRPFLLQAIDK 539
>gi|326504684|dbj|BAK06633.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 522
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 126/390 (32%), Positives = 202/390 (51%), Gaps = 14/390 (3%)
Query: 9 SPAKS-ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDST-PSQPSLRIKTQPYPRK 66
+P +S ++C+ S ++C + G K +T + V +D P ++ I+ PYPRK
Sbjct: 132 APTRSKLSCNLSSYRTNMCAMQGDVRVHGKAATVYVVSASDDNRPDNGTITIR--PYPRK 189
Query: 67 SDKSAMSKVKELTITTSAPPNLS---CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
+ M V+E+TI APP C VT+ PA+VFS GGY N FH D +PL+
Sbjct: 190 WETPTMQLVREVTIRWRAPPGPGAPRCTVTYDVPAVVFSTGGYGVNIFHAITDIIIPLYN 249
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T + D V L + + +W KY L + +PII+++ CF S +G SH
Sbjct: 250 TAREY--DGRVRLIATNYDRKWIAKYRHALSLISIYPIIDLDADNEVRCFPSAHVGTESH 307
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH-TKPKLVLVNRNARVGRTILN 242
+ I+ L K T++ F+ + +AY+ + +H +KP+LV+V R R R + N
Sbjct: 308 KELGIDSALSGKGYTMMGFRGLIRSAYSLKREWVTPINHGSKPRLVMVLR--RNSRALTN 365
Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
+V AA E+GF+V PE L ++SC +VGVHGAGLT+ +FL ++
Sbjct: 366 EAQVVAAAAEVGFEVVAAGPEVVRDLGKFAETVNSCDVLVGVHGAGLTNMVFLPRNGTVL 425
Query: 303 QVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN 361
Q+VP G +W + + +P +GL Y EY++ EE++L + Y N V +P +
Sbjct: 426 QIVPWGEMKWPAWTSYGEPVAPMGLRYAEYEVTAEETTLKDVYPRNHTVFTDPVSIHKQG 485
Query: 362 WSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
++ + +L QN+ LD+ RF L+ Y+
Sbjct: 486 FNMLWETFLNGQNLTLDVHRFTGVLQHIYQ 515
>gi|326489348|dbj|BAK01657.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326527847|dbj|BAK08167.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 527
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 125/395 (31%), Positives = 199/395 (50%), Gaps = 32/395 (8%)
Query: 9 SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ-PSLRIKTQPYPRKS 67
+P+K I CD S YD+C I+G A T V D + P IK Q RK
Sbjct: 130 APSKPI-CDLSDPRYDICEISGDARAIGTNRTVLYVPPVDERGADGPEWAIKDQS--RKY 186
Query: 68 DKSAMS-KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
+ M KVK L+ S AP C H PA+VF+ G GN +H+ D +PLFIT
Sbjct: 187 LEDIMEVKVKTLSAAQSLVAP---ECTSKHDVPAIVFAMNGIIGNPWHDLSDVLIPLFIT 243
Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHG 184
++ D +V + + + KY +L +R+ I++ N + C+ + +GL SHG
Sbjct: 244 ARAY--DGEVQFLVTELQPWFVEKYRLILTNLSRYDIVDFNKDSGVRCYPHIVVGLHSHG 301
Query: 185 RMVINPTLLPKPKTLVDFQSFL-----------------ANAYNENTNTSSSFHHTKPKL 227
+ I+P P+ TL+DF+ ++ AN N + ++ KP+L
Sbjct: 302 DLDIDPARTPRNYTLLDFRLYIRDIFSLPSKGIGIPYKEANKKNSTDDNTTVTEKQKPRL 361
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+L+NR + R +NL E+ A + GF+V + EP SL + R + S ++GVHGA
Sbjct: 362 LLINRG--MSRKFVNLPEITAAVQAAGFEVLVVEPHRDMSLEEFARMLDSSDVLMGVHGA 419
Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
LT+ FLR +V++QVV +G + + Y+ A + L+++EY I EES+L EKYG +
Sbjct: 420 ALTNFFFLRTNAVVLQVVGLGLEREAMHYYGAQAMKVMLQHVEYFIAAEESTLYEKYGKD 479
Query: 348 DLVLKNPQAFAGANWSNMRVYL-KTQNVKLDIDRF 381
+++P + W + Y Q+++L++ RF
Sbjct: 480 HPAVRDPDSIHKQGWQGAKQYFWAEQDIRLNVTRF 514
>gi|326531726|dbj|BAJ97867.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 521
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 125/395 (31%), Positives = 199/395 (50%), Gaps = 32/395 (8%)
Query: 9 SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQ-PSLRIKTQPYPRKS 67
+P+K I CD S YD+C I+G A T V D + P IK Q RK
Sbjct: 124 APSKPI-CDLSDPRYDICEISGDARAIGTNRTVLYVPPVDERGADGPEWAIKDQS--RKY 180
Query: 68 DKSAMS-KVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
+ M KVK L+ S AP C H PA+VF+ G GN +H+ D +PLFIT
Sbjct: 181 LEDIMEVKVKTLSAAQSLVAP---ECTSKHDVPAIVFAMNGIIGNPWHDLSDVLIPLFIT 237
Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHG 184
++ D +V + + + KY +L +R+ I++ N + C+ + +GL SHG
Sbjct: 238 ARAY--DGEVQFLVTELQPWFVEKYRLILTNLSRYDIVDFNKDSGVRCYPHIVVGLHSHG 295
Query: 185 RMVINPTLLPKPKTLVDFQSFL-----------------ANAYNENTNTSSSFHHTKPKL 227
+ I+P P+ TL+DF+ ++ AN N + ++ KP+L
Sbjct: 296 DLDIDPARTPRNYTLLDFRLYIRDIFSLPSKGIGIPYKEANKKNSTDDNTTVTEKQKPRL 355
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+L+NR + R +NL E+ A + GF+V + EP SL + R + S ++GVHGA
Sbjct: 356 LLINRG--MSRKFVNLPEITAAVQAAGFEVLVVEPHRDMSLEEFARMLDSSDVLMGVHGA 413
Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
LT+ FLR +V++QVV +G + + Y+ A + L+++EY I EES+L EKYG +
Sbjct: 414 ALTNFFFLRTNAVVLQVVGLGLEREAMHYYGAQAMKVMLQHVEYFIAAEESTLYEKYGKD 473
Query: 348 DLVLKNPQAFAGANWSNMRVYL-KTQNVKLDIDRF 381
+++P + W + Y Q+++L++ RF
Sbjct: 474 HPAVRDPDSIHKQGWQGAKQYFWAEQDIRLNVTRF 508
>gi|326490115|dbj|BAJ94131.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 394
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 123/387 (31%), Positives = 206/387 (53%), Gaps = 31/387 (8%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ C S SY C ++G + T V T T + QPY R + M+
Sbjct: 6 VVCRASPFSY-TCDVSGDVRTNGTAHTVTLVPATSLTERR---EWSIQPYARYN----MT 57
Query: 74 KVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
+ +T+T ++A P +C VTH PA+V GG+ GN+FH+F D VPLF+ +
Sbjct: 58 GIPNITVTQLDSTSAASPAPACTVTHRVPAIVIGLGGHLGNYFHDFSDALVPLFVASRRY 117
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
+V L ++ W KY ++ R T++ ++++++ CF+ VT+GL H I
Sbjct: 118 --GGEVQLLASNIQPWWLGKYEAVVRRLTKYDVLDLDHDDQIRCFRRVTVGLNMHREFDI 175
Query: 189 NPTLLPK--PKTLVDFQSFLANAYN----------ENTNTSSSFHHTKPKLVLVNRNARV 236
P L+P P ++ +F +FL Y+ + ++ KP+L+L++R
Sbjct: 176 VPELVPGDVPLSMANFTAFLRETYSLPRAAPIRLTKGSSPPVDKKKKKPRLMLLDRGHY- 234
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ++N+ E+ KAAE+ GF+V I +P + + + ++S ++GVHGAGLT++ FL
Sbjct: 235 -RKLVNVPEIVKAAEKAGFEVVIADPRFNVRVKELAMSVNSFDVLLGVHGAGLTNAAFLP 293
Query: 297 PGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
PG+V++QVVP G + ++ F PA +GL YL+Y I EES+L+E G + +K+P
Sbjct: 294 PGAVVIQVVPYGKLEPMAQREFGDPAADMGLRYLQYSITAEESTLLETLGPDHPAIKDPD 353
Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRF 381
+ + W + YL QNV++D++RF
Sbjct: 354 SVHRSGWDKVAEFYLGKQNVRVDVERF 380
>gi|56409856|emb|CAI30076.1| glycosyltransferase [Triticum aestivum]
Length = 525
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 125/394 (31%), Positives = 207/394 (52%), Gaps = 22/394 (5%)
Query: 9 SPAKSITCD-RSHRSYDLCLI---NGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPY 63
+PAK CD + + D+C + +G T T +S S P+ + + PY
Sbjct: 134 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSTHTVLVPPTIESGGSNPNPQEWRVLPY 193
Query: 64 PRKSDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
RK MS +KE+T+ +A C VT PALVF+ GG GN++H+F D +PL
Sbjct: 194 SRKH----MSGIKEITVRELPTAADAPRCAVTSQVPALVFAMGGLTGNYWHDFSDVMIPL 249
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
++ + F D +V L + + +A KY ++ R +R+ +++++ CF S +G+
Sbjct: 250 YLQA-ARF-DGEVQLVVTNIQPWYAGKYRHIIARLSRYDVVDMDKGRQVRCFPSAVVGIR 307
Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYN-----ENTNTSSSFHHTKPKLVLVNRNARV 236
H I+PT P ++ +F FL N + T+ KP++++++R R
Sbjct: 308 MHKEFSIDPTKEPTGHSMPEFTKFLRNVFALPRAAPMRVTAGLISDKKPRMMIISR--RH 365
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ++N+ EV A+ +GF+V I +P + + D R +++ ++GVHGAGLT+SLFL
Sbjct: 366 PRKLVNVAEVVALAKRIGFEVVIGDPPFNVDVGDFAREVNAADVLMGVHGAGLTNSLFLP 425
Query: 297 PGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
G+V +QV P G + + V F PA +GL+Y+ Y EES+LV+ G + +K+P+
Sbjct: 426 TGAVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMAYSCGMEESTLVDTLGRDHPAVKDPE 485
Query: 356 AFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
+ + WS + YL Q+VKLD+ R L A
Sbjct: 486 SIHRSGWSKVAEYYLGKQDVKLDLQRVEPVLIKA 519
>gi|413953955|gb|AFW86604.1| hypothetical protein ZEAMMB73_638297 [Zea mays]
Length = 528
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 125/401 (31%), Positives = 207/401 (51%), Gaps = 29/401 (7%)
Query: 6 AALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI 58
A ++ AK ITCD + + +C + G PK T + V + + P +
Sbjct: 133 AIITEAK-ITCDENGVDEGFPYARPPVCELAGDIRVSPKEKTMYFVNPSGAGPFDSNGEK 191
Query: 59 KTQPYPRKSDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMD 116
K +P+ RK D + V E+TI +SA C H P +VFS GY NFFH+ D
Sbjct: 192 KIRPFARKDD-FLLPGVVEVTIKSVSSAAAAPRCTRRHDVPVVVFSVAGYTDNFFHDNTD 250
Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
+PLF+T +H +V L + + W RK+ LL + + + +I+ HCF++
Sbjct: 251 VLIPLFLT-TAHL-QGEVQLLVTNYKPWWVRKFTPLLRKLSNYDVIDFGKDDEVHCFRAG 308
Query: 177 TLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVN 231
LG+ ++I+P T P+ ++VD+ FL A+N + + KP+++++
Sbjct: 309 HLGMYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFNLPRDAPAVLGEETSAKPQMLIIE 368
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAG 288
R R +LNL EV E LGF VT+ E + AD F +++ ++ VHGAG
Sbjct: 369 RKGT--RKLLNLPEVSALCEALGFAVTVAEAD-----ADVRVFAEKVNAADVLLAVHGAG 421
Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
LT+ +FL G+VL+Q+VP G W++T ++ +PAR + L Y+EY + +EE++L +KY +
Sbjct: 422 LTNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYPRD 481
Query: 348 DLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
V K+P W + + Q+V +++ RF+ +L A
Sbjct: 482 HYVFKDPMHIHAQGWPAIAEIIMKQDVMVNMTRFKPFLLQA 522
>gi|242060158|ref|XP_002451368.1| hypothetical protein SORBIDRAFT_04g000840 [Sorghum bicolor]
gi|241931199|gb|EES04344.1| hypothetical protein SORBIDRAFT_04g000840 [Sorghum bicolor]
Length = 571
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 128/402 (31%), Positives = 211/402 (52%), Gaps = 31/402 (7%)
Query: 14 ITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRK 66
+TCD + + +C ++G PK T + V + + S + +PY RK
Sbjct: 180 MTCDENGVDEGFPYARPTVCELSGDVRVSPKQKTVYLVNPSGAGGFDESGEKRLRPYARK 239
Query: 67 SDKSAMSKVKELTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
D M V E+T+ + SA C HT PA++FS GY NFFH+ +D VPLF+T
Sbjct: 240 DD-FLMPGVTEVTVKSVPSAAVAPKCTKHHTVPAVLFSIAGYTDNFFHDMVDAMVPLFLT 298
Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI--THCFQSVT-LGLI 181
SH +V L I + W +KY LL + + H +IN + + HCF + +GL
Sbjct: 299 -TSHL-KGEVQLLITNYKPWWVQKYTPLLRKMSLHDVINFDAEDADDVHCFPAGAFVGLY 356
Query: 182 SHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARV 236
++++P T P+ T+VDF F+ A + + +P++++++R
Sbjct: 357 RDRDLILSPHPTRNPRNLTMVDFSRFMRGALALPRDRPAVLGEAPGMRPRMLIISRAGT- 415
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSL 293
R +LNL EV K A+ELGF+VTI E + AD F +++ +VGVHGAGL + +
Sbjct: 416 -RRLLNLDEVAKVADELGFNVTIAE-----AGADVPAFAAQVNAADVLVGVHGAGLANVV 469
Query: 294 FLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLK 352
FL +V++Q+VP G W++T ++ +PA + L YLEY + +EE+SL +KY + +V +
Sbjct: 470 FLPTEAVVVQIVPWGKMDWMATNFYARPAAGMALRYLEYYVGEEETSLKDKYPRDHVVFR 529
Query: 353 NPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
+P + W + + Q+V +++ +FR L A K ++
Sbjct: 530 DPMSLHTQGWQALAQTIMKQDVAVNLTKFRPVLLQALDKLQQ 571
>gi|31296713|gb|AAP46642.1| HGA4 [Hordeum vulgare]
Length = 525
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 123/392 (31%), Positives = 206/392 (52%), Gaps = 23/392 (5%)
Query: 9 SPAKSITCD-RSHRSYDLCLI---NGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPY 63
+PAK CD + + D+C + +G T +S S P+ + + PY
Sbjct: 133 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSAHTVLVPPTIESGGSNPNPQEWRVLPY 192
Query: 64 PRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
RK MS +KE+T+ A P + C VT PALVF+ GG GN++H+F D +PL
Sbjct: 193 SRKH----MSGIKEITVRELATPADAPRCAVTSQVPALVFAMGGLTGNYWHDFSDVMIPL 248
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
++ S F D +V L + + +A KY +++ R +R+ +++++ CF S +G+
Sbjct: 249 YLQA-SRF-DGEVQLVVTNYQRWYAGKYRDIIARLSRYDVVDMDKDDQVRCFPSAVVGIR 306
Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYN------ENTNTSSSFHHTKPKLVLVNRNAR 235
I+P P ++ +F FL N + S KP++++++R R
Sbjct: 307 MPKEFSIDPAKEPTGHSMPEFTKFLRNVFALPRAGPMGVTAGPSDGKKKPRMMIISR--R 364
Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ++N+ EV A+ +GF+V I +P + +AD R +++ ++GVHGAG+T+SLFL
Sbjct: 365 HPRKLVNVDEVVALAKRIGFEVVIGDPPFNVDVADFAREVNAADVLMGVHGAGMTNSLFL 424
Query: 296 RPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
G+V +QV P G + + V F PA +GL+Y+ Y EES+LV+ G + +K+P
Sbjct: 425 PTGAVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMSYSCGMEESTLVDTLGRDHPAVKDP 484
Query: 355 QAFAGANWSNM-RVYLKTQNVKLDIDRFRIYL 385
++ + W+ + YL Q+VKLD+ RF L
Sbjct: 485 ESIHRSGWNKVAEFYLGKQDVKLDLQRFEPVL 516
>gi|357127198|ref|XP_003565271.1| PREDICTED: uncharacterized protein LOC100846082 [Brachypodium
distachyon]
Length = 456
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 136/403 (33%), Positives = 206/403 (51%), Gaps = 43/403 (10%)
Query: 13 SITCDRSHRSYDLCLINGSA-LFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSA 71
++ CD S D+C + G +F P T+ F T + S R+K P+ RK+D+ A
Sbjct: 68 ALFCDMSSPRSDVCELKGDVRVFLPNTTIVF----LHPTIRRRSWRMK--PHARKNDRHA 121
Query: 72 MSKVKELTITTS-------APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
+S V E++++ AP C +PA++FSAGGY GN FH+F D VPLFIT
Sbjct: 122 LSSVTEVSLSVVASSSLRHAPSG--CTAESAAPAVIFSAGGYAGNMFHDFTDVLVPLFIT 179
Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI-THCFQSVTLGLISH 183
S F +V ++D W KY LL +RH II++N ++ C++ V +GL H
Sbjct: 180 -ASRF-HGEVHFLVSDAPSWWLDKYQPLLRMLSRHAIIDMNRRSSEVLCYRHVIVGLRFH 237
Query: 184 GRMVINPTLLPKPK-TLVDFQSFLANAY------------NENTNTSSSFH-HTKPKLVL 229
M I+ + ++ DF +Y N+N N S H +P+L++
Sbjct: 238 KEMSIDAAKTVGGRYSMADFARLARTSYGLERDRAIQLPRNDNNNGGSGVESHHRPRLLI 297
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
++R A R N+ + + A LG++V + E ++ + LA R ++SC +V +HGA L
Sbjct: 298 ISRKAT--RAFTNVDAIARTASILGYNVVVGEADQQSDLAALARLVNSCDVLVCLHGAVL 355
Query: 290 THSLFLRPGSVLMQVVPIGTQWLSTV-YFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
T+ +FL G+V++QVVP+G + V F PAR +GL Y++Y I EESS +
Sbjct: 356 TNLVFLPAGAVVVQVVPLGGLEAAAVEAFGAPARDMGLGYVQYNIAVEESSQAAR----- 410
Query: 349 LVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
VL P A W + YL QNV LD+ RFR L A +
Sbjct: 411 -VLAEPPAVRKEGWLALWSAYLVGQNVTLDVARFRGALSRALE 452
>gi|413934853|gb|AFW69404.1| hypothetical protein ZEAMMB73_073843 [Zea mays]
Length = 331
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 116/328 (35%), Positives = 182/328 (55%), Gaps = 15/328 (4%)
Query: 72 MSKVKELTITTSAPPNLS-----CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
M V+E T+ PN + C TH+ P +FS+GG+ GN +H++ D VPLF + N
Sbjct: 1 MDDVREFTLVPFGGPNDTAVPPLCTRTHSVPGFLFSSGGFAGNLYHDYADVLVPLFASTN 60
Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRM 186
H + V +AD D WA K+ L + +R+ +I++NN HCF + +G H M
Sbjct: 61 -HLGGE-VQFLLADIKDWWADKFRPLFRQLSRYDVIDVNNDREVHCFPRIIIGSTFHRAM 118
Query: 187 VINPTLLPKPKTLVDFQSFLANAYNEN----TNTSSSFHHTKPKLVLVNRNARVGRTILN 242
I+P+ P T+ DF+ L A+ + + + +P+L++++R + R +N
Sbjct: 119 GIDPSRSPGGVTVADFKRLLRRAFRLERAVASRSGAPRRRDRPRLLIISRKS--SRRFVN 176
Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
R + +AA FDV I EP+ T + + R ++S M+GVHGAGLT+ +FL +VL+
Sbjct: 177 ERAMARAAAAARFDVRIAEPDNHTDMPNFARLVNSADVMMGVHGAGLTNMVFLPSRAVLV 236
Query: 303 QVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN 361
QVVP G +WL+ V F+ PAR + + Y+EY + EESSL + Y + LK+P
Sbjct: 237 QVVPFGGLEWLTRVTFKDPARDMDVTYMEYNVSLEESSLRDLYPEDHFYLKHPYDVHKKG 296
Query: 362 WSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
W ++ VYL QNV+L++ RF L+ A
Sbjct: 297 WDAIKTVYLDKQNVRLNLTRFTRTLEQA 324
>gi|326515946|dbj|BAJ87996.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516144|dbj|BAJ88095.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 525
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 126/397 (31%), Positives = 205/397 (51%), Gaps = 33/397 (8%)
Query: 9 SPAKSITCD-RSHRSYDLC---------LINGSALFDPKTSTFFSVGHTDSTPSQPSLRI 58
+PAK CD + + D+C I+GSA T S G S P+ R+
Sbjct: 133 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSAHTVLVPPTIESGG---SNPNPQEWRV 189
Query: 59 KTQPYPRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMD 116
PY RK MS +KE+T+ P + C VT PALVF+ GG GN++H+F D
Sbjct: 190 --LPYSRKH----MSGIKEITVRELPTPADAPRCAVTSQVPALVFAMGGLTGNYWHDFSD 243
Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
+PL++ + D +V L + + +A KY ++ R +R+ +++++ CF S
Sbjct: 244 VMIPLYLQASCF--DGEVQLVVTNYQRWYAGKYRHIIARLSRYDVVDMDKDDQVRCFPSA 301
Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN------ENTNTSSSFHHTKPKLVLV 230
+G+ H I+P P ++ +F FL N + S KP+++++
Sbjct: 302 VVGIRMHKEFSIDPAKEPTGHSMPEFTKFLRNVFALPRAAPMRVTAGPSDGKKKPRMMII 361
Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
+R R R ++N+ EV A+ +GF+V I +P + + D R +++ ++GVHGAGLT
Sbjct: 362 SR--RHPRKLVNVDEVVALAKRIGFEVVIGDPPFNVDVGDFAREVNAADVLMGVHGAGLT 419
Query: 291 HSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
+SLFL G+V +QV P G + + V F PA +GL+Y+ Y EES+LV+ G +
Sbjct: 420 NSLFLPTGAVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMAYSCGTEESTLVDTLGRDHP 479
Query: 350 VLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYL 385
+K+P++ + W+ + YL Q+VKLD+ RF L
Sbjct: 480 AVKDPESIHRSGWNKVAEFYLGKQDVKLDLQRFEPVL 516
>gi|242093064|ref|XP_002437022.1| hypothetical protein SORBIDRAFT_10g018270 [Sorghum bicolor]
gi|241915245|gb|EER88389.1| hypothetical protein SORBIDRAFT_10g018270 [Sorghum bicolor]
Length = 529
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 123/403 (30%), Positives = 203/403 (50%), Gaps = 28/403 (6%)
Query: 4 RPAALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSL 56
+PA ITCD + + +C + G PK + V + + P +
Sbjct: 131 KPAGAVTEAKITCDENGVDEGFPYARPPVCELAGDIRISPKEKAMYLVNPSGAGPFDSNG 190
Query: 57 RIKTQPYPRKSDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
K +P+ R +D + V E+TI +SA C H P +VFS GY NFFH+
Sbjct: 191 EKKIRPFAR-NDGFLLPGVVEVTIKSVSSAAAAPQCTRRHDVPVVVFSVAGYTDNFFHDN 249
Query: 115 MDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQ 174
D +PLF+T +H +V I + W K+ LL + + + +IN + HCF+
Sbjct: 250 TDVLIPLFLT-TAHL-KGEVQFLITNFKPWWVNKFTPLLKKLSNYDVINFDEDKEVHCFR 307
Query: 175 SVTLGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFH---HTKPKLVL 229
+ LG+ ++I+P T P ++VD+ FL A++ + + KPK+++
Sbjct: 308 AGHLGMYRDRDLIISPHPTRNPHNYSMVDYNRFLRRAFSLPRDAPAVLGAETSAKPKMLI 367
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHG 286
+ R R +LNLREV E LGF VT+ E + AD F +++ ++ VHG
Sbjct: 368 IERKGT--RKLLNLREVAAMCEALGFAVTVAE-----AGADVRGFAERVNAADVLLAVHG 420
Query: 287 AGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYG 345
AGLT+ +FL G+VL+Q+VP G W++T ++ +PAR + L Y+EY + +EE++L +KY
Sbjct: 421 AGLTNQIFLPTGAVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYP 480
Query: 346 ANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
+ V K+P W + + Q+V +++ RF+ +L A
Sbjct: 481 RDHYVFKDPMRIHAQGWPAIAEIIMKQDVMVNMTRFKPFLLKA 523
>gi|326532542|dbj|BAK05200.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 559
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 122/395 (30%), Positives = 204/395 (51%), Gaps = 23/395 (5%)
Query: 9 SPAKSITCD-RSHRSYDLCLI---NGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQPY 63
+PAK CD + + D+C + +G T +S S P+ + + PY
Sbjct: 167 APAKKPACDIQGPWASDVCDLAGGSGGVRIHGSAHTVLVPPTIESGGSNPNPQEWRVLPY 226
Query: 64 PRKSDKSAMSKVKELTITTSAPPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
RK MS +KE+T+ P + C VT PALVF+ GG GN++H+F D +PL
Sbjct: 227 SRKH----MSGIKEITVRELPTPADAPRCAVTSQVPALVFAMGGLTGNYWHDFSDVMIPL 282
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLI 181
++ + D +V L + + +A KY ++ R +R+ +++++ CF S +G+
Sbjct: 283 YLQASCF--DGEVQLVVTNYQRWYAGKYRHIIARLSRYDVVDMDKDDQVRCFPSAVVGIR 340
Query: 182 SHGRMVINPTLLPKPKTLVDFQSFLANAYN------ENTNTSSSFHHTKPKLVLVNRNAR 235
H I+P P ++ +F FL N + S KP++++++R R
Sbjct: 341 MHKEFSIDPAKEPTGHSMPEFTKFLRNVFALPRAAPMRVTAGPSDGKKKPRMMIISR--R 398
Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ++N+ EV A+ +GF+V I +P + + D R +++ ++GVHGAGLT+SLFL
Sbjct: 399 HPRKLVNVDEVVALAKRIGFEVVIGDPPFNVDVGDFAREVNAADVLMGVHGAGLTNSLFL 458
Query: 296 RPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
G+V +QV P G + + V F PA +GL+Y+ Y EES+LV+ G + +K+P
Sbjct: 459 PTGAVFIQVNPFGKMEHIGEVDFGTPAVDMGLKYMAYSCGTEESTLVDTLGRDHPAVKDP 518
Query: 355 QAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
++ + W+ + YL Q+VKLD+ RF L A
Sbjct: 519 ESIHRSGWNKVAEFYLGKQDVKLDLQRFEPVLLKA 553
>gi|326517042|dbj|BAJ96513.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 123/396 (31%), Positives = 203/396 (51%), Gaps = 33/396 (8%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ C+ RS D C ++G + T+ ++ S P + I PY R+ +
Sbjct: 107 VVCNMEGRS-DTCEVDGDVRTN-GTALSVTLVPAASWPERHEWMIT--PYSRR-----FA 157
Query: 74 KVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSH 128
V+++T+T +APP C VTH PA++F+ GGY N++H + D VPLF+ +
Sbjct: 158 SVRKVTVTQVQDRAAAPP---CTVTHGMPAVLFAVGGYAANYWHAYADILVPLFVASRRY 214
Query: 129 FPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVI 188
+ +++ +W +Y L +++ +++++ CF VT+GL + I
Sbjct: 215 HGEVTFLISNIQFRPRWPVQYRAFLQGLSKYELVDMDGDAEVRCFPRVTVGLRLDKELSI 274
Query: 189 NPTLLPKPK-TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
P L+P + T+ DF FL Y + KP+L+L++R R ILN EV
Sbjct: 275 VPELVPGGRLTMADFTGFLRETYALPRGAARE-PEKKPRLLLIHRGHY--RRILNEPEVA 331
Query: 248 KAAEELGFDVTIFE-------PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
+AAEE GF+ + E PE A R ++S ++G+HGAGLT+++FL PG V
Sbjct: 332 RAAEEAGFEAVVTELRGGGDTPEAEVEQA---RVVNSFDVVLGLHGAGLTNAMFLPPGGV 388
Query: 301 LMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
L+QVVP G + ++ F +PA +GL YL+Y + EESSL+E G +K+P +
Sbjct: 389 LIQVVPYGNMEDIARAEFSEPATDMGLRYLDYSVGAEESSLMETLGPEHPAIKDPASVHR 448
Query: 360 ANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
+ W + +YL QNV++++ RF L A ++
Sbjct: 449 SGWDKVFELYLAKQNVRINVTRFAPTLAQALHHLRR 484
>gi|414876585|tpg|DAA53716.1| TPA: hypothetical protein ZEAMMB73_742502 [Zea mays]
Length = 341
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 112/342 (32%), Positives = 180/342 (52%), Gaps = 26/342 (7%)
Query: 72 MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
M V +L +SAPP C VT+ PA++F+ GG GNF+H+F D VPLFI + D
Sbjct: 6 MVTVTQLEDKSSAPP---CTVTYNIPAVLFALGGLTGNFWHDFGDVLVPLFIASRRY--D 60
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQT-----ITHCFQSVTLGLISHGRM 186
+V I++ W Y +L R +++ ++++ + CF VT+G+ H +
Sbjct: 61 GEVQFLISNMKPWWPAAYKTILQRLSKYDAVDLDGDGDGDAHVVRCFPHVTVGIHMHNGL 120
Query: 187 VINPTLLPKPK-----TLVDFQSFLANAYNENTNTSSSF-------HHTKPKLVLVNRNA 234
I P P P T+ DF F+ Y + +S P+L+L++R
Sbjct: 121 SIVPEWAPGPPGGRGLTMADFTRFMREVYALPRDAPASLVREEPGKQSPPPRLLLIHREH 180
Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
R +N RE+ +AAE GF+ + ++ R ++S ++GVHGAGLT+S+F
Sbjct: 181 --SRRFMNEREILQAAEAAGFEAVALDLRRDVTVDAQARVVNSFDVLLGVHGAGLTNSVF 238
Query: 295 LRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKN 353
L PG+VL+QVVP G ++T+ F PA+ +GL+YL+Y + EES+L+E G +K+
Sbjct: 239 LPPGAVLVQVVPYGKMDVIATLEFGLPAKEMGLKYLDYVVSAEESTLLEMLGPEHPAIKD 298
Query: 354 PQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
P + + W M YL Q+V++D+ RF L A+ ++
Sbjct: 299 PDSIHRSGWDKMTEFYLNMQDVRIDVARFAPVLTQAFDHLRQ 340
>gi|357157870|ref|XP_003577941.1| PREDICTED: uncharacterized protein LOC100833625 [Brachypodium
distachyon]
Length = 532
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 122/388 (31%), Positives = 199/388 (51%), Gaps = 33/388 (8%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRI-------KTQPYPRKSDKSAMSKVKE 77
+C + G P+ T F V TPS K +PY RK D + V E
Sbjct: 160 VCELTGDIRISPREKTMFFV-----TPSAAGAAALDANGEKKIRPYARK-DTFLLPGVVE 213
Query: 78 LTITT--SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
+TI + SA +C H PA+VFS GY NFFH+ D +PLF+T +H +V
Sbjct: 214 VTIKSVPSAEAAPACTRQHDVPAVVFSVAGYTDNFFHDNTDVMIPLFLT-TAHL-RGEVQ 271
Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP--TLL 193
L I + W +K+ LL + + + +IN + CF+ LGL ++++P T
Sbjct: 272 LLITNFKPWWVKKFTPLLKKLSNYEVINFDKDEEVRCFRQGNLGLYRDRDLILSPHPTRN 331
Query: 194 PKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNRNARVGRTILNLREVKKAA 250
P+ T+VD+ FL A+ + + +PK++++ R R +LNL V
Sbjct: 332 PRNYTMVDYNRFLRGAFGLPRDAPAVLGEKTSARPKMLMIERKGT--RKLLNLAAVVAMC 389
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHA---MVGVHGAGLTHSLFLRPGSVLMQVVPI 307
EELGF VT+ E + AD F + +A ++ VHGAGLT+ +FL G+V++Q+VP
Sbjct: 390 EELGFAVTVAE-----AGADVRGFAETVNAADVLLAVHGAGLTNQIFLPTGAVMVQIVPW 444
Query: 308 G-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR 366
G W++T ++ +PAR + L Y+EY + +EE++L +++ + V K+P A W +
Sbjct: 445 GKMDWMATNFYGQPARDMQLRYVEYYVSEEETTLKDRFPRDHYVFKDPMAIHAQGWPALA 504
Query: 367 VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
+ Q+V +++ RF+ +L A K ++
Sbjct: 505 DIVMKQDVMVNVTRFKPFLLSALDKLQE 532
>gi|125563028|gb|EAZ08408.1| hypothetical protein OsI_30669 [Oryza sativa Indica Group]
Length = 410
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 118/386 (30%), Positives = 196/386 (50%), Gaps = 26/386 (6%)
Query: 9 SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSV------GHTDSTPSQPSLRIKTQP 62
+P K I CD YD+C I+G A T V G D + + S+R +++
Sbjct: 25 APRKPI-CDLLDPRYDICEISGDARTMGTNRTILYVPPVGERGLADDS-HEWSIRDQSRK 82
Query: 63 YPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
Y ++KV ++ A P C H PA+VF+ G N +H+F D +PLF
Sbjct: 83 Y-----LEYINKVTVRSLDAQAAP--GCTSRHAVPAVVFAMNGLTSNPWHDFSDVLIPLF 135
Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
IT + + +V ++D + KY +L +R+ I++ N + C+ +T+GL S
Sbjct: 136 ITTRVY--EGEVQFLVSDLQPWFVDKYRLILTNLSRYDIVDFNQDSGVRCYPKITVGLRS 193
Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYN-----ENTNTSSSFHHTKPKLVLVNRNARVG 237
H + I+P P+ T++DF+ ++ Y+ + S +P+ +L+NR
Sbjct: 194 HRDLGIDPARTPRNYTMLDFRLYIREVYSLPPAGVDIPFKESSMQRRPRAMLINRGR--T 251
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R +N +E+ A GF+V EP S+ + R + SC ++G HGAGLT+ FLR
Sbjct: 252 RKFVNFQEIAAAVVAAGFEVVPVEPRRDLSIEEFSRVVDSCDVLMGAHGAGLTNFFFLRT 311
Query: 298 GSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA 356
+V++QVVP G + S V++ PAR + L +EY I EES+L +KYG + +++P++
Sbjct: 312 NAVMLQVVPWGHMEHPSMVFYGGPAREMRLRDVEYSIAAEESTLYDKYGKDHPAIRDPES 371
Query: 357 FAGANWS-NMRVYLKTQNVKLDIDRF 381
W M+ Y Q++KL++ RF
Sbjct: 372 IHKQGWQFGMKYYWIEQDIKLNVTRF 397
>gi|212275672|ref|NP_001130824.1| uncharacterized protein LOC100191928 [Zea mays]
gi|194690210|gb|ACF79189.1| unknown [Zea mays]
gi|413947220|gb|AFW79869.1| hypothetical protein ZEAMMB73_561794 [Zea mays]
Length = 586
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 192/389 (49%), Gaps = 49/389 (12%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPRKSDKSA 71
CD S+R D C ++G ++ V G + T S RIK PYPRK+D+
Sbjct: 234 CDVTSNRRIDWCELDGDVRVVGANASVTLVAPPGADNRTFRAESWRIK--PYPRKADR-- 289
Query: 72 MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
GY GN+FH F D +PLF+T +
Sbjct: 290 ---------------------------------GYTGNYFHAFTDVILPLFLTARRYA-- 314
Query: 132 QDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT 191
+V L +AD W K+ + +++ +++++ CF+ V +GL SH I+P
Sbjct: 315 GEVRLLVADLQPWWVGKFLPVFRSISKYELVDLDRDPRVRCFRHVQVGLTSHADFSIDPR 374
Query: 192 LLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
P +++DF F+ AY + ++ +P+L++V R AR R +N E+ + A
Sbjct: 375 RAPNGYSMLDFTRFMRAAYGLPRGDVVAAAPARRPRLLVVAR-ART-RRFVNTEEIVRGA 432
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
E +GF+ + E + +A + C A++GVHGAGLT+ +FL G V++QVVP+G
Sbjct: 433 EAVGFEAVV--SEGTHEVAPFAELANGCDAIMGVHGAGLTNMVFLPTGGVVIQVVPLGGL 490
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYL 369
YF P+ +GL YLEY+I+ EES+LV++Y + + +P W +++ YL
Sbjct: 491 EFVAGYFRGPSVDMGLRYLEYRIEPEESTLVDQYPRDHPIFTDPNGIKSKGWESLKDAYL 550
Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
Q+V+LD++RFR L++A +K K
Sbjct: 551 DKQDVRLDMERFRPTLQEAIAHLRKAKAK 579
>gi|115457236|ref|NP_001052218.1| Os04g0196600 [Oryza sativa Japonica Group]
gi|38344223|emb|CAE03693.2| OSJNBb0026E15.11 [Oryza sativa Japonica Group]
gi|113563789|dbj|BAF14132.1| Os04g0196600 [Oryza sativa Japonica Group]
gi|215768520|dbj|BAH00749.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 550
Score = 190 bits (483), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 201/384 (52%), Gaps = 19/384 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS-DKSAMSK 74
CD S YD+C+++G +V T +P K +PY RK D
Sbjct: 170 CDTSFGKYDICVLDGDTRAQGGGGAGAAV-VTLVSPRAAPREWKIKPYSRKYLDGLKPVT 228
Query: 75 VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
V+ + APP C PA+V GG GN++H+F D VPLFI +V
Sbjct: 229 VRSVPNPEDAPP---CTTRLNVPAMVIELGGLTGNYWHDFTDVLVPLFIGARRF--GGEV 283
Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLISHGRMVINPT 191
L + + W KY + + +RH I+++ +++ + C+ V +G S I+P+
Sbjct: 284 QLLVVNLLPFWVDKYRRIFSQISRHDIVDLEKDDDRGVVRCYPHVVVGYGSRKEFTIDPS 343
Query: 192 L--LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
L T+V+F FL +Y+ + + H +P+++++ R R ++NL EV
Sbjct: 344 LDDTGGGYTMVNFTEFLRQSYSLPRDRPIKLGTNHGARPRMMILERTN--SRKLMNLPEV 401
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
AA GF+VT+ +++ + R ++S MVGVHGAGLT+ +FL G+VL+Q+VP
Sbjct: 402 AAAARAAGFEVTVAGGRPTSTYDEFAREVNSYDVMVGVHGAGLTNCVFLPTGAVLLQIVP 461
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G + ++ F +PAR +GL Y+EY I +ESSL++ +G + ++K+P A + W N+
Sbjct: 462 YGRLESIAQTDFGEPARDMGLRYIEYDIAADESSLMDVFGKDHPMIKDPVAVHLSGWGNV 521
Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
YL Q+V+++I+RFR +L A
Sbjct: 522 AEWYLGKQDVRVNIERFRPFLTQA 545
>gi|226498526|ref|NP_001142120.1| uncharacterized protein LOC100274284 [Zea mays]
gi|194707210|gb|ACF87689.1| unknown [Zea mays]
gi|413947221|gb|AFW79870.1| hypothetical protein ZEAMMB73_137872 [Zea mays]
Length = 476
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 131/411 (31%), Positives = 204/411 (49%), Gaps = 35/411 (8%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
+ CD S D+C + G P + + S R+K P+ RK+D+ A++
Sbjct: 76 VLCDFSSSRSDVCELRGDVRVLPNATIVLH----HPLARRQSWRMK--PHGRKNDRHALA 129
Query: 74 KVKELTITT---SAPPNLS----------CGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
+V E+T+T SA P+ + C HT+PA+VFS GGY GN FH+ D VP
Sbjct: 130 RVTEVTVTVATASASPHHTSGAAAAAAPRCTANHTAPAVVFSVGGYAGNMFHDLTDVLVP 189
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITH---CFQSVT 177
LFIT DV L + D W K+ LL +RH +++++ + + C+ V
Sbjct: 190 LFITARRF--GGDVHLLVGDAQPWWLDKFRPLLGGLSRHAVVDMSRGSSSGGVLCYPHVI 247
Query: 178 LGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTS-----SSFHHTKPKLVLV 230
LGL H M ++ T ++ DF +Y + + +P+L+L+
Sbjct: 248 LGLEFHKEMSVDAARTAGGGEYSMADFTLLARRSYGLPRDKAIRVHGRGGGGVRPRLLLI 307
Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
+R + R N + +AA LG++V + EP LA R ++SC +VGVHGAGL
Sbjct: 308 SRKST--RAFTNAGSIARAAASLGYEVVVGEPARHADLASFARVVNSCDVLVGVHGAGLA 365
Query: 291 HSLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDL 349
+ +FL G+V++QVVP+G ++ F PAR GL Y+ Y I + ES+L +Y +
Sbjct: 366 NLVFLPAGAVVVQVVPLGGLDAMAADDFGAPARDAGLRYVHYGIAEAESTLATRYPRDHR 425
Query: 350 VLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
VL++P A W +R YL QNV +D+ RF L+ A + ++ D E
Sbjct: 426 VLRDPAAVRSEGWMALRAAYLVGQNVTIDVRRFSGALRRAMELLRQTHDHE 476
>gi|226503944|ref|NP_001140978.1| uncharacterized protein LOC100273057 [Zea mays]
gi|194702014|gb|ACF85091.1| unknown [Zea mays]
gi|413953934|gb|AFW86583.1| hypothetical protein ZEAMMB73_356444 [Zea mays]
Length = 567
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 127/404 (31%), Positives = 200/404 (49%), Gaps = 35/404 (8%)
Query: 3 LRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF---SVGHTDSTPSQPSLRIK 59
+ PAA +P K I CD S YD+C I+G A T VG S + S+R +
Sbjct: 161 MEPAATAPRKPI-CDLSDPRYDICEISGDARTMGANRTVLYVPPVGELGSDGQEWSIRDQ 219
Query: 60 TQPYPRKSDKSAMSKVKELTITTS-APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
++ + +K VK L + S A P C H ALVF+ G N +H+F D
Sbjct: 220 SRKHLEFVNKVT---VKSLNASQSLAAP--ECTSRHAVRALVFAMNGLTSNPWHDFSDVL 274
Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTL 178
+PLFIT + + +V ++D + KY +L +R+ II+ N C+ VT+
Sbjct: 275 IPLFITTRAL--EGEVQFLVSDLQPWFVDKYRLVLKNLSRYDIIDFNQDGGVRCYPHVTV 332
Query: 179 GLISHGRMVINPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHT-------------- 223
GL SH + I+P P+ T++DF+ ++ Y + S +
Sbjct: 333 GLRSHRDLGIDPARAPRNYTMLDFRLYIREIYRLPSAGVSIPYKEANSNAAAAAPGAPAE 392
Query: 224 ----KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
KP+L+L+NR R +N E+ A ++ GF+V EP S+ D R + SC
Sbjct: 393 QLQRKPRLMLINRGRT--RKFVNFPEIVGAVQKAGFEVIPIEPRRDLSVEDFARTVDSCD 450
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
++G HGAGLT+ FLR +V++QVVP G + S ++ PA+ + L +EY I ES
Sbjct: 451 VLMGAHGAGLTNFFFLRTNAVMLQVVPWGHMEHPSMAFYGGPAKEMRLRDVEYSIADVES 510
Query: 339 SLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRF 381
+L +KYG ++ V+ +P++ W MR Y Q+++L++ RF
Sbjct: 511 TLYDKYGKDNPVVSDPESIHKQGWQLGMRYYWIEQDIRLNVTRF 554
>gi|413953933|gb|AFW86582.1| glycosyltransferase [Zea mays]
Length = 576
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 129/406 (31%), Positives = 202/406 (49%), Gaps = 39/406 (9%)
Query: 3 LRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF---SVGHTDSTPSQPSLRIK 59
+ PAA +P K I CD S YD+C I+G A T VG S + S+R +
Sbjct: 170 MEPAATAPRKPI-CDLSDPRYDICEISGDARTMGANRTVLYVPPVGELGSDGQEWSIRDQ 228
Query: 60 TQPYPRKSDKSAMSKVKELTITTS-APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
++ + +K VK L + S A P C H ALVF+ G N +H+F D
Sbjct: 229 SRKHLEFVNKVT---VKSLNASQSLAAP--ECTSRHAVRALVFAMNGLTSNPWHDFSDVL 283
Query: 119 VPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTL 178
+PLFIT + + +V ++D + KY +L +R+ II+ N C+ VT+
Sbjct: 284 IPLFITTRAL--EGEVQFLVSDLQPWFVDKYRLVLKNLSRYDIIDFNQDGGVRCYPHVTV 341
Query: 179 GLISHGRMVINPTLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHHT-------------- 223
GL SH + I+P P+ T++DF+ ++ Y + S +
Sbjct: 342 GLRSHRDLGIDPARAPRNYTMLDFRLYIREIYRLPSAGVSIPYKEANSNAAAAAPGAPAE 401
Query: 224 ----KPKLVLVNRNARVGRT--ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
KP+L+L+NR GRT +N E+ A ++ GF+V EP S+ D R + S
Sbjct: 402 QLQRKPRLMLINR----GRTRKFVNFPEIVGAVQKAGFEVIPIEPRRDLSVEDFARTVDS 457
Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQE 336
C ++G HGAGLT+ FLR +V++QVVP G + S ++ PA+ + L +EY I
Sbjct: 458 CDVLMGAHGAGLTNFFFLRTNAVMLQVVPWGHMEHPSMAFYGGPAKEMRLRDVEYSIADV 517
Query: 337 ESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRF 381
ES+L +KYG ++ V+ +P++ W MR Y Q+++L++ RF
Sbjct: 518 ESTLYDKYGKDNPVVSDPESIHKQGWQLGMRYYWIEQDIRLNVTRF 563
>gi|218194401|gb|EEC76828.1| hypothetical protein OsI_14971 [Oryza sativa Indica Group]
Length = 548
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 121/384 (31%), Positives = 200/384 (52%), Gaps = 19/384 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS-DKSAMSK 74
CD S YD+C+++G + T +P K +PY RK D
Sbjct: 168 CDTSFGKYDICVLDGDTRAQGGGGAGAAA-VTLVSPRAAPREWKIKPYSRKYLDGLKPVT 226
Query: 75 VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
V+ + APP C PA+V GG GN++H+F D VPLFI +V
Sbjct: 227 VRSVPNPEDAPP---CTTRLNVPAMVIELGGLTGNYWHDFTDVLVPLFIGARRF--GGEV 281
Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLISHGRMVINPT 191
L + + W KY + + +RH I+++ +++ + C+ V +G S I+P+
Sbjct: 282 QLLVVNLLPFWVDKYRRIFSQISRHDIVDLEKDDDRGVVRCYPHVVVGYGSRKEFTIDPS 341
Query: 192 L--LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
L T+V+F FL +Y+ + + H +P+++++ R R ++NL EV
Sbjct: 342 LDDTGGGYTMVNFTEFLRQSYSLPRDRPIKLGTNHGARPRMMILERTN--SRKLMNLPEV 399
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
AA GF+VT+ +++ + R ++S MVGVHGAGLT+ +FL G+VL+Q+VP
Sbjct: 400 AAAARAAGFEVTVAGGRPTSTYDEFAREVNSYDVMVGVHGAGLTNCVFLPTGAVLLQIVP 459
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G + ++ F +PAR +GL Y+EY I +ESSL++ +G + ++K+P A + W N+
Sbjct: 460 YGRLESIAQTDFGEPARDMGLRYIEYDIAADESSLMDVFGKDHPMIKDPVAVHLSGWGNV 519
Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
YL Q+V+++I+RFR +L A
Sbjct: 520 AEWYLGKQDVRVNIERFRPFLTQA 543
>gi|116317832|emb|CAH65867.1| OSIGBa0132I10.3 [Oryza sativa Indica Group]
Length = 550
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 121/384 (31%), Positives = 200/384 (52%), Gaps = 19/384 (4%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS-DKSAMSK 74
CD S YD+C+++G + T +P K +PY RK D
Sbjct: 170 CDTSFGKYDICVLDGDTRAQGGGGAGAAA-VTLVSPRAAPREWKIKPYSRKYLDGLKPVT 228
Query: 75 VKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDV 134
V+ + APP C PA+V GG GN++H+F D VPLFI +V
Sbjct: 229 VRSVPNPEDAPP---CTTRLNVPAMVIELGGLTGNYWHDFTDVLVPLFIGARRF--GGEV 283
Query: 135 ILAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLISHGRMVINPT 191
L + + W KY + + +RH I+++ +++ + C+ V +G S I+P+
Sbjct: 284 QLLVVNLLPFWVDKYRRIFSQISRHDIVDLEKDDDRGVVRCYPHVVVGYGSRKEFTIDPS 343
Query: 192 L--LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
L T+V+F FL +Y+ + + H +P+++++ R R ++NL EV
Sbjct: 344 LDDTGGGYTMVNFTEFLRQSYSLPRDRPIKLGTNHGARPRMMILERTN--SRKLMNLPEV 401
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
AA GF+VT+ +++ + R ++S MVGVHGAGLT+ +FL G+VL+Q+VP
Sbjct: 402 AAAARAAGFEVTVAGGRPTSTYDEFAREVNSYDVMVGVHGAGLTNCVFLPTGAVLLQIVP 461
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G + ++ F +PAR +GL Y+EY I +ESSL++ +G + ++K+P A + W N+
Sbjct: 462 YGRLESIAQTDFGEPARDMGLRYIEYDIAADESSLMDVFGKDHPMIKDPVAVHLSGWGNV 521
Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
YL Q+V+++I+RFR +L A
Sbjct: 522 AEWYLGKQDVRVNIERFRPFLTQA 545
>gi|242052297|ref|XP_002455294.1| hypothetical protein SORBIDRAFT_03g007950 [Sorghum bicolor]
gi|241927269|gb|EES00414.1| hypothetical protein SORBIDRAFT_03g007950 [Sorghum bicolor]
Length = 469
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 126/384 (32%), Positives = 199/384 (51%), Gaps = 20/384 (5%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTS-TFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
+ C+ RS D C ++G D +T+ T SV ++ S+ S +PY R+
Sbjct: 90 VVCNMEGRS-DTCEVDG----DVRTNGTALSVTLVPASRSEHS-EWMIRPYSRRFASLRK 143
Query: 73 SKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
V +L +A P C TH PA++F+ GGY GN++H++ D VPLF+ + +
Sbjct: 144 VTVTQLQDRAAAAP---CTATHDMPAVLFAIGGYAGNYWHDYADILVPLFVASRRYNGEV 200
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTL 192
+++ A QW KY L +RH +++++ CF VT+GL I P L
Sbjct: 201 KFLISNAQFQPQWLVKYRAFLRGLSRHDVVDMDGDAEVRCFPHVTVGLRLDKEFSIVPEL 260
Query: 193 LPKPK--TLVDFQSFLANAYN--ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKK 248
+P + T+ DF FL Y +S KP+L+L++R R I N EV +
Sbjct: 261 VPGDRRLTMADFTRFLRETYALPRGAVAASRSRGQKPRLLLIHRGHY--RRITNEAEVAR 318
Query: 249 AAEELGFDVTIFE--PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
AAE GF+ + E A+ R +++ ++G+HGAGLT+++FL PG VL+QVVP
Sbjct: 319 AAEAAGFEAVVAELGGGGGGDEAEQARVVNAFDVVLGMHGAGLTNAVFLPPGGVLIQVVP 378
Query: 307 IG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G + ++ F +PA +GLEYL+Y + EESSL+E G +K+P + + W +
Sbjct: 379 YGKMEHIARAEFGEPAADMGLEYLDYSVSAEESSLMETLGPEHPAVKDPDSVHRSGWGQV 438
Query: 366 -RVYLKTQNVKLDIDRFRIYLKDA 388
+YL QNV++++ RF L A
Sbjct: 439 FELYLAKQNVRVNVTRFAPTLAQA 462
>gi|326503016|dbj|BAJ99133.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 574
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 119/408 (29%), Positives = 201/408 (49%), Gaps = 33/408 (8%)
Query: 9 SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSV---GHTDSTPSQPSLRIKTQPYPR 65
+P K + CD S YD+C I+G A T V G + + +++ +++ Y
Sbjct: 178 APPKPV-CDLSDPRYDICEISGDARAIGGNRTVLYVPPAGERRADGQEWAIKDQSRKYLE 236
Query: 66 KSDKSAMSKVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
DK VK L+ S AP C H PA+VF+ G N +H+F D VPLFI
Sbjct: 237 YIDKVT---VKTLSAAQSLVAP---ECTSRHAVPAVVFAMNGLTSNPWHDFSDVIVPLFI 290
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
T ++ D +V + D + KY +L +R+ I++ N C + +GL SH
Sbjct: 291 TARAY--DGEVQFLVTDLQPWFVDKYRLILANLSRYDIVDFNKDAGVRCHPRIVVGLRSH 348
Query: 184 GRMVINPTLLPKPKTLVDFQSFLANAYN---------------ENTNTSSSFHHTKPKLV 228
+ I+P P+ TL+DF+ ++ + ++ N N ++ KP+L+
Sbjct: 349 RDLGIDPARTPRNYTLLDFRMYIRDIFSLPPDGLGIPYKQKQEANRNATAGTEKRKPRLM 408
Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
L+NR R +N+ E+ A E GF+V + EP L + + + SC ++G HGAG
Sbjct: 409 LINRGR--NRKFVNIPEISAAVEAAGFEVVVVEPRRDLRLEEFSKAVDSCDVLMGAHGAG 466
Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
LT+ FLR + ++QVVP G + + +++ A+ + L +EY I EES+L +KYG +
Sbjct: 467 LTNFFFLRTNATMLQVVPWGHMEHSAMIFYGVQAKEMRLRDVEYSITAEESTLYDKYGKD 526
Query: 348 DLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
+ +P++ W M+ Y Q+++L++ RF L + A++
Sbjct: 527 HPAVSDPESIHKQGWQLGMKYYWLEQDIRLNVTRFAPTLHRVLRTAEE 574
>gi|115468072|ref|NP_001057635.1| Os06g0475400 [Oryza sativa Japonica Group]
gi|51090816|dbj|BAD35293.1| putative HGA1 [Oryza sativa Japonica Group]
gi|113595675|dbj|BAF19549.1| Os06g0475400 [Oryza sativa Japonica Group]
gi|215734833|dbj|BAG95555.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222635589|gb|EEE65721.1| hypothetical protein OsJ_21358 [Oryza sativa Japonica Group]
Length = 534
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 119/388 (30%), Positives = 197/388 (50%), Gaps = 30/388 (7%)
Query: 9 SPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSV------GHTDSTPSQPSLRIKTQP 62
+P K I CD YD+C I+G A T V G D + + S+R +++
Sbjct: 149 APRKPI-CDLLDPRYDICEISGDARTMGTNRTILYVPPVGERGLADDS-HEWSIRDQSRK 206
Query: 63 YPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF 122
Y ++KV ++ A P C H PA+VF+ G N +H+F D +PLF
Sbjct: 207 Y-----LEYINKVTVRSLDAQAAPG--CTSRHAVPAVVFAMNGLTSNPWHDFSDVLIPLF 259
Query: 123 ITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLIS 182
IT + + +V ++D + KY +L +R+ I++ N + C+ +T+GL S
Sbjct: 260 ITTRVY--EGEVQFLVSDLQPWFVDKYRLILTNLSRYDIVDFNQDSDVRCYPKITVGLRS 317
Query: 183 HGRMVINPTLLPKPKTLVDFQSFLANAYN-----ENTNTSSSFHHTKPKLVLVNRNARVG 237
H + I+P + T++DF+ ++ Y+ + S +P+ +L+NR G
Sbjct: 318 HRDLGIDPARTQRNYTMLDFRLYIREVYSLPPAGVDIPFKESSMQRRPRAMLINR----G 373
Query: 238 RT--ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
RT +N +E+ A GF+V EP S+ + R + SC ++G HGAGLT+ FL
Sbjct: 374 RTRKFVNFQEIAAAVVAAGFEVVPVEPRRDLSIEEFSRVVDSCDVLMGAHGAGLTNFFFL 433
Query: 296 RPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
R +V++QVVP G + S V++ PAR + L +EY I EES+L +KYG + +++P
Sbjct: 434 RTNAVMLQVVPWGHMEHPSMVFYGGPAREMRLRDVEYSIAAEESTLYDKYGKDHPAIRDP 493
Query: 355 QAFAGANWS-NMRVYLKTQNVKLDIDRF 381
++ W M+ Y Q++KL++ RF
Sbjct: 494 ESIHKQGWQFGMKYYWIEQDIKLNVTRF 521
>gi|219884553|gb|ACL52651.1| unknown [Zea mays]
Length = 531
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 112/394 (28%), Positives = 197/394 (50%), Gaps = 19/394 (4%)
Query: 9 SPAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS 67
+PAK CD + + D+C I+G T +S + + +PY RK
Sbjct: 141 APAKKPACDIQGPWASDVCDIDGDVRIHGSAGTVLIAPSIESGGNPNPQEWRVRPYSRKH 200
Query: 68 DKSAMS-KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
V+EL + AP +C V PA+V + GG GN++H+F D +PL++
Sbjct: 201 QGGIKEVTVRELASSADAP---ACDVRSPVPAMVLAMGGLTGNYWHDFSDIMIPLYLQA- 256
Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRM 186
+ F + +V L + + + KY +L R +RH I++++ CF +G+ H
Sbjct: 257 ARF-EGEVQLVVENIQPWYVGKYRAILRRLSRHDIVDMDRDDRVRCFPGAVVGIRMHKEF 315
Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF--------HHTKPKLVLVNRNARVGR 238
I+P P ++ +F +FL Y+ + +P++++++R R R
Sbjct: 316 SIDPAREPLGHSMPEFTAFLRETYSLPRAAPARLAGADGEEDERARPRMMVISR--RHPR 373
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
++NL V A +GF+V I +P + + + + +++ +VGVHGAGLT+SLFL G
Sbjct: 374 KLVNLDAVVALARRVGFEVVIGDPPFNVDVGEFAKEVNAVDVLVGVHGAGLTNSLFLPTG 433
Query: 299 SVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
+V +Q+ P G + + V F PA +GL+Y+ Y EE++LV+ G + +K+P++
Sbjct: 434 AVFIQINPYGKMEHIGEVDFGIPAVDMGLKYIAYSAGVEETTLVDTLGRDHPAVKDPESI 493
Query: 358 AGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
+ W + YL Q+V+LD+ RF L+ A +
Sbjct: 494 HRSGWGKVAEYYLGKQDVRLDLQRFEPVLRKAMQ 527
>gi|226529433|ref|NP_001140834.1| hypothetical protein [Zea mays]
gi|194690360|gb|ACF79264.1| unknown [Zea mays]
gi|194701354|gb|ACF84761.1| unknown [Zea mays]
gi|224031103|gb|ACN34627.1| unknown [Zea mays]
gi|414876590|tpg|DAA53721.1| TPA: hypothetical protein ZEAMMB73_455614 [Zea mays]
Length = 531
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 112/394 (28%), Positives = 197/394 (50%), Gaps = 19/394 (4%)
Query: 9 SPAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKS 67
+PAK CD + + D+C I+G T +S + + +PY RK
Sbjct: 141 APAKKPACDIQGPWASDVCDIDGDVRIHGSAGTVLIAPSIESGGNPNPQEWRVRPYSRKH 200
Query: 68 DKSAMS-KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN 126
V+EL + AP +C V PA+V + GG GN++H+F D +PL++
Sbjct: 201 QGGIKEVTVRELASSADAP---ACDVRSPVPAMVLAMGGLTGNYWHDFSDIMIPLYLQA- 256
Query: 127 SHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRM 186
+ F + +V L + + + KY +L R +RH I++++ CF +G+ H
Sbjct: 257 ARF-EGEVQLVVENIQPWYVGKYRAILRRLSRHDIVDMDRDDRVRCFPGAVVGIRMHKEF 315
Query: 187 VINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF--------HHTKPKLVLVNRNARVGR 238
I+P P ++ +F +FL Y+ + +P++++++R R R
Sbjct: 316 SIDPAREPLGHSMPEFTAFLRETYSLPRAAPARLAGADGEEDERARPRMMVISR--RHPR 373
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
++NL V A +GF+V I +P + + + + +++ +VGVHGAGLT+SLFL G
Sbjct: 374 KLVNLDAVVALARRVGFEVVIGDPPFNVDVGEFAKEVNAVDVLVGVHGAGLTNSLFLPTG 433
Query: 299 SVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
+V +Q+ P G + + V F PA +GL+Y+ Y EE++LV+ G + +K+P++
Sbjct: 434 AVFIQINPYGKMEHIGEVDFGIPAVDMGLKYIAYSAGVEETTLVDTLGRDHPAVKDPESI 493
Query: 358 AGANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYK 390
+ W + YL Q+V+LD+ RF L+ A +
Sbjct: 494 HRSGWGKVAEYYLGKQDVRLDLQRFEPVLRKAMQ 527
>gi|414876582|tpg|DAA53713.1| TPA: hypothetical protein ZEAMMB73_516543 [Zea mays]
Length = 465
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 128/397 (32%), Positives = 202/397 (50%), Gaps = 15/397 (3%)
Query: 5 PAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYP 64
P+ A + C R D C ++G + K +SV + K +PY
Sbjct: 77 PSKTEKANKVVCSTEERLSDYCEVDGDVRVNGKA---WSVDIVPPSGWSERREWKIRPYS 133
Query: 65 RKSDKSAMS-KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
R+S + V +L +APP C VTH P +VF+ GGY+GN FH+ D +PLF+
Sbjct: 134 RRSAANVDRLNVTQLQDPAAAPP---CTVTHHVPGVVFALGGYSGNAFHDHADVLLPLFL 190
Query: 124 TINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISH 183
+ + + L I W KY L R +R+ ++N++ CF +T+GL H
Sbjct: 191 A-SLRYGREVQFLVINRVQPWWLGKYRLALRRLSRYDVVNLDGDAHVRCFPHLTVGLRLH 249
Query: 184 GRMVINPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTK-PKLVLVNRNARVGRTI 240
+ P +P + ++ DF FL AY S K P+L+L+ R R
Sbjct: 250 KDFGVVPEWVPGKRRVSMPDFTRFLREAYALPRGAPVSREPGKRPRLMLIQRQR--SRRF 307
Query: 241 LNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
LN E+ +AAE GF+V + E ++ + R ++S MVG+HGAG+T+ +FL PG V
Sbjct: 308 LNGEEMARAAEAAGFEVVVTELMLDAAVDEQARVVNSFDVMVGIHGAGMTNEVFLPPGGV 367
Query: 301 LMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
L+QVVP G ++ + + +PA +GL+YL Y + EESSL+E G + +K+P +
Sbjct: 368 LIQVVPWGKLDLMARIEYGEPATDMGLKYLCYNVTLEESSLLELLGRDHPAIKDPDSVHR 427
Query: 360 ANWSNM-RVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
+ W+ M +Y+ Q+V+LDI RF + L +A ++L
Sbjct: 428 SGWAAMYDIYMTKQDVRLDITRFALTLAEAMDHLRRL 464
>gi|125539258|gb|EAY85653.1| hypothetical protein OsI_07023 [Oryza sativa Indica Group]
Length = 481
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 125/393 (31%), Positives = 199/393 (50%), Gaps = 51/393 (12%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
C S R D C G ++ T +P + ++K PY RK D A+S V
Sbjct: 122 CYESSRRSDTCEATGDVRVHGRSQTIHI------SPLEQEWKVK--PYCRKHDAFALSHV 173
Query: 76 KE-----LTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP 130
KE L+ + P C V ++ A V S GG+ GN FH++ D +P FIT + HF
Sbjct: 174 KEWALRPLSTADTYPAVPHCTVNSSATAFVLSTGGFTGNPFHDYTDVLIPAFITAH-HFA 232
Query: 131 DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINP 190
+ V ++ W +Y ++ R +++ +++I+N +G
Sbjct: 233 GE-VQFLVSSFKSWWTNRYMQIFQRLSKYELVDIDND------DEDAVGY---------- 275
Query: 191 TLLPKPKTLVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLRE 245
++VDF++ L A E S +P+L++++R R R LN R
Sbjct: 276 -------SMVDFRTMLRGALGLERAVAEPSGDRWDIRRRPRLLIISR--RNSRAFLNERA 326
Query: 246 VKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+ A LGFDV + EP+ ST ++ R ++S MVGVHGAGLT+ +FL G+VL+QVV
Sbjct: 327 MADMAMSLGFDVRLGEPDISTDVSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVV 386
Query: 306 PIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN 364
P G +WL+ F++PA + + YLEY I+ +E++L E+Y +D VLK+P + W+
Sbjct: 387 PYGGLEWLARGTFKEPAADMQIHYLEYMIQLDETTLSEQYPKDDPVLKDPYSIHKQGWNA 446
Query: 365 MR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLM 396
++ VYL QNV+ + R LK+ + +A KL+
Sbjct: 447 LKMVYLDKQNVRPHLGR----LKNTFMEALKLL 475
>gi|363543453|ref|NP_001241737.1| glycosyltransferase [Zea mays]
gi|195620094|gb|ACG31877.1| glycosyltransferase [Zea mays]
Length = 577
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 127/407 (31%), Positives = 198/407 (48%), Gaps = 41/407 (10%)
Query: 3 LRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFF---SVGHTDSTPSQPSLRIK 59
+ PAA +P K I CD S YD+C I+G A T VG S + S+R +
Sbjct: 171 MEPAATAPRKPI-CDLSDPRYDICEISGDARTMGANRTVLYVPPVGELGSDGQEWSIRDQ 229
Query: 60 TQPYPRKSDKSAMSKVKELTITTS--APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
++ + +K VK L + S AP C H ALVF+ G N +H+F D
Sbjct: 230 SRKHLEFVNKVT---VKSLNASQSLTAP---ECTSRHAVRALVFAMNGLTSNPWHDFSDV 283
Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
+PLFIT + + +V ++D + KY +L +R+ II+ N C+ VT
Sbjct: 284 LIPLFITTRAL--EGEVQFLVSDLQPWFVDKYRLVLKNLSRYNIIDFNQDGGVRCYPHVT 341
Query: 178 LGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN-------------------ENTNTSS 218
+GL SH + I+P + T++DF+ ++ Y +
Sbjct: 342 VGLRSHRDLGIDPARTARNYTMLDFRLYIREIYRLPPAGVSIPYKEANSNAAAAAPGAPA 401
Query: 219 SFHHTKPKLVLVNRNARVGRT--ILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
KP+L+L+NR GRT +N E+ A + GF+V EP S+ D R +
Sbjct: 402 EQLQRKPRLMLINR----GRTRKFVNFPEIVGAVQNAGFEVIPIEPRRDLSVEDFARTVD 457
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQ 335
SC ++G HGAGLT+ FLR +V++QVVP G + S ++ PA+ + L +EY I
Sbjct: 458 SCDVLMGAHGAGLTNFFFLRTNAVMLQVVPWGHMEHPSMAFYGGPAKEMRLRDVEYSIAD 517
Query: 336 EESSLVEKYGANDLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRF 381
ES+L +KYG ++ V+ +P++ W MR Y Q+++L++ RF
Sbjct: 518 VESTLYDKYGKDNPVVSDPESIHKQGWQLGMRYYWIEQDIRLNVTRF 564
>gi|357131990|ref|XP_003567616.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like
[Brachypodium distachyon]
Length = 518
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 119/395 (30%), Positives = 201/395 (50%), Gaps = 18/395 (4%)
Query: 5 PAALSPAKSITCD-RSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLR-IKTQP 62
P +PAK CD + + D+C I+ T +S + P+ + +
Sbjct: 125 PDESAPAKKPACDIQGPWASDVCAIDAGVRIQGSARTVLITPPIESGGANPNPQSWQIVA 184
Query: 63 YPRKSDKSAMS-KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPL 121
Y RK + V+EL AP +C VT PA+VF+ GG GN++H+F D +PL
Sbjct: 185 YSRKHQAGMIPITVRELATAAEAP---ACDVTSEVPAMVFAMGGLTGNYWHDFSDVMIPL 241
Query: 122 FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF-QSVTLGL 180
++ S F +V L + + +A KY ++L + +++ II+++N C+ + +G+
Sbjct: 242 YLQ-ASKF-QGEVQLVVTNLQPWYAGKYRQILGKLSKYQIIDMDNDKQVRCYPRGSVVGI 299
Query: 181 ISHGRMVINPTLLPKPKTLVDFQSFLANAYN-----ENTNTSSSFHHTKPKLVLVNRNAR 235
H I+P P ++ +F +FL + ++ + KP++++++R R
Sbjct: 300 RMHKEFSIDPEKAPTGHSMPEFTAFLRDVFSLPRAKPTPPAAIVSGEKKPRMMIISR--R 357
Query: 236 VGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ++N+ VK AE +GF+V I +P S + +++ ++GVHGAGLT+SLFL
Sbjct: 358 HPRALVNVAAVKAMAERVGFEVVIGDPPFSQDVGAFAAEVNTADVLLGVHGAGLTNSLFL 417
Query: 296 RPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNP 354
G+V +Q+VP G + ++ F PA +GL Y+ Y EESSLVE G + + +P
Sbjct: 418 PTGAVFIQIVPYGKMEHIAETDFGIPAFDMGLHYVAYSAGVEESSLVETLGRGHVAVADP 477
Query: 355 QAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
+A + W + YL Q+VKLD+ RF L A
Sbjct: 478 EAVHRSGWDKVAEYYLGRQDVKLDLARFEPVLLKA 512
>gi|218187404|gb|EEC69831.1| hypothetical protein OsI_00153 [Oryza sativa Indica Group]
Length = 534
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 115/338 (34%), Positives = 177/338 (52%), Gaps = 26/338 (7%)
Query: 7 ALSPAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIK 59
A P I+CD + +C ++G P TS SV T + +
Sbjct: 163 APKPKSKISCDDKSKDEGFPYARPIVCHLSGDVRVSPATS---SVTLTMPLQQGEAAARR 219
Query: 60 TQPYPRKSDKSAMSKVKELTITTSAPPN--LSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
+PY R+ D + V+E+ IT++A SC V+H PA++FS GGY GNFFH+ D
Sbjct: 220 IRPYARRDD-FLLPLVREVAITSAASEGDAPSCNVSHGVPAVIFSIGGYTGNFFHDMADV 278
Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
VPL++T HF + V L +A+ W +KY +L R + +++ ++ HCF V
Sbjct: 279 LVPLYLT-TFHFKGK-VQLFVANYKQWWIQKYKPVLRRLSHRAVVDFDSDGDVHCFDHVI 336
Query: 178 LGLISHGRMVIN--PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNR 232
+GL+ +++ PT PK T+VDF FL +AY + T KP++++++R
Sbjct: 337 VGLVRDRDLILGQHPTRNPKGYTMVDFTRFLRHAYGLRRDKPMVLGETSGKKPRMLIISR 396
Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGL 289
R R +LNLR+V A ELGF+V + E RF ++SC +VGVHGAGL
Sbjct: 397 --RRTRKLLNLRQVAAMARELGFEVVVSEAGVGGGSGGVKRFASAVNSCDVLVGVHGAGL 454
Query: 290 THSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGL 326
T+ FL G V++Q+VP G +W++T ++ PA + L
Sbjct: 455 TNQAFLPRGGVVVQIVPWGRMEWMATNFYGAPAAAMEL 492
>gi|222617641|gb|EEE53773.1| hypothetical protein OsJ_00163 [Oryza sativa Japonica Group]
Length = 514
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 125/392 (31%), Positives = 187/392 (47%), Gaps = 68/392 (17%)
Query: 10 PAKSITCDRSHRSYD-------LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQP 62
P I+CD + +C ++G P TS SV T + + +P
Sbjct: 166 PKSKISCDDKSKDEGFPYARPIVCHLSGDVRVSPATS---SVTLTMPLQQGEAAARRIRP 222
Query: 63 YPRKSDKSAMSKVKELTITTSAPPN--LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVP 120
Y R+ D + V+E+ IT++A SC V+H PA++FS GGY GNFFH+ D VP
Sbjct: 223 YARRDD-FLLPLVREVAITSAASEGDAPSCNVSHGVPAVIFSIGGYTGNFFHDMADVLVP 281
Query: 121 LFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGL 180
L++T HF + V L +A+ Y + + R T
Sbjct: 282 LYLT-TFHFKGK-VQLFVAN--------YKQCIRRGT----------------------- 308
Query: 181 ISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVG 237
P+ +VDF FL +AY + T KP++++++R R
Sbjct: 309 -------------PRGTPMVDFTRFLRHAYGLRRDKPMVLGETSGKKPRMLIISR--RRT 353
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSLF 294
R +LNLR+V A ELGF+V + E RF ++SC +VGVHGAGLT+ F
Sbjct: 354 RKLLNLRQVAAMARELGFEVVVSEAGVGGGSGGVKRFASAVNSCDVLVGVHGAGLTNQAF 413
Query: 295 LRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKN 353
L G V++Q+VP G +W++T ++ PA + L Y+EY + EESSL +Y V ++
Sbjct: 414 LPRGGVVVQIVPWGRMEWMATNFYGAPAAAMELRYVEYHVAAEESSLARRYPREHAVFRD 473
Query: 354 PQAFAGANWSNMRVYLKTQNVKLDIDRFRIYL 385
P A G W + + TQ+VKL++ RFR L
Sbjct: 474 PMAIHGQGWKALADIVMTQDVKLNLRRFRPTL 505
>gi|115434210|ref|NP_001041863.1| Os01g0119000 [Oryza sativa Japonica Group]
gi|11034626|dbj|BAB17150.1| putative HGA6 [Oryza sativa Japonica Group]
gi|14090226|dbj|BAB55487.1| putative HGA6 [Oryza sativa Japonica Group]
gi|113531394|dbj|BAF03777.1| Os01g0119000 [Oryza sativa Japonica Group]
gi|215740963|dbj|BAG97458.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 482
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 108/329 (32%), Positives = 183/329 (55%), Gaps = 22/329 (6%)
Query: 59 KTQPYPRKSDKSAMSKVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHE 113
K QPYPR++ +S + E+T+T +AP +C VTH P +VF+ GG GN++H+
Sbjct: 148 KIQPYPRRT----VSGIAEVTVTRQQDRAAAP---ACTVTHGVPGVVFALGGLTGNYWHD 200
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
F D VPLF+ + +V +++ W KY ++ R +R+ ++++ T CF
Sbjct: 201 FSDVLVPLFVASRRY--GGEVQFLVSNIQPWWLGKYEAVVRRLSRYDAVDLDRDTEVRCF 258
Query: 174 QSVTLGLISHGRMVINPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVN 231
+ V +GL H + P L P + T+ DF +FL + Y ++ + ++V
Sbjct: 259 RRVAVGLRMHKEFSVKPELAPGGQRLTMADFAAFLRDTYALPRAAAAG---ARRPRLVVI 315
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
R A R I+N+ EV +AAE GF+ + P + + R +++ AMVGVHGAGLT+
Sbjct: 316 RRAHY-RKIVNMDEVVRAAEAAGFEAAVMSPRFDEPVEEVARKVNAFDAMVGVHGAGLTN 374
Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
++FL G+V++QVVP G + ++ F +P +GL Y+EY + +ES+L+E G V
Sbjct: 375 AVFLPAGAVVIQVVPYGRLERMARADFGEPVADMGLRYMEYSVAADESTLLEMLGPEHQV 434
Query: 351 LKNPQAFAGANWSNM-RVYLKTQNVKLDI 378
+K+P+A + W + YL Q+V++++
Sbjct: 435 VKDPEAVHRSGWDKVAEYYLGKQDVRINV 463
>gi|125568815|gb|EAZ10330.1| hypothetical protein OsJ_00165 [Oryza sativa Japonica Group]
Length = 482
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 108/329 (32%), Positives = 183/329 (55%), Gaps = 22/329 (6%)
Query: 59 KTQPYPRKSDKSAMSKVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHE 113
K QPYPR++ +S + E+T+T +AP +C VTH P +VF+ GG GN++H+
Sbjct: 148 KIQPYPRRT----VSGIAEVTVTRQQDRAAAP---ACTVTHGVPGVVFALGGLTGNYWHD 200
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
F D VPLF+ + +V +++ W KY ++ R +R+ ++++ T CF
Sbjct: 201 FSDVLVPLFVASRRY--GGEVQFLVSNIQPWWLGKYEAVVRRLSRYDAVDLDRDTEVRCF 258
Query: 174 QSVTLGLISHGRMVINPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVN 231
+ V +GL H + P L P + T+ DF +FL + Y ++ + ++V
Sbjct: 259 RRVAVGLRMHKEFSVKPELAPGGQRLTMADFAAFLRDTYALPRAAAAG---ARRPRLVVI 315
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
R A R I+N+ EV +AAE GF+ + P + + R +++ AMVGVHGAGLT+
Sbjct: 316 RRAHY-RKIVNMDEVVRAAEAAGFEAAVMSPRFDEPVEEVARKVNAFDAMVGVHGAGLTN 374
Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
++FL G+V++QVVP G + ++ F +P +GL Y+EY + +ES+L+E G V
Sbjct: 375 AVFLPAGAVVIQVVPYGRLERMARADFGEPVADMGLRYMEYSVAADESTLLEMLGPEHQV 434
Query: 351 LKNPQAFAGANWSNM-RVYLKTQNVKLDI 378
+K+P+A + W + YL Q+V++++
Sbjct: 435 VKDPEAVHRSGWDKVAEYYLGKQDVRINV 463
>gi|125524186|gb|EAY72300.1| hypothetical protein OsI_00155 [Oryza sativa Indica Group]
Length = 443
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 107/329 (32%), Positives = 183/329 (55%), Gaps = 22/329 (6%)
Query: 59 KTQPYPRKSDKSAMSKVKELTIT-----TSAPPNLSCGVTHTSPALVFSAGGYNGNFFHE 113
K QPYPR++ +S + E+T+T +AP +C VTH P +VF+ GG GN++H+
Sbjct: 109 KIQPYPRRT----VSGIAEVTVTRQQDRAAAP---ACTVTHGVPGVVFALGGLTGNYWHD 161
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
F D VPLF+ + +V +++ W KY ++ R +R+ ++++ T C
Sbjct: 162 FSDVLVPLFVASRRY--GGEVQFLVSNIQPWWLGKYEAVVRRLSRYDAVDLDRDTEVRCL 219
Query: 174 QSVTLGLISHGRMVINPTLLPKPK--TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVN 231
+ V +GL H + P L P + T+ DF +FL + Y ++ + ++V
Sbjct: 220 RRVAVGLRMHKEFSVKPELAPGGQRLTMADFAAFLRDTYALPRAAAAGARRPR---LVVI 276
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTH 291
R A R I+N+ EV +AAE GF+ + P ++ + R +++ AMVGVHGAGLT+
Sbjct: 277 RRAHY-RKIVNMDEVVRAAEAAGFEAAVMSPRFDEAVEEVARKVNAFDAMVGVHGAGLTN 335
Query: 292 SLFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLV 350
++FL G+V++QVVP G + ++ F +P +GL Y+EY + +ES+L+E G V
Sbjct: 336 AVFLPAGAVVIQVVPYGRLERMARADFGEPVADMGLRYMEYSVAADESTLLEMLGPEHQV 395
Query: 351 LKNPQAFAGANWSNM-RVYLKTQNVKLDI 378
+K+P+A + W + YL Q+V++++
Sbjct: 396 VKDPEAVHRSGWDKVAEYYLGKQDVRINV 424
>gi|326504780|dbj|BAK06681.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 123/382 (32%), Positives = 194/382 (50%), Gaps = 27/382 (7%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S+ D+C + G T ++ H ++ + S R+K P+ RK D ++ V
Sbjct: 74 CDMSNLRSDVCELKGDVRVILSNITIIALVHPSASLRRRSRRMK--PHARKKDGHVLASV 131
Query: 76 KEL--TITTSAPPNLSCGVTHTSPALVFS-AGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
++ ++T S+P C +PA+VFS GGY GN FH+F D +PLFIT S F
Sbjct: 132 TDVLVSVTPSSPHVPGCMAESAAPAVVFSVGGGYEGNMFHDFTDVLIPLFIT-ASRF-RS 189
Query: 133 DVILAIADCNDQWARKYAELLPRFTRHPIININNQ-TITHCFQSVTLGLISHGRMVINPT 191
DV L +D W KY LL + H +I+++ Q T C+ V +GL H M IN
Sbjct: 190 DVHLLASDAPSWWLDKYRPLLRGLSGHAVIDMDRQSTEVLCYPHVVVGLSFHKEMSINDA 249
Query: 192 LLPKPK-TLVDFQSFLANAYNENTNTSSSFHHT---------KPKLVLVNRNARVGRTIL 241
++ F +Y +T+ H +P+L++++R + R
Sbjct: 250 KTAGGHYSMAAFARLARRSYGLERDTAIRLLHGSSDNVKSPRRPRLLIISR--KTTRAFT 307
Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
N+ V +AA LG++V + E E+ + L+ R ++SC +VGVHG GL + +FL PG+V+
Sbjct: 308 NMGTVAQAAAMLGYEVIVGEAEQRSDLSALARLVNSCDVLVGVHGTGLANLVFLPPGAVV 367
Query: 302 MQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGA 360
+QVVP+G + ++ F PA +GL Y+ Y + ES+L E + ++ NP A
Sbjct: 368 VQVVPLGGLEAMAGEDFGVPAGDMGLGYVRYTVAIGESTLAELHPSD-----NPAAVRSQ 422
Query: 361 NWSNMR-VYLKTQNVKLDIDRF 381
+R YL QNV L++ RF
Sbjct: 423 ASLALRPAYLAGQNVTLNVTRF 444
>gi|357139224|ref|XP_003571184.1| PREDICTED: uncharacterized protein LOC100840829 [Brachypodium
distachyon]
Length = 546
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 116/395 (29%), Positives = 197/395 (49%), Gaps = 29/395 (7%)
Query: 9 SPAKSITCDRSHRSYDLCLINGSA--LFDPKTSTFFSVGHTDSTPS--QPSLRIKTQPYP 64
+P K I CD S YD+C ++G A + P + + G + ++R +++ +
Sbjct: 146 APRKPI-CDLSDPRYDVCDMSGDARAIGGPNRTVLYISGDAAGAEEGHEWAIRDQSRKHL 204
Query: 65 RKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT 124
+ A+ + + P C H PA+VF+ G N +H+F D +PLFIT
Sbjct: 205 EYINTVAVKSLSAAQAQAQSAP--ECTSRHAVPAVVFAMNGLTSNPWHDFSDVLIPLFIT 262
Query: 125 INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTIT-HCFQSVTLGLISH 183
D V + D + KY +L +RH I++++ ++ + C V +GL SH
Sbjct: 263 ARGF--DGGVQFLVTDIQPWFLDKYRLILANLSRHDIVDLDKESGSVRCHPRVIVGLRSH 320
Query: 184 GRMVINPTLLP---KPKTLVDFQSFLANAYN------------ENTNTSSSFHHTKPKLV 228
+ I+P P K T++DF+ ++ ++ ++ ++ KP+L+
Sbjct: 321 RDLGIDPARFPAGNKNYTMLDFRMYIRELFSLPPASVDIPYKEQSAAAAAEKQRKKPRLM 380
Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
L+NR R +NL E+ AAE GF+ + EP L + R + SC ++G HGAG
Sbjct: 381 LINRGR--NRKFVNLPEIAAAAEAAGFETVVVEPRRDLKLEEFSRVVDSCDVLMGAHGAG 438
Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
LT+ FLR G+V++QVVP G + S ++ PA+ + L +EY I EES+L EKYG +
Sbjct: 439 LTNFFFLRTGAVMLQVVPWGHMERPSMEFYGVPAKEMRLRDVEYSITAEESTLYEKYGKD 498
Query: 348 DLVLKNPQAFAGANWS-NMRVYLKTQNVKLDIDRF 381
+++P++ W MR Y Q+++L++ RF
Sbjct: 499 HPAVRDPESIHRQGWQLGMRYYWLEQDIRLNVTRF 533
>gi|238007990|gb|ACR35030.1| unknown [Zea mays]
Length = 260
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/254 (36%), Positives = 147/254 (57%), Gaps = 9/254 (3%)
Query: 145 WARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQS 204
W K+ + TRH +I+++ HCF + +G H M ++P P + VDF+
Sbjct: 5 WVGKFTPFFRQLTRHDVIDVDKDGEVHCFPRIVVGATFHRDMGVDPRRAPGHVSAVDFKR 64
Query: 205 FLANAYNENTNTSS-----SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI 259
L A+ +S + H KP+L++++R R R LN RE+ AA + GF+V +
Sbjct: 65 ALRAAFGLKREAASRGGGGATGHGKPRLLIISR--RGSRRFLNSREMAVAAGDAGFEVRV 122
Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLSTVYFE 318
EPE+ T +A ++S AMVGVHGAGLT+ +FL G+VL+QVVP G +WL+ V F+
Sbjct: 123 AEPEQRTDMAAFAALVNSADAMVGVHGAGLTNMVFLPRGAVLVQVVPFGGLEWLTGVTFK 182
Query: 319 KPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLD 377
+PA + + Y++Y ++ EESSLV++Y VL +P A W ++ YL QN+++D
Sbjct: 183 EPAADMEVSYMDYHVRLEESSLVDQYLRGHQVLTDPYAVHRQGWDALKTAYLDKQNIRMD 242
Query: 378 IDRFRIYLKDAYKK 391
+DRFR L++ +
Sbjct: 243 LDRFRATLREVMAR 256
>gi|222628421|gb|EEE60553.1| hypothetical protein OsJ_13906 [Oryza sativa Japonica Group]
Length = 527
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 114/383 (29%), Positives = 197/383 (51%), Gaps = 40/383 (10%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S YD+C+++G +V T +P K +PY RK + +
Sbjct: 170 CDTSFGKYDICVLDGDTRAQGGCGAGAAV-VTLVSPRAAPREWKIKPYSRK----YLDGL 224
Query: 76 KELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVI 135
K +T++++ P GN++H+F D VPLFI +V
Sbjct: 225 KPVTVSSAGSP---------------------GNYWHDFTDVLVPLFIGARRF--GGEVQ 261
Query: 136 LAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLISHGRMVINPTL 192
L + + W KY + + +RH I+++ +++ + C+ V +G S I+P+L
Sbjct: 262 LLVVNLLPFWVDKYRRIFSQISRHDIVDLEKDDDRGVVRCYPHVVVGYGSRKEFTIDPSL 321
Query: 193 --LPKPKTLVDFQSFLANAYN---ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
T+V+F FL +Y+ + + H +P+++++ R R ++NL EV
Sbjct: 322 DDTGGGYTMVNFTEFLRQSYSLPRDRPIKLGTNHGARPRMMILERTN--SRKLMNLPEVA 379
Query: 248 KAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPI 307
AA GF+VT+ +++ + R ++S MVGVHGAGLT+ +FL G+VL+Q+VP
Sbjct: 380 AAARAAGFEVTVAGGRPTSTYDEFAREVNSYDVMVGVHGAGLTNCVFLPTGAVLLQIVPY 439
Query: 308 GT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM- 365
G + ++ F +PAR +GL Y+EY I +ESSL++ +G + ++K+P A + W N+
Sbjct: 440 GRLESIAQTDFGEPARDMGLRYIEYDIAADESSLMDVFGKDHPMIKDPVAVHLSGWGNVA 499
Query: 366 RVYLKTQNVKLDIDRFRIYLKDA 388
YL Q+V+++I+RFR +L A
Sbjct: 500 EWYLGKQDVRVNIERFRPFLTQA 522
>gi|219888573|gb|ACL54661.1| unknown [Zea mays]
gi|413953956|gb|AFW86605.1| hypothetical protein ZEAMMB73_638297 [Zea mays]
Length = 501
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 115/397 (28%), Positives = 188/397 (47%), Gaps = 42/397 (10%)
Query: 7 ALSPAKSITCDRS-------HRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIK 59
A P ITCD + + +C + G PK T + V + + P + K
Sbjct: 126 ATKPEAKITCDENGVDEGFPYARPPVCELAGDIRVSPKEKTMYFVNPSGAGPFDSNGEKK 185
Query: 60 TQPYPRKSDKSAMSKVKELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDC 117
+P+ RK D + V E+TI +SA C H P +VFS GY NFFH+ D
Sbjct: 186 IRPFARKDD-FLLPGVVEVTIKSVSSAAAAPRCTRRHDVPVVVFSVAGYTDNFFHDNTDV 244
Query: 118 FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVT 177
+PLF+T +H +V L + + W RK+ LL + + + +I+ HCF++
Sbjct: 245 LIPLFLT-TAHL-QGEVQLLVTNYKPWWVRKFTPLLRKLSNYDVIDFGKDDEVHCFRAGH 302
Query: 178 LGLISHGRMVINP--TLLPKPKTLVDFQSFLANAYNENTNTSSSFHH---TKPKLVLVNR 232
LG+ ++I+P T P+ ++VD+ FL A+N + + KP+++++ R
Sbjct: 303 LGMYRDRDLIISPHPTRNPRNYSMVDYNRFLRRAFNLPRDAPAVLGEETSAKPQMLIIER 362
Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
R +LNL EV E LGF VT+ E + AD F +A
Sbjct: 363 KGT--RKLLNLPEVSALCEALGFAVTVAEAD-----ADVRVFAEKVNA------------ 403
Query: 293 LFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
VL+Q+VP G W++T ++ +PAR + L Y+EY + +EE++L +KY + V
Sbjct: 404 -----ADVLVQIVPWGKMDWMATNFYGQPARDMRLRYVEYYVSEEETTLKDKYPRDHYVF 458
Query: 352 KNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
K+P W + + Q+V +++ RF+ +L A
Sbjct: 459 KDPMHIHAQGWPAIAEIIMKQDVMVNMTRFKPFLLQA 495
>gi|222617640|gb|EEE53772.1| hypothetical protein OsJ_00161 [Oryza sativa Japonica Group]
Length = 546
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 117/380 (30%), Positives = 173/380 (45%), Gaps = 28/380 (7%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C +G TS F V + +R PYPRK D + M +V
Sbjct: 181 CDFSDFRTDICDFSGDVRMAANTSEFVVVVDPAAAAVSHKVR----PYPRKGDATCMGRV 236
Query: 76 KELTI--TTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQD 133
E+T+ T A C H PA+ FS GGY GN FH+F D VPL+ T+ +
Sbjct: 237 TEITVRATGDAAGAPRCTRAHDVPAVAFSIGGYTGNIFHDFSDVIVPLYNTVQRY--RGG 294
Query: 134 VILAIADCNDQWARKYAELLPRFTRHPIININNQTIT---HCFQSVTLGLISHGRMVINP 190
V L + + W KY +LL +RH I++ HCF S + L +H ++I
Sbjct: 295 VQLVMVNVASWWLVKYDKLLRELSRHAPIDLAKAGAAGEVHCFPSAVVSLRAHRELIIER 354
Query: 191 TLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAA 250
DF FL A + + + +L +AA
Sbjct: 355 ERSLDGLATPDFTRFLRRALSLPRERADAARRRHRAAAAPPG---------HLPGAHQAA 405
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT- 309
E L I+SC A++GVHGAGLT+ +FL PG+ ++QVVP G
Sbjct: 406 PEPRRRGPRGGGGRVGKL------INSCDALLGVHGAGLTNMMFLPPGATMVQVVPWGGL 459
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VY 368
QW++ + + +PA +GL Y++Y+I ESSL +KY A D + NP ++ M+
Sbjct: 460 QWIARMDYGEPAAAMGLNYIQYEIAVAESSLKDKYPAGDEIFTNPTGLHKRGFAFMKQTL 519
Query: 369 LKTQNVKLDIDRFRIYLKDA 388
+ Q++ +D+ RFR L+ A
Sbjct: 520 MDGQDITIDVTRFRPVLQQA 539
>gi|413950202|gb|AFW82851.1| hypothetical protein ZEAMMB73_004474 [Zea mays]
Length = 329
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 103/338 (30%), Positives = 165/338 (48%), Gaps = 37/338 (10%)
Query: 72 MSKVKELTITTSAPPNLSCGVT----HTSPALVFSAGGYNGNFFHEFMDCFVPLFITINS 127
M+ V+E T+ P HT+P +FS GG++GN +H++ D VPLFI+ +
Sbjct: 1 MAHVREYTLKALPEPGAGAAPASTRNHTNPGFLFSNGGFSGNMYHDYTDVLVPLFISTHQ 60
Query: 128 HFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMV 187
F ++ ++ W K+ + TRH +I+++N HCF + +G H M
Sbjct: 61 -FRER-----VSGMKPWWMGKFTPFFRQLTRHDVIDVDNDREVHCFPRIVVGATFHRDMG 114
Query: 188 INPTLLPKPKTLVDFQSFLANAYN---------------ENTNTSSSFHHTKPKLVLVNR 232
++P P ++VD + L A+ +++ HH L LV
Sbjct: 115 VDPRRSPGHISVVDLKRALRAAFRLKREAASRGGVPQRRHGARQTAAAHHLPVWLTLVPE 174
Query: 233 NARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
A G R +T +A ++S +MVGVHGAGLT+
Sbjct: 175 RAGDGARRGRRRVRGAGGGA----------RPATDMATFVALVNSADSMVGVHGAGLTNM 224
Query: 293 LFLRPGSVLMQVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
+FL G VL+QVVP G +WL+ V F++PA + + Y++Y ++ EESSLV++Y VL
Sbjct: 225 VFLPCGVVLVQVVPFGGLEWLTGVTFKEPASDMEVSYMDYNVRLEESSLVDQYPRGHQVL 284
Query: 352 KNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDA 388
+P A W ++ YL QN+++D+DRFR L++A
Sbjct: 285 TDPYAVHRQGWDALKTAYLDKQNIQMDLDRFRATLREA 322
>gi|449526607|ref|XP_004170305.1| PREDICTED: glycosyltransferase-like domain-containing protein
2-like [Cucumis sativus]
Length = 335
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 150/301 (49%), Gaps = 8/301 (2%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKELTITTSA 84
+CL N + F+ + DS + S I PY R+ DK + V L I
Sbjct: 35 VCLTNNPTRINNTNLEFYISTNNDSQQNNFS-PILIHPYARQEDKITLRDVTPLQIIFQP 93
Query: 85 PPNLS--CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCN 142
L C H P L+FS GG+ GN FHEF + +PLFIT + HF + V I D
Sbjct: 94 NKTLLPLCQFIHNVPVLIFSTGGFTGNLFHEFDETIIPLFIT-SYHFQTR-VRFLITDHK 151
Query: 143 DQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDF 202
W +KY +L +R ++N HCF +GL H + +N T +P ++ DF
Sbjct: 152 TWWVQKYNRILSGLSRFNVVNPAEDGSVHCFNGGVIGLKFHNILSLNNTDIPGGYSMSDF 211
Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEP 262
+SFL YN N S KP ++L++R + R +N E+ + +E+GF+V P
Sbjct: 212 RSFLRQTYNLKVNNVSELSGKKPMVMLISR--QTSRRFMNEGEMVEMMKEVGFEVMTTTP 269
Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR 322
+ ++L ++ C ++G HGAGLT+ +FL G+V++QVVP G W ST YF R
Sbjct: 270 QRMSNLDKFSSVVNLCSVIIGAHGAGLTNEVFLANGAVVVQVVPFGLDWPST-YFLGSQR 328
Query: 323 V 323
+
Sbjct: 329 L 329
>gi|297737475|emb|CBI26676.3| unnamed protein product [Vitis vinifera]
Length = 420
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/256 (35%), Positives = 137/256 (53%), Gaps = 3/256 (1%)
Query: 136 LAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPK 195
I D +Y +L + + + IN HCF +GL H + +N + +P
Sbjct: 158 FVITDDKPWMVERYKRILKQLSAYNAINPAEDGSVHCFPGAVIGLKYHDNLALNTSDIPG 217
Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGF 255
++ DF+ FL +Y+ T+S H KP L+L++R R R LN E E+LGF
Sbjct: 218 GYSMSDFKHFLRKSYSLKITTASEIEHPKPVLILISR--RTTRKFLNEDETVGLMEDLGF 275
Query: 256 DVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTV 315
V I P +SL + ++SC MVG HGAGL +S+ L G+V +QVVP+G S
Sbjct: 276 QVIITPPYNMSSLDKFAQVVNSCSVMVGAHGAGLANSVLLPAGAVTVQVVPLGLDRASAA 335
Query: 316 YFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNV 374
+ PAR +G++YLEYKI+ EESSL + YG + ++ +PQ+ + R VYL Q++
Sbjct: 336 NYGDPARKIGVQYLEYKIEAEESSLFDLYGRDQPIIADPQSIHLKGYDVARAVYLDGQDM 395
Query: 375 KLDIDRFRIYLKDAYK 390
K+++ R R L A K
Sbjct: 396 KINLVRLREILVQARK 411
>gi|242066748|ref|XP_002454663.1| hypothetical protein SORBIDRAFT_04g035100 [Sorghum bicolor]
gi|241934494|gb|EES07639.1| hypothetical protein SORBIDRAFT_04g035100 [Sorghum bicolor]
Length = 457
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/394 (28%), Positives = 183/394 (46%), Gaps = 29/394 (7%)
Query: 6 AALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDST-----PSQPSLRIKT 60
A + P CD S R YD C + G D +T++ G S P Q
Sbjct: 62 APVHPPLEPICDLSDRRYDGCEMWG----DARTAS----GADKSVVYFIPPPQQLATAAA 113
Query: 61 QPYPRKSDKSAMSKVKELTITTSAPPNL----SCGVTHTSPALVFSAGGYNGNFFHEFMD 116
+ +S + V+E+ + + +L C V PA+VF+ GG N++H F D
Sbjct: 114 ATWSIRSQSRKIVGVREVIVRSLDASSLHEAPGCTVRRDVPAVVFALGGLTSNYWHAFSD 173
Query: 117 CFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ-NQTITHCFQS 175
VPLF T + D D++ A + KY +L +R+ +++++ + + C+
Sbjct: 174 VLVPLFTTARAFGGDVDLLATGAGGQAWFLGKYDRVLRALSRYDVVDLDADGDVVRCYHH 233
Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFL------ANAYNENTNTSSSFHHTKPKLVL 229
V +GL H I+ P ++ F+ F+ S T+P+L+L
Sbjct: 234 VVVGLRGHRDFDIDAARAPNGYDMLAFREFVRAAYSLPPPPAAALPCKSGGGGTRPRLML 293
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEES-TSLADSFRFIHSCHAMVGVHGAG 288
V R R +N + A E GF+V + S S+ R + +C +VG HGAG
Sbjct: 294 VLRGRT--RRFVNEGAIVDAIERAGFEVARMDETASWGSVGAVAREVDACDVLVGAHGAG 351
Query: 289 LTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
LT+ +FLR G+V++QV+P G + +F PA +GL ++ Y I EES+L E+YG +
Sbjct: 352 LTNMVFLRAGAVVVQVIPWGKMEPYGEGFFGAPAAHMGLRHVAYSIAAEESTLYERYGKD 411
Query: 348 DLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRF 381
V+ +P F N SN + Y Q+++L+ RF
Sbjct: 412 HPVMADPDVFY-RNGSNAKFYWWEQSIRLNTTRF 444
>gi|226503289|ref|NP_001147801.1| glycosyltransferase [Zea mays]
gi|195613814|gb|ACG28737.1| glycosyltransferase [Zea mays]
Length = 551
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 122/395 (30%), Positives = 189/395 (47%), Gaps = 56/395 (14%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPS-QPSLRIKTQPYPRKSDKSAMS-KVKELTITT 82
+C ++G PK T + V + S+ + +PY RK D S VK +
Sbjct: 188 VCELSGDVRVSPKQRTVYLVNPSGGGGGFDESVEKRLRPYARKDDSSMPGVTVKSVASGA 247
Query: 83 SAPPNLSCGVTHTSPALVFSAGGYN--GNFFHEFM-DCFVPLFITINSHFPDQDVILAIA 139
+AP C H PA+VFS GGYN N F + M D VPLF+T +H D +V L +A
Sbjct: 248 AAP---ECTKRHAVPAVVFSVGGYNTDNNLFDDDMTDALVPLFLT-TAHL-DGEVQLVVA 302
Query: 140 DCNDQWARKYAELLPRFTRHPIININNQT---------ITHCF--QSVTLGLISHGRMVI 188
D +W RKYA LL + +RH +++++ HCF + +GL +
Sbjct: 303 DYKPRWVRKYAPLLRKLSRHGVVSLDGDAEGEEEGPLDGVHCFPAGAFVVGLYRDRDRDL 362
Query: 189 N----PTLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
+ P P+ T+ DF FL A + +P+L++V+ R +L
Sbjct: 363 DLSPHPARNPRNVTMADFARFLRGALALPRDRPAVLGGAPGMRPRLLVVSP-----RRLL 417
Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
NL EV AA+ LGFDV E + +A+ +++ +VGV GAGLT+ +FL +VL
Sbjct: 418 NLEEVAAAADALGFDVASAE-AGADDVAEFAARVNAADVLVGVRGAGLTNQVFLPTEAVL 476
Query: 302 MQVVPIGT--QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAG 359
+Q+VP G +W++T + + A +GL YLEY + +EE+ L +KY
Sbjct: 477 VQIVPWGKKIEWVTTSSYGRAAAGMGLRYLEYYVGEEETRLRDKY--------------- 521
Query: 360 ANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
R + Q+V +++ RFR L A K ++
Sbjct: 522 -----PRETVMEQDVVVNLTRFRPVLLQALDKLQQ 551
>gi|413935206|gb|AFW69757.1| hypothetical protein ZEAMMB73_948511 [Zea mays]
Length = 479
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/399 (30%), Positives = 192/399 (48%), Gaps = 60/399 (15%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPS--QPSLRIKTQPYPRKSDKSAMS-KVKELTIT 81
+C ++G PK T + V + S+ + +PY RK D S VK +
Sbjct: 112 VCELSGDVRVSPKQRTVYLVNPSGGGGGGFDESVEKRLRPYARKDDSSMPHITVKSVASG 171
Query: 82 TSAPPNLSCGVTHTSPALVFSAGGYN--GNFFHEFM-DCFVPLFITINSHFPDQDVILAI 138
+AP C H PA+VFS GGYN N F + M D VPLF+T +H D +V L +
Sbjct: 172 AAAP---ECTKRHAVPAVVFSVGGYNTDNNLFDDDMTDALVPLFLT-TAHL-DGEVQLVV 226
Query: 139 ADCNDQWARKYAELLPRFTRHPIININNQTI---------THCF--QSVTLGLISHGR-- 185
AD +W RKYA LL + +RH +++++ HCF + +GL+ R
Sbjct: 227 ADYKPRWVRKYAPLLRKLSRHGVVSLDGDAEGEEEGPLDGVHCFPAGAFVVGLLYRDRDR 286
Query: 186 -MVINP----TLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVG 237
+ ++P P+ T+ DF FL A + +P+L++V+
Sbjct: 287 DLDLSPHPARNRNPRNVTMADFARFLRGALALPRDRPAVLGGAPGMRPRLLVVSP----- 341
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R +LNL EV AA+ LGFDV E + +A+ +++ +VGV GAGLT+ +FL
Sbjct: 342 RRLLNLEEVAAAADALGFDVASAE-AGADDVAEFAARVNAADVLVGVRGAGLTNQVFLPT 400
Query: 298 GSVLMQVVPIGT--QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
+VL+Q+VP G +W++T + + A +GL YLEY + +EE+ L +KY
Sbjct: 401 EAVLVQIVPWGKKIEWVTTSSYGRAAAGMGLRYLEYYVGEEETRLRDKY----------- 449
Query: 356 AFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
R + Q+V +++ RFR L A K ++
Sbjct: 450 ---------PRETVMEQDVVVNLTRFRPVLLQALDKLQQ 479
>gi|413935205|gb|AFW69756.1| glycosyltransferase [Zea mays]
Length = 555
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/399 (30%), Positives = 192/399 (48%), Gaps = 60/399 (15%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPS--QPSLRIKTQPYPRKSDKSAMS-KVKELTIT 81
+C ++G PK T + V + S+ + +PY RK D S VK +
Sbjct: 188 VCELSGDVRVSPKQRTVYLVNPSGGGGGGFDESVEKRLRPYARKDDSSMPHITVKSVASG 247
Query: 82 TSAPPNLSCGVTHTSPALVFSAGGYN--GNFFHEFM-DCFVPLFITINSHFPDQDVILAI 138
+AP C H PA+VFS GGYN N F + M D VPLF+T +H D +V L +
Sbjct: 248 AAAP---ECTKRHAVPAVVFSVGGYNTDNNLFDDDMTDALVPLFLT-TAHL-DGEVQLVV 302
Query: 139 ADCNDQWARKYAELLPRFTRHPIININNQT---------ITHCF--QSVTLGLISHGR-- 185
AD +W RKYA LL + +RH +++++ HCF + +GL+ R
Sbjct: 303 ADYKPRWVRKYAPLLRKLSRHGVVSLDGDAEGEEEGPLDGVHCFPAGAFVVGLLYRDRDR 362
Query: 186 -MVINP----TLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVG 237
+ ++P P+ T+ DF FL A + +P+L++V+
Sbjct: 363 DLDLSPHPARNRNPRNVTMADFARFLRGALALPRDRPAVLGGAPGMRPRLLVVSP----- 417
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R +LNL EV AA+ LGFDV E + +A+ +++ +VGV GAGLT+ +FL
Sbjct: 418 RRLLNLEEVAAAADALGFDVASAE-AGADDVAEFAARVNAADVLVGVRGAGLTNQVFLPT 476
Query: 298 GSVLMQVVPIG--TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
+VL+Q+VP G +W++T + + A +GL YLEY + +EE+ L +KY
Sbjct: 477 EAVLVQIVPWGKKIEWVTTSSYGRAAAGMGLRYLEYYVGEEETRLRDKY----------- 525
Query: 356 AFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
R + Q+V +++ RFR L A K ++
Sbjct: 526 ---------PRETVMEQDVVVNLTRFRPVLLQALDKLQQ 555
>gi|413935207|gb|AFW69758.1| hypothetical protein ZEAMMB73_948511 [Zea mays]
Length = 386
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 123/399 (30%), Positives = 192/399 (48%), Gaps = 60/399 (15%)
Query: 25 LCLINGSALFDPKTSTFFSVGHTDSTPS--QPSLRIKTQPYPRKSDKSAMS-KVKELTIT 81
+C ++G PK T + V + S+ + +PY RK D S VK +
Sbjct: 19 VCELSGDVRVSPKQRTVYLVNPSGGGGGGFDESVEKRLRPYARKDDSSMPHITVKSVASG 78
Query: 82 TSAPPNLSCGVTHTSPALVFSAGGYN--GNFFHEFM-DCFVPLFITINSHFPDQDVILAI 138
+AP C H PA+VFS GGYN N F + M D VPLF+T +H D +V L +
Sbjct: 79 AAAP---ECTKRHAVPAVVFSVGGYNTDNNLFDDDMTDALVPLFLT-TAHL-DGEVQLVV 133
Query: 139 ADCNDQWARKYAELLPRFTRHPIININNQTI---------THCF--QSVTLGLISHGR-- 185
AD +W RKYA LL + +RH +++++ HCF + +GL+ R
Sbjct: 134 ADYKPRWVRKYAPLLRKLSRHGVVSLDGDAEGEEEGPLDGVHCFPAGAFVVGLLYRDRDR 193
Query: 186 -MVINP----TLLPKPKTLVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVG 237
+ ++P P+ T+ DF FL A + +P+L++V+
Sbjct: 194 DLDLSPHPARNRNPRNVTMADFARFLRGALALPRDRPAVLGGAPGMRPRLLVVS-----P 248
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R +LNL EV AA+ LGFDV E + +A+ +++ +VGV GAGLT+ +FL
Sbjct: 249 RRLLNLEEVAAAADALGFDVASAE-AGADDVAEFAARVNAADVLVGVRGAGLTNQVFLPT 307
Query: 298 GSVLMQVVPIG--TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQ 355
+VL+Q+VP G +W++T + + A +GL YLEY + +EE+ L +KY
Sbjct: 308 EAVLVQIVPWGKKIEWVTTSSYGRAAAGMGLRYLEYYVGEEETRLRDKY----------- 356
Query: 356 AFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
R + Q+V +++ RFR L A K ++
Sbjct: 357 ---------PRETVMEQDVVVNLTRFRPVLLQALDKLQQ 386
>gi|414876591|tpg|DAA53722.1| TPA: hypothetical protein ZEAMMB73_455614 [Zea mays]
Length = 302
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 88/302 (29%), Positives = 161/302 (53%), Gaps = 14/302 (4%)
Query: 99 LVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTR 158
+V + GG GN++H+F D +PL++ + F + +V L + + + KY +L R +R
Sbjct: 1 MVLAMGGLTGNYWHDFSDIMIPLYLQ-AARF-EGEVQLVVENIQPWYVGKYRAILRRLSR 58
Query: 159 HPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS 218
H I++++ CF +G+ H I+P P ++ +F +FL Y+ +
Sbjct: 59 HDIVDMDRDDRVRCFPGAVVGIRMHKEFSIDPAREPLGHSMPEFTAFLRETYSLPRAAPA 118
Query: 219 SF--------HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLAD 270
+P++++++R R R ++NL V A +GF+V I +P + + +
Sbjct: 119 RLAGADGEEDERARPRMMVISR--RHPRKLVNLDAVVALARRVGFEVVIGDPPFNVDVGE 176
Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYL 329
+ +++ +VGVHGAGLT+SLFL G+V +Q+ P G + + V F PA +GL+Y+
Sbjct: 177 FAKEVNAVDVLVGVHGAGLTNSLFLPTGAVFIQINPYGKMEHIGEVDFGIPAVDMGLKYI 236
Query: 330 EYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
Y EE++LV+ G + +K+P++ + W + YL Q+V+LD+ RF L+ A
Sbjct: 237 AYSAGVEETTLVDTLGRDHPAVKDPESIHRSGWGKVAEYYLGKQDVRLDLQRFEPVLRKA 296
Query: 389 YK 390
+
Sbjct: 297 MQ 298
>gi|49660109|gb|AAT68345.1| hypothetical protein At2g03360 [Arabidopsis thaliana]
Length = 214
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 105/158 (66%), Gaps = 5/158 (3%)
Query: 2 KLRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQ 61
K P ++ + ITCDRSH SYDLC INGS + +PKT T + T +T S P + K +
Sbjct: 57 KQNPKVMAASAKITCDRSHTSYDLCSINGSCILNPKTGTLTLMDRTLTT-SAPLVE-KIR 114
Query: 62 PYPRKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
PYPRKS+ M +++EL +T S P +L SC +TH SPA+VFSAGGY G+ +H+F+D F+
Sbjct: 115 PYPRKSENWIMPRIRELKLT-SGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFI 173
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFT 157
PLFIT NS +PD+D IL + + + W KY ++L T
Sbjct: 174 PLFITANSVYPDRDFILVVVNPKEWWMPKYIDILDMVT 211
>gi|55740543|gb|AAV63864.1| hypothetical protein At2g03360 [Arabidopsis thaliana]
Length = 210
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 105/158 (66%), Gaps = 5/158 (3%)
Query: 2 KLRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQ 61
K P ++ + ITCDRSH SYDLC INGS + +PKT T + T +T S P + K +
Sbjct: 53 KQNPKVMAASAKITCDRSHTSYDLCSINGSCILNPKTGTLTLMDRTLTT-SAPLVE-KIR 110
Query: 62 PYPRKSDKSAMSKVKELTITTSAPPNL--SCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
PYPRKS+ M +++EL +T S P +L SC +TH SPA+VFSAGGY G+ +H+F+D F+
Sbjct: 111 PYPRKSENWIMPRIRELKLT-SGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFI 169
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFT 157
PLFIT NS +PD+D IL + + + W KY ++L T
Sbjct: 170 PLFITANSVYPDRDFILVVVNPKEWWMPKYIDILDMVT 207
>gi|224035277|gb|ACN36714.1| unknown [Zea mays]
Length = 250
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/226 (37%), Positives = 125/226 (55%), Gaps = 22/226 (9%)
Query: 176 VTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH------------- 222
+ +GL HG +V++P L+P K++ DFQ+ L Y+ T T+S+
Sbjct: 1 MVVGLRIHGELVVDPKLMPDGKSIKDFQALLHQGYSRTTTTASASSSSPTPLLPLAPPTR 60
Query: 223 ------TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
KPK+++ R + R +LNL +V A GF + T LA +
Sbjct: 61 PCPRSANKPKMLIFIR--KQNRVLLNLPQVVTACRRAGFAPHVMNLRRQTPLAAIHAALA 118
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
S AMV VHGA +TH LF+RPGSVL+Q+VP+G W + ++ KPA+ LGL+YLEY++ E
Sbjct: 119 SADAMVAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLDYLEYRVAPE 178
Query: 337 ESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF 381
ESSL +YG N V++ P + W M +VY+ QNV +++ RF
Sbjct: 179 ESSLAAEYGLNSTVVRTPWVISSRGWWEMKKVYMDRQNVTVNVKRF 224
>gi|357481795|ref|XP_003611183.1| Glycosyltransferase, putative [Medicago truncatula]
gi|355512518|gb|AES94141.1| Glycosyltransferase, putative [Medicago truncatula]
Length = 341
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 165/354 (46%), Gaps = 58/354 (16%)
Query: 1 MKLRPAALSPAKSITCDRSHRSYDLCLINGSALFDPKTSTFFSVGH-TDSTPSQPSLRIK 59
++ R L + + + Y C G K+S+ + V H T+S S I+
Sbjct: 28 LETRKVELEQEQPLCVSEARTEY--CQPQGDIRVHGKSSSVYIVSHKTNSLAENVSWIIR 85
Query: 60 TQPYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFV 119
PY K+ K + KV + C H+ PA++FS GY GN FHEF D +
Sbjct: 86 --PYSVKTVKPS-HKVSQ------------CTKYHSIPAVIFSTAGYTGNHFHEFSDIVI 130
Query: 120 PLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLG 179
PLF+T + V L I D W K+ L + + + II+I+ HCF V +G
Sbjct: 131 PLFLT--CRLFNGQVQLIITDKKSWWISKHQAFLKKLSNYEIIDIDRDDEVHCFSKVIIG 188
Query: 180 LIS-HGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT-----KPKLVLVNRN 233
L H + I+P ++ DF FL ++Y+ + KP+L++++R
Sbjct: 189 LKRYHKELSIDPQ--KYSYSIKDFMEFLRSSYSLKRVGAIKIRDIGNKSKKPRLLILSR- 245
Query: 234 ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
+ R+ +N ++ K A+ LGF V VHGAGLT+ L
Sbjct: 246 -KTSRSFINTNQITKMAKGLGFRVI-------------------------VHGAGLTNIL 279
Query: 294 FLRPGSVLMQVVPIG---TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
FL ++ +QVVP G +WL+T F +P + ++YLEYKI+ EES+L+++Y
Sbjct: 280 FLPQNAIFIQVVPFGGMQVEWLATNDFARPLENMNIKYLEYKIRLEESTLIQQY 333
>gi|226528202|ref|NP_001145173.1| uncharacterized protein LOC100278409 [Zea mays]
gi|195652185|gb|ACG45560.1| hypothetical protein [Zea mays]
Length = 248
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 79/240 (32%), Positives = 130/240 (54%), Gaps = 8/240 (3%)
Query: 159 HPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN---ENTN 215
+ +++++ HCF+ V +GL SH I+P P +++DF F+ AY +
Sbjct: 4 YELVDLDRDPRVHCFRHVQVGLTSHDDFSIDPLRAPNGYSMLDFTGFMRAAYGLPRGDVA 63
Query: 216 TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
+ +P+L+++ R AR R +N E+ + AE+LGF+V + E + +A
Sbjct: 64 AAGPSSKRRPRLLVIAR-ART-RRFVNAEEIVRGAEKLGFEVVV--SEGTHEVAPFAELA 119
Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQ 335
+SC A++GV GAGLT+ + G V++QVVP+G YF P+R +GL YLEY+I
Sbjct: 120 NSCDAIMGVXGAGLTNMXXVPTGGVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITP 179
Query: 336 EESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
EES+L+++ + +P W +++ YL Q+V LD+ RFR LK A +K
Sbjct: 180 EESTLIDQXPRXHPIFTDPNGIKSKGWXSLKDAYLDKQDVSLDMKRFRPTLKKAIAHIRK 239
>gi|125552212|gb|EAY97921.1| hypothetical protein OsI_19837 [Oryza sativa Indica Group]
Length = 265
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 73/174 (41%), Positives = 101/174 (58%), Gaps = 10/174 (5%)
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
KPKLV+ R + R +LNL + A GF + T L + S AMV
Sbjct: 78 AKPKLVIFIR--KQNRVLLNLPHIVTACRRAGFAPHVMNLRRQTPLPAIHAALSSADAMV 135
Query: 283 GVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVE 342
VHGA +TH LF+RPGSVL+Q+VP+G W + ++ KPA+ LGL YLEYK+ EESSL
Sbjct: 136 AVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLGYLEYKVAPEESSLAA 195
Query: 343 KYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF-------RIYLKDA 388
+YG N V+++P + W M +VY+ QNV ++I RF R++LK+A
Sbjct: 196 EYGVNSTVVRDPSVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRSARLHLKNA 249
>gi|77551625|gb|ABA94422.1| glycosyltransferase, putative [Oryza sativa Japonica Group]
Length = 295
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 76/213 (35%), Positives = 111/213 (52%), Gaps = 8/213 (3%)
Query: 94 HTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELL 153
H +P LV +AGGY GN FH F D FVP ++T+ H + V+L + N WA Y E++
Sbjct: 87 HDAPLLVMTAGGYTGNLFHAFSDGFVPAWLTVQ-HL-RRRVVLGVLLYNPWWAGTYGEII 144
Query: 154 PRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN 213
+ ++++ + HCF +G HG + +NP L KT+VDF LA+ Y
Sbjct: 145 SGLLDYHVVDLLHDKRKHCFPGAIIGTRFHGILSVNPARLRDNKTIVDFHDLLADVYETA 204
Query: 214 TNTS----SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLA 269
+T +P+L +V+ R R I N V + A +GFDV I E + L
Sbjct: 205 GDTVVVDVPQPAPRRPRLGIVS--CRGKRVIENQAAVARLARTVGFDVDILETADGLQLP 262
Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
S+ + +C +VGVH A LT LFLRPG+ L+
Sbjct: 263 ASYASVSACDVLVGVHSADLTKLLFLRPGAALV 295
>gi|125539259|gb|EAY85654.1| hypothetical protein OsI_07024 [Oryza sativa Indica Group]
Length = 205
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 122/205 (59%), Gaps = 13/205 (6%)
Query: 199 LVDFQSFLANAYN-ENTNTSSSFHH----TKPKLVLVNRNARVGRTILNLREVKKAAEEL 253
+VDF++ L A E S +P+L++++R R R LN R + A L
Sbjct: 1 MVDFRTMLRGALGLERAVAEPSGDRWDIRRRPRLLIISR--RNSRAFLNERAMADMAMSL 58
Query: 254 GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWL 312
GFDV + EP+ ST ++ R ++S MVGVHGAGLT+ +FL G+VL+QVVP G +WL
Sbjct: 59 GFDVRLGEPDISTDVSKFARLVNSADVMVGVHGAGLTNMVFLPAGAVLIQVVPYGGLEWL 118
Query: 313 STVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMR-VYLKT 371
+ F++PA + + YLEY I+ +E++L E+Y +D VLK+P + W+ ++ VYL
Sbjct: 119 ARGTFKEPAADMQIHYLEYMIQLDETTLSEQYPKDDPVLKDPYSIHKQGWNALKMVYLDK 178
Query: 372 QNVKLDIDRFRIYLKDAYKKAKKLM 396
QNV+ + R LK+ + +A KL+
Sbjct: 179 QNVRPHLGR----LKNTFMEALKLL 199
>gi|222617642|gb|EEE53774.1| hypothetical protein OsJ_00166 [Oryza sativa Japonica Group]
Length = 203
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 68/172 (39%), Positives = 104/172 (60%), Gaps = 5/172 (2%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
+P+L+++ R AR R +N E+ + AE GF+V + E E +A ++C AMVG
Sbjct: 26 RPRLLVIAR-ART-RRFVNADEIVRGAERAGFEVVVSEGEHE--VAPFAELANTCDAMVG 81
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEK 343
VHGAGLT+ +FL G V++QVVP+G YF P+R +GL YLEY+I EES+L+++
Sbjct: 82 VHGAGLTNMVFLPTGGVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLIDQ 141
Query: 344 YGANDLVLKNPQAFAGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKK 394
Y + + +P W++++ YL Q+V+LD+ RFR LK A +K
Sbjct: 142 YPRDHPIFTDPDGVKSKGWNSLKEAYLDKQDVRLDMKRFRPILKKAIAHLRK 193
>gi|297612065|ref|NP_001068134.2| Os11g0575500 [Oryza sativa Japonica Group]
gi|125577583|gb|EAZ18805.1| hypothetical protein OsJ_34334 [Oryza sativa Japonica Group]
gi|255680206|dbj|BAF28497.2| Os11g0575500 [Oryza sativa Japonica Group]
Length = 202
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 106/206 (51%), Gaps = 8/206 (3%)
Query: 101 FSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHP 160
+AGGY GN FH F D FVP ++T+ H + V+L + N WA Y E++ +
Sbjct: 1 MTAGGYTGNLFHAFSDGFVPAWLTVQ-HL-RRRVVLGVLLYNPWWAGTYGEIISGLLDYH 58
Query: 161 IININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTS--- 217
++++ + HCF +G HG + +NP L KT+VDF LA+ Y +T
Sbjct: 59 VVDLLHDKRKHCFPGAIIGTRFHGILSVNPARLRDNKTIVDFHDLLADVYETAGDTVVVD 118
Query: 218 -SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
+P+L +V+ R R I N V + A +GFDV I E + L S+ +
Sbjct: 119 VPQPAPRRPRLGIVS--CRGKRVIENQAAVARLARTVGFDVDILETADGLQLPASYASVS 176
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLM 302
+C +VGVH A LT LFLRPG+ L+
Sbjct: 177 ACDVLVGVHSADLTKLLFLRPGAALV 202
>gi|21393001|gb|AAL47575.2| symbiosis-related protein [Daucus carota]
Length = 249
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 101/169 (59%), Gaps = 5/169 (2%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
KPK+V++ RN R ILN + K A+ + F V + P+++T +R ++S M+G
Sbjct: 69 KPKVVIIARND--SRAILNEASLVKMAQGIKFQVEVLIPQKTTEAIRIYRALNSSDVMIG 126
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEK 343
VHGA +T F+RP S +Q++P+GT W + Y+ PAR LG +L YKI +ESSL +
Sbjct: 127 VHGAAMTRFAFMRPDSACIQIIPLGTDWAADTYYGLPARKLGSRWLIYKILPQESSLYNE 186
Query: 344 YGANDLVLKNPQAF--AGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
Y D VL +P + G ++ ++YL QNVKL++ RF L+ AY+
Sbjct: 187 YEKVDPVLTDPDSVNNRGCEFTK-KIYLDRQNVKLNLRRFLKRLQRAYR 234
>gi|222631478|gb|EEE63610.1| hypothetical protein OsJ_18427 [Oryza sativa Japonica Group]
Length = 265
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 58/122 (47%), Positives = 81/122 (66%), Gaps = 8/122 (6%)
Query: 275 IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIK 334
+ S AMV VHGA +TH LF+RPGSVL+Q+VP+G W + ++ KPA+ LGL YLEYK+
Sbjct: 128 LSSADAMVAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLGYLEYKVA 187
Query: 335 QEESSLVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRF-------RIYLK 386
EESSL +YG N V+++P + W M +VY+ QNV ++I RF R++LK
Sbjct: 188 PEESSLAAEYGVNSTVVRDPSVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRSARLHLK 247
Query: 387 DA 388
+A
Sbjct: 248 NA 249
>gi|413949496|gb|AFW82145.1| hypothetical protein ZEAMMB73_426058 [Zea mays]
gi|413949497|gb|AFW82146.1| hypothetical protein ZEAMMB73_426058 [Zea mays]
Length = 147
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 57/110 (51%), Positives = 76/110 (69%), Gaps = 1/110 (0%)
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
AMV VHGA +TH LF+RPGSVL+Q+VP+G W + ++ KPA+ LGLEYLEYK+ EESS
Sbjct: 21 AMVAVHGAAVTHFLFMRPGSVLLQIVPVGLDWAADAFYGKPAQQLGLEYLEYKVAPEESS 80
Query: 340 LVEKYGANDLVLKNPQAFAGANWSNM-RVYLKTQNVKLDIDRFRIYLKDA 388
L +YG + VL+NP + W M +VY+ QNV ++I RF L+ A
Sbjct: 81 LAAEYGLDSTVLRNPWVISSRGWWEMKKVYMDRQNVTVNIKRFGELLRTA 130
>gi|449463274|ref|XP_004149359.1| PREDICTED: glycosyltransferase-like domain-containing protein
2-like [Cucumis sativus]
Length = 155
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 60/150 (40%), Positives = 100/150 (66%), Gaps = 5/150 (3%)
Query: 251 EELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
+E+GF+V P+ ++L ++ C ++G HGAGLT+ +FL G+V++QVVP G
Sbjct: 6 KEVGFEVMTTTPQRMSNLDKFSSVVNLCSVIIGAHGAGLTNEVFLANGAVVVQVVPFGLD 65
Query: 311 WLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQA-FAGANWSNMRVYL 369
W ST +F KPA + L+YLEYKI+ +ESSL +KYG N V+++P++ FA +++ +Y+
Sbjct: 66 WPSTYFFGKPAAEMELQYLEYKIEAKESSLWDKYGENHPVIRDPESIFAQGYFASRAIYI 125
Query: 370 KTQNVKLDIDRFRIYLKDAYKKAKKLMDKE 399
QN+K+++ RFR D + KKL++++
Sbjct: 126 DEQNLKINLTRFR----DTMIQVKKLIEEK 151
>gi|414876580|tpg|DAA53711.1| TPA: hypothetical protein ZEAMMB73_101479 [Zea mays]
Length = 239
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 94/162 (58%), Gaps = 3/162 (1%)
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R +N E+ + AE+LGF+V + E + +A +SC A++GVHGAGLT+ +F+
Sbjct: 71 RRFVNAEEIVRGAEKLGFEVVV--SEGTHEVAPFAELANSCDAIMGVHGAGLTNMVFVPT 128
Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
G V++QVVP+G YF P+R +GL YLEY+I EES+L+ +Y + + +P
Sbjct: 129 GGVVIQVVPLGGLEFVAGYFRGPSRDMGLRYLEYRITPEESTLINQYPRDHPIFTDPNGI 188
Query: 358 AGANWSNMR-VYLKTQNVKLDIDRFRIYLKDAYKKAKKLMDK 398
W +++ YL Q+V LD+ RFR LK A +K K
Sbjct: 189 KSKGWESLKDAYLDKQDVSLDMKRFRPTLKKAIAHIRKARAK 230
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 38/68 (55%)
Query: 72 MSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD 131
M V+ LT+ + +C H PALVFS GY GN+FH F D +PLF+T + +
Sbjct: 1 MRVVRVLTVRSVPGEAPACTDRHGVPALVFSDRGYTGNYFHAFTDVILPLFLTARQYAGE 60
Query: 132 QDVILAIA 139
+++A A
Sbjct: 61 VRLLIARA 68
>gi|218196736|gb|EEC79163.1| hypothetical protein OsI_19838 [Oryza sativa Indica Group]
Length = 348
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 101/211 (47%), Gaps = 26/211 (12%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
++ CDRSH D+C + G DP TS+ + + K +PY RK + S M
Sbjct: 142 ALCCDRSHYRSDVCYLRGDVRTDPSTSSVLLYNAPRGSAPE-----KVRPYTRKFEGSIM 196
Query: 73 SKVKELTI------------TTSAPPN-----LSCGVTHTS--PALVFSAGGYNGNFFHE 113
S + E+TI TT+ C V H PA+VFS GGY GN +HE
Sbjct: 197 STIDEVTIVPVVDAGSGSNGTTAGDTGKDSLRRRCDVRHPPGVPAVVFSTGGYTGNVYHE 256
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
F D +PLFIT +V+ + + + W +Y +L R T + +++ HCF
Sbjct: 257 FSDGLIPLFITAQRFA--GEVVFVVLEYHYWWLGRYGAVLERLTNYKVVDFRYDRRVHCF 314
Query: 174 QSVTLGLISHGRMVINPTLLPKPKTLVDFQS 204
+ +GL HG +V++P L+P ++ D ++
Sbjct: 315 SEMIVGLRIHGELVVDPKLMPNGESCGDHEA 345
>gi|222631479|gb|EEE63611.1| hypothetical protein OsJ_18428 [Oryza sativa Japonica Group]
Length = 368
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 26/201 (12%)
Query: 13 SITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAM 72
++ CDRSH D+C + G DP TS+ + + K +PY RK + S M
Sbjct: 141 ALCCDRSHYRSDVCYLRGDVRTDPSTSSVLLYNAPRGSAPE-----KVRPYTRKFEGSIM 195
Query: 73 SKVKELTI------------TTSAPPN-----LSCGVTHTS--PALVFSAGGYNGNFFHE 113
S + E+TI TT+ C V H PA+VFS GGY GN +HE
Sbjct: 196 STIDEVTIVPVVDAGSGSNGTTAGDTGKDSLRRRCDVRHPPGVPAVVFSTGGYTGNVYHE 255
Query: 114 FMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCF 173
F D +PLFIT +V+ + + + W +Y +L R T + +++ HCF
Sbjct: 256 FSDGLIPLFITAQRF--AGEVVFVVLEYHYWWLGRYGAVLERLTNYKVVDFRYDRRVHCF 313
Query: 174 QSVTLGLISHGRMVINPTLLP 194
+ +GL HG +V++P L+P
Sbjct: 314 SEMIVGLRIHGELVVDPKLMP 334
>gi|222616867|gb|EEE52999.1| hypothetical protein OsJ_35683 [Oryza sativa Japonica Group]
Length = 424
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 84/141 (59%), Gaps = 6/141 (4%)
Query: 251 EELGFDVTIFEPEESTSLADSF----RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ +GF+ E + + + D R ++S A+VGVHGAGLT+ +FL PG+ +Q+VP
Sbjct: 275 QRVGFEAVASELDMAGADHDDVARVARLVNSFDAVVGVHGAGLTNMVFLPPGAAAVQIVP 334
Query: 307 IGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNM 365
G +WL+ F +PA +GL Y++Y++ ES+L +KY + + NP A ++ M
Sbjct: 335 WGGLRWLARADFGEPAVAMGLRYIQYEVAAGESTLKDKYPRDHEIFTNPTALHKKGFTFM 394
Query: 366 R-VYLKTQNVKLDIDRFRIYL 385
R +L Q++ +DIDRF+ L
Sbjct: 395 RHTFLNGQDIIVDIDRFKPVL 415
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 2/65 (3%)
Query: 16 CDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKV 75
CD S D+C + G D S F V P+ +++ PYPRK D ++M +V
Sbjct: 187 CDLSDHRTDVCDLAGDIRMDANASAFVVVVDPAVGADGPTYKVR--PYPRKGDATSMGRV 244
Query: 76 KELTI 80
E+T+
Sbjct: 245 TEITV 249
>gi|125568812|gb|EAZ10327.1| hypothetical protein OsJ_00162 [Oryza sativa Japonica Group]
Length = 168
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 79/139 (56%), Gaps = 9/139 (6%)
Query: 199 LVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEELGF 255
+VDF FL +AY + T KP++++++R R R +LNLR+V A ELGF
Sbjct: 1 MVDFTRFLRHAYGLRRDKPMVLGETSGKKPRMLIISR--RRTRKLLNLRQVAAMARELGF 58
Query: 256 DVTIFEPEESTSLADSFRF---IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG-TQW 311
+V + E RF ++SC +VGVHGAGLT+ FL G V++Q+VP G +W
Sbjct: 59 EVVVSEAGVGGGSGGVKRFASAVNSCDVLVGVHGAGLTNQAFLPRGGVVVQIVPWGRMEW 118
Query: 312 LSTVYFEKPARVLGLEYLE 330
++T ++ PA G + L+
Sbjct: 119 MATNFYGAPAAAHGAQGLK 137
>gi|194703492|gb|ACF85830.1| unknown [Zea mays]
Length = 175
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 84/151 (55%), Gaps = 11/151 (7%)
Query: 199 LVDFQSFLANAY---NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGF 255
+ DF FL A + +P+L++V+ R +LNL EV AA+ LGF
Sbjct: 1 MADFARFLRGALALPRDRPAVLGGAPGMRPRLLVVS-----PRRLLNLEEVAAAADALGF 55
Query: 256 DVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG--TQWLS 313
DV E + +A+ +++ +VGV GAGLT+ +FL +VL+Q+VP G +W++
Sbjct: 56 DVASAEAG-ADDVAEFAARVNAADVLVGVRGAGLTNQVFLPTEAVLVQIVPWGKKIEWVT 114
Query: 314 TVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
T + + A +GL YLEY + +EE+ L +KY
Sbjct: 115 TSSYGRAAAGMGLRYLEYYVGEEETRLRDKY 145
>gi|218198152|gb|EEC80579.1| hypothetical protein OsI_22912 [Oryza sativa Indica Group]
Length = 301
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 74/146 (50%), Gaps = 6/146 (4%)
Query: 161 IININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTS--- 217
++++ + HCF +G HG + +NP L KT+VDF LA+ Y +T
Sbjct: 158 VVDLLHDKRKHCFPGAIIGTRFHGILSVNPARLRDNKTIVDFHDLLADVYETAGDTVVVD 217
Query: 218 -SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
+P+L +V+ R R I N V + A +GFDV I E + L S+ +
Sbjct: 218 VPQPAPRRPRLGIVS--CRGKRVIENQAAVARLARTVGFDVDILETADGLPLPASYASVS 275
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLM 302
+C +VGVH A LT LFLRPG+ L+
Sbjct: 276 ACDVLVGVHSADLTKLLFLRPGAALV 301
>gi|335929271|gb|AEH75985.1| putative glycosyltransferase [Wolffia australiana]
Length = 254
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 59/114 (51%), Gaps = 11/114 (9%)
Query: 14 ITCDRSHRSYDLCLINGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMS 73
I C+ S+ ++C I + + F+ G+ K +PYPRK + M
Sbjct: 133 ILCNNSNYRTNVCEIFTEVKIQGSSFSVFAAGNNS--------LWKIRPYPRKWEPGLME 184
Query: 74 KVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNG-NFFHEFMDCFVPLFITIN 126
++KE T+ A P C V H+ PA+VFS GG G NFFH+ D +PLF+T+N
Sbjct: 185 QIKEYTVKAEAGP--PCSVIHSVPAVVFSTGGLLGKNFFHDLSDVLIPLFLTVN 236
>gi|156408145|ref|XP_001641717.1| predicted protein [Nematostella vectensis]
gi|156228857|gb|EDO49654.1| predicted protein [Nematostella vectensis]
Length = 451
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 124/278 (44%), Gaps = 20/278 (7%)
Query: 69 KSAMSKVKELTITTSAPPN-LSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--I 125
+S ++++ T P N +C V P +V N +H F D FV L+ + I
Sbjct: 131 QSWFAELQHFTSLPYHPMNDGACDVVIDKPTMVIKLDA-GVNMYHHFCD-FVNLYASQHI 188
Query: 126 NSHFPDQDVILAIADCND-QWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISH 183
N F DV + + D +D +A ++E+ F++HP++ +N+ Q C + L++
Sbjct: 189 NGSFS-TDVNILVWDTSDMHYADFFSEMWMVFSKHPLLRLNSFQGKRVCLKDAVFSLLA- 246
Query: 184 GRMVI----NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRT 239
RM+ N L+P F SF + + + ++ L+ R + R
Sbjct: 247 -RMIFGLYYNMPLVPNCHGSGLFHSFTHHTLERLGIIQDEYEEDEFRITLIERKTQY-RN 304
Query: 240 ILNLREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
ILN E+ A + + F V + D + H+ M+G+HGAGLTH LFL
Sbjct: 305 ILNQDELIGAMKSVPNFKVKVVHYNREIPFLDQLKVTHNTDIMIGMHGAGLTHLLFLPDW 364
Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
+V+ ++ + AR+ G+ Y+ ++ +++
Sbjct: 365 AVIFEIYNTE----DPDCYGDLARLRGVTYMTWEKREK 398
>gi|198412913|ref|XP_002126731.1| PREDICTED: similar to AER61 glycosyltransferase [Ciona
intestinalis]
Length = 507
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 78/297 (26%), Positives = 131/297 (44%), Gaps = 57/297 (19%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIAD-CNDQWA 146
C + P ++F + GN FH F D F LF++ +N ++DV + + D + +
Sbjct: 182 CDIVINHP-VIFMKMDFGGNMFHHFCD-FFNLFVSLHVNGSSFNKDVQIVMWDTASSNYY 239
Query: 147 RKYAELLPRFTRHPI---ININNQTITHCFQSVTLGLI--SHGRMVINPTLLPKPKTLVD 201
++ FT P+ ++ + + + CF+ L+ G + N + P+ V
Sbjct: 240 DPFSSSWKAFTSRPVTPLVDWDKKKV--CFREAYFSLLPRMRGGLYYNTYV---PQNCVG 294
Query: 202 ---FQSF------------LANAYNENTNTSSSFHHTKPKL--VLVNR----NARVGRTI 240
F+SF L + + N KPKL L+ R N RV R I
Sbjct: 295 SNLFRSFSKFFLQQMKVRQLGPVFVQGRNP-------KPKLRVTLLQRGTPDNDRVYRKI 347
Query: 241 LNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
N RE++K E ++T+ E + S D H+ M+G+HGAGLTH LFL P
Sbjct: 348 KNQRELEKVFGEFEDLELTVVEYDWRKMSFKDQLLMTHNSDIMIGMHGAGLTHFLFLPPW 407
Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYL--------EYKIKQEESSLVEKYGAN 347
+V ++ G + Y++ AR+ G++Y+ +++ K E KYG+N
Sbjct: 408 AVAFELYNCGDK---NCYYD-LARLGGIKYMTWSDGGNPKFEPKPSEKGKHHKYGSN 460
>gi|242060484|ref|XP_002451531.1| hypothetical protein SORBIDRAFT_04g003343 [Sorghum bicolor]
gi|241931362|gb|EES04507.1| hypothetical protein SORBIDRAFT_04g003343 [Sorghum bicolor]
Length = 152
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 60/101 (59%), Gaps = 6/101 (5%)
Query: 244 REVKKAAEELGFDVTIFEPEESTSLADSFR-FIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
RE+ +AA + G +V + ST +F ++S MVGVHGAGLT+ +FL G VL+
Sbjct: 56 REMARAATDAGLEVCGWRSPTSTRTWATFAALVNSADVMVGVHGAGLTNMVFLPRGGVLI 115
Query: 303 QVVPIGT-QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVE 342
QVVP G +WL+ V F+ PA + + Y EESSL++
Sbjct: 116 QVVPFGGLKWLTGVTFKDPAVDMEVTYGLL----EESSLID 152
>gi|414876596|tpg|DAA53727.1| TPA: hypothetical protein ZEAMMB73_648118 [Zea mays]
Length = 105
Score = 66.2 bits (160), Expect = 3e-08, Method: Composition-based stats.
Identities = 31/92 (33%), Positives = 55/92 (59%), Gaps = 2/92 (2%)
Query: 293 LFLRPGSVLMQVVPIG-TQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVL 351
+FL PG+ L+Q+VP G QW++ + PA +GL+Y++Y+I ES+L +K+ + +
Sbjct: 2 MFLPPGAALVQIVPWGGLQWMARADYGDPAEAMGLKYIQYEIGVAESTLKDKFPSGHKIF 61
Query: 352 KNPQAFAGANWSNMR-VYLKTQNVKLDIDRFR 382
NP A + +R + Q++ +D+ RFR
Sbjct: 62 TNPTALHKKGFMFIRQTLMDGQDITVDVGRFR 93
>gi|222617643|gb|EEE53775.1| hypothetical protein OsJ_00167 [Oryza sativa Japonica Group]
Length = 443
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 69/144 (47%), Gaps = 14/144 (9%)
Query: 16 CD-RSHRSYDLCLINGSALFDPKTSTFFSV---GHTDS-TPSQPSLRIKTQPYPRKSDKS 70
CD S+R D C ++G T V G D T S IK PYPRK+D +
Sbjct: 273 CDLWSNRRIDWCELDGDVRVAGANGTVSLVAPPGPADERTFRAESWHIK--PYPRKADPN 330
Query: 71 AMSKVKELTITT-----SAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITI 125
AM V+ LT+ + ++ +C H P LVFS GY GN+FH + D +PLF+T
Sbjct: 331 AMRHVRVLTVQSLPAPAASAAAPACTERHDVPGLVFSDRGYTGNYFHAYTDVILPLFLTA 390
Query: 126 NSHFPDQDVILAIADCNDQWARKY 149
+ +V L ++D W K+
Sbjct: 391 RQY--SAEVKLLVSDFQMWWLGKF 412
>gi|242070281|ref|XP_002450417.1| hypothetical protein SORBIDRAFT_05g005021 [Sorghum bicolor]
gi|241936260|gb|EES09405.1| hypothetical protein SORBIDRAFT_05g005021 [Sorghum bicolor]
Length = 279
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 1/91 (1%)
Query: 255 FDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT-QWLS 313
+V + EP L R ++SC +VG+HG L + +FL G+V++QV P+G ++
Sbjct: 122 IEVVVGEPARHADLPSFARVVNSCDVLVGMHGVRLANLVFLPAGAVVVQVAPLGGLDAMA 181
Query: 314 TVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
F PAR + Y+ Y I EES+L +Y
Sbjct: 182 AEDFGAPARDAWIRYVHYGIAVEESTLARRY 212
>gi|357521537|ref|XP_003631057.1| Glycosyltransferase [Medicago truncatula]
gi|355525079|gb|AET05533.1| Glycosyltransferase [Medicago truncatula]
Length = 145
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 11/105 (10%)
Query: 20 HRSYDLCL-INGSALFDPKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKEL 78
H S LC ++G K+S + V H + ++R PY R+ D AM V++
Sbjct: 46 HYSTTLCFRVHG------KSSNVYIVSHKTTENMSWTIR----PYARREDAYAMRHVRKW 95
Query: 79 TITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
++ S C H+ PA++FS GY GN FHEF D +PLF+
Sbjct: 96 SVKASHHQVPQCTENHSIPAVIFSTAGYTGNHFHEFSDIIIPLFL 140
>gi|149412730|ref|XP_001510490.1| PREDICTED: uncharacterized glycosyltransferase AER61-like isoform 1
[Ornithorhynchus anatinus]
Length = 527
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 111/236 (47%), Gaps = 15/236 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + ++E FT + II++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVNIVMWDTSSYGYGDLFSETWKAFTDYDIIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ V L+ R + N L+ + F++F + + T +
Sbjct: 303 TYDSKRVCFKEVVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQAGPKE 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ KA + + FDV + + + + D R H+
Sbjct: 363 GKIRITILARSTEY-RKILNQDELVKALKTVSAFDVQVVDYKYKKLGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
VG+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++ K +
Sbjct: 422 FVGMHGAGLTHLLFLPDWATVFELYNCEDE---RCYLDL-ARLRGIHYVTWQKKNK 473
>gi|194466153|gb|ACF74307.1| glycosyltransferase [Arachis hypogaea]
Length = 217
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 16/120 (13%)
Query: 11 AKSITCDRSHRSYDLCLINGSALFDPKTSTFF------------SVGHTDSTPSQPSLRI 58
+ +I CDR+ D+CL+ G +S+ F +VG Q
Sbjct: 97 SGTICCDRTGYRSDICLMKGDIRTHSPSSSIFLYNSGIINNVSRNVGAEKGKGDQILQHE 156
Query: 59 KTQPYPRKSDKSAMSKVKELTITTS----APPNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
+ +PY RK +KS M + EL + + C V H PA+ FS GGY GN +HEF
Sbjct: 157 RIRPYTRKWEKSVMETIDELNLISKRVNFGNSGHGCDVKHEVPAVFFSNGGYTGNVYHEF 216
>gi|123496899|ref|XP_001327065.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121909989|gb|EAY14842.1| hypothetical protein TVAG_411080 [Trichomonas vaginalis G3]
Length = 445
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/293 (23%), Positives = 129/293 (44%), Gaps = 41/293 (13%)
Query: 110 FFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTI 169
+H +D +P++ +++ D + D + ++ YA L + H NI
Sbjct: 159 LWHNLVDFVIPIYRAMHTTNISSDCTIHAFDNDGKYGLFYANALCQNIIHENSNI----- 213
Query: 170 THCFQSVTLGLISHG------RMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT 223
+C + + +G+ G ++ +N + P+ + L+ + + N S H
Sbjct: 214 -YCHKRMIIGVPKTGGSQSERKLFLNYDI-PRNE-LIGLRELMLKEANSTGCMPSREH-- 268
Query: 224 KPKLVLVNRNARV-GRTILNLREVKKAAEELGFDVTIFEPEESTSLAD-----SFRFIHS 277
PK++L+ R + R ++N EV KA E V F +T L D F S
Sbjct: 269 -PKVLLIKRRTKEEKRRLINSDEVSKAIHE----VCPFCEVLNTDLQDFNKMQQVSFTCS 323
Query: 278 CHAMVGVHGAGLTHSLFLRPGS-----VLMQVVPIGTQWLSTV--YFEKPARVLGLEYLE 330
++G+HG+GLTH ++ P S +++++P +L T ++ K A + G+EY
Sbjct: 324 VSLLIGLHGSGLTHLMWQYPSSKEQKTAVIEILP----YLYTCRDWYSKLASMAGVEYFS 379
Query: 331 YKIKQEESSLVEKYG-ANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFR 382
K ++ S EK + + G W + +L+ Q+V +DID+F+
Sbjct: 380 LKTLRKNQSRWEKVSDERERSCHSGSEMCGKGWCHD--FLRDQSVIVDIDQFK 430
>gi|407196281|gb|AFT64228.1| secreted O-linked N-acetylglucosamine transferase [Sus scrofa]
Length = 527
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/232 (25%), Positives = 109/232 (46%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D FV L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FVNLYITQHVNNSFS-TDVYVVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F +A + T
Sbjct: 303 TYDSKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHALHRLNITQQGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVRIVDYKYKELAFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ G + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCGDE---RCYLDL-ARLRGVHYITWR 469
>gi|170047460|ref|XP_001851238.1| glycosyltransferase [Culex quinquefasciatus]
gi|167869905|gb|EDS33288.1| glycosyltransferase [Culex quinquefasciatus]
Length = 551
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/255 (22%), Positives = 114/255 (44%), Gaps = 23/255 (9%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD-----QDVILAIADCNDQ 144
C VT PA + N +H F D F+ L+++ +++ D DV + + + +
Sbjct: 227 CDVTIERPAFIMKIDA-TINMYHHFCD-FINLYMSQHANLSDPDGFSTDVEMLVWE-SYT 283
Query: 145 WARKYAELLPRFTRHPIININ---NQTITHCFQSVTLGLISHGRMVI----NPTLLPKPK 197
++ +AE FTRHPI ++ Q + CF+++ L L+ RM+ N ++ +
Sbjct: 284 YSSPFAEAFKVFTRHPIADLKTYAGQVV--CFKNLVLPLLP--RMIFGLYYNTPIISGCE 339
Query: 198 TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDV 257
FQ+F + + S K ++ ++R+ + R + +++ +E+ + V
Sbjct: 340 NSGLFQAFSEHVLHRLRIPLKSHTDRKLRITFLSRDTKFRRVLNEHDLLEEISEDERYLV 399
Query: 258 TIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYF 317
+ R + +G+HGAGLTH LFL + L ++ T +
Sbjct: 400 NRVSYSYKMDFREQLRITRNTDIFIGMHGAGLTHLLFLPKWAALFELYHCE----DTNCY 455
Query: 318 EKPARVLGLEYLEYK 332
+ AR+ G+ YL ++
Sbjct: 456 KDLARLKGVRYLTWE 470
>gi|170055913|ref|XP_001863794.1| glycosyltransferase [Culex quinquefasciatus]
gi|167875762|gb|EDS39145.1| glycosyltransferase [Culex quinquefasciatus]
Length = 495
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 57/254 (22%), Positives = 111/254 (43%), Gaps = 21/254 (8%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADC----NDQW 145
C VT PA + N +H F D F+ L+++ +++ D D + + +
Sbjct: 178 CDVTIERPAFIMKIDA-TINMYHHFCD-FINLYMSQHANLSDPDGFSTDVEVLVWESYTY 235
Query: 146 ARKYAELLPRFTRHPIININ---NQTITHCFQSVTLGLISHGRMVI----NPTLLPKPKT 198
+ +AE FTRHPI ++ Q + CF+++ L L+ RM+ N ++ +
Sbjct: 236 SSPFAEAFKVFTRHPIADLKTYAGQVV--CFKNLVLPLLP--RMIFGLYYNTPIISGCEN 291
Query: 199 LVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
FQ+F + + S K ++ ++R+ + R + +++ +E+ + V
Sbjct: 292 SGLFQAFSEHVLHRLRIPLKSHTDRKLRITFLSRDTKFRRVLNEHDLLEEISEDERYLVN 351
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
+ R + +G+HGAGLTH LFL + L ++ T ++
Sbjct: 352 RVSYSYKMDFREQLRITRNTDIFIGMHGAGLTHLLFLPKWAALFELYHCE----DTNCYK 407
Query: 319 KPARVLGLEYLEYK 332
AR+ G+ YL ++
Sbjct: 408 DLARLKGVRYLTWE 421
>gi|395516462|ref|XP_003762407.1| PREDICTED: uncharacterized glycosyltransferase AER61 [Sarcophilus
harrisii]
Length = 539
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 60/253 (23%), Positives = 115/253 (45%), Gaps = 16/253 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ- 165
N +H F D F+ L+IT +N+ F I+ + + ++E FT + +I++
Sbjct: 244 NMYHHFCD-FINLYITQHVNNSFSTDVNIIMWDTSSYGYGDLFSETWKAFTDYDVIHLKV 302
Query: 166 NQTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT 223
+ T CF+ L+ R + N L+ + F++F + T
Sbjct: 303 YDSKTVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLYRLNITRDGPQDG 362
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAM 281
K ++ ++ R+ R ILN E+ A + + F+V I + + + + R H+
Sbjct: 363 KIRVTILARSTEY-RKILNQNELVNALKTVSSFEVRIVDYKYKEIGFLEQLRITHNTDIF 421
Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE---ES 338
+G+HGAGLTH LFL +V+ ++ + Y + AR+ G+ Y+ +K K + +
Sbjct: 422 IGMHGAGLTHLLFLPDWAVVFELYNCEDE---HCYLDL-ARLRGIHYVTWKKKNKVFPQD 477
Query: 339 SLVEKYGANDLVL 351
+E++ D V+
Sbjct: 478 KRMEEFNFRDRVV 490
>gi|423062505|ref|ZP_17051295.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
gi|406716413|gb|EKD11564.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
Length = 805
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 66/288 (22%), Positives = 117/288 (40%), Gaps = 44/288 (15%)
Query: 36 PKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK--------VKELTITTS---- 83
P T + S+G T QPS +K P R + + VK+L+ S
Sbjct: 484 PTTVSKSSLGFTSHRTQQPSGAVKIFPQARGLNTGFLHTIITANNQVVKDLSSDNSGIAL 543
Query: 84 APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCND 143
A NL + H + F + N+FH ++ L + + S P +++ +
Sbjct: 544 AAHNLP-EIIHLEGNIAFVSASCGQNYFHWMVEVIPRLHLVLASGVPVDKIVV------N 596
Query: 144 QWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQ 203
++ KY + P Q + CF+ V + +++ LP PK DF
Sbjct: 597 KFGHKYEDETLAMFDIP----EYQKMFGCFRHVQAEV-----LIVQSRTLPTPKWACDFV 647
Query: 204 SFLANAY-----NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
L + + N SS K+ + NA + R ++N +E+ + LGF+V
Sbjct: 648 KDLVLKHPLLEEDNQCNYSS-------KIYISRANAYI-RKVINEQELIDILKPLGFEVV 699
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
E + S+ +H ++ HGAGLT+ +F PG+ ++++ P
Sbjct: 700 YLE---NMSVKQQALCLHHAEVVISPHGAGLTNLVFCEPGTKVIELFP 744
>gi|118150874|ref|NP_001071350.1| EGF domain-specific O-linked N-acetylglucosamine transferase
precursor [Bos taurus]
gi|158512478|sp|A0JND3.1|EOGT_BOVIN RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
transferase; AltName: Full=Extracellular O-linked
N-acetylglucosamine transferase; Flags: Precursor
gi|117306286|gb|AAI26626.1| Chromosome 3 open reading frame 64 ortholog [Bos taurus]
gi|296474967|tpg|DAA17082.1| TPA: AER61 glycosyltransferase precursor [Bos taurus]
Length = 527
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYVVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDAKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKG 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ G + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCGDE---RCYLDL-ARLRGVHYITWR 469
>gi|355667377|gb|AER93846.1| glycosyltransferase AER61 [Mustela putorius furo]
Length = 527
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 68/286 (23%), Positives = 125/286 (43%), Gaps = 18/286 (6%)
Query: 57 RIKTQPYPRKSD-KSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEF 114
R+ T RKS +S ++++ T P + C V P N +H F
Sbjct: 192 RMLTSQGQRKSPLQSWFAELQSYTQLNFRPLEDAKCDVVIEKPTYFMKLDA-GVNMYHHF 250
Query: 115 MDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININNQTITH 171
D F+ L+IT IN+ F DV + + D + + +++ FT + +I++
Sbjct: 251 CD-FINLYITQHINNSFS-TDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLKTYDSKR 308
Query: 172 -CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLV 228
CF+ L+ R + N L+ + F++F + + T K ++
Sbjct: 309 VCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNVTQEGPKDGKIRVT 368
Query: 229 LVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHG 286
++ R+ R ILN E+ KA + + F+V I + + + D R H+ +G+HG
Sbjct: 369 ILARSTEY-RKILNQNELVKALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDIFIGMHG 427
Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
AGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 428 AGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYVTWR 469
>gi|440910004|gb|ELR59843.1| Putative glycosyltransferase AER61, partial [Bos grunniens mutus]
Length = 530
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 248 NMYHHFCD-FINLYITQHVNNSF-STDVYVVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 305
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 306 TYDAKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKG 365
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + + D R H+
Sbjct: 366 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 424
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ G + Y + AR+ G+ Y+ ++
Sbjct: 425 FIGMHGAGLTHLLFLPDWAAVFELYNCGDE---RCYLDL-ARLRGVHYITWR 472
>gi|86606503|ref|YP_475266.1| hypothetical protein CYA_1852 [Synechococcus sp. JA-3-3Ab]
gi|86555045|gb|ABD00003.1| TPR repeat protein [Synechococcus sp. JA-3-3Ab]
Length = 641
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 6/104 (5%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
P+ V ++R A R ++N EV +A GF E SLA+ + A++G+
Sbjct: 501 PRRVYISRRAARWRRVINEAEVLEALRPWGFVPVQLE---VLSLAEQIALMQKAEAVMGI 557
Query: 285 HGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
HGAGLT+ F +PG+V +++ P YF A+V+GL Y
Sbjct: 558 HGAGLTNLAFCQPGTVAIEIFPSNA---VLPYFWSLAQVVGLNY 598
>gi|404255179|ref|ZP_10959147.1| capsular polysaccharide biosynthesis protein-like protein
[Sphingomonas sp. PAMC 26621]
Length = 398
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 53/96 (55%), Gaps = 5/96 (5%)
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
T P L+ V+R A+ GRT+ N +V + E+ GF T+ +P S S + R ++
Sbjct: 266 TGPTLLFVDRAAQHGRTLSNHADVLRLVEQRGF--TVVDPA-SLSFVEQVRLFSGAQVVI 322
Query: 283 GVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
G GA +T+++F RP + ++ + P G W+ Y++
Sbjct: 323 GQMGAAMTNTMFCRPSTTVIYLAPSG--WIEPFYWD 356
>gi|426249299|ref|XP_004018387.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
transferase [Ovis aries]
Length = 527
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYVVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKG 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ G + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCGDE---RCYLDL-ARLRGVHYITWR 469
>gi|156399367|ref|XP_001638473.1| predicted protein [Nematostella vectensis]
gi|156225594|gb|EDO46410.1| predicted protein [Nematostella vectensis]
Length = 620
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 78/327 (23%), Positives = 130/327 (39%), Gaps = 37/327 (11%)
Query: 109 NFFHEFMDCFVPLFITI----NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI 164
N H D +PLF T+ NS D+ + Y+EL F+R ++
Sbjct: 195 NIMHAIHDDLLPLFHTMKQYSNSGSSQIDLNSRLVFMEGYELGPYSELYQLFSRLQLVIK 254
Query: 165 NNQTITH---CFQSVTLGL------ISHGRMVIN---PTLLPKPKTLVDFQSFLANAYNE 212
+N T+ + CFQ+ +GL +G I P + K + +F F+
Sbjct: 255 DNLTVNNTLKCFQNAVVGLSKFTTWYQYGFDQIQGPLPEIQITAKQIYEFTRFVRERLGI 314
Query: 213 NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
N + S H P++VL R+ R ILN +E+ A E+ SL
Sbjct: 315 NESVS---HTQSPQVVLCTRHHN--RLILNDQEISNAIITKMNKRVAKVSFETHSLERMI 369
Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLE 330
R I ++G+HG+ L ++FL GS LM++ P G + ++ A + G L Y +
Sbjct: 370 RIIGRSSGLIGMHGSILVMAMFLPQGSFLMELFPYGVVPWNYRPYKTLAGLPGMNLVYQD 429
Query: 331 YKIKQEESSL--------------VEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKL 376
+ EE+++ + K D++ +Y Q+ K+
Sbjct: 430 WINTNEENTVTHPDRLPAFGGIAHLSKDEQEDIINTKEVPLHYCCNDPYWLYRIYQDTKI 489
Query: 377 DIDRFRIYLKDAYKKAKKLMDKEIKLA 403
DID F L A + + KL + + A
Sbjct: 490 DIDSFSASLNTAIEGSHKLFTQGFQAA 516
>gi|123380767|ref|XP_001298478.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121879056|gb|EAX85548.1| hypothetical protein TVAG_527360 [Trichomonas vaginalis G3]
Length = 560
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/244 (21%), Positives = 111/244 (45%), Gaps = 20/244 (8%)
Query: 111 FHEFMDCFVPLFITINSHFP--DQDVILAIADCNDQWARKYAELLPRFTRHPIININNQT 168
+H D +P++ + + D+D ++ I + + K + + T +INI
Sbjct: 268 WHSLFDFTLPIWWSKQMFWGHNDEDRVMVIDENKN--TEKGYQFIDILTHKNVINIKLDE 325
Query: 169 ITH---CFQSVTLGLISHGRMVINPTLLPKPKTL------VDFQSFLANAYNE-NTNTSS 218
H CF S+ LG + + + P+ P L + F F ++ + N N S
Sbjct: 326 KYHNKTCFSSIILG-VPKTELEVTPSKWPNGYQLPYEFADIAFHQFREHSISTYNVNNSL 384
Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI-FEPEESTSLADSFRFIHS 277
TKP+++ +NR+ R I+N +++ +E DV I + + ++ + +
Sbjct: 385 CGKTTKPRVIFINRDTN-KRYIINSQDLINKMKEWAPDVDIDYVVYTNQTIGEQIAQFCN 443
Query: 278 CHAMVGVHGAGLTHSLFLRPG-SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
++ +HG+ L+H L++R S ++++ P ++E+ A+ +G++Y + K
Sbjct: 444 ASLIISIHGSALSHMLWMRRNRSAIIEIFPYNYD--CRDWYEQVAKGMGIKYFFWINKIP 501
Query: 337 ESSL 340
E+S
Sbjct: 502 ENSF 505
>gi|432093450|gb|ELK25518.1| hypothetical protein MDA_GLEAN10017367 [Myotis davidii]
Length = 523
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L++T +N+ F DV + + D + + +++ FT + +I++
Sbjct: 247 NMYHHFCD-FLNLYLTQHVNNSF-STDVYVVMWDTSTYGYGDLFSDTWKAFTDYDVIHLK 304
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 305 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHSTGLFRAFSQHVLHRLNVTQEGPKD 364
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + + D R H+
Sbjct: 365 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 423
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ G Q Y + AR+ G+ Y+ ++
Sbjct: 424 FIGMHGAGLTHLLFLPDWAAVFELYNCGDQ---RCYLDL-ARLRGVHYVTWR 471
>gi|74096367|ref|NP_001027841.1| glycosyltransferase aer61 precursor [Ciona intestinalis]
gi|56550354|emb|CAI30563.1| glycosyltransferase [Ciona intestinalis]
Length = 512
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 76/318 (23%), Positives = 132/318 (41%), Gaps = 42/318 (13%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWAR 147
C V P L F + GN +H F D F +++T N+ + DV + D + ++
Sbjct: 203 CDVIVDKPML-FVQLDFGGNMYHHFCD-FFNIYLTQMANNSWFGTDVQIVRWDLSYRYGE 260
Query: 148 KYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLA 207
+ E FT +++ + + S I P + S L
Sbjct: 261 VFRESWDAFTNRDHVSLREYMGKRV--CIADAMFSFLPRTILGLFYNTPVEVNCRGSSLF 318
Query: 208 NAYNEN----------TNTSSSFHHTKPKLVLVNRNA----RVGRTILNLREVKKAAEEL 253
A++E+ TS K ++ L+ R + ++ R ILN+ E+ A ++
Sbjct: 319 KAFSEHFLHRMGITSHPPTSLPNQPNKIRVTLLERGSNPRYKIYRRILNVDELGNAIRKI 378
Query: 254 -GFDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQW 311
G +V + E + S D H+ M+G+HGAGLTH LFL P +V ++ G
Sbjct: 379 PGLEVNVVEYDWRKMSFKDQLSMTHNSDIMIGMHGAGLTHFLFLPPWAVAFELYNCGD-- 436
Query: 312 LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSN------- 364
Y + P R+ G+ Y+ + E+ S +E + N+ +P + N
Sbjct: 437 -IRCYRDLP-RLRGVRYITW----EDDSKLEAFDQNE----HPHYGNQPKFWNYKFDVPE 486
Query: 365 -MRVYLKTQNVKLDIDRF 381
+R+ LK ++ L+ DRF
Sbjct: 487 FVRLVLKARSWVLEHDRF 504
>gi|404448828|ref|ZP_11013820.1| capsular polysaccharide biosynthesis protein [Indibacter
alkaliphilus LW1]
gi|403765552|gb|EJZ26430.1| capsular polysaccharide biosynthesis protein [Indibacter
alkaliphilus LW1]
Length = 337
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 50/101 (49%), Gaps = 3/101 (2%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ ++R R +LN EV+ + G+++ + E SL D I C + G+HGA
Sbjct: 206 IYISRKDAQYRKVLNEPEVESVFSDFGYEIQVME---KFSLKDQVNMIRQCSHLAGLHGA 262
Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
GL++ +F+ G +++ +G W + F A LG EY
Sbjct: 263 GLSNMIFMPEGGKVLEFRNMGDSWSLSQSFFAMASDLGHEY 303
>gi|348575414|ref|XP_003473484.1| PREDICTED: uncharacterized glycosyltransferase AER61-like isoform 1
[Cavia porcellus]
Length = 527
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 108/232 (46%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVHIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T +
Sbjct: 303 TYDSKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQAGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL +V+ ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAVVFELYNCEDE---HCYLDL-ARLRGIHYITWR 469
>gi|157117440|ref|XP_001658768.1| glycosyltransferase [Aedes aegypti]
gi|108876053|gb|EAT40278.1| AAEL007982-PA [Aedes aegypti]
Length = 533
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 56/254 (22%), Positives = 111/254 (43%), Gaps = 19/254 (7%)
Query: 89 SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPD-----QDVILAIADCND 143
+C VT PA + N +H F D F+ L+ +++++ D DV + + + +
Sbjct: 219 TCDVTIDRPAFIMKIDA-TINMYHHFCD-FINLYGSLHANLSDPYGFSTDVQIMVWE-SY 275
Query: 144 QWARKYAELLPRFTRHPIININNQT-ITHCFQSVTLGLISHGRMVI----NPTLLPKPKT 198
+ +AE FT+HPI ++ CF+++ L L+ RM+ N ++ +
Sbjct: 276 TYDSPFAETFKVFTKHPIADLKTYAGKVVCFKNLVLPLLP--RMIFGLYYNTPIITGCEN 333
Query: 199 LVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
FQ+F + + S K ++ ++R + R + +++ +E + V
Sbjct: 334 SGFFQAFSEHVLHRLKVPQRSRSDRKLRITFLSRQTKFRRVLNENALLEEISENEDYLVN 393
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
T + + + +G+HGAGLTH LFL +VL ++ ++
Sbjct: 394 QASFTYKTDFREQLKITRNTDIFIGMHGAGLTHLLFLPKWAVLFELYHCE----DPNCYK 449
Query: 319 KPARVLGLEYLEYK 332
AR+ G+ YL ++
Sbjct: 450 DLARLKGVRYLSWE 463
>gi|354465590|ref|XP_003495262.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
[Cricetulus griseus]
gi|344248916|gb|EGW05020.1| Uncharacterized glycosyltransferase AER61 [Cricetulus griseus]
gi|443496408|gb|AGC92969.1| EGF-O-GlcNAc transferase [Cricetulus griseus]
Length = 527
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FLNLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDFKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 363 GKVRVTILARSTEY-RKILNQNELVNALKTVSTFEVRIVDYKYRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469
>gi|158261735|dbj|BAF83045.1| unnamed protein product [Homo sapiens]
Length = 426
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 144 NMYHHFCD-FINLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 201
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 202 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 261
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 262 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 320
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 321 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 368
>gi|345569145|gb|EGX52013.1| hypothetical protein AOL_s00043g403 [Arthrobotrys oligospora ATCC
24927]
Length = 637
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 76/153 (49%), Gaps = 16/153 (10%)
Query: 213 NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
++N + K K++ +N ++ R ++ ++K ++ ++T E + T +
Sbjct: 474 DSNPKTVEEKIKIKIIGRQKNRKILRLFDYVKILQKTYPDVDIEITKLEKLDVT---EQL 530
Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVY--FEKPARVLGLEYLE 330
R I ++GV GAGLTH+LFLR G+ L+++ TQ VY F AR++GL Y +
Sbjct: 531 RMIRKTDILIGVTGAGLTHTLFLRKGAALIEL----TQPEPFVYFGFGNLARMIGLGYFQ 586
Query: 331 YKIKQEESSLVEKYGAN---DLVLKNPQAFAGA 360
+ ++ E K G N D VL + F A
Sbjct: 587 IQAEKREG----KSGINWQEDNVLVDEHTFIDA 615
>gi|222619896|gb|EEE56028.1| hypothetical protein OsJ_04812 [Oryza sativa Japonica Group]
Length = 153
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 41/71 (57%), Gaps = 2/71 (2%)
Query: 324 LGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA-GANWSNM-RVYLKTQNVKLDIDRF 381
+GL Y +Y + ESSL KYG D+V+ +P+A W + RVYL QNV LD+ RF
Sbjct: 1 MGLHYEQYDAEGHESSLSRKYGLRDVVVSDPEAAKRDKGWGFVARVYLGGQNVTLDLSRF 60
Query: 382 RIYLKDAYKKA 392
R L + +A
Sbjct: 61 RHTLTRLHARA 71
>gi|296225632|ref|XP_002758583.1| PREDICTED: uncharacterized glycosyltransferase AER61 isoform 1
[Callithrix jacchus]
Length = 527
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 106/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + II++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDIIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWQ 469
>gi|86606467|ref|YP_475230.1| hypothetical protein CYA_1816 [Synechococcus sp. JA-3-3Ab]
gi|86555009|gb|ABC99967.1| tetratricopeptide repeat protein [Synechococcus sp. JA-3-3Ab]
Length = 720
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 62/136 (45%), Gaps = 6/136 (4%)
Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
S S +P+ + ++R + R ++N EV GF E + SL +
Sbjct: 568 AQPSFSIDPLQPRRIYISRRSARWRRVINEAEVLACLHPWGFVPVQME---TLSLQEQIA 624
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
+ A++G+HGAGLT+ F +PG+ +++++P YF A+V GL Y
Sbjct: 625 LMQGAEAVIGIHGAGLTNLAFCQPGTTVIEILPSNA---VLPYFWSIAQVAGLRYFPLVA 681
Query: 334 KQEESSLVEKYGANDL 349
+ +LV + DL
Sbjct: 682 PVCDPALVALLSSPDL 697
>gi|57114206|ref|NP_001009171.1| EGF domain-specific O-linked N-acetylglucosamine transferase
precursor [Pan troglodytes]
gi|75040721|sp|Q5NDL1.1|EOGT_PANTR RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
transferase; AltName: Full=Extracellular O-linked
N-acetylglucosamine transferase; Flags: Precursor
gi|56550368|emb|CAI30570.1| glycosyltransferase [Pan troglodytes]
Length = 527
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469
>gi|74708096|sp|Q5NDL2.1|EOGT_HUMAN RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
transferase; AltName: Full=Extracellular O-linked
N-acetylglucosamine transferase; Flags: Precursor
gi|56550366|emb|CAI30569.1| glycosyltransferase [Homo sapiens]
gi|443496410|gb|AGC92970.1| EGF-O-GlcNAc transferase [Homo sapiens]
Length = 527
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469
>gi|397480750|ref|XP_003811634.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
transferase isoform 1 [Pan paniscus]
Length = 527
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTNYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWQ 469
>gi|409992949|ref|ZP_11276112.1| hypothetical protein APPUASWS_17665 [Arthrospira platensis str.
Paraca]
gi|291567436|dbj|BAI89708.1| TPR domain protein [Arthrospira platensis NIES-39]
gi|409936195|gb|EKN77696.1| hypothetical protein APPUASWS_17665 [Arthrospira platensis str.
Paraca]
Length = 883
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 119/291 (40%), Gaps = 44/291 (15%)
Query: 36 PKTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSK--------VKELTITTS---- 83
P T + +G T QP+ +K P R + + VK+L+ S
Sbjct: 562 PTTVSNNLLGFTSHRTQQPASSVKILPQARGLNTGFLHTIITANNQVVKDLSSDNSGIAL 621
Query: 84 APPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCND 143
A NL + H + F + N+FH ++ L + + S P +++ +
Sbjct: 622 AAHNLPETI-HLKGNIAFVSAYCGQNYFHWMVEVIPRLHLVLASGLPIDKIVV------N 674
Query: 144 QWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQ 203
++ KY + P NQ + CF+ V ++ MV + TL PK DF
Sbjct: 675 KFGHKYEDETLAMFDIP----ENQKMFGCFRHVEADIL----MVPSRTL-STPKWACDFL 725
Query: 204 SFLANAY-----NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVT 258
L + + N SS K+ + NA + R ++N +E+ + LGF V
Sbjct: 726 KDLVLKHPLLEEDNRCNYSS-------KIYISRANAYI-RQVINEQELMDILKPLGFQVV 777
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
E + S+ + +H ++ HGAGLT+ +F PG+ ++++ P G
Sbjct: 778 YLE---NMSVKEQALCLHHAEVVISPHGAGLTNLVFCEPGTKVIELFPPGA 825
>gi|338714518|ref|XP_003363099.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Equus
caballus]
Length = 527
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 106/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVHIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKQLGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469
>gi|351698200|gb|EHB01119.1| hypothetical protein GW7_02761 [Heterocephalus glaber]
Length = 527
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + T
Sbjct: 303 TYDSKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLYRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWR 469
>gi|410951550|ref|XP_003982458.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
transferase [Felis catus]
Length = 527
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/232 (23%), Positives = 106/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWRAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
+ ++ ++ R+ R ILN E+ A + + +V I + + + D R H+
Sbjct: 363 GRIRVTILARSTEY-RKILNQNELVNALKTVSTLEVRIVDYKYKELGFLDQLRISHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ G + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCGDE---RCYLDL-ARLRGIHYITWR 469
>gi|322711774|gb|EFZ03347.1| DUF563 domain protein [Metarhizium anisopliae ARSEF 23]
Length = 477
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 238 RTILNLREV----KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
R ++N E+ K+A L ++ F S A+ R + ++GVHGAGLTH++
Sbjct: 338 RKLINETELIESAKRAVPHLNIEIVDFA---GFSFAEQLRIVRETDLLIGVHGAGLTHAM 394
Query: 294 FLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIK 334
FL PGS +++++P + + F A++LG +Y +K
Sbjct: 395 FLPPGSAVVEILPRD---FAHMGFRNLAQLLGHQYHRTHVK 432
>gi|403297322|ref|XP_003939521.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
transferase [Saimiri boliviensis boliviensis]
Length = 527
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWQ 469
>gi|449474080|ref|XP_002192616.2| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
transferase [Taeniopygia guttata]
Length = 527
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D FV L+IT IN+ F DV + + D + + ++E FT + II++
Sbjct: 245 NMYHHFCD-FVNLYITQHINNSFS-TDVNIVMWDTSSYGYGDLFSETWKAFTDYEIIHLK 302
Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
+ CF+ L+ R + N L+ F++F + + T
Sbjct: 303 TFDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + +V + + + + ++ R H+
Sbjct: 363 GKIRVTILARSTDY-RKILNQNELVNALKTVSTLEVRVVDYKYKELEFSEQLRITHNSDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL +V+ ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGIHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLD-LARLRGIHYITWR 469
>gi|119585869|gb|EAW65465.1| hCG1777401 [Homo sapiens]
Length = 290
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 8 NMYHHFCD-FINLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 65
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 66 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 125
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 126 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 184
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 185 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 232
>gi|86607926|ref|YP_476688.1| hypothetical protein CYB_0430 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86556468|gb|ABD01425.1| tetratricopeptide repeat protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 718
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 61/136 (44%), Gaps = 6/136 (4%)
Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
S S +P+ + ++R + R ++N EV GF E + SL +
Sbjct: 566 AQPSLSIDPLQPRRIYISRRSARWRRVINEAEVLACLRPWGFVPVQME---TLSLPEQIA 622
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
+ A++G+HGAGLT+ F PG+ +++++P YF A+V GL Y
Sbjct: 623 LMQGAEAVIGIHGAGLTNLAFCPPGTTVIEILPSNA---VLPYFWSIAQVAGLRYFPLVA 679
Query: 334 KQEESSLVEKYGANDL 349
+ +LV + DL
Sbjct: 680 PVCDPALVALLSSPDL 695
>gi|291393983|ref|XP_002713346.1| PREDICTED: AER61 glycosyltransferase isoform 1 [Oryctolagus
cuniculus]
Length = 524
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 104/232 (44%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L++T +N+ F DV + + D + + ++E FT + +I++
Sbjct: 242 NMYHHFCD-FLNLYLTQHVNNSFS-TDVYIVMWDTSTYGYGDLFSETWKAFTDYEVIHLK 299
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ F++F + +
Sbjct: 300 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNIAQEGPQD 359
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + FDV I + + + D R H+
Sbjct: 360 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFDVQIVDYKYKELGFLDQLRITHNTDI 418
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 419 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 466
>gi|426341148|ref|XP_004035914.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
transferase [Gorilla gorilla gorilla]
Length = 527
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 100/232 (43%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDV-TIFEPEESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN EV + F V I D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNEVMNCLGTDFWFKVHVILYVHRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469
>gi|407426125|gb|EKF39586.1| hypothetical protein MOQ_000183 [Trypanosoma cruzi marinkellei]
Length = 578
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 67/131 (51%), Gaps = 14/131 (10%)
Query: 209 AYNENTNTSSSFHHT---KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEES 265
AYN N T + T +P+L+L+NRN R I + + V AE +GF+V + E+
Sbjct: 382 AYN-NMETGNKMQETFVYRPRLLLINRNYR---EIHDYQAVAALAERIGFNVQVVYFEK- 436
Query: 266 TSLADSFRFIHSCHAMVGVHGAGLTHSLFL----RPGS-VLMQVVPIGTQWLSTVYFEKP 320
SL + M+G+HG GLTH L++ RP L++++P G +++
Sbjct: 437 MSLEEQVHVSRHADVMMGMHGMGLTHVLWMDGRRRPRCRALLELMPFGCPQKLIHFYKTF 496
Query: 321 ARVLGLEYLEY 331
+ +G+ Y EY
Sbjct: 497 SDAIGIHY-EY 506
>gi|209523074|ref|ZP_03271631.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
gi|209496661|gb|EDZ96959.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
Length = 891
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/213 (20%), Positives = 91/213 (42%), Gaps = 20/213 (9%)
Query: 94 HTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELL 153
H + F + Y N+FH ++ L + + S P +++ +++ KY +
Sbjct: 637 HLEGNIAFVSALYGHNYFHWMVEVIPRLHLVLASGVPVDKIVV------NKFGHKYEDET 690
Query: 154 PRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN 213
P +Q + CF+ V + +++ LP PK DF L +
Sbjct: 691 LAMFDIP----EHQKMFGCFRHVQAEV-----LIVPCRTLPTPKWACDFVKDLVLKHPSL 741
Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
+ +++ + ++R R ++N +E+ + LGF+V E + S+
Sbjct: 742 LEEDNRCNYSSK--IYISRANSYYRKVINEQELMDILKPLGFEVVYLE---NMSVKQQAL 796
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+H ++ HGAGLT+ +F PG+ ++++ P
Sbjct: 797 CLHHAEVVISPHGAGLTNLVFCEPGTKVIELFP 829
>gi|109036798|ref|XP_001088832.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Macaca
mulatta]
gi|355559515|gb|EHH16243.1| hypothetical protein EGK_11501 [Macaca mulatta]
Length = 527
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L++T +N+ F DV + + D + + +++ FT + II++
Sbjct: 245 NMYHHFCD-FINLYLTQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDIIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVPTFEVQIVDYKYRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWQ 469
>gi|58036800|emb|CAE45897.2| hypothetical protein [Homo sapiens]
Length = 459
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 104/232 (44%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N H F D F+ L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 177 NMCHHFCD-FINLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWNAFTDYDVIHLK 234
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 235 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 294
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 295 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 353
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 354 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 401
>gi|57163723|ref|NP_001009187.1| EGF domain-specific O-linked N-acetylglucosamine transferase
precursor [Canis lupus familiaris]
gi|75040722|sp|Q5NDL9.1|EOGT_CANFA RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
transferase; AltName: Full=Extracellular O-linked
N-acetylglucosamine transferase; Flags: Precursor
gi|56550352|emb|CAI30562.1| glycosyltransferase [Canis lupus familiaris]
Length = 527
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D FV L+IT +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FVNLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + +V I + + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTLEVQIVDYKYKELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469
>gi|355746583|gb|EHH51197.1| hypothetical protein EGM_10532 [Macaca fascicularis]
Length = 527
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 105/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L++T +N+ F DV + + D + + +++ FT + II++
Sbjct: 245 NMYHHFCD-FINLYLTQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDIIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVPTFEVQIVDYKYRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWQ 469
>gi|395824615|ref|XP_003785558.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
transferase [Otolemur garnettii]
Length = 523
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/235 (22%), Positives = 105/235 (44%), Gaps = 13/235 (5%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN 166
N +H F D F+ L+IT +N+ F + I+ + + +++ FT + +I++
Sbjct: 241 NMYHHFCD-FINLYITQHLNNSFSTEVHIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLKT 299
Query: 167 QTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT 223
CF+ L+ R + N L+ + F++F + + T
Sbjct: 300 YDSKRVCFREAIFSLLPRMRYGLFYNTPLISGCQKTGLFRAFSQHVLHRLNITQEGPKDG 359
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAM 281
K ++ ++ R+ R ILN E+ A + + F+V I + + + D R H+
Sbjct: 360 KIRVTILARSTEY-RKILNQNELVNALKTVSMFEVRIVDYKYKELGFLDQLRITHNTDIF 418
Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y ++ K +
Sbjct: 419 IGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYTTWRRKNK 469
>gi|449278758|gb|EMC86527.1| Putative glycosyltransferase AER61 [Columba livia]
Length = 527
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 108/232 (46%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D FV L+IT IN+ F DV + + D + + ++E FT + I+++
Sbjct: 245 NMYHHFCD-FVNLYITQHINNSFS-TDVNIVMWDTSSYGYGDLFSETWKAFTDYEIMHLK 302
Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
+ CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TFDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHSTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + +V + + + + ++ R H+
Sbjct: 363 GKIRVTILARSTDY-RKILNQNELVNALKTVSTLEVKVVDYKYKELEFSEQLRITHNSDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL +V+ ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLD-LARLRGIHYITWR 469
>gi|326928104|ref|XP_003210223.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
[Meleagris gallopavo]
Length = 530
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 107/232 (46%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D FV L+IT IN+ F DV + + D + + ++E FT + II++
Sbjct: 248 NMYHHFCD-FVNLYITQHINNSFS-TDVNIVMWDTSSYGYGDLFSETWKAFTDYDIIHLK 305
Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
+ CF+ L+ R + N L+ F++F + + T
Sbjct: 306 TFDSKRVCFREAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNITQEGPKD 365
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + +V + + + + ++ R H+
Sbjct: 366 GKIRVTILARSTDY-RKILNQNELVNALKTVSTLEVKVVDYKYKELEFSEQLRITHNSDI 424
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL +V+ ++ + Y + AR+ G+ Y+ ++
Sbjct: 425 FIGMHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLD-LARLRGIHYITWR 472
>gi|218189757|gb|EEC72184.1| hypothetical protein OsI_05258 [Oryza sativa Indica Group]
Length = 81
Score = 53.1 bits (126), Expect = 2e-04, Method: Composition-based stats.
Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 2/74 (2%)
Query: 324 LGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFA-GANWSNM-RVYLKTQNVKLDIDRF 381
+GL Y +Y + ESSL KYG D+V+ +P+A W + RVYL QNV LD+ RF
Sbjct: 1 MGLHYEQYDAEGHESSLSRKYGLRDVVVSDPEAAKRDKGWGFVARVYLGGQNVTLDLSRF 60
Query: 382 RIYLKDAYKKAKKL 395
R L + +A ++
Sbjct: 61 RHTLTRLHARALRV 74
>gi|453086759|gb|EMF14801.1| glycosyltransferase family 61 protein [Mycosphaerella populorum
SO2202]
Length = 417
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 46/78 (58%), Gaps = 4/78 (5%)
Query: 275 IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE-YKI 333
+ S ++GVHGAGLTH+LFL P S ++++ P G ++ F A+ LG YL+ Y
Sbjct: 325 LRSTDILLGVHGAGLTHTLFLPPKSTVVEIQPPGLRYFG---FAALAKFLGHRYLQVYGE 381
Query: 334 KQEESSLVEKYGANDLVL 351
++E + + A+D+ L
Sbjct: 382 EREYEGMTHNWQADDVFL 399
>gi|322692964|gb|EFY84844.1| DUF563 domain protein [Metarhizium acridum CQMa 102]
Length = 477
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 54/115 (46%), Gaps = 6/115 (5%)
Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
+ V+KA L ++ F S A+ + + ++GVHGAGLTH++FL PGS ++
Sbjct: 347 IESVQKAIPHLDLEIVDFA---GFSFAEQLKIVRETDLLIGVHGAGLTHTMFLPPGSAVV 403
Query: 303 QVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAF 357
+++P + + F A++LG Y + + E D V Q F
Sbjct: 404 EILPGD---FAHMGFRNLAQILGHRYYRTHAEMHGDASGESQWQFDAVEMEEQQF 455
>gi|71897159|ref|NP_001026580.1| EGF domain-specific O-linked N-acetylglucosamine transferase
[Gallus gallus]
gi|56550364|emb|CAI30568.1| glycosyltransferase [Gallus gallus]
Length = 530
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 106/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D FV L+IT IN+ F DV + + D + + ++E FT + II +
Sbjct: 248 NMYHHFCD-FVNLYITQHINNSFS-TDVNIVMWDTSSYGYGDLFSETWKAFTDYDIIYLK 305
Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
+ CF+ L+ R + N L+ F++F + + T
Sbjct: 306 TFDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNITQEGPKD 365
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + +V + + + + ++ R H+
Sbjct: 366 GKIRVTILARSTDY-RKILNQNELVNALKTVSTLEVKVVDYKYKELEFSEQLRITHNSDI 424
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL +V+ ++ + Y + AR+ G+ Y+ ++
Sbjct: 425 FIGMHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLD-LARLRGIHYITWR 472
>gi|148666934|gb|EDK99350.1| RIKEN cDNA A130022J15, isoform CRA_b [Mus musculus]
Length = 533
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 106/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L++T +N+ F DV + + D + + +++ FT + +I++
Sbjct: 251 NMYHHFCD-FLNLYLTQHVNNSFS-TDVYIVMWDTSTYGYGDLFSDTWKAFTDYDVIHLK 308
Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
+ CF+ L+ R + N L+ + F++F + + T
Sbjct: 309 TYDSKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 368
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V + + + D R H+
Sbjct: 369 GKVRVTILARSTEY-RKILNQDELVNALKTVSTFEVRVVDYKYRELGFLDQLRITHNTDI 427
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 428 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWR 475
>gi|194376580|dbj|BAG57436.1| unnamed protein product [Homo sapiens]
Length = 290
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 104/232 (44%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + + + FT + +I++
Sbjct: 8 NMYHHFCD-FINLYITQHVNNSFS-TDVYIVMWDTSSYGYGDLFFDTWNAFTDYDVIHLK 65
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 66 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHRLNITQEGPKD 125
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 126 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 184
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 185 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 232
>gi|158563925|sp|Q5NDL3.2|EOGT_CHICK RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
transferase; AltName: Full=Extracellular O-linked
N-acetylglucosamine transferase; Flags: Precursor
Length = 535
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 106/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D FV L+IT IN+ F DV + + D + + ++E FT + II +
Sbjct: 253 NMYHHFCD-FVNLYITQHINNSFS-TDVNIVMWDTSSYGYGDLFSETWKAFTDYDIIYLK 310
Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
+ CF+ L+ R + N L+ F++F + + T
Sbjct: 311 TFDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNITQEGPKD 370
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + +V + + + + ++ R H+
Sbjct: 371 GKIRVTILARSTDY-RKILNQNELVNALKTVSTLEVKVVDYKYKELEFSEQLRITHNSDI 429
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL +V+ ++ + Y + AR+ G+ Y+ ++
Sbjct: 430 FIGMHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLD-LARLRGIHYITWR 477
>gi|30424992|ref|NP_780522.1| EGF domain-specific O-linked N-acetylglucosamine transferase
precursor [Mus musculus]
gi|81898120|sp|Q8BYW9.1|EOGT_MOUSE RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
transferase; AltName: Full=Extracellular O-linked
N-acetylglucosamine transferase; Flags: Precursor
gi|26332182|dbj|BAC29821.1| unnamed protein product [Mus musculus]
gi|29294677|gb|AAH48939.1| RIKEN cDNA A130022J15 gene [Mus musculus]
gi|148666933|gb|EDK99349.1| RIKEN cDNA A130022J15, isoform CRA_a [Mus musculus]
Length = 527
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 106/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L++T +N+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FLNLYLTQHVNNSFS-TDVYIVMWDTSTYGYGDLFSDTWKAFTDYDVIHLK 302
Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
+ CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V + + + D R H+
Sbjct: 363 GKVRVTILARSTEY-RKILNQDELVNALKTVSTFEVRVVDYKYRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWR 469
>gi|196015587|ref|XP_002117650.1| hypothetical protein TRIADDRAFT_32818 [Trichoplax adhaerens]
gi|190579819|gb|EDV19908.1| hypothetical protein TRIADDRAFT_32818 [Trichoplax adhaerens]
Length = 484
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 57/244 (23%), Positives = 109/244 (44%), Gaps = 23/244 (9%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI-- 164
N FH F D FV L++T +N F I+ + + + F+RHP+ +
Sbjct: 196 NMFHHFCD-FVNLYVTQHVNGSFTTDINIVMWDTSSMNYGDFFTLTWKAFSRHPVKRLSD 254
Query: 165 -NNQTITHCFQSVTLGL---ISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSF 220
+N+ + CF+ L + +G + N L+P ++F + + T+ +
Sbjct: 255 YDNKRV--CFKDAIFTLPPRMFYG-LYYNMPLIPGCYNSGLMKAFSEHVVHRLNITTKPY 311
Query: 221 HHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHSCH 279
+ ++ L++R+ + R ILN ++ A + + +DVT + H+
Sbjct: 312 DPNRYRITLLSRSTKY-RRILNEDKLIGALKTIWNYDVTRVDYTYQMPFEKQLEITHNSD 370
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL-----EYKIK 334
+G+HG+GLTH LF V ++ + Y++ AR+ G++YL +Y I+
Sbjct: 371 IFIGMHGSGLTHMLFQPHWGVAFELYNCEDE---GCYYD-LARLRGVKYLTWEKKKYLIQ 426
Query: 335 QEES 338
Q+E
Sbjct: 427 QDEG 430
>gi|441665604|ref|XP_004091823.1| PREDICTED: LOW QUALITY PROTEIN: EGF domain-specific O-linked
N-acetylglucosamine transferase [Nomascus leucogenys]
Length = 527
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 104/232 (44%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +++ T + II++
Sbjct: 245 NMYHHFCD-FINLYITQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAXTDYDIIHLK 302
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
CF+ L+ R + N L+ + F++F + + T
Sbjct: 303 TYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V I + + D R H+
Sbjct: 363 GKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 469
>gi|327266162|ref|XP_003217875.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Anolis
carolinensis]
Length = 626
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 58/240 (24%), Positives = 105/240 (43%), Gaps = 23/240 (9%)
Query: 109 NFFHEFMDCFVPLFIT---INSHFPDQDVILAIADCNDQWARKYAELLPR----FTRHPI 161
N +H F D F L+IT INS D ++++ D A Y +L FT + I
Sbjct: 344 NMYHHFCD-FANLYITQHMINSFSTDVNIVMW-----DTSAYGYGDLFSETWKAFTDYEI 397
Query: 162 ININNQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSS 218
+++ + CF+ L+ R + N L+ F++F + + T
Sbjct: 398 VHLKSYDSKRVCFKEAVFTLLPRMRYGLFYNTPLISGCHGTGLFRAFSQHVLHRLNVTQE 457
Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIH 276
K ++ ++ R+ R ILN E+ A + L F+V I + + + +
Sbjct: 458 GPKDGKIRVTILARSTEY-RKILNQNELANALKTLSLFEVQIVNYKYKELDFKEQLKITQ 516
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
+ +G+HGAGLTH LFL +V+ ++ + Y + AR+ G+ Y+ ++ K +
Sbjct: 517 NSDIFIGMHGAGLTHLLFLPDWAVIFELYNCEDE---RCYLDL-ARLRGVHYITWEKKDK 572
>gi|300175586|emb|CBK20897.2| unnamed protein product [Blastocystis hominis]
Length = 606
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 45/82 (54%), Gaps = 4/82 (4%)
Query: 237 GRTILN----LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHS 292
GR ++N L+E+++ A E GF++ +F + +L D F + ++G HG +
Sbjct: 478 GRDVVNENALLKELEQWASEKGFELVVFTASDYKTLDDLFMLLADVDVVLGPHGGAFYNM 537
Query: 293 LFLRPGSVLMQVVPIGTQWLST 314
LF+R G +++ VP +LST
Sbjct: 538 LFMRRGITVIEFVPDSRMFLST 559
>gi|402859641|ref|XP_003894256.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
transferase [Papio anubis]
Length = 535
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 57/240 (23%), Positives = 107/240 (44%), Gaps = 23/240 (9%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIINI- 164
N +H F D FV L++T +N+ F DV + + D + + +++ FT + II++
Sbjct: 245 NMYHHFCD-FVNLYLTQHVNNSF-STDVYIVMWDTSSYGYGDLFSDTWNAFTDYDIIHLK 302
Query: 165 --------NNQTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENT 214
N + CF+ L+ R + N L+ + F++F + +
Sbjct: 303 TYDSKGYWNFPEMFVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLN 362
Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSF 272
T K ++ ++ R+ R ILN E+ A + + F+V I + + D
Sbjct: 363 ITQEGPKDGKIRVTILARSTEY-RKILNQNELVNALKTVPTFEVQIVDYKYRELGFLDQL 421
Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
R H+ +G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 RITHNTDIFIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWQ 477
>gi|434392082|ref|YP_007127029.1| capsular polysaccharide biosynthesis protein-like protein
[Gloeocapsa sp. PCC 7428]
gi|428263923|gb|AFZ29869.1| capsular polysaccharide biosynthesis protein-like protein
[Gloeocapsa sp. PCC 7428]
Length = 422
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 67/135 (49%), Gaps = 11/135 (8%)
Query: 222 HTKPKL-----VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
HT PK+ + ++R R + N +V E+ GF + E S S+ + +
Sbjct: 272 HTSPKVASEEYIYISRGDARKRRVANEGDVINYLEKFGFKIVTLE---SMSVLEQAQLFS 328
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQE 336
S A+V HGAGLT+++F +PG+ L++ ++ +Y++ V GLEY + +Q+
Sbjct: 329 SAKAIVAPHGAGLTNAVFCQPGTKLVEF--FSPNYVHPLYWDLSNHV-GLEYYYFLGEQQ 385
Query: 337 ESSLVEKYGANDLVL 351
A+D+V+
Sbjct: 386 HLPSYHNPNADDIVI 400
>gi|148235443|ref|NP_001085557.1| EGF domain-specific O-linked N-acetylglucosamine transferase
precursor [Xenopus laevis]
gi|82184549|sp|Q6GQ23.1|EOGT_XENLA RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
transferase; AltName: Full=Extracellular O-linked
N-acetylglucosamine transferase; Flags: Precursor
gi|49117992|gb|AAH72925.1| MGC80419 protein [Xenopus laevis]
Length = 525
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 55/233 (23%), Positives = 105/233 (45%), Gaps = 17/233 (7%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI-- 164
N +H F D FV L+IT +N+ F I+ + +++ FT + I ++
Sbjct: 243 NMYHHFCD-FVNLYITQHVNNSFSTDINIVMWTTSVYGYGDLFSDTWKAFTDYEITHLKA 301
Query: 165 -NNQTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH 221
+N+ + CF+ L+ R + N L+ F++F + + T
Sbjct: 302 YDNKRV--CFKDAVFALLPRMRYGLFYNTPLISHCHGSGLFRAFSQHVLHRLNITQHPAT 359
Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEEST-SLADSFRFIHSCH 279
K ++ ++ R+ R ILNL E+ +A E + F V + + + + H+
Sbjct: 360 EAKIRVTILVRSTEF-RKILNLDELVQALEAVPTFQVKVVDYKYRVLGFLEQLSITHNSD 418
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL +V+ ++ + Y + AR+ G++Y+ ++
Sbjct: 419 IFIGMHGAGLTHLLFLPDWAVVFELYNCED---ARCYLD-LARLRGIQYMTWE 467
>gi|57222245|ref|NP_001009502.1| EGF domain-specific O-linked N-acetylglucosamine transferase
precursor [Rattus norvegicus]
gi|81862277|sp|Q5NDL0.1|EOGT_RAT RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
transferase; AltName: Full=Extracellular O-linked
N-acetylglucosamine transferase; Flags: Precursor
gi|56550370|emb|CAI30571.1| glycosyltransferase [Rattus norvegicus]
gi|149036803|gb|EDL91421.1| glycosyltransferase Aer61, isoform CRA_b [Rattus norvegicus]
Length = 527
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 106/232 (45%), Gaps = 15/232 (6%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L++T IN+ F DV + + D + + +++ FT + +I++
Sbjct: 245 NMYHHFCD-FLNLYLTQHINNSFS-TDVYIVMWDTSSYGYGDLFSDTWKAFTDYDVIHLK 302
Query: 166 N-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
+ CF+ L+ R + N L+ + F++F + + +
Sbjct: 303 TYDSKKVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFSQHVLHRLNISQEGPKD 362
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHA 280
K ++ ++ R+ R ILN E+ A + + F+V + + + D R H+
Sbjct: 363 GKLRVTILARSTEY-RKILNQNELVNALKTVSTFEVRVVDYKYRELGFLDQLRITHNTDI 421
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ ++
Sbjct: 422 FIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIYYITWQ 469
>gi|224006746|ref|XP_002292333.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971975|gb|EED90308.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 848
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 81/185 (43%), Gaps = 23/185 (12%)
Query: 229 LVNRNARVGRTILNLREVKKA------AEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
L R R R+ LNL EV KA A+E + E + S + S A++
Sbjct: 588 LAQRTYR--RSWLNLAEVMKACDTAEYAKEAKVLCVVVNVEHTASPYEQLLLHRSVDALI 645
Query: 283 GVHGAGLTHSLFLRPGSVLMQVVP-----IGTQWLSTVYFEKPARVL----GLEYLEYKI 333
GVHGA LT ++ L P +++++P I W+ T + P ++ L +L Y +
Sbjct: 646 GVHGAQLTQAVLLPPHGHILELLPWVPNYIRGAWVQTRHTPTPLGIIYHNTDLNHLGYSL 705
Query: 334 KQEESSLVEKYGANDL---VLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
++ L E + L LKN + F W ++ V I+RF +Y+K +
Sbjct: 706 GRDSVPLCEGVDDDQLEHCFLKNKKNFM---WDVRDFNVRPDIVIQYIERFVLYMKGEQR 762
Query: 391 KAKKL 395
+ K +
Sbjct: 763 QCKDV 767
>gi|346319407|gb|EGX89009.1| DUF563 domain protein [Cordyceps militaris CM01]
Length = 477
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
+L ++ A + V F S L + H ++GVHGAGLTHS+F++PG+VL
Sbjct: 347 HLERLRAAVPHMKLTVVDFA---SMPLHQQVQVAHQTDVLLGVHGAGLTHSMFMKPGAVL 403
Query: 302 MQVVPIGTQWLSTVYFEKPARVLGLEYL 329
++++P + F A++LG Y
Sbjct: 404 IEILPAD---FTHKGFRNLAQMLGHRYF 428
>gi|118404476|ref|NP_001072691.1| EGF domain-specific O-linked N-acetylglucosamine transferase
precursor [Xenopus (Silurana) tropicalis]
gi|123884480|sp|Q08CY9.1|EOGT_XENTR RecName: Full=EGF domain-specific O-linked N-acetylglucosamine
transferase; AltName: Full=Extracellular O-linked
N-acetylglucosamine transferase; Flags: Precursor
gi|115313758|gb|AAI24029.1| glycosyltransferase AER61 [Xenopus (Silurana) tropicalis]
Length = 525
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 103/233 (44%), Gaps = 17/233 (7%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI-- 164
N +H F D FV L+IT +N+ F I+ + +++ FT + I ++
Sbjct: 243 NMYHHFCD-FVNLYITQHVNNSFSTDINIVMWTTSVYGYGDLFSDTWKAFTDYDITHLKA 301
Query: 165 -NNQTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH 221
+N+ + CF+ L+ R + N L+ F++F + + T
Sbjct: 302 YDNKRV--CFKDAVFALLPRMRYGLFYNTPLISNCHGSGLFRAFSQHVLHRLNITQQLPK 359
Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKAAE-ELGFDVTIFEPEEST-SLADSFRFIHSCH 279
K ++ ++ R+ R ILNL E+ A E E F V + + + + H+
Sbjct: 360 EAKIRITILVRSTEF-RKILNLDELVHALEAEPTFQVKVVDYKYRVLGFLEQLEITHNSD 418
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL +V+ ++ + Y + AR+ G+ Y+ ++
Sbjct: 419 IFIGMHGAGLTHLLFLPDWAVVFELYNCEDE---RCYLDL-ARLRGIRYMTWE 467
>gi|226482514|emb|CAX73856.1| glycoprotein 2-beta-D-xylosyltransferase [Schistosoma japonicum]
Length = 453
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 60/128 (46%), Gaps = 14/128 (10%)
Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFHHTK----PKLVLVNR----------NARVGR 238
L K + +F+ F+ +YN N + + T PK+V+V+R N + R
Sbjct: 259 LIKNMFIEEFRQFIFQSYNINNDEDTCQKRTSIRFLPKIVIVSRRDYIAHPRNINGTIHR 318
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
I N E+ +LGF + ++ + + I S ++G+HGA LT+SL L
Sbjct: 319 KITNELELLNKLNQLGFQNSKVVCFTDLTMQEQLKLIMSTDILIGMHGAALTYSLLLSNT 378
Query: 299 SVLMQVVP 306
S ++++ P
Sbjct: 379 SCVIELFP 386
>gi|432858928|ref|XP_004069008.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
transferase-like [Oryzias latipes]
Length = 528
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 114/256 (44%), Gaps = 26/256 (10%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCN-DQWA 146
C + P VF N +H F D FV L+I+ IN+ F + D+++ + D + +
Sbjct: 228 CDIIIEKPT-VFMKLDAGVNMYHHFCD-FVNLYISQHINNSF-NSDILIVMWDTSFYGYG 284
Query: 147 RKYAELLPRFTRHPIININNQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQ 203
++E FT + I+++ CF+ L+ R + N L+ T F+
Sbjct: 285 DLFSETWRAFTDYDIVHLKTYDSKRVCFKDSFFSLLPRMRYGLFYNTPLVSNCYTEGMFR 344
Query: 204 SFLANAYNENTNTSSSFHHTKPK-----LVLVNRNARVGRTILNLREVKKAAEELGF-DV 257
+F +++ H KPK L L+ R+ R ILN ++ A + + +V
Sbjct: 345 AF-----SQHVLHRLHIHQNKPKERHVRLTLLARSTEY-RRILNQEKLVNALKTVPLLEV 398
Query: 258 TIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVY 316
T+ + + + S H+ +G+HGAGLTH LFL +V+ ++ +
Sbjct: 399 TVVDYKYKDVSFLKQLEISHNSDIFIGMHGAGLTHLLFLPDWAVIFELYNCQDE----SC 454
Query: 317 FEKPARVLGLEYLEYK 332
+ AR+ G+ Y+ ++
Sbjct: 455 YRDLARLRGIRYVTWQ 470
>gi|427703345|ref|YP_007046567.1| hypothetical protein Cyagr_2107 [Cyanobium gracile PCC 6307]
gi|427346513|gb|AFY29226.1| hypothetical protein Cyagr_2107 [Cyanobium gracile PCC 6307]
Length = 435
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 95/235 (40%), Gaps = 38/235 (16%)
Query: 103 AGGYNGNFFHEFMDCFVPLFITINSHFPDQ--DVILAIADCNDQWARKYAELLPRFT--R 158
AG ++ N+FH M ++P+ + P D L D E L RF
Sbjct: 192 AGPWSDNYFHWLMQ-YLPILKIASQWQPLSTIDHFLVRGPVKD----FQREALARFGIPA 246
Query: 159 HPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSS 218
H +I IN + + + + L I P ++ + L T S
Sbjct: 247 HKVIGINKNQV-YIIEDLLLTSIPCDNFTYEPWVIRMLREL----------------TES 289
Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGF---DVTIFEPEESTSLADSFRFI 275
+P + +R A R I+N+ E+ E G D + +E +LA S
Sbjct: 290 GSRKGRPTSLFFDRRAPAPRRIVNMAEISDLLENYGIQSIDCSTISFQEQINLASSSSL- 348
Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
++GVHGA L +S+F PG+VL++++P + YF + A L++L+
Sbjct: 349 -----LIGVHGASLANSVFSAPGTVLIELLPRNYR---PRYFSELASACQLQHLK 395
>gi|332709295|ref|ZP_08429257.1| hypothetical protein LYNGBM3L_39730 [Moorea producens 3L]
gi|332351841|gb|EGJ31419.1| hypothetical protein LYNGBM3L_39730 [Moorea producens 3L]
Length = 959
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 47/81 (58%), Gaps = 3/81 (3%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
P+ + + R R +LN EVK+ + GF +TI ES S+ + S A+V
Sbjct: 810 PERIYITRAQARHRQVLNETEVKELLSQFGF-ITI--ALESLSVREQVALFASVKAIVAP 866
Query: 285 HGAGLTHSLFLRPGSVLMQVV 305
HGAGL++++F RPG+ ++++V
Sbjct: 867 HGAGLSNTVFCRPGTQIIELV 887
>gi|407860860|gb|EKG07547.1| hypothetical protein TCSYLVIO_001321 [Trypanosoma cruzi]
Length = 579
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 65/127 (51%), Gaps = 11/127 (8%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
+P+L+L+NRN R I + + V AE +GF+V + E+ L D M+G
Sbjct: 400 RPRLLLINRNFR---EIHDYQAVAALAERVGFNVQVVYFEK-MPLEDQVHVSRHADVMMG 455
Query: 284 VHGAGLTHSLFL----RPGS-VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
+HG GLTH L++ RP L++++P G +++ A +G+ Y EY I+ +
Sbjct: 456 MHGMGLTHVLWMDGRRRPRCRALLELMPFGCPGTLIHFYKLFADTVGIHY-EY-IEAADM 513
Query: 339 SLVEKYG 345
L E G
Sbjct: 514 YLKETGG 520
>gi|449133953|ref|ZP_21769462.1| capsular polysaccharide biosynthesis protein [Rhodopirellula
europaea 6C]
gi|448887370|gb|EMB17750.1| capsular polysaccharide biosynthesis protein [Rhodopirellula
europaea 6C]
Length = 313
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 8/108 (7%)
Query: 196 PKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGF 255
P+T+ ++++ N ++ F+ + V ++R R + N REV+ + GF
Sbjct: 152 PQTMESLRNWVWN-----SSQLDRFNVVPHRRVYISRQFAQMRRVENEREVQGFMRDEGF 206
Query: 256 DVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
++ E+ S D R A+VG+HGAGLT+ LF+ PG+ +++
Sbjct: 207 EIV---HTENLSFDDQVRMFFETKALVGIHGAGLTNLLFMHPGTSVLE 251
>gi|326432569|gb|EGD78139.1| hypothetical protein PTSG_12822 [Salpingoeca sp. ATCC 50818]
Length = 623
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 56/109 (51%), Gaps = 9/109 (8%)
Query: 226 KLVLVNRNARVGRTILNLREVKKA------AEELGFDVTIFEPEESTSLADSFRFIHSCH 279
++V+VNR A GR+IL + + A A + +V + E E+ SL+ ++
Sbjct: 426 RVVIVNRPASSGRSILGVDAIAGAVRATLAALNISHEVIVEEHLETKSLSAQIALFNAAT 485
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
A++ HGA T+ LFLRP ++ ++V P + AR++G +Y
Sbjct: 486 AVISAHGAANTNWLFLRPNALAIEVFPFAYH---PRQYSTIARLIGAKY 531
>gi|71664664|ref|XP_819310.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70884606|gb|EAN97459.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 578
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 59/113 (52%), Gaps = 10/113 (8%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
+P+L+L+NRN R I + + V AE +GF+V + E+ L D M+G
Sbjct: 399 RPRLLLINRNFR---EIHDYQAVAALAERVGFNVQVVYFEK-MPLEDQVHVSRHADVMMG 454
Query: 284 VHGAGLTHSLFL----RPGS-VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
+HG GLTH L++ RP L++++P G +++ A +G+ Y EY
Sbjct: 455 MHGMGLTHVLWMDGRRRPRCRALLELMPFGCPRTLIHFYKLFADTVGIHY-EY 506
>gi|71419571|ref|XP_811209.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70875847|gb|EAN89358.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 579
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 59/113 (52%), Gaps = 10/113 (8%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
+P+L+L+NRN R I + + V AE +GF+V + E+ L D M+G
Sbjct: 400 RPRLLLINRNFR---EIHDYQAVAALAERVGFNVQVVYFEK-MPLEDQVHVSRHADVMMG 455
Query: 284 VHGAGLTHSLFL----RPGS-VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
+HG GLTH L++ RP L++++P G +++ A +G+ Y EY
Sbjct: 456 MHGMGLTHVLWMDGRRRPRCRALLELMPFGCPRTLIHFYKLFADTVGIHY-EY 507
>gi|322709092|gb|EFZ00668.1| DUF563 domain protein [Metarhizium anisopliae ARSEF 23]
Length = 362
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 10/95 (10%)
Query: 238 RTILNLREV----KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSL 293
R ++N E+ K+A L ++ F S A+ + + ++GVHGAGLTH++
Sbjct: 223 RKLINETELIASAKRAVPHLNIEIVDFA---EFSFAEQLKIVRETDLLIGVHGAGLTHTM 279
Query: 294 FLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
FL PGS +++++P + + F A++LG Y
Sbjct: 280 FLPPGSAVVEILPGD---FAHMGFRNLAQLLGHRY 311
>gi|301778056|ref|XP_002924441.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
[Ailuropoda melanoleuca]
gi|281342540|gb|EFB18124.1| hypothetical protein PANDA_013786 [Ailuropoda melanoleuca]
Length = 527
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/273 (21%), Positives = 118/273 (43%), Gaps = 17/273 (6%)
Query: 69 KSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--I 125
+S ++++ T + P + C + PA N +H F D F+ L+IT +
Sbjct: 205 QSWFAELQSYTQLSFRPLEDAKCDIVIEKPAYFMKLDA-GVNMYHHFCD-FINLYITQHV 262
Query: 126 NSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININNQTITH-CFQSVTLGLISH 183
N+ F DV + + D + + + + FT + +I++ CF+ L+
Sbjct: 263 NNSF-GTDVYIVMWDTSSYGYGDLFLDTWKAFTDYDVIHLKTYDSKRVCFKEAVFSLLPR 321
Query: 184 GR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
R + N L+ + F++F + + T K ++ ++ R+ R IL
Sbjct: 322 MRHGLFYNTPLISGCQNTGLFRAFSQHVLHRLNITQEGPKDGKIRVTILARSTEY-RKIL 380
Query: 242 NLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGS 299
N E+ A + + +V I + + + D + H+ +G+HGAGLTH LFL +
Sbjct: 381 NQNELVNALKTVSTLEVQIVDYKYKELGFLDQLQITHNTDIFIGMHGAGLTHLLFLPDWA 440
Query: 300 VLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ ++ + Y + AR+ G+ Y+ ++
Sbjct: 441 AVFELYNCEDE---RCYLDL-ARLRGVHYVTWR 469
>gi|291239169|ref|XP_002739505.1| PREDICTED: Uncharacterized glycosyltransferase AER61-like
[Saccoglossus kowalevskii]
Length = 588
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 90/203 (44%), Gaps = 10/203 (4%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCND-QWARKYAELLPRFTRHPIININ 165
N +H F D FV ++I+ IN+ F DV + + D + + ++ F+ +PII I
Sbjct: 303 NMYHHFCD-FVNIYISQHINNSFS-SDVNIVMWDTSGLSYGDFFSATWQAFSDYPIIPIK 360
Query: 166 NQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
C + L+ + N L+P ++F + +
Sbjct: 361 RWDGKKVCMKEAVFSLLPRMQRGFYYNMPLVPSCHGSGIIKAFSQHLMHRLKIPQEGPLK 420
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
K ++ L+ RN + R I+N E+ KA +E V + E + ++ H+
Sbjct: 421 NKVRVTLLARNTK-HRNIINQNELVKAMKKEKDLTVKVVEYNRNMPFLKQLKYTHNSDIF 479
Query: 282 VGVHGAGLTHSLFLRPGSVLMQV 304
+G+HGAGLTHSLFL +V+ ++
Sbjct: 480 IGMHGAGLTHSLFLPDWAVVFEL 502
>gi|358385599|gb|EHK23196.1| glycosyltransferase family 61 protein [Trichoderma virens Gv29-8]
Length = 454
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 3/67 (4%)
Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE 327
++ + + +VGVHGAGLTH +FL+PGS +++++P G Q F A++LG+
Sbjct: 346 FSEQIQIVRETDLLVGVHGAGLTHLMFLQPGSAVLEILPEGLQHKG---FRNLAQMLGIG 402
Query: 328 YLEYKIK 334
+ K
Sbjct: 403 FFRAHAK 409
>gi|56550356|emb|CAI30564.1| glycosyltransferase [Ciona intestinalis]
Length = 503
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 55/111 (49%), Gaps = 12/111 (10%)
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEEST-SLADSFRFIHSCHA 280
TKP + +V R I+N +++ K + GF V + E + T S D H+
Sbjct: 348 TKPDYL-----GKVYRQIVNEKDLIKVLNNIPGFQVKVVEYHQDTMSFKDQISMSHNSDI 402
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
M+G+HGAGLTH LFL P +V ++ + + AR+ G+ ++ +
Sbjct: 403 MIGMHGAGLTHFLFLPPWAVAFELYNCQAK-----CYRDLARLRGVRHMTW 448
>gi|160358325|ref|NP_001027840.1| glycosyltransferase aer61b precursor [Ciona intestinalis]
Length = 498
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 55/111 (49%), Gaps = 12/111 (10%)
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEEST-SLADSFRFIHSCHA 280
TKP + +V R I+N +++ K + GF V + E + T S D H+
Sbjct: 343 TKPDYL-----GKVYRQIVNEKDLIKVLNNIPGFQVKVVEYHQDTMSFKDQISMSHNSDI 397
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
M+G+HGAGLTH LFL P +V ++ + + AR+ G+ ++ +
Sbjct: 398 MIGMHGAGLTHFLFLPPWAVAFELYNCQAK-----CYRDLARLRGVRHMTW 443
>gi|384488169|gb|EIE80349.1| hypothetical protein RO3G_05054 [Rhizopus delemar RA 99-880]
Length = 198
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 81/187 (43%), Gaps = 19/187 (10%)
Query: 215 NTSSSFH---HTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI--FEPEESTSLA 269
N+ +H T K ++NR + R I N+ +V +A + DV+I +E ++
Sbjct: 11 NSVREYHSGSKTNIKFAILNR--KQSRHITNIPDVIEAMLKEFKDVSIKLINYDEGCNIR 68
Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
+ + + + HG GL LF++ GS V+ I ++W S +F P + + + L
Sbjct: 69 STAQLVEDIDVFISPHGNGLGSGLFMKKGST---VISIDSRWYSEDWFYWPMKAVNVRIL 125
Query: 330 EYKIK----QEESSLVEKYGANDLVLKNP-----QAFAGANWSNMRVYLKTQNVKLDIDR 380
Y QE + A DL + Q + + Y K ++DI+R
Sbjct: 126 YYDCNNPSCQEFDEQLLMKLAPDLTEEQKKEIMTQEVTSVDRLLVEQYRKDSKRRIDIER 185
Query: 381 FRIYLKD 387
F ++LKD
Sbjct: 186 FIVFLKD 192
>gi|347759425|ref|YP_004866986.1| hypothetical protein GLX_02040 [Gluconacetobacter xylinus NBRC
3288]
gi|347578395|dbj|BAK82616.1| hypothetical protein GLX_02040 [Gluconacetobacter xylinus NBRC
3288]
Length = 579
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 61/127 (48%), Gaps = 20/127 (15%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+LV ++R R ILN + LGF++ + TSLAD R +V H
Sbjct: 454 RLVYISRQDTKARRILNEDALIDRLRSLGFEILV---ATGTSLADQIRIFREARLVVAGH 510
Query: 286 GAGLTHSLFLRPGSVLMQVV--------PI------GTQWLSTVYFEKP---ARVLGLEY 328
GAG+++ LF R G+ L++++ P+ G ++ S ++FE + ++
Sbjct: 511 GAGVSNMLFAREGTALLELIQASYLNVGPMRLAQIAGCRYYSMLFFEDGPHNGWYVDIDR 570
Query: 329 LEYKIKQ 335
+E+ I+Q
Sbjct: 571 VEWAIRQ 577
>gi|440795174|gb|ELR16310.1| glycosyltransferase AER61, putative [Acanthamoeba castellanii str.
Neff]
Length = 524
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 117/266 (43%), Gaps = 45/266 (16%)
Query: 84 APPNLSCGVTHTSPALVFSAGGYNG-NFFH---EFMDCFVPLFITINSHFPDQDVILAIA 139
A + CGV + +P +V Y N +H E+++ F L + D++ + +
Sbjct: 234 ADQSTECGVVYENPVVVIMR--YEAWNMYHQLGEWINAFTTLEVVDKL---DKNTQVLLL 288
Query: 140 DCNDQWARKYAELLPRFT-RHPIININNQTITH---CFQSVTLGLISHGRMVIN------ 189
D +++ + ++L F+ HP++ + + + CF+ + +G + N
Sbjct: 289 DMHEK-TEPFTDMLKVFSPDHPLV-LGKELVGKGKVCFKDAIMPWEGYGTFIHNNVWRAS 346
Query: 190 ---PTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHT---KPKLVLVNRNARVGRT---- 239
P L L F F+ N H +P++ L+ R +GR
Sbjct: 347 HGEPCL--DSDILEAFSHFVLNKL-------GMLKHNIPNEPRITLILRKDYMGRKLDRK 397
Query: 240 ILNLREVKKAAEELGFDVTIFEPE--ESTSLADSFRFIHS-CHAMVGVHGAGLTHSLFLR 296
I N +V KA EE+ F E+ + + ++S + ++GVHGAGL+H++FL
Sbjct: 398 ISNEDQVVKALEEVSRGRASFSSVQLETMTFKEQVELMYSKTNILIGVHGAGLSHTVFLP 457
Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPAR 322
P ++L++++P + S YF A+
Sbjct: 458 PEAILIELLPDSVK--SFTYFRNLAK 481
>gi|348510387|ref|XP_003442727.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized glycosyltransferase
AER61-like [Oreochromis niloticus]
Length = 528
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 56/251 (22%), Positives = 110/251 (43%), Gaps = 16/251 (6%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCN-DQWA 146
C + P VF N +H F D F+ L+I+ IN+ F D+ + + D + +
Sbjct: 228 CDIIVEKPT-VFMKLDAGVNMYHHFCD-FINLYISQHINNSF-SSDINIVMWDTSFYGYG 284
Query: 147 RKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQ 203
++E F+ + II++ + CF+ L+ R + N L+ + F+
Sbjct: 285 DLFSETWRAFSEYDIIHLKTFDSKRVCFKDAFFSLLPRMRYGLFYNTPLISDCYSEGMFR 344
Query: 204 SFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTI--LNLREVKKAAEELGFDVTIFE 261
+F + + T + ++ L+ R+ R + + L K A L +V ++
Sbjct: 345 AFSQHILHRLNITRDKPQEGRVRVTLLARSTEYRRILNHMELVNALKTAPLLEVNVVDYK 404
Query: 262 PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPA 321
++ L + R H+ +G+HGAGLTH LFL +V+ ++ + + A
Sbjct: 405 YKDVPFL-EQLRITHNSDIFIGMHGAGLTHLLFLPDWAVIFELYNCQDE----SCYRDLA 459
Query: 322 RVLGLEYLEYK 332
R+ G+ Y+ ++
Sbjct: 460 RLRGIRYVTWQ 470
>gi|240141135|ref|YP_002965615.1| hypothetical protein MexAM1_META1p4710 [Methylobacterium extorquens
AM1]
gi|240011112|gb|ACS42338.1| hypothetical protein MexAM1_META1p4710 [Methylobacterium extorquens
AM1]
Length = 414
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 50/93 (53%), Gaps = 5/93 (5%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+ + V+R+ GR + N V LGF+ F+PE ++ RF + +VG+
Sbjct: 282 RRLFVDRHPGRGRGLANRDAVLGLVRGLGFEA--FDPELTSVRQQVVRF-SAAEIVVGIA 338
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
GAG+T+++F RPG+ ++ +VP G W Y E
Sbjct: 339 GAGMTNTVFCRPGTPVIHLVPEG--WEDRFYGE 369
>gi|91083375|ref|XP_966996.1| PREDICTED: similar to glycosyltransferase [Tribolium castaneum]
gi|270006906|gb|EFA03354.1| hypothetical protein TcasGA2_TC013339 [Tribolium castaneum]
Length = 518
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 116/266 (43%), Gaps = 25/266 (9%)
Query: 89 SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF--ITINSHFPDQ---DVILAIADCND 143
C V P + N +H F D F+ L+ I +NS D DV + I +
Sbjct: 210 DCDVVIEKPTFIMKIDA-TVNMYHHFCD-FLNLYASIHLNSTQWDAFSTDVHVLIWETY- 266
Query: 144 QWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKT 198
+ + + FT HP+ ++ + T CF++V L+ RM+ N ++ +
Sbjct: 267 TYRSAFGDTWEAFTDHPVWDLKTFRGETVCFKNVVFPLLP--RMIFGLYYNTPIIYGCEN 324
Query: 199 LVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDV 257
FQ+F + + ++ K ++ L+ R+ + R ILN E+ +A A+ ++V
Sbjct: 325 SGLFQAFSQHILHRLKIPFHPRNNRKIRITLLARDTKY-RRILNEDELVEALAKNHDYEV 383
Query: 258 TIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYF 317
+ + ++GVHGAGLTH LFL + + ++ + YF
Sbjct: 384 QKVVYNKDVPFKKQLEITRNSDVLIGVHGAGLTHLLFLPDWAAVFELYNCED---ANCYF 440
Query: 318 EKPARVLGLEYLEY----KIKQEESS 339
+ AR+ GL+Y+ + K+K +E
Sbjct: 441 DL-ARLRGLKYVTWEKLDKLKSQEDG 465
>gi|418064455|ref|ZP_12701932.1| hypothetical protein MetexDRAFT_6668, partial [Methylobacterium
extorquens DSM 13060]
gi|373546436|gb|EHP73201.1| hypothetical protein MetexDRAFT_6668, partial [Methylobacterium
extorquens DSM 13060]
Length = 328
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 50/93 (53%), Gaps = 5/93 (5%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+ + V+R+ GR + N V LGF+ F+PE ++ RF + +VG+
Sbjct: 196 RRLFVDRHPGRGRGLANRDAVLGLVRGLGFEA--FDPELTSVRQQVVRF-SAAEIVVGIA 252
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
GAG+T+++F RPG+ ++ +VP G W Y E
Sbjct: 253 GAGMTNTVFCRPGTPVIHLVPEG--WEDRFYGE 283
>gi|70994078|ref|XP_751886.1| DUF563 domain protein [Aspergillus fumigatus Af293]
gi|66849520|gb|EAL89848.1| DUF563 domain protein [Aspergillus fumigatus Af293]
Length = 488
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 95/216 (43%), Gaps = 23/216 (10%)
Query: 149 YAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLAN 208
Y +L F + P++ IN+ T+ F+++ + L NP + S L
Sbjct: 260 YFDLWKIFAQRPLLRINDLHPTNEFENLIVPLAGGS----NPLWQGDWEIHSCEDSALVR 315
Query: 209 AYNENTNTSSSFH------HTKPKLVLVNRNARVGRTILNLRE----VKKAAEELGFDVT 258
++ S FH P++V+ + R ++N ++ VKK + +
Sbjct: 316 TFSRRI--LSHFHVEFRRPRQGPQIVVTFIDRTGSRKLINQKDYFNTVKKQFPHITVQMI 373
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
F S + R +VGVHGAGLTH +FL GSV+++++P G L+ F
Sbjct: 374 DFA---SIPFQEQLRIAQGSDILVGVHGAGLTHGIFLPSGSVMVEILPPG---LNHKGFR 427
Query: 319 KPARVLG-LEYLEYKIKQEESSLVEKYGANDLVLKN 353
A +LG L + + K ++ + + ND+ L++
Sbjct: 428 NLASLLGHLYFSAHATKPAKTVKRDDWHNNDVYLEH 463
>gi|433590319|ref|YP_007279815.1| hypothetical protein Natpe_0999 [Natrinema pellirubrum DSM 15624]
gi|433305099|gb|AGB30911.1| hypothetical protein Natpe_0999 [Natrinema pellirubrum DSM 15624]
Length = 406
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 93/206 (45%), Gaps = 17/206 (8%)
Query: 104 GGYNGNFFHEFMDCFVPLFITINSHFPDQD-VILAIADCNDQWARKYAELLPRFTRHPII 162
G Y+ N++H F + ++P + H+P D V L I +W R ELL F + I+
Sbjct: 168 GPYDKNYYHWFGN-YLPRLEGLQ-HYPGADEVTLMIPADPPEWLRDSLELL-GFGDYNIV 224
Query: 163 NINNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLV--DFQSFLANAYNENTNTSSSF 220
+ + H + + +S R + +P + +F+ ++ + ENT ++
Sbjct: 225 EWPGKPV-HADRYI----LSRYRSEASSETGNRPPIMSKKNFK-WILDRIGENTPNNTR- 277
Query: 221 HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHA 280
+P++ + +A RT LN +EV E F+ I SLA+ R +
Sbjct: 278 EKDRPRIYISREDATARRT-LNEQEVMATLSEYDFEKYILS---EYSLAEQIRLFSNAEI 333
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVP 306
++G HGAGL + LF +V+ + P
Sbjct: 334 IIGAHGAGLINMLFASDATVIELLGP 359
>gi|118344554|ref|NP_001072048.1| glycosyltransferase precursor [Takifugu rubripes]
gi|56550374|emb|CAI30573.1| glycosyltransferase [Takifugu rubripes]
Length = 525
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 123/275 (44%), Gaps = 21/275 (7%)
Query: 69 KSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--I 125
+S ++++ T S P N C +T P VF N +H F D FV L+I+ I
Sbjct: 203 QSWYAELQTYTELDSDPIVNGQCDLTVDKPT-VFMKLDAGVNMYHHFCD-FVNLYISQHI 260
Query: 126 NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI---NNQTITHCFQSVTLGLIS 182
N+ F I+ + ++ ++E F+++ II++ +N+ + CF+ L+
Sbjct: 261 NNSFSSDINIVMWDTSSYEYGDLFSETWRAFSQNDIIHLKVYDNKRV--CFRDALFSLLP 318
Query: 183 HGR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTI 240
R + N L+ + F++F + + + ++ L+ R+ R I
Sbjct: 319 RMRYGLFYNTPLISDCYSEGMFRAFSQHILHRLHVPQDGPKDGRVRVTLLARSTEY-RKI 377
Query: 241 LNLREVKKAAE---ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
LN E+ A + L +V ++ ++ L + H+ +G+HGAGLTH LFL
Sbjct: 378 LNQVELVNALKTVPNLEVNVVDYKYKDVPFLV-QLKTTHNSDIFIGMHGAGLTHLLFLPD 436
Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+V+ ++ + + AR+ G+ Y+ ++
Sbjct: 437 WAVIFELYNCQDE----SCYRDLARLRGVRYVTWQ 467
>gi|254563646|ref|YP_003070741.1| hypothetical protein METDI5316 [Methylobacterium extorquens DM4]
gi|254270924|emb|CAX26929.1| hypothetical protein METDI5316 [Methylobacterium extorquens DM4]
Length = 414
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 5/95 (5%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
+ + + V+R+ GR + N V LGF+ F+PE ++ RF + +VG
Sbjct: 280 RGRRLFVDRHPGRGRGLANRDAVLGLLRGLGFEA--FDPELTSVRQQVVRF-SAAEIVVG 336
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
+ GAG+T+++F RPG+ ++ +VP G W Y E
Sbjct: 337 IAGAGMTNTVFCRPGTPVIHLVPEG--WEDRFYGE 369
>gi|443724229|gb|ELU12341.1| hypothetical protein CAPTEDRAFT_168806 [Capitella teleta]
Length = 519
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/244 (22%), Positives = 110/244 (45%), Gaps = 12/244 (4%)
Query: 69 KSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--I 125
+S +++++ T +S P +C V P +F N +H F D ++ L+++ +
Sbjct: 192 QSWYAELEKYTQLSSDPFEEKTCEVIIDKPT-IFIKLDAGVNMYHHFCD-YINLYLSQHM 249
Query: 126 NSHFPDQDVILAIADCND-QWARKYAELLPRFTRHPIININNQTITH-CFQSVTLGLISH 183
N+ F DV + + D + + + F+ H I+ + CF+ V ++
Sbjct: 250 NNSFS-TDVYIVMWDTSPMHYGDFFHVTWKAFSDHEIVPLKEYDGKKVCFKDVVFSFLAR 308
Query: 184 GR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
R + N L+P + F++F + + T K ++ L++R+ + R IL
Sbjct: 309 MRYGLYYNMPLIPGCQGSSFFRAFNQHILHRLNITQDGPLLDKVRITLLSRSTKF-RKIL 367
Query: 242 NLREVKKAAEELG-FDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSV 300
N E+ A + + + V + + T + + ++ +G+HGAGLTH LF +V
Sbjct: 368 NEDELVTALKSVDDYKVQVVDFNYKTPFLEQLQVTYNSDFFIGMHGAGLTHVLFQPDWAV 427
Query: 301 LMQV 304
L ++
Sbjct: 428 LFEI 431
>gi|56550372|emb|CAI30572.1| glycosyltransferase [Tetraodon nigroviridis]
Length = 525
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 121/274 (44%), Gaps = 19/274 (6%)
Query: 69 KSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--I 125
+S ++++ T S P N C +T P VF N +H F D FV L+I+ I
Sbjct: 203 QSWYAELQTYTELDSDPIVNGQCDLTVDKPT-VFMKLDAGVNMYHHFCD-FVNLYISQHI 260
Query: 126 NSHFPDQDVILAIADCN-DQWARKYAELLPRFTRHPIININNQTITH-CFQSVTLGLISH 183
N+ F D+ + + D + ++ ++E F+ + II++ CF+ L+
Sbjct: 261 NNSF-SSDINIVMWDTSFYEYGDLFSETWRAFSENDIIHLKTYDSKRVCFRDAFFSLLPR 319
Query: 184 GR--MVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
R + N L+ + F++F + + + ++ L+ R+ R IL
Sbjct: 320 MRYGLFYNTPLISDCYSEGMFRAFSQHVLHRLNIPQDGPKDGRVRVTLLARSTEY-RKIL 378
Query: 242 NLREVKKAAE---ELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
N E+ A + L +V F+ ++ L + H+ +G+HGAGLTH LFL
Sbjct: 379 NQVELVNALKTVPHLKVNVVDFKYKDVPFLV-QLKITHNSDIFIGMHGAGLTHLLFLPDW 437
Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+V+ ++ + + AR+ G+ Y+ ++
Sbjct: 438 AVIFELYNCQDE----SCYRDLARLRGVRYVTWQ 467
>gi|312377513|gb|EFR24326.1| hypothetical protein AND_11153 [Anopheles darlingi]
Length = 533
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 58/254 (22%), Positives = 109/254 (42%), Gaps = 21/254 (8%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHF--PD---QDVILAIADCNDQ 144
C V PA + N +H F D F+ L+ +++++ PD D + I + + +
Sbjct: 223 CDVVIEKPAFIMKIDA-AINMYHHFCD-FLNLYGSLHANLSHPDGFTTDTQVLIWE-SFR 279
Query: 145 WARKYAELLPRFTRHPIININNQT-ITHCFQSVTLGLISHGRMVI----NPTLLPKPKTL 199
+ +A+ FTRHPI ++ CF++V L L+ RM+ N ++ +
Sbjct: 280 YISPFADTFKVFTRHPIADLKTYAGKVVCFRNVVLPLLP--RMIFGLYYNTPIIHGCENS 337
Query: 200 VDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE-VKKAAEELGFDVT 258
F +F + + + ++ ++R R R +LN E V ++ + V
Sbjct: 338 GLFHAFSEHVLHRMRIRLVKRPDRRLRITFLSRQTRY-RRVLNENELVASISDNPDYSVQ 396
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
+ + S A+ + + +G+HGAGLTH LFL L ++ +
Sbjct: 397 LVTYGQEMSFAEQLKITRNTDIFIGMHGAGLTHLLFLPKWGTLFELYHCE----DPNCYR 452
Query: 319 KPARVLGLEYLEYK 332
AR+ G+ YL ++
Sbjct: 453 DLARLRGVHYLSWE 466
>gi|188584016|ref|YP_001927461.1| capsular polysaccharide biosynthesis protein-like protein
[Methylobacterium populi BJ001]
gi|179347514|gb|ACB82926.1| capsular polysaccharide biosynthesis protein-like protein
[Methylobacterium populi BJ001]
Length = 426
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 5/95 (5%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
+ + + V+R+ GR + N V +LGF+ F+PE ++ RF +VG
Sbjct: 292 RGRRLFVDRHPGRGRGLANREAVLARLRDLGFEA--FDPELTSVRQQVVRFA-GAEIVVG 348
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
+ GAG+ +++F RPG+ ++ +VP G W Y E
Sbjct: 349 IAGAGMANTVFCRPGTPVIHLVPEG--WEDLFYGE 381
>gi|334335498|ref|XP_003341780.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
[Monodelphis domestica]
Length = 319
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 89/193 (46%), Gaps = 10/193 (5%)
Query: 149 YAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSF 205
++E FT + +I++ + T CF+ L+ R + N L+ + F++F
Sbjct: 78 FSETWKAFTDYDVIHLKAYDSKTVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTGLFRAF 137
Query: 206 LANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPE- 263
+ + T K ++ ++ R+ R ILN E+ A + + F+V I + +
Sbjct: 138 SQHVLHRLNITWDGPKDGKIRVTILARSTEY-RKILNQNELVNALKTVSAFEVQIVDYKY 196
Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARV 323
+ + R H+ +G+HGAGLTH LFL +V+ ++ + Y + AR+
Sbjct: 197 KEMGFLEQLRITHNTDIFIGMHGAGLTHLLFLPDWAVIFELYNCEDE---RCYLDL-ARL 252
Query: 324 LGLEYLEYKIKQE 336
G+ Y+ +K K +
Sbjct: 253 RGIHYVTWKKKSK 265
>gi|397614564|gb|EJK62878.1| hypothetical protein THAOC_16495 [Thalassiosira oceanica]
Length = 719
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 17/122 (13%)
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVP-----IGTQWLSTVYFEKPARVL----GLE 327
S A+VGVHGA LT S+ L PG +++++P I +W+ T + P ++ L
Sbjct: 491 SLDALVGVHGAQLTQSVLLPPGGHVLELLPWVPQYIRGRWVQTTHAPTPLGIIFHNSDLN 550
Query: 328 YLEYKIKQEESSLVEKYGAND-----LVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFR 382
+L Y + ++ L EK+G L N + F W N + V ++RF
Sbjct: 551 HLGYSLGRDSIPLCEKFGEGSAEEEACFLGNRKRFI---WENRDFAVDPGAVVYYVERFV 607
Query: 383 IY 384
+Y
Sbjct: 608 LY 609
>gi|428315211|ref|YP_007113093.1| Tetratricopeptide TPR_2 repeat-containing protein [Oscillatoria
nigro-viridis PCC 7112]
gi|428238891|gb|AFZ04677.1| Tetratricopeptide TPR_2 repeat-containing protein [Oscillatoria
nigro-viridis PCC 7112]
Length = 949
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 47/81 (58%), Gaps = 3/81 (3%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
PK + ++R+ R +LN + + E+ GF V+I ES SLA+ + +V
Sbjct: 809 PKRIYISRSKARYRRVLNEEDAIEVLEKFGF-VSILP--ESMSLAEQIAHFYHAEVIVAA 865
Query: 285 HGAGLTHSLFLRPGSVLMQVV 305
HG+GLT+++F R G+ ++++V
Sbjct: 866 HGSGLTNTIFCRQGTKVIELV 886
>gi|427407648|ref|ZP_18897850.1| hypothetical protein HMPREF9718_00324 [Sphingobium yanoikuyae ATCC
51230]
gi|425714152|gb|EKU77163.1| hypothetical protein HMPREF9718_00324 [Sphingobium yanoikuyae ATCC
51230]
Length = 828
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 57/109 (52%), Gaps = 9/109 (8%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
P+ V ++R A R +LN ++ A GF++ F + L H+ ++
Sbjct: 693 PRRVYISRRAVPRRPMLNESHIEDHARSAGFEILDFA---TLPLWHQIAISHNAETIMSP 749
Query: 285 HGAGLTHSLFLRPGSVLMQVVPI--GTQWLSTVYFEKPARVLGLEYLEY 331
HGAGL+H +F +PG+ +++++PI GT L Y AR+ L+ L+Y
Sbjct: 750 HGAGLSHLIFAKPGTQVIELLPIQDGTYQLRFNY----ARLSILKGLDY 794
>gi|334117762|ref|ZP_08491853.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
vaginatus FGP-2]
gi|333460871|gb|EGK89479.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
vaginatus FGP-2]
Length = 971
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 55/101 (54%), Gaps = 7/101 (6%)
Query: 205 FLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEE 264
FL + + SSS+ PK + ++R+ R +LN +V + E+ GF V+I E
Sbjct: 815 FLRREFLKGILPSSSY----PKRIYISRSKARYRRVLNEEDVVEVLEKFGF-VSILP--E 867
Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
S SLA+ + +V HG+GLT+++F G+ ++++V
Sbjct: 868 SLSLAEQIAHFYHAEVIVAAHGSGLTNTIFCSQGTKVIELV 908
>gi|154421576|ref|XP_001583801.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121918045|gb|EAY22815.1| hypothetical protein TVAG_075620 [Trichomonas vaginalis G3]
Length = 454
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/128 (22%), Positives = 67/128 (52%), Gaps = 13/128 (10%)
Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTV--YFEKPA 321
E+ + + + V +HG+GL+H L+++PG+ ++++ WL T +++K A
Sbjct: 329 ENVPIEKQMKIMSKASLFVSIHGSGLSHILWMKPGTCVIEL----KTWLHTCNDWYQKAA 384
Query: 322 RVLGLEYLEYKIKQ--EESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDID 379
R G+ Y+ Y + ++ S + Y + + + + F G+ + + L+ QN+ ++ +
Sbjct: 385 RATGIHYMAYYPHETLDKPSYISPYLQHCI---DNRIFCGS--KHCKDALRDQNITVNAE 439
Query: 380 RFRIYLKD 387
RF +K+
Sbjct: 440 RFENGIKE 447
>gi|428212874|ref|YP_007086018.1| capsular polysaccharide biosynthesis protein [Oscillatoria
acuminata PCC 6304]
gi|428001255|gb|AFY82098.1| capsular polysaccharide biosynthesis protein [Oscillatoria
acuminata PCC 6304]
Length = 791
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 13/118 (11%)
Query: 189 NPTLLPKPKTLVDF--QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV 246
P + PK + DF + F + A ++ + T KL + R A+V R +N E+
Sbjct: 615 GPFIEGPPKWVCDFLRERFASLADSQGSETPE-------KLYISRRKAKV-RRFINEDEI 666
Query: 247 KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
E GF I E S S+ + + ++ HGAGLT+++F +PG+ L+++
Sbjct: 667 STLLEFYGFKTVILE---SLSVQEQITLLAGAKTIIAPHGAGLTNTIFCQPGTQLLEI 721
>gi|428184519|gb|EKX53374.1| hypothetical protein GUITHDRAFT_101078 [Guillardia theta CCMP2712]
Length = 1432
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 97/245 (39%), Gaps = 29/245 (11%)
Query: 108 GNFFHEFMDCFVPLFITI------------NSHFPDQDVILAIADCNDQWARKYAELLPR 155
G+ HE F LF TI S F D +I + D D+ +K++ LL
Sbjct: 1129 GHLLHE---TFFALFHTILTFSGSDQQNLEESVFDDVILISDVIDA-DKSLKKFSFLLST 1184
Query: 156 FTRHPIININNQTITH---CFQSVTLGLISHGRMVINPTLLPKPKT------LVDFQSFL 206
+ HP + CF+ + +GL M + + KT D Q
Sbjct: 1185 LSNHPWFTFSTLRTAGRRVCFRQLVVGLSDGMNMFATEETVKEDKTSSPIPNYRDVQMLR 1244
Query: 207 ANAYNENTNTSSSFHHTKPKLV-LVNRNARVG--RTILNLREVKKAAEELGFDVTIFEPE 263
+ ++V V+R+ + R I NL ++ ++ E L V I E E
Sbjct: 1245 SRVLKFVETAGRRRKSGGERVVAFVHRSEELTSRRGIFNLDQLVRSVESLNSRVMIIEFE 1304
Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARV 323
+ SL + + A+VGV G GL ++L++R G+ +Q+ P G + FE R
Sbjct: 1305 K-MSLVEQVEVVQDVDALVGVTGTGLWNALWMRRGAAGIQIFPFGVGYKGGREFENAIRY 1363
Query: 324 LGLEY 328
EY
Sbjct: 1364 GPGEY 1368
>gi|148224592|ref|NP_001086091.1| glycosyltransferase-like domain-containing protein 2 precursor
[Xenopus laevis]
gi|82076799|sp|Q5NDE7.1|GTDC2_XENLA RecName: Full=Glycosyltransferase-like domain-containing protein 2;
AltName: Full=Extracellular O-linked N-acetylglucosamine
transferase-like; Flags: Precursor
gi|56605348|emb|CAI30871.1| glycosyltransferase [Xenopus laevis]
Length = 578
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 111/249 (44%), Gaps = 27/249 (10%)
Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRF--TRHPIININN 166
N H F D +P+F TI FPD D + + W L +F + P++
Sbjct: 162 NLMHVFHDDLLPIFYTIQ-QFPDLDFESRLF-FMEGWNEGLHFELYKFMSNKQPLLKEQL 219
Query: 167 QTITH--CFQSVTLGL----ISHGRMVINPTLLPKPKTLVD------FQSFLANAYNENT 214
+T+ CF +GL + + P PK LV F F+ N
Sbjct: 220 KTLGRLLCFTKSYVGLSKITTWYQYGFVQPQG-PKANILVSGNEIRHFAKFMMGKLNITL 278
Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFR 273
+ +++ + +VL +R+ + R I+N E+ A A+E TI E S +D R
Sbjct: 279 DQNAA----EAYIVLFSRS--MNRLIVNEAELLLALAQEFQMK-TITVSLEDHSFSDIVR 331
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEY 331
+ + +V +HGA L SLFL G+V++++ P G ++ + + G+E Y+ +
Sbjct: 332 LLSNATMLVSMHGAQLVTSLFLPKGAVVVELFPYGINPEHYTPYKTLSTLPGMELQYVAW 391
Query: 332 KIKQEESSL 340
+ +EE+++
Sbjct: 392 QNTEEENTI 400
>gi|213625326|gb|AAI70355.1| Glycosyltransferase [Xenopus laevis]
gi|213626901|gb|AAI70357.1| Ago61 protein [Xenopus laevis]
Length = 578
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 111/249 (44%), Gaps = 27/249 (10%)
Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRF--TRHPIININN 166
N H F D +P+F TI FPD D + + W L +F + P++
Sbjct: 162 NLMHVFHDDLLPIFYTIQ-QFPDLDFESRLF-FMEGWNEGLHFELYKFMSNKQPLLKEQL 219
Query: 167 QTITH--CFQSVTLGL----ISHGRMVINPTLLPKPKTLVD------FQSFLANAYNENT 214
+T+ CF +GL + + P PK LV F F+ N
Sbjct: 220 KTLGRLLCFTKSYVGLSKITTWYQYGFVQPQG-PKANILVSGNEIRHFAKFMMGKLNITL 278
Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFR 273
+ +++ + +VL +R+ + R I+N E+ A A+E TI E S +D R
Sbjct: 279 DQNAA----EAYIVLFSRS--MNRLIVNEAELLLALAQEFQMK-TITVSLEDHSFSDIVR 331
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEY 331
+ + +V +HGA L SLFL G+V++++ P G ++ + + G+E Y+ +
Sbjct: 332 LLSNATMLVSMHGAQLVTSLFLPKGAVVVELFPYGINPEHYTPYKTLSTLPGMELQYVAW 391
Query: 332 KIKQEESSL 340
+ +EE+++
Sbjct: 392 QNTEEENTI 400
>gi|428212875|ref|YP_007086019.1| hypothetical protein Oscil6304_2480 [Oscillatoria acuminata PCC
6304]
gi|428001256|gb|AFY82099.1| tetratricopeptide repeat protein [Oscillatoria acuminata PCC 6304]
Length = 692
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ ++R R +N E+ + + GF + I E S S+++ + S A+V HGA
Sbjct: 547 IYISRQRSQFRRFVNEEEILECLGKFGFKMVILE---SFSVSEQIAIMASAKAIVAPHGA 603
Query: 288 GLTHSLFLRPGSVLMQV 304
GLT+++F +PG+ L+++
Sbjct: 604 GLTNAVFCQPGTKLIEI 620
>gi|152967606|ref|YP_001363390.1| capsular polysaccharide biosynthesis protein-like protein
[Kineococcus radiotolerans SRS30216]
gi|151362123|gb|ABS05126.1| Capsular polysaccharide biosynthesis protein-like [Kineococcus
radiotolerans SRS30216]
Length = 398
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 3/81 (3%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
V V+R R +LN +V A GF + S ++ + S +VGVHGA
Sbjct: 262 VFVDRPETDRRRMLNREDVVAALRGRGFRIVDLA---SLTVQEEAVLFGSAAVVVGVHGA 318
Query: 288 GLTHSLFLRPGSVLMQVVPIG 308
GLT+ +F RPG+V++++ P G
Sbjct: 319 GLTNLVFCRPGTVVLELAPRG 339
>gi|452821526|gb|EME28555.1| transferase, transferring glycosyl groups [Galdieria sulphuraria]
Length = 433
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 60/116 (51%), Gaps = 8/116 (6%)
Query: 226 KLVLVNRNA--RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
K++++ RN R+ R + +E++K G V+I E ++L + + +H ++
Sbjct: 280 KIIMIQRNTTRRIRRAEIMTKELEKV---FGVKVSIILFEYLSALQQA-QLMHRVSIVIA 335
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
HGA L++ +F++ GSVL+++ P YF A+ LGL Y +ES+
Sbjct: 336 AHGASLSNIIFMKRGSVLIELSPSLCS--GEGYFGALAKFLGLTYFHISGMHQESN 389
>gi|365898907|ref|ZP_09436839.1| hypothetical protein BRAO3843_3160069 [Bradyrhizobium sp. STM 3843]
gi|365420397|emb|CCE09381.1| hypothetical protein BRAS3843_3160069 [Bradyrhizobium sp. STM 3843]
Length = 386
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 57/115 (49%), Gaps = 6/115 (5%)
Query: 230 VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGL 289
++R R +LN V + ++ GF+V E + S+ + + SC ++G HG G+
Sbjct: 252 ISRGLAGYRRVLNEDIVIEMLKKRGFEVAAME---TMSVPEQAAVMASCEVVIGPHGGGM 308
Query: 290 THSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
++ +F PG+ +++ I + L YF K + LGL+Y K ++L Y
Sbjct: 309 SNVIFCSPGT---KIIEIYSPELVATYFWKLSNQLGLQYYYMLGKGHPTTLGTDY 360
>gi|58332110|ref|NP_001011203.1| glycosyltransferase-like domain-containing protein 2 precursor
[Xenopus (Silurana) tropicalis]
gi|82076796|sp|Q5NDE6.1|GTDC2_XENTR RecName: Full=Glycosyltransferase-like domain-containing protein 2;
AltName: Full=Extracellular O-linked N-acetylglucosamine
transferase-like; Flags: Precursor
gi|56605350|emb|CAI30872.1| glycosyltransferase [Xenopus (Silurana) tropicalis]
gi|134024387|gb|AAI35938.1| ago61 protein [Xenopus (Silurana) tropicalis]
Length = 576
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 110/249 (44%), Gaps = 27/249 (10%)
Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRF--TRHPIININN 166
N H F D +P+F TI F D D + + W L +F + P++
Sbjct: 162 NLMHVFHDDLIPIFYTIQ-QFADLDFESRLFFM-EGWNEGLHFELYKFMSNKQPLLKEQL 219
Query: 167 QTITH--CFQSVTLGL----ISHGRMVINPTLLPKPKTLVD------FQSFLANAYNENT 214
+T+ CF +GL + + P PK LV F F+ N
Sbjct: 220 KTLGRLLCFTKSYVGLSKITTWYQYGFVQPQG-PKANILVSGNEIRHFAKFMMGKLNITK 278
Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFR 273
+ +++ + +VL +R+ + R I+N E+ A A+E TI E S AD R
Sbjct: 279 DQNAA----EAYIVLFSRS--MNRLIVNEAELLLALAQEFQMK-TITVSLEDHSFADIVR 331
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEY 331
I + +V +HGA L SLFL G++++++ P G ++ + + G+E Y+ +
Sbjct: 332 LISNATMLVSMHGAQLITSLFLPKGAIVVELFPYGVNPEHYTPYKTLSTLPGMELQYVAW 391
Query: 332 KIKQEESSL 340
+ +EE+++
Sbjct: 392 QNTEEENTI 400
>gi|89272827|emb|CAJ82077.1| novel protein [Xenopus (Silurana) tropicalis]
Length = 576
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 110/249 (44%), Gaps = 27/249 (10%)
Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRF--TRHPIININN 166
N H F D +P+F TI F D D + + W L +F + P++
Sbjct: 162 NLMHVFHDDLIPIFYTIQ-QFADLDFESRLF-FMEGWNEGLHFELYKFMSNKQPLLKEQL 219
Query: 167 QTITH--CFQSVTLGL----ISHGRMVINPTLLPKPKTLVD------FQSFLANAYNENT 214
+T+ CF +GL + + P PK LV F F+ N
Sbjct: 220 KTLGRLLCFTKSYVGLSKITTWYQYGFVQPQG-PKANILVSGNEIRHFAKFMMGKLNITK 278
Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFR 273
+ +++ + +VL +R+ + R I+N E+ A A+E TI E S AD R
Sbjct: 279 DQNAA----EAYIVLFSRS--MNRLIVNEAELLLALAQEFQMK-TITVSLEDHSFADIVR 331
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEY 331
I + +V +HGA L SLFL G++++++ P G ++ + + G+E Y+ +
Sbjct: 332 LISNATMLVSMHGAQLITSLFLPKGAIVVELFPYGVNPEHYTPYKTLSTLPGMELQYVAW 391
Query: 332 KIKQEESSL 340
+ +EE+++
Sbjct: 392 QNTEEENTI 400
>gi|193632082|ref|XP_001951734.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
[Acyrthosiphon pisum]
Length = 499
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 54/259 (20%), Positives = 115/259 (44%), Gaps = 30/259 (11%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN-----SHFPDQDVILAIADCNDQ 144
C + T P ++ N +H F D F+ L+ + + + +D+ + + + +
Sbjct: 197 CDLIITEPTVIMKIDA-TVNMYHHFCD-FLNLYASQHVNGSGASMFSKDIHILVWE-SFA 253
Query: 145 WARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTL 199
+ +++ FT+HP+ N+N + CF + L L+ RM+ N L+ +
Sbjct: 254 YESAFSDTFQAFTKHPVWNLNTFRGKVVCFNDIVLPLLP--RMIFGLYYNTPLIDGCENS 311
Query: 200 VDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI 259
F++F + + + K ++ ++RN + R +LN E+ A + + +
Sbjct: 312 GLFKAFSQHVLHRLNIDQKPNANEKIRITFLSRNTKY-RNVLNENELITALK----NYSQ 366
Query: 260 FEPEESTSLADSFRFIHSCHA------MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLS 313
+E ++ + F H +G+HGAGLTH LFL +VL ++ +
Sbjct: 367 YEVKKVVYSGNFLTFKEQVHITYNTDIFIGMHGAGLTHLLFLPEWAVLFELYNCEDEHC- 425
Query: 314 TVYFEKPARVLGLEYLEYK 332
++ AR+ G++Y+ ++
Sbjct: 426 ---YKDLARLRGVKYITWR 441
>gi|242049810|ref|XP_002462649.1| hypothetical protein SORBIDRAFT_02g029517 [Sorghum bicolor]
gi|241926026|gb|EER99170.1| hypothetical protein SORBIDRAFT_02g029517 [Sorghum bicolor]
Length = 114
Score = 47.4 bits (111), Expect = 0.014, Method: Composition-based stats.
Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 7/83 (8%)
Query: 310 QWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGAN-WS-NMRV 367
++L + + PA+V +E I +ESSL +Y D VL +P A A W +V
Sbjct: 17 EYLLSGWCSSPAKVASVE-----IAPKESSLSREYPTGDPVLTDPAGVAHARGWDVTKKV 71
Query: 368 YLKTQNVKLDIDRFRIYLKDAYK 390
YL QNV+LD+ RFR L A++
Sbjct: 72 YLDRQNVRLDLARFREELVGAHR 94
>gi|72057504|ref|XP_791565.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like
[Strongylocentrotus purpuratus]
Length = 610
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 94/220 (42%), Gaps = 28/220 (12%)
Query: 108 GNFFHEFMDCFVPLFIT------INSHFPDQDVILAIADCNDQWARKYAELLPRFTRH-P 160
GN H F D +P+F +N D L +D DQ + +L ++ P
Sbjct: 161 GNLMHVFHDDLLPVFYMQLQLGLLNLRSGVNDATLLTSD--DQEEGPFFQLYKHISKKTP 218
Query: 161 II----NINNQTITHCFQSVTLGLISHGRMVINPTLLPK---------PKTLVDFQSFLA 207
I+ + + + CF+ V +GL P+ K + F SF
Sbjct: 219 ILTHQLSQDGSSRIACFEEVHVGLSKFTTWYQYGFDKPQGPLDDIAVTSKEITLFTSFYK 278
Query: 208 NAYN-ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEEST 266
+ N + ++++ +T V+++R R R ILN E+ A + + + E
Sbjct: 279 SKLNIDGCDSNAEIENT---FVILSR--RTNRLILNEVELSLALAQHFDALVVVASLEMY 333
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
SL++ I +V VHG+ L+ S+FL PGSVL+++ P
Sbjct: 334 SLSELIGLISCSKGLVAVHGSLLSLSIFLPPGSVLVEIFP 373
>gi|159464961|ref|XP_001690707.1| glycosyltransferase-like protein [Chlamydomonas reinhardtii]
gi|158270384|gb|EDO96234.1| glycosyltransferase-like protein [Chlamydomonas reinhardtii]
Length = 579
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 53/104 (50%), Gaps = 26/104 (25%)
Query: 226 KLVLVNRN---ARVGRTILNLREVKKAA------------------EELGFDVTIFEPEE 264
KLV ++RN + VGRTILN +E+ + + L F T E
Sbjct: 328 KLVFISRNDDQSAVGRTILNEQELVQLCNAAPAGDLPPAAWGSPYKKYLCFAHTFGE--- 384
Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
+ L D + + A++G+HGAGLT+ L+++PG +++V P G
Sbjct: 385 -SQLMDVW-VMRKVDAILGMHGAGLTNGLYMKPGGAIIEVRPFG 426
>gi|339247853|ref|XP_003375560.1| conserved hypothetical protein [Trichinella spiralis]
gi|316971062|gb|EFV54900.1| conserved hypothetical protein [Trichinella spiralis]
Length = 554
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 68/295 (23%), Positives = 123/295 (41%), Gaps = 32/295 (10%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHF-PDQDVILAIADCNDQWA 146
C V P +V N +H F D F+ L+++ +N F D D+IL D + +
Sbjct: 254 CDVIFDQPTIVIKLDA-GINMYHHFCD-FINLYLSQHLNGSFHQDVDIILWDTDAS-PYF 310
Query: 147 RKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVINPTL-LPKPKTLVDFQS 204
+ E FT P+I++ + CF+ V +++ R V +P P S
Sbjct: 311 DIFRETWLAFTTKPLIDLQDFDGKRVCFREVMFPVLA--RKVFGLYYNMPMPDYCR--AS 366
Query: 205 FLANAYNENTNTSSSFHHTKP-----KLVLVNRNARVGRTILNLRE-VKKAAEELGFDVT 258
L A++ + P ++ + R+++ R I+N E VK+ + F VT
Sbjct: 367 GLVQAFSHHLIHRLQLKQNGPLREKLRVTYLVRSSQY-RIIMNTNEIVKRLKADPQFSVT 425
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
+ + + + ++ H+ + +HGAGLTH FL +V+ ++ G +
Sbjct: 426 VAKYTLDIPVLEQYQMSHNTDIFMSIHGAGLTHMFFLPDWAVVFELYHCG----DPECYR 481
Query: 319 KPARVLGLEYL----EYKIKQEESSL-----VEKYGANDLVLKNPQAFAGANWSN 364
A + GL++ E K++ +E + Y D V++ A WSN
Sbjct: 482 DLATLRGLKFFGWEDETKVQYQEKDVKNPKFTNFYLDADEVMRIVHLAAEYVWSN 536
>gi|400530656|gb|AFP86479.1| glycosyltransferase, partial [Halosauropsis macrochir]
Length = 274
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 70/156 (44%), Gaps = 14/156 (8%)
Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
PK LV F +FLA N T +V+ +R+ V R ILN E+
Sbjct: 2 PKANILVSGNEVRQFSAFLAEKLN--VTVGEQVEKTDEYIVVFSRS--VNRLILNEAELI 57
Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A A+E V EE S AD R I +V +HGA L SLFL G+ ++++ P
Sbjct: 58 LALAQEFQLKVVTVSLEEQ-SFADIIRVISRASMLVSMHGAQLVTSLFLPRGAAVVELFP 116
Query: 307 IGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
++ + G L+Y+ ++ +EE+S+
Sbjct: 117 YAVNPEHYAPYKTLTSLPGMDLQYVAWRNTKEENSV 152
>gi|56756779|gb|AAW26561.1| SJCHGC00943 protein [Schistosoma japonicum]
Length = 369
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 60/128 (46%), Gaps = 14/128 (10%)
Query: 193 LPKPKTLVDFQSFLANAYNENTNTSSSFHHTK----PKLVLVNR----------NARVGR 238
L K + +F+ F+ +YN N + + T PK+V+V+R N + R
Sbjct: 175 LIKNMFIEEFRQFIFQSYNINNDEDTCQKRTSIRFLPKIVIVSRRDYIAHPRNINGTIHR 234
Query: 239 TILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
I N E+ +LGF + ++ + + I S ++G+HGA LT+SL L
Sbjct: 235 KITNELELLNELNQLGFQNSKVVCFTDLTMQEQLKLIMSTDILIGMHGAALTYSLLLSNT 294
Query: 299 SVLMQVVP 306
S ++++ P
Sbjct: 295 SCVIELFP 302
>gi|163853710|ref|YP_001641753.1| capsular polysaccharide biosynthesis protein-like protein
[Methylobacterium extorquens PA1]
gi|163665315|gb|ABY32682.1| Capsular polysaccharide biosynthesis protein-like protein
[Methylobacterium extorquens PA1]
Length = 414
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 5/95 (5%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
+ + + V+R+ GR + N V LGF+ F+PE ++ RF + +VG
Sbjct: 280 RGRRLFVDRHPGRGRGLANRDAVLGLLRGLGFEA--FDPELTSVRQQVVRF-SAAEIVVG 336
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
+ GAG+T+++F +PG+ ++ +VP G W Y E
Sbjct: 337 IAGAGMTNTVFCQPGTPVIHLVPEG--WEDRFYGE 369
>gi|218532568|ref|YP_002423384.1| capsular polysaccharide biosynthesis protein-like protein
[Methylobacterium extorquens CM4]
gi|218524871|gb|ACK85456.1| capsular polysaccharide biosynthesis protein-like protein
[Methylobacterium extorquens CM4]
Length = 414
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 5/95 (5%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
+ + + V+R+ GR + N V LGF+ F+PE ++ RF + +VG
Sbjct: 280 RGRRLFVDRHPGRGRGLANRDAVLGLLHGLGFEA--FDPELTSVRQQVVRF-SAAEIVVG 336
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
+ GAG+T+++F +PG+ ++ +VP G W Y E
Sbjct: 337 IAGAGMTNTVFCQPGTPVIHLVPEG--WEDRFYGE 369
>gi|384261549|ref|YP_005416735.1| Capsular polysaccharide biosynthesis protein-like protein
[Rhodospirillum photometricum DSM 122]
gi|378402649|emb|CCG07765.1| Capsular polysaccharide biosynthesis protein-like protein
[Rhodospirillum photometricum DSM 122]
Length = 384
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 73/173 (42%), Gaps = 23/173 (13%)
Query: 169 ITHCFQSVTLGL-----------ISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTS 217
IT FQ TL L I GR V+ L+ P + F N +EN +
Sbjct: 176 ITLPFQQETLALAAGARPLRFHEIDRGRTVVVDDLVFVP----NLNDFGWNLRHENLDLY 231
Query: 218 SSFHHTKP--KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
P + + + R R ++N E+ GFD+ P E S+ + R +
Sbjct: 232 PGLRPRGPGERKLYLARGPSAKRPLVNEPEIIARLAAAGFDII---PLEGVSVHEQARLM 288
Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+V HGAGLT+ +F PG+ + ++ +L+ V F + A +LGL Y
Sbjct: 289 AEARVIVAPHGAGLTNLVFCGPGTTVCEL--HMDSYLNWV-FRRMANLLGLRY 338
>gi|357509677|ref|XP_003625127.1| Glycosyltransferase [Medicago truncatula]
gi|355500142|gb|AES81345.1| Glycosyltransferase [Medicago truncatula]
Length = 87
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWL 312
+ ++ C + GVHGA LTH +F++P V +Q+V +G W+
Sbjct: 39 IGKDLELLNECDVVFGVHGAALTHFMFMKPSFVFIQIVSLGIHWI 83
>gi|339021440|ref|ZP_08645518.1| hypothetical protein ATPR_1826 [Acetobacter tropicalis NBRC 101654]
gi|338751504|dbj|GAA08822.1| hypothetical protein ATPR_1826 [Acetobacter tropicalis NBRC 101654]
Length = 393
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ + R R + N ++ +A E GF T+ PE + S+AD R +VG GA
Sbjct: 256 LFIERGGASNRLMPNEADLAQALAEAGF--TVVRPE-TLSVADQMRLFAKARLVVGALGA 312
Query: 288 GLTHSLFLRPGSVLMQVVP 306
G+ + + RPG+V+ ++VP
Sbjct: 313 GMANLAWCRPGTVICELVP 331
>gi|119500630|ref|XP_001267072.1| hypothetical protein NFIA_106630 [Neosartorya fischeri NRRL 181]
gi|119415237|gb|EAW25175.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 488
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 60/137 (43%), Gaps = 10/137 (7%)
Query: 225 PKLVLVNRNARVGRTILN----LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHA 280
P++V+ + R ++N + VKK + + F S + R
Sbjct: 336 PQIVVTFIDRTGSRKLINQEDYFKTVKKQFPHITVQMIDFA---SIPFREQLRIAQESDI 392
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSL 340
+VGVHGAGLTH +FL GSV+++++P G L+ F A +LG Y + ++
Sbjct: 393 LVGVHGAGLTHGIFLPSGSVMVEILPPG---LNHKGFRNLASLLGHLYFSAHATKPAKTV 449
Query: 341 VEKYGANDLVLKNPQAF 357
ND V + F
Sbjct: 450 KRDDWHNDDVYLEHEKF 466
>gi|242056795|ref|XP_002457543.1| hypothetical protein SORBIDRAFT_03g009115 [Sorghum bicolor]
gi|241929518|gb|EES02663.1| hypothetical protein SORBIDRAFT_03g009115 [Sorghum bicolor]
Length = 107
Score = 46.6 bits (109), Expect = 0.022, Method: Composition-based stats.
Identities = 18/34 (52%), Positives = 24/34 (70%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFI 123
C V H +P V +AGGYNGN+FH F D F+P ++
Sbjct: 74 CDVRHDAPVHVVTAGGYNGNYFHAFNDGFLPSWV 107
>gi|386397552|ref|ZP_10082330.1| hypothetical protein Bra1253DRAFT_03074 [Bradyrhizobium sp.
WSM1253]
gi|385738178|gb|EIG58374.1| hypothetical protein Bra1253DRAFT_03074 [Bradyrhizobium sp.
WSM1253]
Length = 366
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+ V V R + R + N +E+ A + GF + F+ E D F C +VG H
Sbjct: 235 RRVYVPRKGQ--RKVANEQELMPALRKFGFQIYDFDDVE-----DEAAFFSECAIVVGPH 287
Query: 286 GAGLTHSLFLRPGSVLMQVVP 306
GAGLT+ +F PG+ +++++P
Sbjct: 288 GAGLTNLVFCSPGTKVLELIP 308
>gi|300865615|ref|ZP_07110389.1| Tetratricopeptide TPR_2 repeat protein (fragment) [Oscillatoria sp.
PCC 6506]
gi|300336370|emb|CBN55539.1| Tetratricopeptide TPR_2 repeat protein (fragment) [Oscillatoria sp.
PCC 6506]
Length = 438
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 73/163 (44%), Gaps = 14/163 (8%)
Query: 204 SFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE 263
+FL + T SS+ P + ++RN R + N +V + + GF V IF
Sbjct: 282 NFLRRVFLAGITTLSSY----PDRIYISRNKARYRRVFNEEDVMEELSKFGF-VRIFL-- 334
Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP---IG-TQWLSTVYFEK 319
ES L + +V HG+GLT+++F +PG+ ++++V +G W S+ Y
Sbjct: 335 ESMPLQEQIALFFHAKMIVTPHGSGLTNTIFCQPGTKVIELVSPHYVGHYYWPSSHYLHL 394
Query: 320 PARVLGLEYLE-YKIKQ--EESSLVEKYGANDLVLKNPQAFAG 359
L E E Y I+Q ++SL E N LK G
Sbjct: 395 DHYYLTGEVFECYPIRQLMYQNSLTEDIMINLSSLKKMMELIG 437
>gi|357606973|gb|EHJ65302.1| hypothetical protein KGM_14784 [Danaus plexippus]
Length = 272
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 62/266 (23%), Positives = 112/266 (42%), Gaps = 26/266 (9%)
Query: 55 SLRIKTQPYPRKSDKSAMSKVKELTITTSAP-PNLSCGVTHTSPALVFSAGGYNGNFFHE 113
S R+K + + +S + T P P+ C +T P + N +H
Sbjct: 17 STRLKNEAEHMSALQSWAPEFVNFVKTPGRPIPDGMCDITIDKPTYIMKLDA-TVNMYHH 75
Query: 114 FMDCF-VPLFITINSHFP-----DQDVILAIADCNDQWARKYAELLPRFTRHPIININN- 166
F D F + + +NS P D +++ D + + FT +PI ++
Sbjct: 76 FCDFFNLYASLHVNSTHPSTFSRDNHILVWETFTYDS---AFKDAFKAFTSNPIWDLKEF 132
Query: 167 QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQSF---LANAYNENTNTSSS 219
+ T CF++ L+ RM+ N L+ +T F SF + ++ N + +
Sbjct: 133 RGKTVCFKNAVFPLLP--RMIFGLYYNTPLIYGCETSGLFHSFSKHILHSLNVKLHLRTD 190
Query: 220 FHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHSC 278
+ ++ L++R RTILN +E+ +A ++ G+ V + + H+
Sbjct: 191 ---DRVRITLLSRGTTY-RTILNEQEIVEALLKVKGYYVQRVVYDRTVPFTKQLDITHNT 246
Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQV 304
+G+HGAGLTH LFL + L +V
Sbjct: 247 DVFIGMHGAGLTHLLFLPDWAALFEV 272
>gi|376004134|ref|ZP_09781892.1| putative tetratricopeptide repeats containing glycosyl transferase,
N-acetylglucosaminyl-transferase P110 family
[Arthrospira sp. PCC 8005]
gi|375327468|emb|CCE17645.1| putative tetratricopeptide repeats containing glycosyl transferase,
N-acetylglucosaminyl-transferase P110 family
[Arthrospira sp. PCC 8005]
Length = 1679
Score = 46.2 bits (108), Expect = 0.027, Method: Composition-based stats.
Identities = 24/81 (29%), Positives = 46/81 (56%), Gaps = 3/81 (3%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
PKL+ ++R++ R +LN + ++ GF T PE + S+A + +V
Sbjct: 638 PKLIYISRHSAKYRRLLNEEAIASLLQQRGF--TTINPE-TLSVAQQAQVFSQAKVIVAS 694
Query: 285 HGAGLTHSLFLRPGSVLMQVV 305
HG+GLT+ +F +PGS +++++
Sbjct: 695 HGSGLTNLIFCQPGSTIIELM 715
>gi|209524105|ref|ZP_03272656.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
gi|209495480|gb|EDZ95784.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
Length = 1676
Score = 46.2 bits (108), Expect = 0.027, Method: Composition-based stats.
Identities = 24/81 (29%), Positives = 46/81 (56%), Gaps = 3/81 (3%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
PKL+ ++R++ R +LN + ++ GF T PE + S+A + +V
Sbjct: 638 PKLIYISRHSAKYRRLLNEEAIASLLQQRGF--TTINPE-TLSVAQQAQVFSQAKVIVAS 694
Query: 285 HGAGLTHSLFLRPGSVLMQVV 305
HG+GLT+ +F +PGS +++++
Sbjct: 695 HGSGLTNLIFCQPGSTIIELM 715
>gi|407844226|gb|EKG01856.1| hypothetical protein TCSYLVIO_007137 [Trypanosoma cruzi]
Length = 591
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 74/356 (20%), Positives = 131/356 (36%), Gaps = 69/356 (19%)
Query: 90 CGVTHTSPAL-VFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVIL-AIADCN----- 142
C T+PA+ +F G++ +H + + P F T+ F +D + I D
Sbjct: 233 CKYVVTAPAVFIFRMSGHST--YHLWRNNLGPFFDTLRDDFGIKDRLRKGIFDGESPVVI 290
Query: 143 --DQWARK-------YAELLPRFTRHPIININNQTITHCFQSVTLGLI--SHGRMVINPT 191
D+ R ELL F P++ + T CF LG+ S + +
Sbjct: 291 TVDKKPRSGPKAPHLLDELLHYFIDLPVLEASEFTEPTCFTRAILGISTNSFSQAALRHW 350
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFHHT----------KPKLVLVNR--------- 232
LLP+ F FL + T S H T K L+L N+
Sbjct: 351 LLPRVALQHRFLQFLRKRNKRDVATISGVHATTLRCLMPQYDKRSLILFNQWFAWKLRQW 410
Query: 233 -----------------NARVGRTILNLREVKKAAEELGFDVT------IFEPEESTSLA 269
N GR ++N +V A E +T +F E +
Sbjct: 411 QGLPQRPNVLYLSRNHPNITRGRRVVNEEDVMPALEAAVLAMTGGSLRRVF--LEEMAYV 468
Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE-----KPARVL 324
D + + ++ HG G+ + +++ PGSV+++ VP L +Y + VL
Sbjct: 469 DQIAAMLETNILIAPHGGGIANCVWMPPGSVVVEFVPPAGATLPEMYHKMCRDAAGGGVL 528
Query: 325 GLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDR 380
++++ + +Q+ + L + A + K + W + R ++ LD+ R
Sbjct: 529 PIQHISFVAEQDPAELEPDFAATNPAWKRNKRLFSNIWVSKRQLIEYFAKALDLYR 584
>gi|434398683|ref|YP_007132687.1| Glycosyltransferase AER61, uncharacterized [Stanieria cyanosphaera
PCC 7437]
gi|428269780|gb|AFZ35721.1| Glycosyltransferase AER61, uncharacterized [Stanieria cyanosphaera
PCC 7437]
Length = 900
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 3/84 (3%)
Query: 221 HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHA 280
HH P LV ++R GR ++N EV +LGF E S+ +
Sbjct: 755 HHRYPDLVYISRAQARGRQVINETEVTDLLNQLGFKTVFL---EKMSVLEQVALFAHAKV 811
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQV 304
+V HG+ LT+ +F PG ++++
Sbjct: 812 IVSPHGSSLTNLVFCNPGVTVVEL 835
>gi|110637270|ref|YP_677477.1| capsular polysaccharide biosynthesis protein [Cytophaga
hutchinsonii ATCC 33406]
gi|110279951|gb|ABG58137.1| capsular polysaccharide biosynthesis protein [Cytophaga
hutchinsonii ATCC 33406]
Length = 366
Score = 46.2 bits (108), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 47/215 (21%), Positives = 98/215 (45%), Gaps = 25/215 (11%)
Query: 100 VFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRH 159
V++ ++ N+FH F D LF+ ++ D +L + + + ELL H
Sbjct: 125 VWAHDSWSNNYFHWFNDTLPRLFL-LSKQIEDSVAVLPVELSKITFIVESLELLK--IEH 181
Query: 160 PIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSS 219
I+ Q +H F+S+++ + + INP L + + V F A +
Sbjct: 182 QWID---QKKSHRFESLSVLHTATLQPDINPLLQKQMRDAV----FSAMKIDP------- 227
Query: 220 FHHTKP-KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSC 278
+P + + ++R R I+N +E+ ++ G+D+ E+ S + +
Sbjct: 228 --QERPFRKIYISRAHARYRKIINEQELLPVLKKYGYDIIY---PETYSFKEQVKLFAES 282
Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLS 313
+A++ +HGAG T+ +F++ + +M++ T+W S
Sbjct: 283 NALISIHGAGHTNCMFMKQDAKVMEIR--NTEWES 315
>gi|167522571|ref|XP_001745623.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775972|gb|EDQ89594.1| predicted protein [Monosiga brevicollis MX1]
Length = 526
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 64/125 (51%), Gaps = 11/125 (8%)
Query: 229 LVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ N+NA++ + + K A+ G D ++ + T L D+ S ++GVHG
Sbjct: 385 MCNQNAKMRQLRDESQLWAKLAQRYGHVDFVTYKSRDYT-LTDTIELFKSAEMVLGVHGG 443
Query: 288 GLTHSLFLRPGSVLMQV-VPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGA 346
L + LF RPG+ L+++ +P T T ++E A L L+Y+ ++ ++ + GA
Sbjct: 444 ALANVLFSRPGATLVELTIPRNT----TRHYEHMAAALDLQYVAVDVQPDQRGV----GA 495
Query: 347 NDLVL 351
++VL
Sbjct: 496 PEVVL 500
>gi|444705665|gb|ELW47065.1| hypothetical protein TREES_T100015279 [Tupaia chinensis]
Length = 440
Score = 45.8 bits (107), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 101/230 (43%), Gaps = 40/230 (17%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQ-WARKYAELLPRFTRHPIININ 165
N +H F D F+ L+IT +N+ F DV + + D + + +AE FT + +I++
Sbjct: 187 NMYHHFCD-FINLYITQHVNNSFS-TDVYVVMWDTSSYGYGDLFAETWKAFTDYDVIHLK 244
Query: 166 NQTITH-CFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTK 224
CF+ +LLP+ + + + + L K
Sbjct: 245 TYDSKRVCFKEAVF------------SLLPRMRYGLFYNTPLDG---------------K 277
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMV 282
++ ++ R+ R ILN E+ A + + F+V I + + + D + H+ +
Sbjct: 278 IRVTILARSTEY-RKILNQNELVNALKTVSTFEVRIVDYKYKQLGFLDQLKITHNTDIFI 336
Query: 283 GVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+ +K
Sbjct: 337 GMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLDL-ARLRGIYYITWK 382
>gi|423066047|ref|ZP_17054837.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
gi|406712546|gb|EKD07731.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
Length = 985
Score = 45.8 bits (107), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 46/81 (56%), Gaps = 3/81 (3%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
PKL+ ++R++ R +LN + ++ GF T PE + S+A + +V
Sbjct: 93 PKLIYISRHSAKYRRLLNEEAIASLLQQRGF--TTINPE-TLSVAQQAQVFSQAKVIVAS 149
Query: 285 HGAGLTHSLFLRPGSVLMQVV 305
HG+GLT+ +F +PGS +++++
Sbjct: 150 HGSGLTNLIFCQPGSTIIELM 170
>gi|317419036|emb|CBN81074.1| Uncharacterized glycosyltransferase [Dicentrarchus labrax]
Length = 525
Score = 45.8 bits (107), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 109/253 (43%), Gaps = 18/253 (7%)
Query: 89 SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCN-DQW 145
C + P VF N +H F D FV L+I+ IN+ F D+ + + D + +
Sbjct: 224 QCDIIIEKPT-VFMKLDAGVNMYHHFCD-FVNLYISQHINNSF-SSDINIVMWDTSFYGY 280
Query: 146 ARKYAELLPRFTRHPIININNQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDF 202
++E F+ + II++ CF+ L+ R + N L+ + F
Sbjct: 281 GDLFSETWRAFSEYDIIHLKTYDSKRVCFKDAFFSLLPRMRYGLFYNTPLILDCYSEGMF 340
Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEE---LGFDVTI 259
++F + + + ++ L+ R+ R ILN E+ A + L +V
Sbjct: 341 RAFSQHVLHRLNIPQDGPKEGRVRVTLLARSTEY-RRILNQVELVNALKTVPLLEVNVVD 399
Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEK 319
++ ++ L R H+ +G+HGAGLTH LFL +V+ ++ + +
Sbjct: 400 YKYKDVPFLV-QLRITHNSDIFIGMHGAGLTHLLFLPDWAVIFELYNCQDE----SCYRD 454
Query: 320 PARVLGLEYLEYK 332
AR+ G+ Y+ ++
Sbjct: 455 LARLRGIRYVTWQ 467
>gi|376001680|ref|ZP_09779540.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375329948|emb|CCE15293.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 883
Score = 45.4 bits (106), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 48/219 (21%), Positives = 91/219 (41%), Gaps = 33/219 (15%)
Query: 94 HTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELL 153
H + F + N+FH ++ L + + S P +++ +++ KY +
Sbjct: 631 HLKGNIAFVSAYCGQNYFHWMVEVIPRLHLVLASGVPIDKIVV------NKFGHKYEDET 684
Query: 154 PRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPT-LLPKPKTLVDFQSFLA----- 207
P +Q + CF+ + ++I P+ L PK +F L
Sbjct: 685 LAMFDIP----EHQKMFGCFRHI------QAEVLIVPSRTLSTPKWACNFVKDLVLKHPF 734
Query: 208 NAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTS 267
A + N SS K+ + NA + R ++N +E+ + LGF V E + S
Sbjct: 735 MAEDNRCNYSS-------KIYISRANAYI-RKVINEQELMDILKPLGFQVVYLE---NMS 783
Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ +H ++ HGAGLT+ +F PG+ ++++ P
Sbjct: 784 VKQQALCLHHAEVVISPHGAGLTNLVFCEPGTKVIELFP 822
>gi|357607839|gb|EHJ65710.1| hypothetical protein KGM_02173 [Danaus plexippus]
Length = 289
Score = 45.4 bits (106), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 52/210 (24%), Positives = 95/210 (45%), Gaps = 36/210 (17%)
Query: 149 YAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQ 203
+ + FT +PI ++ + T CF++ L+ RM+ N L+ +T F
Sbjct: 43 FKDAFKAFTSNPIWDLKEFRGKTVCFKNAVFPLLP--RMIFGLYYNTPLIYGCETSGLFH 100
Query: 204 SF---LANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTI 259
SF + ++ N + + + ++ L++R RTILN +E+ +A ++ G+ V
Sbjct: 101 SFSKHILHSLNVKLHLRTD---DRVRITLLSRGTTY-RTILNEQEIVEALLKVKGYYVQR 156
Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEK 319
+ + H+ +G+HGAGLTH LFL + L +V E
Sbjct: 157 VVYDRTVPFTKQLDITHNTDVFIGMHGAGLTHLLFLPDWAALFEVYNC----------ED 206
Query: 320 P------ARVLGLEYLEYK-----IKQEES 338
P AR+ GL+Y+ ++ ++QE+S
Sbjct: 207 PNCYADLARLRGLKYVTWEDKSKLVQQEQS 236
>gi|443691665|gb|ELT93454.1| hypothetical protein CAPTEDRAFT_191413 [Capitella teleta]
Length = 501
Score = 45.4 bits (106), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 93/223 (41%), Gaps = 30/223 (13%)
Query: 130 PDQDVILAIADCNDQWARKY----AELLPRF-------TRHPIININNQTITH------- 171
P+ +I I +++W + E LPR +P I I+ +++ H
Sbjct: 253 PEWTLIDKIFVISERWGEAFFHSLVENLPRLLPWLKYLQENPTIKIHIKSMYHNIIQKVF 312
Query: 172 --CFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVL 229
+ VT G I ++I P +P K V L Y + + + LVL
Sbjct: 313 NISYSRVTAGNI-RANLIILPAGIPCGKPAVYPTQLLW--YQIHKQLPAIPMEERMNLVL 369
Query: 230 VNRNARVGRTILNLREVKK----AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+ R R R +++ + AE+ V +F+ + S D+ + +V H
Sbjct: 370 IKRLPRKNRYFAFHKDIHRLLVIEAEKHNMTVVVFDDQHMPSFEDTMLMFNQAFMVVSPH 429
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
GAGLT+ LF +PG++L++ + + + F ++VLG Y
Sbjct: 430 GAGLTNMLFSQPGTILIEGL---CRAALNLCFGAMSQVLGFRY 469
>gi|326502512|dbj|BAJ95319.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 45.4 bits (106), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 40/72 (55%), Gaps = 8/72 (11%)
Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
+V +L + V+ + E + F E + + R++ + H +V VHGAG+T+ F
Sbjct: 323 QVQALLLKTQHVRSSMENVTF--------EHLDIPSTIRYMSNVHILVSVHGAGMTNMFF 374
Query: 295 LRPGSVLMQVVP 306
+ PGS +++++P
Sbjct: 375 MNPGSAVVEIIP 386
>gi|384247851|gb|EIE21336.1| hypothetical protein COCSUDRAFT_43090 [Coccomyxa subellipsoidea
C-169]
Length = 436
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 52/111 (46%), Gaps = 11/111 (9%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELG------FDVTIFEPEESTSLADSFRFIHSCH 279
++ L+ R + R ILNL EV + L T+ ++ R + +
Sbjct: 244 QVALIERLTQ--RRILNLEEVTQRCRGLAPADGSALTCTVISFDDIIDYPALLRELQTID 301
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVP--IGTQWLSTVYFEKPARVLGLEY 328
+VG+HGAGL +S F+RPG+ +++ P GT+ Y+ P+ V Y
Sbjct: 302 ILVGMHGAGLVNSYFMRPGTAFVEIFPCDFGTE-HHRYYYWHPSHVEAQTY 351
>gi|156353079|ref|XP_001622904.1| predicted protein [Nematostella vectensis]
gi|156209537|gb|EDO30804.1| predicted protein [Nematostella vectensis]
Length = 375
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 85/208 (40%), Gaps = 16/208 (7%)
Query: 109 NFFHEFMDCFVPLFIT--INSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININN 166
N +H F D F L+ T +N F I+ ++ FTRHP++ + +
Sbjct: 172 NMYHHFCD-FFNLYATQHVNGSFSTDVNIVLWEAYKRGGLGNFSPTWRVFTRHPLLYLGH 230
Query: 167 QTITH--CFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQSF---LANAYNENTNTS 217
CF+ L+ RMV N L P F++F L +
Sbjct: 231 DFAGKRVCFKRAIFSLLP--RMVFGLFYNTPLTPGCSGSGLFKAFSNHLVKRLGIVQERN 288
Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIH 276
S ++ L++R + R ILN E+ +A + + I + D + H
Sbjct: 289 ESDVDAPVRVTLLSRGTKY-RDILNENELVEALSSHPAISLKIAKFSWDVPFLDQIKVTH 347
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
+ +G+HGAGLTH+LFL +VL ++
Sbjct: 348 NTDVFLGMHGAGLTHALFLPDWAVLFEL 375
>gi|443696729|gb|ELT97366.1| hypothetical protein CAPTEDRAFT_185193 [Capitella teleta]
Length = 562
Score = 45.1 bits (105), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 59/119 (49%), Gaps = 5/119 (4%)
Query: 215 NTSSSFHHTKPK-LVLVNRNARVGRTILN----LREVKKAAEELGFDVTIFEPEESTSLA 269
N S F+ K +VL+ R+ + GR + LR ++ LG + ++ SLA
Sbjct: 412 NLRSRFNTNKQDTVVLIRRSKQKGRYFRHHGGILRMLQADVAPLGMKIDVYGDNPLPSLA 471
Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
++ R H ++ HGAG ++ LF +PG++L++ + + + F ++ LGL Y
Sbjct: 472 ETRRVFHQAAVVISPHGAGASNLLFSQPGTLLIEGLHSPFRTRPNLCFLVMSQALGLRY 530
>gi|158300077|ref|XP_320074.4| AGAP009279-PA [Anopheles gambiae str. PEST]
gi|157013827|gb|EAA14911.4| AGAP009279-PA [Anopheles gambiae str. PEST]
Length = 523
Score = 45.1 bits (105), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 51/253 (20%), Positives = 104/253 (41%), Gaps = 19/253 (7%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP-----DQDVILAIADCNDQ 144
C V PA + N +H F D F+ L+ +++ + D D + + + +
Sbjct: 212 CDVVIERPAFIMKIDA-AINMYHHFCD-FLNLYASLHVNLSHAGGFDTDTQVLVWE-SFT 268
Query: 145 WARKYAELLPRFTRHPIININNQT-ITHCFQSVTLGLISHGRMVI----NPTLLPKPKTL 199
+A +A+ F++HPI ++ CF++V L L+ RM+ N ++ +
Sbjct: 269 YASPFADTFKVFSKHPIADLKTYAGKVVCFKNVVLPLLP--RMIFGLYYNTPIIYGCENS 326
Query: 200 VDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTI 259
F +F + + ++ + ++ ++R R R + + + A+ + V
Sbjct: 327 GLFHAFSEHVLHRLKVRMTTRPDERVRITFLSRQTRYRRVLNEDELMGRIAKNPNYAVQR 386
Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEK 319
+ R + +G+HGAGLTH LFL + L ++ +
Sbjct: 387 VSYGHDLPFVEQLRITRNTDIFIGMHGAGLTHLLFLPKWATLFELY----HCEDPNCYRD 442
Query: 320 PARVLGLEYLEYK 332
AR+ G+ YL ++
Sbjct: 443 LARLRGVHYLTWE 455
>gi|428210488|ref|YP_007100701.1| capsular polysaccharide biosynthesis protein [Oscillatoria
acuminata PCC 6304]
gi|428004938|gb|AFY85468.1| capsular polysaccharide biosynthesis protein [Oscillatoria
acuminata PCC 6304]
Length = 1588
Score = 45.1 bits (105), Expect = 0.067, Method: Composition-based stats.
Identities = 29/102 (28%), Positives = 50/102 (49%), Gaps = 6/102 (5%)
Query: 204 SFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE 263
+FL A+ + S H P+ + + R+ R ++N EV + + GF+ EPE
Sbjct: 615 TFLRQAFLNSVPASDCEH---PERIYIRRSTARNRRLINEEEVIEFLAQFGFEP--IEPE 669
Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
S S + +V HG+GLT+ +F RPG+ +++V
Sbjct: 670 -SLSFVEQIAVFSQAKIIVAPHGSGLTNLVFCRPGTQAIELV 710
>gi|428214106|ref|YP_007087250.1| hypothetical protein Oscil6304_3772 [Oscillatoria acuminata PCC
6304]
gi|428002487|gb|AFY83330.1| tetratricopeptide repeat protein [Oscillatoria acuminata PCC 6304]
Length = 589
Score = 45.1 bits (105), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 54/114 (47%), Gaps = 6/114 (5%)
Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF 274
+ +++ P + ++R R ILN E GF + E S S +
Sbjct: 435 DKAATLRRQTPSRIYISRRHASYRRILNEEETIARLSPYGFVPIVLE---SLSFLEQVAL 491
Query: 275 IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+ A++ HGAGLT++LF PG+ Q++ I + + +V + + ++GL+Y
Sbjct: 492 FANAKAIIAPHGAGLTNTLFCNPGT---QLIEIFSPDMVSVNYWVVSNIIGLDY 542
>gi|407404675|gb|EKF30041.1| hypothetical protein MOQ_006156 [Trypanosoma cruzi marinkellei]
Length = 590
Score = 45.1 bits (105), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 57/254 (22%), Positives = 98/254 (38%), Gaps = 58/254 (22%)
Query: 151 ELLPRFTRHPIININNQTITHCFQSVTLGLI--SHGRMVINPTLLPKPKTLVDFQSFLAN 208
ELL FT P++ + T CF LG+ S + + LLP+ F FL+
Sbjct: 307 ELLHYFTDLPVLEASEFTEPTCFTRAILGISASSFSQAALRHWLLPRIALRHRFLQFLSK 366
Query: 209 AYNENTNTSSSFHHT---------------------------------KPKLVLVNRN-- 233
+T S H T +P ++ ++RN
Sbjct: 367 RNKREVSTVSRAHATTLRCLIPQYDRRSRIIFNQWFEWKLRQWQELPQRPNVLYLSRNHP 426
Query: 234 -ARVGRTILNLREVKKAAEELGFDVT------IFEPEESTSLADSFRFIHSCHAMVGVHG 286
GR ++N EV A E +T +F E + D + + ++ HG
Sbjct: 427 NITRGRRVVNEEEVIPALEAAVLAMTGGSLRRVFL--EEMAYVDQIATVLETNILIAPHG 484
Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR------VLGLEYLEYKIKQEESSL 340
G+ + +++ PGSV+++ VP L +Y +K R VL ++++ + +Q+ + L
Sbjct: 485 GGIANCVWMPPGSVVVEFVPPAGATLPEMY-DKMCRDAAGGGVLPIQHISFVAEQDPAEL 543
Query: 341 VEKYGANDLVLKNP 354
D V NP
Sbjct: 544 -----EPDFVATNP 552
>gi|311748554|ref|ZP_07722339.1| hypothetical protein ALPR1_19833 [Algoriphagus sp. PR1]
gi|126577073|gb|EAZ81321.1| hypothetical protein ALPR1_19833 [Algoriphagus sp. PR1]
Length = 324
Score = 45.1 bits (105), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 86/213 (40%), Gaps = 31/213 (14%)
Query: 95 TSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELL- 153
T P ++ ++ N+FH DC L++ +N+ D+ VIL + + + + ELL
Sbjct: 85 TIPHGIWVKDEWSANYFHWMTDCLPRLWLGLNTGLSDR-VILHDSYRHLPYVSQSLELLG 143
Query: 154 --PRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN 211
P + +QS V N L P+ + +F L
Sbjct: 144 IQPTY----------------YQS------QENIWVKNLVLTPRTSSFPNFHEDLTKMTR 181
Query: 212 ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADS 271
E + S T +L+ ++R R N +V+ GF+V E SL +
Sbjct: 182 ERLSVSPK--STPSRLIYISRKYANKRKTHNEIDVELLMIRHGFEVIY---TEKMSLKEQ 236
Query: 272 FRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
+ +V +HGA LT+ LFL GS ++++
Sbjct: 237 IDLMSETKILVSLHGAALTNMLFLPEGSKVVEL 269
>gi|224013993|ref|XP_002296660.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968710|gb|EED87055.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 408
Score = 45.1 bits (105), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 63/140 (45%), Gaps = 21/140 (15%)
Query: 180 LISHGRMVINP---TLLPKPKTLVDFQSFLANAY----NENTNTSSSFHHTKPKLVLVNR 232
LI ++I P LL ++F+ + ++ + ++ + SS P ++R
Sbjct: 200 LIEQADVLIEPPTNGLLWDLAWDMNFECYSSDMFKTFVSQFVDKSSKLDPPDPLACWISR 259
Query: 233 NARVGRTILNL-------REVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
R R+ NL REV A EL F + S+ D ++ C + G+H
Sbjct: 260 QGRDVRSASNLNDTLGVMREVFTNARELRF-------TKEMSIDDVANALNDCRVIFGIH 312
Query: 286 GAGLTHSLFLRPGSVLMQVV 305
GAG ++LF RPG +++++
Sbjct: 313 GAGHMNALFARPGVAVIEII 332
>gi|82232390|sp|Q5NDE3.1|GTDC2_TETNG RecName: Full=Glycosyltransferase-like domain-containing protein 2;
AltName: Full=Extracellular O-linked N-acetylglucosamine
transferase-like; Flags: Precursor
gi|47213764|emb|CAF95593.1| unnamed protein product [Tetraodon nigroviridis]
gi|56605356|emb|CAI30875.1| glycosyltransferase [Tetraodon nigroviridis]
Length = 579
Score = 44.7 bits (104), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 58/244 (23%), Positives = 101/244 (41%), Gaps = 16/244 (6%)
Query: 109 NFFHEFMDCFVPLFITINSHFP-DQDVILAIADCNDQWARKYAELLPRFTRHPII--NIN 165
N H F D +P F T+ D+D L + D+ + L + P++ +
Sbjct: 162 NLMHVFHDDLLPAFYTMKQFLDLDEDARLVFMEGWDEGPHFHLYRLLS-DKQPLLKEQLR 220
Query: 166 NQTITHCFQSVTLGL----ISHGRMVINPTLLPKPKTLVDFQSF--LANAYNENTNTSSS 219
N CF +GL + + P PK LV A E N + +
Sbjct: 221 NFGKLMCFTKSYIGLSKMTTWYQYGFVQPQG-PKANILVSGNEIRHFAKVLMEKMNVTRA 279
Query: 220 FHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSC 278
+ + ++V + R ILN E+ A A+E V EE S A + I +
Sbjct: 280 EGGQEDEYIVVFSRSST-RLILNQAELVMALAQEFQMRVVTVSLEEQ-SFASIVQVIGAA 337
Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQE 336
+V +HGA L +LFL PG+V++++ P + A + G++ Y+ ++ +E
Sbjct: 338 SMLVSMHGAQLITALFLPPGAVVVELFPFAVNPDQYTPYRTLAALPGMDLHYISWRNTEE 397
Query: 337 ESSL 340
E+++
Sbjct: 398 ENTI 401
>gi|71418840|ref|XP_810985.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70875598|gb|EAN89134.1| hypothetical protein Tc00.1047053506627.90 [Trypanosoma cruzi]
Length = 591
Score = 44.7 bits (104), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 56/279 (20%), Positives = 104/279 (37%), Gaps = 51/279 (18%)
Query: 151 ELLPRFTRHPIININNQTITHCFQSVTLGLI--SHGRMVINPTLLPKPKTLVDFQSFLAN 208
ELL F P++ + T CF LG+ S + + LLP+ F FL
Sbjct: 308 ELLHYFIDLPVLEASEFTEPTCFTRAILGISANSFSQAALRHWLLPRIALQHRFLQFLRK 367
Query: 209 AYNENTNTSSSFHHT---------------------------------KPKLVLVNRN-- 233
+ T S H T +P ++ ++RN
Sbjct: 368 RNKRDVATVSGVHATTLRCLMPQYDRRSPILFNQWFAWKLRQWQWLPRRPNVLYLSRNHP 427
Query: 234 -ARVGRTILNLREVKKAAEELGFDVT------IFEPEESTSLADSFRFIHSCHAMVGVHG 286
GR ++N +V A E +T +F E + D + + ++ HG
Sbjct: 428 NITRGRRVVNEEDVIPALEAAVLAMTGGSLRRVFL--EEMAYVDQIAAMLETNILIAPHG 485
Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE-----KPARVLGLEYLEYKIKQEESSLV 341
G+ + +++ PGSV+++ VP L +Y + VL ++++ + +Q+ + L
Sbjct: 486 GGIANCVWMPPGSVVVEFVPPAGATLPELYHKMCRDAAGGGVLPIQHISFVAEQDPAELE 545
Query: 342 EKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDR 380
+ A + K + W + R ++ LD+ R
Sbjct: 546 PDFAATNPAWKRNKRLFSNIWVSKRQLIEYFAKALDLYR 584
>gi|302835343|ref|XP_002949233.1| hypothetical protein VOLCADRAFT_104177 [Volvox carteri f.
nagariensis]
gi|300265535|gb|EFJ49726.1| hypothetical protein VOLCADRAFT_104177 [Volvox carteri f.
nagariensis]
Length = 569
Score = 44.7 bits (104), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 50/101 (49%), Gaps = 20/101 (19%)
Query: 226 KLVLVNR---NARVGRTILNLREV-KKAAEELGFDVTIFEPEESTSLADSFRFIH----- 276
K+V ++R + VGRTILN E+ ++ G DV + + + F F H
Sbjct: 314 KIVFISRTPHQSAVGRTILNEAEIIQRCNNAEGADVP--QADWNVPYQKYFCFAHVFGDN 371
Query: 277 ---------SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
++G+HGAGLT+S +++PG +++V P G
Sbjct: 372 QLMDVWLMRQVDVILGMHGAGLTNSFYMKPGGSVIEVRPFG 412
>gi|194041279|ref|XP_001925360.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Sus
scrofa]
Length = 443
Score = 44.7 bits (104), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 51/97 (52%), Gaps = 6/97 (6%)
Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + F+V I + + + + D R H+ +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVRIVDYKYKELAFLDQLRITHNTDIFIGMHGAGLTHLLFL 352
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ + ++ G + Y + AR+ G+ Y+ ++
Sbjct: 353 PDWAAVFELYNCGDE---RCYLDL-ARLRGVHYITWR 385
>gi|400530660|gb|AFP86481.1| glycosyltransferase, partial [Anguilla rostrata]
Length = 277
Score = 44.7 bits (104), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 73/160 (45%), Gaps = 22/160 (13%)
Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
PK LV F SFLA N T +V+ +R+ + R ILN E+
Sbjct: 2 PKANILVSGNEIRQFSSFLAEKLN--VTVGEQTEKTDEYIVVFSRS--LNRLILNEAELI 57
Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A A+E V EE S AD R + +V +HGA L SLFL G+ ++++ P
Sbjct: 58 LALAQEFQMKVVTVSLEEQ-SFADIVRVLSRASMLVSMHGAQLVTSLFLPRGAAVVELYP 116
Query: 307 IGTQWLSTVYFEKPARVL------GLEYLEYKIKQEESSL 340
++ ++ P R L L+Y+ ++ +EE+S+
Sbjct: 117 YA---VNPEHY-APYRTLTSLPGMDLQYVAWRNTKEENSV 152
>gi|242008735|ref|XP_002425156.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212508850|gb|EEB12418.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 508
Score = 44.7 bits (104), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 57/246 (23%), Positives = 101/246 (41%), Gaps = 52/246 (21%)
Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILA--------------IADCNDQWARKYAELLP 154
N H D PLF+T+ DV + D +++ LLP
Sbjct: 200 NIMHVIHDDLFPLFLTLEFLCMKNDVCMQSFKLIFHDNFPTGPFFDLYKIFSKGNPILLP 259
Query: 155 RFTRHPIININNQTITHCFQSVTLGLI------------SHGRMVINPTLLPKPKTLVDF 202
+ H NNQ + C + + GLI HG +N L +V F
Sbjct: 260 QLLLHN----NNQIL--CIEEMHAGLILDSIWYQYGFNEPHGP--VNNFFLNN-HDIVRF 310
Query: 203 QSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREV--------KKAAEELG 254
S++ N ++N++ + P +V+++R R ILN+ EV KK +
Sbjct: 311 TSYIKTKLNVHSNSTKN-----PDIVIISREK--TRKILNVNEVTEKVKNIMKKLLRKNE 363
Query: 255 FDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLST 314
+V + S + + + +C ++G+HGA + ++F++P S+++++ P Q
Sbjct: 364 INVMCIDLLNS-NFTFFIKILSNCDLVIGMHGAEMIFTIFMKPHSLIIELFPFAIQ-SDI 421
Query: 315 VYFEKP 320
V F KP
Sbjct: 422 VSFIKP 427
>gi|449671770|ref|XP_004207561.1| PREDICTED: glycosyltransferase-like domain-containing protein
2-like isoform 3 [Hydra magnipapillata]
Length = 516
Score = 44.7 bits (104), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 98/236 (41%), Gaps = 24/236 (10%)
Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ-NQ 167
N H F D + +F T + +P +V L + D D+ Y +L ++ +IN
Sbjct: 111 NLMHVFHDDLMTIFYTKSLFYPHSEVNLVMTDRRDE--GPYFDLYKLYSNSIYTSINFTD 168
Query: 168 TITHCFQSVTLGL------ISHGRMV----INPTLLPKPKTLVDFQSFLANAYNENTNTS 217
T CFQS +GL +G + +N T PK L FL
Sbjct: 169 TDFVCFQSALVGLSKQLTWYQYGFKIPQGPLNNTHYPKNDLLYFKNDFLTRM------NL 222
Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIH 276
S TK L+L + R ILN ++ F + I+ E+ +L + I
Sbjct: 223 SDIPDTKCILLL---SRTTSRKILNEAQLLFKLSTF-FRLPIYSVSLETDALNNIISLIL 278
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
++ +HGA L +F++PG+VL ++ P + ++ A ++ + Y+ +K
Sbjct: 279 RASLVISMHGAQLILGIFMKPGAVLAELFPYAVPPENYTPYKTLADLISVRYVAWK 334
>gi|123481170|ref|XP_001323514.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121906380|gb|EAY11291.1| hypothetical protein TVAG_061950 [Trichomonas vaginalis G3]
Length = 535
Score = 44.7 bits (104), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 86/188 (45%), Gaps = 34/188 (18%)
Query: 225 PKLVLVNRNARVGRTILNLREVKK-AAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
P +V++ R R I N+ +++ +E + + +S ++ S A++G
Sbjct: 352 PTIVIIERKGS-SRDITNIDDIEDFVKKECDYCKVVRVDLKSLAINAQISLFTSASAIIG 410
Query: 284 VHGAGLTHSLFLRPGS-----VLMQVVPIGTQWLSTVYFEKPARVLGLEYL-----EYKI 333
+HG+GL + L+++P S + +V+P W ++E + V +EY EY I
Sbjct: 411 LHGSGLANVLWMKPTSPEFPTAMFEVMPYN-YWCRD-WYETASNVADVEYFSIMNSEYSI 468
Query: 334 KQEESSL--VEKYGANDLVLKNPQAFAGANWSNMRV--YLKTQNVKLDIDRFRIYLKDAY 389
++E S+ E Y + + +N+ +L+ + VKL+ID LK A+
Sbjct: 469 LKDEDSIRAFECYSSRSMC------------TNIVCHDFLRDKKVKLNID----LLKKAW 512
Query: 390 KKAKKLMD 397
+K +D
Sbjct: 513 SPFQKKLD 520
>gi|194385256|dbj|BAG65005.1| unnamed protein product [Homo sapiens]
Length = 251
Score = 44.3 bits (103), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 42/182 (23%), Positives = 80/182 (43%), Gaps = 10/182 (5%)
Query: 156 FTRHPIININNQTITH-CFQSVTLGLISHGR--MVINPTLLPKPKTLVDFQSFLANAYNE 212
FT + +I++ CF+ L+ R + N L+ + F++F + +
Sbjct: 17 FTDYDVIHLKTYDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCQNTGLFRAFAQHVLHR 76
Query: 213 NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLAD 270
T K ++ ++ R+ R ILN E+ A + + F+V I + + D
Sbjct: 77 LNITQEGPKDGKIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLD 135
Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
R H+ +G+HGAGLTH LFL + + ++ + Y + AR+ G+ Y+
Sbjct: 136 QLRITHNTDIFIGMHGAGLTHLLFLPDWAAVFELYNCEDE---RCYLD-LARLRGVHYIT 191
Query: 331 YK 332
++
Sbjct: 192 WR 193
>gi|113476439|ref|YP_722500.1| hypothetical protein Tery_2853 [Trichodesmium erythraeum IMS101]
gi|110167487|gb|ABG52027.1| TPR repeat [Trichodesmium erythraeum IMS101]
Length = 3172
Score = 44.3 bits (103), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 8/105 (7%)
Query: 203 QSFLANAYNE--NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIF 260
Q FL+ E N +SSF+ P+ + ++RN R + N EV E+GF
Sbjct: 625 QVFLSRIKQEIFTENKTSSFY---PERIYISRNNSRYRRVFNEEEVLLKLSEIGF--VCI 679
Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+P+ S +L + ++G HG+GLT+ +F G+ ++++V
Sbjct: 680 QPD-SMNLKEQIAIFSHAKVIIGAHGSGLTNIIFSPRGTKVIELV 723
>gi|448459513|ref|ZP_21596671.1| capsular polysaccharide biosynthesis protein-like protein
[Halorubrum lipolyticum DSM 21995]
gi|445808307|gb|EMA58378.1| capsular polysaccharide biosynthesis protein-like protein
[Halorubrum lipolyticum DSM 21995]
Length = 414
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 53/120 (44%), Gaps = 12/120 (10%)
Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEE--STSL 268
NE T+ T P V V+R R + N E++ EE G EP E + SL
Sbjct: 270 NEATDGLDLSDDTGPDWVYVSRQNADTRRVANYAEIQPVLEEYGI-----EPVECETLSL 324
Query: 269 ADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+ R S +VG HGAGLT ++ S + VV I + Y+ A VLG +Y
Sbjct: 325 DEQIRLFSSVEGIVGPHGAGLTGLVW----SSDLDVVEIFNNVVKGPYYVL-AHVLGHDY 379
>gi|113474031|ref|YP_720092.1| group 1 glycosyl transferase [Trichodesmium erythraeum IMS101]
gi|110165079|gb|ABG49619.1| glycosyl transferase, group 1 [Trichodesmium erythraeum IMS101]
Length = 3301
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 8/105 (7%)
Query: 203 QSFLANAYNE--NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIF 260
Q FL+ E N +SSF+ P+ + ++RN R + N EV E+GF
Sbjct: 625 QVFLSRIKQEIFTENKTSSFY---PERIYISRNNSRYRRVFNEEEVLLKLSEIGF--VCI 679
Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
+P+ S +L + ++G HG+GLT+ +F G+ ++++V
Sbjct: 680 QPD-SMNLKEQIAIFSHAKVIIGAHGSGLTNIIFSPRGTKVIELV 723
>gi|221129933|ref|XP_002163521.1| PREDICTED: glycosyltransferase-like domain-containing protein
2-like isoform 1 [Hydra magnipapillata]
gi|449671768|ref|XP_004207560.1| PREDICTED: glycosyltransferase-like domain-containing protein
2-like isoform 2 [Hydra magnipapillata]
Length = 551
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 98/236 (41%), Gaps = 24/236 (10%)
Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININ-NQ 167
N H F D + +F T + +P +V L + D D+ Y +L ++ +IN
Sbjct: 146 NLMHVFHDDLMTIFYTKSLFYPHSEVNLVMTDRRDE--GPYFDLYKLYSNSIYTSINFTD 203
Query: 168 TITHCFQSVTLGL------ISHGRMV----INPTLLPKPKTLVDFQSFLANAYNENTNTS 217
T CFQS +GL +G + +N T PK L FL
Sbjct: 204 TDFVCFQSALVGLSKQLTWYQYGFKIPQGPLNNTHYPKNDLLYFKNDFLTRM------NL 257
Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIH 276
S TK L+L + R ILN ++ F + I+ E+ +L + I
Sbjct: 258 SDIPDTKCILLL---SRTTSRKILNEAQLLFKLSTF-FRLPIYSVSLETDALNNIISLIL 313
Query: 277 SCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
++ +HGA L +F++PG+VL ++ P + ++ A ++ + Y+ +K
Sbjct: 314 RASLVISMHGAQLILGIFMKPGAVLAELFPYAVPPENYTPYKTLADLISVRYVAWK 369
>gi|291566530|dbj|BAI88802.1| TPR domain protein [Arthrospira platensis NIES-39]
Length = 1687
Score = 44.3 bits (103), Expect = 0.11, Method: Composition-based stats.
Identities = 41/162 (25%), Positives = 76/162 (46%), Gaps = 13/162 (8%)
Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
S PKL+ ++R++ R +LN + ++ GF +TI E+ S+A+ +
Sbjct: 631 SKLGQKTPKLIYISRHSAKYRRLLNEEAIAALLQQRGF-ITI--SPETLSVAEQAQVFSQ 687
Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEE 337
+V HG+GLT+ +F +P + ++++ + ++ YF +R LGL + Y I E
Sbjct: 688 AEVIVASHGSGLTNLIFCQPETTIIEL--MSPHYIRP-YFWGLSRQLGLHH--YYITGEA 742
Query: 338 SSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDID 379
++ Y L+ P A W N + L+ + L D
Sbjct: 743 ---LDCYPLRQLMY--PTALTEDIWINPQTLLQLLDSVLTTD 779
>gi|321252819|ref|XP_003192530.1| hypothetical protein CGB_C0550W [Cryptococcus gattii WM276]
gi|317458999|gb|ADV20743.1| Hypothetical Protein CGB_C0550W [Cryptococcus gattii WM276]
Length = 462
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 50/91 (54%), Gaps = 12/91 (13%)
Query: 224 KPKLVLVNR-------NARVGRTILN-LREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
KPK+V V+R + V + +L L++++KA + + D + + E+ + F
Sbjct: 302 KPKIVYVDRQRTERKFDMEVHKQLLKRLKKIEKAKKAVVVDAVLDDLEKK----EQFEMF 357
Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
++G+HG GL+H L++ G ++++++P
Sbjct: 358 SDADIILGIHGNGLSHELWMPEGGIIIEILP 388
>gi|400530662|gb|AFP86482.1| glycosyltransferase, partial [Conger oceanicus]
Length = 277
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 73/160 (45%), Gaps = 22/160 (13%)
Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
PK LV F +FLA N +V+ +R+ + R ILN E+
Sbjct: 2 PKANILVSGNEIRQFSAFLAEKLN--VTVGEPIEKMDEYIVVFSRS--LNRLILNEAELI 57
Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A A+E V EE S AD R + +V +HGA L SLFL G+ ++++ P
Sbjct: 58 LALAQEFQMKVVTISLEEQ-SFADIIRALSRASMLVSMHGAQLVCSLFLPRGAAVVELFP 116
Query: 307 IGTQWLSTVYFEKPARVL------GLEYLEYKIKQEESSL 340
G ++ ++ P R L L+Y+ ++ +EE+S+
Sbjct: 117 YG---VNPEHY-APYRTLTSLPGMDLQYVAWRNTKEENSV 152
>gi|400530668|gb|AFP86485.1| glycosyltransferase, partial [Myrichthys maculosus]
Length = 277
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 73/156 (46%), Gaps = 14/156 (8%)
Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
PK LV F +FLA N + + +V+ +R+ + R ILN E+
Sbjct: 2 PKANILVSGNEIRQFSAFLAEKLNVSVGEPAEM--IDEYIVVFSRS--LNRLILNEAELI 57
Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A A+E V EE S AD R I +V +HGA L +LFL G+ ++++ P
Sbjct: 58 LALAQEYQMKVVTVSLEEQ-SFADIVRVISRASMLVSMHGAQLVTALFLPRGAAVVELFP 116
Query: 307 IGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
G ++ A + G++ Y+ ++ +EE+S+
Sbjct: 117 YGVNPEHYAPYKTLASLPGMDLHYVAWRNTKEENSV 152
>gi|123432397|ref|XP_001308412.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121890091|gb|EAX95482.1| hypothetical protein TVAG_222780 [Trichomonas vaginalis G3]
Length = 295
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 33/165 (20%), Positives = 80/165 (48%), Gaps = 16/165 (9%)
Query: 171 HCFQSVTLGL--ISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLV 228
HCF+S + G+ I+ G KP ++ ++ L + + + K V
Sbjct: 85 HCFESFSFGIDKIAIGSESYYYNYDIKPNQTINLRNILLSQLD--------IKKSNEKKV 136
Query: 229 LVNRNARVGRTILNLREVKKAAEEL--GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHG 286
++++N G I N+ +V ++L ++++I E + + + + R + ++G
Sbjct: 137 IIHQNTGTG-VIRNINDVVDVIKKLFPDYNISIIELNKLSKI-EQIREVSQADVLIGPTS 194
Query: 287 AGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
L+ +F++P S+L+++ P L+ +++ A+ GL++++Y
Sbjct: 195 PALSSLVFMKPSSILIEINPYKYSCLN--WYQTAAKGAGLKFIKY 237
>gi|334120934|ref|ZP_08495010.1| hypothetical protein MicvaDRAFT_2953 [Microcoleus vaginatus FGP-2]
gi|333455653|gb|EGK84296.1| hypothetical protein MicvaDRAFT_2953 [Microcoleus vaginatus FGP-2]
Length = 375
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 9/103 (8%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
KL L R + + +REV K FD I E + S ++ + R +VG H
Sbjct: 246 KLYLSRSGRRKVKNEVQVREVLKE-----FDFEILE-DISRTVDEQIRLFAEAAVVVGPH 299
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
GAG T+ L+ +PG+ +++ G + YF ++LGLEY
Sbjct: 300 GAGFTNLLWCQPGTKVLEFFYGG---YTPPYFYYICQLLGLEY 339
>gi|242806215|ref|XP_002484699.1| DUF563 domain protein [Talaromyces stipitatus ATCC 10500]
gi|218715324|gb|EED14746.1| DUF563 domain protein [Talaromyces stipitatus ATCC 10500]
Length = 459
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 6/81 (7%)
Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE 327
L + + I +VGVHGAGLTH L+L S +++++P G Q F A LG +
Sbjct: 347 LQEQVQIIRGTDVLVGVHGAGLTHGLWLPQRSAMVEILPEGFQHKG---FRNLAGALGHD 403
Query: 328 YLEYKIKQEESSLVEKYGAND 348
Y ++S GAND
Sbjct: 404 YFSTHASTLQTS---SRGAND 421
>gi|443704319|gb|ELU01420.1| hypothetical protein CAPTEDRAFT_225629 [Capitella teleta]
Length = 515
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 65/141 (46%), Gaps = 23/141 (16%)
Query: 233 NARVGRTILNLREV----KKAAEEL---GFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+ V R I N RE+ + ++L G + FE + D R I S +VG+H
Sbjct: 379 DGHVVRKIHNEREILNHLQSKYQQLRVAGVQIDKFE------MVDQLRQIASTDILVGMH 432
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLS-TVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
GAGLTH+LFL + L+++ P+ W S +F+ A L Y ++ + + +
Sbjct: 433 GAGLTHALFLPRHAALVELFPL--YWSSINAHFKSIAAWRNLTYTSWENRDQSLEKANHF 490
Query: 345 -----GANDLVLKNPQAFAGA 360
G D ++KN A+ G
Sbjct: 491 THVPVGIADTLIKN--AYKGV 509
>gi|404328964|ref|ZP_10969412.1| Capsular polysaccharide biosynthesis protein-like protein
[Sporolactobacillus vineae DSM 21990 = SL153]
Length = 354
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/152 (25%), Positives = 74/152 (48%), Gaps = 16/152 (10%)
Query: 205 FLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEE 264
FL +++ +++N+ K + ++R + R I N + + + GF+ E
Sbjct: 208 FLRDSFLKDSNSIGR------KRIYISR--KWSRKITNEDLLMEIIYKYGFEKVELE--- 256
Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVL 324
S S+ + S A++GVHGA LT+ F RPG+ ++++ +F + +
Sbjct: 257 SLSVEEQVHLFSSAEAIIGVHGAALTNLTFCRPGTKVLEIFAPN---YIIAHFYGISSMG 313
Query: 325 GLEYLEYKIKQEESSLVEKY-GANDLVLKNPQ 355
L+Y Y I ++ S L + + GA D+VL P+
Sbjct: 314 DLDYY-YYIGEKSSRLSKGWPGAEDIVLNIPR 344
>gi|405962463|gb|EKC28135.1| Beta-(1,2)-xylosyltransferase [Crassostrea gigas]
Length = 504
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 57/116 (49%), Gaps = 4/116 (3%)
Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGL 289
N V R + N +E++ + +L + I + E+ S+ D I + ++G+HGAGL
Sbjct: 364 NPKGTVSRKVKNEKEIEDSVAKLYPNHLIESFQTENMSMTDQLSAIVNTDILIGMHGAGL 423
Query: 290 THSLFLRPGSVLMQVVPIGTQWLS-TVYFEKPARVLGLEYLEYKIKQEESSLVEKY 344
T +LFL + L+++ P W S +F AR L Y+ ++ + ++ Y
Sbjct: 424 TLALFLPKHAGLIEIYP--KYWSSDNAHFRAIARWRNLHYIHWQNVESKNEFTNHY 477
>gi|400530658|gb|AFP86480.1| glycosyltransferase, partial [Elops saurus]
Length = 278
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 70/156 (44%), Gaps = 13/156 (8%)
Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
PK LV F +FLA N + +V+ +R+ + R ILN E+
Sbjct: 2 PKANILVSGNEIRQFAAFLAEKLNVTVEERAE-EQDDAYIVVFSRS--INRLILNEAELI 58
Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A A+E V EE SLAD R + +V +HGA L SLFL G+ ++++ P
Sbjct: 59 LALAQEFQMRVVTVSLEEH-SLADIVRVVGGASMLVSMHGAQLITSLFLPRGAAVVELFP 117
Query: 307 IGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
+ A + G L+Y+ ++ EE+S+
Sbjct: 118 YAVNPEHYAPYRTLASLPGMDLQYVVWRNTMEENSV 153
>gi|409076566|gb|EKM76937.1| hypothetical protein AGABI1DRAFT_78054 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 530
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 52/105 (49%), Gaps = 11/105 (10%)
Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTI-----LNLREVKKAAEELGF-DVTIFEPEE 264
N + S +KP ++ +NR + GR + +L E +A EE G ++ I EE
Sbjct: 350 NRTVISPSHVKSSKPVVIYLNRQ-KTGRRLDEESHESLVEALEALEEGGVCELQIVRMEE 408
Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP---GSVLMQVVP 306
L + R + MVGVHG GLTH L++ P SV+ V+P
Sbjct: 409 -IGLREQIRLVSQSTIMVGVHGNGLTHQLWMPPSKRSSVIEIVMP 452
>gi|389631144|ref|XP_003713225.1| hypothetical protein MGG_07979 [Magnaporthe oryzae 70-15]
gi|351645557|gb|EHA53418.1| hypothetical protein MGG_07979 [Magnaporthe oryzae 70-15]
Length = 517
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 56/119 (47%), Gaps = 13/119 (10%)
Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEEST-SLADSFRFI-HSCH 279
K ++ ++R R R +L + AA DV + + +T S + R + H
Sbjct: 377 QKKLRVTFIDR--RGSRKLLGQERLLDAARRAYPDVQVRSVDFATLSFVEQIRLVRHETD 434
Query: 280 AMVGVHGAGLTHSLFLRP------GSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+VGVHGAGLTH +FLR G +++++P ++ F A +LG +Y K
Sbjct: 435 VLVGVHGAGLTHVMFLRAASEAVGGGAIVEILP---DVMNYKGFRNLAYMLGHDYFSVK 490
>gi|78183683|ref|YP_376117.1| capsular polysaccharide biosynthesis protein-like [Synechococcus
sp. CC9902]
gi|78167977|gb|ABB25074.1| Capsular polysaccharide biosynthesis protein-like [Synechococcus
sp. CC9902]
Length = 351
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
KL + NA+ RTI N+ EVK + G+ ++ +P E SL + +VGV
Sbjct: 235 EKLFIKRGNAKNNRTIDNIEEVKLLCIDKGY--SVIDPGE-MSLLEQIEVFRKSTKVVGV 291
Query: 285 HGAGLTHSLFLRPGSVL 301
HG+ + L+++ GSV+
Sbjct: 292 HGSAFVNMLYMKSGSVI 308
>gi|220910531|ref|YP_002485842.1| Capsular polysaccharide biosynthesis protein-like protein
[Cyanothece sp. PCC 7425]
gi|219867142|gb|ACL47481.1| Capsular polysaccharide biosynthesis protein-like protein
[Cyanothece sp. PCC 7425]
Length = 423
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 22/81 (27%), Positives = 44/81 (54%), Gaps = 3/81 (3%)
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
T+ +L+ ++R R +LN E+ + E GF+ + E S+ + + + +V
Sbjct: 275 TQRRLIYISRARATHRRVLNEAELLQFLEGFGFEPVVLE---QLSVVEQAQLFSAADIIV 331
Query: 283 GVHGAGLTHSLFLRPGSVLMQ 303
HGAGLT+ +F +PG+ +++
Sbjct: 332 APHGAGLTNLVFCQPGTKVIE 352
>gi|242057407|ref|XP_002457849.1| hypothetical protein SORBIDRAFT_03g015771 [Sorghum bicolor]
gi|241929824|gb|EES02969.1| hypothetical protein SORBIDRAFT_03g015771 [Sorghum bicolor]
Length = 49
Score = 43.5 bits (101), Expect = 0.18, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 28/43 (65%)
Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
LA +R +++ MVGVH +TH LF+RPGS+ +Q + I Q
Sbjct: 4 LAKMYRALNASDIMVGVHDMAMTHFLFMRPGSLFIQALGIWHQ 46
>gi|356558433|ref|XP_003547511.1| PREDICTED: V-type proton ATPase 116 kDa subunit a isoform 1-like
[Glycine max]
Length = 822
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 62/148 (41%), Gaps = 9/148 (6%)
Query: 58 IKTQ-PYPRKSDKSAMSKVKELTITTSAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEF 114
IK Q PYP D S EL+ S +S GV H + ++ S +N +FF
Sbjct: 520 IKYQDPYPFGVDPSWRGSRSELSFLNSLKMKMSILFGVAHMNLGIILSY--FNAHFFQNS 577
Query: 115 MDC---FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITH 171
+D FVP I +NS F V++ I C A Y ++ F P N+ +
Sbjct: 578 LDIRYQFVPQMIFLNSLFGYLSVLIVIKWCTGSQADLYHVMIYMFLS-PTDNLGENQLFW 636
Query: 172 CFQSVTLGLISHGRMVINPTLLPKPKTL 199
+ + + L+ + + L PKP L
Sbjct: 637 GQRPLQIVLLLLAVIAVPWMLFPKPFIL 664
>gi|443728436|gb|ELU14792.1| hypothetical protein CAPTEDRAFT_132480 [Capitella teleta]
Length = 548
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 106/260 (40%), Gaps = 26/260 (10%)
Query: 108 GNFFHEFMDCFVPLFITI------NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPI 161
N H F D +P+F T+ +S DV + D D + + L +R PI
Sbjct: 137 SNIMHAFHDDLLPMFHTLRLITLHDSTTMPFDVQIVFMDNADHGSFSHLYGLYSSSR-PI 195
Query: 162 I--NINNQTITHCFQSVTLGL------ISHGRMVIN---PTLLPKPKTLVDFQSFLANAY 210
+I + T CF+ +GL +G P + + F +F +
Sbjct: 196 TKRDIQDAGETICFREAHVGLDKSMTWYQYGFFEPQGPLPDIHVTSSHIAHFANFTRHRL 255
Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTILN-LREVKKAAEELGFDVTIFEPEESTSLA 269
N S S TK ++L ++ R+ ILN L A++ V + E T+ A
Sbjct: 256 NLTNQASPS---TKIAVILSRKHNRL---ILNELSLSSNLAQQFNLKVVLVSLETHTA-A 308
Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
+ I +VG+HG+ SLFL PGSVL+++ P G ++ A + + Y
Sbjct: 309 EIIETIGQADLLVGMHGSLFIMSLFLPPGSVLLELFPYGVNPKHYTPYKTLANIQRITYH 368
Query: 330 EYKIKQEESSLVEKYGANDL 349
++ ++ ++ Y +L
Sbjct: 369 SWRNLNKDHAVPHPYRPREL 388
>gi|409991065|ref|ZP_11274360.1| hypothetical protein APPUASWS_08635, partial [Arthrospira platensis
str. Paraca]
gi|409938067|gb|EKN79436.1| hypothetical protein APPUASWS_08635, partial [Arthrospira platensis
str. Paraca]
Length = 965
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 58/111 (52%), Gaps = 6/111 (5%)
Query: 218 SSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
S PKL+ ++R++ R +LN + ++ GF +TI E+ S+A+ +
Sbjct: 629 SKLGQKTPKLIYISRHSAKYRRLLNEEAIAALLQQRGF-ITI--SPETLSVAEQAQVFSQ 685
Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+V HG+GLT+ +F +P + ++++ + ++ YF +R LGL +
Sbjct: 686 AEVIVASHGSGLTNLIFCQPETTIIEL--MSPHYIRP-YFWGLSRQLGLHH 733
>gi|400530670|gb|AFP86486.1| glycosyltransferase, partial [Saccopharynx ampullaceus]
Length = 277
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 68/156 (43%), Gaps = 14/156 (8%)
Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
PK LV F SFLA N T +V+ +R+ + R ILN E+
Sbjct: 2 PKANILVSGNEIRQFSSFLAEKLN--VTVGEPVEKTNEYIVVFSRS--LNRLILNEAELI 57
Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A A+E V EE S D R + +V +HGA L SLFL G+ ++++ P
Sbjct: 58 LALAQEFQMKVVTVSLEEQ-SFTDIVRVLGRASMLVSMHGAQLVTSLFLPRGAAVVELYP 116
Query: 307 IGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
+ A + G L+Y+ ++ EE+S+
Sbjct: 117 YAVNPEHYAPYRTLASLPGMDLQYVAWRNTMEENSV 152
>gi|428205792|ref|YP_007090145.1| capsular polysaccharide biosynthesis protein-like protein
[Chroococcidiopsis thermalis PCC 7203]
gi|428007713|gb|AFY86276.1| capsular polysaccharide biosynthesis protein-like protein
[Chroococcidiopsis thermalis PCC 7203]
Length = 423
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 47/101 (46%), Gaps = 8/101 (7%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ + R + GRTI N EV +LGF + E S AD R +V HGA
Sbjct: 272 IYIVRTKKTGRTITNEEEVLSILTQLGF---VGYTLEELSFADQVRLFSQAEMVVAAHGA 328
Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
GL + +F ++V+ + + + YF +++LG +Y
Sbjct: 329 GLANIMFAEN----LKVIELFGSYGTAAYF-VLSKMLGFDY 364
>gi|119485065|ref|ZP_01619450.1| TPR repeat protein [Lyngbya sp. PCC 8106]
gi|119457293|gb|EAW38418.1| TPR repeat protein [Lyngbya sp. PCC 8106]
Length = 815
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 22/83 (26%), Positives = 43/83 (51%), Gaps = 3/83 (3%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
K + ++RN R +N + V + E+ GF E + S+ + ++++ +V H
Sbjct: 680 KRIYIDRNKAGYRKTINNQAVVQLLEKYGFQRIYLE---TLSIVEQAHYLNAAEVVVAPH 736
Query: 286 GAGLTHSLFLRPGSVLMQVVPIG 308
GAGLT+ F PG+ ++++ G
Sbjct: 737 GAGLTNLTFCNPGTKVIEIFSPG 759
>gi|374329929|ref|YP_005080113.1| Capsular polysaccharide biosynthesis protein-like protein
[Pseudovibrio sp. FO-BEG1]
gi|359342717|gb|AEV36091.1| Capsular polysaccharide biosynthesis protein-like protein
[Pseudovibrio sp. FO-BEG1]
Length = 399
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 8/111 (7%)
Query: 220 FHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
H T + + + RN R ILN + + E GF + P + T + + S
Sbjct: 263 MHKTPTQKLYIARNDAKTRRILNETTLIEQLEARGFQSVV--PGKLTH-REQVKLFSSAK 319
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPI-----GTQWLSTVYFEKPARVLG 325
+VG HGAGLT+ LF + G L+++ P WLS V + A V+G
Sbjct: 320 IIVGTHGAGLTNLLFTQAGGKLVEIFPADYIQSAYAWLSHVRGLEYAPVIG 370
>gi|402813093|ref|ZP_10862688.1| capsular polysaccharide biosynthesis-like protein [Paenibacillus
alvei DSM 29]
gi|402509036|gb|EJW19556.1| capsular polysaccharide biosynthesis-like protein [Paenibacillus
alvei DSM 29]
Length = 290
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 46/103 (44%), Gaps = 6/103 (5%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ V+R R ++N EV + E GF + P S A+ S +V G+
Sbjct: 160 IYVSRQDAFARHVINEEEVMQVLAEKGFVRIVLTP---LSTAEKIAIYSSAQVIVSPFGS 216
Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
G +++F PGS L+++ PI YF K + G+ Y E
Sbjct: 217 GSINTVFCNPGSTLIELTPITVM---DGYFWKISNHAGMNYYE 256
>gi|427717364|ref|YP_007065358.1| hypothetical protein Cal7507_2081 [Calothrix sp. PCC 7507]
gi|427349800|gb|AFY32524.1| hypothetical protein Cal7507_2081 [Calothrix sp. PCC 7507]
Length = 394
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 65/245 (26%), Positives = 101/245 (41%), Gaps = 38/245 (15%)
Query: 95 TSPALVFS---AGGYNGNFFHEFMDCFVPL-FITINSHFPDQDVILAIADCNDQWARKYA 150
T A+VFS G N+FH DC L + ++ + D D+ L + KY+
Sbjct: 137 TFDAVVFSLLTGGAGQYNYFHWLFDCLPRLHLLKLSGLYDDIDLFLVPS-------LKYS 189
Query: 151 ELLPRFTRHPI--ININ-NQTITHCFQ-SVTLGLISHGRMVINPTLLPKPKTLVDF--QS 204
F R + +NI+ +TI + ++T+ + I + P P + DF QS
Sbjct: 190 -----FQRETLSYLNIDLARTIDSIYNPNITVKKLVVSDHPIEASSEPIPAWICDFLRQS 244
Query: 205 FLANAYNENTNTSSSFHHTKPKLVLVNR-NARVGRTILNLREVKKAAEELGFDVTIFEPE 263
FL A K + +NR +A R ILN E+ +A +LGF I
Sbjct: 245 FLHKAI---------IPQLKSNKIYINRKDASNNRKILNEDELVEALTKLGF---ISISA 292
Query: 264 ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARV 323
SL + + +V HGAGL + +F + + +VV I +E +
Sbjct: 293 TDYSLCEQIGIFANAEVIVAAHGAGLGNLVFCQAET---KVVEIFNYEFQPKMYEHLSHR 349
Query: 324 LGLEY 328
LGL Y
Sbjct: 350 LGLPY 354
>gi|443702643|gb|ELU00569.1| hypothetical protein CAPTEDRAFT_211593 [Capitella teleta]
Length = 481
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 62/281 (22%), Positives = 108/281 (38%), Gaps = 49/281 (17%)
Query: 108 GNFFHEFMDCFVPLFITINS-------------HFPDQDVILAIADCNDQWARKYAELLP 154
GNF H + D FV L+ +N + D+ + DC+D KY + LP
Sbjct: 201 GNFHHFWNDEFVMLYSLMNISNRLRPEARNHVLYRSPTDLSPSEVDCDD--IHKYQDFLP 258
Query: 155 RF-TRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN 213
HP CF++ G+ + L LV Q L+ A
Sbjct: 259 YINVVHPHEVFYKMPSKTCFRNGVFGMAN----------LNYDSKLV-VQHVLSKA---- 303
Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
+ S H K L++ R R ++N E+ +AA + GF ++ S+ + R
Sbjct: 304 GISPSECAHLKRVLIVQRRQ----RRLMNANELLQAAIDEGFPDSVVIDFAEYSVEEQIR 359
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE------ 327
M G+ GAGL ++F+ GS L+++ W YF G++
Sbjct: 360 LSACSTIMAGIQGAGLQWAIFMPVGSTLIEIAWPQKHW--GFYFNSFVTGYGIDHQTLAT 417
Query: 328 ---YLEYKIKQE---ESSLVEKYGANDLVLKNPQAFAGANW 362
++ +++ QE +V + D++ +P++ W
Sbjct: 418 DRVHVNWRVYQERVRRGQVVPEEERLDMLHSSPKSIIDNVW 458
>gi|426194527|gb|EKV44458.1| hypothetical protein AGABI2DRAFT_208690 [Agaricus bisporus var.
bisporus H97]
Length = 538
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 52/105 (49%), Gaps = 11/105 (10%)
Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTI-----LNLREVKKAAEELGF-DVTIFEPEE 264
N + S +KP ++ +NR + GR + +L E +A EE G ++ I EE
Sbjct: 359 NRTVISPSHVKSSKPVVIYLNRQ-KTGRRLDEESHESLVEALEALEEGGVCELQIVRMEE 417
Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP---GSVLMQVVP 306
L + R + MVGVHG GLTH L++ P SV+ V+P
Sbjct: 418 -IGLREQIRLVSRSTIMVGVHGNGLTHQLWMPPSKRSSVIEIVMP 461
>gi|348575416|ref|XP_003473485.1| PREDICTED: uncharacterized glycosyltransferase AER61-like isoform 2
[Cavia porcellus]
Length = 443
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 57/111 (51%), Gaps = 7/111 (6%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAM 281
K ++ ++ R+ R ILN E+ A + + F+V I + + + D R H+
Sbjct: 280 KIRVTILARSTEY-RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDIF 338
Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+G+HGAGLTH LFL +V+ ++ + Y + AR+ G+ Y+ ++
Sbjct: 339 IGMHGAGLTHLLFLPDWAVVFELYNCEDE---HCYLDL-ARLRGIHYITWR 385
>gi|393243584|gb|EJD51098.1| hypothetical protein AURDEDRAFT_143041 [Auricularia delicata
TFB-10046 SS5]
Length = 469
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 11/91 (12%)
Query: 246 VKKAAEELGFDVTIFEPEEST--SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
++K A+ G ++ I +P ++ S + I ++G A L ++FLRPG+ LM+
Sbjct: 336 LEKLAQATGAELHILDPTDTRTRSFKNRAELIARSRILIGSPSAALDDAVFLRPGATLME 395
Query: 304 VVPIGTQWLSTVYFEK---PARVLGLEYLEY 331
P G V PAR++G+EY+ +
Sbjct: 396 FFPPG------VMLHDHGLPARMMGVEYIAW 420
>gi|374601970|ref|ZP_09674966.1| Capsular polysaccharide biosynthesis protein-like protein
[Paenibacillus dendritiformis C454]
gi|374392412|gb|EHQ63738.1| Capsular polysaccharide biosynthesis protein-like protein
[Paenibacillus dendritiformis C454]
Length = 361
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 57/126 (45%), Gaps = 11/126 (8%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ V+R R ++N EV + E GF + P S+AD S +V G+
Sbjct: 231 IYVSRQDAHVRHVINEEEVMQVLAEKGFVRIVLTP---LSMADKIAIYSSAQVIVSPFGS 287
Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY--KIKQEESSLVEKYG 345
G ++ F PGS L+++ P+ YF K + G+ Y E I+Q L G
Sbjct: 288 GSINTAFCNPGSTLIELTPVTVM---DGYFWKISNHAGMNYYEVVCDIEQPPKPLG---G 341
Query: 346 ANDLVL 351
A++L++
Sbjct: 342 ADNLIV 347
>gi|291393985|ref|XP_002713347.1| PREDICTED: AER61 glycosyltransferase isoform 2 [Oryctolagus
cuniculus]
Length = 440
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 49/97 (50%), Gaps = 6/97 (6%)
Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + FDV I + + + D R H+ +G+HGAGLTH LFL
Sbjct: 290 RKILNQNELVNALKTVSTFDVQIVDYKYKELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 349
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ + ++ + Y + AR+ G+ Y+ ++
Sbjct: 350 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 382
>gi|242007553|ref|XP_002424604.1| glycosyltransferase, putative [Pediculus humanus corporis]
gi|212508047|gb|EEB11866.1| glycosyltransferase, putative [Pediculus humanus corporis]
Length = 509
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 58/260 (22%), Positives = 115/260 (44%), Gaps = 25/260 (9%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF--ITINSHFP---DQDVILAIADCNDQ 144
C + P + N +H F D F L+ + +N P D+ + I +
Sbjct: 204 CDIIIMKPTFIMKIDA-TVNMYHHFCD-FFNLYASLHVNMSHPLTFSTDINIIIWE-TFP 260
Query: 145 WARKYAELLPRFTRHPIININNQT-ITHCFQSVTLGLISHGRMVI----NPTLLPKPKTL 199
+ + E+ F+ +PI+ + N T CF++V L+ RM+ N L+ K
Sbjct: 261 YHSNFDEMWRVFSNNPILTLRNFIGKTVCFKNVVFPLLP--RMIFGLYYNTPLISGCKKS 318
Query: 200 VDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE-VKKAAEELGFDV- 257
F++F + +++ K+ L++R R ILN ++ + ++ ++V
Sbjct: 319 GLFKAFSEFVLHRLKIKEHERENSQIKITLLSRETSF-RNILNEKDLINSLSQNKSYNVK 377
Query: 258 -TIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVY 316
T+F ++ + I + ++G+HGAGLTH LFL + + ++ + Y
Sbjct: 378 KTVFN--KNMRFSSQLEIIRNTDILIGMHGAGLTHLLFLPDWAGVFELYNCEDE---NCY 432
Query: 317 FEKPARVLGLEYLEYKIKQE 336
+ AR+ G+EY+ ++ K +
Sbjct: 433 MD-LARLRGVEYITWEDKNK 451
>gi|392965786|ref|ZP_10331205.1| hypothetical protein BN8_02330 [Fibrisoma limi BUZ 3]
gi|387844850|emb|CCH53251.1| hypothetical protein BN8_02330 [Fibrisoma limi BUZ 3]
Length = 395
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 59/121 (48%), Gaps = 12/121 (9%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ ++R R R I N E+ A + LGF + +P S+A+ R S ++G HGA
Sbjct: 263 IYISRAGR--RRIRNEAELIVALKSLGFQIIEDKPR---SIAEQVRLYKSASFIIGPHGA 317
Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAN 347
T+ ++ PG+ L ++ + + YF A+V+ L Y Y + + +++G +
Sbjct: 318 SFTNIIWCEPGTYLFELF---SPTYTPDYFLYLAQVMNLRYTAYSHGEAD----DRFGIS 370
Query: 348 D 348
D
Sbjct: 371 D 371
>gi|428320698|ref|YP_007118580.1| hypothetical protein Osc7112_5970 [Oscillatoria nigro-viridis PCC
7112]
gi|428244378|gb|AFZ10164.1| hypothetical protein Osc7112_5970 [Oscillatoria nigro-viridis PCC
7112]
Length = 376
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 9/103 (8%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
KL L R + + +REV K FD I E + S ++ + R +VG H
Sbjct: 247 KLYLSRSGRRKVKNEVQVREVLKE-----FDFEILE-DISRTVDEQIRLFAEAAVVVGPH 300
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
GAG T+ L+ +PG+ +++ G + YF ++LGLEY
Sbjct: 301 GAGFTNLLWSQPGTKVLEFFYGG---YTPPYFYYICQLLGLEY 340
>gi|443705176|gb|ELU01832.1| hypothetical protein CAPTEDRAFT_196589 [Capitella teleta]
Length = 478
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 31/134 (23%), Positives = 67/134 (50%), Gaps = 15/134 (11%)
Query: 224 KPKLVLVNRNA----RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
+ K+VL+ R+ R ++I + E + A E G ++ IF + S+ + R +
Sbjct: 338 RNKIVLIKRSRKRFFRQHKSIAAMLEAQ--ASEHGLELFIFRDDPVPSINLTRRMFNEAI 395
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
++ HGAG ++ ++ +PG+V+++ + ++ + ++ A++LG+ Y
Sbjct: 396 MIIAPHGAGESNMIYAQPGTVIIEGMCFESRIKVNMAYKLTAQLLGMRYY---------G 446
Query: 340 LVEKYGANDLVLKN 353
L+ KYG D+ K
Sbjct: 447 LLFKYGCFDITAKQ 460
>gi|297793023|ref|XP_002864396.1| AtXylT [Arabidopsis lyrata subsp. lyrata]
gi|297310231|gb|EFH40655.1| AtXylT [Arabidopsis lyrata subsp. lyrata]
Length = 534
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 42/80 (52%), Gaps = 6/80 (7%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S+ D R I ++G HGAGLTH + P + + ++V + Q +FE A+ GL
Sbjct: 448 SMKDQVRAIQEASVIIGAHGAGLTHIVSATPNTTIFEIVSVEFQ---RPHFELIAKWKGL 504
Query: 327 EYLEYKI---KQEESSLVEK 343
EY + + E ++++EK
Sbjct: 505 EYHAMHLANSRAEPTAVIEK 524
>gi|241043295|ref|XP_002407103.1| glycosyltransferase, putative [Ixodes scapularis]
gi|215492101|gb|EEC01742.1| glycosyltransferase, putative [Ixodes scapularis]
Length = 482
Score = 42.7 bits (99), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 6/108 (5%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
++ L++RN + R I+N +E+ AA L G V + + ST + ++G+
Sbjct: 316 RVTLLSRNTK-HRRIVNEQELVAAARSLPGVRVRLVDFGHSTDFMHQLEVTANTDVLIGM 374
Query: 285 HGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
HGAGLTH LF VL ++ Y + AR+ G+EY+ ++
Sbjct: 375 HGAGLTHVLFQPDWGVLFEIFNCED---PGCYLDL-ARLRGVEYVTWE 418
>gi|392576589|gb|EIW69719.1| hypothetical protein TREMEDRAFT_43468 [Tremella mesenterica DSM
1558]
Length = 445
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 42/167 (25%), Positives = 66/167 (39%), Gaps = 26/167 (15%)
Query: 162 ININNQTITHCFQSVTLGLISHG---------RMVINPTLLPKPKT-LVDFQSFLANAYN 211
I+ N T F V +H +M ++ LP P T +V ++ L
Sbjct: 208 IDKKNSGTTFVFDRVRFRFAAHNHGNKVRVWNKMTMDLPDLPVPSTWMVPLRNALKAYAL 267
Query: 212 ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-------- 263
N H K + +V V R + + R V AE+L + E
Sbjct: 268 AN---GCDLHRHKAGIPIVTY---VNRQMTSRRLVASDAEDLVMSLQDLEDRGEIEFNNA 321
Query: 264 --ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
ES A+ F +VGVHG GL+H L++RPG +++++ G
Sbjct: 322 FMESIPRAEQFCLALKTDILVGVHGNGLSHQLWMRPGGAVLELMDTG 368
>gi|400530672|gb|AFP86487.1| glycosyltransferase, partial [Eurypharynx pelecanoides]
Length = 277
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 72/160 (45%), Gaps = 22/160 (13%)
Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
PK LV F SFLA N T +V+ +R+ + R ILN E+
Sbjct: 2 PKANILVSGNEIRQFSSFLAEKLN--VTVGEQAEKTGEYIVVFSRS--LNRLILNEAELI 57
Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A A+E EE S AD R + +V +HGA L SLFL G+ ++++ P
Sbjct: 58 LALAQEFQMKAVTVSLEEQ-SFADIVRVLSRASMLVSMHGAQLVTSLFLPRGAAVVELYP 116
Query: 307 IGTQWLSTVYFEKPARVL------GLEYLEYKIKQEESSL 340
++ ++ P R L L+Y+ ++ +EE+S+
Sbjct: 117 YA---VNPEHY-APYRTLTSLPGMDLQYVAWRNTREENSV 152
>gi|443727464|gb|ELU14205.1| hypothetical protein CAPTEDRAFT_191673 [Capitella teleta]
Length = 481
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 62/281 (22%), Positives = 108/281 (38%), Gaps = 49/281 (17%)
Query: 108 GNFFHEFMDCFVPLFITINS-------------HFPDQDVILAIADCNDQWARKYAELLP 154
GNF H + D FV L+ +N + D+ + DC+D KY + LP
Sbjct: 201 GNFHHFWNDEFVMLYSLMNISNRLRPEARNHVLYRSPTDLSPSEVDCDD--IHKYQDFLP 258
Query: 155 RF-TRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNEN 213
HP CF++ G+ + L LV Q L+ A
Sbjct: 259 YINVVHPHEVFYKMPSKTCFRNGVFGMAN----------LNYDSKLV-VQHVLSKA---- 303
Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
+ S H K L++ R R ++N E+ +AA + GF ++ S+ + R
Sbjct: 304 GISPSECAHLKRVLIVQRRQ----RRLMNANELLQAAIDEGFPDSVVIDFAVYSVEEQIR 359
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE------ 327
M G+ GAGL ++F+ GS L+++ W YF G++
Sbjct: 360 LSACSTIMAGIQGAGLQWAIFMPVGSTLIEIAWPQKHW--GFYFNSFVTGYGIDHQTLTT 417
Query: 328 ---YLEYKIKQE---ESSLVEKYGANDLVLKNPQAFAGANW 362
++ +++ QE +V + D++ +P++ W
Sbjct: 418 DRVHVNWRVYQERVRRGQVVPEEERLDMLHSSPKSIIDNVW 458
>gi|56550358|emb|CAI30565.1| glycosyltransferase [Caenorhabditis briggsae]
Length = 541
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 59/265 (22%), Positives = 105/265 (39%), Gaps = 40/265 (15%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF--ITINSHFPDQDVILAIADC-----N 142
C V P +V N +H F D FV L+ + IN F +QD+ + + D N
Sbjct: 208 CDVIFEKPTVVMKLDA-AVNLYHHFCD-FVNLYASLHINQTF-NQDIDIILWDTHPGGYN 264
Query: 143 DQW---------------ARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGR-- 185
D + +++ + L NI + CF++V + L++ R
Sbjct: 265 DHYYGITWKAFSRNEPFELKEFGKSLEFLNYEITENILSDQKRVCFKNVMMPLLARQRSG 324
Query: 186 MVINPTLL---PKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILN 242
+ N L+ L F F+ + + K ++V+++R+ R ILN
Sbjct: 325 LFYNSPLVYGCSGSTLLKTFSQFILHRLRLGIKPQKA-ELEKVRIVILSRST-AYRRILN 382
Query: 243 LREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
++E+ K+ L + + E +G+HGAGLTH LFL + +
Sbjct: 383 IKELLKSLGHLPNVTTRVVDYNERIPFLRQLNTTSQTDIFIGMHGAGLTHLLFLPDWAAI 442
Query: 302 MQVVPIGTQW-------LSTVYFEK 319
+++ +G + S +YF K
Sbjct: 443 FEIISLGLKRKLLLYSSYSDIYFSK 467
>gi|392562853|gb|EIW56033.1| hypothetical protein TRAVEDRAFT_152588 [Trametes versicolor
FP-101664 SS1]
Length = 486
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 44/81 (54%), Gaps = 7/81 (8%)
Query: 223 TKPKLVLVNRNARVGRTILN------LREVKKAAEELGFDVTIFEPEESTSLADSFRFIH 276
T+P + ++R R ++ ++E+ + +E G++V + E ++ T + + FR
Sbjct: 326 TRPVITYISRQGWNRRMLVQADHDKLVQELYRLRDEHGYEVNVVEMDKLTRI-EQFRLAA 384
Query: 277 SCHAMVGVHGAGLTHSLFLRP 297
M+GVHG GLT L++RP
Sbjct: 385 RTTIMMGVHGNGLTALLWMRP 405
>gi|242826030|ref|XP_002488559.1| glycosyltransferase, putative [Talaromyces stipitatus ATCC 10500]
gi|218712377|gb|EED11803.1| glycosyltransferase, putative [Talaromyces stipitatus ATCC 10500]
Length = 275
Score = 42.4 bits (98), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 24/26 (92%)
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVP 306
+VGVHGAGL H++FL+P SV+++++P
Sbjct: 180 LVGVHGAGLPHAMFLKPKSVVVEILP 205
>gi|58265524|ref|XP_569918.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134108887|ref|XP_776558.1| hypothetical protein CNBC0520 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259238|gb|EAL21911.1| hypothetical protein CNBC0520 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57226150|gb|AAW42611.1| expressed protein [Cryptococcus neoformans var. neoformans JEC21]
Length = 461
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 45/91 (49%), Gaps = 12/91 (13%)
Query: 224 KPKLVLVNR-------NARVGRTIL-NLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
KPK+V V+R + V +L L+ ++KA + + D + E + F
Sbjct: 302 KPKIVYVDRQRTQRKFDVEVHTQLLKQLKNIEKAKKAVVVDAVL----EDLEKKEQFEMF 357
Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
++G+HG GL H L++ G ++++++P
Sbjct: 358 SDADIILGIHGNGLAHELWMPEGGIIIEILP 388
>gi|296533968|ref|ZP_06896486.1| TPR repeat protein, partial [Roseomonas cervicalis ATCC 49957]
gi|296265699|gb|EFH11806.1| TPR repeat protein [Roseomonas cervicalis ATCC 49957]
Length = 313
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 48/96 (50%), Gaps = 6/96 (6%)
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
T P+L L R+ R I N E+++A LGF V F+ LA R +V
Sbjct: 169 TGPRLYLSRRDV-PRRGIENEAELEEALRRLGFTVMTFQ---GVPLARQIRAFQEAEVVV 224
Query: 283 GVHGAGLTHSLFLRPGSVLMQVVP--IGTQWLSTVY 316
HGAGL H + RPG + +++P +GT+ L + Y
Sbjct: 225 SPHGAGLAHLVAARPGLKVFELMPAWLGTEKLRSCY 260
>gi|440796868|gb|ELR17969.1| hypothetical protein ACA1_208470 [Acanthamoeba castellanii str.
Neff]
Length = 574
Score = 42.4 bits (98), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 53/100 (53%), Gaps = 15/100 (15%)
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM- 281
T+ + +++R+ + R +LN +E+ A +L + L D+ F A+
Sbjct: 394 TESHVGIISRSFK--RFLLNEQELLHATLQLNVSAELL-------LFDTLPFYQQVQALR 444
Query: 282 -----VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVY 316
VG+HG+GLT++L+L+ G+VL+Q++P T + Y
Sbjct: 445 RTTVLVGMHGSGLTNALYLQRGAVLLQIMPFKTGGGAAAY 484
>gi|254471926|ref|ZP_05085327.1| hypothetical protein PJE062_312 [Pseudovibrio sp. JE062]
gi|211959128|gb|EEA94327.1| hypothetical protein PJE062_312 [Pseudovibrio sp. JE062]
Length = 399
Score = 42.4 bits (98), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 52/111 (46%), Gaps = 8/111 (7%)
Query: 220 FHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCH 279
H T + + + R+ R ILN + + E GF + P + T + + +S
Sbjct: 263 MHKTPTQKLYIARDDAKTRRILNETTLIEQLEARGFQSVV--PGKLTH-REQVKLFNSAK 319
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPI-----GTQWLSTVYFEKPARVLG 325
+VG HGAGLT+ LF + G L+++ P WLS V + A V+G
Sbjct: 320 IIVGTHGAGLTNLLFTQAGGKLVEIFPADYIQSAYAWLSHVRGLEYAPVIG 370
>gi|400530678|gb|AFP86490.1| glycosyltransferase, partial [Hiodon tergisus]
Length = 276
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 52/109 (47%), Gaps = 7/109 (6%)
Query: 201 DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE-VKKAAEELGFDVTI 259
F +FL N SS + +V+ +R+ V R ILN E + A+E V
Sbjct: 15 QFATFLMEKLNITRQGSS---RDEEYIVVFSRS--VNRLILNEAELILTLAQEFQMKVIT 69
Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
EE S +D + I +V +HGA L SLFL G+V++++ P G
Sbjct: 70 VSLEEH-SFSDIIQAISGASILVSMHGAQLVTSLFLPRGAVVVELFPYG 117
>gi|376005002|ref|ZP_09782572.1| hypothetical protein ARTHRO_370013 [Arthrospira sp. PCC 8005]
gi|375326596|emb|CCE18325.1| hypothetical protein ARTHRO_370013 [Arthrospira sp. PCC 8005]
Length = 915
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 50/108 (46%), Gaps = 6/108 (5%)
Query: 221 HHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHA 280
+H P + V+R R +LN +V LGF + E S +
Sbjct: 771 NHDYPNRIYVSRAQARHRRVLNEEQVMAQLSRLGFVRVL---PEKLSFSQQVALFSRAEI 827
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
++G HG+GLT+ +F +PG ++VV + + Y+ ++ LGLE+
Sbjct: 828 VIGPHGSGLTNLVFCQPG---VRVVELMSPHYDRHYYWVISQALGLEH 872
>gi|427702431|ref|YP_007045653.1| hypothetical protein Cyagr_1133 [Cyanobium gracile PCC 6307]
gi|427345599|gb|AFY28312.1| hypothetical protein Cyagr_1133 [Cyanobium gracile PCC 6307]
Length = 337
Score = 42.4 bits (98), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 43/81 (53%), Gaps = 3/81 (3%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ ++R + R I N E+ + E GF T+ + E+ + L + F + +VG HGA
Sbjct: 202 IFLSRGVKGARAIHNEWELAEQMESAGF--TVIDAEKYSVLQQIYIFSRA-RTIVGFHGA 258
Query: 288 GLTHSLFLRPGSVLMQVVPIG 308
GLT+ +F P L+++ +G
Sbjct: 259 GLTNLIFADPSCELIELAIVG 279
>gi|241111502|ref|XP_002399294.1| glycosyltransferase, putative [Ixodes scapularis]
gi|215492952|gb|EEC02593.1| glycosyltransferase, putative [Ixodes scapularis]
Length = 521
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 43/216 (19%), Positives = 81/216 (37%), Gaps = 19/216 (8%)
Query: 109 NFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIINI---- 164
N H F D +P+F T H +++C D + + P+ + +
Sbjct: 104 NLMHVFHDDLIPIFATAREH-RGCSTGEEVSNCLDNLTLFFTDNRPKGPYWYLYQVLTKD 162
Query: 165 ------NNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQS------FLANAYNE 212
+ T +CF +GL +P+ + QS + +
Sbjct: 163 LLLVPPSETTQLYCFNKAIVGLQKQSTWYQYGFRMPQGPLERNLQSAGKEIKLFTKEFLK 222
Query: 213 NTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSF 272
N S V+V+R+ R ILN E+ + V + E +L+
Sbjct: 223 MLNIQPSLSVEAGYAVIVSRSR--NRLILNEEELLDMVKTHAALVPVVVDLEREALSKVL 280
Query: 273 RFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
+ + +V +HG+ L S+F++PG V++++ P G
Sbjct: 281 QLLVGAKLLVAMHGSALILSMFMKPGGVVLEMFPYG 316
>gi|443685144|gb|ELT88859.1| hypothetical protein CAPTEDRAFT_190671 [Capitella teleta]
Length = 416
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 5/106 (4%)
Query: 237 GRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
GR I+N RE +E F F + ST L M+GVHGAGL + L
Sbjct: 288 GRRIINKRETVHFLKE-RFPKNFFVFKGSTDLDKHASLFARTKLMIGVHGAGLYNMLLCN 346
Query: 297 PGSVLMQVVPIGTQW----LSTVYFEKPARVLGLEYLEYKIKQEES 338
+ +++++PI LST F K A +G Y +++ +S
Sbjct: 347 SKTTILEIMPIDKGSLLPNLSTDIFWKMASSIGQNYWRIYVERIDS 392
>gi|440464716|gb|ELQ34090.1| DUF563 domain-containing protein [Magnaporthe oryzae Y34]
gi|440480711|gb|ELQ61362.1| DUF563 domain-containing protein [Magnaporthe oryzae P131]
Length = 506
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 10/91 (10%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEEST-SLADSFRFI-HSCHAM 281
K ++ ++R R R +L + AA DV + + +T S + R + H +
Sbjct: 327 KLRVTFIDR--RGSRKLLGQERLLDAARRAYPDVQVRSVDFATLSFVEQIRLVRHETDVL 384
Query: 282 VGVHGAGLTHSLFLRP------GSVLMQVVP 306
VGVHGAGLTH +FLR G +++++P
Sbjct: 385 VGVHGAGLTHVMFLRAASEAVGGGAIVEILP 415
>gi|428180864|gb|EKX49730.1| hypothetical protein GUITHDRAFT_135893 [Guillardia theta CCMP2712]
Length = 1399
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 71/310 (22%), Positives = 127/310 (40%), Gaps = 46/310 (14%)
Query: 20 HRSYDLCLINGSALFDP-KTSTFFSVGHTDSTPSQPSLRIKTQPYPRKSDKSAMSKVKE- 77
+R ++C+ N S LF P +S +F + H P L + + + S K +K
Sbjct: 975 YRLRNVCIANRSILFFPIPSSPWFQLDH------DPVLLLGRHWWGQWSSKYCSVTLKRG 1028
Query: 78 ---LTITTSAPPNLSCGVTHTSPA--LVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQ 132
+ NL G + A + + G + +F H F D +PLF TI+ F +Q
Sbjct: 1029 KSLVDFMKQNGINLDDGDAGEAGAGPVYVTLEGPSSHFGHTFTDSMLPLFATIS--FVEQ 1086
Query: 133 DV------ILAIADC-------NDQWAR-KYAELLPRFTRHPIININNQTI------THC 172
+ I+ C D+ ++ ++ +L + HP +++ + + C
Sbjct: 1087 ERRRERRKIILWPHCANFCPIKGDRRSQGRFRDLFYVASSHPQVDLLSLSSFSLSLRCRC 1146
Query: 173 FQSVTLGLISHGRMVI---NPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPK-LV 228
F+ V LG + R+ N +P + L D L N S++F +
Sbjct: 1147 FKEVVLGELIDTRVGYQRDNRRTVPFLEDLADAVRML----RRNILGSTAFRKVDQRERR 1202
Query: 229 LVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGA 287
+ R R ++N+ +++ +ELGF F+ + E L + + +VGV+GA
Sbjct: 1203 RMTLVRRRDRLVVNIHGLERLGQELGF--LAFQVDWEHLRLFQQIQVMEQTDVLVGVYGA 1260
Query: 288 GLTHSLFLRP 297
GL F RP
Sbjct: 1261 GLVKIAFARP 1270
>gi|345569002|gb|EGX51871.1| hypothetical protein AOL_s00043g605 [Arthrobotrys oligospora ATCC
24927]
Length = 531
Score = 42.0 bits (97), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 8/110 (7%)
Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKA--AEELGFDVTIFEPEESTSLADSFRFIHSCH 279
H+ L ++R R + N +E+ A A +V + + + T L + + +
Sbjct: 376 HSPINLTFIDRKG--SRKLTNSKELTDALIAAYPKVNVKVVDMADLT-LKEQISLVVNTD 432
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
+VGVHGAG TH+ FL P S L++++P L F A + G+ Y
Sbjct: 433 VLVGVHGAGHTHAFFLPPQSSLVEILPAD---LKHKGFRNLAGLRGIRYF 479
>gi|113476309|ref|YP_722370.1| hypothetical protein Tery_2708 [Trichodesmium erythraeum IMS101]
gi|110167357|gb|ABG51897.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
Length = 979
Score = 42.0 bits (97), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 54/115 (46%), Gaps = 10/115 (8%)
Query: 214 TNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
TN S F P+ + + R R ++N EV + GF T PE S SL +
Sbjct: 832 TNNKSYF----PEKIYIGRGNAKYRRVMNEAEVVDILRQFGF--TYVTPE-SISLENQIS 884
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+V HG+GLT+ +F PG+ +V+ + + Y+ +++LGLE+
Sbjct: 885 TFAHAKIIVAPHGSGLTNIVFCNPGT---KVIELFSPHYLRYYYWHISQLLGLEH 936
>gi|428205793|ref|YP_007090146.1| capsular polysaccharide biosynthesis protein-like protein
[Chroococcidiopsis thermalis PCC 7203]
gi|428007714|gb|AFY86277.1| capsular polysaccharide biosynthesis protein-like protein
[Chroococcidiopsis thermalis PCC 7203]
Length = 412
Score = 42.0 bits (97), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 3/74 (4%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ ++R GR ++N EV A LGF + E S AD R +V HGA
Sbjct: 272 IYISRTKTTGRQVINEEEVLAALAPLGF---VSYTMERMSFADQVRLFSQAEIVVSPHGA 328
Query: 288 GLTHSLFLRPGSVL 301
LT++LF + SV+
Sbjct: 329 ALTNTLFAQNLSVI 342
>gi|18423754|ref|NP_568825.1| beta-(1,2)-xylosyltransferase [Arabidopsis thaliana]
gi|17369167|sp|Q9LDH0.1|XYLT_ARATH RecName: Full=Beta-(1,2)-xylosyltransferase
gi|15724166|gb|AAL06475.1|AF411785_1 AT5g55500/MTE17_21 [Arabidopsis thaliana]
gi|7671513|emb|CAB89489.1| beta1,2-Xylosyltransferase [Arabidopsis thaliana]
gi|7768105|emb|CAB90610.1| beta1,2-xylosyltransferase [Arabidopsis thaliana]
gi|9758182|dbj|BAB08567.1| unnamed protein product [Arabidopsis thaliana]
gi|23505973|gb|AAN28846.1| At5g55500/MTE17_21 [Arabidopsis thaliana]
gi|332009252|gb|AED96635.1| beta-(1,2)-xylosyltransferase [Arabidopsis thaliana]
Length = 534
Score = 42.0 bits (97), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 6/80 (7%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S+ D R I ++G HGAGLTH + P + + +++ + Q +FE A+ GL
Sbjct: 448 SMKDQVRAIQDASVIIGAHGAGLTHIVSATPNTTIFEIISVEFQ---RPHFELIAKWKGL 504
Query: 327 EYLEYKI---KQEESSLVEK 343
EY + + E ++++EK
Sbjct: 505 EYHAMHLANSRAEPTAVIEK 524
>gi|423065545|ref|ZP_17054335.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
gi|406712988|gb|EKD08163.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
Length = 917
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 6/107 (5%)
Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
H P + V+R R +LN +V LGF + E S + +
Sbjct: 774 HDYPNRIYVSRAQARHRRVLNEEQVMAQLSRLGFVRVL---PEKLSFSQQVALFSRAEIV 830
Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+G HG+GLT+ +F +PG ++VV + + Y+ ++ LGLE+
Sbjct: 831 IGPHGSGLTNLVFCQPG---VRVVELMSPHYDRHYYWVISQALGLEH 874
>gi|209528249|ref|ZP_03276714.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
gi|209491321|gb|EDZ91711.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
Length = 915
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 6/107 (5%)
Query: 222 HTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
H P + V+R R +LN +V LGF + E S + +
Sbjct: 772 HDYPNRIYVSRAQARHRRVLNEEQVMAQLSRLGFVRVL---PEKLSFSQQVALFSRAEIV 828
Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+G HG+GLT+ +F +PG ++VV + + Y+ ++ LGLE+
Sbjct: 829 IGPHGSGLTNLVFCQPG---VRVVELMSPHYDRHYYWVISQALGLEH 872
>gi|8572250|gb|AAF77064.1| beta 1,2-xylosyltransferase [Arabidopsis thaliana]
Length = 533
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 6/80 (7%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S+ D R I ++G HGAGLTH + P + + +++ + Q +FE A+ GL
Sbjct: 447 SMKDQVRAIQDASVIIGAHGAGLTHIVSATPNTTIFEIISVEFQ---RPHFELIAKWKGL 503
Query: 327 EYLEYKI---KQEESSLVEK 343
EY + + E ++++EK
Sbjct: 504 EYHAMHLANSRAEPTAVIEK 523
>gi|344276098|ref|XP_003409846.1| PREDICTED: uncharacterized glycosyltransferase AER61-like
[Loxodonta africana]
Length = 443
Score = 42.0 bits (97), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 50/99 (50%), Gaps = 6/99 (6%)
Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + F+V I + + + D R H+ +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVRIVDYKYKELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 352
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIK 334
+ + ++ + Y + AR+ G+ Y+ ++ K
Sbjct: 353 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWRKK 387
>gi|393213882|gb|EJC99377.1| hypothetical protein FOMMEDRAFT_160990 [Fomitiporia mediterranea
MF3/22]
Length = 537
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 42/94 (44%), Gaps = 4/94 (4%)
Query: 243 LREVKKAAEELGFDVTIFE----PEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG 298
+RE+KK +DV I E E T + R M+GV+G LT+S+ LRPG
Sbjct: 402 VRELKKLGSSQKWDVHIIEIGGDGGEPTPWVEHIRAAAQSSIMLGVYGDSLTNSILLRPG 461
Query: 299 SVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
V+ T E R LG+ Y+ ++
Sbjct: 462 PPTPAVIEFFPDGRYTNEHEFVTRALGIHYVAWR 495
>gi|323451963|gb|EGB07838.1| hypothetical protein AURANDRAFT_71684 [Aureococcus anophagefferens]
Length = 980
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 63/142 (44%), Gaps = 20/142 (14%)
Query: 187 VINPTLLPKPKTLVD---FQSFL--ANAYNENTNTSSSFHHTKPKLVL--------VNRN 233
V++ T +P+ L D ++FL A+ + + + + T+P + +NR
Sbjct: 170 VVHATYVPRATYLRDPDATRAFLEAADGLHGGGSVADAAPSTRPPRTVTLLLRSANLNRP 229
Query: 234 A-RVGRTILNLREVKKAAEEL------GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHG 286
A R G N RE+ + E G+ + + S A FR +VG HG
Sbjct: 230 AKRAGPAWANSRELAASLERWLSSEHPGWVLQTVALAHAFSFASQFRTAQHSGVLVGAHG 289
Query: 287 AGLTHSLFLRPGSVLMQVVPIG 308
A LT+ +F R G+ L++V+ G
Sbjct: 290 ASLTNGVFQRDGAALLEVLNCG 311
>gi|220926108|ref|YP_002501410.1| capsular polysaccharide biosynthesis protein-like protein
[Methylobacterium nodulans ORS 2060]
gi|219950715|gb|ACL61107.1| Capsular polysaccharide biosynthesis protein-like protein
[Methylobacterium nodulans ORS 2060]
Length = 350
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 6/105 (5%)
Query: 231 NRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLT 290
+R R +N V +LG+ + F E S D H C ++VGV GA LT
Sbjct: 225 SRGRGASRDFVNEPAVADHLSKLGY-INKFAGE--LSFTDQISVFHGCSSVVGVMGAALT 281
Query: 291 HSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQ 335
+ +F +PG+ + +P V F + A L+Y+E + ++
Sbjct: 282 NIVFCKPGTTVYCFMPSSA---CEVLFWRIAEARRLKYIEIRSRE 323
>gi|403235324|ref|ZP_10913910.1| hypothetical protein B1040_06025 [Bacillus sp. 10403023]
Length = 386
Score = 42.0 bits (97), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 3/69 (4%)
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R+I+N E+ +A ++ F E ++ HS ++ HGAGLT+ F RP
Sbjct: 261 RSIVNEDEILEALKDYKFKTIRLE---EFRVSKQIEIFHSASVIISPHGAGLTNLAFCRP 317
Query: 298 GSVLMQVVP 306
G+ ++++ P
Sbjct: 318 GTKVLEIYP 326
>gi|397641682|gb|EJK74793.1| hypothetical protein THAOC_03511 [Thalassiosira oceanica]
Length = 945
Score = 42.0 bits (97), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 6/82 (7%)
Query: 229 LVNRNARVGRTILNLREVKKAAEE--LGFDVTIFEP--EESTSLADSFRFIHSCHAMVGV 284
L R++R R +N+ V A E G V I E E +++ ++ F S AMVGV
Sbjct: 415 LAQRSSR--RRWINMPSVMNACREEFAGGGVVIVEVNVEHTSTPSEQFLLHRSLDAMVGV 472
Query: 285 HGAGLTHSLFLRPGSVLMQVVP 306
HGA LT + L P S +++++P
Sbjct: 473 HGAQLTQGVLLPPYSHILELLP 494
>gi|334117761|ref|ZP_08491852.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
vaginatus FGP-2]
gi|333460870|gb|EGK89478.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
vaginatus FGP-2]
Length = 1363
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ ++R R I+N EV K E+LGF E + S+A+ + + +V HG
Sbjct: 1229 IYISRQQASYRRIVNDEEVIKYLEKLGFRSVKLE---TMSIAEQASCLAAAKVVVAPHGG 1285
Query: 288 GLTHSLFLRPGSVLMQV 304
GLT+ +F PG+ ++++
Sbjct: 1286 GLTNLVFCSPGTKVIEI 1302
>gi|330792027|ref|XP_003284092.1| hypothetical protein DICPUDRAFT_147835 [Dictyostelium purpureum]
gi|325086021|gb|EGC39418.1| hypothetical protein DICPUDRAFT_147835 [Dictyostelium purpureum]
Length = 515
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 38/67 (56%), Gaps = 6/67 (8%)
Query: 233 NARVGRTILNLRE----VKKAAEELGFDVTI-FEPEESTSLADSFRFIHSCHAMVGVHGA 287
N +G+T++N+R+ +++ E+G + EP+E T LADS I C A GV GA
Sbjct: 405 NYNIGKTLINIRQSFYRIRELMREMGAIADVPLEPKEQTELADSTMNIKGCVA-AGVPGA 463
Query: 288 GLTHSLF 294
G +LF
Sbjct: 464 GGFDALF 470
>gi|86564130|ref|NP_506677.3| Protein H12D21.10, isoform a [Caenorhabditis elegans]
gi|72058551|emb|CAB07427.3| Protein H12D21.10, isoform a [Caenorhabditis elegans]
Length = 523
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 53/253 (20%), Positives = 101/253 (39%), Gaps = 22/253 (8%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLF--ITINSHFPDQDVILAIADC-----N 142
C + P ++ N +H F D F L+ + +N F DQDV + + D N
Sbjct: 206 CDIIFEKPTIIMKLDA-AVNLYHHFCD-FFNLYASLHLNQTF-DQDVDIILWDTHPGGYN 262
Query: 143 DQWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGR--MVINPTLLPKPKTL 199
D + Y F+++ + CF+ V + L++ R + N ++
Sbjct: 263 DHY---YGVTWKAFSKNQPFELKEFDQKKVCFKRVMMPLLARQRTGLFYNSPVVEGCSGS 319
Query: 200 VDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVT 258
F++F + K ++V+++R+ R ILN++E+ ++ L
Sbjct: 320 KMFRTFSQFILHRLGIRQPKADLEKARIVILSRSTAF-RKILNIKEILRSLGHLPNVSTR 378
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFE 318
+ + E +G+HGAGLTH LFL + + ++ G +
Sbjct: 379 VVDYNERIPFEKQLNITSKTDIFIGMHGAGLTHLLFLPDWAAVFEIYNCGDPGC----YS 434
Query: 319 KPARVLGLEYLEY 331
AR+ G++Y +
Sbjct: 435 DLARLRGVKYYTW 447
>gi|384247850|gb|EIE21335.1| hypothetical protein COCSUDRAFT_56558 [Coccomyxa subellipsoidea
C-169]
Length = 394
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 25/93 (26%), Positives = 43/93 (46%), Gaps = 11/93 (11%)
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEEL---------GFDVTIFEPEESTSLADSFR 273
T K+ + R R ILNL+++ + EL T ++ +
Sbjct: 219 TAIKVAFIERTK--SRHILNLKDLIRRCNELQVGEAPRNRSVQCTAVSFDDVGNFTGLLA 276
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+ + +VGVHGAGL ++ F+RPGS +++ P
Sbjct: 277 ELQTIDILVGVHGAGLVNTYFMRPGSAFLEIFP 309
>gi|395326538|gb|EJF58946.1| hypothetical protein DICSQDRAFT_65917 [Dichomitus squalens LYAD-421
SS1]
Length = 411
Score = 41.6 bits (96), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 7/82 (8%)
Query: 222 HTKPKLVLVNRNARVGRTILN------LREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
T+P + ++R R + + E+ + +E G++V + E ++ T + + FR
Sbjct: 250 ETRPVISYISRQGWNRRKLRQEDHERLVEELYRLRDEYGYEVNVVEMDKLTRM-EQFRLA 308
Query: 276 HSCHAMVGVHGAGLTHSLFLRP 297
M+GVHG GLT L++RP
Sbjct: 309 GRTTIMMGVHGNGLTALLWMRP 330
>gi|242825040|ref|XP_002488357.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218712175|gb|EED11601.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 461
Score = 41.6 bits (96), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 48/101 (47%), Gaps = 6/101 (5%)
Query: 243 LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLM 302
LREV++ + + F S + R +VGVHG L+H++FL+P SV++
Sbjct: 338 LREVQERFPYVKVQIVDFS---SIPFKEQIRIAQRTDILVGVHGVELSHAIFLKPQSVVV 394
Query: 303 QVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEK 343
+++P + F+ +R+ G Y + E LV +
Sbjct: 395 EILPSSC---NLTMFQHLSRLRGHTYYSTHGAEVEFDLVRR 432
>gi|71663377|ref|XP_818682.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70883947|gb|EAN96831.1| hypothetical protein Tc00.1047053509733.95 [Trypanosoma cruzi]
Length = 362
Score = 41.6 bits (96), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 18/59 (30%), Positives = 37/59 (62%), Gaps = 3/59 (5%)
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
F+H+ + HGAG+ + + + PGSV++++ P G ++ +++ A +LGL Y+ Y+
Sbjct: 234 FMHNSSIFIAAHGAGMVNIMSMSPGSVVVELFPHGFRY---AMYQELAELLGLHYIAYE 289
>gi|392397499|ref|YP_006434100.1| hypothetical protein Fleli_1917 [Flexibacter litoralis DSM 6794]
gi|390528577|gb|AFM04307.1| hypothetical protein Fleli_1917 [Flexibacter litoralis DSM 6794]
Length = 337
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 19/79 (24%), Positives = 41/79 (51%), Gaps = 3/79 (3%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
K + ++R R I N EV++ ++ ++ FE + + +++G+H
Sbjct: 207 KKIYISRQKSRFRKINNEDEVQQLLKKYNYETHYFE---DYNFEKQIELMQQTTSLIGLH 263
Query: 286 GAGLTHSLFLRPGSVLMQV 304
GAGLT+ LF+ P + ++++
Sbjct: 264 GAGLTNMLFMNPNTKILEI 282
>gi|302765551|ref|XP_002966196.1| glycoprotein beta-1,2-xylosyltransferase [Selaginella
moellendorffii]
gi|300165616|gb|EFJ32223.1| glycoprotein beta-1,2-xylosyltransferase [Selaginella
moellendorffii]
Length = 495
Score = 41.6 bits (96), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 28/38 (73%)
Query: 268 LADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVV 305
LA+ + + S +VGVHGAGLTH +F RPG+ +++++
Sbjct: 411 LAEQIKAVKSSSIIVGVHGAGLTHIVFARPGTSVVEML 448
>gi|307109772|gb|EFN58009.1| hypothetical protein CHLNCDRAFT_57085 [Chlorella variabilis]
Length = 1084
Score = 41.6 bits (96), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 59/138 (42%), Gaps = 7/138 (5%)
Query: 198 TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL--GF 255
L ++ L Y + +T + +++V+R GR +LN+R V+K E G
Sbjct: 291 ALQSWRERLGACYGLDVHTPARVQGKT--VLIVDRPYEAGRHMLNVRSVEKFVRERFPGD 348
Query: 256 DVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP--IGTQWLS 313
DV + + E D R I +V VHG+ + FL P +V + + P L
Sbjct: 349 DVRL-QYFEDVPFPDQLRIISGVSILVMVHGSAIALWPFLPPKAVAIHISPDIESAHSLQ 407
Query: 314 TVYFEKPARVLGLEYLEY 331
V+ E AR G L +
Sbjct: 408 RVWAEAYARDWGFTGLTF 425
>gi|428777835|ref|YP_007169622.1| hypothetical protein PCC7418_3293 [Halothece sp. PCC 7418]
gi|428692114|gb|AFZ45408.1| hypothetical protein PCC7418_3293 [Halothece sp. PCC 7418]
Length = 467
Score = 41.6 bits (96), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 9/83 (10%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGF---DVTIFEPEESTSLADSFRFIHSCHAMV 282
K + ++R R I+N +E+ E +GF ++T EE S+ + +V
Sbjct: 333 KYIYISRGDAKRRPIINQKEIISVLEAMGFSSVEMTYRSFEEQISI------VRDASIIV 386
Query: 283 GVHGAGLTHSLFLRPGSVLMQVV 305
HGAGLT+ +F +PG+ +++++
Sbjct: 387 APHGAGLTNMVFCKPGTKIVELI 409
>gi|428165975|gb|EKX34960.1| hypothetical protein GUITHDRAFT_118891 [Guillardia theta CCMP2712]
Length = 337
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 12/121 (9%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVT---IFEP-----EESTSLADSFRFIHSCH 279
VLV R+A GR ILN EV A + DV + +P +E LA + F H+
Sbjct: 200 VLVQRSANNGRNILNHDEVLLALKRTFKDVCDWRVMKPLWGGDKELGMLAAASLFRHA-S 258
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
+V HG L + +F P + L++++P + + +E+ +R LGL Y + S
Sbjct: 259 LIVAPHGGSLANLMFASPHACLVEILP---AFRPNLCYERLSRALGLRYFGLVVPGSFSQ 315
Query: 340 L 340
L
Sbjct: 316 L 316
>gi|431899810|gb|ELK07757.1| hypothetical protein PAL_GLEAN10022130 [Pteropus alecto]
Length = 474
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 6/97 (6%)
Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + F+V I + + + D R H+ +G+HGAGLTH LFL
Sbjct: 324 RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 383
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ + ++ + Y + AR+ G+ Y+ ++
Sbjct: 384 PDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 416
>gi|400530666|gb|AFP86484.1| glycosyltransferase, partial [Echidna rhodochilus]
Length = 277
Score = 41.6 bits (96), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 54/123 (43%), Gaps = 12/123 (9%)
Query: 194 PKPKTLV------DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVK 247
PK LV F +F+A N T + +V+ +R+ + R ILN E+
Sbjct: 2 PKANILVSGNEIRQFSAFVAEKLN--VTTEEQAEKMEEYIVVFSRS--LNRLILNEAELI 57
Query: 248 KA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
A A E V EE S +D R I +V +HGA L SLFL G+ ++++ P
Sbjct: 58 LALAREYQMKVVTVSLEEQ-SFSDIVRIISRASMLVSMHGAQLVTSLFLPRGAAVVELFP 116
Query: 307 IGT 309
Sbjct: 117 FAV 119
>gi|428315210|ref|YP_007113092.1| Tetratricopeptide TPR_2 repeat-containing protein [Oscillatoria
nigro-viridis PCC 7112]
gi|428238890|gb|AFZ04676.1| Tetratricopeptide TPR_2 repeat-containing protein [Oscillatoria
nigro-viridis PCC 7112]
Length = 1363
Score = 41.6 bits (96), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ + R R ILN EV K E++GF E + S+A+ + + +V HG
Sbjct: 1229 IYITRQQASYRRILNDEEVIKYLEKIGFRSVKLE---TMSVAEQASCLAAAKVVVAPHGG 1285
Query: 288 GLTHSLFLRPGSVLMQV 304
GLT+ +F PG+ ++++
Sbjct: 1286 GLTNLVFCSPGTKVIEI 1302
>gi|301097784|ref|XP_002897986.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262106431|gb|EEY64483.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 477
Score = 41.6 bits (96), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 51/103 (49%), Gaps = 16/103 (15%)
Query: 237 GRTILNLREVKKA----------AEELGFDVTIFEPEESTSLADSFRFIHS-CHAMVGVH 285
GR ILN EV +A E+L + F +E L D F+ M+G H
Sbjct: 327 GRQILNEPEVFEAISAMLNATGRPEQLQY----FRHDEFNGLEDVASFMRDRVKMMIGPH 382
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
GA ++ F +P + L++++P ++ ++E+ AR+LG +Y
Sbjct: 383 GAAFYNARFAQPRTALIEIIPDPGKFFVPCFWEQ-ARLLGQDY 424
>gi|23006630|ref|ZP_00048862.1| hypothetical protein Magn03002098 [Magnetospirillum magnetotacticum
MS-1]
Length = 113
Score = 41.6 bits (96), Expect = 0.80, Method: Composition-based stats.
Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 6/76 (7%)
Query: 253 LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWL 312
LGF+ F+PE + + RF A+VG+ GAG+ +++F PG+ ++ +VP G W
Sbjct: 8 LGFEA--FDPELADMRQQAVRFA-GAEAVVGLAGAGMANTVFCAPGTPILHLVPEG--WE 62
Query: 313 STVYFEKPARVLGLEY 328
Y E A G EY
Sbjct: 63 DPFYAEL-AMACGQEY 77
>gi|450221959|ref|ZP_21896674.1| region 2 capsular polysaccharide biosynthesis protein [Escherichia
coli O08]
gi|449315599|gb|EMD05740.1| region 2 capsular polysaccharide biosynthesis protein [Escherichia
coli O08]
Length = 378
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 60/118 (50%), Gaps = 13/118 (11%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGF---DVTIFEPEESTSLADSFRFIHSCHA 280
K K+ + RNA R I N ++ K EE G+ D + P E +L F H+
Sbjct: 246 KHKIFISRRNA-TQRKIHNEIDLYKYFEEQGYYICDPGDYTPSEQIAL-----FSHA-EI 298
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEES 338
+VG HG G+ +++F + +L++++P T W Y+ + +++ +Y Y I++ E
Sbjct: 299 IVGSHGMGIANAVFSKNLKILIEIMP--TDWNRVSYY-RTTQLMNCKYGCYWIEKNED 353
>gi|302846443|ref|XP_002954758.1| hypothetical protein VOLCADRAFT_118856 [Volvox carteri f.
nagariensis]
gi|300259941|gb|EFJ44164.1| hypothetical protein VOLCADRAFT_118856 [Volvox carteri f.
nagariensis]
Length = 888
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 37/64 (57%), Gaps = 7/64 (10%)
Query: 281 MVGVHGAGLTHSLFLRPGSVLMQVVPIG----TQWLSTVYFEKPARVLGLEYLEYKIKQE 336
+VGVHG+ LT++LF+RPGS L+++ P G W +Y + R G+E + I
Sbjct: 604 LVGVHGSALTNALFMRPGSSLIELRPYGFSGRESW-PNIYMKSQTR--GMEVFWFGIDVM 660
Query: 337 ESSL 340
++L
Sbjct: 661 SANL 664
>gi|452824307|gb|EME31311.1| transferase, transferring glycosyl groups [Galdieria sulphuraria]
Length = 637
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 45/182 (24%), Positives = 76/182 (41%), Gaps = 31/182 (17%)
Query: 162 ININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFH 221
ININ CF+SV G S + ++N + F N + T+S +
Sbjct: 327 ININASM--KCFRSVVTGKGSDVKYLLNS----------ENSFFKVNNLRKEARTASYWC 374
Query: 222 HTKPKLVLVNRNARVGRTIL-------NLREVKKAAE--------ELGFDVTIFEPEEST 266
++ +++R RT++ N+R+++ E + F I EE T
Sbjct: 375 EKPIQVTILSRKTNNARTLVGADNFAENIRKLQVTKESTQDKKTCHMTFHCQIVYFEEMT 434
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
L + + ++ VHGAG T+ +FL SVL+++ P + FE+ AR L
Sbjct: 435 FL-EQVSIMQKTDILIAVHGAGNTNIVFLPENSVLIEIYPFAYK---ANIFEELARKYLL 490
Query: 327 EY 328
Y
Sbjct: 491 RY 492
>gi|348665751|gb|EGZ05580.1| hypothetical protein PHYSODRAFT_261939 [Phytophthora sojae]
Length = 463
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 65/133 (48%), Gaps = 12/133 (9%)
Query: 231 NRNARVGRTILNLREVKKA-AEELGF-----DVTIFEPEESTSLADSFRFIHS-CHAMVG 283
R A GR ILN ++ A A L + F+ +E L + F+ M+G
Sbjct: 302 GRAANGGRQILNEPKLFDAIAARLNATGRPEQLMYFKHDEFNGLEEVATFMRDRVKMMIG 361
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY--KIKQEESSLV 341
HGA ++ F +P + L+++VP Q+ ++E+ AR+LG +Y + + E++ ++
Sbjct: 362 RHGAAFYNARFAQPRTALIEIVPDPEQFFVPCFWEQ-ARLLGQDYSAHVGNTQNEQNDMI 420
Query: 342 --EKYGANDLVLK 352
E DLVLK
Sbjct: 421 VDEVQDVVDLVLK 433
>gi|291230250|ref|XP_002735084.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 649
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 49/224 (21%), Positives = 92/224 (41%), Gaps = 38/224 (16%)
Query: 109 NFFHEFMDCFVPLFITI------NSHFPDQDVILAIADCNDQWA-RKYAELLPRFTRHPI 161
N H D +PLF T+ + H + D + + + W +Y +L F+ P+
Sbjct: 215 NLMHVLHDDLLPLFTTLSQITSSDEHLFNLDTRIILME---GWKPGEYKDLYQLFSDVPV 271
Query: 162 ININNQTITH-------CFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENT 214
+ +N T CF++ LGL V KP Q +AN T
Sbjct: 272 VLKSNLKSTESPGNNLICFRNAFLGL--SKSTVWYDYGFTKP------QGRVANVKTTAT 323
Query: 215 NTSSSFHHTKPKLVLVNRNAR-----------VGRTILNLREVKKA-AEELGFDVTIFEP 262
+ +L +V+ ++R + R +LN E+ A ++ G V
Sbjct: 324 EIRQFTKFVEKRLGIVSDDSRAEEFIVILSRKINRLLLNEGELMFALIQQFGLKVMSLSV 383
Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
E + ++ + + ++GVHG+ ++ S+FLR +V++++ P
Sbjct: 384 E-THAIREQIELVSKASVLIGVHGSLMSLSMFLRESAVVIEIFP 426
>gi|296225634|ref|XP_002758584.1| PREDICTED: uncharacterized glycosyltransferase AER61 isoform 2
[Callithrix jacchus]
Length = 443
Score = 41.2 bits (95), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 6/97 (6%)
Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + F+V I + + + D R H+ +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVQIVDYKYKELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 352
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ + ++ + Y + AR+ G+ Y+ ++
Sbjct: 353 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWQ 385
>gi|356547204|ref|XP_003542006.1| PREDICTED: V-type proton ATPase 116 kDa subunit a isoform 1-like
[Glycine max]
Length = 853
Score = 41.2 bits (95), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 61/148 (41%), Gaps = 9/148 (6%)
Query: 58 IKTQ-PYPRKSDKSAMSKVKELTITTSAPPNLSC--GVTHTSPALVFSAGGYNGNFFHEF 114
IK Q PYP D S EL S +S GV H + +V S +N +FF
Sbjct: 550 IKYQDPYPFGVDPSWRGSRSELPFLNSLKMKMSILFGVAHMNLGIVLSY--FNAHFFRNS 607
Query: 115 MDC---FVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITH 171
+D FVP I +NS F +++ I C A Y ++ F P N+ +
Sbjct: 608 LDIRYQFVPQMIFLNSLFGYLSLLIVIKWCTGSQADLYHVMIYMFLS-PTDNLGENQLFW 666
Query: 172 CFQSVTLGLISHGRMVINPTLLPKPKTL 199
+ + + L+ + + L PKP L
Sbjct: 667 GQRPLQIVLLLLAVIAVPWMLFPKPFIL 694
>gi|328766557|gb|EGF76611.1| hypothetical protein BATDEDRAFT_92595 [Batrachochytrium
dendrobatidis JAM81]
Length = 778
Score = 41.2 bits (95), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 57/114 (50%), Gaps = 4/114 (3%)
Query: 227 LVLVNRNARVGRTILNLREVKKAAEE-LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+V+++R R ILN ++ K E+ G+ V IF E + + F+ ++G+H
Sbjct: 491 IVVLSRKG--NRLILNEEQMAKDLEKAFGYKV-IFVSNEQHTFEEQIVFLRKARIVIGMH 547
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESS 339
G+ L +F R G+V++++ P ++ A + G++ + K + + SS
Sbjct: 548 GSILVMVMFCRRGTVVIEMYPFAVPGDHYTPYKTLASLNGMDLVYRKWENKHSS 601
>gi|421861291|ref|ZP_16293335.1| capsular polysaccharide biosynthesis protein [Paenibacillus
popilliae ATCC 14706]
gi|410829184|dbj|GAC43772.1| capsular polysaccharide biosynthesis protein [Paenibacillus
popilliae ATCC 14706]
Length = 361
Score = 41.2 bits (95), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 45/103 (43%), Gaps = 6/103 (5%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ V+R R ++N EV + E GF + P S D S +V G+
Sbjct: 231 IYVSRQDAHARYVINEEEVMQVLAEKGFVRIVLTP---LSTVDKISIYSSAQVIVSPFGS 287
Query: 288 GLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLE 330
G ++ F PGS L+++ P+ + YF K + G+ Y E
Sbjct: 288 GSINAAFCNPGSTLIELTPVT---VVDGYFWKISNHAGMNYFE 327
>gi|427721067|ref|YP_007069061.1| hypothetical protein Cal7507_5914 [Calothrix sp. PCC 7507]
gi|427353503|gb|AFY36227.1| hypothetical protein Cal7507_5914 [Calothrix sp. PCC 7507]
Length = 451
Score = 41.2 bits (95), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 78/163 (47%), Gaps = 15/163 (9%)
Query: 183 HGRMVINPTLLPKPKTLVDFQ-SFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTIL 241
+I P+L+ P + + FL + + +N + +P + +NR+ V R +
Sbjct: 275 QAEQIIAPSLVGVPVQVPGWACDFLRHNFIDNKVAKNR----QPLRIYLNRSQVVHRKVE 330
Query: 242 NLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
N E+ + GF E+ S+ + + S ++ HGAGLT+ +F +PG+ +
Sbjct: 331 NEGEIIEFLSRFGFCNI---ATENLSIWEQAELLSSAEIVIAPHGAGLTNLVFCKPGTKV 387
Query: 302 MQVV-PIGTQWLSTVYFEKPARVLGLEYLEYKIKQEESSLVEK 343
++++ P G +ST+++ V GL+Y Y + E L E+
Sbjct: 388 IELLSPKG---VSTLFWALGNHV-GLDY--YYLLGEGEGLPEE 424
>gi|356566724|ref|XP_003551579.1| PREDICTED: beta-(1,2)-xylosyltransferase-like [Glycine max]
Length = 503
Score = 41.2 bits (95), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 35/62 (56%), Gaps = 3/62 (4%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S+ D + IH ++G HGAGLTH + P +V+++++ + + +F +R GL
Sbjct: 417 SMKDQVQAIHDASVIIGAHGAGLTHIVSALPKTVILEII---SSYYRRPHFAYISRWRGL 473
Query: 327 EY 328
EY
Sbjct: 474 EY 475
>gi|154284636|ref|XP_001543113.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150406754|gb|EDN02295.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 495
Score = 41.2 bits (95), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 21/75 (28%), Positives = 36/75 (48%), Gaps = 3/75 (4%)
Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVL 324
S + + + + GVHGAGLTH +FL P S + +++P L F A+ +
Sbjct: 384 SLPFREQIKLVRHTDILAGVHGAGLTHGMFLPPHSTIAEILPPK---LENKGFRNLAKKM 440
Query: 325 GLEYLEYKIKQEESS 339
G +Y + ++S
Sbjct: 441 GHKYFSSHAAEHQTS 455
>gi|149728372|ref|XP_001498814.1| PREDICTED: uncharacterized glycosyltransferase AER61-like isoform 2
[Equus caballus]
Length = 443
Score = 41.2 bits (95), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 6/97 (6%)
Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + F+V I + + + D R H+ +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVQIVDYKYKQLGFLDQLRITHNTDIFIGMHGAGLTHLLFL 352
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ + ++ + Y + AR+ G+ Y+ ++
Sbjct: 353 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 385
>gi|300865616|ref|ZP_07110390.1| hypothetical protein OSCI_2080002 [Oscillatoria sp. PCC 6506]
gi|300336371|emb|CBN55540.1| hypothetical protein OSCI_2080002 [Oscillatoria sp. PCC 6506]
Length = 1330
Score = 41.2 bits (95), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 8/91 (8%)
Query: 219 SFHHTKPKL-----VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFR 273
S +KPK+ + ++R R +N EV ++ GF+ E + S+A+
Sbjct: 1184 SREKSKPKIDKFRRIYISRELASYRRFINEEEVLNFLDKWGFESVKLE---TMSVAEQAS 1240
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV 304
+ + +V HGAGLT+ +F +PG+ ++++
Sbjct: 1241 CLANAEVVVAPHGAGLTNIVFCKPGTKIIEI 1271
>gi|428777356|ref|YP_007169143.1| hypothetical protein PCC7418_2797 [Halothece sp. PCC 7418]
gi|428691635|gb|AFZ44929.1| hypothetical protein PCC7418_2797 [Halothece sp. PCC 7418]
Length = 397
Score = 41.2 bits (95), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 59/139 (42%), Gaps = 15/139 (10%)
Query: 170 THCFQSVTLGLISHGRMVINPTLLPK--PKTLVDFQSFLANAYNENTNTSSSFHHTKPKL 227
+ CF S + GR++ TLL P T ++ +L S P
Sbjct: 205 SKCFWSYQ----NKGRVLYTDTLLAPTFPGTWRNYPQWLPKFLQREITPSPPL----PTR 256
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
L R ++N +K + GF+ I+ PEE + F +VG HGA
Sbjct: 257 RLYISRTGYRRNVVNEEAIKPILIKYGFE--IYNPEEHENQPYDFA---EATIIVGPHGA 311
Query: 288 GLTHSLFLRPGSVLMQVVP 306
GLT+ F +PG+ +++++P
Sbjct: 312 GLTNLAFCQPGTKVLELMP 330
>gi|39930531|ref|NP_775925.1| EGF domain-specific O-linked N-acetylglucosamine transferase
precursor [Homo sapiens]
gi|38173838|gb|AAH60887.1| Chromosome 3 open reading frame 64 [Homo sapiens]
gi|312153034|gb|ADQ33029.1| chromosome 3 open reading frame 64 [synthetic construct]
Length = 443
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + F+V I + + D R H+ +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 352
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ + ++ + Y + AR+ G+ Y+ ++
Sbjct: 353 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 385
>gi|440790833|gb|ELR12100.1| hypothetical protein ACA1_077520, partial [Acanthamoeba castellanii
str. Neff]
Length = 484
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 51/93 (54%), Gaps = 15/93 (16%)
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM- 281
T+ + +++R+ + R +LN +E+ A ++ + L D+ F A+
Sbjct: 161 TESHVGIISRSFK--RFLLNEQELLHATLQMNVSAELL-------LFDTLPFYQQVQALR 211
Query: 282 -----VGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
VG+HG+GLT++L+L+ G+VL+Q++P T
Sbjct: 212 RTTVLVGMHGSGLTNALYLQRGAVLLQIMPFKT 244
>gi|224008206|ref|XP_002293062.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971188|gb|EED89523.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 428
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 56/127 (44%), Gaps = 11/127 (8%)
Query: 213 NTNTSSSFHHTKPKLVLVNRNAR--VGRTILNLREVKKAAEELGFDVTIFEPEESTSLAD 270
N++ S T L++ R R VG + EV A LG V + E E+ S
Sbjct: 267 NSSPLSGVEDTGHVLIVQRRGDRRLVGSRNGSFVEVVDAICNLGIPVKVIEFED-LSAEQ 325
Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV------VPIGTQWLSTVYFEKPARVL 324
+ S + +V HGA L+H+ +++PG +++V V G + AR
Sbjct: 326 QIQSAQSANVLVAAHGAALSHAAWMKPGGAIVEVLMRQGFVEFGD--YHKADYANLARFF 383
Query: 325 GLEYLEY 331
GL+Y+ Y
Sbjct: 384 GLKYVYY 390
>gi|297670905|ref|XP_002813591.1| PREDICTED: uncharacterized glycosyltransferase AER61 isoform 2
[Pongo abelii]
Length = 444
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + F+V I + + D R H+ +G+HGAGLTH LFL
Sbjct: 294 RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 353
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ + ++ + Y + AR+ G+ Y+ ++
Sbjct: 354 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWR 386
>gi|225556661|gb|EEH04949.1| DUF563 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 501
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
+ + + + GVHGAGLTH +FL P S + +++P L F A+ +G +Y
Sbjct: 395 EQIKLVRHTDILTGVHGAGLTHGMFLPPHSTIAEILPPK---LENKGFRNLAKKMGHKYF 451
Query: 330 EYKIKQEESS 339
+ ++S
Sbjct: 452 SSHAAEHQTS 461
>gi|240281524|gb|EER45027.1| DUF563 domain-containing protein [Ajellomyces capsulatus H143]
Length = 501
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
+ + + + GVHGAGLTH +FL P S + +++P L F A+ +G +Y
Sbjct: 395 EQIKLVRHTDILTGVHGAGLTHGMFLPPHSTIAEILPPK---LENKGFRNLAKKMGHKYF 451
Query: 330 EYKIKQEESS 339
+ ++S
Sbjct: 452 SSHAAEHQTS 461
>gi|407409787|gb|EKF32485.1| hypothetical protein MOQ_003667 [Trypanosoma cruzi marinkellei]
Length = 479
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 17/59 (28%), Positives = 36/59 (61%), Gaps = 3/59 (5%)
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
F+H+ + HGAG+ + + + PG V++++ P G ++ +++ A +LGL Y+ Y+
Sbjct: 351 FMHNSSIFIAAHGAGMVNIMSMSPGGVVVELFPYGFRY---AMYQELAELLGLHYIAYE 406
>gi|219123206|ref|XP_002181920.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406521|gb|EEC46460.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 467
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 6/82 (7%)
Query: 232 RNARVGRTILNLREVKKAAEELGFDVTIFEPEESTS------LADSFRFIHSCHAMVGVH 285
+ AR GR N EV +A + G + F+ ++++ + D + + A+VGVH
Sbjct: 312 KEARQGRRCSNEEEVVEALKRNGVNTIFFDLAKASTSSGSDVIEDLMALLDNACAIVGVH 371
Query: 286 GAGLTHSLFLRPGSVLMQVVPI 307
G GL + F + L++++PI
Sbjct: 372 GGGLYNQYFAPATTALVELIPI 393
>gi|324510177|gb|ADY44259.1| Glycosyltransferase AER61 [Ascaris suum]
Length = 522
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 57/261 (21%), Positives = 114/261 (43%), Gaps = 36/261 (13%)
Query: 90 CGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFIT--INSHFPDQDVILAIADC------ 141
C V P +V + N +H F D F+ L+++ +N F D D+ + D
Sbjct: 214 CDVIFERPTVVMKLDA-SINMYHHFCD-FINLYLSQHLNGSF-DSDIDILWWDTFPGGFI 270
Query: 142 ----NDQWARKYAELLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPK 197
W R ++ P + +I+++ + + CF+ V L L++ R+ + + P
Sbjct: 271 DASFGATW-RAFSLRQP----YELISLDQKRV--CFRQVMLPLLARQRLGLYYNM---PV 320
Query: 198 TLVDFQSFLANAYNENTNTSSSFHHTKP-----KLVLVNRNARVGRTILNLREVKKAAEE 252
S L +A++++ + T P ++ L++R+ R I+N E+
Sbjct: 321 IDGCEGSGLFHAFSKHILHRLRVNQTGPLLNSVRVTLLSRSTNF-RRIVNENELLDVLNG 379
Query: 253 L-GFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQW 311
+ G + + + D H+ +G+HG+GLTH LFL +V+ ++
Sbjct: 380 ISGVVAQKVDYNSNVAFFDQLSITHNTDIFIGMHGSGLTHLLFLPDWAVIFEIYNCED-- 437
Query: 312 LSTVYFEKPARVLGLEYLEYK 332
+ Y++ AR+ G++Y +K
Sbjct: 438 -VSCYYD-LARLRGVKYFTWK 456
>gi|325087670|gb|EGC40980.1| DUF563 domain-containing protein [Ajellomyces capsulatus H88]
Length = 501
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
+ + + + GVHGAGLTH +FL P S + +++P L F A+ +G +Y
Sbjct: 395 EQIKLVRHTDILTGVHGAGLTHGMFLPPHSTIAEILPPK---LENKGFRNLAKKMGHKYF 451
Query: 330 EYKIKQEESS 339
+ ++S
Sbjct: 452 SSHAAEHQTS 461
>gi|307104008|gb|EFN52264.1| expressed protein [Chlorella variabilis]
Length = 534
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 61/135 (45%), Gaps = 9/135 (6%)
Query: 202 FQSFLANAYNENTNTSSSFHHTKPKLVLV-NRNARVGRTILNLREVKKAAEEL--GFDVT 258
++ L Y + +T + + K+VL+ +R GR +LN+R V+K + G DV
Sbjct: 313 WRERLGACYGLDVHTPAPVQN---KIVLILDRPYEAGRHMLNVRSVEKFVRQRFPGDDVR 369
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG--TQWLSTVY 316
+ + E D R + S +V VHG+ + FL P +V + + P T L V+
Sbjct: 370 L-QYFEDVPFPDQLRIMGSVSILVMVHGSAIALWPFLPPKAVAIHISPDAESTDSLQRVW 428
Query: 317 FEKPARVLGLEYLEY 331
E R G L +
Sbjct: 429 AEAYVRYWGFTGLSF 443
>gi|397480752|ref|XP_003811635.1| PREDICTED: EGF domain-specific O-linked N-acetylglucosamine
transferase isoform 2 [Pan paniscus]
Length = 443
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + F+V I + + D R H+ +G+HGAGLTH LFL
Sbjct: 293 RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 352
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ + ++ + Y + AR+ G+ Y+ ++
Sbjct: 353 PDWAAVFELYNCEDE---RCYLDL-ARLRGVHYITWQ 385
>gi|159473064|ref|XP_001694659.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276471|gb|EDP02243.1| predicted protein [Chlamydomonas reinhardtii]
Length = 911
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIG 308
+ +VGVHG+ LT+++F+RPGS L+++ P G
Sbjct: 752 LVRQMDVLVGVHGSALTNAMFMRPGSSLIELRPFG 786
>gi|405122953|gb|AFR97718.1| hypothetical protein CNAG_01513 [Cryptococcus neoformans var.
grubii H99]
Length = 461
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/91 (24%), Positives = 47/91 (51%), Gaps = 12/91 (13%)
Query: 224 KPKLVLVNR-------NARVGRTIL-NLREVKKAAEELGFDVTIFEPEESTSLADSFRFI 275
KPK+V V+R + V +L L+++++A + + D + + E+ F
Sbjct: 302 KPKIVYVDRQRTTRKFDVEVHTQLLKQLKKIERAKKAVVVDAVLEDLEKKKQ----FEMF 357
Query: 276 HSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
++G+HG GL H L++ G ++++++P
Sbjct: 358 TDADIILGIHGNGLAHELWMPEGGIVIEILP 388
>gi|158335347|ref|YP_001516519.1| hypothetical protein AM1_2192 [Acaryochloris marina MBIC11017]
gi|158305588|gb|ABW27205.1| hypothetical protein AM1_2192 [Acaryochloris marina MBIC11017]
Length = 394
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 7/91 (7%)
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R I+N E+ A + LGF+ I+E E+ SL + + +VG +G G+TH LF
Sbjct: 282 RHIVNEAELIAALKPLGFE--IYELED-YSLEEKVELFYDAEIVVGAYGGGITHVLFSES 338
Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+L V TQ Y+ A+ LG +Y
Sbjct: 339 VKILELQVMAKTQ----TYYYYLAKALGHDY 365
>gi|340723323|ref|XP_003400040.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Bombus
terrestris]
Length = 557
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 55/262 (20%), Positives = 112/262 (42%), Gaps = 25/262 (9%)
Query: 85 PP--NLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN---SH---FPDQDVIL 136
PP N C + P + N +H F D F L+ +++ SH F + I+
Sbjct: 211 PPIVNHDCDIVIEKPTYIMKIDAI-VNMYHHFCD-FFNLYASLHVNLSHPAAFSTDNHIM 268
Query: 137 AIADCNDQWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPT 191
+ + + + FTR+P+ +++ + T CF+++ L+ RM+ N
Sbjct: 269 IWESYS--YRSAFQDAFQAFTRNPLWDLHTFRGETVCFKNLVFPLLP--RMIFGLYYNTP 324
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAE 251
L+ + F++F + + + + ++ L++R+ + R ILN E+ KA +
Sbjct: 325 LIYGCEKSGLFKAFGDHVLHRLRTPHHERKNQRIRVTLLSRDTQY-RRILNEDELTKALK 383
Query: 252 E-LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
E + V + S + +G+HGAGLTH +FL + + ++
Sbjct: 384 ENPEYKVRKVIYNKKISFKKQLEITRNSDIFIGIHGAGLTHLMFLPDWAAVFEIYNCEDP 443
Query: 311 WLSTVYFEKPARVLGLEYLEYK 332
++ AR+ G++Y ++
Sbjct: 444 GC----YKDLARLRGVKYFTWE 461
>gi|407849199|gb|EKG04022.1| hypothetical protein TCSYLVIO_004922 [Trypanosoma cruzi]
Length = 479
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/59 (28%), Positives = 37/59 (62%), Gaps = 3/59 (5%)
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
F+H+ + HGAG+ + + + PG+V++++ P G ++ +++ A +LGL Y+ Y+
Sbjct: 351 FMHNSSIFIAAHGAGMVNIMSMSPGTVVVELFPHGFRY---AMYQELAELLGLHYIAYE 406
>gi|356566720|ref|XP_003551577.1| PREDICTED: beta-(1,2)-xylosyltransferase-like [Glycine max]
Length = 503
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 13/67 (19%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKP-----A 321
S+ D R I ++G HGAGLTH + P +V+++++ + YF +P +
Sbjct: 417 SMKDQVRAIQDASVIIGAHGAGLTHIVSALPKTVILEII--------SSYFRRPHFAYIS 468
Query: 322 RVLGLEY 328
R GLEY
Sbjct: 469 RWKGLEY 475
>gi|87309616|ref|ZP_01091750.1| hypothetical protein DSM3645_02353 [Blastopirellula marina DSM
3645]
gi|87287380|gb|EAQ79280.1| hypothetical protein DSM3645_02353 [Blastopirellula marina DSM
3645]
Length = 397
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 68/173 (39%), Gaps = 18/173 (10%)
Query: 171 HCFQSVTLGLISHGRMVINPT---------LLPKPKTLVDFQSFLANAY-NENTNTSSSF 220
H F S +L L+ R I P L P F AY E T +
Sbjct: 190 HAFASESLQLLGIERSQILPMGHYTHIQAKQLIIPSVACYFPQPAGVAYLREKMQTQAWS 249
Query: 221 HHTKP-KLVL-VNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSC 278
H+ + +L L + R R I+N E+ A + LGF E L +
Sbjct: 250 HYERDDRLKLYIARRRFTSRYIVNEAELFAALKPLGFQQVYLE---DLPLKKQIQLFQQA 306
Query: 279 HAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
+VG HGAG ++ ++ RPG+ + ++ P ++F A + L+Y Y
Sbjct: 307 GVIVGPHGAGFSNLVYSRPGTAVFEITPTCR---PPLFFHYLAEINALQYAVY 356
>gi|400530674|gb|AFP86488.1| glycosyltransferase, partial [Arapaima gigas]
Length = 276
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 59/117 (50%), Gaps = 6/117 (5%)
Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
LV+ +R+ + R ILN E+ A A+E EE SLA+ R I +V +H
Sbjct: 38 LVVFSRS--INRLILNEAELILALAQEFQMKAITVSLEEH-SLAEIIRVISRASVLVSMH 94
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
GA L SLFL G+ ++++ P ++ A + G L+Y+ +K EE+S+
Sbjct: 95 GAQLITSLFLPRGAAVVELFPYAVNPDHYTPYKTLASLPGMDLQYVAWKNTIEENSV 151
>gi|414341636|ref|YP_006983157.1| hypothetical protein B932_0621 [Gluconobacter oxydans H24]
gi|411026971|gb|AFW00226.1| hypothetical protein B932_0621 [Gluconobacter oxydans H24]
Length = 363
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 52/102 (50%), Gaps = 7/102 (6%)
Query: 202 FQSFLANAYNENTNTSSSFHHTKPKLVLVNRN--ARVGRTILNLREVKKAAEELGFDVTI 259
F SF+ + YN + K + +LV+R+ + R N + A E G+++T
Sbjct: 208 FHSFVRDFYNSLKPDIAP--EKKSRKILVSRSDFSHSNRAFPNRAAFETMAAERGYEIT- 264
Query: 260 FEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
+PE+ SL + + A++G HG+G+ +LF PG+V+
Sbjct: 265 -QPEK-LSLEEQIQIFSEAAAVIGEHGSGMHSTLFSSPGTVV 304
>gi|219117675|ref|XP_002179628.1| betaxylosyltransferase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408681|gb|EEC48614.1| betaxylosyltransferase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 483
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 62/137 (45%), Gaps = 31/137 (22%)
Query: 193 LPKPK-----TLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNA----------RVG 237
LP+P+ + V SF+ Y NT S H ++V++ R +V
Sbjct: 294 LPRPRESNCRSPVQMSSFVD--YVLNTYNLSHIHRIPGRVVILERIPYLDHPRRDLNKVE 351
Query: 238 RTILNLREVKKAAEE-----LGFDVTIFEPEESTSLAD-----SFRFIHSCHAMVGVHGA 287
R + NLR + ++ + G +VT+ E +L D R I H +V HGA
Sbjct: 352 RLLTNLRPLAESLPQRLTPYYGVNVTV----EVAALVDKPMNEQIRTIREAHVLVANHGA 407
Query: 288 GLTHSLFLRPGSVLMQV 304
GLTHS+FL + ++++
Sbjct: 408 GLTHSIFLDDATHVVEM 424
>gi|394337573|gb|AFN27774.1| glycosyltransferase, partial [Ostracion cubicus]
Length = 274
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 78/190 (41%), Gaps = 43/190 (22%)
Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ILN E+ A A+EL V EE S + I +V +HGA L SLFL
Sbjct: 52 RLILNEAELIMALAQELQMRVLTVSLEEQ-SFPSIVQVIGGASMLVSMHGAQLITSLFLP 110
Query: 297 PGSVLMQVVPIGTQ----------------------WLST------VYFEKPARVLGLEY 328
PG+V++++ P W +T + ++P G+ +
Sbjct: 111 PGAVVVELFPFAVNPDQYTPYRTLASLPGMDLHYIPWRNTEEENTVTHPDRPWEQGGIAH 170
Query: 329 LEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
LE +++E + K L +NP+ R+Y Q+ +DI F L++
Sbjct: 171 LEK--EEQERIMASKEVPRHLCCRNPEWL-------FRIY---QDTLVDIPSFLEVLQEG 218
Query: 389 YKKAKKLMDK 398
KAK ++ K
Sbjct: 219 L-KAKPILKK 227
>gi|359459096|ref|ZP_09247659.1| hypothetical protein ACCM5_10223 [Acaryochloris sp. CCMEE 5410]
Length = 385
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 7/91 (7%)
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R I+N E+ A + LGF+ I+E E+ SL + + +VG +G G+TH LF
Sbjct: 273 RHIVNEAELIAALKPLGFE--IYELED-FSLEEKVELFYDAEIVVGAYGGGITHVLFSES 329
Query: 298 GSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+L V TQ Y+ A+ LG +Y
Sbjct: 330 VKILELQVMAKTQ----TYYYYLAKALGHDY 356
>gi|400530664|gb|AFP86483.1| glycosyltransferase, partial [Heteroconger hassi]
Length = 245
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 65/140 (46%), Gaps = 8/140 (5%)
Query: 204 SFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLRE-VKKAAEELGFDVTIFEP 262
+FLA N + +V+ +R+ + R ILN E + + A+E V
Sbjct: 1 AFLAEMLN--VTVEEPAEKSDEYIVVFSRS--LNRLILNEAELILELAQEFQMKVLTVSL 56
Query: 263 EESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPAR 322
EE AD R I +V +HGA L SLFL G+ ++++ P G ++
Sbjct: 57 EEQ-PFADIVRVISKASMLVSMHGAQLVTSLFLPRGAAVVELFPYGVNPEHYAPYKTLTS 115
Query: 323 VLG--LEYLEYKIKQEESSL 340
+ G L+Y+ ++ +EE+S+
Sbjct: 116 LPGMDLQYVAWRNTKEENSV 135
>gi|389634703|ref|XP_003715004.1| hypothetical protein MGG_08052 [Magnaporthe oryzae 70-15]
gi|351647337|gb|EHA55197.1| hypothetical protein MGG_08052 [Magnaporthe oryzae 70-15]
Length = 623
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 12/116 (10%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIHS-CHAM 281
K KL ++R R R +L ++ +A DV I + S S + R + S +
Sbjct: 398 KIKLTFIDR--RGSRKLLGQEKLLEATRRAHPDVEIRSIDFASLSFVEQIRLVRSETDIL 455
Query: 282 VGVHGAGLTHSLFLR-----PGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
VG HGAGLTH +FL+ G ++++ P + +S F A +LG EY K
Sbjct: 456 VGAHGAGLTHIMFLQGEWEGGGKAVVEIQP---EKMSYKGFRNLAYMLGHEYFVAK 508
>gi|451818782|ref|YP_007454983.1| hypothetical protein DUF563 [Clostridium saccharoperbutylacetonicum
N1-4(HMT)]
gi|451784761|gb|AGF55729.1| hypothetical protein DUF563 [Clostridium saccharoperbutylacetonicum
N1-4(HMT)]
Length = 375
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 78/167 (46%), Gaps = 13/167 (7%)
Query: 171 HCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENT-NTSSSFHHTKPKLVL 229
+C+ L IS ++I P L K TL+ ++ ++ N N + + S + T + +
Sbjct: 191 YCYDIEKLIYISD--LLIGPIDLKK-NTLLRYEDYIMNNLAANLLHDNLSINGTASRKLF 247
Query: 230 VNRNARVG-RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAG 288
++R +G R +LN ++++ +E G+++ E S D + + G GAG
Sbjct: 248 MSRKNCLGTRLLLNQDKIEQIFKEYGYEIIY---PEIMSFQDQLKIFSEAEFIAGASGAG 304
Query: 289 LTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKI 333
T+ LF ++ ++P + + + A VLG + YL+ KI
Sbjct: 305 FTNILFANKSVKIICILP---KEIKLSCYSNIAGVLGQQCYYLDAKI 348
>gi|356524323|ref|XP_003530779.1| PREDICTED: beta-(1,2)-xylosyltransferase-like [Glycine max]
Length = 503
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 35/62 (56%), Gaps = 3/62 (4%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S+ D + +H ++G HGAGLTH + P +V+++++ + + +F +R GL
Sbjct: 417 SMKDQVQAVHDASVIIGAHGAGLTHIVSALPKTVILEII---SSYYRRPHFAYISRWRGL 473
Query: 327 EY 328
EY
Sbjct: 474 EY 475
>gi|394337315|gb|AFN27645.1| glycosyltransferase, partial [Apogon lateralis]
Length = 278
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 6/117 (5%)
Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+V+++R+ R ILN E+ A A+E V EE S + I C +V +H
Sbjct: 52 IVVLSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSIVQVISGCSMLVSMH 108
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
GA L SLFL G+V++++ P ++ A + G L Y+ ++ +EE ++
Sbjct: 109 GAQLITSLFLPRGAVVVELFPFAVNPERYTPYKTLASLPGIDLHYISWRNTEEEDTV 165
>gi|440470541|gb|ELQ39608.1| DUF563 domain-containing protein [Magnaporthe oryzae Y34]
gi|440488522|gb|ELQ68247.1| DUF563 domain-containing protein [Magnaporthe oryzae P131]
Length = 614
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 12/116 (10%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIHS-CHAM 281
K KL ++R R R +L ++ +A DV I + S S + R + S +
Sbjct: 389 KIKLTFIDR--RGSRKLLGQEKLLEATRRAHPDVEIRSIDFASLSFVEQIRLVRSETDIL 446
Query: 282 VGVHGAGLTHSLFLR-----PGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
VG HGAGLTH +FL+ G ++++ P + +S F A +LG EY K
Sbjct: 447 VGAHGAGLTHIMFLQGEWEGGGKAVVEIQP---EKMSYKGFRNLAYMLGHEYFVAK 499
>gi|300121625|emb|CBK22143.2| unnamed protein product [Blastocystis hominis]
Length = 549
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 1/84 (1%)
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
+K L V+RN R I N+ E+ + + + + S A+ + S +
Sbjct: 388 SKLALRFVDRNT-ANRGIKNVEEIVELLKNQNGTEFTRQVSQQMSFAEQVISMFSIDILF 446
Query: 283 GVHGAGLTHSLFLRPGSVLMQVVP 306
VHGAGLT +F+ PGS ++++ P
Sbjct: 447 SVHGAGLTSVVFMLPGSAVLEIFP 470
>gi|152967605|ref|YP_001363389.1| capsular polysaccharide biosynthesis protein-like protein
[Kineococcus radiotolerans SRS30216]
gi|151362122|gb|ABS05125.1| Capsular polysaccharide biosynthesis protein-like [Kineococcus
radiotolerans SRS30216]
Length = 381
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 3/79 (3%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGA 287
+ ++R A R +LN V A GF+V + S+AD S + GV GA
Sbjct: 248 IFLDRPATDKRRLLNRDRVLTAVRSGGFEVV---EAGALSVADQAALFASADVVAGVLGA 304
Query: 288 GLTHSLFLRPGSVLMQVVP 306
GLT+ ++ PG+ +++++P
Sbjct: 305 GLTNLVYCYPGTTVVEILP 323
>gi|307111760|gb|EFN59994.1| hypothetical protein CHLNCDRAFT_49458 [Chlorella variabilis]
Length = 656
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/116 (20%), Positives = 56/116 (48%), Gaps = 9/116 (7%)
Query: 199 LVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDV- 257
+V + A ++ +++ + ++V + R+ GR +LN E+ + ++
Sbjct: 399 VVGLEPRPAGGGEGGSSDAAAGGTQRLRVVFLQRDGE-GRQLLNAEELLERCNAWRYEPA 457
Query: 258 ---TIFEPE----ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
T F E + ++A+ +VG+HGA + ++ F+RPGS +++++P
Sbjct: 458 GSGTTFAAECRQARTPTVAEGVAVAQEADVLVGMHGANMANAFFMRPGSSVVELMP 513
>gi|11191827|emb|CAC16419.1| hypothetical protein [Verticillium dahliae]
Length = 291
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 4/66 (6%)
Query: 265 STSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPG-SVLMQVVPIGTQWLSTVYFEKPARV 323
+ + A+ R + +VGVHGAGLTH +F+R +++V P GT ++ F A +
Sbjct: 158 ALTFAEQMRTVRETDVLVGVHGAGLTHVMFMRENVGAIVEVQP-GT--MTHAGFRNVAAM 214
Query: 324 LGLEYL 329
GL Y
Sbjct: 215 RGLGYF 220
>gi|193786960|dbj|BAG52283.1| unnamed protein product [Homo sapiens]
Length = 206
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 238 RTILNLREVKKAAEELG-FDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + F+V I + + D R H+ +G+HGAGLTH LFL
Sbjct: 56 RKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 115
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ + ++ + Y + AR+ G+ Y+ ++
Sbjct: 116 PDWAAVFELYNCEDE---RCYLD-LARLRGVHYITWR 148
>gi|118344568|ref|NP_001072049.1| glycosyltransferase-like domain-containing protein 2 precursor
[Takifugu rubripes]
gi|82076790|sp|Q5NDE4.1|GTDC2_TAKRU RecName: Full=Glycosyltransferase-like domain-containing protein 2;
AltName: Full=Extracellular O-linked N-acetylglucosamine
transferase-like; Flags: Precursor
gi|56605354|emb|CAI30874.1| glycosyltransferase [Takifugu rubripes]
Length = 590
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 61/123 (49%), Gaps = 9/123 (7%)
Query: 224 KPK---LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCH 279
KPK +V+ +R+ R ILN E+ A A+E V EE S + I
Sbjct: 293 KPKDEYIVVFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSIVQVISGAS 349
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEE 337
+V +HGA L SLFL PG+V++++ P + A + G++ Y+ ++ +EE
Sbjct: 350 MLVSMHGAQLITSLFLPPGAVVVELYPFAVNPDQYTPYRTLASLPGMDLHYIPWRNTEEE 409
Query: 338 SSL 340
+++
Sbjct: 410 NTV 412
>gi|119493195|ref|ZP_01624070.1| TPR repeat protein [Lyngbya sp. PCC 8106]
gi|119452760|gb|EAW33937.1| TPR repeat protein [Lyngbya sp. PCC 8106]
Length = 899
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 52/104 (50%), Gaps = 6/104 (5%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
P+ + ++R R ++N EV + + LGF V +F ES S ++ + +V
Sbjct: 758 PERIYISRAKARHRQVMNETEVMEVLKPLGF-VEVFL--ESMSFSEQVQLFSQAKVIVAA 814
Query: 285 HGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
HG+GLT+ +F S ++V+ + Y+ ++ LGLE+
Sbjct: 815 HGSGLTNIVFC---SKEVKVIEFISPHYERYYYRVISQYLGLEH 855
>gi|400530920|gb|AFP86611.1| glycosyltransferase, partial [Tetraodon miurus]
Length = 284
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 59/117 (50%), Gaps = 6/117 (5%)
Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+V+ +R+A R ILN E+ A A+E V EE S + I +V +H
Sbjct: 48 IVVFSRSA--TRLILNEAELIMALAQEFQMRVLTVSLEEQ-SFPSIIQVISGASMLVSMH 104
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
GA L SLFL PG+V++++ P + A + G++ Y+ ++ +E+++L
Sbjct: 105 GAQLITSLFLPPGAVVVELFPFAVNPDQYTPYRTLASLPGMDLHYIPWRNTEEQNTL 161
>gi|322786227|gb|EFZ12831.1| hypothetical protein SINV_09199 [Solenopsis invicta]
Length = 579
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 63/301 (20%), Positives = 130/301 (43%), Gaps = 28/301 (9%)
Query: 109 NFFHEFMDCFVPLFITINSHFP-----DQDVILAIADCNDQWARKYAELLPRFTRHPIIN 163
N +H F D F L+ +++ + D D + I + + + + + FTR+P+ +
Sbjct: 260 NMYHHFCD-FFNLYASLHVNLSHPSAFDTDNHVMIWE-SYSYRSAFQDTFEAFTRNPLWD 317
Query: 164 INN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQSFLANAYNENTNTSS 218
+ + T CF+++ L+ RM+ N L+ + F++F + +
Sbjct: 318 LKTFRGETVCFRNLVFPLLP--RMIFGLFYNTPLIYGCEKSGLFKAFSEHLLHRLQIPLH 375
Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELG-FDVTIFEPEESTSLADSFRFIHS 277
K ++ L++R+ + R ILN E+ KA +E + V ++ +
Sbjct: 376 ERKDRKIRVTLLSRDTQY-RKILNENELVKALKENPVYKVRKVVYNKNIPFKKQLEITRN 434
Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEE 337
+G+HGAGLTH +FL + + ++ ++ AR+ G++Y +K +
Sbjct: 435 SDIFIGIHGAGLTHLMFLPDWAAVFEIYNCE----DPNCYQDLARLRGVKYFTWK---DT 487
Query: 338 SSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKLMD 397
S LV++ +P A A ++N +K + + R + +A+KK + +
Sbjct: 488 SKLVQQDPGT-----HPDGGAHAKFTNYSFDVKEFLRIVSLARNHVKHHNAFKKFNDIHN 542
Query: 398 K 398
K
Sbjct: 543 K 543
>gi|394337317|gb|AFN27646.1| glycosyltransferase, partial [Cheilodipterus quinquelineatus]
Length = 288
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 75/190 (39%), Gaps = 43/190 (22%)
Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ILN E+ A A+E V EE S + I C +V +HGA L SLFL
Sbjct: 59 RLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSIVQVISGCSMLVSMHGAQLITSLFLP 117
Query: 297 PGSVLMQVVPIGTQ----------------------WLST------VYFEKPARVLGLEY 328
G+V++++ P W +T + ++P G+ +
Sbjct: 118 RGAVVVELFPFAVNPEQYTPYKTLASLPGIDLHYISWRNTDEENTVTHPDRPWEQGGISH 177
Query: 329 LEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
LE ++E L K L +NP+ R+Y Q+ +DI F LK+
Sbjct: 178 LEK--DEQERILASKDVPRHLCCRNPEWL-------FRIY---QDTLVDIPSFLDALKEG 225
Query: 389 YKKAKKLMDK 398
K K +M K
Sbjct: 226 M-KTKPVMKK 234
>gi|453329814|dbj|GAC88062.1| hypothetical protein NBRC3255_1723 [Gluconobacter thailandicus NBRC
3255]
Length = 363
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 9/103 (8%)
Query: 202 FQSFLANAYNE-NTNTSSSFHHTKPKLVLVNRN--ARVGRTILNLREVKKAAEELGFDVT 258
F SF+ + YN N S K + +LV+R+ + R N + A E G++
Sbjct: 208 FHSFVRDFYNSLKPNISI---EKKSRKILVSRSDFSHSNRAFPNRAAFETMAAERGYE-- 262
Query: 259 IFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
I +PE+ SL + + A++G HG+G+ +LF PG+V+
Sbjct: 263 IIQPEK-LSLEEQIQIFSEAAAVIGEHGSGMHSTLFSSPGTVV 304
>gi|354594765|ref|ZP_09012802.1| hypothetical protein CIN_14980 [Commensalibacter intestini A911]
gi|353671604|gb|EHD13306.1| hypothetical protein CIN_14980 [Commensalibacter intestini A911]
Length = 372
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/82 (24%), Positives = 41/82 (50%), Gaps = 3/82 (3%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
PK ++R R + N E+ + E+ GF + E +L + + ++G
Sbjct: 241 PKRFYIDRRTASNRVLHNEDELVEVLEQRGFAIVQLE---YLTLDEQAGLFSNAEYIIGA 297
Query: 285 HGAGLTHSLFLRPGSVLMQVVP 306
HGAGL + +F +P + +++++P
Sbjct: 298 HGAGLANMVFCQPETKIIELMP 319
>gi|448310078|ref|ZP_21499930.1| capsular polysaccharide biosynthesis protein-like protein
[Natronorubrum bangense JCM 10635]
gi|445588408|gb|ELY42651.1| capsular polysaccharide biosynthesis protein-like protein
[Natronorubrum bangense JCM 10635]
Length = 225
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 225 PKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGV 284
P + ++R R + N E +K + G + E + SL + F+ +S ++G+
Sbjct: 92 PDWIYISRQKANRRKVKNYEEFEKTLGKFGIKPVVLEDQ---SLEEQFKLFNSADGVIGL 148
Query: 285 HGAGLTHSLF 294
HGAG T+ ++
Sbjct: 149 HGAGFTNLIW 158
>gi|380796235|gb|AFE69993.1| putative glycosyltransferase AER61 precursor, partial [Macaca
mulatta]
Length = 153
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 238 RTILNLREVKKAAEEL-GFDVTIFEPE-ESTSLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
R ILN E+ A + + F+V I + + D R H+ +G+HGAGLTH LFL
Sbjct: 3 RKILNQNELVNALKTVPTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFL 62
Query: 296 RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYK 332
+ + ++ + Y + AR+ G+ Y+ ++
Sbjct: 63 PDWAAVFELYNCEDE---RCYLDL-ARLRGIHYITWQ 95
>gi|428205791|ref|YP_007090144.1| capsular polysaccharide biosynthesis protein-like protein
[Chroococcidiopsis thermalis PCC 7203]
gi|428007712|gb|AFY86275.1| capsular polysaccharide biosynthesis protein-like protein
[Chroococcidiopsis thermalis PCC 7203]
Length = 339
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 42/87 (48%), Gaps = 4/87 (4%)
Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF 274
N SS P+ + ++R GR ++N +V +A GF + PE+ S AD R
Sbjct: 185 NIESSQRSFSPR-IYISRAKTTGRKVINEDDVLEALSPFGF--VAYAPEK-MSFADEVRL 240
Query: 275 IHSCHAMVGVHGAGLTHSLFLRPGSVL 301
+V HG+GL + +F + SV+
Sbjct: 241 FSQAEMVVAPHGSGLVNIIFAQNLSVI 267
>gi|400530654|gb|AFP86478.1| glycosyltransferase, partial [Amia calva]
Length = 276
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 60/120 (50%), Gaps = 4/120 (3%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
+P+ +V + + R ILN E+ A A+E +E S AD + I +V
Sbjct: 33 EPEEYIVVFSRSINRLILNEAELILALAQEFQMKTITVSLDEH-SFADIVQIISKASMLV 91
Query: 283 GVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
+HGA L SLFL G++++++ P ++ A + G L+Y+ ++ +EE+S+
Sbjct: 92 SMHGAQLVTSLFLPRGAIVVELFPYAVNPEHYTPYKTIASLPGMDLQYVAWQNTKEENSI 151
>gi|400530688|gb|AFP86495.1| glycosyltransferase, partial [Heterotis niloticus]
Length = 276
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 12/112 (10%)
Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
+ R ILN E+ A A+E EE SLA+ R I +V +HGA L SLF
Sbjct: 45 INRLILNEAELILALAQEFQMKAITVSLEEH-SLAEIIRVISGASMLVSMHGAQLITSLF 103
Query: 295 LRPGSVLMQVVPIGTQWLSTVYFEKPARVLG------LEYLEYKIKQEESSL 340
L G+ ++++ P +S ++ P R L L+Y+ +K EE+S+
Sbjct: 104 LPRGAAVVELFPYA---VSPDHY-TPYRTLASLPGMDLQYVAWKNTIEENSV 151
>gi|307180926|gb|EFN68718.1| Uncharacterized glycosyltransferase AER61 [Camponotus floridanus]
Length = 554
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 52/246 (21%), Positives = 110/246 (44%), Gaps = 23/246 (9%)
Query: 109 NFFHEFMDCFVPLFITINSHFP-----DQDVILAIADCNDQWARKYAELLPRFTRHPIIN 163
N +H F D F L+ +++ + D D + I + + + + + FT++P+ +
Sbjct: 234 NMYHHFCD-FFNLYASLHVNLSHPSVFDTDNHIMIWE-SYSYRSAFQDTFEAFTKNPLWD 291
Query: 164 INN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQSFLANAYNENTNTSS 218
+ + T CF+++ L+ RM+ N L+ + F++F + +
Sbjct: 292 LKTFKGKTVCFKNLVFPLLP--RMIFGLYYNTPLIYGCEKSGLFKAFSDHVLHRLRIPLH 349
Query: 219 SFHHTKPKLVLVNRNARVGRTILNLREVKKAAEEL-GFDVTIFEPEESTSLADSFRFIHS 277
K ++ L++R+ + R ILN E+ KA +E + V ++ S +
Sbjct: 350 QRKDRKIRITLLSRDTQY-RKILNENELLKALKENPEYKVKKVVYNKNLSFRKQLEITRN 408
Query: 278 CHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKIKQEE 337
+G+HGAGLTH +FL + L ++ ++ AR+ G++Y + ++
Sbjct: 409 SDIFIGIHGAGLTHLMFLPDWAALFEIYNCE----DPNCYKDLARLRGVKYFTW---EDT 461
Query: 338 SSLVEK 343
S L+++
Sbjct: 462 SKLIQQ 467
>gi|350403304|ref|XP_003486762.1| PREDICTED: uncharacterized glycosyltransferase AER61-like [Bombus
impatiens]
Length = 557
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 55/262 (20%), Positives = 112/262 (42%), Gaps = 25/262 (9%)
Query: 85 PP--NLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITIN---SH---FPDQDVIL 136
PP N C + P + N +H F D F L+ +++ SH F + I+
Sbjct: 211 PPIVNHDCDIVIEKPTYIMKIDAI-VNMYHHFCD-FFNLYASLHVNLSHPATFSTDNHIM 268
Query: 137 AIADCNDQWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPT 191
+ + + + FTR+P+ +++ + T CF+++ L+ RM+ N
Sbjct: 269 IWESYS--YRSAFQDAFQAFTRNPLWDLHTFRGETVCFKNLVFPLLP--RMIFGLYYNTP 324
Query: 192 LLPKPKTLVDFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAE 251
L+ + F++F + + + + ++ L++R+ + R ILN E+ KA +
Sbjct: 325 LIYGCEKSGLFKAFGDHVLHRLRIPHHERKNQRIRVTLLSRDTQY-RRILNEDELTKALK 383
Query: 252 E-LGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQ 310
E + V + S + +G+HGAGLTH +FL + + ++
Sbjct: 384 ENPEYKVRKVVYNKKISFKKQLEITRNSDIFIGIHGAGLTHLMFLPDWAAVFEIYNCEDP 443
Query: 311 WLSTVYFEKPARVLGLEYLEYK 332
++ AR+ G++Y ++
Sbjct: 444 GC----YKDLARLRGVKYFTWE 461
>gi|127459561|gb|ABO28372.1| glycosyltransferase [Amia calva]
Length = 276
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 60/120 (50%), Gaps = 4/120 (3%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
+P+ +V + + R ILN E+ A A+E +E S AD + I +V
Sbjct: 33 EPEEYIVVFSRSINRLILNEAELILALAQEFQMKTITVSLDEH-SFADIVQIISKASMLV 91
Query: 283 GVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
+HGA L SLFL G++++++ P ++ A + G L+Y+ ++ +EE+S+
Sbjct: 92 SMHGAQLVTSLFLPRGAIVVELFPYAVNPEHYTPYKTIASLPGMDLQYVAWQNTKEENSI 151
>gi|443692860|gb|ELT94365.1| hypothetical protein CAPTEDRAFT_192992 [Capitella teleta]
Length = 459
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/107 (24%), Positives = 54/107 (50%), Gaps = 6/107 (5%)
Query: 226 KLVLVNRNA----RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAM 281
K+VL+ R+ R ++I + E A E G ++ +F + S+ + R + +
Sbjct: 321 KIVLIKRSLKRFFRQHKSIAAMLEA--LASEHGLELFVFRDDPVPSIDLTRRMFNEAIMV 378
Query: 282 VGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+ HGAG ++ L+ +PG+V+++ + + F+ A +LG+ Y
Sbjct: 379 IAPHGAGESNMLYAQPGTVILEGMCFTSNVRVNTLFKVVANLLGMRY 425
>gi|443728034|gb|ELU14515.1| hypothetical protein CAPTEDRAFT_192300 [Capitella teleta]
Length = 534
Score = 39.7 bits (91), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 4/109 (3%)
Query: 224 KPKLVLVNRNARVGRTILNLREVKKA----AEELGFDVTIFEPEESTSLADSFRFIHSCH 279
+ ++L+ R+A R E+ A A +G + +F + SL ++
Sbjct: 363 RDTMLLIQRSAAKTRKFKYHDEIVAALRNEARLVGLRMAVFSDDPLPSLTETVDLFSRAV 422
Query: 280 AMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
++ HGAG + LF PG+VL++ + + + F A+ LGL Y
Sbjct: 423 IVISPHGAGAANLLFAHPGTVLIEALHPSSSHYPNLCFASMAQALGLRY 471
>gi|412991363|emb|CCO16208.1| predicted protein [Bathycoccus prasinos]
Length = 782
Score = 39.7 bits (91), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/87 (25%), Positives = 43/87 (49%), Gaps = 6/87 (6%)
Query: 256 DVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTV 315
+V + + S + + + C +GVHGAG+ + ++ SV+ ++VP
Sbjct: 679 EVELLNIQASDGVEQTRASMQKCGIFIGVHGAGMINEIYTLKDSVIFEMVPDN----RPA 734
Query: 316 YFEKPARVLGLEYLEYKIK--QEESSL 340
Y+ A + GL+Y+++ I QE S+
Sbjct: 735 YYRNCAVLRGLKYVDFGISGTQETKSV 761
>gi|260809811|ref|XP_002599698.1| hypothetical protein BRAFLDRAFT_119381 [Branchiostoma floridae]
gi|229284979|gb|EEN55710.1| hypothetical protein BRAFLDRAFT_119381 [Branchiostoma floridae]
Length = 309
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 5/77 (6%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S + + + ++G+HGAGLTHSLFL VL ++ ++ AR+ G+
Sbjct: 182 SFPEQLKVTQNSDLLIGMHGAGLTHSLFLPDWGVLFELYNCE----DPRCYKDLARLRGV 237
Query: 327 EYLEYKIKQEESSLVEK 343
+Y ++ KQE+ + EK
Sbjct: 238 KYYTWE-KQEKITQTEK 253
>gi|58039897|ref|YP_191861.1| hypothetical protein GOX1456 [Gluconobacter oxydans 621H]
gi|58002311|gb|AAW61205.1| Hypothetical protein GOX1456 [Gluconobacter oxydans 621H]
Length = 367
Score = 39.7 bits (91), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 21/81 (25%), Positives = 44/81 (54%), Gaps = 3/81 (3%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+ + + R + R + N E+ E+LGF+ PE + SL++ R ++G+
Sbjct: 230 RRLYIERGSSANRHVPNEAELAAGLEKLGFERV--HPE-TLSLSEQIRLFSDAGMVMGML 286
Query: 286 GAGLTHSLFLRPGSVLMQVVP 306
GAG+ + + +PG+++ ++VP
Sbjct: 287 GAGMANIAWCQPGTLVYELVP 307
>gi|22797413|emb|CAD22108.1| beta 1,2-xylosyltransferase [Physcomitrella patens]
Length = 595
Score = 39.7 bits (91), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 40/76 (52%), Gaps = 2/76 (2%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
+L + + + ++G HGAGL+H LF P + +V + + +L +FE ++ +G+
Sbjct: 499 ALHEQLKIVRESSIIIGAHGAGLSHLLFAMPRETV--IVELSSPFLVRPHFEAMSQWMGM 556
Query: 327 EYLEYKIKQEESSLVE 342
EY + + E+ E
Sbjct: 557 EYHKTDMAISEADCSE 572
>gi|428205790|ref|YP_007090143.1| hypothetical protein Chro_0728 [Chroococcidiopsis thermalis PCC
7203]
gi|428007711|gb|AFY86274.1| hypothetical protein Chro_0728 [Chroococcidiopsis thermalis PCC
7203]
Length = 424
Score = 39.7 bits (91), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 79/192 (41%), Gaps = 23/192 (11%)
Query: 106 YNGNFFHEFMDCFVPLFITINSHFPDQDVI---LAIADCNDQWARKYAELLPRFTRHPII 162
+N +FH +DC L ++ Q I L I +W ++ LL + I
Sbjct: 165 WNSGYFHWLIDCLTRLEGI--EYYQAQTGIKPTLIIHANPQEWQKESLRLLG-YNPDDCI 221
Query: 163 NINNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
N+ + + + + +I+PT + Q L+N ++NT+ S S
Sbjct: 222 EWNDSGVK--VKKLVVPSFRREHNIISPTACQWLR-----QRLLSN-LSDNTSISLS--- 270
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
P+ + ++R GR I+N +V LGF + E S AD R +V
Sbjct: 271 --PR-IYISRPKSAGRNIINEADVMAILAPLGF---VAYTMEELSFADEIRLFSQAEIVV 324
Query: 283 GVHGAGLTHSLF 294
HGAGLT+ +F
Sbjct: 325 APHGAGLTNIIF 336
>gi|356524325|ref|XP_003530780.1| PREDICTED: beta-(1,2)-xylosyltransferase-like [Glycine max]
Length = 501
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 13/67 (19%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKP-----A 321
SL D + I ++G HGAGLTH + P +V+++++ + YF +P +
Sbjct: 415 SLKDQVQAIQDASVIIGAHGAGLTHIVSALPKTVILEII--------SSYFRRPHFAYIS 466
Query: 322 RVLGLEY 328
R GLEY
Sbjct: 467 RWKGLEY 473
>gi|434392192|ref|YP_007127139.1| capsular polysaccharide biosynthesis protein-like protein
[Gloeocapsa sp. PCC 7428]
gi|428264033|gb|AFZ29979.1| capsular polysaccharide biosynthesis protein-like protein
[Gloeocapsa sp. PCC 7428]
Length = 405
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 75/192 (39%), Gaps = 22/192 (11%)
Query: 106 YNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCND-QWARKYAELLPRFTRHPIINI 164
YN N+FH +DC + L H L I D N W + +LL + I
Sbjct: 163 YNRNYFHWIVDCLLRLEGIEYYHAQTGRKPLLIIDSNPPAWKIESLKLLG-YKPEDYIYW 221
Query: 165 NNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLAN--AYNENTNTSSSFHH 222
N I + + + + +I+P + Q L+N + + SS +
Sbjct: 222 NGFGIN--IKRLVVSSFRREKSLISPVACHWLR-----QRVLSNLPKSQDKKSYSSRIYI 274
Query: 223 TKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMV 282
++PK GR ++N EV KA LGF + E+ S D +V
Sbjct: 275 SRPK--------SAGRQVINEEEVLKALTPLGF---VSYTLENMSFTDQVALFSQAEIVV 323
Query: 283 GVHGAGLTHSLF 294
HGAGLT+ +F
Sbjct: 324 AAHGAGLTNIVF 335
>gi|394337421|gb|AFN27698.1| glycosyltransferase, partial [Hypoplectrus puella]
Length = 288
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 72/182 (39%), Gaps = 42/182 (23%)
Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ILN E+ A A+E V EE S + I +V +HGA L SLFL
Sbjct: 59 RLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSIVQVISGASMLVSMHGAQLITSLFLP 117
Query: 297 PGSVLMQVVPIGTQ----------------------WLST------VYFEKPARVLGLEY 328
G+V++++ P G W +T Y ++P G+ +
Sbjct: 118 RGAVVVELFPFGVNPEQYTPYKTLATLPGMDLHYVSWRNTKEENTITYPDRPWEQGGIVH 177
Query: 329 LEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDA 388
LE +++E L K L +NP+ R+Y Q+ +DI F LK+
Sbjct: 178 LEK--EEKERILASKEVPRHLCCRNPEWL-------FRIY---QDTLVDIPSFLEVLKEG 225
Query: 389 YK 390
K
Sbjct: 226 MK 227
>gi|168055999|ref|XP_001780010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668615|gb|EDQ55219.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 595
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 2/76 (2%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
+L + + + ++G HGAGL+H LF P + +V + + + +FE ++ +G+
Sbjct: 499 ALHEQLKIVRESSIIIGAHGAGLSHLLFAMPRETV--IVELSSPFFVRPHFEAMSQWMGM 556
Query: 327 EYLEYKIKQEESSLVE 342
EY + + E+ E
Sbjct: 557 EYHKIDMASSEADCSE 572
>gi|223996669|ref|XP_002288008.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977124|gb|EED95451.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 458
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 24/96 (25%), Positives = 44/96 (45%), Gaps = 13/96 (13%)
Query: 258 TIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP-----IGTQWL 312
T + EE+TS + S H +VG+HGA LT + L + +++++P + W+
Sbjct: 267 TKVDVEEATSPEEQLLLHRSLHGLVGIHGASLTQGVLLPSHASILELLPWIPNYLQGSWV 326
Query: 313 ST--------VYFEKPARVLGLEYLEYKIKQEESSL 340
T + F K + L ++ Y + +E L
Sbjct: 327 QTKHRPTPLGIIFHKTGELSNLNHVGYGLDRESVPL 362
>gi|394337531|gb|AFN27753.1| glycosyltransferase, partial [Diademichthys lineatus]
Length = 285
Score = 39.3 bits (90), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 52/106 (49%), Gaps = 4/106 (3%)
Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R I+N E+ A +EL V EE S + + I +V +HGA L SLFL
Sbjct: 56 RLIVNEAELILALVQELQLKVVTVSLEEQ-SFSSIIQVISGASILVSMHGAQLITSLFLP 114
Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
G+V++++ P G + A + G++ YL ++ EE+++
Sbjct: 115 RGAVVVELFPFGVNPEQYTPYRTLATLPGMDLHYLSWRNTNEENTI 160
>gi|53748600|emb|CAF02095.2| beta 1,2-xylosyltransferase [Physcomitrella patens]
Length = 595
Score = 39.3 bits (90), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 2/76 (2%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
+L + + + ++G HGAGL+H LF P + +V + + + +FE ++ +G+
Sbjct: 499 ALHEQLKIVRESSIIIGAHGAGLSHLLFAMPRETV--IVELSSPFFVRPHFEAMSQWMGM 556
Query: 327 EYLEYKIKQEESSLVE 342
EY + + E+ E
Sbjct: 557 EYHKIDMASSEADCSE 572
>gi|123420687|ref|XP_001305813.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121887353|gb|EAX92883.1| hypothetical protein TVAG_467300 [Trichomonas vaginalis G3]
Length = 530
Score = 39.3 bits (90), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 55/291 (18%), Positives = 112/291 (38%), Gaps = 36/291 (12%)
Query: 59 KTQPYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCF 118
+ P+ + +D+ A + P N + TS L++ +H D F
Sbjct: 190 RAPPFDKVTDRFAKEPIVTNITFDEIPRNGKQVIEETS--LIYGVFYNFYMLWHMIYDFF 247
Query: 119 VPL--FITINSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPIININNQTITHCFQSV 176
+P FI + F +D + W ++ L +R+ I + N+ + F+
Sbjct: 248 IPYYSFINVRHQFATRDTRRLFVRSDGFW--RFQALANWISRYEPIVLENRYKPYFFKYA 305
Query: 177 TLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH-------------T 223
+G P P P + S L YN N++T+ F T
Sbjct: 306 IIGCEK-------PEANPDPYRNNEINSILF-KYNLNSSTAPGFREDVIKYNNIHIENST 357
Query: 224 KPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPE-ESTSLADSFRFIHSCHAMV 282
KP ++ + R R + NL ++ + ++ + I E ++ + + I ++
Sbjct: 358 KPIVLFIKRGG-AKRDVKNLDKIVELSKTICTFCDIRTVELQNLDIREQIELIAPAQVLI 416
Query: 283 GVHGAGLTHSLFLRPG-----SVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
G HG+GL H +++ + L++++P ++ ++E A V + Y
Sbjct: 417 GFHGSGLAHVIWMEESRKQRQTHLIEIMP--HEYNCRDWYETAASVARVNY 465
>gi|410503822|ref|YP_006941227.1| hypothetical protein BN8_p06856 [Fibrisoma limi BUZ 3]
gi|387510265|emb|CCH57653.1| hypothetical protein BN8_p06856 [Fibrisoma limi BUZ 3]
Length = 400
Score = 39.3 bits (90), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 67/138 (48%), Gaps = 12/138 (8%)
Query: 209 AYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSL 268
A ++ S TK + ++R GR I+N E+ + ++ GF VTI ++ S+
Sbjct: 245 ALKKHIQGRVSIGATKQSRIYISRQG--GRRIVNEDELIQMLQQFGF-VTI--EDKPRSI 299
Query: 269 ADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+ + ++G HGA T+ ++ +PG+ L+++ ++ YF +LGL Y
Sbjct: 300 VEQVSIYANASFIIGPHGASFTNVIWCQPGTHLVEL--FSANYVPD-YFLYLCTLLGLRY 356
Query: 329 LEY----KIKQEESSLVE 342
Y +K++ +++E
Sbjct: 357 SAYCQGTDVKEDRRAILE 374
>gi|384253738|gb|EIE27212.1| hypothetical protein COCSUDRAFT_26591 [Coccomyxa subellipsoidea
C-169]
Length = 493
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 59/116 (50%), Gaps = 11/116 (9%)
Query: 240 ILNLREVKKAAEELGFDVTI-FEPEESTSLADSFRFIHSCHAMVGV----HGAGLTHSLF 294
++N E+ A +++G + + P +T+ A F S A GV HG L +S+F
Sbjct: 309 VVNREELMGALQDIGHTLGLAVRPYTATARA-PFESYLSVMARTGVLVSRHGPLLANSIF 367
Query: 295 LRPGSVLMQVVPIGTQW--LSTVYFEKPARVLGLEYLEYKIKQEESSLVEKYGAND 348
L PG+V+M+++P +W +S +Y + + ++ ++ + +L YG+ D
Sbjct: 368 LPPGAVVMELLPYNWEWKGISKIYVNLTRSLGDIHHIAWRATHPKWAL---YGSAD 420
>gi|400530896|gb|AFP86599.1| glycosyltransferase, partial [Antennarius striatus]
Length = 287
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 53/106 (50%), Gaps = 4/106 (3%)
Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ILN E+ A A+E V EE S + + I S +V +HGA L SLFL
Sbjct: 61 RLILNEAELIMALAQEFQMRVVTLSLEEQ-SFSSIVQMISSATILVSMHGAQLITSLFLP 119
Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
GSV++++ P ++ A + G++ Y+ ++ E++S+
Sbjct: 120 RGSVVVELFPYAVNPEQYTPYKTLASLPGMDLHYISWRNNNEDNSI 165
>gi|440803308|gb|ELR24216.1| hypothetical protein ACA1_377010 [Acanthamoeba castellanii str.
Neff]
Length = 469
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 79/185 (42%), Gaps = 23/185 (12%)
Query: 223 TKPKLVLVNRNARVGRTILN----LREVKKA----AEELGFDVTIFEPEESTSLADSFRF 274
T+P++VL+ R +TILN L+ ++ A A+ + + P E SL
Sbjct: 296 TRPRVVLIRRTK--DKTILNSDTLLQNLRSALRNDADVFQVSMELLTPREVVSL------ 347
Query: 275 IHSCHAMVGVHGAGLTHSLFL-RPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEYKI 333
+ ++G HG+GL H FL R S+++ P LS F A L ++Y
Sbjct: 348 MQGTTVLIGTHGSGLAHLWFLPRTSSMIVIHAPGADTLLSPKGFSDGAPWL-IDYQTIAS 406
Query: 334 KQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAK 393
K + ++ V K +++ A W Y+K NV+++ D K+A A
Sbjct: 407 KLKMNTFVHKVEKQAESIEDCGA---GRWGLYCAYVK--NVEVEADALATLTKEAINAAL 461
Query: 394 KLMDK 398
D+
Sbjct: 462 ARKDE 466
>gi|384247587|gb|EIE21073.1| hypothetical protein COCSUDRAFT_56993 [Coccomyxa subellipsoidea
C-169]
Length = 515
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 59/131 (45%), Gaps = 12/131 (9%)
Query: 228 VLVNRNARVGRTILNLREVKKAAEE----LGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
VLV + V R LN+ ++ K E G V + LA + + S +V
Sbjct: 297 VLVEQRHGVVRNFLNMDQLLKDCGESGKLQGTSVACRSATFAGDLAWNIAAVSSADVLVT 356
Query: 284 VHGAGLTHSLFLRPGSVLMQVVP--IGT---QWLSTVYFEKPARVLG--LEYLEYKIKQE 336
+HG+G + LF+ GS L++V P GT +W + + K A G ++Y+ I E
Sbjct: 357 IHGSGSNNVLFMHEGSTLIEVRPYKFGTETPEW-ANFFIPKVAEANGYRVQYIGLNILDE 415
Query: 337 ESSLVEKYGAN 347
E S + AN
Sbjct: 416 ELSRPGGFEAN 426
>gi|449273371|gb|EMC82866.1| Putative glycosyltransferase AGO61 [Columba livia]
Length = 578
Score = 38.9 bits (89), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 57/226 (25%), Positives = 94/226 (41%), Gaps = 26/226 (11%)
Query: 97 PALVFSAGGYN-GNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAELLPR 155
P + A +N N H F D +P++ T+ F D D + A + W+ L +
Sbjct: 149 PDVALIANRFNPDNLMHVFHDDLLPIYYTMQ-QFSDLD-LEARLFFMEGWSEGVHFDLYK 206
Query: 156 F--TRHPIININNQTITH--CFQSVTLGL----ISHGRMVINPTLLPKPKTLV------D 201
+ P++ +T+ CF +GL + + P PK LV
Sbjct: 207 LLSNKQPLLREQLKTLGRLLCFTKSYVGLSKITTWYQYGFVQPQG-PKANILVSGNEIRQ 265
Query: 202 FQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA-AEELGFDVTIF 260
F F+ E N S ++ +V+ +R + R ILN E+ A A+E
Sbjct: 266 FTKFMM----EKLNVSLEESSSEEYIVVFSRT--INRLILNEAELILALAQEFQMKTITV 319
Query: 261 EPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
EE S +D R + + +V +HGA L SLFL G+ ++++ P
Sbjct: 320 SLEEH-SFSDIVRLLSNASMLVSMHGAQLVMSLFLPRGATVVELFP 364
>gi|400530878|gb|AFP86590.1| glycosyltransferase, partial [Siganus spinus]
Length = 267
Score = 38.9 bits (89), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 54/106 (50%), Gaps = 4/106 (3%)
Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ILN E+ A A+E V E+ T L + I +V +HGA L SLFL
Sbjct: 51 RLILNEPELIMALAQEFQMKVITVSLEDQT-LPSIVQVISGATMLVSMHGAQLITSLFLP 109
Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
G+V++++ P ++ A + G++ Y+ ++ K+EE+++
Sbjct: 110 RGAVVVELFPFAVNPEQYTPYKTLASLPGMDLHYISWRNKKEENTI 155
>gi|312113531|ref|YP_004011127.1| hypothetical protein Rvan_0752 [Rhodomicrobium vannielii ATCC
17100]
gi|311218660|gb|ADP70028.1| hypothetical protein Rvan_0752 [Rhodomicrobium vannielii ATCC
17100]
Length = 404
Score = 38.9 bits (89), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 49/117 (41%), Gaps = 6/117 (5%)
Query: 193 LPKPKTLVDFQSFLANAYNENTN---TSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA 249
+P P + QS L A ++ S PK + ++RN R +LN E+
Sbjct: 221 VPVPNSPRGLQSPLGLARLDDLGRFVAGGSERPDDPKRLYISRNDARLRRVLNEHELMPI 280
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
GF+ + FR + A+V HGAGL H + +PG+ +++ P
Sbjct: 281 LRSFGFERVVLGKLPIERQVALFR---NAEAVVAPHGAGLAHIAWAKPGTKVVEFFP 334
>gi|400530814|gb|AFP86558.1| glycosyltransferase, partial [Melamphaes polylepis]
Length = 290
Score = 38.9 bits (89), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 53/108 (49%), Gaps = 4/108 (3%)
Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
V R ILN E+ A A+E V EE T +A + I +V +HGA L SLF
Sbjct: 59 VTRLILNEAELIMALAQEFQMRVVTVSLEEQT-VASIVQLISGASMLVSMHGAQLVTSLF 117
Query: 295 LRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
L G+ ++++ P ++ A + G L+Y+ ++ EE+++
Sbjct: 118 LPRGAAVVELFPYAVNPEQYTPYKTLASLPGMDLQYVAWRNTMEENTI 165
>gi|345483345|ref|XP_001599568.2| PREDICTED: uncharacterized glycosyltransferase AER61-like [Nasonia
vitripennis]
Length = 537
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 43/183 (23%), Positives = 84/183 (45%), Gaps = 13/183 (7%)
Query: 156 FTRHPIININN-QTITHCFQSVTLGLISHGRMVI----NPTLLPKPKTLVDFQSFLANAY 210
FT++P+ ++ + T CF++V L+ RM+ N L+ + F +F +
Sbjct: 292 FTKNPLWDLKTFRGETVCFKNVVFPLLP--RMIFGLFYNTPLIYGCEKSGLFTAFSDHVL 349
Query: 211 NENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEE-LGFDVTIFEPEESTSLA 269
+ +TK ++ L++R+ + R ILN E+ KA E+ ++V +
Sbjct: 350 HRLRIPLHVRKNTKIRVTLLSRDTQY-RRILNEDELVKALEKNPRYEVKKVVYNKHMPFK 408
Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYL 329
+ +G+HGAGLTH +FL +V ++ ++ AR+ G++YL
Sbjct: 409 KQLEITRNSDIFIGIHGAGLTHFMFLPEWAVGFELYNCE----DASCYKDLARLKGIKYL 464
Query: 330 EYK 332
++
Sbjct: 465 TWE 467
>gi|400530724|gb|AFP86513.1| glycosyltransferase, partial [Bathylagus euryops]
Length = 277
Score = 38.9 bits (89), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 70/153 (45%), Gaps = 8/153 (5%)
Query: 194 PKPKTLV---DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA- 249
PK LV + + F A++ E N + + + +V + + R ILN E+ A
Sbjct: 2 PKANILVSGNEIRQF-ASSLMEKLNITREESAEEKEEYIVVFSRSINRLILNEAELILAL 60
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
A+E V EE S A + + +V +HGA L SLFL G+ ++++ P
Sbjct: 61 AQEFQMKVVTVSLEEQ-SFAAIVKVVSEATMLVSMHGAQLVSSLFLPRGAAVVELFPYAV 119
Query: 310 QWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
++ A + G L+Y+ ++ EE+SL
Sbjct: 120 NPEQYAPYKTLASLPGMDLQYVSWRNTIEENSL 152
>gi|397572376|gb|EJK48230.1| hypothetical protein THAOC_32991 [Thalassiosira oceanica]
Length = 696
Score = 38.9 bits (89), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 48/89 (53%), Gaps = 18/89 (20%)
Query: 271 SFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQV----VPIGTQW--LSTVYFEKPARVL 324
+ + HS +VG HG L +S+F RPG+ ++++ VP +Q+ LST L
Sbjct: 564 TVKLFHSATVVVGAHGEALVNSIFSRPGTTVIEMGFESVPQASQYRHLST--------SL 615
Query: 325 GLEYLEYKIKQEESSLVEKYGANDLVLKN 353
GL++++ + ++ S +GA + +L+
Sbjct: 616 GLKHVDINLAKDSRS----FGAAEYMLRE 640
>gi|307106684|gb|EFN54929.1| expressed protein [Chlorella variabilis]
Length = 627
Score = 38.9 bits (89), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 35/161 (21%), Positives = 74/161 (45%), Gaps = 17/161 (10%)
Query: 235 RVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
R R ++N + K E G +V + E ST + S +V VH + L ++ F
Sbjct: 436 RANRRVVNEEALLKMLAEFG-EVRVVEFNASTPFRQQLETMASTSVLVSVHTSNLANAQF 494
Query: 295 LRPGSVLMQVVPIGTQW--LSTVYFEKPARVLGLEYLEYKIK-QEESSLVE-----KYGA 346
++PGS + +++ W L + + A + + + ++ + + E+ ++ ++G
Sbjct: 495 MQPGSAVFEIIQRNWFWHGLDKSFQVQTAMMGDIHHYAWRARLRNETEYIQERDAYRFGE 554
Query: 347 NDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKD 387
+ + N + A+ +N+ +V++DID FR L D
Sbjct: 555 WEPLQCNTEECVEAH-TNV-------DVRVDIDAFRALLAD 587
>gi|400530722|gb|AFP86512.1| glycosyltransferase, partial [Argentina silus]
Length = 277
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 70/153 (45%), Gaps = 8/153 (5%)
Query: 194 PKPKTLV---DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA- 249
PK LV + + F A++ E N + + + +V + + R ILN E+ A
Sbjct: 2 PKANILVSGNEIRQF-ASSLMEKLNITREESVEEKEEYIVVFSRSINRLILNEAELILAL 60
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
A+E V EE S A + + +V +HGA L SLFL G+ ++++ P
Sbjct: 61 AQEFQMKVVTVSLEEQ-SFASIVKMVSEASMLVSMHGAQLVSSLFLPRGAAVVELFPYAV 119
Query: 310 QWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
++ A + G L+Y+ ++ EE+S+
Sbjct: 120 NPEQYAPYKTLASLPGMDLQYVAWRNMIEENSV 152
>gi|326922047|ref|XP_003207263.1| PREDICTED: uncharacterized glycosyltransferase AGO61-like
[Meleagris gallopavo]
Length = 577
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 2/72 (2%)
Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
+ R ILN E+ A A+E EE S +D R I + +V +HGA L SLF
Sbjct: 294 INRLILNEAELILALAQEFQMKTITVSLEEH-SFSDIVRLISNASMLVSMHGAQLVMSLF 352
Query: 295 LRPGSVLMQVVP 306
L G+ ++++ P
Sbjct: 353 LPRGATVVELFP 364
>gi|400530916|gb|AFP86609.1| glycosyltransferase, partial [Ranzania laevis]
Length = 267
Score = 38.5 bits (88), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 4/106 (3%)
Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ILN E+ A A+E V EE S R I S ++ +HGA L SLFL
Sbjct: 45 RLILNEAELIMALAQEFQMRVVTVSLEEH-SFPSIVRVISSATMLISMHGAQLITSLFLP 103
Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
G+V++++ P ++ A + G++ Y+ ++ +EE+++
Sbjct: 104 RGAVVVELFPFAVNPEQYTPYKTLAFLPGMDLHYISWRNNKEENTI 149
>gi|394337263|gb|AFN27619.1| glycosyltransferase, partial [Lophius americanus]
Length = 289
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 4/106 (3%)
Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ILN E+ A A+E V EE S + + I +V +HGA L SLFL
Sbjct: 60 RLILNEAELIMALAQEFEMRVVTVSLEEQ-SFSSIIQVISRASMLVSMHGAQLITSLFLP 118
Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
G+V++++ P ++ A + G++ Y+ +K +EE+++
Sbjct: 119 RGAVVVELFPFAVNPEQYTPYKTLASLPGMDLHYISWKNTKEENTI 164
>gi|307208002|gb|EFN85561.1| Uncharacterized glycosyltransferase AER61 [Harpegnathos saltator]
Length = 556
Score = 38.5 bits (88), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 66/315 (20%), Positives = 130/315 (41%), Gaps = 31/315 (9%)
Query: 89 SCGVTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFP-----DQDVILAIADCND 143
C V P + N +H F D F L+ +++ + D D + I + +
Sbjct: 215 DCDVVIEKPTFIMKIDAI-VNMYHHFCD-FFNLYASLHVNLSHPSAFDTDNHIMIWE-SY 271
Query: 144 QWARKYAELLPRFTRHPIININN-QTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDF 202
+ + + FTR+P+ ++ + CF+++ L+ RM+ P
Sbjct: 272 SYRSAFQDTFDVFTRNPLWDLKTFRGKIVCFRNLVFPLLP--RMIFG-LYYNTPLIYGCE 328
Query: 203 QSFLANAYNENT--NTSSSFHHTKPKLV---LVNRNARVGRTILNLREVKKAAEEL-GFD 256
+S L A+ ++ H K + + L++R+ R R ILN E+ KA E ++
Sbjct: 329 KSGLFKAFGDHVLHRLRIPLHQRKDRRIHVTLLSRDTRY-RKILNEDELVKALRENPEYE 387
Query: 257 VTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVY 316
V ++ S + + +G+HGAGLTH +FL + + ++
Sbjct: 388 VRKVVYNKNVSFKEQLEITKNSDIYIGMHGAGLTHLMFLPDWAAVFEIYNCE----DPNC 443
Query: 317 FEKPARVLGLEYLEYKIKQEESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKL 376
++ AR+ G++Y + + S LV++ +P A A ++N +K +
Sbjct: 444 YKDLARLRGVKYFTW---ENSSKLVQQDPGT-----HPDGGAHAKFTNYSFDVKEFLRIV 495
Query: 377 DIDRFRIYLKDAYKK 391
+ R + +A+KK
Sbjct: 496 SLARNHVKHHNAFKK 510
>gi|59709491|ref|NP_001012294.1| glycosyltransferase-like domain-containing protein 2 precursor
[Gallus gallus]
gi|82076801|sp|Q5NDE8.1|GTDC2_CHICK RecName: Full=Glycosyltransferase-like domain-containing protein 2;
AltName: Full=Extracellular O-linked N-acetylglucosamine
transferase-like; Flags: Precursor
gi|56605346|emb|CAI30870.1| glycosyltransferase [Gallus gallus]
Length = 577
Score = 38.5 bits (88), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 2/72 (2%)
Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
+ R ILN E+ A A+E EE S +D R I + +V +HGA L SLF
Sbjct: 294 INRLILNEAELILALAQEFQMKTITVSLEEH-SFSDIVRLISNASMLVSMHGAQLVMSLF 352
Query: 295 LRPGSVLMQVVP 306
L G+ ++++ P
Sbjct: 353 LPRGATVVELFP 364
>gi|157688958|gb|ABV65039.1| glycosyltransferase [Argentina sialis]
Length = 277
Score = 38.5 bits (88), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 70/153 (45%), Gaps = 8/153 (5%)
Query: 194 PKPKTLV---DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA- 249
PK LV + + F A++ E N + + + +V + + R ILN E+ A
Sbjct: 2 PKANILVSGNEIRQF-ASSLMEKLNITREESVEEKEEYIVVFSRSINRLILNEAELILAL 60
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
A+E V EE S A + + +V +HGA L SLFL G+ ++++ P
Sbjct: 61 AQEFQMKVVTVSLEEQ-SFASIVKMVSEASMLVSMHGAQLVSSLFLPRGAAVVELFPYAV 119
Query: 310 QWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
++ A + G L+Y+ ++ EE+S+
Sbjct: 120 NPEQYAPYKTLASLPGMDLQYVAWRNMIEENSV 152
>gi|86147246|ref|ZP_01065561.1| Capsular polysaccharide biosynthesis protein-like [Vibrio sp.
MED222]
gi|85834961|gb|EAQ53104.1| Capsular polysaccharide biosynthesis protein-like [Vibrio sp.
MED222]
Length = 432
Score = 38.5 bits (88), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 42/92 (45%), Gaps = 3/92 (3%)
Query: 215 NTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRF 274
NT S+ + K K + + R R + N+ ++++ GFD F S + +
Sbjct: 289 NTLSNNYILKNKKIYLQRENTKLRKLSNVLDLERLLYRKGFD---FVDPGSLDFFEQYNL 345
Query: 275 IHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVP 306
+VG GA T+ LF++PGS + + P
Sbjct: 346 FSQAEVIVGASGAAFTNLLFMKPGSTAISLYP 377
>gi|34499919|gb|AAQ73546.1| beta 1,2 xylosyltransferase [Medicago sativa]
Length = 511
Score = 38.5 bits (88), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 33/62 (53%), Gaps = 3/62 (4%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S+ + R I ++G HGAGLTH + P +V+++++ + +F AR GL
Sbjct: 425 SMKEQVRAIQDASVIIGAHGAGLTHIVSALPKTVILEII---SSQFRRPHFAYIARWKGL 481
Query: 327 EY 328
EY
Sbjct: 482 EY 483
>gi|410942805|ref|ZP_11374546.1| hypothetical protein GfraN1_00115 [Gluconobacter frateurii NBRC
101659]
Length = 363
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 48/212 (22%), Positives = 84/212 (39%), Gaps = 34/212 (16%)
Query: 100 VFSAGGYNGNFFHEFMDCFVPLFITINS----HFPDQDVILAIADCNDQWARKYAELLPR 155
V A G + + ++ ++P F + + HF D ++L + + +W ++ L
Sbjct: 117 VIVADGISFEIWGHWLIDYLPRFGVVKAVKGEHFADLKILLPHS--SPEWIYRFLALACG 174
Query: 156 FTRHPIININ-NQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNE-- 212
R II N I C +++ G F SF+ + YN
Sbjct: 175 VNRDTIIGYNPGSDIVLCREAILPSYCYTGEF--------------SFHSFVRDFYNSLK 220
Query: 213 -NTNTSSSFHHTKPKLVLVNRN--ARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLA 269
N T K + +LV+R + R N + A E G++ I PE+ SL
Sbjct: 221 PNIGTEK-----KSRKILVSRADFSHGNRPFPNRAAFESMAAERGYE--IIHPEK-LSLE 272
Query: 270 DSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVL 301
+ A++G HG+G+ +LF P +V+
Sbjct: 273 EQIEIFSDAAAVIGEHGSGMHSTLFSGPDTVV 304
>gi|400530880|gb|AFP86591.1| glycosyltransferase, partial [Siganus vulpinus]
Length = 289
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 54/106 (50%), Gaps = 4/106 (3%)
Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ILN E+ A A+E V E+ SL + I +V +HGA L SLFL
Sbjct: 60 RLILNEPELIMALAQEFQMKVITVSLEDQ-SLPSIVQVISGATMLVSMHGAQLITSLFLP 118
Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
G+V++++ P ++ A + G++ Y+ ++ K+EE+++
Sbjct: 119 RGAVVVELFPFAVNPEQYTPYKTLASLPGMDLHYISWRNKKEENTI 164
>gi|224045437|ref|XP_002198311.1| PREDICTED: glycosyltransferase-like domain-containing protein 2
isoform 2 [Taeniopygia guttata]
gi|224045439|ref|XP_002198300.1| PREDICTED: glycosyltransferase-like domain-containing protein 2
isoform 1 [Taeniopygia guttata]
gi|224045441|ref|XP_002198318.1| PREDICTED: glycosyltransferase-like domain-containing protein 2
isoform 3 [Taeniopygia guttata]
gi|449492799|ref|XP_004175421.1| PREDICTED: glycosyltransferase-like domain-containing protein 2
[Taeniopygia guttata]
gi|449492803|ref|XP_004175422.1| PREDICTED: glycosyltransferase-like domain-containing protein 2
[Taeniopygia guttata]
Length = 578
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 2/72 (2%)
Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
+ R ILN E+ A A+E TI E S +D R I + +V +HGA L SLF
Sbjct: 294 INRLILNEAELILALAQEFQMK-TISVSLEEHSFSDIVRLISNASMLVSMHGAQLVMSLF 352
Query: 295 LRPGSVLMQVVP 306
L G+ ++++ P
Sbjct: 353 LPRGATVVELFP 364
>gi|222147143|ref|YP_002548100.1| hypothetical protein Avi_0161 [Agrobacterium vitis S4]
gi|221734133|gb|ACM35096.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 349
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 51/237 (21%), Positives = 95/237 (40%), Gaps = 23/237 (9%)
Query: 92 VTHTSPALVFSAGGYNGNFFHEFMDCFVPLFITINSHFPDQDVILAIADCNDQWARKYAE 151
++ SP ++ G+ N++H + L I H D +++LA+ N W R A
Sbjct: 95 ISDISPLII--GNGWYKNYYHWTLQAIGCLLIY-KKHRDDGNILLAMPRLNG-W-RNEAL 149
Query: 152 LLPRFTRHPIININNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYN 211
L F ++ ++ H +T L G LP P + +F++ A
Sbjct: 150 SLTGFNGQ-LLEMDYHHFLHVDDCITSNLTGGGF-----AHLPHPAVMAEFEALAARVTI 203
Query: 212 ENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADS 271
E + K+ + +A R ++N + V E GF+ I P E S+ +
Sbjct: 204 ERRFSR--------KIYVSRLDAGDVRRVVNEKAVCTLLESRGFE--IITPGE-LSVKEQ 252
Query: 272 FRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+V HGAGL + ++ +PG + +V+ + + + + GL+Y
Sbjct: 253 VVAFRDAEVIVAPHGAGLANLVYCQPG-IKTRVIELFQASCINACYARVCQAKGLDY 308
>gi|385234654|ref|YP_005795996.1| hypothetical protein KVU_2161 [Ketogulonicigenium vulgare WSH-001]
gi|343463565|gb|AEM42000.1| hypothetical protein KVU_2161 [Ketogulonicigenium vulgare WSH-001]
Length = 565
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 48/106 (45%), Gaps = 3/106 (2%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
K + V R + R I+ E+ E LGF FE + T L + + + M+ H
Sbjct: 298 KRIYVGRKSGKSRNIVGEDELIARLETLGFKTLYFE--DYTPLEQA-QIMQGVDIMISGH 354
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
GAG T+ L+ + G+ L+++ + T F A V G ++ +
Sbjct: 355 GAGFTNMLYAKKGATLIEISHLQTARHRFGDFHMHAGVSGARHIHF 400
>gi|310817073|ref|YP_003965037.1| hypothetical protein EIO_2660 [Ketogulonicigenium vulgare Y25]
gi|308755808|gb|ADO43737.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25]
Length = 566
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 48/106 (45%), Gaps = 3/106 (2%)
Query: 226 KLVLVNRNARVGRTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
K + V R + R I+ E+ E LGF FE + T L + + + M+ H
Sbjct: 299 KRIYVGRKSGKSRNIVGEDELIARLETLGFKTLYFE--DYTPLEQA-QIMQGVDIMISGH 355
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEYLEY 331
GAG T+ L+ + G+ L+++ + T F A V G ++ +
Sbjct: 356 GAGFTNMLYAKKGATLIEISHLQTARHRFGDFHMHAGVSGARHIHF 401
>gi|345653323|gb|AEO15506.1| glycosyltransferase [Chionobathyscus dewitti]
Length = 286
Score = 38.1 bits (87), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 59/117 (50%), Gaps = 6/117 (5%)
Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+VL +R+ R ILN E+ A A+E V EE S + + I ++ +H
Sbjct: 48 IVLFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSTIQVISGASMLISMH 104
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
GA L SLFL G+V++++ P ++ A + G++ Y+ ++ +EE+++
Sbjct: 105 GAQLITSLFLPRGAVVVELFPFAVNPEQYTPYKTLATLPGMDLHYVSWRNTKEENTI 161
>gi|357459889|ref|XP_003600225.1| Beta 1 2 xylosyltransferase [Medicago truncatula]
gi|355489273|gb|AES70476.1| Beta 1 2 xylosyltransferase [Medicago truncatula]
Length = 509
Score = 38.1 bits (87), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 33/62 (53%), Gaps = 3/62 (4%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S+ + R I ++G HGAGLTH + P +V+++++ + +F AR GL
Sbjct: 423 SMKEQVRAIQDASVIIGAHGAGLTHIVSALPKTVILEII---SSQFRRPHFAYIARWKGL 479
Query: 327 EY 328
EY
Sbjct: 480 EY 481
>gi|345653375|gb|AEO15532.1| glycosyltransferase [Bathydraco marri]
Length = 284
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 63/129 (48%), Gaps = 10/129 (7%)
Query: 219 SFHHTKPK----LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFR 273
SF K K +VL +R+ R ILN E+ A A+E V EE S + +
Sbjct: 34 SFEDEKGKRDEYIVLFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSTVQ 90
Query: 274 FIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEY 331
I ++ +HGA L SLFL G+V++++ P ++ A + G++ Y+ +
Sbjct: 91 VISGASMLISMHGAQLITSLFLPRGAVVVELFPFAVNPEQYTPYKTLATLPGMDLHYVSW 150
Query: 332 KIKQEESSL 340
+ +EE+++
Sbjct: 151 RNTKEENTI 159
>gi|345653309|gb|AEO15499.1| glycosyltransferase [Chaenodraco wilsoni]
Length = 286
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 63/131 (48%), Gaps = 10/131 (7%)
Query: 217 SSSFHHTKPK----LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADS 271
SF K K +VL +R+ R ILN E+ A A+E V EE S +
Sbjct: 34 GGSFEDEKGKRDEYIVLFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPST 90
Query: 272 FRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YL 329
+ I ++ +HGA L SLFL G+V++++ P ++ A + G++ Y+
Sbjct: 91 VQVISGASMLISMHGAQLITSLFLPRGAVVVELFPFAVNPEQYTPYKTLATLPGMDLHYV 150
Query: 330 EYKIKQEESSL 340
++ +EE+++
Sbjct: 151 SWRNTKEENTI 161
>gi|123421413|ref|XP_001305986.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121887536|gb|EAX93056.1| hypothetical protein TVAG_220310 [Trichomonas vaginalis G3]
Length = 553
Score = 38.1 bits (87), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 57/269 (21%), Positives = 94/269 (34%), Gaps = 49/269 (18%)
Query: 54 PSLRIKTQPYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYN-GNFFH 112
P L + +QP P + + + + + P L G F G N G +H
Sbjct: 211 PFLSLSSQPIPNDVVDTRLDN--QPVVVKTNP--LGKGGNKIQGISYFYCRGVNLGMLWH 266
Query: 113 EFMDCFVPLFITI----------NSHFPDQDVILAIADCNDQWARKYAELLPRFTRHPII 162
D PL+ TI + HF D+ + + L FT++PI
Sbjct: 267 NIFDNVAPLYHTIQEVEGSVTGKDRHFIVTDIYVV---------EVFVLFLKMFTKYPIH 317
Query: 163 NINNQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQSFLANAYNENTNTSSSFHH 222
NI + + F LGL + P T DF +Y+ + F
Sbjct: 318 NIQLEKMDIQFDICVLGL---RKFNTRPLPFRNEFTTFDF------SYDPKDISIPGFRE 368
Query: 223 T---------------KPKLVLVNRNARVGRTILNLREVKKAAEE-LGFDVTIFEPEEST 266
T KP +++ R R I+N+ V K E F ++ +S
Sbjct: 369 TILKELRFTIPIPDPKKPLVIIPLRKNNNTRFIVNMNSVIKEMERRCKFCEFLYLNLDSL 428
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFL 295
++ I + G+HG+GL H +++
Sbjct: 429 TIEWQIELISHATVLAGIHGSGLAHQIWM 457
>gi|400530870|gb|AFP86586.1| glycosyltransferase, partial [Acanthurus nigricans]
Length = 290
Score = 38.1 bits (87), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 75/179 (41%), Gaps = 26/179 (14%)
Query: 238 RTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLR 296
R ILN E+ A A+E V EE S + I S +V +HGA L SLFL
Sbjct: 61 RLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSIVQVISSATMLVSMHGAQLITSLFLP 119
Query: 297 PGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSLV------EKYGAND 348
G+V++++ P ++ A + G L Y+ ++ +EE+++ E+ G
Sbjct: 120 RGAVVVELFPFAVNPEQYTPYKTLASLPGIDLHYISWRNTKEENTVTHPDRPWEQGGITH 179
Query: 349 LVLKNPQAFAGA------------NWSNMRVYLKTQNVKLDIDRFRIYLKDAYKKAKKL 395
L + + G+ W R+Y Q+ +DI F LK+ K L
Sbjct: 180 LEKEEQERILGSKDVPRHLCCRNPEWL-FRIY---QDTLVDIPSFLEVLKEGMKTKPSL 234
>gi|345653315|gb|AEO15502.1| glycosyltransferase [Champsocephalus gunnari]
Length = 286
Score = 38.1 bits (87), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 59/117 (50%), Gaps = 6/117 (5%)
Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+VL +R+ R ILN E+ A A+E V EE S + + I ++ +H
Sbjct: 48 IVLFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSTIQVISGASMLISMH 104
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
GA L SLFL G+V++++ P ++ A + G++ Y+ ++ +EE+++
Sbjct: 105 GAQLITSLFLPRGAVVVELFPFAVNPEQYTPYKTLATLPGMDLHYVSWRNTKEENTI 161
>gi|400530902|gb|AFP86602.1| glycosyltransferase, partial [Himantolophus sagamius]
Length = 276
Score = 38.1 bits (87), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 54/120 (45%), Gaps = 7/120 (5%)
Query: 194 PKPKTLV---DFQSFLANAYNENTNTSSSFHHTKPKLVLVNRNARVGRTILNLREVKKA- 249
PK LV + + F A A E N + K + ++V + R ILN E+ A
Sbjct: 2 PKANMLVSGNEIRQF-AKALMEKMNITGVEEVEKDEYIVVFSRS-TTRLILNEAELIMAL 59
Query: 250 AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGT 309
A+E V EE S + I S +V +HGA L SLFL G+V++++ P
Sbjct: 60 AQEFQMRVVTVSLEEQ-SFPSIIQVISSATMLVSMHGAQLITSLFLPRGAVVVELFPFAV 118
>gi|61845107|emb|CAI70375.1| beta 1,2 xylosyltransferase [Populus tremula x Populus alba]
Length = 468
Score = 38.1 bits (87), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 17/62 (27%), Positives = 34/62 (54%), Gaps = 3/62 (4%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S+ + R + ++G HGAGLTH + P +++++++ + + +F+ A GL
Sbjct: 382 SMKEQVRAVQDASVIIGAHGAGLTHIVSATPKTIILEII---SSYFRRPHFQLIAEWKGL 438
Query: 327 EY 328
EY
Sbjct: 439 EY 440
>gi|345653327|gb|AEO15508.1| glycosyltransferase [Chionodraco hamatus]
Length = 286
Score = 38.1 bits (87), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 59/117 (50%), Gaps = 6/117 (5%)
Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+VL +R+ R ILN E+ A A+E V EE S + + I ++ +H
Sbjct: 48 IVLFSRST--TRLILNEAELIMALAQEFQMRVVTVSLEEQ-SFPSTIQVISGASMLISMH 104
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLE--YLEYKIKQEESSL 340
GA L SLFL G+V++++ P ++ A + G++ Y+ ++ +EE+++
Sbjct: 105 GAQLITSLFLPRGAVVVELFPFAVNPEQYTPYKTLATLPGMDLHYVSWRNTKEENTI 161
>gi|296533944|ref|ZP_06896465.1| tetratricopeptide repeat protein [Roseomonas cervicalis ATCC 49957]
gi|296265725|gb|EFH11829.1| tetratricopeptide repeat protein [Roseomonas cervicalis ATCC 49957]
Length = 380
Score = 38.1 bits (87), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 3/67 (4%)
Query: 238 RTILNLREVKKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRP 297
R +LN EV + LGF+ E SL++ S + GV+G GL ++LF R
Sbjct: 261 RNLLNEAEVAALLDRLGFESVALE---DFSLSEQVALFRSAECVAGVYGGGLFNTLFCRA 317
Query: 298 GSVLMQV 304
G+ ++ +
Sbjct: 318 GTRVLSL 324
>gi|392576608|gb|EIW69738.1| hypothetical protein TREMEDRAFT_73612 [Tremella mesenterica DSM
1558]
Length = 465
Score = 38.1 bits (87), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 47/95 (49%), Gaps = 8/95 (8%)
Query: 223 TKPKLVLVNRNARVGRTILN-----LREVKKAAEELGFDVTIFEPEESTSLADSFRFIHS 277
++P + ++R + GR + N L E + + G +E E TS+ + + I
Sbjct: 305 SRPVITYLSRQS-TGRRLANASHDALVEELETLSKSGRAEVRYEVFEDTSIPEQWARIAR 363
Query: 278 CHAMVGVHGAGLTHSLFL--RPGSVLMQVVPIGTQ 310
++GVHG GLTH +FL PGS + ++ P Q
Sbjct: 364 TTILLGVHGNGLTHIIFLPINPGSAVYELQPENCQ 398
>gi|224061108|ref|XP_002300350.1| predicted protein [Populus trichocarpa]
gi|222847608|gb|EEE85155.1| predicted protein [Populus trichocarpa]
Length = 508
Score = 38.1 bits (87), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 17/62 (27%), Positives = 34/62 (54%), Gaps = 3/62 (4%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S+ + R + ++G HGAGLTH + P +++++++ + + +F+ A GL
Sbjct: 422 SMKEQVRAVQDASVIIGAHGAGLTHIVSATPKTIILEII---SSYFRRPHFQLIAEWKGL 478
Query: 327 EY 328
EY
Sbjct: 479 EY 480
>gi|400530728|gb|AFP86515.1| glycosyltransferase, partial [Macropinna microstoma]
Length = 271
Score = 38.1 bits (87), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 52/108 (48%), Gaps = 4/108 (3%)
Query: 236 VGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLF 294
+ R ILN E+ A A+E V EE S A + + +V +HGA L SLF
Sbjct: 40 ISRLILNEAELILALAQEFQMKVVTVSLEEQ-SFASIVKMVSEASMLVSMHGAQLVSSLF 98
Query: 295 LRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
L G+ ++++ P ++ A + G L+Y+ ++ EE+S+
Sbjct: 99 LPRGAAVVELFPYAVNPEQYAPYKTLASLPGMDLQYVAWRNMIEENSV 146
>gi|400530758|gb|AFP86530.1| glycosyltransferase, partial [Coregonus clupeaformis]
Length = 282
Score = 37.7 bits (86), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 58/235 (24%), Positives = 89/235 (37%), Gaps = 53/235 (22%)
Query: 194 PKPKTLV------DFQSFLANAYNENTN---TSSSFHHTKPKLVLVNRNARVGRTILNLR 244
PK LV F S L N NT S+ +V+ +R+ + R ILN
Sbjct: 2 PKANILVSGNEIRQFASSLMEKLNINTQGGGEESAAEEKDEYIVVFSRS--INRLILNEA 59
Query: 245 EVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQ 303
E+ A A+E EE T + + I +V +HGA L SLFL G+V+++
Sbjct: 60 ELILALAQEFQMRAVTVSLEEQT-FPNIVKVISGASMLVSMHGAQLVSSLFLPRGAVVVE 118
Query: 304 VVPIGTQ----------------------WLSTV------YFEKPARVLGLEYLEYKIKQ 335
+ P W + V Y E+P G+ +LE ++
Sbjct: 119 LFPYAVNPEQYTPYKTLASLPGMDLQYVAWXNMVEENSVAYPERPWEQGGIAHLEK--EE 176
Query: 336 EESSLVEKYGANDLVLKNPQAFAGANWSNMRVYLKTQNVKLDIDRFRIYLKDAYK 390
+E L K L +NP+ R+Y Q+ +DI L++ K
Sbjct: 177 QEHILASKEVPRHLCCRNPEWL-------YRIY---QDTIVDIPSLLEALRETVK 221
>gi|321478050|gb|EFX89008.1| hypothetical protein DAPPUDRAFT_304713 [Daphnia pulex]
Length = 537
Score = 37.7 bits (86), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 47/105 (44%), Gaps = 7/105 (6%)
Query: 226 KLVLVNRNARVGRTILNLREV--KKAAEELGFDVTIFEPEESTSLADSFRFIHSCHAMVG 283
K+ L++RN + R ILN + + K + + + E T I +G
Sbjct: 367 KITLLSRNTQY-RRILNEKALLDKLRSSPRQYTIQRVEFTHETDFRQQLNIIQDTDIFIG 425
Query: 284 VHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGLEY 328
+HGAGLTH LFL + + ++ G + + AR+ G+ Y
Sbjct: 426 MHGAGLTHLLFLPDWASVFELYNCGDEHC----YADLARLRGVHY 466
>gi|154413981|ref|XP_001580019.1| TKL family protein kinase [Trichomonas vaginalis G3]
gi|121914232|gb|EAY19033.1| TKL family protein kinase [Trichomonas vaginalis G3]
Length = 881
Score = 37.7 bits (86), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 73/160 (45%), Gaps = 22/160 (13%)
Query: 61 QPYPRKSDKSAMSKVKELTITTSAPPNLSCGVTHTSPALVFSAGGYNGNFFHEFMDCFV- 119
+P P+ S VK + S P N + T T P V Y+ F+ F++ +
Sbjct: 368 KPSPKPSQAPQTHAVKPSVL--SGPQNTADNTT-TDPVSVLK--NYHNVNFNAFVEGYAK 422
Query: 120 -------PLFITINSHFPDQ---DVILAIA-DCNDQWARKYAELLPRFTRHP---IININ 165
P F T+ +F D +VI IA +CN R + + +F P I IN
Sbjct: 423 TIGIEHFPFFSTVFQNFNDSTPPEVIRFIAKNCNSVMERN-PDFIYQFGGSPYYTTIPIN 481
Query: 166 -NQTITHCFQSVTLGLISHGRMVINPTLLPKPKTLVDFQS 204
N+++ +S + +S ++ I+P L+P+ K L+DF++
Sbjct: 482 SNESVDEIIESFRILFVSAPQIAIHPNLIPQIKKLLDFRA 521
>gi|83715787|emb|CAJ47424.1| Beta-1,2-xylosyltransferase [Vitis vinifera]
Length = 470
Score = 37.7 bits (86), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 3/62 (4%)
Query: 267 SLADSFRFIHSCHAMVGVHGAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLGL 326
S+ + R I ++G HGAGLTH + RP +V+++++ + +F ++ GL
Sbjct: 384 SMKEQVRAIQDASVIIGAHGAGLTHIISARPKTVILEII---SSLYRRPHFALISQWKGL 440
Query: 327 EY 328
EY
Sbjct: 441 EY 442
>gi|400530816|gb|AFP86559.1| glycosyltransferase, partial [Poromitra crassiceps]
Length = 290
Score = 37.7 bits (86), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 6/117 (5%)
Query: 227 LVLVNRNARVGRTILNLREVKKA-AEELGFDVTIFEPEESTSLADSFRFIHSCHAMVGVH 285
+V+ +R+A R ILN E+ A A+E V EE T A + I +V +H
Sbjct: 52 IVVFSRSA--TRLILNEAELIMALAQEFQMRVVTVSLEEQT-FASIVQVISGASMLVSMH 108
Query: 286 GAGLTHSLFLRPGSVLMQVVPIGTQWLSTVYFEKPARVLG--LEYLEYKIKQEESSL 340
GA L SLFL G+ ++++ P ++ A + G L+Y+ ++ EE+++
Sbjct: 109 GAQLVTSLFLPRGAAVVELFPYAVNPEQYTPYKTLASLPGMDLQYVSWRNTMEENTI 165
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.134 0.400
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,420,428,658
Number of Sequences: 23463169
Number of extensions: 270739144
Number of successful extensions: 638064
Number of sequences better than 100.0: 747
Number of HSP's better than 100.0 without gapping: 393
Number of HSP's successfully gapped in prelim test: 354
Number of HSP's that attempted gapping in prelim test: 636119
Number of HSP's gapped (non-prelim): 823
length of query: 404
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 259
effective length of database: 8,957,035,862
effective search space: 2319872288258
effective search space used: 2319872288258
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)