BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 007079
(619 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225432848|ref|XP_002279944.1| PREDICTED: uncharacterized protein LOC100256346 [Vitis vinifera]
Length = 574
Score = 722 bits (1863), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/605 (63%), Positives = 452/605 (74%), Gaps = 35/605 (5%)
Query: 1 MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFWSRRTGAKVG--VSNSEGGGSYLDMW 58
MAS L RAT A P + R + RR+G + + S+GG SYLDMW
Sbjct: 1 MASHLGASLRATARSAVP---VSHRHKHRVAVTVLVRRSGGRGASRIRVSDGGDSYLDMW 57
Query: 59 QKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERD 118
+KAVD++RK +EFQ+IAG+ G+ DG + E LE+KS EF KIL+VSKEERD
Sbjct: 58 KKAVDQERKGMEFQRIAGN--SGGEDDG-------ESAEALERKSGEFMKILEVSKEERD 108
Query: 119 RIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESS-GTAEVSRFVKKNSESSGAAEISP 177
++QR+QVIDRAAAAIAAARAIL+E K+GE G + V E SG+ +
Sbjct: 109 KVQRIQVIDRAAAAIAAARAILQES-----KSGEQELGYSRV--------EGSGSETMHD 155
Query: 178 FVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRD 237
+NS VP G + +FVP+S T N TP GPDFWSW+PP D + D +
Sbjct: 156 VFQNSV---IFIVP--GTQNGILFVPQSRTSVNSTP--GPDFWSWTPPMDSEGKSDDAGN 208
Query: 238 LQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSE 297
LQ A SS Y TP ++EK +SVD L IPFES+ SE +P LPP QSL V +VS
Sbjct: 209 LQTARTSSPYLTPAESLMEKEQSVDFLSIPFESRFSESSHNPPLPPLQSLTEVGTVDVSS 268
Query: 298 TNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGV 357
++LE PSL++E +LG LF HAAEA HALD+VD + G++PDGSRWW+ETGIEQRPDGV
Sbjct: 269 SSLEMPSLKKEDELGVLFLGHAAEAVHALDEVDGALSHGVSPDGSRWWRETGIEQRPDGV 328
Query: 358 VCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ 417
VCRWT+ RGVSAD +EW+EKFWEAAD+ +KELGSEKSGRDATGNVWRE+W ESMWQ+
Sbjct: 329 VCRWTLIRGVSADHVVEWEEKFWEAADKFQYKELGSEKSGRDATGNVWREYWKESMWQDC 388
Query: 418 GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWH 477
GL+H+EKTADKWGKNG GDEW EKWWE YDASGKA+KWAHKWCSIDPNTQL+AGHAHVWH
Sbjct: 389 GLMHMEKTADKWGKNGKGDEWHEKWWEQYDASGKADKWAHKWCSIDPNTQLEAGHAHVWH 448
Query: 478 ERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGE 537
ERWGE+YDGHGGSMKYTDKWAERCEGD W+KWGDKWDENFDPNSHGVKQGETWW GK+GE
Sbjct: 449 ERWGERYDGHGGSMKYTDKWAERCEGDAWTKWGDKWDENFDPNSHGVKQGETWWEGKHGE 508
Query: 538 RWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
RWNRTWGE HNGSGWVHKYGKSSSGE WDTHE+Q+TWYERFPH+GFYHCF+NSVQLREV+
Sbjct: 509 RWNRTWGEGHNGSGWVHKYGKSSSGEHWDTHEEQDTWYERFPHYGFYHCFENSVQLREVQ 568
Query: 598 KPSEF 602
P +
Sbjct: 569 TPPQL 573
>gi|84468392|dbj|BAE71279.1| hypothetical protein [Trifolium pratense]
Length = 563
Score = 693 bits (1789), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/556 (64%), Positives = 417/556 (75%), Gaps = 36/556 (6%)
Query: 49 EGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSK 108
+ SYLDMW+KA++R+R F K+A S ++ E LEKK+EEF K
Sbjct: 43 QASSSYLDMWKKAIERERNTTNFNKLASS-------------NDNNVEENLEKKTEEFQK 89
Query: 109 ILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSE 168
+L VS EERDRIQRLQVIDRA+AAIAAARA+L++ N + V++ + S ++KN
Sbjct: 90 LLQVSSEERDRIQRLQVIDRASAAIAAARALLKDANSNSVRSDKDS--------LQKNES 141
Query: 169 SSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDD 228
SG S FV+ E G + +FVP+SGT + PGPDFWSW+PP D
Sbjct: 142 DSGKKNDSIFVQ-----------ESGTQNGTLFVPKSGT--QKDGIPGPDFWSWTPPADS 188
Query: 229 DRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLL 288
D D L++ KSSV PT NPVVEK RS L IPFES L++ K P LPP QS L
Sbjct: 189 DVPPNDANGLKLNPKSSVNPTLSNPVVEKERSSQSLSIPFESLLTQSKTFPTLPPLQSSL 248
Query: 289 GVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKET 348
V E S +N+E+PSLEEE G L S HAAE AL+ + + G+N DG+RWW+ET
Sbjct: 249 EVV--EASASNVESPSLEEELKRGVLSSDHAAEVVRALETDSKSSPVGVNVDGTRWWRET 306
Query: 349 GIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREF 408
GIEQRPDGV+CRWT+ RGVSAD+ALEWQEKFWEA+DE G+KELGSEKSGRDATGNVWREF
Sbjct: 307 GIEQRPDGVICRWTLIRGVSADKALEWQEKFWEASDEFGYKELGSEKSGRDATGNVWREF 366
Query: 409 WTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQL 468
W ESM Q GL+H+EKTADKWG+NG GDEWQEKW+EHY+ASG+AEKWAHKWCSIDPNT L
Sbjct: 367 WRESMRQENGLMHMEKTADKWGRNGQGDEWQEKWFEHYNASGQAEKWAHKWCSIDPNTPL 426
Query: 469 DAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGE 528
DAGHAHVWHERWGE YDG+GGS+KYTDKWAER GW KWGDKWDENFDPNSHG+KQGE
Sbjct: 427 DAGHAHVWHERWGETYDGYGGSIKYTDKWAERSSDGGWEKWGDKWDENFDPNSHGIKQGE 486
Query: 529 TWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFD 588
TWW GKYGERWNRTWGE+HNGSGWVHKYGKSSSGE WDTHE Q+TWYERFPHFGF+HCF+
Sbjct: 487 TWWEGKYGERWNRTWGEQHNGSGWVHKYGKSSSGEHWDTHEPQDTWYERFPHFGFFHCFE 546
Query: 589 NSVQLREVRKPSEFQE 604
NSVQLREV+KPSE QE
Sbjct: 547 NSVQLREVKKPSERQE 562
>gi|449465699|ref|XP_004150565.1| PREDICTED: uncharacterized protein LOC101218256 [Cucumis sativus]
Length = 579
Score = 680 bits (1754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/603 (61%), Positives = 429/603 (71%), Gaps = 30/603 (4%)
Query: 1 MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFWSRRT--GAKVGVSNSEGGGSYLDMW 58
M +L PR T H P L P Q + R + + S+ G SYL MW
Sbjct: 2 MPLRLPLSPRPTLHHHFPRLYHHNFLLLPLQPHIQIRHATPARTLRIRASDEGESYLGMW 61
Query: 59 QKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERD 118
+ AV+R RK +EFQK+ + G+ D N G D QLEKKSEEFSKIL V EERD
Sbjct: 62 KNAVERQRKAVEFQKVVENT--EGNDDRNAGDPSSD---QLEKKSEEFSKILQVPPEERD 116
Query: 119 RIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAAEISPF 178
RIQR+QVI RAAAAIAAARA++ E V + ++ V NS
Sbjct: 117 RIQRMQVIHRAAAAIAAARALVGETGTLAVGDSDTC--------VNLNS----------- 157
Query: 179 VKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRDL 238
N E E S +P T + TP GPDFWSW+PP DDD + +L
Sbjct: 158 -TNDEGLLDREEALSEFQSENALLPEFETSQSWTP--GPDFWSWTPPPDDDGNDNAFGEL 214
Query: 239 QMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSET 298
Q KS YP N V EK R +D L IPF+S++SE +PLLPPFQSL+G+EK E SET
Sbjct: 215 QPLGKSQAYPKLSNFVEEKERPIDFLSIPFQSEISE-SVNPLLPPFQSLVGMEKLESSET 273
Query: 299 NLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVV 358
+ ET SLEE+ ++G FS HAAEA+ AL VD+ +T+GI+PDGSRWWKETGIEQRPDGV+
Sbjct: 274 STETHSLEEDENVGIEFSVHAAEASQALSSVDKESTKGIDPDGSRWWKETGIEQRPDGVI 333
Query: 359 CRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQG 418
C+WT+TRGVSAD A EWQ K+WEAADE G+KELGSEKSGRDA GNVWRE+W ESM Q QG
Sbjct: 334 CKWTLTRGVSADLATEWQNKYWEAADEFGYKELGSEKSGRDAYGNVWREYWRESMRQEQG 393
Query: 419 LVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHE 478
LVHLEKTADKWG NG+G EWQEKWWE+Y+ SG+AEK AHKWC IDPNT +D GHAH+W+E
Sbjct: 394 LVHLEKTADKWGINGSGTEWQEKWWEYYNTSGQAEKNAHKWCKIDPNTYVDPGHAHIWNE 453
Query: 479 RWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGER 538
RWGEKYDG GGS+KYTDKWAERCEGDGW+KWGDKWDENFDPN HG+KQGETWW G++GER
Sbjct: 454 RWGEKYDGQGGSIKYTDKWAERCEGDGWTKWGDKWDENFDPNGHGIKQGETWWEGRHGER 513
Query: 539 WNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRK 598
WNRTWGE HNGSGWVHKYGKSSSGE WDTH QQETWYERFPHFGFYHCF+NSVQLREV+K
Sbjct: 514 WNRTWGEGHNGSGWVHKYGKSSSGEHWDTHAQQETWYERFPHFGFYHCFNNSVQLREVQK 573
Query: 599 PSE 601
PSE
Sbjct: 574 PSE 576
>gi|255552009|ref|XP_002517049.1| conserved hypothetical protein [Ricinus communis]
gi|223543684|gb|EEF45212.1| conserved hypothetical protein [Ricinus communis]
Length = 566
Score = 666 bits (1719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/615 (60%), Positives = 430/615 (69%), Gaps = 61/615 (9%)
Query: 1 MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFW-------SRRTGAKVGVSNSEGGGS 53
M S LS + RAT PL +TTPQQ+ +T + ++ G S
Sbjct: 1 MTSNLSSFSRAT------PLF---PKTTPQQLFLPFEQPVLPPNKTSSYRAKASINNGES 51
Query: 54 YLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVS 113
YLDMW+ AVDRD+K +EFQK+A ++ D + RD + +K+++F KI+D S
Sbjct: 52 YLDMWKSAVDRDKKSVEFQKLAERFSQI-DNTSDSTSVKRD---DVNRKTKDFKKIVDFS 107
Query: 114 KEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAA 173
K+ERDRIQR+QV+DRAAAAIAAARAIL+E+ G+
Sbjct: 108 KDERDRIQRMQVVDRAAAAIAAARAILKERRSENANVGD--------------------- 146
Query: 174 EISPFVKNSESNGTAEVPE-RGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDM 232
NG E G + IFV RS T GN PGPDFW+W+PP D+ R
Sbjct: 147 -----------NGNLETESGEGTTNESIFVSRSETSGN--GVPGPDFWTWTPPPDN-RTQ 192
Query: 233 RDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEK 292
D +L A+KSS P V K RS+ L IP +SKLS +P LPP QSL+ V+K
Sbjct: 193 YDF-ELMEAQKSSASPISTRNVAMKERSLSYLDIPLQSKLSPSDLNPPLPPLQSLMEVKK 251
Query: 293 EEVSETNLETPSL-EEERDLGALFSAHAAEAAHAL--DKVDELATRGINPDGSRWWKETG 349
EE SE E PSL EEER+L F+AHA EA + L +K DE A+ G+ DGSRWWKE G
Sbjct: 252 EEDSEFRPEMPSLKEEERELDLEFTAHAIEAGYVLATEKEDE-ASSGMELDGSRWWKEKG 310
Query: 350 IEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFW 409
IE+RPDGV+CRWTM RGVS DE +EWQEKFWEA DE G+KELGSEKSGRDATGNVWRE+W
Sbjct: 311 IERRPDGVICRWTMIRGVSVDEDVEWQEKFWEATDEFGYKELGSEKSGRDATGNVWREYW 370
Query: 410 TESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLD 469
ESMWQ GLVHLEKTA+KWGKNG GDEW+EKWWEHYDAS KAEKWAHKWC+IDP QL+
Sbjct: 371 RESMWQESGLVHLEKTANKWGKNGEGDEWEEKWWEHYDASNKAEKWAHKWCTIDPTRQLE 430
Query: 470 AGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGET 529
AGHAH+WHERWGE YDGHGGSMKYTDKWAERCEGDGW+KWGDKWDE+FDPN HGVKQGET
Sbjct: 431 AGHAHIWHERWGENYDGHGGSMKYTDKWAERCEGDGWTKWGDKWDEHFDPNGHGVKQGET 490
Query: 530 WWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDN 589
WWAGK+GERWNRTWGERHNGSGWVHKYGKSSSGE WDTHE+QETWYERFPH+GFYHCF+N
Sbjct: 491 WWAGKHGERWNRTWGERHNGSGWVHKYGKSSSGEHWDTHEEQETWYERFPHYGFYHCFEN 550
Query: 590 SVQLREVRKPSEFQE 604
S LREV+ PS+ E
Sbjct: 551 SGILREVQIPSDSHE 565
>gi|297816898|ref|XP_002876332.1| hypothetical protein ARALYDRAFT_486012 [Arabidopsis lyrata subsp.
lyrata]
gi|297322170|gb|EFH52591.1| hypothetical protein ARALYDRAFT_486012 [Arabidopsis lyrata subsp.
lyrata]
Length = 580
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/565 (61%), Positives = 419/565 (74%), Gaps = 33/565 (5%)
Query: 38 RTGAKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTE 97
R+G ++ ++EG SYLDMW+ AVDR++KE F+KIA ++ VDG + GG
Sbjct: 48 RSGVRILRVSNEGRESYLDMWKNAVDREKKEKAFEKIAENVVA---VDGEKEKGG----- 99
Query: 98 QLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTA 157
+EKKS+EF KIL+VS EERDRIQR+QV+DRAAAAI+AARAIL N K G
Sbjct: 100 DMEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEG------ 153
Query: 158 EVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGP 217
N E++ +E++ KN++ G S ++VPRS T G TP GP
Sbjct: 154 ------FPNEENTVTSEVTETPKNAK---------LGMWSRTVYVPRSETSGTETP--GP 196
Query: 218 DFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKP 277
DFWSW+PP+ + DLQ EK + +PT NPV+EK +S D L IP+ES LS +
Sbjct: 197 DFWSWTPPQGSEISSNMNVDLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERH 256
Query: 278 DPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGI 337
+PPF+SL+ V KE ++ + ET S E + DL + SA+A EAA LD +DE +T G+
Sbjct: 257 SFTIPPFESLIEVRKEAETKPSSETSSTEHDLDL--ISSANAEEAARVLDSLDESSTHGV 314
Query: 338 NPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSG 397
+ DG +WWK+TG+E+RPDGVVCRWTM RGV+AD +EWQ+K+WEA+D+ G KELGSEKSG
Sbjct: 315 SEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKELGSEKSG 374
Query: 398 RDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
RDATGNVWREFW ESM Q G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+GK+EKWAH
Sbjct: 375 RDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAH 434
Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENF 517
KWCSID NT LDAGHAHVWHERWGEKYDG GGS KYTDKWAER GDGW KWGDKWDENF
Sbjct: 435 KWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKWGDKWDENF 494
Query: 518 DPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYER 577
+P++ GVKQGETWW GK+G+RWNRTWGE HNGSGWVHKYGKSSSGE WDTH QETWYER
Sbjct: 495 NPSAQGVKQGETWWEGKHGDRWNRTWGEGHNGSGWVHKYGKSSSGEHWDTHVPQETWYER 554
Query: 578 FPHFGFYHCFDNSVQLREVRKPSEF 602
FPHFGF+HCFDNSVQLR V+KPS+
Sbjct: 555 FPHFGFFHCFDNSVQLRAVKKPSDM 579
>gi|15228186|ref|NP_191135.1| uncharacterized protein [Arabidopsis thaliana]
gi|30694316|ref|NP_850708.1| uncharacterized protein [Arabidopsis thaliana]
gi|334186001|ref|NP_001190098.1| uncharacterized protein [Arabidopsis thaliana]
gi|58652076|gb|AAW80863.1| At3g55760 [Arabidopsis thaliana]
gi|332645910|gb|AEE79431.1| uncharacterized protein [Arabidopsis thaliana]
gi|332645911|gb|AEE79432.1| uncharacterized protein [Arabidopsis thaliana]
gi|332645912|gb|AEE79433.1| uncharacterized protein [Arabidopsis thaliana]
Length = 578
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/565 (61%), Positives = 419/565 (74%), Gaps = 36/565 (6%)
Query: 38 RTGAKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTE 97
RTG ++ ++EG SYLDMW+ AVDR++KE F+KIA ++ VDG + GG
Sbjct: 49 RTGVRILRVSNEGRESYLDMWKNAVDREKKEKAFEKIAENVVA---VDGEKEKGG----- 100
Query: 98 QLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTA 157
LEKKS+EF KIL+VS EERDRIQR+QV+DRAAAAI+AARAIL N K G
Sbjct: 101 DLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEG------ 154
Query: 158 EVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGP 217
N +++ +E++ KN++ G S ++VPRS T G TP GP
Sbjct: 155 ------FPNEDNTVTSEVTETPKNAK---------LGMWSRTVYVPRSETSGTETP--GP 197
Query: 218 DFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKP 277
DFWSW+PP+ + + V DLQ EK + +PT NPV+EK +S D L IP+ES LS +
Sbjct: 198 DFWSWTPPQGSE--ISSV-DLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERH 254
Query: 278 DPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGI 337
+PPF+SL+ V KE +ET + +L E DL + SA+A E A LD +DE +T G+
Sbjct: 255 SFTIPPFESLIEVRKE--AETKPSSETLSTEHDLDLISSANAEEVARVLDSLDESSTHGV 312
Query: 338 NPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSG 397
+ DG +WWK+TG+E+RPDGVVCRWTM RGV+AD +EWQ+K+WEA+D+ G KELGSEKSG
Sbjct: 313 SEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKELGSEKSG 372
Query: 398 RDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
RDATGNVWREFW ESM Q G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+GK+EKWAH
Sbjct: 373 RDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAH 432
Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENF 517
KWCSID NT LDAGHAHVWHERWGEKYDG GGS KYTDKWAER GDGW KWGDKWDENF
Sbjct: 433 KWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKWGDKWDENF 492
Query: 518 DPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYER 577
+P++ GVKQGETWW GK+G+RWNR+WGE HNGSGWVHKYGKSSSGE WDTH QETWYE+
Sbjct: 493 NPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGKSSSGEHWDTHVPQETWYEK 552
Query: 578 FPHFGFYHCFDNSVQLREVRKPSEF 602
FPHFGF+HCFDNSVQLR V+KPS+
Sbjct: 553 FPHFGFFHCFDNSVQLRAVKKPSDM 577
>gi|22655111|gb|AAM98146.1| putative protein [Arabidopsis thaliana]
Length = 578
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/565 (61%), Positives = 419/565 (74%), Gaps = 36/565 (6%)
Query: 38 RTGAKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTE 97
RTG ++ ++EG SYLDMW+ AVDR++KE F+KIA ++ VDG + GG
Sbjct: 49 RTGVRILRVSNEGRESYLDMWKNAVDREKKEKAFEKIAENVVA---VDGEKEKGG----- 100
Query: 98 QLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTA 157
LEKKS+EF KIL+VS EERDRIQR+QV+DRAAAAI+AARAIL N K G
Sbjct: 101 DLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEG------ 154
Query: 158 EVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGP 217
N +++ +E++ KN++ G S ++VPRS T G TP GP
Sbjct: 155 ------FPNEDNTVTSEVTETPKNAK---------LGMWSRTVYVPRSETSGTETP--GP 197
Query: 218 DFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKP 277
DFWSW+PP+ + + V DLQ EK + +PT NPV+EK +S D L IP+ES LS +
Sbjct: 198 DFWSWTPPQGSE--ISSV-DLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERH 254
Query: 278 DPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGI 337
+PPF+SL+ V KE +ET + +L E DL + SA+A E A LD +DE +T G+
Sbjct: 255 SFTIPPFESLIEVRKE--AETKPSSETLSTEHDLDLISSANAEEVARVLDSLDESSTHGV 312
Query: 338 NPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSG 397
+ DG +WWK+TG+E+RPDGVVCRWTM RGV+AD +EWQ+K+WEA+D+ G KELGSEKSG
Sbjct: 313 SEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDSGFKELGSEKSG 372
Query: 398 RDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
RDATGNVWREFW ESM Q G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+GK+EKWAH
Sbjct: 373 RDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAH 432
Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENF 517
KWCSID NT LDAGHAHVWHERWGEKYDG GGS KYTDKWAER GDGW KWGDKWDENF
Sbjct: 433 KWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKWGDKWDENF 492
Query: 518 DPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYER 577
+P++ GVKQGETWW GK+G+RWNR+WGE HNGSGWVHKYGKSSSGE WDTH QETWYE+
Sbjct: 493 NPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGKSSSGEHWDTHVPQETWYEK 552
Query: 578 FPHFGFYHCFDNSVQLREVRKPSEF 602
FPHFGF+HCFDNSVQLR V+KPS+
Sbjct: 553 FPHFGFFHCFDNSVQLRAVKKPSDM 577
>gi|224102135|ref|XP_002312560.1| predicted protein [Populus trichocarpa]
gi|222852380|gb|EEE89927.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/569 (65%), Positives = 426/569 (74%), Gaps = 43/569 (7%)
Query: 41 AKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLE 100
++ VSN +G SYLDMW+ AVDR+RK +EFQ+IAG+LA++ + ++ D+T LE
Sbjct: 4 TRIRVSN-DGSNSYLDMWKTAVDRERKTVEFQQIAGNLAQTDNDSDDDD---DDVTVDLE 59
Query: 101 KKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVS 160
KKSE+F+KIL+VSKEERDRIQR+QVIDRAAAAIAAAR I+ EK +
Sbjct: 60 KKSEDFNKILEVSKEERDRIQRVQVIDRAAAAIAAARDIVREKKSA-------------- 105
Query: 161 RFVKKNSESSGAAEISPFVKNSESNGTAEVPERG----ALSAGIFVPRSGTPGNRTPAPG 216
K+ ES G ++G + S I V RS + + PG
Sbjct: 106 ---------------DKDFKSHESMGGEVEDQQGGKFWSYSRSILVSRSESSAD--GVPG 148
Query: 217 PDFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPK 276
PDFWSW+PP D + D D+ KSS P + PV K RS D L IPFESKL +
Sbjct: 149 PDFWSWTPPLSSDGNSDDSSDVLKVRKSSDTPLTI-PVAMKERSADFLSIPFESKLLDTN 207
Query: 277 PDPLLPPFQSLLGVEKEEVSETNLETPSL-EEERDLGALFSAHAAEAAHAL--DKVDELA 333
+PP QSL+ VE EVSE+ LE PS EEER+LG FSA+AAEAAHAL DKVDEL+
Sbjct: 208 HSSPIPPLQSLVEVEGVEVSESILEMPSKNEEERELGVQFSAYAAEAAHALEKDKVDELS 267
Query: 334 TRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGS 393
+ G+ DGSR W+ETGIEQRPDGV+CRWTMTRGVSAD+ +EWQEKFWEAAD+ G+KELGS
Sbjct: 268 SYGVTADGSRCWRETGIEQRPDGVICRWTMTRGVSADQEVEWQEKFWEAADDFGYKELGS 327
Query: 394 EKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAE 453
EKSGRDATGNVWREFW ESM Q GL+HLEKTADKWGKNG GDEWQEKWWEHY ASG+AE
Sbjct: 328 EKSGRDATGNVWREFWRESMRQESGLLHLEKTADKWGKNGQGDEWQEKWWEHYGASGQAE 387
Query: 454 KWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKW 513
KWAHKWCSIDP T L+AGHAHVWHERWGEKYDGHGGS KYTDKWAERCEGDGW+KWGDKW
Sbjct: 388 KWAHKWCSIDPTTNLEAGHAHVWHERWGEKYDGHGGSTKYTDKWAERCEGDGWAKWGDKW 447
Query: 514 DENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQET 573
DENFD N HGVKQGE WW GK+GERWNRTWGERHNGSGWVHKYGKSS GE WDTH QQ+T
Sbjct: 448 DENFDLNGHGVKQGEAWWEGKHGERWNRTWGERHNGSGWVHKYGKSSCGEHWDTHTQQDT 507
Query: 574 WYERFPHFGFYHCFDNSVQLREVRKPSEF 602
WYERFPH+GFYHCF+NSVQLREV+KPSE
Sbjct: 508 WYERFPHYGFYHCFENSVQLREVQKPSEI 536
>gi|297737133|emb|CBI26334.3| unnamed protein product [Vitis vinifera]
Length = 414
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 300/413 (72%), Positives = 343/413 (83%), Gaps = 2/413 (0%)
Query: 190 VPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPT 249
V + G + +FVP+S T N TP GPDFWSW+PP D + D +LQ A SS Y T
Sbjct: 3 VQQGGTQNGILFVPQSRTSVNSTP--GPDFWSWTPPMDSEGKSDDAGNLQTARTSSPYLT 60
Query: 250 PVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEER 309
P ++EK +SVD L IPFES+ SE +P LPP QSL V +VS ++LE PSL++E
Sbjct: 61 PAESLMEKEQSVDFLSIPFESRFSESSHNPPLPPLQSLTEVGTVDVSSSSLEMPSLKKED 120
Query: 310 DLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSA 369
+LG LF HAAEA HALD+VD + G++PDGSRWW+ETGIEQRPDGVVCRWT+ RGVSA
Sbjct: 121 ELGVLFLGHAAEAVHALDEVDGALSHGVSPDGSRWWRETGIEQRPDGVVCRWTLIRGVSA 180
Query: 370 DEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKW 429
D +EW+EKFWEAAD+ +KELGSEKSGRDATGNVWRE+W ESMWQ+ GL+H+EKTADKW
Sbjct: 181 DHVVEWEEKFWEAADKFQYKELGSEKSGRDATGNVWREYWKESMWQDCGLMHMEKTADKW 240
Query: 430 GKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGG 489
GKNG GDEW EKWWE YDASGKA+KWAHKWCSIDPNTQL+AGHAHVWHERWGE+YDGHGG
Sbjct: 241 GKNGKGDEWHEKWWEQYDASGKADKWAHKWCSIDPNTQLEAGHAHVWHERWGERYDGHGG 300
Query: 490 SMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNG 549
SMKYTDKWAERCEGD W+KWGDKWDENFDPNSHGVKQGETWW GK+GERWNRTWGE HNG
Sbjct: 301 SMKYTDKWAERCEGDAWTKWGDKWDENFDPNSHGVKQGETWWEGKHGERWNRTWGEGHNG 360
Query: 550 SGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEF 602
SGWVHKYGKSSSGE WDTHE+Q+TWYERFPH+GFYHCF+NSVQLREV+ P +
Sbjct: 361 SGWVHKYGKSSSGEHWDTHEEQDTWYERFPHYGFYHCFENSVQLREVQTPPQL 413
>gi|356500589|ref|XP_003519114.1| PREDICTED: uncharacterized protein LOC100810326 [Glycine max]
Length = 575
Score = 603 bits (1554), Expect = e-169, Method: Compositional matrix adjust.
Identities = 328/590 (55%), Positives = 403/590 (68%), Gaps = 54/590 (9%)
Query: 38 RTGAKVGVSNSEG-GGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLT 96
R A VG +G SYLDMW+KAV+R+RK F IA +A + D +
Sbjct: 32 RANASVGGGKGDGETASYLDMWKKAVERERKSASFNSIADRVAANTDDN---------ND 82
Query: 97 EQLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGT 156
+ LEKK+ EF K+L VS EERDR+QR+QVIDRAAAAIAAAR +L+E++ + + E++
Sbjct: 83 DDLEKKTSEFQKLLQVSAEERDRVQRMQVIDRAAAAIAAARQLLQERSAADSSHHEATAD 142
Query: 157 AEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPG 216
+++ SG G S GI V S T GN PG
Sbjct: 143 E------RRDESGSGM---------------------GVQSEGIRVSESETRGN--GVPG 173
Query: 217 PDFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPK 276
PDFWSW+PP + D D LQ+ KSSV PT + VVEK + L IPFES LS+ +
Sbjct: 174 PDFWSWTPPVESDVPSDDGSGLQLDTKSSVRPTLPSAVVEKEWTPQFLSIPFESLLSQSE 233
Query: 277 PDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAH-------AAEAAHALDKV 329
LPPFQS L VE+ E + H AAEAAHAL +
Sbjct: 234 RSDTLPPFQSFLEVEEAETTSDPESLSESPSLSLSLEEEQIHGESSFDYAAEAAHALSEA 293
Query: 330 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 389
++ + G+NPDGSRWWKETGIE+RPDGV+CRWTMTRGVSAD+A+EWQEK+WEA+D+ G+K
Sbjct: 294 NKSSPIGVNPDGSRWWKETGIERRPDGVICRWTMTRGVSADKAIEWQEKYWEASDDFGYK 353
Query: 390 ELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDAS 449
ELGSEKSGRDA GN+WREFW ES+ GL+ EKTADKWG+N NG+EWQEKW E Y+A+
Sbjct: 354 ELGSEKSGRDANGNIWREFWRESLCLENGLMSFEKTADKWGRNVNGNEWQEKWGERYNAA 413
Query: 450 GKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKW 509
G+ EKWAHKWCSIDPNT L+ GHAHVWHERWG KYDG+GGS+KYTDKWAER GW KW
Sbjct: 414 GQTEKWAHKWCSIDPNTPLEPGHAHVWHERWGGKYDGYGGSIKYTDKWAERFVDGGWDKW 473
Query: 510 GDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHE 569
GDKWDENFDPN++GVKQGE+WW G++G+RWNRTWGE+HNGSGW+HKYG+SSSGE WDTH
Sbjct: 474 GDKWDENFDPNANGVKQGESWWEGRHGDRWNRTWGEQHNGSGWIHKYGQSSSGEHWDTHA 533
Query: 570 QQETWYERFPHFGFYHCFDNSVQLREVRKPSEFQEEPFEIQDKRSELQEP 619
+++TWYE+FPH+GF++CF+NSVQLREV KPSE EIQ ++QEP
Sbjct: 534 REDTWYEKFPHYGFFNCFENSVQLREVPKPSEI----LEIQ----QVQEP 575
>gi|326520025|dbj|BAK03937.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 605
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 320/569 (56%), Positives = 390/569 (68%), Gaps = 40/569 (7%)
Query: 53 SYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDV 112
SYLDMW+KAVDR+R+ E +A L S E +E+++ F ++L V
Sbjct: 51 SYLDMWKKAVDRERRSAE---LAYRLQPSPPPAEAEAE--APPQADVERRTARFEEMLRV 105
Query: 113 SKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVV-----KNGESSGTAEV-----SRF 162
+EERDR+QR QVIDRAAAA+AAARA+L+E S K ++G AE SR
Sbjct: 106 PREERDRVQRTQVIDRAAAALAAARAVLKEPPQSSPTPQPHKPTPATGVAEPGNVFDSRK 165
Query: 163 VKKNSESSGAAEIS-PFVKNSE--SNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDF 219
K E G+ + S P NSE +N P + A S + GTPG PDF
Sbjct: 166 AAKGLEDQGSGQDSLPAASNSEKVTNSGDSYPSKQASS------KLGTPG-------PDF 212
Query: 220 WSWSPPEDDDRDMRDVRD-LQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPD 278
WSW PP D + R+ L+ ++K + + ++EK RS D L +PF + E K D
Sbjct: 213 WSWLPPVDSSSEPRESNTVLKPSKKVDSFSSQPEMLMEKERSADFLSLPFVASFFEKKED 272
Query: 279 PLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGIN 338
LPPFQS + E + + P + E FS +AAE A AL D+ ++ GI+
Sbjct: 273 RSLPPFQSFVEPENTD----SKAKPVADAEEAFETQFSQNAAETARALSTSDDKSSHGID 328
Query: 339 PDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGR 398
PDGS+WWKETG+EQRPDGVVC+WT+ RGVSAD ++E+++K+WEA+D HKELGSEKSGR
Sbjct: 329 PDGSKWWKETGVEQRPDGVVCKWTVVRGVSADGSVEFEDKYWEASDRFDHKELGSEKSGR 388
Query: 399 DATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWA 456
DA GNVWRE+W ESMWQ+ GL+H+EKTADKWGKNG G++WQEKWWE YD+SGKAEK A
Sbjct: 389 DARGNVWREYWKESMWQDFTSGLMHMEKTADKWGKNGKGEQWQEKWWEQYDSSGKAEKSA 448
Query: 457 HKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDEN 516
KWCS+DPNT LDAGHAHVWHERWGE YDG GGS+KYTDKWAER EGDGWSKWGDKWDE+
Sbjct: 449 DKWCSLDPNTPLDAGHAHVWHERWGETYDGCGGSVKYTDKWAERSEGDGWSKWGDKWDEH 508
Query: 517 FDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYE 576
FDPN HGVKQGETWW GKYG+RWNRTWGE HNGSGWVHKYG+SSSGE WDTHE QETWYE
Sbjct: 509 FDPNRHGVKQGETWWEGKYGDRWNRTWGEGHNGSGWVHKYGRSSSGEHWDTHEPQETWYE 568
Query: 577 RFPHFGFYHCFDNSVQLREVRK--PSEFQ 603
+PHFGF+HCF+NSVQL V + P F+
Sbjct: 569 SYPHFGFHHCFENSVQLLSVSRQPPKNFK 597
>gi|242071493|ref|XP_002451023.1| hypothetical protein SORBIDRAFT_05g022830 [Sorghum bicolor]
gi|241936866|gb|EES10011.1| hypothetical protein SORBIDRAFT_05g022830 [Sorghum bicolor]
Length = 605
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 301/552 (54%), Positives = 387/552 (70%), Gaps = 18/552 (3%)
Query: 54 YLDMWQKAVDRDRKEIEF-QKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDV 112
YLDMW+KAV+R+R+ E +++ + S + + E + +++ F ++L V
Sbjct: 59 YLDMWRKAVERERRSAELARRLQEAPPTSPAAEADAPPAPGAPVEDVRRRTARFEEMLRV 118
Query: 113 SKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNG--ESSGTAEVSRFVKKNSESS 170
EERDR+QR QVIDRAAAA+AAARA+L+E + + + + TA+V +
Sbjct: 119 PPEERDRVQRRQVIDRAAAALAAARAVLKEPPPASPPSTPPQVAETAKVGSVAAGAAARG 178
Query: 171 GAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPP-EDDD 229
P + S+ +AEVP+ G S S ++ PGPDFWSW PP +D
Sbjct: 179 SDRSSRPSARAQLSSPSAEVPDSGVSSP------SKQSSSKLGTPGPDFWSWLPPVQDSS 232
Query: 230 RDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLG 289
+ L+ ++K + + + ++EK RS D L +PFE+ + K D LPPFQS
Sbjct: 233 KQKESGTGLKPSKKMDAFSSQPDLLMEKERSADSLSLPFETAFFKKKEDRSLPPFQSF-- 290
Query: 290 VEKEEV-SETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKET 348
E E V S+ +L + +++ FS +AAE A AL + E ++ GI+ DGS WWKET
Sbjct: 291 AEPENVDSKADL---AADKKDTFEEQFSKNAAEVARALSESTEKSSHGIHLDGSMWWKET 347
Query: 349 GIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREF 408
G+EQRPDGVVC+WT+ RGVSAD A+E+++K+WEA+D HKELGSEKSGRDA GNVWRE+
Sbjct: 348 GVEQRPDGVVCKWTVIRGVSADGAVEFEDKYWEASDRFDHKELGSEKSGRDAAGNVWREY 407
Query: 409 WTESMWQNQ--GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNT 466
W ESMWQ+ G++H+EKTADKWG+NG G++WQE+W+EHYD++GK EKWA KWCS+DPNT
Sbjct: 408 WKESMWQDYTCGVMHMEKTADKWGQNGKGEQWQEQWFEHYDSTGKTEKWADKWCSLDPNT 467
Query: 467 QLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQ 526
LD GHAHVWHERWGE YDG+GGS KYTDKWAER EGDGWSKWGDKWDE+FDPN HG KQ
Sbjct: 468 PLDVGHAHVWHERWGENYDGYGGSTKYTDKWAERSEGDGWSKWGDKWDEHFDPNGHGTKQ 527
Query: 527 GETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHC 586
GETWWAGKYG+RWNRTWGE HNGSGWVHKYG+SSSGE WDTH Q+TWYERFPHFGFYHC
Sbjct: 528 GETWWAGKYGDRWNRTWGEGHNGSGWVHKYGRSSSGEHWDTHVPQDTWYERFPHFGFYHC 587
Query: 587 FDNSVQLREVRK 598
F+NS QLR V++
Sbjct: 588 FENSAQLRSVKR 599
>gi|77551654|gb|ABA94451.1| expressed protein [Oryza sativa Japonica Group]
gi|125577642|gb|EAZ18864.1| hypothetical protein OsJ_34403 [Oryza sativa Japonica Group]
Length = 618
Score = 556 bits (1434), Expect = e-156, Method: Compositional matrix adjust.
Identities = 316/590 (53%), Positives = 400/590 (67%), Gaps = 31/590 (5%)
Query: 39 TGAKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNE------GGGG 92
T + +GG SYLDMW+KAV+R+R+ E IA L +S G
Sbjct: 35 TCVRATARGGDGGSSYLDMWKKAVERERRSAE---IAHRLQQSSSAAAAAVKEEEGEGKA 91
Query: 93 RDLTEQLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGE 152
+E+++ F ++L V +EERDR+QR QVIDRAAAA+AAARA+L++ +
Sbjct: 92 AAAAGDVERRTARFEEMLRVPREERDRVQRRQVIDRAAAALAAARAVLKDPPPPPPPSPP 151
Query: 153 SSGTAE-------VSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRS 205
S+ E + ++ SES + +P ++ ++ V E +A + VP S
Sbjct: 152 STPPQEREQQQKPAATAIQAGSESGLVSRTAPG-ESDRASPPPPVTETATEAAKVSVPDS 210
Query: 206 GTPGNRTP------APGPDFWSWSPPEDDDRDMRDV-RDLQMAEKSSVYPTPVNPVVEKA 258
G PGPDFWSW PP ++ + ++ L+ +EK + + ++EK
Sbjct: 211 GDSSPFKKSSSKLGTPGPDFWSWLPPVENSTKLGEIDTGLKPSEKLDSFAGQPDLLMEKE 270
Query: 259 RSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAH 318
+S DIL +PFE+ + K D LPPFQS E E SE ++ + + E FS +
Sbjct: 271 QSEDILSLPFETSFFK-KEDRSLPPFQSFAEPENVE-SEPSI---TADAEETFEDQFSKN 325
Query: 319 AAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEK 378
AAEAA AL DE ++ G+ PDGS WWKETG+EQRPDGV C+WT+ RGVSAD A+EW++K
Sbjct: 326 AAEAARALSASDEKSSHGVRPDGSLWWKETGVEQRPDGVTCKWTVIRGVSADGAVEWEDK 385
Query: 379 FWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGD 436
+WEA+D HKELGSEKSGRDATGNVWRE+W ESMWQ+ G++H+EKTADKWG+NG G+
Sbjct: 386 YWEASDRFDHKELGSEKSGRDATGNVWREYWKESMWQDFTCGVMHMEKTADKWGQNGKGE 445
Query: 437 EWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDK 496
+WQE+WWEHYD+SGKAEKWA KWCS+DPNT LD GHAHVWHERWGEKYDG GGS KYTDK
Sbjct: 446 QWQEQWWEHYDSSGKAEKWADKWCSLDPNTPLDVGHAHVWHERWGEKYDGCGGSAKYTDK 505
Query: 497 WAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKY 556
WAER EGDGWSKWGDKWDE+FDPN HGVKQGETWWAGKYG+RWNRTWGE HN +GWVHKY
Sbjct: 506 WAERSEGDGWSKWGDKWDEHFDPNGHGVKQGETWWAGKYGDRWNRTWGEHHNCTGWVHKY 565
Query: 557 GKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEFQEEP 606
G+SSSGE WDTH Q+TWYERFPHFGF HCF+NSVQLR V++ + +P
Sbjct: 566 GRSSSGEHWDTHVPQDTWYERFPHFGFEHCFNNSVQLRSVKRQTPKNTKP 615
>gi|413925397|gb|AFW65329.1| hypothetical protein ZEAMMB73_910657 [Zea mays]
Length = 583
Score = 553 bits (1426), Expect = e-155, Method: Compositional matrix adjust.
Identities = 306/562 (54%), Positives = 386/562 (68%), Gaps = 39/562 (6%)
Query: 50 GGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKI 109
GGGSYLDMW+KAV+R+R+ + + + D E + +++ F ++
Sbjct: 41 GGGSYLDMWKKAVERERRSADLARRLQAPPAEADAPAPA-----PPVEDVRRRTARFEEM 95
Query: 110 LDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNG--------SVVKNGE-SSGTAEVS 160
L V +EERDR+QR QVIDRAAAA+AAARA+L+E V + G+ S +
Sbjct: 96 LRVPREERDRVQRNQVIDRAAAALAAARAVLKEPPAFSPPPTPPQVAETGKVGSAGGGAA 155
Query: 161 RFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFW 220
R +NS S+ A++SP +AEV + G S ++ PGPDFW
Sbjct: 156 RGSDRNSRSAARAQLSP---------SAEVQDSGGSSP------HNQSSSKLGTPGPDFW 200
Query: 221 SWSPPEDDDRDMRDVRD-LQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDP 279
SW PP D ++ L+ ++K + + + ++EK R D LP+PFE+ + K D
Sbjct: 201 SWLPPVQDSSKQKESNTGLKPSKKLDTFSSQPDLLMEKERLADSLPLPFETAFFK-KEDR 259
Query: 280 LLPPFQSLLGVEKEEV-SETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGIN 338
LPPFQS E E V S +L + ++E FS +AAE A AL + + GI+
Sbjct: 260 SLPPFQSF--AEPENVDSSADL---AADKEDTFEEQFSKNAAEVARALSESVGKPSHGIH 314
Query: 339 PDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGR 398
DGS WWKE G+E+RPDGVVC+WT+ RGVSAD A+EW++K+WEA+D HKELGSEKSGR
Sbjct: 315 LDGSMWWKEVGVERRPDGVVCKWTVIRGVSADGAVEWEDKYWEASDRFDHKELGSEKSGR 374
Query: 399 DATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWA 456
DA GNVWRE+W ESMWQ+ G++H+EK ADKWG+NG G++WQE+W+EHYD++GK EKWA
Sbjct: 375 DAAGNVWREYWKESMWQDYTCGVMHMEKNADKWGQNGKGEQWQEQWFEHYDSTGKTEKWA 434
Query: 457 HKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDEN 516
KWCS+DPNT LD GHAHVWHERWGEKYDG GGS KYTDKWAER EGDGWSKWGDKWDE+
Sbjct: 435 DKWCSLDPNTPLDVGHAHVWHERWGEKYDGLGGSEKYTDKWAERSEGDGWSKWGDKWDEH 494
Query: 517 FDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYE 576
FD N HGVKQGETWWAGK+G+RWNRTWGERHNGSGWVHKYG+SSSGE WDTH Q+TWYE
Sbjct: 495 FDLNGHGVKQGETWWAGKHGDRWNRTWGERHNGSGWVHKYGRSSSGEHWDTHAPQDTWYE 554
Query: 577 RFPHFGFYHCFDNSVQLREVRK 598
RFPHFGFYHCF+NS QLR V++
Sbjct: 555 RFPHFGFYHCFENSPQLRSVKR 576
>gi|357156312|ref|XP_003577413.1| PREDICTED: uncharacterized protein LOC100825548 [Brachypodium
distachyon]
Length = 627
Score = 547 bits (1409), Expect = e-153, Method: Compositional matrix adjust.
Identities = 315/559 (56%), Positives = 383/559 (68%), Gaps = 22/559 (3%)
Query: 53 SYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDV 112
SYLDMW+KAVDR+R+ E +A L S + + +++ F ++L V
Sbjct: 64 SYLDMWKKAVDRERRSAE---LAYRLQSSPPPPADPEAEASAPAPDVARRTARFEEMLRV 120
Query: 113 SKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGA 172
+EERDR+QR QVIDRAAAA+AAARA+L++ E E SG
Sbjct: 121 PREERDRVQRTQVIDRAAAALAAARAVLKDPPQQNPPPPPPPPMQEQKPGTDVELEGSGD 180
Query: 173 AEIS---PFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTP------APGPDFWSWS 223
S P + S+ AEV A S VP +G PGPDFWSW
Sbjct: 181 GLGSWKAPGGSDWSSSSLAEVEPPPAPSQSAKVPNTGDSSPSKQSSSKLGTPGPDFWSWL 240
Query: 224 PPEDDDRDMRDVRD-LQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLP 282
PP ++ + R+ L+ ++K+ + + + ++EK RS D L +PF + E K D LP
Sbjct: 241 PPVENSSEPRESNTGLKPSKKAESFSSQPD-LLEKERSADFLSLPFVTSFFEKKEDRSLP 299
Query: 283 PFQSLLGVEKEEV-SETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDG 341
PFQS E E V SE P+ + E FS +AAEAA AL DE ++ GI+PDG
Sbjct: 300 PFQSF--AEPENVDSEVK---PAADAEEVFETQFSKNAAEAARALSTSDEKSSHGIDPDG 354
Query: 342 SRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDAT 401
S+WWKETG+EQRPDGV+C+WT+ RGVSAD A+E+++K+WEA+D HKELGSEKSGRDA
Sbjct: 355 SKWWKETGVEQRPDGVICKWTVIRGVSADGAVEFEDKYWEASDRFEHKELGSEKSGRDAR 414
Query: 402 GNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKW 459
GNVWRE+W ESMW++ GL+H+EKTADKWGKNG G++WQE+WWE YD+SGKAEKWA KW
Sbjct: 415 GNVWREYWKESMWEDSTSGLMHMEKTADKWGKNGKGEQWQEQWWEQYDSSGKAEKWADKW 474
Query: 460 CSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDP 519
CS+DPNT LD GHAHVWHERWGE YDG GGS+KYTDKWAER EGDGWSKWGDKWDE+FDP
Sbjct: 475 CSLDPNTPLDVGHAHVWHERWGETYDGSGGSVKYTDKWAERSEGDGWSKWGDKWDEHFDP 534
Query: 520 NSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFP 579
N HGVKQGETWW GKYG+RWNRTWGE HNGSGWVHKYG+SSSGE WDTHE QETWYER+P
Sbjct: 535 NGHGVKQGETWWEGKYGDRWNRTWGEGHNGSGWVHKYGRSSSGEHWDTHEPQETWYERYP 594
Query: 580 HFGFYHCFDNSVQLREVRK 598
HFGF HCF+NSVQLR V K
Sbjct: 595 HFGFDHCFENSVQLRSVPK 613
>gi|125534904|gb|EAY81452.1| hypothetical protein OsI_36623 [Oryza sativa Indica Group]
Length = 566
Score = 540 bits (1390), Expect = e-150, Method: Compositional matrix adjust.
Identities = 308/572 (53%), Positives = 391/572 (68%), Gaps = 31/572 (5%)
Query: 57 MWQKAVDRDRKEIEFQKIAGSLAESGDVDGNE------GGGGRDLTEQLEKKSEEFSKIL 110
MW+KAV+R+R+ E IA L +S G +E+++ F ++L
Sbjct: 1 MWKKAVERERRSAE---IAHRLQQSSSAAAAAVKEEEGEGKAAAAAGDVERRTARFEEML 57
Query: 111 DVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAE-------VSRFV 163
V +EERDR+QR QVIDRAAAA+AAARA+L++ + S+ E + +
Sbjct: 58 RVPREERDRVQRRQVIDRAAAALAAARAVLKDPPPPPPPSPPSTPPQEREQQQKPAATAI 117
Query: 164 KKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTP------APGP 217
+ SES + +P ++ ++ V E +A + VP SG PGP
Sbjct: 118 QAGSESGLVSRTAP-GESDRASPPPPVTETATEAAKVSVPDSGDSSPFKKSSSKLGTPGP 176
Query: 218 DFWSWSPPEDDDRDMRDV-RDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPK 276
DFWSW PP ++ + ++ L+ +EK + + ++EK +S DIL +PFE+ + K
Sbjct: 177 DFWSWLPPVENSTKLGEIDTGLKPSEKLDSFAGQPDLLMEKEQSEDILSLPFETSFFK-K 235
Query: 277 PDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRG 336
D LPPFQS E E SE ++ + + E FS +AAEAA AL +E ++ G
Sbjct: 236 EDRSLPPFQSFAEPENVE-SEPSI---TADAEETFEDQFSKNAAEAARALSASNEKSSHG 291
Query: 337 INPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKS 396
+ PDGS WWKETG+EQRPDGV C+WT+ RGVSAD A+EW++K+WEA+D HKELGSEKS
Sbjct: 292 VRPDGSLWWKETGVEQRPDGVTCKWTVIRGVSADGAVEWEDKYWEASDRFDHKELGSEKS 351
Query: 397 GRDATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEK 454
GRDATGNVWRE+W ESMWQ+ G++H+EKTADKWG+NG G++WQE+WWEHYD+SGKAEK
Sbjct: 352 GRDATGNVWREYWKESMWQDFTCGVMHMEKTADKWGQNGKGEQWQEQWWEHYDSSGKAEK 411
Query: 455 WAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWD 514
WA KWCS+DPNT LD GHAHVWHERWGEKYDG GGS KYTDKWAER EGDGWSKWGDKWD
Sbjct: 412 WADKWCSLDPNTPLDVGHAHVWHERWGEKYDGCGGSAKYTDKWAERSEGDGWSKWGDKWD 471
Query: 515 ENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETW 574
E+FDPN HGVKQGETWWAGKYG+RWNRTWGE HN +GWVHKYG+SSSGE WDTH Q+TW
Sbjct: 472 EHFDPNGHGVKQGETWWAGKYGDRWNRTWGEHHNCTGWVHKYGRSSSGEHWDTHVPQDTW 531
Query: 575 YERFPHFGFYHCFDNSVQLREVRKPSEFQEEP 606
YERFPHFGF HCF+NSVQLR V++ + +P
Sbjct: 532 YERFPHFGFEHCFNNSVQLRSVKRQTPKNTKP 563
>gi|115486049|ref|NP_001068168.1| Os11g0586300 [Oryza sativa Japonica Group]
gi|113645390|dbj|BAF28531.1| Os11g0586300, partial [Oryza sativa Japonica Group]
Length = 537
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 268/453 (59%), Positives = 330/453 (72%), Gaps = 15/453 (3%)
Query: 163 VKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTP------APG 216
++ SES + +P ++ ++ V E +A + VP SG PG
Sbjct: 88 IQAGSESGLVSRTAP-GESDRASPPPPVTETATEAAKVSVPDSGDSSPFKKSSSKLGTPG 146
Query: 217 PDFWSWSPPEDDDRDMRDV-RDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEP 275
PDFWSW PP ++ + ++ L+ +EK + + ++EK +S DIL +PFE+ +
Sbjct: 147 PDFWSWLPPVENSTKLGEIDTGLKPSEKLDSFAGQPDLLMEKEQSEDILSLPFETSFFK- 205
Query: 276 KPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATR 335
K D LPPFQS E E SE ++ + + E FS +AAEAA AL DE ++
Sbjct: 206 KEDRSLPPFQSFAEPENVE-SEPSI---TADAEETFEDQFSKNAAEAARALSASDEKSSH 261
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G+ PDGS WWKETG+EQRPDGV C+WT+ RGVSAD A+EW++K+WEA+D HKELGSEK
Sbjct: 262 GVRPDGSLWWKETGVEQRPDGVTCKWTVIRGVSADGAVEWEDKYWEASDRFDHKELGSEK 321
Query: 396 SGRDATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAE 453
SGRDATGNVWRE+W ESMWQ+ G++H+EKTADKWG+NG G++WQE+WWEHYD+SGKAE
Sbjct: 322 SGRDATGNVWREYWKESMWQDFTCGVMHMEKTADKWGQNGKGEQWQEQWWEHYDSSGKAE 381
Query: 454 KWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKW 513
KWA KWCS+DPNT LD GHAHVWHERWGEKYDG GGS KYTDKWAER EGDGWSKWGDKW
Sbjct: 382 KWADKWCSLDPNTPLDVGHAHVWHERWGEKYDGCGGSAKYTDKWAERSEGDGWSKWGDKW 441
Query: 514 DENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQET 573
DE+FDPN HGVKQGETWWAGKYG+RWNRTWGE HN +GWVHKYG+SSSGE WDTH Q+T
Sbjct: 442 DEHFDPNGHGVKQGETWWAGKYGDRWNRTWGEHHNCTGWVHKYGRSSSGEHWDTHVPQDT 501
Query: 574 WYERFPHFGFYHCFDNSVQLREVRKPSEFQEEP 606
WYERFPHFGF HCF+NSVQLR V++ + +P
Sbjct: 502 WYERFPHFGFEHCFNNSVQLRSVKRQTPKNTKP 534
>gi|302764726|ref|XP_002965784.1| hypothetical protein SELMODRAFT_21913 [Selaginella moellendorffii]
gi|300166598|gb|EFJ33204.1| hypothetical protein SELMODRAFT_21913 [Selaginella moellendorffii]
Length = 522
Score = 492 bits (1267), Expect = e-136, Method: Compositional matrix adjust.
Identities = 281/559 (50%), Positives = 359/559 (64%), Gaps = 59/559 (10%)
Query: 53 SYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDV 112
SYL MW+ A +R +E + Q+ A ++ + D + R++ Q EK +F+++LDV
Sbjct: 1 SYLSMWKNAKERYERE-QLQQNASTVQQDRQPDQD-----REIQSQREK---DFARLLDV 51
Query: 113 SKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGA 172
+EERDR+ RLQVIDRAAAA+AAA A+ +S S V+K E + A
Sbjct: 52 PQEERDRVHRLQVIDRAAAALAAAEAL------------LASRPRAPSTIVEKKWEEAAA 99
Query: 173 AEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDM 232
G + G L +F+P +PGPDFW+W+PP
Sbjct: 100 ------------RGWEGTKKLGKLQTNLFLP-----ATTVVSPGPDFWTWTPPPPPSPVE 142
Query: 233 RDVRDLQMAEKSSVYPTPV-NPVVEKARSVDILPIPFESK--------LSEPKPDPLLPP 283
++ K+S T N V+EK R L +PFE++ + + + P LPP
Sbjct: 143 DPSEKAALSPKTSQSETQASNSVLEKEREAQTLELPFETENARSVLPLVFQSRAAPSLPP 202
Query: 284 FQSLLGVEKEEVSETN----LETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINP 339
QSL+ + KE V+ T E P+ ERD A H L +T G+NP
Sbjct: 203 LQSLVEI-KENVAATRKKQITEVPTAVLERDKLADSVVH-----QDLQTNKTKSTTGVNP 256
Query: 340 DGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRD 399
DGSRWWKETG E R +GVVC W++TRGVS++ +EW+EKFWEA D+ +KELGSEKSGRD
Sbjct: 257 DGSRWWKETGEEDRGNGVVCSWSVTRGVSSEGVVEWEEKFWEACDDFDYKELGSEKSGRD 316
Query: 400 ATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
A+GNVWREFW E++WQ+ GL+H+EK+A+KWGKNG G +W EKWWEHYDASG+AEKWA
Sbjct: 317 ASGNVWREFWKETIWQDAKSGLLHMEKSAEKWGKNGTGAQWDEKWWEHYDASGRAEKWAD 376
Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENF 517
KW IDPNT L+ GH HVWHERWGE++DG GG+MKYTDKWAER + GW+KWGDKWDE F
Sbjct: 377 KWSVIDPNTPLEPGHGHVWHERWGEEFDGQGGAMKYTDKWAERSDFGGWTKWGDKWDERF 436
Query: 518 DPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYER 577
D N G KQGETWWAG G+RWNRTWGE+HNG+GWVHKYG SSSGE WDTHE+QETWYE+
Sbjct: 437 DKNGVGKKQGETWWAGTNGDRWNRTWGEQHNGTGWVHKYGSSSSGEFWDTHEEQETWYEK 496
Query: 578 FPHFGFYHCFDNSVQLREV 596
FPHFGF+HC +NS +L +V
Sbjct: 497 FPHFGFHHCMENSQELHKV 515
>gi|302805366|ref|XP_002984434.1| hypothetical protein SELMODRAFT_11669 [Selaginella moellendorffii]
gi|300147822|gb|EFJ14484.1| hypothetical protein SELMODRAFT_11669 [Selaginella moellendorffii]
Length = 526
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 282/559 (50%), Positives = 359/559 (64%), Gaps = 55/559 (9%)
Query: 53 SYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDV 112
SYL MW+ A +R +E + Q+ A ++ + D + R++ Q EK +F+++LDV
Sbjct: 1 SYLSMWKNAKERYERE-QLQQHASTVQQDRQPDQD-----REIQSQREK---DFARLLDV 51
Query: 113 SKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGA 172
+EERDR+ RLQVIDRAAAA+AAA A+ +S S +K E + A
Sbjct: 52 PQEERDRVHRLQVIDRAAAALAAAEAL------------LASRPQAPSTIAEKKWEEAAA 99
Query: 173 AEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDM 232
K ++P R ++ S P +PGPDFW+WSPP
Sbjct: 100 RGWEGTKK------LGDLPRRSVVA-------SAEPATTVVSPGPDFWTWSPPPPPSPVE 146
Query: 233 RDVRDLQMAEKSSVYPTPV-NPVVEKARSVDILPIPFESK--------LSEPKPDPLLPP 283
++ K+S T V N V+EK R L +PFE++ + + + P LPP
Sbjct: 147 DPSEKAALSPKTSQSETQVSNSVLEKEREAQTLELPFETENARSVLPLVFQSRAAPSLPP 206
Query: 284 FQSLLGVEKEEVSETN----LETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINP 339
QSL+ + KE V+ T E P+ ERD A H L +T G+NP
Sbjct: 207 LQSLVEI-KENVAATRKKQITEVPTAVLERDKLADSVVH-----QDLQTNKTKSTTGVNP 260
Query: 340 DGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRD 399
DGSRWWKETG E R +GVVC W++TRGVS++ +EW+EKFWEA D+ +KELGSEKSGRD
Sbjct: 261 DGSRWWKETGEEDRGNGVVCSWSVTRGVSSEGVVEWEEKFWEACDDFDYKELGSEKSGRD 320
Query: 400 ATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
A+GNVWREFW E++WQ+ GL+H+EK+A+KWGKNG G +W EKWWEHYDASG+AEKWA
Sbjct: 321 ASGNVWREFWKETIWQDAKSGLLHMEKSAEKWGKNGTGAQWDEKWWEHYDASGRAEKWAD 380
Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENF 517
KW IDPNT L+ GH HVWHERWGE++DG GG+MKYTDKWAER E GW+KWGDKWDE F
Sbjct: 381 KWSVIDPNTPLEPGHGHVWHERWGEEFDGQGGAMKYTDKWAERSEFGGWTKWGDKWDERF 440
Query: 518 DPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYER 577
D N G KQGETWWAG G+RWNRTWGE+HNG+GWV KYG SSSGE WDTHE+QETWYE+
Sbjct: 441 DKNGIGKKQGETWWAGTNGDRWNRTWGEQHNGTGWVRKYGSSSSGEFWDTHEEQETWYEK 500
Query: 578 FPHFGFYHCFDNSVQLREV 596
FPHFGF+HC +NS +L +V
Sbjct: 501 FPHFGFHHCMENSQELHKV 519
>gi|7263564|emb|CAB81601.1| putative protein [Arabidopsis thaliana]
Length = 497
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 259/457 (56%), Positives = 322/457 (70%), Gaps = 36/457 (7%)
Query: 38 RTGAKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTE 97
RTG ++ ++EG SYLDMW+ AVDR++KE F+KIA ++ VDG + GG
Sbjct: 49 RTGVRILRVSNEGRESYLDMWKNAVDREKKEKAFEKIAENVVA---VDGEKEKGG----- 100
Query: 98 QLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTA 157
LEKKS+EF KIL+VS EERDRIQR+QV+DRAAAAI+AARAIL N K G
Sbjct: 101 DLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEG------ 154
Query: 158 EVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGP 217
N +++ +E++ KN++ G S ++VPRS T G TP GP
Sbjct: 155 ------FPNEDNTVTSEVTETPKNAK---------LGMWSRTVYVPRSETSGTETP--GP 197
Query: 218 DFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKP 277
DFWSW+PP+ + + V DLQ EK + +PT NPV+EK +S D L IP+ES LS +
Sbjct: 198 DFWSWTPPQGSE--ISSV-DLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERH 254
Query: 278 DPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGI 337
+PPF+SL+ V KE +ET + +L E DL + SA+A E A LD +DE +T G+
Sbjct: 255 SFTIPPFESLIEVRKE--AETKPSSETLSTEHDLDLISSANAEEVARVLDSLDESSTHGV 312
Query: 338 NPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSG 397
+ DG +WWK+TG+E+RPDGVVCRWTM RGV+AD +EWQ+K+WEA+D+ G KELGSEKSG
Sbjct: 313 SEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKELGSEKSG 372
Query: 398 RDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
RDATGNVWREFW ESM Q G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+GK+EKWAH
Sbjct: 373 RDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAH 432
Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
KWCSID NT LDAGHAHVWHERWGEKYDG GGS KYT
Sbjct: 433 KWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYT 469
>gi|449530534|ref|XP_004172249.1| PREDICTED: uncharacterized protein LOC101231355, partial [Cucumis
sativus]
Length = 453
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 261/480 (54%), Positives = 312/480 (65%), Gaps = 30/480 (6%)
Query: 1 MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFWSRRT--GAKVGVSNSEGGGSYLDMW 58
M +L PR T H P L P Q + R + + S+ G SYL MW
Sbjct: 2 MPLRLPLSPRPTLHHHFPRLYHHNFLLLPLQPHIQIRHATPARTLRIRASDEGESYLGMW 61
Query: 59 QKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERD 118
+ AV+R RK +EFQK+ + G+ D N G D QLEKKSEEFSKIL V EERD
Sbjct: 62 KNAVERQRKAVEFQKVVENT--EGNDDRNAGDPSSD---QLEKKSEEFSKILQVPPEERD 116
Query: 119 RIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAAEISPF 178
RIQR+QVI RAAAAIAAARA++ E V + ++ V NS
Sbjct: 117 RIQRMQVIHRAAAAIAAARALVGETGTLAVGDSDTC--------VNLNS----------- 157
Query: 179 VKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRDL 238
N E E S +P T + TP GPDFWSW+PP DDD + +L
Sbjct: 158 -TNDEGLLDREEALSEFQSENALLPEFETSQSWTP--GPDFWSWTPPPDDDGNDNAFGEL 214
Query: 239 QMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSET 298
Q KS YP N V EK R +D L IPF+S++SE +PLLPPFQSL+G+EK E SET
Sbjct: 215 QPLGKSQAYPKLSNFVEEKERPIDFLSIPFQSEISE-SVNPLLPPFQSLVGMEKLESSET 273
Query: 299 NLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVV 358
+ ET SLEE+ ++G FS HAAEA+ AL VD+ +T+GI+PDGSRWWKETGIEQRPDGV+
Sbjct: 274 STETHSLEEDENVGIEFSVHAAEASQALSSVDKESTKGIDPDGSRWWKETGIEQRPDGVI 333
Query: 359 CRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQG 418
C+WT+TRGVSAD A EWQ K+WEAADE G+KELGSEKSGRDA GNVWRE+W ESM Q QG
Sbjct: 334 CKWTLTRGVSADLATEWQNKYWEAADEFGYKELGSEKSGRDAYGNVWREYWRESMRQEQG 393
Query: 419 LVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHE 478
LVHLEKTADKWG NG+G EWQEKWWE+Y+ SG+AEK AHKWC IDPNT +D GHAH+W+E
Sbjct: 394 LVHLEKTADKWGINGSGTEWQEKWWEYYNTSGQAEKNAHKWCKIDPNTYVDPGHAHIWNE 453
>gi|388514461|gb|AFK45292.1| unknown [Lotus japonicus]
Length = 243
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 196/242 (80%), Positives = 219/242 (90%)
Query: 363 MTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHL 422
MTRGVSAD+A+EWQEKFWEA+DE+G+KELGSEKSGRDA+GNVW EFW ESM + GL+H+
Sbjct: 1 MTRGVSADKAVEWQEKFWEASDEVGYKELGSEKSGRDASGNVWHEFWRESMHEENGLMHM 60
Query: 423 EKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGE 482
EKTADKWG NG G+EWQEKWWE Y+ASG+AEKWAHKWCSIDPNT L+AGHAHVWHERWGE
Sbjct: 61 EKTADKWGSNGQGNEWQEKWWERYNASGQAEKWAHKWCSIDPNTPLEAGHAHVWHERWGE 120
Query: 483 KYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRT 542
YDG+GGS KYTDKWAER + GW KWGDKWDENFD N HG+KQGETWW GK+GERWNRT
Sbjct: 121 TYDGYGGSTKYTDKWAERSQDGGWEKWGDKWDENFDLNGHGIKQGETWWEGKHGERWNRT 180
Query: 543 WGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEF 602
WGE+ NGSGWVHKYGKSSSGE WDTHE Q+TWYERFPHFGF+HC++NSVQLREV KPSE
Sbjct: 181 WGEQRNGSGWVHKYGKSSSGEHWDTHEGQDTWYERFPHFGFFHCYENSVQLREVPKPSEI 240
Query: 603 QE 604
Q+
Sbjct: 241 QD 242
>gi|168049231|ref|XP_001777067.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671510|gb|EDQ58060.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 267
Score = 389 bits (1000), Expect = e-105, Method: Compositional matrix adjust.
Identities = 190/267 (71%), Positives = 220/267 (82%), Gaps = 3/267 (1%)
Query: 330 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 389
D T G++ DGSRWWKETG+E R +GV C WT+ RGVSAD ++EW+EKFWEAAD K
Sbjct: 1 DASDTSGVHEDGSRWWKETGVENRANGVTCTWTVMRGVSADGSVEWEEKFWEAADAYDFK 60
Query: 390 ELGSEKSGRDATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYD 447
ELG+EKSGRDA+G VWREFW ESMWQ+ GL+H++K+ADKW K+G+G +W EKW E YD
Sbjct: 61 ELGAEKSGRDASGGVWREFWQESMWQDATTGLMHIQKSADKWAKDGHGGQWHEKWMEKYD 120
Query: 448 ASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCE-GDGW 506
ASG+AEKWA KW ID T L+ GHAHVWHERWGE+YDG GGSMKYTDKWAER E G GW
Sbjct: 121 ASGRAEKWADKWSQIDLTTPLEPGHAHVWHERWGEEYDGQGGSMKYTDKWAERLESGGGW 180
Query: 507 SKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWD 566
+KWGDKWDE FD N HGVKQGETWW G +GERWNRTWGE HNGSGWVHKYG+SSSGE WD
Sbjct: 181 TKWGDKWDERFDQNGHGVKQGETWWEGLHGERWNRTWGEGHNGSGWVHKYGQSSSGEHWD 240
Query: 567 THEQQETWYERFPHFGFYHCFDNSVQL 593
TH Q+ET+Y+R+PHFGF CF+NS +L
Sbjct: 241 THSQEETFYDRYPHFGFRECFENSREL 267
>gi|308801813|ref|XP_003078220.1| RNA polymerase II transcription elongation factor DSIF/SUPT5H/SPT5
(ISS) [Ostreococcus tauri]
gi|116056671|emb|CAL52960.1| RNA polymerase II transcription elongation factor DSIF/SUPT5H/SPT5
(ISS) [Ostreococcus tauri]
Length = 480
Score = 214 bits (546), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 142/398 (35%), Positives = 212/398 (53%), Gaps = 37/398 (9%)
Query: 215 PGPDFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSE 274
PG DFW+WSPPE +D + + S V E + L + F+S +
Sbjct: 65 PGSDFWTWSPPEVEDNGPTPKLQRKTETRVSAAVAEAERVPEAS-----LQLKFQSDIET 119
Query: 275 PKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKV-DELA 333
PK L F+S VE E LGA +A E A A+ ++ +
Sbjct: 120 PKE--LKLEFESDGVVELEPAP--------------LGATPTAELEETATAVRELGTDGE 163
Query: 334 TRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGS 393
T G+ +GSRWW+E+G ++ G +CRWT+ RG SAD ++EW+EK+WE +D ++ELG+
Sbjct: 164 TEGVLDNGSRWWRESGEDELAGGRLCRWTLVRGASADGSVEWEEKWWETSDAFNYRELGA 223
Query: 394 EKSGRDATGNVWREFWTESMWQN------QGLVHLEKTADKWGKNGNGDEWQEKWWEHYD 447
KSGRDA+GNVW+E W E + + H+ + A+KWG +G EW E W E+Y
Sbjct: 224 IKSGRDASGNVWQESWREHITHDTTTGFSNASKHIMREANKWGAQADGTEWHEVWDENYW 283
Query: 448 ASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERC---EGD 504
G+ ++ K +I + GH + W +WGE++DGHGG +K+ D +A+R +G
Sbjct: 284 GDGRVKRTCTKKGAIGSGISPEDGHGNRWTHKWGEEWDGHGGCVKWNDSYADRDKSEDGG 343
Query: 505 GWSKWGDKWDENFDPNSH----GVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSS 560
WG++W+E + +H G + G T W + G ++ +TWGE H G VHKYG ++
Sbjct: 344 SGRSWGERWEERWGSFAHNGSAGTRNGST-WDDRDGHKFEKTWGEEHWHDGRVHKYGSTT 402
Query: 561 SG-ELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
G + WDT E + W+ER P FG+ +S QL VR
Sbjct: 403 DGSDGWDTWEDSQGWWERAPSFGWDEAVSHSPQLLSVR 440
>gi|115450547|ref|NP_001048874.1| Os03g0133300 [Oryza sativa Japonica Group]
gi|113547345|dbj|BAF10788.1| Os03g0133300, partial [Oryza sativa Japonica Group]
Length = 371
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 144/362 (39%), Positives = 200/362 (55%), Gaps = 48/362 (13%)
Query: 250 PVNPVVEKARSVDILP-IPFESKLSE---------PKPDPLLPPFQSLLGVEKEEVSETN 299
P E RS+ +P +PF S S P+ P P QS
Sbjct: 9 PTTAATEPVRSLTAMPSLPFPSPRSRRQWKQQNFYPRCTPRGPAPQSR------------ 56
Query: 300 LETPSLEEERDLGALFSAHAAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVV 358
+TP +RD G A+E ++ +DE + G N DGS W++E+G ++ +G
Sbjct: 57 -DTPP---KRDTGI-----ASEKEWGINLLDEAVKESGTNEDGSTWYRESGDDRGDNGYR 107
Query: 359 CRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ- 417
CRW G S D EW+E +WE +D G+KELG+EKSG++ G+ W E W E ++Q++
Sbjct: 108 CRWARMGGQSHDGTTEWKETWWEKSDWTGYKELGAEKSGKNGEGDSWWEKWKEVLYQDEW 167
Query: 418 -GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHV 475
L +E++A+K K+G + W EKWWE YDA G EK AHK+ ++ +
Sbjct: 168 SNLARIERSAEKQAKSGAENAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS--------- 218
Query: 476 WHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKY 535
W ERWGE YDG G +K+TDKWAE G +KWGDKW+E F G +QGETW
Sbjct: 219 WWERWGEHYDGRGFVLKWTDKWAETDLG---TKWGDKWEEKFFAGI-GSRQGETWHVSPG 274
Query: 536 GERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLRE 595
G+RW+RTWGE H G+G VHKYGKS++GE WD +ET+YE PH+G+ +S QL
Sbjct: 275 GDRWSRTWGEEHFGNGKVHKYGKSTTGESWDLVVDEETYYEAEPHYGWADVVGDSTQLLS 334
Query: 596 VR 597
++
Sbjct: 335 IQ 336
>gi|326518522|dbj|BAJ88290.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 133/305 (43%), Positives = 182/305 (59%), Gaps = 21/305 (6%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A+E ++ +DE + G N DGS W++E+G + +G CRW G + D EW+E
Sbjct: 126 ASEKEWGINLLDEAVKESGTNEDGSTWYRESGEDLGENGYRCRWARMGGQTHDGTTEWKE 185
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
+WE +D G+KELG+EKSG++A G+ W E W E + Q++ L LEK+A+K K+G
Sbjct: 186 TWWEKSDWTGYKELGAEKSGKNAEGDSWWEKWKEVLHQDEWSNLARLEKSAEKQAKSGIE 245
Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
+ W EKWWE YDA G EK AHK+ ++ + W ERWGE YDG G +K+T
Sbjct: 246 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGSVLKWT 296
Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
DKWAE G ++WGDKW+E F G +QGETW A G+RW+RTWGE H G+G VH
Sbjct: 297 DKWAETDLG---TRWGDKWEEKFFAGI-GSRQGETWHASPGGDRWSRTWGEEHFGNGKVH 352
Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEPFEIQ 610
KYGKS++GE WD ++ET+YE PH+G+ +S QL + R P F F
Sbjct: 353 KYGKSTTGESWDLVVEEETYYEADPHYGWADVVGDSSQLLSIQPVERPPGVFPTIDFSSS 412
Query: 611 DKRSE 615
R+E
Sbjct: 413 PPRTE 417
>gi|242042337|ref|XP_002468563.1| hypothetical protein SORBIDRAFT_01g048110 [Sorghum bicolor]
gi|241922417|gb|EER95561.1| hypothetical protein SORBIDRAFT_01g048110 [Sorghum bicolor]
Length = 348
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 128/283 (45%), Positives = 177/283 (62%), Gaps = 17/283 (6%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A+E ++ DE + GIN DGS W++E+G + +G CRWT G + D + EW+E
Sbjct: 45 ASEKEWGINLPDEAVKESGINEDGSTWYRESGEDVGENGYRCRWTRMGGQNHDGSTEWKE 104
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
+WE +D G+KELG+EKSG++A G+ W E W E ++Q++ L +E++A+K K+G
Sbjct: 105 TWWEKSDWTGYKELGAEKSGKNAEGDSWWEKWKEVLYQDEWSNLARIERSAEKQAKSGVE 164
Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
+ W EKWWE YDA G EK AHK+ ++ + W ERWGE YDG G +K+T
Sbjct: 165 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGFVLKWT 215
Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
DKWAE G +KWGDKW+E F G +QGETW GERW+RTWGE H G+G VH
Sbjct: 216 DKWAETDLG---TKWGDKWEEKFFAGI-GSRQGETWHVSPGGERWSRTWGEEHFGNGKVH 271
Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
KYGKS++GE WD +ET+YE PH+G+ +S QL ++
Sbjct: 272 KYGKSTTGESWDLVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 314
>gi|225459860|ref|XP_002285931.1| PREDICTED: uncharacterized protein LOC100252277 [Vitis vinifera]
Length = 432
Score = 209 bits (533), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 124/264 (46%), Positives = 164/264 (62%), Gaps = 16/264 (6%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G N DGS W++E+G + +G CRWT G S D + EW+E +WE +D G+KELG EK
Sbjct: 148 GTNEDGSAWYRESGEDLGENGYRCRWTRMGGQSHDGSSEWKEMWWEKSDWTGYKELGVEK 207
Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
SGR+A G+ W E W E + Q++ L +E++A K K+G + W EKWWE YDA G
Sbjct: 208 SGRNAEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGST 267
Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
EK AHK+ ++ + W E+WGE YDG G +K+TDKWAE G +KWGDK
Sbjct: 268 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 315
Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
W+E F G +QGETW G+RW+RTWGE H G+G VHKYGKS++GE WD +E
Sbjct: 316 WEEKFFAGI-GSRQGETWHLSPSGDRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 374
Query: 573 TWYERFPHFGFYHCFDNSVQLREV 596
T+YE PH+G+ NS QL +
Sbjct: 375 TYYEAEPHYGWADVVGNSSQLLSI 398
>gi|413956953|gb|AFW89602.1| hypothetical protein ZEAMMB73_256684 [Zea mays]
Length = 450
Score = 209 bits (531), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 179/297 (60%), Gaps = 21/297 (7%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A+E ++ +DE + GIN DGS W++E+G + +G CRW G + D + EW+E
Sbjct: 145 ASEKEWGINLLDEAVKESGINEDGSTWYRESGEDTGENGYRCRWARMGGQNHDGSTEWKE 204
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
+WE +D G+KELG+EKSG++A G+ W E W E ++Q++ L +EK+A+K K+G
Sbjct: 205 TWWEKSDWTGYKELGAEKSGKNAEGDSWWEKWKEVLYQDEWSNLARIEKSAEKQAKSGAE 264
Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
+ W EKWWE YDA G EK AHK+ ++ + W ERWGE YDG G +K+T
Sbjct: 265 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGFVLKWT 315
Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
DKWAE G +KWGDKW+E F G +QGETW ERW+RTWGE H G+G VH
Sbjct: 316 DKWAETDLG---TKWGDKWEEKFFAGI-GSRQGETWHVSPGRERWSRTWGEEHFGNGKVH 371
Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEPF 607
KYGKS++GE WD +ET+YE PH+G+ +S QL + R P F F
Sbjct: 372 KYGKSTTGESWDLVVDEETYYEAEPHYGWADVVGDSTQLLSIQPVERPPGVFPAIDF 428
>gi|125606382|gb|EAZ45418.1| hypothetical protein OsJ_30067 [Oryza sativa Japonica Group]
Length = 401
Score = 209 bits (531), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 135/323 (41%), Positives = 187/323 (57%), Gaps = 29/323 (8%)
Query: 279 PLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDE-LATRGI 337
P LPP G S +PSL L + A+E ++ +DE + G
Sbjct: 69 PPLPP-----GAGVAPASPQCRHSPSL-------LLDTGIASEKEWGINLLDEAVKESGT 116
Query: 338 NPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSG 397
N DGS W++E+G ++ +G CRW G S D EW+E +WE +D G+KELG+EKSG
Sbjct: 117 NEDGSTWYRESGDDRGDNGYRCRWARMGGQSHDGTTEWKETWWEKSDWTGYKELGAEKSG 176
Query: 398 RDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKAEK 454
++ G+ W E W E ++Q++ L +E++A+K K+G + W EKWWE YDA G EK
Sbjct: 177 KNGEGDSWWEKWKEVLYQDEWSNLARIERSAEKQAKSGAENAGWYEKWWEKYDAKGWTEK 236
Query: 455 WAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWD 514
AHK+ ++ + W ERWGE YDG G +K+TDKWAE G +KWGDKW+
Sbjct: 237 GAHKYGRLNEQS---------WWERWGEHYDGRGFVLKWTDKWAETDLG---TKWGDKWE 284
Query: 515 ENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETW 574
E F G +QGETW G+RW+RTWGE H G+G VHKYGKS++GE WD +ET+
Sbjct: 285 EKFFAGI-GSRQGETWHVSPGGDRWSRTWGEEHFGNGKVHKYGKSTTGESWDLVVDEETY 343
Query: 575 YERFPHFGFYHCFDNSVQLREVR 597
YE PH+G+ +S QL ++
Sbjct: 344 YEAEPHYGWADVVGDSTQLLSIQ 366
>gi|255539182|ref|XP_002510656.1| conserved hypothetical protein [Ricinus communis]
gi|223551357|gb|EEF52843.1| conserved hypothetical protein [Ricinus communis]
Length = 428
Score = 208 bits (529), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 124/265 (46%), Positives = 163/265 (61%), Gaps = 16/265 (6%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G N DGS W++E+G + +G CRWT G S D EW+E +WE +D G+KELG EK
Sbjct: 144 GTNEDGSTWYRESGEDLGDNGFRCRWTRMGGRSHDATSEWKETWWEKSDWTGYKELGVEK 203
Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
SGR+A G+ W E W E + Q++ L +E++A K K+G + W EKWWE YDA G
Sbjct: 204 SGRNAEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 263
Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
EK AHK+ ++ + W E+WGE YDG G +K+TDKWAE G +KWGDK
Sbjct: 264 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 311
Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
W+E F G +QGETW GERW+RTWGE H G+G VHKYGKS++GE WD +E
Sbjct: 312 WEEKFFAGI-GSRQGETWHVSPGGERWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 370
Query: 573 TWYERFPHFGFYHCFDNSVQLREVR 597
T YE PH+G+ +S QL ++
Sbjct: 371 TCYEAEPHYGWADVVGDSTQLLSIK 395
>gi|168036746|ref|XP_001770867.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677926|gb|EDQ64391.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 359
Score = 207 bits (528), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 127/274 (46%), Positives = 174/274 (63%), Gaps = 22/274 (8%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G+N DGS W+ E+G++ +G CRWT+ G SAD + EW+E +WE +D G+KELG+EK
Sbjct: 58 GVNEDGSSWYSESGVDLGENGYRCRWTVMGGRSADGSSEWKEAWWEKSDWTGYKELGAEK 117
Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
SG++A G+ W E W E + ++ L +EK+A K K+G G W EKWWE Y+A G +
Sbjct: 118 SGKNAQGDTWWETWQEVLRVDELSNLARIEKSAQKQAKSGTGSAGWFEKWWEKYNAKGWS 177
Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
EK AHK+ ++ D G W E+W E+YDG G +K+TDKWAE G +KWGDK
Sbjct: 178 EKGAHKYGRLN-----DQG----WWEKWEEQYDGRGAVLKWTDKWAESDTG---TKWGDK 225
Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
W+E FD + G +QGETW + G W+RTWGE H G+G VHKYG+S+SGE WD ++
Sbjct: 226 WEEKFD-HGVGTRQGETWHNDEKG--WSRTWGEEHFGNGKVHKYGRSTSGENWDNVVEEG 282
Query: 573 TWYERFPHFGFYHCFDNSVQLREV----RKPSEF 602
T+Y+ PH+G+ NSVQL + R P F
Sbjct: 283 TYYQAEPHYGWADAIGNSVQLLSIQPLERPPGTF 316
>gi|414864653|tpg|DAA43210.1| TPA: hypothetical protein ZEAMMB73_868366 [Zea mays]
Length = 448
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 143/379 (37%), Positives = 203/379 (53%), Gaps = 59/379 (15%)
Query: 248 PTPVNPVVEKARSVDILPIPFES-----KLSEPKPDPLLPPFQSLLGVEKEEVSETNLET 302
P+ P + + P+PF + ++ +P P P S + + +T
Sbjct: 67 PSAAAPGRRRTSLTAMPPLPFPAPRSRRQVKQPDFYPRCTPRGS---------APQSRDT 117
Query: 303 PSLEEERDLGALFSAHAAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRW 361
P +RD G A+E ++ +DE + GIN DGS W++E+G + +G CRW
Sbjct: 118 PP---KRDTGI-----ASEKEWGINLLDEAVKESGINEDGSTWYRESGDDIGENGYRCRW 169
Query: 362 TMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ---- 417
G S D + EW+E +WE +D G+KELG+EKSGR+A G+ W E W E ++Q++
Sbjct: 170 ARMGGQSHDGSTEWKETWWEKSDWTGYKELGAEKSGRNAEGDSWWEKWKEVLYQDEWSQK 229
Query: 418 ------------------GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKAEKWAHK 458
L +E++A+K K+G + W EKWWE YDA G EK AHK
Sbjct: 230 LEQSHLRGSDIVYLVEYSNLARIERSAEKQAKSGIENAGWYEKWWEKYDAKGWTEKGAHK 289
Query: 459 WCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFD 518
+ ++ + W ERWGE YDG G +K+TDKWAE G +KWGDKW+E F
Sbjct: 290 YGRLNEQS---------WWERWGEHYDGRGFVLKWTDKWAETDLG---TKWGDKWEEKFF 337
Query: 519 PNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERF 578
G +QGETW GERW+RTWGE H G+G VHKYGKS++GE WD +ET+YE
Sbjct: 338 AGI-GSRQGETWHVCPGGERWSRTWGEEHFGNGKVHKYGKSTTGERWDLVVDEETYYEAE 396
Query: 579 PHFGFYHCFDNSVQLREVR 597
PH+G+ +S QL ++
Sbjct: 397 PHYGWADVVGDSTQLLSIQ 415
>gi|125542273|gb|EAY88412.1| hypothetical protein OsI_09872 [Oryza sativa Indica Group]
Length = 431
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 125/283 (44%), Positives = 175/283 (61%), Gaps = 17/283 (6%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A+E ++ +DE + G N DGS W++E+G ++ +G CRW G S D EW+E
Sbjct: 127 ASEKEWGINLLDEAVKESGTNEDGSTWYRESGDDRGDNGYRCRWARMGGQSHDGTTEWKE 186
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
+WE +D G+KELG+EKSG++ G+ W E W E ++Q++ L +E++A+K K+G
Sbjct: 187 TWWEKSDWTGYKELGAEKSGKNGAGDSWWEKWKEVLYQDEWSNLARIERSAEKQAKSGAE 246
Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
+ W EKWWE YDA G EK AHK+ ++ + W ERWGE YDG G +K+T
Sbjct: 247 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGFVLKWT 297
Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
DKWAE G +KWGDKW+E F G +QGETW G+RW+RTWGE H G+G VH
Sbjct: 298 DKWAETDLG---TKWGDKWEEKFFAGI-GSRQGETWHVSPGGDRWSRTWGEEHFGNGKVH 353
Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
KYGKS++GE WD +ET+YE PH+G+ +S QL ++
Sbjct: 354 KYGKSTTGESWDLVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 396
>gi|302792138|ref|XP_002977835.1| hypothetical protein SELMODRAFT_107335 [Selaginella moellendorffii]
gi|300154538|gb|EFJ21173.1| hypothetical protein SELMODRAFT_107335 [Selaginella moellendorffii]
Length = 367
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 125/278 (44%), Positives = 168/278 (60%), Gaps = 21/278 (7%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G N DGS W++E G + +G CR+T+ G S+D + EW+E +WE D G+KELG+EK
Sbjct: 97 GTNEDGSTWFRECGEDLGENGYRCRYTVMGGRSSDGSTEWKETWWEKCDWTGYKELGAEK 156
Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
SG++A G+ W E W E + Q++ L +E+TA K K GNG+ W EKWWE Y+A G
Sbjct: 157 SGKNAGGDAWWETWQEILRQDELSNLARIERTAQKQAKQGNGEAGWYEKWWEKYNAKGWT 216
Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
EK AHK+ ++ + W E+WGE+YDG G +K+TDKWAE G+ KWGDK
Sbjct: 217 EKGAHKYGRLNEQS---------WWEKWGEQYDGRGAVLKWTDKWAENATGE---KWGDK 264
Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
W+E F N G +QGETW + E W+RTWGE H G G VHKYGKS+SGE WD+ +
Sbjct: 265 WEEKFQ-NGAGTRQGETWHSAN-AESWSRTWGEEHFGDGKVHKYGKSTSGENWDSVVTET 322
Query: 573 TWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEP 606
T Y PH+G+ S QL + R P + + P
Sbjct: 323 TVYNAEPHYGWVDAIGQSTQLLSIEPRPRPPGVYPDLP 360
>gi|22758282|gb|AAN05510.1| Hypothetical protein [Oryza sativa Japonica Group]
gi|108706037|gb|ABF93832.1| expressed protein [Oryza sativa Japonica Group]
Length = 431
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/283 (44%), Positives = 175/283 (61%), Gaps = 17/283 (6%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A+E ++ +DE + G N DGS W++E+G ++ +G CRW G S D EW+E
Sbjct: 127 ASEKEWGINLLDEAVKESGTNEDGSTWYRESGDDRGDNGYRCRWARMGGQSHDGTTEWKE 186
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
+WE +D G+KELG+EKSG++ G+ W E W E ++Q++ L +E++A+K K+G
Sbjct: 187 TWWEKSDWTGYKELGAEKSGKNGEGDSWWEKWKEVLYQDEWSNLARIERSAEKQAKSGAE 246
Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
+ W EKWWE YDA G EK AHK+ ++ + W ERWGE YDG G +K+T
Sbjct: 247 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGFVLKWT 297
Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
DKWAE G +KWGDKW+E F G +QGETW G+RW+RTWGE H G+G VH
Sbjct: 298 DKWAETDLG---TKWGDKWEEKFFAGI-GSRQGETWHVSPGGDRWSRTWGEEHFGNGKVH 353
Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
KYGKS++GE WD +ET+YE PH+G+ +S QL ++
Sbjct: 354 KYGKSTTGESWDLVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 396
>gi|424513213|emb|CCO66797.1| predicted protein [Bathycoccus prasinos]
Length = 657
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 145/400 (36%), Positives = 202/400 (50%), Gaps = 33/400 (8%)
Query: 214 APGPDFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARS--VDILPIPFESK 271
A G DFWSW+PPE R D + + T + V+ A L + F+S+
Sbjct: 258 ATGMDFWSWTPPE---RKKSDTDGAPVPKLQKQMQTRIEQAVQVAERGVTQTLNLDFQSQ 314
Query: 272 LSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDE 331
+ L F+S + +EV E D A A A A L E
Sbjct: 315 VQAKGTKELPLAFESQTATQADEV------------EADQRAPTEAEFASAVRELGADGE 362
Query: 332 LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKEL 391
T G DG+RWW+E G + +G VC WT+ RG SAD ++EW+EK+W AD +KEL
Sbjct: 363 --THGELSDGTRWWREAGTSELENGRVCEWTLVRGQSADGSVEWEEKWWSTADAFDYKEL 420
Query: 392 GSEKSGRDATGNVWREFWTESMWQN--QGLV-----HLEKTADKWGKNGNGDEWQEKWWE 444
G+ KSGRD GNVW+E W+E + +G +E++A+KWG N +G EW E W E
Sbjct: 421 GAVKSGRDGHGNVWQESWSEISCSDVSRGFFTDASKKIERSANKWGANASGAEWHEDWRE 480
Query: 445 HYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEG- 503
Y G ++ K + N + GHA W+ W EK+DGHGG MK D WA+R G
Sbjct: 481 AYWGDGVVDRECFKKSCVGKNEIPEDGHASRWNHNWKEKWDGHGGCMKTNDSWADRDVGE 540
Query: 504 DGWS--KWGDKWDENFDPNSHGVKQGE---TWWAGKYGERWNRTWGERHNGSGWVHKYGK 558
DG S WG++W E + + +QGE + W + G + ++ WGE H G V KYG
Sbjct: 541 DGGSGRSWGERWSERWGSYASHGRQGEREGSTWNDRDGHKVSKDWGEEHWPDGRVRKYGH 600
Query: 559 SSSG-ELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
SS G + WD E + W+ER P FG+ ++S QL ++
Sbjct: 601 SSDGSDHWDVWEDTDGWWERHPSFGWAEAVNHSPQLMGIK 640
>gi|302141667|emb|CBI18870.3| unnamed protein product [Vitis vinifera]
Length = 347
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 124/264 (46%), Positives = 164/264 (62%), Gaps = 16/264 (6%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G N DGS W++E+G + +G CRWT G S D + EW+E +WE +D G+KELG EK
Sbjct: 63 GTNEDGSAWYRESGEDLGENGYRCRWTRMGGQSHDGSSEWKEMWWEKSDWTGYKELGVEK 122
Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
SGR+A G+ W E W E + Q++ L +E++A K K+G + W EKWWE YDA G
Sbjct: 123 SGRNAEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGST 182
Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
EK AHK+ ++ + W E+WGE YDG G +K+TDKWAE G +KWGDK
Sbjct: 183 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 230
Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
W+E F G +QGETW G+RW+RTWGE H G+G VHKYGKS++GE WD +E
Sbjct: 231 WEEKFFAGI-GSRQGETWHLSPSGDRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 289
Query: 573 TWYERFPHFGFYHCFDNSVQLREV 596
T+YE PH+G+ NS QL +
Sbjct: 290 TYYEAEPHYGWADVVGNSSQLLSI 313
>gi|449452965|ref|XP_004144229.1| PREDICTED: uncharacterized protein LOC101214256 [Cucumis sativus]
Length = 429
Score = 204 bits (520), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 122/270 (45%), Positives = 166/270 (61%), Gaps = 16/270 (5%)
Query: 330 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 389
+ ++ G N DGS W++E+G + +G CRWT G S D EW+E +WE +D G+K
Sbjct: 139 ENVSESGTNEDGSTWYRESGEDLGENGYRCRWTRMGGQSHDGYSEWKETWWEKSDWTGYK 198
Query: 390 ELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHY 446
ELG EKSG++ G+ W E W E + Q++ L +E++A K K+G + W EKWWE Y
Sbjct: 199 ELGVEKSGKNVEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWHEKWWEKY 258
Query: 447 DASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGW 506
DA G EK AHK+ ++ + W E+WGE YDG G +K+TDKWAE G
Sbjct: 259 DAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG--- 306
Query: 507 SKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWD 566
+KWGDKW+E F + G +QGETW GERW+RTWGE H G+G VHKYGKS++GE WD
Sbjct: 307 TKWGDKWEEKFF-SGIGSRQGETWHVSPSGERWSRTWGEEHFGNGKVHKYGKSTTGESWD 365
Query: 567 THEQQETWYERFPHFGFYHCFDNSVQLREV 596
+ET+YE PH+G+ +S QL +
Sbjct: 366 IVVDEETYYEAEPHYGWADVVGDSSQLLSI 395
>gi|308811228|ref|XP_003082922.1| RNA polymerase II transcription elongation factor DSIF/SUPT5H/SPT5
(ISS) [Ostreococcus tauri]
gi|116054800|emb|CAL56877.1| RNA polymerase II transcription elongation factor DSIF/SUPT5H/SPT5
(ISS) [Ostreococcus tauri]
Length = 501
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 160/479 (33%), Positives = 242/479 (50%), Gaps = 55/479 (11%)
Query: 139 ILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSA 198
++ G +G +SG+A R + +SGAA +G R A +
Sbjct: 20 VVTTGGGEGATSGSASGSA--PRSTGTTTGASGAA-----------DGAVMKGGRSANDS 66
Query: 199 GI--FVPRSGTPGNRTPAPGPDFWSWSPPEDDDRD-MRDVRDLQMAEKSSVYPTPVNPVV 255
G+ ++ P T PG DFW+W+PPE D + ++ ++ + V
Sbjct: 67 GLKNTAKKASKPKAETFNPGSDFWTWTPPEAAGSDKVAAAAAPKLQRQTETRVSAAVAVA 126
Query: 256 EKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEE----RDL 311
E+A L + F+S + P L F+S + + KE ++ T LEE R+L
Sbjct: 127 ERAPEAS-LQLKFQSDVE--TPSELKLEFESDVVLAKEPAPLSDTPTVELEETATAVREL 183
Query: 312 GALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADE 371
GA + T G +GSRWW+E+G E+ G +CRWT+ RG SAD
Sbjct: 184 GA-----------------DGETEGTLENGSRWWRESGEEELEGGKLCRWTLVRGASADG 226
Query: 372 ALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQN------QGLVHLEKT 425
++EW+EK+WE +D ++ELG+ KSGRDA GNVW+E W E + + H+ +
Sbjct: 227 SVEWEEKWWETSDAFNYRELGAIKSGRDAKGNVWQESWREQVTHDTTTGFSNASKHIMRE 286
Query: 426 ADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYD 485
A+KWG +G EW E W E+Y G+ ++ K +I D GH + W +WGE++D
Sbjct: 287 ANKWGAQADGAEWHEVWDENYWGDGQVKRTCTKKGAIANGATPDDGHGNRWTHKWGEEWD 346
Query: 486 GHGGSMKYTDKWAERCEG-DGWS--KWGDKWDENFDPNSH----GVKQGETWWAGKYGER 538
GHGG +K+TD +A+R + DG S WG+KW+E + +H G + G T W + G +
Sbjct: 347 GHGGCVKWTDSFADRDQSEDGGSGRAWGEKWEERWGGYAHNGSAGNRNGST-WDDRDGHK 405
Query: 539 WNRTWGERHNGSGWVHKYGKSSSG-ELWDTHEQQETWYERFPHFGFYHCFDNSVQLREV 596
+ +TWGE H G VHK+G ++ G + WDT E W+ER P FG+ +S QL V
Sbjct: 406 FEKTWGEEHWHDGRVHKWGATTDGSDGWDTWEDSAGWWERAPSFGWDEAVSHSPQLLNV 464
>gi|356517440|ref|XP_003527395.1| PREDICTED: uncharacterized protein LOC100788155 [Glycine max]
Length = 449
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 124/279 (44%), Positives = 166/279 (59%), Gaps = 20/279 (7%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G N DGS W++E+G E +G CRWT G S D + EW+E +WE +D G+KELG EK
Sbjct: 163 GTNEDGSTWYRESGEELGENGYKCRWTRMGGQSHDGSSEWKETWWEKSDWTGYKELGVEK 222
Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
SGR++ G+ W E W E++ Q++ + +E++A K K+G + W EKWWE YDA G
Sbjct: 223 SGRNSEGDTWWETWQENLHQDEWSNIARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 282
Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
EK AHK+ ++ + W E+WGE YDG G +K+TDKWAE G +KWGDK
Sbjct: 283 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 330
Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
W+E F G + GETW ERW+RTWGE H G+G VHKYG S++GE WD +E
Sbjct: 331 WEERFF-KGIGSRHGETWHVSPSSERWSRTWGEEHFGNGKVHKYGNSTTGESWDIVVDEE 389
Query: 573 TWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEPF 607
T+YE PH+G+ +S QL + R P F F
Sbjct: 390 TYYEAEPHYGWADVVGDSSQLLSIEPRERPPGVFPNLDF 428
>gi|357114188|ref|XP_003558882.1| PREDICTED: uncharacterized protein LOC100823512 [Brachypodium
distachyon]
Length = 429
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 130/303 (42%), Positives = 181/303 (59%), Gaps = 18/303 (5%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A+E ++ +DE + G N DGS W++E+G + +G RW G + D ++EW+E
Sbjct: 127 ASEKEWGINLLDEAVKESGTNEDGSTWYRESGEDVGENGYRSRWARMGGQTHDGSVEWKE 186
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
+WE +D G+KELG+EKSG++A G+ W E W E + Q++ L +E++A+K K+G
Sbjct: 187 TWWEKSDWTGYKELGAEKSGKNAEGDSWWEKWKEVLHQDEWSNLARIERSAEKQAKSGAE 246
Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
+ W EKWWE YDA G EK AHK+ ++ + W ERWGE YDG G +K+T
Sbjct: 247 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGSVLKWT 297
Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
DKWAE G ++WGDKW+E F G +QGETW A G+RW+RTWGE H G+G VH
Sbjct: 298 DKWAETDLG---TRWGDKWEEKFFAGI-GSRQGETWHASIGGDRWSRTWGEEHYGNGKVH 353
Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEFQEEPFEIQDKRS 614
KYGKS++GE WD +ET YE PH+G+ +S QL + +P E F D S
Sbjct: 354 KYGKSTTGESWDLVVDEETCYEAEPHYGWADVVGDSTQLLSI-QPVERPPGVFPTIDFSS 412
Query: 615 ELQ 617
Q
Sbjct: 413 SPQ 415
>gi|297846712|ref|XP_002891237.1| hypothetical protein ARALYDRAFT_473738 [Arabidopsis lyrata subsp.
lyrata]
gi|297337079|gb|EFH67496.1| hypothetical protein ARALYDRAFT_473738 [Arabidopsis lyrata subsp.
lyrata]
Length = 422
Score = 204 bits (518), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/283 (44%), Positives = 172/283 (60%), Gaps = 17/283 (6%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A E +D ++E + G N DGS W++E+G + +G CRWT G S D + EW E
Sbjct: 123 ANEKDWGIDLLNENVNESGTNEDGSSWFRESGHDLGDNGYRCRWTRMGGRSHDGSSEWTE 182
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
+WE +D G+KELG EKSG++A G+ W E W E + Q++ L +E++A K K+G
Sbjct: 183 TWWEKSDWTGYKELGVEKSGKNAEGDTWWETWQEVLHQDEWSNLARIERSAQKQAKSGTE 242
Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
+ W EKWWE YDA G EK AHK+ ++ + W E+WGE YDG G +K+T
Sbjct: 243 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWT 293
Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
DKWAE G +KWGDKW+E F + G +QGETW +RW+RTWGE H G+G VH
Sbjct: 294 DKWAETELG---TKWGDKWEEKF-FSGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVH 349
Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
KYGKS++GE WD +ET+YE PH+G+ +S QL ++
Sbjct: 350 KYGKSTTGESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 392
>gi|449489311|ref|XP_004158275.1| PREDICTED: uncharacterized protein LOC101230928 [Cucumis sativus]
Length = 382
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/282 (44%), Positives = 172/282 (60%), Gaps = 17/282 (6%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A E ++ ++E ++ G N DGS W++E+G + +G CRWT G S D EW+E
Sbjct: 80 ANEKDWGINLLNENVSESGTNEDGSTWYRESGEDLGENGYRCRWTRMGGQSHDGYSEWKE 139
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
+WE +D G+KELG EKSG++ G+ W E W E + Q++ L +E++A K K+G
Sbjct: 140 TWWEKSDWTGYKELGVEKSGKNVEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTE 199
Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
+ W EKWWE YDA G EK AHK+ ++ + W E+WGE YDG G +K+T
Sbjct: 200 NAGWHEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWT 250
Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
DKWAE G +KWGDKW+E F + G +QGETW GERW+RTWGE H G+G VH
Sbjct: 251 DKWAETELG---TKWGDKWEEKFF-SGIGSRQGETWHVSPSGERWSRTWGEEHFGNGKVH 306
Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREV 596
KYGKS++GE WD +ET+YE PH+G+ +S QL +
Sbjct: 307 KYGKSTTGESWDIVVDEETYYEAEPHYGWADVVGDSSQLLSI 348
>gi|302830957|ref|XP_002947044.1| hypothetical protein VOLCADRAFT_103268 [Volvox carteri f.
nagariensis]
gi|300267451|gb|EFJ51634.1| hypothetical protein VOLCADRAFT_103268 [Volvox carteri f.
nagariensis]
Length = 647
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/275 (43%), Positives = 158/275 (57%), Gaps = 25/275 (9%)
Query: 340 DGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWE---------AADELGHKE 390
DG+R+ K +G + PDG V +W + RGV+ D ++W+E +W+ A++ G +E
Sbjct: 363 DGTRFEKLSGTDTGPDGYVKKWEVLRGVTGDGQVQWEECWWQVWGLGLGGKASNRYGLRE 422
Query: 391 LGSEKSGRDATGNVWREFWTESMWQ---NQGLVHLEKTADKWGKNGNGDEWQEKWWEHYD 447
LG+ K G +G W E W E ++ N LV +E+TA KW ++ N DEW+EKW E ++
Sbjct: 423 LGAFKKGTAESGAAWVEEWKEVLYTHPTNLRLV-IERTAHKWARDENLDEWEEKWGECFE 481
Query: 448 ASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWS 507
+G+ KWA KW N VWHERWGE YDG G K+TDKWAER DG
Sbjct: 482 EAGRVHKWADKWAKAGSN---------VWHERWGEDYDGKGACQKWTDKWAERLLPDGGQ 532
Query: 508 -KWGDKWDENFDPNSHGVKQGETWWAGK-YGERWNRTWGERHNGSGWVHKYGKSSSGELW 565
+WGDKW E F + G K GE W + G R+NR W E H G G V K+G S+SGE W
Sbjct: 533 EQWGDKWTETFG-HGTGTKHGEVWSSSSSCGSRYNRWWNEEHYGDGRVRKWGNSTSGEHW 591
Query: 566 DTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPS 600
DT E +T+Y PHFGF H +S QL V PS
Sbjct: 592 DTVEHMDTYYNPVPHFGFQHAVGHSPQLWAVPLPS 626
>gi|356508780|ref|XP_003523132.1| PREDICTED: uncharacterized protein LOC100820367 isoform 1 [Glycine
max]
Length = 449
Score = 203 bits (516), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 124/279 (44%), Positives = 166/279 (59%), Gaps = 20/279 (7%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G N DGS W++E+G E +G CRWT G S D + EW+E +WE +D G+KELG EK
Sbjct: 163 GTNEDGSAWYRESGEELGENGYRCRWTRMGGQSHDGSSEWKETWWEKSDWTGYKELGVEK 222
Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
SGR++ G+ W E W E++ Q++ + +E++A K K+G + W EKWWE YDA G
Sbjct: 223 SGRNSEGDTWWETWQENLHQDEWSNIARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 282
Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
EK AHK+ ++ + W E+WGE YDG G +K+TDKWAE G +KWGDK
Sbjct: 283 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 330
Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
W+E F G + GETW ERW+RTWGE H G+G VHKYG S++GE WD +E
Sbjct: 331 WEERFF-KGIGSRHGETWHVSPSSERWSRTWGEEHFGNGKVHKYGNSTTGESWDIVVDEE 389
Query: 573 TWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEPF 607
T+YE PH+G+ +S QL + R P F F
Sbjct: 390 TYYEAEPHYGWADVVGDSTQLLSIEPRERPPGVFPNLDF 428
>gi|356508782|ref|XP_003523133.1| PREDICTED: uncharacterized protein LOC100820367 isoform 2 [Glycine
max]
Length = 434
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 124/279 (44%), Positives = 166/279 (59%), Gaps = 20/279 (7%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G N DGS W++E+G E +G CRWT G S D + EW+E +WE +D G+KELG EK
Sbjct: 148 GTNEDGSAWYRESGEELGENGYRCRWTRMGGQSHDGSSEWKETWWEKSDWTGYKELGVEK 207
Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
SGR++ G+ W E W E++ Q++ + +E++A K K+G + W EKWWE YDA G
Sbjct: 208 SGRNSEGDTWWETWQENLHQDEWSNIARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 267
Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
EK AHK+ ++ + W E+WGE YDG G +K+TDKWAE G +KWGDK
Sbjct: 268 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 315
Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
W+E F G + GETW ERW+RTWGE H G+G VHKYG S++GE WD +E
Sbjct: 316 WEERFF-KGIGSRHGETWHVSPSSERWSRTWGEEHFGNGKVHKYGNSTTGESWDIVVDEE 374
Query: 573 TWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEPF 607
T+YE PH+G+ +S QL + R P F F
Sbjct: 375 TYYEAEPHYGWADVVGDSTQLLSIEPRERPPGVFPNLDF 413
>gi|240254218|ref|NP_174971.5| uncharacterized protein [Arabidopsis thaliana]
gi|332193795|gb|AEE31916.1| uncharacterized protein [Arabidopsis thaliana]
Length = 426
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 123/283 (43%), Positives = 172/283 (60%), Gaps = 17/283 (6%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A E +D ++E + G N DGS W++E+G + +G CRW+ G S D + EW E
Sbjct: 127 ANEKDWGIDLLNENVNEAGTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTE 186
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
+WE +D G+KELG EKSG+++ G+ W E W E + Q++ L +E++A K K+G
Sbjct: 187 TWWEKSDWTGYKELGVEKSGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTE 246
Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
+ W EKWWE YDA G EK AHK+ ++ + W E+WGE YDG G +K+T
Sbjct: 247 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWT 297
Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
DKWAE G +KWGDKW+E F + G +QGETW +RW+RTWGE H G+G VH
Sbjct: 298 DKWAETELG---TKWGDKWEEKF-FSGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVH 353
Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
KYGKS++GE WD +ET+YE PH+G+ +S QL ++
Sbjct: 354 KYGKSTTGESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 396
>gi|334183065|ref|NP_001185147.1| uncharacterized protein [Arabidopsis thaliana]
gi|332193796|gb|AEE31917.1| uncharacterized protein [Arabidopsis thaliana]
Length = 409
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 123/283 (43%), Positives = 172/283 (60%), Gaps = 17/283 (6%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A E +D ++E + G N DGS W++E+G + +G CRW+ G S D + EW E
Sbjct: 110 ANEKDWGIDLLNENVNEAGTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTE 169
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
+WE +D G+KELG EKSG+++ G+ W E W E + Q++ L +E++A K K+G
Sbjct: 170 TWWEKSDWTGYKELGVEKSGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTE 229
Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
+ W EKWWE YDA G EK AHK+ ++ + W E+WGE YDG G +K+T
Sbjct: 230 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWT 280
Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
DKWAE G +KWGDKW+E F + G +QGETW +RW+RTWGE H G+G VH
Sbjct: 281 DKWAETELG---TKWGDKWEEKF-FSGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVH 336
Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
KYGKS++GE WD +ET+YE PH+G+ +S QL ++
Sbjct: 337 KYGKSTTGESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 379
>gi|168037924|ref|XP_001771452.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677179|gb|EDQ63652.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 337
Score = 198 bits (504), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 120/264 (45%), Positives = 168/264 (63%), Gaps = 18/264 (6%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G+N DGS W+ E+G++ +G CRWT+ G S D + EW+E +WE +D G+KELG+EK
Sbjct: 34 GVNEDGSTWYNESGVDFGENGYRCRWTVMGGRSGDGSSEWKEAWWEKSDWTGYKELGAEK 93
Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
+G++A G+ W E W E + ++ L +EK+A K K+G G W EKWWE Y+A G +
Sbjct: 94 TGKNAQGDTWWETWQEVLRVDELSNLARIEKSAQKQAKSGTGSAGWFEKWWEKYNAKGWS 153
Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
EK AHK+ ++ D G W E+W E+YDG G +K+TDKWAE G +KWGDK
Sbjct: 154 EKGAHKYGRLN-----DQG----WWEKWEEQYDGRGAVLKWTDKWAENGTG---TKWGDK 201
Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
W+E F+ + G +QGETW G W+RTWGE H G+G VHKYG+S+SGE WD ++
Sbjct: 202 WEEKFN-HGVGTRQGETWHNDDKG--WSRTWGEEHFGNGKVHKYGRSTSGENWDNIVEEG 258
Query: 573 TWYERFPHFGFYHCFDNSVQLREV 596
T+Y+ PH+G+ NS QL +
Sbjct: 259 TYYQAEPHYGWADAIGNSEQLLNI 282
>gi|224086016|ref|XP_002307779.1| predicted protein [Populus trichocarpa]
gi|222857228|gb|EEE94775.1| predicted protein [Populus trichocarpa]
Length = 328
Score = 195 bits (495), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 119/270 (44%), Positives = 164/270 (60%), Gaps = 16/270 (5%)
Query: 330 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 389
+ ++ G N DGS W++E+G + +G CRWT G S D++ +W+E +WE +D G+K
Sbjct: 48 ENVSETGTNEDGSTWFRESGEDLGANGYRCRWTKMGGRSHDDSTQWEETWWEKSDWTGYK 107
Query: 390 ELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHY 446
ELG EKSGR+A G+ W E W E + Q++ L +E++A K K+G + W EKWWE Y
Sbjct: 108 ELGVEKSGRNAEGDSWWETWQEMLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKY 167
Query: 447 DASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGW 506
DA G EK A+K+ ++ + W E+WGE YDG G K+TDKWAE G
Sbjct: 168 DAKGWTEKGANKYGRLNEQS---------WWEKWGEHYDGRGSVTKWTDKWAETELG--- 215
Query: 507 SKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWD 566
+KWGDKW+E F G + GETW G RW+RTWGE H G+G VHKYGKS++ E WD
Sbjct: 216 TKWGDKWEEKFFAGI-GSRHGETWHVSPIGGRWSRTWGEEHFGNGKVHKYGKSTTSESWD 274
Query: 567 THEQQETWYERFPHFGFYHCFDNSVQLREV 596
+ET+YE PH+G+ +S QL +
Sbjct: 275 IVVDEETYYEAEPHYGWADVVGDSSQLLSI 304
>gi|159485472|ref|XP_001700768.1| hypothetical protein CHLREDRAFT_167685 [Chlamydomonas reinhardtii]
gi|158281267|gb|EDP07022.1| predicted protein, partial [Chlamydomonas reinhardtii]
Length = 350
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 123/326 (37%), Positives = 166/326 (50%), Gaps = 22/326 (6%)
Query: 263 ILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETP---SLEEERDLGALFSAHA 319
+L P + + +L E + + E P EE D A
Sbjct: 41 VLAAPLRAVRTSESSSGTPLSLNDILVFESDMPAMLIQEQPPERKAEEIVDRAEEVVGRA 100
Query: 320 AEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKF 379
A D LA DG+R+ K +G + PDG V +W + RGV+ D ++W+E +
Sbjct: 101 ALGRQLADGAGRLA------DGTRFEKLSGTDTGPDGYVKKWEVLRGVTGDGTVQWEECW 154
Query: 380 WEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHL--EKTADKWGKNGNGDE 437
W A++ G +E+G+ K G G W E W E ++ + + L E+TA KW ++ + DE
Sbjct: 155 WTASNRYGLREMGAFKKGSTEAGAAWVEEWKEVLYTHATNLRLVIERTAHKWARDESSDE 214
Query: 438 WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKW 497
W+EKW E Y+ +G+ K+A KW N VWHERWGE YDG G K+TDKW
Sbjct: 215 WEEKWGECYEEAGRVHKFADKWAKAGIN---------VWHERWGEDYDGRGACQKWTDKW 265
Query: 498 AERCEGDGWS-KWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKY 556
AER DG +WGDKW E F G K GE W AG GER+NR W E H G G V ++
Sbjct: 266 AERLLPDGGQEQWGDKWTETFGAGK-GTKHGEVWSAGGGGERYNRWWNEEHYGDGRVRRW 324
Query: 557 GKSSSGELWDTHEQQETWYERFPHFG 582
G S+SGE WD E +T+Y PHFG
Sbjct: 325 GNSTSGEYWDGVEHMDTYYNPVPHFG 350
>gi|307104332|gb|EFN52586.1| hypothetical protein CHLNCDRAFT_8819, partial [Chlorella
variabilis]
Length = 234
Score = 188 bits (478), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 116/241 (48%), Positives = 147/241 (60%), Gaps = 12/241 (4%)
Query: 346 KETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVW 405
+ +G E G RWT RG A++W EK+WE +D G KELG+EK G +A G+ W
Sbjct: 3 RRSGEETGSHGYWYRWTEVRGCDETGAVQWYEKWWEVSDWRGMKELGAEKWGCNARGDAW 62
Query: 406 REFWTESMWQNQGLVH--LEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSID 463
RE W E++ G +E++A KW KNG G EW+EKW E Y ++G+A+KWA KW
Sbjct: 63 RETWREAIVVEAGSTQPSVERSAHKWAKNGMGHEWEEKWGERYWSAGRADKWADKWAREG 122
Query: 464 PNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAER-CEGDGWSKWGDKWDENFDPNSH 522
A VWHE+WGE YDG GG +KYTDKWAER EG +WGDKW+ENF
Sbjct: 123 ---------ADVWHEKWGENYDGSGGCVKYTDKWAEREVEGGAREQWGDKWEENFKDGRG 173
Query: 523 GVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFG 582
+QGETW GER+NR WGE H G V K+G S++GE WD EQ +T+Y PHFG
Sbjct: 174 TKQQGETWSVSAGGERYNRWWGENHLGDRLVQKHGSSNTGEHWDVTEQMDTYYNPIPHFG 233
Query: 583 F 583
+
Sbjct: 234 Y 234
>gi|224061899|ref|XP_002300654.1| predicted protein [Populus trichocarpa]
gi|222842380|gb|EEE79927.1| predicted protein [Populus trichocarpa]
Length = 328
Score = 185 bits (470), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 123/283 (43%), Positives = 165/283 (58%), Gaps = 23/283 (8%)
Query: 327 DKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADEL 386
+KV+E G N DGS W++++G + +G CRW G S D + +W+E +WE D
Sbjct: 48 EKVNE---SGTNEDGSSWFRKSGEDLGENGYRCRWKKMGGRSHDTSSQWEETWWEKGDWT 104
Query: 387 GHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWW 443
G+KELG EKSGR+A G+ W E W E + Q++ L +E++A K K G + W EKWW
Sbjct: 105 GYKELGVEKSGRNAEGDTWWETWQEMLHQDEWSNLARIERSAQKQAKLGTENAGWYEKWW 164
Query: 444 EHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEG 503
E YDA G EK A+K+ ++ + W E+WGE YDG G K+TDKWAE G
Sbjct: 165 EKYDAKGWTEKGANKYGRLNEQS---------WWEKWGEHYDGRGSVTKWTDKWAETELG 215
Query: 504 DGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGE 563
+KWGDKW+E F G + GETW G W+RTWGE H G+G VHKYGK ++GE
Sbjct: 216 ---TKWGDKWEEKFFAGI-GSRHGETWHGSPSGGGWSRTWGEEHLGNGKVHKYGKGTTGE 271
Query: 564 LWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEFQEEP 606
WD +ET+YE PH+G+ +S QL + E QE P
Sbjct: 272 SWDIVVDEETYYEAEPHYGWADVVGDSSQLLSI----EPQERP 310
>gi|255071597|ref|XP_002499473.1| predicted protein [Micromonas sp. RCC299]
gi|226514735|gb|ACO60731.1| predicted protein [Micromonas sp. RCC299]
Length = 940
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 142/448 (31%), Positives = 203/448 (45%), Gaps = 68/448 (15%)
Query: 215 PGPDFWSWSPPEDDDRDMRDVRDL-----QMAEKSSVYPTPVNPVVEKARSV--DILPIP 267
P DFW WSPP +K+ Y V VE L +
Sbjct: 379 PASDFWEWSPPSMPAAGPAGAASSYYPAEMQRQKAPAYTRRVEAAVEVMERAPEQTLDLQ 438
Query: 268 FESKL------------------SEPKPDPLLPPFQSLLGVEKEEVSETN--LETPSLEE 307
FE+ + S PK +P P S LG++ ++ + +T ++ +
Sbjct: 439 FETTIEQQNATLPQFQSQVKPASSPPKVEPAKPASASPLGLQDAQLQQLREVYQTSTVSD 498
Query: 308 ERDLGA----------------LFSAHAAEAAHALDK------VDELATRGINPDGSRWW 345
L A S+ A AL V++ A G G+RWW
Sbjct: 499 AAILAAEQRYDVDVAAPAPAGLAGSSVEASLEDALASPVRELGVEDGAKEGTLRSGARWW 558
Query: 346 KETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVW 405
+E G E DG V WT+ RG SAD ++EW+EKFWE +D ++ELG+ KSGRD+ G W
Sbjct: 559 REEGKEYLEDGKVMSWTVIRGTSADGSVEWEEKFWETSDPFTYRELGAIKSGRDSNGQAW 618
Query: 406 REFWTESMWQNQG-LVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDP 464
+E W E + L + + A KW G W E W E Y A G +++ K S++
Sbjct: 619 QESWKELYNHDANQLPFIHREASKWSHTPKGKCWSEGWTEDYRADGVVDRYCEKTGSLED 678
Query: 465 NTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDG---------WS-KWGDKWD 514
+ GHA+ W ++WGEK+DG GG +K+TD WA R +G W KW +KW
Sbjct: 679 GAAPEDGHANRWTQKWGEKWDGQGGCIKWTDTWASRDHAEGGMANAPSRSWGEKWEEKWG 738
Query: 515 ENFDPNSH-GVKQGETW--WAGKYGERWNRTWGERHNGSGWVHKYGKSSSG-ELWDT-HE 569
+N++ N G++QG W G + E RTWGE H G +HKYG S+ G + WD +
Sbjct: 739 DNYNENGRAGLRQGLAWDELGGNHKE---RTWGEEHYPDGRLHKYGNSNDGSQYWDEWCD 795
Query: 570 QQETWYERFPHFGFYHCFDNSVQLREVR 597
W+E P FG++ +S L VR
Sbjct: 796 GAGGWWETAPSFGWHEAIGHSPSLMNVR 823
>gi|303272745|ref|XP_003055734.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463708|gb|EEH60986.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 716
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 141/426 (33%), Positives = 203/426 (47%), Gaps = 51/426 (11%)
Query: 218 DFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPT---------PVNPVVEKARSVDILPIPF 268
DFW W+PPE D SS YP P P VE R V ++
Sbjct: 232 DFWDWTPPEAP-VDFTPQPGGSPTSASSYYPPKMQKKRLEFPTAPAVE--REVMLMERAP 288
Query: 269 ESKLSEPKP-----DPLLPPFQSLLGVEKEEVSE----TNLETPSLE-----------EE 308
E L + + + +LP FQS++ +EVS+ NL E+
Sbjct: 289 ERTLPQFQSVVEAQNAVLPEFQSVVESGSDEVSQLTSAMNLAAAPGAPAPMMAAPATAEQ 348
Query: 309 RDLGALFSAHA----AEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMT 364
GA A A A L + G+ G+RWW+E G ++ G V WT
Sbjct: 349 IAAGASIEASIEDALASAVRELGAGEGAEKEGVLSSGARWWREEGEDKLEGGKVMSWTCI 408
Query: 365 RGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTES-MWQNQGLVHLE 423
RG SAD A+EW+E++W+ +D ++ELG+ KSGRD+ G W+E W E + + + ++
Sbjct: 409 RGTSADGAVEWEERWWKTSDSFTYRELGAVKSGRDSNGQAWQESWKEMYVHEVNKIPYIH 468
Query: 424 KTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEK 483
+ A KW G W E W E Y A G +++ K +++ + GH + W E+WGEK
Sbjct: 469 REASKWSHTPKGAAWSEGWTEDYRADGTVDRFCEKTGALEDGAAPEDGHGNRWTEKWGEK 528
Query: 484 YDGHGGSMKYTDKWAERCEGDGWSK------WGDKWDENFDP--NSH---GVKQGETWWA 532
+DGHGG +K+TD WA R +G + WG+KW+E + N H G +QG T W
Sbjct: 529 WDGHGGCIKWTDTWASRDHSEGGMENAPGRSWGEKWEEKWGEGYNEHGRAGSRQGLT-WD 587
Query: 533 GKYGERWNRTWGERHNGSGWVHKYGKSSSG-ELWDTHEQQE-TWYERFPHFGFYHCFDNS 590
G+ ++WGE H G +HKYG SS G + WDT E W+ER P FG+ +S
Sbjct: 588 ETMGQHTLKSWGEEHYPDGRLHKYGNSSDGSQYWDTWEDGAGGWWERNPSFGWIEALHHS 647
Query: 591 VQLREV 596
L +
Sbjct: 648 PDLMNL 653
>gi|145344308|ref|XP_001416678.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576904|gb|ABO94971.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 285
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/275 (41%), Positives = 164/275 (59%), Gaps = 15/275 (5%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G +GSRWW+E+G E+ G +CRWT+ RG SAD ++EW+EK+WE +D ++ELG+ K
Sbjct: 1 GTLENGSRWWRESGEEELEGGKLCRWTLVRGASADGSVEWEEKWWETSDAFNYRELGAIK 60
Query: 396 SGRDATGNVWREFWTESMWQN------QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDAS 449
SGRDA GNVW+E W E + + H+ + A+KWG +G EW E W E+Y
Sbjct: 61 SGRDAKGNVWQESWREQVTHDTTTGFSNASKHIMREANKWGAQADGAEWHEVWDENYWGD 120
Query: 450 GKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEG-DGWS- 507
G+ ++ K +I D GH + W +WGE++DGHGG +K+TD +A+R + DG S
Sbjct: 121 GQVKRTCTKKGAIANGATPDDGHGNRWTHKWGEEWDGHGGCVKWTDSFADRDQSEDGGSG 180
Query: 508 -KWGDKWDENFDPNSH----GVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSG 562
WG+KW+E + +H G + G T W + G ++ +TWGE H G VHK+G ++ G
Sbjct: 181 RAWGEKWEERWGGYAHNGSAGNRNGST-WDDRDGHKFEKTWGEEHWHDGRVHKWGATTDG 239
Query: 563 -ELWDTHEQQETWYERFPHFGFYHCFDNSVQLREV 596
+ WDT E W+ER P FG+ +S QL V
Sbjct: 240 SDGWDTWEDSAGWWERAPSFGWDEAVSHSPQLLNV 274
>gi|384246267|gb|EIE19758.1| hypothetical protein COCSUDRAFT_19282 [Coccomyxa subellipsoidea
C-169]
Length = 271
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 119/266 (44%), Positives = 158/266 (59%), Gaps = 20/266 (7%)
Query: 335 RGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSE 394
+G DG+++ +E+G + P+G RWT +GVSA +EW+E++WE +D G +ELG+E
Sbjct: 5 KGQLADGTKYLRESGEDFGPNGFWRRWTCLKGVSAAGKVEWEERWWEESDWAGMRELGAE 64
Query: 395 KSGRDATGNVWREFWTESMW--QNQGLVHLEKTADKWGKNGNGD-EWQEKWWEHYDASGK 451
KSG A G W E W E++ Q G +E++A KW +G EW+E+W E Y + G+
Sbjct: 65 KSGCRADGAAWFETWREAIAFDQTNGEPIVERSAHKWACDGKVRCEWEERWGEQYWSLGR 124
Query: 452 AEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERC-EGDGWSKWG 510
A K+A KW N VWHERWGE YDG GG WAER EG G +WG
Sbjct: 125 ANKYADKWGKEGNN---------VWHERWGEDYDGDGGC------WAERLLEGGGNEQWG 169
Query: 511 DKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQ 570
DKW+E F N G KQGETW G+R + WGE H G+GWV K+G SS+GE WD EQ
Sbjct: 170 DKWEERF-KNGAGSKQGETWTVSAGGDRHQQWWGEDHFGNGWVRKHGNSSTGEQWDVSEQ 228
Query: 571 QETWYERFPHFGFYHCFDNSVQLREV 596
+T+Y PHFG+ D+S L+ V
Sbjct: 229 MDTYYNPIPHFGYKLALDHSPTLKNV 254
>gi|303283944|ref|XP_003061263.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457614|gb|EEH54913.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 391
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 106/282 (37%), Positives = 154/282 (54%), Gaps = 25/282 (8%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G++ DG+ ++++G++ G CRWT+ + D + E +E WE D G KELG+EK
Sbjct: 122 GVDADGNAVFRKSGVDVGDHGYRCRWTIQGRSAQDASWETRETHWEKCDASGFKELGAEK 181
Query: 396 SGRDATGNVWREFWTE-------SMWQNQGLVH---LEKTADKWGKNGNGDEWQEKWWEH 445
SG + G+ W E W E + G + +E++ADKW ++ EW EKWWE
Sbjct: 182 SGFNEDGDTWWETWKEVYRVERDDRDDDSGPIRAEFIERSADKWARDKTNHEWHEKWWEQ 241
Query: 446 YDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGG-SMKYTDKWAERCEGD 504
Y SG E+ K + L A W E+WGE++ GG + K+TDKWA+ G
Sbjct: 242 YSPSGYVERQVEK-------SGLHG--AQAWWEKWGEQHGADGGETRKWTDKWAQNGAG- 291
Query: 505 GWSKWGDKWDENFDPNS-HGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGE 563
++WGDKW+E F + G K+GETW GERW+RTWGE + SG V KYG+S++GE
Sbjct: 292 --TRWGDKWEERFSADCISGDKKGETWRVAASGERWSRTWGETIDSSGEVRKYGESTTGE 349
Query: 564 LWDTHEQQET-WYERFPHFGFYHCFDNSVQLREVRKPSEFQE 604
WD E E +Y+R P + + + S +L + P E E
Sbjct: 350 TWDKTETLEKFYYDRTPEYSWDDIKNISKRLLSIETPDEKDE 391
>gi|255079328|ref|XP_002503244.1| predicted protein [Micromonas sp. RCC299]
gi|226518510|gb|ACO64502.1| predicted protein [Micromonas sp. RCC299]
Length = 290
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 104/282 (36%), Positives = 150/282 (53%), Gaps = 42/282 (14%)
Query: 337 INPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKS 396
+ DG+ ++ ++G++ G CRWT+T + D E++ WE AD G+KELG+EKS
Sbjct: 13 TDDDGNAFFSKSGVDTGDGGYRCRWTVTGRTAKDGTWEYRATHWEKADWSGYKELGAEKS 72
Query: 397 G-RDATGNVWREFWTESMWQNQGLVH----------LEKTADKWGKNGNGDEWQEKWWEH 445
G DA+G+ W E W + + G +E++ADKW ++ + EWQEKWWE
Sbjct: 73 GFDDASGDTWWETWRQVYRRENGDASGSSDTSGPALIERSADKWARDKHKKEWQEKWWER 132
Query: 446 YDASGKAEKWAHKWCSIDPNTQLDAGHAHV--WHERWGEKYD---GHGGSMKYTDKWAER 500
Y +G E+ K +G V W E+WGE+ D G G +K+TDKWAE
Sbjct: 133 YSDAGLVERGVEK-----------SGRQGVQAWWEKWGEQRDDSDGGGDVIKWTDKWAEN 181
Query: 501 CEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSS 560
G ++WGDKW+E F + G K GETW GERW+RTWGE G + YG+S+
Sbjct: 182 GAG---TRWGDKWEERFGADGSGKKVGETWRVNAGGERWSRTWGESVGSDGEIRTYGQST 238
Query: 561 SGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEF 602
SGE WDT EQ + DNS + + ++ +E+
Sbjct: 239 SGEQWDTTEQGNS------------SRDNSSRWEDAKEAAEY 268
>gi|308810753|ref|XP_003082685.1| unnamed protein product [Ostreococcus tauri]
gi|116061154|emb|CAL56542.1| unnamed protein product, partial [Ostreococcus tauri]
Length = 332
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 96/239 (40%), Positives = 131/239 (54%), Gaps = 19/239 (7%)
Query: 346 KETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVW 405
K+ G+E G RW T+ D E W +D G+KELG EKSG + TG W
Sbjct: 104 KKRGVETGEGGYRSRWWKTKRECPDGRSGSSETRWAKSDFSGYKELGFEKSGFNETGETW 163
Query: 406 REFWTESMWQNQ--GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSID 463
E W E ++ GL +E++ADKW ++ EWQEKWWE Y A+G E+ K
Sbjct: 164 WETWREIYSRDDYTGLERIERSADKWARDAQSKEWQEKWWERYYANGAVERGLEK----- 218
Query: 464 PNTQLDAGHA--HVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNS 521
+G W E+WGE+YDG G ++K++DKWAE G G ++WGDKW+E
Sbjct: 219 ------SGREVRQAWWEKWGEQYDGEGATLKWSDKWAE---GSG-TRWGDKWEERRSKFG 268
Query: 522 HGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPH 580
G K GETW G+ GER++RTWGE + G V K+G S++GE WDT ++ + R
Sbjct: 269 SGRKSGETWRVGQDGERFSRTWGEVISPDGSVRKFGTSTTGESWDTTVKENVYLTRISR 327
>gi|412987666|emb|CCO20501.1| predicted protein [Bathycoccus prasinos]
Length = 335
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 97/277 (35%), Positives = 141/277 (50%), Gaps = 47/277 (16%)
Query: 326 LDKVDELATRGINPD-GSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAAD 384
+D+ +GI+P+ G+ W++E+G+++ G CRWT+ G + D++ E++E WE AD
Sbjct: 24 IDEEHTKKMKGIDPETGNSWFRESGVDRGEGGYRCRWTVKGGAAPDKSWEYRETHWEKAD 83
Query: 385 ELGHKELGSEKSGRDATGNVWREFWTESM----------------------------WQN 416
G++ELG+EKSG + G W E W E
Sbjct: 84 LSGYRELGAEKSGFNEKGETWWETWRELYNTSESSSSSSEENNNENNNNSNDDDHHPLSA 143
Query: 417 QGLVHLEKTADKWGK--NGNGD---EWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAG 471
+E++ADKW + + N D EWQEKWWE + + ++ K
Sbjct: 144 SCCQMVERSADKWARHVDTNSDSSREWQEKWWERFSSENTCDRGVEK---------SGRE 194
Query: 472 HAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNS-HGVKQGETW 530
+ H W E+WGE YD +G S+++TDKWAE +G +WGDKW+E S G K GETW
Sbjct: 195 NRHAWWEKWGEHYDPNGISLRWTDKWAENDKG---VRWGDKWEERLKSTSGDGAKSGETW 251
Query: 531 WAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDT 567
GE W RTWGE +G V KYG+S++ E WD
Sbjct: 252 REEPNGEVWRRTWGEEVFENGEVRKYGESTTDEKWDV 288
>gi|145353645|ref|XP_001421117.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|145357258|ref|XP_001422837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581353|gb|ABO99410.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144583081|gb|ABP01196.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 355
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/253 (32%), Positives = 118/253 (46%), Gaps = 37/253 (14%)
Query: 346 KETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVW 405
++ G ++ G RW T+ D E W D G+KELG EKSG + TG W
Sbjct: 115 RKRGFDRGEGGYQSRWWTTKRDCPDGRSGSSETRWAKCDFSGYKELGFEKSGFNDTGETW 174
Query: 406 REFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPN 465
E W E ++ WWE Y A+G E+ K
Sbjct: 175 WETWREIYCRDDFT---------------------GWWERYYANGAVERGVEK------- 206
Query: 466 TQLDAGH--AHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHG 523
+G W E+WGE+YDG G ++K+TDKWAE G +WGDKW+E G
Sbjct: 207 ----SGREVRQAWWEKWGEQYDGEGATLKWTDKWAENGMG---MRWGDKWEERRSKIGSG 259
Query: 524 VKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGF 583
K GETW G+ GER++RTWGE + G V K+G S++GE WDT + ++++ +
Sbjct: 260 RKSGETWRVGEDGERFSRTWGEVLSPDGSVRKFGNSTTGESWDTTVVENVYFDKSKPPTW 319
Query: 584 YHCFDNSVQLREV 596
+S +L +
Sbjct: 320 QEVLSSSERLMSI 332
>gi|6691197|gb|AAF24535.1|AC007534_16 F7F22.5 [Arabidopsis thaliana]
Length = 322
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 79/213 (37%), Positives = 108/213 (50%), Gaps = 44/213 (20%)
Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
A E +D ++E + G N DGS W++E+G + +G CRW+ G S D + EW E
Sbjct: 127 ANEKDWGIDLLNENVNEAGTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTE 186
Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDE 437
W +G EKSG+++ G+ W E W E + Q DE
Sbjct: 187 T-WSVL------FVGVEKSGKNSEGDSWWETWQEVLHQ--------------------DE 219
Query: 438 WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKW 497
W+ WE YDA G EK AHK+ ++ + W E+WGE YDG G +K+TDKW
Sbjct: 220 WR---WEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKW 267
Query: 498 AERCEGDGWSKWGDKWDENFDPNSHGVKQGETW 530
AE G +KWGDKW+E F + G +QGETW
Sbjct: 268 AETELG---TKWGDKWEEKF-FSGIGSRQGETW 296
>gi|302795454|ref|XP_002979490.1| hypothetical protein SELMODRAFT_56868 [Selaginella moellendorffii]
gi|300152738|gb|EFJ19379.1| hypothetical protein SELMODRAFT_56868 [Selaginella moellendorffii]
Length = 175
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 97/192 (50%), Gaps = 48/192 (25%)
Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
G N DGS W++E G + +G CR+T+ G S+D + EW+E + +EK
Sbjct: 32 GTNEDGSTWFRECGEDLGENGYRCRYTVMGGRSSDGSTEWKET------------VRAEK 79
Query: 396 SGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKW 455
SG++A G+ W E W E + Q++ WE Y+A G EK
Sbjct: 80 SGKNAEGDAWWETWQEILRQDELR-----------------------WEKYNAKGWTEKG 116
Query: 456 AHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDE 515
AHK+ ++ + W E+WGE+YDG G +K+TDKWAE G+ KWGDKW+E
Sbjct: 117 AHKYGRLNEQS---------WWEKWGEQYDGRGAVLKWTDKWAENATGE---KWGDKWEE 164
Query: 516 NFDPNSHGVKQG 527
F N G +QG
Sbjct: 165 KF-QNGAGTRQG 175
>gi|221481921|gb|EEE20287.1| AT hook motif-containing protein, putative [Toxoplasma gondii GT1]
Length = 1282
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 92/204 (45%), Gaps = 31/204 (15%)
Query: 309 RDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVS 368
R++ F E AH+ +K RG + G W ++ +RP+ + T+ S
Sbjct: 1096 REVTDWFEDKFGEVAHSQEKW--AYKRGHSASGDNWLEK--WNERPEEK----SATKSGS 1147
Query: 369 ADEALEWQEKFWEAADELGHKELG-SEKSGRDATGNVWREFWTE--SMWQNQGLVHLEKT 425
EW E++ E DE G K +EK+GR+A G+ W E W E S W K
Sbjct: 1148 NARGDEWSEQWKETFDENGEKSTTWAEKTGRNAQGDAWYETWLERRSNW---------KM 1198
Query: 426 ADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYD 485
A K G+N G+EWQEKW E G EKW KW + + H W +RWG+ D
Sbjct: 1199 AIKEGRNARGEEWQEKWGEDLHEDGSGEKWCQKWAKDNAGNR----HGKSWGDRWGK--D 1252
Query: 486 GHGGSMKYTDKWAERCEGDGWSKW 509
G GG +W E D +KW
Sbjct: 1253 GKGGH-----RWGEEWSNDDVNKW 1271
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 81/178 (45%), Gaps = 28/178 (15%)
Query: 371 EALEW-QEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKW 429
E +W ++KF E A +E + K G A+G+ W E W E EK+A K
Sbjct: 1097 EVTDWFEDKFGEVAH---SQEKWAYKRGHSASGDNWLEKWNERP--------EEKSATKS 1145
Query: 430 GKNGNGDEWQEKWWEHYDASG-KAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHG 488
G N GDEW E+W E +D +G K+ WA K N Q DA W+E W E+
Sbjct: 1146 GSNARGDEWSEQWKETFDENGEKSTTWAEK---TGRNAQGDA-----WYETWLERRS--- 1194
Query: 489 GSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGER 546
+ K K G+ +W +KW E+ + G K + W G R ++WG+R
Sbjct: 1195 -NWKMAIKEGRNARGE---EWQEKWGEDLHEDGSGEKWCQKWAKDNAGNRHGKSWGDR 1248
>gi|221501376|gb|EEE27155.1| AT hook motif-containing protein, putative [Toxoplasma gondii VEG]
Length = 1282
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 92/204 (45%), Gaps = 31/204 (15%)
Query: 309 RDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVS 368
R++ F E AH+ +K RG + G W ++ +RP+ + T+ S
Sbjct: 1096 REVTDWFEDKFGEVAHSQEKW--AYKRGHSASGDNWLEK--WNERPEEK----SATKSGS 1147
Query: 369 ADEALEWQEKFWEAADELGHKELG-SEKSGRDATGNVWREFWTE--SMWQNQGLVHLEKT 425
EW E++ E DE G K +EK+GR+A G+ W E W E S W K
Sbjct: 1148 NARGDEWSEQWKETFDENGEKSTTWAEKTGRNAQGDAWYETWLERRSNW---------KM 1198
Query: 426 ADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYD 485
A K G+N G+EWQEKW E G EKW KW + + H W +RWG+ D
Sbjct: 1199 AIKEGRNARGEEWQEKWGEDLHEDGSGEKWCQKWAKDNAGNR----HGKSWGDRWGK--D 1252
Query: 486 GHGGSMKYTDKWAERCEGDGWSKW 509
G GG +W E D +KW
Sbjct: 1253 GKGGH-----RWGEEWSNDDVNKW 1271
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 81/178 (45%), Gaps = 28/178 (15%)
Query: 371 EALEW-QEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKW 429
E +W ++KF E A +E + K G A+G+ W E W E EK+A K
Sbjct: 1097 EVTDWFEDKFGEVAH---SQEKWAYKRGHSASGDNWLEKWNERP--------EEKSATKS 1145
Query: 430 GKNGNGDEWQEKWWEHYDASG-KAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHG 488
G N GDEW E+W E +D +G K+ WA K N Q DA W+E W E+
Sbjct: 1146 GSNARGDEWSEQWKETFDENGEKSTTWAEK---TGRNAQGDA-----WYETWLERRS--- 1194
Query: 489 GSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGER 546
+ K K G+ +W +KW E+ + G K + W G R ++WG+R
Sbjct: 1195 -NWKMAIKEGRNARGE---EWQEKWGEDLHEDGSGEKWCQKWAKDNAGNRHGKSWGDR 1248
>gi|237837095|ref|XP_002367845.1| AT hook motif-containing protein [Toxoplasma gondii ME49]
gi|211965509|gb|EEB00705.1| AT hook motif-containing protein [Toxoplasma gondii ME49]
Length = 1282
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 92/204 (45%), Gaps = 31/204 (15%)
Query: 309 RDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVS 368
R++ F E AH+ +K RG + G W ++ +RP+ + T+ S
Sbjct: 1096 REVTDWFEDKFGEVAHSQEKW--AYKRGHSASGDNWLEK--WNERPEEK----SATKSGS 1147
Query: 369 ADEALEWQEKFWEAADELGHKELG-SEKSGRDATGNVWREFWTE--SMWQNQGLVHLEKT 425
EW E++ E DE G K +EK+GR+A G+ W E W E S W K
Sbjct: 1148 NARGDEWSEQWKETFDENGEKSTTWAEKTGRNAQGDAWYETWLERRSNW---------KM 1198
Query: 426 ADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYD 485
A K G+N G+EWQEKW E G EKW KW + + H W +RWG+ D
Sbjct: 1199 AIKEGRNARGEEWQEKWGEDLHEDGSGEKWCQKWAKDNAGNR----HGKSWGDRWGK--D 1252
Query: 486 GHGGSMKYTDKWAERCEGDGWSKW 509
G GG +W E D +KW
Sbjct: 1253 GKGGH-----RWGEEWSNDDVNKW 1271
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 81/178 (45%), Gaps = 28/178 (15%)
Query: 371 EALEW-QEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKW 429
E +W ++KF E A +E + K G A+G+ W E W E EK+A K
Sbjct: 1097 EVTDWFEDKFGEVAH---SQEKWAYKRGHSASGDNWLEKWNERP--------EEKSATKS 1145
Query: 430 GKNGNGDEWQEKWWEHYDASG-KAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHG 488
G N GDEW E+W E +D +G K+ WA K N Q DA W+E W E+
Sbjct: 1146 GSNARGDEWSEQWKETFDENGEKSTTWAEK---TGRNAQGDA-----WYETWLERRS--- 1194
Query: 489 GSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGER 546
+ K K G+ +W +KW E+ + G K + W G R ++WG+R
Sbjct: 1195 -NWKMAIKEGRNARGE---EWQEKWGEDLHEDGSGEKWCQKWAKDNAGNRHGKSWGDR 1248
>gi|401403071|ref|XP_003881402.1| putative AT hook motif-containing protein [Neospora caninum
Liverpool]
gi|325115814|emb|CBZ51369.1| putative AT hook motif-containing protein [Neospora caninum
Liverpool]
Length = 1316
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 69/140 (49%), Gaps = 19/140 (13%)
Query: 362 TMTRGVSADEALEWQEKFWEAADELGHKELG-SEKSGRDATGNVWREFWTE--SMWQNQG 418
T T+ S W E++ E DE G K + +EK+GR+A G+ W E W E + W
Sbjct: 1175 TATKSGSNARGDAWSEQWKETFDENGEKNITWAEKTGRNAQGDSWYETWLERRANW---- 1230
Query: 419 LVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHE 478
K A K G+N G+EWQEKW E G EKW KW + H W +
Sbjct: 1231 -----KMAIKEGRNARGEEWQEKWGEDLHEDGSGEKWCQKWAKDHAGNR----HGKSWGD 1281
Query: 479 RWGEKYDGHGGSMKYTDKWA 498
RWG+ DG GG K+ ++W+
Sbjct: 1282 RWGK--DGKGG-HKWGEEWS 1298
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 81/177 (45%), Gaps = 29/177 (16%)
Query: 395 KSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASG-KAE 453
K GR+A+G+ W E W E EKTA K G N GD W E+W E +D +G K
Sbjct: 1153 KQGRNASGDQWLEKWNEKP--------EEKTATKSGSNARGDAWSEQWKETFDENGEKNI 1204
Query: 454 KWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKW 513
WA K N Q D+ W+E W E+ + K K G+ +W +KW
Sbjct: 1205 TWAEK---TGRNAQGDS-----WYETWLERR----ANWKMAIKEGRNARGE---EWQEKW 1249
Query: 514 DENFDPNSHGVKQGETWWAGKYGERWNRTWGER--HNGSGWVHKYGKSSSGELWDTH 568
E+ + G K + W G R ++WG+R +G G HK+G+ S E D H
Sbjct: 1250 GEDLHEDGSGEKWCQKWAKDHAGNRHGKSWGDRWGKDGKG-GHKWGEEWSNE--DVH 1303
Score = 38.9 bits (89), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 67/149 (44%), Gaps = 18/149 (12%)
Query: 424 KTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHK-WCSIDPNTQLDAGHAHVWHERWGE 482
+T +K+G +G EW+E W G + W K W + + + WGE
Sbjct: 1026 RTGEKFGSKTDGTEWREAWGRQASDEGPEDSWIEKRWKECNQGEGV---------KEWGE 1076
Query: 483 KYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRT 542
+G G ++ KW ++ G ++ +KW+++ N+ VKQG T W G R
Sbjct: 1077 -TEGSEGRKRWNQKWWKKESWQGGDEFVEKWEDDGYGNTSTVKQGST-WKHHEGGREVTD 1134
Query: 543 WGE------RHNGSGWVHKYGKSSSGELW 565
W E H+ W +K G+++SG+ W
Sbjct: 1135 WFEDKFGVVEHSQEKWAYKQGRNASGDQW 1163
>gi|424042845|ref|ZP_17780513.1| UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate
ligase [Vibrio cholerae HENC-02]
gi|408886232|gb|EKM24913.1| UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate
ligase [Vibrio cholerae HENC-02]
Length = 485
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 38/84 (45%), Gaps = 12/84 (14%)
Query: 1 MASQLSHYPRATGHRANPPLIFTT----RRTTPQQINFWSRRTGAKVGVSNSEGGGSYLD 56
+A QL HYP N LI T + T Q I W GAK V + G G +LD
Sbjct: 94 IAGQLYHYP-------NMELIGVTGTNGKTTITQLIAQWIDLVGAKAAVMGTTGNG-FLD 145
Query: 57 MWQKAVDRDRKEIEFQKIAGSLAE 80
Q+A + +E QK SLAE
Sbjct: 146 NLQEAANTTGNAVEIQKTLASLAE 169
>gi|424031999|ref|ZP_17771420.1| UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate
ligase [Vibrio cholerae HENC-01]
gi|408876411|gb|EKM15528.1| UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate
ligase [Vibrio cholerae HENC-01]
Length = 491
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 38/84 (45%), Gaps = 12/84 (14%)
Query: 1 MASQLSHYPRATGHRANPPLIFTT----RRTTPQQINFWSRRTGAKVGVSNSEGGGSYLD 56
+A QL HYP N LI T + T Q I W GAK V + G G +LD
Sbjct: 100 IAGQLYHYP-------NMELIGVTGTNGKTTITQLIAQWIDLVGAKAAVMGTTGNG-FLD 151
Query: 57 MWQKAVDRDRKEIEFQKIAGSLAE 80
Q+A + +E QK SLAE
Sbjct: 152 NLQEAANTTGNAVEIQKTLASLAE 175
>gi|219117189|ref|XP_002179389.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409280|gb|EEC49212.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1843
Score = 42.0 bits (97), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 35/145 (24%), Positives = 62/145 (42%), Gaps = 9/145 (6%)
Query: 302 TPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRW 361
T SLE E +G EA A D + L +N + GI Q V+ R
Sbjct: 1148 TSSLESEFCVG--LECELFEATIAQDPLQRL--HSLNNGALCLERYAGISQEGSAVIDRE 1203
Query: 362 TMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVH 421
+T V A ++ + A ++ L S ++ D+ N ++ ++ ++ ++ L
Sbjct: 1204 ALTEQVLCSRA----KRMKDEACQIESLYLASARATHDSCKNHFQNISSKRLYHDEALGK 1259
Query: 422 LEKTADKWGKNGNGDEWQEKWWEHY 446
L + ++G GD W EKWW+ +
Sbjct: 1260 LSQQGTHLSQSG-GDCWDEKWWDEF 1283
>gi|393909952|gb|EJD75659.1| hypothetical protein LOAG_17239 [Loa loa]
Length = 522
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 63/252 (25%), Positives = 110/252 (43%), Gaps = 46/252 (18%)
Query: 9 PRATGHRANPPLIFTTRRTTPQQINFWSRRTGAKVGVSNSEGGG--------SYLDMWQK 60
P++TGH A TTP + S T VGV+ + G + +
Sbjct: 214 PQSTGHPA----------TTPAVLKSMSIHT---VGVAEIDQNGQTAYEVDACLISQLKD 260
Query: 61 AVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERDRI 120
+D+ K+I + + +L + + GNE + E LEK +E ++++D E R
Sbjct: 261 ELDKADKKIGYLEKELTLTKRA-IYGNEQFNIKGQIEALEKDKKELTRVIDSQTERLTRF 319
Query: 121 Q-RLQVIDRAAAAIAAARAILEEKNGSVVKNGESS------GTAEVSRFVKKNSESSGA- 172
+ +L+V++R A+ LE N V+ E+SR +K+ E S A
Sbjct: 320 EDQLRVVNREKEALQRKYTELERANNLAVREKLEQLEIVERQRKELSRLEEKHLEKSKAH 379
Query: 173 -AEISPF---VKNSESNGTAEVPERGALSAGIFVP-RSGTPGNRTPAPGPDFWSWSPPED 227
A+I + +K + ERGAL A + R GTP + + ++ + D
Sbjct: 380 DAQIEAYQQLIKQKDD-------ERGALIAELCATNRLGTPLHDSINETESRYAAT---D 429
Query: 228 DDRD-MRDVRDL 238
++R+ +++VRDL
Sbjct: 430 ENRELLKEVRDL 441
>gi|297526966|ref|YP_003668990.1| phosphoesterase DHHA1 [Staphylothermus hellenicus DSM 12710]
gi|297255882|gb|ADI32091.1| phosphoesterase DHHA1 [Staphylothermus hellenicus DSM 12710]
Length = 328
Score = 39.7 bits (91), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 52/130 (40%), Gaps = 11/130 (8%)
Query: 474 HVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPN-----SHGVKQGE 528
HVW E W K G G + Y D+ C +K+ +K+ N D GV G+
Sbjct: 95 HVWDEDWINKLRGLGVKI-YIDR--STCAVGVVAKYAEKYRNNIDEEFVSELVKGVCAGD 151
Query: 529 TWWAGKYGERWNRTWGERHNGSGWVHKYG-KSSSGELWDTHEQQETWYERF-PHFGFYHC 586
W + W RH+ W K K SSG LWD E + ERF Y
Sbjct: 152 LWRFDHWRGPWYLRLVRRHDDPEWRLKVLEKISSGVLWDD-EFTDKVVERFEKELIGYKM 210
Query: 587 FDNSVQLREV 596
D ++ RE+
Sbjct: 211 VDKTILTREI 220
>gi|358388353|gb|EHK25946.1| hypothetical protein TRIVIDRAFT_218117 [Trichoderma virens Gv29-8]
Length = 509
Score = 39.3 bits (90), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 45/96 (46%), Gaps = 8/96 (8%)
Query: 345 WKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDA---T 401
W T + R VCR +R + A ++ + EA D + H+ L S K G + +
Sbjct: 399 WCLTRLALRGTYAVCRMVKSRLLYAVSPAKFHDLEVEAQD-IAHRIL-SLKEGMASFKDS 456
Query: 402 GNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDE 437
VW +F + W +G++ +T D WGK G G E
Sbjct: 457 ALVWNQFMAQGSWIAKGII---ETKDAWGKEGEGHE 489
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.311 0.130 0.415
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,737,318,775
Number of Sequences: 23463169
Number of extensions: 590064556
Number of successful extensions: 1259380
Number of sequences better than 100.0: 379
Number of HSP's better than 100.0 without gapping: 76
Number of HSP's successfully gapped in prelim test: 303
Number of HSP's that attempted gapping in prelim test: 1256267
Number of HSP's gapped (non-prelim): 1015
length of query: 619
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 470
effective length of database: 8,863,183,186
effective search space: 4165696097420
effective search space used: 4165696097420
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 80 (35.4 bits)