BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 007079
         (619 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225432848|ref|XP_002279944.1| PREDICTED: uncharacterized protein LOC100256346 [Vitis vinifera]
          Length = 574

 Score =  722 bits (1863), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/605 (63%), Positives = 452/605 (74%), Gaps = 35/605 (5%)

Query: 1   MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFWSRRTGAKVG--VSNSEGGGSYLDMW 58
           MAS L    RAT   A P    + R      +    RR+G +    +  S+GG SYLDMW
Sbjct: 1   MASHLGASLRATARSAVP---VSHRHKHRVAVTVLVRRSGGRGASRIRVSDGGDSYLDMW 57

Query: 59  QKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERD 118
           +KAVD++RK +EFQ+IAG+    G+ DG       +  E LE+KS EF KIL+VSKEERD
Sbjct: 58  KKAVDQERKGMEFQRIAGN--SGGEDDG-------ESAEALERKSGEFMKILEVSKEERD 108

Query: 119 RIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESS-GTAEVSRFVKKNSESSGAAEISP 177
           ++QR+QVIDRAAAAIAAARAIL+E      K+GE   G + V        E SG+  +  
Sbjct: 109 KVQRIQVIDRAAAAIAAARAILQES-----KSGEQELGYSRV--------EGSGSETMHD 155

Query: 178 FVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRD 237
             +NS       VP  G  +  +FVP+S T  N TP  GPDFWSW+PP D +    D  +
Sbjct: 156 VFQNSV---IFIVP--GTQNGILFVPQSRTSVNSTP--GPDFWSWTPPMDSEGKSDDAGN 208

Query: 238 LQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSE 297
           LQ A  SS Y TP   ++EK +SVD L IPFES+ SE   +P LPP QSL  V   +VS 
Sbjct: 209 LQTARTSSPYLTPAESLMEKEQSVDFLSIPFESRFSESSHNPPLPPLQSLTEVGTVDVSS 268

Query: 298 TNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGV 357
           ++LE PSL++E +LG LF  HAAEA HALD+VD   + G++PDGSRWW+ETGIEQRPDGV
Sbjct: 269 SSLEMPSLKKEDELGVLFLGHAAEAVHALDEVDGALSHGVSPDGSRWWRETGIEQRPDGV 328

Query: 358 VCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ 417
           VCRWT+ RGVSAD  +EW+EKFWEAAD+  +KELGSEKSGRDATGNVWRE+W ESMWQ+ 
Sbjct: 329 VCRWTLIRGVSADHVVEWEEKFWEAADKFQYKELGSEKSGRDATGNVWREYWKESMWQDC 388

Query: 418 GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWH 477
           GL+H+EKTADKWGKNG GDEW EKWWE YDASGKA+KWAHKWCSIDPNTQL+AGHAHVWH
Sbjct: 389 GLMHMEKTADKWGKNGKGDEWHEKWWEQYDASGKADKWAHKWCSIDPNTQLEAGHAHVWH 448

Query: 478 ERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGE 537
           ERWGE+YDGHGGSMKYTDKWAERCEGD W+KWGDKWDENFDPNSHGVKQGETWW GK+GE
Sbjct: 449 ERWGERYDGHGGSMKYTDKWAERCEGDAWTKWGDKWDENFDPNSHGVKQGETWWEGKHGE 508

Query: 538 RWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
           RWNRTWGE HNGSGWVHKYGKSSSGE WDTHE+Q+TWYERFPH+GFYHCF+NSVQLREV+
Sbjct: 509 RWNRTWGEGHNGSGWVHKYGKSSSGEHWDTHEEQDTWYERFPHYGFYHCFENSVQLREVQ 568

Query: 598 KPSEF 602
            P + 
Sbjct: 569 TPPQL 573


>gi|84468392|dbj|BAE71279.1| hypothetical protein [Trifolium pratense]
          Length = 563

 Score =  693 bits (1789), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/556 (64%), Positives = 417/556 (75%), Gaps = 36/556 (6%)

Query: 49  EGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSK 108
           +   SYLDMW+KA++R+R    F K+A S                ++ E LEKK+EEF K
Sbjct: 43  QASSSYLDMWKKAIERERNTTNFNKLASS-------------NDNNVEENLEKKTEEFQK 89

Query: 109 ILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSE 168
           +L VS EERDRIQRLQVIDRA+AAIAAARA+L++ N + V++ + S        ++KN  
Sbjct: 90  LLQVSSEERDRIQRLQVIDRASAAIAAARALLKDANSNSVRSDKDS--------LQKNES 141

Query: 169 SSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDD 228
            SG    S FV+           E G  +  +FVP+SGT   +   PGPDFWSW+PP D 
Sbjct: 142 DSGKKNDSIFVQ-----------ESGTQNGTLFVPKSGT--QKDGIPGPDFWSWTPPADS 188

Query: 229 DRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLL 288
           D    D   L++  KSSV PT  NPVVEK RS   L IPFES L++ K  P LPP QS L
Sbjct: 189 DVPPNDANGLKLNPKSSVNPTLSNPVVEKERSSQSLSIPFESLLTQSKTFPTLPPLQSSL 248

Query: 289 GVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKET 348
            V   E S +N+E+PSLEEE   G L S HAAE   AL+   + +  G+N DG+RWW+ET
Sbjct: 249 EVV--EASASNVESPSLEEELKRGVLSSDHAAEVVRALETDSKSSPVGVNVDGTRWWRET 306

Query: 349 GIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREF 408
           GIEQRPDGV+CRWT+ RGVSAD+ALEWQEKFWEA+DE G+KELGSEKSGRDATGNVWREF
Sbjct: 307 GIEQRPDGVICRWTLIRGVSADKALEWQEKFWEASDEFGYKELGSEKSGRDATGNVWREF 366

Query: 409 WTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQL 468
           W ESM Q  GL+H+EKTADKWG+NG GDEWQEKW+EHY+ASG+AEKWAHKWCSIDPNT L
Sbjct: 367 WRESMRQENGLMHMEKTADKWGRNGQGDEWQEKWFEHYNASGQAEKWAHKWCSIDPNTPL 426

Query: 469 DAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGE 528
           DAGHAHVWHERWGE YDG+GGS+KYTDKWAER    GW KWGDKWDENFDPNSHG+KQGE
Sbjct: 427 DAGHAHVWHERWGETYDGYGGSIKYTDKWAERSSDGGWEKWGDKWDENFDPNSHGIKQGE 486

Query: 529 TWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFD 588
           TWW GKYGERWNRTWGE+HNGSGWVHKYGKSSSGE WDTHE Q+TWYERFPHFGF+HCF+
Sbjct: 487 TWWEGKYGERWNRTWGEQHNGSGWVHKYGKSSSGEHWDTHEPQDTWYERFPHFGFFHCFE 546

Query: 589 NSVQLREVRKPSEFQE 604
           NSVQLREV+KPSE QE
Sbjct: 547 NSVQLREVKKPSERQE 562


>gi|449465699|ref|XP_004150565.1| PREDICTED: uncharacterized protein LOC101218256 [Cucumis sativus]
          Length = 579

 Score =  680 bits (1754), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/603 (61%), Positives = 429/603 (71%), Gaps = 30/603 (4%)

Query: 1   MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFWSRRT--GAKVGVSNSEGGGSYLDMW 58
           M  +L   PR T H   P L        P Q +   R       + +  S+ G SYL MW
Sbjct: 2   MPLRLPLSPRPTLHHHFPRLYHHNFLLLPLQPHIQIRHATPARTLRIRASDEGESYLGMW 61

Query: 59  QKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERD 118
           + AV+R RK +EFQK+  +    G+ D N G    D   QLEKKSEEFSKIL V  EERD
Sbjct: 62  KNAVERQRKAVEFQKVVENT--EGNDDRNAGDPSSD---QLEKKSEEFSKILQVPPEERD 116

Query: 119 RIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAAEISPF 178
           RIQR+QVI RAAAAIAAARA++ E     V + ++         V  NS           
Sbjct: 117 RIQRMQVIHRAAAAIAAARALVGETGTLAVGDSDTC--------VNLNS----------- 157

Query: 179 VKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRDL 238
             N E     E       S    +P   T  + TP  GPDFWSW+PP DDD +     +L
Sbjct: 158 -TNDEGLLDREEALSEFQSENALLPEFETSQSWTP--GPDFWSWTPPPDDDGNDNAFGEL 214

Query: 239 QMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSET 298
           Q   KS  YP   N V EK R +D L IPF+S++SE   +PLLPPFQSL+G+EK E SET
Sbjct: 215 QPLGKSQAYPKLSNFVEEKERPIDFLSIPFQSEISE-SVNPLLPPFQSLVGMEKLESSET 273

Query: 299 NLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVV 358
           + ET SLEE+ ++G  FS HAAEA+ AL  VD+ +T+GI+PDGSRWWKETGIEQRPDGV+
Sbjct: 274 STETHSLEEDENVGIEFSVHAAEASQALSSVDKESTKGIDPDGSRWWKETGIEQRPDGVI 333

Query: 359 CRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQG 418
           C+WT+TRGVSAD A EWQ K+WEAADE G+KELGSEKSGRDA GNVWRE+W ESM Q QG
Sbjct: 334 CKWTLTRGVSADLATEWQNKYWEAADEFGYKELGSEKSGRDAYGNVWREYWRESMRQEQG 393

Query: 419 LVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHE 478
           LVHLEKTADKWG NG+G EWQEKWWE+Y+ SG+AEK AHKWC IDPNT +D GHAH+W+E
Sbjct: 394 LVHLEKTADKWGINGSGTEWQEKWWEYYNTSGQAEKNAHKWCKIDPNTYVDPGHAHIWNE 453

Query: 479 RWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGER 538
           RWGEKYDG GGS+KYTDKWAERCEGDGW+KWGDKWDENFDPN HG+KQGETWW G++GER
Sbjct: 454 RWGEKYDGQGGSIKYTDKWAERCEGDGWTKWGDKWDENFDPNGHGIKQGETWWEGRHGER 513

Query: 539 WNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRK 598
           WNRTWGE HNGSGWVHKYGKSSSGE WDTH QQETWYERFPHFGFYHCF+NSVQLREV+K
Sbjct: 514 WNRTWGEGHNGSGWVHKYGKSSSGEHWDTHAQQETWYERFPHFGFYHCFNNSVQLREVQK 573

Query: 599 PSE 601
           PSE
Sbjct: 574 PSE 576


>gi|255552009|ref|XP_002517049.1| conserved hypothetical protein [Ricinus communis]
 gi|223543684|gb|EEF45212.1| conserved hypothetical protein [Ricinus communis]
          Length = 566

 Score =  666 bits (1719), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/615 (60%), Positives = 430/615 (69%), Gaps = 61/615 (9%)

Query: 1   MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFW-------SRRTGAKVGVSNSEGGGS 53
           M S LS + RAT      PL     +TTPQQ+            +T +    ++   G S
Sbjct: 1   MTSNLSSFSRAT------PLF---PKTTPQQLFLPFEQPVLPPNKTSSYRAKASINNGES 51

Query: 54  YLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVS 113
           YLDMW+ AVDRD+K +EFQK+A   ++  D   +     RD    + +K+++F KI+D S
Sbjct: 52  YLDMWKSAVDRDKKSVEFQKLAERFSQI-DNTSDSTSVKRD---DVNRKTKDFKKIVDFS 107

Query: 114 KEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAA 173
           K+ERDRIQR+QV+DRAAAAIAAARAIL+E+       G+                     
Sbjct: 108 KDERDRIQRMQVVDRAAAAIAAARAILKERRSENANVGD--------------------- 146

Query: 174 EISPFVKNSESNGTAEVPE-RGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDM 232
                      NG  E     G  +  IFV RS T GN    PGPDFW+W+PP D+ R  
Sbjct: 147 -----------NGNLETESGEGTTNESIFVSRSETSGN--GVPGPDFWTWTPPPDN-RTQ 192

Query: 233 RDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEK 292
            D  +L  A+KSS  P     V  K RS+  L IP +SKLS    +P LPP QSL+ V+K
Sbjct: 193 YDF-ELMEAQKSSASPISTRNVAMKERSLSYLDIPLQSKLSPSDLNPPLPPLQSLMEVKK 251

Query: 293 EEVSETNLETPSL-EEERDLGALFSAHAAEAAHAL--DKVDELATRGINPDGSRWWKETG 349
           EE SE   E PSL EEER+L   F+AHA EA + L  +K DE A+ G+  DGSRWWKE G
Sbjct: 252 EEDSEFRPEMPSLKEEERELDLEFTAHAIEAGYVLATEKEDE-ASSGMELDGSRWWKEKG 310

Query: 350 IEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFW 409
           IE+RPDGV+CRWTM RGVS DE +EWQEKFWEA DE G+KELGSEKSGRDATGNVWRE+W
Sbjct: 311 IERRPDGVICRWTMIRGVSVDEDVEWQEKFWEATDEFGYKELGSEKSGRDATGNVWREYW 370

Query: 410 TESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLD 469
            ESMWQ  GLVHLEKTA+KWGKNG GDEW+EKWWEHYDAS KAEKWAHKWC+IDP  QL+
Sbjct: 371 RESMWQESGLVHLEKTANKWGKNGEGDEWEEKWWEHYDASNKAEKWAHKWCTIDPTRQLE 430

Query: 470 AGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGET 529
           AGHAH+WHERWGE YDGHGGSMKYTDKWAERCEGDGW+KWGDKWDE+FDPN HGVKQGET
Sbjct: 431 AGHAHIWHERWGENYDGHGGSMKYTDKWAERCEGDGWTKWGDKWDEHFDPNGHGVKQGET 490

Query: 530 WWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDN 589
           WWAGK+GERWNRTWGERHNGSGWVHKYGKSSSGE WDTHE+QETWYERFPH+GFYHCF+N
Sbjct: 491 WWAGKHGERWNRTWGERHNGSGWVHKYGKSSSGEHWDTHEEQETWYERFPHYGFYHCFEN 550

Query: 590 SVQLREVRKPSEFQE 604
           S  LREV+ PS+  E
Sbjct: 551 SGILREVQIPSDSHE 565


>gi|297816898|ref|XP_002876332.1| hypothetical protein ARALYDRAFT_486012 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322170|gb|EFH52591.1| hypothetical protein ARALYDRAFT_486012 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 580

 Score =  654 bits (1687), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/565 (61%), Positives = 419/565 (74%), Gaps = 33/565 (5%)

Query: 38  RTGAKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTE 97
           R+G ++   ++EG  SYLDMW+ AVDR++KE  F+KIA ++     VDG +  GG     
Sbjct: 48  RSGVRILRVSNEGRESYLDMWKNAVDREKKEKAFEKIAENVVA---VDGEKEKGG----- 99

Query: 98  QLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTA 157
            +EKKS+EF KIL+VS EERDRIQR+QV+DRAAAAI+AARAIL   N    K G      
Sbjct: 100 DMEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEG------ 153

Query: 158 EVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGP 217
                   N E++  +E++   KN++          G  S  ++VPRS T G  TP  GP
Sbjct: 154 ------FPNEENTVTSEVTETPKNAK---------LGMWSRTVYVPRSETSGTETP--GP 196

Query: 218 DFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKP 277
           DFWSW+PP+  +       DLQ  EK + +PT  NPV+EK +S D L IP+ES LS  + 
Sbjct: 197 DFWSWTPPQGSEISSNMNVDLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERH 256

Query: 278 DPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGI 337
              +PPF+SL+ V KE  ++ + ET S E + DL  + SA+A EAA  LD +DE +T G+
Sbjct: 257 SFTIPPFESLIEVRKEAETKPSSETSSTEHDLDL--ISSANAEEAARVLDSLDESSTHGV 314

Query: 338 NPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSG 397
           + DG +WWK+TG+E+RPDGVVCRWTM RGV+AD  +EWQ+K+WEA+D+ G KELGSEKSG
Sbjct: 315 SEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKELGSEKSG 374

Query: 398 RDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
           RDATGNVWREFW ESM Q  G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+GK+EKWAH
Sbjct: 375 RDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAH 434

Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENF 517
           KWCSID NT LDAGHAHVWHERWGEKYDG GGS KYTDKWAER  GDGW KWGDKWDENF
Sbjct: 435 KWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKWGDKWDENF 494

Query: 518 DPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYER 577
           +P++ GVKQGETWW GK+G+RWNRTWGE HNGSGWVHKYGKSSSGE WDTH  QETWYER
Sbjct: 495 NPSAQGVKQGETWWEGKHGDRWNRTWGEGHNGSGWVHKYGKSSSGEHWDTHVPQETWYER 554

Query: 578 FPHFGFYHCFDNSVQLREVRKPSEF 602
           FPHFGF+HCFDNSVQLR V+KPS+ 
Sbjct: 555 FPHFGFFHCFDNSVQLRAVKKPSDM 579


>gi|15228186|ref|NP_191135.1| uncharacterized protein [Arabidopsis thaliana]
 gi|30694316|ref|NP_850708.1| uncharacterized protein [Arabidopsis thaliana]
 gi|334186001|ref|NP_001190098.1| uncharacterized protein [Arabidopsis thaliana]
 gi|58652076|gb|AAW80863.1| At3g55760 [Arabidopsis thaliana]
 gi|332645910|gb|AEE79431.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332645911|gb|AEE79432.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332645912|gb|AEE79433.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 578

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/565 (61%), Positives = 419/565 (74%), Gaps = 36/565 (6%)

Query: 38  RTGAKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTE 97
           RTG ++   ++EG  SYLDMW+ AVDR++KE  F+KIA ++     VDG +  GG     
Sbjct: 49  RTGVRILRVSNEGRESYLDMWKNAVDREKKEKAFEKIAENVVA---VDGEKEKGG----- 100

Query: 98  QLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTA 157
            LEKKS+EF KIL+VS EERDRIQR+QV+DRAAAAI+AARAIL   N    K G      
Sbjct: 101 DLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEG------ 154

Query: 158 EVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGP 217
                   N +++  +E++   KN++          G  S  ++VPRS T G  TP  GP
Sbjct: 155 ------FPNEDNTVTSEVTETPKNAK---------LGMWSRTVYVPRSETSGTETP--GP 197

Query: 218 DFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKP 277
           DFWSW+PP+  +  +  V DLQ  EK + +PT  NPV+EK +S D L IP+ES LS  + 
Sbjct: 198 DFWSWTPPQGSE--ISSV-DLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERH 254

Query: 278 DPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGI 337
              +PPF+SL+ V KE  +ET   + +L  E DL  + SA+A E A  LD +DE +T G+
Sbjct: 255 SFTIPPFESLIEVRKE--AETKPSSETLSTEHDLDLISSANAEEVARVLDSLDESSTHGV 312

Query: 338 NPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSG 397
           + DG +WWK+TG+E+RPDGVVCRWTM RGV+AD  +EWQ+K+WEA+D+ G KELGSEKSG
Sbjct: 313 SEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKELGSEKSG 372

Query: 398 RDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
           RDATGNVWREFW ESM Q  G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+GK+EKWAH
Sbjct: 373 RDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAH 432

Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENF 517
           KWCSID NT LDAGHAHVWHERWGEKYDG GGS KYTDKWAER  GDGW KWGDKWDENF
Sbjct: 433 KWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKWGDKWDENF 492

Query: 518 DPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYER 577
           +P++ GVKQGETWW GK+G+RWNR+WGE HNGSGWVHKYGKSSSGE WDTH  QETWYE+
Sbjct: 493 NPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGKSSSGEHWDTHVPQETWYEK 552

Query: 578 FPHFGFYHCFDNSVQLREVRKPSEF 602
           FPHFGF+HCFDNSVQLR V+KPS+ 
Sbjct: 553 FPHFGFFHCFDNSVQLRAVKKPSDM 577


>gi|22655111|gb|AAM98146.1| putative protein [Arabidopsis thaliana]
          Length = 578

 Score =  648 bits (1672), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/565 (61%), Positives = 419/565 (74%), Gaps = 36/565 (6%)

Query: 38  RTGAKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTE 97
           RTG ++   ++EG  SYLDMW+ AVDR++KE  F+KIA ++     VDG +  GG     
Sbjct: 49  RTGVRILRVSNEGRESYLDMWKNAVDREKKEKAFEKIAENVVA---VDGEKEKGG----- 100

Query: 98  QLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTA 157
            LEKKS+EF KIL+VS EERDRIQR+QV+DRAAAAI+AARAIL   N    K G      
Sbjct: 101 DLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEG------ 154

Query: 158 EVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGP 217
                   N +++  +E++   KN++          G  S  ++VPRS T G  TP  GP
Sbjct: 155 ------FPNEDNTVTSEVTETPKNAK---------LGMWSRTVYVPRSETSGTETP--GP 197

Query: 218 DFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKP 277
           DFWSW+PP+  +  +  V DLQ  EK + +PT  NPV+EK +S D L IP+ES LS  + 
Sbjct: 198 DFWSWTPPQGSE--ISSV-DLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERH 254

Query: 278 DPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGI 337
              +PPF+SL+ V KE  +ET   + +L  E DL  + SA+A E A  LD +DE +T G+
Sbjct: 255 SFTIPPFESLIEVRKE--AETKPSSETLSTEHDLDLISSANAEEVARVLDSLDESSTHGV 312

Query: 338 NPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSG 397
           + DG +WWK+TG+E+RPDGVVCRWTM RGV+AD  +EWQ+K+WEA+D+ G KELGSEKSG
Sbjct: 313 SEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDSGFKELGSEKSG 372

Query: 398 RDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
           RDATGNVWREFW ESM Q  G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+GK+EKWAH
Sbjct: 373 RDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAH 432

Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENF 517
           KWCSID NT LDAGHAHVWHERWGEKYDG GGS KYTDKWAER  GDGW KWGDKWDENF
Sbjct: 433 KWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKWGDKWDENF 492

Query: 518 DPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYER 577
           +P++ GVKQGETWW GK+G+RWNR+WGE HNGSGWVHKYGKSSSGE WDTH  QETWYE+
Sbjct: 493 NPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGKSSSGEHWDTHVPQETWYEK 552

Query: 578 FPHFGFYHCFDNSVQLREVRKPSEF 602
           FPHFGF+HCFDNSVQLR V+KPS+ 
Sbjct: 553 FPHFGFFHCFDNSVQLRAVKKPSDM 577


>gi|224102135|ref|XP_002312560.1| predicted protein [Populus trichocarpa]
 gi|222852380|gb|EEE89927.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score =  643 bits (1659), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/569 (65%), Positives = 426/569 (74%), Gaps = 43/569 (7%)

Query: 41  AKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLE 100
            ++ VSN +G  SYLDMW+ AVDR+RK +EFQ+IAG+LA++ +   ++     D+T  LE
Sbjct: 4   TRIRVSN-DGSNSYLDMWKTAVDRERKTVEFQQIAGNLAQTDNDSDDDD---DDVTVDLE 59

Query: 101 KKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVS 160
           KKSE+F+KIL+VSKEERDRIQR+QVIDRAAAAIAAAR I+ EK  +              
Sbjct: 60  KKSEDFNKILEVSKEERDRIQRVQVIDRAAAAIAAARDIVREKKSA-------------- 105

Query: 161 RFVKKNSESSGAAEISPFVKNSESNGTAEVPERG----ALSAGIFVPRSGTPGNRTPAPG 216
                              K+ ES G     ++G    + S  I V RS +  +    PG
Sbjct: 106 ---------------DKDFKSHESMGGEVEDQQGGKFWSYSRSILVSRSESSAD--GVPG 148

Query: 217 PDFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPK 276
           PDFWSW+PP   D +  D  D+    KSS  P  + PV  K RS D L IPFESKL +  
Sbjct: 149 PDFWSWTPPLSSDGNSDDSSDVLKVRKSSDTPLTI-PVAMKERSADFLSIPFESKLLDTN 207

Query: 277 PDPLLPPFQSLLGVEKEEVSETNLETPSL-EEERDLGALFSAHAAEAAHAL--DKVDELA 333
               +PP QSL+ VE  EVSE+ LE PS  EEER+LG  FSA+AAEAAHAL  DKVDEL+
Sbjct: 208 HSSPIPPLQSLVEVEGVEVSESILEMPSKNEEERELGVQFSAYAAEAAHALEKDKVDELS 267

Query: 334 TRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGS 393
           + G+  DGSR W+ETGIEQRPDGV+CRWTMTRGVSAD+ +EWQEKFWEAAD+ G+KELGS
Sbjct: 268 SYGVTADGSRCWRETGIEQRPDGVICRWTMTRGVSADQEVEWQEKFWEAADDFGYKELGS 327

Query: 394 EKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAE 453
           EKSGRDATGNVWREFW ESM Q  GL+HLEKTADKWGKNG GDEWQEKWWEHY ASG+AE
Sbjct: 328 EKSGRDATGNVWREFWRESMRQESGLLHLEKTADKWGKNGQGDEWQEKWWEHYGASGQAE 387

Query: 454 KWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKW 513
           KWAHKWCSIDP T L+AGHAHVWHERWGEKYDGHGGS KYTDKWAERCEGDGW+KWGDKW
Sbjct: 388 KWAHKWCSIDPTTNLEAGHAHVWHERWGEKYDGHGGSTKYTDKWAERCEGDGWAKWGDKW 447

Query: 514 DENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQET 573
           DENFD N HGVKQGE WW GK+GERWNRTWGERHNGSGWVHKYGKSS GE WDTH QQ+T
Sbjct: 448 DENFDLNGHGVKQGEAWWEGKHGERWNRTWGERHNGSGWVHKYGKSSCGEHWDTHTQQDT 507

Query: 574 WYERFPHFGFYHCFDNSVQLREVRKPSEF 602
           WYERFPH+GFYHCF+NSVQLREV+KPSE 
Sbjct: 508 WYERFPHYGFYHCFENSVQLREVQKPSEI 536


>gi|297737133|emb|CBI26334.3| unnamed protein product [Vitis vinifera]
          Length = 414

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 300/413 (72%), Positives = 343/413 (83%), Gaps = 2/413 (0%)

Query: 190 VPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPT 249
           V + G  +  +FVP+S T  N TP  GPDFWSW+PP D +    D  +LQ A  SS Y T
Sbjct: 3   VQQGGTQNGILFVPQSRTSVNSTP--GPDFWSWTPPMDSEGKSDDAGNLQTARTSSPYLT 60

Query: 250 PVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEER 309
           P   ++EK +SVD L IPFES+ SE   +P LPP QSL  V   +VS ++LE PSL++E 
Sbjct: 61  PAESLMEKEQSVDFLSIPFESRFSESSHNPPLPPLQSLTEVGTVDVSSSSLEMPSLKKED 120

Query: 310 DLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSA 369
           +LG LF  HAAEA HALD+VD   + G++PDGSRWW+ETGIEQRPDGVVCRWT+ RGVSA
Sbjct: 121 ELGVLFLGHAAEAVHALDEVDGALSHGVSPDGSRWWRETGIEQRPDGVVCRWTLIRGVSA 180

Query: 370 DEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKW 429
           D  +EW+EKFWEAAD+  +KELGSEKSGRDATGNVWRE+W ESMWQ+ GL+H+EKTADKW
Sbjct: 181 DHVVEWEEKFWEAADKFQYKELGSEKSGRDATGNVWREYWKESMWQDCGLMHMEKTADKW 240

Query: 430 GKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGG 489
           GKNG GDEW EKWWE YDASGKA+KWAHKWCSIDPNTQL+AGHAHVWHERWGE+YDGHGG
Sbjct: 241 GKNGKGDEWHEKWWEQYDASGKADKWAHKWCSIDPNTQLEAGHAHVWHERWGERYDGHGG 300

Query: 490 SMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNG 549
           SMKYTDKWAERCEGD W+KWGDKWDENFDPNSHGVKQGETWW GK+GERWNRTWGE HNG
Sbjct: 301 SMKYTDKWAERCEGDAWTKWGDKWDENFDPNSHGVKQGETWWEGKHGERWNRTWGEGHNG 360

Query: 550 SGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEF 602
           SGWVHKYGKSSSGE WDTHE+Q+TWYERFPH+GFYHCF+NSVQLREV+ P + 
Sbjct: 361 SGWVHKYGKSSSGEHWDTHEEQDTWYERFPHYGFYHCFENSVQLREVQTPPQL 413


>gi|356500589|ref|XP_003519114.1| PREDICTED: uncharacterized protein LOC100810326 [Glycine max]
          Length = 575

 Score =  603 bits (1554), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 328/590 (55%), Positives = 403/590 (68%), Gaps = 54/590 (9%)

Query: 38  RTGAKVGVSNSEG-GGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLT 96
           R  A VG    +G   SYLDMW+KAV+R+RK   F  IA  +A + D +           
Sbjct: 32  RANASVGGGKGDGETASYLDMWKKAVERERKSASFNSIADRVAANTDDN---------ND 82

Query: 97  EQLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGT 156
           + LEKK+ EF K+L VS EERDR+QR+QVIDRAAAAIAAAR +L+E++ +   + E++  
Sbjct: 83  DDLEKKTSEFQKLLQVSAEERDRVQRMQVIDRAAAAIAAARQLLQERSAADSSHHEATAD 142

Query: 157 AEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPG 216
                  +++   SG                      G  S GI V  S T GN    PG
Sbjct: 143 E------RRDESGSGM---------------------GVQSEGIRVSESETRGN--GVPG 173

Query: 217 PDFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPK 276
           PDFWSW+PP + D    D   LQ+  KSSV PT  + VVEK  +   L IPFES LS+ +
Sbjct: 174 PDFWSWTPPVESDVPSDDGSGLQLDTKSSVRPTLPSAVVEKEWTPQFLSIPFESLLSQSE 233

Query: 277 PDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAH-------AAEAAHALDKV 329
               LPPFQS L VE+ E +                     H       AAEAAHAL + 
Sbjct: 234 RSDTLPPFQSFLEVEEAETTSDPESLSESPSLSLSLEEEQIHGESSFDYAAEAAHALSEA 293

Query: 330 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 389
           ++ +  G+NPDGSRWWKETGIE+RPDGV+CRWTMTRGVSAD+A+EWQEK+WEA+D+ G+K
Sbjct: 294 NKSSPIGVNPDGSRWWKETGIERRPDGVICRWTMTRGVSADKAIEWQEKYWEASDDFGYK 353

Query: 390 ELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDAS 449
           ELGSEKSGRDA GN+WREFW ES+    GL+  EKTADKWG+N NG+EWQEKW E Y+A+
Sbjct: 354 ELGSEKSGRDANGNIWREFWRESLCLENGLMSFEKTADKWGRNVNGNEWQEKWGERYNAA 413

Query: 450 GKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKW 509
           G+ EKWAHKWCSIDPNT L+ GHAHVWHERWG KYDG+GGS+KYTDKWAER    GW KW
Sbjct: 414 GQTEKWAHKWCSIDPNTPLEPGHAHVWHERWGGKYDGYGGSIKYTDKWAERFVDGGWDKW 473

Query: 510 GDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHE 569
           GDKWDENFDPN++GVKQGE+WW G++G+RWNRTWGE+HNGSGW+HKYG+SSSGE WDTH 
Sbjct: 474 GDKWDENFDPNANGVKQGESWWEGRHGDRWNRTWGEQHNGSGWIHKYGQSSSGEHWDTHA 533

Query: 570 QQETWYERFPHFGFYHCFDNSVQLREVRKPSEFQEEPFEIQDKRSELQEP 619
           +++TWYE+FPH+GF++CF+NSVQLREV KPSE      EIQ    ++QEP
Sbjct: 534 REDTWYEKFPHYGFFNCFENSVQLREVPKPSEI----LEIQ----QVQEP 575


>gi|326520025|dbj|BAK03937.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 605

 Score =  570 bits (1470), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 320/569 (56%), Positives = 390/569 (68%), Gaps = 40/569 (7%)

Query: 53  SYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDV 112
           SYLDMW+KAVDR+R+  E   +A  L  S      E          +E+++  F ++L V
Sbjct: 51  SYLDMWKKAVDRERRSAE---LAYRLQPSPPPAEAEAE--APPQADVERRTARFEEMLRV 105

Query: 113 SKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVV-----KNGESSGTAEV-----SRF 162
            +EERDR+QR QVIDRAAAA+AAARA+L+E   S       K   ++G AE      SR 
Sbjct: 106 PREERDRVQRTQVIDRAAAALAAARAVLKEPPQSSPTPQPHKPTPATGVAEPGNVFDSRK 165

Query: 163 VKKNSESSGAAEIS-PFVKNSE--SNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDF 219
             K  E  G+ + S P   NSE  +N     P + A S      + GTPG       PDF
Sbjct: 166 AAKGLEDQGSGQDSLPAASNSEKVTNSGDSYPSKQASS------KLGTPG-------PDF 212

Query: 220 WSWSPPEDDDRDMRDVRD-LQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPD 278
           WSW PP D   + R+    L+ ++K   + +    ++EK RS D L +PF +   E K D
Sbjct: 213 WSWLPPVDSSSEPRESNTVLKPSKKVDSFSSQPEMLMEKERSADFLSLPFVASFFEKKED 272

Query: 279 PLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGIN 338
             LPPFQS +  E  +    +   P  + E      FS +AAE A AL   D+ ++ GI+
Sbjct: 273 RSLPPFQSFVEPENTD----SKAKPVADAEEAFETQFSQNAAETARALSTSDDKSSHGID 328

Query: 339 PDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGR 398
           PDGS+WWKETG+EQRPDGVVC+WT+ RGVSAD ++E+++K+WEA+D   HKELGSEKSGR
Sbjct: 329 PDGSKWWKETGVEQRPDGVVCKWTVVRGVSADGSVEFEDKYWEASDRFDHKELGSEKSGR 388

Query: 399 DATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWA 456
           DA GNVWRE+W ESMWQ+   GL+H+EKTADKWGKNG G++WQEKWWE YD+SGKAEK A
Sbjct: 389 DARGNVWREYWKESMWQDFTSGLMHMEKTADKWGKNGKGEQWQEKWWEQYDSSGKAEKSA 448

Query: 457 HKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDEN 516
            KWCS+DPNT LDAGHAHVWHERWGE YDG GGS+KYTDKWAER EGDGWSKWGDKWDE+
Sbjct: 449 DKWCSLDPNTPLDAGHAHVWHERWGETYDGCGGSVKYTDKWAERSEGDGWSKWGDKWDEH 508

Query: 517 FDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYE 576
           FDPN HGVKQGETWW GKYG+RWNRTWGE HNGSGWVHKYG+SSSGE WDTHE QETWYE
Sbjct: 509 FDPNRHGVKQGETWWEGKYGDRWNRTWGEGHNGSGWVHKYGRSSSGEHWDTHEPQETWYE 568

Query: 577 RFPHFGFYHCFDNSVQLREVRK--PSEFQ 603
            +PHFGF+HCF+NSVQL  V +  P  F+
Sbjct: 569 SYPHFGFHHCFENSVQLLSVSRQPPKNFK 597


>gi|242071493|ref|XP_002451023.1| hypothetical protein SORBIDRAFT_05g022830 [Sorghum bicolor]
 gi|241936866|gb|EES10011.1| hypothetical protein SORBIDRAFT_05g022830 [Sorghum bicolor]
          Length = 605

 Score =  558 bits (1438), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 301/552 (54%), Positives = 387/552 (70%), Gaps = 18/552 (3%)

Query: 54  YLDMWQKAVDRDRKEIEF-QKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDV 112
           YLDMW+KAV+R+R+  E  +++  +   S   + +         E + +++  F ++L V
Sbjct: 59  YLDMWRKAVERERRSAELARRLQEAPPTSPAAEADAPPAPGAPVEDVRRRTARFEEMLRV 118

Query: 113 SKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNG--ESSGTAEVSRFVKKNSESS 170
             EERDR+QR QVIDRAAAA+AAARA+L+E   +   +   + + TA+V       +   
Sbjct: 119 PPEERDRVQRRQVIDRAAAALAAARAVLKEPPPASPPSTPPQVAETAKVGSVAAGAAARG 178

Query: 171 GAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPP-EDDD 229
                 P  +   S+ +AEVP+ G  S       S    ++   PGPDFWSW PP +D  
Sbjct: 179 SDRSSRPSARAQLSSPSAEVPDSGVSSP------SKQSSSKLGTPGPDFWSWLPPVQDSS 232

Query: 230 RDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLG 289
           +       L+ ++K   + +  + ++EK RS D L +PFE+   + K D  LPPFQS   
Sbjct: 233 KQKESGTGLKPSKKMDAFSSQPDLLMEKERSADSLSLPFETAFFKKKEDRSLPPFQSF-- 290

Query: 290 VEKEEV-SETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKET 348
            E E V S+ +L   + +++      FS +AAE A AL +  E ++ GI+ DGS WWKET
Sbjct: 291 AEPENVDSKADL---AADKKDTFEEQFSKNAAEVARALSESTEKSSHGIHLDGSMWWKET 347

Query: 349 GIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREF 408
           G+EQRPDGVVC+WT+ RGVSAD A+E+++K+WEA+D   HKELGSEKSGRDA GNVWRE+
Sbjct: 348 GVEQRPDGVVCKWTVIRGVSADGAVEFEDKYWEASDRFDHKELGSEKSGRDAAGNVWREY 407

Query: 409 WTESMWQNQ--GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNT 466
           W ESMWQ+   G++H+EKTADKWG+NG G++WQE+W+EHYD++GK EKWA KWCS+DPNT
Sbjct: 408 WKESMWQDYTCGVMHMEKTADKWGQNGKGEQWQEQWFEHYDSTGKTEKWADKWCSLDPNT 467

Query: 467 QLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQ 526
            LD GHAHVWHERWGE YDG+GGS KYTDKWAER EGDGWSKWGDKWDE+FDPN HG KQ
Sbjct: 468 PLDVGHAHVWHERWGENYDGYGGSTKYTDKWAERSEGDGWSKWGDKWDEHFDPNGHGTKQ 527

Query: 527 GETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHC 586
           GETWWAGKYG+RWNRTWGE HNGSGWVHKYG+SSSGE WDTH  Q+TWYERFPHFGFYHC
Sbjct: 528 GETWWAGKYGDRWNRTWGEGHNGSGWVHKYGRSSSGEHWDTHVPQDTWYERFPHFGFYHC 587

Query: 587 FDNSVQLREVRK 598
           F+NS QLR V++
Sbjct: 588 FENSAQLRSVKR 599


>gi|77551654|gb|ABA94451.1| expressed protein [Oryza sativa Japonica Group]
 gi|125577642|gb|EAZ18864.1| hypothetical protein OsJ_34403 [Oryza sativa Japonica Group]
          Length = 618

 Score =  556 bits (1434), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 316/590 (53%), Positives = 400/590 (67%), Gaps = 31/590 (5%)

Query: 39  TGAKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNE------GGGG 92
           T  +      +GG SYLDMW+KAV+R+R+  E   IA  L +S              G  
Sbjct: 35  TCVRATARGGDGGSSYLDMWKKAVERERRSAE---IAHRLQQSSSAAAAAVKEEEGEGKA 91

Query: 93  RDLTEQLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGE 152
                 +E+++  F ++L V +EERDR+QR QVIDRAAAA+AAARA+L++       +  
Sbjct: 92  AAAAGDVERRTARFEEMLRVPREERDRVQRRQVIDRAAAALAAARAVLKDPPPPPPPSPP 151

Query: 153 SSGTAE-------VSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRS 205
           S+   E        +  ++  SES   +  +P  ++  ++    V E    +A + VP S
Sbjct: 152 STPPQEREQQQKPAATAIQAGSESGLVSRTAPG-ESDRASPPPPVTETATEAAKVSVPDS 210

Query: 206 GTPGNRTP------APGPDFWSWSPPEDDDRDMRDV-RDLQMAEKSSVYPTPVNPVVEKA 258
           G              PGPDFWSW PP ++   + ++   L+ +EK   +    + ++EK 
Sbjct: 211 GDSSPFKKSSSKLGTPGPDFWSWLPPVENSTKLGEIDTGLKPSEKLDSFAGQPDLLMEKE 270

Query: 259 RSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAH 318
           +S DIL +PFE+   + K D  LPPFQS    E  E SE ++   + + E      FS +
Sbjct: 271 QSEDILSLPFETSFFK-KEDRSLPPFQSFAEPENVE-SEPSI---TADAEETFEDQFSKN 325

Query: 319 AAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEK 378
           AAEAA AL   DE ++ G+ PDGS WWKETG+EQRPDGV C+WT+ RGVSAD A+EW++K
Sbjct: 326 AAEAARALSASDEKSSHGVRPDGSLWWKETGVEQRPDGVTCKWTVIRGVSADGAVEWEDK 385

Query: 379 FWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGD 436
           +WEA+D   HKELGSEKSGRDATGNVWRE+W ESMWQ+   G++H+EKTADKWG+NG G+
Sbjct: 386 YWEASDRFDHKELGSEKSGRDATGNVWREYWKESMWQDFTCGVMHMEKTADKWGQNGKGE 445

Query: 437 EWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDK 496
           +WQE+WWEHYD+SGKAEKWA KWCS+DPNT LD GHAHVWHERWGEKYDG GGS KYTDK
Sbjct: 446 QWQEQWWEHYDSSGKAEKWADKWCSLDPNTPLDVGHAHVWHERWGEKYDGCGGSAKYTDK 505

Query: 497 WAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKY 556
           WAER EGDGWSKWGDKWDE+FDPN HGVKQGETWWAGKYG+RWNRTWGE HN +GWVHKY
Sbjct: 506 WAERSEGDGWSKWGDKWDEHFDPNGHGVKQGETWWAGKYGDRWNRTWGEHHNCTGWVHKY 565

Query: 557 GKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEFQEEP 606
           G+SSSGE WDTH  Q+TWYERFPHFGF HCF+NSVQLR V++ +    +P
Sbjct: 566 GRSSSGEHWDTHVPQDTWYERFPHFGFEHCFNNSVQLRSVKRQTPKNTKP 615


>gi|413925397|gb|AFW65329.1| hypothetical protein ZEAMMB73_910657 [Zea mays]
          Length = 583

 Score =  553 bits (1426), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 306/562 (54%), Positives = 386/562 (68%), Gaps = 39/562 (6%)

Query: 50  GGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKI 109
           GGGSYLDMW+KAV+R+R+  +  +   +     D             E + +++  F ++
Sbjct: 41  GGGSYLDMWKKAVERERRSADLARRLQAPPAEADAPAPA-----PPVEDVRRRTARFEEM 95

Query: 110 LDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNG--------SVVKNGE-SSGTAEVS 160
           L V +EERDR+QR QVIDRAAAA+AAARA+L+E            V + G+  S     +
Sbjct: 96  LRVPREERDRVQRNQVIDRAAAALAAARAVLKEPPAFSPPPTPPQVAETGKVGSAGGGAA 155

Query: 161 RFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFW 220
           R   +NS S+  A++SP         +AEV + G  S            ++   PGPDFW
Sbjct: 156 RGSDRNSRSAARAQLSP---------SAEVQDSGGSSP------HNQSSSKLGTPGPDFW 200

Query: 221 SWSPPEDDDRDMRDVRD-LQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDP 279
           SW PP  D    ++    L+ ++K   + +  + ++EK R  D LP+PFE+   + K D 
Sbjct: 201 SWLPPVQDSSKQKESNTGLKPSKKLDTFSSQPDLLMEKERLADSLPLPFETAFFK-KEDR 259

Query: 280 LLPPFQSLLGVEKEEV-SETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGIN 338
            LPPFQS    E E V S  +L   + ++E      FS +AAE A AL +     + GI+
Sbjct: 260 SLPPFQSF--AEPENVDSSADL---AADKEDTFEEQFSKNAAEVARALSESVGKPSHGIH 314

Query: 339 PDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGR 398
            DGS WWKE G+E+RPDGVVC+WT+ RGVSAD A+EW++K+WEA+D   HKELGSEKSGR
Sbjct: 315 LDGSMWWKEVGVERRPDGVVCKWTVIRGVSADGAVEWEDKYWEASDRFDHKELGSEKSGR 374

Query: 399 DATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWA 456
           DA GNVWRE+W ESMWQ+   G++H+EK ADKWG+NG G++WQE+W+EHYD++GK EKWA
Sbjct: 375 DAAGNVWREYWKESMWQDYTCGVMHMEKNADKWGQNGKGEQWQEQWFEHYDSTGKTEKWA 434

Query: 457 HKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDEN 516
            KWCS+DPNT LD GHAHVWHERWGEKYDG GGS KYTDKWAER EGDGWSKWGDKWDE+
Sbjct: 435 DKWCSLDPNTPLDVGHAHVWHERWGEKYDGLGGSEKYTDKWAERSEGDGWSKWGDKWDEH 494

Query: 517 FDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYE 576
           FD N HGVKQGETWWAGK+G+RWNRTWGERHNGSGWVHKYG+SSSGE WDTH  Q+TWYE
Sbjct: 495 FDLNGHGVKQGETWWAGKHGDRWNRTWGERHNGSGWVHKYGRSSSGEHWDTHAPQDTWYE 554

Query: 577 RFPHFGFYHCFDNSVQLREVRK 598
           RFPHFGFYHCF+NS QLR V++
Sbjct: 555 RFPHFGFYHCFENSPQLRSVKR 576


>gi|357156312|ref|XP_003577413.1| PREDICTED: uncharacterized protein LOC100825548 [Brachypodium
           distachyon]
          Length = 627

 Score =  547 bits (1409), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 315/559 (56%), Positives = 383/559 (68%), Gaps = 22/559 (3%)

Query: 53  SYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDV 112
           SYLDMW+KAVDR+R+  E   +A  L  S     +           + +++  F ++L V
Sbjct: 64  SYLDMWKKAVDRERRSAE---LAYRLQSSPPPPADPEAEASAPAPDVARRTARFEEMLRV 120

Query: 113 SKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGA 172
            +EERDR+QR QVIDRAAAA+AAARA+L++               E         E SG 
Sbjct: 121 PREERDRVQRTQVIDRAAAALAAARAVLKDPPQQNPPPPPPPPMQEQKPGTDVELEGSGD 180

Query: 173 AEIS---PFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTP------APGPDFWSWS 223
              S   P   +  S+  AEV    A S    VP +G              PGPDFWSW 
Sbjct: 181 GLGSWKAPGGSDWSSSSLAEVEPPPAPSQSAKVPNTGDSSPSKQSSSKLGTPGPDFWSWL 240

Query: 224 PPEDDDRDMRDVRD-LQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLP 282
           PP ++  + R+    L+ ++K+  + +  + ++EK RS D L +PF +   E K D  LP
Sbjct: 241 PPVENSSEPRESNTGLKPSKKAESFSSQPD-LLEKERSADFLSLPFVTSFFEKKEDRSLP 299

Query: 283 PFQSLLGVEKEEV-SETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDG 341
           PFQS    E E V SE     P+ + E      FS +AAEAA AL   DE ++ GI+PDG
Sbjct: 300 PFQSF--AEPENVDSEVK---PAADAEEVFETQFSKNAAEAARALSTSDEKSSHGIDPDG 354

Query: 342 SRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDAT 401
           S+WWKETG+EQRPDGV+C+WT+ RGVSAD A+E+++K+WEA+D   HKELGSEKSGRDA 
Sbjct: 355 SKWWKETGVEQRPDGVICKWTVIRGVSADGAVEFEDKYWEASDRFEHKELGSEKSGRDAR 414

Query: 402 GNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKW 459
           GNVWRE+W ESMW++   GL+H+EKTADKWGKNG G++WQE+WWE YD+SGKAEKWA KW
Sbjct: 415 GNVWREYWKESMWEDSTSGLMHMEKTADKWGKNGKGEQWQEQWWEQYDSSGKAEKWADKW 474

Query: 460 CSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDP 519
           CS+DPNT LD GHAHVWHERWGE YDG GGS+KYTDKWAER EGDGWSKWGDKWDE+FDP
Sbjct: 475 CSLDPNTPLDVGHAHVWHERWGETYDGSGGSVKYTDKWAERSEGDGWSKWGDKWDEHFDP 534

Query: 520 NSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFP 579
           N HGVKQGETWW GKYG+RWNRTWGE HNGSGWVHKYG+SSSGE WDTHE QETWYER+P
Sbjct: 535 NGHGVKQGETWWEGKYGDRWNRTWGEGHNGSGWVHKYGRSSSGEHWDTHEPQETWYERYP 594

Query: 580 HFGFYHCFDNSVQLREVRK 598
           HFGF HCF+NSVQLR V K
Sbjct: 595 HFGFDHCFENSVQLRSVPK 613


>gi|125534904|gb|EAY81452.1| hypothetical protein OsI_36623 [Oryza sativa Indica Group]
          Length = 566

 Score =  540 bits (1390), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 308/572 (53%), Positives = 391/572 (68%), Gaps = 31/572 (5%)

Query: 57  MWQKAVDRDRKEIEFQKIAGSLAESGDVDGNE------GGGGRDLTEQLEKKSEEFSKIL 110
           MW+KAV+R+R+  E   IA  L +S              G        +E+++  F ++L
Sbjct: 1   MWKKAVERERRSAE---IAHRLQQSSSAAAAAVKEEEGEGKAAAAAGDVERRTARFEEML 57

Query: 111 DVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAE-------VSRFV 163
            V +EERDR+QR QVIDRAAAA+AAARA+L++       +  S+   E        +  +
Sbjct: 58  RVPREERDRVQRRQVIDRAAAALAAARAVLKDPPPPPPPSPPSTPPQEREQQQKPAATAI 117

Query: 164 KKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTP------APGP 217
           +  SES   +  +P  ++  ++    V E    +A + VP SG              PGP
Sbjct: 118 QAGSESGLVSRTAP-GESDRASPPPPVTETATEAAKVSVPDSGDSSPFKKSSSKLGTPGP 176

Query: 218 DFWSWSPPEDDDRDMRDV-RDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPK 276
           DFWSW PP ++   + ++   L+ +EK   +    + ++EK +S DIL +PFE+   + K
Sbjct: 177 DFWSWLPPVENSTKLGEIDTGLKPSEKLDSFAGQPDLLMEKEQSEDILSLPFETSFFK-K 235

Query: 277 PDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRG 336
            D  LPPFQS    E  E SE ++   + + E      FS +AAEAA AL   +E ++ G
Sbjct: 236 EDRSLPPFQSFAEPENVE-SEPSI---TADAEETFEDQFSKNAAEAARALSASNEKSSHG 291

Query: 337 INPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKS 396
           + PDGS WWKETG+EQRPDGV C+WT+ RGVSAD A+EW++K+WEA+D   HKELGSEKS
Sbjct: 292 VRPDGSLWWKETGVEQRPDGVTCKWTVIRGVSADGAVEWEDKYWEASDRFDHKELGSEKS 351

Query: 397 GRDATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEK 454
           GRDATGNVWRE+W ESMWQ+   G++H+EKTADKWG+NG G++WQE+WWEHYD+SGKAEK
Sbjct: 352 GRDATGNVWREYWKESMWQDFTCGVMHMEKTADKWGQNGKGEQWQEQWWEHYDSSGKAEK 411

Query: 455 WAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWD 514
           WA KWCS+DPNT LD GHAHVWHERWGEKYDG GGS KYTDKWAER EGDGWSKWGDKWD
Sbjct: 412 WADKWCSLDPNTPLDVGHAHVWHERWGEKYDGCGGSAKYTDKWAERSEGDGWSKWGDKWD 471

Query: 515 ENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETW 574
           E+FDPN HGVKQGETWWAGKYG+RWNRTWGE HN +GWVHKYG+SSSGE WDTH  Q+TW
Sbjct: 472 EHFDPNGHGVKQGETWWAGKYGDRWNRTWGEHHNCTGWVHKYGRSSSGEHWDTHVPQDTW 531

Query: 575 YERFPHFGFYHCFDNSVQLREVRKPSEFQEEP 606
           YERFPHFGF HCF+NSVQLR V++ +    +P
Sbjct: 532 YERFPHFGFEHCFNNSVQLRSVKRQTPKNTKP 563


>gi|115486049|ref|NP_001068168.1| Os11g0586300 [Oryza sativa Japonica Group]
 gi|113645390|dbj|BAF28531.1| Os11g0586300, partial [Oryza sativa Japonica Group]
          Length = 537

 Score =  512 bits (1318), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 268/453 (59%), Positives = 330/453 (72%), Gaps = 15/453 (3%)

Query: 163 VKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTP------APG 216
           ++  SES   +  +P  ++  ++    V E    +A + VP SG              PG
Sbjct: 88  IQAGSESGLVSRTAP-GESDRASPPPPVTETATEAAKVSVPDSGDSSPFKKSSSKLGTPG 146

Query: 217 PDFWSWSPPEDDDRDMRDV-RDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEP 275
           PDFWSW PP ++   + ++   L+ +EK   +    + ++EK +S DIL +PFE+   + 
Sbjct: 147 PDFWSWLPPVENSTKLGEIDTGLKPSEKLDSFAGQPDLLMEKEQSEDILSLPFETSFFK- 205

Query: 276 KPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATR 335
           K D  LPPFQS    E  E SE ++   + + E      FS +AAEAA AL   DE ++ 
Sbjct: 206 KEDRSLPPFQSFAEPENVE-SEPSI---TADAEETFEDQFSKNAAEAARALSASDEKSSH 261

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G+ PDGS WWKETG+EQRPDGV C+WT+ RGVSAD A+EW++K+WEA+D   HKELGSEK
Sbjct: 262 GVRPDGSLWWKETGVEQRPDGVTCKWTVIRGVSADGAVEWEDKYWEASDRFDHKELGSEK 321

Query: 396 SGRDATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAE 453
           SGRDATGNVWRE+W ESMWQ+   G++H+EKTADKWG+NG G++WQE+WWEHYD+SGKAE
Sbjct: 322 SGRDATGNVWREYWKESMWQDFTCGVMHMEKTADKWGQNGKGEQWQEQWWEHYDSSGKAE 381

Query: 454 KWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKW 513
           KWA KWCS+DPNT LD GHAHVWHERWGEKYDG GGS KYTDKWAER EGDGWSKWGDKW
Sbjct: 382 KWADKWCSLDPNTPLDVGHAHVWHERWGEKYDGCGGSAKYTDKWAERSEGDGWSKWGDKW 441

Query: 514 DENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQET 573
           DE+FDPN HGVKQGETWWAGKYG+RWNRTWGE HN +GWVHKYG+SSSGE WDTH  Q+T
Sbjct: 442 DEHFDPNGHGVKQGETWWAGKYGDRWNRTWGEHHNCTGWVHKYGRSSSGEHWDTHVPQDT 501

Query: 574 WYERFPHFGFYHCFDNSVQLREVRKPSEFQEEP 606
           WYERFPHFGF HCF+NSVQLR V++ +    +P
Sbjct: 502 WYERFPHFGFEHCFNNSVQLRSVKRQTPKNTKP 534


>gi|302764726|ref|XP_002965784.1| hypothetical protein SELMODRAFT_21913 [Selaginella moellendorffii]
 gi|300166598|gb|EFJ33204.1| hypothetical protein SELMODRAFT_21913 [Selaginella moellendorffii]
          Length = 522

 Score =  492 bits (1267), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 281/559 (50%), Positives = 359/559 (64%), Gaps = 59/559 (10%)

Query: 53  SYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDV 112
           SYL MW+ A +R  +E + Q+ A ++ +    D +     R++  Q EK   +F+++LDV
Sbjct: 1   SYLSMWKNAKERYERE-QLQQNASTVQQDRQPDQD-----REIQSQREK---DFARLLDV 51

Query: 113 SKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGA 172
            +EERDR+ RLQVIDRAAAA+AAA A+             +S     S  V+K  E + A
Sbjct: 52  PQEERDRVHRLQVIDRAAAALAAAEAL------------LASRPRAPSTIVEKKWEEAAA 99

Query: 173 AEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDM 232
                        G     + G L   +F+P          +PGPDFW+W+PP       
Sbjct: 100 ------------RGWEGTKKLGKLQTNLFLP-----ATTVVSPGPDFWTWTPPPPPSPVE 142

Query: 233 RDVRDLQMAEKSSVYPTPV-NPVVEKARSVDILPIPFESK--------LSEPKPDPLLPP 283
                  ++ K+S   T   N V+EK R    L +PFE++        + + +  P LPP
Sbjct: 143 DPSEKAALSPKTSQSETQASNSVLEKEREAQTLELPFETENARSVLPLVFQSRAAPSLPP 202

Query: 284 FQSLLGVEKEEVSETN----LETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINP 339
            QSL+ + KE V+ T      E P+   ERD  A    H       L      +T G+NP
Sbjct: 203 LQSLVEI-KENVAATRKKQITEVPTAVLERDKLADSVVH-----QDLQTNKTKSTTGVNP 256

Query: 340 DGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRD 399
           DGSRWWKETG E R +GVVC W++TRGVS++  +EW+EKFWEA D+  +KELGSEKSGRD
Sbjct: 257 DGSRWWKETGEEDRGNGVVCSWSVTRGVSSEGVVEWEEKFWEACDDFDYKELGSEKSGRD 316

Query: 400 ATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
           A+GNVWREFW E++WQ+   GL+H+EK+A+KWGKNG G +W EKWWEHYDASG+AEKWA 
Sbjct: 317 ASGNVWREFWKETIWQDAKSGLLHMEKSAEKWGKNGTGAQWDEKWWEHYDASGRAEKWAD 376

Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENF 517
           KW  IDPNT L+ GH HVWHERWGE++DG GG+MKYTDKWAER +  GW+KWGDKWDE F
Sbjct: 377 KWSVIDPNTPLEPGHGHVWHERWGEEFDGQGGAMKYTDKWAERSDFGGWTKWGDKWDERF 436

Query: 518 DPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYER 577
           D N  G KQGETWWAG  G+RWNRTWGE+HNG+GWVHKYG SSSGE WDTHE+QETWYE+
Sbjct: 437 DKNGVGKKQGETWWAGTNGDRWNRTWGEQHNGTGWVHKYGSSSSGEFWDTHEEQETWYEK 496

Query: 578 FPHFGFYHCFDNSVQLREV 596
           FPHFGF+HC +NS +L +V
Sbjct: 497 FPHFGFHHCMENSQELHKV 515


>gi|302805366|ref|XP_002984434.1| hypothetical protein SELMODRAFT_11669 [Selaginella moellendorffii]
 gi|300147822|gb|EFJ14484.1| hypothetical protein SELMODRAFT_11669 [Selaginella moellendorffii]
          Length = 526

 Score =  489 bits (1259), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 282/559 (50%), Positives = 359/559 (64%), Gaps = 55/559 (9%)

Query: 53  SYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDV 112
           SYL MW+ A +R  +E + Q+ A ++ +    D +     R++  Q EK   +F+++LDV
Sbjct: 1   SYLSMWKNAKERYERE-QLQQHASTVQQDRQPDQD-----REIQSQREK---DFARLLDV 51

Query: 113 SKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGA 172
            +EERDR+ RLQVIDRAAAA+AAA A+             +S     S   +K  E + A
Sbjct: 52  PQEERDRVHRLQVIDRAAAALAAAEAL------------LASRPQAPSTIAEKKWEEAAA 99

Query: 173 AEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDM 232
                  K        ++P R  ++       S  P     +PGPDFW+WSPP       
Sbjct: 100 RGWEGTKK------LGDLPRRSVVA-------SAEPATTVVSPGPDFWTWSPPPPPSPVE 146

Query: 233 RDVRDLQMAEKSSVYPTPV-NPVVEKARSVDILPIPFESK--------LSEPKPDPLLPP 283
                  ++ K+S   T V N V+EK R    L +PFE++        + + +  P LPP
Sbjct: 147 DPSEKAALSPKTSQSETQVSNSVLEKEREAQTLELPFETENARSVLPLVFQSRAAPSLPP 206

Query: 284 FQSLLGVEKEEVSETN----LETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINP 339
            QSL+ + KE V+ T      E P+   ERD  A    H       L      +T G+NP
Sbjct: 207 LQSLVEI-KENVAATRKKQITEVPTAVLERDKLADSVVH-----QDLQTNKTKSTTGVNP 260

Query: 340 DGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRD 399
           DGSRWWKETG E R +GVVC W++TRGVS++  +EW+EKFWEA D+  +KELGSEKSGRD
Sbjct: 261 DGSRWWKETGEEDRGNGVVCSWSVTRGVSSEGVVEWEEKFWEACDDFDYKELGSEKSGRD 320

Query: 400 ATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
           A+GNVWREFW E++WQ+   GL+H+EK+A+KWGKNG G +W EKWWEHYDASG+AEKWA 
Sbjct: 321 ASGNVWREFWKETIWQDAKSGLLHMEKSAEKWGKNGTGAQWDEKWWEHYDASGRAEKWAD 380

Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENF 517
           KW  IDPNT L+ GH HVWHERWGE++DG GG+MKYTDKWAER E  GW+KWGDKWDE F
Sbjct: 381 KWSVIDPNTPLEPGHGHVWHERWGEEFDGQGGAMKYTDKWAERSEFGGWTKWGDKWDERF 440

Query: 518 DPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYER 577
           D N  G KQGETWWAG  G+RWNRTWGE+HNG+GWV KYG SSSGE WDTHE+QETWYE+
Sbjct: 441 DKNGIGKKQGETWWAGTNGDRWNRTWGEQHNGTGWVRKYGSSSSGEFWDTHEEQETWYEK 500

Query: 578 FPHFGFYHCFDNSVQLREV 596
           FPHFGF+HC +NS +L +V
Sbjct: 501 FPHFGFHHCMENSQELHKV 519


>gi|7263564|emb|CAB81601.1| putative protein [Arabidopsis thaliana]
          Length = 497

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 259/457 (56%), Positives = 322/457 (70%), Gaps = 36/457 (7%)

Query: 38  RTGAKVGVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTE 97
           RTG ++   ++EG  SYLDMW+ AVDR++KE  F+KIA ++     VDG +  GG     
Sbjct: 49  RTGVRILRVSNEGRESYLDMWKNAVDREKKEKAFEKIAENVVA---VDGEKEKGG----- 100

Query: 98  QLEKKSEEFSKILDVSKEERDRIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTA 157
            LEKKS+EF KIL+VS EERDRIQR+QV+DRAAAAI+AARAIL   N    K G      
Sbjct: 101 DLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEG------ 154

Query: 158 EVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGP 217
                   N +++  +E++   KN++          G  S  ++VPRS T G  TP  GP
Sbjct: 155 ------FPNEDNTVTSEVTETPKNAK---------LGMWSRTVYVPRSETSGTETP--GP 197

Query: 218 DFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKP 277
           DFWSW+PP+  +  +  V DLQ  EK + +PT  NPV+EK +S D L IP+ES LS  + 
Sbjct: 198 DFWSWTPPQGSE--ISSV-DLQAVEKPAEFPTLPNPVLEKDKSADSLSIPYESMLSSERH 254

Query: 278 DPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGI 337
              +PPF+SL+ V KE  +ET   + +L  E DL  + SA+A E A  LD +DE +T G+
Sbjct: 255 SFTIPPFESLIEVRKE--AETKPSSETLSTEHDLDLISSANAEEVARVLDSLDESSTHGV 312

Query: 338 NPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSG 397
           + DG +WWK+TG+E+RPDGVVCRWTM RGV+AD  +EWQ+K+WEA+D+ G KELGSEKSG
Sbjct: 313 SEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKELGSEKSG 372

Query: 398 RDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAH 457
           RDATGNVWREFW ESM Q  G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+GK+EKWAH
Sbjct: 373 RDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDATGKSEKWAH 432

Query: 458 KWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           KWCSID NT LDAGHAHVWHERWGEKYDG GGS KYT
Sbjct: 433 KWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYT 469


>gi|449530534|ref|XP_004172249.1| PREDICTED: uncharacterized protein LOC101231355, partial [Cucumis
           sativus]
          Length = 453

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 261/480 (54%), Positives = 312/480 (65%), Gaps = 30/480 (6%)

Query: 1   MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFWSRRT--GAKVGVSNSEGGGSYLDMW 58
           M  +L   PR T H   P L        P Q +   R       + +  S+ G SYL MW
Sbjct: 2   MPLRLPLSPRPTLHHHFPRLYHHNFLLLPLQPHIQIRHATPARTLRIRASDEGESYLGMW 61

Query: 59  QKAVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERD 118
           + AV+R RK +EFQK+  +    G+ D N G    D   QLEKKSEEFSKIL V  EERD
Sbjct: 62  KNAVERQRKAVEFQKVVENT--EGNDDRNAGDPSSD---QLEKKSEEFSKILQVPPEERD 116

Query: 119 RIQRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAAEISPF 178
           RIQR+QVI RAAAAIAAARA++ E     V + ++         V  NS           
Sbjct: 117 RIQRMQVIHRAAAAIAAARALVGETGTLAVGDSDTC--------VNLNS----------- 157

Query: 179 VKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRDL 238
             N E     E       S    +P   T  + TP  GPDFWSW+PP DDD +     +L
Sbjct: 158 -TNDEGLLDREEALSEFQSENALLPEFETSQSWTP--GPDFWSWTPPPDDDGNDNAFGEL 214

Query: 239 QMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSET 298
           Q   KS  YP   N V EK R +D L IPF+S++SE   +PLLPPFQSL+G+EK E SET
Sbjct: 215 QPLGKSQAYPKLSNFVEEKERPIDFLSIPFQSEISE-SVNPLLPPFQSLVGMEKLESSET 273

Query: 299 NLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVV 358
           + ET SLEE+ ++G  FS HAAEA+ AL  VD+ +T+GI+PDGSRWWKETGIEQRPDGV+
Sbjct: 274 STETHSLEEDENVGIEFSVHAAEASQALSSVDKESTKGIDPDGSRWWKETGIEQRPDGVI 333

Query: 359 CRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQG 418
           C+WT+TRGVSAD A EWQ K+WEAADE G+KELGSEKSGRDA GNVWRE+W ESM Q QG
Sbjct: 334 CKWTLTRGVSADLATEWQNKYWEAADEFGYKELGSEKSGRDAYGNVWREYWRESMRQEQG 393

Query: 419 LVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHE 478
           LVHLEKTADKWG NG+G EWQEKWWE+Y+ SG+AEK AHKWC IDPNT +D GHAH+W+E
Sbjct: 394 LVHLEKTADKWGINGSGTEWQEKWWEYYNTSGQAEKNAHKWCKIDPNTYVDPGHAHIWNE 453


>gi|388514461|gb|AFK45292.1| unknown [Lotus japonicus]
          Length = 243

 Score =  414 bits (1063), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 196/242 (80%), Positives = 219/242 (90%)

Query: 363 MTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHL 422
           MTRGVSAD+A+EWQEKFWEA+DE+G+KELGSEKSGRDA+GNVW EFW ESM +  GL+H+
Sbjct: 1   MTRGVSADKAVEWQEKFWEASDEVGYKELGSEKSGRDASGNVWHEFWRESMHEENGLMHM 60

Query: 423 EKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGE 482
           EKTADKWG NG G+EWQEKWWE Y+ASG+AEKWAHKWCSIDPNT L+AGHAHVWHERWGE
Sbjct: 61  EKTADKWGSNGQGNEWQEKWWERYNASGQAEKWAHKWCSIDPNTPLEAGHAHVWHERWGE 120

Query: 483 KYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRT 542
            YDG+GGS KYTDKWAER +  GW KWGDKWDENFD N HG+KQGETWW GK+GERWNRT
Sbjct: 121 TYDGYGGSTKYTDKWAERSQDGGWEKWGDKWDENFDLNGHGIKQGETWWEGKHGERWNRT 180

Query: 543 WGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEF 602
           WGE+ NGSGWVHKYGKSSSGE WDTHE Q+TWYERFPHFGF+HC++NSVQLREV KPSE 
Sbjct: 181 WGEQRNGSGWVHKYGKSSSGEHWDTHEGQDTWYERFPHFGFFHCYENSVQLREVPKPSEI 240

Query: 603 QE 604
           Q+
Sbjct: 241 QD 242


>gi|168049231|ref|XP_001777067.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671510|gb|EDQ58060.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 267

 Score =  389 bits (1000), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 190/267 (71%), Positives = 220/267 (82%), Gaps = 3/267 (1%)

Query: 330 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 389
           D   T G++ DGSRWWKETG+E R +GV C WT+ RGVSAD ++EW+EKFWEAAD    K
Sbjct: 1   DASDTSGVHEDGSRWWKETGVENRANGVTCTWTVMRGVSADGSVEWEEKFWEAADAYDFK 60

Query: 390 ELGSEKSGRDATGNVWREFWTESMWQN--QGLVHLEKTADKWGKNGNGDEWQEKWWEHYD 447
           ELG+EKSGRDA+G VWREFW ESMWQ+   GL+H++K+ADKW K+G+G +W EKW E YD
Sbjct: 61  ELGAEKSGRDASGGVWREFWQESMWQDATTGLMHIQKSADKWAKDGHGGQWHEKWMEKYD 120

Query: 448 ASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCE-GDGW 506
           ASG+AEKWA KW  ID  T L+ GHAHVWHERWGE+YDG GGSMKYTDKWAER E G GW
Sbjct: 121 ASGRAEKWADKWSQIDLTTPLEPGHAHVWHERWGEEYDGQGGSMKYTDKWAERLESGGGW 180

Query: 507 SKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWD 566
           +KWGDKWDE FD N HGVKQGETWW G +GERWNRTWGE HNGSGWVHKYG+SSSGE WD
Sbjct: 181 TKWGDKWDERFDQNGHGVKQGETWWEGLHGERWNRTWGEGHNGSGWVHKYGQSSSGEHWD 240

Query: 567 THEQQETWYERFPHFGFYHCFDNSVQL 593
           TH Q+ET+Y+R+PHFGF  CF+NS +L
Sbjct: 241 THSQEETFYDRYPHFGFRECFENSREL 267


>gi|308801813|ref|XP_003078220.1| RNA polymerase II transcription elongation factor DSIF/SUPT5H/SPT5
           (ISS) [Ostreococcus tauri]
 gi|116056671|emb|CAL52960.1| RNA polymerase II transcription elongation factor DSIF/SUPT5H/SPT5
           (ISS) [Ostreococcus tauri]
          Length = 480

 Score =  214 bits (546), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 142/398 (35%), Positives = 212/398 (53%), Gaps = 37/398 (9%)

Query: 215 PGPDFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSE 274
           PG DFW+WSPPE +D         +   + S        V E +     L + F+S +  
Sbjct: 65  PGSDFWTWSPPEVEDNGPTPKLQRKTETRVSAAVAEAERVPEAS-----LQLKFQSDIET 119

Query: 275 PKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKV-DELA 333
           PK   L   F+S   VE E                 LGA  +A   E A A+ ++  +  
Sbjct: 120 PKE--LKLEFESDGVVELEPAP--------------LGATPTAELEETATAVRELGTDGE 163

Query: 334 TRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGS 393
           T G+  +GSRWW+E+G ++   G +CRWT+ RG SAD ++EW+EK+WE +D   ++ELG+
Sbjct: 164 TEGVLDNGSRWWRESGEDELAGGRLCRWTLVRGASADGSVEWEEKWWETSDAFNYRELGA 223

Query: 394 EKSGRDATGNVWREFWTESMWQN------QGLVHLEKTADKWGKNGNGDEWQEKWWEHYD 447
            KSGRDA+GNVW+E W E +  +          H+ + A+KWG   +G EW E W E+Y 
Sbjct: 224 IKSGRDASGNVWQESWREHITHDTTTGFSNASKHIMREANKWGAQADGTEWHEVWDENYW 283

Query: 448 ASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERC---EGD 504
             G+ ++   K  +I      + GH + W  +WGE++DGHGG +K+ D +A+R    +G 
Sbjct: 284 GDGRVKRTCTKKGAIGSGISPEDGHGNRWTHKWGEEWDGHGGCVKWNDSYADRDKSEDGG 343

Query: 505 GWSKWGDKWDENFDPNSH----GVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSS 560
               WG++W+E +   +H    G + G T W  + G ++ +TWGE H   G VHKYG ++
Sbjct: 344 SGRSWGERWEERWGSFAHNGSAGTRNGST-WDDRDGHKFEKTWGEEHWHDGRVHKYGSTT 402

Query: 561 SG-ELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
            G + WDT E  + W+ER P FG+     +S QL  VR
Sbjct: 403 DGSDGWDTWEDSQGWWERAPSFGWDEAVSHSPQLLSVR 440


>gi|115450547|ref|NP_001048874.1| Os03g0133300 [Oryza sativa Japonica Group]
 gi|113547345|dbj|BAF10788.1| Os03g0133300, partial [Oryza sativa Japonica Group]
          Length = 371

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 144/362 (39%), Positives = 200/362 (55%), Gaps = 48/362 (13%)

Query: 250 PVNPVVEKARSVDILP-IPFESKLSE---------PKPDPLLPPFQSLLGVEKEEVSETN 299
           P     E  RS+  +P +PF S  S          P+  P  P  QS             
Sbjct: 9   PTTAATEPVRSLTAMPSLPFPSPRSRRQWKQQNFYPRCTPRGPAPQSR------------ 56

Query: 300 LETPSLEEERDLGALFSAHAAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVV 358
            +TP    +RD G      A+E    ++ +DE +   G N DGS W++E+G ++  +G  
Sbjct: 57  -DTPP---KRDTGI-----ASEKEWGINLLDEAVKESGTNEDGSTWYRESGDDRGDNGYR 107

Query: 359 CRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ- 417
           CRW    G S D   EW+E +WE +D  G+KELG+EKSG++  G+ W E W E ++Q++ 
Sbjct: 108 CRWARMGGQSHDGTTEWKETWWEKSDWTGYKELGAEKSGKNGEGDSWWEKWKEVLYQDEW 167

Query: 418 -GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHV 475
             L  +E++A+K  K+G  +  W EKWWE YDA G  EK AHK+  ++  +         
Sbjct: 168 SNLARIERSAEKQAKSGAENAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS--------- 218

Query: 476 WHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKY 535
           W ERWGE YDG G  +K+TDKWAE   G   +KWGDKW+E F     G +QGETW     
Sbjct: 219 WWERWGEHYDGRGFVLKWTDKWAETDLG---TKWGDKWEEKFFAGI-GSRQGETWHVSPG 274

Query: 536 GERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLRE 595
           G+RW+RTWGE H G+G VHKYGKS++GE WD    +ET+YE  PH+G+     +S QL  
Sbjct: 275 GDRWSRTWGEEHFGNGKVHKYGKSTTGESWDLVVDEETYYEAEPHYGWADVVGDSTQLLS 334

Query: 596 VR 597
           ++
Sbjct: 335 IQ 336


>gi|326518522|dbj|BAJ88290.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 428

 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 133/305 (43%), Positives = 182/305 (59%), Gaps = 21/305 (6%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A+E    ++ +DE +   G N DGS W++E+G +   +G  CRW    G + D   EW+E
Sbjct: 126 ASEKEWGINLLDEAVKESGTNEDGSTWYRESGEDLGENGYRCRWARMGGQTHDGTTEWKE 185

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
            +WE +D  G+KELG+EKSG++A G+ W E W E + Q++   L  LEK+A+K  K+G  
Sbjct: 186 TWWEKSDWTGYKELGAEKSGKNAEGDSWWEKWKEVLHQDEWSNLARLEKSAEKQAKSGIE 245

Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           +  W EKWWE YDA G  EK AHK+  ++  +         W ERWGE YDG G  +K+T
Sbjct: 246 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGSVLKWT 296

Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
           DKWAE   G   ++WGDKW+E F     G +QGETW A   G+RW+RTWGE H G+G VH
Sbjct: 297 DKWAETDLG---TRWGDKWEEKFFAGI-GSRQGETWHASPGGDRWSRTWGEEHFGNGKVH 352

Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEPFEIQ 610
           KYGKS++GE WD   ++ET+YE  PH+G+     +S QL  +    R P  F    F   
Sbjct: 353 KYGKSTTGESWDLVVEEETYYEADPHYGWADVVGDSSQLLSIQPVERPPGVFPTIDFSSS 412

Query: 611 DKRSE 615
             R+E
Sbjct: 413 PPRTE 417


>gi|242042337|ref|XP_002468563.1| hypothetical protein SORBIDRAFT_01g048110 [Sorghum bicolor]
 gi|241922417|gb|EER95561.1| hypothetical protein SORBIDRAFT_01g048110 [Sorghum bicolor]
          Length = 348

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 128/283 (45%), Positives = 177/283 (62%), Gaps = 17/283 (6%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A+E    ++  DE +   GIN DGS W++E+G +   +G  CRWT   G + D + EW+E
Sbjct: 45  ASEKEWGINLPDEAVKESGINEDGSTWYRESGEDVGENGYRCRWTRMGGQNHDGSTEWKE 104

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
            +WE +D  G+KELG+EKSG++A G+ W E W E ++Q++   L  +E++A+K  K+G  
Sbjct: 105 TWWEKSDWTGYKELGAEKSGKNAEGDSWWEKWKEVLYQDEWSNLARIERSAEKQAKSGVE 164

Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           +  W EKWWE YDA G  EK AHK+  ++  +         W ERWGE YDG G  +K+T
Sbjct: 165 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGFVLKWT 215

Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
           DKWAE   G   +KWGDKW+E F     G +QGETW     GERW+RTWGE H G+G VH
Sbjct: 216 DKWAETDLG---TKWGDKWEEKFFAGI-GSRQGETWHVSPGGERWSRTWGEEHFGNGKVH 271

Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
           KYGKS++GE WD    +ET+YE  PH+G+     +S QL  ++
Sbjct: 272 KYGKSTTGESWDLVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 314


>gi|225459860|ref|XP_002285931.1| PREDICTED: uncharacterized protein LOC100252277 [Vitis vinifera]
          Length = 432

 Score =  209 bits (533), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 124/264 (46%), Positives = 164/264 (62%), Gaps = 16/264 (6%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G N DGS W++E+G +   +G  CRWT   G S D + EW+E +WE +D  G+KELG EK
Sbjct: 148 GTNEDGSAWYRESGEDLGENGYRCRWTRMGGQSHDGSSEWKEMWWEKSDWTGYKELGVEK 207

Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
           SGR+A G+ W E W E + Q++   L  +E++A K  K+G  +  W EKWWE YDA G  
Sbjct: 208 SGRNAEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGST 267

Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
           EK AHK+  ++  +         W E+WGE YDG G  +K+TDKWAE   G   +KWGDK
Sbjct: 268 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 315

Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
           W+E F     G +QGETW     G+RW+RTWGE H G+G VHKYGKS++GE WD    +E
Sbjct: 316 WEEKFFAGI-GSRQGETWHLSPSGDRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 374

Query: 573 TWYERFPHFGFYHCFDNSVQLREV 596
           T+YE  PH+G+     NS QL  +
Sbjct: 375 TYYEAEPHYGWADVVGNSSQLLSI 398


>gi|413956953|gb|AFW89602.1| hypothetical protein ZEAMMB73_256684 [Zea mays]
          Length = 450

 Score =  209 bits (531), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 131/297 (44%), Positives = 179/297 (60%), Gaps = 21/297 (7%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A+E    ++ +DE +   GIN DGS W++E+G +   +G  CRW    G + D + EW+E
Sbjct: 145 ASEKEWGINLLDEAVKESGINEDGSTWYRESGEDTGENGYRCRWARMGGQNHDGSTEWKE 204

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
            +WE +D  G+KELG+EKSG++A G+ W E W E ++Q++   L  +EK+A+K  K+G  
Sbjct: 205 TWWEKSDWTGYKELGAEKSGKNAEGDSWWEKWKEVLYQDEWSNLARIEKSAEKQAKSGAE 264

Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           +  W EKWWE YDA G  EK AHK+  ++  +         W ERWGE YDG G  +K+T
Sbjct: 265 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGFVLKWT 315

Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
           DKWAE   G   +KWGDKW+E F     G +QGETW      ERW+RTWGE H G+G VH
Sbjct: 316 DKWAETDLG---TKWGDKWEEKFFAGI-GSRQGETWHVSPGRERWSRTWGEEHFGNGKVH 371

Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEPF 607
           KYGKS++GE WD    +ET+YE  PH+G+     +S QL  +    R P  F    F
Sbjct: 372 KYGKSTTGESWDLVVDEETYYEAEPHYGWADVVGDSTQLLSIQPVERPPGVFPAIDF 428


>gi|125606382|gb|EAZ45418.1| hypothetical protein OsJ_30067 [Oryza sativa Japonica Group]
          Length = 401

 Score =  209 bits (531), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 135/323 (41%), Positives = 187/323 (57%), Gaps = 29/323 (8%)

Query: 279 PLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDE-LATRGI 337
           P LPP     G      S     +PSL        L +  A+E    ++ +DE +   G 
Sbjct: 69  PPLPP-----GAGVAPASPQCRHSPSL-------LLDTGIASEKEWGINLLDEAVKESGT 116

Query: 338 NPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSG 397
           N DGS W++E+G ++  +G  CRW    G S D   EW+E +WE +D  G+KELG+EKSG
Sbjct: 117 NEDGSTWYRESGDDRGDNGYRCRWARMGGQSHDGTTEWKETWWEKSDWTGYKELGAEKSG 176

Query: 398 RDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKAEK 454
           ++  G+ W E W E ++Q++   L  +E++A+K  K+G  +  W EKWWE YDA G  EK
Sbjct: 177 KNGEGDSWWEKWKEVLYQDEWSNLARIERSAEKQAKSGAENAGWYEKWWEKYDAKGWTEK 236

Query: 455 WAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWD 514
            AHK+  ++  +         W ERWGE YDG G  +K+TDKWAE   G   +KWGDKW+
Sbjct: 237 GAHKYGRLNEQS---------WWERWGEHYDGRGFVLKWTDKWAETDLG---TKWGDKWE 284

Query: 515 ENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETW 574
           E F     G +QGETW     G+RW+RTWGE H G+G VHKYGKS++GE WD    +ET+
Sbjct: 285 EKFFAGI-GSRQGETWHVSPGGDRWSRTWGEEHFGNGKVHKYGKSTTGESWDLVVDEETY 343

Query: 575 YERFPHFGFYHCFDNSVQLREVR 597
           YE  PH+G+     +S QL  ++
Sbjct: 344 YEAEPHYGWADVVGDSTQLLSIQ 366


>gi|255539182|ref|XP_002510656.1| conserved hypothetical protein [Ricinus communis]
 gi|223551357|gb|EEF52843.1| conserved hypothetical protein [Ricinus communis]
          Length = 428

 Score =  208 bits (529), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 124/265 (46%), Positives = 163/265 (61%), Gaps = 16/265 (6%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G N DGS W++E+G +   +G  CRWT   G S D   EW+E +WE +D  G+KELG EK
Sbjct: 144 GTNEDGSTWYRESGEDLGDNGFRCRWTRMGGRSHDATSEWKETWWEKSDWTGYKELGVEK 203

Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
           SGR+A G+ W E W E + Q++   L  +E++A K  K+G  +  W EKWWE YDA G  
Sbjct: 204 SGRNAEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 263

Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
           EK AHK+  ++  +         W E+WGE YDG G  +K+TDKWAE   G   +KWGDK
Sbjct: 264 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 311

Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
           W+E F     G +QGETW     GERW+RTWGE H G+G VHKYGKS++GE WD    +E
Sbjct: 312 WEEKFFAGI-GSRQGETWHVSPGGERWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 370

Query: 573 TWYERFPHFGFYHCFDNSVQLREVR 597
           T YE  PH+G+     +S QL  ++
Sbjct: 371 TCYEAEPHYGWADVVGDSTQLLSIK 395


>gi|168036746|ref|XP_001770867.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677926|gb|EDQ64391.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 359

 Score =  207 bits (528), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 127/274 (46%), Positives = 174/274 (63%), Gaps = 22/274 (8%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G+N DGS W+ E+G++   +G  CRWT+  G SAD + EW+E +WE +D  G+KELG+EK
Sbjct: 58  GVNEDGSSWYSESGVDLGENGYRCRWTVMGGRSADGSSEWKEAWWEKSDWTGYKELGAEK 117

Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
           SG++A G+ W E W E +  ++   L  +EK+A K  K+G G   W EKWWE Y+A G +
Sbjct: 118 SGKNAQGDTWWETWQEVLRVDELSNLARIEKSAQKQAKSGTGSAGWFEKWWEKYNAKGWS 177

Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
           EK AHK+  ++     D G    W E+W E+YDG G  +K+TDKWAE   G   +KWGDK
Sbjct: 178 EKGAHKYGRLN-----DQG----WWEKWEEQYDGRGAVLKWTDKWAESDTG---TKWGDK 225

Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
           W+E FD +  G +QGETW   + G  W+RTWGE H G+G VHKYG+S+SGE WD   ++ 
Sbjct: 226 WEEKFD-HGVGTRQGETWHNDEKG--WSRTWGEEHFGNGKVHKYGRSTSGENWDNVVEEG 282

Query: 573 TWYERFPHFGFYHCFDNSVQLREV----RKPSEF 602
           T+Y+  PH+G+     NSVQL  +    R P  F
Sbjct: 283 TYYQAEPHYGWADAIGNSVQLLSIQPLERPPGTF 316


>gi|414864653|tpg|DAA43210.1| TPA: hypothetical protein ZEAMMB73_868366 [Zea mays]
          Length = 448

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 143/379 (37%), Positives = 203/379 (53%), Gaps = 59/379 (15%)

Query: 248 PTPVNPVVEKARSVDILPIPFES-----KLSEPKPDPLLPPFQSLLGVEKEEVSETNLET 302
           P+   P   +     + P+PF +     ++ +P   P   P  S         +  + +T
Sbjct: 67  PSAAAPGRRRTSLTAMPPLPFPAPRSRRQVKQPDFYPRCTPRGS---------APQSRDT 117

Query: 303 PSLEEERDLGALFSAHAAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRW 361
           P    +RD G      A+E    ++ +DE +   GIN DGS W++E+G +   +G  CRW
Sbjct: 118 PP---KRDTGI-----ASEKEWGINLLDEAVKESGINEDGSTWYRESGDDIGENGYRCRW 169

Query: 362 TMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ---- 417
               G S D + EW+E +WE +D  G+KELG+EKSGR+A G+ W E W E ++Q++    
Sbjct: 170 ARMGGQSHDGSTEWKETWWEKSDWTGYKELGAEKSGRNAEGDSWWEKWKEVLYQDEWSQK 229

Query: 418 ------------------GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKAEKWAHK 458
                              L  +E++A+K  K+G  +  W EKWWE YDA G  EK AHK
Sbjct: 230 LEQSHLRGSDIVYLVEYSNLARIERSAEKQAKSGIENAGWYEKWWEKYDAKGWTEKGAHK 289

Query: 459 WCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFD 518
           +  ++  +         W ERWGE YDG G  +K+TDKWAE   G   +KWGDKW+E F 
Sbjct: 290 YGRLNEQS---------WWERWGEHYDGRGFVLKWTDKWAETDLG---TKWGDKWEEKFF 337

Query: 519 PNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERF 578
               G +QGETW     GERW+RTWGE H G+G VHKYGKS++GE WD    +ET+YE  
Sbjct: 338 AGI-GSRQGETWHVCPGGERWSRTWGEEHFGNGKVHKYGKSTTGERWDLVVDEETYYEAE 396

Query: 579 PHFGFYHCFDNSVQLREVR 597
           PH+G+     +S QL  ++
Sbjct: 397 PHYGWADVVGDSTQLLSIQ 415


>gi|125542273|gb|EAY88412.1| hypothetical protein OsI_09872 [Oryza sativa Indica Group]
          Length = 431

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 125/283 (44%), Positives = 175/283 (61%), Gaps = 17/283 (6%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A+E    ++ +DE +   G N DGS W++E+G ++  +G  CRW    G S D   EW+E
Sbjct: 127 ASEKEWGINLLDEAVKESGTNEDGSTWYRESGDDRGDNGYRCRWARMGGQSHDGTTEWKE 186

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
            +WE +D  G+KELG+EKSG++  G+ W E W E ++Q++   L  +E++A+K  K+G  
Sbjct: 187 TWWEKSDWTGYKELGAEKSGKNGAGDSWWEKWKEVLYQDEWSNLARIERSAEKQAKSGAE 246

Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           +  W EKWWE YDA G  EK AHK+  ++  +         W ERWGE YDG G  +K+T
Sbjct: 247 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGFVLKWT 297

Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
           DKWAE   G   +KWGDKW+E F     G +QGETW     G+RW+RTWGE H G+G VH
Sbjct: 298 DKWAETDLG---TKWGDKWEEKFFAGI-GSRQGETWHVSPGGDRWSRTWGEEHFGNGKVH 353

Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
           KYGKS++GE WD    +ET+YE  PH+G+     +S QL  ++
Sbjct: 354 KYGKSTTGESWDLVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 396


>gi|302792138|ref|XP_002977835.1| hypothetical protein SELMODRAFT_107335 [Selaginella moellendorffii]
 gi|300154538|gb|EFJ21173.1| hypothetical protein SELMODRAFT_107335 [Selaginella moellendorffii]
          Length = 367

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 125/278 (44%), Positives = 168/278 (60%), Gaps = 21/278 (7%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G N DGS W++E G +   +G  CR+T+  G S+D + EW+E +WE  D  G+KELG+EK
Sbjct: 97  GTNEDGSTWFRECGEDLGENGYRCRYTVMGGRSSDGSTEWKETWWEKCDWTGYKELGAEK 156

Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
           SG++A G+ W E W E + Q++   L  +E+TA K  K GNG+  W EKWWE Y+A G  
Sbjct: 157 SGKNAGGDAWWETWQEILRQDELSNLARIERTAQKQAKQGNGEAGWYEKWWEKYNAKGWT 216

Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
           EK AHK+  ++  +         W E+WGE+YDG G  +K+TDKWAE   G+   KWGDK
Sbjct: 217 EKGAHKYGRLNEQS---------WWEKWGEQYDGRGAVLKWTDKWAENATGE---KWGDK 264

Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
           W+E F  N  G +QGETW +    E W+RTWGE H G G VHKYGKS+SGE WD+   + 
Sbjct: 265 WEEKFQ-NGAGTRQGETWHSAN-AESWSRTWGEEHFGDGKVHKYGKSTSGENWDSVVTET 322

Query: 573 TWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEP 606
           T Y   PH+G+      S QL  +    R P  + + P
Sbjct: 323 TVYNAEPHYGWVDAIGQSTQLLSIEPRPRPPGVYPDLP 360


>gi|22758282|gb|AAN05510.1| Hypothetical protein [Oryza sativa Japonica Group]
 gi|108706037|gb|ABF93832.1| expressed protein [Oryza sativa Japonica Group]
          Length = 431

 Score =  207 bits (526), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 125/283 (44%), Positives = 175/283 (61%), Gaps = 17/283 (6%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A+E    ++ +DE +   G N DGS W++E+G ++  +G  CRW    G S D   EW+E
Sbjct: 127 ASEKEWGINLLDEAVKESGTNEDGSTWYRESGDDRGDNGYRCRWARMGGQSHDGTTEWKE 186

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
            +WE +D  G+KELG+EKSG++  G+ W E W E ++Q++   L  +E++A+K  K+G  
Sbjct: 187 TWWEKSDWTGYKELGAEKSGKNGEGDSWWEKWKEVLYQDEWSNLARIERSAEKQAKSGAE 246

Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           +  W EKWWE YDA G  EK AHK+  ++  +         W ERWGE YDG G  +K+T
Sbjct: 247 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGFVLKWT 297

Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
           DKWAE   G   +KWGDKW+E F     G +QGETW     G+RW+RTWGE H G+G VH
Sbjct: 298 DKWAETDLG---TKWGDKWEEKFFAGI-GSRQGETWHVSPGGDRWSRTWGEEHFGNGKVH 353

Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
           KYGKS++GE WD    +ET+YE  PH+G+     +S QL  ++
Sbjct: 354 KYGKSTTGESWDLVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 396


>gi|424513213|emb|CCO66797.1| predicted protein [Bathycoccus prasinos]
          Length = 657

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 145/400 (36%), Positives = 202/400 (50%), Gaps = 33/400 (8%)

Query: 214 APGPDFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPTPVNPVVEKARS--VDILPIPFESK 271
           A G DFWSW+PPE   R   D     + +      T +   V+ A       L + F+S+
Sbjct: 258 ATGMDFWSWTPPE---RKKSDTDGAPVPKLQKQMQTRIEQAVQVAERGVTQTLNLDFQSQ 314

Query: 272 LSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSAHAAEAAHALDKVDE 331
           +       L   F+S    + +EV            E D  A   A  A A   L    E
Sbjct: 315 VQAKGTKELPLAFESQTATQADEV------------EADQRAPTEAEFASAVRELGADGE 362

Query: 332 LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKEL 391
             T G   DG+RWW+E G  +  +G VC WT+ RG SAD ++EW+EK+W  AD   +KEL
Sbjct: 363 --THGELSDGTRWWREAGTSELENGRVCEWTLVRGQSADGSVEWEEKWWSTADAFDYKEL 420

Query: 392 GSEKSGRDATGNVWREFWTESMWQN--QGLV-----HLEKTADKWGKNGNGDEWQEKWWE 444
           G+ KSGRD  GNVW+E W+E    +  +G        +E++A+KWG N +G EW E W E
Sbjct: 421 GAVKSGRDGHGNVWQESWSEISCSDVSRGFFTDASKKIERSANKWGANASGAEWHEDWRE 480

Query: 445 HYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEG- 503
            Y   G  ++   K   +  N   + GHA  W+  W EK+DGHGG MK  D WA+R  G 
Sbjct: 481 AYWGDGVVDRECFKKSCVGKNEIPEDGHASRWNHNWKEKWDGHGGCMKTNDSWADRDVGE 540

Query: 504 DGWS--KWGDKWDENFDPNSHGVKQGE---TWWAGKYGERWNRTWGERHNGSGWVHKYGK 558
           DG S   WG++W E +   +   +QGE   + W  + G + ++ WGE H   G V KYG 
Sbjct: 541 DGGSGRSWGERWSERWGSYASHGRQGEREGSTWNDRDGHKVSKDWGEEHWPDGRVRKYGH 600

Query: 559 SSSG-ELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
           SS G + WD  E  + W+ER P FG+    ++S QL  ++
Sbjct: 601 SSDGSDHWDVWEDTDGWWERHPSFGWAEAVNHSPQLMGIK 640


>gi|302141667|emb|CBI18870.3| unnamed protein product [Vitis vinifera]
          Length = 347

 Score =  206 bits (523), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 124/264 (46%), Positives = 164/264 (62%), Gaps = 16/264 (6%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G N DGS W++E+G +   +G  CRWT   G S D + EW+E +WE +D  G+KELG EK
Sbjct: 63  GTNEDGSAWYRESGEDLGENGYRCRWTRMGGQSHDGSSEWKEMWWEKSDWTGYKELGVEK 122

Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
           SGR+A G+ W E W E + Q++   L  +E++A K  K+G  +  W EKWWE YDA G  
Sbjct: 123 SGRNAEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGST 182

Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
           EK AHK+  ++  +         W E+WGE YDG G  +K+TDKWAE   G   +KWGDK
Sbjct: 183 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 230

Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
           W+E F     G +QGETW     G+RW+RTWGE H G+G VHKYGKS++GE WD    +E
Sbjct: 231 WEEKFFAGI-GSRQGETWHLSPSGDRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 289

Query: 573 TWYERFPHFGFYHCFDNSVQLREV 596
           T+YE  PH+G+     NS QL  +
Sbjct: 290 TYYEAEPHYGWADVVGNSSQLLSI 313


>gi|449452965|ref|XP_004144229.1| PREDICTED: uncharacterized protein LOC101214256 [Cucumis sativus]
          Length = 429

 Score =  204 bits (520), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 122/270 (45%), Positives = 166/270 (61%), Gaps = 16/270 (5%)

Query: 330 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 389
           + ++  G N DGS W++E+G +   +G  CRWT   G S D   EW+E +WE +D  G+K
Sbjct: 139 ENVSESGTNEDGSTWYRESGEDLGENGYRCRWTRMGGQSHDGYSEWKETWWEKSDWTGYK 198

Query: 390 ELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHY 446
           ELG EKSG++  G+ W E W E + Q++   L  +E++A K  K+G  +  W EKWWE Y
Sbjct: 199 ELGVEKSGKNVEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWHEKWWEKY 258

Query: 447 DASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGW 506
           DA G  EK AHK+  ++  +         W E+WGE YDG G  +K+TDKWAE   G   
Sbjct: 259 DAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG--- 306

Query: 507 SKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWD 566
           +KWGDKW+E F  +  G +QGETW     GERW+RTWGE H G+G VHKYGKS++GE WD
Sbjct: 307 TKWGDKWEEKFF-SGIGSRQGETWHVSPSGERWSRTWGEEHFGNGKVHKYGKSTTGESWD 365

Query: 567 THEQQETWYERFPHFGFYHCFDNSVQLREV 596
               +ET+YE  PH+G+     +S QL  +
Sbjct: 366 IVVDEETYYEAEPHYGWADVVGDSSQLLSI 395


>gi|308811228|ref|XP_003082922.1| RNA polymerase II transcription elongation factor DSIF/SUPT5H/SPT5
           (ISS) [Ostreococcus tauri]
 gi|116054800|emb|CAL56877.1| RNA polymerase II transcription elongation factor DSIF/SUPT5H/SPT5
           (ISS) [Ostreococcus tauri]
          Length = 501

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 160/479 (33%), Positives = 242/479 (50%), Gaps = 55/479 (11%)

Query: 139 ILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGALSA 198
           ++    G    +G +SG+A   R     + +SGAA           +G      R A  +
Sbjct: 20  VVTTGGGEGATSGSASGSA--PRSTGTTTGASGAA-----------DGAVMKGGRSANDS 66

Query: 199 GI--FVPRSGTPGNRTPAPGPDFWSWSPPEDDDRD-MRDVRDLQMAEKSSVYPTPVNPVV 255
           G+     ++  P   T  PG DFW+W+PPE    D +      ++  ++    +    V 
Sbjct: 67  GLKNTAKKASKPKAETFNPGSDFWTWTPPEAAGSDKVAAAAAPKLQRQTETRVSAAVAVA 126

Query: 256 EKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEE----RDL 311
           E+A     L + F+S +    P  L   F+S + + KE    ++  T  LEE     R+L
Sbjct: 127 ERAPEAS-LQLKFQSDVE--TPSELKLEFESDVVLAKEPAPLSDTPTVELEETATAVREL 183

Query: 312 GALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADE 371
           GA                 +  T G   +GSRWW+E+G E+   G +CRWT+ RG SAD 
Sbjct: 184 GA-----------------DGETEGTLENGSRWWRESGEEELEGGKLCRWTLVRGASADG 226

Query: 372 ALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQN------QGLVHLEKT 425
           ++EW+EK+WE +D   ++ELG+ KSGRDA GNVW+E W E +  +          H+ + 
Sbjct: 227 SVEWEEKWWETSDAFNYRELGAIKSGRDAKGNVWQESWREQVTHDTTTGFSNASKHIMRE 286

Query: 426 ADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYD 485
           A+KWG   +G EW E W E+Y   G+ ++   K  +I      D GH + W  +WGE++D
Sbjct: 287 ANKWGAQADGAEWHEVWDENYWGDGQVKRTCTKKGAIANGATPDDGHGNRWTHKWGEEWD 346

Query: 486 GHGGSMKYTDKWAERCEG-DGWS--KWGDKWDENFDPNSH----GVKQGETWWAGKYGER 538
           GHGG +K+TD +A+R +  DG S   WG+KW+E +   +H    G + G T W  + G +
Sbjct: 347 GHGGCVKWTDSFADRDQSEDGGSGRAWGEKWEERWGGYAHNGSAGNRNGST-WDDRDGHK 405

Query: 539 WNRTWGERHNGSGWVHKYGKSSSG-ELWDTHEQQETWYERFPHFGFYHCFDNSVQLREV 596
           + +TWGE H   G VHK+G ++ G + WDT E    W+ER P FG+     +S QL  V
Sbjct: 406 FEKTWGEEHWHDGRVHKWGATTDGSDGWDTWEDSAGWWERAPSFGWDEAVSHSPQLLNV 464


>gi|356517440|ref|XP_003527395.1| PREDICTED: uncharacterized protein LOC100788155 [Glycine max]
          Length = 449

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 124/279 (44%), Positives = 166/279 (59%), Gaps = 20/279 (7%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G N DGS W++E+G E   +G  CRWT   G S D + EW+E +WE +D  G+KELG EK
Sbjct: 163 GTNEDGSTWYRESGEELGENGYKCRWTRMGGQSHDGSSEWKETWWEKSDWTGYKELGVEK 222

Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
           SGR++ G+ W E W E++ Q++   +  +E++A K  K+G  +  W EKWWE YDA G  
Sbjct: 223 SGRNSEGDTWWETWQENLHQDEWSNIARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 282

Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
           EK AHK+  ++  +         W E+WGE YDG G  +K+TDKWAE   G   +KWGDK
Sbjct: 283 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 330

Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
           W+E F     G + GETW      ERW+RTWGE H G+G VHKYG S++GE WD    +E
Sbjct: 331 WEERFF-KGIGSRHGETWHVSPSSERWSRTWGEEHFGNGKVHKYGNSTTGESWDIVVDEE 389

Query: 573 TWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEPF 607
           T+YE  PH+G+     +S QL  +    R P  F    F
Sbjct: 390 TYYEAEPHYGWADVVGDSSQLLSIEPRERPPGVFPNLDF 428


>gi|357114188|ref|XP_003558882.1| PREDICTED: uncharacterized protein LOC100823512 [Brachypodium
           distachyon]
          Length = 429

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 130/303 (42%), Positives = 181/303 (59%), Gaps = 18/303 (5%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A+E    ++ +DE +   G N DGS W++E+G +   +G   RW    G + D ++EW+E
Sbjct: 127 ASEKEWGINLLDEAVKESGTNEDGSTWYRESGEDVGENGYRSRWARMGGQTHDGSVEWKE 186

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
            +WE +D  G+KELG+EKSG++A G+ W E W E + Q++   L  +E++A+K  K+G  
Sbjct: 187 TWWEKSDWTGYKELGAEKSGKNAEGDSWWEKWKEVLHQDEWSNLARIERSAEKQAKSGAE 246

Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           +  W EKWWE YDA G  EK AHK+  ++  +         W ERWGE YDG G  +K+T
Sbjct: 247 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWERWGEHYDGRGSVLKWT 297

Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
           DKWAE   G   ++WGDKW+E F     G +QGETW A   G+RW+RTWGE H G+G VH
Sbjct: 298 DKWAETDLG---TRWGDKWEEKFFAGI-GSRQGETWHASIGGDRWSRTWGEEHYGNGKVH 353

Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEFQEEPFEIQDKRS 614
           KYGKS++GE WD    +ET YE  PH+G+     +S QL  + +P E     F   D  S
Sbjct: 354 KYGKSTTGESWDLVVDEETCYEAEPHYGWADVVGDSTQLLSI-QPVERPPGVFPTIDFSS 412

Query: 615 ELQ 617
             Q
Sbjct: 413 SPQ 415


>gi|297846712|ref|XP_002891237.1| hypothetical protein ARALYDRAFT_473738 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337079|gb|EFH67496.1| hypothetical protein ARALYDRAFT_473738 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 422

 Score =  204 bits (518), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 125/283 (44%), Positives = 172/283 (60%), Gaps = 17/283 (6%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A E    +D ++E +   G N DGS W++E+G +   +G  CRWT   G S D + EW E
Sbjct: 123 ANEKDWGIDLLNENVNESGTNEDGSSWFRESGHDLGDNGYRCRWTRMGGRSHDGSSEWTE 182

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
            +WE +D  G+KELG EKSG++A G+ W E W E + Q++   L  +E++A K  K+G  
Sbjct: 183 TWWEKSDWTGYKELGVEKSGKNAEGDTWWETWQEVLHQDEWSNLARIERSAQKQAKSGTE 242

Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           +  W EKWWE YDA G  EK AHK+  ++  +         W E+WGE YDG G  +K+T
Sbjct: 243 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWT 293

Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
           DKWAE   G   +KWGDKW+E F  +  G +QGETW      +RW+RTWGE H G+G VH
Sbjct: 294 DKWAETELG---TKWGDKWEEKF-FSGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVH 349

Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
           KYGKS++GE WD    +ET+YE  PH+G+     +S QL  ++
Sbjct: 350 KYGKSTTGESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 392


>gi|449489311|ref|XP_004158275.1| PREDICTED: uncharacterized protein LOC101230928 [Cucumis sativus]
          Length = 382

 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 125/282 (44%), Positives = 172/282 (60%), Gaps = 17/282 (6%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A E    ++ ++E ++  G N DGS W++E+G +   +G  CRWT   G S D   EW+E
Sbjct: 80  ANEKDWGINLLNENVSESGTNEDGSTWYRESGEDLGENGYRCRWTRMGGQSHDGYSEWKE 139

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
            +WE +D  G+KELG EKSG++  G+ W E W E + Q++   L  +E++A K  K+G  
Sbjct: 140 TWWEKSDWTGYKELGVEKSGKNVEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTE 199

Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           +  W EKWWE YDA G  EK AHK+  ++  +         W E+WGE YDG G  +K+T
Sbjct: 200 NAGWHEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWT 250

Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
           DKWAE   G   +KWGDKW+E F  +  G +QGETW     GERW+RTWGE H G+G VH
Sbjct: 251 DKWAETELG---TKWGDKWEEKFF-SGIGSRQGETWHVSPSGERWSRTWGEEHFGNGKVH 306

Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREV 596
           KYGKS++GE WD    +ET+YE  PH+G+     +S QL  +
Sbjct: 307 KYGKSTTGESWDIVVDEETYYEAEPHYGWADVVGDSSQLLSI 348


>gi|302830957|ref|XP_002947044.1| hypothetical protein VOLCADRAFT_103268 [Volvox carteri f.
           nagariensis]
 gi|300267451|gb|EFJ51634.1| hypothetical protein VOLCADRAFT_103268 [Volvox carteri f.
           nagariensis]
          Length = 647

 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 120/275 (43%), Positives = 158/275 (57%), Gaps = 25/275 (9%)

Query: 340 DGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWE---------AADELGHKE 390
           DG+R+ K +G +  PDG V +W + RGV+ D  ++W+E +W+         A++  G +E
Sbjct: 363 DGTRFEKLSGTDTGPDGYVKKWEVLRGVTGDGQVQWEECWWQVWGLGLGGKASNRYGLRE 422

Query: 391 LGSEKSGRDATGNVWREFWTESMWQ---NQGLVHLEKTADKWGKNGNGDEWQEKWWEHYD 447
           LG+ K G   +G  W E W E ++    N  LV +E+TA KW ++ N DEW+EKW E ++
Sbjct: 423 LGAFKKGTAESGAAWVEEWKEVLYTHPTNLRLV-IERTAHKWARDENLDEWEEKWGECFE 481

Query: 448 ASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWS 507
            +G+  KWA KW     N         VWHERWGE YDG G   K+TDKWAER   DG  
Sbjct: 482 EAGRVHKWADKWAKAGSN---------VWHERWGEDYDGKGACQKWTDKWAERLLPDGGQ 532

Query: 508 -KWGDKWDENFDPNSHGVKQGETWWAGK-YGERWNRTWGERHNGSGWVHKYGKSSSGELW 565
            +WGDKW E F  +  G K GE W +    G R+NR W E H G G V K+G S+SGE W
Sbjct: 533 EQWGDKWTETFG-HGTGTKHGEVWSSSSSCGSRYNRWWNEEHYGDGRVRKWGNSTSGEHW 591

Query: 566 DTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPS 600
           DT E  +T+Y   PHFGF H   +S QL  V  PS
Sbjct: 592 DTVEHMDTYYNPVPHFGFQHAVGHSPQLWAVPLPS 626


>gi|356508780|ref|XP_003523132.1| PREDICTED: uncharacterized protein LOC100820367 isoform 1 [Glycine
           max]
          Length = 449

 Score =  203 bits (516), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 124/279 (44%), Positives = 166/279 (59%), Gaps = 20/279 (7%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G N DGS W++E+G E   +G  CRWT   G S D + EW+E +WE +D  G+KELG EK
Sbjct: 163 GTNEDGSAWYRESGEELGENGYRCRWTRMGGQSHDGSSEWKETWWEKSDWTGYKELGVEK 222

Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
           SGR++ G+ W E W E++ Q++   +  +E++A K  K+G  +  W EKWWE YDA G  
Sbjct: 223 SGRNSEGDTWWETWQENLHQDEWSNIARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 282

Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
           EK AHK+  ++  +         W E+WGE YDG G  +K+TDKWAE   G   +KWGDK
Sbjct: 283 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 330

Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
           W+E F     G + GETW      ERW+RTWGE H G+G VHKYG S++GE WD    +E
Sbjct: 331 WEERFF-KGIGSRHGETWHVSPSSERWSRTWGEEHFGNGKVHKYGNSTTGESWDIVVDEE 389

Query: 573 TWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEPF 607
           T+YE  PH+G+     +S QL  +    R P  F    F
Sbjct: 390 TYYEAEPHYGWADVVGDSTQLLSIEPRERPPGVFPNLDF 428


>gi|356508782|ref|XP_003523133.1| PREDICTED: uncharacterized protein LOC100820367 isoform 2 [Glycine
           max]
          Length = 434

 Score =  202 bits (515), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 124/279 (44%), Positives = 166/279 (59%), Gaps = 20/279 (7%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G N DGS W++E+G E   +G  CRWT   G S D + EW+E +WE +D  G+KELG EK
Sbjct: 148 GTNEDGSAWYRESGEELGENGYRCRWTRMGGQSHDGSSEWKETWWEKSDWTGYKELGVEK 207

Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
           SGR++ G+ W E W E++ Q++   +  +E++A K  K+G  +  W EKWWE YDA G  
Sbjct: 208 SGRNSEGDTWWETWQENLHQDEWSNIARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 267

Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
           EK AHK+  ++  +         W E+WGE YDG G  +K+TDKWAE   G   +KWGDK
Sbjct: 268 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 315

Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
           W+E F     G + GETW      ERW+RTWGE H G+G VHKYG S++GE WD    +E
Sbjct: 316 WEERFF-KGIGSRHGETWHVSPSSERWSRTWGEEHFGNGKVHKYGNSTTGESWDIVVDEE 374

Query: 573 TWYERFPHFGFYHCFDNSVQLREV----RKPSEFQEEPF 607
           T+YE  PH+G+     +S QL  +    R P  F    F
Sbjct: 375 TYYEAEPHYGWADVVGDSTQLLSIEPRERPPGVFPNLDF 413


>gi|240254218|ref|NP_174971.5| uncharacterized protein [Arabidopsis thaliana]
 gi|332193795|gb|AEE31916.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 426

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 123/283 (43%), Positives = 172/283 (60%), Gaps = 17/283 (6%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A E    +D ++E +   G N DGS W++E+G +   +G  CRW+   G S D + EW E
Sbjct: 127 ANEKDWGIDLLNENVNEAGTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTE 186

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
            +WE +D  G+KELG EKSG+++ G+ W E W E + Q++   L  +E++A K  K+G  
Sbjct: 187 TWWEKSDWTGYKELGVEKSGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTE 246

Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           +  W EKWWE YDA G  EK AHK+  ++  +         W E+WGE YDG G  +K+T
Sbjct: 247 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWT 297

Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
           DKWAE   G   +KWGDKW+E F  +  G +QGETW      +RW+RTWGE H G+G VH
Sbjct: 298 DKWAETELG---TKWGDKWEEKF-FSGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVH 353

Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
           KYGKS++GE WD    +ET+YE  PH+G+     +S QL  ++
Sbjct: 354 KYGKSTTGESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 396


>gi|334183065|ref|NP_001185147.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332193796|gb|AEE31917.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 409

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 123/283 (43%), Positives = 172/283 (60%), Gaps = 17/283 (6%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A E    +D ++E +   G N DGS W++E+G +   +G  CRW+   G S D + EW E
Sbjct: 110 ANEKDWGIDLLNENVNEAGTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTE 169

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNG 435
            +WE +D  G+KELG EKSG+++ G+ W E W E + Q++   L  +E++A K  K+G  
Sbjct: 170 TWWEKSDWTGYKELGVEKSGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTE 229

Query: 436 DE-WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYT 494
           +  W EKWWE YDA G  EK AHK+  ++  +         W E+WGE YDG G  +K+T
Sbjct: 230 NAGWYEKWWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWT 280

Query: 495 DKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVH 554
           DKWAE   G   +KWGDKW+E F  +  G +QGETW      +RW+RTWGE H G+G VH
Sbjct: 281 DKWAETELG---TKWGDKWEEKF-FSGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVH 336

Query: 555 KYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR 597
           KYGKS++GE WD    +ET+YE  PH+G+     +S QL  ++
Sbjct: 337 KYGKSTTGESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSIQ 379


>gi|168037924|ref|XP_001771452.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677179|gb|EDQ63652.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 337

 Score =  198 bits (504), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 120/264 (45%), Positives = 168/264 (63%), Gaps = 18/264 (6%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G+N DGS W+ E+G++   +G  CRWT+  G S D + EW+E +WE +D  G+KELG+EK
Sbjct: 34  GVNEDGSTWYNESGVDFGENGYRCRWTVMGGRSGDGSSEWKEAWWEKSDWTGYKELGAEK 93

Query: 396 SGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHYDASGKA 452
           +G++A G+ W E W E +  ++   L  +EK+A K  K+G G   W EKWWE Y+A G +
Sbjct: 94  TGKNAQGDTWWETWQEVLRVDELSNLARIEKSAQKQAKSGTGSAGWFEKWWEKYNAKGWS 153

Query: 453 EKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDK 512
           EK AHK+  ++     D G    W E+W E+YDG G  +K+TDKWAE   G   +KWGDK
Sbjct: 154 EKGAHKYGRLN-----DQG----WWEKWEEQYDGRGAVLKWTDKWAENGTG---TKWGDK 201

Query: 513 WDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQE 572
           W+E F+ +  G +QGETW     G  W+RTWGE H G+G VHKYG+S+SGE WD   ++ 
Sbjct: 202 WEEKFN-HGVGTRQGETWHNDDKG--WSRTWGEEHFGNGKVHKYGRSTSGENWDNIVEEG 258

Query: 573 TWYERFPHFGFYHCFDNSVQLREV 596
           T+Y+  PH+G+     NS QL  +
Sbjct: 259 TYYQAEPHYGWADAIGNSEQLLNI 282


>gi|224086016|ref|XP_002307779.1| predicted protein [Populus trichocarpa]
 gi|222857228|gb|EEE94775.1| predicted protein [Populus trichocarpa]
          Length = 328

 Score =  195 bits (495), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 119/270 (44%), Positives = 164/270 (60%), Gaps = 16/270 (5%)

Query: 330 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 389
           + ++  G N DGS W++E+G +   +G  CRWT   G S D++ +W+E +WE +D  G+K
Sbjct: 48  ENVSETGTNEDGSTWFRESGEDLGANGYRCRWTKMGGRSHDDSTQWEETWWEKSDWTGYK 107

Query: 390 ELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWWEHY 446
           ELG EKSGR+A G+ W E W E + Q++   L  +E++A K  K+G  +  W EKWWE Y
Sbjct: 108 ELGVEKSGRNAEGDSWWETWQEMLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKY 167

Query: 447 DASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGW 506
           DA G  EK A+K+  ++  +         W E+WGE YDG G   K+TDKWAE   G   
Sbjct: 168 DAKGWTEKGANKYGRLNEQS---------WWEKWGEHYDGRGSVTKWTDKWAETELG--- 215

Query: 507 SKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWD 566
           +KWGDKW+E F     G + GETW     G RW+RTWGE H G+G VHKYGKS++ E WD
Sbjct: 216 TKWGDKWEEKFFAGI-GSRHGETWHVSPIGGRWSRTWGEEHFGNGKVHKYGKSTTSESWD 274

Query: 567 THEQQETWYERFPHFGFYHCFDNSVQLREV 596
               +ET+YE  PH+G+     +S QL  +
Sbjct: 275 IVVDEETYYEAEPHYGWADVVGDSSQLLSI 304


>gi|159485472|ref|XP_001700768.1| hypothetical protein CHLREDRAFT_167685 [Chlamydomonas reinhardtii]
 gi|158281267|gb|EDP07022.1| predicted protein, partial [Chlamydomonas reinhardtii]
          Length = 350

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 123/326 (37%), Positives = 166/326 (50%), Gaps = 22/326 (6%)

Query: 263 ILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETP---SLEEERDLGALFSAHA 319
           +L  P  +  +             +L  E +  +    E P     EE  D        A
Sbjct: 41  VLAAPLRAVRTSESSSGTPLSLNDILVFESDMPAMLIQEQPPERKAEEIVDRAEEVVGRA 100

Query: 320 AEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKF 379
           A      D    LA      DG+R+ K +G +  PDG V +W + RGV+ D  ++W+E +
Sbjct: 101 ALGRQLADGAGRLA------DGTRFEKLSGTDTGPDGYVKKWEVLRGVTGDGTVQWEECW 154

Query: 380 WEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHL--EKTADKWGKNGNGDE 437
           W A++  G +E+G+ K G    G  W E W E ++ +   + L  E+TA KW ++ + DE
Sbjct: 155 WTASNRYGLREMGAFKKGSTEAGAAWVEEWKEVLYTHATNLRLVIERTAHKWARDESSDE 214

Query: 438 WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKW 497
           W+EKW E Y+ +G+  K+A KW     N         VWHERWGE YDG G   K+TDKW
Sbjct: 215 WEEKWGECYEEAGRVHKFADKWAKAGIN---------VWHERWGEDYDGRGACQKWTDKW 265

Query: 498 AERCEGDGWS-KWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKY 556
           AER   DG   +WGDKW E F     G K GE W AG  GER+NR W E H G G V ++
Sbjct: 266 AERLLPDGGQEQWGDKWTETFGAGK-GTKHGEVWSAGGGGERYNRWWNEEHYGDGRVRRW 324

Query: 557 GKSSSGELWDTHEQQETWYERFPHFG 582
           G S+SGE WD  E  +T+Y   PHFG
Sbjct: 325 GNSTSGEYWDGVEHMDTYYNPVPHFG 350


>gi|307104332|gb|EFN52586.1| hypothetical protein CHLNCDRAFT_8819, partial [Chlorella
           variabilis]
          Length = 234

 Score =  188 bits (478), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 116/241 (48%), Positives = 147/241 (60%), Gaps = 12/241 (4%)

Query: 346 KETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVW 405
           + +G E    G   RWT  RG     A++W EK+WE +D  G KELG+EK G +A G+ W
Sbjct: 3   RRSGEETGSHGYWYRWTEVRGCDETGAVQWYEKWWEVSDWRGMKELGAEKWGCNARGDAW 62

Query: 406 REFWTESMWQNQGLVH--LEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSID 463
           RE W E++    G     +E++A KW KNG G EW+EKW E Y ++G+A+KWA KW    
Sbjct: 63  RETWREAIVVEAGSTQPSVERSAHKWAKNGMGHEWEEKWGERYWSAGRADKWADKWAREG 122

Query: 464 PNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAER-CEGDGWSKWGDKWDENFDPNSH 522
                    A VWHE+WGE YDG GG +KYTDKWAER  EG    +WGDKW+ENF     
Sbjct: 123 ---------ADVWHEKWGENYDGSGGCVKYTDKWAEREVEGGAREQWGDKWEENFKDGRG 173

Query: 523 GVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFG 582
             +QGETW     GER+NR WGE H G   V K+G S++GE WD  EQ +T+Y   PHFG
Sbjct: 174 TKQQGETWSVSAGGERYNRWWGENHLGDRLVQKHGSSNTGEHWDVTEQMDTYYNPIPHFG 233

Query: 583 F 583
           +
Sbjct: 234 Y 234


>gi|224061899|ref|XP_002300654.1| predicted protein [Populus trichocarpa]
 gi|222842380|gb|EEE79927.1| predicted protein [Populus trichocarpa]
          Length = 328

 Score =  185 bits (470), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 123/283 (43%), Positives = 165/283 (58%), Gaps = 23/283 (8%)

Query: 327 DKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADEL 386
           +KV+E    G N DGS W++++G +   +G  CRW    G S D + +W+E +WE  D  
Sbjct: 48  EKVNE---SGTNEDGSSWFRKSGEDLGENGYRCRWKKMGGRSHDTSSQWEETWWEKGDWT 104

Query: 387 GHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEKWW 443
           G+KELG EKSGR+A G+ W E W E + Q++   L  +E++A K  K G  +  W EKWW
Sbjct: 105 GYKELGVEKSGRNAEGDTWWETWQEMLHQDEWSNLARIERSAQKQAKLGTENAGWYEKWW 164

Query: 444 EHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEG 503
           E YDA G  EK A+K+  ++  +         W E+WGE YDG G   K+TDKWAE   G
Sbjct: 165 EKYDAKGWTEKGANKYGRLNEQS---------WWEKWGEHYDGRGSVTKWTDKWAETELG 215

Query: 504 DGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGE 563
              +KWGDKW+E F     G + GETW     G  W+RTWGE H G+G VHKYGK ++GE
Sbjct: 216 ---TKWGDKWEEKFFAGI-GSRHGETWHGSPSGGGWSRTWGEEHLGNGKVHKYGKGTTGE 271

Query: 564 LWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEFQEEP 606
            WD    +ET+YE  PH+G+     +S QL  +    E QE P
Sbjct: 272 SWDIVVDEETYYEAEPHYGWADVVGDSSQLLSI----EPQERP 310


>gi|255071597|ref|XP_002499473.1| predicted protein [Micromonas sp. RCC299]
 gi|226514735|gb|ACO60731.1| predicted protein [Micromonas sp. RCC299]
          Length = 940

 Score =  185 bits (469), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 142/448 (31%), Positives = 203/448 (45%), Gaps = 68/448 (15%)

Query: 215 PGPDFWSWSPPEDDDRDMRDVRDL-----QMAEKSSVYPTPVNPVVEKARSV--DILPIP 267
           P  DFW WSPP                     +K+  Y   V   VE         L + 
Sbjct: 379 PASDFWEWSPPSMPAAGPAGAASSYYPAEMQRQKAPAYTRRVEAAVEVMERAPEQTLDLQ 438

Query: 268 FESKL------------------SEPKPDPLLPPFQSLLGVEKEEVSETN--LETPSLEE 307
           FE+ +                  S PK +P  P   S LG++  ++ +     +T ++ +
Sbjct: 439 FETTIEQQNATLPQFQSQVKPASSPPKVEPAKPASASPLGLQDAQLQQLREVYQTSTVSD 498

Query: 308 ERDLGA----------------LFSAHAAEAAHALDK------VDELATRGINPDGSRWW 345
              L A                  S+  A    AL        V++ A  G    G+RWW
Sbjct: 499 AAILAAEQRYDVDVAAPAPAGLAGSSVEASLEDALASPVRELGVEDGAKEGTLRSGARWW 558

Query: 346 KETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVW 405
           +E G E   DG V  WT+ RG SAD ++EW+EKFWE +D   ++ELG+ KSGRD+ G  W
Sbjct: 559 REEGKEYLEDGKVMSWTVIRGTSADGSVEWEEKFWETSDPFTYRELGAIKSGRDSNGQAW 618

Query: 406 REFWTESMWQNQG-LVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDP 464
           +E W E    +   L  + + A KW     G  W E W E Y A G  +++  K  S++ 
Sbjct: 619 QESWKELYNHDANQLPFIHREASKWSHTPKGKCWSEGWTEDYRADGVVDRYCEKTGSLED 678

Query: 465 NTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDG---------WS-KWGDKWD 514
               + GHA+ W ++WGEK+DG GG +K+TD WA R   +G         W  KW +KW 
Sbjct: 679 GAAPEDGHANRWTQKWGEKWDGQGGCIKWTDTWASRDHAEGGMANAPSRSWGEKWEEKWG 738

Query: 515 ENFDPNSH-GVKQGETW--WAGKYGERWNRTWGERHNGSGWVHKYGKSSSG-ELWDT-HE 569
           +N++ N   G++QG  W    G + E   RTWGE H   G +HKYG S+ G + WD   +
Sbjct: 739 DNYNENGRAGLRQGLAWDELGGNHKE---RTWGEEHYPDGRLHKYGNSNDGSQYWDEWCD 795

Query: 570 QQETWYERFPHFGFYHCFDNSVQLREVR 597
               W+E  P FG++    +S  L  VR
Sbjct: 796 GAGGWWETAPSFGWHEAIGHSPSLMNVR 823


>gi|303272745|ref|XP_003055734.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463708|gb|EEH60986.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 716

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 141/426 (33%), Positives = 203/426 (47%), Gaps = 51/426 (11%)

Query: 218 DFWSWSPPEDDDRDMRDVRDLQMAEKSSVYPT---------PVNPVVEKARSVDILPIPF 268
           DFW W+PPE    D            SS YP          P  P VE  R V ++    
Sbjct: 232 DFWDWTPPEAP-VDFTPQPGGSPTSASSYYPPKMQKKRLEFPTAPAVE--REVMLMERAP 288

Query: 269 ESKLSEPKP-----DPLLPPFQSLLGVEKEEVSE----TNLETPSLE-----------EE 308
           E  L + +      + +LP FQS++    +EVS+     NL                 E+
Sbjct: 289 ERTLPQFQSVVEAQNAVLPEFQSVVESGSDEVSQLTSAMNLAAAPGAPAPMMAAPATAEQ 348

Query: 309 RDLGALFSAHA----AEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMT 364
              GA   A      A A   L   +     G+   G+RWW+E G ++   G V  WT  
Sbjct: 349 IAAGASIEASIEDALASAVRELGAGEGAEKEGVLSSGARWWREEGEDKLEGGKVMSWTCI 408

Query: 365 RGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTES-MWQNQGLVHLE 423
           RG SAD A+EW+E++W+ +D   ++ELG+ KSGRD+ G  W+E W E  + +   + ++ 
Sbjct: 409 RGTSADGAVEWEERWWKTSDSFTYRELGAVKSGRDSNGQAWQESWKEMYVHEVNKIPYIH 468

Query: 424 KTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEK 483
           + A KW     G  W E W E Y A G  +++  K  +++     + GH + W E+WGEK
Sbjct: 469 REASKWSHTPKGAAWSEGWTEDYRADGTVDRFCEKTGALEDGAAPEDGHGNRWTEKWGEK 528

Query: 484 YDGHGGSMKYTDKWAERCEGDGWSK------WGDKWDENFDP--NSH---GVKQGETWWA 532
           +DGHGG +K+TD WA R   +G  +      WG+KW+E +    N H   G +QG T W 
Sbjct: 529 WDGHGGCIKWTDTWASRDHSEGGMENAPGRSWGEKWEEKWGEGYNEHGRAGSRQGLT-WD 587

Query: 533 GKYGERWNRTWGERHNGSGWVHKYGKSSSG-ELWDTHEQQE-TWYERFPHFGFYHCFDNS 590
              G+   ++WGE H   G +HKYG SS G + WDT E     W+ER P FG+     +S
Sbjct: 588 ETMGQHTLKSWGEEHYPDGRLHKYGNSSDGSQYWDTWEDGAGGWWERNPSFGWIEALHHS 647

Query: 591 VQLREV 596
             L  +
Sbjct: 648 PDLMNL 653


>gi|145344308|ref|XP_001416678.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576904|gb|ABO94971.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 285

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 113/275 (41%), Positives = 164/275 (59%), Gaps = 15/275 (5%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G   +GSRWW+E+G E+   G +CRWT+ RG SAD ++EW+EK+WE +D   ++ELG+ K
Sbjct: 1   GTLENGSRWWRESGEEELEGGKLCRWTLVRGASADGSVEWEEKWWETSDAFNYRELGAIK 60

Query: 396 SGRDATGNVWREFWTESMWQN------QGLVHLEKTADKWGKNGNGDEWQEKWWEHYDAS 449
           SGRDA GNVW+E W E +  +          H+ + A+KWG   +G EW E W E+Y   
Sbjct: 61  SGRDAKGNVWQESWREQVTHDTTTGFSNASKHIMREANKWGAQADGAEWHEVWDENYWGD 120

Query: 450 GKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEG-DGWS- 507
           G+ ++   K  +I      D GH + W  +WGE++DGHGG +K+TD +A+R +  DG S 
Sbjct: 121 GQVKRTCTKKGAIANGATPDDGHGNRWTHKWGEEWDGHGGCVKWTDSFADRDQSEDGGSG 180

Query: 508 -KWGDKWDENFDPNSH----GVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSG 562
             WG+KW+E +   +H    G + G T W  + G ++ +TWGE H   G VHK+G ++ G
Sbjct: 181 RAWGEKWEERWGGYAHNGSAGNRNGST-WDDRDGHKFEKTWGEEHWHDGRVHKWGATTDG 239

Query: 563 -ELWDTHEQQETWYERFPHFGFYHCFDNSVQLREV 596
            + WDT E    W+ER P FG+     +S QL  V
Sbjct: 240 SDGWDTWEDSAGWWERAPSFGWDEAVSHSPQLLNV 274


>gi|384246267|gb|EIE19758.1| hypothetical protein COCSUDRAFT_19282 [Coccomyxa subellipsoidea
           C-169]
          Length = 271

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 119/266 (44%), Positives = 158/266 (59%), Gaps = 20/266 (7%)

Query: 335 RGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSE 394
           +G   DG+++ +E+G +  P+G   RWT  +GVSA   +EW+E++WE +D  G +ELG+E
Sbjct: 5   KGQLADGTKYLRESGEDFGPNGFWRRWTCLKGVSAAGKVEWEERWWEESDWAGMRELGAE 64

Query: 395 KSGRDATGNVWREFWTESMW--QNQGLVHLEKTADKWGKNGNGD-EWQEKWWEHYDASGK 451
           KSG  A G  W E W E++   Q  G   +E++A KW  +G    EW+E+W E Y + G+
Sbjct: 65  KSGCRADGAAWFETWREAIAFDQTNGEPIVERSAHKWACDGKVRCEWEERWGEQYWSLGR 124

Query: 452 AEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERC-EGDGWSKWG 510
           A K+A KW     N         VWHERWGE YDG GG       WAER  EG G  +WG
Sbjct: 125 ANKYADKWGKEGNN---------VWHERWGEDYDGDGGC------WAERLLEGGGNEQWG 169

Query: 511 DKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQ 570
           DKW+E F  N  G KQGETW     G+R  + WGE H G+GWV K+G SS+GE WD  EQ
Sbjct: 170 DKWEERF-KNGAGSKQGETWTVSAGGDRHQQWWGEDHFGNGWVRKHGNSSTGEQWDVSEQ 228

Query: 571 QETWYERFPHFGFYHCFDNSVQLREV 596
            +T+Y   PHFG+    D+S  L+ V
Sbjct: 229 MDTYYNPIPHFGYKLALDHSPTLKNV 254


>gi|303283944|ref|XP_003061263.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457614|gb|EEH54913.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 391

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 106/282 (37%), Positives = 154/282 (54%), Gaps = 25/282 (8%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G++ DG+  ++++G++    G  CRWT+    + D + E +E  WE  D  G KELG+EK
Sbjct: 122 GVDADGNAVFRKSGVDVGDHGYRCRWTIQGRSAQDASWETRETHWEKCDASGFKELGAEK 181

Query: 396 SGRDATGNVWREFWTE-------SMWQNQGLVH---LEKTADKWGKNGNGDEWQEKWWEH 445
           SG +  G+ W E W E           + G +    +E++ADKW ++    EW EKWWE 
Sbjct: 182 SGFNEDGDTWWETWKEVYRVERDDRDDDSGPIRAEFIERSADKWARDKTNHEWHEKWWEQ 241

Query: 446 YDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGG-SMKYTDKWAERCEGD 504
           Y  SG  E+   K       + L    A  W E+WGE++   GG + K+TDKWA+   G 
Sbjct: 242 YSPSGYVERQVEK-------SGLHG--AQAWWEKWGEQHGADGGETRKWTDKWAQNGAG- 291

Query: 505 GWSKWGDKWDENFDPNS-HGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGE 563
             ++WGDKW+E F  +   G K+GETW     GERW+RTWGE  + SG V KYG+S++GE
Sbjct: 292 --TRWGDKWEERFSADCISGDKKGETWRVAASGERWSRTWGETIDSSGEVRKYGESTTGE 349

Query: 564 LWDTHEQQET-WYERFPHFGFYHCFDNSVQLREVRKPSEFQE 604
            WD  E  E  +Y+R P + +    + S +L  +  P E  E
Sbjct: 350 TWDKTETLEKFYYDRTPEYSWDDIKNISKRLLSIETPDEKDE 391


>gi|255079328|ref|XP_002503244.1| predicted protein [Micromonas sp. RCC299]
 gi|226518510|gb|ACO64502.1| predicted protein [Micromonas sp. RCC299]
          Length = 290

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 104/282 (36%), Positives = 150/282 (53%), Gaps = 42/282 (14%)

Query: 337 INPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKS 396
            + DG+ ++ ++G++    G  CRWT+T   + D   E++   WE AD  G+KELG+EKS
Sbjct: 13  TDDDGNAFFSKSGVDTGDGGYRCRWTVTGRTAKDGTWEYRATHWEKADWSGYKELGAEKS 72

Query: 397 G-RDATGNVWREFWTESMWQNQGLVH----------LEKTADKWGKNGNGDEWQEKWWEH 445
           G  DA+G+ W E W +   +  G             +E++ADKW ++ +  EWQEKWWE 
Sbjct: 73  GFDDASGDTWWETWRQVYRRENGDASGSSDTSGPALIERSADKWARDKHKKEWQEKWWER 132

Query: 446 YDASGKAEKWAHKWCSIDPNTQLDAGHAHV--WHERWGEKYD---GHGGSMKYTDKWAER 500
           Y  +G  E+   K           +G   V  W E+WGE+ D   G G  +K+TDKWAE 
Sbjct: 133 YSDAGLVERGVEK-----------SGRQGVQAWWEKWGEQRDDSDGGGDVIKWTDKWAEN 181

Query: 501 CEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSS 560
             G   ++WGDKW+E F  +  G K GETW     GERW+RTWGE     G +  YG+S+
Sbjct: 182 GAG---TRWGDKWEERFGADGSGKKVGETWRVNAGGERWSRTWGESVGSDGEIRTYGQST 238

Query: 561 SGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSEF 602
           SGE WDT EQ  +              DNS +  + ++ +E+
Sbjct: 239 SGEQWDTTEQGNS------------SRDNSSRWEDAKEAAEY 268


>gi|308810753|ref|XP_003082685.1| unnamed protein product [Ostreococcus tauri]
 gi|116061154|emb|CAL56542.1| unnamed protein product, partial [Ostreococcus tauri]
          Length = 332

 Score =  142 bits (358), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 96/239 (40%), Positives = 131/239 (54%), Gaps = 19/239 (7%)

Query: 346 KETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVW 405
           K+ G+E    G   RW  T+    D      E  W  +D  G+KELG EKSG + TG  W
Sbjct: 104 KKRGVETGEGGYRSRWWKTKRECPDGRSGSSETRWAKSDFSGYKELGFEKSGFNETGETW 163

Query: 406 REFWTESMWQNQ--GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSID 463
            E W E   ++   GL  +E++ADKW ++    EWQEKWWE Y A+G  E+   K     
Sbjct: 164 WETWREIYSRDDYTGLERIERSADKWARDAQSKEWQEKWWERYYANGAVERGLEK----- 218

Query: 464 PNTQLDAGHA--HVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNS 521
                 +G      W E+WGE+YDG G ++K++DKWAE   G G ++WGDKW+E      
Sbjct: 219 ------SGREVRQAWWEKWGEQYDGEGATLKWSDKWAE---GSG-TRWGDKWEERRSKFG 268

Query: 522 HGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPH 580
            G K GETW  G+ GER++RTWGE  +  G V K+G S++GE WDT  ++  +  R   
Sbjct: 269 SGRKSGETWRVGQDGERFSRTWGEVISPDGSVRKFGTSTTGESWDTTVKENVYLTRISR 327


>gi|412987666|emb|CCO20501.1| predicted protein [Bathycoccus prasinos]
          Length = 335

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 97/277 (35%), Positives = 141/277 (50%), Gaps = 47/277 (16%)

Query: 326 LDKVDELATRGINPD-GSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAAD 384
           +D+      +GI+P+ G+ W++E+G+++   G  CRWT+  G + D++ E++E  WE AD
Sbjct: 24  IDEEHTKKMKGIDPETGNSWFRESGVDRGEGGYRCRWTVKGGAAPDKSWEYRETHWEKAD 83

Query: 385 ELGHKELGSEKSGRDATGNVWREFWTESM----------------------------WQN 416
             G++ELG+EKSG +  G  W E W E                                 
Sbjct: 84  LSGYRELGAEKSGFNEKGETWWETWRELYNTSESSSSSSEENNNENNNNSNDDDHHPLSA 143

Query: 417 QGLVHLEKTADKWGK--NGNGD---EWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAG 471
                +E++ADKW +  + N D   EWQEKWWE + +    ++   K             
Sbjct: 144 SCCQMVERSADKWARHVDTNSDSSREWQEKWWERFSSENTCDRGVEK---------SGRE 194

Query: 472 HAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNS-HGVKQGETW 530
           + H W E+WGE YD +G S+++TDKWAE  +G    +WGDKW+E     S  G K GETW
Sbjct: 195 NRHAWWEKWGEHYDPNGISLRWTDKWAENDKG---VRWGDKWEERLKSTSGDGAKSGETW 251

Query: 531 WAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDT 567
                GE W RTWGE    +G V KYG+S++ E WD 
Sbjct: 252 REEPNGEVWRRTWGEEVFENGEVRKYGESTTDEKWDV 288


>gi|145353645|ref|XP_001421117.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|145357258|ref|XP_001422837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581353|gb|ABO99410.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144583081|gb|ABP01196.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 355

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 83/253 (32%), Positives = 118/253 (46%), Gaps = 37/253 (14%)

Query: 346 KETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVW 405
           ++ G ++   G   RW  T+    D      E  W   D  G+KELG EKSG + TG  W
Sbjct: 115 RKRGFDRGEGGYQSRWWTTKRDCPDGRSGSSETRWAKCDFSGYKELGFEKSGFNDTGETW 174

Query: 406 REFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPN 465
            E W E   ++                         WWE Y A+G  E+   K       
Sbjct: 175 WETWREIYCRDDFT---------------------GWWERYYANGAVERGVEK------- 206

Query: 466 TQLDAGH--AHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHG 523
               +G      W E+WGE+YDG G ++K+TDKWAE   G    +WGDKW+E       G
Sbjct: 207 ----SGREVRQAWWEKWGEQYDGEGATLKWTDKWAENGMG---MRWGDKWEERRSKIGSG 259

Query: 524 VKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGF 583
            K GETW  G+ GER++RTWGE  +  G V K+G S++GE WDT   +  ++++     +
Sbjct: 260 RKSGETWRVGEDGERFSRTWGEVLSPDGSVRKFGNSTTGESWDTTVVENVYFDKSKPPTW 319

Query: 584 YHCFDNSVQLREV 596
                +S +L  +
Sbjct: 320 QEVLSSSERLMSI 332


>gi|6691197|gb|AAF24535.1|AC007534_16 F7F22.5 [Arabidopsis thaliana]
          Length = 322

 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 79/213 (37%), Positives = 108/213 (50%), Gaps = 44/213 (20%)

Query: 319 AAEAAHALDKVDE-LATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQE 377
           A E    +D ++E +   G N DGS W++E+G +   +G  CRW+   G S D + EW E
Sbjct: 127 ANEKDWGIDLLNENVNEAGTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTE 186

Query: 378 KFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDE 437
             W          +G EKSG+++ G+ W E W E + Q                    DE
Sbjct: 187 T-WSVL------FVGVEKSGKNSEGDSWWETWQEVLHQ--------------------DE 219

Query: 438 WQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKW 497
           W+   WE YDA G  EK AHK+  ++  +         W E+WGE YDG G  +K+TDKW
Sbjct: 220 WR---WEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKW 267

Query: 498 AERCEGDGWSKWGDKWDENFDPNSHGVKQGETW 530
           AE   G   +KWGDKW+E F  +  G +QGETW
Sbjct: 268 AETELG---TKWGDKWEEKF-FSGIGSRQGETW 296


>gi|302795454|ref|XP_002979490.1| hypothetical protein SELMODRAFT_56868 [Selaginella moellendorffii]
 gi|300152738|gb|EFJ19379.1| hypothetical protein SELMODRAFT_56868 [Selaginella moellendorffii]
          Length = 175

 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 68/192 (35%), Positives = 97/192 (50%), Gaps = 48/192 (25%)

Query: 336 GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEK 395
           G N DGS W++E G +   +G  CR+T+  G S+D + EW+E             + +EK
Sbjct: 32  GTNEDGSTWFRECGEDLGENGYRCRYTVMGGRSSDGSTEWKET------------VRAEK 79

Query: 396 SGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKW 455
           SG++A G+ W E W E + Q++                         WE Y+A G  EK 
Sbjct: 80  SGKNAEGDAWWETWQEILRQDELR-----------------------WEKYNAKGWTEKG 116

Query: 456 AHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDE 515
           AHK+  ++  +         W E+WGE+YDG G  +K+TDKWAE   G+   KWGDKW+E
Sbjct: 117 AHKYGRLNEQS---------WWEKWGEQYDGRGAVLKWTDKWAENATGE---KWGDKWEE 164

Query: 516 NFDPNSHGVKQG 527
            F  N  G +QG
Sbjct: 165 KF-QNGAGTRQG 175


>gi|221481921|gb|EEE20287.1| AT hook motif-containing protein, putative [Toxoplasma gondii GT1]
          Length = 1282

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 92/204 (45%), Gaps = 31/204 (15%)

Query: 309  RDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVS 368
            R++   F     E AH+ +K      RG +  G  W ++    +RP+      + T+  S
Sbjct: 1096 REVTDWFEDKFGEVAHSQEKW--AYKRGHSASGDNWLEK--WNERPEEK----SATKSGS 1147

Query: 369  ADEALEWQEKFWEAADELGHKELG-SEKSGRDATGNVWREFWTE--SMWQNQGLVHLEKT 425
                 EW E++ E  DE G K    +EK+GR+A G+ W E W E  S W         K 
Sbjct: 1148 NARGDEWSEQWKETFDENGEKSTTWAEKTGRNAQGDAWYETWLERRSNW---------KM 1198

Query: 426  ADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYD 485
            A K G+N  G+EWQEKW E     G  EKW  KW   +   +    H   W +RWG+  D
Sbjct: 1199 AIKEGRNARGEEWQEKWGEDLHEDGSGEKWCQKWAKDNAGNR----HGKSWGDRWGK--D 1252

Query: 486  GHGGSMKYTDKWAERCEGDGWSKW 509
            G GG      +W E    D  +KW
Sbjct: 1253 GKGGH-----RWGEEWSNDDVNKW 1271



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/178 (32%), Positives = 81/178 (45%), Gaps = 28/178 (15%)

Query: 371  EALEW-QEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKW 429
            E  +W ++KF E A     +E  + K G  A+G+ W E W E           EK+A K 
Sbjct: 1097 EVTDWFEDKFGEVAH---SQEKWAYKRGHSASGDNWLEKWNERP--------EEKSATKS 1145

Query: 430  GKNGNGDEWQEKWWEHYDASG-KAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHG 488
            G N  GDEW E+W E +D +G K+  WA K      N Q DA     W+E W E+     
Sbjct: 1146 GSNARGDEWSEQWKETFDENGEKSTTWAEK---TGRNAQGDA-----WYETWLERRS--- 1194

Query: 489  GSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGER 546
             + K   K      G+   +W +KW E+   +  G K  + W     G R  ++WG+R
Sbjct: 1195 -NWKMAIKEGRNARGE---EWQEKWGEDLHEDGSGEKWCQKWAKDNAGNRHGKSWGDR 1248


>gi|221501376|gb|EEE27155.1| AT hook motif-containing protein, putative [Toxoplasma gondii VEG]
          Length = 1282

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 92/204 (45%), Gaps = 31/204 (15%)

Query: 309  RDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVS 368
            R++   F     E AH+ +K      RG +  G  W ++    +RP+      + T+  S
Sbjct: 1096 REVTDWFEDKFGEVAHSQEKW--AYKRGHSASGDNWLEK--WNERPEEK----SATKSGS 1147

Query: 369  ADEALEWQEKFWEAADELGHKELG-SEKSGRDATGNVWREFWTE--SMWQNQGLVHLEKT 425
                 EW E++ E  DE G K    +EK+GR+A G+ W E W E  S W         K 
Sbjct: 1148 NARGDEWSEQWKETFDENGEKSTTWAEKTGRNAQGDAWYETWLERRSNW---------KM 1198

Query: 426  ADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYD 485
            A K G+N  G+EWQEKW E     G  EKW  KW   +   +    H   W +RWG+  D
Sbjct: 1199 AIKEGRNARGEEWQEKWGEDLHEDGSGEKWCQKWAKDNAGNR----HGKSWGDRWGK--D 1252

Query: 486  GHGGSMKYTDKWAERCEGDGWSKW 509
            G GG      +W E    D  +KW
Sbjct: 1253 GKGGH-----RWGEEWSNDDVNKW 1271



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/178 (32%), Positives = 81/178 (45%), Gaps = 28/178 (15%)

Query: 371  EALEW-QEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKW 429
            E  +W ++KF E A     +E  + K G  A+G+ W E W E           EK+A K 
Sbjct: 1097 EVTDWFEDKFGEVAH---SQEKWAYKRGHSASGDNWLEKWNERP--------EEKSATKS 1145

Query: 430  GKNGNGDEWQEKWWEHYDASG-KAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHG 488
            G N  GDEW E+W E +D +G K+  WA K      N Q DA     W+E W E+     
Sbjct: 1146 GSNARGDEWSEQWKETFDENGEKSTTWAEK---TGRNAQGDA-----WYETWLERRS--- 1194

Query: 489  GSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGER 546
             + K   K      G+   +W +KW E+   +  G K  + W     G R  ++WG+R
Sbjct: 1195 -NWKMAIKEGRNARGE---EWQEKWGEDLHEDGSGEKWCQKWAKDNAGNRHGKSWGDR 1248


>gi|237837095|ref|XP_002367845.1| AT hook motif-containing protein [Toxoplasma gondii ME49]
 gi|211965509|gb|EEB00705.1| AT hook motif-containing protein [Toxoplasma gondii ME49]
          Length = 1282

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 92/204 (45%), Gaps = 31/204 (15%)

Query: 309  RDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVS 368
            R++   F     E AH+ +K      RG +  G  W ++    +RP+      + T+  S
Sbjct: 1096 REVTDWFEDKFGEVAHSQEKW--AYKRGHSASGDNWLEK--WNERPEEK----SATKSGS 1147

Query: 369  ADEALEWQEKFWEAADELGHKELG-SEKSGRDATGNVWREFWTE--SMWQNQGLVHLEKT 425
                 EW E++ E  DE G K    +EK+GR+A G+ W E W E  S W         K 
Sbjct: 1148 NARGDEWSEQWKETFDENGEKSTTWAEKTGRNAQGDAWYETWLERRSNW---------KM 1198

Query: 426  ADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYD 485
            A K G+N  G+EWQEKW E     G  EKW  KW   +   +    H   W +RWG+  D
Sbjct: 1199 AIKEGRNARGEEWQEKWGEDLHEDGSGEKWCQKWAKDNAGNR----HGKSWGDRWGK--D 1252

Query: 486  GHGGSMKYTDKWAERCEGDGWSKW 509
            G GG      +W E    D  +KW
Sbjct: 1253 GKGGH-----RWGEEWSNDDVNKW 1271



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/178 (32%), Positives = 81/178 (45%), Gaps = 28/178 (15%)

Query: 371  EALEW-QEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKW 429
            E  +W ++KF E A     +E  + K G  A+G+ W E W E           EK+A K 
Sbjct: 1097 EVTDWFEDKFGEVAH---SQEKWAYKRGHSASGDNWLEKWNERP--------EEKSATKS 1145

Query: 430  GKNGNGDEWQEKWWEHYDASG-KAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHG 488
            G N  GDEW E+W E +D +G K+  WA K      N Q DA     W+E W E+     
Sbjct: 1146 GSNARGDEWSEQWKETFDENGEKSTTWAEK---TGRNAQGDA-----WYETWLERRS--- 1194

Query: 489  GSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGER 546
             + K   K      G+   +W +KW E+   +  G K  + W     G R  ++WG+R
Sbjct: 1195 -NWKMAIKEGRNARGE---EWQEKWGEDLHEDGSGEKWCQKWAKDNAGNRHGKSWGDR 1248


>gi|401403071|ref|XP_003881402.1| putative AT hook motif-containing protein [Neospora caninum
            Liverpool]
 gi|325115814|emb|CBZ51369.1| putative AT hook motif-containing protein [Neospora caninum
            Liverpool]
          Length = 1316

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 69/140 (49%), Gaps = 19/140 (13%)

Query: 362  TMTRGVSADEALEWQEKFWEAADELGHKELG-SEKSGRDATGNVWREFWTE--SMWQNQG 418
            T T+  S      W E++ E  DE G K +  +EK+GR+A G+ W E W E  + W    
Sbjct: 1175 TATKSGSNARGDAWSEQWKETFDENGEKNITWAEKTGRNAQGDSWYETWLERRANW---- 1230

Query: 419  LVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHE 478
                 K A K G+N  G+EWQEKW E     G  EKW  KW       +    H   W +
Sbjct: 1231 -----KMAIKEGRNARGEEWQEKWGEDLHEDGSGEKWCQKWAKDHAGNR----HGKSWGD 1281

Query: 479  RWGEKYDGHGGSMKYTDKWA 498
            RWG+  DG GG  K+ ++W+
Sbjct: 1282 RWGK--DGKGG-HKWGEEWS 1298



 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 59/177 (33%), Positives = 81/177 (45%), Gaps = 29/177 (16%)

Query: 395  KSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDASG-KAE 453
            K GR+A+G+ W E W E           EKTA K G N  GD W E+W E +D +G K  
Sbjct: 1153 KQGRNASGDQWLEKWNEKP--------EEKTATKSGSNARGDAWSEQWKETFDENGEKNI 1204

Query: 454  KWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKW 513
             WA K      N Q D+     W+E W E+      + K   K      G+   +W +KW
Sbjct: 1205 TWAEK---TGRNAQGDS-----WYETWLERR----ANWKMAIKEGRNARGE---EWQEKW 1249

Query: 514  DENFDPNSHGVKQGETWWAGKYGERWNRTWGER--HNGSGWVHKYGKSSSGELWDTH 568
             E+   +  G K  + W     G R  ++WG+R   +G G  HK+G+  S E  D H
Sbjct: 1250 GEDLHEDGSGEKWCQKWAKDHAGNRHGKSWGDRWGKDGKG-GHKWGEEWSNE--DVH 1303



 Score = 38.9 bits (89), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 67/149 (44%), Gaps = 18/149 (12%)

Query: 424  KTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHK-WCSIDPNTQLDAGHAHVWHERWGE 482
            +T +K+G   +G EW+E W       G  + W  K W   +    +         + WGE
Sbjct: 1026 RTGEKFGSKTDGTEWREAWGRQASDEGPEDSWIEKRWKECNQGEGV---------KEWGE 1076

Query: 483  KYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRT 542
              +G  G  ++  KW ++    G  ++ +KW+++   N+  VKQG T W    G R    
Sbjct: 1077 -TEGSEGRKRWNQKWWKKESWQGGDEFVEKWEDDGYGNTSTVKQGST-WKHHEGGREVTD 1134

Query: 543  WGE------RHNGSGWVHKYGKSSSGELW 565
            W E       H+   W +K G+++SG+ W
Sbjct: 1135 WFEDKFGVVEHSQEKWAYKQGRNASGDQW 1163


>gi|424042845|ref|ZP_17780513.1| UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate
           ligase [Vibrio cholerae HENC-02]
 gi|408886232|gb|EKM24913.1| UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate
           ligase [Vibrio cholerae HENC-02]
          Length = 485

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 38/84 (45%), Gaps = 12/84 (14%)

Query: 1   MASQLSHYPRATGHRANPPLIFTT----RRTTPQQINFWSRRTGAKVGVSNSEGGGSYLD 56
           +A QL HYP       N  LI  T    + T  Q I  W    GAK  V  + G G +LD
Sbjct: 94  IAGQLYHYP-------NMELIGVTGTNGKTTITQLIAQWIDLVGAKAAVMGTTGNG-FLD 145

Query: 57  MWQKAVDRDRKEIEFQKIAGSLAE 80
             Q+A +     +E QK   SLAE
Sbjct: 146 NLQEAANTTGNAVEIQKTLASLAE 169


>gi|424031999|ref|ZP_17771420.1| UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate
           ligase [Vibrio cholerae HENC-01]
 gi|408876411|gb|EKM15528.1| UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate
           ligase [Vibrio cholerae HENC-01]
          Length = 491

 Score = 43.9 bits (102), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 38/84 (45%), Gaps = 12/84 (14%)

Query: 1   MASQLSHYPRATGHRANPPLIFTT----RRTTPQQINFWSRRTGAKVGVSNSEGGGSYLD 56
           +A QL HYP       N  LI  T    + T  Q I  W    GAK  V  + G G +LD
Sbjct: 100 IAGQLYHYP-------NMELIGVTGTNGKTTITQLIAQWIDLVGAKAAVMGTTGNG-FLD 151

Query: 57  MWQKAVDRDRKEIEFQKIAGSLAE 80
             Q+A +     +E QK   SLAE
Sbjct: 152 NLQEAANTTGNAVEIQKTLASLAE 175


>gi|219117189|ref|XP_002179389.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409280|gb|EEC49212.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1843

 Score = 42.0 bits (97), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 35/145 (24%), Positives = 62/145 (42%), Gaps = 9/145 (6%)

Query: 302  TPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCRW 361
            T SLE E  +G        EA  A D +  L    +N       +  GI Q    V+ R 
Sbjct: 1148 TSSLESEFCVG--LECELFEATIAQDPLQRL--HSLNNGALCLERYAGISQEGSAVIDRE 1203

Query: 362  TMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLVH 421
             +T  V    A    ++  + A ++    L S ++  D+  N ++   ++ ++ ++ L  
Sbjct: 1204 ALTEQVLCSRA----KRMKDEACQIESLYLASARATHDSCKNHFQNISSKRLYHDEALGK 1259

Query: 422  LEKTADKWGKNGNGDEWQEKWWEHY 446
            L +      ++G GD W EKWW+ +
Sbjct: 1260 LSQQGTHLSQSG-GDCWDEKWWDEF 1283


>gi|393909952|gb|EJD75659.1| hypothetical protein LOAG_17239 [Loa loa]
          Length = 522

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 63/252 (25%), Positives = 110/252 (43%), Gaps = 46/252 (18%)

Query: 9   PRATGHRANPPLIFTTRRTTPQQINFWSRRTGAKVGVSNSEGGG--------SYLDMWQK 60
           P++TGH A          TTP  +   S  T   VGV+  +  G          +   + 
Sbjct: 214 PQSTGHPA----------TTPAVLKSMSIHT---VGVAEIDQNGQTAYEVDACLISQLKD 260

Query: 61  AVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERDRI 120
            +D+  K+I + +   +L +   + GNE    +   E LEK  +E ++++D   E   R 
Sbjct: 261 ELDKADKKIGYLEKELTLTKRA-IYGNEQFNIKGQIEALEKDKKELTRVIDSQTERLTRF 319

Query: 121 Q-RLQVIDRAAAAIAAARAILEEKNGSVVKNGESS------GTAEVSRFVKKNSESSGA- 172
           + +L+V++R   A+      LE  N   V+              E+SR  +K+ E S A 
Sbjct: 320 EDQLRVVNREKEALQRKYTELERANNLAVREKLEQLEIVERQRKELSRLEEKHLEKSKAH 379

Query: 173 -AEISPF---VKNSESNGTAEVPERGALSAGIFVP-RSGTPGNRTPAPGPDFWSWSPPED 227
            A+I  +   +K  +        ERGAL A +    R GTP + +       ++ +   D
Sbjct: 380 DAQIEAYQQLIKQKDD-------ERGALIAELCATNRLGTPLHDSINETESRYAAT---D 429

Query: 228 DDRD-MRDVRDL 238
           ++R+ +++VRDL
Sbjct: 430 ENRELLKEVRDL 441


>gi|297526966|ref|YP_003668990.1| phosphoesterase DHHA1 [Staphylothermus hellenicus DSM 12710]
 gi|297255882|gb|ADI32091.1| phosphoesterase DHHA1 [Staphylothermus hellenicus DSM 12710]
          Length = 328

 Score = 39.7 bits (91), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 52/130 (40%), Gaps = 11/130 (8%)

Query: 474 HVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPN-----SHGVKQGE 528
           HVW E W  K  G G  + Y D+    C     +K+ +K+  N D         GV  G+
Sbjct: 95  HVWDEDWINKLRGLGVKI-YIDR--STCAVGVVAKYAEKYRNNIDEEFVSELVKGVCAGD 151

Query: 529 TWWAGKYGERWNRTWGERHNGSGWVHKYG-KSSSGELWDTHEQQETWYERF-PHFGFYHC 586
            W    +   W      RH+   W  K   K SSG LWD  E  +   ERF      Y  
Sbjct: 152 LWRFDHWRGPWYLRLVRRHDDPEWRLKVLEKISSGVLWDD-EFTDKVVERFEKELIGYKM 210

Query: 587 FDNSVQLREV 596
            D ++  RE+
Sbjct: 211 VDKTILTREI 220


>gi|358388353|gb|EHK25946.1| hypothetical protein TRIVIDRAFT_218117 [Trichoderma virens Gv29-8]
          Length = 509

 Score = 39.3 bits (90), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 45/96 (46%), Gaps = 8/96 (8%)

Query: 345 WKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDA---T 401
           W  T +  R    VCR   +R + A    ++ +   EA D + H+ L S K G  +   +
Sbjct: 399 WCLTRLALRGTYAVCRMVKSRLLYAVSPAKFHDLEVEAQD-IAHRIL-SLKEGMASFKDS 456

Query: 402 GNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDE 437
             VW +F  +  W  +G++   +T D WGK G G E
Sbjct: 457 ALVWNQFMAQGSWIAKGII---ETKDAWGKEGEGHE 489


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.311    0.130    0.415 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,737,318,775
Number of Sequences: 23463169
Number of extensions: 590064556
Number of successful extensions: 1259380
Number of sequences better than 100.0: 379
Number of HSP's better than 100.0 without gapping: 76
Number of HSP's successfully gapped in prelim test: 303
Number of HSP's that attempted gapping in prelim test: 1256267
Number of HSP's gapped (non-prelim): 1015
length of query: 619
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 470
effective length of database: 8,863,183,186
effective search space: 4165696097420
effective search space used: 4165696097420
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 80 (35.4 bits)