BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 006558
         (640 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225434251|ref|XP_002276208.1| PREDICTED: uncharacterized protein LOC100257808 [Vitis vinifera]
          Length = 656

 Score =  898 bits (2320), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 455/606 (75%), Positives = 519/606 (85%), Gaps = 9/606 (1%)

Query: 30  VDDSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEK-SKVVDDNEGMDWCVRARKVA 88
           VD ++++ESE  +DWE EFLGELDP G+QAPKKRKK+E+ SK+++D +GMDWCV+ARK+A
Sbjct: 56  VDSNDKQESE--MDWELEFLGELDPLGFQAPKKRKKREQGSKLLEDTDGMDWCVKARKMA 113

Query: 89  LKSIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFDLEDDMKMDDIMGS 148
           LKSIEARGL  +MEDLI VKKKK   KK  +K K   K    + + D ++D+++  +   
Sbjct: 114 LKSIEARGLTRTMEDLITVKKKKNNKKKLGKKDKISKKSKVSEEEDDSDEDIELKGV--- 170

Query: 149 GNGYDMND-LRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQ 207
            N  D  D LR+TVSM+AGGMFEEK+EKT++ FV RLSQFSGPS+RRKEINLNK IV+AQ
Sbjct: 171 -NPLDGADRLRKTVSMVAGGMFEEKKEKTMQAFVQRLSQFSGPSDRRKEINLNKAIVEAQ 229

Query: 208 TAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREM 267
           TA+EVLEV AE I AVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT+ RLAF RQ+EM
Sbjct: 230 TAEEVLEVAAETIMAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTSRRLAFARQKEM 289

Query: 268 SMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVAN 327
           SMLV IAMTALPECSAQGISNI+WALSKIGGELLYLSEMDRVAEVALTKV +FNSQNVAN
Sbjct: 290 SMLVGIAMTALPECSAQGISNISWALSKIGGELLYLSEMDRVAEVALTKVEQFNSQNVAN 349

Query: 328 VAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAF 387
           VAGAFASM+HSAPDLFSEL++RAS+IVH FQEQELAQVLWAFASL EPA PLLESLDN F
Sbjct: 350 VAGAFASMRHSAPDLFSELSERASNIVHNFQEQELAQVLWAFASLNEPAGPLLESLDNVF 409

Query: 388 KDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQM 447
            D  QF CCL++     NE   V+++GD   E    SP L+F RDQLGNIAWSYAVLGQM
Sbjct: 410 NDENQFKCCLDQETLKYNEESVVENNGDLAMEEISGSPALNFKRDQLGNIAWSYAVLGQM 469

Query: 448 DRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKI 507
           DR+FFS +WKT+S FEEQRISEQYREDIMFASQVHLVNQCLKLE+PHL+L+L S LEEK+
Sbjct: 470 DRVFFSHVWKTLSHFEEQRISEQYREDIMFASQVHLVNQCLKLEYPHLRLSLRSDLEEKV 529

Query: 508 ASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPT 567
           A AGKTKRFNQK+TSSFQKEVA LLVSTGL+W+REY VDGYT+DAVLVD+KVA EIDGPT
Sbjct: 530 ARAGKTKRFNQKMTSSFQKEVAHLLVSTGLDWVREYVVDGYTLDAVLVDQKVALEIDGPT 589

Query: 568 HFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEG 627
           HFSRN+GVPLGHTMLKRRYI AAGW + S+SHQEWEELQG FEQLDYLR ILKD+I GEG
Sbjct: 590 HFSRNSGVPLGHTMLKRRYITAAGWKLASVSHQEWEELQGGFEQLDYLREILKDHI-GEG 648

Query: 628 SSNIAE 633
           S+NI +
Sbjct: 649 SANIVQ 654


>gi|147853193|emb|CAN78554.1| hypothetical protein VITISV_042206 [Vitis vinifera]
          Length = 676

 Score =  885 bits (2288), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 455/626 (72%), Positives = 519/626 (82%), Gaps = 29/626 (4%)

Query: 30  VDDSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEK-SKVVDDNEGMDWCVRARKVA 88
           VD ++++ESE  +DWE EFLGELDP G+QAPKKRKK+E+ SK+++D +GMDWCV+ARK+A
Sbjct: 56  VDSNDKQESE--MDWELEFLGELDPLGFQAPKKRKKREQGSKLLEDTDGMDWCVKARKMA 113

Query: 89  LKSIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFDLEDDMKMDDIMGS 148
           LKSIEARGL  +MEDLI VKKKK   KK  +K K   K    + + D ++D+++  +   
Sbjct: 114 LKSIEARGLTRTMEDLITVKKKKNNKKKLGKKDKISKKSKVSEEEDDSDEDIELKGV--- 170

Query: 149 GNGYDMND-LRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQ 207
            N  D  D LR+TVSM+AGGMFEEK+EKT++ FV RLSQFSGPS+RRKEINLNK IV+AQ
Sbjct: 171 -NPLDGADRLRKTVSMVAGGMFEEKKEKTMQAFVQRLSQFSGPSDRRKEINLNKAIVEAQ 229

Query: 208 TAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREM 267
           TA+EVLEV AE I AVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT+ RLAF RQ+EM
Sbjct: 230 TAEEVLEVAAETIMAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTSRRLAFARQKEM 289

Query: 268 SMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVAN 327
           SMLV IAMTALPECSAQGISNI+WALSKIGGELLYLSEMDRVAEVALTKV +FNSQNVAN
Sbjct: 290 SMLVGIAMTALPECSAQGISNISWALSKIGGELLYLSEMDRVAEVALTKVEQFNSQNVAN 349

Query: 328 VAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAF 387
           VAGAFASM+HSAPDLFSEL++RAS+IVH FQEQELAQVLWAFASL EPA PLLESLDN F
Sbjct: 350 VAGAFASMRHSAPDLFSELSERASNIVHNFQEQELAQVLWAFASLNEPAGPLLESLDNVF 409

Query: 388 KDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQM 447
            D  QF CCL++     NE   V+++GD   E    SP L+F RDQLGNIAWSYAVLGQM
Sbjct: 410 NDENQFKCCLDQETLKYNEESVVENNGDLAMEEISGSPALNFKRDQLGNIAWSYAVLGQM 469

Query: 448 DRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKI 507
           DR+FFS +WKT+S FEEQRISEQYREDIMFASQVHLVNQCLKLE+PHL+L+L S LEEK+
Sbjct: 470 DRVFFSHVWKTLSHFEEQRISEQYREDIMFASQVHLVNQCLKLEYPHLRLSLRSDLEEKV 529

Query: 508 ASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPT 567
           A AGKTKRFNQK+TSSFQKEVA LLVSTGL+W+REY VDGYT+DAVLVD+KVA EIDGPT
Sbjct: 530 ARAGKTKRFNQKMTSSFQKEVAHLLVSTGLDWVREYVVDGYTLDAVLVDQKVALEIDGPT 589

Query: 568 HFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQ--------------------EWEELQG 607
           HFSRN+GVPLGHTMLKRRYI AAGW + S+SHQ                    EWEELQG
Sbjct: 590 HFSRNSGVPLGHTMLKRRYITAAGWKLASVSHQERHLLVVFICVSSRGFNTVVEWEELQG 649

Query: 608 SFEQLDYLRVILKDYIGGEGSSNIAE 633
            FEQLDYLR ILKD+I GEGS+NI +
Sbjct: 650 GFEQLDYLREILKDHI-GEGSANIVQ 674


>gi|224117838|ref|XP_002331644.1| predicted protein [Populus trichocarpa]
 gi|222874040|gb|EEF11171.1| predicted protein [Populus trichocarpa]
          Length = 663

 Score =  882 bits (2280), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 440/586 (75%), Positives = 499/586 (85%), Gaps = 6/586 (1%)

Query: 44  WESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIEARGLASSMED 103
           W+ EFLGELDP G QA KKRKKQ+ S ++ D +GMDWC+RARKVALKSIEARGL+  MED
Sbjct: 79  WKLEFLGELDPLGCQASKKRKKQQNSGLLKDTDGMDWCLRARKVALKSIEARGLSQRMED 138

Query: 104 LIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFDLEDDMKMDDIMGSGNGYDMNDLRRTVSM 163
           LI VKKKKKK  KK    K K     ++ D D + D  ++   G        DL+R VSM
Sbjct: 139 LINVKKKKKKRNKKKLVGKVKKVKDFEEDDLDFDLDEGVELEEGDA------DLKRMVSM 192

Query: 164 MAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAV 223
           +  GMF+E++EKT+EEF+ RLSQFSGPS+R+KEINLN+ IV+AQTA+EVLE+ AEMI AV
Sbjct: 193 LGDGMFQERKEKTMEEFLQRLSQFSGPSDRKKEINLNRAIVEAQTAEEVLEITAEMIMAV 252

Query: 224 GKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSA 283
           GKGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQ+E+SMLV IAMTALPECSA
Sbjct: 253 GKGLSPSPLSPLNIATALHRIAKNMEKVSMMNTRRLAFARQKEVSMLVGIAMTALPECSA 312

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           QGISNI+WALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA ASMQHSAPDLF
Sbjct: 313 QGISNISWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGALASMQHSAPDLF 372

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSN 403
           S L+KR S+I+HTFQEQELAQVLWAFASLYEPAD LL++LD  FK+A Q  C L    S 
Sbjct: 373 SALSKRGSEIIHTFQEQELAQVLWAFASLYEPADSLLDALDTVFKNANQLECSLKTKTSY 432

Query: 404 CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFE 463
            +E    + SGD D+EG L SPVLSFNRDQLGNIAWSYAV+GQ+DRIFFS++W+T+S FE
Sbjct: 433 SDEERSNEDSGDLDAEGPLRSPVLSFNRDQLGNIAWSYAVIGQLDRIFFSNVWRTLSHFE 492

Query: 464 EQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSS 523
           EQR+SEQYREDIMFASQ HLVNQCLKLE+PHL+L+L   LEEKIA AGKTKRFNQK TSS
Sbjct: 493 EQRLSEQYREDIMFASQAHLVNQCLKLEYPHLRLSLGDNLEEKIARAGKTKRFNQKTTSS 552

Query: 524 FQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLK 583
           FQKEVARLLVSTGL+W+REY VDGYTVDAV+VDKK+A EIDGPTHFSRNTG+PLGHTMLK
Sbjct: 553 FQKEVARLLVSTGLDWVREYVVDGYTVDAVVVDKKIALEIDGPTHFSRNTGMPLGHTMLK 612

Query: 584 RRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSS 629
           RRYIAAAGWNVVSLSHQEWEE++GS+EQ +YLR ILK++IGG+ SS
Sbjct: 613 RRYIAAAGWNVVSLSHQEWEEIEGSYEQQEYLREILKEHIGGDSSS 658


>gi|255585295|ref|XP_002533346.1| conserved hypothetical protein [Ricinus communis]
 gi|223526811|gb|EEF29031.1| conserved hypothetical protein [Ricinus communis]
          Length = 666

 Score =  850 bits (2195), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 462/613 (75%), Positives = 506/613 (82%), Gaps = 28/613 (4%)

Query: 31  DDSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALK 90
           D+ EE E     DWE EFLGELDP GYQAPKKRKKQ+KSK++++ +GMDWC+RARKVALK
Sbjct: 68  DNGEEVE-----DWELEFLGELDPLGYQAPKKRKKQKKSKLLEETDGMDWCLRARKVALK 122

Query: 91  SIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDD------------LDFDLED 138
           SIEARGL+ +MEDLI VKKKKKK KKKL    K +K   D             ++F+   
Sbjct: 123 SIEARGLSQNMEDLINVKKKKKKNKKKLVSKSKISKKNKDLEDDSDFDLDDEDVEFEDVA 182

Query: 139 DMKMDDIMGSGNGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEIN 198
           D+  DD +         DLRRTVS MAGGMFEEK+EK +EEFV RLSQFSGPS+R+KE+N
Sbjct: 183 DLPGDDSI---------DLRRTVSSMAGGMFEEKKEKNMEEFVQRLSQFSGPSDRKKEVN 233

Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
           LN+ IV+AQTA+EVLEV A+MI AVGKGLSPSPLSPLNIATALHRIAKNMEKVSMM T R
Sbjct: 234 LNRAIVEAQTAEEVLEVTADMIIAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMKTRR 293

Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVG 318
           LAF RQREMSMLV IAMTALPECSAQGISNI+WALSKIGGELLYLSEMDRVAEVALTKV 
Sbjct: 294 LAFARQREMSMLVGIAMTALPECSAQGISNISWALSKIGGELLYLSEMDRVAEVALTKVD 353

Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP 378
           EFNSQNVANVAGAFASMQHSA DLFS L+KRASDI+HTFQEQELAQVLWAFASLYEPAD 
Sbjct: 354 EFNSQNVANVAGAFASMQHSASDLFSALSKRASDIIHTFQEQELAQVLWAFASLYEPADS 413

Query: 379 LLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
           LLESLD  FKD  QF C       N NE   +K SGD D E     PVL FNRDQLGNIA
Sbjct: 414 LLESLDIVFKDVNQFHCYTKAETLNYNEVDSMKGSGDLDREEVSGPPVLKFNRDQLGNIA 473

Query: 439 WSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLA 498
           WSYAV GQ++R FFS+IW+T+   EEQRISEQYREDIMFASQ HLVNQCLKLEHPH QLA
Sbjct: 474 WSYAVFGQVNRTFFSNIWRTLRNSEEQRISEQYREDIMFASQAHLVNQCLKLEHPHYQLA 533

Query: 499 LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKK 558
           L   LEEKIA AGKTKRFNQK+TSSFQKEVARLLVSTGL+W+REY VDGYT+DAV+VDKK
Sbjct: 534 LGGDLEEKIARAGKTKRFNQKITSSFQKEVARLLVSTGLDWVREYVVDGYTLDAVVVDKK 593

Query: 559 VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVI 618
           +A EIDGPTHFSRNTGVPLGHTMLKRRYI+AAGW VVSLSHQEWEELQGSFEQLDYLR I
Sbjct: 594 IALEIDGPTHFSRNTGVPLGHTMLKRRYISAAGWKVVSLSHQEWEELQGSFEQLDYLREI 653

Query: 619 LKDYIGGEGSSNI 631
           LK ++G   S+NI
Sbjct: 654 LKVHLG--DSNNI 664


>gi|356506291|ref|XP_003521919.1| PREDICTED: uncharacterized protein LOC100805208 [Glycine max]
          Length = 664

 Score =  838 bits (2164), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 424/594 (71%), Positives = 493/594 (82%), Gaps = 11/594 (1%)

Query: 32  DSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKS 91
           DS++K  E S DWE EFLGELDPFGY+APKKR+K+++SK+++  +GMDWCVRARK AL+S
Sbjct: 70  DSDDKGEESSTDWELEFLGELDPFGYRAPKKREKEQRSKLLEATDGMDWCVRARKKALES 129

Query: 92  IEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDD--DLDFDLEDDMKMDDIMGSG 149
           IEARG+A  +ED++ VKKKKKK KKKLE  KK  K  +   DLDF LE+D+    +    
Sbjct: 130 IEARGMAHLVEDMVTVKKKKKKDKKKLESKKKVVKKIEKIEDLDFVLEEDL----LQPMK 185

Query: 150 NGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTA 209
              D+ DL+R VSM   GMF EK+EKT E FV+RLSQFSGPS+ RKEINLNK I +A+TA
Sbjct: 186 PEIDVGDLKRRVSMFNDGMFIEKKEKTKEAFVNRLSQFSGPSDHRKEINLNKAITEARTA 245

Query: 210 QEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSM 269
            +VLEV  E I AV KGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQREMSM
Sbjct: 246 DDVLEVTYETIVAVAKGLSPSPLSPLNIATALHRIAKNMEKVSMMRTRRLAFARQREMSM 305

Query: 270 LVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVA 329
           LV+IAMTALPECSAQG+SNI+WALSKIGGELLYLSEMDR+AEVALTKVGEFNSQN+AN+A
Sbjct: 306 LVSIAMTALPECSAQGVSNISWALSKIGGELLYLSEMDRIAEVALTKVGEFNSQNIANIA 365

Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKD 389
           GAFA+MQHSAPDLFS L++RASDI+HTFQEQELAQ+LWAFASLYEPADP+ +SLD  FKD
Sbjct: 366 GAFAAMQHSAPDLFSVLSERASDIIHTFQEQELAQLLWAFASLYEPADPIFDSLDIVFKD 425

Query: 390 ATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDR 449
            +Q   C  +  SN +E   V  SG ++      SPVL+  RDQLG IAWSYAV GQMDR
Sbjct: 426 HSQLRGCTGERTSNNHEQIRVDRSGASN-----GSPVLTLTRDQLGTIAWSYAVFGQMDR 480

Query: 450 IFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIAS 509
            FFS +WKT+S +EE+RISE YREDIMFASQVHLVNQCLKLE PHLQL+L   LE+K+A 
Sbjct: 481 SFFSHVWKTLSHYEERRISELYREDIMFASQVHLVNQCLKLEFPHLQLSLCGDLEDKVAL 540

Query: 510 AGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHF 569
           A KTKRFNQK+TSSFQKEV RLL+STGL W++EY VDGYT+DAV+VDKK+A EIDGPTHF
Sbjct: 541 ARKTKRFNQKITSSFQKEVGRLLLSTGLEWVKEYVVDGYTLDAVIVDKKLALEIDGPTHF 600

Query: 570 SRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
           SRNTGVPLGHTMLKRRYI AAGW V S+S QEWEELQG+FEQ++YLR +LK+++
Sbjct: 601 SRNTGVPLGHTMLKRRYITAAGWKVASVSSQEWEELQGAFEQVEYLRNLLKNHL 654


>gi|356522646|ref|XP_003529957.1| PREDICTED: uncharacterized protein LOC100794144 [Glycine max]
          Length = 669

 Score =  830 bits (2145), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 425/593 (71%), Positives = 492/593 (82%), Gaps = 8/593 (1%)

Query: 33  SEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSI 92
           S +K    + DWESEFLGELDPFGY+APKKR+K+++S +++  +GMDWCVRARK ALKSI
Sbjct: 70  SNDKGEGSNTDWESEFLGELDPFGYRAPKKREKEKRSMLLEATDGMDWCVRARKEALKSI 129

Query: 93  EARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDD--DLDFDLEDDMKMDDIMGSGN 150
           EARG+A  ME+++ VKKKKKK KKKLE  KK  K  +   DLDF LE+D+          
Sbjct: 130 EARGMAHLMENMVTVKKKKKKDKKKLESKKKIVKKIEKIEDLDFSLEEDLPQP----MET 185

Query: 151 GYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQ 210
             D+ DL+R VS+   GMF EK+EKT EEFV+RLSQFSGPS+ RKEINLNK I +AQTA 
Sbjct: 186 EIDVGDLKRRVSIFNDGMFIEKKEKTKEEFVNRLSQFSGPSDHRKEINLNKAITEAQTAD 245

Query: 211 EVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSML 270
           +VLEV  E I AV KGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQREMSML
Sbjct: 246 DVLEVTYETIVAVAKGLSPSPLSPLNIATALHRIAKNMEKVSMMRTRRLAFARQREMSML 305

Query: 271 VAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
           V+IAMTALPECSAQG+SNI+WALSKIGGELLYLSEMDR+AEVALTKVGEFNSQN+AN+AG
Sbjct: 306 VSIAMTALPECSAQGVSNISWALSKIGGELLYLSEMDRIAEVALTKVGEFNSQNIANIAG 365

Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA 390
           AFA+MQHSAPDLFSE +KRASDI+HTFQEQELAQ+LWAFASLYEPADP+ +SLD  FKD 
Sbjct: 366 AFAAMQHSAPDLFSEFSKRASDIIHTFQEQELAQLLWAFASLYEPADPIFDSLDIVFKDH 425

Query: 391 TQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRI 450
           +Q   C+ +  SN +E   V  SG   S GSL SPVL+  RDQLG IAWSYAV GQM R 
Sbjct: 426 SQLRGCIGEKTSNNHEQISVDRSG--ASNGSLGSPVLTLTRDQLGTIAWSYAVFGQMARS 483

Query: 451 FFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASA 510
           FFS +WKT+S +EEQRISE YREDIMFASQVHLVNQCLKLE PHLQL+L   LE+K+A +
Sbjct: 484 FFSHVWKTLSHYEEQRISELYREDIMFASQVHLVNQCLKLEFPHLQLSLCGELEDKVALS 543

Query: 511 GKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFS 570
           GKTKRFNQK+TSSFQKEV  LLVSTGL W++E+ VDGYT+DAV+VDKK+A EIDGPTHFS
Sbjct: 544 GKTKRFNQKITSSFQKEVGHLLVSTGLEWVKEFVVDGYTLDAVIVDKKLALEIDGPTHFS 603

Query: 571 RNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
           RNTGVPLGHTMLKRRYI AAGW V S+S+Q+WEELQG+FEQ++YL  +LK+++
Sbjct: 604 RNTGVPLGHTMLKRRYITAAGWKVASISYQKWEELQGAFEQVEYLSNLLKNHL 656


>gi|449505631|ref|XP_004162527.1| PREDICTED: uncharacterized protein LOC101223645 [Cucumis sativus]
          Length = 633

 Score =  799 bits (2064), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 414/605 (68%), Positives = 488/605 (80%), Gaps = 12/605 (1%)

Query: 29  EVDDSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVA 88
           E+ DS  + + D+++WE E L ELDP G+Q PKK+KKQ KSK++DD EGMDWC+RARKVA
Sbjct: 28  EIGDS--RGNGDNMEWEGELLQELDPLGFQPPKKKKKQMKSKLLDDTEGMDWCLRARKVA 85

Query: 89  LKSIEARGLASSMEDLIKVKKKKKKGK-------KKLEKIKKKNKVTDDDLDFDLEDDMK 141
           L+SIE RGLAS+ EDL  VKKK KK K        K   +  K  V ++ L+FD ++D++
Sbjct: 86  LRSIEGRGLASTEEDLFSVKKKNKKNKKKKKIMGSKDNGVNTKGDVIEESLEFDSDEDLE 145

Query: 142 MDDIMGSGNGYDMND---LRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEIN 198
           +D  +   +   +ND   L ++VS+M GGMFE+++EKT+EEF+ RLS+FSGPS+R+KE+N
Sbjct: 146 LDMDLDLLDSLAINDSNHLSKSVSIMGGGMFEQRKEKTMEEFIQRLSKFSGPSDRKKEVN 205

Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
           LN+ I++AQTA E LEVI++MI AVGKGLSPSPLSPLNIATALHRIAKNM+KV MM +HR
Sbjct: 206 LNRAIIEAQTADEALEVISDMILAVGKGLSPSPLSPLNIATALHRIAKNMDKVLMMKSHR 265

Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVG 318
           LAF R+REMSMLV IAMT LPECSAQGISNIAWALSKIGG+ LYLSEMDRVAEV LTK+ 
Sbjct: 266 LAFARRREMSMLVGIAMTTLPECSAQGISNIAWALSKIGGDQLYLSEMDRVAEVTLTKIE 325

Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP 378
           E NSQNVAN+AGAFASMQHSA DLFS LAKRASDIV TF EQELAQVLWAFASL E AD 
Sbjct: 326 ELNSQNVANIAGAFASMQHSASDLFSGLAKRASDIVDTFHEQELAQVLWAFASLNESADL 385

Query: 379 LLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
           LLESLDN + DA+Q TC L++   N N+   V  S D +S+G++  PVL FNR+QLGNIA
Sbjct: 386 LLESLDNVYNDASQITCYLSEQTVNRNQESTVGVSNDLESDGAVGFPVLKFNRNQLGNIA 445

Query: 439 WSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLA 498
           WSYAV GQ+DR FFS IW+TIS FE++ ISEQ+R DI+FASQ+ LV+ CLK E+ HLQL+
Sbjct: 446 WSYAVFGQVDRSFFSHIWRTISYFEKESISEQHRNDIIFASQLWLVHYCLKREYSHLQLS 505

Query: 499 LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKK 558
           LS  LEEK   AGKTKRFNQK TSSFQKEVARLLVSTG  W REY  D YT+DAV+VDKK
Sbjct: 506 LSVDLEEKAILAGKTKRFNQKTTSSFQKEVARLLVSTGHEWTREYVFDAYTLDAVIVDKK 565

Query: 559 VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVI 618
           V  EIDGPTHFSRNTG+PLGHT+LKRRYI AAGW VVSLSHQEWEELQG  EQL+YLR I
Sbjct: 566 VVLEIDGPTHFSRNTGIPLGHTVLKRRYITAAGWKVVSLSHQEWEELQGEVEQLNYLREI 625

Query: 619 LKDYI 623
           LKD+I
Sbjct: 626 LKDHI 630


>gi|449442355|ref|XP_004138947.1| PREDICTED: uncharacterized protein LOC101211080 [Cucumis sativus]
          Length = 671

 Score =  799 bits (2064), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 414/605 (68%), Positives = 488/605 (80%), Gaps = 12/605 (1%)

Query: 29  EVDDSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVA 88
           E+ DS  + + D+++WE E L ELDP G+Q PKK+KKQ KSK++DD EGMDWC+RARKVA
Sbjct: 66  EIGDS--RGNGDNMEWEGELLQELDPLGFQPPKKKKKQMKSKLLDDTEGMDWCLRARKVA 123

Query: 89  LKSIEARGLASSMEDLIKVKKKKKKGK-------KKLEKIKKKNKVTDDDLDFDLEDDMK 141
           L+SIE RGLAS+ EDL  VKKK KK K        K   +  K  V ++ L+FD ++D++
Sbjct: 124 LRSIEGRGLASTEEDLFSVKKKNKKNKKKKKIMGSKDNGVNTKGDVIEESLEFDSDEDLE 183

Query: 142 MDDIMGSGNGYDMND---LRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEIN 198
           +D  +   +   +ND   L ++VS+M GGMFE+++EKT+EEF+ RLS+FSGPS+R+KE+N
Sbjct: 184 LDMDLDLLDSLAINDSNHLSKSVSIMGGGMFEQRKEKTMEEFIQRLSKFSGPSDRKKEVN 243

Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
           LN+ I++AQTA E LEVI++MI AVGKGLSPSPLSPLNIATALHRIAKNM+KV MM +HR
Sbjct: 244 LNRAIIEAQTADEALEVISDMILAVGKGLSPSPLSPLNIATALHRIAKNMDKVLMMKSHR 303

Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVG 318
           LAF R+REMSMLV IAMT LPECSAQGISNIAWALSKIGG+ LYLSEMDRVAEV LTK+ 
Sbjct: 304 LAFARRREMSMLVGIAMTTLPECSAQGISNIAWALSKIGGDQLYLSEMDRVAEVTLTKIE 363

Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP 378
           E NSQNVAN+AGAFASMQHSA DLFS LAKRASDIV TF EQELAQVLWAFASL E AD 
Sbjct: 364 ELNSQNVANIAGAFASMQHSASDLFSGLAKRASDIVDTFHEQELAQVLWAFASLNESADL 423

Query: 379 LLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
           LLESLDN + DA+Q TC L++   N N+   V  S D +S+G++  PVL FNR+QLGNIA
Sbjct: 424 LLESLDNVYNDASQITCYLSEQTVNRNQESTVGVSNDLESDGAVGFPVLKFNRNQLGNIA 483

Query: 439 WSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLA 498
           WSYAV GQ+DR FFS IW+TIS FE++ ISEQ+R DI+FASQ+ LV+ CLK E+ HLQL+
Sbjct: 484 WSYAVFGQVDRSFFSHIWRTISYFEKESISEQHRNDIIFASQLWLVHYCLKREYSHLQLS 543

Query: 499 LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKK 558
           LS  LEEK   AGKTKRFNQK TSSFQKEVARLLVSTG  W REY  D YT+DAV+VDKK
Sbjct: 544 LSVDLEEKAILAGKTKRFNQKTTSSFQKEVARLLVSTGHEWTREYVFDAYTLDAVIVDKK 603

Query: 559 VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVI 618
           V  EIDGPTHFSRNTG+PLGHT+LKRRYI AAGW VVSLSHQEWEELQG  EQL+YLR I
Sbjct: 604 VVLEIDGPTHFSRNTGIPLGHTVLKRRYITAAGWKVVSLSHQEWEELQGEVEQLNYLREI 663

Query: 619 LKDYI 623
           LKD+I
Sbjct: 664 LKDHI 668


>gi|4887747|gb|AAD32283.1| hypothetical protein [Arabidopsis thaliana]
          Length = 627

 Score =  766 bits (1977), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 408/583 (69%), Positives = 473/583 (81%), Gaps = 20/583 (3%)

Query: 48  FLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIEARGLASSMEDLIKV 107
           FLGE+DP   Q PKKRKKQ+ SK ++D EGMDWCVRARK+ALKSIEARGL+S M +++ +
Sbjct: 58  FLGEIDPLDIQPPKKRKKQKNSKALEDTEGMDWCVRARKIALKSIEARGLSSRMAEVMPL 117

Query: 108 KKKKKKGKKKLEKIKKKNKVTDDDLDFDLE-------DDMKMDDIMGSGNGYDMNDLRRT 160
           KKKKKK  KK+   K K K      D           +D  ++D MG        DLR+ 
Sbjct: 118 KKKKKKKSKKVIVKKDKVKSKSIPEDDFDTEDEDLDFEDGFVEDKMG--------DLRKR 169

Query: 161 VSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMI 220
           VS +AGGMFEEK+EK  E+   RLSQFSGPS+R KEINLNK I++AQTA+EVLEV AE I
Sbjct: 170 VSSLAGGMFEEKKEKMKEQLAQRLSQFSGPSDRMKEINLNKAIIEAQTAEEVLEVTAETI 229

Query: 221 TAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPE 280
            AV KGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQREMSMLVA+AMT LPE
Sbjct: 230 MAVAKGLSPSPLSPLNIATALHRIAKNMEKVSMMRTRRLAFARQREMSMLVALAMTCLPE 289

Query: 281 CSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           CSAQGISNI+WALSKIGGELLYL+EMDRVAEVA +KVGEFNSQNVAN+AGAFASM+HSAP
Sbjct: 290 CSAQGISNISWALSKIGGELLYLTEMDRVAEVATSKVGEFNSQNVANIAGAFASMRHSAP 349

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
           +LF+EL+KRAS I++TF+ QE+AQ+LW+FASLYEPADPLLESLD+AFK + QF C L K 
Sbjct: 350 ELFAELSKRASTIINTFKGQEIAQLLWSFASLYEPADPLLESLDSAFKSSDQFKCYLTKE 409

Query: 401 LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
           ++N +E    + S D        SP LSFNRDQLGNIAWSYAVLGQ++R FF++IW T++
Sbjct: 410 ITNSDEVVDAEVSDDVS-----RSPALSFNRDQLGNIAWSYAVLGQVERPFFANIWNTLT 464

Query: 461 RFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKV 520
             EEQR+SEQYRED+MFASQV+LVNQCLKLE PHLQL+L   LEEKI+ AGKTKRFNQK+
Sbjct: 465 TLEEQRLSEQYREDVMFASQVYLVNQCLKLECPHLQLSLCQELEEKISRAGKTKRFNQKI 524

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
           TSSFQKEV RLL+STGL+W +E+ VDGYTVD  LV+KKVA EIDGPTHFSRN+G+PLGHT
Sbjct: 525 TSSFQKEVGRLLISTGLDWAKEHDVDGYTVDVALVEKKVALEIDGPTHFSRNSGLPLGHT 584

Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
           MLKRRY+AAAGW VVSLS QEWEE +GS EQL+YLR IL   I
Sbjct: 585 MLKRRYVAAAGWKVVSLSLQEWEEHEGSHEQLEYLREILTGCI 627


>gi|30685105|ref|NP_850176.1| protein RAP [Arabidopsis thaliana]
 gi|18086393|gb|AAL57655.1| At2g31890/F20M17.7 [Arabidopsis thaliana]
 gi|22136584|gb|AAM91078.1| At2g31890/F20M17.7 [Arabidopsis thaliana]
 gi|330253506|gb|AEC08600.1| protein RAP [Arabidopsis thaliana]
          Length = 671

 Score =  763 bits (1971), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 408/583 (69%), Positives = 473/583 (81%), Gaps = 20/583 (3%)

Query: 48  FLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIEARGLASSMEDLIKV 107
           FLGE+DP   Q PKKRKKQ+ SK ++D EGMDWCVRARK+ALKSIEARGL+S M +++ +
Sbjct: 102 FLGEIDPLDIQPPKKRKKQKNSKALEDTEGMDWCVRARKIALKSIEARGLSSRMAEVMPL 161

Query: 108 KKKKKKGKKKLEKIKKKNKVTDDDLDFDLE-------DDMKMDDIMGSGNGYDMNDLRRT 160
           KKKKKK  KK+   K K K      D           +D  ++D MG        DLR+ 
Sbjct: 162 KKKKKKKSKKVIVKKDKVKSKSIPEDDFDTEDEDLDFEDGFVEDKMG--------DLRKR 213

Query: 161 VSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMI 220
           VS +AGGMFEEK+EK  E+   RLSQFSGPS+R KEINLNK I++AQTA+EVLEV AE I
Sbjct: 214 VSSLAGGMFEEKKEKMKEQLAQRLSQFSGPSDRMKEINLNKAIIEAQTAEEVLEVTAETI 273

Query: 221 TAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPE 280
            AV KGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQREMSMLVA+AMT LPE
Sbjct: 274 MAVAKGLSPSPLSPLNIATALHRIAKNMEKVSMMRTRRLAFARQREMSMLVALAMTCLPE 333

Query: 281 CSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           CSAQGISNI+WALSKIGGELLYL+EMDRVAEVA +KVGEFNSQNVAN+AGAFASM+HSAP
Sbjct: 334 CSAQGISNISWALSKIGGELLYLTEMDRVAEVATSKVGEFNSQNVANIAGAFASMRHSAP 393

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
           +LF+EL+KRAS I++TF+ QE+AQ+LW+FASLYEPADPLLESLD+AFK + QF C L K 
Sbjct: 394 ELFAELSKRASTIINTFKGQEIAQLLWSFASLYEPADPLLESLDSAFKSSDQFKCYLTKE 453

Query: 401 LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
           ++N +E    + S D        SP LSFNRDQLGNIAWSYAVLGQ++R FF++IW T++
Sbjct: 454 ITNSDEVVDAEVSDDVS-----RSPALSFNRDQLGNIAWSYAVLGQVERPFFANIWNTLT 508

Query: 461 RFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKV 520
             EEQR+SEQYRED+MFASQV+LVNQCLKLE PHLQL+L   LEEKI+ AGKTKRFNQK+
Sbjct: 509 TLEEQRLSEQYREDVMFASQVYLVNQCLKLECPHLQLSLCQELEEKISRAGKTKRFNQKI 568

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
           TSSFQKEV RLL+STGL+W +E+ VDGYTVD  LV+KKVA EIDGPTHFSRN+G+PLGHT
Sbjct: 569 TSSFQKEVGRLLISTGLDWAKEHDVDGYTVDVALVEKKVALEIDGPTHFSRNSGLPLGHT 628

Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
           MLKRRY+AAAGW VVSLS QEWEE +GS EQL+YLR IL   I
Sbjct: 629 MLKRRYVAAAGWKVVSLSLQEWEEHEGSHEQLEYLREILTGCI 671


>gi|297826641|ref|XP_002881203.1| hypothetical protein ARALYDRAFT_902227 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327042|gb|EFH57462.1| hypothetical protein ARALYDRAFT_902227 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 668

 Score =  759 bits (1961), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 411/584 (70%), Positives = 477/584 (81%), Gaps = 22/584 (3%)

Query: 48  FLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIEARGLASSMEDLIKV 107
           FLGE+DP   Q PKKRKKQ+ SKV++D EGMDWCVRARK+ALKSIEARGL+S M +++ +
Sbjct: 99  FLGEIDPLDIQPPKKRKKQKNSKVLEDTEGMDWCVRARKIALKSIEARGLSSRMAEVMPL 158

Query: 108 KKKKKKGKKKLEKIKKKNKVTD--------DDLDFDLEDDMKMDDIMGSGNGYDMNDLRR 159
           KKKKKK  KK+   K+K K           +D D D ED + ++D MG        DLR+
Sbjct: 159 KKKKKKKSKKVIVKKEKVKTKSILEEDFDTEDEDLDFEDGL-VEDKMG--------DLRK 209

Query: 160 TVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEM 219
            VS +AGGMFEEK+EK  E+   RLSQFSGPS+R KEINLNK I++AQTA+EVLEV +E 
Sbjct: 210 RVSSLAGGMFEEKKEKMKEQLAQRLSQFSGPSDRMKEINLNKAIIEAQTAEEVLEVTSET 269

Query: 220 ITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALP 279
           I AV KGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQREMSMLVA+AMT LP
Sbjct: 270 IMAVAKGLSPSPLSPLNIATALHRIAKNMEKVSMMRTRRLAFARQREMSMLVALAMTCLP 329

Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
           ECSAQGISNI+WALSKIGGELLYL+EMDRVAEVA +KVGEFNSQNVAN+AGAFASM+HSA
Sbjct: 330 ECSAQGISNISWALSKIGGELLYLTEMDRVAEVATSKVGEFNSQNVANIAGAFASMRHSA 389

Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNK 399
           P+LF+EL+KRAS I+ TF+ QE+AQ+LW+FASL EPADPLLESLD+AFK + QF C L K
Sbjct: 390 PELFAELSKRASTIIITFKGQEIAQLLWSFASLNEPADPLLESLDSAFKSSDQFKCYLTK 449

Query: 400 ALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI 459
            ++N +E   V+ S DA       SP LSFNRDQLGNIAWSYAVLGQ++R FF++IW ++
Sbjct: 450 EITNSDEVVDVEVSDDAS-----GSPPLSFNRDQLGNIAWSYAVLGQVERPFFANIWNSL 504

Query: 460 SRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQK 519
           +  EEQR+SEQYRED+MFASQV LVNQCLKLE PHLQL+L   LEEKI  AGKTKRFNQK
Sbjct: 505 TTLEEQRLSEQYREDVMFASQVFLVNQCLKLECPHLQLSLCHGLEEKITRAGKTKRFNQK 564

Query: 520 VTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGH 579
           ++SSFQKEV RLL+STGL+W +E+ VDGYTVD  LVDKKVA EIDGPTHFSRN+G+PLGH
Sbjct: 565 ISSSFQKEVGRLLISTGLDWAKEHDVDGYTVDVALVDKKVALEIDGPTHFSRNSGIPLGH 624

Query: 580 TMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
           TMLKRRY+AAAGW VVSLS QEWEE +GS EQL+YLR IL   I
Sbjct: 625 TMLKRRYVAAAGWKVVSLSLQEWEEHEGSHEQLEYLREILNGCI 668


>gi|296084379|emb|CBI24767.3| unnamed protein product [Vitis vinifera]
          Length = 439

 Score =  756 bits (1951), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/466 (78%), Positives = 402/466 (86%), Gaps = 29/466 (6%)

Query: 168 MFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGL 227
           MFEEK+EKT++ FV RLSQFSGPS+RRKEINLNK IV+AQTA+EVLEV AE I AVGKGL
Sbjct: 1   MFEEKKEKTMQAFVQRLSQFSGPSDRRKEINLNKAIVEAQTAEEVLEVAAETIMAVGKGL 60

Query: 228 SPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGIS 287
           SPSPLSPLNIATALHRIAKNMEKVSMMT+ RLAF RQ+EMSMLV IAMTALPECSAQGIS
Sbjct: 61  SPSPLSPLNIATALHRIAKNMEKVSMMTSRRLAFARQKEMSMLVGIAMTALPECSAQGIS 120

Query: 288 NIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELA 347
           NI+WALSKIGGELLYLSEMDRVAEVALTKV +FNSQNVANVAGAFASM+HSAPDLFSEL+
Sbjct: 121 NISWALSKIGGELLYLSEMDRVAEVALTKVEQFNSQNVANVAGAFASMRHSAPDLFSELS 180

Query: 348 KRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNEN 407
           +RAS+IVH FQEQELAQVLWAFASL EPA PLLESLDN F D  QF CCL++        
Sbjct: 181 ERASNIVHNFQEQELAQVLWAFASLNEPAGPLLESLDNVFNDENQFKCCLDQETL----- 235

Query: 408 GGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRI 467
                                  +DQLGNIAWSYAVLGQMDR+FFS +WKT+S FEEQRI
Sbjct: 236 -----------------------KDQLGNIAWSYAVLGQMDRVFFSHVWKTLSHFEEQRI 272

Query: 468 SEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKE 527
           SEQYREDIMFASQVHLVNQCLKLE+PHL+L+L S LEEK+A AGKTKRFNQK+TSSFQKE
Sbjct: 273 SEQYREDIMFASQVHLVNQCLKLEYPHLRLSLRSDLEEKVARAGKTKRFNQKMTSSFQKE 332

Query: 528 VARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYI 587
           VA LLVSTGL+W+REY VDGYT+DAVLVD+KVA EIDGPTHFSRN+GVPLGHTMLKRRYI
Sbjct: 333 VAHLLVSTGLDWVREYVVDGYTLDAVLVDQKVALEIDGPTHFSRNSGVPLGHTMLKRRYI 392

Query: 588 AAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSSNIAE 633
            AAGW + S+SHQEWEELQG FEQLDYLR ILKD+I GEGS+NI +
Sbjct: 393 TAAGWKLASVSHQEWEELQGGFEQLDYLREILKDHI-GEGSANIVQ 437


>gi|414875853|tpg|DAA52984.1| TPA: hypothetical protein ZEAMMB73_380323 [Zea mays]
          Length = 641

 Score =  681 bits (1757), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/591 (60%), Positives = 436/591 (73%), Gaps = 15/591 (2%)

Query: 38  SEDSVD----WESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIE 93
           SED  D    W+ +FLG        AP    ++E+  ++   E  DWCVRAR+ AL+SIE
Sbjct: 52  SEDRTDSTPQWQLDFLGA----SAVAPDSPVEEEEEDLLP-AEATDWCVRARRSALRSIE 106

Query: 94  ARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVT----DDDLDFDLEDDMKMDDIMGSG 149
            RGLA +++ ++   KK KK K   +K  KK           L     D+   DD     
Sbjct: 107 ERGLAPALQRMVSPPKKTKKKKTAKKKELKKAAAELKRRTKQLADAEGDEDDDDDYDVVD 166

Query: 150 NGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSG-PSNRRKEINLNKDIVDAQT 208
           +  +M+DL   V+  A GMF+EKR++  E FV  LS+FS  PSNR KE++LN+ IV AQT
Sbjct: 167 DLQNMDDLELRVAQFADGMFDEKRQRNRETFVQTLSRFSAAPSNRSKEVSLNRSIVQAQT 226

Query: 209 AQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMS 268
           A EVL++ AE+ITAV KGLSPSPL+PLNIATALHRIA+NME VSMM THRLAF RQR+MS
Sbjct: 227 ANEVLDLTAEVITAVAKGLSPSPLTPLNIATALHRIARNMEAVSMMQTHRLAFARQRDMS 286

Query: 269 MLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANV 328
           MLV +AM ALPECS QG+SNIAWALSKIGG+LLYL EMDR+A+VA+ KV +FN+QNVANV
Sbjct: 287 MLVGLAMVALPECSPQGVSNIAWALSKIGGDLLYLPEMDRIADVAMAKVQDFNAQNVANV 346

Query: 329 AGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
           AGAFASM+ SAP LFS LA RA+ ++ TF+EQELAQ LW  ASL E   PLL++LD AF+
Sbjct: 347 AGAFASMRQSAPGLFSSLAMRAAQLLQTFKEQELAQFLWGCASLNECPHPLLDALDTAFQ 406

Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD 448
           + T F C +    S+ + +   + SG  D   S S+  L+FNRDQ+GNIAWSYAV+GQMD
Sbjct: 407 NDTSFQCHVTDIKSSAHWSSAEELSGGEDGSTS-SARTLNFNRDQVGNIAWSYAVIGQMD 465

Query: 449 RIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIA 508
           R FFS +W+T+SRFEEQR+S+QYRED+MFASQV+L NQ LKLE+ +L L L S LEEKIA
Sbjct: 466 RPFFSHMWRTLSRFEEQRVSDQYREDMMFASQVYLANQSLKLEYRNLGLCLRSDLEEKIA 525

Query: 509 SAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTH 568
            AGK+KRFNQK TSSFQKEV RLL STG  W+REYA+DGYTVDAVLVD+K+AFEIDGPTH
Sbjct: 526 KAGKSKRFNQKTTSSFQKEVGRLLYSTGHEWVREYAIDGYTVDAVLVDEKLAFEIDGPTH 585

Query: 569 FSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
           FSRN G PLGHT  KRRYI A+GW +VSLS QEWE LQG FEQL+YLR IL
Sbjct: 586 FSRNLGTPLGHTAFKRRYITASGWKLVSLSLQEWENLQGEFEQLEYLRRIL 636


>gi|242056075|ref|XP_002457183.1| hypothetical protein SORBIDRAFT_03g002900 [Sorghum bicolor]
 gi|241929158|gb|EES02303.1| hypothetical protein SORBIDRAFT_03g002900 [Sorghum bicolor]
          Length = 640

 Score =  672 bits (1734), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/553 (62%), Positives = 426/553 (77%), Gaps = 15/553 (2%)

Query: 76  EGMDWCVRARKVALKSIEARGLASSMEDLIK--------VKKKKKKGKKKLEKIKKKNKV 127
           E  DWCVRAR+ AL+SIE RGLA S++ ++            KKK+ KK   ++K++NK 
Sbjct: 89  EATDWCVRARRSALRSIEERGLAPSLQRMVSPPKKKKKKKTAKKKELKKAAAELKRRNKQ 148

Query: 128 TDDDLDFDLEDDMKMDDIMGSGNGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQF 187
            DD    + +DD  +DD+       +M+DL   V+  A GMF+EKR++  E FV  LS+F
Sbjct: 149 VDDAEGDEDDDDDVVDDLQ------NMDDLELRVAQFADGMFDEKRQRNRETFVQTLSRF 202

Query: 188 SG-PSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAK 246
           S  PSNR KE++LN+ IV AQTA EVL++ AE+ITAV KGLSPSPL+PLNIATALHRIA+
Sbjct: 203 SAAPSNRSKEVSLNRSIVQAQTANEVLDLTAEVITAVAKGLSPSPLTPLNIATALHRIAR 262

Query: 247 NMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEM 306
           NME VSMM THRLAF RQR+MSMLV +AM ALPECS QG+SNIAWALSKIGG+LLYL EM
Sbjct: 263 NMEAVSMMQTHRLAFARQRDMSMLVGLAMVALPECSPQGVSNIAWALSKIGGDLLYLPEM 322

Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVL 366
           DR+A+VA++KV +FN+QNVANVAGAFASM+ SAP LFS LA RA+ I+ TF+EQELAQ L
Sbjct: 323 DRIADVAMSKVQDFNAQNVANVAGAFASMRQSAPGLFSALALRAAQILQTFKEQELAQFL 382

Query: 367 WAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPV 426
           W  ASL E   PLL++LD AF++ T F C ++   S+ +++   +     +   + S+  
Sbjct: 383 WGCASLNECPHPLLDALDTAFQNDTSFQCHVSDLKSSAHQSSAEEELSGGEDGSTSSART 442

Query: 427 LSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQ 486
           L+F+RDQ+GNIAWSYAV+GQMDR FFS +WKT+S+FEEQR+S+QYRED+MFASQV+L NQ
Sbjct: 443 LNFSRDQVGNIAWSYAVIGQMDRPFFSHMWKTLSQFEEQRVSDQYREDMMFASQVYLANQ 502

Query: 487 CLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVD 546
            LKLE+  L L L S LEEK+  AGK+KRFNQK TSSFQKEV RLL STG  W+REYA+D
Sbjct: 503 SLKLEYRDLGLCLRSDLEEKVTKAGKSKRFNQKTTSSFQKEVGRLLYSTGHEWVREYAID 562

Query: 547 GYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
           GYTVDAVLVD+K+AFEIDGPTHFSRN G PLGHT  KRRYI A+GW +VSLS QEWE+LQ
Sbjct: 563 GYTVDAVLVDEKLAFEIDGPTHFSRNLGTPLGHTAFKRRYITASGWKLVSLSLQEWEDLQ 622

Query: 607 GSFEQLDYLRVIL 619
           G FEQL+YLR IL
Sbjct: 623 GEFEQLEYLRRIL 635


>gi|30089729|gb|AAP20833.1| expressed protein [Oryza sativa Japonica Group]
 gi|108708908|gb|ABF96703.1| expressed protein [Oryza sativa Japonica Group]
 gi|108708909|gb|ABF96704.1| expressed protein [Oryza sativa Japonica Group]
 gi|108708910|gb|ABF96705.1| expressed protein [Oryza sativa Japonica Group]
 gi|125586723|gb|EAZ27387.1| hypothetical protein OsJ_11335 [Oryza sativa Japonica Group]
          Length = 640

 Score =  665 bits (1715), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/552 (60%), Positives = 418/552 (75%), Gaps = 11/552 (1%)

Query: 76  EGMDWCVRARKVALKSIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFD 135
           E  DWCVRAR+ AL+SIEARGL+ S++ ++   KKK K KK  +   K+ K  +     D
Sbjct: 87  ETNDWCVRARRSALRSIEARGLSPSLQRMVASPKKKNKKKKSKKTNLKQKKAAEPKPPRD 146

Query: 136 LEDDMKMDDIMGSG------NGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFS- 188
            +DD   ++            G +++DL   V+  A GMF+EKR++  E+F+  LS FS 
Sbjct: 147 TDDDEDDEEEADDDLEALLAGGGELDDLELRVAQFADGMFDEKRQRNREQFIQTLSAFSP 206

Query: 189 -GPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKN 247
             PSNR +E++LN+ IV+A+TA EVL + AE++ AV KGLSPSPL+PLNIATALHRIAKN
Sbjct: 207 AAPSNRSQEVSLNRSIVEARTADEVLALTAEVVAAVAKGLSPSPLTPLNIATALHRIAKN 266

Query: 248 MEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMD 307
           ME VSM+ THRL F R R+MSMLV +AM ALPECS QG+SNI+WALSKIGG+LLYL EMD
Sbjct: 267 MEAVSMLQTHRLGFARSRDMSMLVGLAMVALPECSPQGVSNISWALSKIGGDLLYLPEMD 326

Query: 308 RVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLW 367
           R+A+VA+TKV  FN+QNVANVAG+FASM+HSAPDL S L +RA+++V+TF+EQELAQ LW
Sbjct: 327 RIAQVAITKVDSFNAQNVANVAGSFASMRHSAPDLISALTRRAAELVYTFKEQELAQFLW 386

Query: 368 AFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL 427
             ASL E   PLL++LD A +DA  F C L+  +    ++   ++S   +S  + +   L
Sbjct: 387 GCASLNECPYPLLDALDTACRDAPSFDCHLHDTVPGMWQSSDKEASSLKNSSNAYA---L 443

Query: 428 SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQC 487
           +F RDQ+GNIAWSYAVLGQMDR FFS IWKT+S+FEE++IS+QYRED+MF SQV+L NQ 
Sbjct: 444 NFTRDQIGNIAWSYAVLGQMDRPFFSGIWKTLSQFEERKISDQYREDMMFVSQVYLANQS 503

Query: 488 LKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG 547
           LKLE+PHL + L   LEE +   G++KRFNQK+TSSFQKEV RLL STG  W +EY +DG
Sbjct: 504 LKLEYPHLDMCLRGDLEENLTKTGRSKRFNQKMTSSFQKEVGRLLCSTGHEWNKEYTIDG 563

Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
           YTVDAVLVD+K+AFEIDGP+HFSRN G PLGHT  KRRYIAAAGWN+VSLSHQEWE L+G
Sbjct: 564 YTVDAVLVDEKLAFEIDGPSHFSRNLGTPLGHTAFKRRYIAAAGWNLVSLSHQEWENLEG 623

Query: 608 SFEQLDYLRVIL 619
            FEQL+YLR IL
Sbjct: 624 EFEQLEYLRRIL 635


>gi|125544383|gb|EAY90522.1| hypothetical protein OsI_12123 [Oryza sativa Indica Group]
          Length = 640

 Score =  665 bits (1715), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/552 (60%), Positives = 418/552 (75%), Gaps = 11/552 (1%)

Query: 76  EGMDWCVRARKVALKSIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFD 135
           E  DWCVRAR+ AL+SIEARGL+ S++ ++   KKK K KK  +   K+ K  +     D
Sbjct: 87  ETNDWCVRARRSALRSIEARGLSPSLQRMVASPKKKNKKKKSKKTNLKQKKAAEPKPPRD 146

Query: 136 LEDDMKMDDIMGSG------NGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFS- 188
            +DD   ++            G +++DL   V+  A GMF+EKR++  E+F+  LS FS 
Sbjct: 147 TDDDEDDEEEADDDLEALLAGGGELDDLELRVAQFADGMFDEKRQRNREQFIQTLSAFSP 206

Query: 189 -GPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKN 247
             PSNR +E++LN+ IV+A+TA EVL + AE++ AV KGLSPSPL+PLNIATALHRIAKN
Sbjct: 207 AAPSNRSQEVSLNRSIVEARTADEVLALTAEVVAAVAKGLSPSPLTPLNIATALHRIAKN 266

Query: 248 MEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMD 307
           ME VSM+ THRL F R R+MSMLV +AM ALPECS QG+SNI+WALSKIGG+LLYL EMD
Sbjct: 267 MEAVSMLQTHRLGFARSRDMSMLVGLAMVALPECSPQGVSNISWALSKIGGDLLYLPEMD 326

Query: 308 RVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLW 367
           R+A+VA+TKV  FN+QNVANVAG+FASM+HSAPDL S L +RA+++V+TF+EQELAQ LW
Sbjct: 327 RIAQVAITKVDSFNAQNVANVAGSFASMRHSAPDLISALTRRAAELVYTFKEQELAQFLW 386

Query: 368 AFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL 427
             ASL E   PLL++LD A +DA  F C L+  +    ++   ++S   +S  + +   L
Sbjct: 387 GCASLNECPYPLLDALDTACRDAPSFDCHLHDTVPGMWQSSDKEASSLKNSSNAYA---L 443

Query: 428 SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQC 487
           +F RDQ+GNIAWSYAVLGQMDR FFS IWKT+S+FEE++IS+QYRED+MF SQV+L NQ 
Sbjct: 444 NFTRDQIGNIAWSYAVLGQMDRPFFSGIWKTLSQFEERKISDQYREDMMFVSQVYLANQS 503

Query: 488 LKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG 547
           LKLE+PHL + L   LEE +   G++KRFNQK+TSSFQKEV RLL STG  W +EY +DG
Sbjct: 504 LKLEYPHLDMCLRGDLEENLTKTGRSKRFNQKMTSSFQKEVGRLLCSTGHEWNKEYTIDG 563

Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
           YTVDAVLVD+K+AFEIDGP+HFSRN G PLGHT  KRRYIAAAGWN+VSLSHQEWE L+G
Sbjct: 564 YTVDAVLVDEKLAFEIDGPSHFSRNLGTPLGHTAFKRRYIAAAGWNLVSLSHQEWENLEG 623

Query: 608 SFEQLDYLRVIL 619
            FEQL+YLR IL
Sbjct: 624 EFEQLEYLRRIL 635


>gi|357161383|ref|XP_003579073.1| PREDICTED: uncharacterized protein LOC100844423 [Brachypodium
           distachyon]
          Length = 614

 Score =  664 bits (1714), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/588 (59%), Positives = 428/588 (72%), Gaps = 32/588 (5%)

Query: 36  KESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIEAR 95
           KE + +  W+ +FLG         P+ R  +E+       E  DWCVRAR+ AL+SIEAR
Sbjct: 50  KEDDATPQWQLDFLGP-------HPQPRPDEEEDDDPLPAESTDWCVRARRSALRSIEAR 102

Query: 96  GLASSMEDLIKVKKK--KKKGKKKLEKIKKKNKVTDDDLDFDLEDDMKMDDIMGSGNGYD 153
           GL+ S++ ++   KK    K +KK +KI  K K  +D+L  D ED+M  D +        
Sbjct: 103 GLSPSLQRMVSPPKKISNNKKRKKQKKILDKKKKKNDELT-DEEDEMDSDAVP------- 154

Query: 154 MNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSG--PSNRRKEINLNKDIVDAQTAQE 211
            +DL   V+ +A G+F+EKR++  E F+  LS FS   PSNR KE++LN+DIV A+TA+E
Sbjct: 155 -DDLDHRVAQLADGVFDEKRQRNRELFIQTLSSFSAAQPSNRSKEVSLNRDIVQARTAEE 213

Query: 212 VLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLV 271
           VL + AE++ AV KGLSPSPL+PLNIATALHRIAKNME VSM  THRLAF RQR+MSMLV
Sbjct: 214 VLALTAEVMAAVAKGLSPSPLTPLNIATALHRIAKNMETVSMTQTHRLAFARQRDMSMLV 273

Query: 272 AIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
            +AM +LPECS QG+SNI+WALSKIGG+LLYL EMDR+A+VA++KV +FN+QNVANVAGA
Sbjct: 274 GLAMLSLPECSPQGVSNISWALSKIGGDLLYLPEMDRIAKVAISKVDDFNAQNVANVAGA 333

Query: 332 FASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDAT 391
           FASM+ SAP LF  LA+RA+ +V+TF+EQELAQ LW  ASL E   PLL++LD AF+D  
Sbjct: 334 FASMRQSAPALFLALAQRAAQLVYTFKEQELAQFLWGCASLNECPYPLLDALDAAFQDGL 393

Query: 392 QFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
                    +S+  +     +S   D   + +   LSF+RDQLGNIAWSY VLGQ+DR F
Sbjct: 394 ---------VSDMRQTSAKDASSGEDVSNAHA---LSFSRDQLGNIAWSYTVLGQIDRQF 441

Query: 452 FSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG 511
           FS IWKT+ ++EEQR+S+QYREDIMFASQV+L NQ +KLE+PHL  AL   LEEKI  AG
Sbjct: 442 FSHIWKTLKQYEEQRVSDQYREDIMFASQVYLANQSVKLEYPHLDFALRGDLEEKITKAG 501

Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSR 571
           K+KRFNQK TSSFQKEV  LL  TG  WIREY VDGYT+DAVLVD+KVA EIDG THFSR
Sbjct: 502 KSKRFNQKTTSSFQKEVGHLLYITGHEWIREYTVDGYTLDAVLVDEKVALEIDGTTHFSR 561

Query: 572 NTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
           N G PLGHT LKRRYI  AGW +VSLSHQEWEELQG  EQ++YLR IL
Sbjct: 562 NLGTPLGHTALKRRYITTAGWKLVSLSHQEWEELQGESEQMEYLRRIL 609


>gi|115453599|ref|NP_001050400.1| Os03g0425000 [Oryza sativa Japonica Group]
 gi|113548871|dbj|BAF12314.1| Os03g0425000, partial [Oryza sativa Japonica Group]
          Length = 615

 Score =  664 bits (1712), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/552 (60%), Positives = 418/552 (75%), Gaps = 11/552 (1%)

Query: 76  EGMDWCVRARKVALKSIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFD 135
           E  DWCVRAR+ AL+SIEARGL+ S++ ++   KKK K KK  +   K+ K  +     D
Sbjct: 62  ETNDWCVRARRSALRSIEARGLSPSLQRMVASPKKKNKKKKSKKTNLKQKKAAEPKPPRD 121

Query: 136 LEDDMKMDDIMGSG------NGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFS- 188
            +DD   ++            G +++DL   V+  A GMF+EKR++  E+F+  LS FS 
Sbjct: 122 TDDDEDDEEEADDDLEALLAGGGELDDLELRVAQFADGMFDEKRQRNREQFIQTLSAFSP 181

Query: 189 -GPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKN 247
             PSNR +E++LN+ IV+A+TA EVL + AE++ AV KGLSPSPL+PLNIATALHRIAKN
Sbjct: 182 AAPSNRSQEVSLNRSIVEARTADEVLALTAEVVAAVAKGLSPSPLTPLNIATALHRIAKN 241

Query: 248 MEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMD 307
           ME VSM+ THRL F R R+MSMLV +AM ALPECS QG+SNI+WALSKIGG+LLYL EMD
Sbjct: 242 MEAVSMLQTHRLGFARSRDMSMLVGLAMVALPECSPQGVSNISWALSKIGGDLLYLPEMD 301

Query: 308 RVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLW 367
           R+A+VA+TKV  FN+QNVANVAG+FASM+HSAPDL S L +RA+++V+TF+EQELAQ LW
Sbjct: 302 RIAQVAITKVDSFNAQNVANVAGSFASMRHSAPDLISALTRRAAELVYTFKEQELAQFLW 361

Query: 368 AFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL 427
             ASL E   PLL++LD A +DA  F C L+  +    ++   ++S   +S  + +   L
Sbjct: 362 GCASLNECPYPLLDALDTACRDAPSFDCHLHDTVPGMWQSSDKEASSLKNSSNAYA---L 418

Query: 428 SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQC 487
           +F RDQ+GNIAWSYAVLGQMDR FFS IWKT+S+FEE++IS+QYRED+MF SQV+L NQ 
Sbjct: 419 NFTRDQIGNIAWSYAVLGQMDRPFFSGIWKTLSQFEERKISDQYREDMMFVSQVYLANQS 478

Query: 488 LKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG 547
           LKLE+PHL + L   LEE +   G++KRFNQK+TSSFQKEV RLL STG  W +EY +DG
Sbjct: 479 LKLEYPHLDMCLRGDLEENLTKTGRSKRFNQKMTSSFQKEVGRLLCSTGHEWNKEYTIDG 538

Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
           YTVDAVLVD+K+AFEIDGP+HFSRN G PLGHT  KRRYIAAAGWN+VSLSHQEWE L+G
Sbjct: 539 YTVDAVLVDEKLAFEIDGPSHFSRNLGTPLGHTAFKRRYIAAAGWNLVSLSHQEWENLEG 598

Query: 608 SFEQLDYLRVIL 619
            FEQL+YLR IL
Sbjct: 599 EFEQLEYLRRIL 610


>gi|168040935|ref|XP_001772948.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675681|gb|EDQ62173.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 453

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 193/456 (42%), Positives = 269/456 (58%), Gaps = 34/456 (7%)

Query: 173 REKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPS-- 230
           ++  +E    +  +++GP+  R+E  LN+ IV+A  A+ VL  I E +     G  P   
Sbjct: 27  QDTPVERVASKEKEWTGPNQYREERRLNRAIVEAPDAEYVLATIIEALNKPHWG-KPRKI 85

Query: 231 PLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIA 290
           PLSPLN AT LHRIAK M++ SM  + +L F R++EM   +  A+ A PECSAQG++NIA
Sbjct: 86  PLSPLNCATGLHRIAKRMDEASMWKSEKLTFARRQEMKAFLRAAVKAFPECSAQGLANIA 145

Query: 291 WALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRA 350
           WALSKIG   L+  EMD +A+ AL K+ EFN+QN+AN AGAFASM H+AP LF  +A+RA
Sbjct: 146 WALSKIGSSALFEEEMDHLADAALDKLSEFNAQNLANTAGAFASMLHAAPALFDAIAQRA 205

Query: 351 SDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGV 410
            ++  +F+  EL Q+LWAFA L  P DPL +SLD                         V
Sbjct: 206 VEVAGSFRPLELVQILWAFACLNHPLDPLFDSLD-------------------------V 240

Query: 411 KSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQ 470
           +   + D+  +       F++ QL ++AWS AVL Q +R +F  +WK ++       SE 
Sbjct: 241 QLVENPDAAAAT---FRGFSQQQLASMAWSCAVLQQQERPWFISLWKCVNSRATTWTSEA 297

Query: 471 YRED--IMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEV 528
            R+   +    Q++  N  LKLE   L L     LE  +  A + ++   K++S   +EV
Sbjct: 298 DRKPKGVQHMCQLYQANLALKLECADLALTTEKELEIMLEEAWEKEKAANKLSSGDHREV 357

Query: 529 ARLLVST-GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYI 587
            RLLVST G  W+ EY    Y++D  LVD +VA EIDGPTHFSRNTG+ LGHT+LKRR +
Sbjct: 358 DRLLVSTTGRAWVSEYEGAPYSLDLALVDARVAIEIDGPTHFSRNTGILLGHTVLKRRLL 417

Query: 588 AAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
            +AGW V  +  QEWEEL+G  E+  +LR +L+  I
Sbjct: 418 RSAGWTVFPIPFQEWEELRGEQERALFLRTLLEGSI 453


>gi|295829058|gb|ADG38198.1| AT2G31890-like protein [Capsella grandiflora]
          Length = 164

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 129/154 (83%), Positives = 140/154 (90%)

Query: 179 EFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIA 238
           +   RLSQFSGPS+R KEINLNK I++AQTA+EVLEV AE I AV KGLSPSPLSPLNIA
Sbjct: 11  QLAQRLSQFSGPSDRMKEINLNKAIIEAQTAEEVLEVTAEXIMAVAKGLSPSPLSPLNIA 70

Query: 239 TALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGG 298
           TALHRIAKNMEKVSMM T RLAF RQREMSMLVA+AMT LPECSAQGISNI+WALSKIGG
Sbjct: 71  TALHRIAKNMEKVSMMRTRRLAFARQREMSMLVALAMTCLPECSAQGISNISWALSKIGG 130

Query: 299 ELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
           ELLYL+EMDRVAEVA +KVG+FNSQNVAN+AGAF
Sbjct: 131 ELLYLTEMDRVAEVAXSKVGDFNSQNVANIAGAF 164


>gi|295829052|gb|ADG38195.1| AT2G31890-like protein [Capsella grandiflora]
 gi|295829054|gb|ADG38196.1| AT2G31890-like protein [Capsella grandiflora]
 gi|295829056|gb|ADG38197.1| AT2G31890-like protein [Capsella grandiflora]
 gi|295829060|gb|ADG38199.1| AT2G31890-like protein [Capsella grandiflora]
 gi|295829062|gb|ADG38200.1| AT2G31890-like protein [Neslia paniculata]
 gi|345289971|gb|AEN81477.1| AT2G31890-like protein, partial [Capsella rubella]
 gi|345289973|gb|AEN81478.1| AT2G31890-like protein, partial [Capsella rubella]
 gi|345289975|gb|AEN81479.1| AT2G31890-like protein, partial [Capsella rubella]
 gi|345289977|gb|AEN81480.1| AT2G31890-like protein, partial [Capsella rubella]
 gi|345289979|gb|AEN81481.1| AT2G31890-like protein, partial [Capsella rubella]
 gi|345289981|gb|AEN81482.1| AT2G31890-like protein, partial [Capsella rubella]
 gi|345289983|gb|AEN81483.1| AT2G31890-like protein, partial [Capsella rubella]
 gi|345289985|gb|AEN81484.1| AT2G31890-like protein, partial [Capsella rubella]
          Length = 164

 Score =  268 bits (686), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 129/154 (83%), Positives = 140/154 (90%)

Query: 179 EFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIA 238
           +   RLSQFSGPS+R KEINLNK I++AQTA+EVLEV AE I AV KGLSPSPLSPLNIA
Sbjct: 11  QLAQRLSQFSGPSDRMKEINLNKAIIEAQTAEEVLEVTAETIMAVAKGLSPSPLSPLNIA 70

Query: 239 TALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGG 298
           TALHRIAKNMEKVSMM T RLAF RQREMSMLVA+AMT LPECSAQGISNI+WALSKIGG
Sbjct: 71  TALHRIAKNMEKVSMMRTRRLAFARQREMSMLVALAMTCLPECSAQGISNISWALSKIGG 130

Query: 299 ELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
           ELLYL+EMDRVAEVA +KVG+FNSQNVAN+AGAF
Sbjct: 131 ELLYLTEMDRVAEVATSKVGDFNSQNVANIAGAF 164


>gi|302780623|ref|XP_002972086.1| hypothetical protein SELMODRAFT_412577 [Selaginella moellendorffii]
 gi|300160385|gb|EFJ27003.1| hypothetical protein SELMODRAFT_412577 [Selaginella moellendorffii]
          Length = 296

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 89/190 (46%), Positives = 115/190 (60%), Gaps = 17/190 (8%)

Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
           L  D+VD++  + VLE I  +     KG     LS +N+ATALH+I      +SM    R
Sbjct: 78  LTVDLVDSRDVEGVLETIERV-----KG--RFRLSSINVATALHKIVT----LSMSEARR 126

Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVG 318
           L +  Q +++ LVA AM  LPEC+AQG+SNIAWA+SKIGG LLY  EM+ +A  A+ KV 
Sbjct: 127 LKYAMQCDVAELVASAMELLPECNAQGVSNIAWAISKIGGHLLYHGEMEIIARAAVAKVD 186

Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP 378
           EFN QN+ANVAG FASMQHS+P LF +L   AS  V +           A   + +P D 
Sbjct: 187 EFNPQNIANVAGTFASMQHSSPALFEKLLDAASRGVSSTGTGP------ASLGMAQPLDS 240

Query: 379 LLESLDNAFK 388
            LESLD A +
Sbjct: 241 FLESLDAALQ 250


>gi|412993721|emb|CCO14232.1| predicted protein [Bathycoccus prasinos]
          Length = 590

 Score =  112 bits (280), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 113/412 (27%), Positives = 179/412 (43%), Gaps = 37/412 (8%)

Query: 232 LSPLNIATALHRIAKN-MEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQ----GI 286
           +SP NIA  + +I  N ++ V M    R    R    + LV + + A  + S +     +
Sbjct: 172 VSP-NIAGKMLQILGNKVQSVKMDRFERAGIRRDPRFAHLVGLTVAAARQNSEEFKTSAV 230

Query: 287 SNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSEL 346
               W L+ + GE    +EM+ ++  A   V E   ++V NVA A AS +H+   LFS +
Sbjct: 231 CQAIWGLAVVSGEAANAAEMEVLSNRAARSVVEMKPKDVTNVAWALASCRHANEGLFSAI 290

Query: 347 AKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCN 405
            + A    +  F   ++  + WA A L    D +++ +               K  SN  
Sbjct: 291 NEYAEQGGLKGFDSFKITTLCWATAHLQMDGDGIIKGV--------------AKWASN-- 334

Query: 406 ENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI-SRFEE 464
                 + G  + E      V      QL  ++WS   L + D    SDI KT+ S    
Sbjct: 335 ------APGSNEGEDGTQQTVNKLKGAQLCTLSWSLVNL-RNDVGLNSDILKTVWSHVCS 387

Query: 465 QRISEQYREDIMFA----SQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKV 520
           Q   +++ ED        +Q++     +     +    L   L EK ++A   +R    V
Sbjct: 388 QEGIKKFMEDDSIRGRDLNQLYQTAMAISSSDTNKNATLPDALMEKCSNAWAEQR-RPPV 446

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNT-GVPLGH 579
            S FQ++VA +L   G  +  E  V GY VD +L    V  E+DGP+HF+RN     LG 
Sbjct: 447 ISWFQRDVAAILSYMGEKYEEEAIVAGYRVDVLLESIGVVLEVDGPSHFARNVKDHALGQ 506

Query: 580 TMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSSNI 631
           T LKR  + AAG+ +  ++  EW+ L    ++ DY+R  L     GE   +I
Sbjct: 507 TNLKRNLLKAAGYKIFPIAVTEWDLLFNVEDKSDYVRAGLDALANGEDIPDI 558


>gi|255075447|ref|XP_002501398.1| hypothetical protein MICPUN_100065 [Micromonas sp. RCC299]
 gi|226516662|gb|ACO62656.1| hypothetical protein MICPUN_100065 [Micromonas sp. RCC299]
          Length = 571

 Score =  109 bits (272), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 104/378 (27%), Positives = 175/378 (46%), Gaps = 46/378 (12%)

Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVA 326
           + M VA A       S   ++N AWA+  I  E    +EM+ +A  A     + + + +A
Sbjct: 204 LGMCVAAARRGSDALSPVSVANAAWAVGVISTERANSAEMEVLAARAAQVTEDISKRGIA 263

Query: 327 NVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
           ++A A AS +H++ +LF ++  RA+   +  F+  +++ +++AFA L   AD  LE LD 
Sbjct: 264 DLAWALASCRHASEELFQQIGIRAAVTGLKGFKAFDISTLVYAFAHLGHGADGFLEGLDQ 323

Query: 386 AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
            F                    G  +  G   ++ + +    SF    L N AWS AV+G
Sbjct: 324 WFA------------------GGAEEDVGKEAADANAAKMAASFTAHPLVNTAWSLAVIG 365

Query: 446 --QMDRIFFSDIWKTISRFEEQRISEQYRED-------IMFAS----QVHLVNQCL-KLE 491
              +    F+ +W  I    E   +E    D       I + S     ++ +NQ +  +E
Sbjct: 366 GDALRSRAFAALWGEICARGEAAAAEGATVDPSLDGDRIQYGSWKGKNLNQINQAIVAVE 425

Query: 492 HPHLQLALS-SVLEEKIASAGKTKRFNQK---VTSSFQKEVARLLVSTGLNWIREYAVDG 547
                 AL+     + + +A ++    Q+   V S +Q++VA +L   G     E    G
Sbjct: 426 SAGGAEALALRPAPDSLTAAAESAWMAQRRPPVVSWYQRDVASILSYMGEKHEEEAVCGG 485

Query: 548 YTVDAVLVDK--------KVAFEIDGPTHFSRNTG-VPLGHTMLKRRYIAAAGWNVVSLS 598
           Y VD ++ +          +A E+DGP+HF+RN   + LG T LK R +   G +VVS+S
Sbjct: 486 YRVDLLVPNPVGVPQQSGGIAIEVDGPSHFARNDPELALGQTRLKHRQLRHLGMSVVSVS 545

Query: 599 HQEWEELQGSFEQLDYLR 616
             EWE L+ + E+++YLR
Sbjct: 546 VAEWEYLESAEEKVEYLR 563


>gi|145349861|ref|XP_001419345.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579576|gb|ABO97638.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 554

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 125/504 (24%), Positives = 208/504 (41%), Gaps = 56/504 (11%)

Query: 156 DLRRTVSMMAGGMFEE----KREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQE 211
           D  RT ++    M EE    K E ++EE +   +    P     + ++ K+   A  AQ 
Sbjct: 40  DRARTAAIRGYEMDEEGNYIKPEPSVEELLRGTAWEMDPRQDATQFSMTKEEWKAVKAQA 99

Query: 212 VLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLV 271
                      + +      +SP   A+ L  IA+  +             R   ++ ++
Sbjct: 100 RTATYPHDAVHIFENAGLRRISPEMAASMLKLIAQKAQHSRCDREELAGLRRDPRVAHMI 159

Query: 272 AIAMTA-------LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQN 324
              + A       LP   A+ ++   WAL  I GE    +E++ +A  A   + + +   
Sbjct: 160 GTCVAAARAKSDTLP---AEEVAKCCWALGVIAGERANSAELEVLANRASELMKKLSPDE 216

Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESL 383
           +A+++ + A  +HS+   F EL   A+      FQ  ++  V WAFA L       L+ +
Sbjct: 217 IADISWSLAISRHSSERFFHELDVHAAMTGFKGFQAYQITTVAWAFAHLGHSHAGFLDGI 276

Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
           D     A       NK L+                + +  + V  FN   L ++AWS+ V
Sbjct: 277 DVWVARAP----ARNKDLT---------------PQQAAEAQVHRFNATILASLAWSFCV 317

Query: 444 L-GQMDRIFFSDIW-KTISRFEEQRISEQYREDIMFASQVHLVNQCL------KLEHPHL 495
           +   +D +FF  +W + I+R E        +E+   +   H            +L   H 
Sbjct: 318 MEDALDSLFFRTLWAEIITRGEHDAQMVHEKENTAASMDEHHNTNVFGPWKGRQLNQLH- 376

Query: 496 QLALSSV------LEEKIASAGKTKRFNQK---VTSSFQKEVARLLVSTGLNWIREYAVD 546
           Q A+++V      L  ++ +A  T    Q    V S FQ++V  +L   G     E  V 
Sbjct: 377 QAAITAVRAGFDPLPTELGAAADTAWNTQNRPPVVSWFQRDVGAILSYMGEKHEEEALVS 436

Query: 547 GYTVDAVLVDKK---VAFEIDGPTHFSRN-TGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           GY  D +L D K   V  E+DGP+HF+RN   + LG T LK+R +   G+ V  +   EW
Sbjct: 437 GYRCDLLLPDAKPTGVVIEVDGPSHFARNDRKLALGQTRLKQRQLEGEGFAVFPIPIFEW 496

Query: 603 EELQGSFEQLDYLRVILKDYIGGE 626
           + L+ + ++ DYLR  L     GE
Sbjct: 497 DYLEDAQQKSDYLRAGLDAIERGE 520


>gi|303279190|ref|XP_003058888.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460048|gb|EEH57343.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 594

 Score =  105 bits (262), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 126/464 (27%), Positives = 207/464 (44%), Gaps = 74/464 (15%)

Query: 195 KEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMM 254
           KE+  N +  D QTA +  E       A  + +SP      +IA  + ++  ++ K +  
Sbjct: 147 KEVKANAN--DPQTALQAFE------EAGLRRVSP------DIAAGMLKMIADVAKKART 192

Query: 255 TTHRLAFTRQRE-----MSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRV 309
               LA  R+       +   VA A       +   +S  AW+L+ I GE    +EM+ +
Sbjct: 193 DREELAGLRRDSRVAHLLGTCVAAARRNSDALTPNKLSAAAWSLAIISGERANSAEMEVL 252

Query: 310 AEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKR-ASDIVHTFQEQELAQVLWA 368
           AE A   V E   +  A++A A AS +H++P  F+ L  R A++ +  F+  +++ ++WA
Sbjct: 253 AERAALVVSEMKPRACADLAWALASCRHASPAFFNGLDVRFATEGLKKFKVFDVSTLVWA 312

Query: 369 FASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLS 428
           FA L   +D L + L++ F  A+                   +S  DAD+  + S+    
Sbjct: 313 FAHLGHGSDGLRDGLEDWFVGASS------------------ESVSDADAAAAASALAKK 354

Query: 429 FNRDQLGNIAWSYAVLG--QMDRIFFSDIWKTISRF----------EEQRISEQYREDIM 476
           F    L   AWS +V+G   M    F  +W  I R            ++ +  +  + I+
Sbjct: 355 FTPQALVTTAWSLSVIGAEAMRSRAFKALWGEIGRLGGEVNDADAVAKEALLAESGDKIV 414

Query: 477 FAS----QVHLVNQC-LKLEHPHLQLALS-SVLEEKIASAGKTKRFNQK---VTSSFQKE 527
           F       ++ +NQC + ++      AL  + L E +  A       Q+   V S +Q++
Sbjct: 415 FGPWRGKHLNQINQCVVSVDACGGCDALGLAPLAEPLRVAASNAWMAQRRPPVVSWYQRD 474

Query: 528 VARLLVSTGLNWIREYAVDGYTVD-----AVLVD---------KKVAFEIDGPTHFSRNT 573
           VA +L   G     E    GY VD      + +D           VA E+DGP+HF+RN 
Sbjct: 475 VASILSYMGEKHEEEAVCAGYRVDLHIPKPIGIDDATHKAAARAGVAVEVDGPSHFARND 534

Query: 574 G-VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
               LG T LK R + + G+ VVS+   EWE L+ S E+++YLR
Sbjct: 535 ATTSLGQTRLKHRQLRSLGFAVVSVPVSEWEYLETSEEKVEYLR 578


>gi|302832295|ref|XP_002947712.1| hypothetical protein VOLCADRAFT_87862 [Volvox carteri f. nagariensis]
 gi|300267060|gb|EFJ51245.1| hypothetical protein VOLCADRAFT_87862 [Volvox carteri f. nagariensis]
          Length = 1281

 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 162/369 (43%), Gaps = 30/369 (8%)

Query: 263  RQREMSMLVAIAMTALPECSA---QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGE 319
            R  E SML + A   L + ++   QG+SN AWA +++G     L     ++  AL K+  
Sbjct: 645  RSYEHSMLSSWAAQTLDKLASFEPQGVSNTAWAFARLGFHSPQL--FQALSAAALHKIEG 702

Query: 320  FNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPL 379
            F +Q ++N+A A A+  H  P LF  LA+ A+ +  +F  Q  +  LWA A+L    D L
Sbjct: 703  FTAQGLSNLAWAMATAGHVQPRLFEALARHATSLAPSFNAQNCSVTLWACATLRHHDDEL 762

Query: 380  LESLDNAFKDATQFTCC----LNKALSNCNENGGVKSSGDADSEGSLSSPVLS-FNRDQL 434
              +L    +   +   C    +  AL      G       A      +S +L   N+ +L
Sbjct: 763  FNALLE--RLVAEVDTCEPQNVANALWAVARMGHPLPRERAAPLVCHASRLLGRMNQQEL 820

Query: 435  GNIAWSYAVLGQMDRIFFSDIWKTISRFEE---QRISEQYREDIMFASQV------HLVN 485
             N  W+ A L  MD I F+     + R  +   + + + Y   +M+ S +       L  
Sbjct: 821  CNTMWAVACLDLMDEILFATFCSCLQRLADISPEGMHQAYHAQLMYHSSLARRAGMSLAQ 880

Query: 486  -QCLKLEHPHLQLALSSVLEEK---IASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIR 541
             Q L   +P   L L   L E    +A++           S F +EV+  L   G+    
Sbjct: 881  LQQLAASNPPASLGLLPCLSEPLRTVAASMWAASARDVHVSRFHQEVSGALAGAGVPHAL 940

Query: 542  EYAVD--GYTVDAVLV--DKKVAFEIDGPTHFSRNTG-VPLGHTMLKRRYIAAAGWNVVS 596
            E+  D   ++VD  L    K VA E++G  H++ N     LG T ++RR +   GW+VV 
Sbjct: 941  EWMTDDQHFSVDIGLQVNSKPVAVEVNGSHHYASNAPHRALGDTAVRRRMLEDRGWHVVD 1000

Query: 597  LSHQEWEEL 605
            +   EWE +
Sbjct: 1001 VGFAEWEAM 1009



 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 85/184 (46%), Gaps = 20/184 (10%)

Query: 283 AQGISNIAWALSKIGGELLYLSE---MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
           A+G++N AWA     G+L Y+        +A+ AL ++ EF+ QN++N+  +F  M H+ 
Sbjct: 363 ARGLANSAWAF----GKLKYVPSGGLPSVIAQAALRRMPEFSPQNLSNLVWSFVYMHHAD 418

Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLN- 398
             L S  ++     V  F+ QELA ++WAFASL    D +L     A K A +       
Sbjct: 419 EVLLSAASRFVCARVGEFKPQELANIVWAFASLGHRDDQMLHV---AAKQAQRIAPLFKE 475

Query: 399 KALSNCNENGGVKSSGDADS-------EGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
           + LSN     G  S  D          E  +  P  +F    + N+AW+ A +G  D  F
Sbjct: 476 QELSNMLWALGKMSLRDQPQVLEALMEETRVKLP--AFLPQGISNVAWALASVGHPDMQF 533

Query: 452 FSDI 455
              +
Sbjct: 534 LDQV 537



 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 11/98 (11%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVAL----TKVGEFNSQNVANVAGAFASMQHSA 339
           Q +SN+ WAL K+      L +  +V E  +     K+  F  Q ++NVA A AS+ H  
Sbjct: 476 QELSNMLWALGKMS-----LRDQPQVLEALMEETRVKLPAFLPQGISNVAWALASVGHPD 530

Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFASL--YEP 375
                ++  +  + +  F  Q LA ++WA ASL  Y+P
Sbjct: 531 MQFLDQVVAQCGNQLAAFDVQALANLVWAMASLGYYKP 568



 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 41/153 (26%), Positives = 58/153 (37%), Gaps = 40/153 (26%)

Query: 258 RLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKV 317
           RL F   +    L A A+  +   +AQG+SN+AWA++  G     L E   +A  A +  
Sbjct: 680 RLGFHSPQLFQALSAAALHKIEGFTAQGLSNLAWAMATAGHVQPRLFEA--LARHATSLA 737

Query: 318 GEFNSQN-------------------------------------VANVAGAFASMQHSAP 340
             FN+QN                                     VAN   A A M H  P
Sbjct: 738 PSFNAQNCSVTLWACATLRHHDDELFNALLERLVAEVDTCEPQNVANALWAVARMGHPLP 797

Query: 341 -DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            +  + L   AS ++    +QEL   +WA A L
Sbjct: 798 RERAAPLVCHASRLLGRMNQQELCNTMWAVACL 830


>gi|384250651|gb|EIE24130.1| hypothetical protein COCSUDRAFT_47154 [Coccomyxa subellipsoidea
           C-169]
          Length = 1093

 Score =  101 bits (252), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 158/344 (45%), Gaps = 47/344 (13%)

Query: 267 MSMLVAIAMTA-LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNV 325
           MS +VA AM      C+ Q ISN  WA +K+       + +D  A  A  ++ EF+ QN+
Sbjct: 588 MSRVVANAMAERASNCNPQEISNTVWAYAKL--RFYDAAVLDTFANEATRRIEEFSQQNL 645

Query: 326 ANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
           AN+A A   + H    L   +A+ A+ +V     Q ++ +LW +AS       L  ++ +
Sbjct: 646 ANLAWAMGKLSHFHEGLLDAIAEHATAMVQDLSLQHVSNILWTYASFLH----LKPAMTS 701

Query: 386 AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
           AF                    G ++   + ++          FN  QL N+ WS  +  
Sbjct: 702 AFV-------------------GEIERRLNTEA----------FNPQQLSNLLWSLCI-- 730

Query: 446 QMDRIFFSDIWKTI-SRFEEQRISEQ-YREDIMFASQVHLVNQCLKLEHPHLQLALSSVL 503
               +   +IWK I ++ E   I+ +   E+ +  +Q++     ++++ P LQL + + L
Sbjct: 731 --AELCSEEIWKGIMAQIETLGIAAKDLPEEAL--TQIYQAYLLMRVDRPQLQLTMPAQL 786

Query: 504 EEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG--YTVDAVLVDKKVAF 561
                        N ++ S+  ++VAR+L   G+    E+  +   ++VD  L ++K+A 
Sbjct: 787 LPAAHHTWLESCKNVRI-SALHRDVARVLTEHGIPHNIEHVTEDELFSVDIALPEEKIAI 845

Query: 562 EIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
           E+DGP HF+ NT    G  + +++ + A GW V+S+    W  L
Sbjct: 846 EVDGPHHFTANTLAVTGEMLARQKLLKARGWAVISVPFFRWSGL 889



 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 75/270 (27%), Positives = 131/270 (48%), Gaps = 31/270 (11%)

Query: 185 SQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRI 244
           + F+GP    + +++NK I  AQ+A+ V+ V+ + +              + +ATALH +
Sbjct: 179 TNFAGPV--PECVHINKRITAAQSAEAVIGVVQQELDKFDA---------VCMATALHTL 227

Query: 245 AKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIG---GELL 301
           A              A   + E+  L+ +  T L + +A+ +SN  WAL+K+G   GE +
Sbjct: 228 ASMRASAQQYA----ALFERPEVLRLMHVIGTRLTDFTARNLSNSLWALAKMGHNPGEAM 283

Query: 302 YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS-APDLFSELAKRASDIVHTFQEQ 360
            L+ M   AEVA  K+   N+QN+AN+A ++A++ H+   +L   +A +A   +  F  Q
Sbjct: 284 -LNAM--AAEVA-KKLDGCNAQNLANIAWSYATLSHTPGEELLEAIAVKAQKKLAEFSSQ 339

Query: 361 ELAQVLWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE 419
            ++ +L+AFA L ++P+  L ++   A      FT    +ALSN         + D +  
Sbjct: 340 NISNLLYAFAKLEHKPSTFLEQASRAAMPILGSFT---PQALSNTVWALSKLDTLDEELF 396

Query: 420 GSLSSPVLS----FNRDQLGNIAWSYAVLG 445
            ++   VL     FN   + N  W +A L 
Sbjct: 397 IAIVQQVLGKLTRFNAQNVANTVWGFANLA 426



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 53/206 (25%), Positives = 92/206 (44%), Gaps = 26/206 (12%)

Query: 282 SAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
           +AQ ++N  W  + +    G+ L+    D VA+  +  + E++ QN+ANV  ++A M   
Sbjct: 411 NAQNVANTVWGFANLAFDPGQPLW----DAVAQNGIYTMHEYSPQNIANVLWSYAKMGKR 466

Query: 339 APDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL-DNAFKDATQFT-- 394
              L +  +  A+  + TFQ Q +A   WA+A+L   P+   L +L ++A     QF+  
Sbjct: 467 YEALLTAASAHAAHTMSTFQPQSVANFCWAYATLNVAPSSQCLTALAEHANHTLMQFSPQ 526

Query: 395 -------CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVL--- 444
                          +    G V S   A   G+ +S   +F+R  L N+ W++A L   
Sbjct: 527 NISNTAWALATLQFKHMGLMGNVASEVTARLSGAEAS---AFSRQHLANLIWAFATLELD 583

Query: 445 --GQMDRIFFSDIWKTISRFEEQRIS 468
               M R+  + + +  S    Q IS
Sbjct: 584 PGAAMSRVVANAMAERASNCNPQEIS 609


>gi|308806908|ref|XP_003080765.1| unnamed protein product [Ostreococcus tauri]
 gi|116059226|emb|CAL54933.1| unnamed protein product [Ostreococcus tauri]
          Length = 652

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 113/493 (22%), Positives = 210/493 (42%), Gaps = 51/493 (10%)

Query: 156 DLRRTVSMMAGGMFE----EKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQE 211
           D +RT ++    M E    +K E +++E +   +    P+    + ++  D   A  A+ 
Sbjct: 142 DRQRTAAVRGYEMDEDGNWQKPEPSVDELLRGTAWEMDPTKDATQFSMTTDEWKAVKAEA 201

Query: 212 VLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLV 271
              +      +V +      ++P   A+ L  IA+  +   +         R   ++ ++
Sbjct: 202 RTVMYPHDAVSVFEKAGLRRINPEMAASMLKVIAQKAQNSRVDREELAGLRRDPRVAHMI 261

Query: 272 AIAMTA-------LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQN 324
            + ++A       LP   A+ ++   WAL  I GE    +E++ +++ A   + +F+S  
Sbjct: 262 GVCVSAARAKSDMLP---AEEVAKACWALGVIAGERANSAELEVLSDRAADLIVKFSSDE 318

Query: 325 VANVAGAFASMQHSAPDLFSEL-AKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
           +A++  + AS +  +  L     A +A   +  FQ  +L  V WAFA L       +E L
Sbjct: 319 IADICWSLASSRQGSTFLRQYTHANQALTGLKGFQAYQLTTVAWAFAHLGHKHTGFVEGL 378

Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
           D     A      ++ A +             AD++      +  FN   L ++AWS+ V
Sbjct: 379 DIWVTRAPARAKTMSPAEA-------------ADAQ------IHRFNATILASLAWSFCV 419

Query: 444 L-GQMDRIFFSDIWKTI--------SRFEEQRIS--EQYREDIMFASQVHLVNQCLKLEH 492
           +   +D +FF  +W  I        +   E+  S  E +  ++    +   +NQ  +   
Sbjct: 420 MEDALDSLFFRTLWAEICARGVHDAAVVHEKDPSGDEHHHANVFGPWKGRQLNQLHQASL 479

Query: 493 PHLQLALSSVLEEKIASAGKT--KRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTV 550
             +      +  E  A+A +    +    V S FQ++V  +L   G  +  E  V GY  
Sbjct: 480 TAVSAGFEPLPAELGAAADEAWNTQTRPPVISWFQRDVGAILSYMGEKYEEEALVGGYRC 539

Query: 551 DAVLVDKK---VAFEIDGPTHFSRN-TGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
           D +L + K   V  E+DGP+HF+RN     LG T LK+R +   G+ V  +   +W+ L+
Sbjct: 540 DLLLPNAKPNGVVIEVDGPSHFARNDRKRALGQTRLKQRQLEGEGYAVFPIPIFDWDFLE 599

Query: 607 GSFEQLDYLRVIL 619
            + ++ DYLR  L
Sbjct: 600 NAEQKSDYLRAGL 612


>gi|397587109|gb|EJK53812.1| hypothetical protein THAOC_26672 [Thalassiosira oceanica]
          Length = 1144

 Score = 95.5 bits (236), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 97/352 (27%), Positives = 160/352 (45%), Gaps = 22/352 (6%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
            + +L    AQ +SN AWA +  G     L +        L  +  F  Q ++N A AFA
Sbjct: 397 GLCSLDSFKAQALSNTAWAFATAGVPHPELFKKIGRHVTGLGSLDSFKPQALSNTAWAFA 456

Query: 334 SMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLD-NAFKDA 390
           + +   P+LF ++    + +  + +F+ QEL+   WA+A+       L E L   A  + 
Sbjct: 457 TAEIPHPELFKKIGDHIAGLGSLDSFKPQELSNTAWAYATARVFHSRLFERLSTGALVER 516

Query: 391 TQFTCC-LNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDR 449
             F    +   L  C   G  + +  +     + S +   N   L NI W+Y+V      
Sbjct: 517 EHFYVQEVANFLWACATVGHTEETLFSAFAPLIESKLEKCNEQDLTNIGWAYSVTNDASE 576

Query: 450 IFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIAS 509
             F++ +      +E   SE   E++    Q  L  + L  E     L L   L+EK  +
Sbjct: 577 GLFNECFVGACASKECEFSE---ENLFQLHQWQLWQRELGSE-----LELPRSLKEKCRN 628

Query: 510 AGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVL-VD--KKVAFEIDG 565
           +  +  +++   S  Q ++   L +TGL+  +E  +  GY +DA++ VD  +KVA E+DG
Sbjct: 629 SFLSANYSE---SKLQNDIVGELKATGLDLEKEILLGSGYRIDALVKVDNGRKVAIEVDG 685

Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
           P+HF +    P G T LK R +A      V+S+ + EW EL+ S  +  YLR
Sbjct: 686 PSHFIQRR--PAGRTTLKHRQVATLDCIEVMSVPYWEWNELKNSAAKQHYLR 735



 Score = 42.4 bits (98), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 42/190 (22%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTK-VGEFNSQNVANVAGAF 332
           A+  L E  A+ +SN+ ++       L+  +  + V    +T+ +  F  Q ++NV  A+
Sbjct: 207 ALPILHEFDARSLSNLIYSFG-----LVKYNPTEAVGNHIVTRSLDNFWPQALSNVVWAY 261

Query: 333 ASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA 390
           A+     P+L  ++    + +  +  F+ QEL+ + WAFA+  EP  P+L      FK  
Sbjct: 262 ATAGVPHPELLRKIGDHVAGLKSLDPFKPQELSNIAWAFATAGEP-HPVL------FKRI 314

Query: 391 TQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRI 450
                 L                 D D          SF    L NIAW++   G +   
Sbjct: 315 GDHVAGL-----------------DLD----------SFKSQSLSNIAWAFVTAGVLHPE 347

Query: 451 FFSDIWKTIS 460
            F  I   I+
Sbjct: 348 LFKKIGDNIA 357


>gi|299472343|emb|CBN77531.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 695

 Score = 94.4 bits (233), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 89/369 (24%), Positives = 148/369 (40%), Gaps = 76/369 (20%)

Query: 282 SAQGISNIAWALSKIGGELLYLSE-----MDRVAEVALTKVGEFNSQNVANVAGAFASMQ 336
           + Q ++ ++W  S +  E L         +D +A+ A   VG F  Q+V+ V+ A A M 
Sbjct: 332 TPQDLAMLSWGFSSLSQECLPCQPAAYRALDVLAKAARECVGNFRPQDVSMVSLALARMS 391

Query: 337 HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCC 396
              P L   +A R ++ +  F+ QEL+   WA+A L+              +D  +F   
Sbjct: 392 WDDPRLMKAMASRTTETLRAFKPQELSNTAWAYARLH-------------VRD-RRFWSA 437

Query: 397 LNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIW 456
           L K      +  G+ +                    ++ N+AW+ AV+G+ D     ++ 
Sbjct: 438 LQKQAKRMLDGPGMSA-------------------QEIANLAWALAVMGEADVELLEEL- 477

Query: 457 KTISRFEEQRISEQYREDIMF--ASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTK 514
                    R ++  R D     + Q++ V      + P L   L       +       
Sbjct: 478 --------LRSAQAQRGDFTLIESHQLYQVYLLWGKDMPELWKELDGEFLMALKRRWTDN 529

Query: 515 RFNQKVTSSFQKEVARLL-------------------VSTGL---NW-IREYA--VDGYT 549
           +   K +S    EV++ L                   V  GL   +W  R ++       
Sbjct: 530 QQRTKRSSCSHLEVSQTLDLMQISHENESEHDIDIEVVGVGLASEDWDFRSFSAGTGPNP 589

Query: 550 VDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ--G 607
            D   V  K+A E+DGP HF++NT  PLGH +LK R ++  GW VVS+   EW+ +    
Sbjct: 590 ADPAEVRLKLALEVDGPAHFTKNTARPLGHMVLKHRTLSKMGWTVVSIPFLEWDPIPFWS 649

Query: 608 SFEQLDYLR 616
           S E+  YL+
Sbjct: 650 SMEKKRYLQ 658


>gi|302781256|ref|XP_002972402.1| hypothetical protein SELMODRAFT_413123 [Selaginella moellendorffii]
 gi|300159869|gb|EFJ26488.1| hypothetical protein SELMODRAFT_413123 [Selaginella moellendorffii]
          Length = 609

 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 91/296 (30%), Positives = 133/296 (44%), Gaps = 66/296 (22%)

Query: 202 DIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAF 261
           D+VD++  +EVLE I E +    +      LS +N+ATALHRIAK+M  +SM  T RL +
Sbjct: 216 DLVDSRDVEEVLETI-ERVKGRFR------LSSINVATALHRIAKHMVTLSMSETRRLKY 268

Query: 262 TRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFN 321
            RQ +++ LVA         +A   ++    +SKIGG LLY  EM+ +A  AL KV EFN
Sbjct: 269 ARQCDVAELVA--------WNATHRASPTLPISKIGGHLLYRGEMEIIARAALAKVDEFN 320

Query: 322 SQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLE 381
                             P    EL    S                        A P   
Sbjct: 321 ------------------PRTLPELLLPCST-----------------------ARP--H 337

Query: 382 SLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSY 441
           SL +++    +F     ++  + N       SG       LS     F++++L +I WSY
Sbjct: 338 SLRSSWTLRAEFP----RSFEHRNWPSFFGRSGAWLGLWILSWTHRLFSKNKLWSIVWSY 393

Query: 442 AVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQL 497
           AVLGQ+   FF+ + K I  FE+     Q++  +   +Q++ V   LK E   LQL
Sbjct: 394 AVLGQLQGPFFAHVCKEIRAFEQL---GQHKHMLQL-TQLYQVVLALKREGKDLQL 445


>gi|384245272|gb|EIE18767.1| hypothetical protein COCSUDRAFT_49195 [Coccomyxa subellipsoidea
           C-169]
          Length = 845

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 99/377 (26%), Positives = 169/377 (44%), Gaps = 57/377 (15%)

Query: 278 LPECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
           +P    Q I+N  WA + +G   G +L    +D  A   +  +  F  Q ++N   +++ 
Sbjct: 305 MPHFKPQEIANTLWAFATLGHDPGAIL----LDAAAGQMVDNIAHFRPQAISNSLWSYSK 360

Query: 335 MQHSAPD-LFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL--------- 383
           + ++    +    A+RA+ ++H +  QE+A  LWAFA+L + P   +L++          
Sbjct: 361 LAYNPGHRVLDVAARRAAGMLHQYTSQEIANTLWAFATLEHNPGSGMLDAAAVQIARRIE 420

Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLS----FNRDQLGNIAW 439
             + +D T    C  +                A+   ++S   L     F   +L N+ W
Sbjct: 421 QFSPQDTTNSVWCFARLFHYPG----------AELLQAISLYCLRHWHRFKAQELANMIW 470

Query: 440 SYAVLGQMDRIFFSDIWKTISRFEE-QRISEQYREDIMFASQVHLVNQC-LKLEHPHLQL 497
           S A+L    R    D W  ++  E+   ++E   +D    + +H + Q  + L+ P L+L
Sbjct: 471 SLALL----RACSHDTW--VALLEKLNTVAEATFDD----ADLHQLYQAYVLLDPPGLRL 520

Query: 498 ALSSVLEEKIASAGKTKRFNQ---------KVTSSFQKEVARLLVSTGL-NWIREYAVDG 547
             SS L EK    G  +R  +           TS  Q++V+ +L S G+ +   E   DG
Sbjct: 521 P-SSSLSEKFPE-GLARRAERVWRAGVHPLARTSKLQEDVSAVLWSLGVAHKTNEVTADG 578

Query: 548 -YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
            + VD  L   KV  E+DGPTHFS N+  PLG T+ ++  + A G  V S+ + EW  L 
Sbjct: 579 LFCVDIALEGGKVVIEVDGPTHFSVNSRRPLGRTVARKLMVEARGHVVRSIPYYEWCALD 638

Query: 607 GSFEQLDYLRVILKDYI 623
              +Q  Y+  +L   +
Sbjct: 639 SLEQQQAYVWRLLASAV 655



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 52/198 (26%), Positives = 90/198 (45%), Gaps = 26/198 (13%)

Query: 191 SNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEK 250
           SN+ K I   K +  A   Q++L+ +AE +    +         +N+ATALHR+AK    
Sbjct: 117 SNQNKAIT--KRLASAGHYQQILDEVAEWVKVFDE---------VNVATALHRLAKLQPP 165

Query: 251 VSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIG----GELLYLSEM 306
            +      +   R     +LV  +   +P   AQ +SN  WA + +G    G+LL     
Sbjct: 166 GTAGPQSPV--LRSASFQLLVEASQRLVPRFEAQAVSNTLWAFATLGYHPSGDLL----- 218

Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF--SELAKRASDIVHTFQEQELAQ 364
           DR+   A   V  F  Q  +N   A+A + +   + F  +   +  +D+      Q+++ 
Sbjct: 219 DRLGHHAAGIVRTFRPQATSNALWAYAKLAYVPCEPFLAAAALQLLTDLPRCV-PQDISN 277

Query: 365 VLWAFASL-YEPADPLLE 381
             WAFA+L + P + L++
Sbjct: 278 ATWAFATLRHHPGNTLMD 295


>gi|397646149|gb|EJK77145.1| hypothetical protein THAOC_01042 [Thalassiosira oceanica]
          Length = 635

 Score = 90.1 bits (222), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 170/391 (43%), Gaps = 88/391 (22%)

Query: 273 IAMTALPECSAQGISNIAWALS--KIGGELLYLSEM--------DRVAEVALTKVGEFNS 322
           I    L +   Q +SNIAWA +  ++   +L  S +        D +A   L  +  F  
Sbjct: 277 IVARKLEDFQPQNLSNIAWAYANARVSHPILLESHIPSYSNKIGDHIA--GLISLDSFKP 334

Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLL 380
           Q+++N A AFA+   S P+LF ++    + +  + +F+ QEL+ V WAFA   E ++P +
Sbjct: 335 QDLSNTAWAFATAGVSHPELFKKIGDHVAGLGSLDSFKPQELSNVAWAFAKAGE-SNPKV 393

Query: 381 ESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE-GSLSSPVLSFNRDQLGNIAW 439
                                         K  GD  +E G L S    FN  +L NIAW
Sbjct: 394 -----------------------------FKKIGDHAAELGCLDS----FNPQELSNIAW 420

Query: 440 SYAVLGQMDRIFFSDIWKTIS----RFEEQ---------RISEQYREDIM---------- 476
           + A +G  D+  F  +   I+     F EQ          ++   R+D+           
Sbjct: 421 ACATVGYNDKRLFCAVAPMIASKLDEFIEQDLANIAWAYSVANTPRQDLFDEGYVSALAS 480

Query: 477 ----FASQVHLVNQCLKLEHPHLQ--LALSSVLEEKIASAGKTKRFNQKVTSSFQKEVAR 530
               F+++        +L    L+  + L   L+E+  +A  ++ F++   S  Q +V  
Sbjct: 481 NKKEFSAEGLAQLHQWQLWQQELESGIELPRSLQERCRNAFTSRGFSE---SKLQNDVVG 537

Query: 531 LLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
            L + GL+   E  +  GY +DA++     +KVA E+DGP HF      P G T LK+R 
Sbjct: 538 ELKAAGLDLEEEVLLGSGYRIDALVKFGNGRKVAVEVDGPFHFIDRR--PAGRTTLKQRQ 595

Query: 587 IAAAG-WNVVSLSHQEWEELQGSFEQLDYLR 616
           +A      VVS+ + EW EL+ S  +  YLR
Sbjct: 596 VARLDRIEVVSVPYWEWNELKNSVTKQRYLR 626



 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 55/202 (27%), Positives = 86/202 (42%), Gaps = 26/202 (12%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYL-SEMDRVAEVALTKVGEFNSQNVANVAGAF 332
           A+  L   ++Q +SN+ WA  K+  +   L  E  RV  +    +G F  Q +AN+  +F
Sbjct: 202 AVKILHTFNSQNLSNVLWAFVKVDADNSRLFQETGRV--ITGMHLGSFKPQELANILWSF 259

Query: 333 ASMQHSAPDLFSELAKR-ASDIVHTFQEQELAQVLWAFASLYEPADPLLE----SLDNAF 387
           +    + P++F  +     +  +  FQ Q L+ + WA+A+       LLE    S  N  
Sbjct: 260 SKSSEADPEIFQAIGNHIVARKLEDFQPQNLSNIAWAYANARVSHPILLESHIPSYSNKI 319

Query: 388 KDATQFTCCLN----KALSNCN---ENGGV------KSSGD-ADSEGSLSSPVLSFNRDQ 433
            D       L+    + LSN        GV      K  GD     GSL     SF   +
Sbjct: 320 GDHIAGLISLDSFKPQDLSNTAWAFATAGVSHPELFKKIGDHVAGLGSLD----SFKPQE 375

Query: 434 LGNIAWSYAVLGQMDRIFFSDI 455
           L N+AW++A  G+ +   F  I
Sbjct: 376 LSNVAWAFAKAGESNPKVFKKI 397



 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 66/147 (44%), Gaps = 12/147 (8%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
            D +A  A+  + EF +++++N+  +F  ++ + PD     LF+   + A  I+HTF  Q
Sbjct: 154 FDSIASSAVGMLNEFEARHLSNLIYSFGLVERN-PDIGGETLFNVFGEAAVKILHTFNSQ 212

Query: 361 ELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEG 420
            L+ VLWAF  +      L +                 + L+N   +    S  D +   
Sbjct: 213 NLSNVLWAFVKVDADNSRLFQETGRVIT-GMHLGSFKPQELANILWSFSKSSEADPEIFQ 271

Query: 421 SLSSPVLS-----FNRDQLGNIAWSYA 442
           ++ + +++     F    L NIAW+YA
Sbjct: 272 AIGNHIVARKLEDFQPQNLSNIAWAYA 298


>gi|384251748|gb|EIE25225.1| hypothetical protein COCSUDRAFT_61463 [Coccomyxa subellipsoidea
           C-169]
          Length = 937

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 158/359 (44%), Gaps = 28/359 (7%)

Query: 278 LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVAL---TKVGEFNSQNVANVAGAFAS 334
           L   + Q +SNI W     G  +L   + D     AL    ++G FN Q ++N   AFA 
Sbjct: 324 LSHFATQAVSNILW-----GCAVLNFYDQDMFNAAALEIQHRIGSFNDQEISNSLLAFAK 378

Query: 335 MQH---SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDAT 391
           M+H   S   +F E  +R    V  F  Q L+ ++W+FA+L    + +LE++        
Sbjct: 379 MEHVDVSLLRVFEEDIRRPQR-VRDFTSQALSNMVWSFATLRWYPEKVLEAISAELLRRM 437

Query: 392 QFTCCLNKALS--NCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDR 449
            +      ++S     + G       A+    +   V  FN     N  W  +VL     
Sbjct: 438 PYLSVQEISVSIWAMAKLGYHPGRSLAEFGRRIEELVPDFNSQACANTLWGLSVLQATQL 497

Query: 450 IFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLA--LSSVLEEKI 507
             F  +   I R     I    + +++   Q+       +LE     LA  + ++ +   
Sbjct: 498 PCFQML---IDRLGSNNID---KVEVLMLHQLFQSLMLARLEARRQNLADPIRTIPDHIY 551

Query: 508 ASAGKTKRFNQK--VTSSFQKEVARLLVSTGLNWIREYAV-DG-YTVDAVLVDKK--VAF 561
           A   +  +   K  ++S F  +V+++L   G+    E+   DG +++D  L   +  VA 
Sbjct: 552 ALLRRVWKATVKNTLSSRFHIDVSKMLRELGVAHDFEFVTEDGLFSLDIALAGPRGPVAI 611

Query: 562 EIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILK 620
           E+DGP HF+ NT  PLG T+++RR + A GW V+S+   ++  L  +  ++ YL  +L+
Sbjct: 612 EVDGPYHFTLNTRQPLGSTLIRRRLLHALGWTVLSVPFYDYYRLGSTAAKMQYLGQLLR 670



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 55/218 (25%), Positives = 96/218 (44%), Gaps = 29/218 (13%)

Query: 318 GEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPA 376
           G+  ++ +AN   AF  + H A D+   L  +     + T+QEQE++  +WA A+L  P 
Sbjct: 212 GKMRARQLANTLWAFGKLGHDAEDVVDALLFQMHRTHIATWQEQEMSNAVWAMATLSRPD 271

Query: 377 DPLLESLDNAFKDATQ--FTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL----SFN 430
           + LLE++    +DA +   +  + +A+SN      V    +     +++   +     F 
Sbjct: 272 EGLLETMA---RDAMRRGMSAFVPQAISNLVWGFAVLEYNNNPFMLAVAEYFVMDLSHFA 328

Query: 431 RDQLGNIAWSYAVLGQMDRIFFS----DIWKTISRFEEQRISEQYREDIMFASQVHLVNQ 486
              + NI W  AVL   D+  F+    +I   I  F +Q IS      + FA        
Sbjct: 329 TQAVSNILWGCAVLNFYDQDMFNAAALEIQHRIGSFNDQEISNSL---LAFA-------- 377

Query: 487 CLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSF 524
             K+E  H+ ++L  V EE I    + + F  +  S+ 
Sbjct: 378 --KME--HVDVSLLRVFEEDIRRPQRVRDFTSQALSNM 411



 Score = 47.8 bits (112), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 56/245 (22%), Positives = 104/245 (42%), Gaps = 43/245 (17%)

Query: 235 LNIATALHRIAK---------NMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQG 285
           +NI+TA+HR+AK         N+ +   M  H L   +++ +S           +  A+ 
Sbjct: 169 INISTAMHRLAKVSYKNKVPLNVVQAHPMYPHLLTVLKKKVLSG----------KMRARQ 218

Query: 286 ISNIAWALSKIGGEL-----LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           ++N  WA  K+G +        L +M R      T +  +  Q ++N   A A++     
Sbjct: 219 LANTLWAFGKLGHDAEDVVDALLFQMHR------THIATWQEQEMSNAVWAMATLSRPDE 272

Query: 341 DLFSELAKRA-SDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAF-KDATQFTCCLN 398
            L   +A+ A    +  F  Q ++ ++W FA L    +P + ++   F  D + F     
Sbjct: 273 GLLETMARDAMRRGMSAFVPQAISNLVWGFAVLEYNNNPFMLAVAEYFVMDLSHFAT--- 329

Query: 399 KALSNCNENGGVKSSGDAD----SEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD----RI 450
           +A+SN      V +  D D    +   +   + SFN  ++ N   ++A +  +D    R+
Sbjct: 330 QAVSNILWGCAVLNFYDQDMFNAAALEIQHRIGSFNDQEISNSLLAFAKMEHVDVSLLRV 389

Query: 451 FFSDI 455
           F  DI
Sbjct: 390 FEEDI 394


>gi|397565912|gb|EJK44819.1| hypothetical protein THAOC_36611, partial [Thalassiosira oceanica]
          Length = 815

 Score = 89.0 bits (219), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 100/347 (28%), Positives = 155/347 (44%), Gaps = 33/347 (9%)

Query: 284 QGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           Q +SN  WA +  G     L+    D +A   L  +  FNSQ+V++ A AFAS   S P+
Sbjct: 30  QELSNTVWAFATAGASHPELFRKIGDHIA--GLDSLDSFNSQDVSSTAWAFASAGTSHPE 87

Query: 342 LFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDN---AFKDATQFTCC 396
           LF ++    +  D + +F+ Q  +   WA+A+       L E L     A KD  +    
Sbjct: 88  LFRKIGDHVAGLDSLDSFKPQAFSNTAWAYATARVFHSRLFEKLVTEAVAKKDHFESQPI 147

Query: 397 LNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIW 456
            N  L  C   G       +     ++S +  F    L NIAW+Y+V      +F     
Sbjct: 148 AN-FLWACATVGYTDERSFSAFAPVIASKLDKFIEQDLANIAWTYSVANAPQDLFNEGYV 206

Query: 457 KTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQ--LALSSVLEEKIASAGKTK 514
             ++  E +   EQ        +Q+H      +L H  L+  + L   L  K  +A  ++
Sbjct: 207 GALASNENEFSGEQL-------AQLHQ----WQLWHQELESGIELPRSLRAKCRNAFTSQ 255

Query: 515 RFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFS 570
            +++   S  Q +V   L + GL+   E  +  GY +DA++     +KVA E+DGP HF 
Sbjct: 256 GYSE---SKLQNDVVGELKAAGLDLEEEVLLGSGYQIDALVKFGNGRKVAVEVDGPFHFI 312

Query: 571 RNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
                P G T LK+R +A      VVS+ + EW EL+ S  +  YLR
Sbjct: 313 DRR--PAGRTTLKQRQVARLDRIEVVSVPYWEWNELKNSVTKQRYLR 357


>gi|397601425|gb|EJK57903.1| hypothetical protein THAOC_22012 [Thalassiosira oceanica]
          Length = 1126

 Score = 89.0 bits (219), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 93/345 (26%), Positives = 149/345 (43%), Gaps = 57/345 (16%)

Query: 284  QGISNIAWALSKIGGE--LLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
            Q +SN AWA +K G    +L+    D +A   L  +  F  Q ++N A A+A+ +     
Sbjct: 828  QDLSNTAWAFAKDGASHPVLFKKIGDHIAR--LGSLDSFKPQELSNTAWAYATARVFHSR 885

Query: 342  LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
            LF +L   A      F EQ ++ +LWA A++    + L  +L      A      L K  
Sbjct: 886  LFEKLTTEAVAKKDHFDEQGVSNLLWACATVDYTDERLFSAL------APMIASKLGK-- 937

Query: 402  SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
                                       FN  +L N AW+Y+V   + +  F + + +   
Sbjct: 938  ---------------------------FNLQELANFAWAYSVANTLGQGLFDEGYVSALA 970

Query: 462  FEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVT 521
              E+  S +       A          +LE     + L   L+EK  ++  +  +++   
Sbjct: 971  SNEKEFSVE-----QLAQLHQWQLWQQELES---GIELPQSLQEKCRNSFTSASYSE--- 1019

Query: 522  SSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPL 577
            S  Q +V   L +TGL+   E  +  GY +DA++     +KVA E+DGP+HF      P+
Sbjct: 1020 SKLQNDVVDELKATGLDLEEEVLLASGYRIDALVKFNDGRKVAVEVDGPSHFIDRR--PV 1077

Query: 578  GHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLRVILKD 621
            G T+LK R +A      VVS+ + EW++L  S  +  YLRV L D
Sbjct: 1078 GSTILKHRQVARLDRIEVVSVPYWEWDDLMNSVMKQHYLRVKLSD 1122



 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 53/196 (27%), Positives = 84/196 (42%), Gaps = 23/196 (11%)

Query: 284 QGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           Q ++NI W+ +K G E   L+ +  + +AE+    +  F  QN++N+A AFA++  S P 
Sbjct: 669 QALANIIWSFAKSGEEYSKLFQAIGNHIAELGC--LNSFGPQNLSNIAWAFATVGKSNPK 726

Query: 342 LFSELAKR--ASDIVHTFQEQELAQVLWAFASLYEPADPLLE-------------SLDNA 386
           LF ++       D +++F+ Q+L+   WAFA+       LLE             SLD+ 
Sbjct: 727 LFKKIGDHIAGQDSLNSFKPQDLSNTAWAFATAGVSHPELLEKDRRSRDHTAELDSLDSF 786

Query: 387 FKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQ 446
                  T          +     K  G    + SL     SF    L N AW++A  G 
Sbjct: 787 NPQTLSITAWAFATAGESHPELFKKIGGHIAGQDSLD----SFKPQDLSNTAWAFAKDGA 842

Query: 447 MDRIFFSDIWKTISRF 462
              + F  I   I+R 
Sbjct: 843 SHPVLFKKIGDHIARL 858



 Score = 46.6 bits (109), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 42/169 (24%), Positives = 82/169 (48%), Gaps = 19/169 (11%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
            D +A  A+  + EF++++++N+  +F  ++   P+     LF    K A  I+HTF   
Sbjct: 573 FDSIASSAVGMLNEFDARHLSNLIYSFGLVERK-PEIGRETLFDVFGKAALRILHTFNGH 631

Query: 361 ELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKS 412
           +++ +LWAF       + L+E    ++  ++ ++FK         + A S    +   ++
Sbjct: 632 DISNMLWAFVKVDAKNSRLFEVTGGVISGMNLDSFKPQALANIIWSFAKSGEEYSKLFQA 691

Query: 413 SGDADSE-GSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
            G+  +E G L+    SF    L NIAW++A +G+ +   F  I   I+
Sbjct: 692 IGNHIAELGCLN----SFGPQNLSNIAWAFATVGKSNPKLFKKIGDHIA 736



 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 48/96 (50%), Gaps = 2/96 (2%)

Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
           + +L     Q +SN AWA +     + +    +++   A+ K   F+ Q V+N+  A A+
Sbjct: 858 LGSLDSFKPQELSNTAWAYAT--ARVFHSRLFEKLTTEAVAKKDHFDEQGVSNLLWACAT 915

Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
           + ++   LFS LA   +  +  F  QELA   WA++
Sbjct: 916 VDYTDERLFSALAPMIASKLGKFNLQELANFAWAYS 951


>gi|397622591|gb|EJK66728.1| hypothetical protein THAOC_12320 [Thalassiosira oceanica]
          Length = 993

 Score = 88.6 bits (218), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 100/354 (28%), Positives = 159/354 (44%), Gaps = 29/354 (8%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
            + +L     Q +SN AWA +K  GE +      R  E   T +  F  Q ++N   A+A
Sbjct: 647 GLDSLDSFKPQELSNTAWAFAK-AGEAVQEDWKSRSLE--QTSLDLFKPQELSNTMWAYA 703

Query: 334 SMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDN--AFKD 389
             + S P+L  ++    +  D + +F  QEL+  +WA+A+       L E L    A ++
Sbjct: 704 KAEVSHPELLRKIGDHIAGLDSLDSFNPQELSNTIWAYATARVLDLGLFEKLATEVAARN 763

Query: 390 ATQF--TCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQM 447
             QF  T  ++  L  C   G       +     + S +   N   L NIAW+Y+V    
Sbjct: 764 G-QFIETQHMSNFLWACATVGYTDERMFSAFAPVIESKLDECNEQDLTNIAWTYSVANAP 822

Query: 448 DRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKI 507
             IF       ++  E +   EQ       A          +LE     + L   L+EK 
Sbjct: 823 QDIFNKGYVGALTSKENEFSCEQ------LAQLHQWQLWQQELES---GIELPQSLQEKC 873

Query: 508 ASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEI 563
            +A  ++ +++   S  Q +V   L + GL+   E  +  GY +DA++    ++KVA E+
Sbjct: 874 RNAFTSRGYSE---SKLQNDVVGELKAAGLDLDEEVLLGSGYRIDALVKIGDERKVAVEV 930

Query: 564 DGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
           DGP+HF +    P G T LK R +A      VVS+S+ EW+EL+ S  +  YLR
Sbjct: 931 DGPSHFMQRQ--PAGSTTLKHRQVARLDRIEVVSVSYWEWDELRNSETKQHYLR 982



 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 88/212 (41%), Gaps = 29/212 (13%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           Q +SNI W+ +K     L L +        +  +  F+ Q ++N A AFA+   S P+LF
Sbjct: 579 QALSNIIWSFAKSDKADLELFQALGNHIANMGSLDSFDPQALSNTAWAFATAGESHPELF 638

Query: 344 SELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
           +++    +  D + +F+ QEL+   WAFA   E    + E   +   + T       + L
Sbjct: 639 NKIGDHVAGLDSLDSFKPQELSNTAWAFAKAGE---AVQEDWKSRSLEQTSLDLFKPQEL 695

Query: 402 SNCNENGGVKSSGDADSEGSLSSPVL---------------SFNRDQLGNIAWSYAVLGQ 446
           SN            A ++  +S P L               SFN  +L N  W+YA    
Sbjct: 696 SNTMW---------AYAKAEVSHPELLRKIGDHIAGLDSLDSFNPQELSNTIWAYATARV 746

Query: 447 MDRIFFSDIWKTISRFEEQRISEQYREDIMFA 478
           +D   F  +   ++    Q I  Q+  + ++A
Sbjct: 747 LDLGLFEKLATEVAARNGQFIETQHMSNFLWA 778



 Score = 43.5 bits (101), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 41/169 (24%), Positives = 76/169 (44%), Gaps = 29/169 (17%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
            D +A  A+  + EF +++++N+  +F  ++ + PD     LF+     A  I+HTF  Q
Sbjct: 483 FDSIASSAVGMLNEFEARHLSNLIYSFGLVERN-PDIGGETLFNVFGIAAVKILHTFNSQ 541

Query: 361 ELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKS 412
           +++ +LWAF       + L+     ++  +D   FK          +ALSN   +     
Sbjct: 542 DISNMLWAFVKVDADNSRLFHETGGVISGMDLGNFKP---------QALSNIIWSFAKSD 592

Query: 413 SGDADSEGSLSSPVL------SFNRDQLGNIAWSYAVLGQMDRIFFSDI 455
             D +   +L + +       SF+   L N AW++A  G+     F+ I
Sbjct: 593 KADLELFQALGNHIANMGSLDSFDPQALSNTAWAFATAGESHPELFNKI 641


>gi|323450957|gb|EGB06836.1| hypothetical protein AURANDRAFT_65363 [Aureococcus anophagefferens]
          Length = 2492

 Score = 88.6 bits (218), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 94/389 (24%), Positives = 149/389 (38%), Gaps = 66/389 (16%)

Query: 255  TTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVAL 314
            T HR+ F        L   A   L + + QG+SN+AWA +  G      +  + +     
Sbjct: 2068 TKHRVLF------DALADSADHRLRDFNNQGLSNLAWAYASAGASDGNEALFEALGLQVS 2121

Query: 315  TKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKR------ASDIVHTFQEQELAQVLWA 368
             +V EF  Q +AN+  A+A+ +   P++F  +A         +     F  QE+A  +WA
Sbjct: 2122 LRVAEFRPQGLANLVWAYATAELYCPEVFEAVADEIARPSGGARRAFEFNPQEVANTVWA 2181

Query: 369  FASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLS 428
            FA    PA  L ++   A                      G K  GD  + G        
Sbjct: 2182 FAKAAVPAPGLYDAFAAAILKL------------------GAKHGGDLKAAG-------- 2215

Query: 429  FNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI---------------SRF--EEQRISEQY 471
            F   +L N+AW+YA    +D      +W+ I               SRF  EE R  +Q 
Sbjct: 2216 FTPQELANLAWAYACADHVDGDLLLLLWRAIVKEARESPDPGALDGSRFNLEELRQLQQV 2275

Query: 472  REDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARL 531
                 + ++       L  E      A   +L   +A    +    +  + S  +     
Sbjct: 2276 VLHAKYGARRGTTMGGLVAEIARAPPAFVGLLRASLADVDASPSGPRSRSPSAWR----- 2330

Query: 532  LVSTGLNWIRE-YAVDG--YTVDAVLVDKKVAFEIDGPTHFSRNTG-VPLGHTMLKRRYI 587
              + G  W    Y   G  +T   + +  +VA E DGP H+ RN   VP G T  K R +
Sbjct: 2331 --AWGWTWSTNWYCPTGCPWTWLCLPLKWRVAVEFDGPRHYFRNAKRVPTGRTRFKMRLL 2388

Query: 588  AAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
             A GW V+ + + +W +L     + +YL+
Sbjct: 2389 RALGWRVLHVPYFDWAKLDDDAARTEYLK 2417



 Score = 75.9 bits (185), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 55/189 (29%), Positives = 88/189 (46%), Gaps = 30/189 (15%)

Query: 274  AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
            A+  + E +AQ + N AWA +  G +  + +  D +A  A+ +V  F +QN+AN   A+A
Sbjct: 1962 AVRRVDEFNAQELGNTAWAYATAGRD--HPALFDAIAASAMPRVDRFIAQNLANTVWAYA 2019

Query: 334  SMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPA------------DPLLE 381
            +  H+ PDLF  +A+  +     F+ QELA   WA+A+ ++              D L +
Sbjct: 2020 TAGHARPDLFDAVAREVARRADEFKPQELANTAWAYATAHKALPGDRPTKHRVLFDALAD 2079

Query: 382  SLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL--------SSPVLSFNRDQ 433
            S D+  +D        N+ LSN        S+G +D   +L        S  V  F    
Sbjct: 2080 SADHRLRDFN------NQGLSNL--AWAYASAGASDGNEALFEALGLQVSLRVAEFRPQG 2131

Query: 434  LGNIAWSYA 442
            L N+ W+YA
Sbjct: 2132 LANLVWAYA 2140



 Score = 65.9 bits (159), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 46/139 (33%), Positives = 67/139 (48%), Gaps = 11/139 (7%)

Query: 309  VAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWA 368
            +A  A  +  EF +Q +ANVA AFA+     P+LF+ LA  A+  +  F  QELA   WA
Sbjct: 1775 IANGARHRADEFKAQELANVAWAFATANLDEPELFAALAASATPRLSRFSAQELANTAWA 1834

Query: 369  FASLYEPADPLLESLDNAFKDATQFTCCLNKAL--SNCNENGGVKSSGDADSEGSLSSPV 426
            FA    PA      + +A K+     C L +A+    C+E   ++  G A   G    P+
Sbjct: 1835 FAKRLGPA------VGSAPKNGEDAACRLARAMFAELCDE-ACLRFGGGA--YGPDGEPL 1885

Query: 427  LSFNRDQLGNIAWSYAVLG 445
              F   +L N+ W+ A  G
Sbjct: 1886 DGFKPQELANVCWAMATAG 1904



 Score = 60.1 bits (144), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 51/188 (27%), Positives = 81/188 (43%), Gaps = 24/188 (12%)

Query: 279  PECSAQGISNIAWALSKIGG------ELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
            P    Q ++NI WA +K G       + L+ +    V   A+ +V EFN+Q + N A A+
Sbjct: 1926 PATQPQNLANICWAFAKSGCGSPDAVDALFAA----VGRSAVRRVDEFNAQELGNTAWAY 1981

Query: 333  ASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAF-KDAT 391
            A+     P LF  +A  A   V  F  Q LA  +WA+A+       L +++     + A 
Sbjct: 1982 ATAGRDHPALFDAIAASAMPRVDRFIAQNLANTVWAYATAGHARPDLFDAVAREVARRAD 2041

Query: 392  QFTC--CLNKALSNCNENGGVKSSGDADSEGSLSSPVLS---------FNRDQLGNIAWS 440
            +F      N A +    +  +   GD  ++  +    L+         FN   L N+AW+
Sbjct: 2042 EFKPQELANTAWAYATAHKAL--PGDRPTKHRVLFDALADSADHRLRDFNNQGLSNLAWA 2099

Query: 441  YAVLGQMD 448
            YA  G  D
Sbjct: 2100 YASAGASD 2107



 Score = 52.4 bits (124), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 68/256 (26%), Positives = 99/256 (38%), Gaps = 58/256 (22%)

Query: 263  RQREMSMLVAIAMTA---LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALT---K 316
            R    + L AIA  A     E  AQ ++N+AWA +        L E +  A +A +   +
Sbjct: 1765 RDTSTACLRAIANGARHRADEFKAQELANVAWAFATAN-----LDEPELFAALAASATPR 1819

Query: 317  VGEFNSQNVANVAGAFAS----MQHSAPD------------LFSELAKRA---------- 350
            +  F++Q +AN A AFA        SAP             +F+EL   A          
Sbjct: 1820 LSRFSAQELANTAWAFAKRLGPAVGSAPKNGEDAACRLARAMFAELCDEACLRFGGGAYG 1879

Query: 351  --SDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQF------------TCC 396
               + +  F+ QELA V WA A+    A P     D A  +A +               C
Sbjct: 1880 PDGEPLDGFKPQELANVCWAMATAGFEATPRF--WDGAAAEAARIMDAPATQPQNLANIC 1937

Query: 397  LNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIW 456
               A S C     V +   A    ++   V  FN  +LGN AW+YA  G+     F  I 
Sbjct: 1938 WAFAKSGCGSPDAVDALFAAVGRSAVRR-VDEFNAQELGNTAWAYATAGRDHPALFDAIA 1996

Query: 457  KT----ISRFEEQRIS 468
             +    + RF  Q ++
Sbjct: 1997 ASAMPRVDRFIAQNLA 2012


>gi|397611301|gb|EJK61272.1| hypothetical protein THAOC_18274, partial [Thalassiosira oceanica]
          Length = 333

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 100/353 (28%), Positives = 151/353 (42%), Gaps = 57/353 (16%)

Query: 274 AMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
            + +L     Q +SNIAWA +  G     L+      VAE     +G F  Q+ +N+A A
Sbjct: 25  GLGSLDSFKPQNLSNIAWAFATAGVSHRELFKKIGCHVAEKG--SLGSFKPQDFSNIAWA 82

Query: 332 FASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDAT 391
           FA+   S   LF +L++ A+      + Q +A  LWA A++    + L  +L        
Sbjct: 83  FATAGVSHMKLFEKLSEAAARKGEFIETQHIANFLWACATVGYTDERLFSAL-------- 134

Query: 392 QFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
             T  +   L  CNE                          QL NIAW+Y+V     +  
Sbjct: 135 --TSVIASKLDKCNEQ-------------------------QLANIAWTYSVANTPKQDL 167

Query: 452 FSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG 511
           F+  + +     E+  S +       A          +LE     + L   L+ K  +A 
Sbjct: 168 FNKGYASALASIEKDFSAE-----GLAQLHQWQLWQQELES---GIELPRSLQAKCRNAF 219

Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAV--LVD-KKVAFEIDGPT 567
            ++ F +   S  Q +V   L +TGL    E  +  GY +DA+  L D +KVA E+DGP+
Sbjct: 220 TSQGFFE---SKLQNDVVDELKATGLVLDEEVLLGSGYRIDALVKLSDGRKVAVEVDGPS 276

Query: 568 HFSRNTGVPLGHTMLKRRYIAAA-GWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
           HF      P G T+LK R +       VVS+ + EW+EL+ S  +  YLRV L
Sbjct: 277 HFIDRR--PTGSTILKHRQVVKLDSIEVVSVPYWEWDELKNSEMKQHYLRVKL 327


>gi|397643193|gb|EJK75706.1| hypothetical protein THAOC_02564 [Thalassiosira oceanica]
          Length = 1004

 Score = 87.0 bits (214), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 97/350 (27%), Positives = 153/350 (43%), Gaps = 32/350 (9%)

Query: 284  QGISNIAWALSKIGGELLYLSEMDRVAE--VALTKVGEFNSQNVANVAGAFASMQHSAPD 341
            Q +SN AWA +  G  +L+     ++      L+ +G F  Q ++N A AFA+   S P 
Sbjct: 669  QELSNTAWAFATAG--VLHPELFKKIGGHVAGLSCLGSFKPQALSNTAWAFATTGDSNPK 726

Query: 342  LFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDN---AFKD--ATQFT 394
            +F ++       D + +F  QEL+ + WA+A+       L E L     A KD    Q T
Sbjct: 727  MFKKIRDHIVRLDNLDSFTPQELSNIAWAYATARRFDLGLFEKLVTGAVAKKDRFGEQAT 786

Query: 395  CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSD 454
                 A +      G+  S  A     ++S +  +    L NIAW+Y+V     +  F++
Sbjct: 787  SNFLWACATIGYTDGLLFSAFAPV---IASTLDKYGEQHLANIAWAYSVANAPRQDLFNE 843

Query: 455  IWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTK 514
             +          IS     D   A          +LE     + L   L+ K   A  ++
Sbjct: 844  GYVGSLALNRNHIS-----DKELAQLHQWQLWQQELES---GIELPRSLQAKCRYAFTSQ 895

Query: 515  RFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFS 570
               +   S  Q +V   L + GL+   E+ +  GY +DA++     +KVA E+DGP+HF 
Sbjct: 896  GHQE---SKLQDDVVGELRAAGLDLEEEFLLGSGYRIDALVTFSDGRKVAVEVDGPSHFI 952

Query: 571  RNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVIL 619
                 P G  +LK R +       VVS+ H EW EL+ S  + ++LRV L
Sbjct: 953  DRR--PTGSAVLKHRQVVRLDRIEVVSVPHWEWNELKNSEMKQNFLRVKL 1000



 Score = 46.6 bits (109), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 52/242 (21%), Positives = 97/242 (40%), Gaps = 50/242 (20%)

Query: 284 QGISNIAWALSKIG----------GELLYLSEMDRVAEVALTKV---------------- 317
           Q +SN+ WA  K+G          G ++   ++D     AL  +                
Sbjct: 554 QALSNVMWAFVKVGAKNSRLFRETGGVISGMDLDSFKPQALANILWSFAKSGEADPELFQ 613

Query: 318 -----------GEFNSQNVANVAGAFASMQHSAPDLFSELAK--RASDIVHTFQEQELAQ 364
                       +F  Q+++N+A A+A+ +   P LF ++       D + +F+ QEL+ 
Sbjct: 614 VLGNHIVVRSLNDFWPQDISNIAWAYANGRVPHPILFKKIGDLVAGQDSLDSFKPQELSN 673

Query: 365 VLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE--GSL 422
             WAFA+       L + +       +       +ALSN        ++GD++ +    +
Sbjct: 674 TAWAFATAGVLHPELFKKIGGHVAGLSCLGSFKPQALSNTAW--AFATTGDSNPKMFKKI 731

Query: 423 SSPVL------SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIM 476
              ++      SF   +L NIAW+YA   + D   F  +  T +  ++ R  EQ   + +
Sbjct: 732 RDHIVRLDNLDSFTPQELSNIAWAYATARRFDLGLFEKL-VTGAVAKKDRFGEQATSNFL 790

Query: 477 FA 478
           +A
Sbjct: 791 WA 792



 Score = 42.0 bits (97), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 2/99 (2%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           + Q +SNIAWA +        L   +++   A+ K   F  Q  +N   A A++ ++   
Sbjct: 745 TPQELSNIAWAYAT--ARRFDLGLFEKLVTGAVAKKDRFGEQATSNFLWACATIGYTDGL 802

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
           LFS  A   +  +  + EQ LA + WA++    P   L 
Sbjct: 803 LFSAFAPVIASTLDKYGEQHLANIAWAYSVANAPRQDLF 841



 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 38/151 (25%), Positives = 62/151 (41%), Gaps = 38/151 (25%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHS----APDLFSELAKRASDIVHTFQEQE 361
            D +A  +   + +F +++++N+  +F  ++ +       LF+   K A  I+ TF+ Q 
Sbjct: 496 FDSIASSSAGMLDKFETRHLSNLIYSFGLVELNPEIGGDTLFNVFGKTAIKILRTFKPQA 555

Query: 362 LAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGS 421
           L+ V+WAF  +               K++  F            E GGV S  D D    
Sbjct: 556 LSNVMWAFVKV-------------GAKNSRLF-----------RETGGVISGMDLD---- 587

Query: 422 LSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
                 SF    L NI WS+A  G+ D   F
Sbjct: 588 ------SFKPQALANILWSFAKSGEADPELF 612



 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 74/172 (43%), Gaps = 20/172 (11%)

Query: 295 KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI- 353
           +IGG+ L+    +   + A+  +  F  Q ++NV  AF  +      LF E     S + 
Sbjct: 530 EIGGDTLF----NVFGKTAIKILRTFKPQALSNVMWAFVKVGAKNSRLFRETGGVISGMD 585

Query: 354 VHTFQEQELAQVLWAFASLYEPADP-LLESLDN--AFKDATQFTCCLNKALSNCNENGGV 410
           + +F+ Q LA +LW+FA   E ADP L + L N    +    F       ++    NG V
Sbjct: 586 LDSFKPQALANILWSFAKSGE-ADPELFQVLGNHIVVRSLNDFWPQDISNIAWAYANGRV 644

Query: 411 ------KSSGD-ADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDI 455
                 K  GD    + SL     SF   +L N AW++A  G +    F  I
Sbjct: 645 PHPILFKKIGDLVAGQDSLD----SFKPQELSNTAWAFATAGVLHPELFKKI 692


>gi|307109857|gb|EFN58094.1| hypothetical protein CHLNCDRAFT_142412 [Chlorella variabilis]
          Length = 962

 Score = 86.3 bits (212), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 77/302 (25%), Positives = 133/302 (44%), Gaps = 38/302 (12%)

Query: 311 EVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
           E  ++++ +F SQ +AN   +FA ++     L   + +     +HT   QE++  +W+FA
Sbjct: 268 ERRVSRLDDFTSQALANTLWSFAYLRWYPVRLLEPITRAVGRKMHTMSSQEISNSIWSFA 327

Query: 371 SL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSF 429
              Y P   + +      +   +F     ++L+         S+   ++   L    +  
Sbjct: 328 KFAYHPGPVMAQYQVEVVRRVAEFD---GQSLTTTMWAMAALSATHCEAFVKLVERFVEL 384

Query: 430 NRDQLGNIAWSYAVLGQMDRIFFSDIWKTI--SRFEEQRISEQYREDIMFASQVHLVNQC 487
            R             G    + ++ + + +  ++FE+QR   ++R DI     +  V+  
Sbjct: 385 ERA------------GGFQDVQYNQVLQAVLLAQFEQQRRPGEFRADIDLPDDI--VDTA 430

Query: 488 LKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREY--AV 545
           L+      Q + +     K+              SSFQ EV+  L   G+    EY  AV
Sbjct: 431 LQAWQAQQQASAAGGWAAKL--------------SSFQLEVSEALGQLGIEHELEYLTAV 476

Query: 546 DGYTVDAVLV--DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
           +  +VD  +V   KKVA E+DGP HFS NT  PLG TM++RR + A GW V+S+ +  W 
Sbjct: 477 NLLSVDIAIVKGGKKVAVEVDGPFHFSVNTSSPLGQTMIRRRLLRAVGWTVISVPYHAWY 536

Query: 604 EL 605
            L
Sbjct: 537 SL 538



 Score = 39.3 bits (90), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 25/105 (23%), Positives = 50/105 (47%), Gaps = 4/105 (3%)

Query: 270 LVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALT--KVGEFNSQNVAN 327
           LVA     +P    +G++N  W L+ +G   +  +E+ R   +A+   +  ++ +Q ++N
Sbjct: 31  LVARVEALVPHYQPRGLANTMWGLAALGD--VQRAELARRLALAIVSHRTAQYRAQELSN 88

Query: 328 VAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
           V  A  ++    P+    L +     +  F  Q L+ ++WA A L
Sbjct: 89  VVWAMGTLGVLCPEALDPLLEGVVSQIDDFIPQGLSNMVWACAHL 133



 Score = 38.9 bits (89), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 31/135 (22%), Positives = 58/135 (42%), Gaps = 31/135 (22%)

Query: 265 REMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQN 324
           R +++ +    TA  +  AQ +SN+ WA+  +G  +L    +D + E  ++++ +F  Q 
Sbjct: 67  RRLALAIVSHRTA--QYRAQELSNVVWAMGTLG--VLCPEALDPLLEGVVSQIDDFIPQG 122

Query: 325 VANVAGAFASMQ---------------------------HSAPDLFSELAKRASDIVHTF 357
           ++N+  A A ++                           H AP     +A  A+  +  F
Sbjct: 123 LSNMVWACAHLRNGTRGCIGPTMGGNPPTHVPRELRPAWHPAPAFLEAVAAAATRKMPDF 182

Query: 358 QEQELAQVLWAFASL 372
           Q Q L+ +LW F  L
Sbjct: 183 QSQTLSNLLWGFCKL 197


>gi|302780209|ref|XP_002971879.1| hypothetical protein SELMODRAFT_412574 [Selaginella moellendorffii]
 gi|300160178|gb|EFJ26796.1| hypothetical protein SELMODRAFT_412574 [Selaginella moellendorffii]
          Length = 465

 Score = 85.9 bits (211), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 47/96 (48%), Positives = 65/96 (67%), Gaps = 8/96 (8%)

Query: 196 EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT 255
           EI LN+D+VD++  +EVLE I  +    G       LS +N+ATALHRIAK+M  +SM  
Sbjct: 217 EIRLNQDLVDSRDVEEVLETIERVKGRFG-------LSAINVATALHRIAKHMVTLSMSE 269

Query: 256 THRLAFTRQREMSMLVAIAMTALPECSAQGISNIAW 291
             RL + +Q ++ +LVA AM  LPEC+AQG+SNIA+
Sbjct: 270 RRRLKYAKQCDV-LLVASAMELLPECNAQGVSNIAY 304



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 65/145 (44%), Gaps = 35/145 (24%)

Query: 479 SQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLN 538
           +Q++ V    K E   LQL     +E++ A A + +R ++K TS  QK++ R LV TG  
Sbjct: 335 TQLYQVVLASKREGKDLQLG---GIEKRAAGAWEKERSSRKSTSFLQKDIERFLVCTGRQ 391

Query: 539 WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLS 598
           WI EY    Y+ +                                 R + AAGW ++S S
Sbjct: 392 WILEYVDADYSHEG--------------------------------RLLGAAGWKIISAS 419

Query: 599 HQEWEELQGSFEQLDYLRVILKDYI 623
           +  WE LQG  E +D+L  +L  +I
Sbjct: 420 YAAWENLQGESEHVDFLHKLLAPHI 444


>gi|397617752|gb|EJK64587.1| hypothetical protein THAOC_14665, partial [Thalassiosira oceanica]
          Length = 315

 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/319 (26%), Positives = 145/319 (45%), Gaps = 25/319 (7%)

Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQ 364
           D +A   L  +  FNSQ ++N A A+A+   S P+LF ++    + +  + + + QEL+ 
Sbjct: 4   DHIA--GLKSLDSFNSQALSNTAWAYATAGVSHPELFKKIGDHVAGLKSLDSLKPQELSN 61

Query: 365 VLWAFASLYEPADPLLESLDN-AFKDATQFTCC-LNKALSNCNENGGVKSSGDADSEGSL 422
             WA+A+       L E +   A  +   F C  +   L  C   G       +     +
Sbjct: 62  TAWAYATARRFDLRLFEKVSTEAVVNREHFGCQEVANFLWACATVGHTDERLFSAFVPVI 121

Query: 423 SSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVH 482
           +S +  FN  +L NIAW+Y+V      +F       ++ +E     E  R+   +     
Sbjct: 122 ASKLDEFNEQELANIAWAYSVANLKQDLFDEGYVSALAAYENVFPEESRRQLHQWQLWQQ 181

Query: 483 LVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIRE 542
            +   ++L            L+EK  +   +  +++   S  Q +V   L + GL++  E
Sbjct: 182 EIESGIELPQS---------LQEKCRNTFISSSYSE---SKLQNDVVGELRAAGLDFDEE 229

Query: 543 YAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSL 597
             +  GY +DA++    ++KVA E+DGP HF  +   P G T+LK R +A   +  VVS+
Sbjct: 230 VLLGSGYRIDALVKIREERKVAVEVDGPFHFIDSR--PAGRTILKHRQVARLDYIEVVSV 287

Query: 598 SHQEWEELQGSFEQLDYLR 616
            + EW+ L+ S  +  YL 
Sbjct: 288 PYWEWDGLKNSVMKQHYLH 306



 Score = 46.2 bits (108), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 45/97 (46%), Gaps = 2/97 (2%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
            + +L     Q +SN AWA +        L   ++V+  A+     F  Q VAN   A A
Sbjct: 47  GLKSLDSLKPQELSNTAWAYAT--ARRFDLRLFEKVSTEAVVNREHFGCQEVANFLWACA 104

Query: 334 SMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
           ++ H+   LFS      +  +  F EQELA + WA++
Sbjct: 105 TVGHTDERLFSAFVPVIASKLDEFNEQELANIAWAYS 141


>gi|145341433|ref|XP_001415814.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576037|gb|ABO94106.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 417

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 154/360 (42%), Gaps = 52/360 (14%)

Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAE----VALTKV---------------GEF 320
           E   Q ++N  WA +     +L      R+AE    V L+K+               GEF
Sbjct: 51  EFYPQALTNTLWAYT-----VLKHPRAQRLAEILAPVILSKLPEPDKELMQAESATGGEF 105

Query: 321 NSQNVANVAGAFASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADP 378
           ++Q V+N     +S+  H   +L   LA         F+ QEL+  +WAFA   + P + 
Sbjct: 106 STQTVSNALWTLSSLGVHPGYELLDRLAIFVVKSSQNFKAQELSNSVWAFAQFAHHPGNE 165

Query: 379 LLESLDNAFKDATQ-FTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLS-------FN 430
            L + + +  +  + +T    +AL+N      V    + D    L + V          N
Sbjct: 166 ALRTFERSLLERREEYT---TQALANTTIGLSVFGGSEDDGLNKLFNDVTPSWFRLSECN 222

Query: 431 RDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYR-EDIMFASQVHLVNQCLK 489
              L NI W+ A +G     F SD++K   R   +R S  ++ E +       L+     
Sbjct: 223 SQDLSNITWAIASVGA----FQSDLYKAAVRELFRRDSMDFQLEGLKMLFHARLMQHDFD 278

Query: 490 LEHPHLQLALSSVLEEKIASAGKTKRFNQK---VTSSFQKEVARLLVSTGLN-WIREYAV 545
            E   + +    V  + +A  G++    Q      S+FQKEV   + S G   ++ E   
Sbjct: 279 PERETVDV----VYPDWVAELGRSAWLQQTEDTRVSTFQKEVLETVKSLGHEPYMEELTD 334

Query: 546 DGY-TVDAVLVDKKVAFEIDGPTHFSRNTGVPLGH-TMLKRRYIAAAGWNVVSLSHQEWE 603
           DG  ++D  L DK+VA E DGP+HF  N    L   T L+ + +A  GW VV++ + EW+
Sbjct: 335 DGLLSMDICLKDKRVAIECDGPSHFYTNLTEGLTQKTKLRDKALAVRGWKVVTVPYFEWQ 394


>gi|397607841|gb|EJK59823.1| hypothetical protein THAOC_19906 [Thalassiosira oceanica]
          Length = 307

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/307 (26%), Positives = 144/307 (46%), Gaps = 24/307 (7%)

Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPAD 377
           F  QN++N A AFA+   S  +LF ++    +  D + +F  Q L+   WA+A+      
Sbjct: 6   FKPQNLSNTAWAFATAGESHSELFEKIGDHVAGRDSLDSFNPQNLSNTAWAYATARVFHS 65

Query: 378 PLLESLDNAFKDATQFTCCLNKA--LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLG 435
            L E L  A     +F    +K+  L  C   G       +     + S +   N  +L 
Sbjct: 66  RLFEKLSTADARKGEFIETQHKSNFLWACATVGYTDERLFSAFAPVMESKLDECNEQELA 125

Query: 436 NIAWSYAVLGQMDRIFFSDIW-KTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPH 494
           NIAW+Y+V     +  F++ +   ++ +E++  ++++R+   +      +   ++L    
Sbjct: 126 NIAWAYSVANVPSKDLFNEGYVGALAAYEKEFSAKEFRQLHQWQLWQQELESGIELPRS- 184

Query: 495 LQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAV 553
                   L+EK  +A  ++ +++   S  Q +V   L +TGL+   E  +  GY +DA+
Sbjct: 185 --------LQEKCRNAFTSQGYSE---SKLQNDVVNELRATGLDLDEEVLLGSGYRIDAL 233

Query: 554 LV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSF 609
           +      +VA E+DGP+HF +    P+G T LK R +A      VVS+ +  W E++ S 
Sbjct: 234 VKVGNGGRVAVEVDGPSHFIQRW--PVGSTTLKHRQVARLDCIEVVSVPYWVWNEMKNSV 291

Query: 610 EQLDYLR 616
            +  YLR
Sbjct: 292 TKQHYLR 298


>gi|397587968|gb|EJK54094.1| hypothetical protein THAOC_26348, partial [Thalassiosira oceanica]
          Length = 1003

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 160/360 (44%), Gaps = 33/360 (9%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
            + +L     Q +SN AWA +  G     L+    D VA   L  +  F  QN++N+A A
Sbjct: 650 GLMSLDSFDPQALSNTAWAFATTGASHPELFKKIGDHVA--GLGSLNSFKPQNLSNIAWA 707

Query: 332 FASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKD 389
           FA+   S P+LF ++    +  D + +F+ QE++  +WA+A+       L E L      
Sbjct: 708 FATAGASHPELFMKIGDHVAGLDSLDSFKPQEISNTVWAYATARVFDLGLFEKLVTVAVI 767

Query: 390 ATQFTCCLNKALSN----CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
             ++     +A++N    C   G       +     + S +  FN  +L NIAW+Y++  
Sbjct: 768 KREYFD--GQAVANFLWACATVGHTDERLFSALAPLIGSELDKFNEQELANIAWAYSMAN 825

Query: 446 QMDRIFFSDIWKTISRFEEQRISEQY-REDIMFASQVHLVNQCLKLEHPHLQLALSSVLE 504
               +F       ++  E++   EQ  +       Q  LV          L + L   L+
Sbjct: 826 VPQDLFNEGYVGALASNEKEFSGEQLSQLHQWQLWQQELV----------LGIELPGSLQ 875

Query: 505 EKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVA 560
            K  +A  ++ +++   S+ Q +V   L +  L    E  +  GY +DA +     + VA
Sbjct: 876 AKCRNAFTSQGYSE---STLQNDVVGELKAARLVIDEEVLLGSGYRIDASVKFSDGRIVA 932

Query: 561 FEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLRVIL 619
            E+DGP+HF      P G T+LK R +A      VVS+   EW EL+ S  +  YLRV L
Sbjct: 933 VEVDGPSHFIDRR--PTGSTILKHRQVARLDRIEVVSVPFWEWNELKNSEMKQHYLRVKL 990



 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 86/189 (45%), Gaps = 24/189 (12%)

Query: 273 IAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
           I    L +   Q +SNIAWA    G    +L+    D VA   L ++  F+SQ ++N+A 
Sbjct: 532 IVARRLNDFQPQALSNIAWAFDTAGVSHPVLFKKIGDHVA--GLVRLNSFDSQALSNIAW 589

Query: 331 AFASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
           +FA+   S P+LF ++    +  D + +F+ Q L+ + W+FA++ E    L   + N   
Sbjct: 590 SFATAGDSHPELFKKVGYHVAGLDSLDSFEPQHLSNIAWSFATVGESHPKLFNKIGNHIA 649

Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSE------------GSLSSPVLSFNRDQLGN 436
                     +ALSN        ++G +  E            GSL+    SF    L N
Sbjct: 650 GLMSLDSFDPQALSNTAW--AFATTGASHPELFKKIGDHVAGLGSLN----SFKPQNLSN 703

Query: 437 IAWSYAVLG 445
           IAW++A  G
Sbjct: 704 IAWAFATAG 712



 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 45/167 (26%), Positives = 71/167 (42%), Gaps = 15/167 (8%)

Query: 211 EVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSML 270
           E+ + I + +  +G   S  P +  NIA A      +  ++ M     +A          
Sbjct: 678 ELFKKIGDHVAGLGSLNSFKPQNLSNIAWAFATAGASHPELFMKIGDHVA---------- 727

Query: 271 VAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
               + +L     Q ISN  WA +     +  L   +++  VA+ K   F+ Q VAN   
Sbjct: 728 ---GLDSLDSFKPQEISNTVWAYAT--ARVFDLGLFEKLVTVAVIKREYFDGQAVANFLW 782

Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPAD 377
           A A++ H+   LFS LA      +  F EQELA + WA++    P D
Sbjct: 783 ACATVGHTDERLFSALAPLIGSELDKFNEQELANIAWAYSMANVPQD 829



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 93/203 (45%), Gaps = 26/203 (12%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
           A+  L   ++Q +SN+ WA  K+  +   L + +    ++   +G F  Q ++N+  +FA
Sbjct: 457 AVKILHTFNSQELSNMLWAFVKVDADNSRLFQ-ETGGVISGMDLGSFKPQALSNILWSFA 515

Query: 334 SMQHSAPDLFSEL-----AKRASDIVHTFQEQELAQVLWAF--ASLYEPADPLLESLDNA 386
               + P+LF  L     A+R +D    FQ Q L+ + WAF  A +  P   L + + + 
Sbjct: 516 KSGKANPELFGVLGDHIVARRLND----FQPQALSNIAWAFDTAGVSHPV--LFKKIGDH 569

Query: 387 FKDATQFTCCLNKALSNCNENGGVKSSGDADSE---------GSLSSPVLSFNRDQLGNI 437
                +     ++ALSN   +    ++GD+  E           L S + SF    L NI
Sbjct: 570 VAGLVRLNSFDSQALSNIAWS--FATAGDSHPELFKKVGYHVAGLDS-LDSFEPQHLSNI 626

Query: 438 AWSYAVLGQMDRIFFSDIWKTIS 460
           AWS+A +G+     F+ I   I+
Sbjct: 627 AWSFATVGESHPKLFNKIGNHIA 649



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 80/170 (47%), Gaps = 32/170 (18%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
            DR+A  A+  + EF +++++N+  +F  ++ + PD     LF+     A  I+HTF  Q
Sbjct: 409 FDRIARSAVGMLNEFEARHLSNLIYSFGLVERN-PDIGGETLFNVFGIAAVKILHTFNSQ 467

Query: 361 ELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKS 412
           EL+ +LWAF       + L++    ++  +D  +FK          +ALSN   +     
Sbjct: 468 ELSNMLWAFVKVDADNSRLFQETGGVISGMDLGSFKP---------QALSNILWS--FAK 516

Query: 413 SGDADSE--GSLSSPVLS-----FNRDQLGNIAWSYAVLGQMDRIFFSDI 455
           SG A+ E  G L   +++     F    L NIAW++   G    + F  I
Sbjct: 517 SGKANPELFGVLGDHIVARRLNDFQPQALSNIAWAFDTAGVSHPVLFKKI 566


>gi|384250903|gb|EIE24382.1| hypothetical protein COCSUDRAFT_83686 [Coccomyxa subellipsoidea
           C-169]
          Length = 463

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 94/373 (25%), Positives = 149/373 (39%), Gaps = 74/373 (19%)

Query: 266 EMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNV 325
           + S  V +A  ++    AQG++++ WAL+  GG   +  EM+ V EV   +  +F    +
Sbjct: 120 DASGAVKLAGGSVDALGAQGLADLLWALAAFGGRSYFKDEMEAVLEVLDFQQQKFTMSGL 179

Query: 326 ANVAGAFASMQHSAPDLFSELAK--RASDIVHTFQEQ-ELAQVLWAFASLYEPADPLLES 382
            +V  A AS  H  P L ++LA   R    + T ++  +   +LW+FA            
Sbjct: 180 LDVTWALASAAHWTPKL-ADLAAAVRERGGLKTIKKNYQFTGLLWSFA------------ 226

Query: 383 LDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYA 442
                    QF           + N G+        E      V  F   QL    WS  
Sbjct: 227 ---------QF-----------DHNPGLFC------EVLPPKKVAEFETHQLITACWSLC 260

Query: 443 VLGQMDRIFFSDIWKTISRFEEQ-------------RISEQYR--EDIMFASQVHLVNQC 487
           VL +     F  +W+ +   E               +I  ++R  ED++  + VH     
Sbjct: 261 VLQETQSEVFKSLWRELGTRELPATPMKDAIACQLCQIKMEFRGKEDLLLGTDVH----- 315

Query: 488 LKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG 547
                       + +LE+         R      S+   E  R L   GL  I E    G
Sbjct: 316 ------------AQILEKADRCWKHDLRTTDFHMSAQHAETCRALKGMGLEHIYEDVSTG 363

Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
           Y VD  + + ++A EIDGPTHF+RN    LG +++K R +   GW+V  L+ ++WE  + 
Sbjct: 364 YAVDIAIPELRIAVEIDGPTHFARNAKRRLGPSIMKHRQLDDMGWHVFPLTAEDWESAES 423

Query: 608 SFEQLDYLRVILK 620
           S   L  LR  ++
Sbjct: 424 SAAALQKLRDFIR 436


>gi|397600696|gb|EJK57702.1| hypothetical protein THAOC_22226 [Thalassiosira oceanica]
          Length = 877

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 90/356 (25%), Positives = 151/356 (42%), Gaps = 57/356 (16%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
            + +L     Q  SN AWA +  G   L L  M       L  +  F +Q ++N A +FA
Sbjct: 567 GLDSLNSFKPQNFSNTAWAFASAGVSHLALFNMIGHHVAGLGSLDSFKAQALSNTAWSFA 626

Query: 334 SMQHSAPDLFSELAKRASDIVH--TFQEQELAQVLWAFASLYEPADPLLESLDNAFKDAT 391
           +   S P+LF +++   +++ +  +F+ QEL   +WA AS+    + L  +L        
Sbjct: 627 TAGISCPELFRKISGHVAELGYLDSFKLQELLNTVWACASVGYTDERLFSAL-------- 678

Query: 392 QFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
                +   L  C+E                           L NIAW+Y+V     +  
Sbjct: 679 --APVIASKLDECSEQ-------------------------HLANIAWTYSVANTPRQDL 711

Query: 452 FSDIW-KTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASA 510
           F+  +   ++  E+   +E   +   +      +   ++L  P         L  K  +A
Sbjct: 712 FNVGYVGALASIEKVFSAEGLAQLHQWQLWQQELESGIQLPGP---------LGAKCLNA 762

Query: 511 GKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGP 566
             ++ F++   S  Q +V   L + GL+   E  +  GY +DA++     +KVA E+DGP
Sbjct: 763 FTSQGFSE---SKLQNDVVGELKAAGLDLDEEVLLGSGYRIDALVKFSDGRKVAVEVDGP 819

Query: 567 THFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
           +HF      P G T+LK R +       VVS+ + EW EL+ S  +  YLRV L +
Sbjct: 820 SHFIDRR--PTGSTILKHRQVTRLDRIEVVSVPYWEWNELKNSEMKQHYLRVKLSN 873



 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 81/202 (40%), Gaps = 34/202 (16%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
           A+  L   ++QGISN+ +A  K+  +   L E +    ++   +  F  Q +AN+  +FA
Sbjct: 413 ALKILHTFNSQGISNMLFAFVKVDAKNSRLFE-ETCGVISGMDLDNFKPQALANILWSFA 471

Query: 334 SMQHSAPDLFSEL-----AKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
               + P+LF  L     A+R +D    FQ Q L+ + WAFA+       L   + N   
Sbjct: 472 KSGEAEPELFQALGNHIVARRLND----FQPQHLSNIAWAFATAEVSHPELFNKIGNHIA 527

Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL---------------SFNRDQ 433
                    ++ALSN         +  A +   +S  VL               SF    
Sbjct: 528 GPGSLDSFSSQALSN---------TAWAFAAAGVSHTVLMKKIGNHIAGLDSLNSFKPQN 578

Query: 434 LGNIAWSYAVLGQMDRIFFSDI 455
             N AW++A  G      F+ I
Sbjct: 579 FSNTAWAFASAGVSHLALFNMI 600



 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 52/223 (23%), Positives = 87/223 (39%), Gaps = 50/223 (22%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
            DR+A  A+  + EF +++++N+  +F  ++ + PD     LF+   K A  I+HTF  Q
Sbjct: 365 FDRIASSAVGMLNEFEARHLSNLIYSFGLVERN-PDIGEETLFNVFGKAALKILHTFNSQ 423

Query: 361 ELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEG 420
            ++ +L+AF  +      L E            TC             GV S  D D   
Sbjct: 424 GISNMLFAFVKVDAKNSRLFEE-----------TC-------------GVISGMDLD--- 456

Query: 421 SLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQ 480
                  +F    L NI WS+A  G+ +   F  +   I          Q+  +I +A  
Sbjct: 457 -------NFKPQALANILWSFAKSGEAEPELFQALGNHIVARRLNDFQPQHLSNIAWAFA 509

Query: 481 VHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSS 523
                   ++ HP     L + +   IA  G    F+ +  S+
Sbjct: 510 T------AEVSHPE----LFNKIGNHIAGPGSLDSFSSQALSN 542


>gi|397605334|gb|EJK58973.1| hypothetical protein THAOC_20863 [Thalassiosira oceanica]
          Length = 1152

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 98/365 (26%), Positives = 160/365 (43%), Gaps = 37/365 (10%)

Query: 274  AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAE--VALTKVGEFNSQNVANVAGA 331
             + +L   + Q +SN  WA +  G  + Y    +++      L  +  FNSQ ++N   A
Sbjct: 804  GLDSLNSFNPQNLSNTIWAFATAG--VSYPELFNKIGNHIAGLGSLDSFNSQALSNTVWA 861

Query: 332  FASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKD 389
            FA+   S P LF+++    +  D + +F  Q L+   WA+A+       L E L  A   
Sbjct: 862  FATAGESNPKLFNKIGDHVTRLDSIDSFNSQNLSNTAWAYATARVFHSRLFEKLTTAVAA 921

Query: 390  ATQF---TCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQ-----LGNIAWSY 441
                   T  +   L  C   G +      +   S  +PV++   DQ     + NIAW+Y
Sbjct: 922  RKAHFIETQHIANLLWACATVGYID-----ERLFSALAPVVASKLDQCNGQDIANIAWAY 976

Query: 442  AVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSS 501
            +V     +  F++ + +     E+  S    E++    Q  L  Q LK       + L  
Sbjct: 977  SVANFPKQDLFNEGYVSALASNEKDFST---EELFQLHQWQLWQQELKS-----GIELPR 1028

Query: 502  VLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DK 557
             L+EK  +      +++   S  Q +V   L + GL+   E  +  GY +DA++     +
Sbjct: 1029 SLQEKCRNVVTYASYSE---SKLQNDVVGELRAAGLDLDEEVLLGSGYRIDALVKFGGGR 1085

Query: 558  KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
            KVA E+DGP HF      P G  +LK R +A      VV + + EW+EL+ S  +  YLR
Sbjct: 1086 KVAVEVDGPFHFIDRR--PAGRAILKHRQVARLDRIEVVPVPYWEWDELKNSEMKQHYLR 1143

Query: 617  VILKD 621
            V L +
Sbjct: 1144 VKLSN 1148



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 54/197 (27%), Positives = 87/197 (44%), Gaps = 18/197 (9%)

Query: 275 MTALPECSAQGISNIAWALS--KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
           M +L     Q +SN AWA +  +     L+    D +A   L  +  FN Q ++N A AF
Sbjct: 727 MGSLDSFKPQDLSNTAWAFATARESNPKLFKKIGDNIA--GLGSLDSFNPQELSNTAWAF 784

Query: 333 ASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA 390
           A+   S P LF+++    +  D +++F  Q L+  +WAFA+       L   + N     
Sbjct: 785 ATAGDSNPKLFNKIGHHVAGLDSLNSFNPQNLSNTIWAFATAGVSYPELFNKIGNHIAGL 844

Query: 391 TQFTCCLNKALSN-------CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
                  ++ALSN         E+     +   D    L S + SFN   L N AW+YA 
Sbjct: 845 GSLDSFNSQALSNTVWAFATAGESNPKLFNKIGDHVTRLDS-IDSFNSQNLSNTAWAYAT 903

Query: 444 LGQMDRIFFSDIWKTIS 460
                R+F S +++ ++
Sbjct: 904 A----RVFHSRLFEKLT 916



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 87/197 (44%), Gaps = 21/197 (10%)

Query: 274 AMTALPECSAQGISNIAWALS------KIGGELLYLSEMDRVAEVALTKVGEFNSQNVAN 327
           A+  L E  A+ +SN+ ++         IG E L+    +   + A+  +  FNSQ+++N
Sbjct: 608 AVEMLNEFDARTLSNLIYSFGLVERNPDIGEETLF----NVFGKAAVKILNTFNSQDISN 663

Query: 328 VAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADP-LLESLDN 385
           +  AF  +      LF E     S + +  F+ Q LA +LW+FA   E ADP L ++L N
Sbjct: 664 MLLAFVKVDAKNSRLFHETCGVISGMDLDNFKPQALANILWSFAKSGE-ADPELFQALGN 722

Query: 386 AFKDATQFTCCLNKALSN-------CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
                        + LSN         E+         D+   L S + SFN  +L N A
Sbjct: 723 HIAVMGSLDSFKPQDLSNTAWAFATARESNPKLFKKIGDNIAGLGS-LDSFNPQELSNTA 781

Query: 439 WSYAVLGQMDRIFFSDI 455
           W++A  G  +   F+ I
Sbjct: 782 WAFATAGDSNPKLFNKI 798



 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 46/180 (25%), Positives = 81/180 (45%), Gaps = 41/180 (22%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
            DR+A  A+  + EF+++ ++N+  +F  ++ + PD     LF+   K A  I++TF  Q
Sbjct: 601 FDRIARSAVEMLNEFDARTLSNLIYSFGLVERN-PDIGEETLFNVFGKAAVKILNTFNSQ 659

Query: 361 ELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKS 412
           +++ +L AF       + L+     ++  +D + FK          +AL+N   +     
Sbjct: 660 DISNMLLAFVKVDAKNSRLFHETCGVISGMDLDNFKP---------QALANILWS--FAK 708

Query: 413 SGDADSE------------GSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
           SG+AD E            GSL     SF    L N AW++A   + +   F  I   I+
Sbjct: 709 SGEADPELFQALGNHIAVMGSLD----SFKPQDLSNTAWAFATARESNPKLFKKIGDNIA 764


>gi|397636260|gb|EJK72207.1| hypothetical protein THAOC_06282, partial [Thalassiosira oceanica]
          Length = 569

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 102/399 (25%), Positives = 165/399 (41%), Gaps = 66/399 (16%)

Query: 274 AMTALPECSAQGISNIAWALSK--IGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
            +  L     Q +SN  WA +   +    L+    D +A   L  +  F+ QN++N+  A
Sbjct: 174 GLGCLESFKPQNLSNTVWAFATADMTHPELFKKIGDHIA--GLMSLDSFDPQNLSNIVWA 231

Query: 332 FASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPL---------- 379
           FA+ + S P LF+++    +  D +++F  Q+L+   WAFA+  E    L          
Sbjct: 232 FATAKESHPQLFNKIGHHVAGLDSLNSFNSQDLSLTAWAFATAGESNPELFNKIGNHVAG 291

Query: 380 LESLDNAF-----------------------KDATQF---------TCCLNKALSNCNEN 407
           L+SLD+                         K AT+F         T  ++  L  C   
Sbjct: 292 LDSLDSFMPQDFSNTIWAYATARVFHSRLFEKLATEFVSRKGEFIKTQHMSNFLWACATV 351

Query: 408 GGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRI 467
           G       A     + S +  FN  +L NIAW+Y+V     +  F++ +      +E+  
Sbjct: 352 GHTDERLFAALAPVIGSKLDKFNEQELANIAWAYSVANAPRQDLFNEGYVGALASKEKVF 411

Query: 468 SEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKE 527
           S +    +          +          + L   L  K  +A  ++ F++   S  Q +
Sbjct: 412 SGKELAQLHQLQLWQQELES--------GIELPGSLRAKCRNAFTSQGFSE---SKLQND 460

Query: 528 VARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLK 583
           V   L + GL    E  +  GY +DA++     +KVA E+DGP+HF      P G T LK
Sbjct: 461 VVYELKAAGLVLDEEVLLGSGYRIDALVKFGDGRKVAVEVDGPSHFIDRR--PAGSTTLK 518

Query: 584 RRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
            R +A      VVS+ + +W EL+ S  +  YLRV L D
Sbjct: 519 HRQVARLDRIQVVSVPYWQWNELKNSEMKQHYLRVKLPD 557



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 66/266 (24%), Positives = 112/266 (42%), Gaps = 35/266 (13%)

Query: 204 VDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTR 263
           VDA+  +   E      + V  G+      P  ++  L   AK+ E V  +         
Sbjct: 2   VDAKNPRPFQEA-----SGVIPGMDLGSFKPQELSNVLWSFAKSCESVPKLF-------- 48

Query: 264 QREMSMLVAIAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFN 321
            R +   +A  M +L     Q +SN AWA +  G     L+    D VA   L  +  F+
Sbjct: 49  -RLLGNHIA-NMGSLDSFKTQELSNTAWAFATAGQSNPALFEKIGDHVA--GLESLNSFD 104

Query: 322 SQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFAS-------- 371
            Q ++N+A A+A+ + S P+L  ++    + +  + +F+ Q L+   WAFA+        
Sbjct: 105 PQALSNIAWAYATAEVSHPELLKKIGDHIAGLSSLESFKPQNLSNTAWAFATAGVSHPKL 164

Query: 372 LYEPADPL--LESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSF 429
           LY+  D +  L  L+ +FK           A ++       K  GD  + G +S  + SF
Sbjct: 165 LYKIGDYIAGLGCLE-SFKPQNLSNTVWAFATADMTHPELFKKIGDHIA-GLMS--LDSF 220

Query: 430 NRDQLGNIAWSYAVLGQMDRIFFSDI 455
           +   L NI W++A   +     F+ I
Sbjct: 221 DPQNLSNIVWAFATAKESHPQLFNKI 246


>gi|308799013|ref|XP_003074287.1| unnamed protein product [Ostreococcus tauri]
 gi|116000458|emb|CAL50138.1| unnamed protein product, partial [Ostreococcus tauri]
          Length = 478

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 118/484 (24%), Positives = 195/484 (40%), Gaps = 108/484 (22%)

Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNM---------- 248
           LN++I+DAQ  +E+LE +          +     + +N ATA HR+ +            
Sbjct: 6   LNREIMDAQYPEEILEHVR---------VRSHLYNRVNCATAWHRLGRTSRVNGRPRGWT 56

Query: 249 --EKVSMM--TTHRL--AFTRQREMSMLVAIAMTA------------------LPECSAQ 284
             E+V+ +  TT RL   F  Q   ++  A A+                    + E   Q
Sbjct: 57  SDERVAELEATTRRLMSTFAVQNLTNIAWACAVLKYKPRDDLLGSIAARMGEMVAEFYPQ 116

Query: 285 GISNIAWALSKIGGELLY-LSEM--------------DRVAEVALTKVGEFNSQNVANVA 329
            +SN  WA + +     + L+E               D + +    K G F++Q V+NV 
Sbjct: 117 ALSNALWAYTVLKHPRAFALAEALKPAILATLPENPDDELKQAESAKDGVFSTQTVSNVL 176

Query: 330 GAFASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAF 387
             +A++  H   +L   LA        +F+ QEL+   W++A   + P D +L++ +   
Sbjct: 177 WTYATLGVHPGVELLDRLAAFILKSAGSFKAQELSNSCWSYARFGHYPGDEVLQTFERCL 236

Query: 388 KDATQ-FTCCLNKALSNCNENGGVKSSGDADSEGSLSS-----PVLSF-----NRDQLGN 436
            +  + +T    +AL+N +   G+   G    EG L       P   F     N   + N
Sbjct: 237 LERREEYT---TQALANTSV--GLSYFG-GSGEGGLRKLFDDIPPSWFRLREGNSQDISN 290

Query: 437 IAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLK-LEHPHL 495
           I W+ A +G     F S ++K   R       E +R D+         ++ LK L H  L
Sbjct: 291 IVWAIASVGA----FESQVYKAAVR-------ELFRRDV-----TDFQDEGLKALFHARL 334

Query: 496 QLA--------LSSVLEEKIASAGKTKRFNQ---KVTSSFQKEVARLLVSTGLNWIREYA 544
                      +  V  + +A  G      Q      S+FQ+ V   +   G     E  
Sbjct: 335 MQHDFAPDKDEVDVVYPDWVADKGLKPWLEQAEDTRVSTFQQNVTDAVKRAGYEPTMEAL 394

Query: 545 V-DGY-TVDAVLVDKKVAFEIDGPTHFSRNTGVPLGH-TMLKRRYIAAAGWNVVSLSHQE 601
             DG  ++D  L DKK+A E DGPTHF  N    +   T+++ R++   GW V+ + + E
Sbjct: 395 TEDGLLSMDICLNDKKLAIECDGPTHFYSNAPEKMTQKTLIRNRHLEVRGWKVIMIPYYE 454

Query: 602 WEEL 605
           W E+
Sbjct: 455 WREV 458


>gi|397612272|gb|EJK61674.1| hypothetical protein THAOC_17795 [Thalassiosira oceanica]
          Length = 314

 Score = 80.5 bits (197), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 139/349 (39%), Gaps = 56/349 (16%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEM--DRVAEVALTKVGEFNSQNVANVAGA 331
            + +L   +   +SN AWA +  G     L E   D VA      +  FN Q ++N A A
Sbjct: 7   GLKSLDSFNPHDLSNTAWAYATAGESHSELFEKIGDHVA--GRISLDSFNPQALSNTAWA 64

Query: 332 FASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDAT 391
           +A+ +     LF EL+  A      F  QE+A  LWA A++    + L            
Sbjct: 65  YATARRFHSRLFEELSTEAVVSREYFGGQEVANFLWACATVVYTGERLF----------L 114

Query: 392 QFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
            F   +   L  CNE G                         L NIAW+Y+V        
Sbjct: 115 AFAPVVESKLDECNEQG-------------------------LANIAWAYSVANVASEDL 149

Query: 452 FSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG 511
           F++ +       E+  S +          V L    L  +     + L   L+EK   A 
Sbjct: 150 FNEGYVGAFALNEKDFSAE--------GLVQLHQWQLWQQEIESGIELPQSLQEKCRKAF 201

Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAV--LVDKKVAFEIDGPTH 568
            +  +++ +    Q  V R L + GL+   E  +  GY VDA+  + D+ VA E+DGP+H
Sbjct: 202 TSASYSESI---LQNGVVRELKAVGLDVDEEVLLGSGYRVDALVNVGDRGVAIEVDGPSH 258

Query: 569 FSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
           F      P G   LK R +A      VVS+ + EW+ L+ S  +  YL 
Sbjct: 259 FIHRR--PTGSATLKHRQVATLDCIEVVSVPYWEWDGLKNSVMKQHYLH 305


>gi|428181830|gb|EKX50692.1| hypothetical protein GUITHDRAFT_85192 [Guillardia theta CCMP2712]
          Length = 177

 Score = 80.5 bits (197), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 67/106 (63%), Gaps = 2/106 (1%)

Query: 516 FNQKVTSSFQKEVARLLVSTGLNWIREYAVD--GYTVDAVLVDKKVAFEIDGPTHFSRNT 573
             Q   S  QK+VA +L    + ++ E+  +  GY++D +L DK+ A E+DGP+HF   T
Sbjct: 63  MEQHKPSRLQKDVAAILSEMQIEFVEEFIDERSGYSLDLLLRDKRTAIEVDGPSHFIVGT 122

Query: 574 GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
            +PLG T++K R++   G+++  L + EW++L+G  ++ +Y+R +L
Sbjct: 123 HIPLGKTVMKHRHMQQLGFDLRILPYWEWDQLKGKEQKKEYIRRLL 168


>gi|397618779|gb|EJK65038.1| hypothetical protein THAOC_14163, partial [Thalassiosira oceanica]
          Length = 389

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 93/381 (24%), Positives = 161/381 (42%), Gaps = 56/381 (14%)

Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
           +C+ Q ++NI W+ +K G     L E      + +  +  F  Q+++N+  A+A++  S 
Sbjct: 10  DCTEQALANILWSFAKSGEASPELFEAIE-NHIVVRSLDGFRPQHLSNIVWAYATVGVSH 68

Query: 340 PDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCL 397
           P+LF ++    +  D +  F  Q L+   WA+A+       L + + N            
Sbjct: 69  PELFKKIGDHVAGLDSLDWFTPQALSNTAWAYATAEASHSELFKKIGNHIAGMGSLDLFN 128

Query: 398 NKALSNCNENGGVKSSGDADSEGSLSSPVL----SFNRDQLGNIAWSYAVLGQMDRIFFS 453
           ++  SN            +     LS+  +     F+  ++ N  W+ A +G  D   FS
Sbjct: 129 SQDFSNTAWAYATARRFHSRLFEKLSTEAIVKGEYFDGQEVANFLWACATVGYSDERLFS 188

Query: 454 DIWKTI-SRFEEQRISEQYREDIMFASQV------HLVNQCL------------------ 488
                I S+ +E   +EQ+  +I +A  V       L N+C                   
Sbjct: 189 AFTPVIESKLDE--CNEQHLANIAWAYSVVNVPSQDLFNECYVGALASRENAFSEEDLSQ 246

Query: 489 ---------KLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNW 539
                    +LE    ++ L   L  K  +A  ++ +++   S  Q +VA  L + GL+ 
Sbjct: 247 LHQWQLWQQELES---RIELPRSLRAKCRNAFTSRGYSE---SKLQNDVAGELRAAGLDL 300

Query: 540 IREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNV 594
             E  +  GY +DA++     +KVA E+DGP+HF      P G T+LK R +       V
Sbjct: 301 DEEVLLGSGYRIDALVKVGDGRKVAVEVDGPSHFIDRR--PTGSTILKHRQVLRLDRIEV 358

Query: 595 VSLSHQEWEELQGSFEQLDYL 615
           VS+ + EW EL+ S  +  YL
Sbjct: 359 VSVPYWEWNELKNSVTKQHYL 379



 Score = 43.1 bits (100), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 56/124 (45%), Gaps = 10/124 (8%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
            M +L   ++Q  SN AWA +       +    ++++  A+ K   F+ Q VAN   A A
Sbjct: 120 GMGSLDLFNSQDFSNTAWAYAT--ARRFHSRLFEKLSTEAIVKGEYFDGQEVANFLWACA 177

Query: 334 SMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPL--------LESLDN 385
           ++ +S   LFS         +    EQ LA + WA++ +  P+  L        L S +N
Sbjct: 178 TVGYSDERLFSAFTPVIESKLDECNEQHLANIAWAYSVVNVPSQDLFNECYVGALASREN 237

Query: 386 AFKD 389
           AF +
Sbjct: 238 AFSE 241


>gi|397612107|gb|EJK61605.1| hypothetical protein THAOC_17875 [Thalassiosira oceanica]
          Length = 956

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 99/353 (28%), Positives = 160/353 (45%), Gaps = 39/353 (11%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTK-VGEFNSQNVANVAGAFASMQHSAPDL 342
           Q  +NI W+ +K G     L +   +    +T+ V +F  Q+V+N+  A+A+ + S P+L
Sbjct: 623 QDFANIIWSFAKSGKPDPELFQA--LGNHIVTRSVNDFWPQDVSNIVWAYAAAEVSHPEL 680

Query: 343 FSELAKR--ASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDN---AFKDATQFTCCL 397
           F ++       D + +F  Q L+   WA+A+       L E L     A KD        
Sbjct: 681 FKKIGDHIAGRDSLDSFNSQALSNTAWAYATAKVFHSRLFEKLATKVVARKDHFHGQAVA 740

Query: 398 NKALSNCNENGGVKSSGDADSE-GSLSSPVLSFNRDQ-----LGNIAWSYAVLGQMDRIF 451
           N  L  C       + G  D    S  +PV++   D+     L NIAW+Y+V     +  
Sbjct: 741 N-FLWAC------ATVGHTDERLCSALAPVIASKLDECSEHDLANIAWAYSVANTPRQDL 793

Query: 452 FSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG 511
           F + +       E+   ++   ++    Q  L  Q L        + L   L+EK  +A 
Sbjct: 794 FDEGYLCALASNEKDFPDK---ELFQLHQWQLWQQELGS-----GIELPRSLQEKSRNAF 845

Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPT 567
            ++ +++   S  Q +V   L + GL+   E  +  GY +DA++     +KVA E+DGP+
Sbjct: 846 TSRGYSE---SKLQNDVVGELKAAGLDLEEEVLLGSGYRIDALVKFSDGRKVAIEVDGPS 902

Query: 568 HFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLRVIL 619
           HF      P G T LK R +A      VVS+ + EW+EL+ S  +L YLR  L
Sbjct: 903 HFIDKR--PAGSTTLKHRQVAMLDRIEVVSVPYWEWDELKNSEMKLHYLRKKL 953



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 60/151 (39%), Gaps = 38/151 (25%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP----DLFSELAKRASDIVHTFQEQE 361
            DR+A  AL  + EF +++++N+  +F  ++         LF    K A  I+HTF+ QE
Sbjct: 527 FDRIARSALGMLNEFEARHLSNLIYSFGLVERKPEIGRETLFDVFGKAALRILHTFKPQE 586

Query: 362 LAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGS 421
           L+ +LWAF  +      L E                        E  GV S  D D    
Sbjct: 587 LSNMLWAFVKVDAKNSRLFE------------------------ETSGVISGMDLD---- 618

Query: 422 LSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
                 SF      NI WS+A  G+ D   F
Sbjct: 619 ------SFKPQDFANIIWSFAKSGKPDPELF 643


>gi|224013862|ref|XP_002296595.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968947|gb|EED87291.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1014

 Score = 79.7 bits (195), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 160/361 (44%), Gaps = 61/361 (16%)

Query: 265  REMSMLVAIAMTALPECS---AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFN 321
            R  ++   I+  ++P C+    Q ++++AW+ + +     +   ++ +A  +  +  EF+
Sbjct: 700  RSPALFNYISDVSVPHCNDLKRQEVASLAWSFAALN--FFHRPLLEALAVSSEGRWEEFS 757

Query: 322  SQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLE 381
            +QN+AN+A A+ + Q +   L   +A  A      F  Q  + +LWA+A+   P   L  
Sbjct: 758  AQNLANMAWAYTTAQETRHSLLRGIADAAIKKHDEFTHQGFSNLLWAYAAAGHPHQRLFS 817

Query: 382  SLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSY 441
            +L  +          + + L  CN                            L NIAW++
Sbjct: 818  ALAPS----------VAEVLDTCNGQS-------------------------LANIAWAF 842

Query: 442  AVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSS 501
            AV    D + FSD +  +      +I E   E +    Q+H  N          ++    
Sbjct: 843  AVSNVNDELLFSDRFVDVC---SSKIDEFNSEGL---CQLHQWNIW------RAEIGSDK 890

Query: 502  VLEEKIASAGKTKRFNQKVT-SSFQKEVARLLVSTGLNWIREYAVD-GYTVDAVL-VD-K 557
            VL   IA    T+  ++ +  S+ Q +  ++L S  L+ I E   + GY +D V+ VD +
Sbjct: 891  VLPPMIAKKCYTQFTSRPLQGSNLQSDAMKVLTSMDLHPIEEVQTESGYCLDFVVNVDGE 950

Query: 558  KVAFEIDGPTHF-SRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYL 615
            ++  E+DGP HF  R+   P G T+LKRR++       ++SL + E  EL+   ++  YL
Sbjct: 951  ELGIEVDGPHHFVGRD---PTGSTLLKRRHVENVDRIPIISLPYWELNELETLDDKQLYL 1007

Query: 616  R 616
            R
Sbjct: 1008 R 1008



 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 48/217 (22%), Positives = 97/217 (44%), Gaps = 17/217 (7%)

Query: 238 ATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIG 297
           A  L +IA +  K          +  +R    +    ++ L    A+ ++N+A+A +   
Sbjct: 344 AVTLCQIANSFAKA--------GYNDERLFQSISDATISILTSFDARHLANMAYAFALAR 395

Query: 298 GELLY---LSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIV 354
               Y   L+  D +A   + ++    +Q++AN+  A+A++ H+ PDLF  +A+ A   +
Sbjct: 396 VNPRYDDGLTLFDDIANEFIPRLHTATTQHLANITWAYATIGHANPDLFGAVAEEAMGRL 455

Query: 355 HTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGV---K 411
             F  Q L  + WA +     ++ +L+ +      A    C  ++ ++    +       
Sbjct: 456 KEFSPQHLENLSWALSKFPHSSNEILDRIAEEVV-ARGLQCSTSQGIAMLAHSFATLNHA 514

Query: 412 SSGD--ADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQ 446
           ++GD     E + SS V SF   +   IAW++A +G+
Sbjct: 515 TNGDFWECIENTASSRVSSFGVIECIQIAWAFATIGR 551



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 53/225 (23%), Positives = 95/225 (42%), Gaps = 41/225 (18%)

Query: 151 GYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQ 210
           G+   DL   V+  A G  +E   + +E     LS+F   SN                  
Sbjct: 437 GHANPDLFGAVAEEAMGRLKEFSPQHLENLSWALSKFPHSSN------------------ 478

Query: 211 EVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSML 270
           E+L+ IAE + A G   S S                  + ++M+                
Sbjct: 479 EILDRIAEEVVARGLQCSTS------------------QGIAMLAHSFATLNHATNGDFW 520

Query: 271 VAIAMTALPECSAQGIS---NIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVAN 327
             I  TA    S+ G+     IAWA + IG +   L     +  V+++K+ +FN Q ++N
Sbjct: 521 ECIENTASSRVSSFGVIECIQIAWAFATIGRKADDL--FRGIERVSMSKMDQFNPQGLSN 578

Query: 328 VAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
           +A AF+++++ +P LF+ +A+ +   +  F+ QE A ++ A + +
Sbjct: 579 LAWAFSTLEYDSPTLFNAIAECSERKLDQFKPQEKAMLVLALSRI 623



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/187 (26%), Positives = 81/187 (43%), Gaps = 45/187 (24%)

Query: 274 AMTALPECSAQGISNIAWALSKI---GGELLYLSEMDRVAEVALTKVGEFN-SQNVANVA 329
           AM  L E S Q + N++WALSK      E+L     DR+AE  + +  + + SQ +A +A
Sbjct: 451 AMGRLKEFSPQHLENLSWALSKFPHSSNEIL-----DRIAEEVVARGLQCSTSQGIAMLA 505

Query: 330 GAFASMQHSAP-DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
            +FA++ H+   D +  +   AS  V +F   E  Q+ WAFA++   AD L   ++    
Sbjct: 506 HSFATLNHATNGDFWECIENTASSRVSSFGVIECIQIAWAFATIGRKADDLFRGIERV-- 563

Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD 448
                      ++S  ++                      FN   L N+AW+++ L    
Sbjct: 564 -----------SMSKMDQ----------------------FNPQGLSNLAWAFSTLEYDS 590

Query: 449 RIFFSDI 455
              F+ I
Sbjct: 591 PTLFNAI 597



 Score = 43.5 bits (101), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 5/68 (7%)

Query: 263 RQREMSMLVAIAMTALPECSAQGISNIAW--ALSKIGGELLYLSEMDRVAEVALTKVGEF 320
            QR  S L       L  C+ Q ++NIAW  A+S +  ELL+    DR  +V  +K+ EF
Sbjct: 812 HQRLFSALAPSVAEVLDTCNGQSLANIAWAFAVSNVNDELLF---SDRFVDVCSSKIDEF 868

Query: 321 NSQNVANV 328
           NS+ +  +
Sbjct: 869 NSEGLCQL 876


>gi|307105016|gb|EFN53267.1| hypothetical protein CHLNCDRAFT_137207 [Chlorella variabilis]
          Length = 1782

 Score = 79.7 bits (195), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 89/389 (22%), Positives = 168/389 (43%), Gaps = 68/389 (17%)

Query: 246 KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLY-LS 304
           + +  +++   H L    Q  M++L+  AM  LP+   Q +SN+ W+++ +  E  +   
Sbjct: 336 QTISNLALAYAH-LGRKPQLLMALLMKEAMPLLPQFKPQELSNLLWSMASM--EFWHGPG 392

Query: 305 EMDRVAEVALTKVGEFNSQNVANVAGAFASMQH-SAPDLFSELAKRASDIVHTFQEQELA 363
            ++ + + A         Q +AN   A+A+M+     ++   +   A   +  F+ QEL 
Sbjct: 393 AVESITQAACGVADRMKPQEIANCCWAWATMRFFPGAEVLDLMLAHAEAQLDRFKSQELG 452

Query: 364 QVLWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL 422
            + WA A L Y PA  L+ +       A ++    N A+ +C                  
Sbjct: 453 MLTWAVARLAYMPAASLVRA---CLPLAAEWR---NPAVQDC------------------ 488

Query: 423 SSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWK---TISR--FEEQRISEQYREDIMF 477
                       GN+ W++ VLG +     S +     ++ R  F ++   + Y+  +  
Sbjct: 489 ------------GNLLWAFTVLGILTPEVMSVLGHKMLSLPREAFTQEAYIQLYQAKMSL 536

Query: 478 ASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVT---SSFQKEVARLLVS 534
           +  VH +               ++ +  ++ + G+T+   Q      S+  ++VA  +  
Sbjct: 537 SQAVHDI---------------AAHIPPELLARGETEWRQQAAVLKVSATHRDVAAAMAE 581

Query: 535 TGLNW-IREYAVDGY-TVDAVLVDKKVAFEIDGPTHFSRNTG-VPLGHTMLKRRYIAAAG 591
            G+   I     DG  +VD  L  ++VA E+DG  HF++N   VPLG T+ + R +A+ G
Sbjct: 582 LGIEHDIERRIEDGLVSVDIALRSERVAVEVDGSAHFTQNEPFVPLGRTLWRWRLLASRG 641

Query: 592 WNVVSLSHQEWEELQGSFEQLDYLRVILK 620
           W VVS+ +  W  L+   E+  YL  +L+
Sbjct: 642 WRVVSVPYFRWGLLRSMDEKKRYLYQLLQ 670



 Score = 45.8 bits (107), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 29/114 (25%), Positives = 55/114 (48%), Gaps = 9/114 (7%)

Query: 280 ECSAQGISNIAWALSKIGGELLY-----LSEMDRVAEVALTKV---GEFNSQNVANVAGA 331
           E   QG++NI W + K+G ++ +     +  + R  +  LT     G F  QNV+N    
Sbjct: 197 EFKPQGLANILWGMGKLGVKVSHEVRQMVDALCREVQAQLTHSRHKGSFAPQNVSNTLHG 256

Query: 332 FASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLD 384
             ++    +P+L S L + A  ++ +F  QEL  ++W+ + ++    P    +D
Sbjct: 257 IVNIGIVPSPELLSALVRAADGMLRSFGAQELTNLVWSLSQMHRCGVPFTPDVD 310


>gi|397605332|gb|EJK58971.1| hypothetical protein THAOC_20861 [Thalassiosira oceanica]
          Length = 2083

 Score = 79.0 bits (193), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 91/353 (25%), Positives = 153/353 (43%), Gaps = 23/353 (6%)

Query: 274  AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
             + +L     Q +SN AWA +  G     L +           +  FN Q+++N+A AFA
Sbjct: 741  GLASLDSFKPQALSNTAWAFATAGESHPELFKKIGGHIAGPGSLCSFNPQDLSNIAWAFA 800

Query: 334  SMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDN---AFK 388
            +   S  +LF+++    +  D + +F+ Q L+   WA+A+       L E L     A K
Sbjct: 801  TAGVSHRELFNKIGHHVAGLDSLDSFEPQALSNTAWAYATARVFHSRLFEKLAKEVAARK 860

Query: 389  DATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD 448
                 T  +   L  C   G       +     ++S +  FN   L NI W+Y+V     
Sbjct: 861  GELIETQHIANFLWACATVGYTDERSFSAFAPVIASKLDKFNEQGLSNITWAYSVANLPR 920

Query: 449  RIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIA 508
            +  F+  + +     E+  S +       A          ++E     + L   L+ K  
Sbjct: 921  QDLFNKGYVSALASNEKVFSGE-----QLAQLHQWQLWQQEMES---GIELPQSLQAKCR 972

Query: 509  SAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG-YTVDAVLV---DKKVAFEID 564
            +A  ++ +++   S  Q +V   L + GL    E  ++  Y +DA++     +KVA E+D
Sbjct: 973  NAFTSRGYSE---SKLQNDVVGELKAAGLVLDEEVLLESWYLIDALVEFSDGRKVAVEVD 1029

Query: 565  GPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
            GP+HF      P G T+LK R +A      VVS+ + EW+EL+ S  +  YLR
Sbjct: 1030 GPSHFIDMR--PTGSTILKHRQVARMDHIEVVSVPYWEWDELKNSEMKQHYLR 1080



 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 49/178 (27%), Positives = 82/178 (46%), Gaps = 20/178 (11%)

Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQ 364
           D VA   L  +  FN Q ++N A AFA+   S P+LF ++    + +  + +F+ Q L  
Sbjct: 659 DHVA--GLMSLNSFNPQALSNTAWAFATAGVSYPELFKKIGGHVAGLGSLDSFKAQALTN 716

Query: 365 VLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE----- 419
           ++W+FA+  E    L + + +             +ALSN        ++G++  E     
Sbjct: 717 IVWSFATAGESNPKLFKKIGDYIAGLASLDSFKPQALSNTAW--AFATAGESHPELFKKI 774

Query: 420 -GSLSSP--VLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS------RFEEQRIS 468
            G ++ P  + SFN   L NIAW++A  G   R  F+ I   ++       FE Q +S
Sbjct: 775 GGHIAGPGSLCSFNPQDLSNIAWAFATAGVSHRELFNKIGHHVAGLDSLDSFEPQALS 832



 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 81/200 (40%), Gaps = 47/200 (23%)

Query: 303 LSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTF 357
           LS +DR+A  A+  + EF+++ ++N+  +F   +H+ PD     LF+     A  I+HTF
Sbjct: 479 LSIIDRIASSAVGMLNEFDARCLSNLIYSFGLFEHN-PDIEGETLFNVFGDAAGKILHTF 537

Query: 358 QEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDAD 417
           + Q L+ +LWAF  +      L +                        E G V S  D D
Sbjct: 538 ESQNLSNMLWAFVKVDAKHSRLFQ------------------------ETGRVISGMDLD 573

Query: 418 SEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF-----SDIWKTISR--FEEQRISEQ 470
                     SF    L NI WS+   G+ D   F     S   KT++    +E R S  
Sbjct: 574 ----------SFKPQALANILWSFTKSGKADPELFQALGNSHCRKTVAPCAVQEDRRSHC 623

Query: 471 YREDIMFASQVHLVNQCLKL 490
           +   + F      V  CL +
Sbjct: 624 WTGQLEFIQAAGPVQYCLGV 643



 Score = 40.0 bits (92), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 39/166 (23%), Positives = 72/166 (43%), Gaps = 16/166 (9%)

Query: 206 AQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQR 265
            ++  E+ + I   I   G   S +P    NIA A           +   +HR  F +  
Sbjct: 764 GESHPELFKKIGGHIAGPGSLCSFNPQDLSNIAWAF---------ATAGVSHRELFNK-- 812

Query: 266 EMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEF-NSQN 324
            +   VA  + +L     Q +SN AWA +     + +    +++A+    + GE   +Q+
Sbjct: 813 -IGHHVA-GLDSLDSFEPQALSNTAWAYAT--ARVFHSRLFEKLAKEVAARKGELIETQH 868

Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
           +AN   A A++ ++    FS  A   +  +  F EQ L+ + WA++
Sbjct: 869 IANFLWACATVGYTDERSFSAFAPVIASKLDKFNEQGLSNITWAYS 914


>gi|158702076|gb|ABW77414.1| RAP domain protein [Arabidopsis thaliana]
          Length = 49

 Score = 79.0 bits (193), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 36/45 (80%), Positives = 40/45 (88%)

Query: 575 VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
           +PLGHTMLKRRY+AAAGW VVSLS QEWEE +GS EQL+YLR IL
Sbjct: 1   LPLGHTMLKRRYVAAAGWKVVSLSLQEWEEHEGSHEQLEYLREIL 45


>gi|323450314|gb|EGB06196.1| hypothetical protein AURANDRAFT_65882 [Aureococcus anophagefferens]
          Length = 1499

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 89/355 (25%), Positives = 154/355 (43%), Gaps = 43/355 (12%)

Query: 282  SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQ--HSA 339
            ++Q I+N AWA ++ G     L   D +A VA   +  F SQ +AN+A A+A +     A
Sbjct: 831  NSQNIANCAWAYARAGSRDTAL--FDALARVAEPLLDGFKSQELANLAWAYAKLNLVERA 888

Query: 340  PDLFSELAKRASDIVHTFQEQELAQVLWAFAS-------LYEPAD----PLLESLDNAFK 388
              LF +LA+ A   +  +  Q++   LWAFAS       L+E A     P L +LD  F 
Sbjct: 889  QVLFLQLARVAQAKLGRYNAQDVTNTLWAFASNDLEHVALFEAAARHAAPRLRALDRGFA 948

Query: 389  DATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD 448
            +  Q    L  + +             A + G++   +L  +   + N+AW++A +G+ D
Sbjct: 949  N-PQKVATLAWSYAKAAVYAPALMDALAAACGAIVDELLPVD---VANVAWAFAAVGETD 1004

Query: 449  RIFFSDIWKTISRFEEQRISEQYREDIMFA-SQVHLVNQCLKLEHPHLQLALSSVLEEKI 507
            R    +  K  +      +S Q   +++++ S +     C +L    L    +  + + +
Sbjct: 1005 RGGLFEALKDRALAVLDDLSSQELANLVWSFSNLDDAAPCRELWLVLLDRGWTPAIFDDV 1064

Query: 508  ASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREY------------------AVDGYT 549
            A   K++     +  +    VA +    G  W R                    A  G +
Sbjct: 1065 A---KSQLQQAYLRLTLDGAVAAVPPLDG-EWARALQAALTTSDCALGSRTQLEARSGLS 1120

Query: 550  VDAVLVDKKVAFEIDGPTHFSRNTGVPL-GHTMLKRRYIAAAGWNVVSLSHQEWE 603
            +D    + KVA E DGP H+  N    L G + LKRR +   GW++V + +++W+
Sbjct: 1121 LDMAKPELKVAVEFDGPVHYFANAKWMLTGRSKLKRRLLDLVGWDIVYVDYRDWD 1175



 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 90/217 (41%), Gaps = 45/217 (20%)

Query: 190 PSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNM- 248
           PS R KE+N    +   +T  ++L ++            P+ L+  N  TALHR++K   
Sbjct: 255 PSARDKEVN-TLLLRKCKTVADILALVERE--------GPARLNTFNQVTALHRLSKAGL 305

Query: 249 -----------------------EKVSMMTTHRLAFTRQREMSMLVA----------IAM 275
                                   +  + TT  L  T      M V              
Sbjct: 306 RLGRGGGEPLVEALVASVAGKIGARPGVFTTRHLVNTAYSLGKMKVTDARAYAAIATACG 365

Query: 276 TALPECSAQGISNIAWALS--KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
             L E +AQ ++N++WA +  ++  +   L+ + R+   A  ++G F SQ ++N   AFA
Sbjct: 366 PRLGEFNAQDVANLSWAYATAEVSDDADCLATLRRLPGAAQRELGSFTSQGLSNTVWAFA 425

Query: 334 SMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
           +M   AP+L + +A      +  +  QELA  +WA+A
Sbjct: 426 TMGLRAPELMAHVAAEGERRLGEYNAQELANTVWAYA 462



 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 84/340 (24%), Positives = 130/340 (38%), Gaps = 60/340 (17%)

Query: 220 ITAVGKGLSPSPL---SPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMT 276
           I  V  G  P  L   SP N+A+ LH +      VS      + F+R       +     
Sbjct: 601 ICDVAAGDGPCSLDGFSPQNLASLLHAL-----TVSGFDAPDV-FSRAPPRVAAL----- 649

Query: 277 ALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQ 336
            LP C+AQ ISN  W+ +        L +      VA      F +QNV+NVA +FA + 
Sbjct: 650 -LPACNAQDISNTVWSFASNDIRDARLFDAVDAFLVAEGVPETFGAQNVSNVAWSFAKVA 708

Query: 337 HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL-------DNAFKD 389
             +  LF  L   A+ I+  F  Q  +  L+AFA      D    S+       + A+  
Sbjct: 709 MGSDALFGVLGDFAASIIDQFSNQNCSNTLYAFALANRRHDAFFRSMCGEIVRQEAAWSP 768

Query: 390 ATQFTCCLNKALSNCNENGGVKSSGDADSE-----------GSLSSPVLS---------- 428
           + Q     N A +       V  +GD  S+              ++P  +          
Sbjct: 769 SGQDIA--NSAWALATIGLTVAPAGDDKSQVKRRLEDGPDADYFATPAFAALSRAAVRVC 826

Query: 429 ---FNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVN 485
              FN   + N AW+YA  G  D   F  + +      +   S++        ++++LV 
Sbjct: 827 GRGFNSQNIANCAWAYARAGSRDTALFDALARVAEPLLDGFKSQELANLAWAYAKLNLVE 886

Query: 486 QCLKLEHPHLQLALSSVLEEKIASAGKTKRFN-QKVTSSF 524
           +   L    LQLA       ++A A K  R+N Q VT++ 
Sbjct: 887 RAQVL---FLQLA-------RVAQA-KLGRYNAQDVTNTL 915



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 7/98 (7%)

Query: 278 LPECSAQGISNIAWALSKIGGELLYLSE---MDRVAEVALTKVGEFNSQNVANVAGAFAS 334
           L E +AQ ++N  WA +K G E    S+   +  +A  AL K+G+FN QN+ N A AFA+
Sbjct: 446 LGEYNAQELANTVWAYAKCGAE----SQEPFLRAIARAALAKLGDFNPQNLTNTAWAFAT 501

Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
                P+LF  +A  +   +  F  Q L+   WAFA +
Sbjct: 502 AGVVVPELFDGVAAASVRQLDVFNPQNLSNTGWAFAKV 539



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 4/89 (4%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           Q ++N AWA +  G  ++     D VA  ++ ++  FN QN++N   AFA + +    LF
Sbjct: 490 QNLTNTAWAFATAG--VVVPELFDGVAAASVRQLDVFNPQNLSNTGWAFAKVGYYDARLF 547

Query: 344 SELAKRAS--DIVHTFQEQELAQVLWAFA 370
             +A R +  D++  F  Q L+ V W+ A
Sbjct: 548 RAIAARVARDDVIGVFNPQNLSNVAWSLA 576



 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 50/102 (49%), Gaps = 14/102 (13%)

Query: 284 QGISNIAWALSKIGGE----------LLYLSEMDRVAEVAL----TKVGEFNSQNVANVA 329
           Q +SN+AW+L+K   E          + Y   + ++ +VA       +  F+ QN+A++ 
Sbjct: 566 QNLSNVAWSLAKRLTEGPEVHDGDEKVAYFDCLRKICDVAAGDGPCSLDGFSPQNLASLL 625

Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFAS 371
            A       APD+FS    R + ++     Q+++  +W+FAS
Sbjct: 626 HALTVSGFDAPDVFSRAPPRVAALLPACNAQDISNTVWSFAS 667



 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 51/110 (46%), Gaps = 7/110 (6%)

Query: 270  LVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKV-----GEFNSQN 324
            L  +A   L   +AQ ++N  WA +    +L +++  +  A  A  ++     G  N Q 
Sbjct: 895  LARVAQAKLGRYNAQDVTNTLWAFAS--NDLEHVALFEAAARHAAPRLRALDRGFANPQK 952

Query: 325  VANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYE 374
            VA +A ++A     AP L   LA     IV      ++A V WAFA++ E
Sbjct: 953  VATLAWSYAKAAVYAPALMDALAAACGAIVDELLPVDVANVAWAFAAVGE 1002


>gi|397586873|gb|EJK53743.1| hypothetical protein THAOC_26753, partial [Thalassiosira oceanica]
          Length = 447

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 138/316 (43%), Gaps = 30/316 (9%)

Query: 315 TKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASL 372
           T +  F  Q+ +N A AFA+   S P+LF ++    + +  +++F  Q L+   W+FA+ 
Sbjct: 33  TSLNSFKPQDFSNTAWAFATAGASHPELFKKIGNHLAGLMSLNSFNPQALSNTAWSFATA 92

Query: 373 YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE-GSLSSPVLSFNR 431
                 L   + +   +   F     + LSN        + G  D    S  +PV+    
Sbjct: 93  GISYPELFRKIGDHVAELGCFDSFKPQELSNT--VWACATIGHTDERLFSAFAPVIRSKL 150

Query: 432 DQ-----LGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRIS-EQYREDIMFASQVHLVN 485
           D+     L NIAW+Y+V        F++ +       E   S E++R+   +      + 
Sbjct: 151 DECSEQDLANIAWAYSVANLPRHDLFNEGYAGALASNENEFSVEEFRQLHQWQLWQQELQ 210

Query: 486 QCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV 545
             ++         L   L  K  +A  ++ F++   S  Q +V   L   GL+   E  +
Sbjct: 211 SGIE---------LPRSLRAKCRNAFTSRGFSE---SKLQNDVVDELRIAGLDLEEEVLL 258

Query: 546 -DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQ 600
             GY +DA++     +KVA E+DGP HF      P G T LK R +A      VVS+ + 
Sbjct: 259 GSGYRIDALVKVGDGRKVAIEVDGPFHFIDRR--PAGRTTLKHRQVATLDRIEVVSVPYW 316

Query: 601 EWEELQGSFEQLDYLR 616
           EW+EL+ S  +  YLR
Sbjct: 317 EWDELKNSEMKQHYLR 332



 Score = 42.4 bits (98), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 46/99 (46%), Gaps = 4/99 (4%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
            + +L   + Q +SN AW+ +  G     L+    D VAE+       F  Q ++N   A
Sbjct: 70  GLMSLNSFNPQALSNTAWSFATAGISYPELFRKIGDHVAELGC--FDSFKPQELSNTVWA 127

Query: 332 FASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
            A++ H+   LFS  A      +    EQ+LA + WA++
Sbjct: 128 CATIGHTDERLFSAFAPVIRSKLDECSEQDLANIAWAYS 166


>gi|224004716|ref|XP_002296009.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209586041|gb|ACI64726.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1278

 Score = 76.3 bits (186), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 95/438 (21%), Positives = 164/438 (37%), Gaps = 113/438 (25%)

Query: 264  QREMSMLVAIAMTALPECSAQGISNIAWALSKIG-------------------------- 297
            QR    +V          S+QG+ N  W+ +K G                          
Sbjct: 846  QRIAEHIVGNNGRGFSSFSSQGLGNTLWSFAKQGQLSLDVIELLGDSAKAVSTGRLAVYE 905

Query: 298  ------GELLYLSEMDRVAEVALT-KVGEFNSQNVANVAGAFASMQ--HSA------PDL 342
                  GE L        AE  L+  +  F +Q+++N   A+A++   HS         +
Sbjct: 906  TSCLDIGEKLLKQLFAMAAEAGLSMNLDRFKTQDISNTCWAYATLGLLHSGFFNNVESQV 965

Query: 343  FSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKAL 401
             S +    S     F+ QE+A +LW+FA+L  +P   ++++L +                
Sbjct: 966  ISRIGSVPSKSRQIFRGQEMANILWSFATLNAQPQPAMVDALASYIA------------- 1012

Query: 402  SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI-- 459
            + C    G     D  S   L      F R +L NIAWS AVLG+  +   + ++  I  
Sbjct: 1013 AGCRGKNGP----DEHSVSRL------FKRQELANIAWSCAVLGRYPKELMNILYTGIVG 1062

Query: 460  SRFEEQRISEQYREDIMFASQV---HLVNQCLKLEHPHLQLALSSVLEE----------- 505
            +R + Q + + + ++ +  S +   + V     +E P L+L L +               
Sbjct: 1063 TRNDPQEMKQIFDDEGLQKSSIMTLYYVQVAADVEAPQLKLKLPNGFPNGWCDDGEGHSV 1122

Query: 506  KIASAGKTKRFNQK-------VTSSFQKEVARLLVSTGLNWIREYAVDG----------- 547
             I+S G      Q          S  Q++V++     G     E+ +D            
Sbjct: 1123 GISSKGDESDLAQVSSSMLTLTVSKLQRDVSKTFDRLGFENEMEHVIDTNEIKDEYGIQL 1182

Query: 548  -------YTVDAVLVDKKVAFEIDGPTHF-------SRNTGVPLGHTMLKRRYIAAAGWN 593
                    ++D   V+++V  E+DGP HF        R      G T+LK R +   GW+
Sbjct: 1183 PKTPQEFLSIDIANVEQRVGIEVDGPGHFVRLIDSKDRGDNRVNGPTLLKHRLLTHLGWD 1242

Query: 594  VVSLSHQEWEELQGSFEQ 611
            ++ L + E++ L G  E+
Sbjct: 1243 IIHLPYWEYQSLGGGEEE 1260



 Score = 45.8 bits (107), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 46/196 (23%), Positives = 90/196 (45%), Gaps = 15/196 (7%)

Query: 270 LVAIAMTALPEC---SAQGISNIAWALSKIGGELLYLSEM-DRVAEVALTKVGEFNSQNV 325
           L  IA +ALP      AQ ++N+AW  +++G       ++ + VA+    ++ +F  Q+V
Sbjct: 666 LETIADSALPRLERFKAQELNNLAWGFARLGHRTEKAEKLFEGVAKQLKQRIHQFKPQDV 725

Query: 326 ANVAGAFASMQHSAPDLFSELAKRAS-DIVHTFQEQELAQVLWAFASL-YEPADPLLESL 383
                +F++ ++   D F   A R +   + +F+ QE++  +WA A+  + P    + + 
Sbjct: 726 GTTLWSFSTAEYFDLDAFRTGASRLNFQHIRSFKPQEMSNTVWALATAGFTPK--YIHAF 783

Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
           D     ATQ        L++  ++   +       E ++  P L F   +L +I WS++ 
Sbjct: 784 DTTLVPATQ-----RPPLNDIKKDPITECFAAVAGE-AMRRP-LDFKDQELKDILWSFSK 836

Query: 444 LGQMDRIFFSDIWKTI 459
           +G      F  I + I
Sbjct: 837 IGVRHPALFQRIAEHI 852



 Score = 45.8 bits (107), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 301 LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH---SAPDLFSELAKRASDIVHTF 357
           L    ++ +A+ AL ++  F +Q + N+A  FA + H    A  LF  +AK+    +H F
Sbjct: 661 LVFETLETIADSALPRLERFKAQELNNLAWGFARLGHRTEKAEKLFEGVAKQLKQRIHQF 720

Query: 358 QEQELAQVLWAFAS 371
           + Q++   LW+F++
Sbjct: 721 KPQDVGTTLWSFST 734



 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 46/203 (22%), Positives = 81/203 (39%), Gaps = 33/203 (16%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDR-------------------------VAEVALTKVG 318
           Q +SN  WAL+  G    Y+   D                          VA  A+ +  
Sbjct: 761 QEMSNTVWALATAGFTPKYIHAFDTTLVPATQRPPLNDIKKDPITECFAAVAGEAMRRPL 820

Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRA----SDIVHTFQEQELAQVLWAFASLYE 374
           +F  Q + ++  +F+ +    P LF  +A+           +F  Q L   LW+FA   +
Sbjct: 821 DFKDQELKDILWSFSKIGVRHPALFQRIAEHIVGNNGRGFSSFSSQGLGNTLWSFAKQGQ 880

Query: 375 PADPLLESLDNAFKDATQFTCCLNKALSNCNENGG--VKSSGDADSEGSLSSPVLSFNRD 432
            +  ++E L ++ K  +  T  L    ++C + G   +K      +E  LS  +  F   
Sbjct: 881 LSLDVIELLGDSAKAVS--TGRLAVYETSCLDIGEKLLKQLFAMAAEAGLSMNLDRFKTQ 938

Query: 433 QLGNIAWSYAVLGQMDRIFFSDI 455
            + N  W+YA LG +   FF+++
Sbjct: 939 DISNTCWAYATLGLLHSGFFNNV 961


>gi|397566229|gb|EJK44967.1| hypothetical protein THAOC_36452, partial [Thalassiosira oceanica]
          Length = 366

 Score = 75.9 bits (185), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 92/339 (27%), Positives = 153/339 (45%), Gaps = 34/339 (10%)

Query: 299 ELLYLSEMDRVAEVALT--KVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--V 354
           E   LS   R+A ++L    +  FNSQN++N   AFA+   S P+LF+++    + +  +
Sbjct: 42  ECRILSCSRRLAIMSLDCDSLDSFNSQNLSNTVWAFATAGESHPELFNKIGNHIAGLASL 101

Query: 355 HTFQEQELAQVLWAFASLYEPADPLLESLDN-AFKDATQFTCCLNKALSNCNENGGVKSS 413
            +F  Q L+  +WA+A+       L E L   A      F     + +SN          
Sbjct: 102 GSFNPQNLSITVWAYATARVFHSRLFEKLTTEAVAKKDHFD---EQGVSNLLWACATVDY 158

Query: 414 GDADSEGSLSSPVLSF-----NRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRIS 468
            D     +L +P++       N  +L NIAW+Y+V     +  F++ + +     E+  S
Sbjct: 159 IDERLFSAL-APMIGLKLDKCNEQELANIAWAYSVANTPRQDLFNEGYVSALASNEKDFS 217

Query: 469 EQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEV 528
            +       A          +L+     + L   L+ K  +A  +  F++   S FQ +V
Sbjct: 218 AE-----GLAQLHQWQLWQQELKS---GIELPQSLQAKCRNAFTSHGFSE---SKFQNDV 266

Query: 529 ARLLVSTGLNWIREYAV--DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLK 583
              L + GL+ + E A+   GY +DA++     +KVA E+DGP+HF      P G T LK
Sbjct: 267 VYELKAAGLD-LDEEALFGSGYRIDALVKVGDGRKVAVEVDGPSHFIDRR--PAGSTTLK 323

Query: 584 RRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
            R +A      VV + + EW+ L+ S  +  YL + L D
Sbjct: 324 HRQVARLDRIQVVPVPYWEWDNLKNSEMKQHYLHLKLSD 362


>gi|397606443|gb|EJK59317.1| hypothetical protein THAOC_20479, partial [Thalassiosira oceanica]
          Length = 472

 Score = 75.9 bits (185), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 89/342 (26%), Positives = 145/342 (42%), Gaps = 59/342 (17%)

Query: 284 QGISNIAWALSKIGGELLYLSEM--DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           Q +SN  WA +  G     L  M    VAE  L  +  F +Q ++N A A A+   S P+
Sbjct: 117 QDLSNTIWAFATAGVLHPELFNMIGHHVAE--LGSLDSFKAQALSNTAWALATAGVSHPE 174

Query: 342 LFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNK 399
           LF+++    + +  + +F+ QEL+  LWA AS+    + L  +L             +  
Sbjct: 175 LFNKIGNHIAGLGSLDSFKPQELSNTLWACASVCYTDERLFSAL----------APVIAS 224

Query: 400 ALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI 459
            L  C+E                           L N+AW+Y+V     +  F + + + 
Sbjct: 225 KLDKCSEQ-------------------------DLANVAWAYSVANTPRQDLFDEGYVSA 259

Query: 460 SRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQK 519
               E   S +       A          +LE    ++ L    + K  +A  ++ +++ 
Sbjct: 260 LASNENEFSGK-----ELAQLHQWQLWQQELES---RIELQGPFQAKCRNAFTSRGYSE- 310

Query: 520 VTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGV 575
             S  Q +V   L + GL    E  +  GY +DA++     +KVA E+DGP+HF      
Sbjct: 311 --SKLQNDVVDELKAAGLVLDEEVLLGSGYLIDALVEFNDGRKVAVEVDGPSHFIDRR-- 366

Query: 576 PLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
           P G T+LK R +A      VVS+ + EW+EL+ S  +  YLR
Sbjct: 367 PAGRTILKHRQVAKMDHIKVVSVPYWEWDELKNSEMKQRYLR 408



 Score = 42.7 bits (99), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 48/107 (44%)

Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
           + +L    AQ +SN AWAL+  G     L          L  +  F  Q ++N   A AS
Sbjct: 147 LGSLDSFKAQALSNTAWALATAGVSHPELFNKIGNHIAGLGSLDSFKPQELSNTLWACAS 206

Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLE 381
           + ++   LFS LA   +  +    EQ+LA V WA++    P   L +
Sbjct: 207 VCYTDERLFSALAPVIASKLDKCSEQDLANVAWAYSVANTPRQDLFD 253



 Score = 38.9 bits (89), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 39/152 (25%), Positives = 64/152 (42%), Gaps = 18/152 (11%)

Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVH--TFQEQELAQ 364
           D VA   L  +  FN Q ++N A AFAS +   P+L  ++    +  +   +F+ Q L+ 
Sbjct: 4   DHVA--GLDSLNSFNPQTLSNTAWAFASAEVPHPELLRKIGDHIAGQMSLISFEPQNLSN 61

Query: 365 VLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGD-ADSEGSLS 423
             WA+A+  +     L+S D      T +      A +  + +   K  G+     G L 
Sbjct: 62  TAWAYAAAGD-----LDSFDPKVLSITAWAF----ATAGVSHDELFKKIGNHVTGPGGLG 112

Query: 424 SPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDI 455
               SF    L N  W++A  G +    F+ I
Sbjct: 113 ----SFKPQDLSNTIWAFATAGVLHPELFNMI 140


>gi|397589068|gb|EJK54518.1| hypothetical protein THAOC_25847, partial [Thalassiosira oceanica]
          Length = 342

 Score = 75.5 bits (184), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 140/323 (43%), Gaps = 32/323 (9%)

Query: 314 LTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFAS 371
           L  +  F  QN++N A AFA+   S P LF ++    + +  +  F+  EL+   WAFA 
Sbjct: 12  LDSLDSFKQQNLSNTAWAFATAGESHPGLFRKIGGHVAGLMSLDLFKPLELSNTAWAFAK 71

Query: 372 LYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE-GSLSSPVLSFN 430
             +    L + + +             +ALSN        + G  D    S  +PV+   
Sbjct: 72  AGKSNPKLFKKICDYIAGLDSLDSFDPQALSNI--VWACATVGYTDERLFSAFAPVIESK 129

Query: 431 RDQ-----LGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQY--REDIMFASQVHL 483
            D+     L NI+W+Y+V     +  F++ +       E+  SE+   +       Q  L
Sbjct: 130 LDECSEQHLANISWAYSVANLPKQDLFNEGYAGALASNEKDFSEEVLCQLHQWQLWQQEL 189

Query: 484 VNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREY 543
           V   L +E P    +L +       SAG ++       S  Q +V   L + GL    E 
Sbjct: 190 V---LGIELPE---SLQAKCRNAFTSAGYSE-------SKLQNDVVGELRAAGLVLDEEV 236

Query: 544 AV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLS 598
            +  GY +DA++     +KVA E+DGP HF      P G T LK R +A      VVS+ 
Sbjct: 237 LLGSGYRIDALVKFGDGRKVAVEVDGPFHFIDRR--PAGSTTLKHRQVARLDRIEVVSVP 294

Query: 599 HQEWEELQGSFEQLDYLRVILKD 621
           + EW+EL+ S  +  YL V L D
Sbjct: 295 YWEWDELKNSEMKQHYLLVKLPD 317



 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 43/87 (49%), Gaps = 4/87 (4%)

Query: 286 ISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           +SN AWA +K G     L+    D +A   L  +  F+ Q ++N+  A A++ ++   LF
Sbjct: 62  LSNTAWAFAKAGKSNPKLFKKICDYIA--GLDSLDSFDPQALSNIVWACATVGYTDERLF 119

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFA 370
           S  A      +    EQ LA + WA++
Sbjct: 120 SAFAPVIESKLDECSEQHLANISWAYS 146


>gi|397606466|gb|EJK59321.1| hypothetical protein THAOC_20474 [Thalassiosira oceanica]
          Length = 282

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 134/320 (41%), Gaps = 55/320 (17%)

Query: 309 VAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVL 366
           +A +AL  +  F    ++N A AFA    S P LF ++    +  D + +F  Q L+ ++
Sbjct: 7   IARLALGSLDLFKPLELSNTAWAFAKAGKSNPKLFKKICDYIAGLDSMDSFDPQALSNIV 66

Query: 367 WAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPV 426
           WA A++    + L           + F   +   L  C+E                    
Sbjct: 67  WACATVGHTDERLF----------SAFAPVIASKLDECSEQ------------------- 97

Query: 427 LSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQ 486
                  L NIAW+Y+V     +  F++ + +     E+  SE+     +          
Sbjct: 98  ------HLANIAWAYSVANTPRQDLFNEGFVSALASNEKDFSEE-----VLCQLHQWQLW 146

Query: 487 CLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV- 545
             +LE     + L   L+EK  +A  +  +++   S  Q +V   L + GL    E  + 
Sbjct: 147 QQELES---GIELPGSLQEKCRNAFTSASYSE---SKLQNDVVGELKAAGLVLDEEVLLG 200

Query: 546 DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAA-GWNVVSLSHQE 601
            GY +DA++     +K+A E+DGP+HF      P G T+LK+R +       VV + + E
Sbjct: 201 SGYRIDALVKISDGRKLAVEVDGPSHFIDRR--PAGRTILKQRQVTRLDSIEVVPVPYWE 258

Query: 602 WEELQGSFEQLDYLRVILKD 621
           W EL  S  +  YLRV L +
Sbjct: 259 WNELMNSVMKQHYLRVKLSN 278



 Score = 44.3 bits (103), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 4/97 (4%)

Query: 286 ISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           +SN AWA +K G     L+    D +A   L  +  F+ Q ++N+  A A++ H+   LF
Sbjct: 23  LSNTAWAFAKAGKSNPKLFKKICDYIA--GLDSMDSFDPQALSNIVWACATVGHTDERLF 80

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
           S  A   +  +    EQ LA + WA++    P   L 
Sbjct: 81  SAFAPVIASKLDECSEQHLANIAWAYSVANTPRQDLF 117


>gi|302849501|ref|XP_002956280.1| hypothetical protein VOLCADRAFT_97290 [Volvox carteri f. nagariensis]
 gi|300258392|gb|EFJ42629.1| hypothetical protein VOLCADRAFT_97290 [Volvox carteri f. nagariensis]
          Length = 1331

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 58/94 (61%), Gaps = 1/94 (1%)

Query: 513  TKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRN 572
            T  F ++V S +Q+++A  L    L  + E    GY++D  L   ++A E DGPTH SR 
Sbjct: 998  TSGFRRRVQSGYQRQMANSLTGLRLMHLLEDNCTGYSIDITLPQLRIALEADGPTHTSRT 1057

Query: 573  T-GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
              G  LG T +KRR++   GW+V++++++EW++L
Sbjct: 1058 PGGAVLGATAMKRRHLQKMGWHVINVTYKEWDKL 1091



 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 46/197 (23%), Positives = 80/197 (40%), Gaps = 28/197 (14%)

Query: 291 WALSKIGGELLYLSEMDRVAEVAL----------------TKVGEFNSQNVANVAGAFAS 334
           W +S +GG   + +E +    + +                   G  +      V  A  +
Sbjct: 713 WGMSSLGGSPYFQAETEAAVTILVRCLAAVAAAAGGTAATAASGGLSGWQAGQVLWALGN 772

Query: 335 MQHSAPDLFS-ELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLE-SLDNAFKDAT 391
            +H+ P L   E +   S  + + Q ++L++VLW FASL Y P   LL    D ++++ T
Sbjct: 773 SRHATPRLMDLETSILRSGGLSSMQPRDLSRVLWGFASLGYRPERLLLTIRPDWSWRERT 832

Query: 392 QFTCCLN---------KALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYA 442
             T  ++         +A    +  GG +  G    +   S  V SF   QL  + W+ A
Sbjct: 833 TATAVVSEDGRTSPKARARGKRSSRGGGRGGGGRGRQVVQSGDVRSFTPQQLSGVVWALA 892

Query: 443 VLGQMDRIFFSDIWKTI 459
           V+ Q+D + F   W  +
Sbjct: 893 VMEQVDTVPFRSAWTQL 909


>gi|397612992|gb|EJK61975.1| hypothetical protein THAOC_17440 [Thalassiosira oceanica]
          Length = 348

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/334 (26%), Positives = 147/334 (44%), Gaps = 31/334 (9%)

Query: 302 YLSEMDRVAE--VALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRAS--DIVHTF 357
           YLS +  + +    L  +  F+ QN++N A AFA+   S P LF  +    +  D + +F
Sbjct: 21  YLSALPGIGDHIAGLDNLDSFDLQNLSNTAWAFATSGMSNPKLFRMIGGHVAGLDSLDSF 80

Query: 358 QEQELAQVLWAFAS--LYEPADPLLESLDN---AFKDATQFTCCLNKALSNCNENGGVKS 412
           + Q+ +   WA+A+  L+ P   L E L     A KD        N  L  C   G    
Sbjct: 81  KPQDASITAWAYATARLFNPR--LFEKLATEMPARKDHFHGQAVAN-FLWACATVGYTDE 137

Query: 413 SGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYR 472
              A     ++S +   +   L NIAW+Y+V      +F       I+  E+   +E+  
Sbjct: 138 RLFAAFAPLIASKLDECSEQDLANIAWAYSVENAPQDLFNEGYASAIASKEKDFSAEELL 197

Query: 473 EDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLL 532
           +   +      +   ++L            L  K  +A  ++ +++   S  Q +V   L
Sbjct: 198 QLHQWQLWQQELESGIELPRS---------LRAKCRNAFTSQGYSE---SKLQNDVVGEL 245

Query: 533 VSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
            + GL+   E  +  GY +DA++     + VA E+DGP+HF      P G T+LK R +A
Sbjct: 246 KAAGLDLEEEVLLGSGYRIDALVKFSDGRIVAVEVDGPSHFIDRR--PTGSTILKHRQVA 303

Query: 589 AAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
                 VVS+   EWE+L+ S  +  YLRV L +
Sbjct: 304 RLDRIEVVSVPFWEWEKLKNSEMKQHYLRVKLSN 337


>gi|397579135|gb|EJK51101.1| hypothetical protein THAOC_29762, partial [Thalassiosira oceanica]
          Length = 285

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 82/316 (25%), Positives = 138/316 (43%), Gaps = 55/316 (17%)

Query: 313 ALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFA 370
            L  +  F  Q  +N   AFA+   S P LF ++A  A+  D + +F  QEL+ ++WA A
Sbjct: 7   GLDSLDSFKPQAFSNTVWAFATAGESNPKLFKKIANHAAGLDSLDSFTPQELSNIVWACA 66

Query: 371 SLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFN 430
           ++              + D  +F C +   ++                     S +  F 
Sbjct: 67  TV-------------GYID-ERFFCAVAPMIA---------------------SKLDEFI 91

Query: 431 RDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKL 490
              L +IAW+Y+V        F + + +     E+  S +       A          +L
Sbjct: 92  EQDLSHIAWAYSVANTPRLDLFDEGYASALASNEKEFSAE-----GLAQLHQWQLWQQEL 146

Query: 491 EHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVD-GYT 549
           E   ++L LS  L+ K  +A  ++ +++   S  Q +V   L + GL+   E  ++ GY 
Sbjct: 147 ES-GIELPLS--LQAKCRNAFTSRGYSE---SKLQNDVVGELKAAGLDLDEEVLLESGYR 200

Query: 550 VDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEEL 605
           +DA++     +KVA E+DGP+HF      P G T LK R +       VVS+ + EW++L
Sbjct: 201 IDALVKISDGRKVAVEVDGPSHFIDRR--PTGSTTLKHRQVERLDHIEVVSVPYWEWDKL 258

Query: 606 QGSFEQLDYLRVILKD 621
           + S  +  YLRV L +
Sbjct: 259 KNSEMKQHYLRVKLSN 274


>gi|145352343|ref|XP_001420509.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580743|gb|ABO98802.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1070

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 65/223 (29%), Positives = 97/223 (43%), Gaps = 43/223 (19%)

Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALT-KVGEFNS 322
            + L   A   L   +AQG++N  W+ SK G    EL   S+  R  E  +T    EFNS
Sbjct: 315 FTTLAKHAERHLSALNAQGLTNTVWSFSKCGHLDAELF--SKFGRSIERRMTANASEFNS 372

Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL--YEPA---- 376
           Q++AN A AF    H    LF+ LA  +   +  F  Q+L    WAFA L  Y+      
Sbjct: 373 QDIANTAWAFGKACHHDEKLFTSLASLSERCLADFNTQDLVNTTWAFAKLGRYDAKLFVA 432

Query: 377 ------DPLLESLDN--------AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL 422
                 D  L  LD          F  A+Q +  L  AL++  E+        AD     
Sbjct: 433 ARKSILDHRLNDLDAPNIANIVWTFDQASQLSEALFVALASAAEH-------QAD----- 480

Query: 423 SSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQ 465
                +FN   L N+AW++A  GQ++   F+ + +++ R  ++
Sbjct: 481 -----NFNAQDLVNVAWTFANSGQVNDALFTALARSVKRLMDE 518



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/191 (26%), Positives = 85/191 (44%), Gaps = 14/191 (7%)

Query: 278 LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
           L E + QG+SN AW  +K G   + +     +++ A  ++ +FN+Q+ +N+  AFA    
Sbjct: 252 LGEFNTQGLSNTAWGFAKSG--FVDVGLFRAMSQKAQERLDDFNAQDFSNLIYAFAKAGQ 309

Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK-----DATQ 392
               LF+ LAK A   +     Q L   +W+F+        L      + +     +A++
Sbjct: 310 YDAKLFTTLAKHAERHLSALNAQGLTNTVWSFSKCGHLDAELFSKFGRSIERRMTANASE 369

Query: 393 FTCCLNKALSNCNENGGVKSSGDA---DSEGSLSSPVLS-FNRDQLGNIAWSYAVLGQMD 448
           F    ++ ++N     G     D     S  SLS   L+ FN   L N  W++A LG+ D
Sbjct: 370 FN---SQDIANTAWAFGKACHHDEKLFTSLASLSERCLADFNTQDLVNTTWAFAKLGRYD 426

Query: 449 RIFFSDIWKTI 459
              F    K+I
Sbjct: 427 AKLFVAARKSI 437



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 63/242 (26%), Positives = 93/242 (38%), Gaps = 56/242 (23%)

Query: 283 AQGISNIAWALSKIGGELLYL-SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
            Q ++NIAWA +K   +   L S + R+AE    +   FNSQ + N   AFAS+ H+   
Sbjct: 183 GQELANIAWAFAKADYKCERLFSALARMAERHAER---FNSQELTNTCWAFASVGHADAR 239

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFA-----------SLYEPADPLLESLDN----- 385
           LF  LA+     +  F  Q L+   W FA           ++ + A   L+  +      
Sbjct: 240 LFKALARCVERRLGEFNTQGLSNTAWGFAKSGFVDVGLFRAMSQKAQERLDDFNAQDFSN 299

Query: 386 ---AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYA 442
              AF  A Q+   L   L+               +E  LS+     N   L N  WS++
Sbjct: 300 LIYAFAKAGQYDAKLFTTLAK-------------HAERHLSA----LNAQGLTNTVWSFS 342

Query: 443 VLGQMDRIFFSDIWKTISRFEEQRISEQYREDI----------------MFASQVHLVNQ 486
             G +D   FS   ++I R      SE   +DI                +F S   L  +
Sbjct: 343 KCGHLDAELFSKFGRSIERRMTANASEFNSQDIANTAWAFGKACHHDEKLFTSLASLSER 402

Query: 487 CL 488
           CL
Sbjct: 403 CL 404



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 49/188 (26%), Positives = 73/188 (38%), Gaps = 24/188 (12%)

Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
           ++N+A   +K GG     + + +  E  L  +     Q +AN+A AFA   +    LFS 
Sbjct: 149 LANVAHGAAKGGGSEELFAALAKAIERHLGGIDR--GQELANIAWAFAKADYKCERLFSA 206

Query: 346 LAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCN 405
           LA+ A      F  QEL    WAFAS+      L ++L            C+ + L   N
Sbjct: 207 LARMAERHAERFNSQELTNTCWAFASVGHADARLFKALAR----------CVERRLGEFN 256

Query: 406 ENG------GVKSSGDADS------EGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFS 453
             G      G   SG  D              +  FN     N+ +++A  GQ D   F+
Sbjct: 257 TQGLSNTAWGFAKSGFVDVGLFRAMSQKAQERLDDFNAQDFSNLIYAFAKAGQYDAKLFT 316

Query: 454 DIWKTISR 461
            + K   R
Sbjct: 317 TLAKHAER 324



 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 48/96 (50%), Gaps = 6/96 (6%)

Query: 278 LPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASM 335
           L +  A  I+NI W   +     E L+++    +A  A  +   FN+Q++ NVA  FA+ 
Sbjct: 442 LNDLDAPNIANIVWTFDQASQLSEALFVA----LASAAEHQADNFNAQDLVNVAWTFANS 497

Query: 336 QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFAS 371
                 LF+ LA+    ++  F ++EL  + WAF +
Sbjct: 498 GQVNDALFTALARSVKRLMDEFSDEELNNLEWAFTT 533


>gi|397639871|gb|EJK73811.1| hypothetical protein THAOC_04541, partial [Thalassiosira oceanica]
          Length = 292

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 84/298 (28%), Positives = 139/298 (46%), Gaps = 22/298 (7%)

Query: 328 VAGAFASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
           ++ AFA+ +    +LF ++A   +  D + +F  Q ++ + WAFA+       L E L  
Sbjct: 1   ISWAFATARVPHAELFEKIAYHIAGLDSLDSFTAQNVSNIAWAFATAKIYHSHLFEKLAE 60

Query: 386 AFKDATQFTCCLNKA--LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
           A     +FT   N A  L  C   G       +     +SS +  F+  Q+ N++W+Y+V
Sbjct: 61  AAARKGRFTDTTNIATFLWACATVGYTIERLFSGFALIISSKLDEFSDQQISNVSWAYSV 120

Query: 444 LGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVL 503
              M    F+  +      +E+  S   +E +    Q  L  Q L  E   ++L LS  L
Sbjct: 121 ANVMSEGLFNKGYAGALASKEKHFS---KEGLTQLHQWQLWQQELGSE---IELPLS--L 172

Query: 504 EEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKV 559
            +K   A  +  +++   S  Q +V   + + GL+   E  +  GY +DAV+     KKV
Sbjct: 173 RKKCRHAFISTSYSE---SKLQNDVVGGVRAIGLDLDEEVLLGSGYRIDAVVKVGHGKKV 229

Query: 560 AFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
           A E+DGP+H+      P G T+LKRR +       VV++ + EW EL+ +  +  YLR
Sbjct: 230 AVEVDGPSHYIHRR--PTGSTILKRRQVTRLDLIEVVTVPYWEWGELKSTKMKQLYLR 285



 Score = 46.2 bits (108), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 59/113 (52%), Gaps = 8/113 (7%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEF-NSQNVANVAGAF 332
            + +L   +AQ +SNIAWA +    ++ +    +++AE A  K G F ++ N+A    A 
Sbjct: 25  GLDSLDSFTAQNVSNIAWAFAT--AKIYHSHLFEKLAEAAARK-GRFTDTTNIATFLWAC 81

Query: 333 ASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
           A++ ++   LFS  A   S  +  F +Q+++ V WA    Y  A+ + E L N
Sbjct: 82  ATVGYTIERLFSGFALIISSKLDEFSDQQISNVSWA----YSVANVMSEGLFN 130


>gi|255084111|ref|XP_002508630.1| predicted protein [Micromonas sp. RCC299]
 gi|226523907|gb|ACO69888.1| predicted protein [Micromonas sp. RCC299]
          Length = 1128

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 53/189 (28%), Positives = 83/189 (43%), Gaps = 19/189 (10%)

Query: 282 SAQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
           +AQG++N  W+ +K G    +L E      A     K+ +FNSQ++AN A AFA   H  
Sbjct: 299 NAQGLANTVWSFAKAG----HLDEGLFKGFASQVRRKLKDFNSQDLANTAWAFAKACHPD 354

Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA--------- 390
             LF+ ++      +  F  Q+L    WAFA L      L  ++   F+D+         
Sbjct: 355 ESLFASISGACVACLDDFNAQDLVNTAWAFAKLGHFDQSLFAAVARRFRDSGAMNDDQLG 414

Query: 391 TQFTCCLNKALSNCNENGGVKSSGDA----DSEGSLSSPVLSFNRDQLGNIAWSYAVLGQ 446
            QF   +  A S  +E G ++ +       D   +  + V  F    L N+AW++A   Q
Sbjct: 415 AQFIANVAWAFSKASEAGKLEQATSEELFRDLATAAEASVADFTAADLANVAWAFANANQ 474

Query: 447 MDRIFFSDI 455
           MD   F  +
Sbjct: 475 MDPTLFQSL 483



 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/206 (24%), Positives = 87/206 (42%), Gaps = 38/206 (18%)

Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
           +C+AQ ++N++WA +K       ++  D +++  L K    NSQ + N+A AFA+   + 
Sbjct: 147 DCNAQELANVSWAFAK-ADHCADVALFDALSKATLAKASACNSQELTNLAWAFATAGRTQ 205

Query: 340 PD-LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLN 398
            + LF+ LAK     + +F  Q L+   WAFA +      L +++  A +          
Sbjct: 206 DEALFASLAKAVEHTLASFTSQGLSNTAWAFAKVGHLEATLFKAISLAAR---------- 255

Query: 399 KALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKT 458
                                    S + +FN     N AW++A LGQ D   F+ + K 
Sbjct: 256 -------------------------SKLKTFNAQDFANTAWAFAKLGQFDGELFTALAKD 290

Query: 459 ISRFEEQRISEQYREDIM-FASQVHL 483
            +R  E   ++     +  FA   HL
Sbjct: 291 AARHGEGHNAQGLANTVWSFAKAGHL 316



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 81/185 (43%), Gaps = 19/185 (10%)

Query: 281 CSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
           C++Q ++N+AWA +  G    E L+ S    +A+     +  F SQ ++N A AFA + H
Sbjct: 186 CNSQELTNLAWAFATAGRTQDEALFAS----LAKAVEHTLASFTSQGLSNTAWAFAKVGH 241

Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCL 397
               LF  ++  A   + TF  Q+ A   WAFA L +    L  +L    KDA +     
Sbjct: 242 LEATLFKAISLAARSKLKTFNAQDFANTAWAFAKLGQFDGELFTALA---KDAARHGEGH 298

Query: 398 NKALSNCNENGGVKSSGDADSEG---SLSSPVL----SFNRDQLGNIAWSYAVLGQMDRI 450
           N A    N       +G  D EG     +S V      FN   L N AW++A     D  
Sbjct: 299 N-AQGLANTVWSFAKAGHLD-EGLFKGFASQVRRKLKDFNSQDLANTAWAFAKACHPDES 356

Query: 451 FFSDI 455
            F+ I
Sbjct: 357 LFASI 361



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 54/96 (56%), Gaps = 5/96 (5%)

Query: 280 ECSAQGISNIAWALSKI--GGELLYLS--EMDR-VAEVALTKVGEFNSQNVANVAGAFAS 334
           +  AQ I+N+AWA SK    G+L   +  E+ R +A  A   V +F + ++ANVA AFA+
Sbjct: 412 QLGAQFIANVAWAFSKASEAGKLEQATSEELFRDLATAAEASVADFTAADLANVAWAFAN 471

Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
                P LF  LA RA + +  F ++EL    WAFA
Sbjct: 472 ANQMDPTLFQSLANRAENFLDDFNDEELDNAEWAFA 507



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 59/120 (49%), Gaps = 9/120 (7%)

Query: 275 MTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
           +  L + +AQ + N AWA +K+G   + L+ +   R  +       +  +Q +ANVA AF
Sbjct: 366 VACLDDFNAQDLVNTAWAFAKLGHFDQSLFAAVARRFRDSGAMNDDQLGAQFIANVAWAF 425

Query: 333 ASM-------QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
           +         Q ++ +LF +LA  A   V  F   +LA V WAFA+  +    L +SL N
Sbjct: 426 SKASEAGKLEQATSEELFRDLATAAEASVADFTAADLANVAWAFANANQMDPTLFQSLAN 485


>gi|384244813|gb|EIE18310.1| hypothetical protein COCSUDRAFT_60280 [Coccomyxa subellipsoidea
            C-169]
          Length = 1075

 Score = 72.0 bits (175), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 161/391 (41%), Gaps = 67/391 (17%)

Query: 275  MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
            M  + + + Q   N AWA++K+G + L    M+ + + A  +   F+ Q ++N+  A A+
Sbjct: 695  MCRMAQATPQHFGNAAWAMAKLGHDPLQGRFMNALIKQAFPQRSRFHRQELSNILWALAT 754

Query: 335  MQHSAP--------DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNA 386
            +QH  P        D F+ LA           E+ LA + WA A L    +PL   L NA
Sbjct: 755  LQHELPENILRDVSDEFARLALAQLGSAEPGWERHLANMAWACARLR--VNPLGGGLLNA 812

Query: 387  -----FKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL--------SSPVLSFNRDQ 433
                   +   F+    + ++N     G+       +   L        S+   +    +
Sbjct: 813  ACAELVTNPGNFSV---QNMANIVLAAGILQHPFPQAAVDLVLGELQQRSAGSRALPHQE 869

Query: 434  LGNIAWSYAVLGQMDRIFFSDIWKTIS------RFEEQRISEQYREDIMFASQVHLVNQC 487
              NI W  A L Q+ R     I   ++      +F +   ++  + D+M    V  + Q 
Sbjct: 870  ACNILWGLAALDQLTRAQLEHIAGQLAAAAAADKFTKAEANQLRQADLM----VRAMEQS 925

Query: 488  LKLEHPH-LQLALSSVLE-EKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV 545
               E P  L  AL  + + ++I S           TS  QK+V+  L   G+    E  +
Sbjct: 926  GGQEMPSCLPPALQQLADGDQIIS-----------TSRLQKDVSETLSELGVPHTVEGRI 974

Query: 546  DGYTVDAVLVD--------KKVAFEIDGPTHFSRNTGVP---LGHTMLKRRYIAAAGWNV 594
               +     VD          +A E+DGP+HF+     P   LGHT+L+ R + A G  V
Sbjct: 975  SHPSFGPATVDILIEVPGQPPMALEVDGPSHFA--ALAPHQNLGHTVLRNRLLEARGAKV 1032

Query: 595  VSLSHQ----EWEELQGSFE-QLDYLRVILK 620
            V +  +     W ++QG  + +++YL  IL+
Sbjct: 1033 VQIPFRIEGKRWADIQGDMDSRIEYLTGILE 1063



 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/191 (29%), Positives = 87/191 (45%), Gaps = 26/191 (13%)

Query: 270 LVAIAMTALPECSAQGISNIAWALSKIG----GELLYLSEMDRVAEVALTKVGEFNSQNV 325
           + A+A   LP+ S Q I+N+A+ L+ +      ELL       VAE AL ++   +S  V
Sbjct: 575 VFAVAPKVLPDASFQNIANLAYGLAILNHSAPPELLTA-----VAEAALLRMPSASSHGV 629

Query: 326 ANVAGAFASMQHS--APDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
           +N+  A+A M  S     LF      A + +  F  Q LA  LW+ A +   A P  E+L
Sbjct: 630 SNLLWAYAKMGTSPLGGQLFRSALAHARENLDKFSVQHLANTLWSLAVVQHEASP--EAL 687

Query: 384 DNAFKDATQFTCCLNKALSN--CNENGGVKSSGDADSEGSLSSPVLS--------FNRDQ 433
           D+ F +A  F C + +A      N    +   G    +G   + ++         F+R +
Sbjct: 688 DS-FAEA--FMCRMAQATPQHFGNAAWAMAKLGHDPLQGRFMNALIKQAFPQRSRFHRQE 744

Query: 434 LGNIAWSYAVL 444
           L NI W+ A L
Sbjct: 745 LSNILWALATL 755



 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 57/127 (44%), Gaps = 14/127 (11%)

Query: 269 MLVAIAMTAL---PECSAQGISNIAWALSKIG----GELLYLSEMDRVAEVALTKVGEFN 321
           +L A+A  AL   P  S+ G+SN+ WA +K+G    G  L+ S +    E     + +F+
Sbjct: 609 LLTAVAEAALLRMPSASSHGVSNLLWAYAKMGTSPLGGQLFRSALAHARE----NLDKFS 664

Query: 322 SQNVANVAGAFASMQHSA-PDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
            Q++AN   + A +QH A P+     A+     +     Q      WA A L    DPL 
Sbjct: 665 VQHLANTLWSLAVVQHEASPEALDSFAEAFMCRMAQATPQHFGNAAWAMAKLGH--DPLQ 722

Query: 381 ESLDNAF 387
               NA 
Sbjct: 723 GRFMNAL 729


>gi|384245914|gb|EIE19406.1| hypothetical protein COCSUDRAFT_48936 [Coccomyxa subellipsoidea
           C-169]
          Length = 516

 Score = 72.0 bits (175), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 82/332 (24%), Positives = 141/332 (42%), Gaps = 18/332 (5%)

Query: 302 YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP---DLFSELAKRASDIVHTFQ 358
           Y   +D +    L    + ++  + ++  A A  +H +     L   +A  A D+V TF 
Sbjct: 195 YTMLLDAIVGQVLRSFKDLDASGLVSLTHALAETEHDSEGTGKLLKAIAAGALDLVPTFS 254

Query: 359 EQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADS 418
             +LA +L +F+ L    +P+   +    + A        +  S+      V    + + 
Sbjct: 255 PGQLASLLASFSHLRHYDEPMYRVISR--QAAPTVAALEPQQRSDLLHALAVVGHDEPEL 312

Query: 419 EGSLSSPVL----SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYRED 474
             +L   +L      +   L ++ WS AVL Q+    F    +  +R E+  +     E+
Sbjct: 313 VAALRDHLLEDAGQLSGCALCDVLWSLAVLDQLSPDAFR---RMCARLEQLPLGAFEPEN 369

Query: 475 IMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVS 534
                QV  + Q    + P L + L + +    ASA + + F +   +  Q+ + R L  
Sbjct: 370 FQQLYQVQRMVQAAS-QDP-LTVQLPTWIWAYAASAWQDRLFAESNFTPLQQSICRTLAD 427

Query: 535 TGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRN-TGVPLGHTMLKRRYIAAAGWN 593
            G+ W  E  +   T  A+L+  KVA   +GPT +S +    PLG T+  RR +   GW 
Sbjct: 428 LGV-WHEEKFLQNMT-SAILLRDKVAIHPEGPTLYSSSWPRRPLGETLAVRRTLTRHGWT 485

Query: 594 VVSLSHQEWEELQGSFEQLDYLRVILKDYIGG 625
           VV L+  EW  L  S ++  YLR +L D   G
Sbjct: 486 VVPLAKHEWMAL-ASHKRAAYLRKLLDDAGAG 516


>gi|397575811|gb|EJK49902.1| hypothetical protein THAOC_31172, partial [Thalassiosira oceanica]
          Length = 363

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 90/353 (25%), Positives = 137/353 (38%), Gaps = 50/353 (14%)

Query: 277 ALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQ 336
           +L   S+Q +SN AWA +  G     L +        L  +  F  QN++N A AFA+  
Sbjct: 49  SLDSFSSQALSNTAWAFAAAGVSHPVLLKKIGNHIAGLDSLNSFKPQNLSNTAWAFATAG 108

Query: 337 HSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQF- 393
            S P LF ++    + +  + +F+ QEL+ V WA+A+       L E          +F 
Sbjct: 109 ASHPTLFKKIGDHVARLGSLDSFKPQELSNVAWAYATARRFDLGLFEKFTEVSARKGEFL 168

Query: 394 -TCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
            T  +   L  C   G             + S +   N   L NIAW+Y+V     +  F
Sbjct: 169 ETQHIANFLWACATVGHTDERLFGAFAPVIGSKLDECNEQVLANIAWAYSV-ANAPQDLF 227

Query: 453 SDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGK 512
           S+ + +     E+  S +                         QLA     +        
Sbjct: 228 SEGYVSAFALNEKEFSGE-------------------------QLAQLHQWQLWQQELES 262

Query: 513 TKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLV---DKKVAFEIDGPTHF 569
                Q      ++EV  LL S            GY +DA++     +KVA E+DGP HF
Sbjct: 263 GIELPQAAGFELEEEV--LLGS------------GYRIDALVKVGDGRKVAVEVDGPFHF 308

Query: 570 SRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
                 P G T+LK R +       VVS+ + EW++L  S  +  YLR  L +
Sbjct: 309 IDRR--PAGRTILKHRQVVRLDRIKVVSVPYWEWDKLMSSETKQHYLRAKLSN 359


>gi|308809477|ref|XP_003082048.1| Kynurenine 3-monooxygenase and related flavoprotein monooxygenases
           (ISS) [Ostreococcus tauri]
 gi|116060515|emb|CAL55851.1| Kynurenine 3-monooxygenase and related flavoprotein monooxygenases
           (ISS) [Ostreococcus tauri]
          Length = 1077

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 61/221 (27%), Positives = 97/221 (43%), Gaps = 39/221 (17%)

Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQN 324
            + L A A   L   S QG++N  WA +K G   + L+ +    +     T   +FNSQ+
Sbjct: 339 FTTLAAHADRHLSTLSTQGLTNAVWAFAKAGHLDDALFTAFAKSIERRMSTGASDFNSQD 398

Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPL----- 379
           +AN A AFA   H   +LF+ LA+ A   +  F  Q+L    WAFA L +  + L     
Sbjct: 399 MANTAWAFAKACHLDDNLFTALARLAETCLDDFNTQDLVNTTWAFAKLGKYDEKLFIAAR 458

Query: 380 -------LESLDN--------AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSS 424
                  L+ LD         +F  A+Q    L  AL+   E   V++            
Sbjct: 459 KSILNNRLDDLDAPNTANIAWSFDKASQLDKRLFDALARTAE---VRAD----------- 504

Query: 425 PVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQ 465
               F+   L N+AW++A  GQ++   F+ + +++ R  E+
Sbjct: 505 ---EFSAVDLANVAWTFANTGQVNDNLFTALARSVERLMEE 542



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 64/263 (24%), Positives = 110/263 (41%), Gaps = 49/263 (18%)

Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           A+ ++N+A   +K G      + M+ +A     ++   N+Q +AN+A AFA  +H+   L
Sbjct: 168 ARELANVAHGAAKCGRGSTDATLMETLARAIEGELERCNAQELANIAWAFAKAEHADERL 227

Query: 343 FSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALS 402
           F  L K AS     F  QEL  + WAFA++                   Q    L KALS
Sbjct: 228 FLALEKMASTKAEQFNPQELTNMTWAFATV------------------GQGNARLFKALS 269

Query: 403 NCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRF 462
            C E    + + D  ++G             L N AW++A  G +D    + +++ +S+ 
Sbjct: 270 RCVE----RRAEDFSTQG-------------LSNTAWAFAKSGYVD----AGLFRALSQS 308

Query: 463 EEQRISEQYREDI-----MFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASA----GKT 513
            +QR+     +D       FA       +       H    LS++  + + +A     K 
Sbjct: 309 AQQRLDGFNAQDFSNLVWAFAKASQYDAKLFTTLAAHADRHLSTLSTQGLTNAVWAFAKA 368

Query: 514 KRFNQKVTSSFQKEVARLLVSTG 536
              +  + ++F K + R + STG
Sbjct: 369 GHLDDALFTAFAKSIERRM-STG 390



 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 81/188 (43%), Gaps = 16/188 (8%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           S QG+SN AWA +K G   +       +++ A  ++  FN+Q+ +N+  AFA        
Sbjct: 280 STQGLSNTAWAFAKSG--YVDAGLFRALSQSAQQRLDGFNAQDFSNLVWAFAKASQYDAK 337

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK-----DATQFTC- 395
           LF+ LA  A   + T   Q L   +WAFA      D L  +   + +      A+ F   
Sbjct: 338 LFTTLAAHADRHLSTLSTQGLTNAVWAFAKAGHLDDALFTAFAKSIERRMSTGASDFNSQ 397

Query: 396 -CLNKALS---NCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
              N A +    C+ +  + ++    +E  L      FN   L N  W++A LG+ D   
Sbjct: 398 DMANTAWAFAKACHLDDNLFTALARLAETCLD----DFNTQDLVNTTWAFAKLGKYDEKL 453

Query: 452 FSDIWKTI 459
           F    K+I
Sbjct: 454 FIAARKSI 461



 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 48/93 (51%), Gaps = 2/93 (2%)

Query: 278 LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
           L +  A   +NIAW+  K     L     D +A  A  +  EF++ ++ANVA  FA+   
Sbjct: 466 LDDLDAPNTANIAWSFDK--ASQLDKRLFDALARTAEVRADEFSAVDLANVAWTFANTGQ 523

Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
              +LF+ LA+    ++  F ++EL  + WAFA
Sbjct: 524 VNDNLFTALARSVERLMEEFSDEELDNLEWAFA 556


>gi|307108730|gb|EFN56969.1| hypothetical protein CHLNCDRAFT_143558 [Chlorella variabilis]
          Length = 1244

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 130/329 (39%), Gaps = 53/329 (16%)

Query: 304  SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELA 363
            + +D  A   +      +   V  +  A+A+M+H  P LF  L +RA D+  +   + +A
Sbjct: 941  ATLDDAAARCIPLAPRMSGGEVGTLMWAYATMRHVHPGLFKALLERADDLAGSLTWRGIA 1000

Query: 364  QVLWAFASLYE-PADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL 422
             V+WA A   + P  PL + L   +                                  L
Sbjct: 1001 IVMWACAVTRQAPPRPLADRLVERYM--------------------------------PL 1028

Query: 423  SSPVLSFNRD--QLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQ 480
              P ++   D   L N+AW   V   +    F+ +   +   +   +     + +  A+ 
Sbjct: 1029 FHPRMAQGVDLHSLANVAWGLTVFDYLTPDRFAQLTGMVPPHDAAAL-----DSVNTAAW 1083

Query: 481  VHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQ--KEVARLLVSTGLN 538
              L    L LE    Q   + +    +  A K  +     TS  Q   +VA +L S  + 
Sbjct: 1084 CQLFQCALYLEAKTGQHYSAFLPPHILPYAEKHWQARDTTTSRLQARNKVADVLHSLEVP 1143

Query: 539  WIREYA--VDGYTVDAVLVDK---KVAFEIDGPTHFSRNTG-VPLGHTMLKRRYIAAAGW 592
            +  EY+   + + +D  +      ++A E+DGP HFS N   +PL  T ++ + +A  GW
Sbjct: 1144 FAEEYSPRANFFGIDIAIQGSNGVRLAVEVDGPQHFSSNPPHMPLASTYMRNKLLAMHGW 1203

Query: 593  NVVSLSHQEWEELQGSFEQ-----LDYLR 616
             VVS+   EW  L G  E+     +DYLR
Sbjct: 1204 EVVSIPFNEWARLAGLQEKQARLAVDYLR 1232



 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 27/118 (22%), Positives = 49/118 (41%), Gaps = 6/118 (5%)

Query: 264 QREMSMLVAIAMTALPECSAQGISNIAWALSKIGGEL----LYLSEMDRVAEVALTKVGE 319
           Q  M  L   A+  LPE     + ++ W+L K+G +L    ++   +  V   A  ++ +
Sbjct: 360 QAVMDHLSLAALAFLPEVEHTHLGSLVWSLGKLGTKLGAARIHTPVLHAVVATAWRRLHD 419

Query: 320 FNSQNVANVAGAFASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEP 375
                + N    F  +  H   D     A R  +++     Q++   +W+FA L Y P
Sbjct: 420 LTPDALCNTLYGFGLLNFHPGSDFLDAAAARFKELLPYMSAQQVGNCVWSFARLEYSP 477


>gi|412994018|emb|CCO14529.1| predicted protein [Bathycoccus prasinos]
          Length = 1083

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 53/186 (28%), Positives = 74/186 (39%), Gaps = 41/186 (22%)

Query: 278 LPECSAQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGEFNSQNVANVAGAFASM 335
           L EC+ Q I+NIAWA +K G    Y        +AE+A  ++  FNSQ + NV  AFA+ 
Sbjct: 201 LAECNGQEIANIAWAFAKSG----YFDPGMFANLAEMAEKQMDRFNSQEITNVFWAFATA 256

Query: 336 QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTC 395
           +     LF  LAK     +H F  Q L+   WA + +              + DAT F  
Sbjct: 257 ECDNAKLFKALAKAIDGQLHGFNSQGLSNTAWALSKI-------------GYVDATLFRT 303

Query: 396 CLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDI 455
               A  N +                       FN     N+ W++A  GQ D   F+ +
Sbjct: 304 IAQTAQKNMDR----------------------FNAQDFSNLCWAFAKAGQYDAELFTTL 341

Query: 456 WKTISR 461
            K   R
Sbjct: 342 AKNAER 347



 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 62/250 (24%), Positives = 106/250 (42%), Gaps = 35/250 (14%)

Query: 238 ATALHRIA----KNMEKVSMMTTHRL--AFTR--QREMSMLVAIAMTA---LPECSAQGI 286
           AT    IA    KNM++ +      L  AF +  Q +  +   +A  A   +   +AQG+
Sbjct: 298 ATLFRTIAQTAQKNMDRFNAQDFSNLCWAFAKAGQYDAELFTTLAKNAERHMGNLNAQGL 357

Query: 287 SNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           SN  W+ +K G    EL      +   ++      +FN+Q++AN+A A+    H    LF
Sbjct: 358 SNSVWSFAKAGHLNAELFTTFGKNIERKMFANNGTDFNAQDIANIAWAYGKACHLDDALF 417

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSN 403
           + LA+ A   +H F  Q++  + W+F+ L      LLE++             L   L +
Sbjct: 418 TVLARMAEKYLHDFNTQDIVNLTWSFSKLGRFDVELLEAVK---------VSLLKSRLDD 468

Query: 404 CNENGGVKSSGDADSEGSLSSPVLS------------FNRDQLGNIAWSYAVLGQMDRIF 451
            +       +   D  G L   ++S            F    + N+AW++A  G++D   
Sbjct: 469 LDAPNIANLAWTYDKAGKLDDNLVSSLARAAVKRVNEFTATDITNVAWTFANAGKVDDEL 528

Query: 452 FSDIWKTISR 461
           FS + K + R
Sbjct: 529 FSSMAKVVER 538



 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 84/195 (43%), Gaps = 36/195 (18%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           ++QG+SN AWALSKIG   +  +    +A+ A   +  FN+Q+ +N+  AFA       +
Sbjct: 279 NSQGLSNTAWALSKIG--YVDATLFRTIAQTAQKNMDRFNAQDFSNLCWAFAKAGQYDAE 336

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
           LF+ LAK A   +     Q L+  +W+FA     A  L   L   F    +      K  
Sbjct: 337 LFTTLAKNAERHMGNLNAQGLSNSVWSFAK----AGHLNAELFTTFGKNIE-----RKMF 387

Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
           +N   NG                    FN   + NIAW+Y     +D   F+     ++R
Sbjct: 388 AN---NG------------------TDFNAQDIANIAWAYGKACHLDDALFT----VLAR 422

Query: 462 FEEQRISEQYREDIM 476
             E+ + +   +DI+
Sbjct: 423 MAEKYLHDFNTQDIV 437



 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 53/99 (53%), Gaps = 2/99 (2%)

Query: 271 VAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
           V++  + L +  A  I+N+AW   K G   L  + +  +A  A+ +V EF + ++ NVA 
Sbjct: 459 VSLLKSRLDDLDAPNIANLAWTYDKAGK--LDDNLVSSLARAAVKRVNEFTATDITNVAW 516

Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAF 369
            FA+      +LFS +AK    I+  F E++L  + WAF
Sbjct: 517 TFANAGKVDDELFSSMAKVVERIMDDFGEEDLDNLEWAF 555


>gi|145355912|ref|XP_001422190.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582430|gb|ABP00507.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 967

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 89/357 (24%), Positives = 153/357 (42%), Gaps = 57/357 (15%)

Query: 287 SNIAWALSKI----GGELLYLSEMDRVAEVALTKVGEFNS---QNVANVAGAFASM---- 335
           SN+ W+ + +    G E+L      +VAE+ L +VG+ +      V+N   A+A+     
Sbjct: 433 SNLLWSYASLRFNPGNEVL-----TQVAELYL-RVGQHDEVALTQVSNTLWAWANFGWLP 486

Query: 336 -QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL---------- 383
              S  +   ++A +        Q Q LA +LW+ A+L + P D  L++           
Sbjct: 487 EDPSIVECVLQVAIKHFKSDPDLQTQSLANILWSLATLRFVPGDEFLQAFRERALIELRE 546

Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
           D  F D  Q  C    A      N G +   +  S+  L + V +F    + N   ++A 
Sbjct: 547 DERFSD--QGLCNTVWAYGQLGVNPGTELMSEIASQ--LGARVTNFPTQGVTNSILAFAT 602

Query: 444 LGQMDRIFFSDIWKTISRFEEQRISEQYREDI--MFASQVHLVNQCLKLEHPHLQLALSS 501
           LG     F+ D W  +  +  + +   Y   I  +  +Q    N   +   P+  L    
Sbjct: 603 LG-----FWPDEW-VVDNYRAKIVEMYYSTTISDIDLTQFFQANYLFEKCSPYGPLVTDP 656

Query: 502 VLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DG-YTVDAVLVDKKV 559
            + E + SA K +  ++ V S F +EV+  L + G+    EY   DG +++D  L  KK+
Sbjct: 657 QMIEDMLSAWK-RGSSKVVISQFHREVSDTLTNMGVPHEIEYITEDGLFSLDIALKGKKL 715

Query: 560 AFEIDGPTHFSRNT-----------GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
           A E+DGP+HF+RN            G   G   ++  Y+   GW  V +   +W+++
Sbjct: 716 AIEVDGPSHFARNIQNRRMSGKRPDGT--GTYNIRYHYLDTNGWTTVFIPWYDWKQV 770



 Score = 42.4 bits (98), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 76/295 (25%), Positives = 118/295 (40%), Gaps = 36/295 (12%)

Query: 274 AMTALPEC-SAQGISNIAWALSKIGGELLYLSE-----MDRVAEVALTKVGEFNSQNVAN 327
           ++ A+P   S+Q +SN  WA++ + GE   L       ++ +      K   F  Q +AN
Sbjct: 337 SIKAVPNMWSSQSVSNTLWAIATLDGEPHKLRARHGDYLNTLCMYVERKANAFVCQGLAN 396

Query: 328 VAGAFASMQHSAPDLFSELAK-RASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDN 385
              A A+++++      E A  R S +       E + +LW++ASL + P + +L  +  
Sbjct: 397 TLWALATLEYTPSMKMLEAATARWSALATDVYISECSNLLWSYASLRFNPGNEVLTQVAE 456

Query: 386 AFKDATQFTCCLNKALSN---CNENGGVKSSGDADSEGSLSSPVLSFNRD------QLGN 436
            +    Q        +SN      N G      +  E  L   +  F  D       L N
Sbjct: 457 LYLRVGQHDEVALTQVSNTLWAWANFGWLPEDPSIVECVLQVAIKHFKSDPDLQTQSLAN 516

Query: 437 IAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQ 496
           I WS A L    R    D  + +  F E+ + E  RED  F+ Q  L N      +  L 
Sbjct: 517 ILWSLATL----RFVPGD--EFLQAFRERALIE-LREDERFSDQ-GLCNTVWA--YGQLG 566

Query: 497 LALSSVLEEKIAS---AGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGY 548
           +   + L  +IAS   A  T    Q VT+S       L  +T   W  E+ VD Y
Sbjct: 567 VNPGTELMSEIASQLGARVTNFPTQGVTNSI------LAFATLGFWPDEWVVDNY 615



 Score = 38.9 bits (89), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 47/97 (48%), Gaps = 7/97 (7%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQ-HSAP 340
           + QGISN  WA + + G  L    + + ++    ++ +F S   +NV  A A+M+ H  P
Sbjct: 265 APQGISNSLWAFATL-GYTLKPETIAKFSQAIRRQLKDFKSMEFSNVVWALATMKTHLDP 323

Query: 341 -----DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
                +L  E+      + + +  Q ++  LWA A+L
Sbjct: 324 LEVFDELLDEMHASIKAVPNMWSSQSVSNTLWAIATL 360


>gi|428165102|gb|EKX34106.1| hypothetical protein GUITHDRAFT_147455 [Guillardia theta CCMP2712]
          Length = 1225

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 96/395 (24%), Positives = 158/395 (40%), Gaps = 78/395 (19%)

Query: 260  AFTRQREMSMLVAIAMTALPE--CSAQGISNIAWALSKIGGE-LLYLSEMDRVAEVALTK 316
            A+T  + +   +A  +T L E   SAQG++ I  A +K+G +     + + RVA+    +
Sbjct: 713  AYTSLKTLFRRLARIVTGLSEQQFSAQGVALIVNAFAKLGMQDSCMFAHLSRVAQ----Q 768

Query: 317  VGEFN------SQNVANVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAF 369
            +G+ N       Q+V N+  AFA       +LF  ++     +  H F+  ++  +L A+
Sbjct: 769  MGQRNFEIPCSPQDVVNIVNAFAKAHVHDAELFGHMSLLLQAMSAHQFEASKIGILLNAY 828

Query: 370  ASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL-- 427
            A L     PLL  L     D +  +     A++N      V +  D++  G +++ +L  
Sbjct: 829  AQLRIRDLPLLRRLSKVAMDMSP-SAFDAHAIANIFHALAVLNVEDSELLGHVATTLLRS 887

Query: 428  --------SFNRDQLGNIAWSYAVL----GQMDRIFFSDIWKTISRFEEQRISEQYREDI 475
                     FN   L NIAWS AVL      ++R   S     IS  +   +S+ ++   
Sbjct: 888  RQTMMQAKDFNAQALSNIAWSIAVLKISDPTLNRWICSSCLSQISSMDGNALSQLHQ--- 944

Query: 476  MFASQVHLVNQCLKLEHPHL-----------------QLALSSVLEEKIASAGKTKRFNQ 518
             +   + +     K E P L                 Q AL S    K+A    T     
Sbjct: 945  -YILAIEVEGLVPKKELPELDSLLQHRKRIEQAWHATQRALLS--SSKLAGMQGTMTLTD 1001

Query: 519  K------------VTSSFQKEVARLLVSTGLNWIREYAVD--------------GYTVDA 552
            K              S  Q+ VA  L         +  VD               Y++D 
Sbjct: 1002 KGAAATSDVLLPDAMSGLQRNVADTLRVVWKELQEQRVVDQSWTLEEETIERVTSYSLDI 1061

Query: 553  VLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYI 587
             +     A E+DGP+HF+R + +PLG T++KRR +
Sbjct: 1062 SIAAASFAIEVDGPSHFARGSKIPLGRTLMKRRQL 1096


>gi|294887545|ref|XP_002772159.1| hypothetical protein Pmar_PMAR024842 [Perkinsus marinus ATCC 50983]
 gi|239876105|gb|EER03975.1| hypothetical protein Pmar_PMAR024842 [Perkinsus marinus ATCC 50983]
          Length = 1094

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 66/128 (51%), Gaps = 1/128 (0%)

Query: 496 QLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLV 555
           ++  SSV++ +I          + +    ++EV++     GL    E  V  Y++D +LV
Sbjct: 778 EMIFSSVIQLQIFDLWARLLAPKSIMEKMEREVSKFFTMVGLRHRNEVVVGPYSID-ILV 836

Query: 556 DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
            +  AFE+DGP HF R+T +    ++LK R + A G+ V+ + +QEW +     ++L Y+
Sbjct: 837 GESFAFEVDGPHHFYRDTSMRTASSLLKHRILEALGFTVIRVPYQEWSQCGTREKRLRYV 896

Query: 616 RVILKDYI 623
               K  I
Sbjct: 897 GSFWKQLI 904


>gi|224010429|ref|XP_002294172.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970189|gb|EED88527.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 382

 Score = 69.3 bits (168), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 85/341 (24%), Positives = 138/341 (40%), Gaps = 78/341 (22%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTK--VGEFNSQNVANVAGAFASMQHSAPD 341
           Q  +NI WA +    E  +    ++VA    +   +  F  Q+ AN+  A+A+ + S P 
Sbjct: 74  QDYANIVWAYAT--AEASHPQLFEKVANHIESSRDLSSFIPQDYANIVWAYATAELSHPV 131

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFAS-------LYEPADPLLESLDNAFKDATQFT 394
           LFS +A  A      F  Q++  +LWAFAS       LY    P      +A K  +Q+T
Sbjct: 132 LFSNVADSAIQRQSEFNSQDITNLLWAFASNGDIERNLYTKVAP------SAAKLTSQYT 185

Query: 395 CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSD 454
           C                                     QL NIAW+YAV        F++
Sbjct: 186 C------------------------------------QQLTNIAWAYAVADVDAPTLFNE 209

Query: 455 IW-----KTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIAS 509
           I+     K +  F  + + + Y+  +  A +             H +  L  +L EK  +
Sbjct: 210 IFNEKCNKKMDAFSVESLMQLYQWHLWRAKE-------------HSEEGLPQMLHEKCYN 256

Query: 510 AGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV--DKKVAFEIDGP 566
              +   +    S+ Q +V   L + GL+   E  +  GY +DA++    +    E+DGP
Sbjct: 257 VFVSASAS---PSALQDDVVVELRAIGLHPEEEVLLQSGYRIDALVQVNGENFVIEVDGP 313

Query: 567 THFSRNTGVPLGHTMLKRRYIAAA-GWNVVSLSHQEWEELQ 606
           +HF        G T LK R ++   G  +VS+ + EW +L+
Sbjct: 314 SHFIGKIRDLKGSTKLKHRQVSTIDGIPIVSVPYWEWNKLR 354


>gi|397595468|gb|EJK56490.1| hypothetical protein THAOC_23613 [Thalassiosira oceanica]
          Length = 695

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 85/315 (26%), Positives = 130/315 (41%), Gaps = 67/315 (21%)

Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPL 379
           F+ Q++AN+  ++A+ +   PDLF  L   A D    FQ QE+A +LWA A+L +    L
Sbjct: 431 FSVQSIANIIWSYATAREWCPDLFIGLISAAVDRRDEFQPQEMANLLWACATLGQTNADL 490

Query: 380 LESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAW 439
                NAF    +                                 +  F    L N AW
Sbjct: 491 F----NAFVPVVKMK-------------------------------IEDFTAQGLSNAAW 515

Query: 440 SYAVLGQMDRIFFSDIWK--TISRFEEQRISEQYREDIMFASQVHL--VNQCLKLEHPHL 495
           ++AV         +DI      +RF E  I  + R  +   SQ+H   + Q  ++    L
Sbjct: 516 AFAV---------ADIQNDDLNNRFLEAFIKNEDRFSVEGLSQLHQWQLWQIERVSPVQL 566

Query: 496 QLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGL-NWIREYAV--DGYTVDA 552
             +LS    +   S G          S  Q +V  +L      + + EY     GYT+DA
Sbjct: 567 PASLSERCRDAFVSQGTGY-------SKLQDQVVSVLSRMDFYDVLEEYRTRNTGYTLDA 619

Query: 553 VLV---DKKVAFEIDGPTHF--SRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQ 606
           ++      K+  EI+GP H+   RN     G T LK R +++     VVS+ H EWE+L 
Sbjct: 620 LVSLNDTVKIGIEINGPYHYIGGRNLN---GGTRLKLRQVSSIECVRVVSVPHYEWEQLD 676

Query: 607 GSFEQLDYLRVILKD 621
           G   + +YL   L++
Sbjct: 677 GDEGRREYLLSALRE 691



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 67/129 (51%), Gaps = 13/129 (10%)

Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKIG------GELLYLSEMDRVAEVALTKVGEF 320
            + + + A  ++ +  A+G+SN  ++ + IG      G   +L   + VA+  L ++  F
Sbjct: 219 FNFIASAAARSVHKFDARGLSNTIYSFALIGYPPNVQGSRPFL---EIVADECLHQLNHF 275

Query: 321 NSQNVANVAGAFASMQHSAPDLFSELAKRASDIV----HTFQEQELAQVLWAFASLYEPA 376
           N Q ++N+  ++A + HS P+LF  +A R  ++      TF  Q ++ +LW+F +L E  
Sbjct: 276 NMQELSNLVWSYAKLNHSHPELFGAVASRILELNPKADTTFNPQVISNILWSFTTLDEAN 335

Query: 377 DPLLESLDN 385
           + L   + N
Sbjct: 336 EDLFRYIFN 344


>gi|159473869|ref|XP_001695056.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276435|gb|EDP02208.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 347

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 55/94 (58%), Gaps = 1/94 (1%)

Query: 513 TKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRN 572
           T    ++V S +Q+++A  L +     + E    GY++D  L   ++A E DGPTH SR 
Sbjct: 226 TSGLRRRVQSGYQRQMANALTAMRHMHLLEDNSAGYSIDITLPALRIALEADGPTHTSRT 285

Query: 573 T-GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
             G  LG T +KRR++   GW VV++++ EW++L
Sbjct: 286 PGGAMLGATAMKRRHLQRLGWQVVNVTYTEWDKL 319


>gi|307111480|gb|EFN59714.1| hypothetical protein CHLNCDRAFT_133279 [Chlorella variabilis]
          Length = 1273

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 84/187 (44%), Gaps = 45/187 (24%)

Query: 280 ECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASM- 335
           E ++Q ++N  WA + IG   G+ L    +   A VA+ K+ EF+ QN++N+  A+A + 
Sbjct: 451 EYNSQNLANSVWAYANIGVNPGDSL----LQDFARVAIAKMPEFSPQNISNLLWAYAKLG 506

Query: 336 -QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFT 394
            QH+  +LF+E  + A+ I+HTF  Q +A + WA+A+L            +   DAT   
Sbjct: 507 VQHA--ELFAEAGRHAARIMHTFTPQSVANMAWAYATL------------DQCPDATLLH 552

Query: 395 CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSD 454
             +  A     E                      F+   L N AW+ A L + +    S 
Sbjct: 553 ALVGHAARMLPE----------------------FSPQNLSNTAWALATLKECEPGLLSG 590

Query: 455 IWKTISR 461
           I   ++R
Sbjct: 591 ISMEVTR 597



 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 74/304 (24%), Positives = 120/304 (39%), Gaps = 79/304 (25%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQ----NVANVAGAFASMQH 337
           S Q ++N+ WAL+ +  +    S M  +AE  +T+    N Q    N++N+A A++ + H
Sbjct: 610 SRQHLANLVWALATLEFDPGKRSLMC-MAEALVTRADLCNPQEVQQNLSNLAWAYSKLAH 668

Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCL 397
               L + +A RA  ++H    Q    + WA++SL                     T  L
Sbjct: 669 MDEALMTAIADRAESMIHDLSLQHCTNLTWAYSSL------------------KWTTPTL 710

Query: 398 NKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWK 457
             AL              A+S+  L+      N  QL N+ WS  +    DR  F     
Sbjct: 711 MPALV-------------AESKARLAD--TQLNVQQLCNLLWSLGISEACDREVFQAYML 755

Query: 458 TISRFEEQRISEQYREDIMFASQVHLVNQCLKL-EHPHLQLALSSVLEEKIASAGKTKRF 516
            ++   +Q              Q  +  + L L +H  +Q+                   
Sbjct: 756 MLAESPDQ--------------QWPIPGELLALAQHAWVQV------------------- 782

Query: 517 NQKVTSSFQKEVARLLVSTGLNWIREYAVDG--YTVDAVLVDKKVAFEIDGPTHFSRNTG 574
                S F  EV+R+L + G     E+  D   ++VD  L  +++A E+DGP HF+ NT 
Sbjct: 783 -----SEFHSEVSRMLSALGQPHTIEHLTDDHLFSVDIALPGERIALEVDGPHHFTANTF 837

Query: 575 VPLG 578
            PLG
Sbjct: 838 RPLG 841



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 70/294 (23%), Positives = 125/294 (42%), Gaps = 41/294 (13%)

Query: 197 INLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIA--KNMEKVSMM 254
           I+ NK I  A  AQE+  +   +   V             +ATA+H++A  +    +   
Sbjct: 259 ISCNKRITAATYAQEIFNIEHAVFDTV------------CLATAMHKLANLRGAPNLHAE 306

Query: 255 TTHRLAFTRQREM---SMLVAIA---MTALPECSAQGISNIAWALSKIG---GELLYLSE 305
                 F + +++     L  +A        E +AQ ++N+ W+ + +G   G+ +    
Sbjct: 307 IVQAPEFFKLKQLIRDEFLAEVAEEVKGKAREGNAQNVANMLWSFATLGYHPGDEV---- 362

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQ 364
           M  +A     K+ +F SQN++N   +FA ++    D L   LA  A   + TF  Q L+ 
Sbjct: 363 MHALAVAVQQKLADFTSQNMSNAVLSFAKLEFDPGDELLEGLAAEALRKIATFSPQALSN 422

Query: 365 VLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSN---CNENGGVKSSGDA---DS 418
            LW  + L   A  L+E +  A +   Q     ++ L+N      N GV + GD+   D 
Sbjct: 423 TLWGLSKLGINAPELMEGIGQAAR--FQLYEYNSQNLANSVWAYANIGV-NPGDSLLQDF 479

Query: 419 EGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR----FEEQRIS 468
                + +  F+   + N+ W+YA LG      F++  +  +R    F  Q ++
Sbjct: 480 ARVAIAKMPEFSPQNISNLLWAYAKLGVQHAELFAEAGRHAARIMHTFTPQSVA 533



 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 50/217 (23%), Positives = 94/217 (43%), Gaps = 23/217 (10%)

Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKI----GGELLYLSEMDRVAEVALTKVGEFNS 322
           M  L       L + ++Q +SN   + +K+    G ELL     + +A  AL K+  F+ 
Sbjct: 363 MHALAVAVQQKLADFTSQNMSNAVLSFAKLEFDPGDELL-----EGLAAEALRKIATFSP 417

Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLE 381
           Q ++N     + +  +AP+L   + + A   ++ +  Q LA  +WA+A++   P D LL+
Sbjct: 418 QALSNTLWGLSKLGINAPELMEGIGQAARFQLYEYNSQNLANSVWAYANIGVNPGDSLLQ 477

Query: 382 SLDN-AFKDATQFTCCLNKALSN---CNENGGVKSSGDADSEGSLSSPVL-SFNRDQLGN 436
                A     +F+    + +SN        GV+ +      G  ++ ++ +F    + N
Sbjct: 478 DFARVAIAKMPEFS---PQNISNLLWAYAKLGVQHAELFAEAGRHAARIMHTFTPQSVAN 534

Query: 437 IAWSYAVLGQMD-----RIFFSDIWKTISRFEEQRIS 468
           +AW+YA L Q               + +  F  Q +S
Sbjct: 535 MAWAYATLDQCPDATLLHALVGHAARMLPEFSPQNLS 571


>gi|302780627|ref|XP_002972088.1| hypothetical protein SELMODRAFT_412580 [Selaginella moellendorffii]
 gi|300160387|gb|EFJ27005.1| hypothetical protein SELMODRAFT_412580 [Selaginella moellendorffii]
          Length = 205

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 32/63 (50%), Positives = 42/63 (66%), Gaps = 1/63 (1%)

Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQL-DYLR 616
           KV  E++ P+HF+RNTG  LGHT+LK R + AA W ++S S+ EWE LQG    L  Y R
Sbjct: 138 KVVIEVNRPSHFARNTGDLLGHTVLKHRLVEAAEWKIISASYAEWENLQGESGHLTSYKR 197

Query: 617 VIL 619
           + L
Sbjct: 198 LWL 200


>gi|397648138|gb|EJK78006.1| hypothetical protein THAOC_00119 [Thalassiosira oceanica]
          Length = 1158

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 76/302 (25%), Positives = 122/302 (40%), Gaps = 58/302 (19%)

Query: 284  QGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
            Q +SN  WA +  G     L+    D +A   L  +G F  Q+ +N A AFA+ +   P 
Sbjct: 748  QALSNTPWAFATAGASHPELFKKIGDHIA--VLDSLGSFKPQDFSNTAWAFATARVFHPR 805

Query: 342  LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
            LF +L   A      F +QE++  LWA A++        + L +AF              
Sbjct: 806  LFEKLTTEAVASKDHFDDQEVSNFLWACATVGH----TDQRLFSAFAPV----------- 850

Query: 402  SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
                                ++S +  FN+  L NIAW+Y+V        F+  + +   
Sbjct: 851  --------------------IASRLGKFNKQHLANIAWAYSVANLPRHDLFNKGYVSALA 890

Query: 462  FEEQRISEQYREDIMFASQVHLVNQCLKLEHP-HLQLALSSVLEEKIASAGKTKRFNQKV 520
              E+  S +     + A          +LE    +  +L +       SAG ++      
Sbjct: 891  SNEKEFSVE-----LLAQLHQWQLWQQELESGIEVPQSLRAKCRNAFTSAGYSE------ 939

Query: 521  TSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHF--SRNTG 574
             S  Q +V   L + GL+   E  +  GY +DA++    ++KVA E+DGP+HF   R  G
Sbjct: 940  -SRLQNDVVDELKAAGLDLEEEVLLGSGYRIDALVKVGDERKVAVEVDGPSHFIDRRPVG 998

Query: 575  VP 576
             P
Sbjct: 999  KP 1000



 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 39/159 (24%), Positives = 66/159 (41%), Gaps = 40/159 (25%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
            D +A  A+  +  F+++ ++N+  +F  ++ + PD     LF+   + A  I+HTF+ Q
Sbjct: 536 FDSIASSAVGMLNGFDARCLSNLIYSFGLVERN-PDIGGETLFNVFGEAAGKILHTFKSQ 594

Query: 361 ELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEG 420
           +L+ +LWAF  +      L +                        + GGV S  D D   
Sbjct: 595 DLSNMLWAFVKVDAKNSRLFQ------------------------DTGGVISGMDLD--- 627

Query: 421 SLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI 459
                  SF    L NI WS+A  G+ +   F  +   I
Sbjct: 628 -------SFQPQHLANILWSFAKSGKANPELFQALGNHI 659


>gi|428166881|gb|EKX35849.1| hypothetical protein GUITHDRAFT_79396, partial [Guillardia theta
           CCMP2712]
          Length = 124

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 42/124 (33%), Positives = 61/124 (49%), Gaps = 21/124 (16%)

Query: 518 QKVTSSFQKEVARLLVSTGLNWIREYA--VDGYTVD-------------------AVLVD 556
           Q  +S  QKEV  +L+S G     E+     GYT+D                   +    
Sbjct: 1   QLRSSKLQKEVMSVLLSIGFECEEEHQDPRTGYTIDIYCPPSSSSSSSSSSSSSSSSSSS 60

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
             VA E+DGP+HF   T    G T+LKRR++ A G+  +S+ + EW+ LQG+ EQ  ++R
Sbjct: 61  SPVAIEVDGPSHFLHGTREASGSTVLKRRHLEAVGYRFISIPYWEWDALQGAEEQEKFMR 120

Query: 617 VILK 620
             LK
Sbjct: 121 EKLK 124


>gi|196000024|ref|XP_002109880.1| hypothetical protein TRIADDRAFT_53223 [Trichoplax adhaerens]
 gi|190588004|gb|EDV28046.1| hypothetical protein TRIADDRAFT_53223 [Trichoplax adhaerens]
          Length = 639

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 84/361 (23%), Positives = 142/361 (39%), Gaps = 83/361 (22%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASD--IVHTFQEQELA 363
            ++++   +  +   ++ +V+ +A A +S+++   DL   +    +D   ++ F  Q + 
Sbjct: 323 FEKISNYVIKNINNMSTYSVSQIARALSSLRYYNKDLADAIGLHLTDKGALYEFSIQSIG 382

Query: 364 QVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLS 423
            +L+ FA      +P L                L K +S       +KS+ +      + 
Sbjct: 383 DILYVFARWNHLPEPAL----------------LRKLISKIEY--YIKSTPNM-----II 419

Query: 424 SPVLSFNRDQLGNIAWSYAVLGQ-----MDRIFFSDIWKTISRFEEQRISEQYREDIMFA 478
            P+++          WS  +L       ++ +F   I   I  F    I  Q     MF 
Sbjct: 420 PPIVT--------SIWSLIILDTFPHRAINALFNEKIVSEIHSFGTGAIQVQ-----MFQ 466

Query: 479 SQVHLVNQCLKLEHPHLQL-ALSSVLEEKIASAGKTKRFNQKVTSSFQKEVAR----LLV 533
                ++   KLE P LQL  LS     +       K+F+ K  S FQ  V R    L  
Sbjct: 467 -----IDLAAKLERPELQLQGLSH--SHRNHFLKPLKKFSTK-GSVFQHNVQRTLEYLFD 518

Query: 534 STGLNWIREYAVDGYTVD-AVLVD----------------------KKVAFEIDGPTHFS 570
            +   W       GY+VD A++ D                      K++A E+DGP HF 
Sbjct: 519 GSHYYWKEFKTAYGYSVDLAIMTDLNNVLQEPKVNVLRSKNKPTHYKRIAIEVDGPYHFL 578

Query: 571 RNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSSN 630
            N+   +G + +K R +   GW VV + + +WEEL    E+  Y    +K  I G+G  N
Sbjct: 579 HNSTKLIGESKMKHRQLRLLGWTVVQVPYFDWEELNTDDERKQY----MKRKIFGDGPMN 634

Query: 631 I 631
           I
Sbjct: 635 I 635


>gi|219125971|ref|XP_002183242.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405517|gb|EEC45460.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1123

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 101/461 (21%), Positives = 173/461 (37%), Gaps = 141/461 (30%)

Query: 277  ALPECSAQGISNIAWALSK-----------IGGELLYLSEMDR----------------- 308
             L E S QG+ N AWA ++           +GG  L  S   R                 
Sbjct: 690  GLTEFSPQGLGNTAWAFARQAQLSEEAANRLGGASLLPSSNGRLAIYTACYFDIGEELIH 749

Query: 309  -----VAEVALTK---VGEFNSQNVANVAGAFA--SMQHSA--PDLFSELAKRASDI--- 353
                 +AE  +TK   +  F  Q+++N A  FA   ++H+A       EL +R S     
Sbjct: 750  RLFAAIAEAGITKHVNLTSFKPQDLSNTAWTFAVLGLRHTAFMEVAMHELERRLSLFLKG 809

Query: 354  ----VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGG 409
                + TF+ QELA +LWA A+L    +  LE +    ++      C             
Sbjct: 810  ERTSITTFKGQELANLLWALATLNIRVENSLEIVTPYLQE-----VCF------------ 852

Query: 410  VKSSGDADSEGSLSSPVLS----FNRDQLGNIAWSYAVLGQ----MDRIFFSDIWKTISR 461
                     EG    PV +    F R +L N+AWS AV G+    + ++ ++ +      
Sbjct: 853  ---------EGRTGMPVQAIAQIFKRQELANVAWSCAVFGKYPTALMQLLYAGLIGLDKE 903

Query: 462  FEEQRISEQYREDIM----FASQVHLVNQCLKLEHPHLQLALS---------------SV 502
             + +++S  Y +  +      S +++     +     L L  +                +
Sbjct: 904  CDAEKLSNVYGDKGLQSQALMSLIYVQASMDRAGKSTLGLPPNFPDAWRQSTPSEDGQRM 963

Query: 503  LEEKIASAGKTKRFNQKVTSSFQK-----------EVARLLVSTGLNWIREYAVDGYTVD 551
             E  I  +  T +  + V+++F +            +  ++V  G+N+  +  +D  ++D
Sbjct: 964  TETNIELSLSTSKIQRDVSAAFNRIGFKHIEEHTISMQEMVVEYGVNFAPQ-QLDILSID 1022

Query: 552  AVLVDKKVAFEIDGPTHFSR-----------NTGVPLGH-----------------TMLK 583
               V +K+A E+DGP HF             +T  P G                  T LK
Sbjct: 1023 IANVPEKIAIEVDGPAHFINLIDNVDENDYGSTKAPNGKLEYQFQWTGDRQMMNGSTSLK 1082

Query: 584  RRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIG 624
             R + + GW V+ +   EW ++    EQ +Y R  L D +G
Sbjct: 1083 HRLLESLGWRVIHIPFWEWYQMGSDEEQGEYCRDAL-DTLG 1122



 Score = 40.4 bits (93), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 91/194 (46%), Gaps = 15/194 (7%)

Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
           LN+ +V  ++A EVL ++     ++ +  S   ++ +N +T++HR+ ++           
Sbjct: 141 LNQLLVACESASEVLTLLQNTKGSLTQKASGGTMNSVNFSTSIHRLCRHSLNQRDTRAAT 200

Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIG-----GELLYLSEMDRVAEVA 313
           LA  R   +    A AM  +P  S + +SNI WAL+K+        + +    D   + A
Sbjct: 201 LADPRFALLLASTAEAMVTMPFQSRE-LSNIGWALAKLKIVPPLTAMPFEQSDDEALKAA 259

Query: 314 LTKV--GEFNSQNVANVAGAFASMQHSA-PDLFSELAKRAS-DIVHT----FQEQELAQV 365
              V  G F +      +G  +    +A   L  ++  R S ++V T    F+ QE A +
Sbjct: 260 AQTVRDGVFKAAKERQESGTPSKAWITALSQLAGQILDRISQNVVSTQTDGFRLQEWANL 319

Query: 366 LWAFASLYEPADPL 379
           +WA+A+  E ADP+
Sbjct: 320 MWAWAT-AERADPV 332


>gi|397628210|gb|EJK68790.1| hypothetical protein THAOC_10004, partial [Thalassiosira oceanica]
          Length = 2539

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 80/354 (22%), Positives = 138/354 (38%), Gaps = 93/354 (26%)

Query: 320  FNSQNVANVAGAFASMQ--HSAPDLF----SELAKRASDIVHTFQEQELAQVLWAFASLY 373
            +++Q+++N   +FA++   HSA  LF    +E+  R  +    F+ QE++ +LW+FA++ 
Sbjct: 823  YSNQDLSNTVWSFATLGLLHSA--LFKSVENEVKSRLMNNRTKFRGQEISNLLWSFATVN 880

Query: 374  EPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQ 433
               DP       +F DA                  GV +  D   E SL+S  L   R +
Sbjct: 881  AQPDP-------SFIDAMSHYIA------------GVCTGRDGIREQSLTS--LFTQRQE 919

Query: 434  LGNIAWSYAVLGQMDR----IFFSDIWKTISRFEEQRISEQYREDIMFASQV---HLVNQ 486
            L N+AW  AV+GQ  +    I ++ +  T +  +  R    + +D +  S +   + V  
Sbjct: 920  LANLAWGCAVVGQYPKDLMNILYAGLLGTNNDPDHMR--RVFNDDGLEKSSIMTLYYVQI 977

Query: 487  CLKLEHPHLQLALSSVLEEKIASAGKTKRFNQK----------------VTSSFQKEVAR 530
               +E P L+LAL              +R   K                  S  Q+ V  
Sbjct: 978  AADIEAPELKLALPEGFPNGWGVMDGQQRTRSKDGDDLAQQSSSILLTLTVSKLQRHVGS 1037

Query: 531  LLVSTGLNWIREYAVDG--------------------YTVDAVLVDKKVAFEIDGPTHFS 570
               + G +   EY +D                      ++D   V+K++  E+DGP HF 
Sbjct: 1038 AFDAIGFDHELEYVIDTNQIRDELPNEIVLTQSPMEFLSIDLANVEKRIGVEVDGPGHFV 1097

Query: 571  RNTGVPL-------------------GHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
                 P                    G T LK R ++   W+++ L + E+++L
Sbjct: 1098 HLLDKPPRRRESEIIILDDMGDNRFNGPTTLKHRLLSHLDWDIIHLPYWEFQKL 1151



 Score = 39.7 bits (91), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 53/225 (23%), Positives = 89/225 (39%), Gaps = 42/225 (18%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRV----------------------AEVAL---TKVG 318
           Q +SN  WA++  G + LY    D                        A VAL    +  
Sbjct: 648 QEMSNSIWAMATAGFKPLYTRAFDTTLVPRNMRPTKKQLAEDTFGESYAAVALETMRRPH 707

Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAK----RASDIVHTFQEQELAQVLWAFASLYE 374
           EF  Q + +V  +F+ +    P LF   A+    R    + +F  Q L  +LW++A   +
Sbjct: 708 EFKDQELKDVMWSFSRVGIRHPALFKSTAEHVIGREGRGLSSFSSQGLGNLLWSYAKQAQ 767

Query: 375 PADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSG---DADSEGSLSSPVLSFNR 431
            +  ++E+L +     T  T  L    ++C +NG     G   +A    + +  V S++ 
Sbjct: 768 LSLEVIEALGDDVNLVT--TGRLAVYETSCLDNGEANIKGLFVEAARAVASAGAVASYSN 825

Query: 432 DQLGNIAWSYAVLGQMDRIFFSDIWKTI--------SRFEEQRIS 468
             L N  WS+A LG +    F  +   +        ++F  Q IS
Sbjct: 826 QDLSNTVWSFATLGLLHSALFKSVENEVKSRLMNNRTKFRGQEIS 870


>gi|397596760|gb|EJK56844.1| hypothetical protein THAOC_23187 [Thalassiosira oceanica]
          Length = 1026

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 59/106 (55%), Gaps = 5/106 (4%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLY----LSEMDRVAEVALTKVGEFNSQNVANVA 329
           A+  LP   A+ I+N+  + +K     +Y     +  D +A+ AL+K  +   QN+AN+ 
Sbjct: 374 AVPILPTFDARNIANLVHSFAKAEVVPIYEPGKCTLFDMLADSALSKDHDMQPQNIANIL 433

Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEP 375
            AFA M+H +P LF EL+  AS  +H F  Q+LA + W+  S Y P
Sbjct: 434 WAFAKMKHPSPKLFEELSTDASRRMHDFSAQQLATLAWSL-SKYPP 478



 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 103/406 (25%), Positives = 164/406 (40%), Gaps = 86/406 (21%)

Query: 274  AMTALPECSAQGISNIAWALSKIGGELLYLSE----MDRVAEVALTKVGEFNSQNVANVA 329
            A   L E SA  + N++ +  K G     LS     MD +A   + +   F+ + +  +A
Sbjct: 642  AAYQLRELSALALFNLSVSYGKSG-----LSPNDEWMDLLAREIVRRPSSFSPKMIVGIA 696

Query: 330  GAFASMQHSAPDLFSELA-------------KRASDIVHTF------------------- 357
             A+++M +  P LF+ LA             K  + +V +F                   
Sbjct: 697  FAYSTMNYQKPRLFTFLAEQVKSQCQESLEPKELASLVWSFVNIGFLDRGLLAEIAEVLN 756

Query: 358  ------QEQELAQVLWAFASLYEPADPLLESLDNAFKDATQ-FTCCLNKALSNCNENGGV 410
                    Q LA V WA++   E    L + +  A K   + FT     A    N     
Sbjct: 757  GKWSELDTQSLANVAWAYSKAQEDRPALYKGISAAAKAGREGFT-----AQGVSNLLWAF 811

Query: 411  KSSGDADSE-----GSLSSPVL-SFNRDQLGNIAWSYAVLGQMD-RIFFSDIWKTISRFE 463
             ++G+ D +       +S+ +L  F    + N+AW+YAV    D  +F +D   + +   
Sbjct: 812  SAAGEVDDDLFEFFAPVSTSLLDEFQPQGIANLAWAYAVANVDDGSLFNADFIGSCTM-- 869

Query: 464  EQRISEQYRE-DIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG-KTKRFNQKVT 521
                    RE D +   Q+HL N      H   +  L + + E   +A    K+  Q   
Sbjct: 870  ------NLREFDAVGLCQLHLWNM---WRHEARREGLPAGMAETCKNAFVHQKKIRQ--- 917

Query: 522  SSFQKEVARLLVSTGLNWIREYAVD-GYTVDAVLV--DKKVAFEIDGPTHF--SRNTGVP 576
            S  Q  V   L ++G++ I E  V+ GY +D +L    KK+  EIDGP HF   R  G  
Sbjct: 918  SKLQNTVVGHLRNSGMDVIEEVQVESGYLLDVLLTINGKKIGVEIDGPFHFVGRRQNGA- 976

Query: 577  LGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
               T+LKRR ++      ++SL + E   L    E   YL  +L+D
Sbjct: 977  ---TILKRRLVSNVDKIPIISLPYWELNGLDSDVEWASYLNRVLED 1019



 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 35/151 (23%), Positives = 77/151 (50%), Gaps = 16/151 (10%)

Query: 244 IAKNMEKVS----MMTTHRLA---FTRQREM-SMLVAIAMTALPECSAQGISNIAWALSK 295
           +A+ +EKVS    +M  H  A    T+  E  S++V  A++          + ++W+L+ 
Sbjct: 492 VARGLEKVSSQGLVMLAHAFATIGHTQNEEFWSLIVDAAISRASNLWPIECAQLSWSLAT 551

Query: 296 I---GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASD 352
           +     EL     M+ + +  L ++  +  Q +A+VA +F+++ +  P+L+  LAKR+  
Sbjct: 552 VRRKSDEL-----MNGIEKQVLRRIDGYTPQGLASVAWSFSTLGYDVPNLYDALAKRSLQ 606

Query: 353 IVHTFQEQELAQVLWAFASLYEPADPLLESL 383
           ++  F   +   ++ A+++   P   LL+++
Sbjct: 607 LMEDFSPTDKVLLVLAYSNHTHPHPNLLDAV 637



 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 40/81 (49%), Gaps = 6/81 (7%)

Query: 309 VAEVALTKVGEFNSQNVANVAGAFAS------MQHSAPDLFSELAKRASDIVHTFQEQEL 362
           + + A+  +  F+++N+AN+  +FA        +     LF  LA  A    H  Q Q +
Sbjct: 370 IGDAAVPILPTFDARNIANLVHSFAKAEVVPIYEPGKCTLFDMLADSALSKDHDMQPQNI 429

Query: 363 AQVLWAFASLYEPADPLLESL 383
           A +LWAFA +  P+  L E L
Sbjct: 430 ANILWAFAKMKHPSPKLFEEL 450


>gi|294886889|ref|XP_002771904.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239875704|gb|EER03720.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 1157

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 56/118 (47%), Gaps = 15/118 (12%)

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDA---------------VLVDKKVAFEIDG 565
           TS   ++V++     GL    E  V  Y++D                +LV +  AFE+DG
Sbjct: 834 TSQLHRQVSKFFTMVGLRHRNEVVVGPYSIDVSGLGRGLEQAVISVKILVGESFAFEVDG 893

Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
           P HF R+T +    ++LK R + A G+ V+ + +QEW +     ++L Y+    K  I
Sbjct: 894 PHHFYRDTSMRTASSLLKHRILEALGFTVIRVPYQEWSQCGTREKRLRYVGSFWKQLI 951


>gi|401408949|ref|XP_003883923.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325118340|emb|CBZ53891.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 515

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 47/95 (49%), Gaps = 2/95 (2%)

Query: 510 AGKTKRFNQKVTSSFQKEVARLLVSTGL--NWIREYAVDGYTVDAVLVDKKVAFEIDGPT 567
           A + K  N    S  QK V RLL   GL      EY +  Y +D  +  +K+  E+DG  
Sbjct: 394 AREKKLLNLVHVSQVQKRVGRLLFDEGLMSEICVEYPLGPYVLDFAIPSRKLVVEVDGEA 453

Query: 568 HFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           HF   T VP   T +KR  +AA GW+VV +  + W
Sbjct: 454 HFFFGTTVPTAQTRMKRELLAAMGWHVVVVPQELW 488


>gi|428175207|gb|EKX44098.1| hypothetical protein GUITHDRAFT_139952 [Guillardia theta CCMP2712]
          Length = 1108

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 60/116 (51%), Gaps = 9/116 (7%)

Query: 521  TSSFQKEVARLLVSTGLNWIREY--AVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
            +S+ Q  VA  + +  L  I E    V GY +D  L  ++   E+DGP HF+  T  PLG
Sbjct: 987  SSNLQNSVALAIAALDLEMIEEMKDTVSGYRLDIFLPAQQKVVEVDGPRHFAFETRRPLG 1046

Query: 579  HTMLKRRYIAAAGWNVVSLSHQEWEE-------LQGSFEQLDYLRVILKDYIGGEG 627
             T+LKRR +    +  V++ + EW+E          + EQL+YLR  + D+  G+ 
Sbjct: 1047 PTVLKRRILELLRYKPVTIPYWEWDERGGGAGGGGFTREQLEYLRSKIFDHTMGDA 1102


>gi|412993830|emb|CCO14341.1| predicted protein [Bathycoccus prasinos]
          Length = 676

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 90/369 (24%), Positives = 157/369 (42%), Gaps = 51/369 (13%)

Query: 282 SAQGISNIAWALSKI---GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
           S Q I N  W+   +    GE +    MD   ++      +F +Q ++N+A A A +QH 
Sbjct: 306 STQAIGNAMWSCGTLRCHPGEKI----MDAYLKLTTEYHEKFKTQEISNIAWASAMLQHH 361

Query: 339 APDLF-----SELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAFKDATQ 392
             D F       LAKR  +       Q ++  L   A+  Y+  + +L++L    + A +
Sbjct: 362 PGDAFLSVVSETLAKRLEECA----SQAVSNSLLGLATFGYKMDEEMLKALGGK-RHARR 416

Query: 393 FTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLG----NIAWSYAVLGQ-- 446
              C ++ L N         + +A+  G L + V S + D+      N+ +   V+ Q  
Sbjct: 417 ---CNSQDLCNSIWALAAVDAFEAEVYGDLWARVSSMHHDEFAPEGLNMLYHACVMHQDH 473

Query: 447 -MDR-------IFFSDIWKTISRFEEQRISEQYREDIM-------FASQVHLVNQCLKLE 491
            MD+       +   ++   ++   ++R S               F+S +H     + + 
Sbjct: 474 WMDQHAVGNDDVVDEEVLDDVTNTSKKRKSTNQSTTTTKSTKAKGFSSSLH----GMGVR 529

Query: 492 HPHLQLALSSVLEEKIASAGKTKRFNQKVT-SSFQKEVA-RLLVSTGLNWIREYAV-DG- 547
              LQ   + +  + IA      +    VT S+F K V+ R+      N   EY   DG 
Sbjct: 530 EVALQRHDTPIWLDTIAKKSYDDQTIHSVTLSAFHKHVSTRIRAGFIKNVADEYLTEDGV 589

Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGH-TMLKRRYIAAAGWNVVSLSHQEWEELQ 606
            ++D  L+D K+A E DGP+HF +N    + H T+++ R +   GW VVS+ + EW+E  
Sbjct: 590 MSIDIALLDHKIAIECDGPSHFEKNMEKSMTHKTIIRNRGLERRGWRVVSIPYFEWQEAN 649

Query: 607 GSFEQLDYL 615
            +     YL
Sbjct: 650 ANETHRKYL 658


>gi|294942284|ref|XP_002783468.1| hypothetical protein Pmar_PMAR006996 [Perkinsus marinus ATCC 50983]
 gi|239895923|gb|EER15264.1| hypothetical protein Pmar_PMAR006996 [Perkinsus marinus ATCC 50983]
          Length = 389

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 88/187 (47%), Gaps = 30/187 (16%)

Query: 196 EINLNKDIVDAQTAQEV---LEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVS 252
           E  + K I+ A  A+ V   LE++   +T          L+ +N++T +HR+A      S
Sbjct: 169 EFEIQKSILVAANARSVKGLLEIVDTHVTQ---------LNSVNVSTLIHRLA------S 213

Query: 253 MMTTH---RLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRV 309
           +   H   + A TR   M  ++  A+      S Q +SNI+WA+ K     L LS+   V
Sbjct: 214 ITQNHEQSQKALTRDHRMKKVLRRAVELARISSCQSLSNISWAVGK-----LQLSDEKEV 268

Query: 310 AEV----ALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQV 365
            E     A T++  F  QN +N+    + + H    L   +AKR    +H F+ QE++ +
Sbjct: 269 VEAIVGAAKTRLEHFRPQNFSNMLYGLSRVNHYDKALMEMVAKRVLGTIHNFKPQEVSNL 328

Query: 366 LWAFASL 372
           L+A+  L
Sbjct: 329 LYAYGRL 335


>gi|237839529|ref|XP_002369062.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
 gi|211966726|gb|EEB01922.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
          Length = 1448

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 72/137 (52%), Gaps = 4/137 (2%)

Query: 472  REDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQ---KVTSSFQKEV 528
            R DI   +++ +V+  L+L  P    +L   L+  +A A +     Q    ++S   ++V
Sbjct: 1018 RLDIGSVTRLQIVDLYLRLLRPPAFASLPFDLKAFLARARRVDLAQQDCFSLSSKLHRDV 1077

Query: 529  ARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
            +   +  GL    E  +  +++D VL D+ +A EIDGP+HF R T + +  + LK+R + 
Sbjct: 1078 SSAFLRIGLVHRSEVQLGPFSLDIVLGDR-LAVEIDGPSHFYRETCMRVASSRLKQRLLR 1136

Query: 589  AAGWNVVSLSHQEWEEL 605
              GW V+ +S  EW +L
Sbjct: 1137 EMGWTVLPVSFFEWRQL 1153



 Score = 43.1 bits (100), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           Q  SN+  A  ++  E+L L   +  A      + ++N Q+++N+A A++ +  S P+LF
Sbjct: 476 QDFSNLLNAFGRL--EILDLELFNLAAPEISAGIRDYNPQHLSNIAHAYSKVSVSQPELF 533

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASL 372
             +A+     +  F  +ELA +  AFA +
Sbjct: 534 FRIAEMTRRSIQNFSNRELANLALAFAKM 562


>gi|221507781|gb|EEE33368.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 1444

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 72/137 (52%), Gaps = 4/137 (2%)

Query: 472  REDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQ---KVTSSFQKEV 528
            R DI   +++ +V+  L+L  P    +L   L+  +A A +     Q    ++S   ++V
Sbjct: 1018 RLDIGSVTRLQIVDLYLRLLRPPAFASLPFDLKAFLARARRVDLAQQDCFSLSSKLHRDV 1077

Query: 529  ARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
            +   +  GL    E  +  +++D VL D+ +A EIDGP+HF R T + +  + LK+R + 
Sbjct: 1078 SSAFLRIGLVHRSEVQLGPFSLDIVLGDR-LAVEIDGPSHFYRETCMRVASSRLKQRLLR 1136

Query: 589  AAGWNVVSLSHQEWEEL 605
              GW V+ +S  EW +L
Sbjct: 1137 EMGWTVLPVSFFEWRQL 1153



 Score = 43.1 bits (100), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           Q  SN+  A  ++  E+L L   +  A      + ++N Q+++N+A A++ +  S P+LF
Sbjct: 476 QDFSNLLNAFGRL--EILDLELFNLAAPEISAGIRDYNPQHLSNIAHAYSKVSVSQPELF 533

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASL 372
             +A+     +  F  +ELA +  AFA +
Sbjct: 534 FRIAEMTRRSIQNFSNKELANLALAFAKM 562


>gi|221483292|gb|EEE21611.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 1449

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 72/137 (52%), Gaps = 4/137 (2%)

Query: 472  REDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQ---KVTSSFQKEV 528
            R DI   +++ +V+  L+L  P    +L   L+  +A A +     Q    ++S   ++V
Sbjct: 1018 RLDIGSVTRLQIVDLYLRLLRPPAFASLPFDLKAFLARARRVDLAQQDCFSLSSKLHRDV 1077

Query: 529  ARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
            +   +  GL    E  +  +++D VL D+ +A EIDGP+HF R T + +  + LK+R + 
Sbjct: 1078 SSAFLRIGLVHRSEVQLGPFSLDIVLGDR-LAVEIDGPSHFYRETCMRVASSRLKQRLLR 1136

Query: 589  AAGWNVVSLSHQEWEEL 605
              GW V+ +S  EW +L
Sbjct: 1137 EMGWTVLPVSFFEWRQL 1153



 Score = 43.1 bits (100), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           Q  SN+  A  ++  E+L L   +  A      + ++N Q+++N+A A++ +  S P+LF
Sbjct: 476 QDFSNLLNAFGRL--EILDLELFNLAAPEISAGIRDYNPQHLSNIAHAYSKVSVSQPELF 533

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASL 372
             +A+     +  F  +ELA +  AFA +
Sbjct: 534 FRIAEMTRRSIQNFSNKELANLALAFAKM 562


>gi|159481474|ref|XP_001698804.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
 gi|158273515|gb|EDO99304.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
          Length = 1235

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 91/348 (26%), Positives = 158/348 (45%), Gaps = 52/348 (14%)

Query: 265 REMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQN 324
           R +S   A  +  LP    QG+SN AWA +++G     L     +A  AL K+  F +Q 
Sbjct: 642 RMLSAWAAQTLEKLPSFEPQGLSNTAWAFARLGFHSPQL--FQALAAAALHKIDGFTAQG 699

Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDIVHT--FQEQELAQVLWAFASL--YEPA--DP 378
           ++N+A A A+  H+ P LF  LA++A+ +  T  F  Q  +  LWA ASL  Y+ A  D 
Sbjct: 700 LSNLAWAMATAGHAQPRLFEALARQAAALAPTGAFNAQNCSVTLWAAASLRHYDQALFDA 759

Query: 379 LLESLDNAFKD-ATQFTCCLNKALSNCN-ENGGVKSSGDADSEGSLSSPVLSF----NRD 432
           +L  L  A ++   +   C  + ++N       +  S  A++   L   V++     ++ 
Sbjct: 760 MLRRLVAALEEGGAEADGCEPQNVANALWAVARMGHSLPAEAAAPLLRHVVALMPRMSQQ 819

Query: 433 QLGNIAWSYAVLGQMDRIFFSDIWKTISRFEE---QRISEQYREDIMFASQVHLVNQC-- 487
           +L N  W+ AV+ +MD   ++     ++R  +   + + + Y   +MF S  HL      
Sbjct: 820 ELCNSMWAVAVMDRMDEGLWAAFCACLTRLPDISPEGMHQAYHAQLMFHS--HLARAAGM 877

Query: 488 --LKLEH--------------PHLQLALSSVLEEKIASAGK---TKRFNQKVTSSFQKEV 528
              KL+               P L   L +V     A++ +     RF+Q+V+ +    +
Sbjct: 878 PLSKLQALAAADPAAGSRSLLPCLPEPLHTVAASMWAASARDVHVSRFHQEVSGA----L 933

Query: 529 ARLLVSTGLNWI---REYAVD-GYTVDAVLVDKKVAFEIDGPTHFSRN 572
           A   V   L W+   + ++VD G  V+A    +  A E++G  H++ N
Sbjct: 934 AVAGVPHALEWMTDDQHFSVDIGLQVNA----RPTAVEVNGSHHYASN 977



 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 65/244 (26%), Positives = 104/244 (42%), Gaps = 37/244 (15%)

Query: 270 LVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDR-----VAEVALTKVGEFNSQN 324
           L A+ +  +    A+G++N AWA     G+L Y+          +A  AL ++GEF+ QN
Sbjct: 383 LAALMINQINSFDARGLANSAWAF----GKLKYVPAAGTSLPTVIAAAALRRMGEFSPQN 438

Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL 383
           ++N+  +F  M H    L +  A+     V  F+ QELA ++WAFASL Y       E +
Sbjct: 439 LSNLVWSFVYMHHVDEALLAAAARYVVARVGEFKPQELANIVWAFASLGYRD-----EHM 493

Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL-------SFNRDQLGN 436
            +      Q    L K     N    +   G       L S ++       +F    + N
Sbjct: 494 LHVVASQAQRIAPLFKEQELSNVLWALGKMGLRHRPDVLESLMVETRTKLPAFLPQGISN 553

Query: 437 IAWSYAVLGQMDRIFFSDIWKTISR----FEEQRI--------SEQYREDIMFASQVHLV 484
           +AW+ A +G +D +F   +     R    F+ Q +        S  Y +    A+   LV
Sbjct: 554 VAWALAAVGHVDELFLDRVVAQCGRQLGAFDVQALANLVWAMASLGYYQPPFLAA---LV 610

Query: 485 NQCL 488
           N+CL
Sbjct: 611 NECL 614



 Score = 58.9 bits (141), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 65/256 (25%), Positives = 110/256 (42%), Gaps = 33/256 (12%)

Query: 268 SMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRV-----AEVALTKVGEFNS 322
           +++ A A+  + E S Q +SN+ W+        +Y+  +D       A   + +VGEF  
Sbjct: 421 TVIAAAALRRMGEFSPQNLSNLVWSF-------VYMHHVDEALLAAAARYVVARVGEFKP 473

Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP-LLE 381
           Q +AN+  AFAS+ +    +   +A +A  I   F+EQEL+ VLWA   +     P +LE
Sbjct: 474 QELANIVWAFASLGYRDEHMLHVVASQAQRIAPLFKEQELSNVLWALGKMGLRHRPDVLE 533

Query: 382 SLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADS------EGSLSSPVLSFNRDQLG 435
           SL    +  T+    L + +SN      + + G  D              + +F+   L 
Sbjct: 534 SL--MVETRTKLPAFLPQGISNVAW--ALAAVGHVDELFLDRVVAQCGRQLGAFDVQALA 589

Query: 436 NIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEH--P 493
           N+ W+ A LG     F + +          R+S Q   +I++         C  L H  P
Sbjct: 590 NLVWAMASLGYYQPPFLAALVNECLARGLDRLSPQNLSNILWG--------CATLGHRDP 641

Query: 494 HLQLALSSVLEEKIAS 509
            +  A ++   EK+ S
Sbjct: 642 RMLSAWAAQTLEKLPS 657



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 52/180 (28%), Positives = 76/180 (42%), Gaps = 34/180 (18%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVAL----TKVGEFNSQNVANVAGAFASMQHSA 339
           Q +SN+ WAL K+G     L     V E  +    TK+  F  Q ++NVA A A++ H  
Sbjct: 511 QELSNVLWALGKMG-----LRHRPDVLESLMVETRTKLPAFLPQGISNVAWALAAVGHVD 565

Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFASL--YEPADPLLESLDNAFKDATQFTCCL 397
                 +  +    +  F  Q LA ++WA ASL  Y+P  P L +L N          CL
Sbjct: 566 ELFLDRVVAQCGRQLGAFDVQALANLVWAMASLGYYQP--PFLAALVNE---------CL 614

Query: 398 NKALSNCNENG------GVKSSGDADSE--GSLSSPVL----SFNRDQLGNIAWSYAVLG 445
            + L   +         G  + G  D     + ++  L    SF    L N AW++A LG
Sbjct: 615 ARGLDRLSPQNLSNILWGCATLGHRDPRMLSAWAAQTLEKLPSFEPQGLSNTAWAFARLG 674


>gi|397576023|gb|EJK50024.1| hypothetical protein THAOC_31047, partial [Thalassiosira oceanica]
          Length = 292

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 119/286 (41%), Gaps = 54/286 (18%)

Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVL 366
           D VA   L  +  FN QN++N+  AFA+   S   LF +L+  A+      + Q +A  L
Sbjct: 13  DHVA--GLGSLNSFNPQNLSNITWAFATAGVSHTKLFEKLSDAAARKGEFIETQHIANFL 70

Query: 367 WAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPV 426
           WA A++              + D   F+     AL+                   ++S +
Sbjct: 71  WACATV-------------GYTDERLFS-----ALTPV-----------------IASKL 95

Query: 427 LSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQ 486
             FN   L NIAW+Y+V     +  F+  +       E+  S +       A        
Sbjct: 96  DKFNLQNLANIAWAYSVANTPRQDLFNKGYAGALASIEKDFSAE-----GLAQLHQWQLW 150

Query: 487 CLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV- 545
             +LE     + L   L+ K  +A  ++ F++   S  Q +V   L +TGL    E  + 
Sbjct: 151 QQELES---GIELPRSLQAKCRNAFTSQGFSE---SKLQNDVVDELKATGLVLDEEVLLG 204

Query: 546 DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
            GY +DA++      KVA E+DGP+HF      P G T+LK R +A
Sbjct: 205 SGYRIDALVKIGDGGKVAVEVDGPSHFIDRR--PTGSTILKHRQVA 248



 Score = 43.1 bits (100), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 54/108 (50%), Gaps = 4/108 (3%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEF-NSQNVANVAGAF 332
            + +L   + Q +SNI WA +  G  + +    +++++ A  K GEF  +Q++AN   A 
Sbjct: 17  GLGSLNSFNPQNLSNITWAFATAG--VSHTKLFEKLSDAAARK-GEFIETQHIANFLWAC 73

Query: 333 ASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
           A++ ++   LFS L    +  +  F  Q LA + WA++    P   L 
Sbjct: 74  ATVGYTDERLFSALTPVIASKLDKFNLQNLANIAWAYSVANTPRQDLF 121


>gi|397563361|gb|EJK43767.1| hypothetical protein THAOC_37756, partial [Thalassiosira oceanica]
          Length = 1452

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 132/318 (41%), Gaps = 30/318 (9%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
            + +L     Q +SN AWA +     + +    +++ EV + K   F+ + ++N   A A
Sbjct: 582 GLGSLDSFKPQNLSNTAWAYAT--ARVFHSRLFEKLTEV-VAKKDHFDERAISNFLWACA 638

Query: 334 SMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPAD--PLLESLDNAFKDAT 391
           ++ ++   LFS  A      +H   EQ+LA + WA++    P    P+ +      +   
Sbjct: 639 TVGYTDERLFSAFAPVIESKLHECNEQDLANIAWAYSVANIPKQDLPVRKGEFIEIQHIA 698

Query: 392 QFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
            F       L  C   G       +     ++S +   N   L NIAW+Y+V      +F
Sbjct: 699 NF-------LWACVTVGHTDERLLSAFAPVIASKLDECNDQDLANIAWAYSVANAPQDVF 751

Query: 452 FSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG 511
                  ++ +E++   EQ       A          +LE     + L   L  K  +  
Sbjct: 752 NKGYVVALALYEKEFSGEQ------LAQLHQWQLWQQELES---GIELPRSLRAKCRNTF 802

Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPT 567
            ++ F++   S  Q  V   L   GL+   E  +  GY +DA++    ++KVA E+DGP+
Sbjct: 803 TSQGFSE---SKLQNNVVDELRIAGLDLGEEVLLGSGYRIDALVKVGDERKVAVEVDGPS 859

Query: 568 HFSRNTGVPLGHTMLKRR 585
           HF +    P G T LK R
Sbjct: 860 HFIQRR--PAGSTTLKHR 875



 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 68/166 (40%), Gaps = 9/166 (5%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           Q  SN AWA +  G   L L          L  +  FN Q ++N A AFAS   S P LF
Sbjct: 514 QDFSNTAWAFATAGASHLELFNKIGNHIAGLGSLDSFNPQALSNTAWAFASAGESHPKLF 573

Query: 344 SELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
            ++    + +  + +F+ Q L+   WA+A+       L E L         F     +A+
Sbjct: 574 KKIGDHIAGLGSLDSFKPQNLSNTAWAYATARVFHSRLFEKLTEVVAKKDHFD---ERAI 630

Query: 402 SN----CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
           SN    C   G       +     + S +   N   L NIAW+Y+V
Sbjct: 631 SNFLWACATVGYTDERLFSAFAPVIESKLHECNEQDLANIAWAYSV 676



 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 52/204 (25%), Positives = 86/204 (42%), Gaps = 24/204 (11%)

Query: 273 IAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
           I M +L +   Q +SNI WA +  G     L+    D VA   L  +  F  Q+++N+A 
Sbjct: 386 IVMRSLNDFWPQDVSNIVWAYAAAGVSHPELFKKIGDHVA--GLDSLDSFEPQHLSNIAW 443

Query: 331 AFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
           +FA++  S P LF ++    + +  + +F+ Q L+ + WA A   E    L + + +   
Sbjct: 444 SFATVGESNPKLFKKIGDHVAGLGSLGSFKPQALSNISWACAKAGESNPKLFKKIGDHIA 503

Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSE------------GSLSSPVLSFNRDQLGN 436
             +       +  SN        ++G +  E            GSL     SFN   L N
Sbjct: 504 GPSSLGSFYPQDFSNTAW--AFATAGASHLELFNKIGNHIAGLGSLD----SFNPQALSN 557

Query: 437 IAWSYAVLGQMDRIFFSDIWKTIS 460
            AW++A  G+     F  I   I+
Sbjct: 558 TAWAFASAGESHPKLFKKIGDHIA 581



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 86/208 (41%), Gaps = 35/208 (16%)

Query: 284 QGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           Q +SNIAW+ + +G     L+    D VA   L  +G F  Q ++N++ A A    S P 
Sbjct: 436 QHLSNIAWSFATVGESNPKLFKKIGDHVA--GLGSLGSFKPQALSNISWACAKAGESNPK 493

Query: 342 LFSELAKR--ASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNK 399
           LF ++         + +F  Q+ +   WAFA+       L   + N             +
Sbjct: 494 LFKKIGDHIAGPSSLGSFYPQDFSNTAWAFATAGASHLELFNKIGNHIAGLGSLDSFNPQ 553

Query: 400 ALSNCNENGGVKSSGDADSE------------GSLSSPVLSFNRDQLGNIAWSYAVLGQM 447
           ALSN        S+G++  +            GSL S    F    L N AW+YA     
Sbjct: 554 ALSNTAW--AFASAGESHPKLFKKIGDHIAGLGSLDS----FKPQNLSNTAWAYATA--- 604

Query: 448 DRIFFSDIWKTIS-------RFEEQRIS 468
            R+F S +++ ++        F+E+ IS
Sbjct: 605 -RVFHSRLFEKLTEVVAKKDHFDERAIS 631



 Score = 45.8 bits (107), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 11/120 (9%)

Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVG 318
           + +T +R  S    +  + L EC+ Q ++NIAWA S     +  + + D        + G
Sbjct: 640 VGYTDERLFSAFAPVIESKLHECNEQDLANIAWAYS-----VANIPKQD-----LPVRKG 689

Query: 319 EF-NSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPAD 377
           EF   Q++AN   A  ++ H+   L S  A   +  +    +Q+LA + WA++    P D
Sbjct: 690 EFIEIQHIANFLWACVTVGHTDERLLSAFAPVIASKLDECNDQDLANIAWAYSVANAPQD 749



 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 39/158 (24%), Positives = 64/158 (40%), Gaps = 38/158 (24%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHS----APDLFSELAKRASDIVHTFQEQE 361
            D +A  A+  + EF++++++N+  +F  ++ +       LF+   + A  I+HTF  Q 
Sbjct: 263 FDSIASSAVGMLNEFDARHLSNLIYSFGLVERNPYIGGETLFNVFREAAVKILHTFISQN 322

Query: 362 LAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGS 421
           L+ +LWAF  +               K++  F            E G V S  D D    
Sbjct: 323 LSNMLWAFVKVDA-------------KNSRLF-----------QETGRVISGMDLD---- 354

Query: 422 LSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI 459
                 SF      NI WS+A  G+ D   F  +   I
Sbjct: 355 ------SFKPQDFANILWSFAKSGEADSKLFQALGNHI 386


>gi|397639734|gb|EJK73730.1| hypothetical protein THAOC_04631, partial [Thalassiosira oceanica]
          Length = 856

 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 69/310 (22%), Positives = 128/310 (41%), Gaps = 51/310 (16%)

Query: 284 QGISNIAWALSK-----------IGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
           Q ++NI W+ SK           +G  +  +  +D            F+ Q ++N A AF
Sbjct: 574 QALANILWSFSKSSKADPEPFRLLGNHIANMGRLD-----------SFDPQALSNTAWAF 622

Query: 333 ASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFAS-------LYEPADPLLESL 383
           A+   S P+L  ++    +  D + +F  QEL+  +WA+A+       L+E     + + 
Sbjct: 623 ATAGQSNPELLKKIGDHVAGLDSLDSFNPQELSNTIWAYATARVLDLGLFEKLATEVAAR 682

Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
           +  F +       ++  L  C   G       +     + S +   N+  L NIAW+Y+V
Sbjct: 683 NGQFIETQH----MSNFLWACATVGYTDERMFSAFAPVIESKLDECNKQDLANIAWTYSV 738

Query: 444 LGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVL 503
                  F       ++ +E     E   +   +      +   ++L            L
Sbjct: 739 ANAPQDTFNKGYVSALAAYENAFSKEALSQLHQWQLLQQELESGVELPQ---------SL 789

Query: 504 EEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKV 559
           +EK  +A  +  +++   S  Q +V   L + GL+   E  +  GY +DA++    ++KV
Sbjct: 790 QEKCRNAFTSLGYSE---SKLQNDVVGELKAAGLDLDEEVLLGSGYRIDALVKIGDERKV 846

Query: 560 AFEIDGPTHF 569
           A E+DGP+HF
Sbjct: 847 AVEVDGPSHF 856



 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 39/159 (24%), Positives = 74/159 (46%), Gaps = 27/159 (16%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHS----APDLFSELAKRASDIVHTFQEQE 361
            D +A  A   + +F++++++N+  +F  ++ +       LF+     A  I+HTF+ QE
Sbjct: 478 FDSIASSAAVVLNKFDARHLSNLIYSFGLVERNPEIRGKTLFNVFGTAAVKILHTFKPQE 537

Query: 362 LAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKSS 413
           L+ +LWAF       + L++    ++  +D  +FK          +AL+N   +    S 
Sbjct: 538 LSNMLWAFVKVDAKNSRLFQETCRVISGMDLGSFKP---------QALANILWSFSKSSK 588

Query: 414 GDADSEGSLSSPVL------SFNRDQLGNIAWSYAVLGQ 446
            D +    L + +       SF+   L N AW++A  GQ
Sbjct: 589 ADPEPFRLLGNHIANMGRLDSFDPQALSNTAWAFATAGQ 627



 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 79/198 (39%), Gaps = 38/198 (19%)

Query: 284 QGISNIAWALSKIGGELLYL-SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           Q +SN+ WA  K+  +   L  E  RV  ++   +G F  Q +AN+  +F+    + P+ 
Sbjct: 536 QELSNMLWAFVKVDAKNSRLFQETCRV--ISGMDLGSFKPQALANILWSFSKSSKADPEP 593

Query: 343 FSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
           F  L    +++  + +F  Q L+   WAFA+  +    LL+ +                 
Sbjct: 594 FRLLGNHIANMGRLDSFDPQALSNTAWAFATAGQSNPELLKKI----------------- 636

Query: 401 LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
                           D    L S + SFN  +L N  W+YA    +D   F  +   ++
Sbjct: 637 ---------------GDHVAGLDS-LDSFNPQELSNTIWAYATARVLDLGLFEKLATEVA 680

Query: 461 RFEEQRISEQYREDIMFA 478
               Q I  Q+  + ++A
Sbjct: 681 ARNGQFIETQHMSNFLWA 698


>gi|237832727|ref|XP_002365661.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
 gi|211963325|gb|EEA98520.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
          Length = 861

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 48/97 (49%), Gaps = 3/97 (3%)

Query: 509 SAGKTKRF-NQKVTSSFQKEVARLLVSTGL--NWIREYAVDGYTVDAVLVDKKVAFEIDG 565
           S  +TK+  N    S  QK V RLL   GL      EY +  Y +D  +  +K+  E+DG
Sbjct: 738 SLARTKKLLNLVHVSQVQKRVGRLLFDEGLMSEIDVEYPLGPYVLDFAIPSRKLVVEVDG 797

Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
             HF   T VP   T +KR  +AA GW VV +  + W
Sbjct: 798 EAHFFFGTTVPTAQTRMKRELLAAMGWRVVVVPQELW 834


>gi|221488117|gb|EEE26331.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 862

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 48/97 (49%), Gaps = 3/97 (3%)

Query: 509 SAGKTKRF-NQKVTSSFQKEVARLLVSTGL--NWIREYAVDGYTVDAVLVDKKVAFEIDG 565
           S  +TK+  N    S  QK V RLL   GL      EY +  Y +D  +  +K+  E+DG
Sbjct: 739 SLARTKKLLNLVHVSQVQKRVGRLLFDEGLMSEIDVEYPLGPYVLDFAIPSRKLVVEVDG 798

Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
             HF   T VP   T +KR  +AA GW VV +  + W
Sbjct: 799 EAHFFFGTTVPTAQTRMKRELLAAMGWRVVVVPQELW 835


>gi|384254362|gb|EIE27836.1| hypothetical protein COCSUDRAFT_83456 [Coccomyxa subellipsoidea
           C-169]
          Length = 454

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 139/342 (40%), Gaps = 37/342 (10%)

Query: 270 LVAIAMTALPECSAQGISNIAWALSKI----GGELLYLSEMDRVAEVALTKVGEFNSQNV 325
           + A A   L E   QGIS + W   K+     G+LL     D++A      V  +  Q V
Sbjct: 77  IAAAASARLHEFQPQGISMLTWGYGKLDHAPAGDLL-----DQIAHALELDVSVYRHQAV 131

Query: 326 ANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPAD---PLLE 381
           AN+  +FA +Q  +P L + +    +D    F  QEL  +LWAF    + P      LLE
Sbjct: 132 ANMFYSFARLQKDSPTLCAAVETHVTDHAEDFSPQELMNILWAFVKFRFVPKQFIAALLE 191

Query: 382 SL---DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
            +   D A    +     L   L++   +   +     +  G    P  S    +L N+ 
Sbjct: 192 YVLDEDRARTFRSSDWAALIWGLASLGVSVPAEPMAAINKAGLQHLP--SMTAPELCNVM 249

Query: 439 WSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLA 498
           W  ++L + ++  F +    +   ++Q + E             L+ Q L+         
Sbjct: 250 WGLSILDECNQPIFVESMSQLLENKQQTVLEP-----------RLLRQLLQASALAQAAD 298

Query: 499 LSSVLEEKI-ASAGKTKRFNQKVTSSFQKE-VARLLVSTGL-NWIREYAVDGY-TVDAVL 554
           +S  L E +  +A K  R       S   + V+R L + G+ + +  +  +G  TVD  L
Sbjct: 299 VSVSLPEPVHKAAAKWWRATANTVPSLTHDGVSRTLKNLGVKHRVLVFLQEGLPTVDIAL 358

Query: 555 V----DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW 592
                  KVA ++ GP   S NT   LG    + R ++A+GW
Sbjct: 359 EAWGDQPKVAIQVVGPHEVSTNTNTLLGRATAEARLLSASGW 400


>gi|221508635|gb|EEE34204.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 863

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 48/97 (49%), Gaps = 3/97 (3%)

Query: 509 SAGKTKRF-NQKVTSSFQKEVARLLVSTGL--NWIREYAVDGYTVDAVLVDKKVAFEIDG 565
           S  +TK+  N    S  QK V RLL   GL      EY +  Y +D  +  +K+  E+DG
Sbjct: 740 SLARTKKLLNLVHVSQVQKRVGRLLFDEGLMSEIDVEYPLGPYVLDFAIPSRKLVVEVDG 799

Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
             HF   T VP   T +KR  +AA GW VV +  + W
Sbjct: 800 EAHFFFGTTVPTAQTRMKRELLAAMGWRVVVVPQELW 836


>gi|308813528|ref|XP_003084070.1| unnamed protein product [Ostreococcus tauri]
 gi|116055953|emb|CAL58486.1| unnamed protein product [Ostreococcus tauri]
          Length = 812

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 71/278 (25%), Positives = 120/278 (43%), Gaps = 49/278 (17%)

Query: 357 FQEQELAQVLWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGD 415
            Q Q L+ +LWA A L Y P D   E    AF + +              + G     G 
Sbjct: 395 LQTQSLSNILWALAILRYVPED---EDFLVAFSERSLIEL----------QQGRFSYQGL 441

Query: 416 ADSEGSLSSPVLSFNRDQ---------LGN-IAWSYAVLGQMDRIFFSDIWKTISRFEEQ 465
            ++  + S  VL  N  Q         +GN +A  ++  G  + +F    +  +  + E+
Sbjct: 442 TNTVWAFS--VLGINPGQTLLDEFAREIGNRLAGYFSSQGVSNSLF---AFAVLEYWPEK 496

Query: 466 RISEQYREDIM-------FA----SQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTK 514
            + + YR  ++       F+    +Q+   N   +   PH  L     +     +A K  
Sbjct: 497 WVVDAYRAKLVETEKTTGFSEIDWTQLFQANVVFERYSPHGALITDPKMLAAAEAAWKVG 556

Query: 515 RFNQKVTSSFQKEVARLLVSTGL-NWIREYAVDG-YTVDAVLVDKKVAFEIDGPTHFSRN 572
             ++ V S F +EV+  L   G+ + I +   DG +++D  L  KKVA E+DGP+HF+RN
Sbjct: 557 S-SKVVISQFHREVSETLTEMGVPHEIEKLVEDGLFSLDIALKGKKVAIEVDGPSHFARN 615

Query: 573 T------GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
                  G   G T ++ R + ++GW++V +   EW E
Sbjct: 616 IRDRRLEGKDAGVTNMRTRCLTSSGWSIVHVPWFEWAE 653



 Score = 46.2 bits (108), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 52/102 (50%), Gaps = 8/102 (7%)

Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASM--QH 337
           E +AQGISN  WA + +G +L    E+      A+ +V +F S   +N+  A  +M  + 
Sbjct: 152 EFAAQGISNSLWAFATLGYQL--RPELVSKFSQAIRRVKDFKSMEFSNMIWAVGTMKIEL 209

Query: 338 SAPDLFSELAKRA----SDIVHTFQEQELAQVLWAFASLYEP 375
             P+LF E+          + + +  Q ++ +LWA ASL +P
Sbjct: 210 DPPELFDEILDECLASMKALPNMWSSQSVSNILWAMASLNKP 251


>gi|399216298|emb|CCF72986.1| unnamed protein product [Babesia microti strain RI]
          Length = 838

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 70/127 (55%), Gaps = 7/127 (5%)

Query: 486 QCLKLEHPHLQL----ALSSVLEEKIASAGKTK-RFNQKV--TSSFQKEVARLLVSTGLN 538
           Q ++L + +L L     LS  L+E +  A K +  FN+ +  +SS  +E++  L + G+N
Sbjct: 710 QTVQLYYKYLYLEGYNRLSDNLKELLEKAIKARISFNEYLPKSSSSHRELSTYLFAAGVN 769

Query: 539 WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLS 598
            + E  +  Y++D V+ + K   E DGP+HF   T +    ++LK   + + G+N++ + 
Sbjct: 770 HLNEVRLGPYSLDIVISNTKTVIEYDGPSHFYCETTMRSPKSLLKHDILISMGYNLIHVP 829

Query: 599 HQEWEEL 605
             EWE+L
Sbjct: 830 FFEWEQL 836


>gi|302780213|ref|XP_002971881.1| hypothetical protein SELMODRAFT_412578 [Selaginella moellendorffii]
 gi|300160180|gb|EFJ26798.1| hypothetical protein SELMODRAFT_412578 [Selaginella moellendorffii]
          Length = 240

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 17/86 (19%)

Query: 538 NWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSL 597
            WI EY    Y++D                 F+   G  LGHT+LK R + AAGW ++S 
Sbjct: 161 QWIPEYVDADYSLD-----------------FAMKGGDLLGHTVLKHRLLEAAGWKIISA 203

Query: 598 SHQEWEELQGSFEQLDYLRVILKDYI 623
           S+ EWE LQG  E +D+++ ++  +I
Sbjct: 204 SYAEWENLQGESEHVDFIQKLVTPHI 229


>gi|399217569|emb|CCF74456.1| unnamed protein product [Babesia microti strain RI]
          Length = 368

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 51/97 (52%), Gaps = 4/97 (4%)

Query: 522 SSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTM 581
           S  Q+ V +L+   GL++  EY +  Y +D VL   ++A E++G +HF   T +    T 
Sbjct: 265 SKLQRTVTKLIGELGLDFAEEYPLGPYLIDLVLPKHRIAIEVNGFSHFYDQTILHTSKTR 324

Query: 582 LKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVI 618
           LK   +   GW +  ++H +W+ +  +    D LR++
Sbjct: 325 LKYSIVQRMGWKIAEINHHQWKNINRT----DRLRIL 357


>gi|428174671|gb|EKX43565.1| hypothetical protein GUITHDRAFT_140332 [Guillardia theta CCMP2712]
          Length = 1069

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 19/103 (18%)

Query: 521  TSSFQKEVARLLVSTGLNWIREYAVD---GYTVDAVLVDKKV---------------AFE 562
             S   K+V   +   GL  ++E  VD   GY++DA++   ++               A E
Sbjct: 903  VSPVTKQVVSCMKDLGLR-VQEEHVDSSTGYSIDALVEIPRMNKGGGAGGAGGEIFCAVE 961

Query: 563  IDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
            +DGP+HF RN  VPLG T LKR+ +   G+ VVS+ + EW+ L
Sbjct: 962  VDGPSHFPRNDYVPLGGTALKRKQLRKIGYRVVSIPYWEWDAL 1004


>gi|302831782|ref|XP_002947456.1| hypothetical protein VOLCADRAFT_87583 [Volvox carteri f. nagariensis]
 gi|300267320|gb|EFJ51504.1| hypothetical protein VOLCADRAFT_87583 [Volvox carteri f. nagariensis]
          Length = 1333

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 23/46 (50%), Positives = 34/46 (73%)

Query: 558  KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
            +VA E+DGP  F+ NT  PLG T+ +RR++ A GW VVS+ ++EW+
Sbjct: 1222 RVAVEVDGPERFTANTWKPLGTTLYRRRWLTAHGWTVVSVPYREWQ 1267


>gi|397645982|gb|EJK77069.1| hypothetical protein THAOC_01121, partial [Thalassiosira oceanica]
          Length = 263

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 132/318 (41%), Gaps = 72/318 (22%)

Query: 312 VALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASD--IVHTFQEQELAQVLWAF 369
           V    +G F  ++++N A AFA+   S P+LF ++    ++     +F+ QEL+  +WA 
Sbjct: 11  VGPGGLGSFKPRDLSNTAWAFATAGVSHPELFKKIGHHVAEQGCFDSFKPQELSNTVWAC 70

Query: 370 ASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSF 429
           A++    + L           + F   +   L  C+E                       
Sbjct: 71  ATVGYTDERLF----------SAFAPVIGSKLDECSEQ---------------------- 98

Query: 430 NRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLK 489
              +L NIAW+Y+V     +  F++ +       E+  S +    +     +    +   
Sbjct: 99  ---ELTNIAWAYSVANLPRQDLFNEGYVGALASNEKDFSVKELAQLHQWQLLQQELK-YG 154

Query: 490 LEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYT 549
           +E P LQ   + V+ E + +AG             ++EV  LL S            GY 
Sbjct: 155 VELPQLQ---NDVVGE-LRAAG----------VDLEEEV--LLGS------------GYR 186

Query: 550 VDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEEL 605
           +DA++     ++VA E+DGP+HF      P G T LK R +A      VVS+ + EW+ L
Sbjct: 187 IDALVKFGGGRRVAVEVDGPSHFIDRR--PAGRTTLKHRQVATLDRIEVVSVPYWEWDVL 244

Query: 606 QGSFEQLDYLRVILKDYI 623
           + S  +  YLR + K  I
Sbjct: 245 ENSEMKQHYLRELSKGQI 262


>gi|307102859|gb|EFN51125.1| hypothetical protein CHLNCDRAFT_141313 [Chlorella variabilis]
          Length = 720

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 55/108 (50%), Gaps = 12/108 (11%)

Query: 520 VTSSFQKEVARLLVSTGLNWIREYAVDG-YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
           V + +  +V R +   G+  + E++  G Y++D  L + K+A E+DGP HF+ N+   +G
Sbjct: 419 VAACYPPKVHRTVCGLGVPCVLEHSEAGEYSIDVALPEHKIAVEVDGPVHFAANSRHLMG 478

Query: 579 HTMLKRRY------IAAA----GWNVVSLSHQEWEELQGSFEQLDYLR 616
            T LKRR       IAA     GW  V + + EW  L     +  Y+R
Sbjct: 479 GTALKRRLLETLFCIAAPMQRLGWRAVDVPYYEWWAL-APARRPSYMR 525



 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 60/113 (53%), Gaps = 9/113 (7%)

Query: 233 SPLNIATALHRIA--KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIA 290
           SP+    + +R+A  K M++  M + HR    R     +L A +    P  + Q +S++A
Sbjct: 223 SPVGAEASENRLALGKAMQRHIMASPHRRGVAR-----LLAAASRQLAPRLAPQALSSLA 277

Query: 291 WALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
            + + IGG    L  ++ +A  A+    + ++Q VAN+A A++++++  P L+
Sbjct: 278 HSFAAIGGCPWDL--LEELAARAVQLERQLDAQAVANLAWAYSTLRYDHPQLY 328


>gi|294909513|ref|XP_002777784.1| hypothetical protein Pmar_PMAR008719 [Perkinsus marinus ATCC 50983]
 gi|239885746|gb|EER09579.1| hypothetical protein Pmar_PMAR008719 [Perkinsus marinus ATCC 50983]
          Length = 222

 Score = 58.5 bits (140), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 55/106 (51%), Gaps = 1/106 (0%)

Query: 516 FNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGV 575
            N+K+T   Q E  RL+           A +   V     D+ +A E+DGP+HF  N+  
Sbjct: 40  LNEKLTPEEQAEKQRLIKELTKKLAGPLADENGNVPTG-KDRPIAIEVDGPSHFYANSTK 98

Query: 576 PLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
              +T LK R +   G+ V+ + + EW +L+G+ E+ +Y+R  LK+
Sbjct: 99  YTAYTKLKHRLLTRMGYKVLHVPYFEWRKLRGAKEREEYMRTKLKE 144


>gi|258597101|ref|XP_001347524.2| conserved Plasmodium protein [Plasmodium falciparum 3D7]
 gi|254922454|gb|AAN35437.2| conserved Plasmodium protein [Plasmodium falciparum 3D7]
          Length = 433

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 80/363 (22%), Positives = 144/363 (39%), Gaps = 74/363 (20%)

Query: 258 RLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKV 317
           +L FT+    +  + I M   P+  ++ ++ I   L K       LS +D          
Sbjct: 108 KLKFTKYSLYNNFIKIIMNKKPKIDSRMLTQILIDLHK-------LSSLD---------- 150

Query: 318 GEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPAD 377
                    NV   F               K+ +D    F   +L+ +L+ F   Y    
Sbjct: 151 --------INVLTFFTQYY----------IKKETD---QFSLFDLSMILYIFNK-YNYNH 188

Query: 378 PLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNI 437
             +E++DN  K  +Q+       L   +++ GV ++        LS   L+ N     ++
Sbjct: 189 --IETVDNISKTISQY------FLPYIDQDKGVLTTI------LLSISTLNLNYQFYLDV 234

Query: 438 A-------WSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKL 490
                   + +  +  +  I +S + + ++   +  I      DIM+     L+N   KL
Sbjct: 235 MKKHVYKKYEHFEVKYLCNILYSILLRLVNTLHKDDILNIMLNDIMYI----LLNNINKL 290

Query: 491 EHPHL-QLALSSVL-----EEKIASAGKT---KRFNQKVTSS-FQKEVARLLVSTGLNWI 540
           ++  L QL +S        EEK   A K    K     VT+S  Q+++A+L    GLN  
Sbjct: 291 KNEELKQLHISLYYLKDMKEEKYEEARKIIEKKNIKDTVTTSKIQQQIAKLFKEIGLNVE 350

Query: 541 REYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQ 600
           +E+ +  Y +D  L  KK+  E++G TH+    G     T LK   +    W V+++ + 
Sbjct: 351 KEFLIGPYVLDFALKKKKICIEVNGFTHYYNFNGKINAKTTLKYYILNKLKWKVLTIEYM 410

Query: 601 EWE 603
           +W+
Sbjct: 411 DWK 413


>gi|255076950|ref|XP_002502137.1| predicted protein [Micromonas sp. RCC299]
 gi|226517402|gb|ACO63395.1| predicted protein [Micromonas sp. RCC299]
          Length = 1128

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 98/455 (21%), Positives = 160/455 (35%), Gaps = 145/455 (31%)

Query: 286 ISNIAWALSKIGGELLYLSEMDRVA----EVALTKVGEFNSQNVANVAGAFASMQ-HSAP 340
            +N+ WA +K     L  +  DR      E  + K+ +F++Q +AN   A+A++Q   A 
Sbjct: 533 FANLLWAFAK-----LNHTPGDRFQAEFEEAVIEKISKFDAQVLANTVYAYAALQLPGAR 587

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAFK---------DA 390
           ++   +     D +H F+ +EL  VLWAF    Y+P    +   + A +         + 
Sbjct: 588 NVLPLIGLHFKDRLHEFKPRELLMVLWAFTRCSYDPGADAMARFERAMRPMTDNLAPDEV 647

Query: 391 TQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVL------ 444
           TQ+    +  L      G ++       E  +      F+   +    W+YA L      
Sbjct: 648 TQYLWA-SAVLKYRPTEGALRG-----FETRIVDCPSRFSGTPIALTLWAYATLNLPPPF 701

Query: 445 GQMDRIFFSDIWKTISRFEEQRISEQYREDIMF-------------ASQVHLVN------ 485
             MDR  F D        E  R  E Y +D+               A  V +VN      
Sbjct: 702 AVMDR--FGD------ELELSRADEFYPQDLSLGFWSAAVIMTQPKADDVPMVNALDTGA 753

Query: 486 --QCLKLEHPHL-QLALSSVLEEKIA--------------------------------SA 510
             + L+    HL  L  +SV  E ++                                S+
Sbjct: 754 RERVLRQMAKHLGSLGATSVSPEGLSAIYMAILAVEMHSPSLFAELKSNWGHLAAAAESS 813

Query: 511 GKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDK---KVAFEIDGPT 567
            +  +      S  QK V + L   G+ +  E  V G  +    V K   K+  E+DGP 
Sbjct: 814 WRATKGKGPTVSKLQKAVGKTLDELGVEYESEKLVRGGLIRPDFVVKGKAKIVVEVDGPY 873

Query: 568 HFSRNTGV-----------------------------------PLGHTMLKRRYIAAAGW 592
           HFS                                        PLG T+L+ + +++ GW
Sbjct: 874 HFSVEPSAASDAGEELEDWFGGGGGETPDALEKDRFGFGSVLRPLGGTILRNQLLSSWGW 933

Query: 593 NVVSLSHQEW-------------EELQGSFEQLDY 614
           NVV++S+++W             E L+G  +Q  Y
Sbjct: 934 NVVTVSYRDWVKADNDTSGGAKREYLKGLLDQAGY 968



 Score = 40.0 bits (92), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 66/157 (42%), Gaps = 37/157 (23%)

Query: 195 KEINLNKDIVDAQTAQEVLEVI-----AEMITAVGKGLSPSPLSPLNIATALHRIAKNME 249
           K I +N+D+  A    +V  V+     AE   AV            N+ATA  R+ +++ 
Sbjct: 126 KRIGVNQDLAKASKIDDVRFVVQKNGNAEAFNAV------------NVATAYSRLGRHVR 173

Query: 250 KVSMMTTH----RLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIG---GELLY 302
                T       LA  R+       A A+T  PE SA   S++ WAL + G   G   +
Sbjct: 174 DWERGTLDGAEWYLALERR-------ARALT--PEMSAWAASSVTWALGRTGRNPGAAFW 224

Query: 303 LSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
           +    ++  VA     E   Q VANV    A+++H A
Sbjct: 225 VDLEAKLCTVA----DELEPQGVANVLWGLAALEHRA 257


>gi|401410506|ref|XP_003884701.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325119119|emb|CBZ54671.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 1458

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/135 (27%), Positives = 70/135 (51%), Gaps = 4/135 (2%)

Query: 474  DIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQ---KVTSSFQKEVAR 530
            +I   +++ +V+  L+L  P L  +L   L+  ++   +     Q    ++S   ++V+ 
Sbjct: 1038 EIGSVTRLQIVDLYLRLLRPELFASLPFDLKAFLSRVRRVDLTQQDCFSLSSKMHRDVSA 1097

Query: 531  LLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAA 590
              +  GL    E     +++D VL D+ +A EIDGP+HF R T + +  + LK+R +   
Sbjct: 1098 AFLRIGLVHRSEVQFGPFSLDIVLGDR-LAVEIDGPSHFYRETCMRVASSRLKQRLLREM 1156

Query: 591  GWNVVSLSHQEWEEL 605
            GW ++ +S  EW +L
Sbjct: 1157 GWTLLPVSFFEWRQL 1171



 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           Q ISN+  A  K+  E+L +   +R A      + ++N Q+++N+A A++ +     +LF
Sbjct: 487 QDISNLLNAFGKL--EILDVELFNRAAPKIADGIRDYNPQHLSNIAHAYSKVSVPQSELF 544

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASL 372
             +A+     V  F  +ELA +  AFA +
Sbjct: 545 VRIAEMTRRSVQNFSTKELANLALAFAKM 573


>gi|195998900|ref|XP_002109318.1| hypothetical protein TRIADDRAFT_53222 [Trichoplax adhaerens]
 gi|190587442|gb|EDV27484.1| hypothetical protein TRIADDRAFT_53222 [Trichoplax adhaerens]
          Length = 650

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 39/60 (65%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           +++A EIDGP HF+  +   LGHT++K R+++  GW+V+ + + EW +L    E   YL+
Sbjct: 585 ERIAIEIDGPVHFAYKSNRYLGHTIMKTRHLSLLGWHVIRVPYYEWNKLNDLPEIDRYLK 644



 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/87 (26%), Positives = 44/87 (50%), Gaps = 3/87 (3%)

Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
           I+N+ WALSK   + L +    ++ + A+  + +FN  +++ V  + A     +  L + 
Sbjct: 224 IANLMWALSK---DQLNIDIFQQLQQQAINNINKFNPISISMVCYSLALFGDRSEQLLTA 280

Query: 346 LAKRASDIVHTFQEQELAQVLWAFASL 372
           +  R   I++    Q +A + WAFA L
Sbjct: 281 IENRMLAIINLLDPQSIANIAWAFAKL 307



 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 49/92 (53%), Gaps = 8/92 (8%)

Query: 285 GISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
            IS + ++L+  G   E L  +  +R+  +    +   + Q++AN+A AFA +     ++
Sbjct: 259 SISMVCYSLALFGDRSEQLLTAIENRMLAI----INLLDPQSIANIAWAFAKLNWFNDEI 314

Query: 343 FSELAKRASDIV--HTFQEQELAQVLWAFASL 372
           F  + KR  D +   T + Q ++ ++WAFAS+
Sbjct: 315 FGFIQKRTLDNIGKRTLRPQSISNIIWAFASM 346


>gi|397582907|gb|EJK52455.1| hypothetical protein THAOC_28263, partial [Thalassiosira oceanica]
          Length = 408

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 81/327 (24%), Positives = 131/327 (40%), Gaps = 57/327 (17%)

Query: 284 QGISNIAWALS--KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           Q  SNI WA +  +     L+    D VA   L  +G FN Q ++    A+A+ +     
Sbjct: 39  QDFSNIVWAYATARESHPELFNKIGDHVAR--LGSLGSFNPQELSITVWAYATARVFHSR 96

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
           LF +L   A      F+ Q +A  LWA A++    + L  +          F   +   L
Sbjct: 97  LFEKLTTEAVAKKDHFESQHIANFLWACATVGHTDERLFAA----------FAPLVGSKL 146

Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
             C+E                          +L NI+W+Y+V    +   F+    +   
Sbjct: 147 DECSEQ-------------------------ELANISWAYSVANAPNLDLFNVGHVSALA 181

Query: 462 FEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVT 521
             E+  S +       A          +LE     + L   L+ K  +A  ++ F++   
Sbjct: 182 SNEKEFSAE-----GLAQLHQWQLWQQELES---GIELPQSLQAKCRNAFMSQCFSE--- 230

Query: 522 SSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPL 577
           S  Q +V   L + GL+   E  +  GY +DA++     +KVA E+DGP+HF      P 
Sbjct: 231 SKLQNDVVGELRAAGLDLEEEVLLGSGYRIDALVKVGDGRKVAVEVDGPSHFIDRR--PA 288

Query: 578 GHTMLKRRYIAAAG-WNVVSLSHQEWE 603
           G  +LK R +A      VVS+ + EW+
Sbjct: 289 GRAILKHRQVATLDRIEVVSVPYWEWD 315



 Score = 39.3 bits (90), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 47/101 (46%), Gaps = 2/101 (1%)

Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
           + +L   + Q +S   WA +     + +    +++   A+ K   F SQ++AN   A A+
Sbjct: 69  LGSLGSFNPQELSITVWAYAT--ARVFHSRLFEKLTTEAVAKKDHFESQHIANFLWACAT 126

Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEP 375
           + H+   LF+  A      +    EQELA + WA++    P
Sbjct: 127 VGHTDERLFAAFAPLVGSKLDECSEQELANISWAYSVANAP 167


>gi|323444921|gb|EGB01813.1| hypothetical protein AURANDRAFT_69470 [Aureococcus anophagefferens]
          Length = 206

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 38/99 (38%), Positives = 52/99 (52%), Gaps = 4/99 (4%)

Query: 522 SSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLVDKK--VAFEIDGPTHFSRNTG-VPL 577
           S  Q EVA  L   GL+   E  + DG +VD  L+  K  VA E DGP H+ RN   VP 
Sbjct: 31  SRAQVEVAERLEGMGLDVEHELVLPDGLSVDVALLPLKWRVAVEFDGPRHYFRNAKRVPT 90

Query: 578 GHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           G T  K R + A GW V+ + + +W +L     + +YL+
Sbjct: 91  GRTRFKMRLLRALGWRVLHVPYFDWAKLDDDAARTEYLK 129


>gi|303290512|ref|XP_003064543.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454141|gb|EEH51448.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 628

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 87/341 (25%), Positives = 137/341 (40%), Gaps = 45/341 (13%)

Query: 286 ISNIAWALSKI----GGELLYLSEMDRVAEVALTKVGE-FNSQNVANVAGAFASMQH--- 337
           ++NI WA   +    G E L +     V E  LT   E  + Q +AN+  +FA  +H   
Sbjct: 200 LANILWAFHVLKTYPGPECLAV-----VGERMLTLTDEDLHVQTLANMMYSFAQFEHLPG 254

Query: 338 -SAPDLFSELAKRA---SDIVH----TFQEQELAQVLWAFASL-YEPADPLLESLDNA-- 386
            +  D   +L  RA   +D+      T     L+ ++WAF  L Y+P++    + D    
Sbjct: 255 RATMDRVEDLCARAFRSADVGEPGSVTPASNSLSNLIWAFGVLKYKPSEEFFAAFDAVVS 314

Query: 387 -----FKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQ-LGNIAWS 440
                F D          A  N N N G +     D+     +  +S    Q + N  W+
Sbjct: 315 STLGDFNDQGVSNVLFTYA--NLNHNPGAQL---LDALARRCADFISVYAPQGVANTVWA 369

Query: 441 YAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEH--PHLQLA 498
           + VL   D   +      + R   +RIS+   ED     +V L    L L+    H    
Sbjct: 370 WVVL---DGAKYPP--PALLRLYAERISKTRDEDFSKIDRVQLFQSHLALKQFSNHDGEL 424

Query: 499 LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DG-YTVDAVLVD 556
           LS  +      A           S+  ++V+  L   G+    E+   DG ++VD  L  
Sbjct: 425 LSGEMLRSCERAWMEVSAGNLTISAIHRDVSETLTRMGIPHEIEFLTSDGLFSVDIALRG 484

Query: 557 KKVAFEIDGPTHFSRNTGVP-LGHTMLKRRYIAAAGWNVVS 596
           +KVA E+DGP+HF  N     +G  +L+   + + GW V S
Sbjct: 485 RKVAIEVDGPSHFFANKRRERMGADLLRAALMQSKGWTVRS 525



 Score = 45.8 bits (107), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 55/115 (47%), Gaps = 12/115 (10%)

Query: 286 ISNIAWA-----LSKIGGEL---LYLSEMDRVAEVALTKVGEFNSQNVANV---AGAFAS 334
           +SNI WA     LS + G L   + ++  D +     +    F+SQ+VAN    AG    
Sbjct: 75  LSNIVWAIASMNLSGLSGGLPREVMVALDDAMCRSIASDPDTFSSQSVANTLWAAGNAPD 134

Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA-SLYEPADPLLESLDNAFK 388
           +   +P L   LA  + D  HTF  Q +   +W FA + + P D L++ +  A+K
Sbjct: 135 VVTLSPRLMDALASVSCDKFHTFTPQGMTNTIWGFACNGHHPGDELMDKMREAWK 189



 Score = 44.3 bits (103), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 46/186 (24%), Positives = 74/186 (39%), Gaps = 29/186 (15%)

Query: 282 SAQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
           S+Q ++N  WA      +++ LS   MD +A V+  K   F  Q + N    FA   H  
Sbjct: 118 SSQSVANTLWAAGN-APDVVTLSPRLMDALASVSCDKFHTFTPQGMTNTIWGFACNGHHP 176

Query: 340 PD-LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP------------------LL 380
            D L  ++ +      HT+   ELA +LWAF  L     P                   +
Sbjct: 177 GDELMDKMREAWKRSGHTYIVTELANILWAFHVLKTYPGPECLAVVGERMLTLTDEDLHV 236

Query: 381 ESLDNAFKDATQFTCCLNKALSNCNENGGVKS--SGDADSEGSLSSPVLSFNRDQLGNIA 438
           ++L N      QF     +A  +  E+   ++  S D    GS++        + L N+ 
Sbjct: 237 QTLANMMYSFAQFEHLPGRATMDRVEDLCARAFRSADVGEPGSVTPA-----SNSLSNLI 291

Query: 439 WSYAVL 444
           W++ VL
Sbjct: 292 WAFGVL 297



 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 39/81 (48%), Gaps = 5/81 (6%)

Query: 297 GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA-----PDLFSELAKRAS 351
           G E L ++ + R+ E+   K+ EF  Q V+N    FAS+  +      PD  S       
Sbjct: 5   GDEYLPMAMLARLEELVRVKMDEFIPQGVSNCIWGFASLNKNKGLELRPDTVSRFGDGIV 64

Query: 352 DIVHTFQEQELAQVLWAFASL 372
            +   F+  EL+ ++WA AS+
Sbjct: 65  RLASGFKSMELSNIVWAIASM 85


>gi|156089331|ref|XP_001612072.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154799326|gb|EDO08504.1| hypothetical protein BBOV_III009480 [Babesia bovis]
          Length = 239

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 41/66 (62%)

Query: 556 DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           D+ +A E+DGP+HF  N+     +T LK R +   G+ V+ + + EW  L+G+ E+ +Y+
Sbjct: 133 DRPIAIEVDGPSHFYANSTKYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGAKEREEYM 192

Query: 616 RVILKD 621
           R  LK+
Sbjct: 193 REKLKE 198


>gi|294946233|ref|XP_002784988.1| hypothetical protein Pmar_PMAR016478 [Perkinsus marinus ATCC 50983]
 gi|239898352|gb|EER16784.1| hypothetical protein Pmar_PMAR016478 [Perkinsus marinus ATCC 50983]
          Length = 132

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 34/105 (32%), Positives = 55/105 (52%), Gaps = 1/105 (0%)

Query: 516 FNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGV 575
            N+K+T   Q E  RL+           A +   V A   D+ +A E+DGP+HF  N+  
Sbjct: 29  LNEKLTPEEQAEKQRLIKELTKKLAGPLADENGNVPAG-KDRPIAIEVDGPSHFYANSTK 87

Query: 576 PLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILK 620
              +T LK R +   G+ V+ + + EW +L+G+ E+ +Y+R  LK
Sbjct: 88  YTAYTKLKHRLLTRMGYKVLHVPYFEWRKLRGAKEREEYMRTKLK 132


>gi|302834273|ref|XP_002948699.1| hypothetical protein VOLCADRAFT_104026 [Volvox carteri f.
            nagariensis]
 gi|300265890|gb|EFJ50079.1| hypothetical protein VOLCADRAFT_104026 [Volvox carteri f.
            nagariensis]
          Length = 3304

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 26/58 (44%), Positives = 34/58 (58%)

Query: 558  KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
            +VA E+DGP HF+ NT  PL  T  +RR + A GW VVS+ H  W E +    + D L
Sbjct: 3155 RVAVEVDGPAHFTANTKQPLSMTTYRRRCLEARGWVVVSVPHWRWFEFRSGQPERDVL 3212



 Score = 46.6 bits (109), Expect = 0.043,   Method: Composition-based stats.
 Identities = 40/146 (27%), Positives = 64/146 (43%), Gaps = 12/146 (8%)

Query: 232  LSPLNIATALHRIAK-NMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIA 290
              P+N+A ALHR+    +   S      +A  + +E+  + ++    L + + Q + N  
Sbjct: 2328 FEPVNVAAALHRLGSCGLAPGSTAVRQLMADPQFKELERMASV---TLGQFTPQHVGNAL 2384

Query: 291  WALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS-APDLFSEL 346
            WA   +G   GE L      R+ EV    + E   QN++N     A +  S  P +  +L
Sbjct: 2385 WAFGTLGYHPGEPLLQGLTTRLLEV----LPEALPQNISNGLLGLAKLGWSPGPHVLDQL 2440

Query: 347  AKRASDIVHTFQEQELAQVLWAFASL 372
            A+ +   V  F  Q L   LWA A L
Sbjct: 2441 ARGSVGKVPEFNAQALVNTLWAMAHL 2466



 Score = 44.7 bits (104), Expect = 0.14,   Method: Composition-based stats.
 Identities = 46/154 (29%), Positives = 62/154 (40%), Gaps = 22/154 (14%)

Query: 330  GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAFK 388
            G+ A  Q  A   F EL + AS  +  F  Q +   LWAF +L Y P +PLL+ L     
Sbjct: 2348 GSTAVRQLMADPQFKELERMASVTLGQFTPQHVGNALWAFGTLGYHPGEPLLQGL----- 2402

Query: 389  DATQFTCCLNKALSNCNENG--GVKSSG--------DADSEGSLSSPVLSFNRDQLGNIA 438
              T+    L +AL     NG  G+   G        D  + GS+   V  FN   L N  
Sbjct: 2403 -TTRLLEVLPEALPQNISNGLLGLAKLGWSPGPHVLDQLARGSVGK-VPEFNAQALVNTL 2460

Query: 439  WSYAVLGQ----MDRIFFSDIWKTISRFEEQRIS 468
            W+ A L      +    F    K I  F  Q ++
Sbjct: 2461 WAMAHLNYVHEGLQTAMFEQALKRILEFNPQNVA 2494



 Score = 40.0 bits (92), Expect = 3.4,   Method: Composition-based stats.
 Identities = 21/57 (36%), Positives = 31/57 (54%)

Query: 316  KVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            +V     Q++AN+  A+ +++  AP LFS L       V  F EQEL+  +WA A L
Sbjct: 2637 RVWALRPQHIANLLWAYGTLEQPAPVLFSALLPTLLRRVAEFSEQELSNSVWAAARL 2693


>gi|429329946|gb|AFZ81705.1| hypothetical protein BEWA_011230 [Babesia equi]
          Length = 1089

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 78/386 (20%), Positives = 143/386 (37%), Gaps = 90/386 (23%)

Query: 195 KEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMM 254
           K +N+N + +  Q      ++ +++++A+      + L+P+N ATALHR+AK +   +  
Sbjct: 205 KWLNMNPNHIIIQQTIIKSKIPSQILSAITD--KHNQLNPINSATALHRLAKQIHPYN-- 260

Query: 255 TTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEM-------- 306
              R      +    L+++    +PE  +QG++NI W++ +I     +LS++        
Sbjct: 261 ---RHTILNHKSFGKLISVIEVHIPEFDSQGLTNILWSIVRIKITPTWLSQLLTQIDKNL 317

Query: 307 -----------------------------DRVAEVALTKVGEFNSQ-NVANVAGAFASMQ 336
                                         ++  +  T++  F +   +  V+   A   
Sbjct: 318 MVFNANELSSCLLSLSKVGIKNNESLELRSKLVALIRTRINGFKTPLELTCVSTGLARFN 377

Query: 337 HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCC 396
              P LF  ++++  D +  F   EL  V W+FA L              F D   F   
Sbjct: 378 VRDPILFGHISRQIIDSLDKFTMNELRGVAWSFAYL-------------GFNDRLLFANI 424

Query: 397 LNKALSNCNENG---------GVKSSGDADSEGSL--SSPVLSFNRDQL-----GNIAWS 440
            N   +N NE            +    +ADSE  L   SP++  N   L       IAW+
Sbjct: 425 RNFIENNANETNVKNVIRLAWALSKLKEADSELFLFTISPLIRSNISNLTCKDISTIAWA 484

Query: 441 Y----------------AVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLV 484
           +                A+  QM+ +   DI   ++ F     S +   + M    V + 
Sbjct: 485 FLNAEIEDCDLFNDLATALQHQMEEMTTHDITSCVATFSHIEASHRVLFNKMKTRAVEIS 544

Query: 485 NQCLKLEHPHLQLALSSVLEEKIASA 510
           N+   L+   +    S   +EK  S 
Sbjct: 545 NEFTPLQLAKIIRGFSYFSDEKFYSV 570


>gi|403221415|dbj|BAM39548.1| uncharacterized protein TOT_010001003 [Theileria orientalis strain
           Shintoku]
          Length = 418

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 55/104 (52%), Gaps = 1/104 (0%)

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
           TS  Q ++++LL    L +  EY +  Y +D V+    VA E++G THF  N+      T
Sbjct: 315 TSKMQLKLSKLLDEIKLKYKSEYQLGPYRLDYVVPKLNVAIEVNGYTHFFHNSRELNALT 374

Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIG 624
            LK + +   GWNVV +++  W+  +   ++L+YL   L  YI 
Sbjct: 375 QLKYKILKDMGWNVVGVNYYNWKN-RNKQDRLEYLIKELSPYIN 417


>gi|428177039|gb|EKX45921.1| hypothetical protein GUITHDRAFT_163172 [Guillardia theta CCMP2712]
          Length = 976

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 54/197 (27%), Positives = 90/197 (45%), Gaps = 28/197 (14%)

Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
           +  L E   Q +SNI W+ + +G     + ++    E+    + EF  Q+VAN   A+ +
Sbjct: 393 VPGLQEFKPQEVSNILWSYATVGFSSPTVFKL-LAFEILRRGLREFVPQDVANSVWAYVT 451

Query: 335 MQHSAPDLF----SELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL--DNAFK 388
           +  S  +L     S+  +R    +  F+ QELA ++WAFA    P D LL  +  D A +
Sbjct: 452 VGQSTKELLHVVESDAERRG---LSAFKNQELANLIWAFAKADYPMDLLLRLVEQDIASR 508

Query: 389 DATQFTCCLNKALSNC----------NENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
           D + F   + + LSN           +E+  +K + +  S G     +  F   ++ N A
Sbjct: 509 DLSLF---MPQELSNLVWAFATAGHRSEHLFLKIASEISSRG-----LADFKPQEIANTA 560

Query: 439 WSYAVLGQMDRIFFSDI 455
           W+YA +G  D   F  I
Sbjct: 561 WAYAKIGVQDEKLFHRI 577



 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 75/183 (40%), Gaps = 24/183 (13%)

Query: 284 QGISNIAWALSKIGGELLYL-SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           Q +SN  WA +  G    +L  E++R  E     +  F+ Q+++N+  AFA   H AP L
Sbjct: 631 QELSNTVWAHASNGLTFPFLFGEVER--EAVRRGLRLFSPQDISNMLWAFAKADHVAPSL 688

Query: 343 FSELAKRASDI------VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFT-- 394
           + +L     ++      +  F+ QEL+ +LWA A     A  L  + +   K   ++   
Sbjct: 689 YEQLRANLEELRVADPGLTMFKAQELSNLLWAAAKTQHTARCLFSAAEEQVKQILKYAES 748

Query: 395 ------CCLNKALSNCNENGGVKSSGDAD-------SEGSLSSPVLSFNRDQLGNIAWSY 441
                  C    L   ++     S G           E  L+  +++F    L NIAW+ 
Sbjct: 749 REERDETCAVVPLEVTDDMWRFASVGQTAEELFATLEEQVLTRDLMTFTTLHLANIAWAI 808

Query: 442 AVL 444
             L
Sbjct: 809 VFL 811



 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 11/104 (10%)

Query: 284 QGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           Q +SN+ WA +  G   E L+L      +E++   + +F  Q +AN A A+A +      
Sbjct: 516 QELSNLVWAFATAGHRSEHLFL---KIASEISSRGLADFKPQEIANTAWAYAKIGVQDEK 572

Query: 342 LFSELAKRASDIVH----TFQEQELAQVLWAFASLYEPADPLLE 381
           LF  +      I+H     F  QEL+ +LW+FA     +D L +
Sbjct: 573 LFHRIEMEL--ILHRSLRPFIPQELSNILWSFAKFNIASDKLFQ 614



 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 51/210 (24%), Positives = 80/210 (38%), Gaps = 48/210 (22%)

Query: 273 IAMTALPECSAQGISNIAWALSKIGGE---LLYLSEMDRVAEVALTKVGEFNSQNVANVA 329
           I+   L +   Q I+N AWA +KIG +   L +  EM+ +   +L     F  Q ++N+ 
Sbjct: 543 ISSRGLADFKPQEIANTAWAYAKIGVQDEKLFHRIEMELILHRSLRP---FIPQELSNIL 599

Query: 330 GAFASMQHSAPDLF----SELAKRASDIVHTFQEQELAQVLWAFAS-------------- 371
            +FA    ++  LF     E+  R    +  F+ QEL+  +WA AS              
Sbjct: 600 WSFAKFNIASDKLFQVIGQEMLVRG---LQGFKPQELSNTVWAHASNGLTFPFLFGEVER 656

Query: 372 --------LYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLS 423
                   L+ P D  + ++  AF  A      L + L    E   V   G         
Sbjct: 657 EAVRRGLRLFSPQD--ISNMLWAFAKADHVAPSLYEQLRANLEELRVADPG--------- 705

Query: 424 SPVLSFNRDQLGNIAWSYAVLGQMDRIFFS 453
             +  F   +L N+ W+ A      R  FS
Sbjct: 706 --LTMFKAQELSNLLWAAAKTQHTARCLFS 733


>gi|307111199|gb|EFN59434.1| hypothetical protein CHLNCDRAFT_49989 [Chlorella variabilis]
          Length = 1328

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 51/88 (57%), Gaps = 3/88 (3%)

Query: 287 SNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH--SAPDLFS 344
           SN+ WAL+  G E L    +DR+A     ++  F  Q++AN+A A+A++ H  +AP    
Sbjct: 777 SNVLWALASEG-EALPGEALDRIAANLAPRLKSFGPQSLANIAWAYATLGHHPAAPHFLR 835

Query: 345 ELAKRASDIVHTFQEQELAQVLWAFASL 372
           +LA  A   +  F+ Q L+ ++W+ ASL
Sbjct: 836 QLAHAAQRCLPVFEPQGLSLLVWSLASL 863



 Score = 43.1 bits (100), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 7/90 (7%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVAL---TKVGEFNSQNVANVAGAFASMQHSAP 340
           Q ++N+ W + ++G    Y      +  VAL    +V +   Q + N+  AFA + +   
Sbjct: 891 QHLANLVWGMCRVG----YCPAQRFLEAVALEVQLRVCDLKPQELFNIVWAFAQLGYHPA 946

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFA 370
            LF  +A  A+    +F  QEL+ +LWA A
Sbjct: 947 CLFDAVALEAAPQAVSFSPQELSGMLWALA 976



 Score = 39.7 bits (91), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 43/184 (23%), Positives = 69/184 (37%), Gaps = 42/184 (22%)

Query: 291 WALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRA 350
           W+ +++G     L  ++  A  A  ++  F    +A VA + A ++  AP +      + 
Sbjct: 707 WSFARMGTSSRKL--LETAAACAEQQLAAFTPAQLAKVAWSLAKLRWPAPRVLRHAGAQL 764

Query: 351 SDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGV 410
           ++    F ++E + VLWA AS  E A P                    +AL     N   
Sbjct: 765 AERTAAFNDKEASNVLWALASEGE-ALP-------------------GEALDRIAAN--- 801

Query: 411 KSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD------RIFFSDIWKTISRFEE 464
                      L+  + SF    L NIAW+YA LG         R       + +  FE 
Sbjct: 802 -----------LAPRLKSFGPQSLANIAWAYATLGHHPAAPHFLRQLAHAAQRCLPVFEP 850

Query: 465 QRIS 468
           Q +S
Sbjct: 851 QGLS 854


>gi|308798807|ref|XP_003074183.1| unnamed protein product [Ostreococcus tauri]
 gi|116000355|emb|CAL50035.1| unnamed protein product [Ostreococcus tauri]
          Length = 525

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 64/269 (23%), Positives = 117/269 (43%), Gaps = 38/269 (14%)

Query: 198 NLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAK-NMEKVSMMTT 256
           +L  D++DA   + +L  + E      K         +N +TALHR+A+   ++V   T 
Sbjct: 140 DLQGDLMDASDVEVILTTVEEQEEVFNK---------VNASTALHRVARLATQRVPGQTK 190

Query: 257 H---RLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVA 313
               R A         L+ +      E S QG+SN+ WAL+++   +   + +D ++  A
Sbjct: 191 PSLDRAALLGDERFQTLMNMVDRMAGEMSMQGVSNVLWALARLEYPVQE-TLLDALSARA 249

Query: 314 LTKVGEFNSQNVANVAGAFASMQHSA-PDLFSELAKRASDIVHTFQEQELAQVLWAFA-- 370
            T+      +N++    A A++ H     L   +A +A  +V  F+  ++  +LWA+A  
Sbjct: 250 ATQASSAEPKNLSTTLWALAALGHKPRSKLLKAIADQALIVVDDFRAPDVVNMLWAYARW 309

Query: 371 SLYEPAD----PLLES-LDNAFKDATQFT------CCLNKALSNCNENGGVKSSGDADSE 419
           S Y P      P++++ LD A      +T       C + A+ +C  +  V        E
Sbjct: 310 SRYLPPSDRPMPVVQAMLDQAVHTMQSYTPYQLANLCWSLAMLDCPPSPRVL-------E 362

Query: 420 GSLSSPVL---SFNRDQLGNIAWSYAVLG 445
             L +  L     +   L ++ W+Y V+G
Sbjct: 363 YILQTVALEPGKLDGTALTHVLWAYGVMG 391


>gi|397612109|gb|EJK61607.1| hypothetical protein THAOC_17877, partial [Thalassiosira oceanica]
          Length = 728

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 73/246 (29%), Positives = 114/246 (46%), Gaps = 35/246 (14%)

Query: 235 LNIATALHRIAK-NMEKVSMMTTH--RLAFTRQREMSMLV-AIAMTA---LPECSAQGIS 287
           L IA  + ++++ N +  +    H  R  F ++ + S +  +IA +A   L E  A+ +S
Sbjct: 491 LGIAKTISQVSRGNQQYRADDPRHVIRRLFVKESQCSPIFDSIASSAVGMLNEFEARHLS 550

Query: 288 NIAWALS------KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           N+ ++         IGGE L+    +   E A+  +  FNSQ+++N+  AF  +      
Sbjct: 551 NLIYSFGLVERNPDIGGETLF----NVFGEAAVKILHTFNSQDISNMLWAFVKVDAKNSR 606

Query: 342 LFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADP-LLESLDNAFKDATQFTCCLNK 399
           LF E     S + + +F+ QELA +LW+FA   E ADP L   L N    A +      +
Sbjct: 607 LFQETGGVISGMDLDSFKPQELANILWSFAKSGE-ADPELFRVLGNHIV-ARRLNDFQPQ 664

Query: 400 ALSNCN---ENGGV------KSSGDADSE-GSLSSPVLSFNRDQLGNIAWSYAVLGQMDR 449
            LSN        GV      K  GD  +  GSL+S    F    L NIAW++A  G++  
Sbjct: 665 HLSNIAWAFATAGVSHPILFKKIGDHIAGLGSLNS----FEPQALSNIAWAFASAGKLHP 720

Query: 450 IFFSDI 455
             F  I
Sbjct: 721 KLFKKI 726


>gi|71029704|ref|XP_764495.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351449|gb|EAN32212.1| hypothetical protein TP04_0858 [Theileria parva]
          Length = 234

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 39/65 (60%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           + +A E+DGP+HF  NT     +T LK R +   G+ V+ +   EW  L+G+ E+ +Y+R
Sbjct: 133 RPIAIEVDGPSHFYSNTTKYTAYTKLKHRLLTRMGYKVLHVPFFEWRRLRGAREREEYMR 192

Query: 617 VILKD 621
             LK+
Sbjct: 193 AKLKE 197


>gi|84997531|ref|XP_953487.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65304483|emb|CAI76862.1| hypothetical protein, conserved [Theileria annulata]
          Length = 235

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 39/65 (60%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           + +A E+DGP+HF  NT     +T LK R +   G+ V+ +   EW  L+G+ E+ +Y+R
Sbjct: 134 RPIAIEVDGPSHFYSNTTKYTAYTKLKHRLLTRMGYKVLHVPFFEWRRLRGAREREEYMR 193

Query: 617 VILKD 621
             LK+
Sbjct: 194 AKLKE 198


>gi|302830696|ref|XP_002946914.1| hypothetical protein VOLCADRAFT_86990 [Volvox carteri f.
           nagariensis]
 gi|300267958|gb|EFJ52140.1| hypothetical protein VOLCADRAFT_86990 [Volvox carteri f.
           nagariensis]
          Length = 1130

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 96/194 (49%), Gaps = 20/194 (10%)

Query: 190 PSNRRKEIN--LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKN 247
           PS R + ++      I+ AQ+ QE LE +A + +        S  + ++++  + R+ K 
Sbjct: 291 PSTRDRALSHFFTATIMGAQSWQE-LEALARVHS--------SSFNHVHVSALVCRLPKV 341

Query: 248 MEKVSMMTTHRLAFTR-QREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEM 306
           +  V +  + +  F+R  R++S LV I ++A      + I+N+ W +SK+G        +
Sbjct: 342 VNPVELSKSEKTQFSRFLRDVSDLVTIRLSAF---DPRAIANVLWGVSKLGYSPAP-PTL 397

Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASM----QHSAPDLFSELAKRASDIVHTFQEQEL 362
           ++    A  ++ +FN+Q +AN+A A A++        P    +    A   V   + QEL
Sbjct: 398 NKFLFEAYVRMYDFNAQELANLAWALATLASLGNRPVPMWLRKYTLAAVPRVLDLKPQEL 457

Query: 363 AQVLWAFASLYEPA 376
           A ++WA + L+ PA
Sbjct: 458 AHMVWALSKLFPPA 471


>gi|399218609|emb|CCF75496.1| unnamed protein product [Babesia microti strain RI]
          Length = 263

 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 38/63 (60%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           + +A E+DGP+HF  N+     +T LK R +   G+ V+ + + EW  L+G+ E+ DY+R
Sbjct: 151 RPIAIEVDGPSHFYANSTNYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGAREREDYMR 210

Query: 617 VIL 619
             L
Sbjct: 211 AKL 213


>gi|294956189|ref|XP_002788845.1| hypothetical protein Pmar_PMAR004305 [Perkinsus marinus ATCC 50983]
 gi|239904457|gb|EER20641.1| hypothetical protein Pmar_PMAR004305 [Perkinsus marinus ATCC 50983]
          Length = 1040

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 80/145 (55%), Gaps = 12/145 (8%)

Query: 232 LSPLNIATALHRIA---KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISN 288
           L+ +N++T +HR+A   +N E+      ++ A  +   +  ++  A+  +   S Q +SN
Sbjct: 578 LNSVNVSTLIHRLASLTQNQEQ------NQRALAKDARVKQVLRRAIELVSTSSCQSLSN 631

Query: 289 IAWALSKIGGELLYLSEMDR-VAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELA 347
           I WA+ K+  +++  +E+ R + E A T++  F  QN +N+    + + +   +L   +A
Sbjct: 632 ICWAIGKL--QMVEETEVVRAIVEAAKTRLHHFRPQNFSNMLYGLSRVGYCDRELMDLVA 689

Query: 348 KRASDIVHTFQEQELAQVLWAFASL 372
           K  ++ + TF+ QE++ +L+A+  L
Sbjct: 690 KEVANSLATFKPQEVSNLLYAYGRL 714


>gi|397618909|gb|EJK65091.1| hypothetical protein THAOC_14102, partial [Thalassiosira oceanica]
          Length = 235

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 111/267 (41%), Gaps = 57/267 (21%)

Query: 357 FQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDA 416
           F+ QE+A  LWA A++      L           + F   +   L   NE G        
Sbjct: 12  FKAQEVANFLWACATVGHTDQRLF----------SAFAPVIASKLDKLNEQG-------- 53

Query: 417 DSEGSLSSPVLSFNRDQLGNIAWSYAV--LGQMDRIFFSDIWKTISRFEEQRISEQYRED 474
                            L NI W+Y+V  L + D +F       ++  E+    E+  + 
Sbjct: 54  -----------------LSNITWAYSVANLPRQD-LFNKGYVGALASNEKVFSGEELAQL 95

Query: 475 IMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVS 534
             +      +   ++L+ P         L+ K  +A  ++ +++   S  Q +V   L +
Sbjct: 96  HQWQLWQQELESGIELQGP---------LQAKCRNAFTSREYSE---SKLQNDVVDELKA 143

Query: 535 TGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAA 590
            GL    E  +  GY +DA++     +KVA E+DGP+HF      P G T+LK R +A  
Sbjct: 144 AGLVLDEEVLLGSGYRIDALVEFSDGRKVAVEVDGPSHFIDRR--PAGSTILKHRQVAKM 201

Query: 591 GW-NVVSLSHQEWEELQGSFEQLDYLR 616
               VVS+ + EW+EL+ S  +  YLR
Sbjct: 202 DHIKVVSVPYWEWDELKNSEMKQRYLR 228


>gi|294933217|ref|XP_002780656.1| hypothetical protein Pmar_PMAR001249 [Perkinsus marinus ATCC 50983]
 gi|239890590|gb|EER12451.1| hypothetical protein Pmar_PMAR001249 [Perkinsus marinus ATCC 50983]
          Length = 401

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 86/197 (43%), Gaps = 36/197 (18%)

Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
           +NK I+ ++T +E+L+VIAE +            + +NI TAL+++A             
Sbjct: 78  INKQILQSETLEELLDVIAEALNW---------FNIVNIGTALYKLASLALADQSQAAKS 128

Query: 259 LAFTRQ--REMSML--VAIAMTALPE--------------------C-SAQGISNIAWAL 293
            AF R+  R +  L  +A  ++ + E                    C S + ++NI WA+
Sbjct: 129 KAFLRKDNRYIGFLDEIANVLSYVDEPAGIESGNGSGKLVRDVKSACFSPKELANIVWAV 188

Query: 294 SKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI 353
           + IG     L E+  VA   +  +  F+S N++     FA M    P+LF   A    D+
Sbjct: 189 THIGLPHRRLYEL--VARHIIWYIDHFDSVNLSLALWGFAKMDVCCPELFRAAASVIIDM 246

Query: 354 VHTFQEQELAQVLWAFA 370
           +  F+   L    WAF+
Sbjct: 247 IDAFEPHRLCNTAWAFS 263


>gi|145347161|ref|XP_001418044.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578272|gb|ABO96337.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 753

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 78/372 (20%), Positives = 151/372 (40%), Gaps = 32/372 (8%)

Query: 274 AMTALPECSAQGISNIAWALS--KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
           A+  + E SA+ +SN+ +      + G  ++   M  V++    K+ EF    +  V  A
Sbjct: 389 AIDKIEEASAKNLSNLLYGFGTLNLAGLGVFTHAMFCVSQ----KLEEFTPVGIFMVCSA 444

Query: 332 FASMQH-SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLE-------- 381
            AS  +   P +  +   +     H F+ Q+  + L  FA L Y  AD   +        
Sbjct: 445 LASSNYDPGPQMMLQFENKLMKSAHAFESQDFTEFLRVFARLRYMLADETFDFIGVSSAK 504

Query: 382 SLDNAFKDATQFTCCLNKALSNCNE-NGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWS 440
           +LD    D+ + +  L    + C + +  + +  + +  GS S     F         WS
Sbjct: 505 TLDRF--DSYRISMTLWSHATLCAQPHDALLARIEDEIRGSASQ----FKPQNFVLALWS 558

Query: 441 YAVLGQMD--RIFFSDIWKTISRFEEQRI-SEQYREDIMFASQVHLVNQCLKLEHPHLQL 497
             +LG ++  R     +   + + +   + S +  ED    S        +      L L
Sbjct: 559 LVLLGSLEDARDSVVRVLHALVKLQGGALTSSEDLEDAQLCSLYMARLTSMGKPFEELIL 618

Query: 498 ALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGL-NWIREYAVDGYTV--DAVL 554
            ++  + ++   A    +      S  Q  +  +L   G  ++  E  V+G  +  D V 
Sbjct: 619 GVTDGVADECERAWLRAKAQDPTISKVQHHIGEVLREIGAQDFEVEALVEGGKIRSDIVF 678

Query: 555 VDKKVAFEIDGPTHFSRNTG---VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQ 611
            + ++  E+DGP H+SR+       LG T+++   + + GW VV + + +W ++    E+
Sbjct: 679 PNSRIVVEVDGPHHYSRDASGRLRELGQTVMRNNLLKSWGWRVVIVPYADWGDMLTIEEK 738

Query: 612 LDYLRVILKDYI 623
             YLR +L D +
Sbjct: 739 ASYLRSLLGDEV 750


>gi|429329938|gb|AFZ81697.1| RAP domain-containing protein [Babesia equi]
          Length = 237

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 39/65 (60%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           + +A E+DGP+HF  N+     +T LK R +   G+ V+ +   EW  L+G+ E+ +Y+R
Sbjct: 136 RPIAIEVDGPSHFYSNSTKYTAYTKLKHRILTRMGYKVLHVPFFEWRRLRGAKEREEYMR 195

Query: 617 VILKD 621
             LK+
Sbjct: 196 AKLKE 200


>gi|323447941|gb|EGB03846.1| hypothetical protein AURANDRAFT_72645 [Aureococcus anophagefferens]
          Length = 5282

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 3/85 (3%)

Query: 522  SSFQKEVARLLVSTGLNWIREYAVDG--YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGH 579
            S  Q+ V+++L   G     E  +DG   T DA  VD +VA E DGP H+  +     G 
Sbjct: 3823 SRAQESVSQVLRECGFAHEMEVDLDGTGLTADAADVDARVAVEYDGPQHYLADR-TQTGR 3881

Query: 580  TMLKRRYIAAAGWNVVSLSHQEWEE 604
            T  K R + A GW +V +SH  WE+
Sbjct: 3882 TRFKHRLVRALGWRLVVVSHYGWEQ 3906


>gi|397580099|gb|EJK51452.1| hypothetical protein THAOC_29372 [Thalassiosira oceanica]
          Length = 221

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 87/193 (45%), Gaps = 20/193 (10%)

Query: 430 NRDQLGNIAWSYAVLGQMDRIFFSDIW-KTISRFEEQRISEQYREDIMFASQVHLVNQCL 488
           N+  L  IAWSYAV     +  F+ ++   ++ +E    +E   +   +      +   +
Sbjct: 34  NKQGLATIAWSYAVANVPRQDLFNQVFIGALAAYENVFSTEDLFQLHQWQLWQQEIGSGM 93

Query: 489 KLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DG 547
           +L       +L         SA  ++       S  Q +V   L + GL+   +  +  G
Sbjct: 94  ELPQ-----SLGGKCRNAFTSASYSE-------SKLQNDVVDELKAAGLDLDEKVLLGSG 141

Query: 548 YTVDAVL-VD--KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWE 603
           Y VDA++ VD  K VA E+DGP HF +    P+G T LK R +       VVS+ + EW 
Sbjct: 142 YRVDALVKVDDGKSVAIEVDGPFHFIQRR--PMGSTTLKHRQVGKLDRIEVVSVPYWEWN 199

Query: 604 ELQGSFEQLDYLR 616
           EL+ S  + +YL 
Sbjct: 200 ELKNSLTKQNYLH 212


>gi|255075859|ref|XP_002501604.1| predicted protein [Micromonas sp. RCC299]
 gi|226516868|gb|ACO62862.1| predicted protein [Micromonas sp. RCC299]
          Length = 953

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/164 (29%), Positives = 80/164 (48%), Gaps = 28/164 (17%)

Query: 234 PLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWAL 293
           P++ ATA+HRIA + +  +     R + T     + L+ +    L   +AQG++N+AWA 
Sbjct: 439 PIHTATAIHRIATHTKGDAT----RESVTSSPSFAALMDLVRANLGGMNAQGLANVAWAC 494

Query: 294 SKI----GGELL--YLSEMDR--VAEVALTKVG------EFNSQNVANVAGAFASMQHSA 339
           +++    G +LL    + ++R   A+   TK G      E   Q V+N+  A  S++H  
Sbjct: 495 ARLDHSPGADLLDDITAGLERELTAKPPATKGGRAAKAREVKPQAVSNMVWALGSLRHRP 554

Query: 340 PD-----LFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPAD 377
            D     +FS +A R  D    F+ QEL  V+   A + Y P D
Sbjct: 555 SDECLASIFSAVAPRLRD----FRAQELTNVVLGAAHMEYVPGD 594


>gi|397598419|gb|EJK57213.1| hypothetical protein THAOC_22770, partial [Thalassiosira oceanica]
          Length = 998

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 48/214 (22%), Positives = 86/214 (40%), Gaps = 41/214 (19%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           Q I+N+ W+ +K G  +  L +        L  +G F  QN++N A AFA+       LF
Sbjct: 794 QHIANVLWSFAKSGEVVPELFQALGNHISGLDSLGSFKPQNLSNTAWAFATAGELHTKLF 853

Query: 344 SELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
           +++    +  D +++F++Q L+ + WAFA+  E    L + +            CL+   
Sbjct: 854 NKIGDHVTGLDSLNSFEQQSLSNIAWAFAAAGESNPGLFKKIGGHVAG----LMCLD--- 906

Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
                                     SFN   L  + W+++  G+      SD++K +  
Sbjct: 907 --------------------------SFNPQNLSLLVWAFSTAGES----HSDLFKRVGD 936

Query: 462 FEEQRISEQYREDIMFASQVHLVNQCLKLEHPHL 495
               RISE +R   +  +         ++ HP L
Sbjct: 937 HIVARISEDFRPQTL--ANTAWAFATAEVSHPEL 968



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 55/207 (26%), Positives = 91/207 (43%), Gaps = 26/207 (12%)

Query: 268 SMLVAIAMTA---LPECSAQGISNIAWALS------KIGGELLYLSEMDRVAEVALTKVG 318
           S+  +IA +A   L E  A+ +SN+ ++         IG + L+    +   E A+  + 
Sbjct: 696 SIFDSIASSAAGMLNEFEARHLSNLIYSFGLVERNPDIGEKTLF----NVFGEAAVKILN 751

Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPAD 377
            FNSQ+++N+  AF  +      LF E     S + + +F+ Q +A VLW+FA   E   
Sbjct: 752 TFNSQDISNMLWAFVKVDAKNSRLFHETGGVISGMDLDSFEPQHIANVLWSFAKSGEVVP 811

Query: 378 PLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGD---------ADSEGSLSSPVLS 428
            L ++L N             + LSN        ++G+          D    L S + S
Sbjct: 812 ELFQALGNHISGLDSLGSFKPQNLSNTAW--AFATAGELHTKLFNKIGDHVTGLDS-LNS 868

Query: 429 FNRDQLGNIAWSYAVLGQMDRIFFSDI 455
           F +  L NIAW++A  G+ +   F  I
Sbjct: 869 FEQQSLSNIAWAFAAAGESNPGLFKKI 895



 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 5/89 (5%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGE-FNSQNVANVAGAFASMQHSAPDL 342
           Q +S + WA S  G     L    RV +  + ++ E F  Q +AN A AFA+ + S P+L
Sbjct: 911 QNLSLLVWAFSTAGESHSDL--FKRVGDHIVARISEDFRPQTLANTAWAFATAEVSHPEL 968

Query: 343 FSELAKRASDI--VHTFQEQELAQVLWAF 369
           F+++    + +  + +F  Q L+   WAF
Sbjct: 969 FNKIGGHIAGLSTLGSFDPQALSISAWAF 997


>gi|403223561|dbj|BAM41691.1| conserved hypothetical protein [Theileria orientalis strain
           Shintoku]
          Length = 1133

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 66/314 (21%), Positives = 124/314 (39%), Gaps = 73/314 (23%)

Query: 196 EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT 255
            I + +D++ ++ + ++L  I + +           ++ +N++TA+HR+AK         
Sbjct: 258 HILIQQDLLKSKNSTQILSTIGDKL---------GQMNAVNVSTAIHRLAKYSSPY---- 304

Query: 256 THRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKI------------------- 296
            +R A         LV++    + +   QG++NI W+++K+                   
Sbjct: 305 -NRYAVCNHESFGKLVSLVGDHMLQFDPQGLTNIFWSITKLRITPNWISCLLEQINIHAN 363

Query: 297 -------GGELLYLSEMDRVAEVALT-----------KVGEFNSQ-NVANVAGAFASMQH 337
                     L  +S++ R  +V+L            K+ +F    ++  V+ A A +  
Sbjct: 364 SLNANELANCLFCISKLTRADDVSLELRFKILSLVQDKITQFRRPLDLTCVSTALARLNV 423

Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFAS-------LYEPADPLLESLDNAFKDA 390
             P LF  ++ +    +  F+ QE+  V WA+AS       L+      +ES  NA    
Sbjct: 424 RNPVLFGHISSQVLSSLEEFKIQEICGVAWAYASLGFTDRILFGKIKQFIES--NADSSN 481

Query: 391 TQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL-----SFNRDQLGNIAWSYAVLG 445
                 L  ALS   +        D D      SP++     S +   +  IAW+Y   G
Sbjct: 482 IGNIVHLAWALSKIKQ-------ADTDFFLYTISPLVRGHLQSLSCKHMTTIAWAYVNAG 534

Query: 446 QMDRIFFSDIWKTI 459
             D+  F+DI  T+
Sbjct: 535 IEDQDLFNDIANTL 548


>gi|302828620|ref|XP_002945877.1| hypothetical protein VOLCADRAFT_86282 [Volvox carteri f.
           nagariensis]
 gi|300268692|gb|EFJ52872.1| hypothetical protein VOLCADRAFT_86282 [Volvox carteri f.
           nagariensis]
          Length = 1644

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 59/122 (48%), Gaps = 12/122 (9%)

Query: 265 REMSMLVAIAMTALPEC---SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFN 321
           ++ SM+ A   TALP+    +A G+SN+ WA +        L +    A +      E N
Sbjct: 596 QDRSMISAAVQTALPQLRRFNASGLSNLLWACATAQCHCEELFD-GAAAALMALPPHEMN 654

Query: 322 SQNVANVAGAFASMQHSAPDLFSELAK--------RASDIVHTFQEQELAQVLWAFASLY 373
            Q+VAN A A A +QH+ P+L + LA+          +  +     QELA  LWAFA L 
Sbjct: 655 CQDVANTAWACAKLQHNHPELMAHLARLVLAAAEAPGATGLRGANTQELANTLWAFAVLP 714

Query: 374 EP 375
            P
Sbjct: 715 LP 716



 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 31/130 (23%), Positives = 62/130 (47%), Gaps = 5/130 (3%)

Query: 246 KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGG----ELL 301
           +  ++ +   + R A +    ++ LV+  +T LP  +A+  +N+ WAL  +G     ELL
Sbjct: 465 RQGQRTAASASPRTAQSSAALLADLVSGFLTQLPHYTARQYANVVWALGSMGSREHTELL 524

Query: 302 YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQE 361
           + + +   A+    K+     Q ++N+A   A + +    L+  +       +H F+ QE
Sbjct: 525 HAAAVQLQAQGG-AKLFAAPPQELSNLALGLAKLGYREVSLWGAIIAAGKARLHEFKPQE 583

Query: 362 LAQVLWAFAS 371
           L  + WA A+
Sbjct: 584 LHNMAWAVAA 593


>gi|403223568|dbj|BAM41698.1| uncharacterized protein TOT_040000079 [Theileria orientalis strain
           Shintoku]
          Length = 229

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 39/65 (60%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           + +A E+DGP+HF  N+     +T LK R +   G+ V+ +   EW  L+G+ E+ +Y+R
Sbjct: 128 RPIAIEVDGPSHFYSNSTKYTAYTKLKHRLLTRMGYKVLHVPFFEWRRLRGAREREEYMR 187

Query: 617 VILKD 621
             LK+
Sbjct: 188 EKLKE 192


>gi|156089343|ref|XP_001612078.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154799332|gb|EDO08510.1| hypothetical protein BBOV_III009540 [Babesia bovis]
          Length = 1171

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/215 (21%), Positives = 88/215 (40%), Gaps = 52/215 (24%)

Query: 196 EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT 255
            I L + I+  +++ +VL  I + +T          L+ +N ATALHRIA++    S   
Sbjct: 317 HIVLQQSILKCKSSSQVLAAIQDKVTK---------LNAVNAATALHRIARHTTSYS--- 364

Query: 256 THRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWAL---------------------- 293
             R   T     + L++     +     QG++N+ W++                      
Sbjct: 365 --RYTLTGNNTFAQLLSAVEAHIATLDPQGVTNVLWSIVKLRIHPQWMDSLLVTMQKHVK 422

Query: 294 ----SKIGGELLYLSEMDRVAEVAL-----------TKVGEFNSQ-NVANVAGAFASMQH 337
               S++   L  +S++  ++   +            KV  F +  ++  VA A A +  
Sbjct: 423 ELGTSELASSLFAVSKLATMSTAGIDLRDMLLGTVQEKVTHFRTPLDITCVATALARLNV 482

Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
             P +FS+L+     ++  F  Q+L  + WA+ASL
Sbjct: 483 RNPVIFSQLSAAVLAVIDDFAMQQLCGIAWAYASL 517


>gi|124810335|ref|XP_001348847.1| RAP protein, putative [Plasmodium falciparum 3D7]
 gi|23497748|gb|AAN37286.1| RAP protein, putative [Plasmodium falciparum 3D7]
          Length = 532

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 40/65 (61%)

Query: 556 DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           D+ +A E+DGP+HF  N+     +T LK R +   G+NV+ +S+ +W +L+   E+ +++
Sbjct: 430 DRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYIDWRKLRNKSEREEFI 489

Query: 616 RVILK 620
              LK
Sbjct: 490 LKKLK 494


>gi|397638616|gb|EJK73140.1| hypothetical protein THAOC_05252, partial [Thalassiosira oceanica]
          Length = 643

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 75/165 (45%), Gaps = 40/165 (24%)

Query: 303 LSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTF 357
           L   D +A  A+  + EF++++++N+  +F  ++++ PD     LF+   + A  I+HTF
Sbjct: 408 LPIFDSIARSAVDMLNEFDARHLSNLVYSFGLVEYN-PDIGGETLFNVFGEAAGKILHTF 466

Query: 358 QEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDAD 417
           + QEL+ +LWAF  +                DA       N  L   +E GGV S  D D
Sbjct: 467 KPQELSNMLWAFVKV----------------DAD------NSRL--FHETGGVISGMDLD 502

Query: 418 SEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRF 462
                     SF   +L NI WS+A  G+     F  +   I+R 
Sbjct: 503 ----------SFKPQELANIIWSFAKSGESGPELFQALGNHIARL 537



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 50/93 (53%), Gaps = 6/93 (6%)

Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
            Q ++NI W+ +K G  G  L+ +  + +A   L  +  F  Q+++N A AFA+   S P
Sbjct: 506 PQELANIIWSFAKSGESGPELFQALGNHIAR--LNSLDPFKPQDLSNTAWAFATAGVSHP 563

Query: 341 DLFSELAKRAS--DIVHTFQEQELAQVLWAFAS 371
           +LF ++    +  D   +F+ Q L+   WAFA+
Sbjct: 564 ELFKKIGNHGAGQDRFDSFKPQNLSNTAWAFAT 596



 Score = 43.5 bits (101), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 54/123 (43%), Gaps = 3/123 (2%)

Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
            Q +SN+ WA  K+  +   L   +    ++   +  F  Q +AN+  +FA    S P+L
Sbjct: 468 PQELSNMLWAFVKVDADNSRLFH-ETGGVISGMDLDSFKPQELANIIWSFAKSGESGPEL 526

Query: 343 FSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
           F  L    + +  +  F+ Q+L+   WAFA+       L + + N      +F     + 
Sbjct: 527 FQALGNHIARLNSLDPFKPQDLSNTAWAFATAGVSHPELFKKIGNHGAGQDRFDSFKPQN 586

Query: 401 LSN 403
           LSN
Sbjct: 587 LSN 589


>gi|221059023|ref|XP_002260157.1| RAP protein [Plasmodium knowlesi strain H]
 gi|193810230|emb|CAQ41424.1| RAP protein, putative [Plasmodium knowlesi strain H]
          Length = 424

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 44/72 (61%), Gaps = 1/72 (1%)

Query: 551 DAVLVDKK-VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSF 609
           D++  D + +A E+DGP+HF  N+     +T LK R +   G+NV+ +S+ +W +L+   
Sbjct: 316 DSIFADNRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYIDWRKLRNKS 375

Query: 610 EQLDYLRVILKD 621
           E+ +++   LK+
Sbjct: 376 EREEFILKKLKE 387


>gi|223999221|ref|XP_002289283.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974491|gb|EED92820.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 837

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 107/454 (23%), Positives = 168/454 (37%), Gaps = 112/454 (24%)

Query: 249 EKVSMMTTHRLAFTRQREMSMLVAIAMTA-----LPECSAQGISNIAWAL-------SKI 296
           E  +MMT      TR+ + SM     +T        +   + +  IAWAL       + +
Sbjct: 391 ESDAMMTFLAKEATRRIKFSMEAPPTLTGGKRNQFCKLLPRDVVQIAWALGTMESDNASV 450

Query: 297 GGELLYLSEMDRVAEVALTKVGEFN-------SQNVANVAGAFASMQHSAPD-------L 342
           G  L+YL  +D V E  +      N       S   A++     ++ H   D       +
Sbjct: 451 GDALVYL--VDAVNEYWIADSNSSNERHRQIKSWKCADLVQMATALSHGRLDNQSVLTAI 508

Query: 343 FSELAKR-ASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
           + E  +R  S     F   E++ +LWA A LY     L     + F+   +FT    + L
Sbjct: 509 YEESLERIQSSSPGKFSTSEISILLWAQARLY-----LTSKYGSVFQ---EFTGAAARTL 560

Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
                 G      D       + P +     +  N+AWS  VLG  D    SD+   +  
Sbjct: 561 MQ-QMKGKANQHSDERLLPPATLPKMGLRSQEQANLAWSLTVLGHYD----SDVVALL-- 613

Query: 462 FEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGK--------- 512
                      ++I+ A+     +  ++LEH H       +L E   +A +         
Sbjct: 614 -----------QNIVHAASSS-GDGVIQLEHAHQLWQSYFLLSEDCPAAVEFVPAEFSQF 661

Query: 513 -TKRFNQKVTSSFQKEVARLLVSTGLNWIR-----EYAVDGYTVDAVLVDK--------- 557
             K++N +     Q       +S  L  +R     EY  D   VD  +V +         
Sbjct: 662 LEKKWNIEKNRGKQSSSRHRTISQTLELMRVAHRNEYDED---VDVAIVLQEDSSWTHTA 718

Query: 558 -----------KVAFEIDGPTHFS--RNTG------------VP--LGHTMLKRRYIAAA 590
                      KVA E DGP HF+   +TG             P  LGHT+LK R +   
Sbjct: 719 QKDLDNQEGRVKVAVEFDGPFHFTVMASTGKDLTMIENGVKIAPRVLGHTVLKYRLLKKK 778

Query: 591 GWNVVSLSHQEWEELQ--GSFEQLDYLRVILKDY 622
           GW VV + + EW+++    S E+  YL+  LK +
Sbjct: 779 GWAVVRIPYYEWDKIPSFASMERQRYLQRALKTH 812


>gi|156089469|ref|XP_001612141.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154799395|gb|EDO08573.1| hypothetical protein BBOV_III010170 [Babesia bovis]
          Length = 260

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 3/70 (4%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           +K+A E DGPTHF   T +    ++LK   +   GW V+ + +QEW +L    ++   L+
Sbjct: 127 RKIAIEYDGPTHFYAETTMRTAKSILKHEILENTGWQVLHIPYQEWLQLPLKRKRQHLLK 186

Query: 617 V---ILKDYI 623
           V   ILK+YI
Sbjct: 187 VNEEILKEYI 196


>gi|70951793|ref|XP_745109.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56525327|emb|CAH77447.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 350

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 43/72 (59%), Gaps = 1/72 (1%)

Query: 551 DAVLVDKK-VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSF 609
           D +  D + +A E+DGP+HF  N+     +T LK R +   G+NV+ +S+ +W +L+   
Sbjct: 242 DFIFADNRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYFDWRKLRNKS 301

Query: 610 EQLDYLRVILKD 621
           E+ +++   LK+
Sbjct: 302 EREEFILKKLKE 313


>gi|68073089|ref|XP_678459.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56498935|emb|CAH96559.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 319

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 43/72 (59%), Gaps = 1/72 (1%)

Query: 551 DAVLVDKK-VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSF 609
           D +  D + +A E+DGP+HF  N+     +T LK R +   G+NV+ +S+ +W +L+   
Sbjct: 211 DFIFADNRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYFDWRKLRNKS 270

Query: 610 EQLDYLRVILKD 621
           E+ +++   LK+
Sbjct: 271 EREEFILKKLKE 282


>gi|397624180|gb|EJK67299.1| hypothetical protein THAOC_11691, partial [Thalassiosira oceanica]
          Length = 538

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 43/122 (35%), Positives = 63/122 (51%), Gaps = 10/122 (8%)

Query: 499 LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV-- 555
           L   L+EK  +A  +  F++   S  Q +V   L + GL+   E  +  GY VDA++   
Sbjct: 28  LPQSLQEKCRNAFTSASFSE---SKLQNDVVYELRAAGLDLDEEVLLGSGYRVDALVKFS 84

Query: 556 -DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLD 613
             +KVA E+DGP+HF      P G + LK R +A      VVS+ + EW EL+ S  +  
Sbjct: 85  NGRKVAVEVDGPSHFIDRR--PTGSSTLKHRQVARLDRIEVVSVPYWEWNELKNSETKQR 142

Query: 614 YL 615
           YL
Sbjct: 143 YL 144


>gi|403221392|dbj|BAM39525.1| uncharacterized protein TOT_010000980 [Theileria orientalis strain
           Shintoku]
          Length = 571

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 42/161 (26%), Positives = 78/161 (48%), Gaps = 26/161 (16%)

Query: 477 FASQVHLVNQCLKLEHPHLQLALSSV-LEEKIASAGKTKRFNQKV---TSSFQKEVARLL 532
           F SQ++L+N+  +LE   L+   + + L E +    + +    ++   TS+   +V  +L
Sbjct: 410 FISQLNLLNRSAELERHGLKRLFTQMGLREFLTGLEQVRPVFSQIDHNTSNTHVQVDSVL 469

Query: 533 VSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGV----------PLG---- 578
            S     + E+ +  Y VD  +  K    E+DGP H++  TG+          PLG    
Sbjct: 470 KSFNYETLLEHFISPYLVDIFVPSKNAIIEVDGPYHYA--TGMNERVNAIMKRPLGRFPC 527

Query: 579 ----HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
               ++ LKRR ++ +G+   ++ +QEW   Q + EQ+ Y+
Sbjct: 528 QYSLNSRLKRRLLSKSGYKFFNIPYQEWP--QSTNEQIYYI 566


>gi|401409740|ref|XP_003884318.1| hypothetical protein NCLIV_047190 [Neospora caninum Liverpool]
 gi|325118736|emb|CBZ54287.1| hypothetical protein NCLIV_047190 [Neospora caninum Liverpool]
          Length = 929

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 48/89 (53%)

Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
           EVA +L   G+++ R   ++G  +D +L +KKV     GP HF  ++     ++ L++R 
Sbjct: 738 EVAWMLQEMGISFQRRLYINGCRIDILLPEKKVVIMCAGPHHFYLDSTRRTAYSRLQQRL 797

Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           +   G+ V  L + EW EL+   E+  +L
Sbjct: 798 LELQGYAVCVLPYYEWSELKSPEEKQRFL 826


>gi|397564390|gb|EJK44191.1| hypothetical protein THAOC_37291, partial [Thalassiosira oceanica]
          Length = 134

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 32/73 (43%), Positives = 43/73 (58%), Gaps = 5/73 (6%)

Query: 547 GYTVDAVLV--DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWE 603
           GY VDA++   D+ VA E+DGP+HF +    P G T LK R +A      VVS+ + EW 
Sbjct: 57  GYRVDALVKVGDRGVAIEVDGPSHFIQRR--PTGSTTLKHRQVATLECIEVVSVPYWEWN 114

Query: 604 ELQGSFEQLDYLR 616
           EL+ S  +  YLR
Sbjct: 115 ELKNSVTKQQYLR 127


>gi|389585147|dbj|GAB67878.1| RAP protein [Plasmodium cynomolgi strain B]
          Length = 378

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 44/72 (61%), Gaps = 1/72 (1%)

Query: 551 DAVLVDKK-VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSF 609
           D++  D + +A E+DGP+HF  N+     +T LK R +   G+NV+ +S+ +W +L+   
Sbjct: 270 DSIFADNRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYIDWRKLRNKS 329

Query: 610 EQLDYLRVILKD 621
           E+ +++   LK+
Sbjct: 330 EREEFILKKLKE 341


>gi|397643122|gb|EJK75666.1| hypothetical protein THAOC_02605, partial [Thalassiosira oceanica]
          Length = 599

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 45/181 (24%), Positives = 72/181 (39%), Gaps = 39/181 (21%)

Query: 284 QGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           Q +SNIAWA +  G    +L+    D VA  AL  +  F  Q ++N++ AF++   S  +
Sbjct: 447 QELSNIAWAFATAGESHPVLFEKIGDYVA--ALGSLNSFKPQELSNISWAFSAAGVSHAE 504

Query: 342 LFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNK 399
           LF ++A   +  D + +F+ QELA  + AF +   P   L + + +       F      
Sbjct: 505 LFEKIAYHIAGLDCLDSFKPQELANTVHAFCNAVRPHPALFDKIGHYIAGLCSFNL---- 560

Query: 400 ALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI 459
                                        F    L NIAW++A  G+     F  I   I
Sbjct: 561 -----------------------------FQPQNLSNIAWAFATAGESHPALFEKIGDYI 591

Query: 460 S 460
           +
Sbjct: 592 A 592



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 2/112 (1%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
           A+ +L     Q +SNI+WA S  G     L E        L  +  F  Q +AN   AF 
Sbjct: 476 ALGSLNSFKPQELSNISWAFSAAGVSHAELFEKIAYHIAGLDCLDSFKPQELANTVHAFC 535

Query: 334 SMQHSAPDLFSELAKRASDIV--HTFQEQELAQVLWAFASLYEPADPLLESL 383
           +     P LF ++    + +   + FQ Q L+ + WAFA+  E    L E +
Sbjct: 536 NAVRPHPALFDKIGHYIAGLCSFNLFQPQNLSNIAWAFATAGESHPALFEKI 587



 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 41/171 (23%), Positives = 72/171 (42%), Gaps = 48/171 (28%)

Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVA------EVALTKVGEFNSQNVANVAGAFASMQ 336
            Q ++NI W+ SK G       E DR         +    + +F  Q ++ +  A+A+ +
Sbjct: 369 GQALANIVWSFSKSG-------EADREMFNHIGDHIVARSLYDFLPQEMSIIVWAYANGR 421

Query: 337 HSAPDLFSELAKRASDIV--HTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFT 394
            S   LF  +    + +V  ++F+ QEL+ + WAFA+  E + P+L      F+    + 
Sbjct: 422 VSHHALFDRVGFHVTRLVSSYSFKPQELSNIAWAFATAGE-SHPVL------FEKIGDYV 474

Query: 395 CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
             L                      GSL+    SF   +L NI+W+++  G
Sbjct: 475 AAL----------------------GSLN----SFKPQELSNISWAFSAAG 499


>gi|149732832|ref|XP_001501739.1| PREDICTED: FAST kinase domain-containing protein 3-like [Equus
           caballus]
          Length = 660

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 48/174 (27%), Positives = 79/174 (45%), Gaps = 16/174 (9%)

Query: 444 LGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHP-HLQLALSSV 502
           L Q+ ++F + I +    ++  ++  +Y+           +  C  LE P   QL   SV
Sbjct: 490 LAQLTQLFLTSILEC-PFYKGPKLLPKYQVK-------SFLTPCCSLETPVDFQLY-KSV 540

Query: 503 LEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFE 562
           +   I   G    F  KV + +     R  V   +    E  V  +TVD   V K+VA  
Sbjct: 541 MTGLIDLLGARLYFASKVLTPY-----RYTVDVEIKLDEEGFVLPFTVDED-VHKRVALC 594

Query: 563 IDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           IDGP  F  N+   LG   +K+R++   G++VV + + E E L+   E ++YL+
Sbjct: 595 IDGPKRFCLNSKHLLGKEAMKQRHLRLLGYHVVQIPYYEIEMLKSRLELVEYLQ 648


>gi|156099636|ref|XP_001615683.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148804557|gb|EDL45956.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 443

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 44/72 (61%), Gaps = 1/72 (1%)

Query: 551 DAVLVDKK-VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSF 609
           D++  D + +A E+DGP+HF  N+     +T LK R +   G+NV+ +S+ +W +L+   
Sbjct: 335 DSIFADNRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYIDWRKLRNKT 394

Query: 610 EQLDYLRVILKD 621
           E+ +++   LK+
Sbjct: 395 EREEFILKKLKE 406


>gi|84998036|ref|XP_953739.1| hypothetical protein [Theileria annulata]
 gi|65304736|emb|CAI73061.1| hypothetical protein TA16950 [Theileria annulata]
          Length = 574

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 91/205 (44%), Gaps = 32/205 (15%)

Query: 436 NIAWSYAVLG-QMDRIFFSDIWKTISRFEEQRISEQYREDIM----FASQVHLVNQCLKL 490
           N  +SY+    ++D + +S     I ++     S +  E+I+    F SQ++L+ + + L
Sbjct: 372 NCHYSYSQFNLKLDTLIYS-----ILKYVYNIFSGENMEEIIKFPNFVSQLNLLRKSINL 426

Query: 491 EHPHLQLALS----SVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVD 546
           E  HL+  +     S   + +     T   N+  TS+   +V  +L S     + E+ V 
Sbjct: 427 ERVHLKKLIEGSEISCFLDSLEHIKPTFAPNEFKTSNIHSQVDTILKSFNYETLLEHYVC 486

Query: 547 GYTVDAVLVDKKVAFEIDGPTHFSRNT-------------GVPLGHTM---LKRRYIAAA 590
            Y VD  +  K V  E+DGP H+S                   LG+T+   LK R +  +
Sbjct: 487 PYIVDIFVPSKNVIIEVDGPYHYSTTINPRINKILKREVDNYRLGYTLNSKLKSRILTKS 546

Query: 591 GWNVVSLSHQEWEELQGSFEQLDYL 615
           G+  +++   +W   Q + EQ+ ++
Sbjct: 547 GFKFINIPFYQWP--QTTNEQVYFI 569


>gi|399218303|emb|CCF75190.1| unnamed protein product [Babesia microti strain RI]
          Length = 472

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 51/94 (54%), Gaps = 13/94 (13%)

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSR--------- 571
           T+ FQ++V+ LL   G +   E  +  Y VD +LVD KV  E++GP H++          
Sbjct: 362 TTVFQQQVSNLLKEMGYDIDCEVHIYPYIVD-ILVDNKVIIEVNGPCHYTYHCSDKNDYG 420

Query: 572 --NTGVPLG-HTMLKRRYIAAAGWNVVSLSHQEW 602
             N+ + L  +T+LK + +   G+ V+ +S+ +W
Sbjct: 421 VINSALKLNKNTILKEKLLNGCGYKVIHVSYADW 454


>gi|401404312|ref|XP_003881694.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325116107|emb|CBZ51661.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 538

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 37/63 (58%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           + +A E+DGPTHF  N+     +T LK R +   G+ V+ + + EW  L+G  E+ +Y+R
Sbjct: 201 RPIAIEVDGPTHFYANSTRYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGQKEREEYMR 260

Query: 617 VIL 619
             L
Sbjct: 261 RKL 263


>gi|428166758|gb|EKX35728.1| hypothetical protein GUITHDRAFT_118113 [Guillardia theta CCMP2712]
          Length = 560

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 49/97 (50%), Gaps = 14/97 (14%)

Query: 283 AQGISNIAWALSKIG------GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA-SM 335
           AQ +SNI WA +++G      G LL      RV+ +       FN QNVAN   AFA S 
Sbjct: 372 AQELSNILWAHARLGLTFGEEGLLLLTRRASRVSHL-------FNGQNVANALWAFAKSG 424

Query: 336 QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
           +   P L+ +L  RA  +    + QE + +LW+ A L
Sbjct: 425 RTPCPQLYRQLKDRALQLEEELRPQEASSMLWSLAKL 461



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 3/94 (3%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEV---ALTKVGEFNSQNVANVAGAFASMQHS 338
           SAQGI+N+ WA+  +      ++E + V  V   A     +FN Q VAN   + A +  +
Sbjct: 292 SAQGIANVLWAMGTLSSRTGRMAEEEMVRAVCARACEVCEQFNGQAVANSFWSLAKLGAA 351

Query: 339 APDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
              L   L +R  ++  + + QEL+ +LWA A L
Sbjct: 352 NQQLVVGLTRRMMEVADSLKAQELSNILWAHARL 385



 Score = 43.5 bits (101), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 56/118 (47%), Gaps = 18/118 (15%)

Query: 263 RQREMSMLVAIAMTALPEC---SAQGISNIAWALSKIGG--ELLYLSEMDRVAEVALTKV 317
           R  E  M+ A+   A   C   + Q ++N  W+L+K+G   + L +    R+ EVA    
Sbjct: 312 RMAEEEMVRAVCARACEVCEQFNGQAVANSFWSLAKLGAANQQLVVGLTRRMMEVA---- 367

Query: 318 GEFNSQNVANVAGAFASMQHSAPDLFSE-----LAKRASDIVHTFQEQELAQVLWAFA 370
               +Q ++N+  A A +  +    F E     L +RAS + H F  Q +A  LWAFA
Sbjct: 368 DSLKAQELSNILWAHARLGLT----FGEEGLLLLTRRASRVSHLFNGQNVANALWAFA 421



 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 59/131 (45%), Gaps = 13/131 (9%)

Query: 251 VSMMTTHRLAFTRQRE-MSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE---M 306
            SM    ++ F    E M  LV  A   +   +AQ +SN  WA +K+G    Y+ E   M
Sbjct: 222 TSMWAMAKVGFDPGEEVMRTLVGHANEIVASFNAQDVSNFLWASAKLG----YVPEEATM 277

Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASM-----QHSAPDLFSELAKRASDIVHTFQEQE 361
            ++        G+F++Q +ANV  A  ++     + +  ++   +  RA ++   F  Q 
Sbjct: 278 VKLRRRTSKIAGDFSAQGIANVLWAMGTLSSRTGRMAEEEMVRAVCARACEVCEQFNGQA 337

Query: 362 LAQVLWAFASL 372
           +A   W+ A L
Sbjct: 338 VANSFWSLAKL 348


>gi|237839849|ref|XP_002369222.1| hypothetical protein TGME49_085840 [Toxoplasma gondii ME49]
 gi|211966886|gb|EEB02082.1| hypothetical protein TGME49_085840 [Toxoplasma gondii ME49]
          Length = 571

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 37/63 (58%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           + +A E+DGPTHF  N+     +T LK R +   G+ V+ + + EW  L+G  E+ +Y+R
Sbjct: 198 RPIAIEVDGPTHFYANSTRYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGQKEREEYMR 257

Query: 617 VIL 619
             L
Sbjct: 258 RKL 260


>gi|294880367|ref|XP_002768980.1| hypothetical protein Pmar_PMAR008162 [Perkinsus marinus ATCC 50983]
 gi|239872053|gb|EER01698.1| hypothetical protein Pmar_PMAR008162 [Perkinsus marinus ATCC 50983]
          Length = 772

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 70/281 (24%), Positives = 115/281 (40%), Gaps = 71/281 (25%)

Query: 235 LNIATALHRIAKNMEKV------------SMMTTHRLAFTRQREMSMLVAIAMTALPECS 282
           ++ +TALHR+A  + K             S+M T+    T       LV  A   LP  +
Sbjct: 68  IHTSTALHRLATAITKTGGGRPTEGATNASVMATY---VTSDARFVRLVERARVLLPGAT 124

Query: 283 AQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
            + +SNI WALSK+     Y  E  +D V E  L  +  F++Q V+N   AF  ++ S+ 
Sbjct: 125 TRAVSNITWALSKLN----YTDEGILDIVTEYMLANLEAFDTQGVSNCLYAFGLLRCSSG 180

Query: 341 DLFSELAKRASDIV----HTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCC 396
           D    L  R  + +    + F+ QE++  ++A A L    D  L S+      A+    C
Sbjct: 181 DRRRLLLDRLCEHIPPRLNEFKPQEISNCVYALARLGHRDDSFLASV------ASYIPGC 234

Query: 397 LNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSD-- 454
           +N                             +F   ++ N+A+S A+L       F    
Sbjct: 235 IN-----------------------------NFKAQEMSNVAYSCALLSYKSDPLFQSVA 265

Query: 455 ---IWKTISRFEEQRISEQYREDIMFA-SQVHLVNQCLKLE 491
              I + +SR   Q IS     + ++A ++VH   + L +E
Sbjct: 266 DEMIARGMSRCRSQDIS-----NTLYAFAKVHFKCEALCVE 301



 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 50/180 (27%), Positives = 81/180 (45%), Gaps = 12/180 (6%)

Query: 275 MTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAE--VALTKVGEFNS-QNVANVA 329
           +T L E + QGISN  +AL  +G   E    +  D V     +L +  ++++ Q+ AN  
Sbjct: 307 ITRLHEFNMQGISNTMFALGGLGYRHEAFLNAIADHVVGRLCSLDQFSQYSTPQDFANTL 366

Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAFK 388
            AFA +      L           +H F+ QELA V+ A+A+L Y      +E ++    
Sbjct: 367 VAFAKLSLRHDPLLDAFGSIMCHRLHAFKSQELASVVHAYATLGYVHTAFFIEVVNGILS 426

Query: 389 DATQFTCCLNKALSNC-NENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA---WSYAVL 444
             T   C  NK +S+  +E     S G   S   ++S V      + G+IA   +S+ +L
Sbjct: 427 SPT--LCGYNKLVSSSYSEASPTMSIGQRSSNAFVASSVPRLRDFKPGDIALIVYSFGLL 484


>gi|145340621|ref|XP_001415420.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575643|gb|ABO93712.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 417

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 62/277 (22%), Positives = 119/277 (42%), Gaps = 54/277 (19%)

Query: 198 NLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAK----------- 246
           +L  D++DA   + +L ++ E      K         +N +TALHR+A+           
Sbjct: 46  DLQGDLMDASDVEFILTMVEEQEEVFNK---------VNASTALHRVARLTTQRLPGQLR 96

Query: 247 -NMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE 305
             ME+ ++    R     Q  MSM+  +A     E S QG+SN+ WAL+++     Y ++
Sbjct: 97  PTMERSTLFGDERF----QTLMSMVDRMAG----EMSMQGVSNVLWALARLD----YPTD 144

Query: 306 MDRVAEVAL---TKVGEFNSQNVANVAGAFASMQHSA-PDLFSELAKRASDIVHTFQEQE 361
              +  +A    ++      +N++    A A + H     L   +++RA  + H F+  +
Sbjct: 145 EALLEALAARAGSQAASAEPKNLSTTLWALAVLGHKPRSKLLKSISERALAVAHDFRSPD 204

Query: 362 LAQVLWAFA---SLYEPAD---PLLES-LDNAFKDATQFT------CCLNKALSNCNENG 408
           +  +LWA+A       P+D   P++++ LD A      +T         + A+ +C    
Sbjct: 205 VVNMLWAYARWVRYLPPSDRPTPVVQAMLDQAVSTMQSYTPYQLANLSWSLAMLDCPPAP 264

Query: 409 GVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
            V          +++S     +   L ++ W+Y V+G
Sbjct: 265 RVLEY----VLQTVASEPSKLDGTALTHVLWAYGVMG 297


>gi|221504797|gb|EEE30462.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 571

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 37/63 (58%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           + +A E+DGPTHF  N+     +T LK R +   G+ V+ + + EW  L+G  E+ +Y+R
Sbjct: 198 RPIAIEVDGPTHFYANSTRYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGQKEREEYMR 257

Query: 617 VIL 619
             L
Sbjct: 258 RKL 260


>gi|397568565|gb|EJK46207.1| hypothetical protein THAOC_35135 [Thalassiosira oceanica]
          Length = 698

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 58/112 (51%), Gaps = 6/112 (5%)

Query: 273 IAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
           I    L +   Q +SNIAWA +  G    +L+    D +A     ++  FN QN++N+  
Sbjct: 490 IVARRLNDFQPQHLSNIAWAFATAGVSHPILFKKIRDHIA--GQDRLNLFNPQNLSNITW 547

Query: 331 AFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLL 380
           AFA+   S P++F ++    + +  + +F+ Q L+ + WA++    P++ L 
Sbjct: 548 AFATAGDSHPEVFKKIGDHIAGLNSLDSFKAQALSNIAWAYSVANVPSEGLF 599



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/175 (26%), Positives = 82/175 (46%), Gaps = 32/175 (18%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
            DR+A  A   + EF +++++N+  +F  ++ + PD     LF+   K A  I+ TF+ Q
Sbjct: 367 FDRIASSAAVVLNEFEARHLSNLIYSFGLVELN-PDIGGETLFNVFGKTAVRILQTFKPQ 425

Query: 361 ELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKS 412
           EL+ +LWAF       + L++    ++  +D ++FK   Q     + A            
Sbjct: 426 ELSNMLWAFVKVDAKNSRLFQETGGVISGMDLDSFKPQEQSNILWSFA-----------K 474

Query: 413 SGDADSE--GSLSSPVLS-----FNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
           SG+A+ E    L + +++     F    L NIAW++A  G    I F  I   I+
Sbjct: 475 SGEANPELFRVLGNHIVARRLNDFQPQHLSNIAWAFATAGVSHPILFKKIRDHIA 529



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 70/275 (25%), Positives = 114/275 (41%), Gaps = 56/275 (20%)

Query: 287 SNIAWALSKIGGELLYLSEMDRV--AEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFS 344
           SNI W+ +K G       E+ RV    +   ++ +F  Q+++N+A AFA+   S P LF 
Sbjct: 466 SNILWSFAKSG---EANPELFRVLGNHIVARRLNDFQPQHLSNIAWAFATAGVSHPILFK 522

Query: 345 ELAKR--ASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALS 402
           ++       D ++ F  Q L+ + WAFA+  +       S    FK        LN    
Sbjct: 523 KIRDHIAGQDRLNLFNPQNLSNITWAFATAGD-------SHPEVFKKIGDHIAGLNS--- 572

Query: 403 NCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRF 462
                                  + SF    L NIAW+Y+V        F++ +      
Sbjct: 573 -----------------------LDSFKAQALSNIAWAYSVANVPSEGLFNECFAGACSS 609

Query: 463 EEQRISEQYREDIMFASQVHLVNQCLK--LEHPHLQLALSSVLEEKIASAGKTKRFNQKV 520
           +E+   E   E++    Q  L  Q LK  +E PH        L+EK  +A  +  +++  
Sbjct: 610 KEETFPE---EELRQLHQWQLWQQELKSGMELPH-------SLKEKCRNAFISSSYSE-- 657

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVD-GYTVDAVL 554
            S  Q +V   L + GL+   E  ++ GY VDA++
Sbjct: 658 -SKLQNDVVDELKAIGLDLEVEVLLESGYRVDALV 691


>gi|221484602|gb|EEE22896.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 558

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 37/63 (58%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           + +A E+DGPTHF  N+     +T LK R +   G+ V+ + + EW  L+G  E+ +Y+R
Sbjct: 193 RPIAIEVDGPTHFYANSTRYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGQKEREEYMR 252

Query: 617 VIL 619
             L
Sbjct: 253 RKL 255


>gi|428180195|gb|EKX49063.1| hypothetical protein GUITHDRAFT_136245 [Guillardia theta CCMP2712]
          Length = 371

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 90/185 (48%), Gaps = 23/185 (12%)

Query: 286 ISNIAWALSKIG--GELLYLSEMDRVAE-VALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           +S++ W ++ +G   E L+    ++V+E V  T +  FN+  ++ +A +FA  +  A DL
Sbjct: 74  VSSMIWGMAALGHTNERLF----EKVSEHVMSTGLEGFNAPKISIIAWSFARARFQAEDL 129

Query: 343 FSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESLD--------NAFKDATQF 393
           FS + +   +  + +F  Q +A +LWAFA      D LL   +        + F D +  
Sbjct: 130 FSLIEEFVVEKGMSSFNSQNIACILWAFAVFGRMTDDLLACAEEQIWSVGFSGFSDQSFV 189

Query: 394 TCCLNKALSNCNENGGVKSSGDADSEGSLS----SPVLSFNRDQLGNIAWSYAVLGQM-D 448
              L  A +  +  G    SG+   + + +      + SF+  QL  +AW++A LGQ  D
Sbjct: 190 D--LLWAFAASDLTGTCTHSGEDTVKLAAAYLRKRSIRSFSPKQLSTMAWAFARLGQFHD 247

Query: 449 RIFFS 453
           + F+S
Sbjct: 248 QAFYS 252


>gi|209363966|ref|YP_001424481.2| hypothetical membrane associated protein [Coxiella burnetii Dugway
           5J108-111]
 gi|207081899|gb|ABS76574.2| hypothetical membrane associated protein [Coxiella burnetii Dugway
           5J108-111]
          Length = 558

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 66/158 (41%), Gaps = 18/158 (11%)

Query: 264 QREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGE---- 319
           QR    ++ I    +   + QGI+N  WA + +G    YL E  R++   L  V      
Sbjct: 250 QRLSECMLVIVQRTVERFNPQGIANTLWAFATMGVRWRYLEE-QRLSSCLLVAVRHNAER 308

Query: 320 FNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQELAQVLWAFASL-- 372
           FNSQ++AN   AFA+      D     L   L       +  F  QE+A  LWA A++  
Sbjct: 309 FNSQDIANTLWAFATTGVRWQDREMQKLSERLLAAVRHNIEQFNPQEIANTLWALATMEV 368

Query: 373 ---YEPADPLLESLDNAF-KDATQFTC--CLNKALSNC 404
              Y     L   L +   ++A+QF+   C     S C
Sbjct: 369 EWQYLEDQGLSHLLTDVIDRNASQFSLENCTQITWSTC 406



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 64/147 (43%), Gaps = 10/147 (6%)

Query: 235 LNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALS 294
           L +  A+  +A  +   + M        R+R  + L +     + + + QGI+N  WAL+
Sbjct: 11  LTMLGAIFYVANTLWAFATMGVAWQYLKRERLSARLFSAIRHNVGQFNPQGIANALWALA 70

Query: 295 KIGGELLYLSEMDRVAEVALTKVGE----FNSQNVANVAGAFASM-----QHSAPDLFSE 345
            +G    YL E  R++E  L  +      FNSQ++AN   A A+M           L   
Sbjct: 71  TMGMGWRYLKE-QRLSERLLVAIRHTLEGFNSQDIANTFWALATMGVRWRYLERQSLSER 129

Query: 346 LAKRASDIVHTFQEQELAQVLWAFASL 372
           L       V  F  QE+A  LWA A++
Sbjct: 130 LLTAVRRNVEQFNAQEIANALWALATM 156



 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 70/260 (26%), Positives = 108/260 (41%), Gaps = 44/260 (16%)

Query: 232 LSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAW 291
            +P  IA AL  +A        +   RL+        +LVAI  T L   ++Q I+N  W
Sbjct: 57  FNPQGIANALWALATMGMGWRYLKEQRLS------ERLLVAIRHT-LEGFNSQDIANTFW 109

Query: 292 ALSKIGGELLYL---SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAK 348
           AL+ +G    YL   S  +R+       V +FN+Q +AN   A A+M+     L  +   
Sbjct: 110 ALATMGVRWRYLERQSLSERLLTAVRRNVEQFNAQEIANALWALATMEVRWRYLEEQ--- 166

Query: 349 RASD-----IVHT---FQEQELAQVLWAFASL-YEPADPLLESLDNAFKDATQ--FTCCL 397
           RAS+     I HT   F  Q++A  LWA A++  +  D  ++ L      A +    C  
Sbjct: 167 RASERLLVAIRHTIESFNSQDIANTLWALATIGVKWQDREIQRLSGRLVVAVRRNIECFN 226

Query: 398 NKALSNCNENGGVKSSG-DADSEGSLSSPVL--------SFNRDQLGNIAWSYAVLGQMD 448
           ++ ++N         +G     E  LS  +L         FN   + N  W++A +G   
Sbjct: 227 SQNVANTLWAFATMGAGWRYLQEQRLSECMLVIVQRTVERFNPQGIANTLWAFATMGVRW 286

Query: 449 RIFFSDIWKTISRFEEQRIS 468
           R             EEQR+S
Sbjct: 287 RY-----------LEEQRLS 295


>gi|219110565|ref|XP_002177034.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411569|gb|EEC51497.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 923

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 12/77 (15%)

Query: 558 KVAFEIDGPTHFSRN--------TGVP--LGHTMLKRRYIAAAGWNVVSLSHQEWEELQ- 606
           K+A E DGP HF+R           VP  LGHT+LK R +   GW VV + + E++++  
Sbjct: 822 KLAVEFDGPNHFTRQRKPSNGSKPDVPRALGHTVLKYRLLKKQGWTVVRVPYYEFDKIPY 881

Query: 607 -GSFEQLDYLRVILKDY 622
             S E+  YL+ +LK +
Sbjct: 882 WASMERQRYLQRLLKTH 898


>gi|397588981|gb|EJK54479.1| hypothetical protein THAOC_25889, partial [Thalassiosira oceanica]
          Length = 178

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 522 SSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPL 577
           S  Q +V   L + G++   E  +  GY +DA++     + VA E+DGP+HF      P 
Sbjct: 74  SKLQHDVVGELRAAGMDLGEEVLLGSGYRIDALVKFSDGRNVAVEVDGPSHFIDRR--PT 131

Query: 578 GHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVIL 619
           G T LK R +A      VVS+ + EW EL+ S  +  YLRV L
Sbjct: 132 GSTTLKHRQVARVDRIEVVSVPYWEWNELKNSEMKQHYLRVKL 174


>gi|221508215|gb|EEE33802.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 783

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 46/89 (51%)

Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
           EVA +L   G+ + R    +G  +D +L +KK      GP HF  ++     ++ L++R 
Sbjct: 616 EVACMLQEMGILFQRRLYANGCRIDILLPEKKTVIMCAGPHHFYLDSTRRTAYSRLQQRL 675

Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           +   G++V  L + EW ELQ   E+  +L
Sbjct: 676 LELQGYSVCVLPYYEWSELQNPEEKQRFL 704


>gi|221486442|gb|EEE24703.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 783

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 46/89 (51%)

Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
           EVA +L   G+ + R    +G  +D +L +KK      GP HF  ++     ++ L++R 
Sbjct: 616 EVACMLQEMGILFQRRLYANGCRIDILLPEKKTVIMCAGPHHFYLDSTRRTAYSRLQQRL 675

Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           +   G++V  L + EW ELQ   E+  +L
Sbjct: 676 LELQGYSVCVLPYYEWSELQNPEEKQRFL 704


>gi|384247944|gb|EIE21429.1| hypothetical protein COCSUDRAFT_56649 [Coccomyxa subellipsoidea
           C-169]
          Length = 994

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 61/117 (52%), Gaps = 9/117 (7%)

Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEV-ALTKVGEFNSQNV 325
           M +L+ +A   L +C+AQ +SNI   L+      L    +   AE+ A + +  F+ Q V
Sbjct: 442 MHILMDLAEERLEQCNAQDLSNILCGLAACERPDLAKPSLLASAELHACSMMTAFSPQGV 501

Query: 326 ANVAGAFASMQHSAPDLF----SELAKRASDIVHTFQEQELAQVLWAFASLYEPADP 378
           +NV  AFA ++   P L     +E+ +RA +    F  +++A+VLWAFA L     P
Sbjct: 502 SNVLWAFAKLEARVPTLLEAAGAEVVRRAEE----FSARDMAEVLWAFAKLGHNGSP 554



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 63/112 (56%), Gaps = 9/112 (8%)

Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
           MTA    S QG+SN+ WA +K+   +  L E    AEV + +  EF+++++A V  AFA 
Sbjct: 493 MTAF---SPQGVSNVLWAFAKLEARVPTLLEAAG-AEV-VRRAEEFSARDMAEVLWAFAK 547

Query: 335 MQHS-APDLFSELAKRASDIVHT---FQEQELAQVLWAFASLYEPADPLLES 382
           + H+ +PD    L  R   I+ +   +  ++LA ++W+ A L +PA   LE+
Sbjct: 548 LGHNGSPDAVEALIARMEYILRSGGPWVLRDLASMVWSLAVLEQPAPGFLEA 599



 Score = 43.9 bits (102), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 51/104 (49%), Gaps = 6/104 (5%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH---S 338
           + + +S I WAL   G      S M  + ++A  ++ + N+Q+++N+    A+ +    +
Sbjct: 421 TPRNLSTIVWALGSFG---YAPSRMHILMDLAEERLEQCNAQDLSNILCGLAACERPDLA 477

Query: 339 APDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLES 382
            P L +     A  ++  F  Q ++ VLWAFA L      LLE+
Sbjct: 478 KPSLLASAELHACSMMTAFSPQGVSNVLWAFAKLEARVPTLLEA 521


>gi|237833853|ref|XP_002366224.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
 gi|211963888|gb|EEA99083.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
          Length = 783

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 46/89 (51%)

Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
           EVA +L   G+ + R    +G  +D +L +KK      GP HF  ++     ++ L++R 
Sbjct: 616 EVACMLQEMGILFQRRLYANGCRIDILLPEKKTVIMCAGPHHFYLDSTRRTAYSRLQQRL 675

Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           +   G++V  L + EW ELQ   E+  +L
Sbjct: 676 LELQGYSVCVLPYYEWSELQNPEEKQRFL 704


>gi|399218291|emb|CCF75178.1| unnamed protein product [Babesia microti strain RI]
          Length = 507

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 72/155 (46%), Gaps = 21/155 (13%)

Query: 474 DIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLV 533
           D   A+ ++   + L++++PHL   L  + E+       T R       S Q   ++ LV
Sbjct: 326 DTRHATMLYYSLRYLEIQYPHLINTLQPIYEQCTTLLKNTPRM-----KSIQPSKSQRLV 380

Query: 534 STGLNWIR-----EYAVDGY-----TVDAVLVDKKVAFEIDGPTHF--SRNTG--VPLGH 579
           S  LN  R     EY          ++++ L  +K+A E+DGP HF    NT   V  G 
Sbjct: 381 SDALNSWRIPHKFEYTTPKLVSIDISIESTLYGEKIAIEVDGPWHFLTFHNTQERVRTGP 440

Query: 580 TMLKRRYIAAAGWNVVSL--SHQEWEELQGSFEQL 612
           +  K   + + GWNV+SL  S++  ++LQ   ++ 
Sbjct: 441 SFFKHWLLESEGWNVISLQPSNRNLQDLQNDLQEF 475


>gi|308798919|ref|XP_003074239.1| tumor-related protein-like (ISS) [Ostreococcus tauri]
 gi|116000411|emb|CAL50091.1| tumor-related protein-like (ISS) [Ostreococcus tauri]
          Length = 797

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 68/136 (50%), Gaps = 9/136 (6%)

Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE-MDRVAEVALTKV 317
           LA +  + M  L       + E SAQ ++  A A++K+G   +Y S+ M    E A  + 
Sbjct: 273 LAVSNHKIMQTLAKCMARKVEESSAQQMATSAHAMAKLG---VYNSQLMKAYRESAALRR 329

Query: 318 GEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVH-----TFQEQELAQVLWAFASL 372
            +F  +++A +  +FA ++  A ++F  L++   D+++     TF    L  VLW+FA L
Sbjct: 330 EQFQPRDIAFLTWSFAKLEVHASEMFKMLSEVICDMLYDVEFQTFTPHHLTMVLWSFAML 389

Query: 373 YEPADPLLESLDNAFK 388
            E    +L S+  A K
Sbjct: 390 KEDVTEILPSVTRAIK 405



 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 48/174 (27%), Positives = 72/174 (41%), Gaps = 33/174 (18%)

Query: 231 PLSPLNIATALHRIAK-NMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNI 289
           P  P  ++ A   +AK N + + +    R    R R +S+L  +        +AQG+SN 
Sbjct: 79  PWKPQELSNAFWGLAKVNSDAIELF---RFLGERIR-VSLLTDVGTDHRTGWTAQGVSNA 134

Query: 290 AWAL--------------SKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASM 335
           AW+L              S +GGEL  + E+ R  E    ++  FN Q  AN     A  
Sbjct: 135 AWSLGALATETRIGMFEESALGGEL--VRELARAIE---ERIELFNPQECANTLSGLAKC 189

Query: 336 QHSAPD-------LFSELAKRASDIVH--TFQEQELAQVLWAFASLYEPADPLL 380
             SA +        F+   KR    +    FQ Q ++ V+WA A L    D +L
Sbjct: 190 AASASEDAPRGAKAFAGRLKRDRSWLSGGQFQCQHVSNVIWACAKLNMSDDAVL 243


>gi|68067688|ref|XP_675786.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56495167|emb|CAH98407.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 423

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 56/105 (53%), Gaps = 3/105 (2%)

Query: 518 QKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHFSRNTG-- 574
             V+SS  K+++  L    +    EY + D   VDA +    VA EIDGP+HF +  G  
Sbjct: 311 HHVSSSVHKKISADLKYLNVFHYNEYFILDSILVDAYIPHTMVAIEIDGPSHFIQRGGSI 370

Query: 575 VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
           V   +T+ K+R + A G+ VVS+S  E   +  +   +++++ IL
Sbjct: 371 VYNPNTLFKKRLLRALGFVVVSISITEHTFIFSALTTINFVKRIL 415


>gi|307107871|gb|EFN56112.1| hypothetical protein CHLNCDRAFT_144712 [Chlorella variabilis]
          Length = 851

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 44/80 (55%), Gaps = 13/80 (16%)

Query: 549 TVDAVLVDKKVAFEIDGPTHFSRNTG-------------VPLGHTMLKRRYIAAAGWNVV 595
            VD  +  +++A E+DGPTHF RN G             +P+G T+LKRR +   GW V 
Sbjct: 745 CVDIAVPSRRLAIEVDGPTHFCRNNGGGGGGSASKQHLLLPMGSTLLKRRLLQRRGWAVA 804

Query: 596 SLSHQEWEELQGSFEQLDYL 615
           S+   +WE L+G+  +  +L
Sbjct: 805 SVCAADWERLRGAAPKRAFL 824


>gi|389582720|dbj|GAB65457.1| RAP protein [Plasmodium cynomolgi strain B]
          Length = 445

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 54/118 (45%), Gaps = 4/118 (3%)

Query: 490 LEHPHLQLALSSVL----EEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV 545
           L+  H+ L L   L    E+ I    + K  N    S  QK++ +LL   GL   RE+ V
Sbjct: 308 LKQIHIVLYLLRELGGDYEQAINVIERKKIKNTLTVSKMQKQLEKLLKEMGLKADREFPV 367

Query: 546 DGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
             Y +D VL  K+   E++G TH+    G     + LK   +    W V+++ +  W+
Sbjct: 368 GPYVLDFVLQKKRTCIEVNGFTHYYTFGGELNAKSRLKYYILRRLNWKVLTVEYTSWK 425


>gi|84997545|ref|XP_953494.1| hypothetical protein [Theileria annulata]
 gi|65304490|emb|CAI76869.1| hypothetical protein TA11170 [Theileria annulata]
          Length = 1272

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 67/323 (20%), Positives = 130/323 (40%), Gaps = 72/323 (22%)

Query: 191 SNRRKEINLNKDIVDAQTAQEVLEV--IAEMITAVGKGLSPSPLSPLNIATALHRIAKNM 248
           +N   E  LN D       Q++L+     ++++++G  L    ++ +N++TA+HR+AK  
Sbjct: 380 TNLEAETWLNMDPNHILIQQDLLKSKNTTQVLSSIGDKLKQ--MNAVNVSTAIHRLAKYT 437

Query: 249 EKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWA---------------- 292
                   +R           L+A+    + +   QG++NI W+                
Sbjct: 438 NPY-----NRYMVVNHESFGKLIALVEDHILKFDPQGLTNIFWSMIKLKITPKWLDCLLE 492

Query: 293 ----------LSKIGGELLYLSEMDRVAEVALT-----------KVGEFNSQ-NVANVAG 330
                     LS++   L  LS++ +  + +L            K+ +F    ++  V+ 
Sbjct: 493 QININANSLNLSELSNCLFCLSKLTKANDSSLELRFKILSLVQDKIKQFKRPLDLTCVST 552

Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA 390
           A A +    P +F  ++ +    +  F+ QE+  + W++ASL    D LL      F+  
Sbjct: 553 ALARLNVRNPVIFGHISSQVISSLEEFKIQEICGIAWSYASL-GFTDHLL------FRKI 605

Query: 391 TQFTCCLNKALSNCNENGGVK---------SSGDADSEGSLSSPVL-----SFNRDQLGN 436
            +F     ++ ++ N  G +             D D      SP++     S N  Q+  
Sbjct: 606 REFI----ESKADPNNIGNIVHLAWALSKIKEADPDFFLYTVSPLVRSHLSSLNCRQMTT 661

Query: 437 IAWSYAVLGQMDRIFFSDIWKTI 459
           I+W+Y   G  D+  F+DI  T+
Sbjct: 662 ISWAYVNAGVEDQDLFNDIASTL 684


>gi|221056993|ref|XP_002259634.1| RAP protein [Plasmodium knowlesi strain H]
 gi|193809706|emb|CAQ40408.1| RAP protein, putative [Plasmodium knowlesi strain H]
          Length = 1170

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 67/124 (54%), Gaps = 4/124 (3%)

Query: 501  SVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLVDKKV 559
            S+ ++++A   + +  NQ V+SS  K+++  L    +    EY + D   VD  +   +V
Sbjct: 1041 SIWKKQLARNQRKEEKNQ-VSSSVHKKISNDLRHLNIFHHNEYFILDSLLVDVYVPSARV 1099

Query: 560  AFEIDGPTHFSRNTGVPL--GHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRV 617
            A EIDGP+HF +   + L   +++ K+R + A G++V+S+S  E   +  +    ++L+ 
Sbjct: 1100 AIEIDGPSHFLQKGKLILYNPNSLFKKRLLRALGFSVISISISEHTFMFSALNTFNFLKK 1159

Query: 618  ILKD 621
             L +
Sbjct: 1160 FLSN 1163


>gi|84997988|ref|XP_953715.1| hypothetical protein [Theileria annulata]
 gi|65304712|emb|CAI73037.1| hypothetical protein, conserved [Theileria annulata]
          Length = 450

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 45/89 (50%), Gaps = 1/89 (1%)

Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
           ++ RLL    L    E  +  YT+D  +    VA E++G THF  N+      T LK + 
Sbjct: 319 QLGRLLDELKLKHKSELKIGPYTLDYAIPKINVAIEVNGYTHFFHNSKELNALTQLKYKI 378

Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           +   GWNVV +++  W+  +    +LDY+
Sbjct: 379 LKDMGWNVVGINYYNWKN-RNKQSRLDYI 406


>gi|71033831|ref|XP_766557.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68353514|gb|EAN34274.1| hypothetical protein TP01_1036 [Theileria parva]
          Length = 572

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 44/184 (23%), Positives = 78/184 (42%), Gaps = 32/184 (17%)

Query: 441 YAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALS 500
           +++L  +  IF  D  K I++F              F SQ++L+ + + LE  HL+  +S
Sbjct: 387 HSILKYVYGIFSGDDMKEITKFPN------------FVSQLNLLRKSMILERIHLKGLIS 434

Query: 501 ----SVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVD 556
               S   + +     T   N+  TS+   +V  +L S     + E+ V  Y VD  +  
Sbjct: 435 GTEISSFLDSLEHIKPTFAPNEFKTSNIHSQVDTILKSFNYVTLLEHYVCPYIVDIFVPS 494

Query: 557 KKVAFEIDGPTHFSRNT-------------GVPLGHTM---LKRRYIAAAGWNVVSLSHQ 600
           K    E+DGP H+S                   LG+T+   LK + +  +G+  +++   
Sbjct: 495 KNAVIEVDGPYHYSTTLNPRINKILKREVENYQLGYTLNSKLKSKLLTKSGFKFINIPFY 554

Query: 601 EWEE 604
           +W E
Sbjct: 555 QWPE 558


>gi|397598840|gb|EJK57295.1| hypothetical protein THAOC_22677, partial [Thalassiosira oceanica]
          Length = 98

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 42/74 (56%), Gaps = 6/74 (8%)

Query: 547 GYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEW 602
           GY +DA +    ++KVA E+DGP+HF      P G T+LK R +       VVS+ + EW
Sbjct: 18  GYRIDAFVKISDERKVAVEVDGPSHFIDRR--PTGSTILKHRQVVPLDRIEVVSVPYWEW 75

Query: 603 EELQGSFEQLDYLR 616
           +EL  S  +  YLR
Sbjct: 76  DELMSSETKQHYLR 89


>gi|424513170|emb|CCO66754.1| predicted protein [Bathycoccus prasinos]
          Length = 1295

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 53/106 (50%), Gaps = 4/106 (3%)

Query: 522  SSFQKEVARLLVSTGLNWIREYAVDG--YTVDAVLVDKKVAFEIDGPTHFSRN-TGVPLG 578
            S F +EV+  L   G+    E+  +G  Y++D  L  +K+  E DGPTH+S N   V +G
Sbjct: 977  SGFHQEVSSTLSEMGVPHELEFLTEGGLYSLDIALKGRKICIEADGPTHYSINRPTVRIG 1036

Query: 579  HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIG 624
               L+   +   GW V+ +    W+      ++ +Y+  +L ++ G
Sbjct: 1037 GDNLREAILTKQGWTVIQIPWFTWQAAPER-DRREYIANLLYEHAG 1081


>gi|154706218|ref|YP_001424375.1| hypothetical membrane associated protein [Coxiella burnetii Dugway
           5J108-111]
 gi|154355504|gb|ABS76966.1| hypothetical membrane associated protein [Coxiella burnetii Dugway
           5J108-111]
          Length = 593

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 52/186 (27%), Positives = 83/186 (44%), Gaps = 20/186 (10%)

Query: 280 ECSAQGISNIAWALSKIG--GELLYLSEM-DRVAEVALTKVGEFNSQNVANVAGAFASMQ 336
           + + QGI N  WAL+ +G   + L + E+ DR+ E     V  F +Q + N   AFA++ 
Sbjct: 180 QLNPQGIVNTLWALATMGMRWQELEVRELSDRLLEAVRYNVSRFKAQEITNALWAFATLS 239

Query: 337 HSAPDLFSE-LAKRASDIVHTFQE----QELAQVLWAFASL---------YEPADPLLES 382
                L ++ L  R  D VH   E    Q +   LWA A++          E  D LLE+
Sbjct: 240 VRWKKLETQGLNDRLLDAVHHNTEQLNPQGIVNTLWALATMGVRWRELEVRELTDRLLEA 299

Query: 383 LD-NAFKDATQFTCCLNKALSNCN-ENGGVKSSGDADS-EGSLSSPVLSFNRDQLGNIAW 439
           +  NA +  ++       AL+  +   G +++ G  D   G++   V  FN   + N  W
Sbjct: 300 VRYNASRFKSREIANTLWALATLSVRRGNMEAQGLRDRLLGAVHHNVERFNPQDIANALW 359

Query: 440 SYAVLG 445
             A +G
Sbjct: 360 GLATMG 365



 Score = 40.4 bits (93), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 58/126 (46%), Gaps = 18/126 (14%)

Query: 286 ISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           I+N  WAL+ +    G +      DR+       V  FN Q++AN     A+M    P+L
Sbjct: 312 IANTLWALATLSVRRGNMEAQGLRDRLLGAVHHNVERFNPQDIANALWGLATMGMKWPEL 371

Query: 343 FSE-LAKRASDIVHTFQE----QELAQVLWAFASLY--------EPADPLLESLDNAFKD 389
            ++ L+ R  + VH   E    Q++A  LWA A +         +  D LL +L N  ++
Sbjct: 372 EAQGLSDRLLEAVHRNAEQLNPQQIANTLWALAMMTVSWEYLQEQRLDQLLLNLIN--QN 429

Query: 390 ATQFTC 395
           A QF+ 
Sbjct: 430 ANQFSL 435


>gi|195996645|ref|XP_002108191.1| hypothetical protein TRIADDRAFT_52413 [Trichoplax adhaerens]
 gi|190588967|gb|EDV28989.1| hypothetical protein TRIADDRAFT_52413 [Trichoplax adhaerens]
          Length = 617

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 1/58 (1%)

Query: 559 VAFEIDGPTHFSRNTG-VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           VA E DGP+HFS N   V LG T+LK+R++   G+    +++ EW  L    E++ YL
Sbjct: 552 VAIEADGPSHFSCNQPYVNLGQTVLKQRHLKQMGFAFAQIAYHEWMTLNNKDEKISYL 609



 Score = 40.4 bits (93), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 23/85 (27%), Positives = 39/85 (45%), Gaps = 2/85 (2%)

Query: 288 NIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELA 347
           N AWA +   G  L L   D ++     K  + N Q+++N+  AFA   +    L +++A
Sbjct: 353 NTAWAFAT--GGFLDLVCYDNISNKLFRKADKMNEQDISNITWAFALTGYRNEKLQNKVA 410

Query: 348 KRASDIVHTFQEQELAQVLWAFASL 372
                ++H      L+ + W FA L
Sbjct: 411 DTVIGLIHHINSSNLSTITWGFAIL 435


>gi|196000781|ref|XP_002110258.1| hypothetical protein TRIADDRAFT_54076 [Trichoplax adhaerens]
 gi|190586209|gb|EDV26262.1| hypothetical protein TRIADDRAFT_54076 [Trichoplax adhaerens]
          Length = 686

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/66 (31%), Positives = 38/66 (57%)

Query: 562 EIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
           E+DG THF R   +  G ++LK+ ++   G+NV+ + H EW  +    ++++YLR  +  
Sbjct: 619 EVDGKTHFLRKYQLYTGPSILKKNHLKKFGYNVIQIPHFEWRIIDSFSDKVEYLRRKISH 678

Query: 622 YIGGEG 627
           Y  G+ 
Sbjct: 679 YDSGDS 684



 Score = 47.8 bits (112), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 36/149 (24%), Positives = 66/149 (44%), Gaps = 43/149 (28%)

Query: 283 AQGISNIAWALSKIGGE---------------------------------LLYLSE--MD 307
            +GI+N+ W+L+ IG +                                 L Y  +   D
Sbjct: 263 GKGIANVTWSLANIGNKDDAFLQILGNAAMERIKFMNPDSLAIFAWSLVSLDYFDDKLFD 322

Query: 308 RVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLW 367
            +A+ +L ++ +F++QN++N+  AFA   +  P LF ++A+     +H  + Q +A +  
Sbjct: 323 VIADESLVQMRKFSAQNLSNLLLAFAKSNYMIPKLFHDVAESTIKKLHNMEPQAMANIAL 382

Query: 368 AFA--SLYEPADPLLESLDNAFKDATQFT 394
           ++A  S YEP      +L  AF D   F+
Sbjct: 383 SYAKVSYYEP------NLVKAFTDKIIFS 405


>gi|428177978|gb|EKX46855.1| hypothetical protein GUITHDRAFT_70208, partial [Guillardia theta
           CCMP2712]
          Length = 88

 Score = 49.7 bits (117), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/70 (34%), Positives = 39/70 (55%)

Query: 547 GYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
           GY++D ++     A E+DGP HF  N+    G T +K R++   G+   ++   EW ++ 
Sbjct: 18  GYSLDILMPSLGCALEVDGPFHFLLNSYERSGSTKMKHRHLEQIGYKFHAIPFWEWPKVG 77

Query: 607 GSFEQLDYLR 616
            S E+L YLR
Sbjct: 78  PSEEKLAYLR 87


>gi|426246726|ref|XP_004017142.1| PREDICTED: FAST kinase domain-containing protein 3 [Ovis aries]
          Length = 660

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           DG+ +   +   V K+VA  IDGP  F  N+   LG    K+R++   G+ VV + + E 
Sbjct: 575 DGFVLPFTIDEDVHKRVALCIDGPKRFCLNSNHLLGKEATKQRHLRLLGYQVVQIPYYEI 634

Query: 603 EELQGSFEQLDYLR 616
           E L+   E +DYL+
Sbjct: 635 EMLKSRLELVDYLQ 648


>gi|399216319|emb|CCF73007.1| unnamed protein product [Babesia microti strain RI]
          Length = 527

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 33/119 (27%), Positives = 57/119 (47%), Gaps = 24/119 (20%)

Query: 521 TSSFQKEV--ARLLVSTGLN-------WIREYAV-----DGYTVDAVLVDKK-------- 558
           TS+FQK+V  A L +   LN       ++ +YA      D Y ++    + K        
Sbjct: 408 TSNFQKQVGEAALFIYYKLNTEVKIGPFMVDYATPMSVNDMYNINNYRTNDKDINPEINT 467

Query: 559 --VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
             V  E+DGP HF +N+     H+++K   +   G+ VV + + EW++L    ++ +YL
Sbjct: 468 NGVIIEVDGPRHFYKNSHTYTCHSIVKDEILKLMGYRVVHVKYFEWDKLPNLVDKQNYL 526


>gi|71029720|ref|XP_764503.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351457|gb|EAN32220.1| hypothetical protein, conserved [Theileria parva]
          Length = 1135

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 67/325 (20%), Positives = 127/325 (39%), Gaps = 76/325 (23%)

Query: 191 SNRRKEINLNKDIVDAQTAQEVLEV--IAEMITAVGKGLSPSPLSPLNIATALHRIAKNM 248
           +N   E  LN D       Q++L+     ++++++G  L    ++ +N++TALHR+A+  
Sbjct: 240 TNLEPETWLNMDPNHILIQQDLLKSKNTTQVLSSIGDKLKQ--MNAVNVSTALHRLARYT 297

Query: 249 EKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWA---------------- 292
                   +R           L+++    + +   QG++NI W+                
Sbjct: 298 NPY-----NRYMVCNHESFGKLISLVEEHILKFDPQGLTNIFWSIIKLKITPKWLDCLLE 352

Query: 293 ----------LSKIGGELLYLSEMDRVAEVALT-----------KVGEFNSQ-NVANVAG 330
                     LS++   L  LS++ + ++ +L            K+ +F    ++  V+ 
Sbjct: 353 QINIHANSLNLSELSNCLFCLSKLTKSSDSSLELRFKILSLVQDKITQFKRPLDLTCVST 412

Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA 390
           A A +    P +F  ++ +    +  F+ QEL  + W++ASL              F D 
Sbjct: 413 ALARLNVRNPVIFGHISSQVISNLEEFKIQELCGIAWSYASL-------------GFTDH 459

Query: 391 TQFTCCLNKALSNCNENG---------GVKSSGDADSEGSLS--SPVL-----SFNRDQL 434
             F        S  ++N           +    +AD +  L   SP++     S N  Q+
Sbjct: 460 LLFMKIRRFIESKADQNNIGNIIHLAWALSKIKEADPDFFLYTVSPLVRSHLASLNCRQM 519

Query: 435 GNIAWSYAVLGQMDRIFFSDIWKTI 459
             IAW+Y   G  D   F+DI  T+
Sbjct: 520 TTIAWAYVNAGVEDLDLFNDIAATL 544


>gi|302839870|ref|XP_002951491.1| hypothetical protein VOLCADRAFT_92047 [Volvox carteri f. nagariensis]
 gi|300263100|gb|EFJ47302.1| hypothetical protein VOLCADRAFT_92047 [Volvox carteri f. nagariensis]
          Length = 2025

 Score = 49.3 bits (116), Expect = 0.006,   Method: Composition-based stats.
 Identities = 52/212 (24%), Positives = 84/212 (39%), Gaps = 45/212 (21%)

Query: 275  MTALPECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
            +  LP+ S Q +SN    L+K+G   G  +    +D+VA  ++ K+GEFN+Q ++N+  +
Sbjct: 1268 LQVLPQASHQDVSNSLLGLAKLGWSPGPYV----LDQVARGSVAKIGEFNAQELSNMMWS 1323

Query: 332  FASMQHSAPDL------------------FSELAKRASD----IVHTFQEQELAQVLWAF 369
             A ++H    L                   +   +RA D        F  QEL+ +LW+ 
Sbjct: 1324 LAHVKHCNAKLQTAIFQQAGFYHRLLACWLASWYRRAHDGASAAAAHFTYQELSNLLWST 1383

Query: 370  AS---LYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDAD--------- 417
            A    L+EP              A        +     +ENG   SSG  +         
Sbjct: 1384 AKMGYLHEPLMRAAARQAARQLAAEVEEREGREEEQLEDENGRGDSSGGGEEDDLAAAEC 1443

Query: 418  ----SEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
                S  S    V S++   + N  W++A LG
Sbjct: 1444 RAAASRPSARGCVRSWSSQAVSNTTWAFATLG 1475



 Score = 49.3 bits (116), Expect = 0.006,   Method: Composition-based stats.
 Identities = 32/111 (28%), Positives = 60/111 (54%), Gaps = 10/111 (9%)

Query: 265 REMSMLVAIAMTALPECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFN 321
           R +SM +   +  L E   QGI++    L+K+G   G  +    +D+VA  ++ K+GEFN
Sbjct: 465 RNLSMRL---LGLLAEVPPQGIASSLLGLAKLGWSPGPYV----LDQVARGSVAKIGEFN 517

Query: 322 SQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
           +Q ++N   + A + +  P L   + ++A   +  F  Q ++ ++WA A+L
Sbjct: 518 AQALSNTMWSLARLGYYNPQLQDAMFRQALRRLSEFSPQGISNLIWAAATL 568



 Score = 48.1 bits (113), Expect = 0.015,   Method: Composition-based stats.
 Identities = 20/37 (54%), Positives = 26/37 (70%)

Query: 558  KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNV 594
            +VA E+DGPTHF+ NT  PL  T+ +RR + A GW V
Sbjct: 1026 RVAVEVDGPTHFTSNTRQPLSTTLYRRRCLEARGWVV 1062


>gi|351698645|gb|EHB01564.1| FAST kinase domain-containing protein 3 [Heterocephalus glaber]
          Length = 660

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 37/62 (59%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           V+K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E E L+   E ++Y
Sbjct: 587 VNKRIALCIDGPKRFCSNSSHLLGKEAIKQRHLRLLGYQVVQVPYHEMEMLKSRLELVEY 646

Query: 615 LR 616
           L+
Sbjct: 647 LQ 648


>gi|153209513|ref|ZP_01947418.1| conserved domain protein [Coxiella burnetii 'MSU Goat Q177']
 gi|212218771|ref|YP_002305558.1| hypothetical membrane-associated protein [Coxiella burnetii
           CbuK_Q154]
 gi|120575338|gb|EAX31962.1| conserved domain protein [Coxiella burnetii 'MSU Goat Q177']
 gi|212013033|gb|ACJ20413.1| hypothetical membrane-associated protein [Coxiella burnetii
           CbuK_Q154]
          Length = 435

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 66/301 (21%), Positives = 128/301 (42%), Gaps = 57/301 (18%)

Query: 281 CSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
            + QGI+N  W L+ +     EL      DR+ +       +FNSQ++AN   A A+M  
Sbjct: 97  LNPQGIANTLWTLATMNVRRRELEVQGLSDRLLDAVYYNAEQFNSQDIANTLWALAAMGM 156

Query: 338 SAPDLFSE-LAKRASDIVH----TFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQ 392
              +L  + L+ R  D VH     F  Q +A  LWA A+                    +
Sbjct: 157 RWRELEEQGLSDRLLDAVHRNAQRFSPQGIANALWALAT-----------------TGMR 199

Query: 393 FTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
           +    N+ LSN   N  V+ S +  S   +++ + +     +  ++W Y    ++DR+  
Sbjct: 200 WRELENRELSNRLFN-AVQHSAERFSSQQIANTLWAL---AMMALSWGYLKEQRVDRLLL 255

Query: 453 SDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGK 512
           + I ++ ++F  +  ++     IM++++   +        P + L +S++         K
Sbjct: 256 NAIDQSANQFSLEESTQ-----IMWSTRWFDIRPP-----PEILLKISNM---------K 296

Query: 513 TKRFNQKVTSSFQKEVARLL---VSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTH 568
             R     +S   + VA +L   ++  +    E+ + + + VD  +  K++  E+DGP H
Sbjct: 297 PPR-----SSDLHRHVASVLSAQINGEIPIENEFFIQNCFYVDICIPSKRLVIEVDGPYH 351

Query: 569 F 569
            
Sbjct: 352 I 352



 Score = 45.4 bits (106), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 52/200 (26%), Positives = 88/200 (44%), Gaps = 33/200 (16%)

Query: 205 DAQTAQEVLEV----------IAEMITAVGKGLSPSPLSPLNIATAL-HRIAKNMEKVSM 253
           D  T +E+LE           +A ++ A+    +   L P ++A  L   IAKN+E+++ 
Sbjct: 40  DYATIREILEARRHRRFNGQSVANLLLAIAYHHTQWRLLPRSLAAQLWDAIAKNVERLNP 99

Query: 254 MTTHRLAFT------RQREMS-------MLVAIAMTALPECSAQGISNIAWALSKIGGEL 300
                  +T      R+RE+        +L A+   A  + ++Q I+N  WAL+ +G   
Sbjct: 100 QGIANTLWTLATMNVRRRELEVQGLSDRLLDAVYYNA-EQFNSQDIANTLWALAAMGMRW 158

Query: 301 LYLSEM---DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFS-ELAKRASDIVH- 355
             L E    DR+ +        F+ Q +AN   A A+      +L + EL+ R  + V  
Sbjct: 159 RELEEQGLSDRLLDAVHRNAQRFSPQGIANALWALATTGMRWRELENRELSNRLFNAVQH 218

Query: 356 ---TFQEQELAQVLWAFASL 372
               F  Q++A  LWA A +
Sbjct: 219 SAERFSSQQIANTLWALAMM 238


>gi|83273444|ref|XP_729400.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23487122|gb|EAA20965.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 1189

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 55/105 (52%), Gaps = 3/105 (2%)

Query: 518  QKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHF-SRNTGV 575
              V+SS  K+++  L    +    EY + D   VDA +    VA EIDGP+HF  R   +
Sbjct: 1077 HHVSSSVHKKISTDLKYLNVFHYNEYFILDSILVDAYIPHSMVAIEIDGPSHFIQRGESI 1136

Query: 576  PLG-HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
                +T+ K+R + A G+ VVS+S  E   +  +   +++++ IL
Sbjct: 1137 VYNPNTLFKKRLLRALGFVVVSISVTEHTFIFSALNTINFVKRIL 1181


>gi|70954340|ref|XP_746221.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56526762|emb|CAH76318.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 928

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 3/105 (2%)

Query: 518 QKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHF-SRNTGV 575
           Q ++SS  K+++  L    +    EY + D   VDA +     A EIDGP+HF  R   +
Sbjct: 816 QHISSSVHKKISNDLKYLNIFHYNEYFILDSILVDAYIPHAMTAIEIDGPSHFIQRGASI 875

Query: 576 PLG-HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
               +T+ K+R + A G+ VVS+S  +   +  +   +++++ IL
Sbjct: 876 VYNPNTLFKKRLLRALGFVVVSISITDHTFVFSALNTINFIKKIL 920


>gi|70937099|ref|XP_739403.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56516375|emb|CAH87459.1| hypothetical protein PC302475.00.0 [Plasmodium chabaudi chabaudi]
          Length = 226

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/97 (23%), Positives = 50/97 (51%)

Query: 519 KVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
           K ++ +  EV+R+L    +N +R   ++    D +L D  +     GP  +  N+ +   
Sbjct: 112 KYSARWITEVSRILTKINVNHLRNVYINNICADIMLPDSNIIIMCLGPYSYYVNSLLTTS 171

Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
            + LK+  +    +NV++L++ +W +L    EQ+++L
Sbjct: 172 ISDLKKNILEKKKYNVITLNYHDWNKLNDYEEQINFL 208


>gi|294877932|ref|XP_002768199.1| hypothetical protein Pmar_PMAR002989 [Perkinsus marinus ATCC 50983]
 gi|239870396|gb|EER00917.1| hypothetical protein Pmar_PMAR002989 [Perkinsus marinus ATCC 50983]
          Length = 400

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 2/95 (2%)

Query: 278 LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
           L EC+   +SN+   + K   E L  S +  + E  + +  E    +++ +A A A M  
Sbjct: 66  LRECTGDDLSNLCRCICK--AEYLCPSLLTSITEECMARSSELEPADISTIAWALAKMGF 123

Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            +  LF  LA+      H F    LA ++WAFAS+
Sbjct: 124 GSDVLFQRLARVVEVTTHLFSGAYLANLMWAFASV 158



 Score = 42.7 bits (99), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 53/109 (48%), Gaps = 21/109 (19%)

Query: 286 ISNIAWALSKIG-GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFS 344
           IS IAWAL+K+G G  +    + RV EV       F+   +AN+  AFAS+ + +  + +
Sbjct: 111 ISTIAWALAKMGFGSDVLFQRLARVVEVT---THLFSGAYLANLMWAFASVGYRSESMLA 167

Query: 345 ELAKRASDIVHTFQEQ-----------------ELAQVLWAFASLYEPA 376
            +A+R  +++    E                  E++ ++WA + L+ P+
Sbjct: 168 AVAERCQELMTVVLEPPGSTDVEVVDRMPLHPMEMSTLVWALSRLHAPS 216


>gi|212212260|ref|YP_002303196.1| hypothetical membrane-associated protein [Coxiella burnetii
           CbuG_Q212]
 gi|212010670|gb|ACJ18051.1| hypothetical membrane-associated protein [Coxiella burnetii
           CbuG_Q212]
          Length = 496

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 88/403 (21%), Positives = 161/403 (39%), Gaps = 66/403 (16%)

Query: 232 LSPLNIATALH-RIAKNMEKVSMMTTHRLAFT------RQREMS-------MLVAIAMTA 277
           L P ++A  L   IAKN+E+++        +T      R+RE+        +L A+   A
Sbjct: 12  LLPRSLAAQLWDAIAKNVERLNPQGIANTLWTLATMNVRRRELEVQGLSDRLLDAVYYNA 71

Query: 278 LPECSAQGISNIAWALSKIGGELLYLSEM---DRVAEVALTKVGEFNSQNVANVAGAFAS 334
             + ++Q I+N  WAL+ +G     L E    DR+ +        FN Q +AN   A  +
Sbjct: 72  -EQFNSQDIANTLWALAAMGMRWRELEEQGLSDRLLDAVRYDAERFNPQGIANTLWALVA 130

Query: 335 MQHSAPDL-FSELAKRASDIVHT----FQEQELAQVLWAFASL----YEPADPLLES--L 383
           M  +  +L   EL  R  D V +    F  Q++   LWA A++     E  D  L    L
Sbjct: 131 MGMTWGELEAQELNDRLLDAVGSNAPRFNSQDITNTLWALATMGMKWRELGDQRLRDRLL 190

Query: 384 DNAFKDATQFTC--CLNKALSNCNENGGVKSSGDADSE----GSLSSPVLSFNRDQLGNI 437
               ++A +F      N   +        +  GD        G++      FN   + N+
Sbjct: 191 GAVRRNAERFKPQGIANALWALATMGMKWRELGDQRLRDRLLGAVRRNAERFNPQGIANV 250

Query: 438 AWSYAVL----GQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHP 493
            W+ A +    G+++    ++      R+  +R S Q   + ++A  +  ++     E  
Sbjct: 251 LWALATMGMRWGELEAQRLNNCLLAAVRYNAERFSSQQIANTLWALAMMALSWGYLKEQR 310

Query: 494 HLQLALSSV--------LEEKIASAGKTKRFNQKV---------------TSSFQKEVAR 530
             +L L+++        LEE       T+ F+ +                +S   + VA 
Sbjct: 311 VDRLLLNAIDQSANQFSLEESTQIMWSTRWFDIRPPPEILLKISNMKPPRSSDLHRHVAS 370

Query: 531 LL---VSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHF 569
           +L   ++  +    E+ + + + VD  +  K++  E+DGP H 
Sbjct: 371 VLSAQINGEIPIENEFFIQNCFYVDICIPSKRLVIEVDGPYHI 413


>gi|440897891|gb|ELR49494.1| FAST kinase domain-containing protein 3 [Bos grunniens mutus]
          Length = 660

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           DG+ +   +   V K+VA  IDGP  F  N+   LG    K+R++   G+ VV + + E 
Sbjct: 575 DGFVLPFTIDEDVHKRVALCIDGPKRFCLNSKHLLGKEATKQRHLRLLGYQVVQIPYYEI 634

Query: 603 EELQGSFEQLDYLR 616
           E L+   E +DYL+
Sbjct: 635 EMLKSRLELVDYLQ 648


>gi|397646275|gb|EJK77204.1| hypothetical protein THAOC_00981, partial [Thalassiosira oceanica]
          Length = 445

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 43/174 (24%), Positives = 83/174 (47%), Gaps = 32/174 (18%)

Query: 302 YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHT 356
           ++   D +A   +  + +F++++++N+  +F  ++ + PD     LF+   K A  I+HT
Sbjct: 197 FMPIFDSIASSTVVMLDKFDARHLSNLIYSFGLVERN-PDIEGETLFNVFGKTAVKILHT 255

Query: 357 FQEQELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENG 408
           F+ QEL+ +LWAF       + L++    ++  +D ++FK    F   L           
Sbjct: 256 FKPQELSNMLWAFVKVDAKNSRLFQETGGVISGMDLDSFK-PQDFAIIL----------W 304

Query: 409 GVKSSGDADSE--GSLSSPVLS-----FNRDQLGNIAWSYAVLGQMDRIFFSDI 455
               SG ADS+   +L + +++     F    + NI W+YA  G+     F +I
Sbjct: 305 SFAKSGKADSKLFQALGNHIVTRSLNDFWPQDVSNIVWAYATAGESHPELFKNI 358



 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 273 IAMTALPECSAQGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
           I   +L +   Q +SNI WA +  G     L+ +  +  AE+ +     FN QN++ +A 
Sbjct: 324 IVTRSLNDFWPQDVSNIVWAYATAGESHPELFKNIGNHAAELDMD---SFNPQNLSIIAW 380

Query: 331 AFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFAS 371
           AFAS     P+LF ++  R + +  +  F+ Q+L+   W+FA+
Sbjct: 381 AFASAGVPHPELFRKMGARVAGLKSLDLFKPQDLSNTAWSFAT 423



 Score = 42.7 bits (99), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 52/103 (50%), Gaps = 6/103 (5%)

Query: 284 QGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           Q  + I W+ +K G     L+ +  + +   +L    +F  Q+V+N+  A+A+   S P+
Sbjct: 297 QDFAIILWSFAKSGKADSKLFQALGNHIVTRSLN---DFWPQDVSNIVWAYATAGESHPE 353

Query: 342 LFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESL 383
           LF  +   A+++ + +F  Q L+ + WAFAS   P   L   +
Sbjct: 354 LFKNIGNHAAELDMDSFNPQNLSIIAWAFASAGVPHPELFRKM 396


>gi|262205509|ref|NP_001019699.2| FAST kinase domain-containing protein 3 [Bos taurus]
 gi|145558912|sp|Q58CX2.2|FAKD3_BOVIN RecName: Full=FAST kinase domain-containing protein 3
 gi|296475672|tpg|DAA17787.1| TPA: FAST kinase domain-containing protein 3 [Bos taurus]
          Length = 660

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           DG+ +   +   V K+VA  IDGP  F  N+   LG    K+R++   G+ VV + + E 
Sbjct: 575 DGFVLPFTIDEDVHKRVALCIDGPKRFCLNSKHLLGKEATKQRHLRLLGYQVVQIPYYEI 634

Query: 603 EELQGSFEQLDYLR 616
           E L+   E +DYL+
Sbjct: 635 EMLKSRLELVDYLQ 648


>gi|348511573|ref|XP_003443318.1| PREDICTED: FAST kinase domain-containing protein 3-like
           [Oreochromis niloticus]
          Length = 618

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 82/350 (23%), Positives = 136/350 (38%), Gaps = 88/350 (25%)

Query: 339 APDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLN 398
           A  L   L+ RAS +   F++ E+ +VL A  +L +    L+ +++            L 
Sbjct: 254 AVSLVLRLSHRASRVFKAFRDDEIMKVLSALMTLGQHDGELVAAMEKH----------LT 303

Query: 399 KALSNCNEN--GGV-------KSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDR 449
             L  C+    G +       +   +   E    + V    +     IA     +G+++ 
Sbjct: 304 GRLEKCDPELIGAIMEYCLQMRCRSEPLFEAVAENFVRHAEKHTTLQIAKQIVAMGRLNY 363

Query: 450 I--FFSDIWKTISRFEEQRISE-QYRE--DIMFASQVHL----VNQCLKLEHPHL----- 495
           +    S ++K +     +R S+ Q R   D+M A  +HL    +N   K+  PH      
Sbjct: 364 LPQCSSQMFKKLESILSERFSQFQPRSLVDVMHAC-IHLERFPLNYMTKVFSPHFLQRLQ 422

Query: 496 ---------------QLALSSVLE---------------EKIASAGKTKRFNQKVTSSFQ 525
                          QL LS+ LE               ++ +SAG+   F   + S   
Sbjct: 423 AQGEPLDKNTLGQLTQLHLSTTLECTYYWGPRLPFFLHVKRFSSAGQA--FETPMESLLY 480

Query: 526 KEV----ARLLVSTGLNWIREYAVDGYTVDA-VLVD---------------KKVAFEIDG 565
           K+V    A LL   G  +       GYT+D  + +D               K+V   +DG
Sbjct: 481 KQVKGPLAHLL--GGTLYSTRMIHGGYTIDVEICLDEGGFVLPPSQWDHTYKRVVLCLDG 538

Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           P  F  NT   LG  + KRR++   G  +V + + E+E+LQ   EQ+ YL
Sbjct: 539 PNRFCTNTRHLLGKEVTKRRHLQRMGMELVEIPYFEFEKLQTEEEQIQYL 588


>gi|148705059|gb|EDL37006.1| FAST kinase domains 3, isoform CRA_a [Mus musculus]
          Length = 661

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           DG+ +   +   + K+VA  IDGP  F  ++   LG    K+R++   G+ VV L + E 
Sbjct: 576 DGFVLPCTVDEDIHKRVALCIDGPQRFCLDSKHLLGKEATKQRHLRLLGYQVVQLPYHEL 635

Query: 603 EELQGSFEQLDYLR 616
           E L    E +DYL+
Sbjct: 636 ELLTSRLELVDYLQ 649


>gi|338175904|ref|YP_004652714.1| hypothetical protein PUV_19100 [Parachlamydia acanthamoebae UV-7]
 gi|336480262|emb|CCB86860.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV-7]
          Length = 565

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 66/312 (21%), Positives = 119/312 (38%), Gaps = 51/312 (16%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQV 365
           ++++ EV L      N+  +  +A ++  +     DL  EL K     ++      L  +
Sbjct: 263 LEQLKEVFLKNATSLNADEIVRIAWSYHFLNCIHEDLLRELCKHLEPKINDLTNDGLINI 322

Query: 366 LWAFASL-------YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADS 418
              F SL              +E  D    +  QFT       SN +E       G   S
Sbjct: 323 TKIFISLNFIDKELLWKLLKKIE--DKVVDNPHQFTP------SNLSELTHAMLMGYCQS 374

Query: 419 EGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFA 478
           E                   ++  +L  +D IF  D     SR++  ++S          
Sbjct: 375 ED------------------YTTFILNMLDVIFQIDP----SRWKAHQLS---------- 402

Query: 479 SQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLN 538
            Q+H ++    L+    + A+   L+E+I    K  +  + ++S F   VA+ + +    
Sbjct: 403 -QIHTIHLIYTLKSKQ-EKAMPIPLQERIDIHLKGLKDKKPISSDFHLSVAKCIENILGK 460

Query: 539 WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLS 598
             +E+ ++ Y VD     +K+  E+DGP HF +  G  L    +K   +   GW V+ +S
Sbjct: 461 SEKEFQIETYFVDIAYPARKLVIEVDGPAHFDQ-FGNYLQKNAVKEFVLKLLGWQVIRIS 519

Query: 599 HQEWEELQGSFE 610
            +EW   +  F 
Sbjct: 520 -KEWPGYEHIFH 530


>gi|397572795|gb|EJK48407.1| hypothetical protein THAOC_32795, partial [Thalassiosira oceanica]
          Length = 163

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 79/156 (50%), Gaps = 19/156 (12%)

Query: 304 SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQ 358
           S  DR+A   +  + EF++++++N+  +F  ++ + PD     LF+   + A  I+HTF+
Sbjct: 13  SIFDRIASSTVGILNEFDARHLSNLIYSFGLVERN-PDIGGDTLFNVFGEAAVKILHTFK 71

Query: 359 EQELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGV 410
            QEL+ +LWAF       + L++    ++  +D  +FK         + A S+  +    
Sbjct: 72  PQELSNMLWAFVKVDADNSRLFQETGRVISGMDLGSFKPQDFSNVLWSSAKSDEADPVLF 131

Query: 411 KSSGDADSE-GSLSSPVLSFNRDQLGNIAWSYAVLG 445
           ++ G+  +  GSL     SF   +L N AW++A  G
Sbjct: 132 QAIGNHIANMGSLD----SFKPQELSNTAWAFATAG 163


>gi|255070911|ref|XP_002507537.1| hypothetical protein MICPUN_55039 [Micromonas sp. RCC299]
 gi|226522812|gb|ACO68795.1| hypothetical protein MICPUN_55039 [Micromonas sp. RCC299]
          Length = 593

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 61/259 (23%), Positives = 103/259 (39%), Gaps = 47/259 (18%)

Query: 198 NLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTH 257
           ++  D++DA + ++VL ++ +      K         +N +TALHRIA+        T  
Sbjct: 191 DIQGDLMDAASVEDVLLLVEKQGEIFNK---------VNTSTALHRIARIASTAPYATAG 241

Query: 258 R--------LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE-MDR 308
                    L  TR      L+ +A     E S   +SN  WAL+++  ++  ++  +D 
Sbjct: 242 ANQQSPDAVLRITRDERFHHLLQLATALSKEMSIVSVSNTLWALARLRCDIHEMNTLLDD 301

Query: 309 VAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQVLW 367
           +A  A         +++A V  A A + H     L   +A R  D    F+  ++  +LW
Sbjct: 302 LAGRAAATAHNAQPKHLATVIWALAVLGHEPRSRLLRAVAMRVMDTAGDFRAPDVVNMLW 361

Query: 368 AFASLYEPADPLLESLDN--AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSP 425
           A+A     A   L S D     KD  +   C+  AL+N  +                   
Sbjct: 362 AYARWTRLAP--LNSPDGLPGAKDVVKELSCV--ALANLTD------------------- 398

Query: 426 VLSFNRDQLGNIAWSYAVL 444
              F   Q  N++WS A+L
Sbjct: 399 ---FTPYQCANLSWSLAML 414


>gi|128485706|ref|NP_081399.3| FAST kinase domain-containing protein 3 [Mus musculus]
 gi|145558913|sp|Q8BSN9.2|FAKD3_MOUSE RecName: Full=FAST kinase domain-containing protein 3
 gi|26328905|dbj|BAC28191.1| unnamed protein product [Mus musculus]
          Length = 661

 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           DG+ +   +   + K+VA  IDGP  F  ++   LG    K+R++   G+ VV L + E 
Sbjct: 576 DGFVLPCTVDEDIHKRVALCIDGPQRFCLDSKHLLGKEATKQRHLRLLGYQVVQLPYHEL 635

Query: 603 EELQGSFEQLDYLR 616
           E L    E +DYL+
Sbjct: 636 ELLTSRLELVDYLQ 649


>gi|282889813|ref|ZP_06298352.1| hypothetical protein pah_c004o212 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|281500387|gb|EFB42667.1| hypothetical protein pah_c004o212 [Parachlamydia acanthamoebae str.
           Hall's coccus]
          Length = 546

 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 66/310 (21%), Positives = 119/310 (38%), Gaps = 47/310 (15%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQV 365
           ++++ EV L      N+  +  +A ++  +     DL  EL K     ++      L  +
Sbjct: 262 LEQLKEVFLKNATSLNADEIVRIAWSYHFLNCIHEDLLRELCKHLEPKINDLTNDGLINI 321

Query: 366 LWAFASL-----YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEG 420
              F SL           L +  D    +  QFT       SN +E       G   SE 
Sbjct: 322 TKIFISLNFIDKKLLWKLLKKIEDKVVDNPHQFTP------SNLSELTHAMLMGYCQSED 375

Query: 421 SLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQ 480
                             ++  +L  +D IF  D     SR++  ++S           Q
Sbjct: 376 ------------------YTTFILNMLDVIFQIDP----SRWKAHQLS-----------Q 402

Query: 481 VHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWI 540
           +H ++    L+    + A+   L+E+I    K  +  + ++S F   VA+ + +      
Sbjct: 403 IHTIHLIYTLKSKQ-EKAMPIPLQERIDIHLKGLKDKKPISSDFHLSVAKCIENILGKSE 461

Query: 541 REYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQ 600
           +E+ ++ Y VD     +K+  E+DGP HF +  G  L    +K   +   GW V+ +S +
Sbjct: 462 KEFQIETYFVDIAYPARKLVIEVDGPAHFDQ-FGNYLQKNAVKEFVLKLLGWQVIRIS-K 519

Query: 601 EWEELQGSFE 610
           EW   +  F 
Sbjct: 520 EWPGYEHIFH 529


>gi|395833180|ref|XP_003789620.1| PREDICTED: LOW QUALITY PROTEIN: FAST kinase domain-containing
           protein 3 [Otolemur garnettii]
          Length = 679

 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 42/154 (27%), Positives = 68/154 (44%), Gaps = 33/154 (21%)

Query: 496 QLALSSVLE------EKIASAGKTKRFNQKVTS-------SFQKEV---------ARLLV 533
           QL L+S+LE       K+ S  + K F     S          K+V         ARL  
Sbjct: 492 QLYLTSILECPFYKGTKLLSKFQVKSFLTPCCSLETPMDFHLYKQVMFGLIDLLGARLYF 551

Query: 534 STGLNWIREYAVD--------GYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTML 582
           ++ +     Y +D        G+ + + +   V K+VA  IDGP  F  N+   LG   +
Sbjct: 552 ASKVLTPYCYTIDVEIKLDEEGFVLPSTVDEDVYKRVALCIDGPKRFCPNSNHLLGKEAI 611

Query: 583 KRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           K+R++   G+ VV + + E E L+   E ++YL+
Sbjct: 612 KQRHLQLIGYEVVQIPYHEVEMLKSRLELVEYLQ 645


>gi|189183794|ref|YP_001937579.1| repeat-containing protein A_04 [Orientia tsutsugamushi str. Ikeda]
 gi|189180565|dbj|BAG40345.1| repeat-containing protein A_04 [Orientia tsutsugamushi str. Ikeda]
          Length = 554

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 79/391 (20%), Positives = 142/391 (36%), Gaps = 97/391 (24%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEV----ALTKVGEFNSQNVANVA 329
           A   +   + QG++N  WA  +     L +   D+  +     A   +  FN+Q +AN  
Sbjct: 126 ATKTIDNFNTQGLANSIWAFGR-----LEIHPSDQFIQAWIHHATKTIDNFNTQGLANSI 180

Query: 330 GAFASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAF 387
            A   ++ H +          A+  +  F  Q LA  +WAF  L   P+D  +++     
Sbjct: 181 LALGQLEIHPSDQFIQAWIHHATKTIDNFNTQNLANSIWAFGQLEIHPSDQFIQAW---I 237

Query: 388 KDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQM 447
             AT       K + N                         FN   L N  W++  L   
Sbjct: 238 HHAT-------KTIDN-------------------------FNTQNLANSIWAFGQLEIH 265

Query: 448 DRIFFSDIW-----KTISRFEEQRISEQ----YREDIMFASQVHLVNQCLKLEHPHLQLA 498
               F   W     KTI  F  Q ++      +  +++  S++ +  Q +   + +++L 
Sbjct: 266 PSDQFIQAWIHHATKTIDNFSLQELANSIYGIFTLNVLCNSKIKVPQQFISAVNQNIEL- 324

Query: 499 LSSVLEEKIASAGKT------------------------KRFNQKVT----SSFQ----K 526
                +E I   G+                         K+F  K+T    S+ Q    K
Sbjct: 325 ----FDENIEDIGQILKAHYYFGKQGVGILTSQNRQLLEKKFKTKLTPCHTSNLQLNVLK 380

Query: 527 EVARLLVSTGLNWIREYAVDGYT--VDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKR 584
            V ++L    +    EY +   T  VD  + +K    ++DGP+HF  N   P   T L  
Sbjct: 381 VVKKVLAQHTVK--SEYHIKQITSSVDIFIKEKNTVIQVDGPSHFDDNNA-PNFSTRLNT 437

Query: 585 RYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
             + + G+ V  + +  W +L+ +  + +Y+
Sbjct: 438 ELLKSYGYIVHRIPYWVWNKLKTNIAKEEYI 468


>gi|156082057|ref|XP_001608521.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148801092|gb|EDL42497.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 446

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 40/144 (27%), Positives = 65/144 (45%), Gaps = 3/144 (2%)

Query: 483 LVNQCLKLEHPHLQLA--LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWI 540
           L N+ LK  H  L L   L    E+ I    + K  N    S  QK++ +LL   GL   
Sbjct: 304 LKNEELKQTHIALYLLRELGGDCEQAIDQIERKKIKNTLTVSKMQKQLEKLLKEMGLKAD 363

Query: 541 REYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQ 600
           RE+ V  Y +D  L  K+   E++G TH+    G     + LK   +    W V+++ + 
Sbjct: 364 REFPVGPYVLDFALQKKRTCIEVNGFTHYYTFGGELNAKSRLKYFILRRLHWKVLTVEYT 423

Query: 601 EWEELQGSFEQLDYLRVILKDYIG 624
            W+  +   ++++YL   +   IG
Sbjct: 424 SWKN-KSKEDKMEYLEETVLSRIG 446


>gi|221054031|ref|XP_002261763.1| RAP protein [Plasmodium knowlesi strain H]
 gi|193808223|emb|CAQ38926.1| RAP protein, putative [Plasmodium knowlesi strain H]
          Length = 449

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 8/124 (6%)

Query: 480 QVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNW 539
           QVH+V   L+      Q A++ + ++KI         N    S  QK++ +LL   GL  
Sbjct: 314 QVHIVLYLLRELGGDYQQAINMIEKKKIK--------NTLTVSKMQKQLEKLLKEMGLKA 365

Query: 540 IREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSH 599
            RE+ +  Y +D  L  K+   E++G TH+    G     + LK   +    W V+++ +
Sbjct: 366 EREFPMGPYVLDFALQKKRTCIEVNGFTHYYTFGGELNAKSRLKYYILRRLNWKVLTVEY 425

Query: 600 QEWE 603
             W+
Sbjct: 426 TSWK 429


>gi|124506281|ref|XP_001351738.1| RAP protein, putative [Plasmodium falciparum 3D7]
 gi|23504667|emb|CAD51545.1| RAP protein, putative [Plasmodium falciparum 3D7]
          Length = 1379

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 53/265 (20%), Positives = 119/265 (44%), Gaps = 36/265 (13%)

Query: 360  QELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE 419
            Q +A +LW+ + L   +        N F+D   + C  NK    C         G   + 
Sbjct: 1137 QSIANILWSLSILNVYSR-------NVFEDGL-YEC--NKRFIKC---------GKKKNT 1177

Query: 420  GSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFAS 479
              + + +   ++ QL   A+SY        ++  +  K I++  + +  E Y+ DI+  +
Sbjct: 1178 TKVKNFISQLHQSQLYQAAFSYC-------LYLLNNQKHINKLLKNK--ENYKSDIIINN 1228

Query: 480  QVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQK--VTSSFQKEVARLLVSTGL 537
             +      +  ++  + + + ++ ++++A   + +R  QK  ++SS  K+++  L    +
Sbjct: 1229 DIKKKIHAIFEKYFKVSINVLNIWKKQLA---RNQRKEQKTHISSSVHKKISNDLRRLNI 1285

Query: 538  NWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPL--GHTMLKRRYIAAAGWNV 594
                EY + D   VD  +   K+  EIDGP HF +   +     +T+ K+R + A G+ V
Sbjct: 1286 FHYNEYFILDSILVDIFIPHSKIVIEIDGPNHFFQKGEMIFYKSNTLFKKRLLRALGYTV 1345

Query: 595  VSLSHQEWEELQGSFEQLDYLRVIL 619
            +S+   ++  +  + + + + + +L
Sbjct: 1346 ISVPISDYTFMFSALDTMHFTKRLL 1370



 Score = 39.7 bits (91), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 14/50 (28%), Positives = 34/50 (68%)

Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
           ++VAN+A A + + +  PD++  + K+  + ++ F+ QE++ ++W+F S+
Sbjct: 535 KHVANIAWASSVLSNKDPDIWKYIKKQFYENINNFKAQEISIIIWSFGSI 584


>gi|354487325|ref|XP_003505824.1| PREDICTED: FAST kinase domain-containing protein 3-like [Cricetulus
           griseus]
 gi|344245962|gb|EGW02066.1| FAST kinase domain-containing protein 3 [Cricetulus griseus]
          Length = 660

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 46/177 (25%), Positives = 78/177 (44%), Gaps = 22/177 (12%)

Query: 444 LGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLAL-SSV 502
           L Q+ ++F + + +  + ++  ++  QY            +  C  LE P L L L  SV
Sbjct: 490 LAQVTQLFMTSVLEC-AFYKGPKLLPQYHVK-------SFLTPCCSLETP-LDLHLYKSV 540

Query: 503 LEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVL---VDKKV 559
           +   I   G    F  KV + +   +    V   L+       DG+ +   +   V K+V
Sbjct: 541 VTGLIDLLGSRLYFASKVLTPYCYTID---VEIKLDE------DGFVLPFTVEEDVHKRV 591

Query: 560 AFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           A  IDGP  F  +T   LG   +K+R++   G+ VV + + E E L    E ++YL+
Sbjct: 592 ALCIDGPQRFCADTKHLLGKEAIKQRHLRLLGYQVVQVPYHELELLTSRLELVEYLQ 648


>gi|159471540|ref|XP_001693914.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277081|gb|EDP02850.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 702

 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 63/134 (47%), Gaps = 26/134 (19%)

Query: 260 AFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGE----LLYLSEMDRVAEVALT 315
           A TRQR            L E SAQ +SN AWAL+++G      L     +  VAE +  
Sbjct: 67  ALTRQR------------LAEYSAQALSNTAWALARLGAAPPPGLRGGGWLGAVAEASQP 114

Query: 316 KVGEFNSQNVANVAGAFASMQHSAP-----DLFSELAKRASDIVHTFQEQELAQVLWAFA 370
            +  F++Q + N+  A A  +H  P          LA+RA  +    + Q+++ V W+ A
Sbjct: 115 LLPVFHTQELCNLLWAMAVCRHRPPARWLVAALGLLAERAEGL----EPQDVSNVCWSLA 170

Query: 371 SL-YEPADPLLESL 383
           +L   P  PLL+ L
Sbjct: 171 ALRVRPGVPLLQRL 184


>gi|397635539|gb|EJK71902.1| hypothetical protein THAOC_06615, partial [Thalassiosira oceanica]
          Length = 172

 Score = 48.1 bits (113), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 41/130 (31%), Positives = 64/130 (49%), Gaps = 10/130 (7%)

Query: 497 LALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV 555
           + L   L  K  +A  ++ F++   S  Q +V   L + G++   E  +  GY +DA++ 
Sbjct: 41  IELPESLRAKCRNAFTSQGFSE---SKLQNDVVGELRAAGVDLEEEVLLGSGYRIDALVK 97

Query: 556 ---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQ 611
               ++VA E+DGP HF      P G T+LK R +       VVS+ + EW+EL  S  +
Sbjct: 98  VGDGREVAVEVDGPFHFIDRR--PAGSTILKHRQVTRLDRIGVVSVPYWEWDELMNSEMK 155

Query: 612 LDYLRVILKD 621
             YL   L D
Sbjct: 156 QHYLLAKLPD 165


>gi|428171424|gb|EKX40341.1| hypothetical protein GUITHDRAFT_154162 [Guillardia theta CCMP2712]
          Length = 102

 Score = 47.8 bits (112), Expect = 0.017,   Method: Composition-based stats.
 Identities = 28/77 (36%), Positives = 45/77 (58%), Gaps = 8/77 (10%)

Query: 547 GYTVDAVL-----VDKK--VAFEIDGPTHFSRNTGVPL-GHTMLKRRYIAAAGWNVVSLS 598
           GY++D V+     VD++  +A E+DGP H+ R     L G T +K R++   GW VV++ 
Sbjct: 15  GYSIDIVIRSGEGVDEEHPIAVEVDGPGHYMRPGLRELVGGTKMKTRHLCRLGWKVVAIP 74

Query: 599 HQEWEELQGSFEQLDYL 615
           + EW E + + E+  YL
Sbjct: 75  YWEWNEARDAGEEERYL 91


>gi|403282217|ref|XP_003932552.1| PREDICTED: FAST kinase domain-containing protein 3 [Saimiri
           boliviensis boliviensis]
          Length = 659

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           V K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E E L    E ++Y
Sbjct: 587 VHKRIALCIDGPQRFCSNSKHLLGKEAIKQRHLRLLGYQVVQMPYHEMEMLTTRLEVVEY 646

Query: 615 LR 616
           L+
Sbjct: 647 LQ 648


>gi|148284481|ref|YP_001248571.1| RNA-binding protein [Orientia tsutsugamushi str. Boryong]
 gi|146739920|emb|CAM79915.1| putative RNA-binding protein [Orientia tsutsugamushi str. Boryong]
          Length = 540

 Score = 47.8 bits (112), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 81/393 (20%), Positives = 161/393 (40%), Gaps = 63/393 (16%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEV--ALTKVGEFNSQNVANVAGA 331
           A   +   + QG++N  WA  ++G   ++ S+    A +  A   +  FN+Q +AN   A
Sbjct: 73  ATKTIDNFNTQGLANSIWAFGRLG---IHPSDQFIKAWIHHATKTIDNFNTQGLANSIWA 129

Query: 332 FASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLES-LDNAFK 388
              ++ H +          A+  +  F  Q LA  + AF  L   P+D  +++ + +A K
Sbjct: 130 LGRLEIHPSDQFIKAWIHHATKTIDNFNTQNLANSVLAFGRLEIHPSDQFIKAWIHHATK 189

Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSE-----GSLSSPVLSFNRDQLGNIAWSYAV 443
               F     + L+N     G      +D          +  + +FN   L N  W+   
Sbjct: 190 TIDNFNT---QNLANSVLAFGRLEIHPSDQFIKAWIHHATKTIDNFNTQGLANSIWA--- 243

Query: 444 LGQMD--------RIFFSDIWKTISRFEEQRISEQ----YREDIMFASQVHLVNQCLKLE 491
           LGQ++        + +     KTI  F  Q ++      +  +++  S++ +  Q +   
Sbjct: 244 LGQLEIHPSDQFIKAWIHHATKTIDNFSLQELANSIYGIFTLNVLCNSKIKVPQQFISAV 303

Query: 492 HPHLQL------ALSSVLEEK----------IASAGKT---KRFNQKV----TSSFQ--- 525
           + +++L       +S +L+            + S  +    K+F  K+    TS+ Q   
Sbjct: 304 NQNIELFDENNECISQILKAHYYFGKQGVGILTSQNRQLLEKKFKTKLTPCHTSNLQLNV 363

Query: 526 -KEVARLLVSTGLNWIREYAVDGYT--VDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTML 582
            K V ++L    +    E+ +   T  VD  + +K +  ++DGP+HF  N   P   T L
Sbjct: 364 LKVVKKVLAQHTVK--SEHYIKQITSSVDIFIKEKNIVIQVDGPSHFDDNNA-PNFSTRL 420

Query: 583 KRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
               + + G+ V  + +  W +L+ +  + +Y+
Sbjct: 421 NTELLKSYGYIVHRIPYWVWNKLKTNIAKEEYI 453



 Score = 40.8 bits (94), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 28/109 (25%), Positives = 52/109 (47%), Gaps = 7/109 (6%)

Query: 278 LPECSAQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGEFNSQNVANVAGAFASM 335
           + E + Q ++N  WAL ++    ++ S+  ++     A   +  FN+QN+AN   AF  +
Sbjct: 1   MDEFNPQELANSIWALGRLE---IHPSDQFINAWIHHATKTIDNFNTQNLANSIWAFGRL 57

Query: 336 Q-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLES 382
             H +    +     A+  +  F  Q LA  +WAF  L   P+D  +++
Sbjct: 58  GIHPSDQFINAWIHHATKTIDNFNTQGLANSIWAFGRLGIHPSDQFIKA 106


>gi|432104650|gb|ELK31262.1| FAST kinase domain-containing protein 3 [Myotis davidii]
          Length = 477

 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 36/62 (58%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           V K+VA  IDGP  F  N+   LG   +K+R++   G+ VV + + + E L+   E ++Y
Sbjct: 404 VHKRVALCIDGPKRFCLNSKHLLGKEAIKQRHLRLLGYQVVQIPYYDIETLKSKLELVEY 463

Query: 615 LR 616
           L+
Sbjct: 464 LQ 465


>gi|428673456|gb|EKX74369.1| conserved hypothetical protein [Babesia equi]
          Length = 414

 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 41/82 (50%)

Query: 522 SSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTM 581
           S  Q++V RLL    L    E  +  Y +D V+   KVA E++G THF   +      T 
Sbjct: 313 SKMQEKVGRLLDELKLKHESEVMLGPYRLDFVIPKLKVAIEVNGYTHFFHRSEQLNATTE 372

Query: 582 LKRRYIAAAGWNVVSLSHQEWE 603
           LK + I   GW V  L++ +W+
Sbjct: 373 LKYKIIEDLGWKVFGLNYYDWK 394


>gi|145340688|ref|XP_001415452.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575675|gb|ABO93744.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 528

 Score = 47.0 bits (110), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 58/115 (50%), Gaps = 9/115 (7%)

Query: 280 ECSAQGISNIAWALSKIGGELLYLSE-MDRVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
           E SAQ I+  A A++K+G   +Y S+ M    + A  +  EF  +++A +A +FA +   
Sbjct: 216 ESSAQQIATSAHAMAKLG---IYNSQIMKAYKDHAAARRDEFQPRDIAFLAWSFAKLDIK 272

Query: 339 APDLFSELAKRASDIV-----HTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
           AP+LF   +    +++      TF    L  VLW+FA L E    +L  +  A K
Sbjct: 273 APELFEMFSAVVCEMLFDVEFQTFSPHHLTMVLWSFAMLNENTQEVLPYIVRAMK 327


>gi|294953994|ref|XP_002787986.1| hypothetical protein Pmar_PMAR012092 [Perkinsus marinus ATCC 50983]
 gi|239903121|gb|EER19782.1| hypothetical protein Pmar_PMAR012092 [Perkinsus marinus ATCC 50983]
          Length = 768

 Score = 47.0 bits (110), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 88/374 (23%), Positives = 151/374 (40%), Gaps = 67/374 (17%)

Query: 280 ECSAQGISNIAWALSKIGG-ELLYLSEMDRVAEVALTKVGE-FNSQNVANVAGAFASMQH 337
           E ++Q  + +AWAL ++ G E   + +M R+A+    + G+ F ++++  +  A A  + 
Sbjct: 438 EMTSQHAATVAWALWRMRGMEANSVHDMARIAD----QHGDAFANRHLITLTRAAAGAKF 493

Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCL 397
             P L   + +R    V ++   +  Q+LW  A+       +LE      + A  +T   
Sbjct: 494 YHPSLLDAILRRP---VSSWTADQCGQLLWVLATWGVRNPRMLEYAMQCEEIARAYT--- 547

Query: 398 NKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWK 457
                         + G AD              D+L  I W+ A+L        S +W 
Sbjct: 548 --------------ADGGAD-----------LGMDKLTTIEWATALLDLPSPPRGSYLWD 582

Query: 458 -------------TISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLE 504
                        T+S+   QR+S+      M  +Q++     L+ +     L  S VL 
Sbjct: 583 KEREYIEGQAADLTVSQVLRQRLSDTQSFSDMGLTQLYWA-WVLRYDEGCGDLPPSWVL- 640

Query: 505 EKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVD--GYTVDAVLVDKKVAFE 562
            K+ S            SS QK V   L     +W +EY +   G ++D     +K+A E
Sbjct: 641 -KVRSWLSDAASYSLQPSSLQKTVHSHLPQG--DWRQEYLLPPWGISIDIASPSRKIAIE 697

Query: 563 IDGPTHFSRNTGVPLGHTM------LKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           +DG   F     V  G T+      +K+R +   GW V+ +S QE+  + G  +Q  +L 
Sbjct: 698 VDGKL-FHSVYDVATGQTLSDASATVKQRLLTRQGWRVLRVSEQEF--MAGDSDQRAHLA 754

Query: 617 VILKDYIGGEGSSN 630
             L   + G+G SN
Sbjct: 755 TALAR-MEGDGKSN 767


>gi|403374846|gb|EJY87385.1| hypothetical protein OXYTRI_03886 [Oxytricha trifallax]
          Length = 577

 Score = 47.0 bits (110), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 32/59 (54%)

Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNV 594
           GL  ++EY V  Y +D  L + K+A EIDG  H+S N G        + R+I A G ++
Sbjct: 475 GLQILQEYEVGPYYLDIFLPELKLAIEIDGAHHYSNNKGDQFSKFKARDRFIKAHGLHI 533


>gi|429327420|gb|AFZ79180.1| hypothetical protein BEWA_020260 [Babesia equi]
          Length = 593

 Score = 47.0 bits (110), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 58/287 (20%), Positives = 122/287 (42%), Gaps = 19/287 (6%)

Query: 324 NVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
           +++ +  +FA ++       + +  + +  + +FQ+Q +AQ+++A   L      + ES+
Sbjct: 291 SISCLLHSFAKLKFRPKSDITSILSQITKSIFSFQDQNVAQIVYALGQLGLHCRDVFESI 350

Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEG-----SLSSPVLS-FNRDQLGNI 437
               +   ++    + A+       G    G  D E      + S  +L+ F   QL ++
Sbjct: 351 STFIQSRIEYQSPQHLAMFM----QGYARVGIYDKETVKVIMNHSMELLTGFTLSQLVSL 406

Query: 438 AWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQL 497
                +LG  ++  F+   + ++RF   R S+   + I+  +Q++ +  C++LEH     
Sbjct: 407 MDGALILGHFEQDKFT---RFLTRFTSIR-SDNIPDHIL--NQLNRIMYCIRLEHQSFVT 460

Query: 498 ALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDK 557
                ++  I        F  K   S+ + +   L  T   ++    +  Y VD VL+  
Sbjct: 461 TSEYFMQNLINQYQGA--FMIKPLQSYNQALYECLKETDSEYVLNKKIGLYNVD-VLLQN 517

Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
             + E+         TG  LG   LK+R+I   G+  + ++ +EW E
Sbjct: 518 NTSVELLSQGSVCPLTGSALGAVQLKKRHIELLGYKHIQINRREWFE 564


>gi|165924154|ref|ZP_02219986.1| conserved domain protein [Coxiella burnetii Q321]
 gi|165916403|gb|EDR35007.1| conserved domain protein [Coxiella burnetii Q321]
          Length = 435

 Score = 46.6 bits (109), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 65/301 (21%), Positives = 127/301 (42%), Gaps = 57/301 (18%)

Query: 281 CSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
            + QGI+N  W L+ +     EL      DR+ +       +FNSQ++AN   A A+M  
Sbjct: 97  LNPQGIANTLWTLATMNVRRRELEVQGLSDRLLDAVYYNAEQFNSQDIANTLWALAAMGM 156

Query: 338 SAPDLFSE-LAKRASDIVH----TFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQ 392
              +L  + L+ R  D VH     F  Q +A  LWA A+                    +
Sbjct: 157 RWRELEEQGLSDRLLDAVHRNAQRFSPQGIANALWALAT-----------------TGMR 199

Query: 393 FTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
           +     + LSN   N  V+ S +  S   +++ + +     +  ++W Y    ++DR+  
Sbjct: 200 WRELETRELSNRLFN-AVQHSAERFSSQQIANTLWAL---AMMALSWGYLKEQRVDRLLL 255

Query: 453 SDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGK 512
           + I ++ ++F  +  ++     IM++++   +        P + L +S++         K
Sbjct: 256 NAIDQSANQFSLEESTQ-----IMWSTRWFDIRPP-----PEILLKISNM---------K 296

Query: 513 TKRFNQKVTSSFQKEVARLL---VSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTH 568
             R     +S   + VA +L   ++  +    E+ + + + VD  +  K++  E+DGP H
Sbjct: 297 PPR-----SSDLHRHVASVLSAQINGEIPIENEFFIQNCFYVDICIPSKRLVIEVDGPYH 351

Query: 569 F 569
            
Sbjct: 352 I 352



 Score = 45.4 bits (106), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 52/200 (26%), Positives = 88/200 (44%), Gaps = 33/200 (16%)

Query: 205 DAQTAQEVLEV----------IAEMITAVGKGLSPSPLSPLNIATAL-HRIAKNMEKVSM 253
           D  T +E+LE           +A ++ A+    +   L P ++A  L   IAKN+E+++ 
Sbjct: 40  DYATIREILEARRHRRFNGQSVANLLLAIAYHHTQWRLLPRSLAAQLWDAIAKNVERLNP 99

Query: 254 MTTHRLAFT------RQREMS-------MLVAIAMTALPECSAQGISNIAWALSKIGGEL 300
                  +T      R+RE+        +L A+   A  + ++Q I+N  WAL+ +G   
Sbjct: 100 QGIANTLWTLATMNVRRRELEVQGLSDRLLDAVYYNA-EQFNSQDIANTLWALAAMGMRW 158

Query: 301 LYLSEM---DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFS-ELAKRASDIVH- 355
             L E    DR+ +        F+ Q +AN   A A+      +L + EL+ R  + V  
Sbjct: 159 RELEEQGLSDRLLDAVHRNAQRFSPQGIANALWALATTGMRWRELETRELSNRLFNAVQH 218

Query: 356 ---TFQEQELAQVLWAFASL 372
               F  Q++A  LWA A +
Sbjct: 219 SAERFSSQQIANTLWALAMM 238


>gi|71657249|ref|XP_817143.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70882315|gb|EAN95292.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 220

 Score = 46.6 bits (109), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 30/121 (24%), Positives = 57/121 (47%), Gaps = 15/121 (12%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           + + I+N+ +A +K+G  L +     R+A+ A+   GEF   +VA +  A+A ++     
Sbjct: 33  TPKDITNVVYAYAKVG--LWHYKLFVRLADRAIQLRGEFRCDHVARLLEAYARVEMRYEK 90

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
           LF E + R   I H     E+ +++ A+A +  P             D   F  C ++A+
Sbjct: 91  LFVEFSPRIQTIAHLLTAGEVTKIVSAYAKVRIP-------------DVGVFNACGDRAV 137

Query: 402 S 402
           +
Sbjct: 138 T 138


>gi|291411172|ref|XP_002721863.1| PREDICTED: FAST kinase domain-containing protein 3-like
           [Oryctolagus cuniculus]
          Length = 660

 Score = 46.6 bits (109), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           V K+VA  IDGP  F  N    LG   +K+R++   G+ VV + + E E L+   E ++Y
Sbjct: 587 VYKRVALCIDGPQRFCSNGKHLLGKEAIKQRHLQLLGYQVVQVPYHEIEVLKSRLELVEY 646

Query: 615 LR 616
           L+
Sbjct: 647 LQ 648


>gi|68074247|ref|XP_679038.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56499680|emb|CAH93735.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 830

 Score = 46.6 bits (109), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 37/182 (20%), Positives = 76/182 (41%), Gaps = 36/182 (19%)

Query: 476 MFASQVHLVNQCLKLEH-PHLQLALSSVLEEKIASA-GKTKRFNQKVTSSFQKEVARLLV 533
           ++ +Q+ ++   L+ +H P++   + +   E +     K K     + S  QKEV  +L+
Sbjct: 586 IYLNQLKIIELSLRTQHVPNVYNKIDTECYEYMNYIKNKEKEIEYNIKSDLQKEVKHILL 645

Query: 534 STGLNWIREYAVDGYTVDAVLVDK----------------------------------KV 559
           +  L  + E ++  Y VD V  D+                                  K+
Sbjct: 646 TFNLTPLEEVSIGPYNVDFVEKDQTFQNICKNEIYYKDQSNNYTKIISSNKKINENIGKI 705

Query: 560 AFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
             E++G  HF RNT      + LK + ++  G+ V+++ + +W  L+    +  Y++ I+
Sbjct: 706 IIEVNGEHHFYRNTKSYTSFSKLKHKLLSDLGYIVINIPYFDWAILKTDLNKKSYIKKII 765

Query: 620 KD 621
            D
Sbjct: 766 ND 767


>gi|221061135|ref|XP_002262137.1| RAP protein [Plasmodium knowlesi strain H]
 gi|193811287|emb|CAQ42015.1| RAP protein, putative [Plasmodium knowlesi strain H]
          Length = 958

 Score = 46.6 bits (109), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 30/125 (24%), Positives = 57/125 (45%), Gaps = 3/125 (2%)

Query: 517 NQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVP 576
           N K  + +  E++++L    +N ++   ++    D +L D +V     GP  +  N+ V 
Sbjct: 728 NMKYGARWINELSKILARINVNHLKNIYINHICADIMLPDSQVIIMCLGPYSYYVNSLVT 787

Query: 577 LGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKD---YIGGEGSSNIAE 633
              + LKR  +    + V+ LS+ EW +L    E++ +L    +D   Y+       +AE
Sbjct: 788 TSTSDLKRFILEKKKYKVIPLSYHEWNKLNDYEEKIRFLYAFGRDAANYLFVNAKKGVAE 847

Query: 634 TLKMD 638
             K D
Sbjct: 848 GEKSD 852


>gi|154332667|ref|XP_001562150.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059598|emb|CAM37182.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 442

 Score = 46.2 bits (108), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 2/95 (2%)

Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           A+G++NI  A SK G     L  +  +    L +VGEF + ++  +A AFA +++   ++
Sbjct: 110 AKGVTNIISAFSKTGINHEKLFRLLSMRVQTLARVGEFEAAHLVILANAFARLRYREQNV 169

Query: 343 FSELAKRASDIVHTFQEQELAQVLWAF--ASLYEP 375
           FS +A+RA  +       EL  ++ AF  A L +P
Sbjct: 170 FSAIARRAMSLRERVTVNELVPLINAFSKAGLKDP 204


>gi|397609733|gb|EJK60493.1| hypothetical protein THAOC_19142, partial [Thalassiosira oceanica]
          Length = 500

 Score = 46.2 bits (108), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 84/193 (43%), Gaps = 20/193 (10%)

Query: 269 MLVAIAMTALP---ECSAQGISNIAWALSKIGGELLYLSE---MDRVAEVALTKVGEFNS 322
           +  ++ + ALP   E  A+ +SN+ ++   +     +  +    D +A  A+ K+  FN 
Sbjct: 311 LFGSVEIAALPILGEFDARYLSNLIYSFGLVKYNPTFEDKTKLFDALASTAIDKLAVFNG 370

Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADP--- 378
           Q+++N+  AF  +      LF +  +    + +  F EQ LA +LW+FA   E ADP   
Sbjct: 371 QDISNMLLAFVYVDSKNSMLFQKTGEALLKLYLGDFTEQALANILWSFAKSGE-ADPELF 429

Query: 379 ------LLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRD 432
                 ++E + + F+         + A    +  G  K  GD  +      P   F+  
Sbjct: 430 QALGDHIVERILDDFRPQHLSNIVWSYATGGVSHPGLFKKIGDHVAGLKSLDP---FDPQ 486

Query: 433 QLGNIAWSYAVLG 445
            L N AW++A  G
Sbjct: 487 SLSNTAWAFATAG 499


>gi|189184538|ref|YP_001938323.1| repeat-containing protein A_05 [Orientia tsutsugamushi str. Ikeda]
 gi|189181309|dbj|BAG41089.1| repeat-containing protein A_05 [Orientia tsutsugamushi str. Ikeda]
          Length = 589

 Score = 46.2 bits (108), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 51/208 (24%), Positives = 89/208 (42%), Gaps = 17/208 (8%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
           A+  + E ++QG++N  WA  ++  +    S +D     A+  + EFNSQ+++N    F 
Sbjct: 88  AINLMDEFNSQGVTNSLWAFGRLKIQ-PQASFIDAWTNQAINLMDEFNSQDLSNSIWGFG 146

Query: 334 SMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDN-AFKDA 390
            ++             +A+  +  F  QELA  LWA   L   P    +++  N A K  
Sbjct: 147 WLEIQPQASFIDAWTNQATKTIGKFNPQELANSLWALGRLEIHPQALFIDAWTNQATKTI 206

Query: 391 TQFTCCLNKALSNCNEN-GGVKSSGDADSEGSLSSPVLS----FNRDQLGNIAWSYAVLG 445
            QF    ++ LSN     G ++    A    + ++  ++    FN   L N  W +  L 
Sbjct: 207 DQFN---HQNLSNSIWALGRLEIQPQASFIEAWTNQAINLMDEFNSQDLSNSIWGFGRLK 263

Query: 446 QMDRIFFSDIW-----KTISRFEEQRIS 468
              +  F + W     KTI +F  Q ++
Sbjct: 264 IQPQASFIEAWIHQATKTIDKFNSQDLA 291


>gi|363735806|ref|XP_421951.2| PREDICTED: FAST kinase domain-containing protein 2 [Gallus gallus]
          Length = 677

 Score = 46.2 bits (108), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 74/333 (22%), Positives = 138/333 (41%), Gaps = 46/333 (13%)

Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL------ 383
           A  S+Q+    LFS +A   + IV    ++++   L AF +L ++P++ L+  L      
Sbjct: 356 ACHSLQYRNIKLFSAVADYVNSIVCLLDKRQIILFLSAFETLGFQPSE-LMGVLAEKVTE 414

Query: 384 DNAFKDATQFTCCLNKALSNCNE-NGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYA 442
           D+ F D   F   L +  S  N    G            L+  +   +  +L    +S  
Sbjct: 415 DSEFLDLKSFLIVL-RVYSRLNYVPRGQHLLFYETLHSCLNKYLPQISNAELLKAVYSLC 473

Query: 443 VLGQMDRIFFSDIWKTISRFEEQRISEQYRE--DIMFASQVHLVNQCLKLEHPH------ 494
           +LG +  +  + + K  S FEE    + Y+E  ++M    +H V  C++L+ P       
Sbjct: 474 ILGYLPHLALNQLLKKDS-FEELMSGDLYKEKREMM----LHCVRTCMELDSPSFMKPAF 528

Query: 495 ---------LQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEV--ARLLVSTGLNWIREY 543
                    + + L    E  +   G    F Q V   ++  +     + S     +   
Sbjct: 529 VPTEIFSSLVSVTLRKAREALLELLGDENMFRQNVQLPYEYRIDFEIWMDSDTKKVLPIT 588

Query: 544 AVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
           A D Y   +V   +++AF    P+ F   T  P G   +K+R+++  G++V+ + +++++
Sbjct: 589 ATDSYADRSV---QRLAFLFVPPSAFCLGTTHPQGKLAMKKRHLSKLGYHVIPVLNKKFQ 645

Query: 604 EL--QGSFEQLDYLRVILKDYIGGEGSSNIAET 634
           EL  +G+ E        LK  I  E  S  +E 
Sbjct: 646 ELTNEGAIE-------FLKGKIYSENVSPFSEV 671


>gi|124512480|ref|XP_001349373.1| RAP protein, putative [Plasmodium falciparum 3D7]
 gi|23499142|emb|CAD51222.1| RAP protein, putative [Plasmodium falciparum 3D7]
          Length = 975

 Score = 45.8 bits (107), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 26/113 (23%), Positives = 55/113 (48%), Gaps = 6/113 (5%)

Query: 519 KVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
           K ++ +  E++R+L    ++ IR   ++    D +L    V  +  GP  +  N+ V   
Sbjct: 750 KYSARWINELSRILTKMNVDHIRNVYINNICTDIMLTSTNVIIKCLGPYSYYINSLVTTS 809

Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSSNI 631
            + LK + + +  + V++LS+ +W +L    E++ +L      Y  G  ++NI
Sbjct: 810 ISDLKLKILESKKYKVINLSYHDWNKLNDYEEKIKFL------YSFGRHAANI 856


>gi|156088385|ref|XP_001611599.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154798853|gb|EDO08031.1| hypothetical protein BBOV_III004680 [Babesia bovis]
          Length = 371

 Score = 45.8 bits (107), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 52/95 (54%), Gaps = 18/95 (18%)

Query: 516 FNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVD-------KKVAFEIDGPTH 568
           +N    S+ Q+ V+ +LV  G+     + V+  T D + +D       +++A E+DGP H
Sbjct: 242 YNDSKMSTSQRYVSDVLVRLGI----PHKVELLTPDLLSIDIAIEGGGERIALEVDGPLH 297

Query: 569 FSR-----NTGVPL--GHTMLKRRYIAAAGWNVVS 596
           F+R     + G P+  G T +K  ++ ++GW+V+S
Sbjct: 298 FTRVCHGTHLGQPMLTGPTRMKHNFLRSSGWHVIS 332


>gi|51259555|gb|AAH79475.1| Fastkd3 protein [Rattus norvegicus]
          Length = 591

 Score = 45.8 bits (107), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 3/74 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           DG+ +   +   V  +VA  IDGP  F   +   LG   +K+R++   G+ VV + + E 
Sbjct: 506 DGFVLPFTVDEDVHTRVALCIDGPQRFCLGSKHLLGKEAIKQRHLRLLGYQVVQVPYHEL 565

Query: 603 EELQGSFEQLDYLR 616
           E L    E +DYL+
Sbjct: 566 ELLTSRLELVDYLQ 579


>gi|401404784|ref|XP_003881842.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325116256|emb|CBZ51809.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 2454

 Score = 45.8 bits (107), Expect = 0.067,   Method: Composition-based stats.
 Identities = 29/78 (37%), Positives = 41/78 (52%), Gaps = 5/78 (6%)

Query: 548  YTVDAVLVDKKVAFEIDGPTHFSRN-TGVPL---GHTMLKRRYIAAAGWNVVSLSHQEWE 603
            YT+  V    ++AFE+    HF R+  G  +     T L+RR + A GW VV++ H EW 
Sbjct: 2217 YTLPLVDATHRIAFEVGASEHFFRDPEGAEIELTAWTSLRRRLLQAQGWRVVAVPHFEWT 2276

Query: 604  ELQGSFEQLDYL-RVILK 620
             L     +L YL R +LK
Sbjct: 2277 ALPDRLARLRYLQRQLLK 2294


>gi|294866651|ref|XP_002764794.1| hypothetical protein Pmar_PMAR004016 [Perkinsus marinus ATCC 50983]
 gi|239864541|gb|EEQ97511.1| hypothetical protein Pmar_PMAR004016 [Perkinsus marinus ATCC 50983]
          Length = 663

 Score = 45.8 bits (107), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 48/206 (23%), Positives = 86/206 (41%), Gaps = 20/206 (9%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDR--VAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           Q ++N +WA +K+    +    +DR  + E     +G+ +S+++++V  + AS Q+   D
Sbjct: 14  QLLANTSWAAAKLEAAKMSSDSIDRTDLNEKIYRFIGQMDSRHLSSVLWSIASAQNWPVD 73

Query: 342 --LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP------------LLESLDNAF 387
             +FS + +   DI      QELA  LWA A   E   P             ++  D  F
Sbjct: 74  SEVFSRITRSLLDIPRPLHHQELANTLWALARAPERFRPESREVAIALMTKYVDRADPKF 133

Query: 388 KDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG-- 445
           + + Q +  +  A++    +  +          S+      +    L   AWS A LG  
Sbjct: 134 RFSDQHSANILWAIAKLEIDPTMARGVIDICIASIMETCGEYRPHSLSLSAWSLATLGIH 193

Query: 446 --QMDRIFFSDIWKTISRFEEQRISE 469
              +DRI      + +  FE Q+I+ 
Sbjct: 194 PEVVDRIIVEASARRLRDFESQQIAH 219



 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 51/107 (47%), Gaps = 13/107 (12%)

Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           +Q I+++ WA    GG LL    +D + E     + +   QNVANV    A    S P L
Sbjct: 214 SQQIAHVVWA----GGTLLSAWSLDGLPERLAVTIDKAKPQNVANVMWGLA---RSGPPL 266

Query: 343 FSELAKRASDIVHT----FQEQELAQVLWAFASLYEPADPL--LESL 383
            S+L + A   + T    +   +L+ +LW+  ++    DP   LESL
Sbjct: 267 NSKLVRFAQAHMETSSKAYLPVDLSSMLWSLGTMTNRGDPSEGLESL 313


>gi|148705062|gb|EDL37009.1| FAST kinase domains 3, isoform CRA_d [Mus musculus]
          Length = 129

 Score = 45.8 bits (107), Expect = 0.073,   Method: Composition-based stats.
 Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           DG+ +   +   + K+VA  IDGP  F  ++   LG    K+R++   G+ VV L + E 
Sbjct: 44  DGFVLPCTVDEDIHKRVALCIDGPQRFCLDSKHLLGKEATKQRHLRLLGYQVVQLPYHEL 103

Query: 603 EELQGSFEQLDYLR 616
           E L    E +DYL+
Sbjct: 104 ELLTSRLELVDYLQ 117


>gi|128485527|ref|NP_001076043.1| FAST kinase domain-containing protein 3 precursor [Rattus
           norvegicus]
 gi|145558914|sp|Q68FN9.2|FAKD3_RAT RecName: Full=FAST kinase domain-containing protein 3
 gi|149032747|gb|EDL87602.1| similar to hypothetical protein MGC5297, isoform CRA_b [Rattus
           norvegicus]
          Length = 656

 Score = 45.8 bits (107), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 3/74 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           DG+ +   +   V  +VA  IDGP  F   +   LG   +K+R++   G+ VV + + E 
Sbjct: 571 DGFVLPFTVDEDVHTRVALCIDGPQRFCLGSKHLLGKEAIKQRHLRLLGYQVVQVPYHEL 630

Query: 603 EELQGSFEQLDYLR 616
           E L    E +DYL+
Sbjct: 631 ELLTSRLELVDYLQ 644


>gi|294874532|ref|XP_002767003.1| hypothetical protein Pmar_PMAR010983 [Perkinsus marinus ATCC 50983]
 gi|239868378|gb|EEQ99720.1| hypothetical protein Pmar_PMAR010983 [Perkinsus marinus ATCC 50983]
          Length = 733

 Score = 45.4 bits (106), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 54/236 (22%), Positives = 100/236 (42%), Gaps = 59/236 (25%)

Query: 181 VHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATA 240
           V+R+S+ + P+   K +         Q+A  V ++ A    A+G  L+   +    + +A
Sbjct: 64  VYRMSKHAAPTEAVKAL---------QSALHVDQLTA----ALGTSLAKLGIRDETVFSA 110

Query: 241 L-HRIAKNMEKVSM--MTTHRLAFTR----QREMSMLVAIAMTA-LPECSAQGISNIAWA 292
           L  R++  M+   M  +     AF R     RE+   +  ++T    ECS + + ++ W+
Sbjct: 111 LGSRLSDKMDDFDMEDIAAVSWAFARAKFTDRELFRKIRESLTVRTTECSVKSLVSLTWS 170

Query: 293 LSKIG---GE----------------LLYLSE-------------------MDRVAEVAL 314
           LSK+G   GE                L Y  +                   M  +A   +
Sbjct: 171 LSKLGETGGEEDLFRYTLAPTIRSYMLEYTVQDLCALAWSFANANVHDVDFMSDIAHALM 230

Query: 315 TKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
            K  + N Q+V +   A AS+ +S  +LF  L +++  ++HTF   +L++ L+ F 
Sbjct: 231 PKTRDMNCQDVCSAVVALASLHYSHKELFEALKQQSFRLMHTFTPLQLSRTLYGFG 286


>gi|161831154|ref|YP_001597208.1| hypothetical protein COXBURSA331_A1522 [Coxiella burnetii RSA 331]
 gi|161763021|gb|ABX78663.1| conserved domain protein [Coxiella burnetii RSA 331]
          Length = 580

 Score = 45.4 bits (106), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 68/300 (22%), Positives = 122/300 (40%), Gaps = 57/300 (19%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMD---RVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
           S QGI+N+ WAL+  G     L       R+ E        FN Q +AN   A A+M   
Sbjct: 243 SPQGIANVLWALATTGMRRRELENQGLSVRLFEAIRRNAERFNPQGIANALWALATMGMW 302

Query: 339 APDLFSE-LAKRASDIVH----TFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQF 393
             +L  + L+ R    VH     F  Q +A VLWA  ++          +     +A + 
Sbjct: 303 WEELEEQRLSDRLLGAVHRNAQRFSPQGIANVLWALTTM---------GMRWGELEAQRL 353

Query: 394 TCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFS 453
             CL  A+    E     S   A++  +L+   LS          W Y    ++DR+  +
Sbjct: 354 NNCLLAAVRYNAER--FSSQQIANTLWALAMMALS----------WGYLKEQRVDRLLLN 401

Query: 454 DIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKT 513
            I ++ ++F  +  ++     IM++++   +        P + L +S++         K 
Sbjct: 402 AIDQSANQFSLEESTQ-----IMWSTRWFDIR-----PPPEILLKISNM---------KP 442

Query: 514 KRFNQKVTSSFQKEVARLL---VSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHF 569
            R     +S   + VA +L   ++  +    E+ + + + VD  +  K++  E+DGP H 
Sbjct: 443 PR-----SSDLHRHVASVLSAQINGEIPIENEFFIQNCFYVDICIPSKRLVIEVDGPYHI 497



 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 73/162 (45%), Gaps = 23/162 (14%)

Query: 232 LSPLNIATALH-RIAKNMEKVSMMTTHRLAFT------RQREMS-------MLVAIAMTA 277
           L P ++A  L   IAKN+E+++        +T      R+RE+        +L A+   A
Sbjct: 12  LLPRSLAAQLWDAIAKNVERLNPQGIANTLWTLATMNVRRRELEVQGLSDRLLDAVRYDA 71

Query: 278 LPECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
               + QGI+N  WAL  +G   GEL      DR+ +   +    FNSQ++ N   A A+
Sbjct: 72  -ERFNPQGIANTLWALVAMGMTWGELEAQELNDRLLDAVGSNAPRFNSQDITNTLWALAT 130

Query: 335 MQHSAPDLFSE-LAKRASDIVH----TFQEQELAQVLWAFAS 371
           M     +L  + L  R    V      F+ Q +A  LWA A+
Sbjct: 131 MGMKWRELGDQRLRDRLLGAVRRNAERFKPQGIANALWALAT 172



 Score = 40.0 bits (92), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 42/96 (43%), Gaps = 8/96 (8%)

Query: 284 QGISNIAWALSKIGGELLYLSEMD---RVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           QGI+N  WAL+  G     L       R+ E        FN Q +AN   A A+M     
Sbjct: 161 QGIANALWALATTGMRRRELENQGLSVRLFEAIRRNAERFNPQGIANALWALATMGMWWE 220

Query: 341 DLFSE-LAKRASDIVH----TFQEQELAQVLWAFAS 371
           +L  + L+ R    VH     F  Q +A VLWA A+
Sbjct: 221 ELEEQRLSDRLLGAVHRNAQRFSPQGIANVLWALAT 256


>gi|303273894|ref|XP_003056299.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462383|gb|EEH59675.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 769

 Score = 45.4 bits (106), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 41/183 (22%), Positives = 82/183 (44%), Gaps = 18/183 (9%)

Query: 198 NLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTH 257
           +L  D++DA   ++VL  + E+     K         +N +TALHR+A+     +   + 
Sbjct: 262 DLQGDLMDASDVEDVLLAVEELGDVFNK---------VNCSTALHRVARLCTTPAAAGSP 312

Query: 258 R---LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE--MDRVAEV 312
           R    A         L+A+   +  E     ISN  WA +++    L  S+  +  +A  
Sbjct: 313 RPDVAAVAHDERFRALLAMVERSAHEMEIVSISNTLWAFARL---RLRPSDATVSTLASR 369

Query: 313 ALTKVGEFNSQNVANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQVLWAFAS 371
           A+ +  +   ++++ V  A A + H     L + +  RA ++  +F+  ++  +LWA+A 
Sbjct: 370 AVDQCADAEPRHLSTVMWALAVLGHEPRSRLLAAVGDRAGEVAASFRPPDVVNLLWAYAR 429

Query: 372 LYE 374
            + 
Sbjct: 430 WHR 432


>gi|428175295|gb|EKX44186.1| hypothetical protein GUITHDRAFT_109971 [Guillardia theta CCMP2712]
          Length = 1200

 Score = 45.4 bits (106), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 55/127 (43%), Gaps = 26/127 (20%)

Query: 521  TSSFQKEVARLLVSTGLN----WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGV- 575
             S  Q +V R L   G+     W+   +   Y VDA L    +A E+DGP H++ + G  
Sbjct: 1071 VSRLQSDVIRTLRGMGVEVEEEWMEPRS--RYVVDAWLPTFGIALEVDGPYHYAYSAGSA 1128

Query: 576  ------------------PLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRV 617
                              PLG T LK R++A     V+ + + EW E   + +Q  YL  
Sbjct: 1129 QETRPGSATVRPDGNGRHPLGSTKLKHRHLAELMIPVLVVPYWEWPEDSQASKQ-TYLSN 1187

Query: 618  ILKDYIG 624
            +L  ++G
Sbjct: 1188 LLFSHVG 1194


>gi|156095482|ref|XP_001613776.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148802650|gb|EDL44049.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1193

 Score = 45.4 bits (106), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 32/133 (24%), Positives = 72/133 (54%), Gaps = 6/133 (4%)

Query: 494  HLQLALSS--VLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTV 550
            H +++L++  + ++++A   + +  NQ ++SS  K+++  L    +    EY + D   V
Sbjct: 1055 HFKVSLNTLNIWKKQLARNQRREEKNQ-ISSSVHKKISNDLRHLSIFHHNEYFILDSLLV 1113

Query: 551  DAVLVDKKVAFEIDGPTHFSRNTGVPL--GHTMLKRRYIAAAGWNVVSLSHQEWEELQGS 608
            D  +   +V  EIDGP+HF +   + L   +++ K+R + A G++V+S+S  +   +  +
Sbjct: 1114 DVYVPRSRVVIEIDGPSHFLQKGRLILYNPNSLFKKRLLRALGFSVISISISDHTFMFSA 1173

Query: 609  FEQLDYLRVILKD 621
               L +++  L +
Sbjct: 1174 LNTLSFVKQFLSN 1186


>gi|124809797|ref|XP_001348683.1| RAP protein, putative [Plasmodium falciparum 3D7]
 gi|23497581|gb|AAN37122.1| RAP protein, putative [Plasmodium falciparum 3D7]
          Length = 1725

 Score = 45.4 bits (106), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 33/143 (23%), Positives = 69/143 (48%), Gaps = 25/143 (17%)

Query: 488  LKLEHPHLQLALSSV-LEEKIASAGKTKRFNQKVT-----SSFQKEVARLLVSTGLNWIR 541
            +K +H +L L+ S + L+++I    + + F + +      S F  ++ ++L    + +  
Sbjct: 1563 IKYDHSNLHLSNSFIQLKDEIFLLLQKREFKRNMNKNDHISDFHVQICQILDDLNIRYHN 1622

Query: 542  EYAV-DGYTVDAVL----VDKKVAFEIDGPTHFS--------------RNTGVPLGHTML 582
            EY   D  +VD  L     ++K+A EIDGP+H                + T +  G T+ 
Sbjct: 1623 EYITKDLLSVDIKLERKCCEQKLAIEIDGPSHHFLVLNEMQKADPQRIKKTYIKCGTTIF 1682

Query: 583  KRRYIAAAGWNVVSLSHQEWEEL 605
            K   +  +GW++++++  EW ++
Sbjct: 1683 KHWLLQKSGWSIINVTSFEWNKI 1705


>gi|294867004|ref|XP_002764926.1| hypothetical protein Pmar_PMAR007493 [Perkinsus marinus ATCC 50983]
 gi|239864762|gb|EEQ97643.1| hypothetical protein Pmar_PMAR007493 [Perkinsus marinus ATCC 50983]
          Length = 795

 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 45/201 (22%), Positives = 90/201 (44%), Gaps = 25/201 (12%)

Query: 427 LSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQ 486
           +SF+   +  + W+ A     D+  F D+   ++    +  + + +  +   S+VH    
Sbjct: 572 ISFDVADVAIVLWAMAAADTYDQSVFRDLLSILASKSNELSAGERKASL---SKVHRAYL 628

Query: 487 CLKLEH-----PHLQLALSSVLEE-KIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWI 540
             +L +     P     ++ VLEE + AS         +V  +  K ++R   S+ ++ +
Sbjct: 629 WARLGYGFQHSPQNGHLIAEVLEEAQRASVDARGALQTEVCQTLNKALSRSPRSSSMHLL 688

Query: 541 REY----AVDGYTVDAVLVD-----KKVAFEIDGPTHFSRNTGVPL-------GHTMLKR 584
            E      + G +VDA +VD     +++  E+DGP H+    G          G ++LK+
Sbjct: 689 SEVDLAPELPGLSVDAAVVDGRTGSRRLLVEVDGPHHYVDVLGESAVTRRQYNGQSVLKQ 748

Query: 585 RYIAAAGWNVVSLSHQEWEEL 605
             IA AG+ ++S+  ++W  L
Sbjct: 749 HLIAQAGFRLLSVEDEKWRSL 769


>gi|302829348|ref|XP_002946241.1| hypothetical protein VOLCADRAFT_102845 [Volvox carteri f.
            nagariensis]
 gi|300269056|gb|EFJ53236.1| hypothetical protein VOLCADRAFT_102845 [Volvox carteri f.
            nagariensis]
          Length = 1387

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 3/86 (3%)

Query: 522  SSFQKEVARLLVSTGLNWIREYAVDGYTVDAV--LVDKKVAFEIDGPTHFSR-NTGVPLG 578
            S  Q++V R LV+ G     E  V  +TVD +  +  + VA E+DGPTHF+  +   PLG
Sbjct: 1234 SDLQRDVYRQLVALGYRPRMEERVGFWTVDILFRVGARPVAVEVDGPTHFTTCHHRQPLG 1293

Query: 579  HTMLKRRYIAAAGWNVVSLSHQEWEE 604
             ++ +   +   G  VV+LS +++ +
Sbjct: 1294 TSLARDECLRRLGLAVVALSFRDYRQ 1319


>gi|397568314|gb|EJK46072.1| hypothetical protein THAOC_35281, partial [Thalassiosira oceanica]
          Length = 441

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 43/75 (57%), Gaps = 6/75 (8%)

Query: 303 LSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTF 357
           LS  D +A   +  + EF +++++N+  +F  ++H+ PD     LF+     A  I+HTF
Sbjct: 368 LSIFDSIASSTVNMLNEFEARHLSNLIYSFGLIEHN-PDIGGETLFNVFGDAALKILHTF 426

Query: 358 QEQELAQVLWAFASL 372
           + Q L+ +LWAF  +
Sbjct: 427 ESQNLSNMLWAFVKV 441


>gi|118353796|ref|XP_001010163.1| hypothetical protein TTHERM_00560100 [Tetrahymena thermophila]
 gi|89291930|gb|EAR89918.1| hypothetical protein TTHERM_00560100 [Tetrahymena thermophila
           SB210]
          Length = 412

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/109 (22%), Positives = 56/109 (51%), Gaps = 6/109 (5%)

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVL----VDKKVAFEIDGPTHFSR--NTG 574
            S  Q++   +L     N+  E  +D YTVD ++    +  ++  E++GP+H+    N  
Sbjct: 299 VSPIQEDCEIILKVLKWNFKSEVRIDPYTVDFLITLPSIKNQIVLEMNGPSHYPYFSNKD 358

Query: 575 VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
           V      +K + +    +  V + HQ+W +++G   ++D+++ +++ +I
Sbjct: 359 VFSAKEQMKVKNLKIKNYIPVLIHHQDWSQIKGVTGKIDFIQNLVQKHI 407


>gi|344272348|ref|XP_003407994.1| PREDICTED: LOW QUALITY PROTEIN: FAST kinase domain-containing
           protein 3-like [Loxodonta africana]
          Length = 671

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 34/60 (56%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           K+VA  IDGP  F  N    LG   +K+R++   G+ VV + + E E L+   E ++YL+
Sbjct: 589 KRVALCIDGPKRFCFNGTNLLGKEAIKQRHLRLLGYEVVQIPYHETEMLKSRLELVEYLQ 648


>gi|294901002|ref|XP_002777205.1| hypothetical protein Pmar_PMAR007110 [Perkinsus marinus ATCC 50983]
 gi|239884697|gb|EER09021.1| hypothetical protein Pmar_PMAR007110 [Perkinsus marinus ATCC 50983]
          Length = 504

 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 23/90 (25%), Positives = 47/90 (52%), Gaps = 2/90 (2%)

Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
           E + Q +  +AW+ +     +  +  M  +A   + K  + N Q+V +   A AS+ +S 
Sbjct: 295 EYTVQDLCALAWSFA--NANVHDVDFMSDIAHALMPKTRDMNCQDVCSAVVALASLHYSH 352

Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAF 369
            +LF  L +++  ++HTF   +L++ L+ F
Sbjct: 353 KELFEALKQQSFRLMHTFTPLQLSRTLYGF 382



 Score = 39.7 bits (91), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 65/272 (23%), Positives = 101/272 (37%), Gaps = 60/272 (22%)

Query: 235 LNIATALHRIAKNME--KVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWA 292
           +N+ATALHR+AK+ +  +VS + T           + LV      L      G+ N  WA
Sbjct: 60  INLATALHRVAKHSKSYQVSQVAT-------DPRYTALVDRLGAYLNSLDGVGLMNTLWA 112

Query: 293 LSKIG--------------------------GELLYL----------SEMDRVAEVAL-- 314
           L ++                           G+ LY           +E  +  + AL  
Sbjct: 113 LVRLNAAAPKWISELLDRCISSVDQLEPKQLGQGLYCVYRMSKHAAPTEAVKALQSALHG 172

Query: 315 ---TKVGEF-NSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
                +  F +S  + +V  + A +      +FS L  R SD +  F  +++A V WAFA
Sbjct: 173 QVRASLDHFSDSHELVSVCTSLAKLGIRDETVFSALGSRLSDKMDDFDMEDIAAVSWAFA 232

Query: 371 SLYEPADPLLESLDNAFKDATQFTCC-------LNKALSNCNENGGVKSSGDADSEGSLS 423
                   L   +  +    T  T C       L  +LS   E GG +         ++ 
Sbjct: 233 RAKFTDRELFRKIRESLTVRT--TECSVKSLVSLTWSLSKLGETGGEEDLFRYTLAPTIR 290

Query: 424 SPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDI 455
           S +L +    L  +AWS+A     D  F SDI
Sbjct: 291 SYMLEYTVQDLCALAWSFANANVHDVDFMSDI 322


>gi|237829857|ref|XP_002364226.1| hypothetical protein TGME49_109790 [Toxoplasma gondii ME49]
 gi|211961890|gb|EEA97085.1| hypothetical protein TGME49_109790 [Toxoplasma gondii ME49]
 gi|221507092|gb|EEE32696.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 309

 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 58/230 (25%), Positives = 101/230 (43%), Gaps = 47/230 (20%)

Query: 206 AQTAQEVL-----------EVIAEMITAVGKGLSPSPLSPLN---------------IAT 239
           A+TAQE+L           E+ ++ + A    LSPS ++ +                +AT
Sbjct: 57  AETAQELLRGKETKRRAFWEIFSKRVKASAHMLSPSLMALIAKSFDVHDRDTGIYVALAT 116

Query: 240 ALHRIAKNMEKVSMMTTHRLAFTRQRE-------MSMLVAIAMTALPECSAQGISNIAWA 292
            L    K  +  S++T   + F+R+ +        S L      AL + + + +  I  +
Sbjct: 117 VLPEAVKRADGRSLLTLSDV-FSRRLKRDSNPHLFSTLARQLPNALYQLTGKDVLRILSS 175

Query: 293 LSKIGGELLYLSEMDRVAEVA---LTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKR 349
           L   G     L++M    +VA   L ++ E +S ++A+ +  FAS  +  P+L+S LA+R
Sbjct: 176 LDAAG-----LADMLACRQVARKLLAELDELDSVDLADASAVFASQGYRNPELYSALARR 230

Query: 350 ASDIVHTF----QEQELAQVLWAFASLYEPADPLLESLDNAFKDAT-QFT 394
           A D+  +F    Q   + ++L  F+      D LLES       +  QFT
Sbjct: 231 AVDVKDSFDSCSQAPTVFRLLSGFSQNAVACDELLESFSTLLVSSKDQFT 280


>gi|156101207|ref|XP_001616297.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805171|gb|EDL46570.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1277

 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 56/103 (54%), Gaps = 9/103 (8%)

Query: 521  TSSFQKEVARLLVSTGLNWIREYA--VDG-YTVDAVLVDKKVAFEIDGPTHFS-----RN 572
            +SSF +EV   L+S G   ++     +DG YTVD ++V+  V  EI+G  H+      + 
Sbjct: 1170 SSSFHREVLSTLLSLGEKNVQCEVPFMDGIYTVD-IVVNNSVCIEINGSNHYYYDSNLKR 1228

Query: 573  TGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
            +G  L    L + Y+ +  + ++ +S+ +W  L+ + E+ DYL
Sbjct: 1229 SGEKLDALNLVKYYLLSKKYKLILVSYLDWNNLKSAEEKRDYL 1271


>gi|322699135|gb|EFY90899.1| ATP-dependent DNA helicase mph1 [Metarhizium acridum CQMa 102]
          Length = 1070

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 50/180 (27%), Positives = 75/180 (41%), Gaps = 33/180 (18%)

Query: 1   MGFLPEN----SRWVYEEIISNIRIRRVTEDDEVDDSEEKESEDSVDWESEFLGELDPFG 56
           MG  PE+     R        +I +R   E D   +SEE ++ DSV              
Sbjct: 825 MGTEPESLVRQCRSTDTSRFQDIAVRPFVESD--GESEEDDTSDSVT------------- 869

Query: 57  YQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSI------EARGLASSMEDLIKVKKK 110
               KKR  +++S   D  E      + RK++  SI      E    A +M    + + K
Sbjct: 870 ----KKRSTRQRSVGADHEESQPSRGKRRKISTTSIPGPSELEDDTEAPAMNGGARKRTK 925

Query: 111 KKKGKKKLEKIKKKNKVTDDDLDFDLEDDMKMDDIMGSGNGYDMNDL----RRTVSMMAG 166
           K K K+K  K K++  +  D+L  D E D  + +  GS +G D+ D     R+T S M G
Sbjct: 926 KPKSKRKGRKTKQRTGINSDELGDDCERDSDLIESSGSDDGADLLDFVVADRQTTSSMVG 985


>gi|125815393|ref|XP_698448.2| PREDICTED: FAST kinase domain-containing protein 5-like [Danio
           rerio]
          Length = 640

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 2/68 (2%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL--QGSFEQLDY 614
           K++A ++    H+   T   LG   LKRR++  AG+ VV L H EW  L  +   E+L Y
Sbjct: 570 KRLAVQVTNRNHYCYRTKQLLGLHALKRRHLTLAGYRVVELPHWEWFPLLRRSQAEKLAY 629

Query: 615 LRVILKDY 622
           L   + +Y
Sbjct: 630 LHCKIFNY 637


>gi|332228041|ref|XP_003263199.1| PREDICTED: LOW QUALITY PROTEIN: FAST kinase domain-containing
           protein 3 [Nomascus leucogenys]
          Length = 662

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           +G+ + + +   + K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E 
Sbjct: 575 EGFVLPSTVDEDIHKRIALCIDGPERFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEI 634

Query: 603 EELQGSFEQLDYLR 616
             L+   E ++YL+
Sbjct: 635 GMLKSRCELVEYLQ 648


>gi|389584538|dbj|GAB67270.1| hypothetical protein PCYB_112910 [Plasmodium cynomolgi strain B]
          Length = 1311

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 56/103 (54%), Gaps = 9/103 (8%)

Query: 521  TSSFQKEVARLLVSTGLNWIREYA--VDG-YTVDAVLVDKKVAFEIDGPTHFS-----RN 572
            +SSF +EV   L+S G+  ++     +DG YTVD ++++     EI+G  H+      + 
Sbjct: 1204 SSSFHREVLSTLLSLGVKNVQCEVPFMDGIYTVD-IVINNSTCIEINGSNHYYYDNNLKR 1262

Query: 573  TGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
            +G  L    L + Y+ +  + ++ +S+ +W  L+ + E+ DYL
Sbjct: 1263 SGEKLDALNLIKYYLLSKKYKLILVSYLDWNNLKSAEEKKDYL 1305


>gi|209876299|ref|XP_002139592.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209555198|gb|EEA05243.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 587

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 26/99 (26%), Positives = 52/99 (52%), Gaps = 6/99 (6%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
           A+  L    A  +S + W+ SK G +   L+++ + +V    L+++    SQ ++N+  +
Sbjct: 190 AVYQLDRFIAINLSMLLWSYSKSGKKYNYLFITAIPKV----LSELDNLQSQQISNIIWS 245

Query: 332 FASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
           +A +   +P LF  +AKR + I+  F    ++   +AFA
Sbjct: 246 YAKIGLISPHLFENIAKRCTSILSEFLPIHISMTAYAFA 284


>gi|156083971|ref|XP_001609469.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154796720|gb|EDO05901.1| hypothetical protein BBOV_IV003040 [Babesia bovis]
          Length = 217

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 42/83 (50%)

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
           TS  Q ++A LL    L    E  +  Y +D V+    VA E++G +HF   +      T
Sbjct: 118 TSKMQYKLAPLLNHLKLQHRAEVQIGPYVMDYVIPRLNVAVEVNGHSHFYHQSTQFHALT 177

Query: 581 MLKRRYIAAAGWNVVSLSHQEWE 603
            LK   + + GW V+S+++ +W+
Sbjct: 178 KLKYSIVQSLGWQVLSVNYFDWK 200


>gi|294877802|ref|XP_002768134.1| hypothetical protein Pmar_PMAR002922 [Perkinsus marinus ATCC 50983]
 gi|239870331|gb|EER00852.1| hypothetical protein Pmar_PMAR002922 [Perkinsus marinus ATCC 50983]
          Length = 146

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 522 SSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTM 581
           S FQ+ +  +L +  L +  E     Y VD   V   +A E DG THF   T        
Sbjct: 28  SKFQESIKAVLKACELEYHEEVIAGTYIVDYA-VGNSLALEADGFTHFYAGTENFTAKAK 86

Query: 582 LKRRYIAAAGWNVVSLSH 599
           LK R + + GWN+VSL +
Sbjct: 87  LKHRILRSLGWNIVSLPY 104


>gi|294956195|ref|XP_002788848.1| hypothetical protein Pmar_PMAR004308 [Perkinsus marinus ATCC 50983]
 gi|239904460|gb|EER20644.1| hypothetical protein Pmar_PMAR004308 [Perkinsus marinus ATCC 50983]
          Length = 299

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 61/268 (22%), Positives = 109/268 (40%), Gaps = 51/268 (19%)

Query: 183 RLSQFSGPSNRRK--EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATA 240
           R+ + +G   R K  ++ L + ++DA T   VLE++      +G          +N A A
Sbjct: 14  RMYEVAGRLRRGKSGDLVLQRRLMDASTPAAVLEIVLPNANKLGS---------VNYACA 64

Query: 241 LHRIA-------------KNMEKVSMMT------------THRLAFTRQREMSMLVAIAM 275
           LHR A               + ++++ T            T  LA TR+ E  +  A   
Sbjct: 65  LHRCAVWFRSGKPTPSGLSQVPRLALQTVRDWRAREAATITWALAVTRELEHILEFARLS 124

Query: 276 TALPECSAQGISNIAWALSKIG-------GELLYLSEMDRVAEVALTKVGEFNSQNVANV 328
            +  E S   ++N+  +L+  G         L  +++  RV  + L+  G    + +A V
Sbjct: 125 MSCNEASGGDLANVVHSLTISGLNPRQCTATLAVVAK--RVTAMDLSHCGVIEPKQLAAV 182

Query: 329 AGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNA-- 386
              F  ++ +  D+ + L + A+  +  F  Q+L+ V WA A     + PLL + D A  
Sbjct: 183 FWGFVKLEFTDDDVMTYLVRSATTRMDEFNSQDLSMVSWALAK----SLPLLPTEDCAQG 238

Query: 387 FKDATQFTCCLNKALSNCNENGGVKSSG 414
               TQF    ++ L         ++SG
Sbjct: 239 IDRFTQFNTSCDEHLMGIGTMSASRTSG 266


>gi|82596883|ref|XP_726446.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23481859|gb|EAA18011.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 1071

 Score = 44.7 bits (104), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 47/211 (22%), Positives = 89/211 (42%), Gaps = 43/211 (20%)

Query: 434  LGNIAWSYAVLGQM--DRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLE 491
            L    W  +++  +  D I F +I+     + E +I EQ   + M+   V  +   LK  
Sbjct: 850  LARYLWGVSIVNLINDDTINFINIY----NWNEIKIYEQ---NPMYLHMVFTLWLRLKYS 902

Query: 492  HPHLQLA---------LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIRE 542
            + HL+L+         ++ +L++K    G     N+   S+F  +++++L    + +  E
Sbjct: 903  YSHLKLSKNFLNFIDQITHILKKKYIKNG----LNKDNLSTFHVQISKILDEFNVKYTNE 958

Query: 543  YAVDGYTVDAVL-----VDKKVAFEIDGPTH-------FSRNTGV-------PLGHTMLK 583
            Y      +  ++       +K+A EIDGP+H          NT +         G T  K
Sbjct: 959  YITKDLLIIDIIIILKECKEKIAIEIDGPSHHLLDLSDLHVNTSINDNKKYLQCGTTYFK 1018

Query: 584  RRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
               +   GW V+++   EW +++   E  DY
Sbjct: 1019 NFLLKKNGWKVINIPSYEWNKIKK--EDRDY 1047


>gi|407849431|gb|EKG04172.1| hypothetical protein TCSYLVIO_004775 [Trypanosoma cruzi]
          Length = 1005

 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 26/94 (27%), Positives = 49/94 (52%), Gaps = 2/94 (2%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           + + I+N+ +A +K+G  L +     R+A+ A+   GEF   +VA +  A+A ++     
Sbjct: 818 TPKDITNVVYAYAKVG--LWHYKLFVRLADRAIQLRGEFRCDHVARLLEAYARVEMRYEK 875

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEP 375
           LF E + R   I H     E+ +++ A+A +  P
Sbjct: 876 LFVEFSPRIQTIAHLLTAGEVTKIVSAYAKVRIP 909


>gi|348503464|ref|XP_003439284.1| PREDICTED: FAST kinase domain-containing protein 3-like
           [Oreochromis niloticus]
          Length = 659

 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 3/73 (4%)

Query: 546 DGYTVDAVLVD---KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           +GY + A   D   K++A  IDG   F+ N    LG   +K+R++   G+ VV + + E+
Sbjct: 574 EGYVLPASQTDDVYKRIALCIDGQKRFTSNLRQLLGKEAIKQRHLRLLGYEVVQIPYFEY 633

Query: 603 EELQGSFEQLDYL 615
           E+LQ     ++YL
Sbjct: 634 EKLQSKNSMVEYL 646


>gi|395735635|ref|XP_002815460.2| PREDICTED: FAST kinase domain-containing protein 3 [Pongo abelii]
          Length = 658

 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           +G+ + + +   + K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E 
Sbjct: 575 EGFVLPSTVNEDIHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEI 634

Query: 603 EELQGSFEQLDYLR 616
             L+   E ++YL+
Sbjct: 635 GMLKSRRELVEYLQ 648


>gi|428673296|gb|EKX74209.1| conserved hypothetical protein [Babesia equi]
          Length = 570

 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 36/149 (24%), Positives = 67/149 (44%), Gaps = 25/149 (16%)

Query: 477 FASQVHLVNQ-CLKLEHP-HLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVS 534
           F +Q++L+N+ C+   H  H ++  +  L + I S   + RF++     F+     L V 
Sbjct: 408 FITQLNLLNKACIVERHRLHSKIMANQQLSDFINSIPNSTRFDE--AYDFKTSTTHLQVR 465

Query: 535 TGLNWIR-----EYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG----------- 578
             L+        E  V  Y VD ++  K +  E+DGP H++      +G           
Sbjct: 466 NTLDMFNYETEVETKVYPYIVDILVKSKNLIIEVDGPYHYTTYINKSVGKILNRESSDDL 525

Query: 579 --HTM---LKRRYIAAAGWNVVSLSHQEW 602
             HT+   LK+R +  +G+  V++ + +W
Sbjct: 526 FQHTLNSRLKQRLLQKSGYKFVNIPYYKW 554


>gi|308807601|ref|XP_003081111.1| unnamed protein product [Ostreococcus tauri]
 gi|116059573|emb|CAL55280.1| unnamed protein product [Ostreococcus tauri]
          Length = 665

 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 79/192 (41%), Gaps = 37/192 (19%)

Query: 199 LNKDIVDAQTAQEVLEVI---AEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT 255
           + +++ +A +A++ L V+    E+  AV            + ATALHR+AK     S + 
Sbjct: 55  IQRELANASSAEDALRVVERDLEVFDAV------------HAATALHRVAKFSSPSSRLD 102

Query: 256 THRL----AFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAE 311
                   A TR      L +     + E  A G++N+AW+ +KIG    Y    D +  
Sbjct: 103 ARDFDRVEAVTRDERFKALASTVGDRMNEFDAFGLANVAWSFAKIG----YTPSQDTLNA 158

Query: 312 VA-------LTKVGEFNSQNVANVAGAFASMQHSAP----DLFSELAKRASDIVHTFQEQ 360
           +A       L        Q+++N A AF  +++  P    +   E   R  D    F+  
Sbjct: 159 LASRLEREVLKHGASVKPQSLSNAAYAFGRLRYKPPKSTLEALCEATMRQMD---KFRTD 215

Query: 361 ELAQVLWAFASL 372
           E A ++   A L
Sbjct: 216 EFAGMMLGLAHL 227


>gi|308804243|ref|XP_003079434.1| unnamed protein product [Ostreococcus tauri]
 gi|116057889|emb|CAL54092.1| unnamed protein product, partial [Ostreococcus tauri]
          Length = 1182

 Score = 44.3 bits (103), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 6/89 (6%)

Query: 520 VTSSFQKEVARLLVSTGL-NWIREYAVDGYTV--DAVLVDKKVAFEIDGPTHFSRNT-GV 575
            TS+ Q+ VA  L   G+ ++  E AV+G  +  D V   +++  E+DGP H+S +  GV
Sbjct: 770 TTSNLQRAVADHLHDMGVGDFDVERAVEGGKMRPDIVFESRRLVIEVDGPHHYSVDADGV 829

Query: 576 --PLGHTMLKRRYIAAAGWNVVSLSHQEW 602
              LG T+++   + + GW V  + + EW
Sbjct: 830 RRELGQTIVRNELLRSWGWKVCVVPYHEW 858



 Score = 42.4 bits (98), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 56/224 (25%), Positives = 91/224 (40%), Gaps = 42/224 (18%)

Query: 197 INLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTT 256
           + +NK ++  +T +E+  V+         G   S +S +N +T   R+AK          
Sbjct: 154 LRMNKALMTCETVEELAAVV---------GGRASAMSDVNASTTYSRLAKFARG-----G 199

Query: 257 HRLAFTRQREMSMLV------AIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVA 310
            R      REMS         A ++  + +   +  + +AWA     G L      D  A
Sbjct: 200 RRAREEVVREMSRATWFKEVEARSIETMDKMQPRSAAQMAWAC----GHLSRSRRRDGDA 255

Query: 311 -----EVALTKVG-EFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQE 359
                E AL ++G +F  Q VANVA A+A ++   P        + L + A D    ++ 
Sbjct: 256 FWDALERALERLGTKFKPQGVANVAWAYAKLEMRMPQGIRNAFETHLERNAQD----YKP 311

Query: 360 QELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSN 403
            EL    WA   L +  D + E +  A +     TCC  + L+N
Sbjct: 312 YELTITFWA---LTKHGDAVREDVAIALERTLDLTCCKPQELAN 352


>gi|82594046|ref|XP_725261.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23480197|gb|EAA16826.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 213

 Score = 44.3 bits (103), Expect = 0.21,   Method: Composition-based stats.
 Identities = 32/144 (22%), Positives = 58/144 (40%), Gaps = 34/144 (23%)

Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDK-------------- 557
           K K     + S  QKEV  +L++  L  + E ++  Y VD +  DK              
Sbjct: 7   KEKEIEYNIKSDLQKEVKNILLTFNLTPLEEVSIGPYNVDFIEEDKTFQNISKNEIYYKK 66

Query: 558 --------------------KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSL 597
                               K+  E++G  HF RNT      + LK + ++  G+ V+++
Sbjct: 67  ESNNSTKIILSDKKNYENIGKIIIEVNGEHHFYRNTKSYTSFSKLKHKLLSDLGYIVINI 126

Query: 598 SHQEWEELQGSFEQLDYLRVILKD 621
            + +W  L+    +  Y++ I+ D
Sbjct: 127 PYFDWAILKTYLNKKSYIKKIIND 150


>gi|71421683|ref|XP_811868.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70876580|gb|EAN90017.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1005

 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 26/94 (27%), Positives = 49/94 (52%), Gaps = 2/94 (2%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           + + I+N+ +A +K+G  L +     R+A+ A+   GEF   +VA +  A+A ++     
Sbjct: 818 TPKDITNVVYAYAKVG--LWHYKLFVRLADRAIQLRGEFRCDHVARLLEAYARVEMRYEK 875

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEP 375
           LF E + R   I H     E+ +++ A+A +  P
Sbjct: 876 LFVEFSPRIQTIAHLLTAGEVTKIVSAYAKVRIP 909


>gi|48257152|gb|AAH01295.2| FASTKD3 protein [Homo sapiens]
          Length = 550

 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           + K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E   L+   E ++Y
Sbjct: 475 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 534

Query: 615 LR 616
           L+
Sbjct: 535 LQ 536


>gi|429327253|gb|AFZ79013.1| hypothetical protein BEWA_018580 [Babesia equi]
          Length = 951

 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 3/88 (3%)

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVL-VDK--KVAFEIDGPTHFSRNTGVPL 577
           +S   +E++  L   G+    E     Y +D V  V+   KVA E DGP+HF   T +  
Sbjct: 745 SSPAHRELSHFLNLAGVLHKNEVQCGPYLIDIVPEVNPGIKVAIEYDGPSHFYAETVMRN 804

Query: 578 GHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
             ++ K   + + GW V+ + +QEW +L
Sbjct: 805 IKSITKHEILESMGWEVIHVPYQEWIQL 832


>gi|156085826|ref|XP_001610322.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797575|gb|EDO06754.1| hypothetical protein BBOV_IV003930 [Babesia bovis]
          Length = 651

 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 46/215 (21%), Positives = 84/215 (39%), Gaps = 21/215 (9%)

Query: 419 EGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFA 478
           EG+ ++ +LSF+   +G +  + A         F +I K +    E    +    D++  
Sbjct: 425 EGNKTTLILSFSHIIMGTVKLNKAPKSTETMPVFYNILKYLLEHPELHDEDHIDPDVLQG 484

Query: 479 SQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQK---VTSSFQKEVARLLVST 535
           S  ++      + + HL+    S     I S     R +      TS   K+VA +L + 
Sbjct: 485 SLNNVRLLVTYIGYDHLKQWFRSTEISAIESLLAKARLDYCKDFRTSDLHKQVADVLSTL 544

Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTM-------------- 581
           G+   +E  +  +  D VL  +++  EIDGP HF+      L   +              
Sbjct: 545 GIECDQEVTIGSHICDLVLKKRRIVIEIDGPYHFNTTLNSSLNSILNRHVDDYRLTYTYN 604

Query: 582 --LKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
             +K   +   G+ V+ + +  W    G  EQ+ Y
Sbjct: 605 SRIKMYMLRQGGYKVIHIPYFMWPS--GKQEQMVY 637


>gi|296194953|ref|XP_002806679.1| PREDICTED: LOW QUALITY PROTEIN: FAST kinase domain-containing
           protein 3 [Callithrix jacchus]
          Length = 670

 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 12/99 (12%)

Query: 529 ARLLVSTGLNWIREYAVD--------GYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPL 577
           ARL  ++ +     Y +D        G+ +   +   V K++A  IDGP  F  N+   L
Sbjct: 550 ARLYFASKVLTPYYYTIDVEIKLDEEGFVLPCTVNEDVHKRIALCIDGPQRFCSNSKHLL 609

Query: 578 GHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           G   +K+R++   G+ VV + + E E L    E ++YL+
Sbjct: 610 GKEAIKQRHLQLLGYQVVQMPYHEIEML-TRLELVEYLQ 647


>gi|68063701|ref|XP_673847.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56491996|emb|CAI01743.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 608

 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 15/59 (25%), Positives = 33/59 (55%)

Query: 546 DGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
           + Y +   L+ K +  E+DG +HF + +     ++++K   +   GWN++ + +QEW +
Sbjct: 428 EKYNIIEKLLTKNIVIEVDGISHFYKESYSRTLNSIIKNYILKKFGWNIIHIPYQEWNQ 486


>gi|302834581|ref|XP_002948853.1| hypothetical protein VOLCADRAFT_89149 [Volvox carteri f.
           nagariensis]
 gi|300266044|gb|EFJ50233.1| hypothetical protein VOLCADRAFT_89149 [Volvox carteri f.
           nagariensis]
          Length = 1137

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 51/98 (52%), Gaps = 3/98 (3%)

Query: 286 ISNIAWAL---SKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           ISN+++AL    +      + + M  +A  A  ++ EF  Q+++N+  A+A    + P L
Sbjct: 465 ISNLSYALVVARQHRAHPAHEAVMRALAVAAEARLSEFCPQDISNMLWAYARCGMAQPAL 524

Query: 343 FSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
           FS  A  A  +   F +  L QV+WA+A++     PLL
Sbjct: 525 FSAAASIARMMAADFSQAGLVQVIWAYAAMRVYDAPLL 562


>gi|40068497|ref|NP_076996.2| FAST kinase domain-containing protein 3 [Homo sapiens]
 gi|294862434|sp|Q14CZ7.2|FAKD3_HUMAN RecName: Full=FAST kinase domain-containing protein 3
          Length = 662

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           + K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E   L+   E ++Y
Sbjct: 587 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 646

Query: 615 LR 616
           L+
Sbjct: 647 LQ 648


>gi|109730533|gb|AAI13564.1| Hypothetical protein MGC5297 [Homo sapiens]
 gi|119628499|gb|EAX08094.1| hypothetical protein MGC5297, isoform CRA_a [Homo sapiens]
 gi|313883202|gb|ADR83087.1| FAST kinase domains 3 [synthetic construct]
          Length = 662

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           + K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E   L+   E ++Y
Sbjct: 587 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 646

Query: 615 LR 616
           L+
Sbjct: 647 LQ 648


>gi|348536034|ref|XP_003455502.1| PREDICTED: FAST kinase domain-containing protein 5-like
           [Oreochromis niloticus]
          Length = 986

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 35/66 (53%), Gaps = 2/66 (3%)

Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL--QGSFEQLDYL 615
           K+A ++    HF   +   LG   +KRR +  AG+ VV LS+QEW  L  +   E+L YL
Sbjct: 919 KIAVQVSNRNHFCSQSQQLLGLHAMKRRQLKIAGYRVVELSYQEWFPLLRKSRAEKLAYL 978

Query: 616 RVILKD 621
              L D
Sbjct: 979 HCKLYD 984


>gi|407410016|gb|EKF32615.1| hypothetical protein MOQ_003529 [Trypanosoma cruzi marinkellei]
          Length = 1005

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 30/121 (24%), Positives = 58/121 (47%), Gaps = 15/121 (12%)

Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
           + + I+N+ +A +++G  L +     R+A+ A+   GEF   +VA +  A+A ++     
Sbjct: 818 TPKDITNVVYAYAQVG--LWHYKLFVRLADRAIQLRGEFRCDHVARLLEAYARVEMRYEK 875

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
           LF E + R   I H     E+ +++ A+A +  P             DA  F  C ++A+
Sbjct: 876 LFLEFSPRIQTIAHLLTAGEVTKIVAAYAKVRIP-------------DAGVFNACGDRAV 922

Query: 402 S 402
           +
Sbjct: 923 A 923


>gi|397475721|ref|XP_003809274.1| PREDICTED: FAST kinase domain-containing protein 3 [Pan paniscus]
          Length = 662

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           + K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E   L+   E ++Y
Sbjct: 587 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 646

Query: 615 LR 616
           L+
Sbjct: 647 LQ 648


>gi|426385166|ref|XP_004059100.1| PREDICTED: FAST kinase domain-containing protein 3 [Gorilla gorilla
           gorilla]
          Length = 662

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           + K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E   L+   E ++Y
Sbjct: 587 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 646

Query: 615 LR 616
           L+
Sbjct: 647 LQ 648


>gi|345796308|ref|XP_545176.3| PREDICTED: LOW QUALITY PROTEIN: FAST kinase domain-containing
           protein 3 [Canis lupus familiaris]
          Length = 672

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 36/62 (58%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           V K+VA  ID P  FS N+   LG   +K+R++   G+ VV + + E E L+   E ++Y
Sbjct: 588 VHKRVALCIDDPKRFSLNSKHLLGKEAIKQRHLRLLGYQVVQIPYYEIEVLKSRGELVEY 647

Query: 615 LR 616
           L+
Sbjct: 648 LQ 649


>gi|332820899|ref|XP_517625.3| PREDICTED: FAST kinase domain-containing protein 3 [Pan
           troglodytes]
 gi|410217416|gb|JAA05927.1| FAST kinase domains 3 [Pan troglodytes]
 gi|410254100|gb|JAA15017.1| FAST kinase domains 3 [Pan troglodytes]
 gi|410288824|gb|JAA23012.1| FAST kinase domains 3 [Pan troglodytes]
 gi|410339215|gb|JAA38554.1| FAST kinase domains 3 [Pan troglodytes]
          Length = 662

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           + K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E   L+   E ++Y
Sbjct: 587 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 646

Query: 615 LR 616
           L+
Sbjct: 647 LQ 648


>gi|410951914|ref|XP_003982637.1| PREDICTED: protein TBRG4 isoform 2 [Felis catus]
          Length = 630

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 68/159 (42%), Gaps = 20/159 (12%)

Query: 259 LAFTRQREMSMLVAIA--MTALPECSAQG-ISNIAWALSKIGGELLYLSEMDRVAEVALT 315
           LA   +R + +L A++  +   P    +G + ++A+A  K+G        + R+A   L 
Sbjct: 264 LAAQNRRSVPLLRAVSYHLVQKPFPLTKGMLLDLAYAYGKLGFH--QTQVLQRLAADLLP 321

Query: 316 KVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL--- 372
            V    S  VA  A +FA ++  +P LF  LA+   D  HT     L  VL AFA L   
Sbjct: 322 HVPSLTSGEVARGAKSFALLKWLSPPLFEALAQHVVDRAHTVTVPHLCNVLLAFAHLNFR 381

Query: 373 -----------YEPADPLLESLDNAFK-DATQFTCCLNK 399
                      +E   P L+SL  A + D     C L +
Sbjct: 382 PEREDKFFGLVHEKLGPKLQSLHPALQVDVVWALCVLQQ 420


>gi|351714996|gb|EHB17915.1| FAST kinase domain-containing protein 1 [Heterocephalus glaber]
          Length = 778

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 6/75 (8%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFE--QLDY 614
           +++AFE      F RN     G + +K+R++   G+ V+ + H EW  +  S +  ++DY
Sbjct: 708 ERIAFEFLDSKAFCRNIPHLKGKSAMKKRHLEILGYRVIQIPHFEWNSMALSTKDARMDY 767

Query: 615 LRVILKDYIGGEGSS 629
           LR     +I GEG+S
Sbjct: 768 LR----QHIFGEGTS 778


>gi|294872955|ref|XP_002766462.1| hypothetical protein Pmar_PMAR018296 [Perkinsus marinus ATCC 50983]
 gi|239867342|gb|EEQ99179.1| hypothetical protein Pmar_PMAR018296 [Perkinsus marinus ATCC 50983]
          Length = 1082

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 49/223 (21%), Positives = 95/223 (42%), Gaps = 41/223 (18%)

Query: 186 QFSGPSNRRK----EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATAL 241
           +  G SN+R+    E  + + I+ A  ++     I+ ++  V K L    L+ +N++T +
Sbjct: 596 RVGGHSNQRQATANEFEIQRSILAAANSRS----ISSLLLIVEKHLDE--LNSVNVSTLI 649

Query: 242 HRIA---KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGG 298
           HR+A   +N E+      ++        +  ++  A+   P  S Q +SNI WA+    G
Sbjct: 650 HRLASITQNQEQ------NQRVLANDPRVKEVLRRAIDLAPTSSCQSLSNICWAI----G 699

Query: 299 ELLYLSE------------------MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           +L  + E                  MD VAE     +  F  Q V+N+  A+  +     
Sbjct: 700 KLQMVEEKDVVRAIVEAAKSQLEELMDLVAEKVANSLYTFKPQEVSNLLYAYGRLNCYNE 759

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
            L  E+    + ++  +  Q +  V+ + A L  P   L++++
Sbjct: 760 KLLQEICACVATMMPRYDGQGVGNVICSLAKLKYPCIQLMDAI 802


>gi|410949837|ref|XP_003981623.1| PREDICTED: FAST kinase domain-containing protein 3 [Felis catus]
          Length = 718

 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 545 VDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
           V  +T+D   V K++A  ID P  FS N+   LG   +K+R++   G+ VV + + E E 
Sbjct: 578 VLPFTIDED-VHKRLALCIDDPKRFSLNSRHLLGKEAIKQRHLRLLGYQVVQIPYYEIEM 636

Query: 605 LQGSFEQLDYLR 616
           L+   E ++YL+
Sbjct: 637 LKSRVELVEYLQ 648


>gi|149032746|gb|EDL87601.1| similar to hypothetical protein MGC5297, isoform CRA_a [Rattus
           norvegicus]
          Length = 168

 Score = 43.5 bits (101), Expect = 0.31,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 1/69 (1%)

Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
           +TVD   V  +VA  IDGP  F   +   LG   +K+R++   G+ VV + + E E L  
Sbjct: 89  FTVDED-VHTRVALCIDGPQRFCLGSKHLLGKEAIKQRHLRLLGYQVVQVPYHELELLTS 147

Query: 608 SFEQLDYLR 616
             E +DYL+
Sbjct: 148 RLELVDYLQ 156


>gi|153206845|ref|ZP_01945686.1| hypothetical protein A35_A0967 [Coxiella burnetii 'MSU Goat Q177']
 gi|120577208|gb|EAX33832.1| hypothetical protein A35_A0967 [Coxiella burnetii 'MSU Goat Q177']
          Length = 438

 Score = 43.5 bits (101), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 50/184 (27%), Positives = 77/184 (41%), Gaps = 20/184 (10%)

Query: 282 SAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
           + QGI+N  WAL+ +G    EL      DR+         +FN QNV N    FA++   
Sbjct: 98  NPQGIANTLWALATMGVRRQELEAQGLNDRLMGAVHHNAEQFNPQNVTNTLWTFATLSVK 157

Query: 339 APDL-FSELAKRASDIVH----TFQEQELAQVLWAFASL---------YEPADPLLESLD 384
             +L   EL     + VH        Q +   LWA A++          E  D LLE++ 
Sbjct: 158 WEELEAQELNDCLLNAVHRNADQLNPQGIVNTLWALATMGVRWRELEVRELTDRLLEAVR 217

Query: 385 -NAFKDATQFTCCLNKALSNCN-ENGGVKSSGDADS-EGSLSSPVLSFNRDQLGNIAWSY 441
            NA +  ++       AL+  +   G +++ G  D   G++   V  FN   + N  W  
Sbjct: 218 YNASRFKSREIANTLWALATLSVRRGNMEAQGLRDRLLGAVHHNVERFNPQDIANALWGL 277

Query: 442 AVLG 445
           A +G
Sbjct: 278 ATMG 281


>gi|156102949|ref|XP_001617167.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148806041|gb|EDL47440.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 943

 Score = 43.1 bits (100), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 24/104 (23%), Positives = 50/104 (48%), Gaps = 6/104 (5%)

Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
           E++++L    ++ ++   ++    D +L D ++     GP  +  N+ V    + LKR  
Sbjct: 738 ELSKILARINVSHLKSVYINHICADIMLPDSQIVIMCLGPYSYYVNSLVTTSTSDLKRSI 797

Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSSN 630
           +    + V+ LS+ EW +L    E++ +L      Y  G G++N
Sbjct: 798 LEKKKYKVIPLSYHEWNKLNDYEEKIRFL------YAFGRGAAN 835


>gi|302781714|ref|XP_002972631.1| hypothetical protein SELMODRAFT_413126 [Selaginella moellendorffii]
 gi|300160098|gb|EFJ26717.1| hypothetical protein SELMODRAFT_413126 [Selaginella moellendorffii]
          Length = 177

 Score = 43.1 bits (100), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 574 GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQL-DYLRVIL 619
           G  LGHT+LK R + AA W ++S S+ EWE LQG    L  Y R+ L
Sbjct: 126 GDLLGHTVLKHRLVEAAEWKIISASYAEWENLQGESGHLTSYKRLWL 172


>gi|221057756|ref|XP_002261386.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
 gi|194247391|emb|CAQ40791.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
          Length = 1303

 Score = 43.1 bits (100), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 56/103 (54%), Gaps = 9/103 (8%)

Query: 521  TSSFQKEVARLLVSTGLNWIREYA--VDG-YTVDAVLVDKKVAFEIDGPTHFS-----RN 572
            +SSF +EV   L+S  +  ++     +DG YTVD ++++  V  EI+G  H+      + 
Sbjct: 1196 SSSFHREVLSTLLSLDVKNVQCEVPFMDGIYTVD-IVINNSVCIEINGSNHYYYDNNLKR 1254

Query: 573  TGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
            +G  L    L + Y+ +  + ++ +S+ +W  L+ + E+ DYL
Sbjct: 1255 SGEKLDALNLIKYYLLSKKYKLILVSYLDWNNLKSAEEKKDYL 1297


>gi|410909325|ref|XP_003968141.1| PREDICTED: FAST kinase domain-containing protein 3-like [Takifugu
           rubripes]
          Length = 665

 Score = 43.1 bits (100), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 73/331 (22%), Positives = 133/331 (40%), Gaps = 31/331 (9%)

Query: 313 ALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
           A+  V  F    +  V GA     HS       + K    +  T   + + QV+  F+S 
Sbjct: 327 AVRHVPHFTDDELTGVLGALMHFGHSDHYFVDAMEKYVPTMTFTSHPETVTQVIQFFSSR 386

Query: 373 YEPADPLLESLDNAF-KDATQF-TCCLNKALSNCNENGGVKSSGDA---DSEGSLSSPVL 427
              +  +L+++  +F   A  F T  + K +    + G +  +        E  L S   
Sbjct: 387 NILSPTVLDAVAESFVYRADDFSTTQVAKHIMALGKLGYLPPNAGTVFRKVENILHSHFS 446

Query: 428 SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQC 487
            F    L N+  S  ++ +    F S ++K  S F +Q   +  R D    +Q+  +   
Sbjct: 447 HFQPQSLLNLLHSCTLVERFPVNFVSKVFK--SYFLQQLQEDGNRVDRYVLAQLTQLYMT 504

Query: 488 LKLEHPHLQ-----------------LALSSVLEEKIASAGKTKRFNQKVTSSFQKEVAR 530
           +KLE P  +                  +L + ++  + ++ KT   N  +  +     ++
Sbjct: 505 MKLECPFYEGPRLPPKYQVKSFLLPGRSLETPVDLHLYNSVKTGLVN--LLGARHYFGSK 562

Query: 531 LLVSTGLNWIREYAVD--GYTVDAVLVD---KKVAFEIDGPTHFSRNTGVPLGHTMLKRR 585
           +L S       E  +D  G+ + A  VD   K++A  IDG   F+ N    LG   +K+R
Sbjct: 563 VLTSNCYTLDVEIKLDEEGFVLPASHVDEVCKRIAVCIDGRKRFTVNKRQLLGKEAIKQR 622

Query: 586 YIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           ++   G+ VV +   E+E+LQ     ++YL 
Sbjct: 623 HLRLLGYEVVQIPFYEFEKLQNQASVVEYLH 653


>gi|221056200|ref|XP_002259238.1| RAP protein [Plasmodium knowlesi strain H]
 gi|193809309|emb|CAQ40011.1| RAP protein, putative [Plasmodium knowlesi strain H]
          Length = 740

 Score = 42.7 bits (99), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 23/108 (21%), Positives = 46/108 (42%)

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
           TS   KE++ +L    +  +   A   + VD          E++ P  +   +       
Sbjct: 577 TSMLHKEISDILTQIKVEHLNSVACGPFIVDIYHPHSNCIIEVNAPFQYYLTSEKLTTLA 636

Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGS 628
             + +++A  G+ ++ +SH+ W  L    +++DYL   L   + G GS
Sbjct: 637 EWRHKFLARMGFRIIHISHKVWSSLPTDKQKVDYLSRALPAAMFGRGS 684


>gi|397635941|gb|EJK72081.1| hypothetical protein THAOC_06426, partial [Thalassiosira oceanica]
          Length = 198

 Score = 42.7 bits (99), Expect = 0.53,   Method: Composition-based stats.
 Identities = 38/161 (23%), Positives = 64/161 (39%), Gaps = 37/161 (22%)

Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
           +S  AWA +  G     L E      V    +G F  ++++N A AFA+   S P+LF +
Sbjct: 41  LSITAWAFATSGVSHSELFEKIGNHVVGPGGLGSFKPRDLSNTAWAFATAGVSHPELFKK 100

Query: 346 LAKRASD--IVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSN 403
           +    ++     +F+ QEL+  +WA A++    + L  +          F   +   L  
Sbjct: 101 IGHHVAEQGCFDSFKPQELSNTVWACATVGYTDERLFSA----------FAPVIGSKLDE 150

Query: 404 CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVL 444
           C+E                          +L NIAW+Y+ L
Sbjct: 151 CSEQ-------------------------ELTNIAWAYSTL 166


>gi|412993943|emb|CCO14454.1| predicted protein [Bathycoccus prasinos]
          Length = 970

 Score = 42.7 bits (99), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 6/72 (8%)

Query: 317 VGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAF------A 370
           + EFN++++ANV  AFA    +   +   +AKRA++I+ TF  QEL + L A        
Sbjct: 599 IDEFNARDLANVTEAFAKRLDTPEKVLKTIAKRAAEILDTFNAQELLKFLGALERAGGDV 658

Query: 371 SLYEPADPLLES 382
             YE  + LL S
Sbjct: 659 HKYEKLNELLRS 670


>gi|70947243|ref|XP_743256.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56522666|emb|CAH77389.1| hypothetical protein PC000205.02.0 [Plasmodium chabaudi chabaudi]
          Length = 378

 Score = 42.7 bits (99), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 18/67 (26%), Positives = 35/67 (52%)

Query: 554 LVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLD 613
           L+ K +  E+DG +HF + +     ++++K   +   GWN++ + +QEW +      +L 
Sbjct: 174 LLTKNIVIEVDGISHFYKESYSRTLNSIIKNYILKKFGWNIIHIPYQEWNQCYNFKTKLL 233

Query: 614 YLRVILK 620
           Y   I K
Sbjct: 234 YAIHIFK 240


>gi|149704825|ref|XP_001497489.1| PREDICTED: protein TBRG4-like isoform 1 [Equus caballus]
          Length = 632

 Score = 42.7 bits (99), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 153/398 (38%), Gaps = 84/398 (21%)

Query: 259 LAFTRQREMSMLVAIA--MTALPECSAQG-ISNIAWALSKIGGELLYLSEMDRVAEVALT 315
           LA   +R + +L AI+  +   P    +G + ++A+A  K+G          R+A   L 
Sbjct: 266 LAAQNRRSVPLLRAISYHLVQKPFPLTKGMLLDLAYAYGKLGFH--QTQVFQRLAADLLP 323

Query: 316 KVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YE 374
                 S  VA  A +FA ++     LF   A+   +   +     L  +L AFA L + 
Sbjct: 324 HTPSLTSGEVARCAKSFAFLKWLNLPLFEAFAQHVLNRAQSTTVPHLCNMLLAFARLNFR 383

Query: 375 PADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQL 434
           P            +   QF   + + L +  E  G+  +   D                 
Sbjct: 384 P------------EREDQFFSLVREKLGS--ELAGLDPALQVD----------------- 412

Query: 435 GNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRIS-EQYREDIMFASQVHLVNQCLKLEHP 493
             + W+  VL Q+       + +    F  Q +  E  ++  +F   +H +N   +LEHP
Sbjct: 413 --VVWALCVLQQVREAELRAVLR--PEFHTQFLGGESPKDQSIFQKLLH-INATAQLEHP 467

Query: 494 HLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEV-------------ARLLVSTGLNWI 540
                 +S L    A   +    ++KVT   QKE+                +V+T   W+
Sbjct: 468 EY----TSPLLPVSALVPRLSALDKKVTP-LQKELQETLKGLLGSSDRGSFMVATQYGWV 522

Query: 541 --REYAVDGYTVDAVLVD-------------------KKVAF-EIDGPTHFSRNTGVPLG 578
              E  +D  +    L D                   K++AF   + P   SR+  + LG
Sbjct: 523 LDAEVLLDADSQFLPLRDFVAPHLAPPSGSQPLPPGAKRLAFLRWEFPNFNSRSKDL-LG 581

Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
             +L RR++ AAG+ VV + + EW EL+  +++  YL+
Sbjct: 582 RFVLARRHVLAAGFLVVDVPYYEWLELKSEWQKGAYLK 619


>gi|302851686|ref|XP_002957366.1| hypothetical protein VOLCADRAFT_98423 [Volvox carteri f.
           nagariensis]
 gi|300257325|gb|EFJ41575.1| hypothetical protein VOLCADRAFT_98423 [Volvox carteri f.
           nagariensis]
          Length = 1061

 Score = 42.7 bits (99), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 1/91 (1%)

Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
           + S Q ++  AWA+ ++G +   L    RV   AL   G+   Q+++++  A A   H  
Sbjct: 298 DASPQALALTAWAVVQLGEQPPPLEWWRRVQGAALRLRGQLQPQDISHLVWATARSGHPP 357

Query: 340 P-DLFSELAKRASDIVHTFQEQELAQVLWAF 369
           P D  + +   A   +  F+ QE+  +LW  
Sbjct: 358 PPDWLAAMCTEAHGCLRGFRAQEVCNLLWGL 388


>gi|189183089|ref|YP_001936874.1| repeat-containing protein A_01 [Orientia tsutsugamushi str. Ikeda]
 gi|189179860|dbj|BAG39640.1| repeat-containing protein A_01 [Orientia tsutsugamushi str. Ikeda]
          Length = 631

 Score = 42.7 bits (99), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 80/397 (20%), Positives = 149/397 (37%), Gaps = 71/397 (17%)

Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEV----ALTKVGEFNSQNVANVA 329
           A+  +   + QG++N  WAL +     L +    +  E     A   +  F +Q+++N  
Sbjct: 164 AIKTIDHFTTQGLANSLWALGR-----LEIHPQAKFIEAWIHHATKTIDHFTTQDLSNSL 218

Query: 330 GAFASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLES-LDNA 386
            A   ++ H   +      + A+  +  F  Q+L+  LW    L   P    +E+ + +A
Sbjct: 219 WALGRLEIHPQAEFIEAWIRHATKTIDHFTTQDLSNSLWGLGRLEIHPQAKFIEAWIHHA 278

Query: 387 FKDATQFTCCLNKALSNCNEN-GGVKSSGDADSEGSL----SSPVLSFNRDQLGNIAWSY 441
            K    FT    + LSN     G ++    A+   +     +  +  F    L N  W+ 
Sbjct: 279 TKTIDHFT---TQDLSNSLWGLGRLEIHPQAEFIEAWIRHATKTIDHFTTQDLSNSLWAL 335

Query: 442 AVLGQMDRIFFSDIW-----KTISRFEEQ--------------------RISEQYREDI- 475
             L    +  F + W     KTI  F  Q                    ++ +Q+   + 
Sbjct: 336 GQLEIHPQAEFIEAWIHHATKTIDHFTTQGLANSIYGIFILNVLCDSKIKVPQQFISAVN 395

Query: 476 ----MFASQVHLVNQCLKLEHPHLQLALSSV----------LEEKIAS---AGKTKRFNQ 518
               +F   +  ++Q LK    H       V          LE+K  +      T     
Sbjct: 396 KNIELFDENIEGISQILK---AHYYFGKQGVGILTSQNRQFLEKKFKTKLTPCHTSNLQL 452

Query: 519 KVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
            V    +K +A+ LV +  ++I++      +VD  + DK    ++DGP HF  N   P  
Sbjct: 453 NVLKVVKKVLAQHLVKSE-HYIKQITS---SVDIFIKDKNTVIQVDGPCHFDDNNA-PNI 507

Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
            T L    + + G+ V  + +  W +L+ + ++  Y+
Sbjct: 508 STRLNTELLKSYGYIVHRIPYWVWNKLRTNTDKEKYI 544


>gi|157864875|ref|XP_001681146.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68124440|emb|CAJ02301.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 442

 Score = 42.7 bits (99), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 2/95 (2%)

Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           A+G++N+  A SK G     L  +  +    L +VGEF + ++  +A AFA ++     +
Sbjct: 110 AKGVTNVISAFSKTGINHEKLFGLLSMRVQTLARVGEFEAAHLVILANAFARLRFREQHV 169

Query: 343 FSELAKRASDIVHTFQEQELAQVLWAF--ASLYEP 375
           FS +A+RA  +       EL  ++ AF  A L +P
Sbjct: 170 FSAIARRAMSLRERVTVNELVPLINAFSKAGLKDP 204


>gi|6841122|gb|AAF28914.1|AF161354_1 HSPC091 [Homo sapiens]
          Length = 193

 Score = 42.7 bits (99), Expect = 0.68,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 36/62 (58%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           + K++A  IDGP  F  N+   LG   +K+R++   G+ VV + ++E   L+   E ++Y
Sbjct: 118 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYREIGMLKSRRELVEY 177

Query: 615 LR 616
           L+
Sbjct: 178 LQ 179


>gi|302828418|ref|XP_002945776.1| hypothetical protein VOLCADRAFT_127357 [Volvox carteri f.
            nagariensis]
 gi|300268591|gb|EFJ52771.1| hypothetical protein VOLCADRAFT_127357 [Volvox carteri f.
            nagariensis]
          Length = 1323

 Score = 42.7 bits (99), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 5/52 (9%)

Query: 558  KVAFEIDGPTHFSRNTGVP---LGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
            ++A E+DGP+HF  N  VP   LG T+ + R + A G  +V + H EW  LQ
Sbjct: 1153 RIAVEVDGPSHFCAN--VPNHALGATVARDRCLQALGLQLVVVPHFEWYLLQ 1202


>gi|156098671|ref|XP_001615351.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148804225|gb|EDL45624.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 735

 Score = 42.4 bits (98), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 24/115 (20%), Positives = 52/115 (45%), Gaps = 6/115 (5%)

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
           TS+  KE++ +L    +  +   A   + VD          E + P  +  N+      +
Sbjct: 562 TSTLHKEISSILTLIKIEHLNSVACGPFIVDIYHPPSNYIIEANAPFQYYLNSERLTALS 621

Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL------KDYIGGEGSS 629
             + +++A  G+ ++ +SH+ W  L    +++DYL  +L      + + GG+ S+
Sbjct: 622 EWRHKFLARMGFRLIHISHKVWNSLPTEKQRVDYLLRVLPAGMLGRAHPGGKDST 676


>gi|301789123|ref|XP_002929978.1| PREDICTED: FAST kinase domain-containing protein 3-like [Ailuropoda
           melanoleuca]
          Length = 667

 Score = 42.4 bits (98), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 37/62 (59%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           V K+VA  ID P  FS ++   LG   +K+R++   G++VV + + E + L+   E ++Y
Sbjct: 587 VHKRVALCIDDPKRFSLDSKHLLGKEAIKQRHLRLLGYHVVQIPYYEIKMLKSRVELVEY 646

Query: 615 LR 616
           L+
Sbjct: 647 LQ 648


>gi|294951459|ref|XP_002786991.1| hypothetical protein Pmar_PMAR006407 [Perkinsus marinus ATCC 50983]
 gi|239901581|gb|EER18787.1| hypothetical protein Pmar_PMAR006407 [Perkinsus marinus ATCC 50983]
          Length = 633

 Score = 42.4 bits (98), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 52/223 (23%), Positives = 95/223 (42%), Gaps = 41/223 (18%)

Query: 186 QFSGPSNRRK----EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATAL 241
           +  G SN+R+    E  + + I+ A  ++     I+ ++  V K L    L+ +N++T +
Sbjct: 147 RVGGHSNQRQATANEFEIQRSILAAANSRS----ISSLLLIVEKHLDE--LNSVNVSTLI 200

Query: 242 HRIA---KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGG 298
           HR+A   +N E+       R+     R   +L   A+   P  S Q +SNI WA+    G
Sbjct: 201 HRLASITQNQEQ-----NQRVLANDPRVKEVLRR-AIDLAPTSSCQSLSNICWAI----G 250

Query: 299 ELLYLSE------------------MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           +L  + E                  MD VAE     +  F  Q V+N+  A+  +     
Sbjct: 251 KLQMVEEKDVVRAIVEAAKSQLEELMDLVAEKVANTLYTFKPQEVSNLLYAYGRLNCYNE 310

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
            L  E+    + ++  +  Q +  V+ + A L  P   L++++
Sbjct: 311 KLLQEICACVATMMPRYDGQGVGNVICSLAKLKYPCIQLMDAI 353


>gi|397606496|gb|EJK59336.1| hypothetical protein THAOC_20457, partial [Thalassiosira oceanica]
          Length = 146

 Score = 42.4 bits (98), Expect = 0.77,   Method: Composition-based stats.
 Identities = 32/143 (22%), Positives = 56/143 (39%), Gaps = 35/143 (24%)

Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPAD 377
           FN Q++AN+  +FA    + P+LF  +    + +  + +F+ Q+L+  +WAFA+      
Sbjct: 31  FNPQHLANILWSFAKSGEADPELFQAIGNHITGLGSLDSFKPQDLSNTIWAFATAGVSYP 90

Query: 378 PLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNI 437
            L E +            CLN                             SF +    N 
Sbjct: 91  ALFEKIGGHIVGLD----CLN-----------------------------SFKQQDFSNT 117

Query: 438 AWSYAVLGQMDRIFFSDIWKTIS 460
           AW++A +G+ +   F  I   I+
Sbjct: 118 AWAFAKVGESNPKLFKKIGDYIA 140


>gi|119628500|gb|EAX08095.1| hypothetical protein MGC5297, isoform CRA_b [Homo sapiens]
          Length = 194

 Score = 42.4 bits (98), Expect = 0.79,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           + K++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E   L+   E ++Y
Sbjct: 119 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 178

Query: 615 LR 616
           L+
Sbjct: 179 LQ 180


>gi|428186081|gb|EKX54932.1| hypothetical protein GUITHDRAFT_99583 [Guillardia theta CCMP2712]
          Length = 824

 Score = 42.4 bits (98), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 73/175 (41%), Gaps = 24/175 (13%)

Query: 230 SPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNI 289
           S L P  ++  L   A+       + T   +  RQR  S            CS Q ++N+
Sbjct: 471 SALKPAELSMTLWACARYHHPSKWLYTRFSSEMRQRGFS-----------NCSTQELANL 519

Query: 290 AWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSEL 346
            WAL++     G+L+Y    D   EV+   V   NS+++ N+    A  +     L SE+
Sbjct: 520 CWALTESSDEYGDLVY----DVAQEVSSRPVNPRNSKDMRNILCCIAKSRVPDCGLASEV 575

Query: 347 AKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLD-----NAFKDATQFTC 395
           A+       T   +      WAF+ + + P+  L +S +     +A  ++T F C
Sbjct: 576 ARELEASGSTTSVRAWILTFWAFSHIAFIPSSDLQQSFETKVQGDAISNSTTFLC 630


>gi|348561900|ref|XP_003466749.1| PREDICTED: FAST kinase domain-containing protein 3-like [Cavia
           porcellus]
          Length = 669

 Score = 42.4 bits (98), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 34/62 (54%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           V K++A  ID P  F  N    LG   +K+R++   G+ VV + + E E+L+   + + Y
Sbjct: 588 VHKRIALCIDDPNRFCSNGIHLLGKEAIKQRHLGLLGYEVVQVPYHEMEKLKSRHQLVKY 647

Query: 615 LR 616
           L+
Sbjct: 648 LQ 649


>gi|291394925|ref|XP_002713900.1| PREDICTED: transforming growth factor beta regulated gene 4-like
           [Oryctolagus cuniculus]
          Length = 632

 Score = 42.4 bits (98), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 48/185 (25%), Positives = 80/185 (43%), Gaps = 42/185 (22%)

Query: 477 FASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLL---- 532
           F   +H +N   +LEHP      S  L   +A A +    +QKVT   QKE+   L    
Sbjct: 452 FLKLLH-INATARLEHPEY----SGPLLPALAVAPRPPAPDQKVTP-LQKELQETLKGLL 505

Query: 533 ---------VSTGLNWI----------------REYAVDGYTVDA-----VLVDKKVAF- 561
                    V+T   W+                R++        A      L  K++AF 
Sbjct: 506 GSADRGSFEVATQYGWVLDAEVLLDADGQFLPLRDFVAPHLAQPAGGQPLPLGAKRLAFL 565

Query: 562 EIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
             + P++ SR+  + LG  +L RR++ AAG+ VV + + EW EL+  +++  YL+  ++ 
Sbjct: 566 RWEFPSYNSRSKDL-LGRFVLARRHLLAAGFLVVDVPYYEWLELKSEWQKGAYLKDKMRK 624

Query: 622 YIGGE 626
            +  E
Sbjct: 625 VVAEE 629


>gi|407395839|gb|EKF27268.1| hypothetical protein MOQ_009016 [Trypanosoma cruzi marinkellei]
          Length = 521

 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 50/92 (54%), Gaps = 4/92 (4%)

Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           ++G++NI  A SK G   E L+     RV  +A  +VGEF + ++  +A AF+ +++   
Sbjct: 187 SKGVANIISAFSKTGINHEKLFGFLSKRVQTLA--RVGEFEAAHLVIIANAFSRLRYRDK 244

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            LF  +A+RA  +       EL  ++ AF+ +
Sbjct: 245 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 276


>gi|344268382|ref|XP_003406039.1| PREDICTED: FAST kinase domain-containing protein 1 [Loxodonta
           africana]
          Length = 839

 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 6/75 (8%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFE--QLDY 614
           +++A E      F RN     G + +K+R++   G++V+ + H EW  +  S +  Q+DY
Sbjct: 769 ERIALEFLYSRAFCRNIPHLKGVSAMKKRHLEILGYHVIQIPHFEWNSMALSTKDAQMDY 828

Query: 615 LRVILKDYIGGEGSS 629
           LR    + I GEG S
Sbjct: 829 LR----ERIFGEGKS 839


>gi|417403497|gb|JAA48549.1| Putative fast kinase-like protein [Desmodus rotundus]
          Length = 632

 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 79/349 (22%), Positives = 126/349 (36%), Gaps = 81/349 (23%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQV 365
           + R+A   L       S  VA  A +FA ++     LF   A+       +     L  +
Sbjct: 314 LQRLAADLLPHTPSLTSSEVARCAKSFAFLKWLNLPLFEAFAQHVLSRAQSITVPPLCNM 373

Query: 366 LWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSS 424
           L AFA L + P            +   +F   +++ L                 EG L+S
Sbjct: 374 LLAFARLNFHP------------EQEDEFFSLVHEKL-----------------EGQLAS 404

Query: 425 --PVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQY-REDIMFASQV 481
             P L  +      + W+  VLGQ        +     +F  Q + +Q  +    F   +
Sbjct: 405 LGPALQVD------VLWALCVLGQAQEAELRAV--LCPQFHTQLLGDQSPKGQSTFQKLL 456

Query: 482 HLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEV------------- 528
           H VN   +LEHP      S  L    A   +    ++KVT   QKE+             
Sbjct: 457 H-VNATAQLEHPEY----SGPLLPASALVPRPSALDRKVTP-LQKELQGALKGLLGSADR 510

Query: 529 ARLLVSTGLNWIR----------EYAVDGYTVDAVLVD-----------KKVAFEIDGPT 567
            R  V     W+           ++   G  V   L             K++AF     +
Sbjct: 511 GRFTVPMQYGWVLDAEVLLGAEGQFLPLGDFVAPHLAPPSEGQPLPPGAKRLAFLRWEFS 570

Query: 568 HFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           +F+  +   LG   L RR++ AAG+ VV + H EW EL+  +++  YL+
Sbjct: 571 NFNSRSKDLLGRFALARRHVLAAGFLVVDVPHYEWLELKSDWQKGAYLK 619


>gi|71420728|ref|XP_811585.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70876263|gb|EAN89734.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 521

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 50/92 (54%), Gaps = 4/92 (4%)

Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           ++G++NI  A SK G   E L+     RV  +A  +VGEF + ++  +A AF+ +++   
Sbjct: 187 SKGVANIISAFSKTGINHEKLFGFLSKRVQTLA--RVGEFEAAHLVIIANAFSRLRYRDK 244

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            LF  +A+RA  +       EL  ++ AF+ +
Sbjct: 245 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 276


>gi|124806224|ref|XP_001350662.1| RAP protein, putative [Plasmodium falciparum 3D7]
 gi|23496788|gb|AAN36342.1| RAP protein, putative [Plasmodium falciparum 3D7]
          Length = 1505

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 14/51 (27%), Positives = 31/51 (60%)

Query: 554  LVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
            +++K +  E+DG +HF + +     ++++K   +   GWNV+ + +QEW +
Sbjct: 1276 VLNKNIVIEVDGISHFYKESFSRTINSVIKDYILKKLGWNVIHIPYQEWNQ 1326


>gi|399217206|emb|CCF73893.1| unnamed protein product [Babesia microti strain RI]
          Length = 570

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/109 (22%), Positives = 49/109 (44%)

Query: 516 FNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGV 575
           F +     F K+VAR+L    +  ++      +T+D    ++ +  E   P  F   TG 
Sbjct: 462 FGRSYHEDFVKDVARILTLLNIEAVKGVIAGPFTLDLYSSERNLVIECCPPYQFYTQTGS 521

Query: 576 PLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIG 624
                  + + I A G+++V + +++W  L    ++  +L  IL ++I 
Sbjct: 522 YTTCASWRHKLIRAMGFHLVLVPYKKWYSLPSDNDKGAFLTTILPNHIA 570


>gi|407832047|gb|EKF98310.1| hypothetical protein TCSYLVIO_010792 [Trypanosoma cruzi]
          Length = 521

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 50/92 (54%), Gaps = 4/92 (4%)

Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           ++G++NI  A SK G   E L+     RV  +A  +VGEF + ++  +A AF+ +++   
Sbjct: 187 SKGVANIISAFSKTGINHEKLFGFLSKRVQTLA--RVGEFEAAHLVIIANAFSRLRYRDK 244

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            LF  +A+RA  +       EL  ++ AF+ +
Sbjct: 245 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 276


>gi|401416346|ref|XP_003872668.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322488892|emb|CBZ24142.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 442

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 2/95 (2%)

Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           A+G++NI  A SK G     L  +  +    L +VGEF + ++  +A AFA ++     +
Sbjct: 110 AKGVTNIISAFSKTGINHEKLFGLLSMRVQTLARVGEFEAAHLVILANAFARLRFREQHV 169

Query: 343 FSELAKRASDIVHTFQEQELAQVLWAF--ASLYEP 375
           F  +A+RA  +       EL  ++ AF  A L +P
Sbjct: 170 FGAIARRAMSLRERVTVNELVPLINAFSKAGLKDP 204


>gi|146078054|ref|XP_001463441.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398010941|ref|XP_003858667.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134067526|emb|CAM65806.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322496876|emb|CBZ31947.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 442

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 2/95 (2%)

Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           A+G++NI  A SK G     L  +  +    L +VGEF + ++  +A AFA ++     +
Sbjct: 110 AKGVTNIISAFSKTGINHEKLFGLLSMRVQTLARVGEFEAAHLVILANAFARLRFREQHV 169

Query: 343 FSELAKRASDIVHTFQEQELAQVLWAF--ASLYEP 375
           F  +A+RA  +       EL  ++ AF  A L +P
Sbjct: 170 FGAIARRAMSLRERVTVNELVPLINAFSKAGLKDP 204


>gi|432929105|ref|XP_004081183.1| PREDICTED: FAST kinase domain-containing protein 3-like [Oryzias
           latipes]
          Length = 662

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 547 GYTVDAVLVD---KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
           GY + A   D   K+VA  IDG   F+ N+   LG   +K+R++   G+ V  + + E+E
Sbjct: 578 GYVLHASQTDDVCKRVALCIDGQRRFTSNSRQLLGKETMKQRHLRLLGYEVAQIPYYEFE 637

Query: 604 ELQGSFEQLDYL 615
           +L      ++YL
Sbjct: 638 KLHSKTSVVEYL 649


>gi|68073079|ref|XP_678454.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56498926|emb|CAH97349.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 1637

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 46/211 (21%), Positives = 88/211 (41%), Gaps = 43/211 (20%)

Query: 434  LGNIAWSYAVLGQM--DRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLE 491
            L    W  +++  +  D I F +I+     + E +I EQ   + M+   V  +   LK  
Sbjct: 1434 LARYLWGVSIVNLINDDTINFINIY----NWNEIKIYEQ---NPMYLHMVFTLWLRLKYY 1486

Query: 492  HPHLQLA---------LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIRE 542
            + HL+L+         ++ +L++     G     N+   S+F  +++++L    + +  E
Sbjct: 1487 YAHLKLSKNFLNFIDKITHILKKIYIKNG----LNKDNLSTFHVQISKILDKFNVKYTNE 1542

Query: 543  YAVDGYTVDAVL-----VDKKVAFEIDGPTH-------FSRNTGV-------PLGHTMLK 583
            Y      +  ++       +K+A EIDGP+H          NT +         G T  K
Sbjct: 1543 YITKDLLIIDIIIILKECKEKIAIEIDGPSHHLLDLSDLHENTSINDNKKYLQCGTTYFK 1602

Query: 584  RRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
               +   GW V+++   EW +++   E  DY
Sbjct: 1603 NFLLKKNGWEVINIPSYEWNKIKK--EDRDY 1631


>gi|159465104|ref|XP_001690764.1| hypothetical protein CHLREDRAFT_180834 [Chlamydomonas reinhardtii]
 gi|158270346|gb|EDO96203.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 690

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 6/63 (9%)

Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL------Y 373
           FN+Q+V+N   A A + ++  DL   LA+  + +  T   Q+L+ +LWA  +L      Y
Sbjct: 182 FNAQDVSNALWACAKLGYADADLLQRLAEAGAAVAKTMIPQDLSNILWALKALGCTGPAY 241

Query: 374 EPA 376
           +PA
Sbjct: 242 QPA 244


>gi|159490231|ref|XP_001703086.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
 gi|158270832|gb|EDO96665.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
          Length = 1337

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 64/143 (44%), Gaps = 10/143 (6%)

Query: 493  PHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQK--EVARL-LVSTGLNWIREYAVDGYT 549
            P L  A+   +  + A+   T R  ++V  + Q+  +  RL +VS     + E  +    
Sbjct: 1136 PDLLAAMEVAVVAERATGSTTSRLQKQVAEALQRLLQKGRLPIVSVQTEVVVEGVLGRVD 1195

Query: 550  VDAVLVD-KKVAFEIDGPTHFSRN----TGVPLGHTMLKRRYI--AAAGWNVVSLSHQEW 602
            + A   D ++VA E+DGP HF  N        +G T L+ R +  A     +V + + EW
Sbjct: 1196 IVADWSDGRRVAIEVDGPAHFPTNRKDDPSAVIGSTALRNRQLRRAFGEGGLVCVPYWEW 1255

Query: 603  EELQGSFEQLDYLRVILKDYIGG 625
              L+    Q  YL   L+D + G
Sbjct: 1256 YGLRTPTAQEAYLLQRLQDLLSG 1278


>gi|323449653|gb|EGB05539.1| hypothetical protein AURANDRAFT_66278 [Aureococcus anophagefferens]
          Length = 892

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 48/90 (53%), Gaps = 3/90 (3%)

Query: 286 ISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
           + N+A AL+++G   G +        +   A  +   F+++ +AN A AFA+    AP+L
Sbjct: 501 LGNVAHALARLGAGKGHMDGERAFQSLGRAAAPRAAAFDARELANTAWAFATAGVDAPEL 560

Query: 343 FSELAKRASDIVHTFQEQELAQVLWAFASL 372
               A RA+D V  +  +ELA ++WA A L
Sbjct: 561 MRAFAARAADKVVDYDVRELANLVWALAKL 590


>gi|221501449|gb|EEE27225.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 236

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 37/73 (50%)

Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
           + V+  L  K V  E+DGP HF R++      + LK R +A  G+ +  + + +W EL  
Sbjct: 6   FYVEHELDIKGVVLEVDGPQHFYRDSFHWTSASKLKHRLLAGLGFRIAHVPYFDWLELHT 65

Query: 608 SFEQLDYLRVILK 620
              +  YLR  L+
Sbjct: 66  EDVRRVYLRCALE 78


>gi|397632551|gb|EJK70608.1| hypothetical protein THAOC_08020 [Thalassiosira oceanica]
          Length = 701

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 31/119 (26%), Positives = 57/119 (47%), Gaps = 3/119 (2%)

Query: 270 LVAIAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVAN 327
           + + A+  L E  A+ +SN+ ++   +G   E+   +  D   E A+  +  F  Q ++N
Sbjct: 553 IASSAVGMLDEFEARHLSNLIYSFGLVGYNPEIEAETLFDVFGEAAVRILHTFKPQALSN 612

Query: 328 VAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
           +  AF  +      LF E     S + + +F+ Q+ A +LW+FA   E    L ++L N
Sbjct: 613 ILWAFVKVDTKNSRLFQETGGVISGMDLDSFKPQDFANILWSFAKASEADSKLFQALGN 671



 Score = 40.0 bits (92), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 61/148 (41%), Gaps = 38/148 (25%)

Query: 309 VAEVALTKVGEFNSQNVANVAGAFASMQHS----APDLFSELAKRASDIVHTFQEQELAQ 364
           +A  A+  + EF +++++N+  +F  + ++    A  LF    + A  I+HTF+ Q L+ 
Sbjct: 553 IASSAVGMLDEFEARHLSNLIYSFGLVGYNPEIEAETLFDVFGEAAVRILHTFKPQALSN 612

Query: 365 VLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSS 424
           +LWAF  +               K++  F            E GGV S  D D       
Sbjct: 613 ILWAFVKV-------------DTKNSRLF-----------QETGGVISGMDLD------- 641

Query: 425 PVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
              SF      NI WS+A   + D   F
Sbjct: 642 ---SFKPQDFANILWSFAKASEADSKLF 666


>gi|82596268|ref|XP_726191.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23481496|gb|EAA17756.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 834

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 21/89 (23%), Positives = 46/89 (51%)

Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
           EV+R+L    +N +R   ++    D +L D  +     GP  +  N+ +    + LK+  
Sbjct: 640 EVSRVLTKINVNHLRNVYINNICADIMLPDSNIIIMCLGPYSYYVNSLLTTSISDLKKNI 699

Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           +    +NV++L++ +W +L    +Q+++L
Sbjct: 700 LKKKKYNVITLNYHDWNKLNDYEDQINFL 728


>gi|221060957|ref|XP_002262048.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|193811198|emb|CAQ41926.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 955

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 30/125 (24%), Positives = 53/125 (42%), Gaps = 20/125 (16%)

Query: 276 TALPECSAQGISNIAWALSKI--GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
           T L  C+++ +SN+ +A S +  G   L+      +    + K  + + Q +A +A A+ 
Sbjct: 663 TFLNLCTSEDLSNLCYAYSLVRSGNRELH----SLIQSAIMKKQSDLSPQEIAKIAYAYG 718

Query: 334 SMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQ 392
           +M  +S+  L S L       +H F   E+  +LW +               N F DA  
Sbjct: 719 NMYFYSSYTLLSSLQYEILQRMHQFCHHEICDILWCYCI-------------NRFLDANF 765

Query: 393 FTCCL 397
           + C L
Sbjct: 766 WKCML 770


>gi|84995016|ref|XP_952230.1| hypothetical protein [Theileria annulata]
 gi|65302391|emb|CAI74498.1| hypothetical protein TA13450 [Theileria annulata]
          Length = 460

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 35/133 (26%), Positives = 66/133 (49%), Gaps = 19/133 (14%)

Query: 517 NQKVTSSFQKEVARLLVSTGL-NWIREYAVDGYTVDAVLV--DKKVAFEIDGPTHFSRNT 573
           N K+ S  QK V+  L+   + + +     D  +VD  +   D+K+  E+DGPTHF RN 
Sbjct: 336 NGKIISKSQKLVSDFLIRQNIPHQLEILTSDLSSVDIYICLNDEKIILEVDGPTHFIRNL 395

Query: 574 GVP-----LGHTMLKRRYIAAAGWNVVSLS--HQEWEELQGSFEQLD-YLRVILKDYIGG 625
             P     +G    K + +   G+  +S+   H + + ++    Q+D Y + +L++    
Sbjct: 396 DDPSETRKIGPCHFKEKLLKENGFVFISIPPIHSDTQNIK----QIDEYYKELLQN---- 447

Query: 626 EGSSNIAETLKMD 638
            GS+++ E +K +
Sbjct: 448 SGSAHLNEIMKYN 460


>gi|156120709|ref|NP_001095501.1| FAST kinase domain-containing protein 1 [Bos taurus]
 gi|151554767|gb|AAI50046.1| FASTKD1 protein [Bos taurus]
 gi|296490639|tpg|DAA32752.1| TPA: FAST kinase domains 1 [Bos taurus]
          Length = 832

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 6/75 (8%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFE--QLDY 614
           +K+A E      F RN     G + +K+R++   G++V+ + H EW  +  S    ++DY
Sbjct: 762 EKIALEFLDSRAFCRNIPHLKGKSAMKKRHLEILGYHVIQIPHFEWNSMALSTRDARMDY 821

Query: 615 LRVILKDYIGGEGSS 629
           LR    + I GEG S
Sbjct: 822 LR----ERIFGEGKS 832


>gi|83273693|ref|XP_729510.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23487521|gb|EAA21075.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 689

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 25/103 (24%), Positives = 49/103 (47%), Gaps = 5/103 (4%)

Query: 311 EVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
           ++   K+ E + Q+++N+  A++ +  +   +++ L  +    +  F EQELA +L A++
Sbjct: 299 KIIYNKINELSYQSISNICNAYSKLNPNDTKIYNILINKIKKNIDKFNEQELANILSAYS 358

Query: 371 SL-YEPADPLLESLDNAFKDATQF----TCCLNKALSNCNENG 408
            L  +  D   +SL+  F     F       +  A S CN N 
Sbjct: 359 KLNIKDFDLFNKSLEYIFHKFYNFKPIEIVMITNAYSKCNINN 401


>gi|294944359|ref|XP_002784216.1| hypothetical protein Pmar_PMAR003475 [Perkinsus marinus ATCC 50983]
 gi|239897250|gb|EER16012.1| hypothetical protein Pmar_PMAR003475 [Perkinsus marinus ATCC 50983]
          Length = 319

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 42/190 (22%), Positives = 83/190 (43%), Gaps = 24/190 (12%)

Query: 187 FSGPSNRRKEINLNKDIVDAQ-TAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIA 245
            S  + + + I  N+++ +   TAQ++L +  +           +  + +N AT  HR+A
Sbjct: 77  MSAAAFKSQHIAWNRELTNPNATAQQILALAKKHC---------AQFNSVNWATTFHRLA 127

Query: 246 KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE 305
           K          H        E+  L+     ++   + Q ++ +AWA++K+   ++    
Sbjct: 128 K-------FHLHEAKSEHSLEIQTLLG-KCDSVEGFAPQHLATLAWAMAKL--HIVDHDL 177

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRA----SDIVHTFQEQE 361
           + RV   +LT   +   Q++AN++ A A +     DL  E   +        +  F+  E
Sbjct: 178 LSRVVHKSLTLHSDLKPQDLANLSWALARLDCPESDLMYECVCQKIMYDRGCLSQFKPME 237

Query: 362 LAQVLWAFAS 371
           LA V+WA A+
Sbjct: 238 LASVMWAIAT 247


>gi|440912811|gb|ELR62346.1| FAST kinase domain-containing protein 1, partial [Bos grunniens
           mutus]
          Length = 849

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 6/75 (8%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFE--QLDY 614
           +K+A E      F RN     G + +K+R++   G++V+ + H EW  +  S    ++DY
Sbjct: 779 EKIALEFLDSRAFCRNIPHLKGKSAMKKRHLEILGYHVIQIPHFEWNSMALSTRDARMDY 838

Query: 615 LRVILKDYIGGEGSS 629
           LR    + I GEG S
Sbjct: 839 LR----ERIFGEGKS 849


>gi|303276196|ref|XP_003057392.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461744|gb|EEH59037.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1039

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 10/108 (9%)

Query: 235 LNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALS 294
           +N+ATA  R+ +++E        R      R    L   A   LPE      S++ WAL 
Sbjct: 193 VNVATAYSRLGRHVEDA-----ERGTLDDARWYLALETRAFALLPELGGWAASSLTWALG 247

Query: 295 KIGGE--LLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           + G +    +   ++RV    L K  E   Q VAN+  AFA ++   P
Sbjct: 248 RTGRDPGAKFWEALERVL---LRKASELEPQGVANILWAFAVLERKHP 292


>gi|426220935|ref|XP_004004667.1| PREDICTED: FAST kinase domain-containing protein 1 [Ovis aries]
          Length = 832

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 6/75 (8%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFE--QLDY 614
           +K+A E      F RN     G + +K+R++   G++V+ + H EW  +  S    ++DY
Sbjct: 762 EKIALEFLDSRAFCRNIPHLKGKSAMKKRHLEILGYHVIQIPHFEWNSMALSTRDARMDY 821

Query: 615 LRVILKDYIGGEGSS 629
           LR    + I GEG S
Sbjct: 822 LR----ERIFGEGKS 832


>gi|303273294|ref|XP_003056008.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462092|gb|EEH59384.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 445

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 52/111 (46%), Gaps = 9/111 (8%)

Query: 271 VAIAMTALP-----ECSAQGISNIAWALSKIG--GELL--YLSEMDRVAEVALTKVGEFN 321
           VA A+ + P     E   Q ++N+AWA +K+G   +L+  YLSE+     V   KV  ++
Sbjct: 144 VADAVISFPDPIKYELKPQDVANLAWAFAKLGRKKQLMFNYLSEVFAAQAVIDVKVTAYS 203

Query: 322 SQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            + V+ +  AFA++      L +          H F  ++L    WA  SL
Sbjct: 204 PKQVSMILWAFATLDIQHQTLLTAAIPMIKARAHEFNPRDLTNTAWALDSL 254


>gi|317420047|emb|CBN82083.1| FAST kinase domain-containing protein 3 [Dicentrarchus labrax]
          Length = 669

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 41/73 (56%), Gaps = 3/73 (4%)

Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
           DG+ + A     V K++A  IDG T F+      LG   +K+R++   G+ VV + + E+
Sbjct: 584 DGFMLPASHNKDVYKRMAVCIDGQTRFTTIKRQLLGKEAIKQRHLRLLGYEVVQIPYYEF 643

Query: 603 EELQGSFEQLDYL 615
           E+LQ   E ++YL
Sbjct: 644 EKLQTKSEVVEYL 656


>gi|355749811|gb|EHH54149.1| hypothetical protein EGM_14923 [Macaca fascicularis]
          Length = 657

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           + +++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E   L+   E ++Y
Sbjct: 585 IHERIALCIDGPKRFCSNSKHLLGKEAIKQRHLRLLGYQVVQIPYYEIGMLKSRRELVEY 644

Query: 615 LR 616
           L+
Sbjct: 645 LQ 646


>gi|291222240|ref|XP_002731123.1| PREDICTED: protein TBRG4-like [Saccoglossus kowalevskii]
          Length = 627

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 80/411 (19%), Positives = 161/411 (39%), Gaps = 85/411 (20%)

Query: 245 AKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTAL---PECSAQGISNIAWALSKIGGELL 301
           +++M +V++     L+ + +R + +L A+A   L    E   Q + N  +A +K+     
Sbjct: 252 SEDMSRVALA----LSKSNRRTLPLLRALAYHVLHRHKELGLQTMWNFTYAFAKLN---F 304

Query: 302 YLSE-MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQ 360
           Y S+ M+++    L KV +     +A  A AF+  ++    LF  +++     +  F++ 
Sbjct: 305 YHSQLMEKIQGELLQKVPDSTPYMIATFAWAFSYNKYLDKPLFDAMSQYIVSNISHFKQL 364

Query: 361 ELAQVLWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE 419
            +  ++ ++A L Y+P+    E L   F  +                             
Sbjct: 365 RICSIIISYARLNYQPSGDFFEKLLTDFDFS----------------------------- 395

Query: 420 GSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFAS 479
            +LSS       D+L ++ WS  +L Q    F S +  +    E+      Y+  +    
Sbjct: 396 -ALSS-------DKLVDVVWSLVILQQASAEFISHVLAS-QHLEKLPDGTSYQIQMTRQK 446

Query: 480 QVHLVNQCLKLEHP-HLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLN 538
            +H +N   KLE P +    L     +   S     R N+ ++ S    +  L  + G +
Sbjct: 447 LLH-INTAAKLEQPDYTGPFLPDDFMKPADSLINPGRENESLSPSLNAVMQSLAKAIGGD 505

Query: 539 -WIRE--YAVDGYTVDA-VLVDKKV-------------------------AFEID----G 565
            +IR   +   GYT+DA  LVD K+                         A+ I      
Sbjct: 506 KYIRTNVFTPYGYTIDAEFLVDSKLTPLPINDYKTFYLPEDDTKQEVPEDAYRIAVINWE 565

Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
              + +N+   LG   + +R++    +    + + EW +L+  +++  Y++
Sbjct: 566 YNKYCQNSKQLLGRYTMTKRHLRGXXFIYFQVPYYEWNDLKSDWQKTAYIK 616


>gi|355691207|gb|EHH26392.1| hypothetical protein EGK_16351 [Macaca mulatta]
          Length = 657

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           + +++A  IDGP  F  N+   LG   +K+R++   G+ VV + + E   L+   E ++Y
Sbjct: 585 IHERIALCIDGPKRFCSNSKHLLGKEAIKQRHLRLLGYQVVQIPYYEIGMLKSRRELVEY 644

Query: 615 LR 616
           L+
Sbjct: 645 LQ 646


>gi|327270690|ref|XP_003220122.1| PREDICTED: FAST kinase domain-containing protein 3-like [Anolis
           carolinensis]
          Length = 662

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 35/62 (56%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           V +++A  ID    F  N+   LG   +K+R++   G+NVV +   E+++LQ   + L+Y
Sbjct: 589 VHQRIALCIDDQKRFCTNSHNLLGREAIKQRHLQLLGYNVVQIPFFEFQQLQNRGDILEY 648

Query: 615 LR 616
           L 
Sbjct: 649 LH 650


>gi|404216429|ref|YP_006670625.1| hypothetical protein KTR9_3834 [Gordonia sp. KTR9]
 gi|403647228|gb|AFR50468.1| hypothetical protein KTR9_3834 [Gordonia sp. KTR9]
          Length = 306

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 41/91 (45%), Gaps = 6/91 (6%)

Query: 516 FNQKVTSSFQKEVARLLVSTGLN-WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTG 574
             +   S  ++   RL  + GL  W+      GY +D    D KVA EIDG   F R+T 
Sbjct: 184 LGEGARSEAERMTVRLFTAGGLTGWVANMPAHGYVIDFAFPDVKVAIEIDGFA-FHRDTR 242

Query: 575 VPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
                  +KR  + A GW V++ +   W +L
Sbjct: 243 T-FQRDRVKRNLLTAKGWTVLNFT---WADL 269


>gi|294895650|ref|XP_002775245.1| hypothetical protein Pmar_PMAR015474 [Perkinsus marinus ATCC 50983]
 gi|239881304|gb|EER07061.1| hypothetical protein Pmar_PMAR015474 [Perkinsus marinus ATCC 50983]
          Length = 984

 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/45 (37%), Positives = 29/45 (64%)

Query: 571 RNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
           ++T V +G ++LK R++   GW VV +   EW  LQ + +++DYL
Sbjct: 913 KSTRVMIGSSLLKVRHLMTLGWKVVPIWISEWSSLQSTKDRVDYL 957


>gi|124513816|ref|XP_001350264.1| RAP protein, putative [Plasmodium falciparum 3D7]
 gi|23615681|emb|CAD52673.1| RAP protein, putative [Plasmodium falciparum 3D7]
          Length = 1017

 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 16/66 (24%), Positives = 38/66 (57%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           KK+  E++G  HF +NT   +  +  K + ++  G+ V+++ + +W  L   F++  Y++
Sbjct: 890 KKLIIEVNGEHHFYKNTKSYISLSKFKHKLLSDLGYVVINIPYFDWAILNTDFDKKAYIK 949

Query: 617 VILKDY 622
            ++ D+
Sbjct: 950 KLIYDH 955


>gi|156100253|ref|XP_001615854.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148804728|gb|EDL46127.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1615

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 24/101 (23%), Positives = 49/101 (48%), Gaps = 15/101 (14%)

Query: 521  TSSFQKEVARLLVSTGLNWIREY-AVDGYTVDAVLVDK----KVAFEIDGPTHF------ 569
             S F ++V ++L   G+ +  E+ A +  ++D  + D+    ++A E+DGP+H       
Sbjct: 1485 VSDFHQQVCQVLDKFGVKYENEHMAQELLSIDLAIRDEAAGERIAVEVDGPSHHLVLLDE 1544

Query: 570  ----SRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
                ++    P G T  K   +   GW V+++   +W +L+
Sbjct: 1545 TDPRAKKMYAPCGTTHFKNWLLRKMGWTVINIEAHKWNKLR 1585


>gi|406885763|gb|EKD32892.1| hypothetical protein ACD_76C00122G0008 [uncultured bacterium]
          Length = 118

 Score = 40.8 bits (94), Expect = 2.0,   Method: Composition-based stats.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 10/91 (10%)

Query: 518 QKVTSSFQKEVARLLVS-------TGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFS 570
           ++V   FQ +  +LL S        G  + R+Y +  Y VD    + ++A E+DGPTH  
Sbjct: 14  RRVLRLFQTKAEKLLWSKIKRKQLNGCKFRRQYGIGPYIVDFYCPEIRLAIEVDGPTH-- 71

Query: 571 RNTGVPLGHTMLKRRYIAAAGWNVVSLSHQE 601
            +  +   +   ++RYI + G  VV + ++E
Sbjct: 72  -DNHLAKEYDDFRQRYIESLGIRVVRVYNEE 101


>gi|403222079|dbj|BAM40211.1| conserved hypothetical protein [Theileria orientalis strain
           Shintoku]
          Length = 537

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 44/191 (23%), Positives = 81/191 (42%), Gaps = 38/191 (19%)

Query: 434 LGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVH----------- 482
           L  I WS ++L       FS+  +TI++  E  I E   + +   +Q++           
Sbjct: 313 LIRILWSLSILKVRLAEVFSNALETIAKLLEDTIDELSLKRLAHINQLYSILKSLRHSIH 372

Query: 483 -------------LVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVA 529
                        + ++C +++   +++   ++L    A +GK    +QK+ S F   + 
Sbjct: 373 ADVRKGDGAVNDAVTDECDEIDRL-MEVCKENLLSHNYAQSGKIISKSQKLVSDF---LI 428

Query: 530 RLLVSTGLNWIREYAVDGYTVDA--VLVDKKVAFEIDGPTHFSRNTGVP-----LGHTML 582
           R  +   L  I     D  ++D   +L D+ +A E+DGPTHF RN   P      G    
Sbjct: 429 RANIPHQLEII---TPDLLSIDIRIILDDEMIALEVDGPTHFLRNIEDPEVVMETGPCSF 485

Query: 583 KRRYIAAAGWN 593
           K+  +  +G+N
Sbjct: 486 KKELLTRSGYN 496


>gi|294882591|ref|XP_002769754.1| hypothetical protein Pmar_PMAR004835 [Perkinsus marinus ATCC 50983]
 gi|239873503|gb|EER02472.1| hypothetical protein Pmar_PMAR004835 [Perkinsus marinus ATCC 50983]
          Length = 677

 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 71/316 (22%), Positives = 131/316 (41%), Gaps = 61/316 (19%)

Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL-DNAF 387
           G+  S   +A  L S L  R + I  T    + A+++ AF++L Y P+  +L  L D A 
Sbjct: 399 GSLQSTLETAHALHSYLGSRLNAI--TPSAVDAARLVAAFSNLSYLPSHTILTKLMDIAL 456

Query: 388 KDATQFT----CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
           + A+ F     C    AL+  ++ G      +            S   DQ  +I W+  V
Sbjct: 457 EGASSFYPNTYCMYAIALAQLHQTGHRLPPCNE-----------SLTIDQACSILWTGVV 505

Query: 444 LGQMDRI--FFSDIWKTISR--------FEEQRISEQYREDIMFASQVHLVNQCLKLEHP 493
           L  +D +      +   I++        F  Q I+  +   ++  +Q  L+       +P
Sbjct: 506 L-DIDGVESIMERVLACIAKEGDLPSLPFARQAIAGLWARGMVTEAQ-QLIG-----AYP 558

Query: 494 HLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIR-EYAVDG-YTVD 551
                 +  L E  AS            SS    +++ L S G   +R E  + G Y  D
Sbjct: 559 ------AGTLTENPAS------------SSLHTNISQTLRSMGYGNVRDEVEICGIYRAD 600

Query: 552 AVLVDKKVAFEIDGPTHF--SRNTG---VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
            V+ D  +  E DG  H+  S ++G   + +G ++++ +    AGW V+ +S   W+  +
Sbjct: 601 VVIDDLGIVIECDGDVHYLYSPDSGCSDILIGSSVIRDKVFINAGWKVIRVSVAAWKNCK 660

Query: 607 GSFEQLDYLRVILKDY 622
            +  ++  LR ++ ++
Sbjct: 661 DAAGKVAMLRRLINNH 676


>gi|397583153|gb|EJK52533.1| hypothetical protein THAOC_28177, partial [Thalassiosira oceanica]
          Length = 376

 Score = 40.8 bits (94), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 53/237 (22%), Positives = 92/237 (38%), Gaps = 53/237 (22%)

Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
           LF +L+  A      F+ QE+A  LWA A++      L  +L +                
Sbjct: 3   LFEKLSTEAVVNKEHFKAQEVANFLWACATVGHTDQRLFSALTSV--------------- 47

Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
                               ++S +  FN  +L NI W+Y+V     +  F + + +   
Sbjct: 48  --------------------IASKLDKFNEQELANITWTYSVANTPSQDLFGEGYVSALA 87

Query: 462 FEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVT 521
             E   S ++      A          +LE     + L   L+ K  +A  ++ +++   
Sbjct: 88  SNENEFSVEH-----LAQLHQWQLWQQELES---GMELPQSLQAKCRNAFTSRGYSE--- 136

Query: 522 SSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTG 574
           S  Q +V   L + GL+   E  +  GY +DA++     ++VA E+DGP    RN G
Sbjct: 137 SKLQNDVVDELKAVGLDLEEEVLLGSGYRIDALVKIGDGRRVAVEVDGP---RRNVG 190



 Score = 39.7 bits (91), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 31/61 (50%)

Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPL 379
           F +Q VAN   A A++ H+   LFS L    +  +  F EQELA + W ++    P+  L
Sbjct: 18  FKAQEVANFLWACATVGHTDQRLFSALTSVIASKLDKFNEQELANITWTYSVANTPSQDL 77

Query: 380 L 380
            
Sbjct: 78  F 78


>gi|258511520|ref|YP_003184954.1| Superfamily I DNA and RNA helicase and helicase subunits-like protein
            [Alicyclobacillus acidocaldarius subsp. acidocaldarius
            DSM 446]
 gi|257478246|gb|ACV58565.1| Superfamily I DNA and RNA helicase and helicase subunits-like protein
            [Alicyclobacillus acidocaldarius subsp. acidocaldarius
            DSM 446]
          Length = 1403

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 47/108 (43%), Gaps = 16/108 (14%)

Query: 519  KVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKK------VAFEIDGPTHFSRN 572
            +  S F++EVA  L   G     +    GY +D  +VD        +A E DG T+ S  
Sbjct: 1277 RYDSPFEEEVATELRKLGYTVNTQVGFSGYRIDLAIVDPDNPERYLLAVECDGATYHS-- 1334

Query: 573  TGVPLGHTMLKRRYIAAAGWNV--------VSLSHQEWEELQGSFEQL 612
            + V       ++R++   GWNV        +   H+E E++Q    QL
Sbjct: 1335 SKVARERDFYRQRFLEQHGWNVHRVWSRNWLKAKHKEIEKIQSRIRQL 1382


>gi|145348368|ref|XP_001418622.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578852|gb|ABO96915.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 586

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 44/186 (23%), Positives = 78/186 (41%), Gaps = 24/186 (12%)

Query: 198 NLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT-- 255
           ++ + + +A +A+  L V+   + A            ++ ATALHR+AK     S +   
Sbjct: 62  DIQRMLANADSAEAALRVVESDLDA---------FDAVHAATALHRVAKFSAPESRLERD 112

Query: 256 -THRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVA- 313
            +     T       L A   T + E  A G++N+AW+ +KIG    Y    + +  +A 
Sbjct: 113 FSRAEGLTNDGRFRALAASVATRVDEFDAFGLANVAWSFAKIG----YTPSQETLGALAA 168

Query: 314 -----LTKVG-EFNSQNVANVAGAFASMQHSAP-DLFSELAKRASDIVHTFQEQELAQVL 366
                ++K G     Q+++N   AF  M+   P      L    +  +  F+  EL+ +L
Sbjct: 169 RLEREVSKQGARLKPQSLSNATYAFGRMRFKPPRSTLEALCAATTREMGEFRADELSGML 228

Query: 367 WAFASL 372
              A L
Sbjct: 229 LGLAHL 234


>gi|40645472|dbj|BAD06581.1| arginine decarboxylase [Nicotiana tabacum]
          Length = 733

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 41/85 (48%), Gaps = 9/85 (10%)

Query: 327 NVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNA 386
           + A    +MQH    +F  L  RA + VH   EQE  + L AFASL   A  L +S +N 
Sbjct: 630 SCADVLRAMQHEPELMFETLKHRAEEFVHNDDEQEEDKGL-AFASL---ASSLAQSFNNM 685

Query: 387 FKDATQFTCCLNKALSN-----CNE 406
               T  +CCL  A +N     CN+
Sbjct: 686 PYLVTNSSCCLTAAANNGGYYYCND 710


>gi|302832912|ref|XP_002948020.1| hypothetical protein VOLCADRAFT_103642 [Volvox carteri f.
           nagariensis]
 gi|300266822|gb|EFJ51008.1| hypothetical protein VOLCADRAFT_103642 [Volvox carteri f.
           nagariensis]
          Length = 1327

 Score = 40.8 bits (94), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 16/112 (14%)

Query: 279 PECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKV----GEFNSQNVANVAGAFAS 334
           P  ++Q ISN+ +A++ +G E+    E+   AE+ L  V    GE N+Q ++NV    A 
Sbjct: 666 PPFNSQEISNVLYAIASMGYEIDPEGEL---AELLLDAVHFRLGEANAQELSNVMWCLAV 722

Query: 335 MQ--HSAP---DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLE 381
           +Q   S P   D F+    R    + TF+  +LAQ L+  A L  P  PL E
Sbjct: 723 LQIRPSQPWLDDYFTAAHSR----LPTFKPVDLAQSLYGVAKLRLPLQPLPE 770


>gi|422339502|ref|ZP_16420460.1| putative DNA helicase [Fusobacterium nucleatum subsp. polymorphum
            F0401]
 gi|355370932|gb|EHG18307.1| putative DNA helicase [Fusobacterium nucleatum subsp. polymorphum
            F0401]
          Length = 1230

 Score = 40.8 bits (94), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 4/87 (4%)

Query: 513  TKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVD--AVLVDKKVAFEIDGPTHFS 570
            T+   +   S F++EV + LVS G +  +++ V  Y +D  A+  DKK+A E DG    S
Sbjct: 956  TEEIEKNSESIFEEEVVKYLVSEGYHIKQQWEVGAYRIDMVALFQDKKIAIECDGEKWHS 1015

Query: 571  RNTGVPLGHTMLKRRYIAAAGWNVVSL 597
              T   +   M ++  +   GW  + +
Sbjct: 1016 --TEEQIKQDMERQSILERCGWEFIRI 1040


>gi|115728540|ref|XP_785534.2| PREDICTED: protein TBRG4-like [Strongylocentrotus purpuratus]
          Length = 616

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 75/373 (20%), Positives = 152/373 (40%), Gaps = 63/373 (16%)

Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVAL---T 315
           L  T  R + +L AI+   L + S   I  +   +S +G   L        +E A+    
Sbjct: 284 LVATNTRSLPILRAISYQLLEQRSQWEIPAMMDIMSAMGN--LGFHNAALFSEFAVHIQQ 341

Query: 316 KVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YE 374
           K+ E +   + ++A  +A ++  A +L   +    +  +      +L ++L+A++   Y+
Sbjct: 342 KLDECSLSLLCDIAKTYAVLRIQASNLLDSIHTVLAGALDELTILDLKRLLFAYSQFSYQ 401

Query: 375 PADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQL 434
           P D            +T +    NK  +N ++  G                     +DQ+
Sbjct: 402 PPDA-----------STFYVEVGNKLDANFDDYSG---------------------KDQI 429

Query: 435 GNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPH 494
            ++A S  VL Q+ +   + I K I   +E  +S   +  ++       +N   +L+ P 
Sbjct: 430 -DVAHSLTVLKQVPQKIVTKILKNIEESQEAPLSGTLKLKLL------QINAYSQLDFPD 482

Query: 495 LQLALSSVLEEKIASAGKTKRFNQKV-TSSFQKEVARLL-VSTG--LNWIREYAVD-GYT 549
            +          + S  K+   N K+ T++  + + ++L  S G  L  +     D GY 
Sbjct: 483 YE-------GPYLTSDLKSFPANHKIYTTTLHRSLFKVLQASLGDDLTMVENVKSDLGYV 535

Query: 550 VDAVLVDK------KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
           +DA +  K      K+A    GP  +  +T   +G   +   ++   G+ V+ + +Q+W 
Sbjct: 536 IDAEISSKLKGNGQKLAIMTFGPPSYLYSTTQLVGRLEMMLSHLELTGYQVLQIPYQDWY 595

Query: 604 ELQGSFEQLDYLR 616
            L+   +Q+ YL+
Sbjct: 596 PLRTPVQQVHYLK 608


>gi|124504899|ref|XP_001351192.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
 gi|3764010|emb|CAA15603.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
          Length = 768

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 27/110 (24%), Positives = 55/110 (50%), Gaps = 15/110 (13%)

Query: 290 AWALSKIGGELLYLSEMDRVAEVALTK-----VGEFNSQNVANVAGAFASMQH--SAPDL 342
           ++ +S+I      L+ MD      L K     + + + Q+++N+  A++ + +  +  DL
Sbjct: 319 SFDISQIVNSYTRLNYMDDKLFSYLKKYIDQQIDDMSFQSISNICNAYSKLLNIENYEDL 378

Query: 343 FSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQ 392
           F +L  R  D +H F+ QE+A +L +++ LY        +++  FKD   
Sbjct: 379 FFKLRVRIRDNIHEFKPQEVANILNSYSKLY--------NINGIFKDVIH 420


>gi|221061533|ref|XP_002262336.1| RAP protein [Plasmodium knowlesi strain H]
 gi|193811486|emb|CAQ42214.1| RAP protein, putative [Plasmodium knowlesi strain H]
          Length = 1273

 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 13/50 (26%), Positives = 30/50 (60%)

Query: 555  VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
            ++K +  E+DG +HF R +     ++++K   +   GW+++ + +QEW +
Sbjct: 1071 IEKNILVEVDGVSHFYRESHSRAINSIIKNFILEKCGWHIIHIPYQEWNQ 1120


>gi|156101285|ref|XP_001616336.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805210|gb|EDL46609.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 994

 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 21/96 (21%), Positives = 48/96 (50%), Gaps = 17/96 (17%)

Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRV 617
           K+  E++G  HF +N+      + LK + ++  G+ VV++ + EW +L+ + ++  Y++ 
Sbjct: 868 KLIIEVNGEHHFYKNSKSYTALSKLKHKLLSDLGYTVVNIPYFEWGQLKSNLDRKAYIKK 927

Query: 618 ILKDY-----------------IGGEGSSNIAETLK 636
           ++ D                  +GGE  + +A T++
Sbjct: 928 LISDSLTFEVVNVLPLNQKSEPLGGEEMAKVASTIR 963


>gi|291461161|ref|ZP_06027290.2| DNA helicase [Fusobacterium periodonticum ATCC 33693]
 gi|291378403|gb|EFE85921.1| DNA helicase [Fusobacterium periodonticum ATCC 33693]
          Length = 1621

 Score = 40.4 bits (93), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 4/87 (4%)

Query: 513  TKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVD--AVLVDKKVAFEIDGPTHFS 570
            T+   +   S F++EV + LVS G +  +++ V  Y +D  A+  DKK+A E DG    S
Sbjct: 1356 TEEIEKNSESIFEEEVVKYLVSEGYHIKQQWEVGAYRIDMVALFQDKKIAIECDGEKWHS 1415

Query: 571  RNTGVPLGHTMLKRRYIAAAGWNVVSL 597
              T   +   M ++  +   GW  + +
Sbjct: 1416 --TEEQIKQDMERQSILERCGWEFIRI 1440


>gi|221057670|ref|XP_002261343.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|194247348|emb|CAQ40748.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 944

 Score = 40.4 bits (93), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 16/69 (23%), Positives = 40/69 (57%)

Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRV 617
           K+  E++G  HF +N+      + LK + ++  G+ V+++ + EW +L+ + ++  Y++ 
Sbjct: 818 KLIIEVNGEHHFYKNSKSYTALSKLKHKLLSDLGYTVINIPYFEWGQLKTNLDKKAYIKK 877

Query: 618 ILKDYIGGE 626
           ++ D +  E
Sbjct: 878 LISDSLNFE 886


>gi|156103321|ref|XP_001617353.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148806227|gb|EDL47626.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1234

 Score = 40.0 bits (92), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 20/78 (25%), Positives = 39/78 (50%), Gaps = 4/78 (5%)

Query: 547  GYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL- 605
             Y V    V K +  E+DG +HF + +     ++++K+  +   GW+++ + +QEW +  
Sbjct: 1017 AYPVLQKRVKKNILVEVDGVSHFYKESHSRTINSIIKKFILQKCGWHIIHIPYQEWNQCV 1076

Query: 606  ---QGSFEQLDYLRVILK 620
               +     L  LR IL+
Sbjct: 1077 DFRRKVLYALQVLRQILR 1094


>gi|66359096|ref|XP_626726.1| hypothetical protein [Cryptosporidium parvum Iowa II]
 gi|46228239|gb|EAK89138.1| hypothetical protein with transmembrane or GPI anchor sequence at
           carboxy terminus [Cryptosporidium parvum Iowa II]
 gi|323509501|dbj|BAJ77643.1| cgd3_1520 [Cryptosporidium parvum]
          Length = 589

 Score = 40.0 bits (92), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 46/220 (20%), Positives = 94/220 (42%), Gaps = 35/220 (15%)

Query: 184 LSQFSGPSNRRK-----------------EINLNKDIVDAQTAQEVLEVIAEMITAVGKG 226
           + QFSGP  +R                   + +NK I  +++  E+L ++   I      
Sbjct: 44  IGQFSGPYEQRNITYNNGVLYSRDEHIVFNLKMNKIITASESFGELLGIVHCHIYY---- 99

Query: 227 LSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLV-AIAMTALPEC--SA 283
                L+ +N+ + LH++A     +S     +    R     +L+  I + +   C  S 
Sbjct: 100 -----LNEINMVSILHKLAV----LSQSNNFKGRIKRDERFRLLLDVIVLRSNFPCRFSP 150

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           + +SNIAW+L K+G  L      D V   ++ ++  F S N++ +  +FA       +LF
Sbjct: 151 KELSNIAWSLVKLG--LNNHKIFDFVCNESIIQLERFISINLSIILWSFAKAGKFNKNLF 208

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
                +    +   + Q+++ + W+++ +   +  L E+L
Sbjct: 209 VYAIPKILSELDNLEPQQISNIAWSYSKVGLVSPHLFENL 248


>gi|221055575|ref|XP_002258926.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|193808996|emb|CAQ39699.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 613

 Score = 40.0 bits (92), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 4/90 (4%)

Query: 286 ISNIAWALSKIG-GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFS 344
           IS IA   +K+  G+      M+   E+   ++ E + Q+++N+  A++ +   +  LF 
Sbjct: 226 ISQIANCFAKLNYGDANLFKHME---ELICERIDELSCQSISNICNAYSKLSLGSETLFC 282

Query: 345 ELAKRASDIVHTFQEQELAQVLWAFASLYE 374
            L K     +  F EQE+A +L A++ L E
Sbjct: 283 LLIKAVKKKLDNFNEQEIANILNAYSKLGE 312


>gi|412994033|emb|CCO14544.1| predicted protein [Bathycoccus prasinos]
          Length = 790

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 47/205 (22%), Positives = 92/205 (44%), Gaps = 27/205 (13%)

Query: 270 LVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEV--ALTKVGEF-----NS 322
           L+ +  T +PE S  G+SN++WAL++     L+  +  RV  +  A++K         ++
Sbjct: 431 LLEMCETKIPEMSPLGLSNVSWALAR-----LFPDDPTRVKSLLSAISKRSALQMKYADA 485

Query: 323 QNVANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLE 381
           + ++ +  A A++       L +   +RA +I   F+  ++A  LWA+A        L  
Sbjct: 486 KCLSTILWALAALGFEPRSRLLASAQRRACEIEEEFRAPDVANALWAYAKWAR----LFS 541

Query: 382 SLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL---SSPVL-SFNRDQLGNI 437
               A K++  ++        + +E       GD     SL   S  V+ +F+  Q  NI
Sbjct: 542 GGVGALKESVDYS-----EDESVDEGSSKSYGGDRAVITSLLRQSEAVMETFSAYQCANI 596

Query: 438 AWSYAVL-GQMDRIFFSDIWKTISR 461
            WS A L  ++   +  ++ + I++
Sbjct: 597 CWSSATLNAKLPETYLENLLERIAK 621


>gi|303288517|ref|XP_003063547.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455379|gb|EEH52683.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 807

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 61/139 (43%), Gaps = 23/139 (16%)

Query: 255 TTHRLAFTRQREMSMLVAIAMTALP-----ECSAQGISNIAWALSKI---------GGEL 300
           T H +     R     ++ A+ ALP       +A  ++N+AWA +K          GG  
Sbjct: 30  TGHGVGDDGDRASFAAISEALLALPGGTFDALTAPQLANVAWAFAKANDAGGGSTRGGPS 89

Query: 301 L---------YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRAS 351
                     + S    +A  A ++  +F++Q + +VA AFA+       LF+  A+RA 
Sbjct: 90  SSSISSISSPFASLFAALARSAASRANDFSAQELTDVAWAFANAGCVDGRLFAAFARRAE 149

Query: 352 DIVHTFQEQELAQVLWAFA 370
            +   F ++EL    WAFA
Sbjct: 150 TLADDFDDEELDNAEWAFA 168


>gi|159489962|ref|XP_001702960.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158270983|gb|EDO96813.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1282

 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 57/120 (47%), Gaps = 20/120 (16%)

Query: 266 EMSMLVAIAMTALPECSAQGISNIAWALSKIG----GELL--YLSEMDRVAEVALTKVGE 319
           E+S LVA     LP    + ++N+ WA+ K+G      LL  +L E       A  ++ +
Sbjct: 549 EVSELVA---QRLPTFDPRAVANVLWAVCKLGYSPAPPLLNQFLFE-------AYVRMEK 598

Query: 320 FNSQNVANVAGAFASM----QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEP 375
           FN+Q +AN++ A A++    +   P    +    A   V   + QELA + WA + L  P
Sbjct: 599 FNAQELANLSWALATLAAMGRQPVPAWLRKFISAAKLHVDELKPQELAHMAWALSRLCPP 658


>gi|72389356|ref|XP_844973.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62358897|gb|AAX79348.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70801507|gb|AAZ11414.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 516

 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 4/92 (4%)

Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           A+ ++NI  A SK G   E L+     RV  +A  +VGEF + ++  +A AF+ +++   
Sbjct: 182 AKDVTNIISAFSKTGINHEKLFAFLSRRVQTLA--RVGEFEAAHLVILANAFSRLRYRDK 239

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            LF  +A+RA  +       EL  ++ AF+ +
Sbjct: 240 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 271


>gi|389584499|dbj|GAB67231.1| hypothetical protein PCYB_112520 [Plasmodium cynomolgi strain B]
          Length = 941

 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 16/69 (23%), Positives = 39/69 (56%)

Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRV 617
           K+  E++G  HF +N+      + LK + +   G+ V+++ + EW +L+ + ++  Y++ 
Sbjct: 815 KLIIEVNGEHHFYKNSKSYTSLSKLKHKLLCDLGYTVINIPYFEWGQLRTNLDKKAYIKK 874

Query: 618 ILKDYIGGE 626
           ++ D +  E
Sbjct: 875 LISDSLSFE 883


>gi|261328305|emb|CBH11282.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 516

 Score = 40.0 bits (92), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 4/92 (4%)

Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           A+ ++NI  A SK G   E L+     RV  +A  +VGEF + ++  +A AF+ +++   
Sbjct: 182 AKDVTNIISAFSKTGINHEKLFAFLSRRVQTLA--RVGEFEAAHLVILANAFSRLRYRDK 239

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            LF  +A+RA  +       EL  ++ AF+ +
Sbjct: 240 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 271


>gi|397640805|gb|EJK74327.1| hypothetical protein THAOC_03999, partial [Thalassiosira oceanica]
          Length = 400

 Score = 40.0 bits (92), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 45/170 (26%), Positives = 80/170 (47%), Gaps = 29/170 (17%)

Query: 302 YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHT 356
           YL   D +A   +  + EF++++++N+  +F  ++ + PD     LF+   K A  I+HT
Sbjct: 230 YLLIFDSIASSTVDMLNEFDARHMSNLIYSFGLVERN-PDIGGETLFNVFGKAAVKILHT 288

Query: 357 FQEQELAQVLWAF-------ASLYEPADPLLESLD-NAFKD-ATQFTCCL----NKALSN 403
           F  Q+++ +L AF       ++L++     L  LD   F + A     CL     +ALSN
Sbjct: 289 FNSQDISNMLLAFVYVDAKNSALFQKTGEELLGLDLGEFTEQALANILCLYDFWPQALSN 348

Query: 404 CNENGGVKSSGDADSEG--------SLSSPVLSFNRDQLGNIAWSYAVLG 445
                   ++G++  E         +L   + SF+   L N AW++A  G
Sbjct: 349 V--VWAYATAGESHPELFKKMGDHIALLERLDSFDPQALSNTAWAFATAG 396


>gi|294865634|ref|XP_002764450.1| hypothetical protein Pmar_PMAR026874 [Perkinsus marinus ATCC 50983]
 gi|239863879|gb|EEQ97167.1| hypothetical protein Pmar_PMAR026874 [Perkinsus marinus ATCC 50983]
          Length = 195

 Score = 40.0 bits (92), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 89/201 (44%), Gaps = 36/201 (17%)

Query: 183 RLSQFSGPSNRRK--EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATA 240
           R+ + +G   R K  ++ L + ++DA T   VLE++    T +G          +N A A
Sbjct: 14  RMYEVAGRLRRGKSGDLVLQRRLMDASTPAAVLEIVLPNATKLGS---------VNYACA 64

Query: 241 LHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGEL 300
           LHR A            R    R   +S +  +A+  + +  A+  + I WAL+ +  EL
Sbjct: 65  LHRCA---------VWFRSGKRRPSGLSQVPRLALQTVRDWRAREAATITWALA-VTREL 114

Query: 301 LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA-----SMQHSAPDLFSELAKR--ASDI 353
            ++ E  R++        E +  ++ANV  +         Q +A    + +AKR  A D+
Sbjct: 115 DHILEFARLS----MSCDEASGGDLANVVHSLTISGLNPRQCTAT--LAVVAKRVTAMDL 168

Query: 354 VHT--FQEQELAQVLWAFASL 372
            H+   + ++LA V W F  L
Sbjct: 169 SHSGVIEPKQLAAVFWGFVKL 189


>gi|399218084|emb|CCF74971.1| unnamed protein product [Babesia microti strain RI]
          Length = 480

 Score = 40.0 bits (92), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 48/102 (47%), Gaps = 3/102 (2%)

Query: 279 PECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
           P+ S+QG+S I  ++SK   ++   S   R + +   ++ EFN  +   VA A +   + 
Sbjct: 222 PKFSSQGLSLILNSISKYNDDI---SLFQRYSMIIQLRIDEFNIHSCCLVASAVSRANYK 278

Query: 339 APDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
              L   LA+R     +    Q +A + ++FA L     PL+
Sbjct: 279 EIKLLEVLAERVGKQSNELYPQAVATLAYSFAKLNHLHGPLM 320


>gi|302848319|ref|XP_002955692.1| hypothetical protein VOLCADRAFT_121443 [Volvox carteri f.
           nagariensis]
 gi|300259101|gb|EFJ43332.1| hypothetical protein VOLCADRAFT_121443 [Volvox carteri f.
           nagariensis]
          Length = 500

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQ 364
           MD VA+   +K+G+F +Q+++N   AFA +++   +  + +  ++    +    ++ELA 
Sbjct: 139 MDAVAQEIHSKLGQFRAQDLSNTLWAFAMLKYKPTEQWWQDFERQVFGALTDLTDRELAN 198

Query: 365 VLWAFASL 372
           +LWAFA L
Sbjct: 199 LLWAFAVL 206


>gi|343924360|ref|ZP_08763910.1| hypothetical protein GOALK_015_00060 [Gordonia alkanivorans NBRC
           16433]
 gi|343765692|dbj|GAA10836.1| hypothetical protein GOALK_015_00060 [Gordonia alkanivorans NBRC
           16433]
          Length = 298

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 5/83 (6%)

Query: 525 QKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKR 584
           +K +A L  +    W     V GY VD   +D+KVA EIDG    S        H   ++
Sbjct: 197 RKALALLRSAEITGWTANAKVCGYVVDIAFIDQKVAVEIDGFAFHS--DAASFQHDRTRQ 254

Query: 585 RYIAAAGWNVVSLSHQEWEELQG 607
             + A GW V+  +   W+++ G
Sbjct: 255 NVLIANGWTVLRFT---WQDITG 274


>gi|441516792|ref|ZP_20998536.1| hypothetical protein GOHSU_08_00250 [Gordonia hirsuta DSM 44140 =
           NBRC 16056]
 gi|441456258|dbj|GAC56497.1| hypothetical protein GOHSU_08_00250 [Gordonia hirsuta DSM 44140 =
           NBRC 16056]
          Length = 312

 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 42/77 (54%), Gaps = 6/77 (7%)

Query: 530 RLLVSTGLN-WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
           RLL   GL+ W++++   G+++D    D KVA EIDG   + R+    L  +  KR  +A
Sbjct: 215 RLLKDQGLDGWVQQHPFHGWSIDFAWPDLKVAVEIDG-WAYHRDHKAFLRDSR-KRNALA 272

Query: 589 AAGWNVVSLSHQEWEEL 605
            AGW  +S S   W +L
Sbjct: 273 LAGWITLSFS---WHDL 286


>gi|215919094|ref|YP_002332981.1| hypothetical protein CBU_1061a [Coxiella burnetii RSA 493]
 gi|206583979|gb|ACI15272.1| hypothetical membrane associated protein [Coxiella burnetii RSA
           493]
          Length = 368

 Score = 39.7 bits (91), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 39/153 (25%), Positives = 68/153 (44%), Gaps = 14/153 (9%)

Query: 228 SPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGIS 287
           +P PL P+ +       AK  +         L+   Q   ++   +   + P  +AQ I+
Sbjct: 5   NPIPLDPIPLIRDFFHTAKQQK------NRPLSLNPQDYQTIKSILDNQSHPAFNAQSIA 58

Query: 288 NIAWALS--KIGGELLYLSEMDRVAEVALTKVGE-FNSQNVANVAGAFASMQHSAPD--- 341
           N+  AL+  +     L   E+DR    A+ +  + FN Q++AN   A A+M  +  D   
Sbjct: 59  NLLLALAYRRTRWAALLNKELDRPLLHAIAQNADRFNPQDIANTLWALATMGINWRDIQE 118

Query: 342 --LFSELAKRASDIVHTFQEQELAQVLWAFASL 372
             L + L K  +   + F  Q++A  LWA A++
Sbjct: 119 KELDNSLLKAIAQNANRFNPQDIANTLWALATM 151


>gi|294867000|ref|XP_002764924.1| hypothetical protein Pmar_PMAR007491 [Perkinsus marinus ATCC 50983]
 gi|239864760|gb|EEQ97641.1| hypothetical protein Pmar_PMAR007491 [Perkinsus marinus ATCC 50983]
          Length = 805

 Score = 39.7 bits (91), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 67/301 (22%), Positives = 106/301 (35%), Gaps = 50/301 (16%)

Query: 357 FQEQELAQVLWAFASLYEPADPLLESLDNAFKD-------ATQFTCCLNKALSNCNENGG 409
           F++QELA + W+ A+L      L E   +  KD        +     L   L++      
Sbjct: 512 FKQQELALITWSLATLRISHQMLEEHCCHQAKDLLLTSGITSSHLSMLLWGLASNYHTSA 571

Query: 410 VKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISE 469
             S    +    + S  L F      ++AWS A     D      +    + FE      
Sbjct: 572 PASELIQEVVARVRSRELRFAAADSFHVAWSLAAFDVFDPQSLEVLLSAAATFE------ 625

Query: 470 QYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVA 529
                 +  + +  +NQ       H       ++E    +AG  +R    + S+FQ +V 
Sbjct: 626 ------LDGAALQKINQVSMWSSSHGYEPTPMIVELFHRAAGSAQRDASVIDSAFQDQVT 679

Query: 530 RLLVSTGLNWIREYAV------DGYTVDAVLVDKKVA-----------------FEIDGP 566
             L     N   EY V             V+VD  V                   E+DGP
Sbjct: 680 TCLRRAIGNSDYEYRVVSEMDLTNLGCPGVIVDLAVTRCESADECSRDEELPLIIEVDGP 739

Query: 567 THFSRNTGVPL-------GHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
            H+ R+ G  L       G  +L+R  +   G++V  +S  +W  L G  E+  Y+  IL
Sbjct: 740 WHYVRSIGTSLPPGQKLCGKAVLRRNALRRLGYDVEEISFAQWSRL-GREERQKYIESIL 798

Query: 620 K 620
           K
Sbjct: 799 K 799


>gi|221487299|gb|EEE25531.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 245

 Score = 39.7 bits (91), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 46/187 (24%), Positives = 84/187 (44%), Gaps = 42/187 (22%)

Query: 207 QTAQEVL-----------EVIAEMITAVGKGLSPSPLSPLN---------------IATA 240
           +TAQE+L           E+ ++ + A    LSPS ++ +                +AT 
Sbjct: 58  ETAQELLRGKETKRRAFWEIFSKRVKASAHMLSPSLMALIAKSFDVHDRDTGIYVALATV 117

Query: 241 LHRIAKNMEKVSMMTTHRLAFTRQRE-------MSMLVAIAMTALPECSAQGISNIAWAL 293
           L    K  +  S++T   + F+R+ +        S L      AL + + + +  I  +L
Sbjct: 118 LPEAVKRADGRSLLTLSDV-FSRRLKRDSNPHLFSTLARQLPNALYQLTGKDVLRILSSL 176

Query: 294 SKIGGELLYLSEMDRVAEVA---LTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRA 350
              G     L++M    +VA   L ++ E +S ++A+ +  FAS  +  P+L+S LA+RA
Sbjct: 177 DAAG-----LADMLACRQVARKLLAELDELDSVDLADASAVFASQGYRNPELYSALARRA 231

Query: 351 SDIVHTF 357
            D+  +F
Sbjct: 232 VDVKDSF 238


>gi|160872163|ref|ZP_02062295.1| RAP domain family [Rickettsiella grylli]
 gi|159120962|gb|EDP46300.1| RAP domain family [Rickettsiella grylli]
          Length = 941

 Score = 39.7 bits (91), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 6/99 (6%)

Query: 521 TSSFQKEVARLLVSTG--LNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
           TS  Q EV + L++      ++ E+ ++   VD    +KK+  +++GP+H+    G  L 
Sbjct: 787 TSRLQNEVFQYLLACFPEFKFVEEHFLEFTYVDIACPEKKILMQVNGPSHY---VGKKLN 843

Query: 579 -HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
             +          GW+VV + + +W+ L     +  YL+
Sbjct: 844 VSSQFNNHLFEKLGWSVVIIPYFDWQALIKESARKKYLK 882


>gi|399218603|emb|CCF75490.1| unnamed protein product [Babesia microti strain RI]
          Length = 1215

 Score = 39.7 bits (91), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 59/305 (19%), Positives = 118/305 (38%), Gaps = 70/305 (22%)

Query: 206 AQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQR 265
           ++++ ++LE+  E  T +           +N  TALHRIAKN +        R   +   
Sbjct: 362 SRSSSDILEIYKENFTEINY---------VNAVTALHRIAKNSKN-----HERYTLSNDP 407

Query: 266 EMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEM--------DRVA----EVA 313
            M+ L+    + +P+   Q I+N  WAL+++     ++S +        +++      ++
Sbjct: 408 TMNKLLDHIYSFIPQMDQQSITNTLWALTRLEIRPNWISNLFLKLIPLANKLTPSELSMS 467

Query: 314 LTKVGEFNSQN----VAN---------------------------------VAGAFASMQ 336
           L  V +FNS +    V N                                 +A +FA + 
Sbjct: 468 LYCVAKFNSSSKKRLVTNQINKSTAYTIKDTLLTISRQRIEEFKMPIELTCIATSFARLN 527

Query: 337 HSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFT 394
                +F  +A ++  +  ++    + +  ++W+FA +      LL      F +     
Sbjct: 528 VRDSHVFRYIADKSLQLFEMNKLDVEHICSLIWSFARVNIVNTSLLGHF-CKFIEKNADK 586

Query: 395 CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQ----LGNIAWSYAVLGQMDRI 450
           C L   ++ C     +  + +     ++S  + +F RD     +  IAWSY+  G  D  
Sbjct: 587 CALRDLVNLCWSLSKLNYTPNELFIYTMSPMLRTFIRDMNSRDVSIIAWSYSNAGIQDNE 646

Query: 451 FFSDI 455
            F D+
Sbjct: 647 LFKDL 651


>gi|422921513|ref|ZP_16954736.1| hypothetical protein VCBJG01_0254 [Vibrio cholerae BJG-01]
 gi|341648748|gb|EGS72784.1| hypothetical protein VCBJG01_0254 [Vibrio cholerae BJG-01]
          Length = 108

 Score = 39.7 bits (91), Expect = 5.1,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 3/69 (4%)

Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVV 595
           G+ + R++ V  Y +D      K+A EIDG +HFS    +   H   +  Y+   G  VV
Sbjct: 18  GVKFRRQFGVGNYVLDFYCSTYKLAVEIDGDSHFSEGGKI---HDEQRTAYLTRHGIRVV 74

Query: 596 SLSHQEWEE 604
             ++QE E+
Sbjct: 75  RYTNQEVEQ 83


>gi|340053741|emb|CCC48034.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 514

 Score = 39.7 bits (91), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 4/92 (4%)

Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           A+ ++NI  A SK G   E L+     RV  +A  +VGEF + ++  +A AF+ +++   
Sbjct: 180 AKDVTNIISAFSKTGINHEKLFSFLSKRVQTLA--RVGEFEAAHLVILANAFSRLRYRDK 237

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            LF  +A+RA  +       EL  ++ AF+ +
Sbjct: 238 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 269


>gi|405373797|ref|ZP_11028456.1| Aspartokinase [Chondromyces apiculatus DSM 436]
 gi|397087311|gb|EJJ18361.1| Aspartokinase [Myxococcus sp. (contaminant ex DSM 436)]
          Length = 425

 Score = 39.7 bits (91), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 11/108 (10%)

Query: 124 KNKVTDDDLDFDLEDDMKMDDIMGSGNGYDMNDLRRTVSMM------AGGMFEEKREKTI 177
           K+  TDD      E+D  M+D++  G  YD N+ + TV  +      A  +F    EK I
Sbjct: 229 KSSFTDDPGTLVCEEDSSMEDVLVRGVAYDRNETKITVCGVPDIAGAAAKIFGPLDEKHI 288

Query: 178 EEFVHRLSQFSGPS-NRRKEINLNKDIVDAQTAQEVLEVIAEMITAVG 224
              V  + Q   PS + R ++       D QTAQ+V+  +AE I A G
Sbjct: 289 --VVDLIVQ--NPSRDGRTDVTFTVGKTDFQTAQDVVRKVAEEIGAAG 332


>gi|342181126|emb|CCC90604.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 517

 Score = 39.7 bits (91), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 4/92 (4%)

Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
           A+ ++NI  A SK G   E L+     RV  +A  +VGEF + ++  +A AF+ +++   
Sbjct: 183 AKDVTNIISAFSKTGINHEKLFSFLSRRVQTLA--RVGEFEAAHLVILANAFSRLRYRDK 240

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            LF  +A+RA  +       EL  ++ AF+ +
Sbjct: 241 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 272


>gi|254286257|ref|ZP_04961216.1| protein of unknown function [Vibrio cholerae AM-19226]
 gi|150423672|gb|EDN15614.1| protein of unknown function [Vibrio cholerae AM-19226]
          Length = 126

 Score = 39.3 bits (90), Expect = 5.8,   Method: Composition-based stats.
 Identities = 27/98 (27%), Positives = 45/98 (45%), Gaps = 8/98 (8%)

Query: 512 KTKRFNQKVTSSFQKEVARLL-----VSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGP 566
           ++K F Q + ++      RL         G+ + R++ V  Y +D      K+A EIDG 
Sbjct: 7   RSKVFRQYLRNNMTHPEQRLWQHLRHFQLGVKFRRQFGVGNYVLDFYCSTYKLAVEIDGD 66

Query: 567 THFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
           +HFS    +   H   +  Y+   G  VV  ++QE E+
Sbjct: 67  SHFSEGGKI---HDEQRTAYLTRHGIRVVRYTNQEVEQ 101


>gi|189183514|ref|YP_001937299.1| repeat-containing protein A_03 [Orientia tsutsugamushi str. Ikeda]
 gi|189180285|dbj|BAG40065.1| repeat-containing protein A_03 [Orientia tsutsugamushi str. Ikeda]
          Length = 237

 Score = 39.3 bits (90), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 28/109 (25%), Positives = 49/109 (44%), Gaps = 11/109 (10%)

Query: 280 ECSAQGISNIAWALSK----IGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASM 335
           +  A+G++ I +  +K    IG E +     +     A+  + EFN Q +AN   AF  +
Sbjct: 56  QFDARGLATILYQFAKLNYVIGSEFI-----EAWTNKAINLMDEFNPQELANSIWAFGRL 110

Query: 336 Q-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLES 382
           + H +          A+  +  F  Q LA  +WAF  L   P+D  +++
Sbjct: 111 EIHPSDQFIQAWIHHATKTIDNFNTQGLANSIWAFGRLEIHPSDQFIQA 159


>gi|156094199|ref|XP_001613137.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148802011|gb|EDL43410.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 578

 Score = 39.3 bits (90), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 6/91 (6%)

Query: 286 ISNIAWALSKI--GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           IS IA   +K+  G   L+     ++ E    ++ E + Q+++N+  A++ +   +  L+
Sbjct: 189 ISQIANCFAKLNYGDATLFRHMEQQICE----RIDELSCQSISNICNAYSKLSLGSTTLY 244

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYE 374
             L K  +  +  F EQE+A +L A+A + E
Sbjct: 245 DHLIKAVTKNLQKFNEQEIANILNAYAKVGE 275


>gi|71030818|ref|XP_765051.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68352007|gb|EAN32768.1| hypothetical protein, conserved [Theileria parva]
          Length = 471

 Score = 39.3 bits (90), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 19/131 (14%)

Query: 517 NQKVTSSFQKEVARLLVSTGL-NWIREYAVDGYTVDA--VLVDKKVAFEIDGPTHFSRNT 573
           N K+ S  QK V+  L+   + + +     D  +VD    L  +K+  E+DGPTHF RN 
Sbjct: 347 NGKIISKSQKLVSDFLIRQNIPHQLEILTSDLSSVDIYICLNGEKIILEVDGPTHFIRNL 406

Query: 574 GVP-----LGHTMLKRRYIAAAGWNVVSLS--HQEWEELQGSFEQLD-YLRVILKDYIGG 625
             P     +G    K + +   G+  +S+   H   + ++    Q+D Y + +LK+    
Sbjct: 407 NDPSETRKIGPCDFKEKMLKENGFVFISIPPIHSNTQNIK----QIDEYYKELLKN---- 458

Query: 626 EGSSNIAETLK 636
            GS+++ E LK
Sbjct: 459 SGSAHLNEILK 469


>gi|421341825|ref|ZP_15792234.1| hypothetical protein VCHC43B1_0345 [Vibrio cholerae HC-43B1]
 gi|395947002|gb|EJH57660.1| hypothetical protein VCHC43B1_0345 [Vibrio cholerae HC-43B1]
          Length = 117

 Score = 39.3 bits (90), Expect = 6.6,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 3/69 (4%)

Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVV 595
           G+ + R++ V  Y +D      K+A EIDG +HFS    +   H   +  Y+   G  VV
Sbjct: 27  GVKFRRQFGVGNYVLDFYCSTYKLAVEIDGDSHFSEGGKI---HDEQRTAYLKRHGIRVV 83

Query: 596 SLSHQEWEE 604
             ++QE E+
Sbjct: 84  RYTNQEVEQ 92


>gi|114569092|ref|YP_755772.1| hypothetical protein Mmar10_0541 [Maricaulis maris MCS10]
 gi|114339554|gb|ABI64834.1| protein of unknown function DUF559 [Maricaulis maris MCS10]
          Length = 225

 Score = 39.3 bits (90), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 8/80 (10%)

Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVP--LGHTMLKRRYIAAAGWN 593
           G  + R++ V  Y  D   V+ K+  E+DG TH     G P  L H   +  ++ AAGW 
Sbjct: 72  GFKFRRQHPVAPYIADFACVELKLIVELDGDTH-----GTPQELAHDRRRTGFLEAAGWT 126

Query: 594 VV-SLSHQEWEELQGSFEQL 612
           V+ + +   ++ L G   Q+
Sbjct: 127 VIRAFNIDVYQNLDGVLTQI 146


>gi|197245530|gb|AAI68451.1| Unknown (protein for MGC:136169) [Xenopus (Silurana) tropicalis]
          Length = 546

 Score = 39.3 bits (90), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 33/62 (53%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           V +++A  IDG   F  NT   LG   +K+R++   G+ V+ +   E++ L    E ++Y
Sbjct: 473 VHRRIALCIDGQKRFCSNTHKLLGKESIKQRHLRLLGYEVIQIPFYEFDNLSYKEEIVEY 532

Query: 615 LR 616
           L 
Sbjct: 533 LH 534


>gi|342184581|emb|CCC94063.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1024

 Score = 39.3 bits (90), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 62/126 (49%), Gaps = 12/126 (9%)

Query: 252 SMMTTHRLAFTRQREMS---MLVAIAMTALPECSAQGISNIAWALSKIGG--ELLYLSEM 306
           ++M+  R+ FT QR+M     L A+AM   P CS Q ++NIA A S  G   E L+    
Sbjct: 733 TLMSFARVGFT-QRDMVDSFTLRALAMA--PTCSLQALANIAIAFSISGCRHEELFSIIA 789

Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVL 366
           DR     + +  +  +  +A+V  AFAS+      LF E   R   +      +++  V+
Sbjct: 790 DRF----INQKMDIPAVTIASVLSAFASIGIRNDRLFIEAIPRVRHVGQYGTPKDITNVV 845

Query: 367 WAFASL 372
           +A++ +
Sbjct: 846 YAYSQV 851



 Score = 39.3 bits (90), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 2/87 (2%)

Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
           I+N+ +A S++G  L +     R+A+ A+   GEF   +VA +  A+A +      LF E
Sbjct: 841 ITNVVYAYSQVG--LWHYKLFVRLADRAIQLRGEFRCDHVAKLLEAYARVNMRYEKLFVE 898

Query: 346 LAKRASDIVHTFQEQELAQVLWAFASL 372
            + R   + H     E+  ++ ++ ++
Sbjct: 899 FSSRIQTLAHLMNAGEITSIVHSYVTV 925


>gi|317048838|ref|YP_004116486.1| hypothetical protein Pat9b_2630 [Pantoea sp. At-9b]
 gi|316950455|gb|ADU69930.1| protein of unknown function DUF559 [Pantoea sp. At-9b]
          Length = 117

 Score = 39.3 bits (90), Expect = 7.0,   Method: Composition-based stats.
 Identities = 21/79 (26%), Positives = 43/79 (54%), Gaps = 4/79 (5%)

Query: 535 TGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNV 594
           +G+ + R+YA+  Y VD   +++ +  E+DG  H  ++T   L +  ++  Y+   GW V
Sbjct: 32  SGVKFRRQYAIGRYIVDFACIERLLVIELDGGQHAEQST---LHYDEVRTAYLHRCGWRV 88

Query: 595 VSL-SHQEWEELQGSFEQL 612
           +   ++Q + EL    E++
Sbjct: 89  IRFWNNQVFCELDAVMEEI 107


>gi|422348130|ref|ZP_16429035.1| hypothetical protein HMPREF9476_03108 [Clostridium perfringens
           WAL-14572]
 gi|373222679|gb|EHP45041.1| hypothetical protein HMPREF9476_03108 [Clostridium perfringens
           WAL-14572]
          Length = 315

 Score = 38.9 bits (89), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 13/104 (12%)

Query: 486 QCLKLEHPHLQLALSSVL---EEKIASAGK---TKRFNQKVTSSFQKEVARLLVSTGLNW 539
           +CLKL++     AL S+    EE    +GK   +K +  KV  + +      L+ T  ++
Sbjct: 13  RCLKLKN-----ALESIKPKKEEFSTFSGKKPFSKEYELKVKYNLENPYQSTLIGTAFDY 67

Query: 540 IREYAVDGYTVDAVLVDKKVAFEIDGPTH--FSRNTGVPLGHTM 581
           +  + +  YT   V VD  +AF+I  P H      T   L H M
Sbjct: 68  LARFIISKYTFSYVSVDNLIAFKIAEPIHEIIDEETSSKLKHLM 111


>gi|291236686|ref|XP_002738268.1| PREDICTED: FAST kinase domain-containing protein 1-like
           [Saccoglossus kowalevskii]
          Length = 101

 Score = 38.9 bits (89), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 7/76 (9%)

Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW--EELQGSFEQLDY 614
           ++VA E      F  N+  PLG+  +KRR++   G+  V++ H EW   +L  S +  +Y
Sbjct: 15  ERVAIEFLSSKSFCTNSQHPLGYIDMKRRHLEIMGYRYVAIPHFEWFSMKLSSSDDYREY 74

Query: 615 LRVIL-----KDYIGG 625
           LR  L      DY+ G
Sbjct: 75  LREKLFAQKDPDYLEG 90


>gi|389583480|dbj|GAB66215.1| hypothetical protein PCYB_083760, partial [Plasmodium cynomolgi
           strain B]
          Length = 468

 Score = 38.9 bits (89), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 47/91 (51%), Gaps = 6/91 (6%)

Query: 286 ISNIAWALSKI--GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
           IS IA   +K+  G + L+     ++ E    ++ E + Q+++N+  A++ +   +  LF
Sbjct: 84  ISQIANCFAKLNYGDDKLFKHMEQQICE----RIDELSCQSISNICNAYSKLSLGSETLF 139

Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYE 374
             L K     +  F EQE+A +L A++ L E
Sbjct: 140 CRLIKTVKKNLDNFNEQEIANILNAYSKLGE 170


>gi|351542151|ref|NP_001135619.2| FAST kinase domains 3 [Xenopus (Silurana) tropicalis]
          Length = 691

 Score = 38.9 bits (89), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 33/62 (53%)

Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
           V +++A  IDG   F  NT   LG   +K+R++   G+ V+ +   E++ L    E ++Y
Sbjct: 618 VHRRIALCIDGQKRFCSNTHKLLGKESIKQRHLRLLGYEVIQIPFYEFDNLSYKEEIVEY 677

Query: 615 LR 616
           L 
Sbjct: 678 LH 679


>gi|297581735|ref|ZP_06943657.1| DNA methyltransferase [Vibrio cholerae RC385]
 gi|421350131|ref|ZP_15800499.1| hypothetical protein VCHE25_1308 [Vibrio cholerae HE-25]
 gi|297534142|gb|EFH72981.1| DNA methyltransferase [Vibrio cholerae RC385]
 gi|395955238|gb|EJH65841.1| hypothetical protein VCHE25_1308 [Vibrio cholerae HE-25]
          Length = 126

 Score = 38.9 bits (89), Expect = 7.7,   Method: Composition-based stats.
 Identities = 27/98 (27%), Positives = 45/98 (45%), Gaps = 8/98 (8%)

Query: 512 KTKRFNQKVTSSFQKEVARLL-----VSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGP 566
           ++K F Q + ++      RL         G+ + R++ V  Y +D      K+A EIDG 
Sbjct: 7   RSKVFRQYLRNNMTHPEQRLWQHLRHFQLGVKFRRQFGVGNYVLDFYCSTYKLAVEIDGD 66

Query: 567 THFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
           +HFS    +   H   +  Y+   G  VV  ++QE E+
Sbjct: 67  SHFSEGGKI---HDEQRTAYLKRHGIRVVRYTNQEVEQ 101


>gi|255087452|ref|XP_002505649.1| predicted protein [Micromonas sp. RCC299]
 gi|226520919|gb|ACO66907.1| predicted protein [Micromonas sp. RCC299]
          Length = 629

 Score = 38.9 bits (89), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 46/177 (25%), Positives = 74/177 (41%), Gaps = 19/177 (10%)

Query: 202 DIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAF 261
           D   A  +QE  + +AE      + ++P  ++  N+  AL ++      VS     RLA 
Sbjct: 261 DAAAAAVSQEGWKRLAEAAEQQARDMNPQDIA--NVLNALSKLDAAAAAVSPEGWKRLAE 318

Query: 262 TRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGE 319
             +R+             E + QG +N+  ALSK+      +S     RV E    +  E
Sbjct: 319 AAERQAR-----------EMNPQGNANVLNALSKLDAAAAEVSPEGWKRVGEAVERQARE 367

Query: 320 FNSQNVANVAGAFASMQHSA----PDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            N Q  ANV  A + +  +A    P+ +  LA+ A         Q++A VL A + L
Sbjct: 368 MNPQGNANVLNALSKLDAAAAAVSPEGWKRLAEAAERQARDMNPQDIANVLNALSKL 424


>gi|294865269|ref|XP_002764366.1| hypothetical protein Pmar_PMAR015373 [Perkinsus marinus ATCC 50983]
 gi|239863598|gb|EEQ97083.1| hypothetical protein Pmar_PMAR015373 [Perkinsus marinus ATCC 50983]
          Length = 810

 Score = 38.9 bits (89), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 75/347 (21%), Positives = 126/347 (36%), Gaps = 77/347 (22%)

Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP--- 340
           Q +  + WAL  +      L E   V    + K G  +S+++A V     S  H +P   
Sbjct: 519 QDVGLLVWALGTLRLSHYELEERCCVLARGMLKEGRIDSRHLAMVLWGITSNAHRSPSAI 578

Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
           DL  ++  R      + +  ++  V+W+ A     ++  L+ L  A   A          
Sbjct: 579 DLIQDVIHRVESSTLSPRPADVTIVIWSMAVFDLYSEKALQKLLEALVKA---------- 628

Query: 401 LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
                   G  S+    +E   S                       + R+  S +W  + 
Sbjct: 629 --------GPMSNAPPRTEQGAS-----------------------LIRLHRSLLWARLC 657

Query: 461 RFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKV 520
              +   SE+           HLV    +   P   L  SS L+ +I S  +        
Sbjct: 658 HGFQPSPSEE----------AHLVKIAQRQRAPGGGLVTSSTLQWEIRSELQRVLLEVAP 707

Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKK----VAFEIDGPTHFSR----N 572
           T+S + E                 ++G  VD  ++D K    +  E+DG +HFS+    N
Sbjct: 708 TASLRDEYEL-----------PAPLEGIFVDLAVIDAKEQVLLIIEVDGYSHFSKLISDN 756

Query: 573 TGVPL---GHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
           +   L   G+T L RR +  AG+ V+S+S  +W   Q    + +YLR
Sbjct: 757 SLAELQYNGNTELSRRILRKAGYEVLSISTVDWNNTQ-RHRRGEYLR 802


>gi|153801511|ref|ZP_01956097.1| DNA methyltransferase [Vibrio cholerae MZO-3]
 gi|153826738|ref|ZP_01979405.1| DNA methyltransferase [Vibrio cholerae MZO-2]
 gi|417819161|ref|ZP_12465780.1| hypothetical protein VCHE39_0600 [Vibrio cholerae HE39]
 gi|419835217|ref|ZP_14358665.1| hypothetical protein VCHC46B1_0339 [Vibrio cholerae HC-46B1]
 gi|423733571|ref|ZP_17706797.1| hypothetical protein VCHC41B1_0330 [Vibrio cholerae HC-41B1]
 gi|423944542|ref|ZP_17733223.1| hypothetical protein VCHE40_0267 [Vibrio cholerae HE-40]
 gi|423973991|ref|ZP_17736771.1| hypothetical protein VCHE46_0269 [Vibrio cholerae HE-46]
 gi|424007860|ref|ZP_17750816.1| hypothetical protein VCHC44C1_0325 [Vibrio cholerae HC-44C1]
 gi|124122916|gb|EAY41659.1| DNA methyltransferase [Vibrio cholerae MZO-3]
 gi|149739453|gb|EDM53691.1| DNA methyltransferase [Vibrio cholerae MZO-2]
 gi|340043051|gb|EGR04012.1| hypothetical protein VCHE39_0600 [Vibrio cholerae HE39]
 gi|408632129|gb|EKL04612.1| hypothetical protein VCHC41B1_0330 [Vibrio cholerae HC-41B1]
 gi|408662338|gb|EKL33288.1| hypothetical protein VCHE40_0267 [Vibrio cholerae HE-40]
 gi|408666350|gb|EKL37139.1| hypothetical protein VCHE46_0269 [Vibrio cholerae HE-46]
 gi|408859358|gb|EKL99019.1| hypothetical protein VCHC46B1_0339 [Vibrio cholerae HC-46B1]
 gi|408867417|gb|EKM06777.1| hypothetical protein VCHC44C1_0325 [Vibrio cholerae HC-44C1]
          Length = 126

 Score = 38.9 bits (89), Expect = 8.0,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 3/69 (4%)

Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVV 595
           G+ + R++ V  Y +D      K+A EIDG +HFS    +   H   +  Y+   G  VV
Sbjct: 36  GVKFRRQFGVGNYVLDFYCSTYKLAVEIDGDSHFSEGGKI---HDEQRTAYLKRHGIRVV 92

Query: 596 SLSHQEWEE 604
             ++QE E+
Sbjct: 93  RYTNQEVEQ 101


>gi|71748328|ref|XP_823219.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70832887|gb|EAN78391.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 1024

 Score = 38.9 bits (89), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 30/115 (26%), Positives = 53/115 (46%), Gaps = 15/115 (13%)

Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
           I+N+ +A S++G  L +     R+A+ A+   GEF    +A +  A+A +      LF E
Sbjct: 836 ITNVVYAYSQVG--LWHYKLFVRLADRAVQLRGEFRCDQLARLLEAYARVDMRYEKLFVE 893

Query: 346 LAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
            + R   + H     E++ V+ A+A         +  LD A      F  C+++A
Sbjct: 894 FSPRVQTVAHLLTAGEISTVVNAYAK--------VRVLDTAV-----FKACVDRA 935


>gi|302854443|ref|XP_002958729.1| hypothetical protein VOLCADRAFT_120035 [Volvox carteri f.
            nagariensis]
 gi|300255904|gb|EFJ40185.1| hypothetical protein VOLCADRAFT_120035 [Volvox carteri f.
            nagariensis]
          Length = 2274

 Score = 38.9 bits (89), Expect = 8.5,   Method: Composition-based stats.
 Identities = 30/108 (27%), Positives = 55/108 (50%), Gaps = 5/108 (4%)

Query: 268  SMLVAIAMTALPECSAQGISNIAWALSKIGGELLY--LSEMDRVAEVALTKVGEFNSQNV 325
            S+ V  A T LP+ + + ++ + W+L+K+G    +  LS +    +     +   + Q +
Sbjct: 1065 SLAVRFAQT-LPDATIREVATVLWSLAKLGRPAPHALLSHILAAQQRGFM-LRTASPQAI 1122

Query: 326  ANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQVLWAFASL 372
            AN+  A A+ +   P+ L S + ++    +  FQ Q+ A VLWA A L
Sbjct: 1123 ANMLWALATWRTREPEPLLSLVLEQCYRALPAFQPQDTANVLWALARL 1170


>gi|159469824|ref|XP_001693063.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277865|gb|EDP03632.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 649

 Score = 38.9 bits (89), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 9/94 (9%)

Query: 304 SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH--SAPDLFSELAKRASDIVHTFQEQE 361
           S +D VA+V L+++   +   VA     F + +H  + PD   ++A      + +F  Q 
Sbjct: 253 SLLDAVADVLLSRLDGLSHHEVATALWTFGTFRHRPAHPDFAKQVAAALYARMRSFSPQG 312

Query: 362 LAQVLWAFASLYEPADPLLESLD-------NAFK 388
           LA V+ A A L   ++PL+E L        NAFK
Sbjct: 313 LAMVVKALAQLQWRSEPLMEQLIAAAEAKLNAFK 346


>gi|261333127|emb|CBH16122.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 1024

 Score = 38.9 bits (89), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 30/115 (26%), Positives = 53/115 (46%), Gaps = 15/115 (13%)

Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
           I+N+ +A S++G  L +     R+A+ A+   GEF    +A +  A+A +      LF E
Sbjct: 836 ITNVVYAYSQVG--LWHYKLFVRLADRAVQLRGEFRCDQLARLLEAYARVDMRYEKLFVE 893

Query: 346 LAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
            + R   + H     E++ V+ A+A         +  LD A      F  C+++A
Sbjct: 894 FSPRVQTVAHLLTAGEISTVVNAYAK--------VRVLDTAV-----FKACVDRA 935


>gi|260802957|ref|XP_002596358.1| hypothetical protein BRAFLDRAFT_121233 [Branchiostoma floridae]
 gi|229281613|gb|EEN52370.1| hypothetical protein BRAFLDRAFT_121233 [Branchiostoma floridae]
          Length = 831

 Score = 38.9 bits (89), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 15/54 (27%), Positives = 30/54 (55%)

Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQ 611
           +VA +      F RN+   LGH  +++R++   G+ V+ + H EW  ++ + E+
Sbjct: 762 RVAIDYQDARDFCRNSQHLLGHVAMRKRHLEILGYTVIQIPHFEWNSMKLATEE 815


>gi|156081959|ref|XP_001608472.1| Secretory protein [Plasmodium vivax Sal-1]
 gi|148801043|gb|EDL42448.1| Secretory protein, putative [Plasmodium vivax]
          Length = 441

 Score = 38.9 bits (89), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 23/90 (25%), Positives = 42/90 (46%), Gaps = 1/90 (1%)

Query: 522 SSFQKEVARLLVSTGLNWIREYAVDGYTVD-AVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
           S FQ EV+  L   G++    +    Y +D   + +K+  + +DGP  F  +T   +   
Sbjct: 293 SEFQWEVSNCLAKLGISHRNTFLWGSYYIDIGEMNEKRNCWFVDGPACFYTSTNQYIESV 352

Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFE 610
            L+ R +   GWN+  +   +W +L   +E
Sbjct: 353 KLQHRILYNLGWNIRRIVWLDWLQLGDDWE 382


>gi|357020460|ref|ZP_09082691.1| hypothetical protein KEK_10638 [Mycobacterium thermoresistibile
           ATCC 19527]
 gi|356478208|gb|EHI11345.1| hypothetical protein KEK_10638 [Mycobacterium thermoresistibile
           ATCC 19527]
          Length = 287

 Score = 38.9 bits (89), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 522 SSFQKEVARLLVSTGLN-WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFS 570
           S+ ++++ RLL   G++ W   YA+ GY VD      +VA E+DG  + S
Sbjct: 190 SAAERKLVRLLRGAGISGWTTNYAIGGYKVDVAFPAGRVAIEVDGLAYHS 239


>gi|386590754|ref|YP_006087154.1| Dipeptide-binding ABC transporter [Salmonella enterica subsp.
           enterica serovar Heidelberg str. B182]
 gi|383797798|gb|AFH44880.1| Dipeptide-binding ABC transporter [Salmonella enterica subsp.
           enterica serovar Heidelberg str. B182]
          Length = 512

 Score = 38.9 bits (89), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 34/70 (48%)

Query: 134 FDLEDDMKMDDIMGSGNGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNR 193
           F L+ DMK+ +++  G     + L  T+++  GG F++  +         L + S P N 
Sbjct: 63  FGLDKDMKVKNVLAKGYTVSDDGLTYTITLRQGGKFQDGADFDAAAVKANLDRASNPDNH 122

Query: 194 RKEINLNKDI 203
            K  NL K+I
Sbjct: 123 LKRYNLYKNI 132


>gi|159485166|ref|XP_001700618.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
 gi|158272142|gb|EDO97947.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
          Length = 584

 Score = 38.9 bits (89), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 29/53 (54%)

Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
           FN Q ++NV  A A + H  PDL   LA  A+  V +   Q L+  LWA A+L
Sbjct: 191 FNQQELSNVLWACAKLGHRDPDLLQPLADAAAAAVASMTGQGLSNCLWALATL 243


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.130    0.369 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,632,927,709
Number of Sequences: 23463169
Number of extensions: 402818058
Number of successful extensions: 1976467
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 254
Number of HSP's successfully gapped in prelim test: 1435
Number of HSP's that attempted gapping in prelim test: 1959658
Number of HSP's gapped (non-prelim): 10985
length of query: 640
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 491
effective length of database: 8,863,183,186
effective search space: 4351822944326
effective search space used: 4351822944326
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)