BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006558
(640 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225434251|ref|XP_002276208.1| PREDICTED: uncharacterized protein LOC100257808 [Vitis vinifera]
Length = 656
Score = 898 bits (2320), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/606 (75%), Positives = 519/606 (85%), Gaps = 9/606 (1%)
Query: 30 VDDSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEK-SKVVDDNEGMDWCVRARKVA 88
VD ++++ESE +DWE EFLGELDP G+QAPKKRKK+E+ SK+++D +GMDWCV+ARK+A
Sbjct: 56 VDSNDKQESE--MDWELEFLGELDPLGFQAPKKRKKREQGSKLLEDTDGMDWCVKARKMA 113
Query: 89 LKSIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFDLEDDMKMDDIMGS 148
LKSIEARGL +MEDLI VKKKK KK +K K K + + D ++D+++ +
Sbjct: 114 LKSIEARGLTRTMEDLITVKKKKNNKKKLGKKDKISKKSKVSEEEDDSDEDIELKGV--- 170
Query: 149 GNGYDMND-LRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQ 207
N D D LR+TVSM+AGGMFEEK+EKT++ FV RLSQFSGPS+RRKEINLNK IV+AQ
Sbjct: 171 -NPLDGADRLRKTVSMVAGGMFEEKKEKTMQAFVQRLSQFSGPSDRRKEINLNKAIVEAQ 229
Query: 208 TAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREM 267
TA+EVLEV AE I AVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT+ RLAF RQ+EM
Sbjct: 230 TAEEVLEVAAETIMAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTSRRLAFARQKEM 289
Query: 268 SMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVAN 327
SMLV IAMTALPECSAQGISNI+WALSKIGGELLYLSEMDRVAEVALTKV +FNSQNVAN
Sbjct: 290 SMLVGIAMTALPECSAQGISNISWALSKIGGELLYLSEMDRVAEVALTKVEQFNSQNVAN 349
Query: 328 VAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAF 387
VAGAFASM+HSAPDLFSEL++RAS+IVH FQEQELAQVLWAFASL EPA PLLESLDN F
Sbjct: 350 VAGAFASMRHSAPDLFSELSERASNIVHNFQEQELAQVLWAFASLNEPAGPLLESLDNVF 409
Query: 388 KDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQM 447
D QF CCL++ NE V+++GD E SP L+F RDQLGNIAWSYAVLGQM
Sbjct: 410 NDENQFKCCLDQETLKYNEESVVENNGDLAMEEISGSPALNFKRDQLGNIAWSYAVLGQM 469
Query: 448 DRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKI 507
DR+FFS +WKT+S FEEQRISEQYREDIMFASQVHLVNQCLKLE+PHL+L+L S LEEK+
Sbjct: 470 DRVFFSHVWKTLSHFEEQRISEQYREDIMFASQVHLVNQCLKLEYPHLRLSLRSDLEEKV 529
Query: 508 ASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPT 567
A AGKTKRFNQK+TSSFQKEVA LLVSTGL+W+REY VDGYT+DAVLVD+KVA EIDGPT
Sbjct: 530 ARAGKTKRFNQKMTSSFQKEVAHLLVSTGLDWVREYVVDGYTLDAVLVDQKVALEIDGPT 589
Query: 568 HFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEG 627
HFSRN+GVPLGHTMLKRRYI AAGW + S+SHQEWEELQG FEQLDYLR ILKD+I GEG
Sbjct: 590 HFSRNSGVPLGHTMLKRRYITAAGWKLASVSHQEWEELQGGFEQLDYLREILKDHI-GEG 648
Query: 628 SSNIAE 633
S+NI +
Sbjct: 649 SANIVQ 654
>gi|147853193|emb|CAN78554.1| hypothetical protein VITISV_042206 [Vitis vinifera]
Length = 676
Score = 885 bits (2288), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/626 (72%), Positives = 519/626 (82%), Gaps = 29/626 (4%)
Query: 30 VDDSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEK-SKVVDDNEGMDWCVRARKVA 88
VD ++++ESE +DWE EFLGELDP G+QAPKKRKK+E+ SK+++D +GMDWCV+ARK+A
Sbjct: 56 VDSNDKQESE--MDWELEFLGELDPLGFQAPKKRKKREQGSKLLEDTDGMDWCVKARKMA 113
Query: 89 LKSIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFDLEDDMKMDDIMGS 148
LKSIEARGL +MEDLI VKKKK KK +K K K + + D ++D+++ +
Sbjct: 114 LKSIEARGLTRTMEDLITVKKKKNNKKKLGKKDKISKKSKVSEEEDDSDEDIELKGV--- 170
Query: 149 GNGYDMND-LRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQ 207
N D D LR+TVSM+AGGMFEEK+EKT++ FV RLSQFSGPS+RRKEINLNK IV+AQ
Sbjct: 171 -NPLDGADRLRKTVSMVAGGMFEEKKEKTMQAFVQRLSQFSGPSDRRKEINLNKAIVEAQ 229
Query: 208 TAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREM 267
TA+EVLEV AE I AVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT+ RLAF RQ+EM
Sbjct: 230 TAEEVLEVAAETIMAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTSRRLAFARQKEM 289
Query: 268 SMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVAN 327
SMLV IAMTALPECSAQGISNI+WALSKIGGELLYLSEMDRVAEVALTKV +FNSQNVAN
Sbjct: 290 SMLVGIAMTALPECSAQGISNISWALSKIGGELLYLSEMDRVAEVALTKVEQFNSQNVAN 349
Query: 328 VAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAF 387
VAGAFASM+HSAPDLFSEL++RAS+IVH FQEQELAQVLWAFASL EPA PLLESLDN F
Sbjct: 350 VAGAFASMRHSAPDLFSELSERASNIVHNFQEQELAQVLWAFASLNEPAGPLLESLDNVF 409
Query: 388 KDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQM 447
D QF CCL++ NE V+++GD E SP L+F RDQLGNIAWSYAVLGQM
Sbjct: 410 NDENQFKCCLDQETLKYNEESVVENNGDLAMEEISGSPALNFKRDQLGNIAWSYAVLGQM 469
Query: 448 DRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKI 507
DR+FFS +WKT+S FEEQRISEQYREDIMFASQVHLVNQCLKLE+PHL+L+L S LEEK+
Sbjct: 470 DRVFFSHVWKTLSHFEEQRISEQYREDIMFASQVHLVNQCLKLEYPHLRLSLRSDLEEKV 529
Query: 508 ASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPT 567
A AGKTKRFNQK+TSSFQKEVA LLVSTGL+W+REY VDGYT+DAVLVD+KVA EIDGPT
Sbjct: 530 ARAGKTKRFNQKMTSSFQKEVAHLLVSTGLDWVREYVVDGYTLDAVLVDQKVALEIDGPT 589
Query: 568 HFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQ--------------------EWEELQG 607
HFSRN+GVPLGHTMLKRRYI AAGW + S+SHQ EWEELQG
Sbjct: 590 HFSRNSGVPLGHTMLKRRYITAAGWKLASVSHQERHLLVVFICVSSRGFNTVVEWEELQG 649
Query: 608 SFEQLDYLRVILKDYIGGEGSSNIAE 633
FEQLDYLR ILKD+I GEGS+NI +
Sbjct: 650 GFEQLDYLREILKDHI-GEGSANIVQ 674
>gi|224117838|ref|XP_002331644.1| predicted protein [Populus trichocarpa]
gi|222874040|gb|EEF11171.1| predicted protein [Populus trichocarpa]
Length = 663
Score = 882 bits (2280), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/586 (75%), Positives = 499/586 (85%), Gaps = 6/586 (1%)
Query: 44 WESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIEARGLASSMED 103
W+ EFLGELDP G QA KKRKKQ+ S ++ D +GMDWC+RARKVALKSIEARGL+ MED
Sbjct: 79 WKLEFLGELDPLGCQASKKRKKQQNSGLLKDTDGMDWCLRARKVALKSIEARGLSQRMED 138
Query: 104 LIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFDLEDDMKMDDIMGSGNGYDMNDLRRTVSM 163
LI VKKKKKK KK K K ++ D D + D ++ G DL+R VSM
Sbjct: 139 LINVKKKKKKRNKKKLVGKVKKVKDFEEDDLDFDLDEGVELEEGDA------DLKRMVSM 192
Query: 164 MAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAV 223
+ GMF+E++EKT+EEF+ RLSQFSGPS+R+KEINLN+ IV+AQTA+EVLE+ AEMI AV
Sbjct: 193 LGDGMFQERKEKTMEEFLQRLSQFSGPSDRKKEINLNRAIVEAQTAEEVLEITAEMIMAV 252
Query: 224 GKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSA 283
GKGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQ+E+SMLV IAMTALPECSA
Sbjct: 253 GKGLSPSPLSPLNIATALHRIAKNMEKVSMMNTRRLAFARQKEVSMLVGIAMTALPECSA 312
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
QGISNI+WALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA ASMQHSAPDLF
Sbjct: 313 QGISNISWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGALASMQHSAPDLF 372
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSN 403
S L+KR S+I+HTFQEQELAQVLWAFASLYEPAD LL++LD FK+A Q C L S
Sbjct: 373 SALSKRGSEIIHTFQEQELAQVLWAFASLYEPADSLLDALDTVFKNANQLECSLKTKTSY 432
Query: 404 CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFE 463
+E + SGD D+EG L SPVLSFNRDQLGNIAWSYAV+GQ+DRIFFS++W+T+S FE
Sbjct: 433 SDEERSNEDSGDLDAEGPLRSPVLSFNRDQLGNIAWSYAVIGQLDRIFFSNVWRTLSHFE 492
Query: 464 EQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSS 523
EQR+SEQYREDIMFASQ HLVNQCLKLE+PHL+L+L LEEKIA AGKTKRFNQK TSS
Sbjct: 493 EQRLSEQYREDIMFASQAHLVNQCLKLEYPHLRLSLGDNLEEKIARAGKTKRFNQKTTSS 552
Query: 524 FQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLK 583
FQKEVARLLVSTGL+W+REY VDGYTVDAV+VDKK+A EIDGPTHFSRNTG+PLGHTMLK
Sbjct: 553 FQKEVARLLVSTGLDWVREYVVDGYTVDAVVVDKKIALEIDGPTHFSRNTGMPLGHTMLK 612
Query: 584 RRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSS 629
RRYIAAAGWNVVSLSHQEWEE++GS+EQ +YLR ILK++IGG+ SS
Sbjct: 613 RRYIAAAGWNVVSLSHQEWEEIEGSYEQQEYLREILKEHIGGDSSS 658
>gi|255585295|ref|XP_002533346.1| conserved hypothetical protein [Ricinus communis]
gi|223526811|gb|EEF29031.1| conserved hypothetical protein [Ricinus communis]
Length = 666
Score = 850 bits (2195), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 462/613 (75%), Positives = 506/613 (82%), Gaps = 28/613 (4%)
Query: 31 DDSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALK 90
D+ EE E DWE EFLGELDP GYQAPKKRKKQ+KSK++++ +GMDWC+RARKVALK
Sbjct: 68 DNGEEVE-----DWELEFLGELDPLGYQAPKKRKKQKKSKLLEETDGMDWCLRARKVALK 122
Query: 91 SIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDD------------LDFDLED 138
SIEARGL+ +MEDLI VKKKKKK KKKL K +K D ++F+
Sbjct: 123 SIEARGLSQNMEDLINVKKKKKKNKKKLVSKSKISKKNKDLEDDSDFDLDDEDVEFEDVA 182
Query: 139 DMKMDDIMGSGNGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEIN 198
D+ DD + DLRRTVS MAGGMFEEK+EK +EEFV RLSQFSGPS+R+KE+N
Sbjct: 183 DLPGDDSI---------DLRRTVSSMAGGMFEEKKEKNMEEFVQRLSQFSGPSDRKKEVN 233
Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
LN+ IV+AQTA+EVLEV A+MI AVGKGLSPSPLSPLNIATALHRIAKNMEKVSMM T R
Sbjct: 234 LNRAIVEAQTAEEVLEVTADMIIAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMKTRR 293
Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVG 318
LAF RQREMSMLV IAMTALPECSAQGISNI+WALSKIGGELLYLSEMDRVAEVALTKV
Sbjct: 294 LAFARQREMSMLVGIAMTALPECSAQGISNISWALSKIGGELLYLSEMDRVAEVALTKVD 353
Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP 378
EFNSQNVANVAGAFASMQHSA DLFS L+KRASDI+HTFQEQELAQVLWAFASLYEPAD
Sbjct: 354 EFNSQNVANVAGAFASMQHSASDLFSALSKRASDIIHTFQEQELAQVLWAFASLYEPADS 413
Query: 379 LLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
LLESLD FKD QF C N NE +K SGD D E PVL FNRDQLGNIA
Sbjct: 414 LLESLDIVFKDVNQFHCYTKAETLNYNEVDSMKGSGDLDREEVSGPPVLKFNRDQLGNIA 473
Query: 439 WSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLA 498
WSYAV GQ++R FFS+IW+T+ EEQRISEQYREDIMFASQ HLVNQCLKLEHPH QLA
Sbjct: 474 WSYAVFGQVNRTFFSNIWRTLRNSEEQRISEQYREDIMFASQAHLVNQCLKLEHPHYQLA 533
Query: 499 LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKK 558
L LEEKIA AGKTKRFNQK+TSSFQKEVARLLVSTGL+W+REY VDGYT+DAV+VDKK
Sbjct: 534 LGGDLEEKIARAGKTKRFNQKITSSFQKEVARLLVSTGLDWVREYVVDGYTLDAVVVDKK 593
Query: 559 VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVI 618
+A EIDGPTHFSRNTGVPLGHTMLKRRYI+AAGW VVSLSHQEWEELQGSFEQLDYLR I
Sbjct: 594 IALEIDGPTHFSRNTGVPLGHTMLKRRYISAAGWKVVSLSHQEWEELQGSFEQLDYLREI 653
Query: 619 LKDYIGGEGSSNI 631
LK ++G S+NI
Sbjct: 654 LKVHLG--DSNNI 664
>gi|356506291|ref|XP_003521919.1| PREDICTED: uncharacterized protein LOC100805208 [Glycine max]
Length = 664
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/594 (71%), Positives = 493/594 (82%), Gaps = 11/594 (1%)
Query: 32 DSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKS 91
DS++K E S DWE EFLGELDPFGY+APKKR+K+++SK+++ +GMDWCVRARK AL+S
Sbjct: 70 DSDDKGEESSTDWELEFLGELDPFGYRAPKKREKEQRSKLLEATDGMDWCVRARKKALES 129
Query: 92 IEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDD--DLDFDLEDDMKMDDIMGSG 149
IEARG+A +ED++ VKKKKKK KKKLE KK K + DLDF LE+D+ +
Sbjct: 130 IEARGMAHLVEDMVTVKKKKKKDKKKLESKKKVVKKIEKIEDLDFVLEEDL----LQPMK 185
Query: 150 NGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTA 209
D+ DL+R VSM GMF EK+EKT E FV+RLSQFSGPS+ RKEINLNK I +A+TA
Sbjct: 186 PEIDVGDLKRRVSMFNDGMFIEKKEKTKEAFVNRLSQFSGPSDHRKEINLNKAITEARTA 245
Query: 210 QEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSM 269
+VLEV E I AV KGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQREMSM
Sbjct: 246 DDVLEVTYETIVAVAKGLSPSPLSPLNIATALHRIAKNMEKVSMMRTRRLAFARQREMSM 305
Query: 270 LVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVA 329
LV+IAMTALPECSAQG+SNI+WALSKIGGELLYLSEMDR+AEVALTKVGEFNSQN+AN+A
Sbjct: 306 LVSIAMTALPECSAQGVSNISWALSKIGGELLYLSEMDRIAEVALTKVGEFNSQNIANIA 365
Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKD 389
GAFA+MQHSAPDLFS L++RASDI+HTFQEQELAQ+LWAFASLYEPADP+ +SLD FKD
Sbjct: 366 GAFAAMQHSAPDLFSVLSERASDIIHTFQEQELAQLLWAFASLYEPADPIFDSLDIVFKD 425
Query: 390 ATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDR 449
+Q C + SN +E V SG ++ SPVL+ RDQLG IAWSYAV GQMDR
Sbjct: 426 HSQLRGCTGERTSNNHEQIRVDRSGASN-----GSPVLTLTRDQLGTIAWSYAVFGQMDR 480
Query: 450 IFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIAS 509
FFS +WKT+S +EE+RISE YREDIMFASQVHLVNQCLKLE PHLQL+L LE+K+A
Sbjct: 481 SFFSHVWKTLSHYEERRISELYREDIMFASQVHLVNQCLKLEFPHLQLSLCGDLEDKVAL 540
Query: 510 AGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHF 569
A KTKRFNQK+TSSFQKEV RLL+STGL W++EY VDGYT+DAV+VDKK+A EIDGPTHF
Sbjct: 541 ARKTKRFNQKITSSFQKEVGRLLLSTGLEWVKEYVVDGYTLDAVIVDKKLALEIDGPTHF 600
Query: 570 SRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
SRNTGVPLGHTMLKRRYI AAGW V S+S QEWEELQG+FEQ++YLR +LK+++
Sbjct: 601 SRNTGVPLGHTMLKRRYITAAGWKVASVSSQEWEELQGAFEQVEYLRNLLKNHL 654
>gi|356522646|ref|XP_003529957.1| PREDICTED: uncharacterized protein LOC100794144 [Glycine max]
Length = 669
Score = 830 bits (2145), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/593 (71%), Positives = 492/593 (82%), Gaps = 8/593 (1%)
Query: 33 SEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSI 92
S +K + DWESEFLGELDPFGY+APKKR+K+++S +++ +GMDWCVRARK ALKSI
Sbjct: 70 SNDKGEGSNTDWESEFLGELDPFGYRAPKKREKEKRSMLLEATDGMDWCVRARKEALKSI 129
Query: 93 EARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDD--DLDFDLEDDMKMDDIMGSGN 150
EARG+A ME+++ VKKKKKK KKKLE KK K + DLDF LE+D+
Sbjct: 130 EARGMAHLMENMVTVKKKKKKDKKKLESKKKIVKKIEKIEDLDFSLEEDLPQP----MET 185
Query: 151 GYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQ 210
D+ DL+R VS+ GMF EK+EKT EEFV+RLSQFSGPS+ RKEINLNK I +AQTA
Sbjct: 186 EIDVGDLKRRVSIFNDGMFIEKKEKTKEEFVNRLSQFSGPSDHRKEINLNKAITEAQTAD 245
Query: 211 EVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSML 270
+VLEV E I AV KGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQREMSML
Sbjct: 246 DVLEVTYETIVAVAKGLSPSPLSPLNIATALHRIAKNMEKVSMMRTRRLAFARQREMSML 305
Query: 271 VAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
V+IAMTALPECSAQG+SNI+WALSKIGGELLYLSEMDR+AEVALTKVGEFNSQN+AN+AG
Sbjct: 306 VSIAMTALPECSAQGVSNISWALSKIGGELLYLSEMDRIAEVALTKVGEFNSQNIANIAG 365
Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA 390
AFA+MQHSAPDLFSE +KRASDI+HTFQEQELAQ+LWAFASLYEPADP+ +SLD FKD
Sbjct: 366 AFAAMQHSAPDLFSEFSKRASDIIHTFQEQELAQLLWAFASLYEPADPIFDSLDIVFKDH 425
Query: 391 TQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRI 450
+Q C+ + SN +E V SG S GSL SPVL+ RDQLG IAWSYAV GQM R
Sbjct: 426 SQLRGCIGEKTSNNHEQISVDRSG--ASNGSLGSPVLTLTRDQLGTIAWSYAVFGQMARS 483
Query: 451 FFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASA 510
FFS +WKT+S +EEQRISE YREDIMFASQVHLVNQCLKLE PHLQL+L LE+K+A +
Sbjct: 484 FFSHVWKTLSHYEEQRISELYREDIMFASQVHLVNQCLKLEFPHLQLSLCGELEDKVALS 543
Query: 511 GKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFS 570
GKTKRFNQK+TSSFQKEV LLVSTGL W++E+ VDGYT+DAV+VDKK+A EIDGPTHFS
Sbjct: 544 GKTKRFNQKITSSFQKEVGHLLVSTGLEWVKEFVVDGYTLDAVIVDKKLALEIDGPTHFS 603
Query: 571 RNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
RNTGVPLGHTMLKRRYI AAGW V S+S+Q+WEELQG+FEQ++YL +LK+++
Sbjct: 604 RNTGVPLGHTMLKRRYITAAGWKVASISYQKWEELQGAFEQVEYLSNLLKNHL 656
>gi|449505631|ref|XP_004162527.1| PREDICTED: uncharacterized protein LOC101223645 [Cucumis sativus]
Length = 633
Score = 799 bits (2064), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/605 (68%), Positives = 488/605 (80%), Gaps = 12/605 (1%)
Query: 29 EVDDSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVA 88
E+ DS + + D+++WE E L ELDP G+Q PKK+KKQ KSK++DD EGMDWC+RARKVA
Sbjct: 28 EIGDS--RGNGDNMEWEGELLQELDPLGFQPPKKKKKQMKSKLLDDTEGMDWCLRARKVA 85
Query: 89 LKSIEARGLASSMEDLIKVKKKKKKGK-------KKLEKIKKKNKVTDDDLDFDLEDDMK 141
L+SIE RGLAS+ EDL VKKK KK K K + K V ++ L+FD ++D++
Sbjct: 86 LRSIEGRGLASTEEDLFSVKKKNKKNKKKKKIMGSKDNGVNTKGDVIEESLEFDSDEDLE 145
Query: 142 MDDIMGSGNGYDMND---LRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEIN 198
+D + + +ND L ++VS+M GGMFE+++EKT+EEF+ RLS+FSGPS+R+KE+N
Sbjct: 146 LDMDLDLLDSLAINDSNHLSKSVSIMGGGMFEQRKEKTMEEFIQRLSKFSGPSDRKKEVN 205
Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
LN+ I++AQTA E LEVI++MI AVGKGLSPSPLSPLNIATALHRIAKNM+KV MM +HR
Sbjct: 206 LNRAIIEAQTADEALEVISDMILAVGKGLSPSPLSPLNIATALHRIAKNMDKVLMMKSHR 265
Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVG 318
LAF R+REMSMLV IAMT LPECSAQGISNIAWALSKIGG+ LYLSEMDRVAEV LTK+
Sbjct: 266 LAFARRREMSMLVGIAMTTLPECSAQGISNIAWALSKIGGDQLYLSEMDRVAEVTLTKIE 325
Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP 378
E NSQNVAN+AGAFASMQHSA DLFS LAKRASDIV TF EQELAQVLWAFASL E AD
Sbjct: 326 ELNSQNVANIAGAFASMQHSASDLFSGLAKRASDIVDTFHEQELAQVLWAFASLNESADL 385
Query: 379 LLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
LLESLDN + DA+Q TC L++ N N+ V S D +S+G++ PVL FNR+QLGNIA
Sbjct: 386 LLESLDNVYNDASQITCYLSEQTVNRNQESTVGVSNDLESDGAVGFPVLKFNRNQLGNIA 445
Query: 439 WSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLA 498
WSYAV GQ+DR FFS IW+TIS FE++ ISEQ+R DI+FASQ+ LV+ CLK E+ HLQL+
Sbjct: 446 WSYAVFGQVDRSFFSHIWRTISYFEKESISEQHRNDIIFASQLWLVHYCLKREYSHLQLS 505
Query: 499 LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKK 558
LS LEEK AGKTKRFNQK TSSFQKEVARLLVSTG W REY D YT+DAV+VDKK
Sbjct: 506 LSVDLEEKAILAGKTKRFNQKTTSSFQKEVARLLVSTGHEWTREYVFDAYTLDAVIVDKK 565
Query: 559 VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVI 618
V EIDGPTHFSRNTG+PLGHT+LKRRYI AAGW VVSLSHQEWEELQG EQL+YLR I
Sbjct: 566 VVLEIDGPTHFSRNTGIPLGHTVLKRRYITAAGWKVVSLSHQEWEELQGEVEQLNYLREI 625
Query: 619 LKDYI 623
LKD+I
Sbjct: 626 LKDHI 630
>gi|449442355|ref|XP_004138947.1| PREDICTED: uncharacterized protein LOC101211080 [Cucumis sativus]
Length = 671
Score = 799 bits (2064), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/605 (68%), Positives = 488/605 (80%), Gaps = 12/605 (1%)
Query: 29 EVDDSEEKESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVA 88
E+ DS + + D+++WE E L ELDP G+Q PKK+KKQ KSK++DD EGMDWC+RARKVA
Sbjct: 66 EIGDS--RGNGDNMEWEGELLQELDPLGFQPPKKKKKQMKSKLLDDTEGMDWCLRARKVA 123
Query: 89 LKSIEARGLASSMEDLIKVKKKKKKGK-------KKLEKIKKKNKVTDDDLDFDLEDDMK 141
L+SIE RGLAS+ EDL VKKK KK K K + K V ++ L+FD ++D++
Sbjct: 124 LRSIEGRGLASTEEDLFSVKKKNKKNKKKKKIMGSKDNGVNTKGDVIEESLEFDSDEDLE 183
Query: 142 MDDIMGSGNGYDMND---LRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEIN 198
+D + + +ND L ++VS+M GGMFE+++EKT+EEF+ RLS+FSGPS+R+KE+N
Sbjct: 184 LDMDLDLLDSLAINDSNHLSKSVSIMGGGMFEQRKEKTMEEFIQRLSKFSGPSDRKKEVN 243
Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
LN+ I++AQTA E LEVI++MI AVGKGLSPSPLSPLNIATALHRIAKNM+KV MM +HR
Sbjct: 244 LNRAIIEAQTADEALEVISDMILAVGKGLSPSPLSPLNIATALHRIAKNMDKVLMMKSHR 303
Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVG 318
LAF R+REMSMLV IAMT LPECSAQGISNIAWALSKIGG+ LYLSEMDRVAEV LTK+
Sbjct: 304 LAFARRREMSMLVGIAMTTLPECSAQGISNIAWALSKIGGDQLYLSEMDRVAEVTLTKIE 363
Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP 378
E NSQNVAN+AGAFASMQHSA DLFS LAKRASDIV TF EQELAQVLWAFASL E AD
Sbjct: 364 ELNSQNVANIAGAFASMQHSASDLFSGLAKRASDIVDTFHEQELAQVLWAFASLNESADL 423
Query: 379 LLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
LLESLDN + DA+Q TC L++ N N+ V S D +S+G++ PVL FNR+QLGNIA
Sbjct: 424 LLESLDNVYNDASQITCYLSEQTVNRNQESTVGVSNDLESDGAVGFPVLKFNRNQLGNIA 483
Query: 439 WSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLA 498
WSYAV GQ+DR FFS IW+TIS FE++ ISEQ+R DI+FASQ+ LV+ CLK E+ HLQL+
Sbjct: 484 WSYAVFGQVDRSFFSHIWRTISYFEKESISEQHRNDIIFASQLWLVHYCLKREYSHLQLS 543
Query: 499 LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKK 558
LS LEEK AGKTKRFNQK TSSFQKEVARLLVSTG W REY D YT+DAV+VDKK
Sbjct: 544 LSVDLEEKAILAGKTKRFNQKTTSSFQKEVARLLVSTGHEWTREYVFDAYTLDAVIVDKK 603
Query: 559 VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVI 618
V EIDGPTHFSRNTG+PLGHT+LKRRYI AAGW VVSLSHQEWEELQG EQL+YLR I
Sbjct: 604 VVLEIDGPTHFSRNTGIPLGHTVLKRRYITAAGWKVVSLSHQEWEELQGEVEQLNYLREI 663
Query: 619 LKDYI 623
LKD+I
Sbjct: 664 LKDHI 668
>gi|4887747|gb|AAD32283.1| hypothetical protein [Arabidopsis thaliana]
Length = 627
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/583 (69%), Positives = 473/583 (81%), Gaps = 20/583 (3%)
Query: 48 FLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIEARGLASSMEDLIKV 107
FLGE+DP Q PKKRKKQ+ SK ++D EGMDWCVRARK+ALKSIEARGL+S M +++ +
Sbjct: 58 FLGEIDPLDIQPPKKRKKQKNSKALEDTEGMDWCVRARKIALKSIEARGLSSRMAEVMPL 117
Query: 108 KKKKKKGKKKLEKIKKKNKVTDDDLDFDLE-------DDMKMDDIMGSGNGYDMNDLRRT 160
KKKKKK KK+ K K K D +D ++D MG DLR+
Sbjct: 118 KKKKKKKSKKVIVKKDKVKSKSIPEDDFDTEDEDLDFEDGFVEDKMG--------DLRKR 169
Query: 161 VSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMI 220
VS +AGGMFEEK+EK E+ RLSQFSGPS+R KEINLNK I++AQTA+EVLEV AE I
Sbjct: 170 VSSLAGGMFEEKKEKMKEQLAQRLSQFSGPSDRMKEINLNKAIIEAQTAEEVLEVTAETI 229
Query: 221 TAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPE 280
AV KGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQREMSMLVA+AMT LPE
Sbjct: 230 MAVAKGLSPSPLSPLNIATALHRIAKNMEKVSMMRTRRLAFARQREMSMLVALAMTCLPE 289
Query: 281 CSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
CSAQGISNI+WALSKIGGELLYL+EMDRVAEVA +KVGEFNSQNVAN+AGAFASM+HSAP
Sbjct: 290 CSAQGISNISWALSKIGGELLYLTEMDRVAEVATSKVGEFNSQNVANIAGAFASMRHSAP 349
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
+LF+EL+KRAS I++TF+ QE+AQ+LW+FASLYEPADPLLESLD+AFK + QF C L K
Sbjct: 350 ELFAELSKRASTIINTFKGQEIAQLLWSFASLYEPADPLLESLDSAFKSSDQFKCYLTKE 409
Query: 401 LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
++N +E + S D SP LSFNRDQLGNIAWSYAVLGQ++R FF++IW T++
Sbjct: 410 ITNSDEVVDAEVSDDVS-----RSPALSFNRDQLGNIAWSYAVLGQVERPFFANIWNTLT 464
Query: 461 RFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKV 520
EEQR+SEQYRED+MFASQV+LVNQCLKLE PHLQL+L LEEKI+ AGKTKRFNQK+
Sbjct: 465 TLEEQRLSEQYREDVMFASQVYLVNQCLKLECPHLQLSLCQELEEKISRAGKTKRFNQKI 524
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
TSSFQKEV RLL+STGL+W +E+ VDGYTVD LV+KKVA EIDGPTHFSRN+G+PLGHT
Sbjct: 525 TSSFQKEVGRLLISTGLDWAKEHDVDGYTVDVALVEKKVALEIDGPTHFSRNSGLPLGHT 584
Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
MLKRRY+AAAGW VVSLS QEWEE +GS EQL+YLR IL I
Sbjct: 585 MLKRRYVAAAGWKVVSLSLQEWEEHEGSHEQLEYLREILTGCI 627
>gi|30685105|ref|NP_850176.1| protein RAP [Arabidopsis thaliana]
gi|18086393|gb|AAL57655.1| At2g31890/F20M17.7 [Arabidopsis thaliana]
gi|22136584|gb|AAM91078.1| At2g31890/F20M17.7 [Arabidopsis thaliana]
gi|330253506|gb|AEC08600.1| protein RAP [Arabidopsis thaliana]
Length = 671
Score = 763 bits (1971), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/583 (69%), Positives = 473/583 (81%), Gaps = 20/583 (3%)
Query: 48 FLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIEARGLASSMEDLIKV 107
FLGE+DP Q PKKRKKQ+ SK ++D EGMDWCVRARK+ALKSIEARGL+S M +++ +
Sbjct: 102 FLGEIDPLDIQPPKKRKKQKNSKALEDTEGMDWCVRARKIALKSIEARGLSSRMAEVMPL 161
Query: 108 KKKKKKGKKKLEKIKKKNKVTDDDLDFDLE-------DDMKMDDIMGSGNGYDMNDLRRT 160
KKKKKK KK+ K K K D +D ++D MG DLR+
Sbjct: 162 KKKKKKKSKKVIVKKDKVKSKSIPEDDFDTEDEDLDFEDGFVEDKMG--------DLRKR 213
Query: 161 VSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMI 220
VS +AGGMFEEK+EK E+ RLSQFSGPS+R KEINLNK I++AQTA+EVLEV AE I
Sbjct: 214 VSSLAGGMFEEKKEKMKEQLAQRLSQFSGPSDRMKEINLNKAIIEAQTAEEVLEVTAETI 273
Query: 221 TAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPE 280
AV KGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQREMSMLVA+AMT LPE
Sbjct: 274 MAVAKGLSPSPLSPLNIATALHRIAKNMEKVSMMRTRRLAFARQREMSMLVALAMTCLPE 333
Query: 281 CSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
CSAQGISNI+WALSKIGGELLYL+EMDRVAEVA +KVGEFNSQNVAN+AGAFASM+HSAP
Sbjct: 334 CSAQGISNISWALSKIGGELLYLTEMDRVAEVATSKVGEFNSQNVANIAGAFASMRHSAP 393
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
+LF+EL+KRAS I++TF+ QE+AQ+LW+FASLYEPADPLLESLD+AFK + QF C L K
Sbjct: 394 ELFAELSKRASTIINTFKGQEIAQLLWSFASLYEPADPLLESLDSAFKSSDQFKCYLTKE 453
Query: 401 LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
++N +E + S D SP LSFNRDQLGNIAWSYAVLGQ++R FF++IW T++
Sbjct: 454 ITNSDEVVDAEVSDDVS-----RSPALSFNRDQLGNIAWSYAVLGQVERPFFANIWNTLT 508
Query: 461 RFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKV 520
EEQR+SEQYRED+MFASQV+LVNQCLKLE PHLQL+L LEEKI+ AGKTKRFNQK+
Sbjct: 509 TLEEQRLSEQYREDVMFASQVYLVNQCLKLECPHLQLSLCQELEEKISRAGKTKRFNQKI 568
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
TSSFQKEV RLL+STGL+W +E+ VDGYTVD LV+KKVA EIDGPTHFSRN+G+PLGHT
Sbjct: 569 TSSFQKEVGRLLISTGLDWAKEHDVDGYTVDVALVEKKVALEIDGPTHFSRNSGLPLGHT 628
Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
MLKRRY+AAAGW VVSLS QEWEE +GS EQL+YLR IL I
Sbjct: 629 MLKRRYVAAAGWKVVSLSLQEWEEHEGSHEQLEYLREILTGCI 671
>gi|297826641|ref|XP_002881203.1| hypothetical protein ARALYDRAFT_902227 [Arabidopsis lyrata subsp.
lyrata]
gi|297327042|gb|EFH57462.1| hypothetical protein ARALYDRAFT_902227 [Arabidopsis lyrata subsp.
lyrata]
Length = 668
Score = 759 bits (1961), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/584 (70%), Positives = 477/584 (81%), Gaps = 22/584 (3%)
Query: 48 FLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIEARGLASSMEDLIKV 107
FLGE+DP Q PKKRKKQ+ SKV++D EGMDWCVRARK+ALKSIEARGL+S M +++ +
Sbjct: 99 FLGEIDPLDIQPPKKRKKQKNSKVLEDTEGMDWCVRARKIALKSIEARGLSSRMAEVMPL 158
Query: 108 KKKKKKGKKKLEKIKKKNKVTD--------DDLDFDLEDDMKMDDIMGSGNGYDMNDLRR 159
KKKKKK KK+ K+K K +D D D ED + ++D MG DLR+
Sbjct: 159 KKKKKKKSKKVIVKKEKVKTKSILEEDFDTEDEDLDFEDGL-VEDKMG--------DLRK 209
Query: 160 TVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEM 219
VS +AGGMFEEK+EK E+ RLSQFSGPS+R KEINLNK I++AQTA+EVLEV +E
Sbjct: 210 RVSSLAGGMFEEKKEKMKEQLAQRLSQFSGPSDRMKEINLNKAIIEAQTAEEVLEVTSET 269
Query: 220 ITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALP 279
I AV KGLSPSPLSPLNIATALHRIAKNMEKVSMM T RLAF RQREMSMLVA+AMT LP
Sbjct: 270 IMAVAKGLSPSPLSPLNIATALHRIAKNMEKVSMMRTRRLAFARQREMSMLVALAMTCLP 329
Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
ECSAQGISNI+WALSKIGGELLYL+EMDRVAEVA +KVGEFNSQNVAN+AGAFASM+HSA
Sbjct: 330 ECSAQGISNISWALSKIGGELLYLTEMDRVAEVATSKVGEFNSQNVANIAGAFASMRHSA 389
Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNK 399
P+LF+EL+KRAS I+ TF+ QE+AQ+LW+FASL EPADPLLESLD+AFK + QF C L K
Sbjct: 390 PELFAELSKRASTIIITFKGQEIAQLLWSFASLNEPADPLLESLDSAFKSSDQFKCYLTK 449
Query: 400 ALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI 459
++N +E V+ S DA SP LSFNRDQLGNIAWSYAVLGQ++R FF++IW ++
Sbjct: 450 EITNSDEVVDVEVSDDAS-----GSPPLSFNRDQLGNIAWSYAVLGQVERPFFANIWNSL 504
Query: 460 SRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQK 519
+ EEQR+SEQYRED+MFASQV LVNQCLKLE PHLQL+L LEEKI AGKTKRFNQK
Sbjct: 505 TTLEEQRLSEQYREDVMFASQVFLVNQCLKLECPHLQLSLCHGLEEKITRAGKTKRFNQK 564
Query: 520 VTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGH 579
++SSFQKEV RLL+STGL+W +E+ VDGYTVD LVDKKVA EIDGPTHFSRN+G+PLGH
Sbjct: 565 ISSSFQKEVGRLLISTGLDWAKEHDVDGYTVDVALVDKKVALEIDGPTHFSRNSGIPLGH 624
Query: 580 TMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
TMLKRRY+AAAGW VVSLS QEWEE +GS EQL+YLR IL I
Sbjct: 625 TMLKRRYVAAAGWKVVSLSLQEWEEHEGSHEQLEYLREILNGCI 668
>gi|296084379|emb|CBI24767.3| unnamed protein product [Vitis vinifera]
Length = 439
Score = 756 bits (1951), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/466 (78%), Positives = 402/466 (86%), Gaps = 29/466 (6%)
Query: 168 MFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGL 227
MFEEK+EKT++ FV RLSQFSGPS+RRKEINLNK IV+AQTA+EVLEV AE I AVGKGL
Sbjct: 1 MFEEKKEKTMQAFVQRLSQFSGPSDRRKEINLNKAIVEAQTAEEVLEVAAETIMAVGKGL 60
Query: 228 SPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGIS 287
SPSPLSPLNIATALHRIAKNMEKVSMMT+ RLAF RQ+EMSMLV IAMTALPECSAQGIS
Sbjct: 61 SPSPLSPLNIATALHRIAKNMEKVSMMTSRRLAFARQKEMSMLVGIAMTALPECSAQGIS 120
Query: 288 NIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELA 347
NI+WALSKIGGELLYLSEMDRVAEVALTKV +FNSQNVANVAGAFASM+HSAPDLFSEL+
Sbjct: 121 NISWALSKIGGELLYLSEMDRVAEVALTKVEQFNSQNVANVAGAFASMRHSAPDLFSELS 180
Query: 348 KRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNEN 407
+RAS+IVH FQEQELAQVLWAFASL EPA PLLESLDN F D QF CCL++
Sbjct: 181 ERASNIVHNFQEQELAQVLWAFASLNEPAGPLLESLDNVFNDENQFKCCLDQETL----- 235
Query: 408 GGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRI 467
+DQLGNIAWSYAVLGQMDR+FFS +WKT+S FEEQRI
Sbjct: 236 -----------------------KDQLGNIAWSYAVLGQMDRVFFSHVWKTLSHFEEQRI 272
Query: 468 SEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKE 527
SEQYREDIMFASQVHLVNQCLKLE+PHL+L+L S LEEK+A AGKTKRFNQK+TSSFQKE
Sbjct: 273 SEQYREDIMFASQVHLVNQCLKLEYPHLRLSLRSDLEEKVARAGKTKRFNQKMTSSFQKE 332
Query: 528 VARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYI 587
VA LLVSTGL+W+REY VDGYT+DAVLVD+KVA EIDGPTHFSRN+GVPLGHTMLKRRYI
Sbjct: 333 VAHLLVSTGLDWVREYVVDGYTLDAVLVDQKVALEIDGPTHFSRNSGVPLGHTMLKRRYI 392
Query: 588 AAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSSNIAE 633
AAGW + S+SHQEWEELQG FEQLDYLR ILKD+I GEGS+NI +
Sbjct: 393 TAAGWKLASVSHQEWEELQGGFEQLDYLREILKDHI-GEGSANIVQ 437
>gi|414875853|tpg|DAA52984.1| TPA: hypothetical protein ZEAMMB73_380323 [Zea mays]
Length = 641
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/591 (60%), Positives = 436/591 (73%), Gaps = 15/591 (2%)
Query: 38 SEDSVD----WESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIE 93
SED D W+ +FLG AP ++E+ ++ E DWCVRAR+ AL+SIE
Sbjct: 52 SEDRTDSTPQWQLDFLGA----SAVAPDSPVEEEEEDLLP-AEATDWCVRARRSALRSIE 106
Query: 94 ARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVT----DDDLDFDLEDDMKMDDIMGSG 149
RGLA +++ ++ KK KK K +K KK L D+ DD
Sbjct: 107 ERGLAPALQRMVSPPKKTKKKKTAKKKELKKAAAELKRRTKQLADAEGDEDDDDDYDVVD 166
Query: 150 NGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSG-PSNRRKEINLNKDIVDAQT 208
+ +M+DL V+ A GMF+EKR++ E FV LS+FS PSNR KE++LN+ IV AQT
Sbjct: 167 DLQNMDDLELRVAQFADGMFDEKRQRNRETFVQTLSRFSAAPSNRSKEVSLNRSIVQAQT 226
Query: 209 AQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMS 268
A EVL++ AE+ITAV KGLSPSPL+PLNIATALHRIA+NME VSMM THRLAF RQR+MS
Sbjct: 227 ANEVLDLTAEVITAVAKGLSPSPLTPLNIATALHRIARNMEAVSMMQTHRLAFARQRDMS 286
Query: 269 MLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANV 328
MLV +AM ALPECS QG+SNIAWALSKIGG+LLYL EMDR+A+VA+ KV +FN+QNVANV
Sbjct: 287 MLVGLAMVALPECSPQGVSNIAWALSKIGGDLLYLPEMDRIADVAMAKVQDFNAQNVANV 346
Query: 329 AGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
AGAFASM+ SAP LFS LA RA+ ++ TF+EQELAQ LW ASL E PLL++LD AF+
Sbjct: 347 AGAFASMRQSAPGLFSSLAMRAAQLLQTFKEQELAQFLWGCASLNECPHPLLDALDTAFQ 406
Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD 448
+ T F C + S+ + + + SG D S S+ L+FNRDQ+GNIAWSYAV+GQMD
Sbjct: 407 NDTSFQCHVTDIKSSAHWSSAEELSGGEDGSTS-SARTLNFNRDQVGNIAWSYAVIGQMD 465
Query: 449 RIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIA 508
R FFS +W+T+SRFEEQR+S+QYRED+MFASQV+L NQ LKLE+ +L L L S LEEKIA
Sbjct: 466 RPFFSHMWRTLSRFEEQRVSDQYREDMMFASQVYLANQSLKLEYRNLGLCLRSDLEEKIA 525
Query: 509 SAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTH 568
AGK+KRFNQK TSSFQKEV RLL STG W+REYA+DGYTVDAVLVD+K+AFEIDGPTH
Sbjct: 526 KAGKSKRFNQKTTSSFQKEVGRLLYSTGHEWVREYAIDGYTVDAVLVDEKLAFEIDGPTH 585
Query: 569 FSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
FSRN G PLGHT KRRYI A+GW +VSLS QEWE LQG FEQL+YLR IL
Sbjct: 586 FSRNLGTPLGHTAFKRRYITASGWKLVSLSLQEWENLQGEFEQLEYLRRIL 636
>gi|242056075|ref|XP_002457183.1| hypothetical protein SORBIDRAFT_03g002900 [Sorghum bicolor]
gi|241929158|gb|EES02303.1| hypothetical protein SORBIDRAFT_03g002900 [Sorghum bicolor]
Length = 640
Score = 672 bits (1734), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/553 (62%), Positives = 426/553 (77%), Gaps = 15/553 (2%)
Query: 76 EGMDWCVRARKVALKSIEARGLASSMEDLIK--------VKKKKKKGKKKLEKIKKKNKV 127
E DWCVRAR+ AL+SIE RGLA S++ ++ KKK+ KK ++K++NK
Sbjct: 89 EATDWCVRARRSALRSIEERGLAPSLQRMVSPPKKKKKKKTAKKKELKKAAAELKRRNKQ 148
Query: 128 TDDDLDFDLEDDMKMDDIMGSGNGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQF 187
DD + +DD +DD+ +M+DL V+ A GMF+EKR++ E FV LS+F
Sbjct: 149 VDDAEGDEDDDDDVVDDLQ------NMDDLELRVAQFADGMFDEKRQRNRETFVQTLSRF 202
Query: 188 SG-PSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAK 246
S PSNR KE++LN+ IV AQTA EVL++ AE+ITAV KGLSPSPL+PLNIATALHRIA+
Sbjct: 203 SAAPSNRSKEVSLNRSIVQAQTANEVLDLTAEVITAVAKGLSPSPLTPLNIATALHRIAR 262
Query: 247 NMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEM 306
NME VSMM THRLAF RQR+MSMLV +AM ALPECS QG+SNIAWALSKIGG+LLYL EM
Sbjct: 263 NMEAVSMMQTHRLAFARQRDMSMLVGLAMVALPECSPQGVSNIAWALSKIGGDLLYLPEM 322
Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVL 366
DR+A+VA++KV +FN+QNVANVAGAFASM+ SAP LFS LA RA+ I+ TF+EQELAQ L
Sbjct: 323 DRIADVAMSKVQDFNAQNVANVAGAFASMRQSAPGLFSALALRAAQILQTFKEQELAQFL 382
Query: 367 WAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPV 426
W ASL E PLL++LD AF++ T F C ++ S+ +++ + + + S+
Sbjct: 383 WGCASLNECPHPLLDALDTAFQNDTSFQCHVSDLKSSAHQSSAEEELSGGEDGSTSSART 442
Query: 427 LSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQ 486
L+F+RDQ+GNIAWSYAV+GQMDR FFS +WKT+S+FEEQR+S+QYRED+MFASQV+L NQ
Sbjct: 443 LNFSRDQVGNIAWSYAVIGQMDRPFFSHMWKTLSQFEEQRVSDQYREDMMFASQVYLANQ 502
Query: 487 CLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVD 546
LKLE+ L L L S LEEK+ AGK+KRFNQK TSSFQKEV RLL STG W+REYA+D
Sbjct: 503 SLKLEYRDLGLCLRSDLEEKVTKAGKSKRFNQKTTSSFQKEVGRLLYSTGHEWVREYAID 562
Query: 547 GYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
GYTVDAVLVD+K+AFEIDGPTHFSRN G PLGHT KRRYI A+GW +VSLS QEWE+LQ
Sbjct: 563 GYTVDAVLVDEKLAFEIDGPTHFSRNLGTPLGHTAFKRRYITASGWKLVSLSLQEWEDLQ 622
Query: 607 GSFEQLDYLRVIL 619
G FEQL+YLR IL
Sbjct: 623 GEFEQLEYLRRIL 635
>gi|30089729|gb|AAP20833.1| expressed protein [Oryza sativa Japonica Group]
gi|108708908|gb|ABF96703.1| expressed protein [Oryza sativa Japonica Group]
gi|108708909|gb|ABF96704.1| expressed protein [Oryza sativa Japonica Group]
gi|108708910|gb|ABF96705.1| expressed protein [Oryza sativa Japonica Group]
gi|125586723|gb|EAZ27387.1| hypothetical protein OsJ_11335 [Oryza sativa Japonica Group]
Length = 640
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/552 (60%), Positives = 418/552 (75%), Gaps = 11/552 (1%)
Query: 76 EGMDWCVRARKVALKSIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFD 135
E DWCVRAR+ AL+SIEARGL+ S++ ++ KKK K KK + K+ K + D
Sbjct: 87 ETNDWCVRARRSALRSIEARGLSPSLQRMVASPKKKNKKKKSKKTNLKQKKAAEPKPPRD 146
Query: 136 LEDDMKMDDIMGSG------NGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFS- 188
+DD ++ G +++DL V+ A GMF+EKR++ E+F+ LS FS
Sbjct: 147 TDDDEDDEEEADDDLEALLAGGGELDDLELRVAQFADGMFDEKRQRNREQFIQTLSAFSP 206
Query: 189 -GPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKN 247
PSNR +E++LN+ IV+A+TA EVL + AE++ AV KGLSPSPL+PLNIATALHRIAKN
Sbjct: 207 AAPSNRSQEVSLNRSIVEARTADEVLALTAEVVAAVAKGLSPSPLTPLNIATALHRIAKN 266
Query: 248 MEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMD 307
ME VSM+ THRL F R R+MSMLV +AM ALPECS QG+SNI+WALSKIGG+LLYL EMD
Sbjct: 267 MEAVSMLQTHRLGFARSRDMSMLVGLAMVALPECSPQGVSNISWALSKIGGDLLYLPEMD 326
Query: 308 RVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLW 367
R+A+VA+TKV FN+QNVANVAG+FASM+HSAPDL S L +RA+++V+TF+EQELAQ LW
Sbjct: 327 RIAQVAITKVDSFNAQNVANVAGSFASMRHSAPDLISALTRRAAELVYTFKEQELAQFLW 386
Query: 368 AFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL 427
ASL E PLL++LD A +DA F C L+ + ++ ++S +S + + L
Sbjct: 387 GCASLNECPYPLLDALDTACRDAPSFDCHLHDTVPGMWQSSDKEASSLKNSSNAYA---L 443
Query: 428 SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQC 487
+F RDQ+GNIAWSYAVLGQMDR FFS IWKT+S+FEE++IS+QYRED+MF SQV+L NQ
Sbjct: 444 NFTRDQIGNIAWSYAVLGQMDRPFFSGIWKTLSQFEERKISDQYREDMMFVSQVYLANQS 503
Query: 488 LKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG 547
LKLE+PHL + L LEE + G++KRFNQK+TSSFQKEV RLL STG W +EY +DG
Sbjct: 504 LKLEYPHLDMCLRGDLEENLTKTGRSKRFNQKMTSSFQKEVGRLLCSTGHEWNKEYTIDG 563
Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
YTVDAVLVD+K+AFEIDGP+HFSRN G PLGHT KRRYIAAAGWN+VSLSHQEWE L+G
Sbjct: 564 YTVDAVLVDEKLAFEIDGPSHFSRNLGTPLGHTAFKRRYIAAAGWNLVSLSHQEWENLEG 623
Query: 608 SFEQLDYLRVIL 619
FEQL+YLR IL
Sbjct: 624 EFEQLEYLRRIL 635
>gi|125544383|gb|EAY90522.1| hypothetical protein OsI_12123 [Oryza sativa Indica Group]
Length = 640
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/552 (60%), Positives = 418/552 (75%), Gaps = 11/552 (1%)
Query: 76 EGMDWCVRARKVALKSIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFD 135
E DWCVRAR+ AL+SIEARGL+ S++ ++ KKK K KK + K+ K + D
Sbjct: 87 ETNDWCVRARRSALRSIEARGLSPSLQRMVASPKKKNKKKKSKKTNLKQKKAAEPKPPRD 146
Query: 136 LEDDMKMDDIMGSG------NGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFS- 188
+DD ++ G +++DL V+ A GMF+EKR++ E+F+ LS FS
Sbjct: 147 TDDDEDDEEEADDDLEALLAGGGELDDLELRVAQFADGMFDEKRQRNREQFIQTLSAFSP 206
Query: 189 -GPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKN 247
PSNR +E++LN+ IV+A+TA EVL + AE++ AV KGLSPSPL+PLNIATALHRIAKN
Sbjct: 207 AAPSNRSQEVSLNRSIVEARTADEVLALTAEVVAAVAKGLSPSPLTPLNIATALHRIAKN 266
Query: 248 MEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMD 307
ME VSM+ THRL F R R+MSMLV +AM ALPECS QG+SNI+WALSKIGG+LLYL EMD
Sbjct: 267 MEAVSMLQTHRLGFARSRDMSMLVGLAMVALPECSPQGVSNISWALSKIGGDLLYLPEMD 326
Query: 308 RVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLW 367
R+A+VA+TKV FN+QNVANVAG+FASM+HSAPDL S L +RA+++V+TF+EQELAQ LW
Sbjct: 327 RIAQVAITKVDSFNAQNVANVAGSFASMRHSAPDLISALTRRAAELVYTFKEQELAQFLW 386
Query: 368 AFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL 427
ASL E PLL++LD A +DA F C L+ + ++ ++S +S + + L
Sbjct: 387 GCASLNECPYPLLDALDTACRDAPSFDCHLHDTVPGMWQSSDKEASSLKNSSNAYA---L 443
Query: 428 SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQC 487
+F RDQ+GNIAWSYAVLGQMDR FFS IWKT+S+FEE++IS+QYRED+MF SQV+L NQ
Sbjct: 444 NFTRDQIGNIAWSYAVLGQMDRPFFSGIWKTLSQFEERKISDQYREDMMFVSQVYLANQS 503
Query: 488 LKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG 547
LKLE+PHL + L LEE + G++KRFNQK+TSSFQKEV RLL STG W +EY +DG
Sbjct: 504 LKLEYPHLDMCLRGDLEENLTKTGRSKRFNQKMTSSFQKEVGRLLCSTGHEWNKEYTIDG 563
Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
YTVDAVLVD+K+AFEIDGP+HFSRN G PLGHT KRRYIAAAGWN+VSLSHQEWE L+G
Sbjct: 564 YTVDAVLVDEKLAFEIDGPSHFSRNLGTPLGHTAFKRRYIAAAGWNLVSLSHQEWENLEG 623
Query: 608 SFEQLDYLRVIL 619
FEQL+YLR IL
Sbjct: 624 EFEQLEYLRRIL 635
>gi|357161383|ref|XP_003579073.1| PREDICTED: uncharacterized protein LOC100844423 [Brachypodium
distachyon]
Length = 614
Score = 664 bits (1714), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/588 (59%), Positives = 428/588 (72%), Gaps = 32/588 (5%)
Query: 36 KESEDSVDWESEFLGELDPFGYQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSIEAR 95
KE + + W+ +FLG P+ R +E+ E DWCVRAR+ AL+SIEAR
Sbjct: 50 KEDDATPQWQLDFLGP-------HPQPRPDEEEDDDPLPAESTDWCVRARRSALRSIEAR 102
Query: 96 GLASSMEDLIKVKKK--KKKGKKKLEKIKKKNKVTDDDLDFDLEDDMKMDDIMGSGNGYD 153
GL+ S++ ++ KK K +KK +KI K K +D+L D ED+M D +
Sbjct: 103 GLSPSLQRMVSPPKKISNNKKRKKQKKILDKKKKKNDELT-DEEDEMDSDAVP------- 154
Query: 154 MNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSG--PSNRRKEINLNKDIVDAQTAQE 211
+DL V+ +A G+F+EKR++ E F+ LS FS PSNR KE++LN+DIV A+TA+E
Sbjct: 155 -DDLDHRVAQLADGVFDEKRQRNRELFIQTLSSFSAAQPSNRSKEVSLNRDIVQARTAEE 213
Query: 212 VLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLV 271
VL + AE++ AV KGLSPSPL+PLNIATALHRIAKNME VSM THRLAF RQR+MSMLV
Sbjct: 214 VLALTAEVMAAVAKGLSPSPLTPLNIATALHRIAKNMETVSMTQTHRLAFARQRDMSMLV 273
Query: 272 AIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
+AM +LPECS QG+SNI+WALSKIGG+LLYL EMDR+A+VA++KV +FN+QNVANVAGA
Sbjct: 274 GLAMLSLPECSPQGVSNISWALSKIGGDLLYLPEMDRIAKVAISKVDDFNAQNVANVAGA 333
Query: 332 FASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDAT 391
FASM+ SAP LF LA+RA+ +V+TF+EQELAQ LW ASL E PLL++LD AF+D
Sbjct: 334 FASMRQSAPALFLALAQRAAQLVYTFKEQELAQFLWGCASLNECPYPLLDALDAAFQDGL 393
Query: 392 QFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
+S+ + +S D + + LSF+RDQLGNIAWSY VLGQ+DR F
Sbjct: 394 ---------VSDMRQTSAKDASSGEDVSNAHA---LSFSRDQLGNIAWSYTVLGQIDRQF 441
Query: 452 FSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG 511
FS IWKT+ ++EEQR+S+QYREDIMFASQV+L NQ +KLE+PHL AL LEEKI AG
Sbjct: 442 FSHIWKTLKQYEEQRVSDQYREDIMFASQVYLANQSVKLEYPHLDFALRGDLEEKITKAG 501
Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSR 571
K+KRFNQK TSSFQKEV LL TG WIREY VDGYT+DAVLVD+KVA EIDG THFSR
Sbjct: 502 KSKRFNQKTTSSFQKEVGHLLYITGHEWIREYTVDGYTLDAVLVDEKVALEIDGTTHFSR 561
Query: 572 NTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
N G PLGHT LKRRYI AGW +VSLSHQEWEELQG EQ++YLR IL
Sbjct: 562 NLGTPLGHTALKRRYITTAGWKLVSLSHQEWEELQGESEQMEYLRRIL 609
>gi|115453599|ref|NP_001050400.1| Os03g0425000 [Oryza sativa Japonica Group]
gi|113548871|dbj|BAF12314.1| Os03g0425000, partial [Oryza sativa Japonica Group]
Length = 615
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/552 (60%), Positives = 418/552 (75%), Gaps = 11/552 (1%)
Query: 76 EGMDWCVRARKVALKSIEARGLASSMEDLIKVKKKKKKGKKKLEKIKKKNKVTDDDLDFD 135
E DWCVRAR+ AL+SIEARGL+ S++ ++ KKK K KK + K+ K + D
Sbjct: 62 ETNDWCVRARRSALRSIEARGLSPSLQRMVASPKKKNKKKKSKKTNLKQKKAAEPKPPRD 121
Query: 136 LEDDMKMDDIMGSG------NGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFS- 188
+DD ++ G +++DL V+ A GMF+EKR++ E+F+ LS FS
Sbjct: 122 TDDDEDDEEEADDDLEALLAGGGELDDLELRVAQFADGMFDEKRQRNREQFIQTLSAFSP 181
Query: 189 -GPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKN 247
PSNR +E++LN+ IV+A+TA EVL + AE++ AV KGLSPSPL+PLNIATALHRIAKN
Sbjct: 182 AAPSNRSQEVSLNRSIVEARTADEVLALTAEVVAAVAKGLSPSPLTPLNIATALHRIAKN 241
Query: 248 MEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMD 307
ME VSM+ THRL F R R+MSMLV +AM ALPECS QG+SNI+WALSKIGG+LLYL EMD
Sbjct: 242 MEAVSMLQTHRLGFARSRDMSMLVGLAMVALPECSPQGVSNISWALSKIGGDLLYLPEMD 301
Query: 308 RVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLW 367
R+A+VA+TKV FN+QNVANVAG+FASM+HSAPDL S L +RA+++V+TF+EQELAQ LW
Sbjct: 302 RIAQVAITKVDSFNAQNVANVAGSFASMRHSAPDLISALTRRAAELVYTFKEQELAQFLW 361
Query: 368 AFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL 427
ASL E PLL++LD A +DA F C L+ + ++ ++S +S + + L
Sbjct: 362 GCASLNECPYPLLDALDTACRDAPSFDCHLHDTVPGMWQSSDKEASSLKNSSNAYA---L 418
Query: 428 SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQC 487
+F RDQ+GNIAWSYAVLGQMDR FFS IWKT+S+FEE++IS+QYRED+MF SQV+L NQ
Sbjct: 419 NFTRDQIGNIAWSYAVLGQMDRPFFSGIWKTLSQFEERKISDQYREDMMFVSQVYLANQS 478
Query: 488 LKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG 547
LKLE+PHL + L LEE + G++KRFNQK+TSSFQKEV RLL STG W +EY +DG
Sbjct: 479 LKLEYPHLDMCLRGDLEENLTKTGRSKRFNQKMTSSFQKEVGRLLCSTGHEWNKEYTIDG 538
Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
YTVDAVLVD+K+AFEIDGP+HFSRN G PLGHT KRRYIAAAGWN+VSLSHQEWE L+G
Sbjct: 539 YTVDAVLVDEKLAFEIDGPSHFSRNLGTPLGHTAFKRRYIAAAGWNLVSLSHQEWENLEG 598
Query: 608 SFEQLDYLRVIL 619
FEQL+YLR IL
Sbjct: 599 EFEQLEYLRRIL 610
>gi|168040935|ref|XP_001772948.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675681|gb|EDQ62173.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 453
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 193/456 (42%), Positives = 269/456 (58%), Gaps = 34/456 (7%)
Query: 173 REKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPS-- 230
++ +E + +++GP+ R+E LN+ IV+A A+ VL I E + G P
Sbjct: 27 QDTPVERVASKEKEWTGPNQYREERRLNRAIVEAPDAEYVLATIIEALNKPHWG-KPRKI 85
Query: 231 PLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIA 290
PLSPLN AT LHRIAK M++ SM + +L F R++EM + A+ A PECSAQG++NIA
Sbjct: 86 PLSPLNCATGLHRIAKRMDEASMWKSEKLTFARRQEMKAFLRAAVKAFPECSAQGLANIA 145
Query: 291 WALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRA 350
WALSKIG L+ EMD +A+ AL K+ EFN+QN+AN AGAFASM H+AP LF +A+RA
Sbjct: 146 WALSKIGSSALFEEEMDHLADAALDKLSEFNAQNLANTAGAFASMLHAAPALFDAIAQRA 205
Query: 351 SDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGV 410
++ +F+ EL Q+LWAFA L P DPL +SLD V
Sbjct: 206 VEVAGSFRPLELVQILWAFACLNHPLDPLFDSLD-------------------------V 240
Query: 411 KSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQ 470
+ + D+ + F++ QL ++AWS AVL Q +R +F +WK ++ SE
Sbjct: 241 QLVENPDAAAAT---FRGFSQQQLASMAWSCAVLQQQERPWFISLWKCVNSRATTWTSEA 297
Query: 471 YRED--IMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEV 528
R+ + Q++ N LKLE L L LE + A + ++ K++S +EV
Sbjct: 298 DRKPKGVQHMCQLYQANLALKLECADLALTTEKELEIMLEEAWEKEKAANKLSSGDHREV 357
Query: 529 ARLLVST-GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYI 587
RLLVST G W+ EY Y++D LVD +VA EIDGPTHFSRNTG+ LGHT+LKRR +
Sbjct: 358 DRLLVSTTGRAWVSEYEGAPYSLDLALVDARVAIEIDGPTHFSRNTGILLGHTVLKRRLL 417
Query: 588 AAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
+AGW V + QEWEEL+G E+ +LR +L+ I
Sbjct: 418 RSAGWTVFPIPFQEWEELRGEQERALFLRTLLEGSI 453
>gi|295829058|gb|ADG38198.1| AT2G31890-like protein [Capsella grandiflora]
Length = 164
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 129/154 (83%), Positives = 140/154 (90%)
Query: 179 EFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIA 238
+ RLSQFSGPS+R KEINLNK I++AQTA+EVLEV AE I AV KGLSPSPLSPLNIA
Sbjct: 11 QLAQRLSQFSGPSDRMKEINLNKAIIEAQTAEEVLEVTAEXIMAVAKGLSPSPLSPLNIA 70
Query: 239 TALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGG 298
TALHRIAKNMEKVSMM T RLAF RQREMSMLVA+AMT LPECSAQGISNI+WALSKIGG
Sbjct: 71 TALHRIAKNMEKVSMMRTRRLAFARQREMSMLVALAMTCLPECSAQGISNISWALSKIGG 130
Query: 299 ELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
ELLYL+EMDRVAEVA +KVG+FNSQNVAN+AGAF
Sbjct: 131 ELLYLTEMDRVAEVAXSKVGDFNSQNVANIAGAF 164
>gi|295829052|gb|ADG38195.1| AT2G31890-like protein [Capsella grandiflora]
gi|295829054|gb|ADG38196.1| AT2G31890-like protein [Capsella grandiflora]
gi|295829056|gb|ADG38197.1| AT2G31890-like protein [Capsella grandiflora]
gi|295829060|gb|ADG38199.1| AT2G31890-like protein [Capsella grandiflora]
gi|295829062|gb|ADG38200.1| AT2G31890-like protein [Neslia paniculata]
gi|345289971|gb|AEN81477.1| AT2G31890-like protein, partial [Capsella rubella]
gi|345289973|gb|AEN81478.1| AT2G31890-like protein, partial [Capsella rubella]
gi|345289975|gb|AEN81479.1| AT2G31890-like protein, partial [Capsella rubella]
gi|345289977|gb|AEN81480.1| AT2G31890-like protein, partial [Capsella rubella]
gi|345289979|gb|AEN81481.1| AT2G31890-like protein, partial [Capsella rubella]
gi|345289981|gb|AEN81482.1| AT2G31890-like protein, partial [Capsella rubella]
gi|345289983|gb|AEN81483.1| AT2G31890-like protein, partial [Capsella rubella]
gi|345289985|gb|AEN81484.1| AT2G31890-like protein, partial [Capsella rubella]
Length = 164
Score = 268 bits (686), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 129/154 (83%), Positives = 140/154 (90%)
Query: 179 EFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIA 238
+ RLSQFSGPS+R KEINLNK I++AQTA+EVLEV AE I AV KGLSPSPLSPLNIA
Sbjct: 11 QLAQRLSQFSGPSDRMKEINLNKAIIEAQTAEEVLEVTAETIMAVAKGLSPSPLSPLNIA 70
Query: 239 TALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGG 298
TALHRIAKNMEKVSMM T RLAF RQREMSMLVA+AMT LPECSAQGISNI+WALSKIGG
Sbjct: 71 TALHRIAKNMEKVSMMRTRRLAFARQREMSMLVALAMTCLPECSAQGISNISWALSKIGG 130
Query: 299 ELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
ELLYL+EMDRVAEVA +KVG+FNSQNVAN+AGAF
Sbjct: 131 ELLYLTEMDRVAEVATSKVGDFNSQNVANIAGAF 164
>gi|302780623|ref|XP_002972086.1| hypothetical protein SELMODRAFT_412577 [Selaginella moellendorffii]
gi|300160385|gb|EFJ27003.1| hypothetical protein SELMODRAFT_412577 [Selaginella moellendorffii]
Length = 296
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 89/190 (46%), Positives = 115/190 (60%), Gaps = 17/190 (8%)
Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
L D+VD++ + VLE I + KG LS +N+ATALH+I +SM R
Sbjct: 78 LTVDLVDSRDVEGVLETIERV-----KG--RFRLSSINVATALHKIVT----LSMSEARR 126
Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVG 318
L + Q +++ LVA AM LPEC+AQG+SNIAWA+SKIGG LLY EM+ +A A+ KV
Sbjct: 127 LKYAMQCDVAELVASAMELLPECNAQGVSNIAWAISKIGGHLLYHGEMEIIARAAVAKVD 186
Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP 378
EFN QN+ANVAG FASMQHS+P LF +L AS V + A + +P D
Sbjct: 187 EFNPQNIANVAGTFASMQHSSPALFEKLLDAASRGVSSTGTGP------ASLGMAQPLDS 240
Query: 379 LLESLDNAFK 388
LESLD A +
Sbjct: 241 FLESLDAALQ 250
>gi|412993721|emb|CCO14232.1| predicted protein [Bathycoccus prasinos]
Length = 590
Score = 112 bits (280), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 113/412 (27%), Positives = 179/412 (43%), Gaps = 37/412 (8%)
Query: 232 LSPLNIATALHRIAKN-MEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQ----GI 286
+SP NIA + +I N ++ V M R R + LV + + A + S + +
Sbjct: 172 VSP-NIAGKMLQILGNKVQSVKMDRFERAGIRRDPRFAHLVGLTVAAARQNSEEFKTSAV 230
Query: 287 SNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSEL 346
W L+ + GE +EM+ ++ A V E ++V NVA A AS +H+ LFS +
Sbjct: 231 CQAIWGLAVVSGEAANAAEMEVLSNRAARSVVEMKPKDVTNVAWALASCRHANEGLFSAI 290
Query: 347 AKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCN 405
+ A + F ++ + WA A L D +++ + K SN
Sbjct: 291 NEYAEQGGLKGFDSFKITTLCWATAHLQMDGDGIIKGV--------------AKWASN-- 334
Query: 406 ENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI-SRFEE 464
+ G + E V QL ++WS L + D SDI KT+ S
Sbjct: 335 ------APGSNEGEDGTQQTVNKLKGAQLCTLSWSLVNL-RNDVGLNSDILKTVWSHVCS 387
Query: 465 QRISEQYREDIMFA----SQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKV 520
Q +++ ED +Q++ + + L L EK ++A +R V
Sbjct: 388 QEGIKKFMEDDSIRGRDLNQLYQTAMAISSSDTNKNATLPDALMEKCSNAWAEQR-RPPV 446
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNT-GVPLGH 579
S FQ++VA +L G + E V GY VD +L V E+DGP+HF+RN LG
Sbjct: 447 ISWFQRDVAAILSYMGEKYEEEAIVAGYRVDVLLESIGVVLEVDGPSHFARNVKDHALGQ 506
Query: 580 TMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSSNI 631
T LKR + AAG+ + ++ EW+ L ++ DY+R L GE +I
Sbjct: 507 TNLKRNLLKAAGYKIFPIAVTEWDLLFNVEDKSDYVRAGLDALANGEDIPDI 558
>gi|255075447|ref|XP_002501398.1| hypothetical protein MICPUN_100065 [Micromonas sp. RCC299]
gi|226516662|gb|ACO62656.1| hypothetical protein MICPUN_100065 [Micromonas sp. RCC299]
Length = 571
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 175/378 (46%), Gaps = 46/378 (12%)
Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVA 326
+ M VA A S ++N AWA+ I E +EM+ +A A + + + +A
Sbjct: 204 LGMCVAAARRGSDALSPVSVANAAWAVGVISTERANSAEMEVLAARAAQVTEDISKRGIA 263
Query: 327 NVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
++A A AS +H++ +LF ++ RA+ + F+ +++ +++AFA L AD LE LD
Sbjct: 264 DLAWALASCRHASEELFQQIGIRAAVTGLKGFKAFDISTLVYAFAHLGHGADGFLEGLDQ 323
Query: 386 AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
F G + G ++ + + SF L N AWS AV+G
Sbjct: 324 WFA------------------GGAEEDVGKEAADANAAKMAASFTAHPLVNTAWSLAVIG 365
Query: 446 --QMDRIFFSDIWKTISRFEEQRISEQYRED-------IMFAS----QVHLVNQCL-KLE 491
+ F+ +W I E +E D I + S ++ +NQ + +E
Sbjct: 366 GDALRSRAFAALWGEICARGEAAAAEGATVDPSLDGDRIQYGSWKGKNLNQINQAIVAVE 425
Query: 492 HPHLQLALS-SVLEEKIASAGKTKRFNQK---VTSSFQKEVARLLVSTGLNWIREYAVDG 547
AL+ + + +A ++ Q+ V S +Q++VA +L G E G
Sbjct: 426 SAGGAEALALRPAPDSLTAAAESAWMAQRRPPVVSWYQRDVASILSYMGEKHEEEAVCGG 485
Query: 548 YTVDAVLVDK--------KVAFEIDGPTHFSRNTG-VPLGHTMLKRRYIAAAGWNVVSLS 598
Y VD ++ + +A E+DGP+HF+RN + LG T LK R + G +VVS+S
Sbjct: 486 YRVDLLVPNPVGVPQQSGGIAIEVDGPSHFARNDPELALGQTRLKHRQLRHLGMSVVSVS 545
Query: 599 HQEWEELQGSFEQLDYLR 616
EWE L+ + E+++YLR
Sbjct: 546 VAEWEYLESAEEKVEYLR 563
>gi|145349861|ref|XP_001419345.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579576|gb|ABO97638.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 554
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 125/504 (24%), Positives = 208/504 (41%), Gaps = 56/504 (11%)
Query: 156 DLRRTVSMMAGGMFEE----KREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQE 211
D RT ++ M EE K E ++EE + + P + ++ K+ A AQ
Sbjct: 40 DRARTAAIRGYEMDEEGNYIKPEPSVEELLRGTAWEMDPRQDATQFSMTKEEWKAVKAQA 99
Query: 212 VLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLV 271
+ + +SP A+ L IA+ + R ++ ++
Sbjct: 100 RTATYPHDAVHIFENAGLRRISPEMAASMLKLIAQKAQHSRCDREELAGLRRDPRVAHMI 159
Query: 272 AIAMTA-------LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQN 324
+ A LP A+ ++ WAL I GE +E++ +A A + + +
Sbjct: 160 GTCVAAARAKSDTLP---AEEVAKCCWALGVIAGERANSAELEVLANRASELMKKLSPDE 216
Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESL 383
+A+++ + A +HS+ F EL A+ FQ ++ V WAFA L L+ +
Sbjct: 217 IADISWSLAISRHSSERFFHELDVHAAMTGFKGFQAYQITTVAWAFAHLGHSHAGFLDGI 276
Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
D A NK L+ + + + V FN L ++AWS+ V
Sbjct: 277 DVWVARAP----ARNKDLT---------------PQQAAEAQVHRFNATILASLAWSFCV 317
Query: 444 L-GQMDRIFFSDIW-KTISRFEEQRISEQYREDIMFASQVHLVNQCL------KLEHPHL 495
+ +D +FF +W + I+R E +E+ + H +L H
Sbjct: 318 MEDALDSLFFRTLWAEIITRGEHDAQMVHEKENTAASMDEHHNTNVFGPWKGRQLNQLH- 376
Query: 496 QLALSSV------LEEKIASAGKTKRFNQK---VTSSFQKEVARLLVSTGLNWIREYAVD 546
Q A+++V L ++ +A T Q V S FQ++V +L G E V
Sbjct: 377 QAAITAVRAGFDPLPTELGAAADTAWNTQNRPPVVSWFQRDVGAILSYMGEKHEEEALVS 436
Query: 547 GYTVDAVLVDKK---VAFEIDGPTHFSRN-TGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
GY D +L D K V E+DGP+HF+RN + LG T LK+R + G+ V + EW
Sbjct: 437 GYRCDLLLPDAKPTGVVIEVDGPSHFARNDRKLALGQTRLKQRQLEGEGFAVFPIPIFEW 496
Query: 603 EELQGSFEQLDYLRVILKDYIGGE 626
+ L+ + ++ DYLR L GE
Sbjct: 497 DYLEDAQQKSDYLRAGLDAIERGE 520
>gi|303279190|ref|XP_003058888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460048|gb|EEH57343.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 594
Score = 105 bits (262), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 126/464 (27%), Positives = 207/464 (44%), Gaps = 74/464 (15%)
Query: 195 KEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMM 254
KE+ N + D QTA + E A + +SP +IA + ++ ++ K +
Sbjct: 147 KEVKANAN--DPQTALQAFE------EAGLRRVSP------DIAAGMLKMIADVAKKART 192
Query: 255 TTHRLAFTRQRE-----MSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRV 309
LA R+ + VA A + +S AW+L+ I GE +EM+ +
Sbjct: 193 DREELAGLRRDSRVAHLLGTCVAAARRNSDALTPNKLSAAAWSLAIISGERANSAEMEVL 252
Query: 310 AEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKR-ASDIVHTFQEQELAQVLWA 368
AE A V E + A++A A AS +H++P F+ L R A++ + F+ +++ ++WA
Sbjct: 253 AERAALVVSEMKPRACADLAWALASCRHASPAFFNGLDVRFATEGLKKFKVFDVSTLVWA 312
Query: 369 FASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLS 428
FA L +D L + L++ F A+ +S DAD+ + S+
Sbjct: 313 FAHLGHGSDGLRDGLEDWFVGASS------------------ESVSDADAAAAASALAKK 354
Query: 429 FNRDQLGNIAWSYAVLG--QMDRIFFSDIWKTISRF----------EEQRISEQYREDIM 476
F L AWS +V+G M F +W I R ++ + + + I+
Sbjct: 355 FTPQALVTTAWSLSVIGAEAMRSRAFKALWGEIGRLGGEVNDADAVAKEALLAESGDKIV 414
Query: 477 FAS----QVHLVNQC-LKLEHPHLQLALS-SVLEEKIASAGKTKRFNQK---VTSSFQKE 527
F ++ +NQC + ++ AL + L E + A Q+ V S +Q++
Sbjct: 415 FGPWRGKHLNQINQCVVSVDACGGCDALGLAPLAEPLRVAASNAWMAQRRPPVVSWYQRD 474
Query: 528 VARLLVSTGLNWIREYAVDGYTVD-----AVLVD---------KKVAFEIDGPTHFSRNT 573
VA +L G E GY VD + +D VA E+DGP+HF+RN
Sbjct: 475 VASILSYMGEKHEEEAVCAGYRVDLHIPKPIGIDDATHKAAARAGVAVEVDGPSHFARND 534
Query: 574 G-VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
LG T LK R + + G+ VVS+ EWE L+ S E+++YLR
Sbjct: 535 ATTSLGQTRLKHRQLRSLGFAVVSVPVSEWEYLETSEEKVEYLR 578
>gi|302832295|ref|XP_002947712.1| hypothetical protein VOLCADRAFT_87862 [Volvox carteri f. nagariensis]
gi|300267060|gb|EFJ51245.1| hypothetical protein VOLCADRAFT_87862 [Volvox carteri f. nagariensis]
Length = 1281
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 162/369 (43%), Gaps = 30/369 (8%)
Query: 263 RQREMSMLVAIAMTALPECSA---QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGE 319
R E SML + A L + ++ QG+SN AWA +++G L ++ AL K+
Sbjct: 645 RSYEHSMLSSWAAQTLDKLASFEPQGVSNTAWAFARLGFHSPQL--FQALSAAALHKIEG 702
Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPL 379
F +Q ++N+A A A+ H P LF LA+ A+ + +F Q + LWA A+L D L
Sbjct: 703 FTAQGLSNLAWAMATAGHVQPRLFEALARHATSLAPSFNAQNCSVTLWACATLRHHDDEL 762
Query: 380 LESLDNAFKDATQFTCC----LNKALSNCNENGGVKSSGDADSEGSLSSPVLS-FNRDQL 434
+L + + C + AL G A +S +L N+ +L
Sbjct: 763 FNALLE--RLVAEVDTCEPQNVANALWAVARMGHPLPRERAAPLVCHASRLLGRMNQQEL 820
Query: 435 GNIAWSYAVLGQMDRIFFSDIWKTISRFEE---QRISEQYREDIMFASQV------HLVN 485
N W+ A L MD I F+ + R + + + + Y +M+ S + L
Sbjct: 821 CNTMWAVACLDLMDEILFATFCSCLQRLADISPEGMHQAYHAQLMYHSSLARRAGMSLAQ 880
Query: 486 -QCLKLEHPHLQLALSSVLEEK---IASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIR 541
Q L +P L L L E +A++ S F +EV+ L G+
Sbjct: 881 LQQLAASNPPASLGLLPCLSEPLRTVAASMWAASARDVHVSRFHQEVSGALAGAGVPHAL 940
Query: 542 EYAVD--GYTVDAVLV--DKKVAFEIDGPTHFSRNTG-VPLGHTMLKRRYIAAAGWNVVS 596
E+ D ++VD L K VA E++G H++ N LG T ++RR + GW+VV
Sbjct: 941 EWMTDDQHFSVDIGLQVNSKPVAVEVNGSHHYASNAPHRALGDTAVRRRMLEDRGWHVVD 1000
Query: 597 LSHQEWEEL 605
+ EWE +
Sbjct: 1001 VGFAEWEAM 1009
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 85/184 (46%), Gaps = 20/184 (10%)
Query: 283 AQGISNIAWALSKIGGELLYLSE---MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
A+G++N AWA G+L Y+ +A+ AL ++ EF+ QN++N+ +F M H+
Sbjct: 363 ARGLANSAWAF----GKLKYVPSGGLPSVIAQAALRRMPEFSPQNLSNLVWSFVYMHHAD 418
Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLN- 398
L S ++ V F+ QELA ++WAFASL D +L A K A +
Sbjct: 419 EVLLSAASRFVCARVGEFKPQELANIVWAFASLGHRDDQMLHV---AAKQAQRIAPLFKE 475
Query: 399 KALSNCNENGGVKSSGDADS-------EGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
+ LSN G S D E + P +F + N+AW+ A +G D F
Sbjct: 476 QELSNMLWALGKMSLRDQPQVLEALMEETRVKLP--AFLPQGISNVAWALASVGHPDMQF 533
Query: 452 FSDI 455
+
Sbjct: 534 LDQV 537
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 11/98 (11%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVAL----TKVGEFNSQNVANVAGAFASMQHSA 339
Q +SN+ WAL K+ L + +V E + K+ F Q ++NVA A AS+ H
Sbjct: 476 QELSNMLWALGKMS-----LRDQPQVLEALMEETRVKLPAFLPQGISNVAWALASVGHPD 530
Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFASL--YEP 375
++ + + + F Q LA ++WA ASL Y+P
Sbjct: 531 MQFLDQVVAQCGNQLAAFDVQALANLVWAMASLGYYKP 568
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 58/153 (37%), Gaps = 40/153 (26%)
Query: 258 RLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKV 317
RL F + L A A+ + +AQG+SN+AWA++ G L E +A A +
Sbjct: 680 RLGFHSPQLFQALSAAALHKIEGFTAQGLSNLAWAMATAGHVQPRLFEA--LARHATSLA 737
Query: 318 GEFNSQN-------------------------------------VANVAGAFASMQHSAP 340
FN+QN VAN A A M H P
Sbjct: 738 PSFNAQNCSVTLWACATLRHHDDELFNALLERLVAEVDTCEPQNVANALWAVARMGHPLP 797
Query: 341 -DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
+ + L AS ++ +QEL +WA A L
Sbjct: 798 RERAAPLVCHASRLLGRMNQQELCNTMWAVACL 830
>gi|384250651|gb|EIE24130.1| hypothetical protein COCSUDRAFT_47154 [Coccomyxa subellipsoidea
C-169]
Length = 1093
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 158/344 (45%), Gaps = 47/344 (13%)
Query: 267 MSMLVAIAMTA-LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNV 325
MS +VA AM C+ Q ISN WA +K+ + +D A A ++ EF+ QN+
Sbjct: 588 MSRVVANAMAERASNCNPQEISNTVWAYAKL--RFYDAAVLDTFANEATRRIEEFSQQNL 645
Query: 326 ANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
AN+A A + H L +A+ A+ +V Q ++ +LW +AS L ++ +
Sbjct: 646 ANLAWAMGKLSHFHEGLLDAIAEHATAMVQDLSLQHVSNILWTYASFLH----LKPAMTS 701
Query: 386 AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
AF G ++ + ++ FN QL N+ WS +
Sbjct: 702 AFV-------------------GEIERRLNTEA----------FNPQQLSNLLWSLCI-- 730
Query: 446 QMDRIFFSDIWKTI-SRFEEQRISEQ-YREDIMFASQVHLVNQCLKLEHPHLQLALSSVL 503
+ +IWK I ++ E I+ + E+ + +Q++ ++++ P LQL + + L
Sbjct: 731 --AELCSEEIWKGIMAQIETLGIAAKDLPEEAL--TQIYQAYLLMRVDRPQLQLTMPAQL 786
Query: 504 EEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG--YTVDAVLVDKKVAF 561
N ++ S+ ++VAR+L G+ E+ + ++VD L ++K+A
Sbjct: 787 LPAAHHTWLESCKNVRI-SALHRDVARVLTEHGIPHNIEHVTEDELFSVDIALPEEKIAI 845
Query: 562 EIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
E+DGP HF+ NT G + +++ + A GW V+S+ W L
Sbjct: 846 EVDGPHHFTANTLAVTGEMLARQKLLKARGWAVISVPFFRWSGL 889
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 131/270 (48%), Gaps = 31/270 (11%)
Query: 185 SQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRI 244
+ F+GP + +++NK I AQ+A+ V+ V+ + + + +ATALH +
Sbjct: 179 TNFAGPV--PECVHINKRITAAQSAEAVIGVVQQELDKFDA---------VCMATALHTL 227
Query: 245 AKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIG---GELL 301
A A + E+ L+ + T L + +A+ +SN WAL+K+G GE +
Sbjct: 228 ASMRASAQQYA----ALFERPEVLRLMHVIGTRLTDFTARNLSNSLWALAKMGHNPGEAM 283
Query: 302 YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS-APDLFSELAKRASDIVHTFQEQ 360
L+ M AEVA K+ N+QN+AN+A ++A++ H+ +L +A +A + F Q
Sbjct: 284 -LNAM--AAEVA-KKLDGCNAQNLANIAWSYATLSHTPGEELLEAIAVKAQKKLAEFSSQ 339
Query: 361 ELAQVLWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE 419
++ +L+AFA L ++P+ L ++ A FT +ALSN + D +
Sbjct: 340 NISNLLYAFAKLEHKPSTFLEQASRAAMPILGSFT---PQALSNTVWALSKLDTLDEELF 396
Query: 420 GSLSSPVLS----FNRDQLGNIAWSYAVLG 445
++ VL FN + N W +A L
Sbjct: 397 IAIVQQVLGKLTRFNAQNVANTVWGFANLA 426
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 92/206 (44%), Gaps = 26/206 (12%)
Query: 282 SAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
+AQ ++N W + + G+ L+ D VA+ + + E++ QN+ANV ++A M
Sbjct: 411 NAQNVANTVWGFANLAFDPGQPLW----DAVAQNGIYTMHEYSPQNIANVLWSYAKMGKR 466
Query: 339 APDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL-DNAFKDATQFT-- 394
L + + A+ + TFQ Q +A WA+A+L P+ L +L ++A QF+
Sbjct: 467 YEALLTAASAHAAHTMSTFQPQSVANFCWAYATLNVAPSSQCLTALAEHANHTLMQFSPQ 526
Query: 395 -------CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVL--- 444
+ G V S A G+ +S +F+R L N+ W++A L
Sbjct: 527 NISNTAWALATLQFKHMGLMGNVASEVTARLSGAEAS---AFSRQHLANLIWAFATLELD 583
Query: 445 --GQMDRIFFSDIWKTISRFEEQRIS 468
M R+ + + + S Q IS
Sbjct: 584 PGAAMSRVVANAMAERASNCNPQEIS 609
>gi|308806908|ref|XP_003080765.1| unnamed protein product [Ostreococcus tauri]
gi|116059226|emb|CAL54933.1| unnamed protein product [Ostreococcus tauri]
Length = 652
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 113/493 (22%), Positives = 210/493 (42%), Gaps = 51/493 (10%)
Query: 156 DLRRTVSMMAGGMFE----EKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQE 211
D +RT ++ M E +K E +++E + + P+ + ++ D A A+
Sbjct: 142 DRQRTAAVRGYEMDEDGNWQKPEPSVDELLRGTAWEMDPTKDATQFSMTTDEWKAVKAEA 201
Query: 212 VLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLV 271
+ +V + ++P A+ L IA+ + + R ++ ++
Sbjct: 202 RTVMYPHDAVSVFEKAGLRRINPEMAASMLKVIAQKAQNSRVDREELAGLRRDPRVAHMI 261
Query: 272 AIAMTA-------LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQN 324
+ ++A LP A+ ++ WAL I GE +E++ +++ A + +F+S
Sbjct: 262 GVCVSAARAKSDMLP---AEEVAKACWALGVIAGERANSAELEVLSDRAADLIVKFSSDE 318
Query: 325 VANVAGAFASMQHSAPDLFSEL-AKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
+A++ + AS + + L A +A + FQ +L V WAFA L +E L
Sbjct: 319 IADICWSLASSRQGSTFLRQYTHANQALTGLKGFQAYQLTTVAWAFAHLGHKHTGFVEGL 378
Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
D A ++ A + AD++ + FN L ++AWS+ V
Sbjct: 379 DIWVTRAPARAKTMSPAEA-------------ADAQ------IHRFNATILASLAWSFCV 419
Query: 444 L-GQMDRIFFSDIWKTI--------SRFEEQRIS--EQYREDIMFASQVHLVNQCLKLEH 492
+ +D +FF +W I + E+ S E + ++ + +NQ +
Sbjct: 420 MEDALDSLFFRTLWAEICARGVHDAAVVHEKDPSGDEHHHANVFGPWKGRQLNQLHQASL 479
Query: 493 PHLQLALSSVLEEKIASAGKT--KRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTV 550
+ + E A+A + + V S FQ++V +L G + E V GY
Sbjct: 480 TAVSAGFEPLPAELGAAADEAWNTQTRPPVISWFQRDVGAILSYMGEKYEEEALVGGYRC 539
Query: 551 DAVLVDKK---VAFEIDGPTHFSRN-TGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
D +L + K V E+DGP+HF+RN LG T LK+R + G+ V + +W+ L+
Sbjct: 540 DLLLPNAKPNGVVIEVDGPSHFARNDRKRALGQTRLKQRQLEGEGYAVFPIPIFDWDFLE 599
Query: 607 GSFEQLDYLRVIL 619
+ ++ DYLR L
Sbjct: 600 NAEQKSDYLRAGL 612
>gi|397587109|gb|EJK53812.1| hypothetical protein THAOC_26672 [Thalassiosira oceanica]
Length = 1144
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 160/352 (45%), Gaps = 22/352 (6%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
+ +L AQ +SN AWA + G L + L + F Q ++N A AFA
Sbjct: 397 GLCSLDSFKAQALSNTAWAFATAGVPHPELFKKIGRHVTGLGSLDSFKPQALSNTAWAFA 456
Query: 334 SMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLD-NAFKDA 390
+ + P+LF ++ + + + +F+ QEL+ WA+A+ L E L A +
Sbjct: 457 TAEIPHPELFKKIGDHIAGLGSLDSFKPQELSNTAWAYATARVFHSRLFERLSTGALVER 516
Query: 391 TQFTCC-LNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDR 449
F + L C G + + + + S + N L NI W+Y+V
Sbjct: 517 EHFYVQEVANFLWACATVGHTEETLFSAFAPLIESKLEKCNEQDLTNIGWAYSVTNDASE 576
Query: 450 IFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIAS 509
F++ + +E SE E++ Q L + L E L L L+EK +
Sbjct: 577 GLFNECFVGACASKECEFSE---ENLFQLHQWQLWQRELGSE-----LELPRSLKEKCRN 628
Query: 510 AGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVL-VD--KKVAFEIDG 565
+ + +++ S Q ++ L +TGL+ +E + GY +DA++ VD +KVA E+DG
Sbjct: 629 SFLSANYSE---SKLQNDIVGELKATGLDLEKEILLGSGYRIDALVKVDNGRKVAIEVDG 685
Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
P+HF + P G T LK R +A V+S+ + EW EL+ S + YLR
Sbjct: 686 PSHFIQRR--PAGRTTLKHRQVATLDCIEVMSVPYWEWNELKNSAAKQHYLR 735
Score = 42.4 bits (98), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 42/190 (22%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTK-VGEFNSQNVANVAGAF 332
A+ L E A+ +SN+ ++ L+ + + V +T+ + F Q ++NV A+
Sbjct: 207 ALPILHEFDARSLSNLIYSFG-----LVKYNPTEAVGNHIVTRSLDNFWPQALSNVVWAY 261
Query: 333 ASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA 390
A+ P+L ++ + + + F+ QEL+ + WAFA+ EP P+L FK
Sbjct: 262 ATAGVPHPELLRKIGDHVAGLKSLDPFKPQELSNIAWAFATAGEP-HPVL------FKRI 314
Query: 391 TQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRI 450
L D D SF L NIAW++ G +
Sbjct: 315 GDHVAGL-----------------DLD----------SFKSQSLSNIAWAFVTAGVLHPE 347
Query: 451 FFSDIWKTIS 460
F I I+
Sbjct: 348 LFKKIGDNIA 357
>gi|299472343|emb|CBN77531.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 695
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 148/369 (40%), Gaps = 76/369 (20%)
Query: 282 SAQGISNIAWALSKIGGELLYLSE-----MDRVAEVALTKVGEFNSQNVANVAGAFASMQ 336
+ Q ++ ++W S + E L +D +A+ A VG F Q+V+ V+ A A M
Sbjct: 332 TPQDLAMLSWGFSSLSQECLPCQPAAYRALDVLAKAARECVGNFRPQDVSMVSLALARMS 391
Query: 337 HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCC 396
P L +A R ++ + F+ QEL+ WA+A L+ +D +F
Sbjct: 392 WDDPRLMKAMASRTTETLRAFKPQELSNTAWAYARLH-------------VRD-RRFWSA 437
Query: 397 LNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIW 456
L K + G+ + ++ N+AW+ AV+G+ D ++
Sbjct: 438 LQKQAKRMLDGPGMSA-------------------QEIANLAWALAVMGEADVELLEEL- 477
Query: 457 KTISRFEEQRISEQYREDIMF--ASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTK 514
R ++ R D + Q++ V + P L L +
Sbjct: 478 --------LRSAQAQRGDFTLIESHQLYQVYLLWGKDMPELWKELDGEFLMALKRRWTDN 529
Query: 515 RFNQKVTSSFQKEVARLL-------------------VSTGL---NW-IREYA--VDGYT 549
+ K +S EV++ L V GL +W R ++
Sbjct: 530 QQRTKRSSCSHLEVSQTLDLMQISHENESEHDIDIEVVGVGLASEDWDFRSFSAGTGPNP 589
Query: 550 VDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ--G 607
D V K+A E+DGP HF++NT PLGH +LK R ++ GW VVS+ EW+ +
Sbjct: 590 ADPAEVRLKLALEVDGPAHFTKNTARPLGHMVLKHRTLSKMGWTVVSIPFLEWDPIPFWS 649
Query: 608 SFEQLDYLR 616
S E+ YL+
Sbjct: 650 SMEKKRYLQ 658
>gi|302781256|ref|XP_002972402.1| hypothetical protein SELMODRAFT_413123 [Selaginella moellendorffii]
gi|300159869|gb|EFJ26488.1| hypothetical protein SELMODRAFT_413123 [Selaginella moellendorffii]
Length = 609
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 133/296 (44%), Gaps = 66/296 (22%)
Query: 202 DIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAF 261
D+VD++ +EVLE I E + + LS +N+ATALHRIAK+M +SM T RL +
Sbjct: 216 DLVDSRDVEEVLETI-ERVKGRFR------LSSINVATALHRIAKHMVTLSMSETRRLKY 268
Query: 262 TRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFN 321
RQ +++ LVA +A ++ +SKIGG LLY EM+ +A AL KV EFN
Sbjct: 269 ARQCDVAELVA--------WNATHRASPTLPISKIGGHLLYRGEMEIIARAALAKVDEFN 320
Query: 322 SQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLE 381
P EL S A P
Sbjct: 321 ------------------PRTLPELLLPCST-----------------------ARP--H 337
Query: 382 SLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSY 441
SL +++ +F ++ + N SG LS F++++L +I WSY
Sbjct: 338 SLRSSWTLRAEFP----RSFEHRNWPSFFGRSGAWLGLWILSWTHRLFSKNKLWSIVWSY 393
Query: 442 AVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQL 497
AVLGQ+ FF+ + K I FE+ Q++ + +Q++ V LK E LQL
Sbjct: 394 AVLGQLQGPFFAHVCKEIRAFEQL---GQHKHMLQL-TQLYQVVLALKREGKDLQL 445
>gi|384245272|gb|EIE18767.1| hypothetical protein COCSUDRAFT_49195 [Coccomyxa subellipsoidea
C-169]
Length = 845
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 169/377 (44%), Gaps = 57/377 (15%)
Query: 278 LPECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
+P Q I+N WA + +G G +L +D A + + F Q ++N +++
Sbjct: 305 MPHFKPQEIANTLWAFATLGHDPGAIL----LDAAAGQMVDNIAHFRPQAISNSLWSYSK 360
Query: 335 MQHSAPD-LFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL--------- 383
+ ++ + A+RA+ ++H + QE+A LWAFA+L + P +L++
Sbjct: 361 LAYNPGHRVLDVAARRAAGMLHQYTSQEIANTLWAFATLEHNPGSGMLDAAAVQIARRIE 420
Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLS----FNRDQLGNIAW 439
+ +D T C + A+ ++S L F +L N+ W
Sbjct: 421 QFSPQDTTNSVWCFARLFHYPG----------AELLQAISLYCLRHWHRFKAQELANMIW 470
Query: 440 SYAVLGQMDRIFFSDIWKTISRFEE-QRISEQYREDIMFASQVHLVNQC-LKLEHPHLQL 497
S A+L R D W ++ E+ ++E +D + +H + Q + L+ P L+L
Sbjct: 471 SLALL----RACSHDTW--VALLEKLNTVAEATFDD----ADLHQLYQAYVLLDPPGLRL 520
Query: 498 ALSSVLEEKIASAGKTKRFNQ---------KVTSSFQKEVARLLVSTGL-NWIREYAVDG 547
SS L EK G +R + TS Q++V+ +L S G+ + E DG
Sbjct: 521 P-SSSLSEKFPE-GLARRAERVWRAGVHPLARTSKLQEDVSAVLWSLGVAHKTNEVTADG 578
Query: 548 -YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
+ VD L KV E+DGPTHFS N+ PLG T+ ++ + A G V S+ + EW L
Sbjct: 579 LFCVDIALEGGKVVIEVDGPTHFSVNSRRPLGRTVARKLMVEARGHVVRSIPYYEWCALD 638
Query: 607 GSFEQLDYLRVILKDYI 623
+Q Y+ +L +
Sbjct: 639 SLEQQQAYVWRLLASAV 655
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 90/198 (45%), Gaps = 26/198 (13%)
Query: 191 SNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEK 250
SN+ K I K + A Q++L+ +AE + + +N+ATALHR+AK
Sbjct: 117 SNQNKAIT--KRLASAGHYQQILDEVAEWVKVFDE---------VNVATALHRLAKLQPP 165
Query: 251 VSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIG----GELLYLSEM 306
+ + R +LV + +P AQ +SN WA + +G G+LL
Sbjct: 166 GTAGPQSPV--LRSASFQLLVEASQRLVPRFEAQAVSNTLWAFATLGYHPSGDLL----- 218
Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF--SELAKRASDIVHTFQEQELAQ 364
DR+ A V F Q +N A+A + + + F + + +D+ Q+++
Sbjct: 219 DRLGHHAAGIVRTFRPQATSNALWAYAKLAYVPCEPFLAAAALQLLTDLPRCV-PQDISN 277
Query: 365 VLWAFASL-YEPADPLLE 381
WAFA+L + P + L++
Sbjct: 278 ATWAFATLRHHPGNTLMD 295
>gi|397646149|gb|EJK77145.1| hypothetical protein THAOC_01042 [Thalassiosira oceanica]
Length = 635
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 170/391 (43%), Gaps = 88/391 (22%)
Query: 273 IAMTALPECSAQGISNIAWALS--KIGGELLYLSEM--------DRVAEVALTKVGEFNS 322
I L + Q +SNIAWA + ++ +L S + D +A L + F
Sbjct: 277 IVARKLEDFQPQNLSNIAWAYANARVSHPILLESHIPSYSNKIGDHIA--GLISLDSFKP 334
Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLL 380
Q+++N A AFA+ S P+LF ++ + + + +F+ QEL+ V WAFA E ++P +
Sbjct: 335 QDLSNTAWAFATAGVSHPELFKKIGDHVAGLGSLDSFKPQELSNVAWAFAKAGE-SNPKV 393
Query: 381 ESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE-GSLSSPVLSFNRDQLGNIAW 439
K GD +E G L S FN +L NIAW
Sbjct: 394 -----------------------------FKKIGDHAAELGCLDS----FNPQELSNIAW 420
Query: 440 SYAVLGQMDRIFFSDIWKTIS----RFEEQ---------RISEQYREDIM---------- 476
+ A +G D+ F + I+ F EQ ++ R+D+
Sbjct: 421 ACATVGYNDKRLFCAVAPMIASKLDEFIEQDLANIAWAYSVANTPRQDLFDEGYVSALAS 480
Query: 477 ----FASQVHLVNQCLKLEHPHLQ--LALSSVLEEKIASAGKTKRFNQKVTSSFQKEVAR 530
F+++ +L L+ + L L+E+ +A ++ F++ S Q +V
Sbjct: 481 NKKEFSAEGLAQLHQWQLWQQELESGIELPRSLQERCRNAFTSRGFSE---SKLQNDVVG 537
Query: 531 LLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
L + GL+ E + GY +DA++ +KVA E+DGP HF P G T LK+R
Sbjct: 538 ELKAAGLDLEEEVLLGSGYRIDALVKFGNGRKVAVEVDGPFHFIDRR--PAGRTTLKQRQ 595
Query: 587 IAAAG-WNVVSLSHQEWEELQGSFEQLDYLR 616
+A VVS+ + EW EL+ S + YLR
Sbjct: 596 VARLDRIEVVSVPYWEWNELKNSVTKQRYLR 626
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 86/202 (42%), Gaps = 26/202 (12%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYL-SEMDRVAEVALTKVGEFNSQNVANVAGAF 332
A+ L ++Q +SN+ WA K+ + L E RV + +G F Q +AN+ +F
Sbjct: 202 AVKILHTFNSQNLSNVLWAFVKVDADNSRLFQETGRV--ITGMHLGSFKPQELANILWSF 259
Query: 333 ASMQHSAPDLFSELAKR-ASDIVHTFQEQELAQVLWAFASLYEPADPLLE----SLDNAF 387
+ + P++F + + + FQ Q L+ + WA+A+ LLE S N
Sbjct: 260 SKSSEADPEIFQAIGNHIVARKLEDFQPQNLSNIAWAYANARVSHPILLESHIPSYSNKI 319
Query: 388 KDATQFTCCLN----KALSNCN---ENGGV------KSSGD-ADSEGSLSSPVLSFNRDQ 433
D L+ + LSN GV K GD GSL SF +
Sbjct: 320 GDHIAGLISLDSFKPQDLSNTAWAFATAGVSHPELFKKIGDHVAGLGSLD----SFKPQE 375
Query: 434 LGNIAWSYAVLGQMDRIFFSDI 455
L N+AW++A G+ + F I
Sbjct: 376 LSNVAWAFAKAGESNPKVFKKI 397
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 66/147 (44%), Gaps = 12/147 (8%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
D +A A+ + EF +++++N+ +F ++ + PD LF+ + A I+HTF Q
Sbjct: 154 FDSIASSAVGMLNEFEARHLSNLIYSFGLVERN-PDIGGETLFNVFGEAAVKILHTFNSQ 212
Query: 361 ELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEG 420
L+ VLWAF + L + + L+N + S D +
Sbjct: 213 NLSNVLWAFVKVDADNSRLFQETGRVIT-GMHLGSFKPQELANILWSFSKSSEADPEIFQ 271
Query: 421 SLSSPVLS-----FNRDQLGNIAWSYA 442
++ + +++ F L NIAW+YA
Sbjct: 272 AIGNHIVARKLEDFQPQNLSNIAWAYA 298
>gi|384251748|gb|EIE25225.1| hypothetical protein COCSUDRAFT_61463 [Coccomyxa subellipsoidea
C-169]
Length = 937
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 158/359 (44%), Gaps = 28/359 (7%)
Query: 278 LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVAL---TKVGEFNSQNVANVAGAFAS 334
L + Q +SNI W G +L + D AL ++G FN Q ++N AFA
Sbjct: 324 LSHFATQAVSNILW-----GCAVLNFYDQDMFNAAALEIQHRIGSFNDQEISNSLLAFAK 378
Query: 335 MQH---SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDAT 391
M+H S +F E +R V F Q L+ ++W+FA+L + +LE++
Sbjct: 379 MEHVDVSLLRVFEEDIRRPQR-VRDFTSQALSNMVWSFATLRWYPEKVLEAISAELLRRM 437
Query: 392 QFTCCLNKALS--NCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDR 449
+ ++S + G A+ + V FN N W +VL
Sbjct: 438 PYLSVQEISVSIWAMAKLGYHPGRSLAEFGRRIEELVPDFNSQACANTLWGLSVLQATQL 497
Query: 450 IFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLA--LSSVLEEKI 507
F + I R I + +++ Q+ +LE LA + ++ +
Sbjct: 498 PCFQML---IDRLGSNNID---KVEVLMLHQLFQSLMLARLEARRQNLADPIRTIPDHIY 551
Query: 508 ASAGKTKRFNQK--VTSSFQKEVARLLVSTGLNWIREYAV-DG-YTVDAVLVDKK--VAF 561
A + + K ++S F +V+++L G+ E+ DG +++D L + VA
Sbjct: 552 ALLRRVWKATVKNTLSSRFHIDVSKMLRELGVAHDFEFVTEDGLFSLDIALAGPRGPVAI 611
Query: 562 EIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILK 620
E+DGP HF+ NT PLG T+++RR + A GW V+S+ ++ L + ++ YL +L+
Sbjct: 612 EVDGPYHFTLNTRQPLGSTLIRRRLLHALGWTVLSVPFYDYYRLGSTAAKMQYLGQLLR 670
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/218 (25%), Positives = 96/218 (44%), Gaps = 29/218 (13%)
Query: 318 GEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPA 376
G+ ++ +AN AF + H A D+ L + + T+QEQE++ +WA A+L P
Sbjct: 212 GKMRARQLANTLWAFGKLGHDAEDVVDALLFQMHRTHIATWQEQEMSNAVWAMATLSRPD 271
Query: 377 DPLLESLDNAFKDATQ--FTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL----SFN 430
+ LLE++ +DA + + + +A+SN V + +++ + F
Sbjct: 272 EGLLETMA---RDAMRRGMSAFVPQAISNLVWGFAVLEYNNNPFMLAVAEYFVMDLSHFA 328
Query: 431 RDQLGNIAWSYAVLGQMDRIFFS----DIWKTISRFEEQRISEQYREDIMFASQVHLVNQ 486
+ NI W AVL D+ F+ +I I F +Q IS + FA
Sbjct: 329 TQAVSNILWGCAVLNFYDQDMFNAAALEIQHRIGSFNDQEISNSL---LAFA-------- 377
Query: 487 CLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSF 524
K+E H+ ++L V EE I + + F + S+
Sbjct: 378 --KME--HVDVSLLRVFEEDIRRPQRVRDFTSQALSNM 411
Score = 47.8 bits (112), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 56/245 (22%), Positives = 104/245 (42%), Gaps = 43/245 (17%)
Query: 235 LNIATALHRIAK---------NMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQG 285
+NI+TA+HR+AK N+ + M H L +++ +S + A+
Sbjct: 169 INISTAMHRLAKVSYKNKVPLNVVQAHPMYPHLLTVLKKKVLSG----------KMRARQ 218
Query: 286 ISNIAWALSKIGGEL-----LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
++N WA K+G + L +M R T + + Q ++N A A++
Sbjct: 219 LANTLWAFGKLGHDAEDVVDALLFQMHR------THIATWQEQEMSNAVWAMATLSRPDE 272
Query: 341 DLFSELAKRA-SDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAF-KDATQFTCCLN 398
L +A+ A + F Q ++ ++W FA L +P + ++ F D + F
Sbjct: 273 GLLETMARDAMRRGMSAFVPQAISNLVWGFAVLEYNNNPFMLAVAEYFVMDLSHFAT--- 329
Query: 399 KALSNCNENGGVKSSGDAD----SEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD----RI 450
+A+SN V + D D + + + SFN ++ N ++A + +D R+
Sbjct: 330 QAVSNILWGCAVLNFYDQDMFNAAALEIQHRIGSFNDQEISNSLLAFAKMEHVDVSLLRV 389
Query: 451 FFSDI 455
F DI
Sbjct: 390 FEEDI 394
>gi|397565912|gb|EJK44819.1| hypothetical protein THAOC_36611, partial [Thalassiosira oceanica]
Length = 815
Score = 89.0 bits (219), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 155/347 (44%), Gaps = 33/347 (9%)
Query: 284 QGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q +SN WA + G L+ D +A L + FNSQ+V++ A AFAS S P+
Sbjct: 30 QELSNTVWAFATAGASHPELFRKIGDHIA--GLDSLDSFNSQDVSSTAWAFASAGTSHPE 87
Query: 342 LFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDN---AFKDATQFTCC 396
LF ++ + D + +F+ Q + WA+A+ L E L A KD +
Sbjct: 88 LFRKIGDHVAGLDSLDSFKPQAFSNTAWAYATARVFHSRLFEKLVTEAVAKKDHFESQPI 147
Query: 397 LNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIW 456
N L C G + ++S + F L NIAW+Y+V +F
Sbjct: 148 AN-FLWACATVGYTDERSFSAFAPVIASKLDKFIEQDLANIAWTYSVANAPQDLFNEGYV 206
Query: 457 KTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQ--LALSSVLEEKIASAGKTK 514
++ E + EQ +Q+H +L H L+ + L L K +A ++
Sbjct: 207 GALASNENEFSGEQL-------AQLHQ----WQLWHQELESGIELPRSLRAKCRNAFTSQ 255
Query: 515 RFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFS 570
+++ S Q +V L + GL+ E + GY +DA++ +KVA E+DGP HF
Sbjct: 256 GYSE---SKLQNDVVGELKAAGLDLEEEVLLGSGYQIDALVKFGNGRKVAVEVDGPFHFI 312
Query: 571 RNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
P G T LK+R +A VVS+ + EW EL+ S + YLR
Sbjct: 313 DRR--PAGRTTLKQRQVARLDRIEVVSVPYWEWNELKNSVTKQRYLR 357
>gi|397601425|gb|EJK57903.1| hypothetical protein THAOC_22012 [Thalassiosira oceanica]
Length = 1126
Score = 89.0 bits (219), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 149/345 (43%), Gaps = 57/345 (16%)
Query: 284 QGISNIAWALSKIGGE--LLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q +SN AWA +K G +L+ D +A L + F Q ++N A A+A+ +
Sbjct: 828 QDLSNTAWAFAKDGASHPVLFKKIGDHIAR--LGSLDSFKPQELSNTAWAYATARVFHSR 885
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
LF +L A F EQ ++ +LWA A++ + L +L A L K
Sbjct: 886 LFEKLTTEAVAKKDHFDEQGVSNLLWACATVDYTDERLFSAL------APMIASKLGK-- 937
Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
FN +L N AW+Y+V + + F + + +
Sbjct: 938 ---------------------------FNLQELANFAWAYSVANTLGQGLFDEGYVSALA 970
Query: 462 FEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVT 521
E+ S + A +LE + L L+EK ++ + +++
Sbjct: 971 SNEKEFSVE-----QLAQLHQWQLWQQELES---GIELPQSLQEKCRNSFTSASYSE--- 1019
Query: 522 SSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPL 577
S Q +V L +TGL+ E + GY +DA++ +KVA E+DGP+HF P+
Sbjct: 1020 SKLQNDVVDELKATGLDLEEEVLLASGYRIDALVKFNDGRKVAVEVDGPSHFIDRR--PV 1077
Query: 578 GHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLRVILKD 621
G T+LK R +A VVS+ + EW++L S + YLRV L D
Sbjct: 1078 GSTILKHRQVARLDRIEVVSVPYWEWDDLMNSVMKQHYLRVKLSD 1122
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 84/196 (42%), Gaps = 23/196 (11%)
Query: 284 QGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q ++NI W+ +K G E L+ + + +AE+ + F QN++N+A AFA++ S P
Sbjct: 669 QALANIIWSFAKSGEEYSKLFQAIGNHIAELGC--LNSFGPQNLSNIAWAFATVGKSNPK 726
Query: 342 LFSELAKR--ASDIVHTFQEQELAQVLWAFASLYEPADPLLE-------------SLDNA 386
LF ++ D +++F+ Q+L+ WAFA+ LLE SLD+
Sbjct: 727 LFKKIGDHIAGQDSLNSFKPQDLSNTAWAFATAGVSHPELLEKDRRSRDHTAELDSLDSF 786
Query: 387 FKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQ 446
T + K G + SL SF L N AW++A G
Sbjct: 787 NPQTLSITAWAFATAGESHPELFKKIGGHIAGQDSLD----SFKPQDLSNTAWAFAKDGA 842
Query: 447 MDRIFFSDIWKTISRF 462
+ F I I+R
Sbjct: 843 SHPVLFKKIGDHIARL 858
Score = 46.6 bits (109), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 42/169 (24%), Positives = 82/169 (48%), Gaps = 19/169 (11%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
D +A A+ + EF++++++N+ +F ++ P+ LF K A I+HTF
Sbjct: 573 FDSIASSAVGMLNEFDARHLSNLIYSFGLVERK-PEIGRETLFDVFGKAALRILHTFNGH 631
Query: 361 ELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKS 412
+++ +LWAF + L+E ++ ++ ++FK + A S + ++
Sbjct: 632 DISNMLWAFVKVDAKNSRLFEVTGGVISGMNLDSFKPQALANIIWSFAKSGEEYSKLFQA 691
Query: 413 SGDADSE-GSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
G+ +E G L+ SF L NIAW++A +G+ + F I I+
Sbjct: 692 IGNHIAELGCLN----SFGPQNLSNIAWAFATVGKSNPKLFKKIGDHIA 736
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 48/96 (50%), Gaps = 2/96 (2%)
Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
+ +L Q +SN AWA + + + +++ A+ K F+ Q V+N+ A A+
Sbjct: 858 LGSLDSFKPQELSNTAWAYAT--ARVFHSRLFEKLTTEAVAKKDHFDEQGVSNLLWACAT 915
Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
+ ++ LFS LA + + F QELA WA++
Sbjct: 916 VDYTDERLFSALAPMIASKLGKFNLQELANFAWAYS 951
>gi|397622591|gb|EJK66728.1| hypothetical protein THAOC_12320 [Thalassiosira oceanica]
Length = 993
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 159/354 (44%), Gaps = 29/354 (8%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
+ +L Q +SN AWA +K GE + R E T + F Q ++N A+A
Sbjct: 647 GLDSLDSFKPQELSNTAWAFAK-AGEAVQEDWKSRSLE--QTSLDLFKPQELSNTMWAYA 703
Query: 334 SMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDN--AFKD 389
+ S P+L ++ + D + +F QEL+ +WA+A+ L E L A ++
Sbjct: 704 KAEVSHPELLRKIGDHIAGLDSLDSFNPQELSNTIWAYATARVLDLGLFEKLATEVAARN 763
Query: 390 ATQF--TCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQM 447
QF T ++ L C G + + S + N L NIAW+Y+V
Sbjct: 764 G-QFIETQHMSNFLWACATVGYTDERMFSAFAPVIESKLDECNEQDLTNIAWTYSVANAP 822
Query: 448 DRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKI 507
IF ++ E + EQ A +LE + L L+EK
Sbjct: 823 QDIFNKGYVGALTSKENEFSCEQ------LAQLHQWQLWQQELES---GIELPQSLQEKC 873
Query: 508 ASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEI 563
+A ++ +++ S Q +V L + GL+ E + GY +DA++ ++KVA E+
Sbjct: 874 RNAFTSRGYSE---SKLQNDVVGELKAAGLDLDEEVLLGSGYRIDALVKIGDERKVAVEV 930
Query: 564 DGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
DGP+HF + P G T LK R +A VVS+S+ EW+EL+ S + YLR
Sbjct: 931 DGPSHFMQRQ--PAGSTTLKHRQVARLDRIEVVSVSYWEWDELRNSETKQHYLR 982
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 88/212 (41%), Gaps = 29/212 (13%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
Q +SNI W+ +K L L + + + F+ Q ++N A AFA+ S P+LF
Sbjct: 579 QALSNIIWSFAKSDKADLELFQALGNHIANMGSLDSFDPQALSNTAWAFATAGESHPELF 638
Query: 344 SELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
+++ + D + +F+ QEL+ WAFA E + E + + T + L
Sbjct: 639 NKIGDHVAGLDSLDSFKPQELSNTAWAFAKAGE---AVQEDWKSRSLEQTSLDLFKPQEL 695
Query: 402 SNCNENGGVKSSGDADSEGSLSSPVL---------------SFNRDQLGNIAWSYAVLGQ 446
SN A ++ +S P L SFN +L N W+YA
Sbjct: 696 SNTMW---------AYAKAEVSHPELLRKIGDHIAGLDSLDSFNPQELSNTIWAYATARV 746
Query: 447 MDRIFFSDIWKTISRFEEQRISEQYREDIMFA 478
+D F + ++ Q I Q+ + ++A
Sbjct: 747 LDLGLFEKLATEVAARNGQFIETQHMSNFLWA 778
Score = 43.5 bits (101), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 41/169 (24%), Positives = 76/169 (44%), Gaps = 29/169 (17%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
D +A A+ + EF +++++N+ +F ++ + PD LF+ A I+HTF Q
Sbjct: 483 FDSIASSAVGMLNEFEARHLSNLIYSFGLVERN-PDIGGETLFNVFGIAAVKILHTFNSQ 541
Query: 361 ELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKS 412
+++ +LWAF + L+ ++ +D FK +ALSN +
Sbjct: 542 DISNMLWAFVKVDADNSRLFHETGGVISGMDLGNFKP---------QALSNIIWSFAKSD 592
Query: 413 SGDADSEGSLSSPVL------SFNRDQLGNIAWSYAVLGQMDRIFFSDI 455
D + +L + + SF+ L N AW++A G+ F+ I
Sbjct: 593 KADLELFQALGNHIANMGSLDSFDPQALSNTAWAFATAGESHPELFNKI 641
>gi|323450957|gb|EGB06836.1| hypothetical protein AURANDRAFT_65363 [Aureococcus anophagefferens]
Length = 2492
Score = 88.6 bits (218), Expect = 9e-15, Method: Composition-based stats.
Identities = 94/389 (24%), Positives = 149/389 (38%), Gaps = 66/389 (16%)
Query: 255 TTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVAL 314
T HR+ F L A L + + QG+SN+AWA + G + + +
Sbjct: 2068 TKHRVLF------DALADSADHRLRDFNNQGLSNLAWAYASAGASDGNEALFEALGLQVS 2121
Query: 315 TKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKR------ASDIVHTFQEQELAQVLWA 368
+V EF Q +AN+ A+A+ + P++F +A + F QE+A +WA
Sbjct: 2122 LRVAEFRPQGLANLVWAYATAELYCPEVFEAVADEIARPSGGARRAFEFNPQEVANTVWA 2181
Query: 369 FASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLS 428
FA PA L ++ A G K GD + G
Sbjct: 2182 FAKAAVPAPGLYDAFAAAILKL------------------GAKHGGDLKAAG-------- 2215
Query: 429 FNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI---------------SRF--EEQRISEQY 471
F +L N+AW+YA +D +W+ I SRF EE R +Q
Sbjct: 2216 FTPQELANLAWAYACADHVDGDLLLLLWRAIVKEARESPDPGALDGSRFNLEELRQLQQV 2275
Query: 472 REDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARL 531
+ ++ L E A +L +A + + + S +
Sbjct: 2276 VLHAKYGARRGTTMGGLVAEIARAPPAFVGLLRASLADVDASPSGPRSRSPSAWR----- 2330
Query: 532 LVSTGLNWIRE-YAVDG--YTVDAVLVDKKVAFEIDGPTHFSRNTG-VPLGHTMLKRRYI 587
+ G W Y G +T + + +VA E DGP H+ RN VP G T K R +
Sbjct: 2331 --AWGWTWSTNWYCPTGCPWTWLCLPLKWRVAVEFDGPRHYFRNAKRVPTGRTRFKMRLL 2388
Query: 588 AAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
A GW V+ + + +W +L + +YL+
Sbjct: 2389 RALGWRVLHVPYFDWAKLDDDAARTEYLK 2417
Score = 75.9 bits (185), Expect = 6e-11, Method: Composition-based stats.
Identities = 55/189 (29%), Positives = 88/189 (46%), Gaps = 30/189 (15%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
A+ + E +AQ + N AWA + G + + + D +A A+ +V F +QN+AN A+A
Sbjct: 1962 AVRRVDEFNAQELGNTAWAYATAGRD--HPALFDAIAASAMPRVDRFIAQNLANTVWAYA 2019
Query: 334 SMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPA------------DPLLE 381
+ H+ PDLF +A+ + F+ QELA WA+A+ ++ D L +
Sbjct: 2020 TAGHARPDLFDAVAREVARRADEFKPQELANTAWAYATAHKALPGDRPTKHRVLFDALAD 2079
Query: 382 SLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL--------SSPVLSFNRDQ 433
S D+ +D N+ LSN S+G +D +L S V F
Sbjct: 2080 SADHRLRDFN------NQGLSNL--AWAYASAGASDGNEALFEALGLQVSLRVAEFRPQG 2131
Query: 434 LGNIAWSYA 442
L N+ W+YA
Sbjct: 2132 LANLVWAYA 2140
Score = 65.9 bits (159), Expect = 6e-08, Method: Composition-based stats.
Identities = 46/139 (33%), Positives = 67/139 (48%), Gaps = 11/139 (7%)
Query: 309 VAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWA 368
+A A + EF +Q +ANVA AFA+ P+LF+ LA A+ + F QELA WA
Sbjct: 1775 IANGARHRADEFKAQELANVAWAFATANLDEPELFAALAASATPRLSRFSAQELANTAWA 1834
Query: 369 FASLYEPADPLLESLDNAFKDATQFTCCLNKAL--SNCNENGGVKSSGDADSEGSLSSPV 426
FA PA + +A K+ C L +A+ C+E ++ G A G P+
Sbjct: 1835 FAKRLGPA------VGSAPKNGEDAACRLARAMFAELCDE-ACLRFGGGA--YGPDGEPL 1885
Query: 427 LSFNRDQLGNIAWSYAVLG 445
F +L N+ W+ A G
Sbjct: 1886 DGFKPQELANVCWAMATAG 1904
Score = 60.1 bits (144), Expect = 3e-06, Method: Composition-based stats.
Identities = 51/188 (27%), Positives = 81/188 (43%), Gaps = 24/188 (12%)
Query: 279 PECSAQGISNIAWALSKIGG------ELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
P Q ++NI WA +K G + L+ + V A+ +V EFN+Q + N A A+
Sbjct: 1926 PATQPQNLANICWAFAKSGCGSPDAVDALFAA----VGRSAVRRVDEFNAQELGNTAWAY 1981
Query: 333 ASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAF-KDAT 391
A+ P LF +A A V F Q LA +WA+A+ L +++ + A
Sbjct: 1982 ATAGRDHPALFDAIAASAMPRVDRFIAQNLANTVWAYATAGHARPDLFDAVAREVARRAD 2041
Query: 392 QFTC--CLNKALSNCNENGGVKSSGDADSEGSLSSPVLS---------FNRDQLGNIAWS 440
+F N A + + + GD ++ + L+ FN L N+AW+
Sbjct: 2042 EFKPQELANTAWAYATAHKAL--PGDRPTKHRVLFDALADSADHRLRDFNNQGLSNLAWA 2099
Query: 441 YAVLGQMD 448
YA G D
Sbjct: 2100 YASAGASD 2107
Score = 52.4 bits (124), Expect = 8e-04, Method: Composition-based stats.
Identities = 68/256 (26%), Positives = 99/256 (38%), Gaps = 58/256 (22%)
Query: 263 RQREMSMLVAIAMTA---LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALT---K 316
R + L AIA A E AQ ++N+AWA + L E + A +A + +
Sbjct: 1765 RDTSTACLRAIANGARHRADEFKAQELANVAWAFATAN-----LDEPELFAALAASATPR 1819
Query: 317 VGEFNSQNVANVAGAFAS----MQHSAPD------------LFSELAKRA---------- 350
+ F++Q +AN A AFA SAP +F+EL A
Sbjct: 1820 LSRFSAQELANTAWAFAKRLGPAVGSAPKNGEDAACRLARAMFAELCDEACLRFGGGAYG 1879
Query: 351 --SDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQF------------TCC 396
+ + F+ QELA V WA A+ A P D A +A + C
Sbjct: 1880 PDGEPLDGFKPQELANVCWAMATAGFEATPRF--WDGAAAEAARIMDAPATQPQNLANIC 1937
Query: 397 LNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIW 456
A S C V + A ++ V FN +LGN AW+YA G+ F I
Sbjct: 1938 WAFAKSGCGSPDAVDALFAAVGRSAVRR-VDEFNAQELGNTAWAYATAGRDHPALFDAIA 1996
Query: 457 KT----ISRFEEQRIS 468
+ + RF Q ++
Sbjct: 1997 ASAMPRVDRFIAQNLA 2012
>gi|397611301|gb|EJK61272.1| hypothetical protein THAOC_18274, partial [Thalassiosira oceanica]
Length = 333
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/353 (28%), Positives = 151/353 (42%), Gaps = 57/353 (16%)
Query: 274 AMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
+ +L Q +SNIAWA + G L+ VAE +G F Q+ +N+A A
Sbjct: 25 GLGSLDSFKPQNLSNIAWAFATAGVSHRELFKKIGCHVAEKG--SLGSFKPQDFSNIAWA 82
Query: 332 FASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDAT 391
FA+ S LF +L++ A+ + Q +A LWA A++ + L +L
Sbjct: 83 FATAGVSHMKLFEKLSEAAARKGEFIETQHIANFLWACATVGYTDERLFSAL-------- 134
Query: 392 QFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
T + L CNE QL NIAW+Y+V +
Sbjct: 135 --TSVIASKLDKCNEQ-------------------------QLANIAWTYSVANTPKQDL 167
Query: 452 FSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG 511
F+ + + E+ S + A +LE + L L+ K +A
Sbjct: 168 FNKGYASALASIEKDFSAE-----GLAQLHQWQLWQQELES---GIELPRSLQAKCRNAF 219
Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAV--LVD-KKVAFEIDGPT 567
++ F + S Q +V L +TGL E + GY +DA+ L D +KVA E+DGP+
Sbjct: 220 TSQGFFE---SKLQNDVVDELKATGLVLDEEVLLGSGYRIDALVKLSDGRKVAVEVDGPS 276
Query: 568 HFSRNTGVPLGHTMLKRRYIAAA-GWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
HF P G T+LK R + VVS+ + EW+EL+ S + YLRV L
Sbjct: 277 HFIDRR--PTGSTILKHRQVVKLDSIEVVSVPYWEWDELKNSEMKQHYLRVKL 327
>gi|397643193|gb|EJK75706.1| hypothetical protein THAOC_02564 [Thalassiosira oceanica]
Length = 1004
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 153/350 (43%), Gaps = 32/350 (9%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAE--VALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q +SN AWA + G +L+ ++ L+ +G F Q ++N A AFA+ S P
Sbjct: 669 QELSNTAWAFATAG--VLHPELFKKIGGHVAGLSCLGSFKPQALSNTAWAFATTGDSNPK 726
Query: 342 LFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDN---AFKD--ATQFT 394
+F ++ D + +F QEL+ + WA+A+ L E L A KD Q T
Sbjct: 727 MFKKIRDHIVRLDNLDSFTPQELSNIAWAYATARRFDLGLFEKLVTGAVAKKDRFGEQAT 786
Query: 395 CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSD 454
A + G+ S A ++S + + L NIAW+Y+V + F++
Sbjct: 787 SNFLWACATIGYTDGLLFSAFAPV---IASTLDKYGEQHLANIAWAYSVANAPRQDLFNE 843
Query: 455 IWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTK 514
+ IS D A +LE + L L+ K A ++
Sbjct: 844 GYVGSLALNRNHIS-----DKELAQLHQWQLWQQELES---GIELPRSLQAKCRYAFTSQ 895
Query: 515 RFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFS 570
+ S Q +V L + GL+ E+ + GY +DA++ +KVA E+DGP+HF
Sbjct: 896 GHQE---SKLQDDVVGELRAAGLDLEEEFLLGSGYRIDALVTFSDGRKVAVEVDGPSHFI 952
Query: 571 RNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVIL 619
P G +LK R + VVS+ H EW EL+ S + ++LRV L
Sbjct: 953 DRR--PTGSAVLKHRQVVRLDRIEVVSVPHWEWNELKNSEMKQNFLRVKL 1000
Score = 46.6 bits (109), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 52/242 (21%), Positives = 97/242 (40%), Gaps = 50/242 (20%)
Query: 284 QGISNIAWALSKIG----------GELLYLSEMDRVAEVALTKV---------------- 317
Q +SN+ WA K+G G ++ ++D AL +
Sbjct: 554 QALSNVMWAFVKVGAKNSRLFRETGGVISGMDLDSFKPQALANILWSFAKSGEADPELFQ 613
Query: 318 -----------GEFNSQNVANVAGAFASMQHSAPDLFSELAK--RASDIVHTFQEQELAQ 364
+F Q+++N+A A+A+ + P LF ++ D + +F+ QEL+
Sbjct: 614 VLGNHIVVRSLNDFWPQDISNIAWAYANGRVPHPILFKKIGDLVAGQDSLDSFKPQELSN 673
Query: 365 VLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE--GSL 422
WAFA+ L + + + +ALSN ++GD++ + +
Sbjct: 674 TAWAFATAGVLHPELFKKIGGHVAGLSCLGSFKPQALSNTAW--AFATTGDSNPKMFKKI 731
Query: 423 SSPVL------SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIM 476
++ SF +L NIAW+YA + D F + T + ++ R EQ + +
Sbjct: 732 RDHIVRLDNLDSFTPQELSNIAWAYATARRFDLGLFEKL-VTGAVAKKDRFGEQATSNFL 790
Query: 477 FA 478
+A
Sbjct: 791 WA 792
Score = 42.0 bits (97), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
+ Q +SNIAWA + L +++ A+ K F Q +N A A++ ++
Sbjct: 745 TPQELSNIAWAYAT--ARRFDLGLFEKLVTGAVAKKDRFGEQATSNFLWACATIGYTDGL 802
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
LFS A + + + EQ LA + WA++ P L
Sbjct: 803 LFSAFAPVIASTLDKYGEQHLANIAWAYSVANAPRQDLF 841
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/151 (25%), Positives = 62/151 (41%), Gaps = 38/151 (25%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHS----APDLFSELAKRASDIVHTFQEQE 361
D +A + + +F +++++N+ +F ++ + LF+ K A I+ TF+ Q
Sbjct: 496 FDSIASSSAGMLDKFETRHLSNLIYSFGLVELNPEIGGDTLFNVFGKTAIKILRTFKPQA 555
Query: 362 LAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGS 421
L+ V+WAF + K++ F E GGV S D D
Sbjct: 556 LSNVMWAFVKV-------------GAKNSRLF-----------RETGGVISGMDLD---- 587
Query: 422 LSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
SF L NI WS+A G+ D F
Sbjct: 588 ------SFKPQALANILWSFAKSGEADPELF 612
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 74/172 (43%), Gaps = 20/172 (11%)
Query: 295 KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI- 353
+IGG+ L+ + + A+ + F Q ++NV AF + LF E S +
Sbjct: 530 EIGGDTLF----NVFGKTAIKILRTFKPQALSNVMWAFVKVGAKNSRLFRETGGVISGMD 585
Query: 354 VHTFQEQELAQVLWAFASLYEPADP-LLESLDN--AFKDATQFTCCLNKALSNCNENGGV 410
+ +F+ Q LA +LW+FA E ADP L + L N + F ++ NG V
Sbjct: 586 LDSFKPQALANILWSFAKSGE-ADPELFQVLGNHIVVRSLNDFWPQDISNIAWAYANGRV 644
Query: 411 ------KSSGD-ADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDI 455
K GD + SL SF +L N AW++A G + F I
Sbjct: 645 PHPILFKKIGDLVAGQDSLD----SFKPQELSNTAWAFATAGVLHPELFKKI 692
>gi|307109857|gb|EFN58094.1| hypothetical protein CHLNCDRAFT_142412 [Chlorella variabilis]
Length = 962
Score = 86.3 bits (212), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 133/302 (44%), Gaps = 38/302 (12%)
Query: 311 EVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
E ++++ +F SQ +AN +FA ++ L + + +HT QE++ +W+FA
Sbjct: 268 ERRVSRLDDFTSQALANTLWSFAYLRWYPVRLLEPITRAVGRKMHTMSSQEISNSIWSFA 327
Query: 371 SL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSF 429
Y P + + + +F ++L+ S+ ++ L +
Sbjct: 328 KFAYHPGPVMAQYQVEVVRRVAEFD---GQSLTTTMWAMAALSATHCEAFVKLVERFVEL 384
Query: 430 NRDQLGNIAWSYAVLGQMDRIFFSDIWKTI--SRFEEQRISEQYREDIMFASQVHLVNQC 487
R G + ++ + + + ++FE+QR ++R DI + V+
Sbjct: 385 ERA------------GGFQDVQYNQVLQAVLLAQFEQQRRPGEFRADIDLPDDI--VDTA 430
Query: 488 LKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREY--AV 545
L+ Q + + K+ SSFQ EV+ L G+ EY AV
Sbjct: 431 LQAWQAQQQASAAGGWAAKL--------------SSFQLEVSEALGQLGIEHELEYLTAV 476
Query: 546 DGYTVDAVLV--DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
+ +VD +V KKVA E+DGP HFS NT PLG TM++RR + A GW V+S+ + W
Sbjct: 477 NLLSVDIAIVKGGKKVAVEVDGPFHFSVNTSSPLGQTMIRRRLLRAVGWTVISVPYHAWY 536
Query: 604 EL 605
L
Sbjct: 537 SL 538
Score = 39.3 bits (90), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 25/105 (23%), Positives = 50/105 (47%), Gaps = 4/105 (3%)
Query: 270 LVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALT--KVGEFNSQNVAN 327
LVA +P +G++N W L+ +G + +E+ R +A+ + ++ +Q ++N
Sbjct: 31 LVARVEALVPHYQPRGLANTMWGLAALGD--VQRAELARRLALAIVSHRTAQYRAQELSN 88
Query: 328 VAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
V A ++ P+ L + + F Q L+ ++WA A L
Sbjct: 89 VVWAMGTLGVLCPEALDPLLEGVVSQIDDFIPQGLSNMVWACAHL 133
Score = 38.9 bits (89), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 31/135 (22%), Positives = 58/135 (42%), Gaps = 31/135 (22%)
Query: 265 REMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQN 324
R +++ + TA + AQ +SN+ WA+ +G +L +D + E ++++ +F Q
Sbjct: 67 RRLALAIVSHRTA--QYRAQELSNVVWAMGTLG--VLCPEALDPLLEGVVSQIDDFIPQG 122
Query: 325 VANVAGAFASMQ---------------------------HSAPDLFSELAKRASDIVHTF 357
++N+ A A ++ H AP +A A+ + F
Sbjct: 123 LSNMVWACAHLRNGTRGCIGPTMGGNPPTHVPRELRPAWHPAPAFLEAVAAAATRKMPDF 182
Query: 358 QEQELAQVLWAFASL 372
Q Q L+ +LW F L
Sbjct: 183 QSQTLSNLLWGFCKL 197
>gi|302780209|ref|XP_002971879.1| hypothetical protein SELMODRAFT_412574 [Selaginella moellendorffii]
gi|300160178|gb|EFJ26796.1| hypothetical protein SELMODRAFT_412574 [Selaginella moellendorffii]
Length = 465
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 47/96 (48%), Positives = 65/96 (67%), Gaps = 8/96 (8%)
Query: 196 EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT 255
EI LN+D+VD++ +EVLE I + G LS +N+ATALHRIAK+M +SM
Sbjct: 217 EIRLNQDLVDSRDVEEVLETIERVKGRFG-------LSAINVATALHRIAKHMVTLSMSE 269
Query: 256 THRLAFTRQREMSMLVAIAMTALPECSAQGISNIAW 291
RL + +Q ++ +LVA AM LPEC+AQG+SNIA+
Sbjct: 270 RRRLKYAKQCDV-LLVASAMELLPECNAQGVSNIAY 304
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 65/145 (44%), Gaps = 35/145 (24%)
Query: 479 SQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLN 538
+Q++ V K E LQL +E++ A A + +R ++K TS QK++ R LV TG
Sbjct: 335 TQLYQVVLASKREGKDLQLG---GIEKRAAGAWEKERSSRKSTSFLQKDIERFLVCTGRQ 391
Query: 539 WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLS 598
WI EY Y+ + R + AAGW ++S S
Sbjct: 392 WILEYVDADYSHEG--------------------------------RLLGAAGWKIISAS 419
Query: 599 HQEWEELQGSFEQLDYLRVILKDYI 623
+ WE LQG E +D+L +L +I
Sbjct: 420 YAAWENLQGESEHVDFLHKLLAPHI 444
>gi|397617752|gb|EJK64587.1| hypothetical protein THAOC_14665, partial [Thalassiosira oceanica]
Length = 315
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/319 (26%), Positives = 145/319 (45%), Gaps = 25/319 (7%)
Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQ 364
D +A L + FNSQ ++N A A+A+ S P+LF ++ + + + + + QEL+
Sbjct: 4 DHIA--GLKSLDSFNSQALSNTAWAYATAGVSHPELFKKIGDHVAGLKSLDSLKPQELSN 61
Query: 365 VLWAFASLYEPADPLLESLDN-AFKDATQFTCC-LNKALSNCNENGGVKSSGDADSEGSL 422
WA+A+ L E + A + F C + L C G + +
Sbjct: 62 TAWAYATARRFDLRLFEKVSTEAVVNREHFGCQEVANFLWACATVGHTDERLFSAFVPVI 121
Query: 423 SSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVH 482
+S + FN +L NIAW+Y+V +F ++ +E E R+ +
Sbjct: 122 ASKLDEFNEQELANIAWAYSVANLKQDLFDEGYVSALAAYENVFPEESRRQLHQWQLWQQ 181
Query: 483 LVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIRE 542
+ ++L L+EK + + +++ S Q +V L + GL++ E
Sbjct: 182 EIESGIELPQS---------LQEKCRNTFISSSYSE---SKLQNDVVGELRAAGLDFDEE 229
Query: 543 YAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSL 597
+ GY +DA++ ++KVA E+DGP HF + P G T+LK R +A + VVS+
Sbjct: 230 VLLGSGYRIDALVKIREERKVAVEVDGPFHFIDSR--PAGRTILKHRQVARLDYIEVVSV 287
Query: 598 SHQEWEELQGSFEQLDYLR 616
+ EW+ L+ S + YL
Sbjct: 288 PYWEWDGLKNSVMKQHYLH 306
Score = 46.2 bits (108), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 45/97 (46%), Gaps = 2/97 (2%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
+ +L Q +SN AWA + L ++V+ A+ F Q VAN A A
Sbjct: 47 GLKSLDSLKPQELSNTAWAYAT--ARRFDLRLFEKVSTEAVVNREHFGCQEVANFLWACA 104
Query: 334 SMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
++ H+ LFS + + F EQELA + WA++
Sbjct: 105 TVGHTDERLFSAFVPVIASKLDEFNEQELANIAWAYS 141
>gi|145341433|ref|XP_001415814.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576037|gb|ABO94106.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 417
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 154/360 (42%), Gaps = 52/360 (14%)
Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAE----VALTKV---------------GEF 320
E Q ++N WA + +L R+AE V L+K+ GEF
Sbjct: 51 EFYPQALTNTLWAYT-----VLKHPRAQRLAEILAPVILSKLPEPDKELMQAESATGGEF 105
Query: 321 NSQNVANVAGAFASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADP 378
++Q V+N +S+ H +L LA F+ QEL+ +WAFA + P +
Sbjct: 106 STQTVSNALWTLSSLGVHPGYELLDRLAIFVVKSSQNFKAQELSNSVWAFAQFAHHPGNE 165
Query: 379 LLESLDNAFKDATQ-FTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLS-------FN 430
L + + + + + +T +AL+N V + D L + V N
Sbjct: 166 ALRTFERSLLERREEYT---TQALANTTIGLSVFGGSEDDGLNKLFNDVTPSWFRLSECN 222
Query: 431 RDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYR-EDIMFASQVHLVNQCLK 489
L NI W+ A +G F SD++K R +R S ++ E + L+
Sbjct: 223 SQDLSNITWAIASVGA----FQSDLYKAAVRELFRRDSMDFQLEGLKMLFHARLMQHDFD 278
Query: 490 LEHPHLQLALSSVLEEKIASAGKTKRFNQK---VTSSFQKEVARLLVSTGLN-WIREYAV 545
E + + V + +A G++ Q S+FQKEV + S G ++ E
Sbjct: 279 PERETVDV----VYPDWVAELGRSAWLQQTEDTRVSTFQKEVLETVKSLGHEPYMEELTD 334
Query: 546 DGY-TVDAVLVDKKVAFEIDGPTHFSRNTGVPLGH-TMLKRRYIAAAGWNVVSLSHQEWE 603
DG ++D L DK+VA E DGP+HF N L T L+ + +A GW VV++ + EW+
Sbjct: 335 DGLLSMDICLKDKRVAIECDGPSHFYTNLTEGLTQKTKLRDKALAVRGWKVVTVPYFEWQ 394
>gi|397607841|gb|EJK59823.1| hypothetical protein THAOC_19906 [Thalassiosira oceanica]
Length = 307
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 144/307 (46%), Gaps = 24/307 (7%)
Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPAD 377
F QN++N A AFA+ S +LF ++ + D + +F Q L+ WA+A+
Sbjct: 6 FKPQNLSNTAWAFATAGESHSELFEKIGDHVAGRDSLDSFNPQNLSNTAWAYATARVFHS 65
Query: 378 PLLESLDNAFKDATQFTCCLNKA--LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLG 435
L E L A +F +K+ L C G + + S + N +L
Sbjct: 66 RLFEKLSTADARKGEFIETQHKSNFLWACATVGYTDERLFSAFAPVMESKLDECNEQELA 125
Query: 436 NIAWSYAVLGQMDRIFFSDIW-KTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPH 494
NIAW+Y+V + F++ + ++ +E++ ++++R+ + + ++L
Sbjct: 126 NIAWAYSVANVPSKDLFNEGYVGALAAYEKEFSAKEFRQLHQWQLWQQELESGIELPRS- 184
Query: 495 LQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAV 553
L+EK +A ++ +++ S Q +V L +TGL+ E + GY +DA+
Sbjct: 185 --------LQEKCRNAFTSQGYSE---SKLQNDVVNELRATGLDLDEEVLLGSGYRIDAL 233
Query: 554 LV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSF 609
+ +VA E+DGP+HF + P+G T LK R +A VVS+ + W E++ S
Sbjct: 234 VKVGNGGRVAVEVDGPSHFIQRW--PVGSTTLKHRQVARLDCIEVVSVPYWVWNEMKNSV 291
Query: 610 EQLDYLR 616
+ YLR
Sbjct: 292 TKQHYLR 298
>gi|397587968|gb|EJK54094.1| hypothetical protein THAOC_26348, partial [Thalassiosira oceanica]
Length = 1003
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 160/360 (44%), Gaps = 33/360 (9%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
+ +L Q +SN AWA + G L+ D VA L + F QN++N+A A
Sbjct: 650 GLMSLDSFDPQALSNTAWAFATTGASHPELFKKIGDHVA--GLGSLNSFKPQNLSNIAWA 707
Query: 332 FASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKD 389
FA+ S P+LF ++ + D + +F+ QE++ +WA+A+ L E L
Sbjct: 708 FATAGASHPELFMKIGDHVAGLDSLDSFKPQEISNTVWAYATARVFDLGLFEKLVTVAVI 767
Query: 390 ATQFTCCLNKALSN----CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
++ +A++N C G + + S + FN +L NIAW+Y++
Sbjct: 768 KREYFD--GQAVANFLWACATVGHTDERLFSALAPLIGSELDKFNEQELANIAWAYSMAN 825
Query: 446 QMDRIFFSDIWKTISRFEEQRISEQY-REDIMFASQVHLVNQCLKLEHPHLQLALSSVLE 504
+F ++ E++ EQ + Q LV L + L L+
Sbjct: 826 VPQDLFNEGYVGALASNEKEFSGEQLSQLHQWQLWQQELV----------LGIELPGSLQ 875
Query: 505 EKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVA 560
K +A ++ +++ S+ Q +V L + L E + GY +DA + + VA
Sbjct: 876 AKCRNAFTSQGYSE---STLQNDVVGELKAARLVIDEEVLLGSGYRIDASVKFSDGRIVA 932
Query: 561 FEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLRVIL 619
E+DGP+HF P G T+LK R +A VVS+ EW EL+ S + YLRV L
Sbjct: 933 VEVDGPSHFIDRR--PTGSTILKHRQVARLDRIEVVSVPFWEWNELKNSEMKQHYLRVKL 990
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 86/189 (45%), Gaps = 24/189 (12%)
Query: 273 IAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
I L + Q +SNIAWA G +L+ D VA L ++ F+SQ ++N+A
Sbjct: 532 IVARRLNDFQPQALSNIAWAFDTAGVSHPVLFKKIGDHVA--GLVRLNSFDSQALSNIAW 589
Query: 331 AFASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
+FA+ S P+LF ++ + D + +F+ Q L+ + W+FA++ E L + N
Sbjct: 590 SFATAGDSHPELFKKVGYHVAGLDSLDSFEPQHLSNIAWSFATVGESHPKLFNKIGNHIA 649
Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSE------------GSLSSPVLSFNRDQLGN 436
+ALSN ++G + E GSL+ SF L N
Sbjct: 650 GLMSLDSFDPQALSNTAW--AFATTGASHPELFKKIGDHVAGLGSLN----SFKPQNLSN 703
Query: 437 IAWSYAVLG 445
IAW++A G
Sbjct: 704 IAWAFATAG 712
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 71/167 (42%), Gaps = 15/167 (8%)
Query: 211 EVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSML 270
E+ + I + + +G S P + NIA A + ++ M +A
Sbjct: 678 ELFKKIGDHVAGLGSLNSFKPQNLSNIAWAFATAGASHPELFMKIGDHVA---------- 727
Query: 271 VAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
+ +L Q ISN WA + + L +++ VA+ K F+ Q VAN
Sbjct: 728 ---GLDSLDSFKPQEISNTVWAYAT--ARVFDLGLFEKLVTVAVIKREYFDGQAVANFLW 782
Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPAD 377
A A++ H+ LFS LA + F EQELA + WA++ P D
Sbjct: 783 ACATVGHTDERLFSALAPLIGSELDKFNEQELANIAWAYSMANVPQD 829
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 93/203 (45%), Gaps = 26/203 (12%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
A+ L ++Q +SN+ WA K+ + L + + ++ +G F Q ++N+ +FA
Sbjct: 457 AVKILHTFNSQELSNMLWAFVKVDADNSRLFQ-ETGGVISGMDLGSFKPQALSNILWSFA 515
Query: 334 SMQHSAPDLFSEL-----AKRASDIVHTFQEQELAQVLWAF--ASLYEPADPLLESLDNA 386
+ P+LF L A+R +D FQ Q L+ + WAF A + P L + + +
Sbjct: 516 KSGKANPELFGVLGDHIVARRLND----FQPQALSNIAWAFDTAGVSHPV--LFKKIGDH 569
Query: 387 FKDATQFTCCLNKALSNCNENGGVKSSGDADSE---------GSLSSPVLSFNRDQLGNI 437
+ ++ALSN + ++GD+ E L S + SF L NI
Sbjct: 570 VAGLVRLNSFDSQALSNIAWS--FATAGDSHPELFKKVGYHVAGLDS-LDSFEPQHLSNI 626
Query: 438 AWSYAVLGQMDRIFFSDIWKTIS 460
AWS+A +G+ F+ I I+
Sbjct: 627 AWSFATVGESHPKLFNKIGNHIA 649
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 80/170 (47%), Gaps = 32/170 (18%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
DR+A A+ + EF +++++N+ +F ++ + PD LF+ A I+HTF Q
Sbjct: 409 FDRIARSAVGMLNEFEARHLSNLIYSFGLVERN-PDIGGETLFNVFGIAAVKILHTFNSQ 467
Query: 361 ELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKS 412
EL+ +LWAF + L++ ++ +D +FK +ALSN +
Sbjct: 468 ELSNMLWAFVKVDADNSRLFQETGGVISGMDLGSFKP---------QALSNILWS--FAK 516
Query: 413 SGDADSE--GSLSSPVLS-----FNRDQLGNIAWSYAVLGQMDRIFFSDI 455
SG A+ E G L +++ F L NIAW++ G + F I
Sbjct: 517 SGKANPELFGVLGDHIVARRLNDFQPQALSNIAWAFDTAGVSHPVLFKKI 566
>gi|384250903|gb|EIE24382.1| hypothetical protein COCSUDRAFT_83686 [Coccomyxa subellipsoidea
C-169]
Length = 463
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 149/373 (39%), Gaps = 74/373 (19%)
Query: 266 EMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNV 325
+ S V +A ++ AQG++++ WAL+ GG + EM+ V EV + +F +
Sbjct: 120 DASGAVKLAGGSVDALGAQGLADLLWALAAFGGRSYFKDEMEAVLEVLDFQQQKFTMSGL 179
Query: 326 ANVAGAFASMQHSAPDLFSELAK--RASDIVHTFQEQ-ELAQVLWAFASLYEPADPLLES 382
+V A AS H P L ++LA R + T ++ + +LW+FA
Sbjct: 180 LDVTWALASAAHWTPKL-ADLAAAVRERGGLKTIKKNYQFTGLLWSFA------------ 226
Query: 383 LDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYA 442
QF + N G+ E V F QL WS
Sbjct: 227 ---------QF-----------DHNPGLFC------EVLPPKKVAEFETHQLITACWSLC 260
Query: 443 VLGQMDRIFFSDIWKTISRFEEQ-------------RISEQYR--EDIMFASQVHLVNQC 487
VL + F +W+ + E +I ++R ED++ + VH
Sbjct: 261 VLQETQSEVFKSLWRELGTRELPATPMKDAIACQLCQIKMEFRGKEDLLLGTDVH----- 315
Query: 488 LKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG 547
+ +LE+ R S+ E R L GL I E G
Sbjct: 316 ------------AQILEKADRCWKHDLRTTDFHMSAQHAETCRALKGMGLEHIYEDVSTG 363
Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
Y VD + + ++A EIDGPTHF+RN LG +++K R + GW+V L+ ++WE +
Sbjct: 364 YAVDIAIPELRIAVEIDGPTHFARNAKRRLGPSIMKHRQLDDMGWHVFPLTAEDWESAES 423
Query: 608 SFEQLDYLRVILK 620
S L LR ++
Sbjct: 424 SAAALQKLRDFIR 436
>gi|397600696|gb|EJK57702.1| hypothetical protein THAOC_22226 [Thalassiosira oceanica]
Length = 877
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 90/356 (25%), Positives = 151/356 (42%), Gaps = 57/356 (16%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
+ +L Q SN AWA + G L L M L + F +Q ++N A +FA
Sbjct: 567 GLDSLNSFKPQNFSNTAWAFASAGVSHLALFNMIGHHVAGLGSLDSFKAQALSNTAWSFA 626
Query: 334 SMQHSAPDLFSELAKRASDIVH--TFQEQELAQVLWAFASLYEPADPLLESLDNAFKDAT 391
+ S P+LF +++ +++ + +F+ QEL +WA AS+ + L +L
Sbjct: 627 TAGISCPELFRKISGHVAELGYLDSFKLQELLNTVWACASVGYTDERLFSAL-------- 678
Query: 392 QFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
+ L C+E L NIAW+Y+V +
Sbjct: 679 --APVIASKLDECSEQ-------------------------HLANIAWTYSVANTPRQDL 711
Query: 452 FSDIW-KTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASA 510
F+ + ++ E+ +E + + + ++L P L K +A
Sbjct: 712 FNVGYVGALASIEKVFSAEGLAQLHQWQLWQQELESGIQLPGP---------LGAKCLNA 762
Query: 511 GKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGP 566
++ F++ S Q +V L + GL+ E + GY +DA++ +KVA E+DGP
Sbjct: 763 FTSQGFSE---SKLQNDVVGELKAAGLDLDEEVLLGSGYRIDALVKFSDGRKVAVEVDGP 819
Query: 567 THFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
+HF P G T+LK R + VVS+ + EW EL+ S + YLRV L +
Sbjct: 820 SHFIDRR--PTGSTILKHRQVTRLDRIEVVSVPYWEWNELKNSEMKQHYLRVKLSN 873
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 81/202 (40%), Gaps = 34/202 (16%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
A+ L ++QGISN+ +A K+ + L E + ++ + F Q +AN+ +FA
Sbjct: 413 ALKILHTFNSQGISNMLFAFVKVDAKNSRLFE-ETCGVISGMDLDNFKPQALANILWSFA 471
Query: 334 SMQHSAPDLFSEL-----AKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
+ P+LF L A+R +D FQ Q L+ + WAFA+ L + N
Sbjct: 472 KSGEAEPELFQALGNHIVARRLND----FQPQHLSNIAWAFATAEVSHPELFNKIGNHIA 527
Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL---------------SFNRDQ 433
++ALSN + A + +S VL SF
Sbjct: 528 GPGSLDSFSSQALSN---------TAWAFAAAGVSHTVLMKKIGNHIAGLDSLNSFKPQN 578
Query: 434 LGNIAWSYAVLGQMDRIFFSDI 455
N AW++A G F+ I
Sbjct: 579 FSNTAWAFASAGVSHLALFNMI 600
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 87/223 (39%), Gaps = 50/223 (22%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
DR+A A+ + EF +++++N+ +F ++ + PD LF+ K A I+HTF Q
Sbjct: 365 FDRIASSAVGMLNEFEARHLSNLIYSFGLVERN-PDIGEETLFNVFGKAALKILHTFNSQ 423
Query: 361 ELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEG 420
++ +L+AF + L E TC GV S D D
Sbjct: 424 GISNMLFAFVKVDAKNSRLFEE-----------TC-------------GVISGMDLD--- 456
Query: 421 SLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQ 480
+F L NI WS+A G+ + F + I Q+ +I +A
Sbjct: 457 -------NFKPQALANILWSFAKSGEAEPELFQALGNHIVARRLNDFQPQHLSNIAWAFA 509
Query: 481 VHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSS 523
++ HP L + + IA G F+ + S+
Sbjct: 510 T------AEVSHPE----LFNKIGNHIAGPGSLDSFSSQALSN 542
>gi|397605334|gb|EJK58973.1| hypothetical protein THAOC_20863 [Thalassiosira oceanica]
Length = 1152
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 160/365 (43%), Gaps = 37/365 (10%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAE--VALTKVGEFNSQNVANVAGA 331
+ +L + Q +SN WA + G + Y +++ L + FNSQ ++N A
Sbjct: 804 GLDSLNSFNPQNLSNTIWAFATAG--VSYPELFNKIGNHIAGLGSLDSFNSQALSNTVWA 861
Query: 332 FASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKD 389
FA+ S P LF+++ + D + +F Q L+ WA+A+ L E L A
Sbjct: 862 FATAGESNPKLFNKIGDHVTRLDSIDSFNSQNLSNTAWAYATARVFHSRLFEKLTTAVAA 921
Query: 390 ATQF---TCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQ-----LGNIAWSY 441
T + L C G + + S +PV++ DQ + NIAW+Y
Sbjct: 922 RKAHFIETQHIANLLWACATVGYID-----ERLFSALAPVVASKLDQCNGQDIANIAWAY 976
Query: 442 AVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSS 501
+V + F++ + + E+ S E++ Q L Q LK + L
Sbjct: 977 SVANFPKQDLFNEGYVSALASNEKDFST---EELFQLHQWQLWQQELKS-----GIELPR 1028
Query: 502 VLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DK 557
L+EK + +++ S Q +V L + GL+ E + GY +DA++ +
Sbjct: 1029 SLQEKCRNVVTYASYSE---SKLQNDVVGELRAAGLDLDEEVLLGSGYRIDALVKFGGGR 1085
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
KVA E+DGP HF P G +LK R +A VV + + EW+EL+ S + YLR
Sbjct: 1086 KVAVEVDGPFHFIDRR--PAGRAILKHRQVARLDRIEVVPVPYWEWDELKNSEMKQHYLR 1143
Query: 617 VILKD 621
V L +
Sbjct: 1144 VKLSN 1148
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/197 (27%), Positives = 87/197 (44%), Gaps = 18/197 (9%)
Query: 275 MTALPECSAQGISNIAWALS--KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
M +L Q +SN AWA + + L+ D +A L + FN Q ++N A AF
Sbjct: 727 MGSLDSFKPQDLSNTAWAFATARESNPKLFKKIGDNIA--GLGSLDSFNPQELSNTAWAF 784
Query: 333 ASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA 390
A+ S P LF+++ + D +++F Q L+ +WAFA+ L + N
Sbjct: 785 ATAGDSNPKLFNKIGHHVAGLDSLNSFNPQNLSNTIWAFATAGVSYPELFNKIGNHIAGL 844
Query: 391 TQFTCCLNKALSN-------CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
++ALSN E+ + D L S + SFN L N AW+YA
Sbjct: 845 GSLDSFNSQALSNTVWAFATAGESNPKLFNKIGDHVTRLDS-IDSFNSQNLSNTAWAYAT 903
Query: 444 LGQMDRIFFSDIWKTIS 460
R+F S +++ ++
Sbjct: 904 A----RVFHSRLFEKLT 916
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 87/197 (44%), Gaps = 21/197 (10%)
Query: 274 AMTALPECSAQGISNIAWALS------KIGGELLYLSEMDRVAEVALTKVGEFNSQNVAN 327
A+ L E A+ +SN+ ++ IG E L+ + + A+ + FNSQ+++N
Sbjct: 608 AVEMLNEFDARTLSNLIYSFGLVERNPDIGEETLF----NVFGKAAVKILNTFNSQDISN 663
Query: 328 VAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADP-LLESLDN 385
+ AF + LF E S + + F+ Q LA +LW+FA E ADP L ++L N
Sbjct: 664 MLLAFVKVDAKNSRLFHETCGVISGMDLDNFKPQALANILWSFAKSGE-ADPELFQALGN 722
Query: 386 AFKDATQFTCCLNKALSN-------CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
+ LSN E+ D+ L S + SFN +L N A
Sbjct: 723 HIAVMGSLDSFKPQDLSNTAWAFATARESNPKLFKKIGDNIAGLGS-LDSFNPQELSNTA 781
Query: 439 WSYAVLGQMDRIFFSDI 455
W++A G + F+ I
Sbjct: 782 WAFATAGDSNPKLFNKI 798
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 81/180 (45%), Gaps = 41/180 (22%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
DR+A A+ + EF+++ ++N+ +F ++ + PD LF+ K A I++TF Q
Sbjct: 601 FDRIARSAVEMLNEFDARTLSNLIYSFGLVERN-PDIGEETLFNVFGKAAVKILNTFNSQ 659
Query: 361 ELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKS 412
+++ +L AF + L+ ++ +D + FK +AL+N +
Sbjct: 660 DISNMLLAFVKVDAKNSRLFHETCGVISGMDLDNFKP---------QALANILWS--FAK 708
Query: 413 SGDADSE------------GSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
SG+AD E GSL SF L N AW++A + + F I I+
Sbjct: 709 SGEADPELFQALGNHIAVMGSLD----SFKPQDLSNTAWAFATARESNPKLFKKIGDNIA 764
>gi|397636260|gb|EJK72207.1| hypothetical protein THAOC_06282, partial [Thalassiosira oceanica]
Length = 569
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 102/399 (25%), Positives = 165/399 (41%), Gaps = 66/399 (16%)
Query: 274 AMTALPECSAQGISNIAWALSK--IGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
+ L Q +SN WA + + L+ D +A L + F+ QN++N+ A
Sbjct: 174 GLGCLESFKPQNLSNTVWAFATADMTHPELFKKIGDHIA--GLMSLDSFDPQNLSNIVWA 231
Query: 332 FASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPL---------- 379
FA+ + S P LF+++ + D +++F Q+L+ WAFA+ E L
Sbjct: 232 FATAKESHPQLFNKIGHHVAGLDSLNSFNSQDLSLTAWAFATAGESNPELFNKIGNHVAG 291
Query: 380 LESLDNAF-----------------------KDATQF---------TCCLNKALSNCNEN 407
L+SLD+ K AT+F T ++ L C
Sbjct: 292 LDSLDSFMPQDFSNTIWAYATARVFHSRLFEKLATEFVSRKGEFIKTQHMSNFLWACATV 351
Query: 408 GGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRI 467
G A + S + FN +L NIAW+Y+V + F++ + +E+
Sbjct: 352 GHTDERLFAALAPVIGSKLDKFNEQELANIAWAYSVANAPRQDLFNEGYVGALASKEKVF 411
Query: 468 SEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKE 527
S + + + + L L K +A ++ F++ S Q +
Sbjct: 412 SGKELAQLHQLQLWQQELES--------GIELPGSLRAKCRNAFTSQGFSE---SKLQND 460
Query: 528 VARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLK 583
V L + GL E + GY +DA++ +KVA E+DGP+HF P G T LK
Sbjct: 461 VVYELKAAGLVLDEEVLLGSGYRIDALVKFGDGRKVAVEVDGPSHFIDRR--PAGSTTLK 518
Query: 584 RRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
R +A VVS+ + +W EL+ S + YLRV L D
Sbjct: 519 HRQVARLDRIQVVSVPYWQWNELKNSEMKQHYLRVKLPD 557
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 66/266 (24%), Positives = 112/266 (42%), Gaps = 35/266 (13%)
Query: 204 VDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTR 263
VDA+ + E + V G+ P ++ L AK+ E V +
Sbjct: 2 VDAKNPRPFQEA-----SGVIPGMDLGSFKPQELSNVLWSFAKSCESVPKLF-------- 48
Query: 264 QREMSMLVAIAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFN 321
R + +A M +L Q +SN AWA + G L+ D VA L + F+
Sbjct: 49 -RLLGNHIA-NMGSLDSFKTQELSNTAWAFATAGQSNPALFEKIGDHVA--GLESLNSFD 104
Query: 322 SQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFAS-------- 371
Q ++N+A A+A+ + S P+L ++ + + + +F+ Q L+ WAFA+
Sbjct: 105 PQALSNIAWAYATAEVSHPELLKKIGDHIAGLSSLESFKPQNLSNTAWAFATAGVSHPKL 164
Query: 372 LYEPADPL--LESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSF 429
LY+ D + L L+ +FK A ++ K GD + G +S + SF
Sbjct: 165 LYKIGDYIAGLGCLE-SFKPQNLSNTVWAFATADMTHPELFKKIGDHIA-GLMS--LDSF 220
Query: 430 NRDQLGNIAWSYAVLGQMDRIFFSDI 455
+ L NI W++A + F+ I
Sbjct: 221 DPQNLSNIVWAFATAKESHPQLFNKI 246
>gi|308799013|ref|XP_003074287.1| unnamed protein product [Ostreococcus tauri]
gi|116000458|emb|CAL50138.1| unnamed protein product, partial [Ostreococcus tauri]
Length = 478
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 118/484 (24%), Positives = 195/484 (40%), Gaps = 108/484 (22%)
Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNM---------- 248
LN++I+DAQ +E+LE + + + +N ATA HR+ +
Sbjct: 6 LNREIMDAQYPEEILEHVR---------VRSHLYNRVNCATAWHRLGRTSRVNGRPRGWT 56
Query: 249 --EKVSMM--TTHRL--AFTRQREMSMLVAIAMTA------------------LPECSAQ 284
E+V+ + TT RL F Q ++ A A+ + E Q
Sbjct: 57 SDERVAELEATTRRLMSTFAVQNLTNIAWACAVLKYKPRDDLLGSIAARMGEMVAEFYPQ 116
Query: 285 GISNIAWALSKIGGELLY-LSEM--------------DRVAEVALTKVGEFNSQNVANVA 329
+SN WA + + + L+E D + + K G F++Q V+NV
Sbjct: 117 ALSNALWAYTVLKHPRAFALAEALKPAILATLPENPDDELKQAESAKDGVFSTQTVSNVL 176
Query: 330 GAFASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAF 387
+A++ H +L LA +F+ QEL+ W++A + P D +L++ +
Sbjct: 177 WTYATLGVHPGVELLDRLAAFILKSAGSFKAQELSNSCWSYARFGHYPGDEVLQTFERCL 236
Query: 388 KDATQ-FTCCLNKALSNCNENGGVKSSGDADSEGSLSS-----PVLSF-----NRDQLGN 436
+ + +T +AL+N + G+ G EG L P F N + N
Sbjct: 237 LERREEYT---TQALANTSV--GLSYFG-GSGEGGLRKLFDDIPPSWFRLREGNSQDISN 290
Query: 437 IAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLK-LEHPHL 495
I W+ A +G F S ++K R E +R D+ ++ LK L H L
Sbjct: 291 IVWAIASVGA----FESQVYKAAVR-------ELFRRDV-----TDFQDEGLKALFHARL 334
Query: 496 QLA--------LSSVLEEKIASAGKTKRFNQ---KVTSSFQKEVARLLVSTGLNWIREYA 544
+ V + +A G Q S+FQ+ V + G E
Sbjct: 335 MQHDFAPDKDEVDVVYPDWVADKGLKPWLEQAEDTRVSTFQQNVTDAVKRAGYEPTMEAL 394
Query: 545 V-DGY-TVDAVLVDKKVAFEIDGPTHFSRNTGVPLGH-TMLKRRYIAAAGWNVVSLSHQE 601
DG ++D L DKK+A E DGPTHF N + T+++ R++ GW V+ + + E
Sbjct: 395 TEDGLLSMDICLNDKKLAIECDGPTHFYSNAPEKMTQKTLIRNRHLEVRGWKVIMIPYYE 454
Query: 602 WEEL 605
W E+
Sbjct: 455 WREV 458
>gi|397612272|gb|EJK61674.1| hypothetical protein THAOC_17795 [Thalassiosira oceanica]
Length = 314
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 139/349 (39%), Gaps = 56/349 (16%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEM--DRVAEVALTKVGEFNSQNVANVAGA 331
+ +L + +SN AWA + G L E D VA + FN Q ++N A A
Sbjct: 7 GLKSLDSFNPHDLSNTAWAYATAGESHSELFEKIGDHVA--GRISLDSFNPQALSNTAWA 64
Query: 332 FASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDAT 391
+A+ + LF EL+ A F QE+A LWA A++ + L
Sbjct: 65 YATARRFHSRLFEELSTEAVVSREYFGGQEVANFLWACATVVYTGERLF----------L 114
Query: 392 QFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
F + L CNE G L NIAW+Y+V
Sbjct: 115 AFAPVVESKLDECNEQG-------------------------LANIAWAYSVANVASEDL 149
Query: 452 FSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG 511
F++ + E+ S + V L L + + L L+EK A
Sbjct: 150 FNEGYVGAFALNEKDFSAE--------GLVQLHQWQLWQQEIESGIELPQSLQEKCRKAF 201
Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAV--LVDKKVAFEIDGPTH 568
+ +++ + Q V R L + GL+ E + GY VDA+ + D+ VA E+DGP+H
Sbjct: 202 TSASYSESI---LQNGVVRELKAVGLDVDEEVLLGSGYRVDALVNVGDRGVAIEVDGPSH 258
Query: 569 FSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
F P G LK R +A VVS+ + EW+ L+ S + YL
Sbjct: 259 FIHRR--PTGSATLKHRQVATLDCIEVVSVPYWEWDGLKNSVMKQHYLH 305
>gi|428181830|gb|EKX50692.1| hypothetical protein GUITHDRAFT_85192 [Guillardia theta CCMP2712]
Length = 177
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 67/106 (63%), Gaps = 2/106 (1%)
Query: 516 FNQKVTSSFQKEVARLLVSTGLNWIREYAVD--GYTVDAVLVDKKVAFEIDGPTHFSRNT 573
Q S QK+VA +L + ++ E+ + GY++D +L DK+ A E+DGP+HF T
Sbjct: 63 MEQHKPSRLQKDVAAILSEMQIEFVEEFIDERSGYSLDLLLRDKRTAIEVDGPSHFIVGT 122
Query: 574 GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
+PLG T++K R++ G+++ L + EW++L+G ++ +Y+R +L
Sbjct: 123 HIPLGKTVMKHRHMQQLGFDLRILPYWEWDQLKGKEQKKEYIRRLL 168
>gi|397618779|gb|EJK65038.1| hypothetical protein THAOC_14163, partial [Thalassiosira oceanica]
Length = 389
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 161/381 (42%), Gaps = 56/381 (14%)
Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
+C+ Q ++NI W+ +K G L E + + + F Q+++N+ A+A++ S
Sbjct: 10 DCTEQALANILWSFAKSGEASPELFEAIE-NHIVVRSLDGFRPQHLSNIVWAYATVGVSH 68
Query: 340 PDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCL 397
P+LF ++ + D + F Q L+ WA+A+ L + + N
Sbjct: 69 PELFKKIGDHVAGLDSLDWFTPQALSNTAWAYATAEASHSELFKKIGNHIAGMGSLDLFN 128
Query: 398 NKALSNCNENGGVKSSGDADSEGSLSSPVL----SFNRDQLGNIAWSYAVLGQMDRIFFS 453
++ SN + LS+ + F+ ++ N W+ A +G D FS
Sbjct: 129 SQDFSNTAWAYATARRFHSRLFEKLSTEAIVKGEYFDGQEVANFLWACATVGYSDERLFS 188
Query: 454 DIWKTI-SRFEEQRISEQYREDIMFASQV------HLVNQCL------------------ 488
I S+ +E +EQ+ +I +A V L N+C
Sbjct: 189 AFTPVIESKLDE--CNEQHLANIAWAYSVVNVPSQDLFNECYVGALASRENAFSEEDLSQ 246
Query: 489 ---------KLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNW 539
+LE ++ L L K +A ++ +++ S Q +VA L + GL+
Sbjct: 247 LHQWQLWQQELES---RIELPRSLRAKCRNAFTSRGYSE---SKLQNDVAGELRAAGLDL 300
Query: 540 IREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNV 594
E + GY +DA++ +KVA E+DGP+HF P G T+LK R + V
Sbjct: 301 DEEVLLGSGYRIDALVKVGDGRKVAVEVDGPSHFIDRR--PTGSTILKHRQVLRLDRIEV 358
Query: 595 VSLSHQEWEELQGSFEQLDYL 615
VS+ + EW EL+ S + YL
Sbjct: 359 VSVPYWEWNELKNSVTKQHYL 379
Score = 43.1 bits (100), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 56/124 (45%), Gaps = 10/124 (8%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
M +L ++Q SN AWA + + ++++ A+ K F+ Q VAN A A
Sbjct: 120 GMGSLDLFNSQDFSNTAWAYAT--ARRFHSRLFEKLSTEAIVKGEYFDGQEVANFLWACA 177
Query: 334 SMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPL--------LESLDN 385
++ +S LFS + EQ LA + WA++ + P+ L L S +N
Sbjct: 178 TVGYSDERLFSAFTPVIESKLDECNEQHLANIAWAYSVVNVPSQDLFNECYVGALASREN 237
Query: 386 AFKD 389
AF +
Sbjct: 238 AFSE 241
>gi|397612107|gb|EJK61605.1| hypothetical protein THAOC_17875 [Thalassiosira oceanica]
Length = 956
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 99/353 (28%), Positives = 160/353 (45%), Gaps = 39/353 (11%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTK-VGEFNSQNVANVAGAFASMQHSAPDL 342
Q +NI W+ +K G L + + +T+ V +F Q+V+N+ A+A+ + S P+L
Sbjct: 623 QDFANIIWSFAKSGKPDPELFQA--LGNHIVTRSVNDFWPQDVSNIVWAYAAAEVSHPEL 680
Query: 343 FSELAKR--ASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDN---AFKDATQFTCCL 397
F ++ D + +F Q L+ WA+A+ L E L A KD
Sbjct: 681 FKKIGDHIAGRDSLDSFNSQALSNTAWAYATAKVFHSRLFEKLATKVVARKDHFHGQAVA 740
Query: 398 NKALSNCNENGGVKSSGDADSE-GSLSSPVLSFNRDQ-----LGNIAWSYAVLGQMDRIF 451
N L C + G D S +PV++ D+ L NIAW+Y+V +
Sbjct: 741 N-FLWAC------ATVGHTDERLCSALAPVIASKLDECSEHDLANIAWAYSVANTPRQDL 793
Query: 452 FSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG 511
F + + E+ ++ ++ Q L Q L + L L+EK +A
Sbjct: 794 FDEGYLCALASNEKDFPDK---ELFQLHQWQLWQQELGS-----GIELPRSLQEKSRNAF 845
Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPT 567
++ +++ S Q +V L + GL+ E + GY +DA++ +KVA E+DGP+
Sbjct: 846 TSRGYSE---SKLQNDVVGELKAAGLDLEEEVLLGSGYRIDALVKFSDGRKVAIEVDGPS 902
Query: 568 HFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLRVIL 619
HF P G T LK R +A VVS+ + EW+EL+ S +L YLR L
Sbjct: 903 HFIDKR--PAGSTTLKHRQVAMLDRIEVVSVPYWEWDELKNSEMKLHYLRKKL 953
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 60/151 (39%), Gaps = 38/151 (25%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP----DLFSELAKRASDIVHTFQEQE 361
DR+A AL + EF +++++N+ +F ++ LF K A I+HTF+ QE
Sbjct: 527 FDRIARSALGMLNEFEARHLSNLIYSFGLVERKPEIGRETLFDVFGKAALRILHTFKPQE 586
Query: 362 LAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGS 421
L+ +LWAF + L E E GV S D D
Sbjct: 587 LSNMLWAFVKVDAKNSRLFE------------------------ETSGVISGMDLD---- 618
Query: 422 LSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
SF NI WS+A G+ D F
Sbjct: 619 ------SFKPQDFANIIWSFAKSGKPDPELF 643
>gi|224013862|ref|XP_002296595.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968947|gb|EED87291.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1014
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 160/361 (44%), Gaps = 61/361 (16%)
Query: 265 REMSMLVAIAMTALPECS---AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFN 321
R ++ I+ ++P C+ Q ++++AW+ + + + ++ +A + + EF+
Sbjct: 700 RSPALFNYISDVSVPHCNDLKRQEVASLAWSFAALN--FFHRPLLEALAVSSEGRWEEFS 757
Query: 322 SQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLE 381
+QN+AN+A A+ + Q + L +A A F Q + +LWA+A+ P L
Sbjct: 758 AQNLANMAWAYTTAQETRHSLLRGIADAAIKKHDEFTHQGFSNLLWAYAAAGHPHQRLFS 817
Query: 382 SLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSY 441
+L + + + L CN L NIAW++
Sbjct: 818 ALAPS----------VAEVLDTCNGQS-------------------------LANIAWAF 842
Query: 442 AVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSS 501
AV D + FSD + + +I E E + Q+H N ++
Sbjct: 843 AVSNVNDELLFSDRFVDVC---SSKIDEFNSEGL---CQLHQWNIW------RAEIGSDK 890
Query: 502 VLEEKIASAGKTKRFNQKVT-SSFQKEVARLLVSTGLNWIREYAVD-GYTVDAVL-VD-K 557
VL IA T+ ++ + S+ Q + ++L S L+ I E + GY +D V+ VD +
Sbjct: 891 VLPPMIAKKCYTQFTSRPLQGSNLQSDAMKVLTSMDLHPIEEVQTESGYCLDFVVNVDGE 950
Query: 558 KVAFEIDGPTHF-SRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYL 615
++ E+DGP HF R+ P G T+LKRR++ ++SL + E EL+ ++ YL
Sbjct: 951 ELGIEVDGPHHFVGRD---PTGSTLLKRRHVENVDRIPIISLPYWELNELETLDDKQLYL 1007
Query: 616 R 616
R
Sbjct: 1008 R 1008
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 48/217 (22%), Positives = 97/217 (44%), Gaps = 17/217 (7%)
Query: 238 ATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIG 297
A L +IA + K + +R + ++ L A+ ++N+A+A +
Sbjct: 344 AVTLCQIANSFAKA--------GYNDERLFQSISDATISILTSFDARHLANMAYAFALAR 395
Query: 298 GELLY---LSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIV 354
Y L+ D +A + ++ +Q++AN+ A+A++ H+ PDLF +A+ A +
Sbjct: 396 VNPRYDDGLTLFDDIANEFIPRLHTATTQHLANITWAYATIGHANPDLFGAVAEEAMGRL 455
Query: 355 HTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGV---K 411
F Q L + WA + ++ +L+ + A C ++ ++ +
Sbjct: 456 KEFSPQHLENLSWALSKFPHSSNEILDRIAEEVV-ARGLQCSTSQGIAMLAHSFATLNHA 514
Query: 412 SSGD--ADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQ 446
++GD E + SS V SF + IAW++A +G+
Sbjct: 515 TNGDFWECIENTASSRVSSFGVIECIQIAWAFATIGR 551
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/225 (23%), Positives = 95/225 (42%), Gaps = 41/225 (18%)
Query: 151 GYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNRRKEINLNKDIVDAQTAQ 210
G+ DL V+ A G +E + +E LS+F SN
Sbjct: 437 GHANPDLFGAVAEEAMGRLKEFSPQHLENLSWALSKFPHSSN------------------ 478
Query: 211 EVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSML 270
E+L+ IAE + A G S S + ++M+
Sbjct: 479 EILDRIAEEVVARGLQCSTS------------------QGIAMLAHSFATLNHATNGDFW 520
Query: 271 VAIAMTALPECSAQGIS---NIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVAN 327
I TA S+ G+ IAWA + IG + L + V+++K+ +FN Q ++N
Sbjct: 521 ECIENTASSRVSSFGVIECIQIAWAFATIGRKADDL--FRGIERVSMSKMDQFNPQGLSN 578
Query: 328 VAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
+A AF+++++ +P LF+ +A+ + + F+ QE A ++ A + +
Sbjct: 579 LAWAFSTLEYDSPTLFNAIAECSERKLDQFKPQEKAMLVLALSRI 623
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 81/187 (43%), Gaps = 45/187 (24%)
Query: 274 AMTALPECSAQGISNIAWALSKI---GGELLYLSEMDRVAEVALTKVGEFN-SQNVANVA 329
AM L E S Q + N++WALSK E+L DR+AE + + + + SQ +A +A
Sbjct: 451 AMGRLKEFSPQHLENLSWALSKFPHSSNEIL-----DRIAEEVVARGLQCSTSQGIAMLA 505
Query: 330 GAFASMQHSAP-DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
+FA++ H+ D + + AS V +F E Q+ WAFA++ AD L ++
Sbjct: 506 HSFATLNHATNGDFWECIENTASSRVSSFGVIECIQIAWAFATIGRKADDLFRGIERV-- 563
Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD 448
++S ++ FN L N+AW+++ L
Sbjct: 564 -----------SMSKMDQ----------------------FNPQGLSNLAWAFSTLEYDS 590
Query: 449 RIFFSDI 455
F+ I
Sbjct: 591 PTLFNAI 597
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 5/68 (7%)
Query: 263 RQREMSMLVAIAMTALPECSAQGISNIAW--ALSKIGGELLYLSEMDRVAEVALTKVGEF 320
QR S L L C+ Q ++NIAW A+S + ELL+ DR +V +K+ EF
Sbjct: 812 HQRLFSALAPSVAEVLDTCNGQSLANIAWAFAVSNVNDELLF---SDRFVDVCSSKIDEF 868
Query: 321 NSQNVANV 328
NS+ + +
Sbjct: 869 NSEGLCQL 876
>gi|307105016|gb|EFN53267.1| hypothetical protein CHLNCDRAFT_137207 [Chlorella variabilis]
Length = 1782
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 89/389 (22%), Positives = 168/389 (43%), Gaps = 68/389 (17%)
Query: 246 KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLY-LS 304
+ + +++ H L Q M++L+ AM LP+ Q +SN+ W+++ + E +
Sbjct: 336 QTISNLALAYAH-LGRKPQLLMALLMKEAMPLLPQFKPQELSNLLWSMASM--EFWHGPG 392
Query: 305 EMDRVAEVALTKVGEFNSQNVANVAGAFASMQH-SAPDLFSELAKRASDIVHTFQEQELA 363
++ + + A Q +AN A+A+M+ ++ + A + F+ QEL
Sbjct: 393 AVESITQAACGVADRMKPQEIANCCWAWATMRFFPGAEVLDLMLAHAEAQLDRFKSQELG 452
Query: 364 QVLWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL 422
+ WA A L Y PA L+ + A ++ N A+ +C
Sbjct: 453 MLTWAVARLAYMPAASLVRA---CLPLAAEWR---NPAVQDC------------------ 488
Query: 423 SSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWK---TISR--FEEQRISEQYREDIMF 477
GN+ W++ VLG + S + ++ R F ++ + Y+ +
Sbjct: 489 ------------GNLLWAFTVLGILTPEVMSVLGHKMLSLPREAFTQEAYIQLYQAKMSL 536
Query: 478 ASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVT---SSFQKEVARLLVS 534
+ VH + ++ + ++ + G+T+ Q S+ ++VA +
Sbjct: 537 SQAVHDI---------------AAHIPPELLARGETEWRQQAAVLKVSATHRDVAAAMAE 581
Query: 535 TGLNW-IREYAVDGY-TVDAVLVDKKVAFEIDGPTHFSRNTG-VPLGHTMLKRRYIAAAG 591
G+ I DG +VD L ++VA E+DG HF++N VPLG T+ + R +A+ G
Sbjct: 582 LGIEHDIERRIEDGLVSVDIALRSERVAVEVDGSAHFTQNEPFVPLGRTLWRWRLLASRG 641
Query: 592 WNVVSLSHQEWEELQGSFEQLDYLRVILK 620
W VVS+ + W L+ E+ YL +L+
Sbjct: 642 WRVVSVPYFRWGLLRSMDEKKRYLYQLLQ 670
Score = 45.8 bits (107), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 55/114 (48%), Gaps = 9/114 (7%)
Query: 280 ECSAQGISNIAWALSKIGGELLY-----LSEMDRVAEVALTKV---GEFNSQNVANVAGA 331
E QG++NI W + K+G ++ + + + R + LT G F QNV+N
Sbjct: 197 EFKPQGLANILWGMGKLGVKVSHEVRQMVDALCREVQAQLTHSRHKGSFAPQNVSNTLHG 256
Query: 332 FASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLD 384
++ +P+L S L + A ++ +F QEL ++W+ + ++ P +D
Sbjct: 257 IVNIGIVPSPELLSALVRAADGMLRSFGAQELTNLVWSLSQMHRCGVPFTPDVD 310
>gi|397605332|gb|EJK58971.1| hypothetical protein THAOC_20861 [Thalassiosira oceanica]
Length = 2083
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 153/353 (43%), Gaps = 23/353 (6%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
+ +L Q +SN AWA + G L + + FN Q+++N+A AFA
Sbjct: 741 GLASLDSFKPQALSNTAWAFATAGESHPELFKKIGGHIAGPGSLCSFNPQDLSNIAWAFA 800
Query: 334 SMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDN---AFK 388
+ S +LF+++ + D + +F+ Q L+ WA+A+ L E L A K
Sbjct: 801 TAGVSHRELFNKIGHHVAGLDSLDSFEPQALSNTAWAYATARVFHSRLFEKLAKEVAARK 860
Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD 448
T + L C G + ++S + FN L NI W+Y+V
Sbjct: 861 GELIETQHIANFLWACATVGYTDERSFSAFAPVIASKLDKFNEQGLSNITWAYSVANLPR 920
Query: 449 RIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIA 508
+ F+ + + E+ S + A ++E + L L+ K
Sbjct: 921 QDLFNKGYVSALASNEKVFSGE-----QLAQLHQWQLWQQEMES---GIELPQSLQAKCR 972
Query: 509 SAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDG-YTVDAVLV---DKKVAFEID 564
+A ++ +++ S Q +V L + GL E ++ Y +DA++ +KVA E+D
Sbjct: 973 NAFTSRGYSE---SKLQNDVVGELKAAGLVLDEEVLLESWYLIDALVEFSDGRKVAVEVD 1029
Query: 565 GPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
GP+HF P G T+LK R +A VVS+ + EW+EL+ S + YLR
Sbjct: 1030 GPSHFIDMR--PTGSTILKHRQVARMDHIEVVSVPYWEWDELKNSEMKQHYLR 1080
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 82/178 (46%), Gaps = 20/178 (11%)
Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQ 364
D VA L + FN Q ++N A AFA+ S P+LF ++ + + + +F+ Q L
Sbjct: 659 DHVA--GLMSLNSFNPQALSNTAWAFATAGVSYPELFKKIGGHVAGLGSLDSFKAQALTN 716
Query: 365 VLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE----- 419
++W+FA+ E L + + + +ALSN ++G++ E
Sbjct: 717 IVWSFATAGESNPKLFKKIGDYIAGLASLDSFKPQALSNTAW--AFATAGESHPELFKKI 774
Query: 420 -GSLSSP--VLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS------RFEEQRIS 468
G ++ P + SFN L NIAW++A G R F+ I ++ FE Q +S
Sbjct: 775 GGHIAGPGSLCSFNPQDLSNIAWAFATAGVSHRELFNKIGHHVAGLDSLDSFEPQALS 832
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 81/200 (40%), Gaps = 47/200 (23%)
Query: 303 LSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTF 357
LS +DR+A A+ + EF+++ ++N+ +F +H+ PD LF+ A I+HTF
Sbjct: 479 LSIIDRIASSAVGMLNEFDARCLSNLIYSFGLFEHN-PDIEGETLFNVFGDAAGKILHTF 537
Query: 358 QEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDAD 417
+ Q L+ +LWAF + L + E G V S D D
Sbjct: 538 ESQNLSNMLWAFVKVDAKHSRLFQ------------------------ETGRVISGMDLD 573
Query: 418 SEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF-----SDIWKTISR--FEEQRISEQ 470
SF L NI WS+ G+ D F S KT++ +E R S
Sbjct: 574 ----------SFKPQALANILWSFTKSGKADPELFQALGNSHCRKTVAPCAVQEDRRSHC 623
Query: 471 YREDIMFASQVHLVNQCLKL 490
+ + F V CL +
Sbjct: 624 WTGQLEFIQAAGPVQYCLGV 643
Score = 40.0 bits (92), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 72/166 (43%), Gaps = 16/166 (9%)
Query: 206 AQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQR 265
++ E+ + I I G S +P NIA A + +HR F +
Sbjct: 764 GESHPELFKKIGGHIAGPGSLCSFNPQDLSNIAWAF---------ATAGVSHRELFNK-- 812
Query: 266 EMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEF-NSQN 324
+ VA + +L Q +SN AWA + + + +++A+ + GE +Q+
Sbjct: 813 -IGHHVA-GLDSLDSFEPQALSNTAWAYAT--ARVFHSRLFEKLAKEVAARKGELIETQH 868
Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
+AN A A++ ++ FS A + + F EQ L+ + WA++
Sbjct: 869 IANFLWACATVGYTDERSFSAFAPVIASKLDKFNEQGLSNITWAYS 914
>gi|158702076|gb|ABW77414.1| RAP domain protein [Arabidopsis thaliana]
Length = 49
Score = 79.0 bits (193), Expect = 8e-12, Method: Composition-based stats.
Identities = 36/45 (80%), Positives = 40/45 (88%)
Query: 575 VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
+PLGHTMLKRRY+AAAGW VVSLS QEWEE +GS EQL+YLR IL
Sbjct: 1 LPLGHTMLKRRYVAAAGWKVVSLSLQEWEEHEGSHEQLEYLREIL 45
>gi|323450314|gb|EGB06196.1| hypothetical protein AURANDRAFT_65882 [Aureococcus anophagefferens]
Length = 1499
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 154/355 (43%), Gaps = 43/355 (12%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQ--HSA 339
++Q I+N AWA ++ G L D +A VA + F SQ +AN+A A+A + A
Sbjct: 831 NSQNIANCAWAYARAGSRDTAL--FDALARVAEPLLDGFKSQELANLAWAYAKLNLVERA 888
Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFAS-------LYEPAD----PLLESLDNAFK 388
LF +LA+ A + + Q++ LWAFAS L+E A P L +LD F
Sbjct: 889 QVLFLQLARVAQAKLGRYNAQDVTNTLWAFASNDLEHVALFEAAARHAAPRLRALDRGFA 948
Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD 448
+ Q L + + A + G++ +L + + N+AW++A +G+ D
Sbjct: 949 N-PQKVATLAWSYAKAAVYAPALMDALAAACGAIVDELLPVD---VANVAWAFAAVGETD 1004
Query: 449 RIFFSDIWKTISRFEEQRISEQYREDIMFA-SQVHLVNQCLKLEHPHLQLALSSVLEEKI 507
R + K + +S Q +++++ S + C +L L + + + +
Sbjct: 1005 RGGLFEALKDRALAVLDDLSSQELANLVWSFSNLDDAAPCRELWLVLLDRGWTPAIFDDV 1064
Query: 508 ASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREY------------------AVDGYT 549
A K++ + + VA + G W R A G +
Sbjct: 1065 A---KSQLQQAYLRLTLDGAVAAVPPLDG-EWARALQAALTTSDCALGSRTQLEARSGLS 1120
Query: 550 VDAVLVDKKVAFEIDGPTHFSRNTGVPL-GHTMLKRRYIAAAGWNVVSLSHQEWE 603
+D + KVA E DGP H+ N L G + LKRR + GW++V + +++W+
Sbjct: 1121 LDMAKPELKVAVEFDGPVHYFANAKWMLTGRSKLKRRLLDLVGWDIVYVDYRDWD 1175
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 90/217 (41%), Gaps = 45/217 (20%)
Query: 190 PSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNM- 248
PS R KE+N + +T ++L ++ P+ L+ N TALHR++K
Sbjct: 255 PSARDKEVN-TLLLRKCKTVADILALVERE--------GPARLNTFNQVTALHRLSKAGL 305
Query: 249 -----------------------EKVSMMTTHRLAFTRQREMSMLVA----------IAM 275
+ + TT L T M V
Sbjct: 306 RLGRGGGEPLVEALVASVAGKIGARPGVFTTRHLVNTAYSLGKMKVTDARAYAAIATACG 365
Query: 276 TALPECSAQGISNIAWALS--KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
L E +AQ ++N++WA + ++ + L+ + R+ A ++G F SQ ++N AFA
Sbjct: 366 PRLGEFNAQDVANLSWAYATAEVSDDADCLATLRRLPGAAQRELGSFTSQGLSNTVWAFA 425
Query: 334 SMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
+M AP+L + +A + + QELA +WA+A
Sbjct: 426 TMGLRAPELMAHVAAEGERRLGEYNAQELANTVWAYA 462
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 130/340 (38%), Gaps = 60/340 (17%)
Query: 220 ITAVGKGLSPSPL---SPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMT 276
I V G P L SP N+A+ LH + VS + F+R +
Sbjct: 601 ICDVAAGDGPCSLDGFSPQNLASLLHAL-----TVSGFDAPDV-FSRAPPRVAAL----- 649
Query: 277 ALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQ 336
LP C+AQ ISN W+ + L + VA F +QNV+NVA +FA +
Sbjct: 650 -LPACNAQDISNTVWSFASNDIRDARLFDAVDAFLVAEGVPETFGAQNVSNVAWSFAKVA 708
Query: 337 HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL-------DNAFKD 389
+ LF L A+ I+ F Q + L+AFA D S+ + A+
Sbjct: 709 MGSDALFGVLGDFAASIIDQFSNQNCSNTLYAFALANRRHDAFFRSMCGEIVRQEAAWSP 768
Query: 390 ATQFTCCLNKALSNCNENGGVKSSGDADSE-----------GSLSSPVLS---------- 428
+ Q N A + V +GD S+ ++P +
Sbjct: 769 SGQDIA--NSAWALATIGLTVAPAGDDKSQVKRRLEDGPDADYFATPAFAALSRAAVRVC 826
Query: 429 ---FNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVN 485
FN + N AW+YA G D F + + + S++ ++++LV
Sbjct: 827 GRGFNSQNIANCAWAYARAGSRDTALFDALARVAEPLLDGFKSQELANLAWAYAKLNLVE 886
Query: 486 QCLKLEHPHLQLALSSVLEEKIASAGKTKRFN-QKVTSSF 524
+ L LQLA ++A A K R+N Q VT++
Sbjct: 887 RAQVL---FLQLA-------RVAQA-KLGRYNAQDVTNTL 915
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 7/98 (7%)
Query: 278 LPECSAQGISNIAWALSKIGGELLYLSE---MDRVAEVALTKVGEFNSQNVANVAGAFAS 334
L E +AQ ++N WA +K G E S+ + +A AL K+G+FN QN+ N A AFA+
Sbjct: 446 LGEYNAQELANTVWAYAKCGAE----SQEPFLRAIARAALAKLGDFNPQNLTNTAWAFAT 501
Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
P+LF +A + + F Q L+ WAFA +
Sbjct: 502 AGVVVPELFDGVAAASVRQLDVFNPQNLSNTGWAFAKV 539
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 4/89 (4%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
Q ++N AWA + G ++ D VA ++ ++ FN QN++N AFA + + LF
Sbjct: 490 QNLTNTAWAFATAG--VVVPELFDGVAAASVRQLDVFNPQNLSNTGWAFAKVGYYDARLF 547
Query: 344 SELAKRAS--DIVHTFQEQELAQVLWAFA 370
+A R + D++ F Q L+ V W+ A
Sbjct: 548 RAIAARVARDDVIGVFNPQNLSNVAWSLA 576
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 50/102 (49%), Gaps = 14/102 (13%)
Query: 284 QGISNIAWALSKIGGE----------LLYLSEMDRVAEVAL----TKVGEFNSQNVANVA 329
Q +SN+AW+L+K E + Y + ++ +VA + F+ QN+A++
Sbjct: 566 QNLSNVAWSLAKRLTEGPEVHDGDEKVAYFDCLRKICDVAAGDGPCSLDGFSPQNLASLL 625
Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFAS 371
A APD+FS R + ++ Q+++ +W+FAS
Sbjct: 626 HALTVSGFDAPDVFSRAPPRVAALLPACNAQDISNTVWSFAS 667
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 51/110 (46%), Gaps = 7/110 (6%)
Query: 270 LVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKV-----GEFNSQN 324
L +A L +AQ ++N WA + +L +++ + A A ++ G N Q
Sbjct: 895 LARVAQAKLGRYNAQDVTNTLWAFAS--NDLEHVALFEAAARHAAPRLRALDRGFANPQK 952
Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYE 374
VA +A ++A AP L LA IV ++A V WAFA++ E
Sbjct: 953 VATLAWSYAKAAVYAPALMDALAAACGAIVDELLPVDVANVAWAFAAVGE 1002
>gi|397586873|gb|EJK53743.1| hypothetical protein THAOC_26753, partial [Thalassiosira oceanica]
Length = 447
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 138/316 (43%), Gaps = 30/316 (9%)
Query: 315 TKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASL 372
T + F Q+ +N A AFA+ S P+LF ++ + + +++F Q L+ W+FA+
Sbjct: 33 TSLNSFKPQDFSNTAWAFATAGASHPELFKKIGNHLAGLMSLNSFNPQALSNTAWSFATA 92
Query: 373 YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE-GSLSSPVLSFNR 431
L + + + F + LSN + G D S +PV+
Sbjct: 93 GISYPELFRKIGDHVAELGCFDSFKPQELSNT--VWACATIGHTDERLFSAFAPVIRSKL 150
Query: 432 DQ-----LGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRIS-EQYREDIMFASQVHLVN 485
D+ L NIAW+Y+V F++ + E S E++R+ + +
Sbjct: 151 DECSEQDLANIAWAYSVANLPRHDLFNEGYAGALASNENEFSVEEFRQLHQWQLWQQELQ 210
Query: 486 QCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV 545
++ L L K +A ++ F++ S Q +V L GL+ E +
Sbjct: 211 SGIE---------LPRSLRAKCRNAFTSRGFSE---SKLQNDVVDELRIAGLDLEEEVLL 258
Query: 546 -DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQ 600
GY +DA++ +KVA E+DGP HF P G T LK R +A VVS+ +
Sbjct: 259 GSGYRIDALVKVGDGRKVAIEVDGPFHFIDRR--PAGRTTLKHRQVATLDRIEVVSVPYW 316
Query: 601 EWEELQGSFEQLDYLR 616
EW+EL+ S + YLR
Sbjct: 317 EWDELKNSEMKQHYLR 332
Score = 42.4 bits (98), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 46/99 (46%), Gaps = 4/99 (4%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
+ +L + Q +SN AW+ + G L+ D VAE+ F Q ++N A
Sbjct: 70 GLMSLNSFNPQALSNTAWSFATAGISYPELFRKIGDHVAELGC--FDSFKPQELSNTVWA 127
Query: 332 FASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
A++ H+ LFS A + EQ+LA + WA++
Sbjct: 128 CATIGHTDERLFSAFAPVIRSKLDECSEQDLANIAWAYS 166
>gi|224004716|ref|XP_002296009.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209586041|gb|ACI64726.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1278
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 95/438 (21%), Positives = 164/438 (37%), Gaps = 113/438 (25%)
Query: 264 QREMSMLVAIAMTALPECSAQGISNIAWALSKIG-------------------------- 297
QR +V S+QG+ N W+ +K G
Sbjct: 846 QRIAEHIVGNNGRGFSSFSSQGLGNTLWSFAKQGQLSLDVIELLGDSAKAVSTGRLAVYE 905
Query: 298 ------GELLYLSEMDRVAEVALT-KVGEFNSQNVANVAGAFASMQ--HSA------PDL 342
GE L AE L+ + F +Q+++N A+A++ HS +
Sbjct: 906 TSCLDIGEKLLKQLFAMAAEAGLSMNLDRFKTQDISNTCWAYATLGLLHSGFFNNVESQV 965
Query: 343 FSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKAL 401
S + S F+ QE+A +LW+FA+L +P ++++L +
Sbjct: 966 ISRIGSVPSKSRQIFRGQEMANILWSFATLNAQPQPAMVDALASYIA------------- 1012
Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI-- 459
+ C G D S L F R +L NIAWS AVLG+ + + ++ I
Sbjct: 1013 AGCRGKNGP----DEHSVSRL------FKRQELANIAWSCAVLGRYPKELMNILYTGIVG 1062
Query: 460 SRFEEQRISEQYREDIMFASQV---HLVNQCLKLEHPHLQLALSSVLEE----------- 505
+R + Q + + + ++ + S + + V +E P L+L L +
Sbjct: 1063 TRNDPQEMKQIFDDEGLQKSSIMTLYYVQVAADVEAPQLKLKLPNGFPNGWCDDGEGHSV 1122
Query: 506 KIASAGKTKRFNQK-------VTSSFQKEVARLLVSTGLNWIREYAVDG----------- 547
I+S G Q S Q++V++ G E+ +D
Sbjct: 1123 GISSKGDESDLAQVSSSMLTLTVSKLQRDVSKTFDRLGFENEMEHVIDTNEIKDEYGIQL 1182
Query: 548 -------YTVDAVLVDKKVAFEIDGPTHF-------SRNTGVPLGHTMLKRRYIAAAGWN 593
++D V+++V E+DGP HF R G T+LK R + GW+
Sbjct: 1183 PKTPQEFLSIDIANVEQRVGIEVDGPGHFVRLIDSKDRGDNRVNGPTLLKHRLLTHLGWD 1242
Query: 594 VVSLSHQEWEELQGSFEQ 611
++ L + E++ L G E+
Sbjct: 1243 IIHLPYWEYQSLGGGEEE 1260
Score = 45.8 bits (107), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 90/196 (45%), Gaps = 15/196 (7%)
Query: 270 LVAIAMTALPEC---SAQGISNIAWALSKIGGELLYLSEM-DRVAEVALTKVGEFNSQNV 325
L IA +ALP AQ ++N+AW +++G ++ + VA+ ++ +F Q+V
Sbjct: 666 LETIADSALPRLERFKAQELNNLAWGFARLGHRTEKAEKLFEGVAKQLKQRIHQFKPQDV 725
Query: 326 ANVAGAFASMQHSAPDLFSELAKRAS-DIVHTFQEQELAQVLWAFASL-YEPADPLLESL 383
+F++ ++ D F A R + + +F+ QE++ +WA A+ + P + +
Sbjct: 726 GTTLWSFSTAEYFDLDAFRTGASRLNFQHIRSFKPQEMSNTVWALATAGFTPK--YIHAF 783
Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
D ATQ L++ ++ + E ++ P L F +L +I WS++
Sbjct: 784 DTTLVPATQ-----RPPLNDIKKDPITECFAAVAGE-AMRRP-LDFKDQELKDILWSFSK 836
Query: 444 LGQMDRIFFSDIWKTI 459
+G F I + I
Sbjct: 837 IGVRHPALFQRIAEHI 852
Score = 45.8 bits (107), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 301 LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH---SAPDLFSELAKRASDIVHTF 357
L ++ +A+ AL ++ F +Q + N+A FA + H A LF +AK+ +H F
Sbjct: 661 LVFETLETIADSALPRLERFKAQELNNLAWGFARLGHRTEKAEKLFEGVAKQLKQRIHQF 720
Query: 358 QEQELAQVLWAFAS 371
+ Q++ LW+F++
Sbjct: 721 KPQDVGTTLWSFST 734
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 46/203 (22%), Positives = 81/203 (39%), Gaps = 33/203 (16%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDR-------------------------VAEVALTKVG 318
Q +SN WAL+ G Y+ D VA A+ +
Sbjct: 761 QEMSNTVWALATAGFTPKYIHAFDTTLVPATQRPPLNDIKKDPITECFAAVAGEAMRRPL 820
Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRA----SDIVHTFQEQELAQVLWAFASLYE 374
+F Q + ++ +F+ + P LF +A+ +F Q L LW+FA +
Sbjct: 821 DFKDQELKDILWSFSKIGVRHPALFQRIAEHIVGNNGRGFSSFSSQGLGNTLWSFAKQGQ 880
Query: 375 PADPLLESLDNAFKDATQFTCCLNKALSNCNENGG--VKSSGDADSEGSLSSPVLSFNRD 432
+ ++E L ++ K + T L ++C + G +K +E LS + F
Sbjct: 881 LSLDVIELLGDSAKAVS--TGRLAVYETSCLDIGEKLLKQLFAMAAEAGLSMNLDRFKTQ 938
Query: 433 QLGNIAWSYAVLGQMDRIFFSDI 455
+ N W+YA LG + FF+++
Sbjct: 939 DISNTCWAYATLGLLHSGFFNNV 961
>gi|397566229|gb|EJK44967.1| hypothetical protein THAOC_36452, partial [Thalassiosira oceanica]
Length = 366
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 92/339 (27%), Positives = 153/339 (45%), Gaps = 34/339 (10%)
Query: 299 ELLYLSEMDRVAEVALT--KVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--V 354
E LS R+A ++L + FNSQN++N AFA+ S P+LF+++ + + +
Sbjct: 42 ECRILSCSRRLAIMSLDCDSLDSFNSQNLSNTVWAFATAGESHPELFNKIGNHIAGLASL 101
Query: 355 HTFQEQELAQVLWAFASLYEPADPLLESLDN-AFKDATQFTCCLNKALSNCNENGGVKSS 413
+F Q L+ +WA+A+ L E L A F + +SN
Sbjct: 102 GSFNPQNLSITVWAYATARVFHSRLFEKLTTEAVAKKDHFD---EQGVSNLLWACATVDY 158
Query: 414 GDADSEGSLSSPVLSF-----NRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRIS 468
D +L +P++ N +L NIAW+Y+V + F++ + + E+ S
Sbjct: 159 IDERLFSAL-APMIGLKLDKCNEQELANIAWAYSVANTPRQDLFNEGYVSALASNEKDFS 217
Query: 469 EQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEV 528
+ A +L+ + L L+ K +A + F++ S FQ +V
Sbjct: 218 AE-----GLAQLHQWQLWQQELKS---GIELPQSLQAKCRNAFTSHGFSE---SKFQNDV 266
Query: 529 ARLLVSTGLNWIREYAV--DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLK 583
L + GL+ + E A+ GY +DA++ +KVA E+DGP+HF P G T LK
Sbjct: 267 VYELKAAGLD-LDEEALFGSGYRIDALVKVGDGRKVAVEVDGPSHFIDRR--PAGSTTLK 323
Query: 584 RRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
R +A VV + + EW+ L+ S + YL + L D
Sbjct: 324 HRQVARLDRIQVVPVPYWEWDNLKNSEMKQHYLHLKLSD 362
>gi|397606443|gb|EJK59317.1| hypothetical protein THAOC_20479, partial [Thalassiosira oceanica]
Length = 472
Score = 75.9 bits (185), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 89/342 (26%), Positives = 145/342 (42%), Gaps = 59/342 (17%)
Query: 284 QGISNIAWALSKIGGELLYLSEM--DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q +SN WA + G L M VAE L + F +Q ++N A A A+ S P+
Sbjct: 117 QDLSNTIWAFATAGVLHPELFNMIGHHVAE--LGSLDSFKAQALSNTAWALATAGVSHPE 174
Query: 342 LFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNK 399
LF+++ + + + +F+ QEL+ LWA AS+ + L +L +
Sbjct: 175 LFNKIGNHIAGLGSLDSFKPQELSNTLWACASVCYTDERLFSAL----------APVIAS 224
Query: 400 ALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI 459
L C+E L N+AW+Y+V + F + + +
Sbjct: 225 KLDKCSEQ-------------------------DLANVAWAYSVANTPRQDLFDEGYVSA 259
Query: 460 SRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQK 519
E S + A +LE ++ L + K +A ++ +++
Sbjct: 260 LASNENEFSGK-----ELAQLHQWQLWQQELES---RIELQGPFQAKCRNAFTSRGYSE- 310
Query: 520 VTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGV 575
S Q +V L + GL E + GY +DA++ +KVA E+DGP+HF
Sbjct: 311 --SKLQNDVVDELKAAGLVLDEEVLLGSGYLIDALVEFNDGRKVAVEVDGPSHFIDRR-- 366
Query: 576 PLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
P G T+LK R +A VVS+ + EW+EL+ S + YLR
Sbjct: 367 PAGRTILKHRQVAKMDHIKVVSVPYWEWDELKNSEMKQRYLR 408
Score = 42.7 bits (99), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 48/107 (44%)
Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
+ +L AQ +SN AWAL+ G L L + F Q ++N A AS
Sbjct: 147 LGSLDSFKAQALSNTAWALATAGVSHPELFNKIGNHIAGLGSLDSFKPQELSNTLWACAS 206
Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLE 381
+ ++ LFS LA + + EQ+LA V WA++ P L +
Sbjct: 207 VCYTDERLFSALAPVIASKLDKCSEQDLANVAWAYSVANTPRQDLFD 253
Score = 38.9 bits (89), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 39/152 (25%), Positives = 64/152 (42%), Gaps = 18/152 (11%)
Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVH--TFQEQELAQ 364
D VA L + FN Q ++N A AFAS + P+L ++ + + +F+ Q L+
Sbjct: 4 DHVA--GLDSLNSFNPQTLSNTAWAFASAEVPHPELLRKIGDHIAGQMSLISFEPQNLSN 61
Query: 365 VLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGD-ADSEGSLS 423
WA+A+ + L+S D T + A + + + K G+ G L
Sbjct: 62 TAWAYAAAGD-----LDSFDPKVLSITAWAF----ATAGVSHDELFKKIGNHVTGPGGLG 112
Query: 424 SPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDI 455
SF L N W++A G + F+ I
Sbjct: 113 ----SFKPQDLSNTIWAFATAGVLHPELFNMI 140
>gi|397589068|gb|EJK54518.1| hypothetical protein THAOC_25847, partial [Thalassiosira oceanica]
Length = 342
Score = 75.5 bits (184), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 140/323 (43%), Gaps = 32/323 (9%)
Query: 314 LTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFAS 371
L + F QN++N A AFA+ S P LF ++ + + + F+ EL+ WAFA
Sbjct: 12 LDSLDSFKQQNLSNTAWAFATAGESHPGLFRKIGGHVAGLMSLDLFKPLELSNTAWAFAK 71
Query: 372 LYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE-GSLSSPVLSFN 430
+ L + + + +ALSN + G D S +PV+
Sbjct: 72 AGKSNPKLFKKICDYIAGLDSLDSFDPQALSNI--VWACATVGYTDERLFSAFAPVIESK 129
Query: 431 RDQ-----LGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQY--REDIMFASQVHL 483
D+ L NI+W+Y+V + F++ + E+ SE+ + Q L
Sbjct: 130 LDECSEQHLANISWAYSVANLPKQDLFNEGYAGALASNEKDFSEEVLCQLHQWQLWQQEL 189
Query: 484 VNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREY 543
V L +E P +L + SAG ++ S Q +V L + GL E
Sbjct: 190 V---LGIELPE---SLQAKCRNAFTSAGYSE-------SKLQNDVVGELRAAGLVLDEEV 236
Query: 544 AV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLS 598
+ GY +DA++ +KVA E+DGP HF P G T LK R +A VVS+
Sbjct: 237 LLGSGYRIDALVKFGDGRKVAVEVDGPFHFIDRR--PAGSTTLKHRQVARLDRIEVVSVP 294
Query: 599 HQEWEELQGSFEQLDYLRVILKD 621
+ EW+EL+ S + YL V L D
Sbjct: 295 YWEWDELKNSEMKQHYLLVKLPD 317
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 43/87 (49%), Gaps = 4/87 (4%)
Query: 286 ISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
+SN AWA +K G L+ D +A L + F+ Q ++N+ A A++ ++ LF
Sbjct: 62 LSNTAWAFAKAGKSNPKLFKKICDYIA--GLDSLDSFDPQALSNIVWACATVGYTDERLF 119
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFA 370
S A + EQ LA + WA++
Sbjct: 120 SAFAPVIESKLDECSEQHLANISWAYS 146
>gi|397606466|gb|EJK59321.1| hypothetical protein THAOC_20474 [Thalassiosira oceanica]
Length = 282
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 134/320 (41%), Gaps = 55/320 (17%)
Query: 309 VAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVL 366
+A +AL + F ++N A AFA S P LF ++ + D + +F Q L+ ++
Sbjct: 7 IARLALGSLDLFKPLELSNTAWAFAKAGKSNPKLFKKICDYIAGLDSMDSFDPQALSNIV 66
Query: 367 WAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPV 426
WA A++ + L + F + L C+E
Sbjct: 67 WACATVGHTDERLF----------SAFAPVIASKLDECSEQ------------------- 97
Query: 427 LSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQ 486
L NIAW+Y+V + F++ + + E+ SE+ +
Sbjct: 98 ------HLANIAWAYSVANTPRQDLFNEGFVSALASNEKDFSEE-----VLCQLHQWQLW 146
Query: 487 CLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV- 545
+LE + L L+EK +A + +++ S Q +V L + GL E +
Sbjct: 147 QQELES---GIELPGSLQEKCRNAFTSASYSE---SKLQNDVVGELKAAGLVLDEEVLLG 200
Query: 546 DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAA-GWNVVSLSHQE 601
GY +DA++ +K+A E+DGP+HF P G T+LK+R + VV + + E
Sbjct: 201 SGYRIDALVKISDGRKLAVEVDGPSHFIDRR--PAGRTILKQRQVTRLDSIEVVPVPYWE 258
Query: 602 WEELQGSFEQLDYLRVILKD 621
W EL S + YLRV L +
Sbjct: 259 WNELMNSVMKQHYLRVKLSN 278
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 4/97 (4%)
Query: 286 ISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
+SN AWA +K G L+ D +A L + F+ Q ++N+ A A++ H+ LF
Sbjct: 23 LSNTAWAFAKAGKSNPKLFKKICDYIA--GLDSMDSFDPQALSNIVWACATVGHTDERLF 80
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
S A + + EQ LA + WA++ P L
Sbjct: 81 SAFAPVIASKLDECSEQHLANIAWAYSVANTPRQDLF 117
>gi|302849501|ref|XP_002956280.1| hypothetical protein VOLCADRAFT_97290 [Volvox carteri f. nagariensis]
gi|300258392|gb|EFJ42629.1| hypothetical protein VOLCADRAFT_97290 [Volvox carteri f. nagariensis]
Length = 1331
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 58/94 (61%), Gaps = 1/94 (1%)
Query: 513 TKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRN 572
T F ++V S +Q+++A L L + E GY++D L ++A E DGPTH SR
Sbjct: 998 TSGFRRRVQSGYQRQMANSLTGLRLMHLLEDNCTGYSIDITLPQLRIALEADGPTHTSRT 1057
Query: 573 T-GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
G LG T +KRR++ GW+V++++++EW++L
Sbjct: 1058 PGGAVLGATAMKRRHLQKMGWHVINVTYKEWDKL 1091
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 46/197 (23%), Positives = 80/197 (40%), Gaps = 28/197 (14%)
Query: 291 WALSKIGGELLYLSEMDRVAEVAL----------------TKVGEFNSQNVANVAGAFAS 334
W +S +GG + +E + + + G + V A +
Sbjct: 713 WGMSSLGGSPYFQAETEAAVTILVRCLAAVAAAAGGTAATAASGGLSGWQAGQVLWALGN 772
Query: 335 MQHSAPDLFS-ELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLE-SLDNAFKDAT 391
+H+ P L E + S + + Q ++L++VLW FASL Y P LL D ++++ T
Sbjct: 773 SRHATPRLMDLETSILRSGGLSSMQPRDLSRVLWGFASLGYRPERLLLTIRPDWSWRERT 832
Query: 392 QFTCCLN---------KALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYA 442
T ++ +A + GG + G + S V SF QL + W+ A
Sbjct: 833 TATAVVSEDGRTSPKARARGKRSSRGGGRGGGGRGRQVVQSGDVRSFTPQQLSGVVWALA 892
Query: 443 VLGQMDRIFFSDIWKTI 459
V+ Q+D + F W +
Sbjct: 893 VMEQVDTVPFRSAWTQL 909
>gi|397612992|gb|EJK61975.1| hypothetical protein THAOC_17440 [Thalassiosira oceanica]
Length = 348
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/334 (26%), Positives = 147/334 (44%), Gaps = 31/334 (9%)
Query: 302 YLSEMDRVAE--VALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRAS--DIVHTF 357
YLS + + + L + F+ QN++N A AFA+ S P LF + + D + +F
Sbjct: 21 YLSALPGIGDHIAGLDNLDSFDLQNLSNTAWAFATSGMSNPKLFRMIGGHVAGLDSLDSF 80
Query: 358 QEQELAQVLWAFAS--LYEPADPLLESLDN---AFKDATQFTCCLNKALSNCNENGGVKS 412
+ Q+ + WA+A+ L+ P L E L A KD N L C G
Sbjct: 81 KPQDASITAWAYATARLFNPR--LFEKLATEMPARKDHFHGQAVAN-FLWACATVGYTDE 137
Query: 413 SGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYR 472
A ++S + + L NIAW+Y+V +F I+ E+ +E+
Sbjct: 138 RLFAAFAPLIASKLDECSEQDLANIAWAYSVENAPQDLFNEGYASAIASKEKDFSAEELL 197
Query: 473 EDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLL 532
+ + + ++L L K +A ++ +++ S Q +V L
Sbjct: 198 QLHQWQLWQQELESGIELPRS---------LRAKCRNAFTSQGYSE---SKLQNDVVGEL 245
Query: 533 VSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
+ GL+ E + GY +DA++ + VA E+DGP+HF P G T+LK R +A
Sbjct: 246 KAAGLDLEEEVLLGSGYRIDALVKFSDGRIVAVEVDGPSHFIDRR--PTGSTILKHRQVA 303
Query: 589 AAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
VVS+ EWE+L+ S + YLRV L +
Sbjct: 304 RLDRIEVVSVPFWEWEKLKNSEMKQHYLRVKLSN 337
>gi|397579135|gb|EJK51101.1| hypothetical protein THAOC_29762, partial [Thalassiosira oceanica]
Length = 285
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 82/316 (25%), Positives = 138/316 (43%), Gaps = 55/316 (17%)
Query: 313 ALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFA 370
L + F Q +N AFA+ S P LF ++A A+ D + +F QEL+ ++WA A
Sbjct: 7 GLDSLDSFKPQAFSNTVWAFATAGESNPKLFKKIANHAAGLDSLDSFTPQELSNIVWACA 66
Query: 371 SLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFN 430
++ + D +F C + ++ S + F
Sbjct: 67 TV-------------GYID-ERFFCAVAPMIA---------------------SKLDEFI 91
Query: 431 RDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKL 490
L +IAW+Y+V F + + + E+ S + A +L
Sbjct: 92 EQDLSHIAWAYSVANTPRLDLFDEGYASALASNEKEFSAE-----GLAQLHQWQLWQQEL 146
Query: 491 EHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVD-GYT 549
E ++L LS L+ K +A ++ +++ S Q +V L + GL+ E ++ GY
Sbjct: 147 ES-GIELPLS--LQAKCRNAFTSRGYSE---SKLQNDVVGELKAAGLDLDEEVLLESGYR 200
Query: 550 VDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEEL 605
+DA++ +KVA E+DGP+HF P G T LK R + VVS+ + EW++L
Sbjct: 201 IDALVKISDGRKVAVEVDGPSHFIDRR--PTGSTTLKHRQVERLDHIEVVSVPYWEWDKL 258
Query: 606 QGSFEQLDYLRVILKD 621
+ S + YLRV L +
Sbjct: 259 KNSEMKQHYLRVKLSN 274
>gi|145352343|ref|XP_001420509.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580743|gb|ABO98802.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1070
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 65/223 (29%), Positives = 97/223 (43%), Gaps = 43/223 (19%)
Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALT-KVGEFNS 322
+ L A L +AQG++N W+ SK G EL S+ R E +T EFNS
Sbjct: 315 FTTLAKHAERHLSALNAQGLTNTVWSFSKCGHLDAELF--SKFGRSIERRMTANASEFNS 372
Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL--YEPA---- 376
Q++AN A AF H LF+ LA + + F Q+L WAFA L Y+
Sbjct: 373 QDIANTAWAFGKACHHDEKLFTSLASLSERCLADFNTQDLVNTTWAFAKLGRYDAKLFVA 432
Query: 377 ------DPLLESLDN--------AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL 422
D L LD F A+Q + L AL++ E+ AD
Sbjct: 433 ARKSILDHRLNDLDAPNIANIVWTFDQASQLSEALFVALASAAEH-------QAD----- 480
Query: 423 SSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQ 465
+FN L N+AW++A GQ++ F+ + +++ R ++
Sbjct: 481 -----NFNAQDLVNVAWTFANSGQVNDALFTALARSVKRLMDE 518
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 85/191 (44%), Gaps = 14/191 (7%)
Query: 278 LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
L E + QG+SN AW +K G + + +++ A ++ +FN+Q+ +N+ AFA
Sbjct: 252 LGEFNTQGLSNTAWGFAKSG--FVDVGLFRAMSQKAQERLDDFNAQDFSNLIYAFAKAGQ 309
Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK-----DATQ 392
LF+ LAK A + Q L +W+F+ L + + +A++
Sbjct: 310 YDAKLFTTLAKHAERHLSALNAQGLTNTVWSFSKCGHLDAELFSKFGRSIERRMTANASE 369
Query: 393 FTCCLNKALSNCNENGGVKSSGDA---DSEGSLSSPVLS-FNRDQLGNIAWSYAVLGQMD 448
F ++ ++N G D S SLS L+ FN L N W++A LG+ D
Sbjct: 370 FN---SQDIANTAWAFGKACHHDEKLFTSLASLSERCLADFNTQDLVNTTWAFAKLGRYD 426
Query: 449 RIFFSDIWKTI 459
F K+I
Sbjct: 427 AKLFVAARKSI 437
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 93/242 (38%), Gaps = 56/242 (23%)
Query: 283 AQGISNIAWALSKIGGELLYL-SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q ++NIAWA +K + L S + R+AE + FNSQ + N AFAS+ H+
Sbjct: 183 GQELANIAWAFAKADYKCERLFSALARMAERHAER---FNSQELTNTCWAFASVGHADAR 239
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFA-----------SLYEPADPLLESLDN----- 385
LF LA+ + F Q L+ W FA ++ + A L+ +
Sbjct: 240 LFKALARCVERRLGEFNTQGLSNTAWGFAKSGFVDVGLFRAMSQKAQERLDDFNAQDFSN 299
Query: 386 ---AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYA 442
AF A Q+ L L+ +E LS+ N L N WS++
Sbjct: 300 LIYAFAKAGQYDAKLFTTLAK-------------HAERHLSA----LNAQGLTNTVWSFS 342
Query: 443 VLGQMDRIFFSDIWKTISRFEEQRISEQYREDI----------------MFASQVHLVNQ 486
G +D FS ++I R SE +DI +F S L +
Sbjct: 343 KCGHLDAELFSKFGRSIERRMTANASEFNSQDIANTAWAFGKACHHDEKLFTSLASLSER 402
Query: 487 CL 488
CL
Sbjct: 403 CL 404
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/188 (26%), Positives = 73/188 (38%), Gaps = 24/188 (12%)
Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
++N+A +K GG + + + E L + Q +AN+A AFA + LFS
Sbjct: 149 LANVAHGAAKGGGSEELFAALAKAIERHLGGIDR--GQELANIAWAFAKADYKCERLFSA 206
Query: 346 LAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCN 405
LA+ A F QEL WAFAS+ L ++L C+ + L N
Sbjct: 207 LARMAERHAERFNSQELTNTCWAFASVGHADARLFKALAR----------CVERRLGEFN 256
Query: 406 ENG------GVKSSGDADS------EGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFS 453
G G SG D + FN N+ +++A GQ D F+
Sbjct: 257 TQGLSNTAWGFAKSGFVDVGLFRAMSQKAQERLDDFNAQDFSNLIYAFAKAGQYDAKLFT 316
Query: 454 DIWKTISR 461
+ K R
Sbjct: 317 TLAKHAER 324
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 48/96 (50%), Gaps = 6/96 (6%)
Query: 278 LPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASM 335
L + A I+NI W + E L+++ +A A + FN+Q++ NVA FA+
Sbjct: 442 LNDLDAPNIANIVWTFDQASQLSEALFVA----LASAAEHQADNFNAQDLVNVAWTFANS 497
Query: 336 QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFAS 371
LF+ LA+ ++ F ++EL + WAF +
Sbjct: 498 GQVNDALFTALARSVKRLMDEFSDEELNNLEWAFTT 533
>gi|397639871|gb|EJK73811.1| hypothetical protein THAOC_04541, partial [Thalassiosira oceanica]
Length = 292
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 84/298 (28%), Positives = 139/298 (46%), Gaps = 22/298 (7%)
Query: 328 VAGAFASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
++ AFA+ + +LF ++A + D + +F Q ++ + WAFA+ L E L
Sbjct: 1 ISWAFATARVPHAELFEKIAYHIAGLDSLDSFTAQNVSNIAWAFATAKIYHSHLFEKLAE 60
Query: 386 AFKDATQFTCCLNKA--LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
A +FT N A L C G + +SS + F+ Q+ N++W+Y+V
Sbjct: 61 AAARKGRFTDTTNIATFLWACATVGYTIERLFSGFALIISSKLDEFSDQQISNVSWAYSV 120
Query: 444 LGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVL 503
M F+ + +E+ S +E + Q L Q L E ++L LS L
Sbjct: 121 ANVMSEGLFNKGYAGALASKEKHFS---KEGLTQLHQWQLWQQELGSE---IELPLS--L 172
Query: 504 EEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKV 559
+K A + +++ S Q +V + + GL+ E + GY +DAV+ KKV
Sbjct: 173 RKKCRHAFISTSYSE---SKLQNDVVGGVRAIGLDLDEEVLLGSGYRIDAVVKVGHGKKV 229
Query: 560 AFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLDYLR 616
A E+DGP+H+ P G T+LKRR + VV++ + EW EL+ + + YLR
Sbjct: 230 AVEVDGPSHYIHRR--PTGSTILKRRQVTRLDLIEVVTVPYWEWGELKSTKMKQLYLR 285
Score = 46.2 bits (108), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 59/113 (52%), Gaps = 8/113 (7%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEF-NSQNVANVAGAF 332
+ +L +AQ +SNIAWA + ++ + +++AE A K G F ++ N+A A
Sbjct: 25 GLDSLDSFTAQNVSNIAWAFAT--AKIYHSHLFEKLAEAAARK-GRFTDTTNIATFLWAC 81
Query: 333 ASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
A++ ++ LFS A S + F +Q+++ V WA Y A+ + E L N
Sbjct: 82 ATVGYTIERLFSGFALIISSKLDEFSDQQISNVSWA----YSVANVMSEGLFN 130
>gi|255084111|ref|XP_002508630.1| predicted protein [Micromonas sp. RCC299]
gi|226523907|gb|ACO69888.1| predicted protein [Micromonas sp. RCC299]
Length = 1128
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 83/189 (43%), Gaps = 19/189 (10%)
Query: 282 SAQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
+AQG++N W+ +K G +L E A K+ +FNSQ++AN A AFA H
Sbjct: 299 NAQGLANTVWSFAKAG----HLDEGLFKGFASQVRRKLKDFNSQDLANTAWAFAKACHPD 354
Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA--------- 390
LF+ ++ + F Q+L WAFA L L ++ F+D+
Sbjct: 355 ESLFASISGACVACLDDFNAQDLVNTAWAFAKLGHFDQSLFAAVARRFRDSGAMNDDQLG 414
Query: 391 TQFTCCLNKALSNCNENGGVKSSGDA----DSEGSLSSPVLSFNRDQLGNIAWSYAVLGQ 446
QF + A S +E G ++ + D + + V F L N+AW++A Q
Sbjct: 415 AQFIANVAWAFSKASEAGKLEQATSEELFRDLATAAEASVADFTAADLANVAWAFANANQ 474
Query: 447 MDRIFFSDI 455
MD F +
Sbjct: 475 MDPTLFQSL 483
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 87/206 (42%), Gaps = 38/206 (18%)
Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
+C+AQ ++N++WA +K ++ D +++ L K NSQ + N+A AFA+ +
Sbjct: 147 DCNAQELANVSWAFAK-ADHCADVALFDALSKATLAKASACNSQELTNLAWAFATAGRTQ 205
Query: 340 PD-LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLN 398
+ LF+ LAK + +F Q L+ WAFA + L +++ A +
Sbjct: 206 DEALFASLAKAVEHTLASFTSQGLSNTAWAFAKVGHLEATLFKAISLAAR---------- 255
Query: 399 KALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKT 458
S + +FN N AW++A LGQ D F+ + K
Sbjct: 256 -------------------------SKLKTFNAQDFANTAWAFAKLGQFDGELFTALAKD 290
Query: 459 ISRFEEQRISEQYREDIM-FASQVHL 483
+R E ++ + FA HL
Sbjct: 291 AARHGEGHNAQGLANTVWSFAKAGHL 316
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 81/185 (43%), Gaps = 19/185 (10%)
Query: 281 CSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
C++Q ++N+AWA + G E L+ S +A+ + F SQ ++N A AFA + H
Sbjct: 186 CNSQELTNLAWAFATAGRTQDEALFAS----LAKAVEHTLASFTSQGLSNTAWAFAKVGH 241
Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCL 397
LF ++ A + TF Q+ A WAFA L + L +L KDA +
Sbjct: 242 LEATLFKAISLAARSKLKTFNAQDFANTAWAFAKLGQFDGELFTALA---KDAARHGEGH 298
Query: 398 NKALSNCNENGGVKSSGDADSEG---SLSSPVL----SFNRDQLGNIAWSYAVLGQMDRI 450
N A N +G D EG +S V FN L N AW++A D
Sbjct: 299 N-AQGLANTVWSFAKAGHLD-EGLFKGFASQVRRKLKDFNSQDLANTAWAFAKACHPDES 356
Query: 451 FFSDI 455
F+ I
Sbjct: 357 LFASI 361
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 54/96 (56%), Gaps = 5/96 (5%)
Query: 280 ECSAQGISNIAWALSKI--GGELLYLS--EMDR-VAEVALTKVGEFNSQNVANVAGAFAS 334
+ AQ I+N+AWA SK G+L + E+ R +A A V +F + ++ANVA AFA+
Sbjct: 412 QLGAQFIANVAWAFSKASEAGKLEQATSEELFRDLATAAEASVADFTAADLANVAWAFAN 471
Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
P LF LA RA + + F ++EL WAFA
Sbjct: 472 ANQMDPTLFQSLANRAENFLDDFNDEELDNAEWAFA 507
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 59/120 (49%), Gaps = 9/120 (7%)
Query: 275 MTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
+ L + +AQ + N AWA +K+G + L+ + R + + +Q +ANVA AF
Sbjct: 366 VACLDDFNAQDLVNTAWAFAKLGHFDQSLFAAVARRFRDSGAMNDDQLGAQFIANVAWAF 425
Query: 333 ASM-------QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
+ Q ++ +LF +LA A V F +LA V WAFA+ + L +SL N
Sbjct: 426 SKASEAGKLEQATSEELFRDLATAAEASVADFTAADLANVAWAFANANQMDPTLFQSLAN 485
>gi|384244813|gb|EIE18310.1| hypothetical protein COCSUDRAFT_60280 [Coccomyxa subellipsoidea
C-169]
Length = 1075
Score = 72.0 bits (175), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 161/391 (41%), Gaps = 67/391 (17%)
Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
M + + + Q N AWA++K+G + L M+ + + A + F+ Q ++N+ A A+
Sbjct: 695 MCRMAQATPQHFGNAAWAMAKLGHDPLQGRFMNALIKQAFPQRSRFHRQELSNILWALAT 754
Query: 335 MQHSAP--------DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNA 386
+QH P D F+ LA E+ LA + WA A L +PL L NA
Sbjct: 755 LQHELPENILRDVSDEFARLALAQLGSAEPGWERHLANMAWACARLR--VNPLGGGLLNA 812
Query: 387 -----FKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL--------SSPVLSFNRDQ 433
+ F+ + ++N G+ + L S+ + +
Sbjct: 813 ACAELVTNPGNFSV---QNMANIVLAAGILQHPFPQAAVDLVLGELQQRSAGSRALPHQE 869
Query: 434 LGNIAWSYAVLGQMDRIFFSDIWKTIS------RFEEQRISEQYREDIMFASQVHLVNQC 487
NI W A L Q+ R I ++ +F + ++ + D+M V + Q
Sbjct: 870 ACNILWGLAALDQLTRAQLEHIAGQLAAAAAADKFTKAEANQLRQADLM----VRAMEQS 925
Query: 488 LKLEHPH-LQLALSSVLE-EKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV 545
E P L AL + + ++I S TS QK+V+ L G+ E +
Sbjct: 926 GGQEMPSCLPPALQQLADGDQIIS-----------TSRLQKDVSETLSELGVPHTVEGRI 974
Query: 546 DGYTVDAVLVD--------KKVAFEIDGPTHFSRNTGVP---LGHTMLKRRYIAAAGWNV 594
+ VD +A E+DGP+HF+ P LGHT+L+ R + A G V
Sbjct: 975 SHPSFGPATVDILIEVPGQPPMALEVDGPSHFA--ALAPHQNLGHTVLRNRLLEARGAKV 1032
Query: 595 VSLSHQ----EWEELQGSFE-QLDYLRVILK 620
V + + W ++QG + +++YL IL+
Sbjct: 1033 VQIPFRIEGKRWADIQGDMDSRIEYLTGILE 1063
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 87/191 (45%), Gaps = 26/191 (13%)
Query: 270 LVAIAMTALPECSAQGISNIAWALSKIG----GELLYLSEMDRVAEVALTKVGEFNSQNV 325
+ A+A LP+ S Q I+N+A+ L+ + ELL VAE AL ++ +S V
Sbjct: 575 VFAVAPKVLPDASFQNIANLAYGLAILNHSAPPELLTA-----VAEAALLRMPSASSHGV 629
Query: 326 ANVAGAFASMQHS--APDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
+N+ A+A M S LF A + + F Q LA LW+ A + A P E+L
Sbjct: 630 SNLLWAYAKMGTSPLGGQLFRSALAHARENLDKFSVQHLANTLWSLAVVQHEASP--EAL 687
Query: 384 DNAFKDATQFTCCLNKALSN--CNENGGVKSSGDADSEGSLSSPVLS--------FNRDQ 433
D+ F +A F C + +A N + G +G + ++ F+R +
Sbjct: 688 DS-FAEA--FMCRMAQATPQHFGNAAWAMAKLGHDPLQGRFMNALIKQAFPQRSRFHRQE 744
Query: 434 LGNIAWSYAVL 444
L NI W+ A L
Sbjct: 745 LSNILWALATL 755
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 57/127 (44%), Gaps = 14/127 (11%)
Query: 269 MLVAIAMTAL---PECSAQGISNIAWALSKIG----GELLYLSEMDRVAEVALTKVGEFN 321
+L A+A AL P S+ G+SN+ WA +K+G G L+ S + E + +F+
Sbjct: 609 LLTAVAEAALLRMPSASSHGVSNLLWAYAKMGTSPLGGQLFRSALAHARE----NLDKFS 664
Query: 322 SQNVANVAGAFASMQHSA-PDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
Q++AN + A +QH A P+ A+ + Q WA A L DPL
Sbjct: 665 VQHLANTLWSLAVVQHEASPEALDSFAEAFMCRMAQATPQHFGNAAWAMAKLGH--DPLQ 722
Query: 381 ESLDNAF 387
NA
Sbjct: 723 GRFMNAL 729
>gi|384245914|gb|EIE19406.1| hypothetical protein COCSUDRAFT_48936 [Coccomyxa subellipsoidea
C-169]
Length = 516
Score = 72.0 bits (175), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 141/332 (42%), Gaps = 18/332 (5%)
Query: 302 YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP---DLFSELAKRASDIVHTFQ 358
Y +D + L + ++ + ++ A A +H + L +A A D+V TF
Sbjct: 195 YTMLLDAIVGQVLRSFKDLDASGLVSLTHALAETEHDSEGTGKLLKAIAAGALDLVPTFS 254
Query: 359 EQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADS 418
+LA +L +F+ L +P+ + + A + S+ V + +
Sbjct: 255 PGQLASLLASFSHLRHYDEPMYRVISR--QAAPTVAALEPQQRSDLLHALAVVGHDEPEL 312
Query: 419 EGSLSSPVL----SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYRED 474
+L +L + L ++ WS AVL Q+ F + +R E+ + E+
Sbjct: 313 VAALRDHLLEDAGQLSGCALCDVLWSLAVLDQLSPDAFR---RMCARLEQLPLGAFEPEN 369
Query: 475 IMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVS 534
QV + Q + P L + L + + ASA + + F + + Q+ + R L
Sbjct: 370 FQQLYQVQRMVQAAS-QDP-LTVQLPTWIWAYAASAWQDRLFAESNFTPLQQSICRTLAD 427
Query: 535 TGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRN-TGVPLGHTMLKRRYIAAAGWN 593
G+ W E + T A+L+ KVA +GPT +S + PLG T+ RR + GW
Sbjct: 428 LGV-WHEEKFLQNMT-SAILLRDKVAIHPEGPTLYSSSWPRRPLGETLAVRRTLTRHGWT 485
Query: 594 VVSLSHQEWEELQGSFEQLDYLRVILKDYIGG 625
VV L+ EW L S ++ YLR +L D G
Sbjct: 486 VVPLAKHEWMAL-ASHKRAAYLRKLLDDAGAG 516
>gi|397575811|gb|EJK49902.1| hypothetical protein THAOC_31172, partial [Thalassiosira oceanica]
Length = 363
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/353 (25%), Positives = 137/353 (38%), Gaps = 50/353 (14%)
Query: 277 ALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQ 336
+L S+Q +SN AWA + G L + L + F QN++N A AFA+
Sbjct: 49 SLDSFSSQALSNTAWAFAAAGVSHPVLLKKIGNHIAGLDSLNSFKPQNLSNTAWAFATAG 108
Query: 337 HSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQF- 393
S P LF ++ + + + +F+ QEL+ V WA+A+ L E +F
Sbjct: 109 ASHPTLFKKIGDHVARLGSLDSFKPQELSNVAWAYATARRFDLGLFEKFTEVSARKGEFL 168
Query: 394 -TCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
T + L C G + S + N L NIAW+Y+V + F
Sbjct: 169 ETQHIANFLWACATVGHTDERLFGAFAPVIGSKLDECNEQVLANIAWAYSV-ANAPQDLF 227
Query: 453 SDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGK 512
S+ + + E+ S + QLA +
Sbjct: 228 SEGYVSAFALNEKEFSGE-------------------------QLAQLHQWQLWQQELES 262
Query: 513 TKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLV---DKKVAFEIDGPTHF 569
Q ++EV LL S GY +DA++ +KVA E+DGP HF
Sbjct: 263 GIELPQAAGFELEEEV--LLGS------------GYRIDALVKVGDGRKVAVEVDGPFHF 308
Query: 570 SRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
P G T+LK R + VVS+ + EW++L S + YLR L +
Sbjct: 309 IDRR--PAGRTILKHRQVVRLDRIKVVSVPYWEWDKLMSSETKQHYLRAKLSN 359
>gi|308809477|ref|XP_003082048.1| Kynurenine 3-monooxygenase and related flavoprotein monooxygenases
(ISS) [Ostreococcus tauri]
gi|116060515|emb|CAL55851.1| Kynurenine 3-monooxygenase and related flavoprotein monooxygenases
(ISS) [Ostreococcus tauri]
Length = 1077
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 97/221 (43%), Gaps = 39/221 (17%)
Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQN 324
+ L A A L S QG++N WA +K G + L+ + + T +FNSQ+
Sbjct: 339 FTTLAAHADRHLSTLSTQGLTNAVWAFAKAGHLDDALFTAFAKSIERRMSTGASDFNSQD 398
Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPL----- 379
+AN A AFA H +LF+ LA+ A + F Q+L WAFA L + + L
Sbjct: 399 MANTAWAFAKACHLDDNLFTALARLAETCLDDFNTQDLVNTTWAFAKLGKYDEKLFIAAR 458
Query: 380 -------LESLDN--------AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSS 424
L+ LD +F A+Q L AL+ E V++
Sbjct: 459 KSILNNRLDDLDAPNTANIAWSFDKASQLDKRLFDALARTAE---VRAD----------- 504
Query: 425 PVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQ 465
F+ L N+AW++A GQ++ F+ + +++ R E+
Sbjct: 505 ---EFSAVDLANVAWTFANTGQVNDNLFTALARSVERLMEE 542
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/263 (24%), Positives = 110/263 (41%), Gaps = 49/263 (18%)
Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
A+ ++N+A +K G + M+ +A ++ N+Q +AN+A AFA +H+ L
Sbjct: 168 ARELANVAHGAAKCGRGSTDATLMETLARAIEGELERCNAQELANIAWAFAKAEHADERL 227
Query: 343 FSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALS 402
F L K AS F QEL + WAFA++ Q L KALS
Sbjct: 228 FLALEKMASTKAEQFNPQELTNMTWAFATV------------------GQGNARLFKALS 269
Query: 403 NCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRF 462
C E + + D ++G L N AW++A G +D + +++ +S+
Sbjct: 270 RCVE----RRAEDFSTQG-------------LSNTAWAFAKSGYVD----AGLFRALSQS 308
Query: 463 EEQRISEQYREDI-----MFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASA----GKT 513
+QR+ +D FA + H LS++ + + +A K
Sbjct: 309 AQQRLDGFNAQDFSNLVWAFAKASQYDAKLFTTLAAHADRHLSTLSTQGLTNAVWAFAKA 368
Query: 514 KRFNQKVTSSFQKEVARLLVSTG 536
+ + ++F K + R + STG
Sbjct: 369 GHLDDALFTAFAKSIERRM-STG 390
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 81/188 (43%), Gaps = 16/188 (8%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
S QG+SN AWA +K G + +++ A ++ FN+Q+ +N+ AFA
Sbjct: 280 STQGLSNTAWAFAKSG--YVDAGLFRALSQSAQQRLDGFNAQDFSNLVWAFAKASQYDAK 337
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK-----DATQFTC- 395
LF+ LA A + T Q L +WAFA D L + + + A+ F
Sbjct: 338 LFTTLAAHADRHLSTLSTQGLTNAVWAFAKAGHLDDALFTAFAKSIERRMSTGASDFNSQ 397
Query: 396 -CLNKALS---NCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
N A + C+ + + ++ +E L FN L N W++A LG+ D
Sbjct: 398 DMANTAWAFAKACHLDDNLFTALARLAETCLD----DFNTQDLVNTTWAFAKLGKYDEKL 453
Query: 452 FSDIWKTI 459
F K+I
Sbjct: 454 FIAARKSI 461
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 48/93 (51%), Gaps = 2/93 (2%)
Query: 278 LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
L + A +NIAW+ K L D +A A + EF++ ++ANVA FA+
Sbjct: 466 LDDLDAPNTANIAWSFDK--ASQLDKRLFDALARTAEVRADEFSAVDLANVAWTFANTGQ 523
Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
+LF+ LA+ ++ F ++EL + WAFA
Sbjct: 524 VNDNLFTALARSVERLMEEFSDEELDNLEWAFA 556
>gi|307108730|gb|EFN56969.1| hypothetical protein CHLNCDRAFT_143558 [Chlorella variabilis]
Length = 1244
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 130/329 (39%), Gaps = 53/329 (16%)
Query: 304 SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELA 363
+ +D A + + V + A+A+M+H P LF L +RA D+ + + +A
Sbjct: 941 ATLDDAAARCIPLAPRMSGGEVGTLMWAYATMRHVHPGLFKALLERADDLAGSLTWRGIA 1000
Query: 364 QVLWAFASLYE-PADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL 422
V+WA A + P PL + L + L
Sbjct: 1001 IVMWACAVTRQAPPRPLADRLVERYM--------------------------------PL 1028
Query: 423 SSPVLSFNRD--QLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQ 480
P ++ D L N+AW V + F+ + + + + + + A+
Sbjct: 1029 FHPRMAQGVDLHSLANVAWGLTVFDYLTPDRFAQLTGMVPPHDAAAL-----DSVNTAAW 1083
Query: 481 VHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQ--KEVARLLVSTGLN 538
L L LE Q + + + A K + TS Q +VA +L S +
Sbjct: 1084 CQLFQCALYLEAKTGQHYSAFLPPHILPYAEKHWQARDTTTSRLQARNKVADVLHSLEVP 1143
Query: 539 WIREYA--VDGYTVDAVLVDK---KVAFEIDGPTHFSRNTG-VPLGHTMLKRRYIAAAGW 592
+ EY+ + + +D + ++A E+DGP HFS N +PL T ++ + +A GW
Sbjct: 1144 FAEEYSPRANFFGIDIAIQGSNGVRLAVEVDGPQHFSSNPPHMPLASTYMRNKLLAMHGW 1203
Query: 593 NVVSLSHQEWEELQGSFEQ-----LDYLR 616
VVS+ EW L G E+ +DYLR
Sbjct: 1204 EVVSIPFNEWARLAGLQEKQARLAVDYLR 1232
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 27/118 (22%), Positives = 49/118 (41%), Gaps = 6/118 (5%)
Query: 264 QREMSMLVAIAMTALPECSAQGISNIAWALSKIGGEL----LYLSEMDRVAEVALTKVGE 319
Q M L A+ LPE + ++ W+L K+G +L ++ + V A ++ +
Sbjct: 360 QAVMDHLSLAALAFLPEVEHTHLGSLVWSLGKLGTKLGAARIHTPVLHAVVATAWRRLHD 419
Query: 320 FNSQNVANVAGAFASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEP 375
+ N F + H D A R +++ Q++ +W+FA L Y P
Sbjct: 420 LTPDALCNTLYGFGLLNFHPGSDFLDAAAARFKELLPYMSAQQVGNCVWSFARLEYSP 477
>gi|412994018|emb|CCO14529.1| predicted protein [Bathycoccus prasinos]
Length = 1083
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/186 (28%), Positives = 74/186 (39%), Gaps = 41/186 (22%)
Query: 278 LPECSAQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGEFNSQNVANVAGAFASM 335
L EC+ Q I+NIAWA +K G Y +AE+A ++ FNSQ + NV AFA+
Sbjct: 201 LAECNGQEIANIAWAFAKSG----YFDPGMFANLAEMAEKQMDRFNSQEITNVFWAFATA 256
Query: 336 QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTC 395
+ LF LAK +H F Q L+ WA + + + DAT F
Sbjct: 257 ECDNAKLFKALAKAIDGQLHGFNSQGLSNTAWALSKI-------------GYVDATLFRT 303
Query: 396 CLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDI 455
A N + FN N+ W++A GQ D F+ +
Sbjct: 304 IAQTAQKNMDR----------------------FNAQDFSNLCWAFAKAGQYDAELFTTL 341
Query: 456 WKTISR 461
K R
Sbjct: 342 AKNAER 347
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 62/250 (24%), Positives = 106/250 (42%), Gaps = 35/250 (14%)
Query: 238 ATALHRIA----KNMEKVSMMTTHRL--AFTR--QREMSMLVAIAMTA---LPECSAQGI 286
AT IA KNM++ + L AF + Q + + +A A + +AQG+
Sbjct: 298 ATLFRTIAQTAQKNMDRFNAQDFSNLCWAFAKAGQYDAELFTTLAKNAERHMGNLNAQGL 357
Query: 287 SNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
SN W+ +K G EL + ++ +FN+Q++AN+A A+ H LF
Sbjct: 358 SNSVWSFAKAGHLNAELFTTFGKNIERKMFANNGTDFNAQDIANIAWAYGKACHLDDALF 417
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSN 403
+ LA+ A +H F Q++ + W+F+ L LLE++ L L +
Sbjct: 418 TVLARMAEKYLHDFNTQDIVNLTWSFSKLGRFDVELLEAVK---------VSLLKSRLDD 468
Query: 404 CNENGGVKSSGDADSEGSLSSPVLS------------FNRDQLGNIAWSYAVLGQMDRIF 451
+ + D G L ++S F + N+AW++A G++D
Sbjct: 469 LDAPNIANLAWTYDKAGKLDDNLVSSLARAAVKRVNEFTATDITNVAWTFANAGKVDDEL 528
Query: 452 FSDIWKTISR 461
FS + K + R
Sbjct: 529 FSSMAKVVER 538
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 84/195 (43%), Gaps = 36/195 (18%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
++QG+SN AWALSKIG + + +A+ A + FN+Q+ +N+ AFA +
Sbjct: 279 NSQGLSNTAWALSKIG--YVDATLFRTIAQTAQKNMDRFNAQDFSNLCWAFAKAGQYDAE 336
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
LF+ LAK A + Q L+ +W+FA A L L F + K
Sbjct: 337 LFTTLAKNAERHMGNLNAQGLSNSVWSFAK----AGHLNAELFTTFGKNIE-----RKMF 387
Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
+N NG FN + NIAW+Y +D F+ ++R
Sbjct: 388 AN---NG------------------TDFNAQDIANIAWAYGKACHLDDALFT----VLAR 422
Query: 462 FEEQRISEQYREDIM 476
E+ + + +DI+
Sbjct: 423 MAEKYLHDFNTQDIV 437
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 53/99 (53%), Gaps = 2/99 (2%)
Query: 271 VAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
V++ + L + A I+N+AW K G L + + +A A+ +V EF + ++ NVA
Sbjct: 459 VSLLKSRLDDLDAPNIANLAWTYDKAGK--LDDNLVSSLARAAVKRVNEFTATDITNVAW 516
Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAF 369
FA+ +LFS +AK I+ F E++L + WAF
Sbjct: 517 TFANAGKVDDELFSSMAKVVERIMDDFGEEDLDNLEWAF 555
>gi|145355912|ref|XP_001422190.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582430|gb|ABP00507.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 967
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 153/357 (42%), Gaps = 57/357 (15%)
Query: 287 SNIAWALSKI----GGELLYLSEMDRVAEVALTKVGEFNS---QNVANVAGAFASM---- 335
SN+ W+ + + G E+L +VAE+ L +VG+ + V+N A+A+
Sbjct: 433 SNLLWSYASLRFNPGNEVL-----TQVAELYL-RVGQHDEVALTQVSNTLWAWANFGWLP 486
Query: 336 -QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL---------- 383
S + ++A + Q Q LA +LW+ A+L + P D L++
Sbjct: 487 EDPSIVECVLQVAIKHFKSDPDLQTQSLANILWSLATLRFVPGDEFLQAFRERALIELRE 546
Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
D F D Q C A N G + + S+ L + V +F + N ++A
Sbjct: 547 DERFSD--QGLCNTVWAYGQLGVNPGTELMSEIASQ--LGARVTNFPTQGVTNSILAFAT 602
Query: 444 LGQMDRIFFSDIWKTISRFEEQRISEQYREDI--MFASQVHLVNQCLKLEHPHLQLALSS 501
LG F+ D W + + + + Y I + +Q N + P+ L
Sbjct: 603 LG-----FWPDEW-VVDNYRAKIVEMYYSTTISDIDLTQFFQANYLFEKCSPYGPLVTDP 656
Query: 502 VLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DG-YTVDAVLVDKKV 559
+ E + SA K + ++ V S F +EV+ L + G+ EY DG +++D L KK+
Sbjct: 657 QMIEDMLSAWK-RGSSKVVISQFHREVSDTLTNMGVPHEIEYITEDGLFSLDIALKGKKL 715
Query: 560 AFEIDGPTHFSRNT-----------GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
A E+DGP+HF+RN G G ++ Y+ GW V + +W+++
Sbjct: 716 AIEVDGPSHFARNIQNRRMSGKRPDGT--GTYNIRYHYLDTNGWTTVFIPWYDWKQV 770
Score = 42.4 bits (98), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 76/295 (25%), Positives = 118/295 (40%), Gaps = 36/295 (12%)
Query: 274 AMTALPEC-SAQGISNIAWALSKIGGELLYLSE-----MDRVAEVALTKVGEFNSQNVAN 327
++ A+P S+Q +SN WA++ + GE L ++ + K F Q +AN
Sbjct: 337 SIKAVPNMWSSQSVSNTLWAIATLDGEPHKLRARHGDYLNTLCMYVERKANAFVCQGLAN 396
Query: 328 VAGAFASMQHSAPDLFSELAK-RASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDN 385
A A+++++ E A R S + E + +LW++ASL + P + +L +
Sbjct: 397 TLWALATLEYTPSMKMLEAATARWSALATDVYISECSNLLWSYASLRFNPGNEVLTQVAE 456
Query: 386 AFKDATQFTCCLNKALSN---CNENGGVKSSGDADSEGSLSSPVLSFNRD------QLGN 436
+ Q +SN N G + E L + F D L N
Sbjct: 457 LYLRVGQHDEVALTQVSNTLWAWANFGWLPEDPSIVECVLQVAIKHFKSDPDLQTQSLAN 516
Query: 437 IAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQ 496
I WS A L R D + + F E+ + E RED F+ Q L N + L
Sbjct: 517 ILWSLATL----RFVPGD--EFLQAFRERALIE-LREDERFSDQ-GLCNTVWA--YGQLG 566
Query: 497 LALSSVLEEKIAS---AGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGY 548
+ + L +IAS A T Q VT+S L +T W E+ VD Y
Sbjct: 567 VNPGTELMSEIASQLGARVTNFPTQGVTNSI------LAFATLGFWPDEWVVDNY 615
Score = 38.9 bits (89), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 47/97 (48%), Gaps = 7/97 (7%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQ-HSAP 340
+ QGISN WA + + G L + + ++ ++ +F S +NV A A+M+ H P
Sbjct: 265 APQGISNSLWAFATL-GYTLKPETIAKFSQAIRRQLKDFKSMEFSNVVWALATMKTHLDP 323
Query: 341 -----DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
+L E+ + + + Q ++ LWA A+L
Sbjct: 324 LEVFDELLDEMHASIKAVPNMWSSQSVSNTLWAIATL 360
>gi|428165102|gb|EKX34106.1| hypothetical protein GUITHDRAFT_147455 [Guillardia theta CCMP2712]
Length = 1225
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 96/395 (24%), Positives = 158/395 (40%), Gaps = 78/395 (19%)
Query: 260 AFTRQREMSMLVAIAMTALPE--CSAQGISNIAWALSKIGGE-LLYLSEMDRVAEVALTK 316
A+T + + +A +T L E SAQG++ I A +K+G + + + RVA+ +
Sbjct: 713 AYTSLKTLFRRLARIVTGLSEQQFSAQGVALIVNAFAKLGMQDSCMFAHLSRVAQ----Q 768
Query: 317 VGEFN------SQNVANVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAF 369
+G+ N Q+V N+ AFA +LF ++ + H F+ ++ +L A+
Sbjct: 769 MGQRNFEIPCSPQDVVNIVNAFAKAHVHDAELFGHMSLLLQAMSAHQFEASKIGILLNAY 828
Query: 370 ASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL-- 427
A L PLL L D + + A++N V + D++ G +++ +L
Sbjct: 829 AQLRIRDLPLLRRLSKVAMDMSP-SAFDAHAIANIFHALAVLNVEDSELLGHVATTLLRS 887
Query: 428 --------SFNRDQLGNIAWSYAVL----GQMDRIFFSDIWKTISRFEEQRISEQYREDI 475
FN L NIAWS AVL ++R S IS + +S+ ++
Sbjct: 888 RQTMMQAKDFNAQALSNIAWSIAVLKISDPTLNRWICSSCLSQISSMDGNALSQLHQ--- 944
Query: 476 MFASQVHLVNQCLKLEHPHL-----------------QLALSSVLEEKIASAGKTKRFNQ 518
+ + + K E P L Q AL S K+A T
Sbjct: 945 -YILAIEVEGLVPKKELPELDSLLQHRKRIEQAWHATQRALLS--SSKLAGMQGTMTLTD 1001
Query: 519 K------------VTSSFQKEVARLLVSTGLNWIREYAVD--------------GYTVDA 552
K S Q+ VA L + VD Y++D
Sbjct: 1002 KGAAATSDVLLPDAMSGLQRNVADTLRVVWKELQEQRVVDQSWTLEEETIERVTSYSLDI 1061
Query: 553 VLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYI 587
+ A E+DGP+HF+R + +PLG T++KRR +
Sbjct: 1062 SIAAASFAIEVDGPSHFARGSKIPLGRTLMKRRQL 1096
>gi|294887545|ref|XP_002772159.1| hypothetical protein Pmar_PMAR024842 [Perkinsus marinus ATCC 50983]
gi|239876105|gb|EER03975.1| hypothetical protein Pmar_PMAR024842 [Perkinsus marinus ATCC 50983]
Length = 1094
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 66/128 (51%), Gaps = 1/128 (0%)
Query: 496 QLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLV 555
++ SSV++ +I + + ++EV++ GL E V Y++D +LV
Sbjct: 778 EMIFSSVIQLQIFDLWARLLAPKSIMEKMEREVSKFFTMVGLRHRNEVVVGPYSID-ILV 836
Query: 556 DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+ AFE+DGP HF R+T + ++LK R + A G+ V+ + +QEW + ++L Y+
Sbjct: 837 GESFAFEVDGPHHFYRDTSMRTASSLLKHRILEALGFTVIRVPYQEWSQCGTREKRLRYV 896
Query: 616 RVILKDYI 623
K I
Sbjct: 897 GSFWKQLI 904
>gi|224010429|ref|XP_002294172.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970189|gb|EED88527.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 382
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 85/341 (24%), Positives = 138/341 (40%), Gaps = 78/341 (22%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTK--VGEFNSQNVANVAGAFASMQHSAPD 341
Q +NI WA + E + ++VA + + F Q+ AN+ A+A+ + S P
Sbjct: 74 QDYANIVWAYAT--AEASHPQLFEKVANHIESSRDLSSFIPQDYANIVWAYATAELSHPV 131
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFAS-------LYEPADPLLESLDNAFKDATQFT 394
LFS +A A F Q++ +LWAFAS LY P +A K +Q+T
Sbjct: 132 LFSNVADSAIQRQSEFNSQDITNLLWAFASNGDIERNLYTKVAP------SAAKLTSQYT 185
Query: 395 CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSD 454
C QL NIAW+YAV F++
Sbjct: 186 C------------------------------------QQLTNIAWAYAVADVDAPTLFNE 209
Query: 455 IW-----KTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIAS 509
I+ K + F + + + Y+ + A + H + L +L EK +
Sbjct: 210 IFNEKCNKKMDAFSVESLMQLYQWHLWRAKE-------------HSEEGLPQMLHEKCYN 256
Query: 510 AGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV--DKKVAFEIDGP 566
+ + S+ Q +V L + GL+ E + GY +DA++ + E+DGP
Sbjct: 257 VFVSASAS---PSALQDDVVVELRAIGLHPEEEVLLQSGYRIDALVQVNGENFVIEVDGP 313
Query: 567 THFSRNTGVPLGHTMLKRRYIAAA-GWNVVSLSHQEWEELQ 606
+HF G T LK R ++ G +VS+ + EW +L+
Sbjct: 314 SHFIGKIRDLKGSTKLKHRQVSTIDGIPIVSVPYWEWNKLR 354
>gi|397595468|gb|EJK56490.1| hypothetical protein THAOC_23613 [Thalassiosira oceanica]
Length = 695
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/315 (26%), Positives = 130/315 (41%), Gaps = 67/315 (21%)
Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPL 379
F+ Q++AN+ ++A+ + PDLF L A D FQ QE+A +LWA A+L + L
Sbjct: 431 FSVQSIANIIWSYATAREWCPDLFIGLISAAVDRRDEFQPQEMANLLWACATLGQTNADL 490
Query: 380 LESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAW 439
NAF + + F L N AW
Sbjct: 491 F----NAFVPVVKMK-------------------------------IEDFTAQGLSNAAW 515
Query: 440 SYAVLGQMDRIFFSDIWK--TISRFEEQRISEQYREDIMFASQVHL--VNQCLKLEHPHL 495
++AV +DI +RF E I + R + SQ+H + Q ++ L
Sbjct: 516 AFAV---------ADIQNDDLNNRFLEAFIKNEDRFSVEGLSQLHQWQLWQIERVSPVQL 566
Query: 496 QLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGL-NWIREYAV--DGYTVDA 552
+LS + S G S Q +V +L + + EY GYT+DA
Sbjct: 567 PASLSERCRDAFVSQGTGY-------SKLQDQVVSVLSRMDFYDVLEEYRTRNTGYTLDA 619
Query: 553 VLV---DKKVAFEIDGPTHF--SRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQ 606
++ K+ EI+GP H+ RN G T LK R +++ VVS+ H EWE+L
Sbjct: 620 LVSLNDTVKIGIEINGPYHYIGGRNLN---GGTRLKLRQVSSIECVRVVSVPHYEWEQLD 676
Query: 607 GSFEQLDYLRVILKD 621
G + +YL L++
Sbjct: 677 GDEGRREYLLSALRE 691
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 67/129 (51%), Gaps = 13/129 (10%)
Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKIG------GELLYLSEMDRVAEVALTKVGEF 320
+ + + A ++ + A+G+SN ++ + IG G +L + VA+ L ++ F
Sbjct: 219 FNFIASAAARSVHKFDARGLSNTIYSFALIGYPPNVQGSRPFL---EIVADECLHQLNHF 275
Query: 321 NSQNVANVAGAFASMQHSAPDLFSELAKRASDIV----HTFQEQELAQVLWAFASLYEPA 376
N Q ++N+ ++A + HS P+LF +A R ++ TF Q ++ +LW+F +L E
Sbjct: 276 NMQELSNLVWSYAKLNHSHPELFGAVASRILELNPKADTTFNPQVISNILWSFTTLDEAN 335
Query: 377 DPLLESLDN 385
+ L + N
Sbjct: 336 EDLFRYIFN 344
>gi|159473869|ref|XP_001695056.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276435|gb|EDP02208.1| predicted protein [Chlamydomonas reinhardtii]
Length = 347
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 55/94 (58%), Gaps = 1/94 (1%)
Query: 513 TKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRN 572
T ++V S +Q+++A L + + E GY++D L ++A E DGPTH SR
Sbjct: 226 TSGLRRRVQSGYQRQMANALTAMRHMHLLEDNSAGYSIDITLPALRIALEADGPTHTSRT 285
Query: 573 T-GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
G LG T +KRR++ GW VV++++ EW++L
Sbjct: 286 PGGAMLGATAMKRRHLQRLGWQVVNVTYTEWDKL 319
>gi|307111480|gb|EFN59714.1| hypothetical protein CHLNCDRAFT_133279 [Chlorella variabilis]
Length = 1273
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 84/187 (44%), Gaps = 45/187 (24%)
Query: 280 ECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASM- 335
E ++Q ++N WA + IG G+ L + A VA+ K+ EF+ QN++N+ A+A +
Sbjct: 451 EYNSQNLANSVWAYANIGVNPGDSL----LQDFARVAIAKMPEFSPQNISNLLWAYAKLG 506
Query: 336 -QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFT 394
QH+ +LF+E + A+ I+HTF Q +A + WA+A+L + DAT
Sbjct: 507 VQHA--ELFAEAGRHAARIMHTFTPQSVANMAWAYATL------------DQCPDATLLH 552
Query: 395 CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSD 454
+ A E F+ L N AW+ A L + + S
Sbjct: 553 ALVGHAARMLPE----------------------FSPQNLSNTAWALATLKECEPGLLSG 590
Query: 455 IWKTISR 461
I ++R
Sbjct: 591 ISMEVTR 597
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/304 (24%), Positives = 120/304 (39%), Gaps = 79/304 (25%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQ----NVANVAGAFASMQH 337
S Q ++N+ WAL+ + + S M +AE +T+ N Q N++N+A A++ + H
Sbjct: 610 SRQHLANLVWALATLEFDPGKRSLMC-MAEALVTRADLCNPQEVQQNLSNLAWAYSKLAH 668
Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCL 397
L + +A RA ++H Q + WA++SL T L
Sbjct: 669 MDEALMTAIADRAESMIHDLSLQHCTNLTWAYSSL------------------KWTTPTL 710
Query: 398 NKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWK 457
AL A+S+ L+ N QL N+ WS + DR F
Sbjct: 711 MPALV-------------AESKARLAD--TQLNVQQLCNLLWSLGISEACDREVFQAYML 755
Query: 458 TISRFEEQRISEQYREDIMFASQVHLVNQCLKL-EHPHLQLALSSVLEEKIASAGKTKRF 516
++ +Q Q + + L L +H +Q+
Sbjct: 756 MLAESPDQ--------------QWPIPGELLALAQHAWVQV------------------- 782
Query: 517 NQKVTSSFQKEVARLLVSTGLNWIREYAVDG--YTVDAVLVDKKVAFEIDGPTHFSRNTG 574
S F EV+R+L + G E+ D ++VD L +++A E+DGP HF+ NT
Sbjct: 783 -----SEFHSEVSRMLSALGQPHTIEHLTDDHLFSVDIALPGERIALEVDGPHHFTANTF 837
Query: 575 VPLG 578
PLG
Sbjct: 838 RPLG 841
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 70/294 (23%), Positives = 125/294 (42%), Gaps = 41/294 (13%)
Query: 197 INLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIA--KNMEKVSMM 254
I+ NK I A AQE+ + + V +ATA+H++A + +
Sbjct: 259 ISCNKRITAATYAQEIFNIEHAVFDTV------------CLATAMHKLANLRGAPNLHAE 306
Query: 255 TTHRLAFTRQREM---SMLVAIA---MTALPECSAQGISNIAWALSKIG---GELLYLSE 305
F + +++ L +A E +AQ ++N+ W+ + +G G+ +
Sbjct: 307 IVQAPEFFKLKQLIRDEFLAEVAEEVKGKAREGNAQNVANMLWSFATLGYHPGDEV---- 362
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQ 364
M +A K+ +F SQN++N +FA ++ D L LA A + TF Q L+
Sbjct: 363 MHALAVAVQQKLADFTSQNMSNAVLSFAKLEFDPGDELLEGLAAEALRKIATFSPQALSN 422
Query: 365 VLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSN---CNENGGVKSSGDA---DS 418
LW + L A L+E + A + Q ++ L+N N GV + GD+ D
Sbjct: 423 TLWGLSKLGINAPELMEGIGQAAR--FQLYEYNSQNLANSVWAYANIGV-NPGDSLLQDF 479
Query: 419 EGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR----FEEQRIS 468
+ + F+ + N+ W+YA LG F++ + +R F Q ++
Sbjct: 480 ARVAIAKMPEFSPQNISNLLWAYAKLGVQHAELFAEAGRHAARIMHTFTPQSVA 533
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 50/217 (23%), Positives = 94/217 (43%), Gaps = 23/217 (10%)
Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKI----GGELLYLSEMDRVAEVALTKVGEFNS 322
M L L + ++Q +SN + +K+ G ELL + +A AL K+ F+
Sbjct: 363 MHALAVAVQQKLADFTSQNMSNAVLSFAKLEFDPGDELL-----EGLAAEALRKIATFSP 417
Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLE 381
Q ++N + + +AP+L + + A ++ + Q LA +WA+A++ P D LL+
Sbjct: 418 QALSNTLWGLSKLGINAPELMEGIGQAARFQLYEYNSQNLANSVWAYANIGVNPGDSLLQ 477
Query: 382 SLDN-AFKDATQFTCCLNKALSN---CNENGGVKSSGDADSEGSLSSPVL-SFNRDQLGN 436
A +F+ + +SN GV+ + G ++ ++ +F + N
Sbjct: 478 DFARVAIAKMPEFS---PQNISNLLWAYAKLGVQHAELFAEAGRHAARIMHTFTPQSVAN 534
Query: 437 IAWSYAVLGQMD-----RIFFSDIWKTISRFEEQRIS 468
+AW+YA L Q + + F Q +S
Sbjct: 535 MAWAYATLDQCPDATLLHALVGHAARMLPEFSPQNLS 571
>gi|302780627|ref|XP_002972088.1| hypothetical protein SELMODRAFT_412580 [Selaginella moellendorffii]
gi|300160387|gb|EFJ27005.1| hypothetical protein SELMODRAFT_412580 [Selaginella moellendorffii]
Length = 205
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/63 (50%), Positives = 42/63 (66%), Gaps = 1/63 (1%)
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQL-DYLR 616
KV E++ P+HF+RNTG LGHT+LK R + AA W ++S S+ EWE LQG L Y R
Sbjct: 138 KVVIEVNRPSHFARNTGDLLGHTVLKHRLVEAAEWKIISASYAEWENLQGESGHLTSYKR 197
Query: 617 VIL 619
+ L
Sbjct: 198 LWL 200
>gi|397648138|gb|EJK78006.1| hypothetical protein THAOC_00119 [Thalassiosira oceanica]
Length = 1158
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 122/302 (40%), Gaps = 58/302 (19%)
Query: 284 QGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q +SN WA + G L+ D +A L +G F Q+ +N A AFA+ + P
Sbjct: 748 QALSNTPWAFATAGASHPELFKKIGDHIA--VLDSLGSFKPQDFSNTAWAFATARVFHPR 805
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
LF +L A F +QE++ LWA A++ + L +AF
Sbjct: 806 LFEKLTTEAVASKDHFDDQEVSNFLWACATVGH----TDQRLFSAFAPV----------- 850
Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
++S + FN+ L NIAW+Y+V F+ + +
Sbjct: 851 --------------------IASRLGKFNKQHLANIAWAYSVANLPRHDLFNKGYVSALA 890
Query: 462 FEEQRISEQYREDIMFASQVHLVNQCLKLEHP-HLQLALSSVLEEKIASAGKTKRFNQKV 520
E+ S + + A +LE + +L + SAG ++
Sbjct: 891 SNEKEFSVE-----LLAQLHQWQLWQQELESGIEVPQSLRAKCRNAFTSAGYSE------ 939
Query: 521 TSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHF--SRNTG 574
S Q +V L + GL+ E + GY +DA++ ++KVA E+DGP+HF R G
Sbjct: 940 -SRLQNDVVDELKAAGLDLEEEVLLGSGYRIDALVKVGDERKVAVEVDGPSHFIDRRPVG 998
Query: 575 VP 576
P
Sbjct: 999 KP 1000
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 66/159 (41%), Gaps = 40/159 (25%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
D +A A+ + F+++ ++N+ +F ++ + PD LF+ + A I+HTF+ Q
Sbjct: 536 FDSIASSAVGMLNGFDARCLSNLIYSFGLVERN-PDIGGETLFNVFGEAAGKILHTFKSQ 594
Query: 361 ELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEG 420
+L+ +LWAF + L + + GGV S D D
Sbjct: 595 DLSNMLWAFVKVDAKNSRLFQ------------------------DTGGVISGMDLD--- 627
Query: 421 SLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI 459
SF L NI WS+A G+ + F + I
Sbjct: 628 -------SFQPQHLANILWSFAKSGKANPELFQALGNHI 659
>gi|428166881|gb|EKX35849.1| hypothetical protein GUITHDRAFT_79396, partial [Guillardia theta
CCMP2712]
Length = 124
Score = 65.9 bits (159), Expect = 7e-08, Method: Composition-based stats.
Identities = 42/124 (33%), Positives = 61/124 (49%), Gaps = 21/124 (16%)
Query: 518 QKVTSSFQKEVARLLVSTGLNWIREYA--VDGYTVD-------------------AVLVD 556
Q +S QKEV +L+S G E+ GYT+D +
Sbjct: 1 QLRSSKLQKEVMSVLLSIGFECEEEHQDPRTGYTIDIYCPPSSSSSSSSSSSSSSSSSSS 60
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
VA E+DGP+HF T G T+LKRR++ A G+ +S+ + EW+ LQG+ EQ ++R
Sbjct: 61 SPVAIEVDGPSHFLHGTREASGSTVLKRRHLEAVGYRFISIPYWEWDALQGAEEQEKFMR 120
Query: 617 VILK 620
LK
Sbjct: 121 EKLK 124
>gi|196000024|ref|XP_002109880.1| hypothetical protein TRIADDRAFT_53223 [Trichoplax adhaerens]
gi|190588004|gb|EDV28046.1| hypothetical protein TRIADDRAFT_53223 [Trichoplax adhaerens]
Length = 639
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 84/361 (23%), Positives = 142/361 (39%), Gaps = 83/361 (22%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASD--IVHTFQEQELA 363
++++ + + ++ +V+ +A A +S+++ DL + +D ++ F Q +
Sbjct: 323 FEKISNYVIKNINNMSTYSVSQIARALSSLRYYNKDLADAIGLHLTDKGALYEFSIQSIG 382
Query: 364 QVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLS 423
+L+ FA +P L L K +S +KS+ + +
Sbjct: 383 DILYVFARWNHLPEPAL----------------LRKLISKIEY--YIKSTPNM-----II 419
Query: 424 SPVLSFNRDQLGNIAWSYAVLGQ-----MDRIFFSDIWKTISRFEEQRISEQYREDIMFA 478
P+++ WS +L ++ +F I I F I Q MF
Sbjct: 420 PPIVT--------SIWSLIILDTFPHRAINALFNEKIVSEIHSFGTGAIQVQ-----MFQ 466
Query: 479 SQVHLVNQCLKLEHPHLQL-ALSSVLEEKIASAGKTKRFNQKVTSSFQKEVAR----LLV 533
++ KLE P LQL LS + K+F+ K S FQ V R L
Sbjct: 467 -----IDLAAKLERPELQLQGLSH--SHRNHFLKPLKKFSTK-GSVFQHNVQRTLEYLFD 518
Query: 534 STGLNWIREYAVDGYTVD-AVLVD----------------------KKVAFEIDGPTHFS 570
+ W GY+VD A++ D K++A E+DGP HF
Sbjct: 519 GSHYYWKEFKTAYGYSVDLAIMTDLNNVLQEPKVNVLRSKNKPTHYKRIAIEVDGPYHFL 578
Query: 571 RNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSSN 630
N+ +G + +K R + GW VV + + +WEEL E+ Y +K I G+G N
Sbjct: 579 HNSTKLIGESKMKHRQLRLLGWTVVQVPYFDWEELNTDDERKQY----MKRKIFGDGPMN 634
Query: 631 I 631
I
Sbjct: 635 I 635
>gi|219125971|ref|XP_002183242.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405517|gb|EEC45460.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1123
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 101/461 (21%), Positives = 173/461 (37%), Gaps = 141/461 (30%)
Query: 277 ALPECSAQGISNIAWALSK-----------IGGELLYLSEMDR----------------- 308
L E S QG+ N AWA ++ +GG L S R
Sbjct: 690 GLTEFSPQGLGNTAWAFARQAQLSEEAANRLGGASLLPSSNGRLAIYTACYFDIGEELIH 749
Query: 309 -----VAEVALTK---VGEFNSQNVANVAGAFA--SMQHSA--PDLFSELAKRASDI--- 353
+AE +TK + F Q+++N A FA ++H+A EL +R S
Sbjct: 750 RLFAAIAEAGITKHVNLTSFKPQDLSNTAWTFAVLGLRHTAFMEVAMHELERRLSLFLKG 809
Query: 354 ----VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGG 409
+ TF+ QELA +LWA A+L + LE + ++ C
Sbjct: 810 ERTSITTFKGQELANLLWALATLNIRVENSLEIVTPYLQE-----VCF------------ 852
Query: 410 VKSSGDADSEGSLSSPVLS----FNRDQLGNIAWSYAVLGQ----MDRIFFSDIWKTISR 461
EG PV + F R +L N+AWS AV G+ + ++ ++ +
Sbjct: 853 ---------EGRTGMPVQAIAQIFKRQELANVAWSCAVFGKYPTALMQLLYAGLIGLDKE 903
Query: 462 FEEQRISEQYREDIM----FASQVHLVNQCLKLEHPHLQLALS---------------SV 502
+ +++S Y + + S +++ + L L + +
Sbjct: 904 CDAEKLSNVYGDKGLQSQALMSLIYVQASMDRAGKSTLGLPPNFPDAWRQSTPSEDGQRM 963
Query: 503 LEEKIASAGKTKRFNQKVTSSFQK-----------EVARLLVSTGLNWIREYAVDGYTVD 551
E I + T + + V+++F + + ++V G+N+ + +D ++D
Sbjct: 964 TETNIELSLSTSKIQRDVSAAFNRIGFKHIEEHTISMQEMVVEYGVNFAPQ-QLDILSID 1022
Query: 552 AVLVDKKVAFEIDGPTHFSR-----------NTGVPLGH-----------------TMLK 583
V +K+A E+DGP HF +T P G T LK
Sbjct: 1023 IANVPEKIAIEVDGPAHFINLIDNVDENDYGSTKAPNGKLEYQFQWTGDRQMMNGSTSLK 1082
Query: 584 RRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIG 624
R + + GW V+ + EW ++ EQ +Y R L D +G
Sbjct: 1083 HRLLESLGWRVIHIPFWEWYQMGSDEEQGEYCRDAL-DTLG 1122
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 91/194 (46%), Gaps = 15/194 (7%)
Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
LN+ +V ++A EVL ++ ++ + S ++ +N +T++HR+ ++
Sbjct: 141 LNQLLVACESASEVLTLLQNTKGSLTQKASGGTMNSVNFSTSIHRLCRHSLNQRDTRAAT 200
Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIG-----GELLYLSEMDRVAEVA 313
LA R + A AM +P S + +SNI WAL+K+ + + D + A
Sbjct: 201 LADPRFALLLASTAEAMVTMPFQSRE-LSNIGWALAKLKIVPPLTAMPFEQSDDEALKAA 259
Query: 314 LTKV--GEFNSQNVANVAGAFASMQHSA-PDLFSELAKRAS-DIVHT----FQEQELAQV 365
V G F + +G + +A L ++ R S ++V T F+ QE A +
Sbjct: 260 AQTVRDGVFKAAKERQESGTPSKAWITALSQLAGQILDRISQNVVSTQTDGFRLQEWANL 319
Query: 366 LWAFASLYEPADPL 379
+WA+A+ E ADP+
Sbjct: 320 MWAWAT-AERADPV 332
>gi|397628210|gb|EJK68790.1| hypothetical protein THAOC_10004, partial [Thalassiosira oceanica]
Length = 2539
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 80/354 (22%), Positives = 138/354 (38%), Gaps = 93/354 (26%)
Query: 320 FNSQNVANVAGAFASMQ--HSAPDLF----SELAKRASDIVHTFQEQELAQVLWAFASLY 373
+++Q+++N +FA++ HSA LF +E+ R + F+ QE++ +LW+FA++
Sbjct: 823 YSNQDLSNTVWSFATLGLLHSA--LFKSVENEVKSRLMNNRTKFRGQEISNLLWSFATVN 880
Query: 374 EPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQ 433
DP +F DA GV + D E SL+S L R +
Sbjct: 881 AQPDP-------SFIDAMSHYIA------------GVCTGRDGIREQSLTS--LFTQRQE 919
Query: 434 LGNIAWSYAVLGQMDR----IFFSDIWKTISRFEEQRISEQYREDIMFASQV---HLVNQ 486
L N+AW AV+GQ + I ++ + T + + R + +D + S + + V
Sbjct: 920 LANLAWGCAVVGQYPKDLMNILYAGLLGTNNDPDHMR--RVFNDDGLEKSSIMTLYYVQI 977
Query: 487 CLKLEHPHLQLALSSVLEEKIASAGKTKRFNQK----------------VTSSFQKEVAR 530
+E P L+LAL +R K S Q+ V
Sbjct: 978 AADIEAPELKLALPEGFPNGWGVMDGQQRTRSKDGDDLAQQSSSILLTLTVSKLQRHVGS 1037
Query: 531 LLVSTGLNWIREYAVDG--------------------YTVDAVLVDKKVAFEIDGPTHFS 570
+ G + EY +D ++D V+K++ E+DGP HF
Sbjct: 1038 AFDAIGFDHELEYVIDTNQIRDELPNEIVLTQSPMEFLSIDLANVEKRIGVEVDGPGHFV 1097
Query: 571 RNTGVPL-------------------GHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
P G T LK R ++ W+++ L + E+++L
Sbjct: 1098 HLLDKPPRRRESEIIILDDMGDNRFNGPTTLKHRLLSHLDWDIIHLPYWEFQKL 1151
Score = 39.7 bits (91), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 53/225 (23%), Positives = 89/225 (39%), Gaps = 42/225 (18%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRV----------------------AEVAL---TKVG 318
Q +SN WA++ G + LY D A VAL +
Sbjct: 648 QEMSNSIWAMATAGFKPLYTRAFDTTLVPRNMRPTKKQLAEDTFGESYAAVALETMRRPH 707
Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAK----RASDIVHTFQEQELAQVLWAFASLYE 374
EF Q + +V +F+ + P LF A+ R + +F Q L +LW++A +
Sbjct: 708 EFKDQELKDVMWSFSRVGIRHPALFKSTAEHVIGREGRGLSSFSSQGLGNLLWSYAKQAQ 767
Query: 375 PADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSG---DADSEGSLSSPVLSFNR 431
+ ++E+L + T T L ++C +NG G +A + + V S++
Sbjct: 768 LSLEVIEALGDDVNLVT--TGRLAVYETSCLDNGEANIKGLFVEAARAVASAGAVASYSN 825
Query: 432 DQLGNIAWSYAVLGQMDRIFFSDIWKTI--------SRFEEQRIS 468
L N WS+A LG + F + + ++F Q IS
Sbjct: 826 QDLSNTVWSFATLGLLHSALFKSVENEVKSRLMNNRTKFRGQEIS 870
>gi|397596760|gb|EJK56844.1| hypothetical protein THAOC_23187 [Thalassiosira oceanica]
Length = 1026
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 59/106 (55%), Gaps = 5/106 (4%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLY----LSEMDRVAEVALTKVGEFNSQNVANVA 329
A+ LP A+ I+N+ + +K +Y + D +A+ AL+K + QN+AN+
Sbjct: 374 AVPILPTFDARNIANLVHSFAKAEVVPIYEPGKCTLFDMLADSALSKDHDMQPQNIANIL 433
Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEP 375
AFA M+H +P LF EL+ AS +H F Q+LA + W+ S Y P
Sbjct: 434 WAFAKMKHPSPKLFEELSTDASRRMHDFSAQQLATLAWSL-SKYPP 478
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 103/406 (25%), Positives = 164/406 (40%), Gaps = 86/406 (21%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSE----MDRVAEVALTKVGEFNSQNVANVA 329
A L E SA + N++ + K G LS MD +A + + F+ + + +A
Sbjct: 642 AAYQLRELSALALFNLSVSYGKSG-----LSPNDEWMDLLAREIVRRPSSFSPKMIVGIA 696
Query: 330 GAFASMQHSAPDLFSELA-------------KRASDIVHTF------------------- 357
A+++M + P LF+ LA K + +V +F
Sbjct: 697 FAYSTMNYQKPRLFTFLAEQVKSQCQESLEPKELASLVWSFVNIGFLDRGLLAEIAEVLN 756
Query: 358 ------QEQELAQVLWAFASLYEPADPLLESLDNAFKDATQ-FTCCLNKALSNCNENGGV 410
Q LA V WA++ E L + + A K + FT A N
Sbjct: 757 GKWSELDTQSLANVAWAYSKAQEDRPALYKGISAAAKAGREGFT-----AQGVSNLLWAF 811
Query: 411 KSSGDADSE-----GSLSSPVL-SFNRDQLGNIAWSYAVLGQMD-RIFFSDIWKTISRFE 463
++G+ D + +S+ +L F + N+AW+YAV D +F +D + +
Sbjct: 812 SAAGEVDDDLFEFFAPVSTSLLDEFQPQGIANLAWAYAVANVDDGSLFNADFIGSCTM-- 869
Query: 464 EQRISEQYRE-DIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG-KTKRFNQKVT 521
RE D + Q+HL N H + L + + E +A K+ Q
Sbjct: 870 ------NLREFDAVGLCQLHLWNM---WRHEARREGLPAGMAETCKNAFVHQKKIRQ--- 917
Query: 522 SSFQKEVARLLVSTGLNWIREYAVD-GYTVDAVLV--DKKVAFEIDGPTHF--SRNTGVP 576
S Q V L ++G++ I E V+ GY +D +L KK+ EIDGP HF R G
Sbjct: 918 SKLQNTVVGHLRNSGMDVIEEVQVESGYLLDVLLTINGKKIGVEIDGPFHFVGRRQNGA- 976
Query: 577 LGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
T+LKRR ++ ++SL + E L E YL +L+D
Sbjct: 977 ---TILKRRLVSNVDKIPIISLPYWELNGLDSDVEWASYLNRVLED 1019
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 35/151 (23%), Positives = 77/151 (50%), Gaps = 16/151 (10%)
Query: 244 IAKNMEKVS----MMTTHRLA---FTRQREM-SMLVAIAMTALPECSAQGISNIAWALSK 295
+A+ +EKVS +M H A T+ E S++V A++ + ++W+L+
Sbjct: 492 VARGLEKVSSQGLVMLAHAFATIGHTQNEEFWSLIVDAAISRASNLWPIECAQLSWSLAT 551
Query: 296 I---GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASD 352
+ EL M+ + + L ++ + Q +A+VA +F+++ + P+L+ LAKR+
Sbjct: 552 VRRKSDEL-----MNGIEKQVLRRIDGYTPQGLASVAWSFSTLGYDVPNLYDALAKRSLQ 606
Query: 353 IVHTFQEQELAQVLWAFASLYEPADPLLESL 383
++ F + ++ A+++ P LL+++
Sbjct: 607 LMEDFSPTDKVLLVLAYSNHTHPHPNLLDAV 637
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 40/81 (49%), Gaps = 6/81 (7%)
Query: 309 VAEVALTKVGEFNSQNVANVAGAFAS------MQHSAPDLFSELAKRASDIVHTFQEQEL 362
+ + A+ + F+++N+AN+ +FA + LF LA A H Q Q +
Sbjct: 370 IGDAAVPILPTFDARNIANLVHSFAKAEVVPIYEPGKCTLFDMLADSALSKDHDMQPQNI 429
Query: 363 AQVLWAFASLYEPADPLLESL 383
A +LWAFA + P+ L E L
Sbjct: 430 ANILWAFAKMKHPSPKLFEEL 450
>gi|294886889|ref|XP_002771904.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239875704|gb|EER03720.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 1157
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 56/118 (47%), Gaps = 15/118 (12%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDA---------------VLVDKKVAFEIDG 565
TS ++V++ GL E V Y++D +LV + AFE+DG
Sbjct: 834 TSQLHRQVSKFFTMVGLRHRNEVVVGPYSIDVSGLGRGLEQAVISVKILVGESFAFEVDG 893
Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
P HF R+T + ++LK R + A G+ V+ + +QEW + ++L Y+ K I
Sbjct: 894 PHHFYRDTSMRTASSLLKHRILEALGFTVIRVPYQEWSQCGTREKRLRYVGSFWKQLI 951
>gi|401408949|ref|XP_003883923.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325118340|emb|CBZ53891.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 515
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 47/95 (49%), Gaps = 2/95 (2%)
Query: 510 AGKTKRFNQKVTSSFQKEVARLLVSTGL--NWIREYAVDGYTVDAVLVDKKVAFEIDGPT 567
A + K N S QK V RLL GL EY + Y +D + +K+ E+DG
Sbjct: 394 AREKKLLNLVHVSQVQKRVGRLLFDEGLMSEICVEYPLGPYVLDFAIPSRKLVVEVDGEA 453
Query: 568 HFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
HF T VP T +KR +AA GW+VV + + W
Sbjct: 454 HFFFGTTVPTAQTRMKRELLAAMGWHVVVVPQELW 488
>gi|428175207|gb|EKX44098.1| hypothetical protein GUITHDRAFT_139952 [Guillardia theta CCMP2712]
Length = 1108
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 60/116 (51%), Gaps = 9/116 (7%)
Query: 521 TSSFQKEVARLLVSTGLNWIREY--AVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
+S+ Q VA + + L I E V GY +D L ++ E+DGP HF+ T PLG
Sbjct: 987 SSNLQNSVALAIAALDLEMIEEMKDTVSGYRLDIFLPAQQKVVEVDGPRHFAFETRRPLG 1046
Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEE-------LQGSFEQLDYLRVILKDYIGGEG 627
T+LKRR + + V++ + EW+E + EQL+YLR + D+ G+
Sbjct: 1047 PTVLKRRILELLRYKPVTIPYWEWDERGGGAGGGGFTREQLEYLRSKIFDHTMGDA 1102
>gi|412993830|emb|CCO14341.1| predicted protein [Bathycoccus prasinos]
Length = 676
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 90/369 (24%), Positives = 157/369 (42%), Gaps = 51/369 (13%)
Query: 282 SAQGISNIAWALSKI---GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
S Q I N W+ + GE + MD ++ +F +Q ++N+A A A +QH
Sbjct: 306 STQAIGNAMWSCGTLRCHPGEKI----MDAYLKLTTEYHEKFKTQEISNIAWASAMLQHH 361
Query: 339 APDLF-----SELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAFKDATQ 392
D F LAKR + Q ++ L A+ Y+ + +L++L + A +
Sbjct: 362 PGDAFLSVVSETLAKRLEECA----SQAVSNSLLGLATFGYKMDEEMLKALGGK-RHARR 416
Query: 393 FTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLG----NIAWSYAVLGQ-- 446
C ++ L N + +A+ G L + V S + D+ N+ + V+ Q
Sbjct: 417 ---CNSQDLCNSIWALAAVDAFEAEVYGDLWARVSSMHHDEFAPEGLNMLYHACVMHQDH 473
Query: 447 -MDR-------IFFSDIWKTISRFEEQRISEQYREDIM-------FASQVHLVNQCLKLE 491
MD+ + ++ ++ ++R S F+S +H + +
Sbjct: 474 WMDQHAVGNDDVVDEEVLDDVTNTSKKRKSTNQSTTTTKSTKAKGFSSSLH----GMGVR 529
Query: 492 HPHLQLALSSVLEEKIASAGKTKRFNQKVT-SSFQKEVA-RLLVSTGLNWIREYAV-DG- 547
LQ + + + IA + VT S+F K V+ R+ N EY DG
Sbjct: 530 EVALQRHDTPIWLDTIAKKSYDDQTIHSVTLSAFHKHVSTRIRAGFIKNVADEYLTEDGV 589
Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGH-TMLKRRYIAAAGWNVVSLSHQEWEELQ 606
++D L+D K+A E DGP+HF +N + H T+++ R + GW VVS+ + EW+E
Sbjct: 590 MSIDIALLDHKIAIECDGPSHFEKNMEKSMTHKTIIRNRGLERRGWRVVSIPYFEWQEAN 649
Query: 607 GSFEQLDYL 615
+ YL
Sbjct: 650 ANETHRKYL 658
>gi|294942284|ref|XP_002783468.1| hypothetical protein Pmar_PMAR006996 [Perkinsus marinus ATCC 50983]
gi|239895923|gb|EER15264.1| hypothetical protein Pmar_PMAR006996 [Perkinsus marinus ATCC 50983]
Length = 389
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 88/187 (47%), Gaps = 30/187 (16%)
Query: 196 EINLNKDIVDAQTAQEV---LEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVS 252
E + K I+ A A+ V LE++ +T L+ +N++T +HR+A S
Sbjct: 169 EFEIQKSILVAANARSVKGLLEIVDTHVTQ---------LNSVNVSTLIHRLA------S 213
Query: 253 MMTTH---RLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRV 309
+ H + A TR M ++ A+ S Q +SNI+WA+ K L LS+ V
Sbjct: 214 ITQNHEQSQKALTRDHRMKKVLRRAVELARISSCQSLSNISWAVGK-----LQLSDEKEV 268
Query: 310 AEV----ALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQV 365
E A T++ F QN +N+ + + H L +AKR +H F+ QE++ +
Sbjct: 269 VEAIVGAAKTRLEHFRPQNFSNMLYGLSRVNHYDKALMEMVAKRVLGTIHNFKPQEVSNL 328
Query: 366 LWAFASL 372
L+A+ L
Sbjct: 329 LYAYGRL 335
>gi|237839529|ref|XP_002369062.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211966726|gb|EEB01922.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
Length = 1448
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 72/137 (52%), Gaps = 4/137 (2%)
Query: 472 REDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQ---KVTSSFQKEV 528
R DI +++ +V+ L+L P +L L+ +A A + Q ++S ++V
Sbjct: 1018 RLDIGSVTRLQIVDLYLRLLRPPAFASLPFDLKAFLARARRVDLAQQDCFSLSSKLHRDV 1077
Query: 529 ARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
+ + GL E + +++D VL D+ +A EIDGP+HF R T + + + LK+R +
Sbjct: 1078 SSAFLRIGLVHRSEVQLGPFSLDIVLGDR-LAVEIDGPSHFYRETCMRVASSRLKQRLLR 1136
Query: 589 AAGWNVVSLSHQEWEEL 605
GW V+ +S EW +L
Sbjct: 1137 EMGWTVLPVSFFEWRQL 1153
Score = 43.1 bits (100), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
Q SN+ A ++ E+L L + A + ++N Q+++N+A A++ + S P+LF
Sbjct: 476 QDFSNLLNAFGRL--EILDLELFNLAAPEISAGIRDYNPQHLSNIAHAYSKVSVSQPELF 533
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASL 372
+A+ + F +ELA + AFA +
Sbjct: 534 FRIAEMTRRSIQNFSNRELANLALAFAKM 562
>gi|221507781|gb|EEE33368.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 1444
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 72/137 (52%), Gaps = 4/137 (2%)
Query: 472 REDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQ---KVTSSFQKEV 528
R DI +++ +V+ L+L P +L L+ +A A + Q ++S ++V
Sbjct: 1018 RLDIGSVTRLQIVDLYLRLLRPPAFASLPFDLKAFLARARRVDLAQQDCFSLSSKLHRDV 1077
Query: 529 ARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
+ + GL E + +++D VL D+ +A EIDGP+HF R T + + + LK+R +
Sbjct: 1078 SSAFLRIGLVHRSEVQLGPFSLDIVLGDR-LAVEIDGPSHFYRETCMRVASSRLKQRLLR 1136
Query: 589 AAGWNVVSLSHQEWEEL 605
GW V+ +S EW +L
Sbjct: 1137 EMGWTVLPVSFFEWRQL 1153
Score = 43.1 bits (100), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
Q SN+ A ++ E+L L + A + ++N Q+++N+A A++ + S P+LF
Sbjct: 476 QDFSNLLNAFGRL--EILDLELFNLAAPEISAGIRDYNPQHLSNIAHAYSKVSVSQPELF 533
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASL 372
+A+ + F +ELA + AFA +
Sbjct: 534 FRIAEMTRRSIQNFSNKELANLALAFAKM 562
>gi|221483292|gb|EEE21611.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 1449
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 72/137 (52%), Gaps = 4/137 (2%)
Query: 472 REDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQ---KVTSSFQKEV 528
R DI +++ +V+ L+L P +L L+ +A A + Q ++S ++V
Sbjct: 1018 RLDIGSVTRLQIVDLYLRLLRPPAFASLPFDLKAFLARARRVDLAQQDCFSLSSKLHRDV 1077
Query: 529 ARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
+ + GL E + +++D VL D+ +A EIDGP+HF R T + + + LK+R +
Sbjct: 1078 SSAFLRIGLVHRSEVQLGPFSLDIVLGDR-LAVEIDGPSHFYRETCMRVASSRLKQRLLR 1136
Query: 589 AAGWNVVSLSHQEWEEL 605
GW V+ +S EW +L
Sbjct: 1137 EMGWTVLPVSFFEWRQL 1153
Score = 43.1 bits (100), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
Q SN+ A ++ E+L L + A + ++N Q+++N+A A++ + S P+LF
Sbjct: 476 QDFSNLLNAFGRL--EILDLELFNLAAPEISAGIRDYNPQHLSNIAHAYSKVSVSQPELF 533
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASL 372
+A+ + F +ELA + AFA +
Sbjct: 534 FRIAEMTRRSIQNFSNKELANLALAFAKM 562
>gi|159481474|ref|XP_001698804.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
gi|158273515|gb|EDO99304.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
Length = 1235
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 91/348 (26%), Positives = 158/348 (45%), Gaps = 52/348 (14%)
Query: 265 REMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQN 324
R +S A + LP QG+SN AWA +++G L +A AL K+ F +Q
Sbjct: 642 RMLSAWAAQTLEKLPSFEPQGLSNTAWAFARLGFHSPQL--FQALAAAALHKIDGFTAQG 699
Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDIVHT--FQEQELAQVLWAFASL--YEPA--DP 378
++N+A A A+ H+ P LF LA++A+ + T F Q + LWA ASL Y+ A D
Sbjct: 700 LSNLAWAMATAGHAQPRLFEALARQAAALAPTGAFNAQNCSVTLWAAASLRHYDQALFDA 759
Query: 379 LLESLDNAFKD-ATQFTCCLNKALSNCN-ENGGVKSSGDADSEGSLSSPVLSF----NRD 432
+L L A ++ + C + ++N + S A++ L V++ ++
Sbjct: 760 MLRRLVAALEEGGAEADGCEPQNVANALWAVARMGHSLPAEAAAPLLRHVVALMPRMSQQ 819
Query: 433 QLGNIAWSYAVLGQMDRIFFSDIWKTISRFEE---QRISEQYREDIMFASQVHLVNQC-- 487
+L N W+ AV+ +MD ++ ++R + + + + Y +MF S HL
Sbjct: 820 ELCNSMWAVAVMDRMDEGLWAAFCACLTRLPDISPEGMHQAYHAQLMFHS--HLARAAGM 877
Query: 488 --LKLEH--------------PHLQLALSSVLEEKIASAGK---TKRFNQKVTSSFQKEV 528
KL+ P L L +V A++ + RF+Q+V+ + +
Sbjct: 878 PLSKLQALAAADPAAGSRSLLPCLPEPLHTVAASMWAASARDVHVSRFHQEVSGA----L 933
Query: 529 ARLLVSTGLNWI---REYAVD-GYTVDAVLVDKKVAFEIDGPTHFSRN 572
A V L W+ + ++VD G V+A + A E++G H++ N
Sbjct: 934 AVAGVPHALEWMTDDQHFSVDIGLQVNA----RPTAVEVNGSHHYASN 977
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 65/244 (26%), Positives = 104/244 (42%), Gaps = 37/244 (15%)
Query: 270 LVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDR-----VAEVALTKVGEFNSQN 324
L A+ + + A+G++N AWA G+L Y+ +A AL ++GEF+ QN
Sbjct: 383 LAALMINQINSFDARGLANSAWAF----GKLKYVPAAGTSLPTVIAAAALRRMGEFSPQN 438
Query: 325 VANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL 383
++N+ +F M H L + A+ V F+ QELA ++WAFASL Y E +
Sbjct: 439 LSNLVWSFVYMHHVDEALLAAAARYVVARVGEFKPQELANIVWAFASLGYRD-----EHM 493
Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL-------SFNRDQLGN 436
+ Q L K N + G L S ++ +F + N
Sbjct: 494 LHVVASQAQRIAPLFKEQELSNVLWALGKMGLRHRPDVLESLMVETRTKLPAFLPQGISN 553
Query: 437 IAWSYAVLGQMDRIFFSDIWKTISR----FEEQRI--------SEQYREDIMFASQVHLV 484
+AW+ A +G +D +F + R F+ Q + S Y + A+ LV
Sbjct: 554 VAWALAAVGHVDELFLDRVVAQCGRQLGAFDVQALANLVWAMASLGYYQPPFLAA---LV 610
Query: 485 NQCL 488
N+CL
Sbjct: 611 NECL 614
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 110/256 (42%), Gaps = 33/256 (12%)
Query: 268 SMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRV-----AEVALTKVGEFNS 322
+++ A A+ + E S Q +SN+ W+ +Y+ +D A + +VGEF
Sbjct: 421 TVIAAAALRRMGEFSPQNLSNLVWSF-------VYMHHVDEALLAAAARYVVARVGEFKP 473
Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP-LLE 381
Q +AN+ AFAS+ + + +A +A I F+EQEL+ VLWA + P +LE
Sbjct: 474 QELANIVWAFASLGYRDEHMLHVVASQAQRIAPLFKEQELSNVLWALGKMGLRHRPDVLE 533
Query: 382 SLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADS------EGSLSSPVLSFNRDQLG 435
SL + T+ L + +SN + + G D + +F+ L
Sbjct: 534 SL--MVETRTKLPAFLPQGISNVAW--ALAAVGHVDELFLDRVVAQCGRQLGAFDVQALA 589
Query: 436 NIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEH--P 493
N+ W+ A LG F + + R+S Q +I++ C L H P
Sbjct: 590 NLVWAMASLGYYQPPFLAALVNECLARGLDRLSPQNLSNILWG--------CATLGHRDP 641
Query: 494 HLQLALSSVLEEKIAS 509
+ A ++ EK+ S
Sbjct: 642 RMLSAWAAQTLEKLPS 657
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 76/180 (42%), Gaps = 34/180 (18%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVAL----TKVGEFNSQNVANVAGAFASMQHSA 339
Q +SN+ WAL K+G L V E + TK+ F Q ++NVA A A++ H
Sbjct: 511 QELSNVLWALGKMG-----LRHRPDVLESLMVETRTKLPAFLPQGISNVAWALAAVGHVD 565
Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAFASL--YEPADPLLESLDNAFKDATQFTCCL 397
+ + + F Q LA ++WA ASL Y+P P L +L N CL
Sbjct: 566 ELFLDRVVAQCGRQLGAFDVQALANLVWAMASLGYYQP--PFLAALVNE---------CL 614
Query: 398 NKALSNCNENG------GVKSSGDADSE--GSLSSPVL----SFNRDQLGNIAWSYAVLG 445
+ L + G + G D + ++ L SF L N AW++A LG
Sbjct: 615 ARGLDRLSPQNLSNILWGCATLGHRDPRMLSAWAAQTLEKLPSFEPQGLSNTAWAFARLG 674
>gi|397576023|gb|EJK50024.1| hypothetical protein THAOC_31047, partial [Thalassiosira oceanica]
Length = 292
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 119/286 (41%), Gaps = 54/286 (18%)
Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVL 366
D VA L + FN QN++N+ AFA+ S LF +L+ A+ + Q +A L
Sbjct: 13 DHVA--GLGSLNSFNPQNLSNITWAFATAGVSHTKLFEKLSDAAARKGEFIETQHIANFL 70
Query: 367 WAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPV 426
WA A++ + D F+ AL+ ++S +
Sbjct: 71 WACATV-------------GYTDERLFS-----ALTPV-----------------IASKL 95
Query: 427 LSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQ 486
FN L NIAW+Y+V + F+ + E+ S + A
Sbjct: 96 DKFNLQNLANIAWAYSVANTPRQDLFNKGYAGALASIEKDFSAE-----GLAQLHQWQLW 150
Query: 487 CLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV- 545
+LE + L L+ K +A ++ F++ S Q +V L +TGL E +
Sbjct: 151 QQELES---GIELPRSLQAKCRNAFTSQGFSE---SKLQNDVVDELKATGLVLDEEVLLG 204
Query: 546 DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
GY +DA++ KVA E+DGP+HF P G T+LK R +A
Sbjct: 205 SGYRIDALVKIGDGGKVAVEVDGPSHFIDRR--PTGSTILKHRQVA 248
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 54/108 (50%), Gaps = 4/108 (3%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEF-NSQNVANVAGAF 332
+ +L + Q +SNI WA + G + + +++++ A K GEF +Q++AN A
Sbjct: 17 GLGSLNSFNPQNLSNITWAFATAG--VSHTKLFEKLSDAAARK-GEFIETQHIANFLWAC 73
Query: 333 ASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
A++ ++ LFS L + + F Q LA + WA++ P L
Sbjct: 74 ATVGYTDERLFSALTPVIASKLDKFNLQNLANIAWAYSVANTPRQDLF 121
>gi|397563361|gb|EJK43767.1| hypothetical protein THAOC_37756, partial [Thalassiosira oceanica]
Length = 1452
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 132/318 (41%), Gaps = 30/318 (9%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
+ +L Q +SN AWA + + + +++ EV + K F+ + ++N A A
Sbjct: 582 GLGSLDSFKPQNLSNTAWAYAT--ARVFHSRLFEKLTEV-VAKKDHFDERAISNFLWACA 638
Query: 334 SMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPAD--PLLESLDNAFKDAT 391
++ ++ LFS A +H EQ+LA + WA++ P P+ + +
Sbjct: 639 TVGYTDERLFSAFAPVIESKLHECNEQDLANIAWAYSVANIPKQDLPVRKGEFIEIQHIA 698
Query: 392 QFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIF 451
F L C G + ++S + N L NIAW+Y+V +F
Sbjct: 699 NF-------LWACVTVGHTDERLLSAFAPVIASKLDECNDQDLANIAWAYSVANAPQDVF 751
Query: 452 FSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAG 511
++ +E++ EQ A +LE + L L K +
Sbjct: 752 NKGYVVALALYEKEFSGEQ------LAQLHQWQLWQQELES---GIELPRSLRAKCRNTF 802
Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPT 567
++ F++ S Q V L GL+ E + GY +DA++ ++KVA E+DGP+
Sbjct: 803 TSQGFSE---SKLQNNVVDELRIAGLDLGEEVLLGSGYRIDALVKVGDERKVAVEVDGPS 859
Query: 568 HFSRNTGVPLGHTMLKRR 585
HF + P G T LK R
Sbjct: 860 HFIQRR--PAGSTTLKHR 875
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 68/166 (40%), Gaps = 9/166 (5%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
Q SN AWA + G L L L + FN Q ++N A AFAS S P LF
Sbjct: 514 QDFSNTAWAFATAGASHLELFNKIGNHIAGLGSLDSFNPQALSNTAWAFASAGESHPKLF 573
Query: 344 SELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
++ + + + +F+ Q L+ WA+A+ L E L F +A+
Sbjct: 574 KKIGDHIAGLGSLDSFKPQNLSNTAWAYATARVFHSRLFEKLTEVVAKKDHFD---ERAI 630
Query: 402 SN----CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
SN C G + + S + N L NIAW+Y+V
Sbjct: 631 SNFLWACATVGYTDERLFSAFAPVIESKLHECNEQDLANIAWAYSV 676
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 86/204 (42%), Gaps = 24/204 (11%)
Query: 273 IAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
I M +L + Q +SNI WA + G L+ D VA L + F Q+++N+A
Sbjct: 386 IVMRSLNDFWPQDVSNIVWAYAAAGVSHPELFKKIGDHVA--GLDSLDSFEPQHLSNIAW 443
Query: 331 AFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
+FA++ S P LF ++ + + + +F+ Q L+ + WA A E L + + +
Sbjct: 444 SFATVGESNPKLFKKIGDHVAGLGSLGSFKPQALSNISWACAKAGESNPKLFKKIGDHIA 503
Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSE------------GSLSSPVLSFNRDQLGN 436
+ + SN ++G + E GSL SFN L N
Sbjct: 504 GPSSLGSFYPQDFSNTAW--AFATAGASHLELFNKIGNHIAGLGSLD----SFNPQALSN 557
Query: 437 IAWSYAVLGQMDRIFFSDIWKTIS 460
AW++A G+ F I I+
Sbjct: 558 TAWAFASAGESHPKLFKKIGDHIA 581
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 86/208 (41%), Gaps = 35/208 (16%)
Query: 284 QGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q +SNIAW+ + +G L+ D VA L +G F Q ++N++ A A S P
Sbjct: 436 QHLSNIAWSFATVGESNPKLFKKIGDHVA--GLGSLGSFKPQALSNISWACAKAGESNPK 493
Query: 342 LFSELAKR--ASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNK 399
LF ++ + +F Q+ + WAFA+ L + N +
Sbjct: 494 LFKKIGDHIAGPSSLGSFYPQDFSNTAWAFATAGASHLELFNKIGNHIAGLGSLDSFNPQ 553
Query: 400 ALSNCNENGGVKSSGDADSE------------GSLSSPVLSFNRDQLGNIAWSYAVLGQM 447
ALSN S+G++ + GSL S F L N AW+YA
Sbjct: 554 ALSNTAW--AFASAGESHPKLFKKIGDHIAGLGSLDS----FKPQNLSNTAWAYATA--- 604
Query: 448 DRIFFSDIWKTIS-------RFEEQRIS 468
R+F S +++ ++ F+E+ IS
Sbjct: 605 -RVFHSRLFEKLTEVVAKKDHFDERAIS 631
Score = 45.8 bits (107), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 11/120 (9%)
Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVG 318
+ +T +R S + + L EC+ Q ++NIAWA S + + + D + G
Sbjct: 640 VGYTDERLFSAFAPVIESKLHECNEQDLANIAWAYS-----VANIPKQD-----LPVRKG 689
Query: 319 EF-NSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPAD 377
EF Q++AN A ++ H+ L S A + + +Q+LA + WA++ P D
Sbjct: 690 EFIEIQHIANFLWACVTVGHTDERLLSAFAPVIASKLDECNDQDLANIAWAYSVANAPQD 749
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 64/158 (40%), Gaps = 38/158 (24%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHS----APDLFSELAKRASDIVHTFQEQE 361
D +A A+ + EF++++++N+ +F ++ + LF+ + A I+HTF Q
Sbjct: 263 FDSIASSAVGMLNEFDARHLSNLIYSFGLVERNPYIGGETLFNVFREAAVKILHTFISQN 322
Query: 362 LAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGS 421
L+ +LWAF + K++ F E G V S D D
Sbjct: 323 LSNMLWAFVKVDA-------------KNSRLF-----------QETGRVISGMDLD---- 354
Query: 422 LSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI 459
SF NI WS+A G+ D F + I
Sbjct: 355 ------SFKPQDFANILWSFAKSGEADSKLFQALGNHI 386
>gi|397639734|gb|EJK73730.1| hypothetical protein THAOC_04631, partial [Thalassiosira oceanica]
Length = 856
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 69/310 (22%), Positives = 128/310 (41%), Gaps = 51/310 (16%)
Query: 284 QGISNIAWALSK-----------IGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAF 332
Q ++NI W+ SK +G + + +D F+ Q ++N A AF
Sbjct: 574 QALANILWSFSKSSKADPEPFRLLGNHIANMGRLD-----------SFDPQALSNTAWAF 622
Query: 333 ASMQHSAPDLFSELAKRAS--DIVHTFQEQELAQVLWAFAS-------LYEPADPLLESL 383
A+ S P+L ++ + D + +F QEL+ +WA+A+ L+E + +
Sbjct: 623 ATAGQSNPELLKKIGDHVAGLDSLDSFNPQELSNTIWAYATARVLDLGLFEKLATEVAAR 682
Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
+ F + ++ L C G + + S + N+ L NIAW+Y+V
Sbjct: 683 NGQFIETQH----MSNFLWACATVGYTDERMFSAFAPVIESKLDECNKQDLANIAWTYSV 738
Query: 444 LGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVL 503
F ++ +E E + + + ++L L
Sbjct: 739 ANAPQDTFNKGYVSALAAYENAFSKEALSQLHQWQLLQQELESGVELPQ---------SL 789
Query: 504 EEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKV 559
+EK +A + +++ S Q +V L + GL+ E + GY +DA++ ++KV
Sbjct: 790 QEKCRNAFTSLGYSE---SKLQNDVVGELKAAGLDLDEEVLLGSGYRIDALVKIGDERKV 846
Query: 560 AFEIDGPTHF 569
A E+DGP+HF
Sbjct: 847 AVEVDGPSHF 856
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 74/159 (46%), Gaps = 27/159 (16%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHS----APDLFSELAKRASDIVHTFQEQE 361
D +A A + +F++++++N+ +F ++ + LF+ A I+HTF+ QE
Sbjct: 478 FDSIASSAAVVLNKFDARHLSNLIYSFGLVERNPEIRGKTLFNVFGTAAVKILHTFKPQE 537
Query: 362 LAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKSS 413
L+ +LWAF + L++ ++ +D +FK +AL+N + S
Sbjct: 538 LSNMLWAFVKVDAKNSRLFQETCRVISGMDLGSFKP---------QALANILWSFSKSSK 588
Query: 414 GDADSEGSLSSPVL------SFNRDQLGNIAWSYAVLGQ 446
D + L + + SF+ L N AW++A GQ
Sbjct: 589 ADPEPFRLLGNHIANMGRLDSFDPQALSNTAWAFATAGQ 627
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 79/198 (39%), Gaps = 38/198 (19%)
Query: 284 QGISNIAWALSKIGGELLYL-SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
Q +SN+ WA K+ + L E RV ++ +G F Q +AN+ +F+ + P+
Sbjct: 536 QELSNMLWAFVKVDAKNSRLFQETCRV--ISGMDLGSFKPQALANILWSFSKSSKADPEP 593
Query: 343 FSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
F L +++ + +F Q L+ WAFA+ + LL+ +
Sbjct: 594 FRLLGNHIANMGRLDSFDPQALSNTAWAFATAGQSNPELLKKI----------------- 636
Query: 401 LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
D L S + SFN +L N W+YA +D F + ++
Sbjct: 637 ---------------GDHVAGLDS-LDSFNPQELSNTIWAYATARVLDLGLFEKLATEVA 680
Query: 461 RFEEQRISEQYREDIMFA 478
Q I Q+ + ++A
Sbjct: 681 ARNGQFIETQHMSNFLWA 698
>gi|237832727|ref|XP_002365661.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211963325|gb|EEA98520.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
Length = 861
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 48/97 (49%), Gaps = 3/97 (3%)
Query: 509 SAGKTKRF-NQKVTSSFQKEVARLLVSTGL--NWIREYAVDGYTVDAVLVDKKVAFEIDG 565
S +TK+ N S QK V RLL GL EY + Y +D + +K+ E+DG
Sbjct: 738 SLARTKKLLNLVHVSQVQKRVGRLLFDEGLMSEIDVEYPLGPYVLDFAIPSRKLVVEVDG 797
Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
HF T VP T +KR +AA GW VV + + W
Sbjct: 798 EAHFFFGTTVPTAQTRMKRELLAAMGWRVVVVPQELW 834
>gi|221488117|gb|EEE26331.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 862
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 48/97 (49%), Gaps = 3/97 (3%)
Query: 509 SAGKTKRF-NQKVTSSFQKEVARLLVSTGL--NWIREYAVDGYTVDAVLVDKKVAFEIDG 565
S +TK+ N S QK V RLL GL EY + Y +D + +K+ E+DG
Sbjct: 739 SLARTKKLLNLVHVSQVQKRVGRLLFDEGLMSEIDVEYPLGPYVLDFAIPSRKLVVEVDG 798
Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
HF T VP T +KR +AA GW VV + + W
Sbjct: 799 EAHFFFGTTVPTAQTRMKRELLAAMGWRVVVVPQELW 835
>gi|384254362|gb|EIE27836.1| hypothetical protein COCSUDRAFT_83456 [Coccomyxa subellipsoidea
C-169]
Length = 454
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 139/342 (40%), Gaps = 37/342 (10%)
Query: 270 LVAIAMTALPECSAQGISNIAWALSKI----GGELLYLSEMDRVAEVALTKVGEFNSQNV 325
+ A A L E QGIS + W K+ G+LL D++A V + Q V
Sbjct: 77 IAAAASARLHEFQPQGISMLTWGYGKLDHAPAGDLL-----DQIAHALELDVSVYRHQAV 131
Query: 326 ANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPAD---PLLE 381
AN+ +FA +Q +P L + + +D F QEL +LWAF + P LLE
Sbjct: 132 ANMFYSFARLQKDSPTLCAAVETHVTDHAEDFSPQELMNILWAFVKFRFVPKQFIAALLE 191
Query: 382 SL---DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
+ D A + L L++ + + + G P S +L N+
Sbjct: 192 YVLDEDRARTFRSSDWAALIWGLASLGVSVPAEPMAAINKAGLQHLP--SMTAPELCNVM 249
Query: 439 WSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLA 498
W ++L + ++ F + + ++Q + E L+ Q L+
Sbjct: 250 WGLSILDECNQPIFVESMSQLLENKQQTVLEP-----------RLLRQLLQASALAQAAD 298
Query: 499 LSSVLEEKI-ASAGKTKRFNQKVTSSFQKE-VARLLVSTGL-NWIREYAVDGY-TVDAVL 554
+S L E + +A K R S + V+R L + G+ + + + +G TVD L
Sbjct: 299 VSVSLPEPVHKAAAKWWRATANTVPSLTHDGVSRTLKNLGVKHRVLVFLQEGLPTVDIAL 358
Query: 555 V----DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW 592
KVA ++ GP S NT LG + R ++A+GW
Sbjct: 359 EAWGDQPKVAIQVVGPHEVSTNTNTLLGRATAEARLLSASGW 400
>gi|221508635|gb|EEE34204.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 863
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 48/97 (49%), Gaps = 3/97 (3%)
Query: 509 SAGKTKRF-NQKVTSSFQKEVARLLVSTGL--NWIREYAVDGYTVDAVLVDKKVAFEIDG 565
S +TK+ N S QK V RLL GL EY + Y +D + +K+ E+DG
Sbjct: 740 SLARTKKLLNLVHVSQVQKRVGRLLFDEGLMSEIDVEYPLGPYVLDFAIPSRKLVVEVDG 799
Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
HF T VP T +KR +AA GW VV + + W
Sbjct: 800 EAHFFFGTTVPTAQTRMKRELLAAMGWRVVVVPQELW 836
>gi|308813528|ref|XP_003084070.1| unnamed protein product [Ostreococcus tauri]
gi|116055953|emb|CAL58486.1| unnamed protein product [Ostreococcus tauri]
Length = 812
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 120/278 (43%), Gaps = 49/278 (17%)
Query: 357 FQEQELAQVLWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGD 415
Q Q L+ +LWA A L Y P D E AF + + + G G
Sbjct: 395 LQTQSLSNILWALAILRYVPED---EDFLVAFSERSLIEL----------QQGRFSYQGL 441
Query: 416 ADSEGSLSSPVLSFNRDQ---------LGN-IAWSYAVLGQMDRIFFSDIWKTISRFEEQ 465
++ + S VL N Q +GN +A ++ G + +F + + + E+
Sbjct: 442 TNTVWAFS--VLGINPGQTLLDEFAREIGNRLAGYFSSQGVSNSLF---AFAVLEYWPEK 496
Query: 466 RISEQYREDIM-------FA----SQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTK 514
+ + YR ++ F+ +Q+ N + PH L + +A K
Sbjct: 497 WVVDAYRAKLVETEKTTGFSEIDWTQLFQANVVFERYSPHGALITDPKMLAAAEAAWKVG 556
Query: 515 RFNQKVTSSFQKEVARLLVSTGL-NWIREYAVDG-YTVDAVLVDKKVAFEIDGPTHFSRN 572
++ V S F +EV+ L G+ + I + DG +++D L KKVA E+DGP+HF+RN
Sbjct: 557 S-SKVVISQFHREVSETLTEMGVPHEIEKLVEDGLFSLDIALKGKKVAIEVDGPSHFARN 615
Query: 573 T------GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
G G T ++ R + ++GW++V + EW E
Sbjct: 616 IRDRRLEGKDAGVTNMRTRCLTSSGWSIVHVPWFEWAE 653
Score = 46.2 bits (108), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 52/102 (50%), Gaps = 8/102 (7%)
Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASM--QH 337
E +AQGISN WA + +G +L E+ A+ +V +F S +N+ A +M +
Sbjct: 152 EFAAQGISNSLWAFATLGYQL--RPELVSKFSQAIRRVKDFKSMEFSNMIWAVGTMKIEL 209
Query: 338 SAPDLFSELAKRA----SDIVHTFQEQELAQVLWAFASLYEP 375
P+LF E+ + + + Q ++ +LWA ASL +P
Sbjct: 210 DPPELFDEILDECLASMKALPNMWSSQSVSNILWAMASLNKP 251
>gi|399216298|emb|CCF72986.1| unnamed protein product [Babesia microti strain RI]
Length = 838
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 70/127 (55%), Gaps = 7/127 (5%)
Query: 486 QCLKLEHPHLQL----ALSSVLEEKIASAGKTK-RFNQKV--TSSFQKEVARLLVSTGLN 538
Q ++L + +L L LS L+E + A K + FN+ + +SS +E++ L + G+N
Sbjct: 710 QTVQLYYKYLYLEGYNRLSDNLKELLEKAIKARISFNEYLPKSSSSHRELSTYLFAAGVN 769
Query: 539 WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLS 598
+ E + Y++D V+ + K E DGP+HF T + ++LK + + G+N++ +
Sbjct: 770 HLNEVRLGPYSLDIVISNTKTVIEYDGPSHFYCETTMRSPKSLLKHDILISMGYNLIHVP 829
Query: 599 HQEWEEL 605
EWE+L
Sbjct: 830 FFEWEQL 836
>gi|302780213|ref|XP_002971881.1| hypothetical protein SELMODRAFT_412578 [Selaginella moellendorffii]
gi|300160180|gb|EFJ26798.1| hypothetical protein SELMODRAFT_412578 [Selaginella moellendorffii]
Length = 240
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 17/86 (19%)
Query: 538 NWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSL 597
WI EY Y++D F+ G LGHT+LK R + AAGW ++S
Sbjct: 161 QWIPEYVDADYSLD-----------------FAMKGGDLLGHTVLKHRLLEAAGWKIISA 203
Query: 598 SHQEWEELQGSFEQLDYLRVILKDYI 623
S+ EWE LQG E +D+++ ++ +I
Sbjct: 204 SYAEWENLQGESEHVDFIQKLVTPHI 229
>gi|399217569|emb|CCF74456.1| unnamed protein product [Babesia microti strain RI]
Length = 368
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 51/97 (52%), Gaps = 4/97 (4%)
Query: 522 SSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTM 581
S Q+ V +L+ GL++ EY + Y +D VL ++A E++G +HF T + T
Sbjct: 265 SKLQRTVTKLIGELGLDFAEEYPLGPYLIDLVLPKHRIAIEVNGFSHFYDQTILHTSKTR 324
Query: 582 LKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVI 618
LK + GW + ++H +W+ + + D LR++
Sbjct: 325 LKYSIVQRMGWKIAEINHHQWKNINRT----DRLRIL 357
>gi|428174671|gb|EKX43565.1| hypothetical protein GUITHDRAFT_140332 [Guillardia theta CCMP2712]
Length = 1069
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 19/103 (18%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVD---GYTVDAVLVDKKV---------------AFE 562
S K+V + GL ++E VD GY++DA++ ++ A E
Sbjct: 903 VSPVTKQVVSCMKDLGLR-VQEEHVDSSTGYSIDALVEIPRMNKGGGAGGAGGEIFCAVE 961
Query: 563 IDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
+DGP+HF RN VPLG T LKR+ + G+ VVS+ + EW+ L
Sbjct: 962 VDGPSHFPRNDYVPLGGTALKRKQLRKIGYRVVSIPYWEWDAL 1004
>gi|302831782|ref|XP_002947456.1| hypothetical protein VOLCADRAFT_87583 [Volvox carteri f. nagariensis]
gi|300267320|gb|EFJ51504.1| hypothetical protein VOLCADRAFT_87583 [Volvox carteri f. nagariensis]
Length = 1333
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 34/46 (73%)
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
+VA E+DGP F+ NT PLG T+ +RR++ A GW VVS+ ++EW+
Sbjct: 1222 RVAVEVDGPERFTANTWKPLGTTLYRRRWLTAHGWTVVSVPYREWQ 1267
>gi|397645982|gb|EJK77069.1| hypothetical protein THAOC_01121, partial [Thalassiosira oceanica]
Length = 263
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 132/318 (41%), Gaps = 72/318 (22%)
Query: 312 VALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASD--IVHTFQEQELAQVLWAF 369
V +G F ++++N A AFA+ S P+LF ++ ++ +F+ QEL+ +WA
Sbjct: 11 VGPGGLGSFKPRDLSNTAWAFATAGVSHPELFKKIGHHVAEQGCFDSFKPQELSNTVWAC 70
Query: 370 ASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSF 429
A++ + L + F + L C+E
Sbjct: 71 ATVGYTDERLF----------SAFAPVIGSKLDECSEQ---------------------- 98
Query: 430 NRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLK 489
+L NIAW+Y+V + F++ + E+ S + + + +
Sbjct: 99 ---ELTNIAWAYSVANLPRQDLFNEGYVGALASNEKDFSVKELAQLHQWQLLQQELK-YG 154
Query: 490 LEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYT 549
+E P LQ + V+ E + +AG ++EV LL S GY
Sbjct: 155 VELPQLQ---NDVVGE-LRAAG----------VDLEEEV--LLGS------------GYR 186
Query: 550 VDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEEL 605
+DA++ ++VA E+DGP+HF P G T LK R +A VVS+ + EW+ L
Sbjct: 187 IDALVKFGGGRRVAVEVDGPSHFIDRR--PAGRTTLKHRQVATLDRIEVVSVPYWEWDVL 244
Query: 606 QGSFEQLDYLRVILKDYI 623
+ S + YLR + K I
Sbjct: 245 ENSEMKQHYLRELSKGQI 262
>gi|307102859|gb|EFN51125.1| hypothetical protein CHLNCDRAFT_141313 [Chlorella variabilis]
Length = 720
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 55/108 (50%), Gaps = 12/108 (11%)
Query: 520 VTSSFQKEVARLLVSTGLNWIREYAVDG-YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
V + + +V R + G+ + E++ G Y++D L + K+A E+DGP HF+ N+ +G
Sbjct: 419 VAACYPPKVHRTVCGLGVPCVLEHSEAGEYSIDVALPEHKIAVEVDGPVHFAANSRHLMG 478
Query: 579 HTMLKRRY------IAAA----GWNVVSLSHQEWEELQGSFEQLDYLR 616
T LKRR IAA GW V + + EW L + Y+R
Sbjct: 479 GTALKRRLLETLFCIAAPMQRLGWRAVDVPYYEWWAL-APARRPSYMR 525
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 60/113 (53%), Gaps = 9/113 (7%)
Query: 233 SPLNIATALHRIA--KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIA 290
SP+ + +R+A K M++ M + HR R +L A + P + Q +S++A
Sbjct: 223 SPVGAEASENRLALGKAMQRHIMASPHRRGVAR-----LLAAASRQLAPRLAPQALSSLA 277
Query: 291 WALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
+ + IGG L ++ +A A+ + ++Q VAN+A A++++++ P L+
Sbjct: 278 HSFAAIGGCPWDL--LEELAARAVQLERQLDAQAVANLAWAYSTLRYDHPQLY 328
>gi|294909513|ref|XP_002777784.1| hypothetical protein Pmar_PMAR008719 [Perkinsus marinus ATCC 50983]
gi|239885746|gb|EER09579.1| hypothetical protein Pmar_PMAR008719 [Perkinsus marinus ATCC 50983]
Length = 222
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 55/106 (51%), Gaps = 1/106 (0%)
Query: 516 FNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGV 575
N+K+T Q E RL+ A + V D+ +A E+DGP+HF N+
Sbjct: 40 LNEKLTPEEQAEKQRLIKELTKKLAGPLADENGNVPTG-KDRPIAIEVDGPSHFYANSTK 98
Query: 576 PLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
+T LK R + G+ V+ + + EW +L+G+ E+ +Y+R LK+
Sbjct: 99 YTAYTKLKHRLLTRMGYKVLHVPYFEWRKLRGAKEREEYMRTKLKE 144
>gi|258597101|ref|XP_001347524.2| conserved Plasmodium protein [Plasmodium falciparum 3D7]
gi|254922454|gb|AAN35437.2| conserved Plasmodium protein [Plasmodium falciparum 3D7]
Length = 433
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/363 (22%), Positives = 144/363 (39%), Gaps = 74/363 (20%)
Query: 258 RLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKV 317
+L FT+ + + I M P+ ++ ++ I L K LS +D
Sbjct: 108 KLKFTKYSLYNNFIKIIMNKKPKIDSRMLTQILIDLHK-------LSSLD---------- 150
Query: 318 GEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPAD 377
NV F K+ +D F +L+ +L+ F Y
Sbjct: 151 --------INVLTFFTQYY----------IKKETD---QFSLFDLSMILYIFNK-YNYNH 188
Query: 378 PLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNI 437
+E++DN K +Q+ L +++ GV ++ LS L+ N ++
Sbjct: 189 --IETVDNISKTISQY------FLPYIDQDKGVLTTI------LLSISTLNLNYQFYLDV 234
Query: 438 A-------WSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKL 490
+ + + + I +S + + ++ + I DIM+ L+N KL
Sbjct: 235 MKKHVYKKYEHFEVKYLCNILYSILLRLVNTLHKDDILNIMLNDIMYI----LLNNINKL 290
Query: 491 EHPHL-QLALSSVL-----EEKIASAGKT---KRFNQKVTSS-FQKEVARLLVSTGLNWI 540
++ L QL +S EEK A K K VT+S Q+++A+L GLN
Sbjct: 291 KNEELKQLHISLYYLKDMKEEKYEEARKIIEKKNIKDTVTTSKIQQQIAKLFKEIGLNVE 350
Query: 541 REYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQ 600
+E+ + Y +D L KK+ E++G TH+ G T LK + W V+++ +
Sbjct: 351 KEFLIGPYVLDFALKKKKICIEVNGFTHYYNFNGKINAKTTLKYYILNKLKWKVLTIEYM 410
Query: 601 EWE 603
+W+
Sbjct: 411 DWK 413
>gi|255076950|ref|XP_002502137.1| predicted protein [Micromonas sp. RCC299]
gi|226517402|gb|ACO63395.1| predicted protein [Micromonas sp. RCC299]
Length = 1128
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 98/455 (21%), Positives = 160/455 (35%), Gaps = 145/455 (31%)
Query: 286 ISNIAWALSKIGGELLYLSEMDRVA----EVALTKVGEFNSQNVANVAGAFASMQ-HSAP 340
+N+ WA +K L + DR E + K+ +F++Q +AN A+A++Q A
Sbjct: 533 FANLLWAFAK-----LNHTPGDRFQAEFEEAVIEKISKFDAQVLANTVYAYAALQLPGAR 587
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAFK---------DA 390
++ + D +H F+ +EL VLWAF Y+P + + A + +
Sbjct: 588 NVLPLIGLHFKDRLHEFKPRELLMVLWAFTRCSYDPGADAMARFERAMRPMTDNLAPDEV 647
Query: 391 TQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVL------ 444
TQ+ + L G ++ E + F+ + W+YA L
Sbjct: 648 TQYLWA-SAVLKYRPTEGALRG-----FETRIVDCPSRFSGTPIALTLWAYATLNLPPPF 701
Query: 445 GQMDRIFFSDIWKTISRFEEQRISEQYREDIMF-------------ASQVHLVN------ 485
MDR F D E R E Y +D+ A V +VN
Sbjct: 702 AVMDR--FGD------ELELSRADEFYPQDLSLGFWSAAVIMTQPKADDVPMVNALDTGA 753
Query: 486 --QCLKLEHPHL-QLALSSVLEEKIA--------------------------------SA 510
+ L+ HL L +SV E ++ S+
Sbjct: 754 RERVLRQMAKHLGSLGATSVSPEGLSAIYMAILAVEMHSPSLFAELKSNWGHLAAAAESS 813
Query: 511 GKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDK---KVAFEIDGPT 567
+ + S QK V + L G+ + E V G + V K K+ E+DGP
Sbjct: 814 WRATKGKGPTVSKLQKAVGKTLDELGVEYESEKLVRGGLIRPDFVVKGKAKIVVEVDGPY 873
Query: 568 HFSRNTGV-----------------------------------PLGHTMLKRRYIAAAGW 592
HFS PLG T+L+ + +++ GW
Sbjct: 874 HFSVEPSAASDAGEELEDWFGGGGGETPDALEKDRFGFGSVLRPLGGTILRNQLLSSWGW 933
Query: 593 NVVSLSHQEW-------------EELQGSFEQLDY 614
NVV++S+++W E L+G +Q Y
Sbjct: 934 NVVTVSYRDWVKADNDTSGGAKREYLKGLLDQAGY 968
Score = 40.0 bits (92), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 66/157 (42%), Gaps = 37/157 (23%)
Query: 195 KEINLNKDIVDAQTAQEVLEVI-----AEMITAVGKGLSPSPLSPLNIATALHRIAKNME 249
K I +N+D+ A +V V+ AE AV N+ATA R+ +++
Sbjct: 126 KRIGVNQDLAKASKIDDVRFVVQKNGNAEAFNAV------------NVATAYSRLGRHVR 173
Query: 250 KVSMMTTH----RLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIG---GELLY 302
T LA R+ A A+T PE SA S++ WAL + G G +
Sbjct: 174 DWERGTLDGAEWYLALERR-------ARALT--PEMSAWAASSVTWALGRTGRNPGAAFW 224
Query: 303 LSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
+ ++ VA E Q VANV A+++H A
Sbjct: 225 VDLEAKLCTVA----DELEPQGVANVLWGLAALEHRA 257
>gi|401410506|ref|XP_003884701.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325119119|emb|CBZ54671.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 1458
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 70/135 (51%), Gaps = 4/135 (2%)
Query: 474 DIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQ---KVTSSFQKEVAR 530
+I +++ +V+ L+L P L +L L+ ++ + Q ++S ++V+
Sbjct: 1038 EIGSVTRLQIVDLYLRLLRPELFASLPFDLKAFLSRVRRVDLTQQDCFSLSSKMHRDVSA 1097
Query: 531 LLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAA 590
+ GL E +++D VL D+ +A EIDGP+HF R T + + + LK+R +
Sbjct: 1098 AFLRIGLVHRSEVQFGPFSLDIVLGDR-LAVEIDGPSHFYRETCMRVASSRLKQRLLREM 1156
Query: 591 GWNVVSLSHQEWEEL 605
GW ++ +S EW +L
Sbjct: 1157 GWTLLPVSFFEWRQL 1171
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
Q ISN+ A K+ E+L + +R A + ++N Q+++N+A A++ + +LF
Sbjct: 487 QDISNLLNAFGKL--EILDVELFNRAAPKIADGIRDYNPQHLSNIAHAYSKVSVPQSELF 544
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASL 372
+A+ V F +ELA + AFA +
Sbjct: 545 VRIAEMTRRSVQNFSTKELANLALAFAKM 573
>gi|195998900|ref|XP_002109318.1| hypothetical protein TRIADDRAFT_53222 [Trichoplax adhaerens]
gi|190587442|gb|EDV27484.1| hypothetical protein TRIADDRAFT_53222 [Trichoplax adhaerens]
Length = 650
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 39/60 (65%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+++A EIDGP HF+ + LGHT++K R+++ GW+V+ + + EW +L E YL+
Sbjct: 585 ERIAIEIDGPVHFAYKSNRYLGHTIMKTRHLSLLGWHVIRVPYYEWNKLNDLPEIDRYLK 644
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 44/87 (50%), Gaps = 3/87 (3%)
Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
I+N+ WALSK + L + ++ + A+ + +FN +++ V + A + L +
Sbjct: 224 IANLMWALSK---DQLNIDIFQQLQQQAINNINKFNPISISMVCYSLALFGDRSEQLLTA 280
Query: 346 LAKRASDIVHTFQEQELAQVLWAFASL 372
+ R I++ Q +A + WAFA L
Sbjct: 281 IENRMLAIINLLDPQSIANIAWAFAKL 307
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 49/92 (53%), Gaps = 8/92 (8%)
Query: 285 GISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
IS + ++L+ G E L + +R+ + + + Q++AN+A AFA + ++
Sbjct: 259 SISMVCYSLALFGDRSEQLLTAIENRMLAI----INLLDPQSIANIAWAFAKLNWFNDEI 314
Query: 343 FSELAKRASDIV--HTFQEQELAQVLWAFASL 372
F + KR D + T + Q ++ ++WAFAS+
Sbjct: 315 FGFIQKRTLDNIGKRTLRPQSISNIIWAFASM 346
>gi|397582907|gb|EJK52455.1| hypothetical protein THAOC_28263, partial [Thalassiosira oceanica]
Length = 408
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 131/327 (40%), Gaps = 57/327 (17%)
Query: 284 QGISNIAWALS--KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q SNI WA + + L+ D VA L +G FN Q ++ A+A+ +
Sbjct: 39 QDFSNIVWAYATARESHPELFNKIGDHVAR--LGSLGSFNPQELSITVWAYATARVFHSR 96
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
LF +L A F+ Q +A LWA A++ + L + F + L
Sbjct: 97 LFEKLTTEAVAKKDHFESQHIANFLWACATVGHTDERLFAA----------FAPLVGSKL 146
Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
C+E +L NI+W+Y+V + F+ +
Sbjct: 147 DECSEQ-------------------------ELANISWAYSVANAPNLDLFNVGHVSALA 181
Query: 462 FEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVT 521
E+ S + A +LE + L L+ K +A ++ F++
Sbjct: 182 SNEKEFSAE-----GLAQLHQWQLWQQELES---GIELPQSLQAKCRNAFMSQCFSE--- 230
Query: 522 SSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPL 577
S Q +V L + GL+ E + GY +DA++ +KVA E+DGP+HF P
Sbjct: 231 SKLQNDVVGELRAAGLDLEEEVLLGSGYRIDALVKVGDGRKVAVEVDGPSHFIDRR--PA 288
Query: 578 GHTMLKRRYIAAAG-WNVVSLSHQEWE 603
G +LK R +A VVS+ + EW+
Sbjct: 289 GRAILKHRQVATLDRIEVVSVPYWEWD 315
Score = 39.3 bits (90), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 47/101 (46%), Gaps = 2/101 (1%)
Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
+ +L + Q +S WA + + + +++ A+ K F SQ++AN A A+
Sbjct: 69 LGSLGSFNPQELSITVWAYAT--ARVFHSRLFEKLTTEAVAKKDHFESQHIANFLWACAT 126
Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEP 375
+ H+ LF+ A + EQELA + WA++ P
Sbjct: 127 VGHTDERLFAAFAPLVGSKLDECSEQELANISWAYSVANAP 167
>gi|323444921|gb|EGB01813.1| hypothetical protein AURANDRAFT_69470 [Aureococcus anophagefferens]
Length = 206
Score = 58.2 bits (139), Expect = 2e-05, Method: Composition-based stats.
Identities = 38/99 (38%), Positives = 52/99 (52%), Gaps = 4/99 (4%)
Query: 522 SSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLVDKK--VAFEIDGPTHFSRNTG-VPL 577
S Q EVA L GL+ E + DG +VD L+ K VA E DGP H+ RN VP
Sbjct: 31 SRAQVEVAERLEGMGLDVEHELVLPDGLSVDVALLPLKWRVAVEFDGPRHYFRNAKRVPT 90
Query: 578 GHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
G T K R + A GW V+ + + +W +L + +YL+
Sbjct: 91 GRTRFKMRLLRALGWRVLHVPYFDWAKLDDDAARTEYLK 129
>gi|303290512|ref|XP_003064543.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454141|gb|EEH51448.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 628
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 87/341 (25%), Positives = 137/341 (40%), Gaps = 45/341 (13%)
Query: 286 ISNIAWALSKI----GGELLYLSEMDRVAEVALTKVGE-FNSQNVANVAGAFASMQH--- 337
++NI WA + G E L + V E LT E + Q +AN+ +FA +H
Sbjct: 200 LANILWAFHVLKTYPGPECLAV-----VGERMLTLTDEDLHVQTLANMMYSFAQFEHLPG 254
Query: 338 -SAPDLFSELAKRA---SDIVH----TFQEQELAQVLWAFASL-YEPADPLLESLDNA-- 386
+ D +L RA +D+ T L+ ++WAF L Y+P++ + D
Sbjct: 255 RATMDRVEDLCARAFRSADVGEPGSVTPASNSLSNLIWAFGVLKYKPSEEFFAAFDAVVS 314
Query: 387 -----FKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQ-LGNIAWS 440
F D A N N N G + D+ + +S Q + N W+
Sbjct: 315 STLGDFNDQGVSNVLFTYA--NLNHNPGAQL---LDALARRCADFISVYAPQGVANTVWA 369
Query: 441 YAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEH--PHLQLA 498
+ VL D + + R +RIS+ ED +V L L L+ H
Sbjct: 370 WVVL---DGAKYPP--PALLRLYAERISKTRDEDFSKIDRVQLFQSHLALKQFSNHDGEL 424
Query: 499 LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DG-YTVDAVLVD 556
LS + A S+ ++V+ L G+ E+ DG ++VD L
Sbjct: 425 LSGEMLRSCERAWMEVSAGNLTISAIHRDVSETLTRMGIPHEIEFLTSDGLFSVDIALRG 484
Query: 557 KKVAFEIDGPTHFSRNTGVP-LGHTMLKRRYIAAAGWNVVS 596
+KVA E+DGP+HF N +G +L+ + + GW V S
Sbjct: 485 RKVAIEVDGPSHFFANKRRERMGADLLRAALMQSKGWTVRS 525
Score = 45.8 bits (107), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 55/115 (47%), Gaps = 12/115 (10%)
Query: 286 ISNIAWA-----LSKIGGEL---LYLSEMDRVAEVALTKVGEFNSQNVANV---AGAFAS 334
+SNI WA LS + G L + ++ D + + F+SQ+VAN AG
Sbjct: 75 LSNIVWAIASMNLSGLSGGLPREVMVALDDAMCRSIASDPDTFSSQSVANTLWAAGNAPD 134
Query: 335 MQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA-SLYEPADPLLESLDNAFK 388
+ +P L LA + D HTF Q + +W FA + + P D L++ + A+K
Sbjct: 135 VVTLSPRLMDALASVSCDKFHTFTPQGMTNTIWGFACNGHHPGDELMDKMREAWK 189
Score = 44.3 bits (103), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 74/186 (39%), Gaps = 29/186 (15%)
Query: 282 SAQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
S+Q ++N WA +++ LS MD +A V+ K F Q + N FA H
Sbjct: 118 SSQSVANTLWAAGN-APDVVTLSPRLMDALASVSCDKFHTFTPQGMTNTIWGFACNGHHP 176
Query: 340 PD-LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP------------------LL 380
D L ++ + HT+ ELA +LWAF L P +
Sbjct: 177 GDELMDKMREAWKRSGHTYIVTELANILWAFHVLKTYPGPECLAVVGERMLTLTDEDLHV 236
Query: 381 ESLDNAFKDATQFTCCLNKALSNCNENGGVKS--SGDADSEGSLSSPVLSFNRDQLGNIA 438
++L N QF +A + E+ ++ S D GS++ + L N+
Sbjct: 237 QTLANMMYSFAQFEHLPGRATMDRVEDLCARAFRSADVGEPGSVTPA-----SNSLSNLI 291
Query: 439 WSYAVL 444
W++ VL
Sbjct: 292 WAFGVL 297
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 39/81 (48%), Gaps = 5/81 (6%)
Query: 297 GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA-----PDLFSELAKRAS 351
G E L ++ + R+ E+ K+ EF Q V+N FAS+ + PD S
Sbjct: 5 GDEYLPMAMLARLEELVRVKMDEFIPQGVSNCIWGFASLNKNKGLELRPDTVSRFGDGIV 64
Query: 352 DIVHTFQEQELAQVLWAFASL 372
+ F+ EL+ ++WA AS+
Sbjct: 65 RLASGFKSMELSNIVWAIASM 85
>gi|156089331|ref|XP_001612072.1| hypothetical protein [Babesia bovis T2Bo]
gi|154799326|gb|EDO08504.1| hypothetical protein BBOV_III009480 [Babesia bovis]
Length = 239
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 41/66 (62%)
Query: 556 DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
D+ +A E+DGP+HF N+ +T LK R + G+ V+ + + EW L+G+ E+ +Y+
Sbjct: 133 DRPIAIEVDGPSHFYANSTKYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGAKEREEYM 192
Query: 616 RVILKD 621
R LK+
Sbjct: 193 REKLKE 198
>gi|294946233|ref|XP_002784988.1| hypothetical protein Pmar_PMAR016478 [Perkinsus marinus ATCC 50983]
gi|239898352|gb|EER16784.1| hypothetical protein Pmar_PMAR016478 [Perkinsus marinus ATCC 50983]
Length = 132
Score = 57.0 bits (136), Expect = 3e-05, Method: Composition-based stats.
Identities = 34/105 (32%), Positives = 55/105 (52%), Gaps = 1/105 (0%)
Query: 516 FNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGV 575
N+K+T Q E RL+ A + V A D+ +A E+DGP+HF N+
Sbjct: 29 LNEKLTPEEQAEKQRLIKELTKKLAGPLADENGNVPAG-KDRPIAIEVDGPSHFYANSTK 87
Query: 576 PLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILK 620
+T LK R + G+ V+ + + EW +L+G+ E+ +Y+R LK
Sbjct: 88 YTAYTKLKHRLLTRMGYKVLHVPYFEWRKLRGAKEREEYMRTKLK 132
>gi|302834273|ref|XP_002948699.1| hypothetical protein VOLCADRAFT_104026 [Volvox carteri f.
nagariensis]
gi|300265890|gb|EFJ50079.1| hypothetical protein VOLCADRAFT_104026 [Volvox carteri f.
nagariensis]
Length = 3304
Score = 57.0 bits (136), Expect = 3e-05, Method: Composition-based stats.
Identities = 26/58 (44%), Positives = 34/58 (58%)
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+VA E+DGP HF+ NT PL T +RR + A GW VVS+ H W E + + D L
Sbjct: 3155 RVAVEVDGPAHFTANTKQPLSMTTYRRRCLEARGWVVVSVPHWRWFEFRSGQPERDVL 3212
Score = 46.6 bits (109), Expect = 0.043, Method: Composition-based stats.
Identities = 40/146 (27%), Positives = 64/146 (43%), Gaps = 12/146 (8%)
Query: 232 LSPLNIATALHRIAK-NMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIA 290
P+N+A ALHR+ + S +A + +E+ + ++ L + + Q + N
Sbjct: 2328 FEPVNVAAALHRLGSCGLAPGSTAVRQLMADPQFKELERMASV---TLGQFTPQHVGNAL 2384
Query: 291 WALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS-APDLFSEL 346
WA +G GE L R+ EV + E QN++N A + S P + +L
Sbjct: 2385 WAFGTLGYHPGEPLLQGLTTRLLEV----LPEALPQNISNGLLGLAKLGWSPGPHVLDQL 2440
Query: 347 AKRASDIVHTFQEQELAQVLWAFASL 372
A+ + V F Q L LWA A L
Sbjct: 2441 ARGSVGKVPEFNAQALVNTLWAMAHL 2466
Score = 44.7 bits (104), Expect = 0.14, Method: Composition-based stats.
Identities = 46/154 (29%), Positives = 62/154 (40%), Gaps = 22/154 (14%)
Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAFK 388
G+ A Q A F EL + AS + F Q + LWAF +L Y P +PLL+ L
Sbjct: 2348 GSTAVRQLMADPQFKELERMASVTLGQFTPQHVGNALWAFGTLGYHPGEPLLQGL----- 2402
Query: 389 DATQFTCCLNKALSNCNENG--GVKSSG--------DADSEGSLSSPVLSFNRDQLGNIA 438
T+ L +AL NG G+ G D + GS+ V FN L N
Sbjct: 2403 -TTRLLEVLPEALPQNISNGLLGLAKLGWSPGPHVLDQLARGSVGK-VPEFNAQALVNTL 2460
Query: 439 WSYAVLGQ----MDRIFFSDIWKTISRFEEQRIS 468
W+ A L + F K I F Q ++
Sbjct: 2461 WAMAHLNYVHEGLQTAMFEQALKRILEFNPQNVA 2494
Score = 40.0 bits (92), Expect = 3.4, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 31/57 (54%)
Query: 316 KVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
+V Q++AN+ A+ +++ AP LFS L V F EQEL+ +WA A L
Sbjct: 2637 RVWALRPQHIANLLWAYGTLEQPAPVLFSALLPTLLRRVAEFSEQELSNSVWAAARL 2693
>gi|429329946|gb|AFZ81705.1| hypothetical protein BEWA_011230 [Babesia equi]
Length = 1089
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 78/386 (20%), Positives = 143/386 (37%), Gaps = 90/386 (23%)
Query: 195 KEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMM 254
K +N+N + + Q ++ +++++A+ + L+P+N ATALHR+AK + +
Sbjct: 205 KWLNMNPNHIIIQQTIIKSKIPSQILSAITD--KHNQLNPINSATALHRLAKQIHPYN-- 260
Query: 255 TTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEM-------- 306
R + L+++ +PE +QG++NI W++ +I +LS++
Sbjct: 261 ---RHTILNHKSFGKLISVIEVHIPEFDSQGLTNILWSIVRIKITPTWLSQLLTQIDKNL 317
Query: 307 -----------------------------DRVAEVALTKVGEFNSQ-NVANVAGAFASMQ 336
++ + T++ F + + V+ A
Sbjct: 318 MVFNANELSSCLLSLSKVGIKNNESLELRSKLVALIRTRINGFKTPLELTCVSTGLARFN 377
Query: 337 HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCC 396
P LF ++++ D + F EL V W+FA L F D F
Sbjct: 378 VRDPILFGHISRQIIDSLDKFTMNELRGVAWSFAYL-------------GFNDRLLFANI 424
Query: 397 LNKALSNCNENG---------GVKSSGDADSEGSL--SSPVLSFNRDQL-----GNIAWS 440
N +N NE + +ADSE L SP++ N L IAW+
Sbjct: 425 RNFIENNANETNVKNVIRLAWALSKLKEADSELFLFTISPLIRSNISNLTCKDISTIAWA 484
Query: 441 Y----------------AVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLV 484
+ A+ QM+ + DI ++ F S + + M V +
Sbjct: 485 FLNAEIEDCDLFNDLATALQHQMEEMTTHDITSCVATFSHIEASHRVLFNKMKTRAVEIS 544
Query: 485 NQCLKLEHPHLQLALSSVLEEKIASA 510
N+ L+ + S +EK S
Sbjct: 545 NEFTPLQLAKIIRGFSYFSDEKFYSV 570
>gi|403221415|dbj|BAM39548.1| uncharacterized protein TOT_010001003 [Theileria orientalis strain
Shintoku]
Length = 418
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 55/104 (52%), Gaps = 1/104 (0%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
TS Q ++++LL L + EY + Y +D V+ VA E++G THF N+ T
Sbjct: 315 TSKMQLKLSKLLDEIKLKYKSEYQLGPYRLDYVVPKLNVAIEVNGYTHFFHNSRELNALT 374
Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIG 624
LK + + GWNVV +++ W+ + ++L+YL L YI
Sbjct: 375 QLKYKILKDMGWNVVGVNYYNWKN-RNKQDRLEYLIKELSPYIN 417
>gi|428177039|gb|EKX45921.1| hypothetical protein GUITHDRAFT_163172 [Guillardia theta CCMP2712]
Length = 976
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 54/197 (27%), Positives = 90/197 (45%), Gaps = 28/197 (14%)
Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
+ L E Q +SNI W+ + +G + ++ E+ + EF Q+VAN A+ +
Sbjct: 393 VPGLQEFKPQEVSNILWSYATVGFSSPTVFKL-LAFEILRRGLREFVPQDVANSVWAYVT 451
Query: 335 MQHSAPDLF----SELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL--DNAFK 388
+ S +L S+ +R + F+ QELA ++WAFA P D LL + D A +
Sbjct: 452 VGQSTKELLHVVESDAERRG---LSAFKNQELANLIWAFAKADYPMDLLLRLVEQDIASR 508
Query: 389 DATQFTCCLNKALSNC----------NENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA 438
D + F + + LSN +E+ +K + + S G + F ++ N A
Sbjct: 509 DLSLF---MPQELSNLVWAFATAGHRSEHLFLKIASEISSRG-----LADFKPQEIANTA 560
Query: 439 WSYAVLGQMDRIFFSDI 455
W+YA +G D F I
Sbjct: 561 WAYAKIGVQDEKLFHRI 577
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 75/183 (40%), Gaps = 24/183 (13%)
Query: 284 QGISNIAWALSKIGGELLYL-SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
Q +SN WA + G +L E++R E + F+ Q+++N+ AFA H AP L
Sbjct: 631 QELSNTVWAHASNGLTFPFLFGEVER--EAVRRGLRLFSPQDISNMLWAFAKADHVAPSL 688
Query: 343 FSELAKRASDI------VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFT-- 394
+ +L ++ + F+ QEL+ +LWA A A L + + K ++
Sbjct: 689 YEQLRANLEELRVADPGLTMFKAQELSNLLWAAAKTQHTARCLFSAAEEQVKQILKYAES 748
Query: 395 ------CCLNKALSNCNENGGVKSSGDAD-------SEGSLSSPVLSFNRDQLGNIAWSY 441
C L ++ S G E L+ +++F L NIAW+
Sbjct: 749 REERDETCAVVPLEVTDDMWRFASVGQTAEELFATLEEQVLTRDLMTFTTLHLANIAWAI 808
Query: 442 AVL 444
L
Sbjct: 809 VFL 811
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 11/104 (10%)
Query: 284 QGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q +SN+ WA + G E L+L +E++ + +F Q +AN A A+A +
Sbjct: 516 QELSNLVWAFATAGHRSEHLFL---KIASEISSRGLADFKPQEIANTAWAYAKIGVQDEK 572
Query: 342 LFSELAKRASDIVH----TFQEQELAQVLWAFASLYEPADPLLE 381
LF + I+H F QEL+ +LW+FA +D L +
Sbjct: 573 LFHRIEMEL--ILHRSLRPFIPQELSNILWSFAKFNIASDKLFQ 614
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 80/210 (38%), Gaps = 48/210 (22%)
Query: 273 IAMTALPECSAQGISNIAWALSKIGGE---LLYLSEMDRVAEVALTKVGEFNSQNVANVA 329
I+ L + Q I+N AWA +KIG + L + EM+ + +L F Q ++N+
Sbjct: 543 ISSRGLADFKPQEIANTAWAYAKIGVQDEKLFHRIEMELILHRSLRP---FIPQELSNIL 599
Query: 330 GAFASMQHSAPDLF----SELAKRASDIVHTFQEQELAQVLWAFAS-------------- 371
+FA ++ LF E+ R + F+ QEL+ +WA AS
Sbjct: 600 WSFAKFNIASDKLFQVIGQEMLVRG---LQGFKPQELSNTVWAHASNGLTFPFLFGEVER 656
Query: 372 --------LYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLS 423
L+ P D + ++ AF A L + L E V G
Sbjct: 657 EAVRRGLRLFSPQD--ISNMLWAFAKADHVAPSLYEQLRANLEELRVADPG--------- 705
Query: 424 SPVLSFNRDQLGNIAWSYAVLGQMDRIFFS 453
+ F +L N+ W+ A R FS
Sbjct: 706 --LTMFKAQELSNLLWAAAKTQHTARCLFS 733
>gi|307111199|gb|EFN59434.1| hypothetical protein CHLNCDRAFT_49989 [Chlorella variabilis]
Length = 1328
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 51/88 (57%), Gaps = 3/88 (3%)
Query: 287 SNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH--SAPDLFS 344
SN+ WAL+ G E L +DR+A ++ F Q++AN+A A+A++ H +AP
Sbjct: 777 SNVLWALASEG-EALPGEALDRIAANLAPRLKSFGPQSLANIAWAYATLGHHPAAPHFLR 835
Query: 345 ELAKRASDIVHTFQEQELAQVLWAFASL 372
+LA A + F+ Q L+ ++W+ ASL
Sbjct: 836 QLAHAAQRCLPVFEPQGLSLLVWSLASL 863
Score = 43.1 bits (100), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 7/90 (7%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVAL---TKVGEFNSQNVANVAGAFASMQHSAP 340
Q ++N+ W + ++G Y + VAL +V + Q + N+ AFA + +
Sbjct: 891 QHLANLVWGMCRVG----YCPAQRFLEAVALEVQLRVCDLKPQELFNIVWAFAQLGYHPA 946
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFA 370
LF +A A+ +F QEL+ +LWA A
Sbjct: 947 CLFDAVALEAAPQAVSFSPQELSGMLWALA 976
Score = 39.7 bits (91), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 43/184 (23%), Positives = 69/184 (37%), Gaps = 42/184 (22%)
Query: 291 WALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRA 350
W+ +++G L ++ A A ++ F +A VA + A ++ AP + +
Sbjct: 707 WSFARMGTSSRKL--LETAAACAEQQLAAFTPAQLAKVAWSLAKLRWPAPRVLRHAGAQL 764
Query: 351 SDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGV 410
++ F ++E + VLWA AS E A P +AL N
Sbjct: 765 AERTAAFNDKEASNVLWALASEGE-ALP-------------------GEALDRIAAN--- 801
Query: 411 KSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMD------RIFFSDIWKTISRFEE 464
L+ + SF L NIAW+YA LG R + + FE
Sbjct: 802 -----------LAPRLKSFGPQSLANIAWAYATLGHHPAAPHFLRQLAHAAQRCLPVFEP 850
Query: 465 QRIS 468
Q +S
Sbjct: 851 QGLS 854
>gi|308798807|ref|XP_003074183.1| unnamed protein product [Ostreococcus tauri]
gi|116000355|emb|CAL50035.1| unnamed protein product [Ostreococcus tauri]
Length = 525
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 64/269 (23%), Positives = 117/269 (43%), Gaps = 38/269 (14%)
Query: 198 NLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAK-NMEKVSMMTT 256
+L D++DA + +L + E K +N +TALHR+A+ ++V T
Sbjct: 140 DLQGDLMDASDVEVILTTVEEQEEVFNK---------VNASTALHRVARLATQRVPGQTK 190
Query: 257 H---RLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVA 313
R A L+ + E S QG+SN+ WAL+++ + + +D ++ A
Sbjct: 191 PSLDRAALLGDERFQTLMNMVDRMAGEMSMQGVSNVLWALARLEYPVQE-TLLDALSARA 249
Query: 314 LTKVGEFNSQNVANVAGAFASMQHSA-PDLFSELAKRASDIVHTFQEQELAQVLWAFA-- 370
T+ +N++ A A++ H L +A +A +V F+ ++ +LWA+A
Sbjct: 250 ATQASSAEPKNLSTTLWALAALGHKPRSKLLKAIADQALIVVDDFRAPDVVNMLWAYARW 309
Query: 371 SLYEPAD----PLLES-LDNAFKDATQFT------CCLNKALSNCNENGGVKSSGDADSE 419
S Y P P++++ LD A +T C + A+ +C + V E
Sbjct: 310 SRYLPPSDRPMPVVQAMLDQAVHTMQSYTPYQLANLCWSLAMLDCPPSPRVL-------E 362
Query: 420 GSLSSPVL---SFNRDQLGNIAWSYAVLG 445
L + L + L ++ W+Y V+G
Sbjct: 363 YILQTVALEPGKLDGTALTHVLWAYGVMG 391
>gi|397612109|gb|EJK61607.1| hypothetical protein THAOC_17877, partial [Thalassiosira oceanica]
Length = 728
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 73/246 (29%), Positives = 114/246 (46%), Gaps = 35/246 (14%)
Query: 235 LNIATALHRIAK-NMEKVSMMTTH--RLAFTRQREMSMLV-AIAMTA---LPECSAQGIS 287
L IA + ++++ N + + H R F ++ + S + +IA +A L E A+ +S
Sbjct: 491 LGIAKTISQVSRGNQQYRADDPRHVIRRLFVKESQCSPIFDSIASSAVGMLNEFEARHLS 550
Query: 288 NIAWALS------KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
N+ ++ IGGE L+ + E A+ + FNSQ+++N+ AF +
Sbjct: 551 NLIYSFGLVERNPDIGGETLF----NVFGEAAVKILHTFNSQDISNMLWAFVKVDAKNSR 606
Query: 342 LFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADP-LLESLDNAFKDATQFTCCLNK 399
LF E S + + +F+ QELA +LW+FA E ADP L L N A + +
Sbjct: 607 LFQETGGVISGMDLDSFKPQELANILWSFAKSGE-ADPELFRVLGNHIV-ARRLNDFQPQ 664
Query: 400 ALSNCN---ENGGV------KSSGDADSE-GSLSSPVLSFNRDQLGNIAWSYAVLGQMDR 449
LSN GV K GD + GSL+S F L NIAW++A G++
Sbjct: 665 HLSNIAWAFATAGVSHPILFKKIGDHIAGLGSLNS----FEPQALSNIAWAFASAGKLHP 720
Query: 450 IFFSDI 455
F I
Sbjct: 721 KLFKKI 726
>gi|71029704|ref|XP_764495.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351449|gb|EAN32212.1| hypothetical protein TP04_0858 [Theileria parva]
Length = 234
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 39/65 (60%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ +A E+DGP+HF NT +T LK R + G+ V+ + EW L+G+ E+ +Y+R
Sbjct: 133 RPIAIEVDGPSHFYSNTTKYTAYTKLKHRLLTRMGYKVLHVPFFEWRRLRGAREREEYMR 192
Query: 617 VILKD 621
LK+
Sbjct: 193 AKLKE 197
>gi|84997531|ref|XP_953487.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65304483|emb|CAI76862.1| hypothetical protein, conserved [Theileria annulata]
Length = 235
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 39/65 (60%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ +A E+DGP+HF NT +T LK R + G+ V+ + EW L+G+ E+ +Y+R
Sbjct: 134 RPIAIEVDGPSHFYSNTTKYTAYTKLKHRLLTRMGYKVLHVPFFEWRRLRGAREREEYMR 193
Query: 617 VILKD 621
LK+
Sbjct: 194 AKLKE 198
>gi|302830696|ref|XP_002946914.1| hypothetical protein VOLCADRAFT_86990 [Volvox carteri f.
nagariensis]
gi|300267958|gb|EFJ52140.1| hypothetical protein VOLCADRAFT_86990 [Volvox carteri f.
nagariensis]
Length = 1130
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 96/194 (49%), Gaps = 20/194 (10%)
Query: 190 PSNRRKEIN--LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKN 247
PS R + ++ I+ AQ+ QE LE +A + + S + ++++ + R+ K
Sbjct: 291 PSTRDRALSHFFTATIMGAQSWQE-LEALARVHS--------SSFNHVHVSALVCRLPKV 341
Query: 248 MEKVSMMTTHRLAFTR-QREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEM 306
+ V + + + F+R R++S LV I ++A + I+N+ W +SK+G +
Sbjct: 342 VNPVELSKSEKTQFSRFLRDVSDLVTIRLSAF---DPRAIANVLWGVSKLGYSPAP-PTL 397
Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASM----QHSAPDLFSELAKRASDIVHTFQEQEL 362
++ A ++ +FN+Q +AN+A A A++ P + A V + QEL
Sbjct: 398 NKFLFEAYVRMYDFNAQELANLAWALATLASLGNRPVPMWLRKYTLAAVPRVLDLKPQEL 457
Query: 363 AQVLWAFASLYEPA 376
A ++WA + L+ PA
Sbjct: 458 AHMVWALSKLFPPA 471
>gi|399218609|emb|CCF75496.1| unnamed protein product [Babesia microti strain RI]
Length = 263
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 38/63 (60%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ +A E+DGP+HF N+ +T LK R + G+ V+ + + EW L+G+ E+ DY+R
Sbjct: 151 RPIAIEVDGPSHFYANSTNYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGAREREDYMR 210
Query: 617 VIL 619
L
Sbjct: 211 AKL 213
>gi|294956189|ref|XP_002788845.1| hypothetical protein Pmar_PMAR004305 [Perkinsus marinus ATCC 50983]
gi|239904457|gb|EER20641.1| hypothetical protein Pmar_PMAR004305 [Perkinsus marinus ATCC 50983]
Length = 1040
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 80/145 (55%), Gaps = 12/145 (8%)
Query: 232 LSPLNIATALHRIA---KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISN 288
L+ +N++T +HR+A +N E+ ++ A + + ++ A+ + S Q +SN
Sbjct: 578 LNSVNVSTLIHRLASLTQNQEQ------NQRALAKDARVKQVLRRAIELVSTSSCQSLSN 631
Query: 289 IAWALSKIGGELLYLSEMDR-VAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELA 347
I WA+ K+ +++ +E+ R + E A T++ F QN +N+ + + + +L +A
Sbjct: 632 ICWAIGKL--QMVEETEVVRAIVEAAKTRLHHFRPQNFSNMLYGLSRVGYCDRELMDLVA 689
Query: 348 KRASDIVHTFQEQELAQVLWAFASL 372
K ++ + TF+ QE++ +L+A+ L
Sbjct: 690 KEVANSLATFKPQEVSNLLYAYGRL 714
>gi|397618909|gb|EJK65091.1| hypothetical protein THAOC_14102, partial [Thalassiosira oceanica]
Length = 235
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 111/267 (41%), Gaps = 57/267 (21%)
Query: 357 FQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDA 416
F+ QE+A LWA A++ L + F + L NE G
Sbjct: 12 FKAQEVANFLWACATVGHTDQRLF----------SAFAPVIASKLDKLNEQG-------- 53
Query: 417 DSEGSLSSPVLSFNRDQLGNIAWSYAV--LGQMDRIFFSDIWKTISRFEEQRISEQYRED 474
L NI W+Y+V L + D +F ++ E+ E+ +
Sbjct: 54 -----------------LSNITWAYSVANLPRQD-LFNKGYVGALASNEKVFSGEELAQL 95
Query: 475 IMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVS 534
+ + ++L+ P L+ K +A ++ +++ S Q +V L +
Sbjct: 96 HQWQLWQQELESGIELQGP---------LQAKCRNAFTSREYSE---SKLQNDVVDELKA 143
Query: 535 TGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAA 590
GL E + GY +DA++ +KVA E+DGP+HF P G T+LK R +A
Sbjct: 144 AGLVLDEEVLLGSGYRIDALVEFSDGRKVAVEVDGPSHFIDRR--PAGSTILKHRQVAKM 201
Query: 591 GW-NVVSLSHQEWEELQGSFEQLDYLR 616
VVS+ + EW+EL+ S + YLR
Sbjct: 202 DHIKVVSVPYWEWDELKNSEMKQRYLR 228
>gi|294933217|ref|XP_002780656.1| hypothetical protein Pmar_PMAR001249 [Perkinsus marinus ATCC 50983]
gi|239890590|gb|EER12451.1| hypothetical protein Pmar_PMAR001249 [Perkinsus marinus ATCC 50983]
Length = 401
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 86/197 (43%), Gaps = 36/197 (18%)
Query: 199 LNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHR 258
+NK I+ ++T +E+L+VIAE + + +NI TAL+++A
Sbjct: 78 INKQILQSETLEELLDVIAEALNW---------FNIVNIGTALYKLASLALADQSQAAKS 128
Query: 259 LAFTRQ--REMSML--VAIAMTALPE--------------------C-SAQGISNIAWAL 293
AF R+ R + L +A ++ + E C S + ++NI WA+
Sbjct: 129 KAFLRKDNRYIGFLDEIANVLSYVDEPAGIESGNGSGKLVRDVKSACFSPKELANIVWAV 188
Query: 294 SKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI 353
+ IG L E+ VA + + F+S N++ FA M P+LF A D+
Sbjct: 189 THIGLPHRRLYEL--VARHIIWYIDHFDSVNLSLALWGFAKMDVCCPELFRAAASVIIDM 246
Query: 354 VHTFQEQELAQVLWAFA 370
+ F+ L WAF+
Sbjct: 247 IDAFEPHRLCNTAWAFS 263
>gi|145347161|ref|XP_001418044.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578272|gb|ABO96337.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 753
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 78/372 (20%), Positives = 151/372 (40%), Gaps = 32/372 (8%)
Query: 274 AMTALPECSAQGISNIAWALS--KIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
A+ + E SA+ +SN+ + + G ++ M V++ K+ EF + V A
Sbjct: 389 AIDKIEEASAKNLSNLLYGFGTLNLAGLGVFTHAMFCVSQ----KLEEFTPVGIFMVCSA 444
Query: 332 FASMQH-SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLE-------- 381
AS + P + + + H F+ Q+ + L FA L Y AD +
Sbjct: 445 LASSNYDPGPQMMLQFENKLMKSAHAFESQDFTEFLRVFARLRYMLADETFDFIGVSSAK 504
Query: 382 SLDNAFKDATQFTCCLNKALSNCNE-NGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWS 440
+LD D+ + + L + C + + + + + + GS S F WS
Sbjct: 505 TLDRF--DSYRISMTLWSHATLCAQPHDALLARIEDEIRGSASQ----FKPQNFVLALWS 558
Query: 441 YAVLGQMD--RIFFSDIWKTISRFEEQRI-SEQYREDIMFASQVHLVNQCLKLEHPHLQL 497
+LG ++ R + + + + + S + ED S + L L
Sbjct: 559 LVLLGSLEDARDSVVRVLHALVKLQGGALTSSEDLEDAQLCSLYMARLTSMGKPFEELIL 618
Query: 498 ALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGL-NWIREYAVDGYTV--DAVL 554
++ + ++ A + S Q + +L G ++ E V+G + D V
Sbjct: 619 GVTDGVADECERAWLRAKAQDPTISKVQHHIGEVLREIGAQDFEVEALVEGGKIRSDIVF 678
Query: 555 VDKKVAFEIDGPTHFSRNTG---VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQ 611
+ ++ E+DGP H+SR+ LG T+++ + + GW VV + + +W ++ E+
Sbjct: 679 PNSRIVVEVDGPHHYSRDASGRLRELGQTVMRNNLLKSWGWRVVIVPYADWGDMLTIEEK 738
Query: 612 LDYLRVILKDYI 623
YLR +L D +
Sbjct: 739 ASYLRSLLGDEV 750
>gi|429329938|gb|AFZ81697.1| RAP domain-containing protein [Babesia equi]
Length = 237
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 39/65 (60%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ +A E+DGP+HF N+ +T LK R + G+ V+ + EW L+G+ E+ +Y+R
Sbjct: 136 RPIAIEVDGPSHFYSNSTKYTAYTKLKHRILTRMGYKVLHVPFFEWRRLRGAKEREEYMR 195
Query: 617 VILKD 621
LK+
Sbjct: 196 AKLKE 200
>gi|323447941|gb|EGB03846.1| hypothetical protein AURANDRAFT_72645 [Aureococcus anophagefferens]
Length = 5282
Score = 54.3 bits (129), Expect = 2e-04, Method: Composition-based stats.
Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 3/85 (3%)
Query: 522 SSFQKEVARLLVSTGLNWIREYAVDG--YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGH 579
S Q+ V+++L G E +DG T DA VD +VA E DGP H+ + G
Sbjct: 3823 SRAQESVSQVLRECGFAHEMEVDLDGTGLTADAADVDARVAVEYDGPQHYLADR-TQTGR 3881
Query: 580 TMLKRRYIAAAGWNVVSLSHQEWEE 604
T K R + A GW +V +SH WE+
Sbjct: 3882 TRFKHRLVRALGWRLVVVSHYGWEQ 3906
>gi|397580099|gb|EJK51452.1| hypothetical protein THAOC_29372 [Thalassiosira oceanica]
Length = 221
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 87/193 (45%), Gaps = 20/193 (10%)
Query: 430 NRDQLGNIAWSYAVLGQMDRIFFSDIW-KTISRFEEQRISEQYREDIMFASQVHLVNQCL 488
N+ L IAWSYAV + F+ ++ ++ +E +E + + + +
Sbjct: 34 NKQGLATIAWSYAVANVPRQDLFNQVFIGALAAYENVFSTEDLFQLHQWQLWQQEIGSGM 93
Query: 489 KLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DG 547
+L +L SA ++ S Q +V L + GL+ + + G
Sbjct: 94 ELPQ-----SLGGKCRNAFTSASYSE-------SKLQNDVVDELKAAGLDLDEKVLLGSG 141
Query: 548 YTVDAVL-VD--KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWE 603
Y VDA++ VD K VA E+DGP HF + P+G T LK R + VVS+ + EW
Sbjct: 142 YRVDALVKVDDGKSVAIEVDGPFHFIQRR--PMGSTTLKHRQVGKLDRIEVVSVPYWEWN 199
Query: 604 ELQGSFEQLDYLR 616
EL+ S + +YL
Sbjct: 200 ELKNSLTKQNYLH 212
>gi|255075859|ref|XP_002501604.1| predicted protein [Micromonas sp. RCC299]
gi|226516868|gb|ACO62862.1| predicted protein [Micromonas sp. RCC299]
Length = 953
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 80/164 (48%), Gaps = 28/164 (17%)
Query: 234 PLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWAL 293
P++ ATA+HRIA + + + R + T + L+ + L +AQG++N+AWA
Sbjct: 439 PIHTATAIHRIATHTKGDAT----RESVTSSPSFAALMDLVRANLGGMNAQGLANVAWAC 494
Query: 294 SKI----GGELL--YLSEMDR--VAEVALTKVG------EFNSQNVANVAGAFASMQHSA 339
+++ G +LL + ++R A+ TK G E Q V+N+ A S++H
Sbjct: 495 ARLDHSPGADLLDDITAGLERELTAKPPATKGGRAAKAREVKPQAVSNMVWALGSLRHRP 554
Query: 340 PD-----LFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPAD 377
D +FS +A R D F+ QEL V+ A + Y P D
Sbjct: 555 SDECLASIFSAVAPRLRD----FRAQELTNVVLGAAHMEYVPGD 594
>gi|397598419|gb|EJK57213.1| hypothetical protein THAOC_22770, partial [Thalassiosira oceanica]
Length = 998
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/214 (22%), Positives = 86/214 (40%), Gaps = 41/214 (19%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
Q I+N+ W+ +K G + L + L +G F QN++N A AFA+ LF
Sbjct: 794 QHIANVLWSFAKSGEVVPELFQALGNHISGLDSLGSFKPQNLSNTAWAFATAGELHTKLF 853
Query: 344 SELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
+++ + D +++F++Q L+ + WAFA+ E L + + CL+
Sbjct: 854 NKIGDHVTGLDSLNSFEQQSLSNIAWAFAAAGESNPGLFKKIGGHVAG----LMCLD--- 906
Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
SFN L + W+++ G+ SD++K +
Sbjct: 907 --------------------------SFNPQNLSLLVWAFSTAGES----HSDLFKRVGD 936
Query: 462 FEEQRISEQYREDIMFASQVHLVNQCLKLEHPHL 495
RISE +R + + ++ HP L
Sbjct: 937 HIVARISEDFRPQTL--ANTAWAFATAEVSHPEL 968
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 91/207 (43%), Gaps = 26/207 (12%)
Query: 268 SMLVAIAMTA---LPECSAQGISNIAWALS------KIGGELLYLSEMDRVAEVALTKVG 318
S+ +IA +A L E A+ +SN+ ++ IG + L+ + E A+ +
Sbjct: 696 SIFDSIASSAAGMLNEFEARHLSNLIYSFGLVERNPDIGEKTLF----NVFGEAAVKILN 751
Query: 319 EFNSQNVANVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPAD 377
FNSQ+++N+ AF + LF E S + + +F+ Q +A VLW+FA E
Sbjct: 752 TFNSQDISNMLWAFVKVDAKNSRLFHETGGVISGMDLDSFEPQHIANVLWSFAKSGEVVP 811
Query: 378 PLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGD---------ADSEGSLSSPVLS 428
L ++L N + LSN ++G+ D L S + S
Sbjct: 812 ELFQALGNHISGLDSLGSFKPQNLSNTAW--AFATAGELHTKLFNKIGDHVTGLDS-LNS 868
Query: 429 FNRDQLGNIAWSYAVLGQMDRIFFSDI 455
F + L NIAW++A G+ + F I
Sbjct: 869 FEQQSLSNIAWAFAAAGESNPGLFKKI 895
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 5/89 (5%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGE-FNSQNVANVAGAFASMQHSAPDL 342
Q +S + WA S G L RV + + ++ E F Q +AN A AFA+ + S P+L
Sbjct: 911 QNLSLLVWAFSTAGESHSDL--FKRVGDHIVARISEDFRPQTLANTAWAFATAEVSHPEL 968
Query: 343 FSELAKRASDI--VHTFQEQELAQVLWAF 369
F+++ + + + +F Q L+ WAF
Sbjct: 969 FNKIGGHIAGLSTLGSFDPQALSISAWAF 997
>gi|403223561|dbj|BAM41691.1| conserved hypothetical protein [Theileria orientalis strain
Shintoku]
Length = 1133
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 66/314 (21%), Positives = 124/314 (39%), Gaps = 73/314 (23%)
Query: 196 EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT 255
I + +D++ ++ + ++L I + + ++ +N++TA+HR+AK
Sbjct: 258 HILIQQDLLKSKNSTQILSTIGDKL---------GQMNAVNVSTAIHRLAKYSSPY---- 304
Query: 256 THRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKI------------------- 296
+R A LV++ + + QG++NI W+++K+
Sbjct: 305 -NRYAVCNHESFGKLVSLVGDHMLQFDPQGLTNIFWSITKLRITPNWISCLLEQINIHAN 363
Query: 297 -------GGELLYLSEMDRVAEVALT-----------KVGEFNSQ-NVANVAGAFASMQH 337
L +S++ R +V+L K+ +F ++ V+ A A +
Sbjct: 364 SLNANELANCLFCISKLTRADDVSLELRFKILSLVQDKITQFRRPLDLTCVSTALARLNV 423
Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFAS-------LYEPADPLLESLDNAFKDA 390
P LF ++ + + F+ QE+ V WA+AS L+ +ES NA
Sbjct: 424 RNPVLFGHISSQVLSSLEEFKIQEICGVAWAYASLGFTDRILFGKIKQFIES--NADSSN 481
Query: 391 TQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVL-----SFNRDQLGNIAWSYAVLG 445
L ALS + D D SP++ S + + IAW+Y G
Sbjct: 482 IGNIVHLAWALSKIKQ-------ADTDFFLYTISPLVRGHLQSLSCKHMTTIAWAYVNAG 534
Query: 446 QMDRIFFSDIWKTI 459
D+ F+DI T+
Sbjct: 535 IEDQDLFNDIANTL 548
>gi|302828620|ref|XP_002945877.1| hypothetical protein VOLCADRAFT_86282 [Volvox carteri f.
nagariensis]
gi|300268692|gb|EFJ52872.1| hypothetical protein VOLCADRAFT_86282 [Volvox carteri f.
nagariensis]
Length = 1644
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 59/122 (48%), Gaps = 12/122 (9%)
Query: 265 REMSMLVAIAMTALPEC---SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFN 321
++ SM+ A TALP+ +A G+SN+ WA + L + A + E N
Sbjct: 596 QDRSMISAAVQTALPQLRRFNASGLSNLLWACATAQCHCEELFD-GAAAALMALPPHEMN 654
Query: 322 SQNVANVAGAFASMQHSAPDLFSELAK--------RASDIVHTFQEQELAQVLWAFASLY 373
Q+VAN A A A +QH+ P+L + LA+ + + QELA LWAFA L
Sbjct: 655 CQDVANTAWACAKLQHNHPELMAHLARLVLAAAEAPGATGLRGANTQELANTLWAFAVLP 714
Query: 374 EP 375
P
Sbjct: 715 LP 716
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/130 (23%), Positives = 62/130 (47%), Gaps = 5/130 (3%)
Query: 246 KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGG----ELL 301
+ ++ + + R A + ++ LV+ +T LP +A+ +N+ WAL +G ELL
Sbjct: 465 RQGQRTAASASPRTAQSSAALLADLVSGFLTQLPHYTARQYANVVWALGSMGSREHTELL 524
Query: 302 YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQE 361
+ + + A+ K+ Q ++N+A A + + L+ + +H F+ QE
Sbjct: 525 HAAAVQLQAQGG-AKLFAAPPQELSNLALGLAKLGYREVSLWGAIIAAGKARLHEFKPQE 583
Query: 362 LAQVLWAFAS 371
L + WA A+
Sbjct: 584 LHNMAWAVAA 593
>gi|403223568|dbj|BAM41698.1| uncharacterized protein TOT_040000079 [Theileria orientalis strain
Shintoku]
Length = 229
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 39/65 (60%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ +A E+DGP+HF N+ +T LK R + G+ V+ + EW L+G+ E+ +Y+R
Sbjct: 128 RPIAIEVDGPSHFYSNSTKYTAYTKLKHRLLTRMGYKVLHVPFFEWRRLRGAREREEYMR 187
Query: 617 VILKD 621
LK+
Sbjct: 188 EKLKE 192
>gi|156089343|ref|XP_001612078.1| hypothetical protein [Babesia bovis T2Bo]
gi|154799332|gb|EDO08510.1| hypothetical protein BBOV_III009540 [Babesia bovis]
Length = 1171
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/215 (21%), Positives = 88/215 (40%), Gaps = 52/215 (24%)
Query: 196 EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT 255
I L + I+ +++ +VL I + +T L+ +N ATALHRIA++ S
Sbjct: 317 HIVLQQSILKCKSSSQVLAAIQDKVTK---------LNAVNAATALHRIARHTTSYS--- 364
Query: 256 THRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWAL---------------------- 293
R T + L++ + QG++N+ W++
Sbjct: 365 --RYTLTGNNTFAQLLSAVEAHIATLDPQGVTNVLWSIVKLRIHPQWMDSLLVTMQKHVK 422
Query: 294 ----SKIGGELLYLSEMDRVAEVAL-----------TKVGEFNSQ-NVANVAGAFASMQH 337
S++ L +S++ ++ + KV F + ++ VA A A +
Sbjct: 423 ELGTSELASSLFAVSKLATMSTAGIDLRDMLLGTVQEKVTHFRTPLDITCVATALARLNV 482
Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
P +FS+L+ ++ F Q+L + WA+ASL
Sbjct: 483 RNPVIFSQLSAAVLAVIDDFAMQQLCGIAWAYASL 517
>gi|124810335|ref|XP_001348847.1| RAP protein, putative [Plasmodium falciparum 3D7]
gi|23497748|gb|AAN37286.1| RAP protein, putative [Plasmodium falciparum 3D7]
Length = 532
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 40/65 (61%)
Query: 556 DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
D+ +A E+DGP+HF N+ +T LK R + G+NV+ +S+ +W +L+ E+ +++
Sbjct: 430 DRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYIDWRKLRNKSEREEFI 489
Query: 616 RVILK 620
LK
Sbjct: 490 LKKLK 494
>gi|397638616|gb|EJK73140.1| hypothetical protein THAOC_05252, partial [Thalassiosira oceanica]
Length = 643
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 75/165 (45%), Gaps = 40/165 (24%)
Query: 303 LSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTF 357
L D +A A+ + EF++++++N+ +F ++++ PD LF+ + A I+HTF
Sbjct: 408 LPIFDSIARSAVDMLNEFDARHLSNLVYSFGLVEYN-PDIGGETLFNVFGEAAGKILHTF 466
Query: 358 QEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDAD 417
+ QEL+ +LWAF + DA N L +E GGV S D D
Sbjct: 467 KPQELSNMLWAFVKV----------------DAD------NSRL--FHETGGVISGMDLD 502
Query: 418 SEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRF 462
SF +L NI WS+A G+ F + I+R
Sbjct: 503 ----------SFKPQELANIIWSFAKSGESGPELFQALGNHIARL 537
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 50/93 (53%), Gaps = 6/93 (6%)
Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
Q ++NI W+ +K G G L+ + + +A L + F Q+++N A AFA+ S P
Sbjct: 506 PQELANIIWSFAKSGESGPELFQALGNHIAR--LNSLDPFKPQDLSNTAWAFATAGVSHP 563
Query: 341 DLFSELAKRAS--DIVHTFQEQELAQVLWAFAS 371
+LF ++ + D +F+ Q L+ WAFA+
Sbjct: 564 ELFKKIGNHGAGQDRFDSFKPQNLSNTAWAFAT 596
Score = 43.5 bits (101), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 54/123 (43%), Gaps = 3/123 (2%)
Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
Q +SN+ WA K+ + L + ++ + F Q +AN+ +FA S P+L
Sbjct: 468 PQELSNMLWAFVKVDADNSRLFH-ETGGVISGMDLDSFKPQELANIIWSFAKSGESGPEL 526
Query: 343 FSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
F L + + + F+ Q+L+ WAFA+ L + + N +F +
Sbjct: 527 FQALGNHIARLNSLDPFKPQDLSNTAWAFATAGVSHPELFKKIGNHGAGQDRFDSFKPQN 586
Query: 401 LSN 403
LSN
Sbjct: 587 LSN 589
>gi|221059023|ref|XP_002260157.1| RAP protein [Plasmodium knowlesi strain H]
gi|193810230|emb|CAQ41424.1| RAP protein, putative [Plasmodium knowlesi strain H]
Length = 424
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 44/72 (61%), Gaps = 1/72 (1%)
Query: 551 DAVLVDKK-VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSF 609
D++ D + +A E+DGP+HF N+ +T LK R + G+NV+ +S+ +W +L+
Sbjct: 316 DSIFADNRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYIDWRKLRNKS 375
Query: 610 EQLDYLRVILKD 621
E+ +++ LK+
Sbjct: 376 EREEFILKKLKE 387
>gi|223999221|ref|XP_002289283.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974491|gb|EED92820.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 837
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 107/454 (23%), Positives = 168/454 (37%), Gaps = 112/454 (24%)
Query: 249 EKVSMMTTHRLAFTRQREMSMLVAIAMTA-----LPECSAQGISNIAWAL-------SKI 296
E +MMT TR+ + SM +T + + + IAWAL + +
Sbjct: 391 ESDAMMTFLAKEATRRIKFSMEAPPTLTGGKRNQFCKLLPRDVVQIAWALGTMESDNASV 450
Query: 297 GGELLYLSEMDRVAEVALTKVGEFN-------SQNVANVAGAFASMQHSAPD-------L 342
G L+YL +D V E + N S A++ ++ H D +
Sbjct: 451 GDALVYL--VDAVNEYWIADSNSSNERHRQIKSWKCADLVQMATALSHGRLDNQSVLTAI 508
Query: 343 FSELAKR-ASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
+ E +R S F E++ +LWA A LY L + F+ +FT + L
Sbjct: 509 YEESLERIQSSSPGKFSTSEISILLWAQARLY-----LTSKYGSVFQ---EFTGAAARTL 560
Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
G D + P + + N+AWS VLG D SD+ +
Sbjct: 561 MQ-QMKGKANQHSDERLLPPATLPKMGLRSQEQANLAWSLTVLGHYD----SDVVALL-- 613
Query: 462 FEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGK--------- 512
++I+ A+ + ++LEH H +L E +A +
Sbjct: 614 -----------QNIVHAASSS-GDGVIQLEHAHQLWQSYFLLSEDCPAAVEFVPAEFSQF 661
Query: 513 -TKRFNQKVTSSFQKEVARLLVSTGLNWIR-----EYAVDGYTVDAVLVDK--------- 557
K++N + Q +S L +R EY D VD +V +
Sbjct: 662 LEKKWNIEKNRGKQSSSRHRTISQTLELMRVAHRNEYDED---VDVAIVLQEDSSWTHTA 718
Query: 558 -----------KVAFEIDGPTHFS--RNTG------------VP--LGHTMLKRRYIAAA 590
KVA E DGP HF+ +TG P LGHT+LK R +
Sbjct: 719 QKDLDNQEGRVKVAVEFDGPFHFTVMASTGKDLTMIENGVKIAPRVLGHTVLKYRLLKKK 778
Query: 591 GWNVVSLSHQEWEELQ--GSFEQLDYLRVILKDY 622
GW VV + + EW+++ S E+ YL+ LK +
Sbjct: 779 GWAVVRIPYYEWDKIPSFASMERQRYLQRALKTH 812
>gi|156089469|ref|XP_001612141.1| hypothetical protein [Babesia bovis T2Bo]
gi|154799395|gb|EDO08573.1| hypothetical protein BBOV_III010170 [Babesia bovis]
Length = 260
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 3/70 (4%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+K+A E DGPTHF T + ++LK + GW V+ + +QEW +L ++ L+
Sbjct: 127 RKIAIEYDGPTHFYAETTMRTAKSILKHEILENTGWQVLHIPYQEWLQLPLKRKRQHLLK 186
Query: 617 V---ILKDYI 623
V ILK+YI
Sbjct: 187 VNEEILKEYI 196
>gi|70951793|ref|XP_745109.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56525327|emb|CAH77447.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 350
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 43/72 (59%), Gaps = 1/72 (1%)
Query: 551 DAVLVDKK-VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSF 609
D + D + +A E+DGP+HF N+ +T LK R + G+NV+ +S+ +W +L+
Sbjct: 242 DFIFADNRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYFDWRKLRNKS 301
Query: 610 EQLDYLRVILKD 621
E+ +++ LK+
Sbjct: 302 EREEFILKKLKE 313
>gi|68073089|ref|XP_678459.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56498935|emb|CAH96559.1| conserved hypothetical protein [Plasmodium berghei]
Length = 319
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 43/72 (59%), Gaps = 1/72 (1%)
Query: 551 DAVLVDKK-VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSF 609
D + D + +A E+DGP+HF N+ +T LK R + G+NV+ +S+ +W +L+
Sbjct: 211 DFIFADNRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYFDWRKLRNKS 270
Query: 610 EQLDYLRVILKD 621
E+ +++ LK+
Sbjct: 271 EREEFILKKLKE 282
>gi|397624180|gb|EJK67299.1| hypothetical protein THAOC_11691, partial [Thalassiosira oceanica]
Length = 538
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 63/122 (51%), Gaps = 10/122 (8%)
Query: 499 LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV-- 555
L L+EK +A + F++ S Q +V L + GL+ E + GY VDA++
Sbjct: 28 LPQSLQEKCRNAFTSASFSE---SKLQNDVVYELRAAGLDLDEEVLLGSGYRVDALVKFS 84
Query: 556 -DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWEELQGSFEQLD 613
+KVA E+DGP+HF P G + LK R +A VVS+ + EW EL+ S +
Sbjct: 85 NGRKVAVEVDGPSHFIDRR--PTGSSTLKHRQVARLDRIEVVSVPYWEWNELKNSETKQR 142
Query: 614 YL 615
YL
Sbjct: 143 YL 144
>gi|403221392|dbj|BAM39525.1| uncharacterized protein TOT_010000980 [Theileria orientalis strain
Shintoku]
Length = 571
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 78/161 (48%), Gaps = 26/161 (16%)
Query: 477 FASQVHLVNQCLKLEHPHLQLALSSV-LEEKIASAGKTKRFNQKV---TSSFQKEVARLL 532
F SQ++L+N+ +LE L+ + + L E + + + ++ TS+ +V +L
Sbjct: 410 FISQLNLLNRSAELERHGLKRLFTQMGLREFLTGLEQVRPVFSQIDHNTSNTHVQVDSVL 469
Query: 533 VSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGV----------PLG---- 578
S + E+ + Y VD + K E+DGP H++ TG+ PLG
Sbjct: 470 KSFNYETLLEHFISPYLVDIFVPSKNAIIEVDGPYHYA--TGMNERVNAIMKRPLGRFPC 527
Query: 579 ----HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
++ LKRR ++ +G+ ++ +QEW Q + EQ+ Y+
Sbjct: 528 QYSLNSRLKRRLLSKSGYKFFNIPYQEWP--QSTNEQIYYI 566
>gi|401409740|ref|XP_003884318.1| hypothetical protein NCLIV_047190 [Neospora caninum Liverpool]
gi|325118736|emb|CBZ54287.1| hypothetical protein NCLIV_047190 [Neospora caninum Liverpool]
Length = 929
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 48/89 (53%)
Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
EVA +L G+++ R ++G +D +L +KKV GP HF ++ ++ L++R
Sbjct: 738 EVAWMLQEMGISFQRRLYINGCRIDILLPEKKVVIMCAGPHHFYLDSTRRTAYSRLQQRL 797
Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+ G+ V L + EW EL+ E+ +L
Sbjct: 798 LELQGYAVCVLPYYEWSELKSPEEKQRFL 826
>gi|397564390|gb|EJK44191.1| hypothetical protein THAOC_37291, partial [Thalassiosira oceanica]
Length = 134
Score = 52.8 bits (125), Expect = 6e-04, Method: Composition-based stats.
Identities = 32/73 (43%), Positives = 43/73 (58%), Gaps = 5/73 (6%)
Query: 547 GYTVDAVLV--DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGW-NVVSLSHQEWE 603
GY VDA++ D+ VA E+DGP+HF + P G T LK R +A VVS+ + EW
Sbjct: 57 GYRVDALVKVGDRGVAIEVDGPSHFIQRR--PTGSTTLKHRQVATLECIEVVSVPYWEWN 114
Query: 604 ELQGSFEQLDYLR 616
EL+ S + YLR
Sbjct: 115 ELKNSVTKQQYLR 127
>gi|389585147|dbj|GAB67878.1| RAP protein [Plasmodium cynomolgi strain B]
Length = 378
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 44/72 (61%), Gaps = 1/72 (1%)
Query: 551 DAVLVDKK-VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSF 609
D++ D + +A E+DGP+HF N+ +T LK R + G+NV+ +S+ +W +L+
Sbjct: 270 DSIFADNRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYIDWRKLRNKS 329
Query: 610 EQLDYLRVILKD 621
E+ +++ LK+
Sbjct: 330 EREEFILKKLKE 341
>gi|397643122|gb|EJK75666.1| hypothetical protein THAOC_02605, partial [Thalassiosira oceanica]
Length = 599
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/181 (24%), Positives = 72/181 (39%), Gaps = 39/181 (21%)
Query: 284 QGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q +SNIAWA + G +L+ D VA AL + F Q ++N++ AF++ S +
Sbjct: 447 QELSNIAWAFATAGESHPVLFEKIGDYVA--ALGSLNSFKPQELSNISWAFSAAGVSHAE 504
Query: 342 LFSELAKRAS--DIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNK 399
LF ++A + D + +F+ QELA + AF + P L + + + F
Sbjct: 505 LFEKIAYHIAGLDCLDSFKPQELANTVHAFCNAVRPHPALFDKIGHYIAGLCSFNL---- 560
Query: 400 ALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTI 459
F L NIAW++A G+ F I I
Sbjct: 561 -----------------------------FQPQNLSNIAWAFATAGESHPALFEKIGDYI 591
Query: 460 S 460
+
Sbjct: 592 A 592
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 2/112 (1%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
A+ +L Q +SNI+WA S G L E L + F Q +AN AF
Sbjct: 476 ALGSLNSFKPQELSNISWAFSAAGVSHAELFEKIAYHIAGLDCLDSFKPQELANTVHAFC 535
Query: 334 SMQHSAPDLFSELAKRASDIV--HTFQEQELAQVLWAFASLYEPADPLLESL 383
+ P LF ++ + + + FQ Q L+ + WAFA+ E L E +
Sbjct: 536 NAVRPHPALFDKIGHYIAGLCSFNLFQPQNLSNIAWAFATAGESHPALFEKI 587
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 72/171 (42%), Gaps = 48/171 (28%)
Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVA------EVALTKVGEFNSQNVANVAGAFASMQ 336
Q ++NI W+ SK G E DR + + +F Q ++ + A+A+ +
Sbjct: 369 GQALANIVWSFSKSG-------EADREMFNHIGDHIVARSLYDFLPQEMSIIVWAYANGR 421
Query: 337 HSAPDLFSELAKRASDIV--HTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFT 394
S LF + + +V ++F+ QEL+ + WAFA+ E + P+L F+ +
Sbjct: 422 VSHHALFDRVGFHVTRLVSSYSFKPQELSNIAWAFATAGE-SHPVL------FEKIGDYV 474
Query: 395 CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
L GSL+ SF +L NI+W+++ G
Sbjct: 475 AAL----------------------GSLN----SFKPQELSNISWAFSAAG 499
>gi|149732832|ref|XP_001501739.1| PREDICTED: FAST kinase domain-containing protein 3-like [Equus
caballus]
Length = 660
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 79/174 (45%), Gaps = 16/174 (9%)
Query: 444 LGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHP-HLQLALSSV 502
L Q+ ++F + I + ++ ++ +Y+ + C LE P QL SV
Sbjct: 490 LAQLTQLFLTSILEC-PFYKGPKLLPKYQVK-------SFLTPCCSLETPVDFQLY-KSV 540
Query: 503 LEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFE 562
+ I G F KV + + R V + E V +TVD V K+VA
Sbjct: 541 MTGLIDLLGARLYFASKVLTPY-----RYTVDVEIKLDEEGFVLPFTVDED-VHKRVALC 594
Query: 563 IDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
IDGP F N+ LG +K+R++ G++VV + + E E L+ E ++YL+
Sbjct: 595 IDGPKRFCLNSKHLLGKEAMKQRHLRLLGYHVVQIPYYEIEMLKSRLELVEYLQ 648
>gi|156099636|ref|XP_001615683.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804557|gb|EDL45956.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 443
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 44/72 (61%), Gaps = 1/72 (1%)
Query: 551 DAVLVDKK-VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSF 609
D++ D + +A E+DGP+HF N+ +T LK R + G+NV+ +S+ +W +L+
Sbjct: 335 DSIFADNRPIAIEVDGPSHFYANSNRYTTYTKLKHRILTKLGYNVIHISYIDWRKLRNKT 394
Query: 610 EQLDYLRVILKD 621
E+ +++ LK+
Sbjct: 395 EREEFILKKLKE 406
>gi|84998036|ref|XP_953739.1| hypothetical protein [Theileria annulata]
gi|65304736|emb|CAI73061.1| hypothetical protein TA16950 [Theileria annulata]
Length = 574
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 91/205 (44%), Gaps = 32/205 (15%)
Query: 436 NIAWSYAVLG-QMDRIFFSDIWKTISRFEEQRISEQYREDIM----FASQVHLVNQCLKL 490
N +SY+ ++D + +S I ++ S + E+I+ F SQ++L+ + + L
Sbjct: 372 NCHYSYSQFNLKLDTLIYS-----ILKYVYNIFSGENMEEIIKFPNFVSQLNLLRKSINL 426
Query: 491 EHPHLQLALS----SVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVD 546
E HL+ + S + + T N+ TS+ +V +L S + E+ V
Sbjct: 427 ERVHLKKLIEGSEISCFLDSLEHIKPTFAPNEFKTSNIHSQVDTILKSFNYETLLEHYVC 486
Query: 547 GYTVDAVLVDKKVAFEIDGPTHFSRNT-------------GVPLGHTM---LKRRYIAAA 590
Y VD + K V E+DGP H+S LG+T+ LK R + +
Sbjct: 487 PYIVDIFVPSKNVIIEVDGPYHYSTTINPRINKILKREVDNYRLGYTLNSKLKSRILTKS 546
Query: 591 GWNVVSLSHQEWEELQGSFEQLDYL 615
G+ +++ +W Q + EQ+ ++
Sbjct: 547 GFKFINIPFYQWP--QTTNEQVYFI 569
>gi|399218303|emb|CCF75190.1| unnamed protein product [Babesia microti strain RI]
Length = 472
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 51/94 (54%), Gaps = 13/94 (13%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSR--------- 571
T+ FQ++V+ LL G + E + Y VD +LVD KV E++GP H++
Sbjct: 362 TTVFQQQVSNLLKEMGYDIDCEVHIYPYIVD-ILVDNKVIIEVNGPCHYTYHCSDKNDYG 420
Query: 572 --NTGVPLG-HTMLKRRYIAAAGWNVVSLSHQEW 602
N+ + L +T+LK + + G+ V+ +S+ +W
Sbjct: 421 VINSALKLNKNTILKEKLLNGCGYKVIHVSYADW 454
>gi|401404312|ref|XP_003881694.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325116107|emb|CBZ51661.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 538
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 37/63 (58%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ +A E+DGPTHF N+ +T LK R + G+ V+ + + EW L+G E+ +Y+R
Sbjct: 201 RPIAIEVDGPTHFYANSTRYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGQKEREEYMR 260
Query: 617 VIL 619
L
Sbjct: 261 RKL 263
>gi|428166758|gb|EKX35728.1| hypothetical protein GUITHDRAFT_118113 [Guillardia theta CCMP2712]
Length = 560
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 49/97 (50%), Gaps = 14/97 (14%)
Query: 283 AQGISNIAWALSKIG------GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA-SM 335
AQ +SNI WA +++G G LL RV+ + FN QNVAN AFA S
Sbjct: 372 AQELSNILWAHARLGLTFGEEGLLLLTRRASRVSHL-------FNGQNVANALWAFAKSG 424
Query: 336 QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
+ P L+ +L RA + + QE + +LW+ A L
Sbjct: 425 RTPCPQLYRQLKDRALQLEEELRPQEASSMLWSLAKL 461
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 3/94 (3%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEV---ALTKVGEFNSQNVANVAGAFASMQHS 338
SAQGI+N+ WA+ + ++E + V V A +FN Q VAN + A + +
Sbjct: 292 SAQGIANVLWAMGTLSSRTGRMAEEEMVRAVCARACEVCEQFNGQAVANSFWSLAKLGAA 351
Query: 339 APDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
L L +R ++ + + QEL+ +LWA A L
Sbjct: 352 NQQLVVGLTRRMMEVADSLKAQELSNILWAHARL 385
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 56/118 (47%), Gaps = 18/118 (15%)
Query: 263 RQREMSMLVAIAMTALPEC---SAQGISNIAWALSKIGG--ELLYLSEMDRVAEVALTKV 317
R E M+ A+ A C + Q ++N W+L+K+G + L + R+ EVA
Sbjct: 312 RMAEEEMVRAVCARACEVCEQFNGQAVANSFWSLAKLGAANQQLVVGLTRRMMEVA---- 367
Query: 318 GEFNSQNVANVAGAFASMQHSAPDLFSE-----LAKRASDIVHTFQEQELAQVLWAFA 370
+Q ++N+ A A + + F E L +RAS + H F Q +A LWAFA
Sbjct: 368 DSLKAQELSNILWAHARLGLT----FGEEGLLLLTRRASRVSHLFNGQNVANALWAFA 421
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 59/131 (45%), Gaps = 13/131 (9%)
Query: 251 VSMMTTHRLAFTRQRE-MSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE---M 306
SM ++ F E M LV A + +AQ +SN WA +K+G Y+ E M
Sbjct: 222 TSMWAMAKVGFDPGEEVMRTLVGHANEIVASFNAQDVSNFLWASAKLG----YVPEEATM 277
Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASM-----QHSAPDLFSELAKRASDIVHTFQEQE 361
++ G+F++Q +ANV A ++ + + ++ + RA ++ F Q
Sbjct: 278 VKLRRRTSKIAGDFSAQGIANVLWAMGTLSSRTGRMAEEEMVRAVCARACEVCEQFNGQA 337
Query: 362 LAQVLWAFASL 372
+A W+ A L
Sbjct: 338 VANSFWSLAKL 348
>gi|237839849|ref|XP_002369222.1| hypothetical protein TGME49_085840 [Toxoplasma gondii ME49]
gi|211966886|gb|EEB02082.1| hypothetical protein TGME49_085840 [Toxoplasma gondii ME49]
Length = 571
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 37/63 (58%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ +A E+DGPTHF N+ +T LK R + G+ V+ + + EW L+G E+ +Y+R
Sbjct: 198 RPIAIEVDGPTHFYANSTRYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGQKEREEYMR 257
Query: 617 VIL 619
L
Sbjct: 258 RKL 260
>gi|294880367|ref|XP_002768980.1| hypothetical protein Pmar_PMAR008162 [Perkinsus marinus ATCC 50983]
gi|239872053|gb|EER01698.1| hypothetical protein Pmar_PMAR008162 [Perkinsus marinus ATCC 50983]
Length = 772
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 115/281 (40%), Gaps = 71/281 (25%)
Query: 235 LNIATALHRIAKNMEKV------------SMMTTHRLAFTRQREMSMLVAIAMTALPECS 282
++ +TALHR+A + K S+M T+ T LV A LP +
Sbjct: 68 IHTSTALHRLATAITKTGGGRPTEGATNASVMATY---VTSDARFVRLVERARVLLPGAT 124
Query: 283 AQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
+ +SNI WALSK+ Y E +D V E L + F++Q V+N AF ++ S+
Sbjct: 125 TRAVSNITWALSKLN----YTDEGILDIVTEYMLANLEAFDTQGVSNCLYAFGLLRCSSG 180
Query: 341 DLFSELAKRASDIV----HTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCC 396
D L R + + + F+ QE++ ++A A L D L S+ A+ C
Sbjct: 181 DRRRLLLDRLCEHIPPRLNEFKPQEISNCVYALARLGHRDDSFLASV------ASYIPGC 234
Query: 397 LNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSD-- 454
+N +F ++ N+A+S A+L F
Sbjct: 235 IN-----------------------------NFKAQEMSNVAYSCALLSYKSDPLFQSVA 265
Query: 455 ---IWKTISRFEEQRISEQYREDIMFA-SQVHLVNQCLKLE 491
I + +SR Q IS + ++A ++VH + L +E
Sbjct: 266 DEMIARGMSRCRSQDIS-----NTLYAFAKVHFKCEALCVE 301
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 81/180 (45%), Gaps = 12/180 (6%)
Query: 275 MTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAE--VALTKVGEFNS-QNVANVA 329
+T L E + QGISN +AL +G E + D V +L + ++++ Q+ AN
Sbjct: 307 ITRLHEFNMQGISNTMFALGGLGYRHEAFLNAIADHVVGRLCSLDQFSQYSTPQDFANTL 366
Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAFK 388
AFA + L +H F+ QELA V+ A+A+L Y +E ++
Sbjct: 367 VAFAKLSLRHDPLLDAFGSIMCHRLHAFKSQELASVVHAYATLGYVHTAFFIEVVNGILS 426
Query: 389 DATQFTCCLNKALSNC-NENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIA---WSYAVL 444
T C NK +S+ +E S G S ++S V + G+IA +S+ +L
Sbjct: 427 SPT--LCGYNKLVSSSYSEASPTMSIGQRSSNAFVASSVPRLRDFKPGDIALIVYSFGLL 484
>gi|145340621|ref|XP_001415420.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575643|gb|ABO93712.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 417
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 62/277 (22%), Positives = 119/277 (42%), Gaps = 54/277 (19%)
Query: 198 NLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAK----------- 246
+L D++DA + +L ++ E K +N +TALHR+A+
Sbjct: 46 DLQGDLMDASDVEFILTMVEEQEEVFNK---------VNASTALHRVARLTTQRLPGQLR 96
Query: 247 -NMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE 305
ME+ ++ R Q MSM+ +A E S QG+SN+ WAL+++ Y ++
Sbjct: 97 PTMERSTLFGDERF----QTLMSMVDRMAG----EMSMQGVSNVLWALARLD----YPTD 144
Query: 306 MDRVAEVAL---TKVGEFNSQNVANVAGAFASMQHSA-PDLFSELAKRASDIVHTFQEQE 361
+ +A ++ +N++ A A + H L +++RA + H F+ +
Sbjct: 145 EALLEALAARAGSQAASAEPKNLSTTLWALAVLGHKPRSKLLKSISERALAVAHDFRSPD 204
Query: 362 LAQVLWAFA---SLYEPAD---PLLES-LDNAFKDATQFT------CCLNKALSNCNENG 408
+ +LWA+A P+D P++++ LD A +T + A+ +C
Sbjct: 205 VVNMLWAYARWVRYLPPSDRPTPVVQAMLDQAVSTMQSYTPYQLANLSWSLAMLDCPPAP 264
Query: 409 GVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
V +++S + L ++ W+Y V+G
Sbjct: 265 RVLEY----VLQTVASEPSKLDGTALTHVLWAYGVMG 297
>gi|221504797|gb|EEE30462.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 571
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 37/63 (58%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ +A E+DGPTHF N+ +T LK R + G+ V+ + + EW L+G E+ +Y+R
Sbjct: 198 RPIAIEVDGPTHFYANSTRYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGQKEREEYMR 257
Query: 617 VIL 619
L
Sbjct: 258 RKL 260
>gi|397568565|gb|EJK46207.1| hypothetical protein THAOC_35135 [Thalassiosira oceanica]
Length = 698
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 58/112 (51%), Gaps = 6/112 (5%)
Query: 273 IAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
I L + Q +SNIAWA + G +L+ D +A ++ FN QN++N+
Sbjct: 490 IVARRLNDFQPQHLSNIAWAFATAGVSHPILFKKIRDHIA--GQDRLNLFNPQNLSNITW 547
Query: 331 AFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLL 380
AFA+ S P++F ++ + + + +F+ Q L+ + WA++ P++ L
Sbjct: 548 AFATAGDSHPEVFKKIGDHIAGLNSLDSFKAQALSNIAWAYSVANVPSEGLF 599
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 82/175 (46%), Gaps = 32/175 (18%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQ 360
DR+A A + EF +++++N+ +F ++ + PD LF+ K A I+ TF+ Q
Sbjct: 367 FDRIASSAAVVLNEFEARHLSNLIYSFGLVELN-PDIGGETLFNVFGKTAVRILQTFKPQ 425
Query: 361 ELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGVKS 412
EL+ +LWAF + L++ ++ +D ++FK Q + A
Sbjct: 426 ELSNMLWAFVKVDAKNSRLFQETGGVISGMDLDSFKPQEQSNILWSFA-----------K 474
Query: 413 SGDADSE--GSLSSPVLS-----FNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
SG+A+ E L + +++ F L NIAW++A G I F I I+
Sbjct: 475 SGEANPELFRVLGNHIVARRLNDFQPQHLSNIAWAFATAGVSHPILFKKIRDHIA 529
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/275 (25%), Positives = 114/275 (41%), Gaps = 56/275 (20%)
Query: 287 SNIAWALSKIGGELLYLSEMDRV--AEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFS 344
SNI W+ +K G E+ RV + ++ +F Q+++N+A AFA+ S P LF
Sbjct: 466 SNILWSFAKSG---EANPELFRVLGNHIVARRLNDFQPQHLSNIAWAFATAGVSHPILFK 522
Query: 345 ELAKR--ASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALS 402
++ D ++ F Q L+ + WAFA+ + S FK LN
Sbjct: 523 KIRDHIAGQDRLNLFNPQNLSNITWAFATAGD-------SHPEVFKKIGDHIAGLNS--- 572
Query: 403 NCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRF 462
+ SF L NIAW+Y+V F++ +
Sbjct: 573 -----------------------LDSFKAQALSNIAWAYSVANVPSEGLFNECFAGACSS 609
Query: 463 EEQRISEQYREDIMFASQVHLVNQCLK--LEHPHLQLALSSVLEEKIASAGKTKRFNQKV 520
+E+ E E++ Q L Q LK +E PH L+EK +A + +++
Sbjct: 610 KEETFPE---EELRQLHQWQLWQQELKSGMELPH-------SLKEKCRNAFISSSYSE-- 657
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVD-GYTVDAVL 554
S Q +V L + GL+ E ++ GY VDA++
Sbjct: 658 -SKLQNDVVDELKAIGLDLEVEVLLESGYRVDALV 691
>gi|221484602|gb|EEE22896.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 558
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 37/63 (58%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ +A E+DGPTHF N+ +T LK R + G+ V+ + + EW L+G E+ +Y+R
Sbjct: 193 RPIAIEVDGPTHFYANSTRYTAYTKLKHRLLTRMGYKVLHVPYFEWRRLRGQKEREEYMR 252
Query: 617 VIL 619
L
Sbjct: 253 RKL 255
>gi|428180195|gb|EKX49063.1| hypothetical protein GUITHDRAFT_136245 [Guillardia theta CCMP2712]
Length = 371
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 90/185 (48%), Gaps = 23/185 (12%)
Query: 286 ISNIAWALSKIG--GELLYLSEMDRVAE-VALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
+S++ W ++ +G E L+ ++V+E V T + FN+ ++ +A +FA + A DL
Sbjct: 74 VSSMIWGMAALGHTNERLF----EKVSEHVMSTGLEGFNAPKISIIAWSFARARFQAEDL 129
Query: 343 FSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESLD--------NAFKDATQF 393
FS + + + + +F Q +A +LWAFA D LL + + F D +
Sbjct: 130 FSLIEEFVVEKGMSSFNSQNIACILWAFAVFGRMTDDLLACAEEQIWSVGFSGFSDQSFV 189
Query: 394 TCCLNKALSNCNENGGVKSSGDADSEGSLS----SPVLSFNRDQLGNIAWSYAVLGQM-D 448
L A + + G SG+ + + + + SF+ QL +AW++A LGQ D
Sbjct: 190 D--LLWAFAASDLTGTCTHSGEDTVKLAAAYLRKRSIRSFSPKQLSTMAWAFARLGQFHD 247
Query: 449 RIFFS 453
+ F+S
Sbjct: 248 QAFYS 252
>gi|209363966|ref|YP_001424481.2| hypothetical membrane associated protein [Coxiella burnetii Dugway
5J108-111]
gi|207081899|gb|ABS76574.2| hypothetical membrane associated protein [Coxiella burnetii Dugway
5J108-111]
Length = 558
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 66/158 (41%), Gaps = 18/158 (11%)
Query: 264 QREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGE---- 319
QR ++ I + + QGI+N WA + +G YL E R++ L V
Sbjct: 250 QRLSECMLVIVQRTVERFNPQGIANTLWAFATMGVRWRYLEE-QRLSSCLLVAVRHNAER 308
Query: 320 FNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQEQELAQVLWAFASL-- 372
FNSQ++AN AFA+ D L L + F QE+A LWA A++
Sbjct: 309 FNSQDIANTLWAFATTGVRWQDREMQKLSERLLAAVRHNIEQFNPQEIANTLWALATMEV 368
Query: 373 ---YEPADPLLESLDNAF-KDATQFTC--CLNKALSNC 404
Y L L + ++A+QF+ C S C
Sbjct: 369 EWQYLEDQGLSHLLTDVIDRNASQFSLENCTQITWSTC 406
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 64/147 (43%), Gaps = 10/147 (6%)
Query: 235 LNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALS 294
L + A+ +A + + M R+R + L + + + + QGI+N WAL+
Sbjct: 11 LTMLGAIFYVANTLWAFATMGVAWQYLKRERLSARLFSAIRHNVGQFNPQGIANALWALA 70
Query: 295 KIGGELLYLSEMDRVAEVALTKVGE----FNSQNVANVAGAFASM-----QHSAPDLFSE 345
+G YL E R++E L + FNSQ++AN A A+M L
Sbjct: 71 TMGMGWRYLKE-QRLSERLLVAIRHTLEGFNSQDIANTFWALATMGVRWRYLERQSLSER 129
Query: 346 LAKRASDIVHTFQEQELAQVLWAFASL 372
L V F QE+A LWA A++
Sbjct: 130 LLTAVRRNVEQFNAQEIANALWALATM 156
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 108/260 (41%), Gaps = 44/260 (16%)
Query: 232 LSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAW 291
+P IA AL +A + RL+ +LVAI T L ++Q I+N W
Sbjct: 57 FNPQGIANALWALATMGMGWRYLKEQRLS------ERLLVAIRHT-LEGFNSQDIANTFW 109
Query: 292 ALSKIGGELLYL---SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAK 348
AL+ +G YL S +R+ V +FN+Q +AN A A+M+ L +
Sbjct: 110 ALATMGVRWRYLERQSLSERLLTAVRRNVEQFNAQEIANALWALATMEVRWRYLEEQ--- 166
Query: 349 RASD-----IVHT---FQEQELAQVLWAFASL-YEPADPLLESLDNAFKDATQ--FTCCL 397
RAS+ I HT F Q++A LWA A++ + D ++ L A + C
Sbjct: 167 RASERLLVAIRHTIESFNSQDIANTLWALATIGVKWQDREIQRLSGRLVVAVRRNIECFN 226
Query: 398 NKALSNCNENGGVKSSG-DADSEGSLSSPVL--------SFNRDQLGNIAWSYAVLGQMD 448
++ ++N +G E LS +L FN + N W++A +G
Sbjct: 227 SQNVANTLWAFATMGAGWRYLQEQRLSECMLVIVQRTVERFNPQGIANTLWAFATMGVRW 286
Query: 449 RIFFSDIWKTISRFEEQRIS 468
R EEQR+S
Sbjct: 287 RY-----------LEEQRLS 295
>gi|219110565|ref|XP_002177034.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411569|gb|EEC51497.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 923
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 12/77 (15%)
Query: 558 KVAFEIDGPTHFSRN--------TGVP--LGHTMLKRRYIAAAGWNVVSLSHQEWEELQ- 606
K+A E DGP HF+R VP LGHT+LK R + GW VV + + E++++
Sbjct: 822 KLAVEFDGPNHFTRQRKPSNGSKPDVPRALGHTVLKYRLLKKQGWTVVRVPYYEFDKIPY 881
Query: 607 -GSFEQLDYLRVILKDY 622
S E+ YL+ +LK +
Sbjct: 882 WASMERQRYLQRLLKTH 898
>gi|397588981|gb|EJK54479.1| hypothetical protein THAOC_25889, partial [Thalassiosira oceanica]
Length = 178
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 522 SSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPL 577
S Q +V L + G++ E + GY +DA++ + VA E+DGP+HF P
Sbjct: 74 SKLQHDVVGELRAAGMDLGEEVLLGSGYRIDALVKFSDGRNVAVEVDGPSHFIDRR--PT 131
Query: 578 GHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQLDYLRVIL 619
G T LK R +A VVS+ + EW EL+ S + YLRV L
Sbjct: 132 GSTTLKHRQVARVDRIEVVSVPYWEWNELKNSEMKQHYLRVKL 174
>gi|221508215|gb|EEE33802.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 783
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 46/89 (51%)
Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
EVA +L G+ + R +G +D +L +KK GP HF ++ ++ L++R
Sbjct: 616 EVACMLQEMGILFQRRLYANGCRIDILLPEKKTVIMCAGPHHFYLDSTRRTAYSRLQQRL 675
Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+ G++V L + EW ELQ E+ +L
Sbjct: 676 LELQGYSVCVLPYYEWSELQNPEEKQRFL 704
>gi|221486442|gb|EEE24703.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 783
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 46/89 (51%)
Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
EVA +L G+ + R +G +D +L +KK GP HF ++ ++ L++R
Sbjct: 616 EVACMLQEMGILFQRRLYANGCRIDILLPEKKTVIMCAGPHHFYLDSTRRTAYSRLQQRL 675
Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+ G++V L + EW ELQ E+ +L
Sbjct: 676 LELQGYSVCVLPYYEWSELQNPEEKQRFL 704
>gi|384247944|gb|EIE21429.1| hypothetical protein COCSUDRAFT_56649 [Coccomyxa subellipsoidea
C-169]
Length = 994
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 61/117 (52%), Gaps = 9/117 (7%)
Query: 267 MSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEV-ALTKVGEFNSQNV 325
M +L+ +A L +C+AQ +SNI L+ L + AE+ A + + F+ Q V
Sbjct: 442 MHILMDLAEERLEQCNAQDLSNILCGLAACERPDLAKPSLLASAELHACSMMTAFSPQGV 501
Query: 326 ANVAGAFASMQHSAPDLF----SELAKRASDIVHTFQEQELAQVLWAFASLYEPADP 378
+NV AFA ++ P L +E+ +RA + F +++A+VLWAFA L P
Sbjct: 502 SNVLWAFAKLEARVPTLLEAAGAEVVRRAEE----FSARDMAEVLWAFAKLGHNGSP 554
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 63/112 (56%), Gaps = 9/112 (8%)
Query: 275 MTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
MTA S QG+SN+ WA +K+ + L E AEV + + EF+++++A V AFA
Sbjct: 493 MTAF---SPQGVSNVLWAFAKLEARVPTLLEAAG-AEV-VRRAEEFSARDMAEVLWAFAK 547
Query: 335 MQHS-APDLFSELAKRASDIVHT---FQEQELAQVLWAFASLYEPADPLLES 382
+ H+ +PD L R I+ + + ++LA ++W+ A L +PA LE+
Sbjct: 548 LGHNGSPDAVEALIARMEYILRSGGPWVLRDLASMVWSLAVLEQPAPGFLEA 599
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 51/104 (49%), Gaps = 6/104 (5%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH---S 338
+ + +S I WAL G S M + ++A ++ + N+Q+++N+ A+ + +
Sbjct: 421 TPRNLSTIVWALGSFG---YAPSRMHILMDLAEERLEQCNAQDLSNILCGLAACERPDLA 477
Query: 339 APDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLES 382
P L + A ++ F Q ++ VLWAFA L LLE+
Sbjct: 478 KPSLLASAELHACSMMTAFSPQGVSNVLWAFAKLEARVPTLLEA 521
>gi|237833853|ref|XP_002366224.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211963888|gb|EEA99083.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
Length = 783
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 46/89 (51%)
Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
EVA +L G+ + R +G +D +L +KK GP HF ++ ++ L++R
Sbjct: 616 EVACMLQEMGILFQRRLYANGCRIDILLPEKKTVIMCAGPHHFYLDSTRRTAYSRLQQRL 675
Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+ G++V L + EW ELQ E+ +L
Sbjct: 676 LELQGYSVCVLPYYEWSELQNPEEKQRFL 704
>gi|399218291|emb|CCF75178.1| unnamed protein product [Babesia microti strain RI]
Length = 507
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 72/155 (46%), Gaps = 21/155 (13%)
Query: 474 DIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLV 533
D A+ ++ + L++++PHL L + E+ T R S Q ++ LV
Sbjct: 326 DTRHATMLYYSLRYLEIQYPHLINTLQPIYEQCTTLLKNTPRM-----KSIQPSKSQRLV 380
Query: 534 STGLNWIR-----EYAVDGY-----TVDAVLVDKKVAFEIDGPTHF--SRNTG--VPLGH 579
S LN R EY ++++ L +K+A E+DGP HF NT V G
Sbjct: 381 SDALNSWRIPHKFEYTTPKLVSIDISIESTLYGEKIAIEVDGPWHFLTFHNTQERVRTGP 440
Query: 580 TMLKRRYIAAAGWNVVSL--SHQEWEELQGSFEQL 612
+ K + + GWNV+SL S++ ++LQ ++
Sbjct: 441 SFFKHWLLESEGWNVISLQPSNRNLQDLQNDLQEF 475
>gi|308798919|ref|XP_003074239.1| tumor-related protein-like (ISS) [Ostreococcus tauri]
gi|116000411|emb|CAL50091.1| tumor-related protein-like (ISS) [Ostreococcus tauri]
Length = 797
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 68/136 (50%), Gaps = 9/136 (6%)
Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE-MDRVAEVALTKV 317
LA + + M L + E SAQ ++ A A++K+G +Y S+ M E A +
Sbjct: 273 LAVSNHKIMQTLAKCMARKVEESSAQQMATSAHAMAKLG---VYNSQLMKAYRESAALRR 329
Query: 318 GEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVH-----TFQEQELAQVLWAFASL 372
+F +++A + +FA ++ A ++F L++ D+++ TF L VLW+FA L
Sbjct: 330 EQFQPRDIAFLTWSFAKLEVHASEMFKMLSEVICDMLYDVEFQTFTPHHLTMVLWSFAML 389
Query: 373 YEPADPLLESLDNAFK 388
E +L S+ A K
Sbjct: 390 KEDVTEILPSVTRAIK 405
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 72/174 (41%), Gaps = 33/174 (18%)
Query: 231 PLSPLNIATALHRIAK-NMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNI 289
P P ++ A +AK N + + + R R R +S+L + +AQG+SN
Sbjct: 79 PWKPQELSNAFWGLAKVNSDAIELF---RFLGERIR-VSLLTDVGTDHRTGWTAQGVSNA 134
Query: 290 AWAL--------------SKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASM 335
AW+L S +GGEL + E+ R E ++ FN Q AN A
Sbjct: 135 AWSLGALATETRIGMFEESALGGEL--VRELARAIE---ERIELFNPQECANTLSGLAKC 189
Query: 336 QHSAPD-------LFSELAKRASDIVH--TFQEQELAQVLWAFASLYEPADPLL 380
SA + F+ KR + FQ Q ++ V+WA A L D +L
Sbjct: 190 AASASEDAPRGAKAFAGRLKRDRSWLSGGQFQCQHVSNVIWACAKLNMSDDAVL 243
>gi|68067688|ref|XP_675786.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56495167|emb|CAH98407.1| conserved hypothetical protein [Plasmodium berghei]
Length = 423
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 56/105 (53%), Gaps = 3/105 (2%)
Query: 518 QKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHFSRNTG-- 574
V+SS K+++ L + EY + D VDA + VA EIDGP+HF + G
Sbjct: 311 HHVSSSVHKKISADLKYLNVFHYNEYFILDSILVDAYIPHTMVAIEIDGPSHFIQRGGSI 370
Query: 575 VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
V +T+ K+R + A G+ VVS+S E + + +++++ IL
Sbjct: 371 VYNPNTLFKKRLLRALGFVVVSISITEHTFIFSALTTINFVKRIL 415
>gi|307107871|gb|EFN56112.1| hypothetical protein CHLNCDRAFT_144712 [Chlorella variabilis]
Length = 851
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 44/80 (55%), Gaps = 13/80 (16%)
Query: 549 TVDAVLVDKKVAFEIDGPTHFSRNTG-------------VPLGHTMLKRRYIAAAGWNVV 595
VD + +++A E+DGPTHF RN G +P+G T+LKRR + GW V
Sbjct: 745 CVDIAVPSRRLAIEVDGPTHFCRNNGGGGGGSASKQHLLLPMGSTLLKRRLLQRRGWAVA 804
Query: 596 SLSHQEWEELQGSFEQLDYL 615
S+ +WE L+G+ + +L
Sbjct: 805 SVCAADWERLRGAAPKRAFL 824
>gi|389582720|dbj|GAB65457.1| RAP protein [Plasmodium cynomolgi strain B]
Length = 445
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 54/118 (45%), Gaps = 4/118 (3%)
Query: 490 LEHPHLQLALSSVL----EEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV 545
L+ H+ L L L E+ I + K N S QK++ +LL GL RE+ V
Sbjct: 308 LKQIHIVLYLLRELGGDYEQAINVIERKKIKNTLTVSKMQKQLEKLLKEMGLKADREFPV 367
Query: 546 DGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
Y +D VL K+ E++G TH+ G + LK + W V+++ + W+
Sbjct: 368 GPYVLDFVLQKKRTCIEVNGFTHYYTFGGELNAKSRLKYYILRRLNWKVLTVEYTSWK 425
>gi|84997545|ref|XP_953494.1| hypothetical protein [Theileria annulata]
gi|65304490|emb|CAI76869.1| hypothetical protein TA11170 [Theileria annulata]
Length = 1272
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 67/323 (20%), Positives = 130/323 (40%), Gaps = 72/323 (22%)
Query: 191 SNRRKEINLNKDIVDAQTAQEVLEV--IAEMITAVGKGLSPSPLSPLNIATALHRIAKNM 248
+N E LN D Q++L+ ++++++G L ++ +N++TA+HR+AK
Sbjct: 380 TNLEAETWLNMDPNHILIQQDLLKSKNTTQVLSSIGDKLKQ--MNAVNVSTAIHRLAKYT 437
Query: 249 EKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWA---------------- 292
+R L+A+ + + QG++NI W+
Sbjct: 438 NPY-----NRYMVVNHESFGKLIALVEDHILKFDPQGLTNIFWSMIKLKITPKWLDCLLE 492
Query: 293 ----------LSKIGGELLYLSEMDRVAEVALT-----------KVGEFNSQ-NVANVAG 330
LS++ L LS++ + + +L K+ +F ++ V+
Sbjct: 493 QININANSLNLSELSNCLFCLSKLTKANDSSLELRFKILSLVQDKIKQFKRPLDLTCVST 552
Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA 390
A A + P +F ++ + + F+ QE+ + W++ASL D LL F+
Sbjct: 553 ALARLNVRNPVIFGHISSQVISSLEEFKIQEICGIAWSYASL-GFTDHLL------FRKI 605
Query: 391 TQFTCCLNKALSNCNENGGVK---------SSGDADSEGSLSSPVL-----SFNRDQLGN 436
+F ++ ++ N G + D D SP++ S N Q+
Sbjct: 606 REFI----ESKADPNNIGNIVHLAWALSKIKEADPDFFLYTVSPLVRSHLSSLNCRQMTT 661
Query: 437 IAWSYAVLGQMDRIFFSDIWKTI 459
I+W+Y G D+ F+DI T+
Sbjct: 662 ISWAYVNAGVEDQDLFNDIASTL 684
>gi|221056993|ref|XP_002259634.1| RAP protein [Plasmodium knowlesi strain H]
gi|193809706|emb|CAQ40408.1| RAP protein, putative [Plasmodium knowlesi strain H]
Length = 1170
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 67/124 (54%), Gaps = 4/124 (3%)
Query: 501 SVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLVDKKV 559
S+ ++++A + + NQ V+SS K+++ L + EY + D VD + +V
Sbjct: 1041 SIWKKQLARNQRKEEKNQ-VSSSVHKKISNDLRHLNIFHHNEYFILDSLLVDVYVPSARV 1099
Query: 560 AFEIDGPTHFSRNTGVPL--GHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRV 617
A EIDGP+HF + + L +++ K+R + A G++V+S+S E + + ++L+
Sbjct: 1100 AIEIDGPSHFLQKGKLILYNPNSLFKKRLLRALGFSVISISISEHTFMFSALNTFNFLKK 1159
Query: 618 ILKD 621
L +
Sbjct: 1160 FLSN 1163
>gi|84997988|ref|XP_953715.1| hypothetical protein [Theileria annulata]
gi|65304712|emb|CAI73037.1| hypothetical protein, conserved [Theileria annulata]
Length = 450
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 45/89 (50%), Gaps = 1/89 (1%)
Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
++ RLL L E + YT+D + VA E++G THF N+ T LK +
Sbjct: 319 QLGRLLDELKLKHKSELKIGPYTLDYAIPKINVAIEVNGYTHFFHNSKELNALTQLKYKI 378
Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+ GWNVV +++ W+ + +LDY+
Sbjct: 379 LKDMGWNVVGINYYNWKN-RNKQSRLDYI 406
>gi|71033831|ref|XP_766557.1| hypothetical protein [Theileria parva strain Muguga]
gi|68353514|gb|EAN34274.1| hypothetical protein TP01_1036 [Theileria parva]
Length = 572
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/184 (23%), Positives = 78/184 (42%), Gaps = 32/184 (17%)
Query: 441 YAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALS 500
+++L + IF D K I++F F SQ++L+ + + LE HL+ +S
Sbjct: 387 HSILKYVYGIFSGDDMKEITKFPN------------FVSQLNLLRKSMILERIHLKGLIS 434
Query: 501 ----SVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVD 556
S + + T N+ TS+ +V +L S + E+ V Y VD +
Sbjct: 435 GTEISSFLDSLEHIKPTFAPNEFKTSNIHSQVDTILKSFNYVTLLEHYVCPYIVDIFVPS 494
Query: 557 KKVAFEIDGPTHFSRNT-------------GVPLGHTM---LKRRYIAAAGWNVVSLSHQ 600
K E+DGP H+S LG+T+ LK + + +G+ +++
Sbjct: 495 KNAVIEVDGPYHYSTTLNPRINKILKREVENYQLGYTLNSKLKSKLLTKSGFKFINIPFY 554
Query: 601 EWEE 604
+W E
Sbjct: 555 QWPE 558
>gi|397598840|gb|EJK57295.1| hypothetical protein THAOC_22677, partial [Thalassiosira oceanica]
Length = 98
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 42/74 (56%), Gaps = 6/74 (8%)
Query: 547 GYTVDAVLV---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEW 602
GY +DA + ++KVA E+DGP+HF P G T+LK R + VVS+ + EW
Sbjct: 18 GYRIDAFVKISDERKVAVEVDGPSHFIDRR--PTGSTILKHRQVVPLDRIEVVSVPYWEW 75
Query: 603 EELQGSFEQLDYLR 616
+EL S + YLR
Sbjct: 76 DELMSSETKQHYLR 89
>gi|424513170|emb|CCO66754.1| predicted protein [Bathycoccus prasinos]
Length = 1295
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 53/106 (50%), Gaps = 4/106 (3%)
Query: 522 SSFQKEVARLLVSTGLNWIREYAVDG--YTVDAVLVDKKVAFEIDGPTHFSRN-TGVPLG 578
S F +EV+ L G+ E+ +G Y++D L +K+ E DGPTH+S N V +G
Sbjct: 977 SGFHQEVSSTLSEMGVPHELEFLTEGGLYSLDIALKGRKICIEADGPTHYSINRPTVRIG 1036
Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIG 624
L+ + GW V+ + W+ ++ +Y+ +L ++ G
Sbjct: 1037 GDNLREAILTKQGWTVIQIPWFTWQAAPER-DRREYIANLLYEHAG 1081
>gi|154706218|ref|YP_001424375.1| hypothetical membrane associated protein [Coxiella burnetii Dugway
5J108-111]
gi|154355504|gb|ABS76966.1| hypothetical membrane associated protein [Coxiella burnetii Dugway
5J108-111]
Length = 593
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 83/186 (44%), Gaps = 20/186 (10%)
Query: 280 ECSAQGISNIAWALSKIG--GELLYLSEM-DRVAEVALTKVGEFNSQNVANVAGAFASMQ 336
+ + QGI N WAL+ +G + L + E+ DR+ E V F +Q + N AFA++
Sbjct: 180 QLNPQGIVNTLWALATMGMRWQELEVRELSDRLLEAVRYNVSRFKAQEITNALWAFATLS 239
Query: 337 HSAPDLFSE-LAKRASDIVHTFQE----QELAQVLWAFASL---------YEPADPLLES 382
L ++ L R D VH E Q + LWA A++ E D LLE+
Sbjct: 240 VRWKKLETQGLNDRLLDAVHHNTEQLNPQGIVNTLWALATMGVRWRELEVRELTDRLLEA 299
Query: 383 LD-NAFKDATQFTCCLNKALSNCN-ENGGVKSSGDADS-EGSLSSPVLSFNRDQLGNIAW 439
+ NA + ++ AL+ + G +++ G D G++ V FN + N W
Sbjct: 300 VRYNASRFKSREIANTLWALATLSVRRGNMEAQGLRDRLLGAVHHNVERFNPQDIANALW 359
Query: 440 SYAVLG 445
A +G
Sbjct: 360 GLATMG 365
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 58/126 (46%), Gaps = 18/126 (14%)
Query: 286 ISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
I+N WAL+ + G + DR+ V FN Q++AN A+M P+L
Sbjct: 312 IANTLWALATLSVRRGNMEAQGLRDRLLGAVHHNVERFNPQDIANALWGLATMGMKWPEL 371
Query: 343 FSE-LAKRASDIVHTFQE----QELAQVLWAFASLY--------EPADPLLESLDNAFKD 389
++ L+ R + VH E Q++A LWA A + + D LL +L N ++
Sbjct: 372 EAQGLSDRLLEAVHRNAEQLNPQQIANTLWALAMMTVSWEYLQEQRLDQLLLNLIN--QN 429
Query: 390 ATQFTC 395
A QF+
Sbjct: 430 ANQFSL 435
>gi|195996645|ref|XP_002108191.1| hypothetical protein TRIADDRAFT_52413 [Trichoplax adhaerens]
gi|190588967|gb|EDV28989.1| hypothetical protein TRIADDRAFT_52413 [Trichoplax adhaerens]
Length = 617
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 1/58 (1%)
Query: 559 VAFEIDGPTHFSRNTG-VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
VA E DGP+HFS N V LG T+LK+R++ G+ +++ EW L E++ YL
Sbjct: 552 VAIEADGPSHFSCNQPYVNLGQTVLKQRHLKQMGFAFAQIAYHEWMTLNNKDEKISYL 609
Score = 40.4 bits (93), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 23/85 (27%), Positives = 39/85 (45%), Gaps = 2/85 (2%)
Query: 288 NIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELA 347
N AWA + G L L D ++ K + N Q+++N+ AFA + L +++A
Sbjct: 353 NTAWAFAT--GGFLDLVCYDNISNKLFRKADKMNEQDISNITWAFALTGYRNEKLQNKVA 410
Query: 348 KRASDIVHTFQEQELAQVLWAFASL 372
++H L+ + W FA L
Sbjct: 411 DTVIGLIHHINSSNLSTITWGFAIL 435
>gi|196000781|ref|XP_002110258.1| hypothetical protein TRIADDRAFT_54076 [Trichoplax adhaerens]
gi|190586209|gb|EDV26262.1| hypothetical protein TRIADDRAFT_54076 [Trichoplax adhaerens]
Length = 686
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 38/66 (57%)
Query: 562 EIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
E+DG THF R + G ++LK+ ++ G+NV+ + H EW + ++++YLR +
Sbjct: 619 EVDGKTHFLRKYQLYTGPSILKKNHLKKFGYNVIQIPHFEWRIIDSFSDKVEYLRRKISH 678
Query: 622 YIGGEG 627
Y G+
Sbjct: 679 YDSGDS 684
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 66/149 (44%), Gaps = 43/149 (28%)
Query: 283 AQGISNIAWALSKIGGE---------------------------------LLYLSE--MD 307
+GI+N+ W+L+ IG + L Y + D
Sbjct: 263 GKGIANVTWSLANIGNKDDAFLQILGNAAMERIKFMNPDSLAIFAWSLVSLDYFDDKLFD 322
Query: 308 RVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLW 367
+A+ +L ++ +F++QN++N+ AFA + P LF ++A+ +H + Q +A +
Sbjct: 323 VIADESLVQMRKFSAQNLSNLLLAFAKSNYMIPKLFHDVAESTIKKLHNMEPQAMANIAL 382
Query: 368 AFA--SLYEPADPLLESLDNAFKDATQFT 394
++A S YEP +L AF D F+
Sbjct: 383 SYAKVSYYEP------NLVKAFTDKIIFS 405
>gi|428177978|gb|EKX46855.1| hypothetical protein GUITHDRAFT_70208, partial [Guillardia theta
CCMP2712]
Length = 88
Score = 49.7 bits (117), Expect = 0.005, Method: Composition-based stats.
Identities = 24/70 (34%), Positives = 39/70 (55%)
Query: 547 GYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
GY++D ++ A E+DGP HF N+ G T +K R++ G+ ++ EW ++
Sbjct: 18 GYSLDILMPSLGCALEVDGPFHFLLNSYERSGSTKMKHRHLEQIGYKFHAIPFWEWPKVG 77
Query: 607 GSFEQLDYLR 616
S E+L YLR
Sbjct: 78 PSEEKLAYLR 87
>gi|426246726|ref|XP_004017142.1| PREDICTED: FAST kinase domain-containing protein 3 [Ovis aries]
Length = 660
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
DG+ + + V K+VA IDGP F N+ LG K+R++ G+ VV + + E
Sbjct: 575 DGFVLPFTIDEDVHKRVALCIDGPKRFCLNSNHLLGKEATKQRHLRLLGYQVVQIPYYEI 634
Query: 603 EELQGSFEQLDYLR 616
E L+ E +DYL+
Sbjct: 635 EMLKSRLELVDYLQ 648
>gi|399216319|emb|CCF73007.1| unnamed protein product [Babesia microti strain RI]
Length = 527
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 57/119 (47%), Gaps = 24/119 (20%)
Query: 521 TSSFQKEV--ARLLVSTGLN-------WIREYAV-----DGYTVDAVLVDKK-------- 558
TS+FQK+V A L + LN ++ +YA D Y ++ + K
Sbjct: 408 TSNFQKQVGEAALFIYYKLNTEVKIGPFMVDYATPMSVNDMYNINNYRTNDKDINPEINT 467
Query: 559 --VAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
V E+DGP HF +N+ H+++K + G+ VV + + EW++L ++ +YL
Sbjct: 468 NGVIIEVDGPRHFYKNSHTYTCHSIVKDEILKLMGYRVVHVKYFEWDKLPNLVDKQNYL 526
>gi|71029720|ref|XP_764503.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351457|gb|EAN32220.1| hypothetical protein, conserved [Theileria parva]
Length = 1135
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 67/325 (20%), Positives = 127/325 (39%), Gaps = 76/325 (23%)
Query: 191 SNRRKEINLNKDIVDAQTAQEVLEV--IAEMITAVGKGLSPSPLSPLNIATALHRIAKNM 248
+N E LN D Q++L+ ++++++G L ++ +N++TALHR+A+
Sbjct: 240 TNLEPETWLNMDPNHILIQQDLLKSKNTTQVLSSIGDKLKQ--MNAVNVSTALHRLARYT 297
Query: 249 EKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWA---------------- 292
+R L+++ + + QG++NI W+
Sbjct: 298 NPY-----NRYMVCNHESFGKLISLVEEHILKFDPQGLTNIFWSIIKLKITPKWLDCLLE 352
Query: 293 ----------LSKIGGELLYLSEMDRVAEVALT-----------KVGEFNSQ-NVANVAG 330
LS++ L LS++ + ++ +L K+ +F ++ V+
Sbjct: 353 QINIHANSLNLSELSNCLFCLSKLTKSSDSSLELRFKILSLVQDKITQFKRPLDLTCVST 412
Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDA 390
A A + P +F ++ + + F+ QEL + W++ASL F D
Sbjct: 413 ALARLNVRNPVIFGHISSQVISNLEEFKIQELCGIAWSYASL-------------GFTDH 459
Query: 391 TQFTCCLNKALSNCNENG---------GVKSSGDADSEGSLS--SPVL-----SFNRDQL 434
F S ++N + +AD + L SP++ S N Q+
Sbjct: 460 LLFMKIRRFIESKADQNNIGNIIHLAWALSKIKEADPDFFLYTVSPLVRSHLASLNCRQM 519
Query: 435 GNIAWSYAVLGQMDRIFFSDIWKTI 459
IAW+Y G D F+DI T+
Sbjct: 520 TTIAWAYVNAGVEDLDLFNDIAATL 544
>gi|302839870|ref|XP_002951491.1| hypothetical protein VOLCADRAFT_92047 [Volvox carteri f. nagariensis]
gi|300263100|gb|EFJ47302.1| hypothetical protein VOLCADRAFT_92047 [Volvox carteri f. nagariensis]
Length = 2025
Score = 49.3 bits (116), Expect = 0.006, Method: Composition-based stats.
Identities = 52/212 (24%), Positives = 84/212 (39%), Gaps = 45/212 (21%)
Query: 275 MTALPECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
+ LP+ S Q +SN L+K+G G + +D+VA ++ K+GEFN+Q ++N+ +
Sbjct: 1268 LQVLPQASHQDVSNSLLGLAKLGWSPGPYV----LDQVARGSVAKIGEFNAQELSNMMWS 1323
Query: 332 FASMQHSAPDL------------------FSELAKRASD----IVHTFQEQELAQVLWAF 369
A ++H L + +RA D F QEL+ +LW+
Sbjct: 1324 LAHVKHCNAKLQTAIFQQAGFYHRLLACWLASWYRRAHDGASAAAAHFTYQELSNLLWST 1383
Query: 370 AS---LYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDAD--------- 417
A L+EP A + +ENG SSG +
Sbjct: 1384 AKMGYLHEPLMRAAARQAARQLAAEVEEREGREEEQLEDENGRGDSSGGGEEDDLAAAEC 1443
Query: 418 ----SEGSLSSPVLSFNRDQLGNIAWSYAVLG 445
S S V S++ + N W++A LG
Sbjct: 1444 RAAASRPSARGCVRSWSSQAVSNTTWAFATLG 1475
Score = 49.3 bits (116), Expect = 0.006, Method: Composition-based stats.
Identities = 32/111 (28%), Positives = 60/111 (54%), Gaps = 10/111 (9%)
Query: 265 REMSMLVAIAMTALPECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFN 321
R +SM + + L E QGI++ L+K+G G + +D+VA ++ K+GEFN
Sbjct: 465 RNLSMRL---LGLLAEVPPQGIASSLLGLAKLGWSPGPYV----LDQVARGSVAKIGEFN 517
Query: 322 SQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
+Q ++N + A + + P L + ++A + F Q ++ ++WA A+L
Sbjct: 518 AQALSNTMWSLARLGYYNPQLQDAMFRQALRRLSEFSPQGISNLIWAAATL 568
Score = 48.1 bits (113), Expect = 0.015, Method: Composition-based stats.
Identities = 20/37 (54%), Positives = 26/37 (70%)
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNV 594
+VA E+DGPTHF+ NT PL T+ +RR + A GW V
Sbjct: 1026 RVAVEVDGPTHFTSNTRQPLSTTLYRRRCLEARGWVV 1062
>gi|351698645|gb|EHB01564.1| FAST kinase domain-containing protein 3 [Heterocephalus glaber]
Length = 660
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 37/62 (59%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
V+K++A IDGP F N+ LG +K+R++ G+ VV + + E E L+ E ++Y
Sbjct: 587 VNKRIALCIDGPKRFCSNSSHLLGKEAIKQRHLRLLGYQVVQVPYHEMEMLKSRLELVEY 646
Query: 615 LR 616
L+
Sbjct: 647 LQ 648
>gi|153209513|ref|ZP_01947418.1| conserved domain protein [Coxiella burnetii 'MSU Goat Q177']
gi|212218771|ref|YP_002305558.1| hypothetical membrane-associated protein [Coxiella burnetii
CbuK_Q154]
gi|120575338|gb|EAX31962.1| conserved domain protein [Coxiella burnetii 'MSU Goat Q177']
gi|212013033|gb|ACJ20413.1| hypothetical membrane-associated protein [Coxiella burnetii
CbuK_Q154]
Length = 435
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 66/301 (21%), Positives = 128/301 (42%), Gaps = 57/301 (18%)
Query: 281 CSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
+ QGI+N W L+ + EL DR+ + +FNSQ++AN A A+M
Sbjct: 97 LNPQGIANTLWTLATMNVRRRELEVQGLSDRLLDAVYYNAEQFNSQDIANTLWALAAMGM 156
Query: 338 SAPDLFSE-LAKRASDIVH----TFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQ 392
+L + L+ R D VH F Q +A LWA A+ +
Sbjct: 157 RWRELEEQGLSDRLLDAVHRNAQRFSPQGIANALWALAT-----------------TGMR 199
Query: 393 FTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
+ N+ LSN N V+ S + S +++ + + + ++W Y ++DR+
Sbjct: 200 WRELENRELSNRLFN-AVQHSAERFSSQQIANTLWAL---AMMALSWGYLKEQRVDRLLL 255
Query: 453 SDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGK 512
+ I ++ ++F + ++ IM++++ + P + L +S++ K
Sbjct: 256 NAIDQSANQFSLEESTQ-----IMWSTRWFDIRPP-----PEILLKISNM---------K 296
Query: 513 TKRFNQKVTSSFQKEVARLL---VSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTH 568
R +S + VA +L ++ + E+ + + + VD + K++ E+DGP H
Sbjct: 297 PPR-----SSDLHRHVASVLSAQINGEIPIENEFFIQNCFYVDICIPSKRLVIEVDGPYH 351
Query: 569 F 569
Sbjct: 352 I 352
Score = 45.4 bits (106), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 88/200 (44%), Gaps = 33/200 (16%)
Query: 205 DAQTAQEVLEV----------IAEMITAVGKGLSPSPLSPLNIATAL-HRIAKNMEKVSM 253
D T +E+LE +A ++ A+ + L P ++A L IAKN+E+++
Sbjct: 40 DYATIREILEARRHRRFNGQSVANLLLAIAYHHTQWRLLPRSLAAQLWDAIAKNVERLNP 99
Query: 254 MTTHRLAFT------RQREMS-------MLVAIAMTALPECSAQGISNIAWALSKIGGEL 300
+T R+RE+ +L A+ A + ++Q I+N WAL+ +G
Sbjct: 100 QGIANTLWTLATMNVRRRELEVQGLSDRLLDAVYYNA-EQFNSQDIANTLWALAAMGMRW 158
Query: 301 LYLSEM---DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFS-ELAKRASDIVH- 355
L E DR+ + F+ Q +AN A A+ +L + EL+ R + V
Sbjct: 159 RELEEQGLSDRLLDAVHRNAQRFSPQGIANALWALATTGMRWRELENRELSNRLFNAVQH 218
Query: 356 ---TFQEQELAQVLWAFASL 372
F Q++A LWA A +
Sbjct: 219 SAERFSSQQIANTLWALAMM 238
>gi|83273444|ref|XP_729400.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23487122|gb|EAA20965.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 1189
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 518 QKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHF-SRNTGV 575
V+SS K+++ L + EY + D VDA + VA EIDGP+HF R +
Sbjct: 1077 HHVSSSVHKKISTDLKYLNVFHYNEYFILDSILVDAYIPHSMVAIEIDGPSHFIQRGESI 1136
Query: 576 PLG-HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
+T+ K+R + A G+ VVS+S E + + +++++ IL
Sbjct: 1137 VYNPNTLFKKRLLRALGFVVVSISVTEHTFIFSALNTINFVKRIL 1181
>gi|70954340|ref|XP_746221.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56526762|emb|CAH76318.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 928
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 518 QKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHF-SRNTGV 575
Q ++SS K+++ L + EY + D VDA + A EIDGP+HF R +
Sbjct: 816 QHISSSVHKKISNDLKYLNIFHYNEYFILDSILVDAYIPHAMTAIEIDGPSHFIQRGASI 875
Query: 576 PLG-HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
+T+ K+R + A G+ VVS+S + + + +++++ IL
Sbjct: 876 VYNPNTLFKKRLLRALGFVVVSISITDHTFVFSALNTINFIKKIL 920
>gi|70937099|ref|XP_739403.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56516375|emb|CAH87459.1| hypothetical protein PC302475.00.0 [Plasmodium chabaudi chabaudi]
Length = 226
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/97 (23%), Positives = 50/97 (51%)
Query: 519 KVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
K ++ + EV+R+L +N +R ++ D +L D + GP + N+ +
Sbjct: 112 KYSARWITEVSRILTKINVNHLRNVYINNICADIMLPDSNIIIMCLGPYSYYVNSLLTTS 171
Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+ LK+ + +NV++L++ +W +L EQ+++L
Sbjct: 172 ISDLKKNILEKKKYNVITLNYHDWNKLNDYEEQINFL 208
>gi|294877932|ref|XP_002768199.1| hypothetical protein Pmar_PMAR002989 [Perkinsus marinus ATCC 50983]
gi|239870396|gb|EER00917.1| hypothetical protein Pmar_PMAR002989 [Perkinsus marinus ATCC 50983]
Length = 400
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 2/95 (2%)
Query: 278 LPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
L EC+ +SN+ + K E L S + + E + + E +++ +A A A M
Sbjct: 66 LRECTGDDLSNLCRCICK--AEYLCPSLLTSITEECMARSSELEPADISTIAWALAKMGF 123
Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
+ LF LA+ H F LA ++WAFAS+
Sbjct: 124 GSDVLFQRLARVVEVTTHLFSGAYLANLMWAFASV 158
Score = 42.7 bits (99), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 53/109 (48%), Gaps = 21/109 (19%)
Query: 286 ISNIAWALSKIG-GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFS 344
IS IAWAL+K+G G + + RV EV F+ +AN+ AFAS+ + + + +
Sbjct: 111 ISTIAWALAKMGFGSDVLFQRLARVVEVT---THLFSGAYLANLMWAFASVGYRSESMLA 167
Query: 345 ELAKRASDIVHTFQEQ-----------------ELAQVLWAFASLYEPA 376
+A+R +++ E E++ ++WA + L+ P+
Sbjct: 168 AVAERCQELMTVVLEPPGSTDVEVVDRMPLHPMEMSTLVWALSRLHAPS 216
>gi|212212260|ref|YP_002303196.1| hypothetical membrane-associated protein [Coxiella burnetii
CbuG_Q212]
gi|212010670|gb|ACJ18051.1| hypothetical membrane-associated protein [Coxiella burnetii
CbuG_Q212]
Length = 496
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 88/403 (21%), Positives = 161/403 (39%), Gaps = 66/403 (16%)
Query: 232 LSPLNIATALH-RIAKNMEKVSMMTTHRLAFT------RQREMS-------MLVAIAMTA 277
L P ++A L IAKN+E+++ +T R+RE+ +L A+ A
Sbjct: 12 LLPRSLAAQLWDAIAKNVERLNPQGIANTLWTLATMNVRRRELEVQGLSDRLLDAVYYNA 71
Query: 278 LPECSAQGISNIAWALSKIGGELLYLSEM---DRVAEVALTKVGEFNSQNVANVAGAFAS 334
+ ++Q I+N WAL+ +G L E DR+ + FN Q +AN A +
Sbjct: 72 -EQFNSQDIANTLWALAAMGMRWRELEEQGLSDRLLDAVRYDAERFNPQGIANTLWALVA 130
Query: 335 MQHSAPDL-FSELAKRASDIVHT----FQEQELAQVLWAFASL----YEPADPLLES--L 383
M + +L EL R D V + F Q++ LWA A++ E D L L
Sbjct: 131 MGMTWGELEAQELNDRLLDAVGSNAPRFNSQDITNTLWALATMGMKWRELGDQRLRDRLL 190
Query: 384 DNAFKDATQFTC--CLNKALSNCNENGGVKSSGDADSE----GSLSSPVLSFNRDQLGNI 437
++A +F N + + GD G++ FN + N+
Sbjct: 191 GAVRRNAERFKPQGIANALWALATMGMKWRELGDQRLRDRLLGAVRRNAERFNPQGIANV 250
Query: 438 AWSYAVL----GQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHP 493
W+ A + G+++ ++ R+ +R S Q + ++A + ++ E
Sbjct: 251 LWALATMGMRWGELEAQRLNNCLLAAVRYNAERFSSQQIANTLWALAMMALSWGYLKEQR 310
Query: 494 HLQLALSSV--------LEEKIASAGKTKRFNQKV---------------TSSFQKEVAR 530
+L L+++ LEE T+ F+ + +S + VA
Sbjct: 311 VDRLLLNAIDQSANQFSLEESTQIMWSTRWFDIRPPPEILLKISNMKPPRSSDLHRHVAS 370
Query: 531 LL---VSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHF 569
+L ++ + E+ + + + VD + K++ E+DGP H
Sbjct: 371 VLSAQINGEIPIENEFFIQNCFYVDICIPSKRLVIEVDGPYHI 413
>gi|440897891|gb|ELR49494.1| FAST kinase domain-containing protein 3 [Bos grunniens mutus]
Length = 660
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
DG+ + + V K+VA IDGP F N+ LG K+R++ G+ VV + + E
Sbjct: 575 DGFVLPFTIDEDVHKRVALCIDGPKRFCLNSKHLLGKEATKQRHLRLLGYQVVQIPYYEI 634
Query: 603 EELQGSFEQLDYLR 616
E L+ E +DYL+
Sbjct: 635 EMLKSRLELVDYLQ 648
>gi|397646275|gb|EJK77204.1| hypothetical protein THAOC_00981, partial [Thalassiosira oceanica]
Length = 445
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 83/174 (47%), Gaps = 32/174 (18%)
Query: 302 YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHT 356
++ D +A + + +F++++++N+ +F ++ + PD LF+ K A I+HT
Sbjct: 197 FMPIFDSIASSTVVMLDKFDARHLSNLIYSFGLVERN-PDIEGETLFNVFGKTAVKILHT 255
Query: 357 FQEQELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENG 408
F+ QEL+ +LWAF + L++ ++ +D ++FK F L
Sbjct: 256 FKPQELSNMLWAFVKVDAKNSRLFQETGGVISGMDLDSFK-PQDFAIIL----------W 304
Query: 409 GVKSSGDADSE--GSLSSPVLS-----FNRDQLGNIAWSYAVLGQMDRIFFSDI 455
SG ADS+ +L + +++ F + NI W+YA G+ F +I
Sbjct: 305 SFAKSGKADSKLFQALGNHIVTRSLNDFWPQDVSNIVWAYATAGESHPELFKNI 358
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 273 IAMTALPECSAQGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAG 330
I +L + Q +SNI WA + G L+ + + AE+ + FN QN++ +A
Sbjct: 324 IVTRSLNDFWPQDVSNIVWAYATAGESHPELFKNIGNHAAELDMD---SFNPQNLSIIAW 380
Query: 331 AFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFAS 371
AFAS P+LF ++ R + + + F+ Q+L+ W+FA+
Sbjct: 381 AFASAGVPHPELFRKMGARVAGLKSLDLFKPQDLSNTAWSFAT 423
Score = 42.7 bits (99), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 52/103 (50%), Gaps = 6/103 (5%)
Query: 284 QGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q + I W+ +K G L+ + + + +L +F Q+V+N+ A+A+ S P+
Sbjct: 297 QDFAIILWSFAKSGKADSKLFQALGNHIVTRSLN---DFWPQDVSNIVWAYATAGESHPE 353
Query: 342 LFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESL 383
LF + A+++ + +F Q L+ + WAFAS P L +
Sbjct: 354 LFKNIGNHAAELDMDSFNPQNLSIIAWAFASAGVPHPELFRKM 396
>gi|262205509|ref|NP_001019699.2| FAST kinase domain-containing protein 3 [Bos taurus]
gi|145558912|sp|Q58CX2.2|FAKD3_BOVIN RecName: Full=FAST kinase domain-containing protein 3
gi|296475672|tpg|DAA17787.1| TPA: FAST kinase domain-containing protein 3 [Bos taurus]
Length = 660
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
DG+ + + V K+VA IDGP F N+ LG K+R++ G+ VV + + E
Sbjct: 575 DGFVLPFTIDEDVHKRVALCIDGPKRFCLNSKHLLGKEATKQRHLRLLGYQVVQIPYYEI 634
Query: 603 EELQGSFEQLDYLR 616
E L+ E +DYL+
Sbjct: 635 EMLKSRLELVDYLQ 648
>gi|348511573|ref|XP_003443318.1| PREDICTED: FAST kinase domain-containing protein 3-like
[Oreochromis niloticus]
Length = 618
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 82/350 (23%), Positives = 136/350 (38%), Gaps = 88/350 (25%)
Query: 339 APDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLN 398
A L L+ RAS + F++ E+ +VL A +L + L+ +++ L
Sbjct: 254 AVSLVLRLSHRASRVFKAFRDDEIMKVLSALMTLGQHDGELVAAMEKH----------LT 303
Query: 399 KALSNCNEN--GGV-------KSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDR 449
L C+ G + + + E + V + IA +G+++
Sbjct: 304 GRLEKCDPELIGAIMEYCLQMRCRSEPLFEAVAENFVRHAEKHTTLQIAKQIVAMGRLNY 363
Query: 450 I--FFSDIWKTISRFEEQRISE-QYRE--DIMFASQVHL----VNQCLKLEHPHL----- 495
+ S ++K + +R S+ Q R D+M A +HL +N K+ PH
Sbjct: 364 LPQCSSQMFKKLESILSERFSQFQPRSLVDVMHAC-IHLERFPLNYMTKVFSPHFLQRLQ 422
Query: 496 ---------------QLALSSVLE---------------EKIASAGKTKRFNQKVTSSFQ 525
QL LS+ LE ++ +SAG+ F + S
Sbjct: 423 AQGEPLDKNTLGQLTQLHLSTTLECTYYWGPRLPFFLHVKRFSSAGQA--FETPMESLLY 480
Query: 526 KEV----ARLLVSTGLNWIREYAVDGYTVDA-VLVD---------------KKVAFEIDG 565
K+V A LL G + GYT+D + +D K+V +DG
Sbjct: 481 KQVKGPLAHLL--GGTLYSTRMIHGGYTIDVEICLDEGGFVLPPSQWDHTYKRVVLCLDG 538
Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
P F NT LG + KRR++ G +V + + E+E+LQ EQ+ YL
Sbjct: 539 PNRFCTNTRHLLGKEVTKRRHLQRMGMELVEIPYFEFEKLQTEEEQIQYL 588
>gi|148705059|gb|EDL37006.1| FAST kinase domains 3, isoform CRA_a [Mus musculus]
Length = 661
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
DG+ + + + K+VA IDGP F ++ LG K+R++ G+ VV L + E
Sbjct: 576 DGFVLPCTVDEDIHKRVALCIDGPQRFCLDSKHLLGKEATKQRHLRLLGYQVVQLPYHEL 635
Query: 603 EELQGSFEQLDYLR 616
E L E +DYL+
Sbjct: 636 ELLTSRLELVDYLQ 649
>gi|338175904|ref|YP_004652714.1| hypothetical protein PUV_19100 [Parachlamydia acanthamoebae UV-7]
gi|336480262|emb|CCB86860.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV-7]
Length = 565
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 66/312 (21%), Positives = 119/312 (38%), Gaps = 51/312 (16%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQV 365
++++ EV L N+ + +A ++ + DL EL K ++ L +
Sbjct: 263 LEQLKEVFLKNATSLNADEIVRIAWSYHFLNCIHEDLLRELCKHLEPKINDLTNDGLINI 322
Query: 366 LWAFASL-------YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADS 418
F SL +E D + QFT SN +E G S
Sbjct: 323 TKIFISLNFIDKELLWKLLKKIE--DKVVDNPHQFTP------SNLSELTHAMLMGYCQS 374
Query: 419 EGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFA 478
E ++ +L +D IF D SR++ ++S
Sbjct: 375 ED------------------YTTFILNMLDVIFQIDP----SRWKAHQLS---------- 402
Query: 479 SQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLN 538
Q+H ++ L+ + A+ L+E+I K + + ++S F VA+ + +
Sbjct: 403 -QIHTIHLIYTLKSKQ-EKAMPIPLQERIDIHLKGLKDKKPISSDFHLSVAKCIENILGK 460
Query: 539 WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLS 598
+E+ ++ Y VD +K+ E+DGP HF + G L +K + GW V+ +S
Sbjct: 461 SEKEFQIETYFVDIAYPARKLVIEVDGPAHFDQ-FGNYLQKNAVKEFVLKLLGWQVIRIS 519
Query: 599 HQEWEELQGSFE 610
+EW + F
Sbjct: 520 -KEWPGYEHIFH 530
>gi|397572795|gb|EJK48407.1| hypothetical protein THAOC_32795, partial [Thalassiosira oceanica]
Length = 163
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 79/156 (50%), Gaps = 19/156 (12%)
Query: 304 SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQ 358
S DR+A + + EF++++++N+ +F ++ + PD LF+ + A I+HTF+
Sbjct: 13 SIFDRIASSTVGILNEFDARHLSNLIYSFGLVERN-PDIGGDTLFNVFGEAAVKILHTFK 71
Query: 359 EQELAQVLWAF-------ASLYEPADPLLESLD-NAFKDATQFTCCLNKALSNCNENGGV 410
QEL+ +LWAF + L++ ++ +D +FK + A S+ +
Sbjct: 72 PQELSNMLWAFVKVDADNSRLFQETGRVISGMDLGSFKPQDFSNVLWSSAKSDEADPVLF 131
Query: 411 KSSGDADSE-GSLSSPVLSFNRDQLGNIAWSYAVLG 445
++ G+ + GSL SF +L N AW++A G
Sbjct: 132 QAIGNHIANMGSLD----SFKPQELSNTAWAFATAG 163
>gi|255070911|ref|XP_002507537.1| hypothetical protein MICPUN_55039 [Micromonas sp. RCC299]
gi|226522812|gb|ACO68795.1| hypothetical protein MICPUN_55039 [Micromonas sp. RCC299]
Length = 593
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 61/259 (23%), Positives = 103/259 (39%), Gaps = 47/259 (18%)
Query: 198 NLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTH 257
++ D++DA + ++VL ++ + K +N +TALHRIA+ T
Sbjct: 191 DIQGDLMDAASVEDVLLLVEKQGEIFNK---------VNTSTALHRIARIASTAPYATAG 241
Query: 258 R--------LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE-MDR 308
L TR L+ +A E S +SN WAL+++ ++ ++ +D
Sbjct: 242 ANQQSPDAVLRITRDERFHHLLQLATALSKEMSIVSVSNTLWALARLRCDIHEMNTLLDD 301
Query: 309 VAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQVLW 367
+A A +++A V A A + H L +A R D F+ ++ +LW
Sbjct: 302 LAGRAAATAHNAQPKHLATVIWALAVLGHEPRSRLLRAVAMRVMDTAGDFRAPDVVNMLW 361
Query: 368 AFASLYEPADPLLESLDN--AFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSP 425
A+A A L S D KD + C+ AL+N +
Sbjct: 362 AYARWTRLAP--LNSPDGLPGAKDVVKELSCV--ALANLTD------------------- 398
Query: 426 VLSFNRDQLGNIAWSYAVL 444
F Q N++WS A+L
Sbjct: 399 ---FTPYQCANLSWSLAML 414
>gi|128485706|ref|NP_081399.3| FAST kinase domain-containing protein 3 [Mus musculus]
gi|145558913|sp|Q8BSN9.2|FAKD3_MOUSE RecName: Full=FAST kinase domain-containing protein 3
gi|26328905|dbj|BAC28191.1| unnamed protein product [Mus musculus]
Length = 661
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
DG+ + + + K+VA IDGP F ++ LG K+R++ G+ VV L + E
Sbjct: 576 DGFVLPCTVDEDIHKRVALCIDGPQRFCLDSKHLLGKEATKQRHLRLLGYQVVQLPYHEL 635
Query: 603 EELQGSFEQLDYLR 616
E L E +DYL+
Sbjct: 636 ELLTSRLELVDYLQ 649
>gi|282889813|ref|ZP_06298352.1| hypothetical protein pah_c004o212 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|281500387|gb|EFB42667.1| hypothetical protein pah_c004o212 [Parachlamydia acanthamoebae str.
Hall's coccus]
Length = 546
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 66/310 (21%), Positives = 119/310 (38%), Gaps = 47/310 (15%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQV 365
++++ EV L N+ + +A ++ + DL EL K ++ L +
Sbjct: 262 LEQLKEVFLKNATSLNADEIVRIAWSYHFLNCIHEDLLRELCKHLEPKINDLTNDGLINI 321
Query: 366 LWAFASL-----YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEG 420
F SL L + D + QFT SN +E G SE
Sbjct: 322 TKIFISLNFIDKKLLWKLLKKIEDKVVDNPHQFTP------SNLSELTHAMLMGYCQSED 375
Query: 421 SLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQ 480
++ +L +D IF D SR++ ++S Q
Sbjct: 376 ------------------YTTFILNMLDVIFQIDP----SRWKAHQLS-----------Q 402
Query: 481 VHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWI 540
+H ++ L+ + A+ L+E+I K + + ++S F VA+ + +
Sbjct: 403 IHTIHLIYTLKSKQ-EKAMPIPLQERIDIHLKGLKDKKPISSDFHLSVAKCIENILGKSE 461
Query: 541 REYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQ 600
+E+ ++ Y VD +K+ E+DGP HF + G L +K + GW V+ +S +
Sbjct: 462 KEFQIETYFVDIAYPARKLVIEVDGPAHFDQ-FGNYLQKNAVKEFVLKLLGWQVIRIS-K 519
Query: 601 EWEELQGSFE 610
EW + F
Sbjct: 520 EWPGYEHIFH 529
>gi|395833180|ref|XP_003789620.1| PREDICTED: LOW QUALITY PROTEIN: FAST kinase domain-containing
protein 3 [Otolemur garnettii]
Length = 679
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 68/154 (44%), Gaps = 33/154 (21%)
Query: 496 QLALSSVLE------EKIASAGKTKRFNQKVTS-------SFQKEV---------ARLLV 533
QL L+S+LE K+ S + K F S K+V ARL
Sbjct: 492 QLYLTSILECPFYKGTKLLSKFQVKSFLTPCCSLETPMDFHLYKQVMFGLIDLLGARLYF 551
Query: 534 STGLNWIREYAVD--------GYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTML 582
++ + Y +D G+ + + + V K+VA IDGP F N+ LG +
Sbjct: 552 ASKVLTPYCYTIDVEIKLDEEGFVLPSTVDEDVYKRVALCIDGPKRFCPNSNHLLGKEAI 611
Query: 583 KRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
K+R++ G+ VV + + E E L+ E ++YL+
Sbjct: 612 KQRHLQLIGYEVVQIPYHEVEMLKSRLELVEYLQ 645
>gi|189183794|ref|YP_001937579.1| repeat-containing protein A_04 [Orientia tsutsugamushi str. Ikeda]
gi|189180565|dbj|BAG40345.1| repeat-containing protein A_04 [Orientia tsutsugamushi str. Ikeda]
Length = 554
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 79/391 (20%), Positives = 142/391 (36%), Gaps = 97/391 (24%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEV----ALTKVGEFNSQNVANVA 329
A + + QG++N WA + L + D+ + A + FN+Q +AN
Sbjct: 126 ATKTIDNFNTQGLANSIWAFGR-----LEIHPSDQFIQAWIHHATKTIDNFNTQGLANSI 180
Query: 330 GAFASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDNAF 387
A ++ H + A+ + F Q LA +WAF L P+D +++
Sbjct: 181 LALGQLEIHPSDQFIQAWIHHATKTIDNFNTQNLANSIWAFGQLEIHPSDQFIQAW---I 237
Query: 388 KDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQM 447
AT K + N FN L N W++ L
Sbjct: 238 HHAT-------KTIDN-------------------------FNTQNLANSIWAFGQLEIH 265
Query: 448 DRIFFSDIW-----KTISRFEEQRISEQ----YREDIMFASQVHLVNQCLKLEHPHLQLA 498
F W KTI F Q ++ + +++ S++ + Q + + +++L
Sbjct: 266 PSDQFIQAWIHHATKTIDNFSLQELANSIYGIFTLNVLCNSKIKVPQQFISAVNQNIEL- 324
Query: 499 LSSVLEEKIASAGKT------------------------KRFNQKVT----SSFQ----K 526
+E I G+ K+F K+T S+ Q K
Sbjct: 325 ----FDENIEDIGQILKAHYYFGKQGVGILTSQNRQLLEKKFKTKLTPCHTSNLQLNVLK 380
Query: 527 EVARLLVSTGLNWIREYAVDGYT--VDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKR 584
V ++L + EY + T VD + +K ++DGP+HF N P T L
Sbjct: 381 VVKKVLAQHTVK--SEYHIKQITSSVDIFIKEKNTVIQVDGPSHFDDNNA-PNFSTRLNT 437
Query: 585 RYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+ + G+ V + + W +L+ + + +Y+
Sbjct: 438 ELLKSYGYIVHRIPYWVWNKLKTNIAKEEYI 468
>gi|156082057|ref|XP_001608521.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148801092|gb|EDL42497.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 446
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 65/144 (45%), Gaps = 3/144 (2%)
Query: 483 LVNQCLKLEHPHLQLA--LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWI 540
L N+ LK H L L L E+ I + K N S QK++ +LL GL
Sbjct: 304 LKNEELKQTHIALYLLRELGGDCEQAIDQIERKKIKNTLTVSKMQKQLEKLLKEMGLKAD 363
Query: 541 REYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQ 600
RE+ V Y +D L K+ E++G TH+ G + LK + W V+++ +
Sbjct: 364 REFPVGPYVLDFALQKKRTCIEVNGFTHYYTFGGELNAKSRLKYFILRRLHWKVLTVEYT 423
Query: 601 EWEELQGSFEQLDYLRVILKDYIG 624
W+ + ++++YL + IG
Sbjct: 424 SWKN-KSKEDKMEYLEETVLSRIG 446
>gi|221054031|ref|XP_002261763.1| RAP protein [Plasmodium knowlesi strain H]
gi|193808223|emb|CAQ38926.1| RAP protein, putative [Plasmodium knowlesi strain H]
Length = 449
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 8/124 (6%)
Query: 480 QVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNW 539
QVH+V L+ Q A++ + ++KI N S QK++ +LL GL
Sbjct: 314 QVHIVLYLLRELGGDYQQAINMIEKKKIK--------NTLTVSKMQKQLEKLLKEMGLKA 365
Query: 540 IREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSH 599
RE+ + Y +D L K+ E++G TH+ G + LK + W V+++ +
Sbjct: 366 EREFPMGPYVLDFALQKKRTCIEVNGFTHYYTFGGELNAKSRLKYYILRRLNWKVLTVEY 425
Query: 600 QEWE 603
W+
Sbjct: 426 TSWK 429
>gi|124506281|ref|XP_001351738.1| RAP protein, putative [Plasmodium falciparum 3D7]
gi|23504667|emb|CAD51545.1| RAP protein, putative [Plasmodium falciparum 3D7]
Length = 1379
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 53/265 (20%), Positives = 119/265 (44%), Gaps = 36/265 (13%)
Query: 360 QELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE 419
Q +A +LW+ + L + N F+D + C NK C G +
Sbjct: 1137 QSIANILWSLSILNVYSR-------NVFEDGL-YEC--NKRFIKC---------GKKKNT 1177
Query: 420 GSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFAS 479
+ + + ++ QL A+SY ++ + K I++ + + E Y+ DI+ +
Sbjct: 1178 TKVKNFISQLHQSQLYQAAFSYC-------LYLLNNQKHINKLLKNK--ENYKSDIIINN 1228
Query: 480 QVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQK--VTSSFQKEVARLLVSTGL 537
+ + ++ + + + ++ ++++A + +R QK ++SS K+++ L +
Sbjct: 1229 DIKKKIHAIFEKYFKVSINVLNIWKKQLA---RNQRKEQKTHISSSVHKKISNDLRRLNI 1285
Query: 538 NWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPL--GHTMLKRRYIAAAGWNV 594
EY + D VD + K+ EIDGP HF + + +T+ K+R + A G+ V
Sbjct: 1286 FHYNEYFILDSILVDIFIPHSKIVIEIDGPNHFFQKGEMIFYKSNTLFKKRLLRALGYTV 1345
Query: 595 VSLSHQEWEELQGSFEQLDYLRVIL 619
+S+ ++ + + + + + + +L
Sbjct: 1346 ISVPISDYTFMFSALDTMHFTKRLL 1370
Score = 39.7 bits (91), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 14/50 (28%), Positives = 34/50 (68%)
Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
++VAN+A A + + + PD++ + K+ + ++ F+ QE++ ++W+F S+
Sbjct: 535 KHVANIAWASSVLSNKDPDIWKYIKKQFYENINNFKAQEISIIIWSFGSI 584
>gi|354487325|ref|XP_003505824.1| PREDICTED: FAST kinase domain-containing protein 3-like [Cricetulus
griseus]
gi|344245962|gb|EGW02066.1| FAST kinase domain-containing protein 3 [Cricetulus griseus]
Length = 660
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 78/177 (44%), Gaps = 22/177 (12%)
Query: 444 LGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLAL-SSV 502
L Q+ ++F + + + + ++ ++ QY + C LE P L L L SV
Sbjct: 490 LAQVTQLFMTSVLEC-AFYKGPKLLPQYHVK-------SFLTPCCSLETP-LDLHLYKSV 540
Query: 503 LEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVL---VDKKV 559
+ I G F KV + + + V L+ DG+ + + V K+V
Sbjct: 541 VTGLIDLLGSRLYFASKVLTPYCYTID---VEIKLDE------DGFVLPFTVEEDVHKRV 591
Query: 560 AFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
A IDGP F +T LG +K+R++ G+ VV + + E E L E ++YL+
Sbjct: 592 ALCIDGPQRFCADTKHLLGKEAIKQRHLRLLGYQVVQVPYHELELLTSRLELVEYLQ 648
>gi|159471540|ref|XP_001693914.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277081|gb|EDP02850.1| predicted protein [Chlamydomonas reinhardtii]
Length = 702
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 63/134 (47%), Gaps = 26/134 (19%)
Query: 260 AFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGE----LLYLSEMDRVAEVALT 315
A TRQR L E SAQ +SN AWAL+++G L + VAE +
Sbjct: 67 ALTRQR------------LAEYSAQALSNTAWALARLGAAPPPGLRGGGWLGAVAEASQP 114
Query: 316 KVGEFNSQNVANVAGAFASMQHSAP-----DLFSELAKRASDIVHTFQEQELAQVLWAFA 370
+ F++Q + N+ A A +H P LA+RA + + Q+++ V W+ A
Sbjct: 115 LLPVFHTQELCNLLWAMAVCRHRPPARWLVAALGLLAERAEGL----EPQDVSNVCWSLA 170
Query: 371 SL-YEPADPLLESL 383
+L P PLL+ L
Sbjct: 171 ALRVRPGVPLLQRL 184
>gi|397635539|gb|EJK71902.1| hypothetical protein THAOC_06615, partial [Thalassiosira oceanica]
Length = 172
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 64/130 (49%), Gaps = 10/130 (7%)
Query: 497 LALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV 555
+ L L K +A ++ F++ S Q +V L + G++ E + GY +DA++
Sbjct: 41 IELPESLRAKCRNAFTSQGFSE---SKLQNDVVGELRAAGVDLEEEVLLGSGYRIDALVK 97
Query: 556 ---DKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAG-WNVVSLSHQEWEELQGSFEQ 611
++VA E+DGP HF P G T+LK R + VVS+ + EW+EL S +
Sbjct: 98 VGDGREVAVEVDGPFHFIDRR--PAGSTILKHRQVTRLDRIGVVSVPYWEWDELMNSEMK 155
Query: 612 LDYLRVILKD 621
YL L D
Sbjct: 156 QHYLLAKLPD 165
>gi|428171424|gb|EKX40341.1| hypothetical protein GUITHDRAFT_154162 [Guillardia theta CCMP2712]
Length = 102
Score = 47.8 bits (112), Expect = 0.017, Method: Composition-based stats.
Identities = 28/77 (36%), Positives = 45/77 (58%), Gaps = 8/77 (10%)
Query: 547 GYTVDAVL-----VDKK--VAFEIDGPTHFSRNTGVPL-GHTMLKRRYIAAAGWNVVSLS 598
GY++D V+ VD++ +A E+DGP H+ R L G T +K R++ GW VV++
Sbjct: 15 GYSIDIVIRSGEGVDEEHPIAVEVDGPGHYMRPGLRELVGGTKMKTRHLCRLGWKVVAIP 74
Query: 599 HQEWEELQGSFEQLDYL 615
+ EW E + + E+ YL
Sbjct: 75 YWEWNEARDAGEEERYL 91
>gi|403282217|ref|XP_003932552.1| PREDICTED: FAST kinase domain-containing protein 3 [Saimiri
boliviensis boliviensis]
Length = 659
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
V K++A IDGP F N+ LG +K+R++ G+ VV + + E E L E ++Y
Sbjct: 587 VHKRIALCIDGPQRFCSNSKHLLGKEAIKQRHLRLLGYQVVQMPYHEMEMLTTRLEVVEY 646
Query: 615 LR 616
L+
Sbjct: 647 LQ 648
>gi|148284481|ref|YP_001248571.1| RNA-binding protein [Orientia tsutsugamushi str. Boryong]
gi|146739920|emb|CAM79915.1| putative RNA-binding protein [Orientia tsutsugamushi str. Boryong]
Length = 540
Score = 47.8 bits (112), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 81/393 (20%), Positives = 161/393 (40%), Gaps = 63/393 (16%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEV--ALTKVGEFNSQNVANVAGA 331
A + + QG++N WA ++G ++ S+ A + A + FN+Q +AN A
Sbjct: 73 ATKTIDNFNTQGLANSIWAFGRLG---IHPSDQFIKAWIHHATKTIDNFNTQGLANSIWA 129
Query: 332 FASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLES-LDNAFK 388
++ H + A+ + F Q LA + AF L P+D +++ + +A K
Sbjct: 130 LGRLEIHPSDQFIKAWIHHATKTIDNFNTQNLANSVLAFGRLEIHPSDQFIKAWIHHATK 189
Query: 389 DATQFTCCLNKALSNCNENGGVKSSGDADSE-----GSLSSPVLSFNRDQLGNIAWSYAV 443
F + L+N G +D + + +FN L N W+
Sbjct: 190 TIDNFNT---QNLANSVLAFGRLEIHPSDQFIKAWIHHATKTIDNFNTQGLANSIWA--- 243
Query: 444 LGQMD--------RIFFSDIWKTISRFEEQRISEQ----YREDIMFASQVHLVNQCLKLE 491
LGQ++ + + KTI F Q ++ + +++ S++ + Q +
Sbjct: 244 LGQLEIHPSDQFIKAWIHHATKTIDNFSLQELANSIYGIFTLNVLCNSKIKVPQQFISAV 303
Query: 492 HPHLQL------ALSSVLEEK----------IASAGKT---KRFNQKV----TSSFQ--- 525
+ +++L +S +L+ + S + K+F K+ TS+ Q
Sbjct: 304 NQNIELFDENNECISQILKAHYYFGKQGVGILTSQNRQLLEKKFKTKLTPCHTSNLQLNV 363
Query: 526 -KEVARLLVSTGLNWIREYAVDGYT--VDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTML 582
K V ++L + E+ + T VD + +K + ++DGP+HF N P T L
Sbjct: 364 LKVVKKVLAQHTVK--SEHYIKQITSSVDIFIKEKNIVIQVDGPSHFDDNNA-PNFSTRL 420
Query: 583 KRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+ + G+ V + + W +L+ + + +Y+
Sbjct: 421 NTELLKSYGYIVHRIPYWVWNKLKTNIAKEEYI 453
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 52/109 (47%), Gaps = 7/109 (6%)
Query: 278 LPECSAQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGEFNSQNVANVAGAFASM 335
+ E + Q ++N WAL ++ ++ S+ ++ A + FN+QN+AN AF +
Sbjct: 1 MDEFNPQELANSIWALGRLE---IHPSDQFINAWIHHATKTIDNFNTQNLANSIWAFGRL 57
Query: 336 Q-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLES 382
H + + A+ + F Q LA +WAF L P+D +++
Sbjct: 58 GIHPSDQFINAWIHHATKTIDNFNTQGLANSIWAFGRLGIHPSDQFIKA 106
>gi|432104650|gb|ELK31262.1| FAST kinase domain-containing protein 3 [Myotis davidii]
Length = 477
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 36/62 (58%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
V K+VA IDGP F N+ LG +K+R++ G+ VV + + + E L+ E ++Y
Sbjct: 404 VHKRVALCIDGPKRFCLNSKHLLGKEAIKQRHLRLLGYQVVQIPYYDIETLKSKLELVEY 463
Query: 615 LR 616
L+
Sbjct: 464 LQ 465
>gi|428673456|gb|EKX74369.1| conserved hypothetical protein [Babesia equi]
Length = 414
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 41/82 (50%)
Query: 522 SSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTM 581
S Q++V RLL L E + Y +D V+ KVA E++G THF + T
Sbjct: 313 SKMQEKVGRLLDELKLKHESEVMLGPYRLDFVIPKLKVAIEVNGYTHFFHRSEQLNATTE 372
Query: 582 LKRRYIAAAGWNVVSLSHQEWE 603
LK + I GW V L++ +W+
Sbjct: 373 LKYKIIEDLGWKVFGLNYYDWK 394
>gi|145340688|ref|XP_001415452.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575675|gb|ABO93744.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 528
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 58/115 (50%), Gaps = 9/115 (7%)
Query: 280 ECSAQGISNIAWALSKIGGELLYLSE-MDRVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
E SAQ I+ A A++K+G +Y S+ M + A + EF +++A +A +FA +
Sbjct: 216 ESSAQQIATSAHAMAKLG---IYNSQIMKAYKDHAAARRDEFQPRDIAFLAWSFAKLDIK 272
Query: 339 APDLFSELAKRASDIV-----HTFQEQELAQVLWAFASLYEPADPLLESLDNAFK 388
AP+LF + +++ TF L VLW+FA L E +L + A K
Sbjct: 273 APELFEMFSAVVCEMLFDVEFQTFSPHHLTMVLWSFAMLNENTQEVLPYIVRAMK 327
>gi|294953994|ref|XP_002787986.1| hypothetical protein Pmar_PMAR012092 [Perkinsus marinus ATCC 50983]
gi|239903121|gb|EER19782.1| hypothetical protein Pmar_PMAR012092 [Perkinsus marinus ATCC 50983]
Length = 768
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 151/374 (40%), Gaps = 67/374 (17%)
Query: 280 ECSAQGISNIAWALSKIGG-ELLYLSEMDRVAEVALTKVGE-FNSQNVANVAGAFASMQH 337
E ++Q + +AWAL ++ G E + +M R+A+ + G+ F ++++ + A A +
Sbjct: 438 EMTSQHAATVAWALWRMRGMEANSVHDMARIAD----QHGDAFANRHLITLTRAAAGAKF 493
Query: 338 SAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCL 397
P L + +R V ++ + Q+LW A+ +LE + A +T
Sbjct: 494 YHPSLLDAILRRP---VSSWTADQCGQLLWVLATWGVRNPRMLEYAMQCEEIARAYT--- 547
Query: 398 NKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWK 457
+ G AD D+L I W+ A+L S +W
Sbjct: 548 --------------ADGGAD-----------LGMDKLTTIEWATALLDLPSPPRGSYLWD 582
Query: 458 -------------TISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLE 504
T+S+ QR+S+ M +Q++ L+ + L S VL
Sbjct: 583 KEREYIEGQAADLTVSQVLRQRLSDTQSFSDMGLTQLYWA-WVLRYDEGCGDLPPSWVL- 640
Query: 505 EKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVD--GYTVDAVLVDKKVAFE 562
K+ S SS QK V L +W +EY + G ++D +K+A E
Sbjct: 641 -KVRSWLSDAASYSLQPSSLQKTVHSHLPQG--DWRQEYLLPPWGISIDIASPSRKIAIE 697
Query: 563 IDGPTHFSRNTGVPLGHTM------LKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+DG F V G T+ +K+R + GW V+ +S QE+ + G +Q +L
Sbjct: 698 VDGKL-FHSVYDVATGQTLSDASATVKQRLLTRQGWRVLRVSEQEF--MAGDSDQRAHLA 754
Query: 617 VILKDYIGGEGSSN 630
L + G+G SN
Sbjct: 755 TALAR-MEGDGKSN 767
>gi|403374846|gb|EJY87385.1| hypothetical protein OXYTRI_03886 [Oxytricha trifallax]
Length = 577
Score = 47.0 bits (110), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 32/59 (54%)
Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNV 594
GL ++EY V Y +D L + K+A EIDG H+S N G + R+I A G ++
Sbjct: 475 GLQILQEYEVGPYYLDIFLPELKLAIEIDGAHHYSNNKGDQFSKFKARDRFIKAHGLHI 533
>gi|429327420|gb|AFZ79180.1| hypothetical protein BEWA_020260 [Babesia equi]
Length = 593
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 58/287 (20%), Positives = 122/287 (42%), Gaps = 19/287 (6%)
Query: 324 NVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
+++ + +FA ++ + + + + + +FQ+Q +AQ+++A L + ES+
Sbjct: 291 SISCLLHSFAKLKFRPKSDITSILSQITKSIFSFQDQNVAQIVYALGQLGLHCRDVFESI 350
Query: 384 DNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEG-----SLSSPVLS-FNRDQLGNI 437
+ ++ + A+ G G D E + S +L+ F QL ++
Sbjct: 351 STFIQSRIEYQSPQHLAMFM----QGYARVGIYDKETVKVIMNHSMELLTGFTLSQLVSL 406
Query: 438 AWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQL 497
+LG ++ F+ + ++RF R S+ + I+ +Q++ + C++LEH
Sbjct: 407 MDGALILGHFEQDKFT---RFLTRFTSIR-SDNIPDHIL--NQLNRIMYCIRLEHQSFVT 460
Query: 498 ALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDK 557
++ I F K S+ + + L T ++ + Y VD VL+
Sbjct: 461 TSEYFMQNLINQYQGA--FMIKPLQSYNQALYECLKETDSEYVLNKKIGLYNVD-VLLQN 517
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
+ E+ TG LG LK+R+I G+ + ++ +EW E
Sbjct: 518 NTSVELLSQGSVCPLTGSALGAVQLKKRHIELLGYKHIQINRREWFE 564
>gi|165924154|ref|ZP_02219986.1| conserved domain protein [Coxiella burnetii Q321]
gi|165916403|gb|EDR35007.1| conserved domain protein [Coxiella burnetii Q321]
Length = 435
Score = 46.6 bits (109), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 65/301 (21%), Positives = 127/301 (42%), Gaps = 57/301 (18%)
Query: 281 CSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH 337
+ QGI+N W L+ + EL DR+ + +FNSQ++AN A A+M
Sbjct: 97 LNPQGIANTLWTLATMNVRRRELEVQGLSDRLLDAVYYNAEQFNSQDIANTLWALAAMGM 156
Query: 338 SAPDLFSE-LAKRASDIVH----TFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQ 392
+L + L+ R D VH F Q +A LWA A+ +
Sbjct: 157 RWRELEEQGLSDRLLDAVHRNAQRFSPQGIANALWALAT-----------------TGMR 199
Query: 393 FTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
+ + LSN N V+ S + S +++ + + + ++W Y ++DR+
Sbjct: 200 WRELETRELSNRLFN-AVQHSAERFSSQQIANTLWAL---AMMALSWGYLKEQRVDRLLL 255
Query: 453 SDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGK 512
+ I ++ ++F + ++ IM++++ + P + L +S++ K
Sbjct: 256 NAIDQSANQFSLEESTQ-----IMWSTRWFDIRPP-----PEILLKISNM---------K 296
Query: 513 TKRFNQKVTSSFQKEVARLL---VSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTH 568
R +S + VA +L ++ + E+ + + + VD + K++ E+DGP H
Sbjct: 297 PPR-----SSDLHRHVASVLSAQINGEIPIENEFFIQNCFYVDICIPSKRLVIEVDGPYH 351
Query: 569 F 569
Sbjct: 352 I 352
Score = 45.4 bits (106), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 88/200 (44%), Gaps = 33/200 (16%)
Query: 205 DAQTAQEVLEV----------IAEMITAVGKGLSPSPLSPLNIATAL-HRIAKNMEKVSM 253
D T +E+LE +A ++ A+ + L P ++A L IAKN+E+++
Sbjct: 40 DYATIREILEARRHRRFNGQSVANLLLAIAYHHTQWRLLPRSLAAQLWDAIAKNVERLNP 99
Query: 254 MTTHRLAFT------RQREMS-------MLVAIAMTALPECSAQGISNIAWALSKIGGEL 300
+T R+RE+ +L A+ A + ++Q I+N WAL+ +G
Sbjct: 100 QGIANTLWTLATMNVRRRELEVQGLSDRLLDAVYYNA-EQFNSQDIANTLWALAAMGMRW 158
Query: 301 LYLSEM---DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFS-ELAKRASDIVH- 355
L E DR+ + F+ Q +AN A A+ +L + EL+ R + V
Sbjct: 159 RELEEQGLSDRLLDAVHRNAQRFSPQGIANALWALATTGMRWRELETRELSNRLFNAVQH 218
Query: 356 ---TFQEQELAQVLWAFASL 372
F Q++A LWA A +
Sbjct: 219 SAERFSSQQIANTLWALAMM 238
>gi|71657249|ref|XP_817143.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70882315|gb|EAN95292.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 220
Score = 46.6 bits (109), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 30/121 (24%), Positives = 57/121 (47%), Gaps = 15/121 (12%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
+ + I+N+ +A +K+G L + R+A+ A+ GEF +VA + A+A ++
Sbjct: 33 TPKDITNVVYAYAKVG--LWHYKLFVRLADRAIQLRGEFRCDHVARLLEAYARVEMRYEK 90
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
LF E + R I H E+ +++ A+A + P D F C ++A+
Sbjct: 91 LFVEFSPRIQTIAHLLTAGEVTKIVSAYAKVRIP-------------DVGVFNACGDRAV 137
Query: 402 S 402
+
Sbjct: 138 T 138
>gi|291411172|ref|XP_002721863.1| PREDICTED: FAST kinase domain-containing protein 3-like
[Oryctolagus cuniculus]
Length = 660
Score = 46.6 bits (109), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
V K+VA IDGP F N LG +K+R++ G+ VV + + E E L+ E ++Y
Sbjct: 587 VYKRVALCIDGPQRFCSNGKHLLGKEAIKQRHLQLLGYQVVQVPYHEIEVLKSRLELVEY 646
Query: 615 LR 616
L+
Sbjct: 647 LQ 648
>gi|68074247|ref|XP_679038.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56499680|emb|CAH93735.1| conserved hypothetical protein [Plasmodium berghei]
Length = 830
Score = 46.6 bits (109), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 37/182 (20%), Positives = 76/182 (41%), Gaps = 36/182 (19%)
Query: 476 MFASQVHLVNQCLKLEH-PHLQLALSSVLEEKIASA-GKTKRFNQKVTSSFQKEVARLLV 533
++ +Q+ ++ L+ +H P++ + + E + K K + S QKEV +L+
Sbjct: 586 IYLNQLKIIELSLRTQHVPNVYNKIDTECYEYMNYIKNKEKEIEYNIKSDLQKEVKHILL 645
Query: 534 STGLNWIREYAVDGYTVDAVLVDK----------------------------------KV 559
+ L + E ++ Y VD V D+ K+
Sbjct: 646 TFNLTPLEEVSIGPYNVDFVEKDQTFQNICKNEIYYKDQSNNYTKIISSNKKINENIGKI 705
Query: 560 AFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
E++G HF RNT + LK + ++ G+ V+++ + +W L+ + Y++ I+
Sbjct: 706 IIEVNGEHHFYRNTKSYTSFSKLKHKLLSDLGYIVINIPYFDWAILKTDLNKKSYIKKII 765
Query: 620 KD 621
D
Sbjct: 766 ND 767
>gi|221061135|ref|XP_002262137.1| RAP protein [Plasmodium knowlesi strain H]
gi|193811287|emb|CAQ42015.1| RAP protein, putative [Plasmodium knowlesi strain H]
Length = 958
Score = 46.6 bits (109), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 30/125 (24%), Positives = 57/125 (45%), Gaps = 3/125 (2%)
Query: 517 NQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVP 576
N K + + E++++L +N ++ ++ D +L D +V GP + N+ V
Sbjct: 728 NMKYGARWINELSKILARINVNHLKNIYINHICADIMLPDSQVIIMCLGPYSYYVNSLVT 787
Query: 577 LGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKD---YIGGEGSSNIAE 633
+ LKR + + V+ LS+ EW +L E++ +L +D Y+ +AE
Sbjct: 788 TSTSDLKRFILEKKKYKVIPLSYHEWNKLNDYEEKIRFLYAFGRDAANYLFVNAKKGVAE 847
Query: 634 TLKMD 638
K D
Sbjct: 848 GEKSD 852
>gi|154332667|ref|XP_001562150.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059598|emb|CAM37182.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 442
Score = 46.2 bits (108), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 2/95 (2%)
Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
A+G++NI A SK G L + + L +VGEF + ++ +A AFA +++ ++
Sbjct: 110 AKGVTNIISAFSKTGINHEKLFRLLSMRVQTLARVGEFEAAHLVILANAFARLRYREQNV 169
Query: 343 FSELAKRASDIVHTFQEQELAQVLWAF--ASLYEP 375
FS +A+RA + EL ++ AF A L +P
Sbjct: 170 FSAIARRAMSLRERVTVNELVPLINAFSKAGLKDP 204
>gi|397609733|gb|EJK60493.1| hypothetical protein THAOC_19142, partial [Thalassiosira oceanica]
Length = 500
Score = 46.2 bits (108), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 84/193 (43%), Gaps = 20/193 (10%)
Query: 269 MLVAIAMTALP---ECSAQGISNIAWALSKIGGELLYLSE---MDRVAEVALTKVGEFNS 322
+ ++ + ALP E A+ +SN+ ++ + + + D +A A+ K+ FN
Sbjct: 311 LFGSVEIAALPILGEFDARYLSNLIYSFGLVKYNPTFEDKTKLFDALASTAIDKLAVFNG 370
Query: 323 QNVANVAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADP--- 378
Q+++N+ AF + LF + + + + F EQ LA +LW+FA E ADP
Sbjct: 371 QDISNMLLAFVYVDSKNSMLFQKTGEALLKLYLGDFTEQALANILWSFAKSGE-ADPELF 429
Query: 379 ------LLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRD 432
++E + + F+ + A + G K GD + P F+
Sbjct: 430 QALGDHIVERILDDFRPQHLSNIVWSYATGGVSHPGLFKKIGDHVAGLKSLDP---FDPQ 486
Query: 433 QLGNIAWSYAVLG 445
L N AW++A G
Sbjct: 487 SLSNTAWAFATAG 499
>gi|189184538|ref|YP_001938323.1| repeat-containing protein A_05 [Orientia tsutsugamushi str. Ikeda]
gi|189181309|dbj|BAG41089.1| repeat-containing protein A_05 [Orientia tsutsugamushi str. Ikeda]
Length = 589
Score = 46.2 bits (108), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 89/208 (42%), Gaps = 17/208 (8%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
A+ + E ++QG++N WA ++ + S +D A+ + EFNSQ+++N F
Sbjct: 88 AINLMDEFNSQGVTNSLWAFGRLKIQ-PQASFIDAWTNQAINLMDEFNSQDLSNSIWGFG 146
Query: 334 SMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLDN-AFKDA 390
++ +A+ + F QELA LWA L P +++ N A K
Sbjct: 147 WLEIQPQASFIDAWTNQATKTIGKFNPQELANSLWALGRLEIHPQALFIDAWTNQATKTI 206
Query: 391 TQFTCCLNKALSNCNEN-GGVKSSGDADSEGSLSSPVLS----FNRDQLGNIAWSYAVLG 445
QF ++ LSN G ++ A + ++ ++ FN L N W + L
Sbjct: 207 DQFN---HQNLSNSIWALGRLEIQPQASFIEAWTNQAINLMDEFNSQDLSNSIWGFGRLK 263
Query: 446 QMDRIFFSDIW-----KTISRFEEQRIS 468
+ F + W KTI +F Q ++
Sbjct: 264 IQPQASFIEAWIHQATKTIDKFNSQDLA 291
>gi|363735806|ref|XP_421951.2| PREDICTED: FAST kinase domain-containing protein 2 [Gallus gallus]
Length = 677
Score = 46.2 bits (108), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 74/333 (22%), Positives = 138/333 (41%), Gaps = 46/333 (13%)
Query: 331 AFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL------ 383
A S+Q+ LFS +A + IV ++++ L AF +L ++P++ L+ L
Sbjct: 356 ACHSLQYRNIKLFSAVADYVNSIVCLLDKRQIILFLSAFETLGFQPSE-LMGVLAEKVTE 414
Query: 384 DNAFKDATQFTCCLNKALSNCNE-NGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYA 442
D+ F D F L + S N G L+ + + +L +S
Sbjct: 415 DSEFLDLKSFLIVL-RVYSRLNYVPRGQHLLFYETLHSCLNKYLPQISNAELLKAVYSLC 473
Query: 443 VLGQMDRIFFSDIWKTISRFEEQRISEQYRE--DIMFASQVHLVNQCLKLEHPH------ 494
+LG + + + + K S FEE + Y+E ++M +H V C++L+ P
Sbjct: 474 ILGYLPHLALNQLLKKDS-FEELMSGDLYKEKREMM----LHCVRTCMELDSPSFMKPAF 528
Query: 495 ---------LQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEV--ARLLVSTGLNWIREY 543
+ + L E + G F Q V ++ + + S +
Sbjct: 529 VPTEIFSSLVSVTLRKAREALLELLGDENMFRQNVQLPYEYRIDFEIWMDSDTKKVLPIT 588
Query: 544 AVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
A D Y +V +++AF P+ F T P G +K+R+++ G++V+ + +++++
Sbjct: 589 ATDSYADRSV---QRLAFLFVPPSAFCLGTTHPQGKLAMKKRHLSKLGYHVIPVLNKKFQ 645
Query: 604 EL--QGSFEQLDYLRVILKDYIGGEGSSNIAET 634
EL +G+ E LK I E S +E
Sbjct: 646 ELTNEGAIE-------FLKGKIYSENVSPFSEV 671
>gi|124512480|ref|XP_001349373.1| RAP protein, putative [Plasmodium falciparum 3D7]
gi|23499142|emb|CAD51222.1| RAP protein, putative [Plasmodium falciparum 3D7]
Length = 975
Score = 45.8 bits (107), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 26/113 (23%), Positives = 55/113 (48%), Gaps = 6/113 (5%)
Query: 519 KVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
K ++ + E++R+L ++ IR ++ D +L V + GP + N+ V
Sbjct: 750 KYSARWINELSRILTKMNVDHIRNVYINNICTDIMLTSTNVIIKCLGPYSYYINSLVTTS 809
Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSSNI 631
+ LK + + + + V++LS+ +W +L E++ +L Y G ++NI
Sbjct: 810 ISDLKLKILESKKYKVINLSYHDWNKLNDYEEKIKFL------YSFGRHAANI 856
>gi|156088385|ref|XP_001611599.1| hypothetical protein [Babesia bovis T2Bo]
gi|154798853|gb|EDO08031.1| hypothetical protein BBOV_III004680 [Babesia bovis]
Length = 371
Score = 45.8 bits (107), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 52/95 (54%), Gaps = 18/95 (18%)
Query: 516 FNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVD-------KKVAFEIDGPTH 568
+N S+ Q+ V+ +LV G+ + V+ T D + +D +++A E+DGP H
Sbjct: 242 YNDSKMSTSQRYVSDVLVRLGI----PHKVELLTPDLLSIDIAIEGGGERIALEVDGPLH 297
Query: 569 FSR-----NTGVPL--GHTMLKRRYIAAAGWNVVS 596
F+R + G P+ G T +K ++ ++GW+V+S
Sbjct: 298 FTRVCHGTHLGQPMLTGPTRMKHNFLRSSGWHVIS 332
>gi|51259555|gb|AAH79475.1| Fastkd3 protein [Rattus norvegicus]
Length = 591
Score = 45.8 bits (107), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
DG+ + + V +VA IDGP F + LG +K+R++ G+ VV + + E
Sbjct: 506 DGFVLPFTVDEDVHTRVALCIDGPQRFCLGSKHLLGKEAIKQRHLRLLGYQVVQVPYHEL 565
Query: 603 EELQGSFEQLDYLR 616
E L E +DYL+
Sbjct: 566 ELLTSRLELVDYLQ 579
>gi|401404784|ref|XP_003881842.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325116256|emb|CBZ51809.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 2454
Score = 45.8 bits (107), Expect = 0.067, Method: Composition-based stats.
Identities = 29/78 (37%), Positives = 41/78 (52%), Gaps = 5/78 (6%)
Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRN-TGVPL---GHTMLKRRYIAAAGWNVVSLSHQEWE 603
YT+ V ++AFE+ HF R+ G + T L+RR + A GW VV++ H EW
Sbjct: 2217 YTLPLVDATHRIAFEVGASEHFFRDPEGAEIELTAWTSLRRRLLQAQGWRVVAVPHFEWT 2276
Query: 604 ELQGSFEQLDYL-RVILK 620
L +L YL R +LK
Sbjct: 2277 ALPDRLARLRYLQRQLLK 2294
>gi|294866651|ref|XP_002764794.1| hypothetical protein Pmar_PMAR004016 [Perkinsus marinus ATCC 50983]
gi|239864541|gb|EEQ97511.1| hypothetical protein Pmar_PMAR004016 [Perkinsus marinus ATCC 50983]
Length = 663
Score = 45.8 bits (107), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 48/206 (23%), Positives = 86/206 (41%), Gaps = 20/206 (9%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDR--VAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
Q ++N +WA +K+ + +DR + E +G+ +S+++++V + AS Q+ D
Sbjct: 14 QLLANTSWAAAKLEAAKMSSDSIDRTDLNEKIYRFIGQMDSRHLSSVLWSIASAQNWPVD 73
Query: 342 --LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADP------------LLESLDNAF 387
+FS + + DI QELA LWA A E P ++ D F
Sbjct: 74 SEVFSRITRSLLDIPRPLHHQELANTLWALARAPERFRPESREVAIALMTKYVDRADPKF 133
Query: 388 KDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLG-- 445
+ + Q + + A++ + + S+ + L AWS A LG
Sbjct: 134 RFSDQHSANILWAIAKLEIDPTMARGVIDICIASIMETCGEYRPHSLSLSAWSLATLGIH 193
Query: 446 --QMDRIFFSDIWKTISRFEEQRISE 469
+DRI + + FE Q+I+
Sbjct: 194 PEVVDRIIVEASARRLRDFESQQIAH 219
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 51/107 (47%), Gaps = 13/107 (12%)
Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
+Q I+++ WA GG LL +D + E + + QNVANV A S P L
Sbjct: 214 SQQIAHVVWA----GGTLLSAWSLDGLPERLAVTIDKAKPQNVANVMWGLA---RSGPPL 266
Query: 343 FSELAKRASDIVHT----FQEQELAQVLWAFASLYEPADPL--LESL 383
S+L + A + T + +L+ +LW+ ++ DP LESL
Sbjct: 267 NSKLVRFAQAHMETSSKAYLPVDLSSMLWSLGTMTNRGDPSEGLESL 313
>gi|148705062|gb|EDL37009.1| FAST kinase domains 3, isoform CRA_d [Mus musculus]
Length = 129
Score = 45.8 bits (107), Expect = 0.073, Method: Composition-based stats.
Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
DG+ + + + K+VA IDGP F ++ LG K+R++ G+ VV L + E
Sbjct: 44 DGFVLPCTVDEDIHKRVALCIDGPQRFCLDSKHLLGKEATKQRHLRLLGYQVVQLPYHEL 103
Query: 603 EELQGSFEQLDYLR 616
E L E +DYL+
Sbjct: 104 ELLTSRLELVDYLQ 117
>gi|128485527|ref|NP_001076043.1| FAST kinase domain-containing protein 3 precursor [Rattus
norvegicus]
gi|145558914|sp|Q68FN9.2|FAKD3_RAT RecName: Full=FAST kinase domain-containing protein 3
gi|149032747|gb|EDL87602.1| similar to hypothetical protein MGC5297, isoform CRA_b [Rattus
norvegicus]
Length = 656
Score = 45.8 bits (107), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
DG+ + + V +VA IDGP F + LG +K+R++ G+ VV + + E
Sbjct: 571 DGFVLPFTVDEDVHTRVALCIDGPQRFCLGSKHLLGKEAIKQRHLRLLGYQVVQVPYHEL 630
Query: 603 EELQGSFEQLDYLR 616
E L E +DYL+
Sbjct: 631 ELLTSRLELVDYLQ 644
>gi|294874532|ref|XP_002767003.1| hypothetical protein Pmar_PMAR010983 [Perkinsus marinus ATCC 50983]
gi|239868378|gb|EEQ99720.1| hypothetical protein Pmar_PMAR010983 [Perkinsus marinus ATCC 50983]
Length = 733
Score = 45.4 bits (106), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 54/236 (22%), Positives = 100/236 (42%), Gaps = 59/236 (25%)
Query: 181 VHRLSQFSGPSNRRKEINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATA 240
V+R+S+ + P+ K + Q+A V ++ A A+G L+ + + +A
Sbjct: 64 VYRMSKHAAPTEAVKAL---------QSALHVDQLTA----ALGTSLAKLGIRDETVFSA 110
Query: 241 L-HRIAKNMEKVSM--MTTHRLAFTR----QREMSMLVAIAMTA-LPECSAQGISNIAWA 292
L R++ M+ M + AF R RE+ + ++T ECS + + ++ W+
Sbjct: 111 LGSRLSDKMDDFDMEDIAAVSWAFARAKFTDRELFRKIRESLTVRTTECSVKSLVSLTWS 170
Query: 293 LSKIG---GE----------------LLYLSE-------------------MDRVAEVAL 314
LSK+G GE L Y + M +A +
Sbjct: 171 LSKLGETGGEEDLFRYTLAPTIRSYMLEYTVQDLCALAWSFANANVHDVDFMSDIAHALM 230
Query: 315 TKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
K + N Q+V + A AS+ +S +LF L +++ ++HTF +L++ L+ F
Sbjct: 231 PKTRDMNCQDVCSAVVALASLHYSHKELFEALKQQSFRLMHTFTPLQLSRTLYGFG 286
>gi|161831154|ref|YP_001597208.1| hypothetical protein COXBURSA331_A1522 [Coxiella burnetii RSA 331]
gi|161763021|gb|ABX78663.1| conserved domain protein [Coxiella burnetii RSA 331]
Length = 580
Score = 45.4 bits (106), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 68/300 (22%), Positives = 122/300 (40%), Gaps = 57/300 (19%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMD---RVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
S QGI+N+ WAL+ G L R+ E FN Q +AN A A+M
Sbjct: 243 SPQGIANVLWALATTGMRRRELENQGLSVRLFEAIRRNAERFNPQGIANALWALATMGMW 302
Query: 339 APDLFSE-LAKRASDIVH----TFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQF 393
+L + L+ R VH F Q +A VLWA ++ + +A +
Sbjct: 303 WEELEEQRLSDRLLGAVHRNAQRFSPQGIANVLWALTTM---------GMRWGELEAQRL 353
Query: 394 TCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFS 453
CL A+ E S A++ +L+ LS W Y ++DR+ +
Sbjct: 354 NNCLLAAVRYNAER--FSSQQIANTLWALAMMALS----------WGYLKEQRVDRLLLN 401
Query: 454 DIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKT 513
I ++ ++F + ++ IM++++ + P + L +S++ K
Sbjct: 402 AIDQSANQFSLEESTQ-----IMWSTRWFDIR-----PPPEILLKISNM---------KP 442
Query: 514 KRFNQKVTSSFQKEVARLL---VSTGLNWIREYAV-DGYTVDAVLVDKKVAFEIDGPTHF 569
R +S + VA +L ++ + E+ + + + VD + K++ E+DGP H
Sbjct: 443 PR-----SSDLHRHVASVLSAQINGEIPIENEFFIQNCFYVDICIPSKRLVIEVDGPYHI 497
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 73/162 (45%), Gaps = 23/162 (14%)
Query: 232 LSPLNIATALH-RIAKNMEKVSMMTTHRLAFT------RQREMS-------MLVAIAMTA 277
L P ++A L IAKN+E+++ +T R+RE+ +L A+ A
Sbjct: 12 LLPRSLAAQLWDAIAKNVERLNPQGIANTLWTLATMNVRRRELEVQGLSDRLLDAVRYDA 71
Query: 278 LPECSAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFAS 334
+ QGI+N WAL +G GEL DR+ + + FNSQ++ N A A+
Sbjct: 72 -ERFNPQGIANTLWALVAMGMTWGELEAQELNDRLLDAVGSNAPRFNSQDITNTLWALAT 130
Query: 335 MQHSAPDLFSE-LAKRASDIVH----TFQEQELAQVLWAFAS 371
M +L + L R V F+ Q +A LWA A+
Sbjct: 131 MGMKWRELGDQRLRDRLLGAVRRNAERFKPQGIANALWALAT 172
Score = 40.0 bits (92), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 42/96 (43%), Gaps = 8/96 (8%)
Query: 284 QGISNIAWALSKIGGELLYLSEMD---RVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
QGI+N WAL+ G L R+ E FN Q +AN A A+M
Sbjct: 161 QGIANALWALATTGMRRRELENQGLSVRLFEAIRRNAERFNPQGIANALWALATMGMWWE 220
Query: 341 DLFSE-LAKRASDIVH----TFQEQELAQVLWAFAS 371
+L + L+ R VH F Q +A VLWA A+
Sbjct: 221 ELEEQRLSDRLLGAVHRNAQRFSPQGIANVLWALAT 256
>gi|303273894|ref|XP_003056299.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462383|gb|EEH59675.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 769
Score = 45.4 bits (106), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 41/183 (22%), Positives = 82/183 (44%), Gaps = 18/183 (9%)
Query: 198 NLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTH 257
+L D++DA ++VL + E+ K +N +TALHR+A+ + +
Sbjct: 262 DLQGDLMDASDVEDVLLAVEELGDVFNK---------VNCSTALHRVARLCTTPAAAGSP 312
Query: 258 R---LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE--MDRVAEV 312
R A L+A+ + E ISN WA +++ L S+ + +A
Sbjct: 313 RPDVAAVAHDERFRALLAMVERSAHEMEIVSISNTLWAFARL---RLRPSDATVSTLASR 369
Query: 313 ALTKVGEFNSQNVANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQVLWAFAS 371
A+ + + ++++ V A A + H L + + RA ++ +F+ ++ +LWA+A
Sbjct: 370 AVDQCADAEPRHLSTVMWALAVLGHEPRSRLLAAVGDRAGEVAASFRPPDVVNLLWAYAR 429
Query: 372 LYE 374
+
Sbjct: 430 WHR 432
>gi|428175295|gb|EKX44186.1| hypothetical protein GUITHDRAFT_109971 [Guillardia theta CCMP2712]
Length = 1200
Score = 45.4 bits (106), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 55/127 (43%), Gaps = 26/127 (20%)
Query: 521 TSSFQKEVARLLVSTGLN----WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGV- 575
S Q +V R L G+ W+ + Y VDA L +A E+DGP H++ + G
Sbjct: 1071 VSRLQSDVIRTLRGMGVEVEEEWMEPRS--RYVVDAWLPTFGIALEVDGPYHYAYSAGSA 1128
Query: 576 ------------------PLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRV 617
PLG T LK R++A V+ + + EW E + +Q YL
Sbjct: 1129 QETRPGSATVRPDGNGRHPLGSTKLKHRHLAELMIPVLVVPYWEWPEDSQASKQ-TYLSN 1187
Query: 618 ILKDYIG 624
+L ++G
Sbjct: 1188 LLFSHVG 1194
>gi|156095482|ref|XP_001613776.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802650|gb|EDL44049.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1193
Score = 45.4 bits (106), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 72/133 (54%), Gaps = 6/133 (4%)
Query: 494 HLQLALSS--VLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAV-DGYTV 550
H +++L++ + ++++A + + NQ ++SS K+++ L + EY + D V
Sbjct: 1055 HFKVSLNTLNIWKKQLARNQRREEKNQ-ISSSVHKKISNDLRHLSIFHHNEYFILDSLLV 1113
Query: 551 DAVLVDKKVAFEIDGPTHFSRNTGVPL--GHTMLKRRYIAAAGWNVVSLSHQEWEELQGS 608
D + +V EIDGP+HF + + L +++ K+R + A G++V+S+S + + +
Sbjct: 1114 DVYVPRSRVVIEIDGPSHFLQKGRLILYNPNSLFKKRLLRALGFSVISISISDHTFMFSA 1173
Query: 609 FEQLDYLRVILKD 621
L +++ L +
Sbjct: 1174 LNTLSFVKQFLSN 1186
>gi|124809797|ref|XP_001348683.1| RAP protein, putative [Plasmodium falciparum 3D7]
gi|23497581|gb|AAN37122.1| RAP protein, putative [Plasmodium falciparum 3D7]
Length = 1725
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 33/143 (23%), Positives = 69/143 (48%), Gaps = 25/143 (17%)
Query: 488 LKLEHPHLQLALSSV-LEEKIASAGKTKRFNQKVT-----SSFQKEVARLLVSTGLNWIR 541
+K +H +L L+ S + L+++I + + F + + S F ++ ++L + +
Sbjct: 1563 IKYDHSNLHLSNSFIQLKDEIFLLLQKREFKRNMNKNDHISDFHVQICQILDDLNIRYHN 1622
Query: 542 EYAV-DGYTVDAVL----VDKKVAFEIDGPTHFS--------------RNTGVPLGHTML 582
EY D +VD L ++K+A EIDGP+H + T + G T+
Sbjct: 1623 EYITKDLLSVDIKLERKCCEQKLAIEIDGPSHHFLVLNEMQKADPQRIKKTYIKCGTTIF 1682
Query: 583 KRRYIAAAGWNVVSLSHQEWEEL 605
K + +GW++++++ EW ++
Sbjct: 1683 KHWLLQKSGWSIINVTSFEWNKI 1705
>gi|294867004|ref|XP_002764926.1| hypothetical protein Pmar_PMAR007493 [Perkinsus marinus ATCC 50983]
gi|239864762|gb|EEQ97643.1| hypothetical protein Pmar_PMAR007493 [Perkinsus marinus ATCC 50983]
Length = 795
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 45/201 (22%), Positives = 90/201 (44%), Gaps = 25/201 (12%)
Query: 427 LSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQ 486
+SF+ + + W+ A D+ F D+ ++ + + + + + S+VH
Sbjct: 572 ISFDVADVAIVLWAMAAADTYDQSVFRDLLSILASKSNELSAGERKASL---SKVHRAYL 628
Query: 487 CLKLEH-----PHLQLALSSVLEE-KIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWI 540
+L + P ++ VLEE + AS +V + K ++R S+ ++ +
Sbjct: 629 WARLGYGFQHSPQNGHLIAEVLEEAQRASVDARGALQTEVCQTLNKALSRSPRSSSMHLL 688
Query: 541 REY----AVDGYTVDAVLVD-----KKVAFEIDGPTHFSRNTGVPL-------GHTMLKR 584
E + G +VDA +VD +++ E+DGP H+ G G ++LK+
Sbjct: 689 SEVDLAPELPGLSVDAAVVDGRTGSRRLLVEVDGPHHYVDVLGESAVTRRQYNGQSVLKQ 748
Query: 585 RYIAAAGWNVVSLSHQEWEEL 605
IA AG+ ++S+ ++W L
Sbjct: 749 HLIAQAGFRLLSVEDEKWRSL 769
>gi|302829348|ref|XP_002946241.1| hypothetical protein VOLCADRAFT_102845 [Volvox carteri f.
nagariensis]
gi|300269056|gb|EFJ53236.1| hypothetical protein VOLCADRAFT_102845 [Volvox carteri f.
nagariensis]
Length = 1387
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 3/86 (3%)
Query: 522 SSFQKEVARLLVSTGLNWIREYAVDGYTVDAV--LVDKKVAFEIDGPTHFSR-NTGVPLG 578
S Q++V R LV+ G E V +TVD + + + VA E+DGPTHF+ + PLG
Sbjct: 1234 SDLQRDVYRQLVALGYRPRMEERVGFWTVDILFRVGARPVAVEVDGPTHFTTCHHRQPLG 1293
Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEE 604
++ + + G VV+LS +++ +
Sbjct: 1294 TSLARDECLRRLGLAVVALSFRDYRQ 1319
>gi|397568314|gb|EJK46072.1| hypothetical protein THAOC_35281, partial [Thalassiosira oceanica]
Length = 441
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 43/75 (57%), Gaps = 6/75 (8%)
Query: 303 LSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTF 357
LS D +A + + EF +++++N+ +F ++H+ PD LF+ A I+HTF
Sbjct: 368 LSIFDSIASSTVNMLNEFEARHLSNLIYSFGLIEHN-PDIGGETLFNVFGDAALKILHTF 426
Query: 358 QEQELAQVLWAFASL 372
+ Q L+ +LWAF +
Sbjct: 427 ESQNLSNMLWAFVKV 441
>gi|118353796|ref|XP_001010163.1| hypothetical protein TTHERM_00560100 [Tetrahymena thermophila]
gi|89291930|gb|EAR89918.1| hypothetical protein TTHERM_00560100 [Tetrahymena thermophila
SB210]
Length = 412
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/109 (22%), Positives = 56/109 (51%), Gaps = 6/109 (5%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVL----VDKKVAFEIDGPTHFSR--NTG 574
S Q++ +L N+ E +D YTVD ++ + ++ E++GP+H+ N
Sbjct: 299 VSPIQEDCEIILKVLKWNFKSEVRIDPYTVDFLITLPSIKNQIVLEMNGPSHYPYFSNKD 358
Query: 575 VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYI 623
V +K + + + V + HQ+W +++G ++D+++ +++ +I
Sbjct: 359 VFSAKEQMKVKNLKIKNYIPVLIHHQDWSQIKGVTGKIDFIQNLVQKHI 407
>gi|344272348|ref|XP_003407994.1| PREDICTED: LOW QUALITY PROTEIN: FAST kinase domain-containing
protein 3-like [Loxodonta africana]
Length = 671
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 34/60 (56%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
K+VA IDGP F N LG +K+R++ G+ VV + + E E L+ E ++YL+
Sbjct: 589 KRVALCIDGPKRFCFNGTNLLGKEAIKQRHLRLLGYEVVQIPYHETEMLKSRLELVEYLQ 648
>gi|294901002|ref|XP_002777205.1| hypothetical protein Pmar_PMAR007110 [Perkinsus marinus ATCC 50983]
gi|239884697|gb|EER09021.1| hypothetical protein Pmar_PMAR007110 [Perkinsus marinus ATCC 50983]
Length = 504
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 23/90 (25%), Positives = 47/90 (52%), Gaps = 2/90 (2%)
Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
E + Q + +AW+ + + + M +A + K + N Q+V + A AS+ +S
Sbjct: 295 EYTVQDLCALAWSFA--NANVHDVDFMSDIAHALMPKTRDMNCQDVCSAVVALASLHYSH 352
Query: 340 PDLFSELAKRASDIVHTFQEQELAQVLWAF 369
+LF L +++ ++HTF +L++ L+ F
Sbjct: 353 KELFEALKQQSFRLMHTFTPLQLSRTLYGF 382
Score = 39.7 bits (91), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 65/272 (23%), Positives = 101/272 (37%), Gaps = 60/272 (22%)
Query: 235 LNIATALHRIAKNME--KVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWA 292
+N+ATALHR+AK+ + +VS + T + LV L G+ N WA
Sbjct: 60 INLATALHRVAKHSKSYQVSQVAT-------DPRYTALVDRLGAYLNSLDGVGLMNTLWA 112
Query: 293 LSKIG--------------------------GELLYL----------SEMDRVAEVAL-- 314
L ++ G+ LY +E + + AL
Sbjct: 113 LVRLNAAAPKWISELLDRCISSVDQLEPKQLGQGLYCVYRMSKHAAPTEAVKALQSALHG 172
Query: 315 ---TKVGEF-NSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
+ F +S + +V + A + +FS L R SD + F +++A V WAFA
Sbjct: 173 QVRASLDHFSDSHELVSVCTSLAKLGIRDETVFSALGSRLSDKMDDFDMEDIAAVSWAFA 232
Query: 371 SLYEPADPLLESLDNAFKDATQFTCC-------LNKALSNCNENGGVKSSGDADSEGSLS 423
L + + T T C L +LS E GG + ++
Sbjct: 233 RAKFTDRELFRKIRESLTVRT--TECSVKSLVSLTWSLSKLGETGGEEDLFRYTLAPTIR 290
Query: 424 SPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDI 455
S +L + L +AWS+A D F SDI
Sbjct: 291 SYMLEYTVQDLCALAWSFANANVHDVDFMSDI 322
>gi|237829857|ref|XP_002364226.1| hypothetical protein TGME49_109790 [Toxoplasma gondii ME49]
gi|211961890|gb|EEA97085.1| hypothetical protein TGME49_109790 [Toxoplasma gondii ME49]
gi|221507092|gb|EEE32696.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 309
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 58/230 (25%), Positives = 101/230 (43%), Gaps = 47/230 (20%)
Query: 206 AQTAQEVL-----------EVIAEMITAVGKGLSPSPLSPLN---------------IAT 239
A+TAQE+L E+ ++ + A LSPS ++ + +AT
Sbjct: 57 AETAQELLRGKETKRRAFWEIFSKRVKASAHMLSPSLMALIAKSFDVHDRDTGIYVALAT 116
Query: 240 ALHRIAKNMEKVSMMTTHRLAFTRQRE-------MSMLVAIAMTALPECSAQGISNIAWA 292
L K + S++T + F+R+ + S L AL + + + + I +
Sbjct: 117 VLPEAVKRADGRSLLTLSDV-FSRRLKRDSNPHLFSTLARQLPNALYQLTGKDVLRILSS 175
Query: 293 LSKIGGELLYLSEMDRVAEVA---LTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKR 349
L G L++M +VA L ++ E +S ++A+ + FAS + P+L+S LA+R
Sbjct: 176 LDAAG-----LADMLACRQVARKLLAELDELDSVDLADASAVFASQGYRNPELYSALARR 230
Query: 350 ASDIVHTF----QEQELAQVLWAFASLYEPADPLLESLDNAFKDAT-QFT 394
A D+ +F Q + ++L F+ D LLES + QFT
Sbjct: 231 AVDVKDSFDSCSQAPTVFRLLSGFSQNAVACDELLESFSTLLVSSKDQFT 280
>gi|156101207|ref|XP_001616297.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805171|gb|EDL46570.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1277
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 56/103 (54%), Gaps = 9/103 (8%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYA--VDG-YTVDAVLVDKKVAFEIDGPTHFS-----RN 572
+SSF +EV L+S G ++ +DG YTVD ++V+ V EI+G H+ +
Sbjct: 1170 SSSFHREVLSTLLSLGEKNVQCEVPFMDGIYTVD-IVVNNSVCIEINGSNHYYYDSNLKR 1228
Query: 573 TGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+G L L + Y+ + + ++ +S+ +W L+ + E+ DYL
Sbjct: 1229 SGEKLDALNLVKYYLLSKKYKLILVSYLDWNNLKSAEEKRDYL 1271
>gi|322699135|gb|EFY90899.1| ATP-dependent DNA helicase mph1 [Metarhizium acridum CQMa 102]
Length = 1070
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 75/180 (41%), Gaps = 33/180 (18%)
Query: 1 MGFLPEN----SRWVYEEIISNIRIRRVTEDDEVDDSEEKESEDSVDWESEFLGELDPFG 56
MG PE+ R +I +R E D +SEE ++ DSV
Sbjct: 825 MGTEPESLVRQCRSTDTSRFQDIAVRPFVESD--GESEEDDTSDSVT------------- 869
Query: 57 YQAPKKRKKQEKSKVVDDNEGMDWCVRARKVALKSI------EARGLASSMEDLIKVKKK 110
KKR +++S D E + RK++ SI E A +M + + K
Sbjct: 870 ----KKRSTRQRSVGADHEESQPSRGKRRKISTTSIPGPSELEDDTEAPAMNGGARKRTK 925
Query: 111 KKKGKKKLEKIKKKNKVTDDDLDFDLEDDMKMDDIMGSGNGYDMNDL----RRTVSMMAG 166
K K K+K K K++ + D+L D E D + + GS +G D+ D R+T S M G
Sbjct: 926 KPKSKRKGRKTKQRTGINSDELGDDCERDSDLIESSGSDDGADLLDFVVADRQTTSSMVG 985
>gi|125815393|ref|XP_698448.2| PREDICTED: FAST kinase domain-containing protein 5-like [Danio
rerio]
Length = 640
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL--QGSFEQLDY 614
K++A ++ H+ T LG LKRR++ AG+ VV L H EW L + E+L Y
Sbjct: 570 KRLAVQVTNRNHYCYRTKQLLGLHALKRRHLTLAGYRVVELPHWEWFPLLRRSQAEKLAY 629
Query: 615 LRVILKDY 622
L + +Y
Sbjct: 630 LHCKIFNY 637
>gi|332228041|ref|XP_003263199.1| PREDICTED: LOW QUALITY PROTEIN: FAST kinase domain-containing
protein 3 [Nomascus leucogenys]
Length = 662
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
+G+ + + + + K++A IDGP F N+ LG +K+R++ G+ VV + + E
Sbjct: 575 EGFVLPSTVDEDIHKRIALCIDGPERFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEI 634
Query: 603 EELQGSFEQLDYLR 616
L+ E ++YL+
Sbjct: 635 GMLKSRCELVEYLQ 648
>gi|389584538|dbj|GAB67270.1| hypothetical protein PCYB_112910 [Plasmodium cynomolgi strain B]
Length = 1311
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 56/103 (54%), Gaps = 9/103 (8%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYA--VDG-YTVDAVLVDKKVAFEIDGPTHFS-----RN 572
+SSF +EV L+S G+ ++ +DG YTVD ++++ EI+G H+ +
Sbjct: 1204 SSSFHREVLSTLLSLGVKNVQCEVPFMDGIYTVD-IVINNSTCIEINGSNHYYYDNNLKR 1262
Query: 573 TGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+G L L + Y+ + + ++ +S+ +W L+ + E+ DYL
Sbjct: 1263 SGEKLDALNLIKYYLLSKKYKLILVSYLDWNNLKSAEEKKDYL 1305
>gi|209876299|ref|XP_002139592.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555198|gb|EEA05243.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 587
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 52/99 (52%), Gaps = 6/99 (6%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGEL--LYLSEMDRVAEVALTKVGEFNSQNVANVAGA 331
A+ L A +S + W+ SK G + L+++ + +V L+++ SQ ++N+ +
Sbjct: 190 AVYQLDRFIAINLSMLLWSYSKSGKKYNYLFITAIPKV----LSELDNLQSQQISNIIWS 245
Query: 332 FASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
+A + +P LF +AKR + I+ F ++ +AFA
Sbjct: 246 YAKIGLISPHLFENIAKRCTSILSEFLPIHISMTAYAFA 284
>gi|156083971|ref|XP_001609469.1| hypothetical protein [Babesia bovis T2Bo]
gi|154796720|gb|EDO05901.1| hypothetical protein BBOV_IV003040 [Babesia bovis]
Length = 217
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 42/83 (50%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
TS Q ++A LL L E + Y +D V+ VA E++G +HF + T
Sbjct: 118 TSKMQYKLAPLLNHLKLQHRAEVQIGPYVMDYVIPRLNVAVEVNGHSHFYHQSTQFHALT 177
Query: 581 MLKRRYIAAAGWNVVSLSHQEWE 603
LK + + GW V+S+++ +W+
Sbjct: 178 KLKYSIVQSLGWQVLSVNYFDWK 200
>gi|294877802|ref|XP_002768134.1| hypothetical protein Pmar_PMAR002922 [Perkinsus marinus ATCC 50983]
gi|239870331|gb|EER00852.1| hypothetical protein Pmar_PMAR002922 [Perkinsus marinus ATCC 50983]
Length = 146
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 522 SSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTM 581
S FQ+ + +L + L + E Y VD V +A E DG THF T
Sbjct: 28 SKFQESIKAVLKACELEYHEEVIAGTYIVDYA-VGNSLALEADGFTHFYAGTENFTAKAK 86
Query: 582 LKRRYIAAAGWNVVSLSH 599
LK R + + GWN+VSL +
Sbjct: 87 LKHRILRSLGWNIVSLPY 104
>gi|294956195|ref|XP_002788848.1| hypothetical protein Pmar_PMAR004308 [Perkinsus marinus ATCC 50983]
gi|239904460|gb|EER20644.1| hypothetical protein Pmar_PMAR004308 [Perkinsus marinus ATCC 50983]
Length = 299
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 61/268 (22%), Positives = 109/268 (40%), Gaps = 51/268 (19%)
Query: 183 RLSQFSGPSNRRK--EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATA 240
R+ + +G R K ++ L + ++DA T VLE++ +G +N A A
Sbjct: 14 RMYEVAGRLRRGKSGDLVLQRRLMDASTPAAVLEIVLPNANKLGS---------VNYACA 64
Query: 241 LHRIA-------------KNMEKVSMMT------------THRLAFTRQREMSMLVAIAM 275
LHR A + ++++ T T LA TR+ E + A
Sbjct: 65 LHRCAVWFRSGKPTPSGLSQVPRLALQTVRDWRAREAATITWALAVTRELEHILEFARLS 124
Query: 276 TALPECSAQGISNIAWALSKIG-------GELLYLSEMDRVAEVALTKVGEFNSQNVANV 328
+ E S ++N+ +L+ G L +++ RV + L+ G + +A V
Sbjct: 125 MSCNEASGGDLANVVHSLTISGLNPRQCTATLAVVAK--RVTAMDLSHCGVIEPKQLAAV 182
Query: 329 AGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNA-- 386
F ++ + D+ + L + A+ + F Q+L+ V WA A + PLL + D A
Sbjct: 183 FWGFVKLEFTDDDVMTYLVRSATTRMDEFNSQDLSMVSWALAK----SLPLLPTEDCAQG 238
Query: 387 FKDATQFTCCLNKALSNCNENGGVKSSG 414
TQF ++ L ++SG
Sbjct: 239 IDRFTQFNTSCDEHLMGIGTMSASRTSG 266
>gi|82596883|ref|XP_726446.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23481859|gb|EAA18011.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 1071
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 47/211 (22%), Positives = 89/211 (42%), Gaps = 43/211 (20%)
Query: 434 LGNIAWSYAVLGQM--DRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLE 491
L W +++ + D I F +I+ + E +I EQ + M+ V + LK
Sbjct: 850 LARYLWGVSIVNLINDDTINFINIY----NWNEIKIYEQ---NPMYLHMVFTLWLRLKYS 902
Query: 492 HPHLQLA---------LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIRE 542
+ HL+L+ ++ +L++K G N+ S+F +++++L + + E
Sbjct: 903 YSHLKLSKNFLNFIDQITHILKKKYIKNG----LNKDNLSTFHVQISKILDEFNVKYTNE 958
Query: 543 YAVDGYTVDAVL-----VDKKVAFEIDGPTH-------FSRNTGV-------PLGHTMLK 583
Y + ++ +K+A EIDGP+H NT + G T K
Sbjct: 959 YITKDLLIIDIIIILKECKEKIAIEIDGPSHHLLDLSDLHVNTSINDNKKYLQCGTTYFK 1018
Query: 584 RRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ GW V+++ EW +++ E DY
Sbjct: 1019 NFLLKKNGWKVINIPSYEWNKIKK--EDRDY 1047
>gi|407849431|gb|EKG04172.1| hypothetical protein TCSYLVIO_004775 [Trypanosoma cruzi]
Length = 1005
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 49/94 (52%), Gaps = 2/94 (2%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
+ + I+N+ +A +K+G L + R+A+ A+ GEF +VA + A+A ++
Sbjct: 818 TPKDITNVVYAYAKVG--LWHYKLFVRLADRAIQLRGEFRCDHVARLLEAYARVEMRYEK 875
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEP 375
LF E + R I H E+ +++ A+A + P
Sbjct: 876 LFVEFSPRIQTIAHLLTAGEVTKIVSAYAKVRIP 909
>gi|348503464|ref|XP_003439284.1| PREDICTED: FAST kinase domain-containing protein 3-like
[Oreochromis niloticus]
Length = 659
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 3/73 (4%)
Query: 546 DGYTVDAVLVD---KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
+GY + A D K++A IDG F+ N LG +K+R++ G+ VV + + E+
Sbjct: 574 EGYVLPASQTDDVYKRIALCIDGQKRFTSNLRQLLGKEAIKQRHLRLLGYEVVQIPYFEY 633
Query: 603 EELQGSFEQLDYL 615
E+LQ ++YL
Sbjct: 634 EKLQSKNSMVEYL 646
>gi|395735635|ref|XP_002815460.2| PREDICTED: FAST kinase domain-containing protein 3 [Pongo abelii]
Length = 658
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
+G+ + + + + K++A IDGP F N+ LG +K+R++ G+ VV + + E
Sbjct: 575 EGFVLPSTVNEDIHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEI 634
Query: 603 EELQGSFEQLDYLR 616
L+ E ++YL+
Sbjct: 635 GMLKSRRELVEYLQ 648
>gi|428673296|gb|EKX74209.1| conserved hypothetical protein [Babesia equi]
Length = 570
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 67/149 (44%), Gaps = 25/149 (16%)
Query: 477 FASQVHLVNQ-CLKLEHP-HLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVS 534
F +Q++L+N+ C+ H H ++ + L + I S + RF++ F+ L V
Sbjct: 408 FITQLNLLNKACIVERHRLHSKIMANQQLSDFINSIPNSTRFDE--AYDFKTSTTHLQVR 465
Query: 535 TGLNWIR-----EYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG----------- 578
L+ E V Y VD ++ K + E+DGP H++ +G
Sbjct: 466 NTLDMFNYETEVETKVYPYIVDILVKSKNLIIEVDGPYHYTTYINKSVGKILNRESSDDL 525
Query: 579 --HTM---LKRRYIAAAGWNVVSLSHQEW 602
HT+ LK+R + +G+ V++ + +W
Sbjct: 526 FQHTLNSRLKQRLLQKSGYKFVNIPYYKW 554
>gi|308807601|ref|XP_003081111.1| unnamed protein product [Ostreococcus tauri]
gi|116059573|emb|CAL55280.1| unnamed protein product [Ostreococcus tauri]
Length = 665
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 79/192 (41%), Gaps = 37/192 (19%)
Query: 199 LNKDIVDAQTAQEVLEVI---AEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT 255
+ +++ +A +A++ L V+ E+ AV + ATALHR+AK S +
Sbjct: 55 IQRELANASSAEDALRVVERDLEVFDAV------------HAATALHRVAKFSSPSSRLD 102
Query: 256 THRL----AFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAE 311
A TR L + + E A G++N+AW+ +KIG Y D +
Sbjct: 103 ARDFDRVEAVTRDERFKALASTVGDRMNEFDAFGLANVAWSFAKIG----YTPSQDTLNA 158
Query: 312 VA-------LTKVGEFNSQNVANVAGAFASMQHSAP----DLFSELAKRASDIVHTFQEQ 360
+A L Q+++N A AF +++ P + E R D F+
Sbjct: 159 LASRLEREVLKHGASVKPQSLSNAAYAFGRLRYKPPKSTLEALCEATMRQMD---KFRTD 215
Query: 361 ELAQVLWAFASL 372
E A ++ A L
Sbjct: 216 EFAGMMLGLAHL 227
>gi|308804243|ref|XP_003079434.1| unnamed protein product [Ostreococcus tauri]
gi|116057889|emb|CAL54092.1| unnamed protein product, partial [Ostreococcus tauri]
Length = 1182
Score = 44.3 bits (103), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 6/89 (6%)
Query: 520 VTSSFQKEVARLLVSTGL-NWIREYAVDGYTV--DAVLVDKKVAFEIDGPTHFSRNT-GV 575
TS+ Q+ VA L G+ ++ E AV+G + D V +++ E+DGP H+S + GV
Sbjct: 770 TTSNLQRAVADHLHDMGVGDFDVERAVEGGKMRPDIVFESRRLVIEVDGPHHYSVDADGV 829
Query: 576 --PLGHTMLKRRYIAAAGWNVVSLSHQEW 602
LG T+++ + + GW V + + EW
Sbjct: 830 RRELGQTIVRNELLRSWGWKVCVVPYHEW 858
Score = 42.4 bits (98), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 56/224 (25%), Positives = 91/224 (40%), Gaps = 42/224 (18%)
Query: 197 INLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTT 256
+ +NK ++ +T +E+ V+ G S +S +N +T R+AK
Sbjct: 154 LRMNKALMTCETVEELAAVV---------GGRASAMSDVNASTTYSRLAKFARG-----G 199
Query: 257 HRLAFTRQREMSMLV------AIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVA 310
R REMS A ++ + + + + +AWA G L D A
Sbjct: 200 RRAREEVVREMSRATWFKEVEARSIETMDKMQPRSAAQMAWAC----GHLSRSRRRDGDA 255
Query: 311 -----EVALTKVG-EFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHTFQE 359
E AL ++G +F Q VANVA A+A ++ P + L + A D ++
Sbjct: 256 FWDALERALERLGTKFKPQGVANVAWAYAKLEMRMPQGIRNAFETHLERNAQD----YKP 311
Query: 360 QELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSN 403
EL WA L + D + E + A + TCC + L+N
Sbjct: 312 YELTITFWA---LTKHGDAVREDVAIALERTLDLTCCKPQELAN 352
>gi|82594046|ref|XP_725261.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23480197|gb|EAA16826.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 213
Score = 44.3 bits (103), Expect = 0.21, Method: Composition-based stats.
Identities = 32/144 (22%), Positives = 58/144 (40%), Gaps = 34/144 (23%)
Query: 512 KTKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDK-------------- 557
K K + S QKEV +L++ L + E ++ Y VD + DK
Sbjct: 7 KEKEIEYNIKSDLQKEVKNILLTFNLTPLEEVSIGPYNVDFIEEDKTFQNISKNEIYYKK 66
Query: 558 --------------------KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSL 597
K+ E++G HF RNT + LK + ++ G+ V+++
Sbjct: 67 ESNNSTKIILSDKKNYENIGKIIIEVNGEHHFYRNTKSYTSFSKLKHKLLSDLGYIVINI 126
Query: 598 SHQEWEELQGSFEQLDYLRVILKD 621
+ +W L+ + Y++ I+ D
Sbjct: 127 PYFDWAILKTYLNKKSYIKKIIND 150
>gi|71421683|ref|XP_811868.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876580|gb|EAN90017.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1005
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 49/94 (52%), Gaps = 2/94 (2%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
+ + I+N+ +A +K+G L + R+A+ A+ GEF +VA + A+A ++
Sbjct: 818 TPKDITNVVYAYAKVG--LWHYKLFVRLADRAIQLRGEFRCDHVARLLEAYARVEMRYEK 875
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEP 375
LF E + R I H E+ +++ A+A + P
Sbjct: 876 LFVEFSPRIQTIAHLLTAGEVTKIVSAYAKVRIP 909
>gi|48257152|gb|AAH01295.2| FASTKD3 protein [Homo sapiens]
Length = 550
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ K++A IDGP F N+ LG +K+R++ G+ VV + + E L+ E ++Y
Sbjct: 475 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 534
Query: 615 LR 616
L+
Sbjct: 535 LQ 536
>gi|429327253|gb|AFZ79013.1| hypothetical protein BEWA_018580 [Babesia equi]
Length = 951
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 3/88 (3%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVL-VDK--KVAFEIDGPTHFSRNTGVPL 577
+S +E++ L G+ E Y +D V V+ KVA E DGP+HF T +
Sbjct: 745 SSPAHRELSHFLNLAGVLHKNEVQCGPYLIDIVPEVNPGIKVAIEYDGPSHFYAETVMRN 804
Query: 578 GHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
++ K + + GW V+ + +QEW +L
Sbjct: 805 IKSITKHEILESMGWEVIHVPYQEWIQL 832
>gi|156085826|ref|XP_001610322.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797575|gb|EDO06754.1| hypothetical protein BBOV_IV003930 [Babesia bovis]
Length = 651
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 46/215 (21%), Positives = 84/215 (39%), Gaps = 21/215 (9%)
Query: 419 EGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFA 478
EG+ ++ +LSF+ +G + + A F +I K + E + D++
Sbjct: 425 EGNKTTLILSFSHIIMGTVKLNKAPKSTETMPVFYNILKYLLEHPELHDEDHIDPDVLQG 484
Query: 479 SQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQK---VTSSFQKEVARLLVST 535
S ++ + + HL+ S I S R + TS K+VA +L +
Sbjct: 485 SLNNVRLLVTYIGYDHLKQWFRSTEISAIESLLAKARLDYCKDFRTSDLHKQVADVLSTL 544
Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTM-------------- 581
G+ +E + + D VL +++ EIDGP HF+ L +
Sbjct: 545 GIECDQEVTIGSHICDLVLKKRRIVIEIDGPYHFNTTLNSSLNSILNRHVDDYRLTYTYN 604
Query: 582 --LKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+K + G+ V+ + + W G EQ+ Y
Sbjct: 605 SRIKMYMLRQGGYKVIHIPYFMWPS--GKQEQMVY 637
>gi|296194953|ref|XP_002806679.1| PREDICTED: LOW QUALITY PROTEIN: FAST kinase domain-containing
protein 3 [Callithrix jacchus]
Length = 670
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 12/99 (12%)
Query: 529 ARLLVSTGLNWIREYAVD--------GYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPL 577
ARL ++ + Y +D G+ + + V K++A IDGP F N+ L
Sbjct: 550 ARLYFASKVLTPYYYTIDVEIKLDEEGFVLPCTVNEDVHKRIALCIDGPQRFCSNSKHLL 609
Query: 578 GHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
G +K+R++ G+ VV + + E E L E ++YL+
Sbjct: 610 GKEAIKQRHLQLLGYQVVQMPYHEIEML-TRLELVEYLQ 647
>gi|68063701|ref|XP_673847.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56491996|emb|CAI01743.1| conserved hypothetical protein [Plasmodium berghei]
Length = 608
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 15/59 (25%), Positives = 33/59 (55%)
Query: 546 DGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
+ Y + L+ K + E+DG +HF + + ++++K + GWN++ + +QEW +
Sbjct: 428 EKYNIIEKLLTKNIVIEVDGISHFYKESYSRTLNSIIKNYILKKFGWNIIHIPYQEWNQ 486
>gi|302834581|ref|XP_002948853.1| hypothetical protein VOLCADRAFT_89149 [Volvox carteri f.
nagariensis]
gi|300266044|gb|EFJ50233.1| hypothetical protein VOLCADRAFT_89149 [Volvox carteri f.
nagariensis]
Length = 1137
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 51/98 (52%), Gaps = 3/98 (3%)
Query: 286 ISNIAWAL---SKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
ISN+++AL + + + M +A A ++ EF Q+++N+ A+A + P L
Sbjct: 465 ISNLSYALVVARQHRAHPAHEAVMRALAVAAEARLSEFCPQDISNMLWAYARCGMAQPAL 524
Query: 343 FSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
FS A A + F + L QV+WA+A++ PLL
Sbjct: 525 FSAAASIARMMAADFSQAGLVQVIWAYAAMRVYDAPLL 562
>gi|40068497|ref|NP_076996.2| FAST kinase domain-containing protein 3 [Homo sapiens]
gi|294862434|sp|Q14CZ7.2|FAKD3_HUMAN RecName: Full=FAST kinase domain-containing protein 3
Length = 662
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ K++A IDGP F N+ LG +K+R++ G+ VV + + E L+ E ++Y
Sbjct: 587 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 646
Query: 615 LR 616
L+
Sbjct: 647 LQ 648
>gi|109730533|gb|AAI13564.1| Hypothetical protein MGC5297 [Homo sapiens]
gi|119628499|gb|EAX08094.1| hypothetical protein MGC5297, isoform CRA_a [Homo sapiens]
gi|313883202|gb|ADR83087.1| FAST kinase domains 3 [synthetic construct]
Length = 662
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ K++A IDGP F N+ LG +K+R++ G+ VV + + E L+ E ++Y
Sbjct: 587 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 646
Query: 615 LR 616
L+
Sbjct: 647 LQ 648
>gi|348536034|ref|XP_003455502.1| PREDICTED: FAST kinase domain-containing protein 5-like
[Oreochromis niloticus]
Length = 986
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 35/66 (53%), Gaps = 2/66 (3%)
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL--QGSFEQLDYL 615
K+A ++ HF + LG +KRR + AG+ VV LS+QEW L + E+L YL
Sbjct: 919 KIAVQVSNRNHFCSQSQQLLGLHAMKRRQLKIAGYRVVELSYQEWFPLLRKSRAEKLAYL 978
Query: 616 RVILKD 621
L D
Sbjct: 979 HCKLYD 984
>gi|407410016|gb|EKF32615.1| hypothetical protein MOQ_003529 [Trypanosoma cruzi marinkellei]
Length = 1005
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/121 (24%), Positives = 58/121 (47%), Gaps = 15/121 (12%)
Query: 282 SAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD 341
+ + I+N+ +A +++G L + R+A+ A+ GEF +VA + A+A ++
Sbjct: 818 TPKDITNVVYAYAQVG--LWHYKLFVRLADRAIQLRGEFRCDHVARLLEAYARVEMRYEK 875
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
LF E + R I H E+ +++ A+A + P DA F C ++A+
Sbjct: 876 LFLEFSPRIQTIAHLLTAGEVTKIVAAYAKVRIP-------------DAGVFNACGDRAV 922
Query: 402 S 402
+
Sbjct: 923 A 923
>gi|397475721|ref|XP_003809274.1| PREDICTED: FAST kinase domain-containing protein 3 [Pan paniscus]
Length = 662
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ K++A IDGP F N+ LG +K+R++ G+ VV + + E L+ E ++Y
Sbjct: 587 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 646
Query: 615 LR 616
L+
Sbjct: 647 LQ 648
>gi|426385166|ref|XP_004059100.1| PREDICTED: FAST kinase domain-containing protein 3 [Gorilla gorilla
gorilla]
Length = 662
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ K++A IDGP F N+ LG +K+R++ G+ VV + + E L+ E ++Y
Sbjct: 587 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 646
Query: 615 LR 616
L+
Sbjct: 647 LQ 648
>gi|345796308|ref|XP_545176.3| PREDICTED: LOW QUALITY PROTEIN: FAST kinase domain-containing
protein 3 [Canis lupus familiaris]
Length = 672
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 36/62 (58%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
V K+VA ID P FS N+ LG +K+R++ G+ VV + + E E L+ E ++Y
Sbjct: 588 VHKRVALCIDDPKRFSLNSKHLLGKEAIKQRHLRLLGYQVVQIPYYEIEVLKSRGELVEY 647
Query: 615 LR 616
L+
Sbjct: 648 LQ 649
>gi|332820899|ref|XP_517625.3| PREDICTED: FAST kinase domain-containing protein 3 [Pan
troglodytes]
gi|410217416|gb|JAA05927.1| FAST kinase domains 3 [Pan troglodytes]
gi|410254100|gb|JAA15017.1| FAST kinase domains 3 [Pan troglodytes]
gi|410288824|gb|JAA23012.1| FAST kinase domains 3 [Pan troglodytes]
gi|410339215|gb|JAA38554.1| FAST kinase domains 3 [Pan troglodytes]
Length = 662
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ K++A IDGP F N+ LG +K+R++ G+ VV + + E L+ E ++Y
Sbjct: 587 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 646
Query: 615 LR 616
L+
Sbjct: 647 LQ 648
>gi|410951914|ref|XP_003982637.1| PREDICTED: protein TBRG4 isoform 2 [Felis catus]
Length = 630
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 68/159 (42%), Gaps = 20/159 (12%)
Query: 259 LAFTRQREMSMLVAIA--MTALPECSAQG-ISNIAWALSKIGGELLYLSEMDRVAEVALT 315
LA +R + +L A++ + P +G + ++A+A K+G + R+A L
Sbjct: 264 LAAQNRRSVPLLRAVSYHLVQKPFPLTKGMLLDLAYAYGKLGFH--QTQVLQRLAADLLP 321
Query: 316 KVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL--- 372
V S VA A +FA ++ +P LF LA+ D HT L VL AFA L
Sbjct: 322 HVPSLTSGEVARGAKSFALLKWLSPPLFEALAQHVVDRAHTVTVPHLCNVLLAFAHLNFR 381
Query: 373 -----------YEPADPLLESLDNAFK-DATQFTCCLNK 399
+E P L+SL A + D C L +
Sbjct: 382 PEREDKFFGLVHEKLGPKLQSLHPALQVDVVWALCVLQQ 420
>gi|351714996|gb|EHB17915.1| FAST kinase domain-containing protein 1 [Heterocephalus glaber]
Length = 778
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 6/75 (8%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFE--QLDY 614
+++AFE F RN G + +K+R++ G+ V+ + H EW + S + ++DY
Sbjct: 708 ERIAFEFLDSKAFCRNIPHLKGKSAMKKRHLEILGYRVIQIPHFEWNSMALSTKDARMDY 767
Query: 615 LRVILKDYIGGEGSS 629
LR +I GEG+S
Sbjct: 768 LR----QHIFGEGTS 778
>gi|294872955|ref|XP_002766462.1| hypothetical protein Pmar_PMAR018296 [Perkinsus marinus ATCC 50983]
gi|239867342|gb|EEQ99179.1| hypothetical protein Pmar_PMAR018296 [Perkinsus marinus ATCC 50983]
Length = 1082
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 49/223 (21%), Positives = 95/223 (42%), Gaps = 41/223 (18%)
Query: 186 QFSGPSNRRK----EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATAL 241
+ G SN+R+ E + + I+ A ++ I+ ++ V K L L+ +N++T +
Sbjct: 596 RVGGHSNQRQATANEFEIQRSILAAANSRS----ISSLLLIVEKHLDE--LNSVNVSTLI 649
Query: 242 HRIA---KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGG 298
HR+A +N E+ ++ + ++ A+ P S Q +SNI WA+ G
Sbjct: 650 HRLASITQNQEQ------NQRVLANDPRVKEVLRRAIDLAPTSSCQSLSNICWAI----G 699
Query: 299 ELLYLSE------------------MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
+L + E MD VAE + F Q V+N+ A+ +
Sbjct: 700 KLQMVEEKDVVRAIVEAAKSQLEELMDLVAEKVANSLYTFKPQEVSNLLYAYGRLNCYNE 759
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
L E+ + ++ + Q + V+ + A L P L++++
Sbjct: 760 KLLQEICACVATMMPRYDGQGVGNVICSLAKLKYPCIQLMDAI 802
>gi|410949837|ref|XP_003981623.1| PREDICTED: FAST kinase domain-containing protein 3 [Felis catus]
Length = 718
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 545 VDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
V +T+D V K++A ID P FS N+ LG +K+R++ G+ VV + + E E
Sbjct: 578 VLPFTIDED-VHKRLALCIDDPKRFSLNSRHLLGKEAIKQRHLRLLGYQVVQIPYYEIEM 636
Query: 605 LQGSFEQLDYLR 616
L+ E ++YL+
Sbjct: 637 LKSRVELVEYLQ 648
>gi|149032746|gb|EDL87601.1| similar to hypothetical protein MGC5297, isoform CRA_a [Rattus
norvegicus]
Length = 168
Score = 43.5 bits (101), Expect = 0.31, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 1/69 (1%)
Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
+TVD V +VA IDGP F + LG +K+R++ G+ VV + + E E L
Sbjct: 89 FTVDED-VHTRVALCIDGPQRFCLGSKHLLGKEAIKQRHLRLLGYQVVQVPYHELELLTS 147
Query: 608 SFEQLDYLR 616
E +DYL+
Sbjct: 148 RLELVDYLQ 156
>gi|153206845|ref|ZP_01945686.1| hypothetical protein A35_A0967 [Coxiella burnetii 'MSU Goat Q177']
gi|120577208|gb|EAX33832.1| hypothetical protein A35_A0967 [Coxiella burnetii 'MSU Goat Q177']
Length = 438
Score = 43.5 bits (101), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 77/184 (41%), Gaps = 20/184 (10%)
Query: 282 SAQGISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
+ QGI+N WAL+ +G EL DR+ +FN QNV N FA++
Sbjct: 98 NPQGIANTLWALATMGVRRQELEAQGLNDRLMGAVHHNAEQFNPQNVTNTLWTFATLSVK 157
Query: 339 APDL-FSELAKRASDIVH----TFQEQELAQVLWAFASL---------YEPADPLLESLD 384
+L EL + VH Q + LWA A++ E D LLE++
Sbjct: 158 WEELEAQELNDCLLNAVHRNADQLNPQGIVNTLWALATMGVRWRELEVRELTDRLLEAVR 217
Query: 385 -NAFKDATQFTCCLNKALSNCN-ENGGVKSSGDADS-EGSLSSPVLSFNRDQLGNIAWSY 441
NA + ++ AL+ + G +++ G D G++ V FN + N W
Sbjct: 218 YNASRFKSREIANTLWALATLSVRRGNMEAQGLRDRLLGAVHHNVERFNPQDIANALWGL 277
Query: 442 AVLG 445
A +G
Sbjct: 278 ATMG 281
>gi|156102949|ref|XP_001617167.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148806041|gb|EDL47440.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 943
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 24/104 (23%), Positives = 50/104 (48%), Gaps = 6/104 (5%)
Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
E++++L ++ ++ ++ D +L D ++ GP + N+ V + LKR
Sbjct: 738 ELSKILARINVSHLKSVYINHICADIMLPDSQIVIMCLGPYSYYVNSLVTTSTSDLKRSI 797
Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGSSN 630
+ + V+ LS+ EW +L E++ +L Y G G++N
Sbjct: 798 LEKKKYKVIPLSYHEWNKLNDYEEKIRFL------YAFGRGAAN 835
>gi|302781714|ref|XP_002972631.1| hypothetical protein SELMODRAFT_413126 [Selaginella moellendorffii]
gi|300160098|gb|EFJ26717.1| hypothetical protein SELMODRAFT_413126 [Selaginella moellendorffii]
Length = 177
Score = 43.1 bits (100), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 574 GVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQL-DYLRVIL 619
G LGHT+LK R + AA W ++S S+ EWE LQG L Y R+ L
Sbjct: 126 GDLLGHTVLKHRLVEAAEWKIISASYAEWENLQGESGHLTSYKRLWL 172
>gi|221057756|ref|XP_002261386.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|194247391|emb|CAQ40791.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 1303
Score = 43.1 bits (100), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 56/103 (54%), Gaps = 9/103 (8%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYA--VDG-YTVDAVLVDKKVAFEIDGPTHFS-----RN 572
+SSF +EV L+S + ++ +DG YTVD ++++ V EI+G H+ +
Sbjct: 1196 SSSFHREVLSTLLSLDVKNVQCEVPFMDGIYTVD-IVINNSVCIEINGSNHYYYDNNLKR 1254
Query: 573 TGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+G L L + Y+ + + ++ +S+ +W L+ + E+ DYL
Sbjct: 1255 SGEKLDALNLIKYYLLSKKYKLILVSYLDWNNLKSAEEKKDYL 1297
>gi|410909325|ref|XP_003968141.1| PREDICTED: FAST kinase domain-containing protein 3-like [Takifugu
rubripes]
Length = 665
Score = 43.1 bits (100), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 73/331 (22%), Positives = 133/331 (40%), Gaps = 31/331 (9%)
Query: 313 ALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
A+ V F + V GA HS + K + T + + QV+ F+S
Sbjct: 327 AVRHVPHFTDDELTGVLGALMHFGHSDHYFVDAMEKYVPTMTFTSHPETVTQVIQFFSSR 386
Query: 373 YEPADPLLESLDNAF-KDATQF-TCCLNKALSNCNENGGVKSSGDA---DSEGSLSSPVL 427
+ +L+++ +F A F T + K + + G + + E L S
Sbjct: 387 NILSPTVLDAVAESFVYRADDFSTTQVAKHIMALGKLGYLPPNAGTVFRKVENILHSHFS 446
Query: 428 SFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQC 487
F L N+ S ++ + F S ++K S F +Q + R D +Q+ +
Sbjct: 447 HFQPQSLLNLLHSCTLVERFPVNFVSKVFK--SYFLQQLQEDGNRVDRYVLAQLTQLYMT 504
Query: 488 LKLEHPHLQ-----------------LALSSVLEEKIASAGKTKRFNQKVTSSFQKEVAR 530
+KLE P + +L + ++ + ++ KT N + + ++
Sbjct: 505 MKLECPFYEGPRLPPKYQVKSFLLPGRSLETPVDLHLYNSVKTGLVN--LLGARHYFGSK 562
Query: 531 LLVSTGLNWIREYAVD--GYTVDAVLVD---KKVAFEIDGPTHFSRNTGVPLGHTMLKRR 585
+L S E +D G+ + A VD K++A IDG F+ N LG +K+R
Sbjct: 563 VLTSNCYTLDVEIKLDEEGFVLPASHVDEVCKRIAVCIDGRKRFTVNKRQLLGKEAIKQR 622
Query: 586 YIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
++ G+ VV + E+E+LQ ++YL
Sbjct: 623 HLRLLGYEVVQIPFYEFEKLQNQASVVEYLH 653
>gi|221056200|ref|XP_002259238.1| RAP protein [Plasmodium knowlesi strain H]
gi|193809309|emb|CAQ40011.1| RAP protein, putative [Plasmodium knowlesi strain H]
Length = 740
Score = 42.7 bits (99), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 23/108 (21%), Positives = 46/108 (42%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
TS KE++ +L + + A + VD E++ P + +
Sbjct: 577 TSMLHKEISDILTQIKVEHLNSVACGPFIVDIYHPHSNCIIEVNAPFQYYLTSEKLTTLA 636
Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIGGEGS 628
+ +++A G+ ++ +SH+ W L +++DYL L + G GS
Sbjct: 637 EWRHKFLARMGFRIIHISHKVWSSLPTDKQKVDYLSRALPAAMFGRGS 684
>gi|397635941|gb|EJK72081.1| hypothetical protein THAOC_06426, partial [Thalassiosira oceanica]
Length = 198
Score = 42.7 bits (99), Expect = 0.53, Method: Composition-based stats.
Identities = 38/161 (23%), Positives = 64/161 (39%), Gaps = 37/161 (22%)
Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
+S AWA + G L E V +G F ++++N A AFA+ S P+LF +
Sbjct: 41 LSITAWAFATSGVSHSELFEKIGNHVVGPGGLGSFKPRDLSNTAWAFATAGVSHPELFKK 100
Query: 346 LAKRASD--IVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSN 403
+ ++ +F+ QEL+ +WA A++ + L + F + L
Sbjct: 101 IGHHVAEQGCFDSFKPQELSNTVWACATVGYTDERLFSA----------FAPVIGSKLDE 150
Query: 404 CNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVL 444
C+E +L NIAW+Y+ L
Sbjct: 151 CSEQ-------------------------ELTNIAWAYSTL 166
>gi|412993943|emb|CCO14454.1| predicted protein [Bathycoccus prasinos]
Length = 970
Score = 42.7 bits (99), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 6/72 (8%)
Query: 317 VGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAF------A 370
+ EFN++++ANV AFA + + +AKRA++I+ TF QEL + L A
Sbjct: 599 IDEFNARDLANVTEAFAKRLDTPEKVLKTIAKRAAEILDTFNAQELLKFLGALERAGGDV 658
Query: 371 SLYEPADPLLES 382
YE + LL S
Sbjct: 659 HKYEKLNELLRS 670
>gi|70947243|ref|XP_743256.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56522666|emb|CAH77389.1| hypothetical protein PC000205.02.0 [Plasmodium chabaudi chabaudi]
Length = 378
Score = 42.7 bits (99), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 18/67 (26%), Positives = 35/67 (52%)
Query: 554 LVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLD 613
L+ K + E+DG +HF + + ++++K + GWN++ + +QEW + +L
Sbjct: 174 LLTKNIVIEVDGISHFYKESYSRTLNSIIKNYILKKFGWNIIHIPYQEWNQCYNFKTKLL 233
Query: 614 YLRVILK 620
Y I K
Sbjct: 234 YAIHIFK 240
>gi|149704825|ref|XP_001497489.1| PREDICTED: protein TBRG4-like isoform 1 [Equus caballus]
Length = 632
Score = 42.7 bits (99), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 153/398 (38%), Gaps = 84/398 (21%)
Query: 259 LAFTRQREMSMLVAIA--MTALPECSAQG-ISNIAWALSKIGGELLYLSEMDRVAEVALT 315
LA +R + +L AI+ + P +G + ++A+A K+G R+A L
Sbjct: 266 LAAQNRRSVPLLRAISYHLVQKPFPLTKGMLLDLAYAYGKLGFH--QTQVFQRLAADLLP 323
Query: 316 KVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YE 374
S VA A +FA ++ LF A+ + + L +L AFA L +
Sbjct: 324 HTPSLTSGEVARCAKSFAFLKWLNLPLFEAFAQHVLNRAQSTTVPHLCNMLLAFARLNFR 383
Query: 375 PADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQL 434
P + QF + + L + E G+ + D
Sbjct: 384 P------------EREDQFFSLVREKLGS--ELAGLDPALQVD----------------- 412
Query: 435 GNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRIS-EQYREDIMFASQVHLVNQCLKLEHP 493
+ W+ VL Q+ + + F Q + E ++ +F +H +N +LEHP
Sbjct: 413 --VVWALCVLQQVREAELRAVLR--PEFHTQFLGGESPKDQSIFQKLLH-INATAQLEHP 467
Query: 494 HLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEV-------------ARLLVSTGLNWI 540
+S L A + ++KVT QKE+ +V+T W+
Sbjct: 468 EY----TSPLLPVSALVPRLSALDKKVTP-LQKELQETLKGLLGSSDRGSFMVATQYGWV 522
Query: 541 --REYAVDGYTVDAVLVD-------------------KKVAF-EIDGPTHFSRNTGVPLG 578
E +D + L D K++AF + P SR+ + LG
Sbjct: 523 LDAEVLLDADSQFLPLRDFVAPHLAPPSGSQPLPPGAKRLAFLRWEFPNFNSRSKDL-LG 581
Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+L RR++ AAG+ VV + + EW EL+ +++ YL+
Sbjct: 582 RFVLARRHVLAAGFLVVDVPYYEWLELKSEWQKGAYLK 619
>gi|302851686|ref|XP_002957366.1| hypothetical protein VOLCADRAFT_98423 [Volvox carteri f.
nagariensis]
gi|300257325|gb|EFJ41575.1| hypothetical protein VOLCADRAFT_98423 [Volvox carteri f.
nagariensis]
Length = 1061
Score = 42.7 bits (99), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 1/91 (1%)
Query: 280 ECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSA 339
+ S Q ++ AWA+ ++G + L RV AL G+ Q+++++ A A H
Sbjct: 298 DASPQALALTAWAVVQLGEQPPPLEWWRRVQGAALRLRGQLQPQDISHLVWATARSGHPP 357
Query: 340 P-DLFSELAKRASDIVHTFQEQELAQVLWAF 369
P D + + A + F+ QE+ +LW
Sbjct: 358 PPDWLAAMCTEAHGCLRGFRAQEVCNLLWGL 388
>gi|189183089|ref|YP_001936874.1| repeat-containing protein A_01 [Orientia tsutsugamushi str. Ikeda]
gi|189179860|dbj|BAG39640.1| repeat-containing protein A_01 [Orientia tsutsugamushi str. Ikeda]
Length = 631
Score = 42.7 bits (99), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 80/397 (20%), Positives = 149/397 (37%), Gaps = 71/397 (17%)
Query: 274 AMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEV----ALTKVGEFNSQNVANVA 329
A+ + + QG++N WAL + L + + E A + F +Q+++N
Sbjct: 164 AIKTIDHFTTQGLANSLWALGR-----LEIHPQAKFIEAWIHHATKTIDHFTTQDLSNSL 218
Query: 330 GAFASMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLES-LDNA 386
A ++ H + + A+ + F Q+L+ LW L P +E+ + +A
Sbjct: 219 WALGRLEIHPQAEFIEAWIRHATKTIDHFTTQDLSNSLWGLGRLEIHPQAKFIEAWIHHA 278
Query: 387 FKDATQFTCCLNKALSNCNEN-GGVKSSGDADSEGSL----SSPVLSFNRDQLGNIAWSY 441
K FT + LSN G ++ A+ + + + F L N W+
Sbjct: 279 TKTIDHFT---TQDLSNSLWGLGRLEIHPQAEFIEAWIRHATKTIDHFTTQDLSNSLWAL 335
Query: 442 AVLGQMDRIFFSDIW-----KTISRFEEQ--------------------RISEQYREDI- 475
L + F + W KTI F Q ++ +Q+ +
Sbjct: 336 GQLEIHPQAEFIEAWIHHATKTIDHFTTQGLANSIYGIFILNVLCDSKIKVPQQFISAVN 395
Query: 476 ----MFASQVHLVNQCLKLEHPHLQLALSSV----------LEEKIAS---AGKTKRFNQ 518
+F + ++Q LK H V LE+K + T
Sbjct: 396 KNIELFDENIEGISQILK---AHYYFGKQGVGILTSQNRQFLEKKFKTKLTPCHTSNLQL 452
Query: 519 KVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
V +K +A+ LV + ++I++ +VD + DK ++DGP HF N P
Sbjct: 453 NVLKVVKKVLAQHLVKSE-HYIKQITS---SVDIFIKDKNTVIQVDGPCHFDDNNA-PNI 507
Query: 579 HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
T L + + G+ V + + W +L+ + ++ Y+
Sbjct: 508 STRLNTELLKSYGYIVHRIPYWVWNKLRTNTDKEKYI 544
>gi|157864875|ref|XP_001681146.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68124440|emb|CAJ02301.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 442
Score = 42.7 bits (99), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 2/95 (2%)
Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
A+G++N+ A SK G L + + L +VGEF + ++ +A AFA ++ +
Sbjct: 110 AKGVTNVISAFSKTGINHEKLFGLLSMRVQTLARVGEFEAAHLVILANAFARLRFREQHV 169
Query: 343 FSELAKRASDIVHTFQEQELAQVLWAF--ASLYEP 375
FS +A+RA + EL ++ AF A L +P
Sbjct: 170 FSAIARRAMSLRERVTVNELVPLINAFSKAGLKDP 204
>gi|6841122|gb|AAF28914.1|AF161354_1 HSPC091 [Homo sapiens]
Length = 193
Score = 42.7 bits (99), Expect = 0.68, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 36/62 (58%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ K++A IDGP F N+ LG +K+R++ G+ VV + ++E L+ E ++Y
Sbjct: 118 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYREIGMLKSRRELVEY 177
Query: 615 LR 616
L+
Sbjct: 178 LQ 179
>gi|302828418|ref|XP_002945776.1| hypothetical protein VOLCADRAFT_127357 [Volvox carteri f.
nagariensis]
gi|300268591|gb|EFJ52771.1| hypothetical protein VOLCADRAFT_127357 [Volvox carteri f.
nagariensis]
Length = 1323
Score = 42.7 bits (99), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 5/52 (9%)
Query: 558 KVAFEIDGPTHFSRNTGVP---LGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
++A E+DGP+HF N VP LG T+ + R + A G +V + H EW LQ
Sbjct: 1153 RIAVEVDGPSHFCAN--VPNHALGATVARDRCLQALGLQLVVVPHFEWYLLQ 1202
>gi|156098671|ref|XP_001615351.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804225|gb|EDL45624.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 735
Score = 42.4 bits (98), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 24/115 (20%), Positives = 52/115 (45%), Gaps = 6/115 (5%)
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
TS+ KE++ +L + + A + VD E + P + N+ +
Sbjct: 562 TSTLHKEISSILTLIKIEHLNSVACGPFIVDIYHPPSNYIIEANAPFQYYLNSERLTALS 621
Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL------KDYIGGEGSS 629
+ +++A G+ ++ +SH+ W L +++DYL +L + + GG+ S+
Sbjct: 622 EWRHKFLARMGFRLIHISHKVWNSLPTEKQRVDYLLRVLPAGMLGRAHPGGKDST 676
>gi|301789123|ref|XP_002929978.1| PREDICTED: FAST kinase domain-containing protein 3-like [Ailuropoda
melanoleuca]
Length = 667
Score = 42.4 bits (98), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 37/62 (59%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
V K+VA ID P FS ++ LG +K+R++ G++VV + + E + L+ E ++Y
Sbjct: 587 VHKRVALCIDDPKRFSLDSKHLLGKEAIKQRHLRLLGYHVVQIPYYEIKMLKSRVELVEY 646
Query: 615 LR 616
L+
Sbjct: 647 LQ 648
>gi|294951459|ref|XP_002786991.1| hypothetical protein Pmar_PMAR006407 [Perkinsus marinus ATCC 50983]
gi|239901581|gb|EER18787.1| hypothetical protein Pmar_PMAR006407 [Perkinsus marinus ATCC 50983]
Length = 633
Score = 42.4 bits (98), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 95/223 (42%), Gaps = 41/223 (18%)
Query: 186 QFSGPSNRRK----EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATAL 241
+ G SN+R+ E + + I+ A ++ I+ ++ V K L L+ +N++T +
Sbjct: 147 RVGGHSNQRQATANEFEIQRSILAAANSRS----ISSLLLIVEKHLDE--LNSVNVSTLI 200
Query: 242 HRIA---KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGG 298
HR+A +N E+ R+ R +L A+ P S Q +SNI WA+ G
Sbjct: 201 HRLASITQNQEQ-----NQRVLANDPRVKEVLRR-AIDLAPTSSCQSLSNICWAI----G 250
Query: 299 ELLYLSE------------------MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
+L + E MD VAE + F Q V+N+ A+ +
Sbjct: 251 KLQMVEEKDVVRAIVEAAKSQLEELMDLVAEKVANTLYTFKPQEVSNLLYAYGRLNCYNE 310
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
L E+ + ++ + Q + V+ + A L P L++++
Sbjct: 311 KLLQEICACVATMMPRYDGQGVGNVICSLAKLKYPCIQLMDAI 353
>gi|397606496|gb|EJK59336.1| hypothetical protein THAOC_20457, partial [Thalassiosira oceanica]
Length = 146
Score = 42.4 bits (98), Expect = 0.77, Method: Composition-based stats.
Identities = 32/143 (22%), Positives = 56/143 (39%), Gaps = 35/143 (24%)
Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPAD 377
FN Q++AN+ +FA + P+LF + + + + +F+ Q+L+ +WAFA+
Sbjct: 31 FNPQHLANILWSFAKSGEADPELFQAIGNHITGLGSLDSFKPQDLSNTIWAFATAGVSYP 90
Query: 378 PLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNI 437
L E + CLN SF + N
Sbjct: 91 ALFEKIGGHIVGLD----CLN-----------------------------SFKQQDFSNT 117
Query: 438 AWSYAVLGQMDRIFFSDIWKTIS 460
AW++A +G+ + F I I+
Sbjct: 118 AWAFAKVGESNPKLFKKIGDYIA 140
>gi|119628500|gb|EAX08095.1| hypothetical protein MGC5297, isoform CRA_b [Homo sapiens]
Length = 194
Score = 42.4 bits (98), Expect = 0.79, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ K++A IDGP F N+ LG +K+R++ G+ VV + + E L+ E ++Y
Sbjct: 119 IHKRIALCIDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQVVQIPYHEIGMLKSRRELVEY 178
Query: 615 LR 616
L+
Sbjct: 179 LQ 180
>gi|428186081|gb|EKX54932.1| hypothetical protein GUITHDRAFT_99583 [Guillardia theta CCMP2712]
Length = 824
Score = 42.4 bits (98), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 73/175 (41%), Gaps = 24/175 (13%)
Query: 230 SPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNI 289
S L P ++ L A+ + T + RQR S CS Q ++N+
Sbjct: 471 SALKPAELSMTLWACARYHHPSKWLYTRFSSEMRQRGFS-----------NCSTQELANL 519
Query: 290 AWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSEL 346
WAL++ G+L+Y D EV+ V NS+++ N+ A + L SE+
Sbjct: 520 CWALTESSDEYGDLVY----DVAQEVSSRPVNPRNSKDMRNILCCIAKSRVPDCGLASEV 575
Query: 347 AKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESLD-----NAFKDATQFTC 395
A+ T + WAF+ + + P+ L +S + +A ++T F C
Sbjct: 576 ARELEASGSTTSVRAWILTFWAFSHIAFIPSSDLQQSFETKVQGDAISNSTTFLC 630
>gi|348561900|ref|XP_003466749.1| PREDICTED: FAST kinase domain-containing protein 3-like [Cavia
porcellus]
Length = 669
Score = 42.4 bits (98), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 34/62 (54%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
V K++A ID P F N LG +K+R++ G+ VV + + E E+L+ + + Y
Sbjct: 588 VHKRIALCIDDPNRFCSNGIHLLGKEAIKQRHLGLLGYEVVQVPYHEMEKLKSRHQLVKY 647
Query: 615 LR 616
L+
Sbjct: 648 LQ 649
>gi|291394925|ref|XP_002713900.1| PREDICTED: transforming growth factor beta regulated gene 4-like
[Oryctolagus cuniculus]
Length = 632
Score = 42.4 bits (98), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 48/185 (25%), Positives = 80/185 (43%), Gaps = 42/185 (22%)
Query: 477 FASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLL---- 532
F +H +N +LEHP S L +A A + +QKVT QKE+ L
Sbjct: 452 FLKLLH-INATARLEHPEY----SGPLLPALAVAPRPPAPDQKVTP-LQKELQETLKGLL 505
Query: 533 ---------VSTGLNWI----------------REYAVDGYTVDA-----VLVDKKVAF- 561
V+T W+ R++ A L K++AF
Sbjct: 506 GSADRGSFEVATQYGWVLDAEVLLDADGQFLPLRDFVAPHLAQPAGGQPLPLGAKRLAFL 565
Query: 562 EIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKD 621
+ P++ SR+ + LG +L RR++ AAG+ VV + + EW EL+ +++ YL+ ++
Sbjct: 566 RWEFPSYNSRSKDL-LGRFVLARRHLLAAGFLVVDVPYYEWLELKSEWQKGAYLKDKMRK 624
Query: 622 YIGGE 626
+ E
Sbjct: 625 VVAEE 629
>gi|407395839|gb|EKF27268.1| hypothetical protein MOQ_009016 [Trypanosoma cruzi marinkellei]
Length = 521
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 50/92 (54%), Gaps = 4/92 (4%)
Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
++G++NI A SK G E L+ RV +A +VGEF + ++ +A AF+ +++
Sbjct: 187 SKGVANIISAFSKTGINHEKLFGFLSKRVQTLA--RVGEFEAAHLVIIANAFSRLRYRDK 244
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
LF +A+RA + EL ++ AF+ +
Sbjct: 245 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 276
>gi|344268382|ref|XP_003406039.1| PREDICTED: FAST kinase domain-containing protein 1 [Loxodonta
africana]
Length = 839
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 6/75 (8%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFE--QLDY 614
+++A E F RN G + +K+R++ G++V+ + H EW + S + Q+DY
Sbjct: 769 ERIALEFLYSRAFCRNIPHLKGVSAMKKRHLEILGYHVIQIPHFEWNSMALSTKDAQMDY 828
Query: 615 LRVILKDYIGGEGSS 629
LR + I GEG S
Sbjct: 829 LR----ERIFGEGKS 839
>gi|417403497|gb|JAA48549.1| Putative fast kinase-like protein [Desmodus rotundus]
Length = 632
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 79/349 (22%), Positives = 126/349 (36%), Gaps = 81/349 (23%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQV 365
+ R+A L S VA A +FA ++ LF A+ + L +
Sbjct: 314 LQRLAADLLPHTPSLTSSEVARCAKSFAFLKWLNLPLFEAFAQHVLSRAQSITVPPLCNM 373
Query: 366 LWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSS 424
L AFA L + P + +F +++ L EG L+S
Sbjct: 374 LLAFARLNFHP------------EQEDEFFSLVHEKL-----------------EGQLAS 404
Query: 425 --PVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQY-REDIMFASQV 481
P L + + W+ VLGQ + +F Q + +Q + F +
Sbjct: 405 LGPALQVD------VLWALCVLGQAQEAELRAV--LCPQFHTQLLGDQSPKGQSTFQKLL 456
Query: 482 HLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEV------------- 528
H VN +LEHP S L A + ++KVT QKE+
Sbjct: 457 H-VNATAQLEHPEY----SGPLLPASALVPRPSALDRKVTP-LQKELQGALKGLLGSADR 510
Query: 529 ARLLVSTGLNWIR----------EYAVDGYTVDAVLVD-----------KKVAFEIDGPT 567
R V W+ ++ G V L K++AF +
Sbjct: 511 GRFTVPMQYGWVLDAEVLLGAEGQFLPLGDFVAPHLAPPSEGQPLPPGAKRLAFLRWEFS 570
Query: 568 HFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+F+ + LG L RR++ AAG+ VV + H EW EL+ +++ YL+
Sbjct: 571 NFNSRSKDLLGRFALARRHVLAAGFLVVDVPHYEWLELKSDWQKGAYLK 619
>gi|71420728|ref|XP_811585.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876263|gb|EAN89734.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 521
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 50/92 (54%), Gaps = 4/92 (4%)
Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
++G++NI A SK G E L+ RV +A +VGEF + ++ +A AF+ +++
Sbjct: 187 SKGVANIISAFSKTGINHEKLFGFLSKRVQTLA--RVGEFEAAHLVIIANAFSRLRYRDK 244
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
LF +A+RA + EL ++ AF+ +
Sbjct: 245 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 276
>gi|124806224|ref|XP_001350662.1| RAP protein, putative [Plasmodium falciparum 3D7]
gi|23496788|gb|AAN36342.1| RAP protein, putative [Plasmodium falciparum 3D7]
Length = 1505
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 14/51 (27%), Positives = 31/51 (60%)
Query: 554 LVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
+++K + E+DG +HF + + ++++K + GWNV+ + +QEW +
Sbjct: 1276 VLNKNIVIEVDGISHFYKESFSRTINSVIKDYILKKLGWNVIHIPYQEWNQ 1326
>gi|399217206|emb|CCF73893.1| unnamed protein product [Babesia microti strain RI]
Length = 570
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/109 (22%), Positives = 49/109 (44%)
Query: 516 FNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGV 575
F + F K+VAR+L + ++ +T+D ++ + E P F TG
Sbjct: 462 FGRSYHEDFVKDVARILTLLNIEAVKGVIAGPFTLDLYSSERNLVIECCPPYQFYTQTGS 521
Query: 576 PLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVILKDYIG 624
+ + I A G+++V + +++W L ++ +L IL ++I
Sbjct: 522 YTTCASWRHKLIRAMGFHLVLVPYKKWYSLPSDNDKGAFLTTILPNHIA 570
>gi|407832047|gb|EKF98310.1| hypothetical protein TCSYLVIO_010792 [Trypanosoma cruzi]
Length = 521
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 50/92 (54%), Gaps = 4/92 (4%)
Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
++G++NI A SK G E L+ RV +A +VGEF + ++ +A AF+ +++
Sbjct: 187 SKGVANIISAFSKTGINHEKLFGFLSKRVQTLA--RVGEFEAAHLVIIANAFSRLRYRDK 244
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
LF +A+RA + EL ++ AF+ +
Sbjct: 245 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 276
>gi|401416346|ref|XP_003872668.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488892|emb|CBZ24142.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 442
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 2/95 (2%)
Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
A+G++NI A SK G L + + L +VGEF + ++ +A AFA ++ +
Sbjct: 110 AKGVTNIISAFSKTGINHEKLFGLLSMRVQTLARVGEFEAAHLVILANAFARLRFREQHV 169
Query: 343 FSELAKRASDIVHTFQEQELAQVLWAF--ASLYEP 375
F +A+RA + EL ++ AF A L +P
Sbjct: 170 FGAIARRAMSLRERVTVNELVPLINAFSKAGLKDP 204
>gi|146078054|ref|XP_001463441.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398010941|ref|XP_003858667.1| hypothetical protein, conserved [Leishmania donovani]
gi|134067526|emb|CAM65806.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322496876|emb|CBZ31947.1| hypothetical protein, conserved [Leishmania donovani]
Length = 442
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 2/95 (2%)
Query: 283 AQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
A+G++NI A SK G L + + L +VGEF + ++ +A AFA ++ +
Sbjct: 110 AKGVTNIISAFSKTGINHEKLFGLLSMRVQTLARVGEFEAAHLVILANAFARLRFREQHV 169
Query: 343 FSELAKRASDIVHTFQEQELAQVLWAF--ASLYEP 375
F +A+RA + EL ++ AF A L +P
Sbjct: 170 FGAIARRAMSLRERVTVNELVPLINAFSKAGLKDP 204
>gi|432929105|ref|XP_004081183.1| PREDICTED: FAST kinase domain-containing protein 3-like [Oryzias
latipes]
Length = 662
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 547 GYTVDAVLVD---KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
GY + A D K+VA IDG F+ N+ LG +K+R++ G+ V + + E+E
Sbjct: 578 GYVLHASQTDDVCKRVALCIDGQRRFTSNSRQLLGKETMKQRHLRLLGYEVAQIPYYEFE 637
Query: 604 ELQGSFEQLDYL 615
+L ++YL
Sbjct: 638 KLHSKTSVVEYL 649
>gi|68073079|ref|XP_678454.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56498926|emb|CAH97349.1| conserved hypothetical protein [Plasmodium berghei]
Length = 1637
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 46/211 (21%), Positives = 88/211 (41%), Gaps = 43/211 (20%)
Query: 434 LGNIAWSYAVLGQM--DRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLE 491
L W +++ + D I F +I+ + E +I EQ + M+ V + LK
Sbjct: 1434 LARYLWGVSIVNLINDDTINFINIY----NWNEIKIYEQ---NPMYLHMVFTLWLRLKYY 1486
Query: 492 HPHLQLA---------LSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIRE 542
+ HL+L+ ++ +L++ G N+ S+F +++++L + + E
Sbjct: 1487 YAHLKLSKNFLNFIDKITHILKKIYIKNG----LNKDNLSTFHVQISKILDKFNVKYTNE 1542
Query: 543 YAVDGYTVDAVL-----VDKKVAFEIDGPTH-------FSRNTGV-------PLGHTMLK 583
Y + ++ +K+A EIDGP+H NT + G T K
Sbjct: 1543 YITKDLLIIDIIIILKECKEKIAIEIDGPSHHLLDLSDLHENTSINDNKKYLQCGTTYFK 1602
Query: 584 RRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ GW V+++ EW +++ E DY
Sbjct: 1603 NFLLKKNGWEVINIPSYEWNKIKK--EDRDY 1631
>gi|159465104|ref|XP_001690764.1| hypothetical protein CHLREDRAFT_180834 [Chlamydomonas reinhardtii]
gi|158270346|gb|EDO96203.1| predicted protein [Chlamydomonas reinhardtii]
Length = 690
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 6/63 (9%)
Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL------Y 373
FN+Q+V+N A A + ++ DL LA+ + + T Q+L+ +LWA +L Y
Sbjct: 182 FNAQDVSNALWACAKLGYADADLLQRLAEAGAAVAKTMIPQDLSNILWALKALGCTGPAY 241
Query: 374 EPA 376
+PA
Sbjct: 242 QPA 244
>gi|159490231|ref|XP_001703086.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
gi|158270832|gb|EDO96665.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
Length = 1337
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 64/143 (44%), Gaps = 10/143 (6%)
Query: 493 PHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQK--EVARL-LVSTGLNWIREYAVDGYT 549
P L A+ + + A+ T R ++V + Q+ + RL +VS + E +
Sbjct: 1136 PDLLAAMEVAVVAERATGSTTSRLQKQVAEALQRLLQKGRLPIVSVQTEVVVEGVLGRVD 1195
Query: 550 VDAVLVD-KKVAFEIDGPTHFSRN----TGVPLGHTMLKRRYI--AAAGWNVVSLSHQEW 602
+ A D ++VA E+DGP HF N +G T L+ R + A +V + + EW
Sbjct: 1196 IVADWSDGRRVAIEVDGPAHFPTNRKDDPSAVIGSTALRNRQLRRAFGEGGLVCVPYWEW 1255
Query: 603 EELQGSFEQLDYLRVILKDYIGG 625
L+ Q YL L+D + G
Sbjct: 1256 YGLRTPTAQEAYLLQRLQDLLSG 1278
>gi|323449653|gb|EGB05539.1| hypothetical protein AURANDRAFT_66278 [Aureococcus anophagefferens]
Length = 892
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 48/90 (53%), Gaps = 3/90 (3%)
Query: 286 ISNIAWALSKIG---GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDL 342
+ N+A AL+++G G + + A + F+++ +AN A AFA+ AP+L
Sbjct: 501 LGNVAHALARLGAGKGHMDGERAFQSLGRAAAPRAAAFDARELANTAWAFATAGVDAPEL 560
Query: 343 FSELAKRASDIVHTFQEQELAQVLWAFASL 372
A RA+D V + +ELA ++WA A L
Sbjct: 561 MRAFAARAADKVVDYDVRELANLVWALAKL 590
>gi|221501449|gb|EEE27225.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 236
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 37/73 (50%)
Query: 548 YTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQG 607
+ V+ L K V E+DGP HF R++ + LK R +A G+ + + + +W EL
Sbjct: 6 FYVEHELDIKGVVLEVDGPQHFYRDSFHWTSASKLKHRLLAGLGFRIAHVPYFDWLELHT 65
Query: 608 SFEQLDYLRVILK 620
+ YLR L+
Sbjct: 66 EDVRRVYLRCALE 78
>gi|397632551|gb|EJK70608.1| hypothetical protein THAOC_08020 [Thalassiosira oceanica]
Length = 701
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 57/119 (47%), Gaps = 3/119 (2%)
Query: 270 LVAIAMTALPECSAQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVAN 327
+ + A+ L E A+ +SN+ ++ +G E+ + D E A+ + F Q ++N
Sbjct: 553 IASSAVGMLDEFEARHLSNLIYSFGLVGYNPEIEAETLFDVFGEAAVRILHTFKPQALSN 612
Query: 328 VAGAFASMQHSAPDLFSELAKRASDI-VHTFQEQELAQVLWAFASLYEPADPLLESLDN 385
+ AF + LF E S + + +F+ Q+ A +LW+FA E L ++L N
Sbjct: 613 ILWAFVKVDTKNSRLFQETGGVISGMDLDSFKPQDFANILWSFAKASEADSKLFQALGN 671
Score = 40.0 bits (92), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 61/148 (41%), Gaps = 38/148 (25%)
Query: 309 VAEVALTKVGEFNSQNVANVAGAFASMQHS----APDLFSELAKRASDIVHTFQEQELAQ 364
+A A+ + EF +++++N+ +F + ++ A LF + A I+HTF+ Q L+
Sbjct: 553 IASSAVGMLDEFEARHLSNLIYSFGLVGYNPEIEAETLFDVFGEAAVRILHTFKPQALSN 612
Query: 365 VLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSS 424
+LWAF + K++ F E GGV S D D
Sbjct: 613 ILWAFVKV-------------DTKNSRLF-----------QETGGVISGMDLD------- 641
Query: 425 PVLSFNRDQLGNIAWSYAVLGQMDRIFF 452
SF NI WS+A + D F
Sbjct: 642 ---SFKPQDFANILWSFAKASEADSKLF 666
>gi|82596268|ref|XP_726191.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23481496|gb|EAA17756.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 834
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 21/89 (23%), Positives = 46/89 (51%)
Query: 527 EVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRY 586
EV+R+L +N +R ++ D +L D + GP + N+ + + LK+
Sbjct: 640 EVSRVLTKINVNHLRNVYINNICADIMLPDSNIIIMCLGPYSYYVNSLLTTSISDLKKNI 699
Query: 587 IAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
+ +NV++L++ +W +L +Q+++L
Sbjct: 700 LKKKKYNVITLNYHDWNKLNDYEDQINFL 728
>gi|221060957|ref|XP_002262048.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193811198|emb|CAQ41926.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 955
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/125 (24%), Positives = 53/125 (42%), Gaps = 20/125 (16%)
Query: 276 TALPECSAQGISNIAWALSKI--GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA 333
T L C+++ +SN+ +A S + G L+ + + K + + Q +A +A A+
Sbjct: 663 TFLNLCTSEDLSNLCYAYSLVRSGNRELH----SLIQSAIMKKQSDLSPQEIAKIAYAYG 718
Query: 334 SMQ-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQ 392
+M +S+ L S L +H F E+ +LW + N F DA
Sbjct: 719 NMYFYSSYTLLSSLQYEILQRMHQFCHHEICDILWCYCI-------------NRFLDANF 765
Query: 393 FTCCL 397
+ C L
Sbjct: 766 WKCML 770
>gi|84995016|ref|XP_952230.1| hypothetical protein [Theileria annulata]
gi|65302391|emb|CAI74498.1| hypothetical protein TA13450 [Theileria annulata]
Length = 460
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 66/133 (49%), Gaps = 19/133 (14%)
Query: 517 NQKVTSSFQKEVARLLVSTGL-NWIREYAVDGYTVDAVLV--DKKVAFEIDGPTHFSRNT 573
N K+ S QK V+ L+ + + + D +VD + D+K+ E+DGPTHF RN
Sbjct: 336 NGKIISKSQKLVSDFLIRQNIPHQLEILTSDLSSVDIYICLNDEKIILEVDGPTHFIRNL 395
Query: 574 GVP-----LGHTMLKRRYIAAAGWNVVSLS--HQEWEELQGSFEQLD-YLRVILKDYIGG 625
P +G K + + G+ +S+ H + + ++ Q+D Y + +L++
Sbjct: 396 DDPSETRKIGPCHFKEKLLKENGFVFISIPPIHSDTQNIK----QIDEYYKELLQN---- 447
Query: 626 EGSSNIAETLKMD 638
GS+++ E +K +
Sbjct: 448 SGSAHLNEIMKYN 460
>gi|156120709|ref|NP_001095501.1| FAST kinase domain-containing protein 1 [Bos taurus]
gi|151554767|gb|AAI50046.1| FASTKD1 protein [Bos taurus]
gi|296490639|tpg|DAA32752.1| TPA: FAST kinase domains 1 [Bos taurus]
Length = 832
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 6/75 (8%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFE--QLDY 614
+K+A E F RN G + +K+R++ G++V+ + H EW + S ++DY
Sbjct: 762 EKIALEFLDSRAFCRNIPHLKGKSAMKKRHLEILGYHVIQIPHFEWNSMALSTRDARMDY 821
Query: 615 LRVILKDYIGGEGSS 629
LR + I GEG S
Sbjct: 822 LR----ERIFGEGKS 832
>gi|83273693|ref|XP_729510.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23487521|gb|EAA21075.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 689
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/103 (24%), Positives = 49/103 (47%), Gaps = 5/103 (4%)
Query: 311 EVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFA 370
++ K+ E + Q+++N+ A++ + + +++ L + + F EQELA +L A++
Sbjct: 299 KIIYNKINELSYQSISNICNAYSKLNPNDTKIYNILINKIKKNIDKFNEQELANILSAYS 358
Query: 371 SL-YEPADPLLESLDNAFKDATQF----TCCLNKALSNCNENG 408
L + D +SL+ F F + A S CN N
Sbjct: 359 KLNIKDFDLFNKSLEYIFHKFYNFKPIEIVMITNAYSKCNINN 401
>gi|294944359|ref|XP_002784216.1| hypothetical protein Pmar_PMAR003475 [Perkinsus marinus ATCC 50983]
gi|239897250|gb|EER16012.1| hypothetical protein Pmar_PMAR003475 [Perkinsus marinus ATCC 50983]
Length = 319
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 42/190 (22%), Positives = 83/190 (43%), Gaps = 24/190 (12%)
Query: 187 FSGPSNRRKEINLNKDIVDAQ-TAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIA 245
S + + + I N+++ + TAQ++L + + + + +N AT HR+A
Sbjct: 77 MSAAAFKSQHIAWNRELTNPNATAQQILALAKKHC---------AQFNSVNWATTFHRLA 127
Query: 246 KNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE 305
K H E+ L+ ++ + Q ++ +AWA++K+ ++
Sbjct: 128 K-------FHLHEAKSEHSLEIQTLLG-KCDSVEGFAPQHLATLAWAMAKL--HIVDHDL 177
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRA----SDIVHTFQEQE 361
+ RV +LT + Q++AN++ A A + DL E + + F+ E
Sbjct: 178 LSRVVHKSLTLHSDLKPQDLANLSWALARLDCPESDLMYECVCQKIMYDRGCLSQFKPME 237
Query: 362 LAQVLWAFAS 371
LA V+WA A+
Sbjct: 238 LASVMWAIAT 247
>gi|440912811|gb|ELR62346.1| FAST kinase domain-containing protein 1, partial [Bos grunniens
mutus]
Length = 849
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 6/75 (8%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFE--QLDY 614
+K+A E F RN G + +K+R++ G++V+ + H EW + S ++DY
Sbjct: 779 EKIALEFLDSRAFCRNIPHLKGKSAMKKRHLEILGYHVIQIPHFEWNSMALSTRDARMDY 838
Query: 615 LRVILKDYIGGEGSS 629
LR + I GEG S
Sbjct: 839 LR----ERIFGEGKS 849
>gi|303276196|ref|XP_003057392.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461744|gb|EEH59037.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1039
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 10/108 (9%)
Query: 235 LNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALS 294
+N+ATA R+ +++E R R L A LPE S++ WAL
Sbjct: 193 VNVATAYSRLGRHVEDA-----ERGTLDDARWYLALETRAFALLPELGGWAASSLTWALG 247
Query: 295 KIGGE--LLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
+ G + + ++RV L K E Q VAN+ AFA ++ P
Sbjct: 248 RTGRDPGAKFWEALERVL---LRKASELEPQGVANILWAFAVLERKHP 292
>gi|426220935|ref|XP_004004667.1| PREDICTED: FAST kinase domain-containing protein 1 [Ovis aries]
Length = 832
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 6/75 (8%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFE--QLDY 614
+K+A E F RN G + +K+R++ G++V+ + H EW + S ++DY
Sbjct: 762 EKIALEFLDSRAFCRNIPHLKGKSAMKKRHLEILGYHVIQIPHFEWNSMALSTRDARMDY 821
Query: 615 LRVILKDYIGGEGSS 629
LR + I GEG S
Sbjct: 822 LR----ERIFGEGKS 832
>gi|303273294|ref|XP_003056008.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462092|gb|EEH59384.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 445
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 52/111 (46%), Gaps = 9/111 (8%)
Query: 271 VAIAMTALP-----ECSAQGISNIAWALSKIG--GELL--YLSEMDRVAEVALTKVGEFN 321
VA A+ + P E Q ++N+AWA +K+G +L+ YLSE+ V KV ++
Sbjct: 144 VADAVISFPDPIKYELKPQDVANLAWAFAKLGRKKQLMFNYLSEVFAAQAVIDVKVTAYS 203
Query: 322 SQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
+ V+ + AFA++ L + H F ++L WA SL
Sbjct: 204 PKQVSMILWAFATLDIQHQTLLTAAIPMIKARAHEFNPRDLTNTAWALDSL 254
>gi|317420047|emb|CBN82083.1| FAST kinase domain-containing protein 3 [Dicentrarchus labrax]
Length = 669
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 41/73 (56%), Gaps = 3/73 (4%)
Query: 546 DGYTVDAVL---VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW 602
DG+ + A V K++A IDG T F+ LG +K+R++ G+ VV + + E+
Sbjct: 584 DGFMLPASHNKDVYKRMAVCIDGQTRFTTIKRQLLGKEAIKQRHLRLLGYEVVQIPYYEF 643
Query: 603 EELQGSFEQLDYL 615
E+LQ E ++YL
Sbjct: 644 EKLQTKSEVVEYL 656
>gi|355749811|gb|EHH54149.1| hypothetical protein EGM_14923 [Macaca fascicularis]
Length = 657
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ +++A IDGP F N+ LG +K+R++ G+ VV + + E L+ E ++Y
Sbjct: 585 IHERIALCIDGPKRFCSNSKHLLGKEAIKQRHLRLLGYQVVQIPYYEIGMLKSRRELVEY 644
Query: 615 LR 616
L+
Sbjct: 645 LQ 646
>gi|291222240|ref|XP_002731123.1| PREDICTED: protein TBRG4-like [Saccoglossus kowalevskii]
Length = 627
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 80/411 (19%), Positives = 161/411 (39%), Gaps = 85/411 (20%)
Query: 245 AKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTAL---PECSAQGISNIAWALSKIGGELL 301
+++M +V++ L+ + +R + +L A+A L E Q + N +A +K+
Sbjct: 252 SEDMSRVALA----LSKSNRRTLPLLRALAYHVLHRHKELGLQTMWNFTYAFAKLN---F 304
Query: 302 YLSE-MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQ 360
Y S+ M+++ L KV + +A A AF+ ++ LF +++ + F++
Sbjct: 305 YHSQLMEKIQGELLQKVPDSTPYMIATFAWAFSYNKYLDKPLFDAMSQYIVSNISHFKQL 364
Query: 361 ELAQVLWAFASL-YEPADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSE 419
+ ++ ++A L Y+P+ E L F +
Sbjct: 365 RICSIIISYARLNYQPSGDFFEKLLTDFDFS----------------------------- 395
Query: 420 GSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFAS 479
+LSS D+L ++ WS +L Q F S + + E+ Y+ +
Sbjct: 396 -ALSS-------DKLVDVVWSLVILQQASAEFISHVLAS-QHLEKLPDGTSYQIQMTRQK 446
Query: 480 QVHLVNQCLKLEHP-HLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLN 538
+H +N KLE P + L + S R N+ ++ S + L + G +
Sbjct: 447 LLH-INTAAKLEQPDYTGPFLPDDFMKPADSLINPGRENESLSPSLNAVMQSLAKAIGGD 505
Query: 539 -WIRE--YAVDGYTVDA-VLVDKKV-------------------------AFEID----G 565
+IR + GYT+DA LVD K+ A+ I
Sbjct: 506 KYIRTNVFTPYGYTIDAEFLVDSKLTPLPINDYKTFYLPEDDTKQEVPEDAYRIAVINWE 565
Query: 566 PTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ +N+ LG + +R++ + + + EW +L+ +++ Y++
Sbjct: 566 YNKYCQNSKQLLGRYTMTKRHLRGXXFIYFQVPYYEWNDLKSDWQKTAYIK 616
>gi|355691207|gb|EHH26392.1| hypothetical protein EGK_16351 [Macaca mulatta]
Length = 657
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
+ +++A IDGP F N+ LG +K+R++ G+ VV + + E L+ E ++Y
Sbjct: 585 IHERIALCIDGPKRFCSNSKHLLGKEAIKQRHLRLLGYQVVQIPYYEIGMLKSRRELVEY 644
Query: 615 LR 616
L+
Sbjct: 645 LQ 646
>gi|327270690|ref|XP_003220122.1| PREDICTED: FAST kinase domain-containing protein 3-like [Anolis
carolinensis]
Length = 662
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 35/62 (56%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
V +++A ID F N+ LG +K+R++ G+NVV + E+++LQ + L+Y
Sbjct: 589 VHQRIALCIDDQKRFCTNSHNLLGREAIKQRHLQLLGYNVVQIPFFEFQQLQNRGDILEY 648
Query: 615 LR 616
L
Sbjct: 649 LH 650
>gi|404216429|ref|YP_006670625.1| hypothetical protein KTR9_3834 [Gordonia sp. KTR9]
gi|403647228|gb|AFR50468.1| hypothetical protein KTR9_3834 [Gordonia sp. KTR9]
Length = 306
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 41/91 (45%), Gaps = 6/91 (6%)
Query: 516 FNQKVTSSFQKEVARLLVSTGLN-WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTG 574
+ S ++ RL + GL W+ GY +D D KVA EIDG F R+T
Sbjct: 184 LGEGARSEAERMTVRLFTAGGLTGWVANMPAHGYVIDFAFPDVKVAIEIDGFA-FHRDTR 242
Query: 575 VPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL 605
+KR + A GW V++ + W +L
Sbjct: 243 T-FQRDRVKRNLLTAKGWTVLNFT---WADL 269
>gi|294895650|ref|XP_002775245.1| hypothetical protein Pmar_PMAR015474 [Perkinsus marinus ATCC 50983]
gi|239881304|gb|EER07061.1| hypothetical protein Pmar_PMAR015474 [Perkinsus marinus ATCC 50983]
Length = 984
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 29/45 (64%)
Query: 571 RNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYL 615
++T V +G ++LK R++ GW VV + EW LQ + +++DYL
Sbjct: 913 KSTRVMIGSSLLKVRHLMTLGWKVVPIWISEWSSLQSTKDRVDYL 957
>gi|124513816|ref|XP_001350264.1| RAP protein, putative [Plasmodium falciparum 3D7]
gi|23615681|emb|CAD52673.1| RAP protein, putative [Plasmodium falciparum 3D7]
Length = 1017
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/66 (24%), Positives = 38/66 (57%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
KK+ E++G HF +NT + + K + ++ G+ V+++ + +W L F++ Y++
Sbjct: 890 KKLIIEVNGEHHFYKNTKSYISLSKFKHKLLSDLGYVVINIPYFDWAILNTDFDKKAYIK 949
Query: 617 VILKDY 622
++ D+
Sbjct: 950 KLIYDH 955
>gi|156100253|ref|XP_001615854.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804728|gb|EDL46127.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1615
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 24/101 (23%), Positives = 49/101 (48%), Gaps = 15/101 (14%)
Query: 521 TSSFQKEVARLLVSTGLNWIREY-AVDGYTVDAVLVDK----KVAFEIDGPTHF------ 569
S F ++V ++L G+ + E+ A + ++D + D+ ++A E+DGP+H
Sbjct: 1485 VSDFHQQVCQVLDKFGVKYENEHMAQELLSIDLAIRDEAAGERIAVEVDGPSHHLVLLDE 1544
Query: 570 ----SRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
++ P G T K + GW V+++ +W +L+
Sbjct: 1545 TDPRAKKMYAPCGTTHFKNWLLRKMGWTVINIEAHKWNKLR 1585
>gi|406885763|gb|EKD32892.1| hypothetical protein ACD_76C00122G0008 [uncultured bacterium]
Length = 118
Score = 40.8 bits (94), Expect = 2.0, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 10/91 (10%)
Query: 518 QKVTSSFQKEVARLLVS-------TGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFS 570
++V FQ + +LL S G + R+Y + Y VD + ++A E+DGPTH
Sbjct: 14 RRVLRLFQTKAEKLLWSKIKRKQLNGCKFRRQYGIGPYIVDFYCPEIRLAIEVDGPTH-- 71
Query: 571 RNTGVPLGHTMLKRRYIAAAGWNVVSLSHQE 601
+ + + ++RYI + G VV + ++E
Sbjct: 72 -DNHLAKEYDDFRQRYIESLGIRVVRVYNEE 101
>gi|403222079|dbj|BAM40211.1| conserved hypothetical protein [Theileria orientalis strain
Shintoku]
Length = 537
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 81/191 (42%), Gaps = 38/191 (19%)
Query: 434 LGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVH----------- 482
L I WS ++L FS+ +TI++ E I E + + +Q++
Sbjct: 313 LIRILWSLSILKVRLAEVFSNALETIAKLLEDTIDELSLKRLAHINQLYSILKSLRHSIH 372
Query: 483 -------------LVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVA 529
+ ++C +++ +++ ++L A +GK +QK+ S F +
Sbjct: 373 ADVRKGDGAVNDAVTDECDEIDRL-MEVCKENLLSHNYAQSGKIISKSQKLVSDF---LI 428
Query: 530 RLLVSTGLNWIREYAVDGYTVDA--VLVDKKVAFEIDGPTHFSRNTGVP-----LGHTML 582
R + L I D ++D +L D+ +A E+DGPTHF RN P G
Sbjct: 429 RANIPHQLEII---TPDLLSIDIRIILDDEMIALEVDGPTHFLRNIEDPEVVMETGPCSF 485
Query: 583 KRRYIAAAGWN 593
K+ + +G+N
Sbjct: 486 KKELLTRSGYN 496
>gi|294882591|ref|XP_002769754.1| hypothetical protein Pmar_PMAR004835 [Perkinsus marinus ATCC 50983]
gi|239873503|gb|EER02472.1| hypothetical protein Pmar_PMAR004835 [Perkinsus marinus ATCC 50983]
Length = 677
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 71/316 (22%), Positives = 131/316 (41%), Gaps = 61/316 (19%)
Query: 330 GAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLESL-DNAF 387
G+ S +A L S L R + I T + A+++ AF++L Y P+ +L L D A
Sbjct: 399 GSLQSTLETAHALHSYLGSRLNAI--TPSAVDAARLVAAFSNLSYLPSHTILTKLMDIAL 456
Query: 388 KDATQFT----CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAV 443
+ A+ F C AL+ ++ G + S DQ +I W+ V
Sbjct: 457 EGASSFYPNTYCMYAIALAQLHQTGHRLPPCNE-----------SLTIDQACSILWTGVV 505
Query: 444 LGQMDRI--FFSDIWKTISR--------FEEQRISEQYREDIMFASQVHLVNQCLKLEHP 493
L +D + + I++ F Q I+ + ++ +Q L+ +P
Sbjct: 506 L-DIDGVESIMERVLACIAKEGDLPSLPFARQAIAGLWARGMVTEAQ-QLIG-----AYP 558
Query: 494 HLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVARLLVSTGLNWIR-EYAVDG-YTVD 551
+ L E AS SS +++ L S G +R E + G Y D
Sbjct: 559 ------AGTLTENPAS------------SSLHTNISQTLRSMGYGNVRDEVEICGIYRAD 600
Query: 552 AVLVDKKVAFEIDGPTHF--SRNTG---VPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQ 606
V+ D + E DG H+ S ++G + +G ++++ + AGW V+ +S W+ +
Sbjct: 601 VVIDDLGIVIECDGDVHYLYSPDSGCSDILIGSSVIRDKVFINAGWKVIRVSVAAWKNCK 660
Query: 607 GSFEQLDYLRVILKDY 622
+ ++ LR ++ ++
Sbjct: 661 DAAGKVAMLRRLINNH 676
>gi|397583153|gb|EJK52533.1| hypothetical protein THAOC_28177, partial [Thalassiosira oceanica]
Length = 376
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 53/237 (22%), Positives = 92/237 (38%), Gaps = 53/237 (22%)
Query: 342 LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKAL 401
LF +L+ A F+ QE+A LWA A++ L +L +
Sbjct: 3 LFEKLSTEAVVNKEHFKAQEVANFLWACATVGHTDQRLFSALTSV--------------- 47
Query: 402 SNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISR 461
++S + FN +L NI W+Y+V + F + + +
Sbjct: 48 --------------------IASKLDKFNEQELANITWTYSVANTPSQDLFGEGYVSALA 87
Query: 462 FEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVT 521
E S ++ A +LE + L L+ K +A ++ +++
Sbjct: 88 SNENEFSVEH-----LAQLHQWQLWQQELES---GMELPQSLQAKCRNAFTSRGYSE--- 136
Query: 522 SSFQKEVARLLVSTGLNWIREYAV-DGYTVDAVLV---DKKVAFEIDGPTHFSRNTG 574
S Q +V L + GL+ E + GY +DA++ ++VA E+DGP RN G
Sbjct: 137 SKLQNDVVDELKAVGLDLEEEVLLGSGYRIDALVKIGDGRRVAVEVDGP---RRNVG 190
Score = 39.7 bits (91), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 31/61 (50%)
Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPL 379
F +Q VAN A A++ H+ LFS L + + F EQELA + W ++ P+ L
Sbjct: 18 FKAQEVANFLWACATVGHTDQRLFSALTSVIASKLDKFNEQELANITWTYSVANTPSQDL 77
Query: 380 L 380
Sbjct: 78 F 78
>gi|258511520|ref|YP_003184954.1| Superfamily I DNA and RNA helicase and helicase subunits-like protein
[Alicyclobacillus acidocaldarius subsp. acidocaldarius
DSM 446]
gi|257478246|gb|ACV58565.1| Superfamily I DNA and RNA helicase and helicase subunits-like protein
[Alicyclobacillus acidocaldarius subsp. acidocaldarius
DSM 446]
Length = 1403
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 47/108 (43%), Gaps = 16/108 (14%)
Query: 519 KVTSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKK------VAFEIDGPTHFSRN 572
+ S F++EVA L G + GY +D +VD +A E DG T+ S
Sbjct: 1277 RYDSPFEEEVATELRKLGYTVNTQVGFSGYRIDLAIVDPDNPERYLLAVECDGATYHS-- 1334
Query: 573 TGVPLGHTMLKRRYIAAAGWNV--------VSLSHQEWEELQGSFEQL 612
+ V ++R++ GWNV + H+E E++Q QL
Sbjct: 1335 SKVARERDFYRQRFLEQHGWNVHRVWSRNWLKAKHKEIEKIQSRIRQL 1382
>gi|145348368|ref|XP_001418622.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578852|gb|ABO96915.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 586
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 44/186 (23%), Positives = 78/186 (41%), Gaps = 24/186 (12%)
Query: 198 NLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMT-- 255
++ + + +A +A+ L V+ + A ++ ATALHR+AK S +
Sbjct: 62 DIQRMLANADSAEAALRVVESDLDA---------FDAVHAATALHRVAKFSAPESRLERD 112
Query: 256 -THRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVA- 313
+ T L A T + E A G++N+AW+ +KIG Y + + +A
Sbjct: 113 FSRAEGLTNDGRFRALAASVATRVDEFDAFGLANVAWSFAKIG----YTPSQETLGALAA 168
Query: 314 -----LTKVG-EFNSQNVANVAGAFASMQHSAP-DLFSELAKRASDIVHTFQEQELAQVL 366
++K G Q+++N AF M+ P L + + F+ EL+ +L
Sbjct: 169 RLEREVSKQGARLKPQSLSNATYAFGRMRFKPPRSTLEALCAATTREMGEFRADELSGML 228
Query: 367 WAFASL 372
A L
Sbjct: 229 LGLAHL 234
>gi|40645472|dbj|BAD06581.1| arginine decarboxylase [Nicotiana tabacum]
Length = 733
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 41/85 (48%), Gaps = 9/85 (10%)
Query: 327 NVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNA 386
+ A +MQH +F L RA + VH EQE + L AFASL A L +S +N
Sbjct: 630 SCADVLRAMQHEPELMFETLKHRAEEFVHNDDEQEEDKGL-AFASL---ASSLAQSFNNM 685
Query: 387 FKDATQFTCCLNKALSN-----CNE 406
T +CCL A +N CN+
Sbjct: 686 PYLVTNSSCCLTAAANNGGYYYCND 710
>gi|302832912|ref|XP_002948020.1| hypothetical protein VOLCADRAFT_103642 [Volvox carteri f.
nagariensis]
gi|300266822|gb|EFJ51008.1| hypothetical protein VOLCADRAFT_103642 [Volvox carteri f.
nagariensis]
Length = 1327
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 57/112 (50%), Gaps = 16/112 (14%)
Query: 279 PECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKV----GEFNSQNVANVAGAFAS 334
P ++Q ISN+ +A++ +G E+ E+ AE+ L V GE N+Q ++NV A
Sbjct: 666 PPFNSQEISNVLYAIASMGYEIDPEGEL---AELLLDAVHFRLGEANAQELSNVMWCLAV 722
Query: 335 MQ--HSAP---DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLE 381
+Q S P D F+ R + TF+ +LAQ L+ A L P PL E
Sbjct: 723 LQIRPSQPWLDDYFTAAHSR----LPTFKPVDLAQSLYGVAKLRLPLQPLPE 770
>gi|422339502|ref|ZP_16420460.1| putative DNA helicase [Fusobacterium nucleatum subsp. polymorphum
F0401]
gi|355370932|gb|EHG18307.1| putative DNA helicase [Fusobacterium nucleatum subsp. polymorphum
F0401]
Length = 1230
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 4/87 (4%)
Query: 513 TKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVD--AVLVDKKVAFEIDGPTHFS 570
T+ + S F++EV + LVS G + +++ V Y +D A+ DKK+A E DG S
Sbjct: 956 TEEIEKNSESIFEEEVVKYLVSEGYHIKQQWEVGAYRIDMVALFQDKKIAIECDGEKWHS 1015
Query: 571 RNTGVPLGHTMLKRRYIAAAGWNVVSL 597
T + M ++ + GW + +
Sbjct: 1016 --TEEQIKQDMERQSILERCGWEFIRI 1040
>gi|115728540|ref|XP_785534.2| PREDICTED: protein TBRG4-like [Strongylocentrotus purpuratus]
Length = 616
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 75/373 (20%), Positives = 152/373 (40%), Gaps = 63/373 (16%)
Query: 259 LAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEVAL---T 315
L T R + +L AI+ L + S I + +S +G L +E A+
Sbjct: 284 LVATNTRSLPILRAISYQLLEQRSQWEIPAMMDIMSAMGN--LGFHNAALFSEFAVHIQQ 341
Query: 316 KVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YE 374
K+ E + + ++A +A ++ A +L + + + +L ++L+A++ Y+
Sbjct: 342 KLDECSLSLLCDIAKTYAVLRIQASNLLDSIHTVLAGALDELTILDLKRLLFAYSQFSYQ 401
Query: 375 PADPLLESLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQL 434
P D +T + NK +N ++ G +DQ+
Sbjct: 402 PPDA-----------STFYVEVGNKLDANFDDYSG---------------------KDQI 429
Query: 435 GNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISEQYREDIMFASQVHLVNQCLKLEHPH 494
++A S VL Q+ + + I K I +E +S + ++ +N +L+ P
Sbjct: 430 -DVAHSLTVLKQVPQKIVTKILKNIEESQEAPLSGTLKLKLL------QINAYSQLDFPD 482
Query: 495 LQLALSSVLEEKIASAGKTKRFNQKV-TSSFQKEVARLL-VSTG--LNWIREYAVD-GYT 549
+ + S K+ N K+ T++ + + ++L S G L + D GY
Sbjct: 483 YE-------GPYLTSDLKSFPANHKIYTTTLHRSLFKVLQASLGDDLTMVENVKSDLGYV 535
Query: 550 VDAVLVDK------KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWE 603
+DA + K K+A GP + +T +G + ++ G+ V+ + +Q+W
Sbjct: 536 IDAEISSKLKGNGQKLAIMTFGPPSYLYSTTQLVGRLEMMLSHLELTGYQVLQIPYQDWY 595
Query: 604 ELQGSFEQLDYLR 616
L+ +Q+ YL+
Sbjct: 596 PLRTPVQQVHYLK 608
>gi|124504899|ref|XP_001351192.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
gi|3764010|emb|CAA15603.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
Length = 768
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/110 (24%), Positives = 55/110 (50%), Gaps = 15/110 (13%)
Query: 290 AWALSKIGGELLYLSEMDRVAEVALTK-----VGEFNSQNVANVAGAFASMQH--SAPDL 342
++ +S+I L+ MD L K + + + Q+++N+ A++ + + + DL
Sbjct: 319 SFDISQIVNSYTRLNYMDDKLFSYLKKYIDQQIDDMSFQSISNICNAYSKLLNIENYEDL 378
Query: 343 FSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQ 392
F +L R D +H F+ QE+A +L +++ LY +++ FKD
Sbjct: 379 FFKLRVRIRDNIHEFKPQEVANILNSYSKLY--------NINGIFKDVIH 420
>gi|221061533|ref|XP_002262336.1| RAP protein [Plasmodium knowlesi strain H]
gi|193811486|emb|CAQ42214.1| RAP protein, putative [Plasmodium knowlesi strain H]
Length = 1273
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 13/50 (26%), Positives = 30/50 (60%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
++K + E+DG +HF R + ++++K + GW+++ + +QEW +
Sbjct: 1071 IEKNILVEVDGVSHFYRESHSRAINSIIKNFILEKCGWHIIHIPYQEWNQ 1120
>gi|156101285|ref|XP_001616336.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805210|gb|EDL46609.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 994
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 21/96 (21%), Positives = 48/96 (50%), Gaps = 17/96 (17%)
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRV 617
K+ E++G HF +N+ + LK + ++ G+ VV++ + EW +L+ + ++ Y++
Sbjct: 868 KLIIEVNGEHHFYKNSKSYTALSKLKHKLLSDLGYTVVNIPYFEWGQLKSNLDRKAYIKK 927
Query: 618 ILKDY-----------------IGGEGSSNIAETLK 636
++ D +GGE + +A T++
Sbjct: 928 LISDSLTFEVVNVLPLNQKSEPLGGEEMAKVASTIR 963
>gi|291461161|ref|ZP_06027290.2| DNA helicase [Fusobacterium periodonticum ATCC 33693]
gi|291378403|gb|EFE85921.1| DNA helicase [Fusobacterium periodonticum ATCC 33693]
Length = 1621
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 4/87 (4%)
Query: 513 TKRFNQKVTSSFQKEVARLLVSTGLNWIREYAVDGYTVD--AVLVDKKVAFEIDGPTHFS 570
T+ + S F++EV + LVS G + +++ V Y +D A+ DKK+A E DG S
Sbjct: 1356 TEEIEKNSESIFEEEVVKYLVSEGYHIKQQWEVGAYRIDMVALFQDKKIAIECDGEKWHS 1415
Query: 571 RNTGVPLGHTMLKRRYIAAAGWNVVSL 597
T + M ++ + GW + +
Sbjct: 1416 --TEEQIKQDMERQSILERCGWEFIRI 1440
>gi|221057670|ref|XP_002261343.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|194247348|emb|CAQ40748.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 944
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 16/69 (23%), Positives = 40/69 (57%)
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRV 617
K+ E++G HF +N+ + LK + ++ G+ V+++ + EW +L+ + ++ Y++
Sbjct: 818 KLIIEVNGEHHFYKNSKSYTALSKLKHKLLSDLGYTVINIPYFEWGQLKTNLDKKAYIKK 877
Query: 618 ILKDYIGGE 626
++ D + E
Sbjct: 878 LISDSLNFE 886
>gi|156103321|ref|XP_001617353.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148806227|gb|EDL47626.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1234
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 20/78 (25%), Positives = 39/78 (50%), Gaps = 4/78 (5%)
Query: 547 GYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEEL- 605
Y V V K + E+DG +HF + + ++++K+ + GW+++ + +QEW +
Sbjct: 1017 AYPVLQKRVKKNILVEVDGVSHFYKESHSRTINSIIKKFILQKCGWHIIHIPYQEWNQCV 1076
Query: 606 ---QGSFEQLDYLRVILK 620
+ L LR IL+
Sbjct: 1077 DFRRKVLYALQVLRQILR 1094
>gi|66359096|ref|XP_626726.1| hypothetical protein [Cryptosporidium parvum Iowa II]
gi|46228239|gb|EAK89138.1| hypothetical protein with transmembrane or GPI anchor sequence at
carboxy terminus [Cryptosporidium parvum Iowa II]
gi|323509501|dbj|BAJ77643.1| cgd3_1520 [Cryptosporidium parvum]
Length = 589
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 46/220 (20%), Positives = 94/220 (42%), Gaps = 35/220 (15%)
Query: 184 LSQFSGPSNRRK-----------------EINLNKDIVDAQTAQEVLEVIAEMITAVGKG 226
+ QFSGP +R + +NK I +++ E+L ++ I
Sbjct: 44 IGQFSGPYEQRNITYNNGVLYSRDEHIVFNLKMNKIITASESFGELLGIVHCHIYY---- 99
Query: 227 LSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLV-AIAMTALPEC--SA 283
L+ +N+ + LH++A +S + R +L+ I + + C S
Sbjct: 100 -----LNEINMVSILHKLAV----LSQSNNFKGRIKRDERFRLLLDVIVLRSNFPCRFSP 150
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
+ +SNIAW+L K+G L D V ++ ++ F S N++ + +FA +LF
Sbjct: 151 KELSNIAWSLVKLG--LNNHKIFDFVCNESIIQLERFISINLSIILWSFAKAGKFNKNLF 208
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESL 383
+ + + Q+++ + W+++ + + L E+L
Sbjct: 209 VYAIPKILSELDNLEPQQISNIAWSYSKVGLVSPHLFENL 248
>gi|221055575|ref|XP_002258926.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193808996|emb|CAQ39699.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 613
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 4/90 (4%)
Query: 286 ISNIAWALSKIG-GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFS 344
IS IA +K+ G+ M+ E+ ++ E + Q+++N+ A++ + + LF
Sbjct: 226 ISQIANCFAKLNYGDANLFKHME---ELICERIDELSCQSISNICNAYSKLSLGSETLFC 282
Query: 345 ELAKRASDIVHTFQEQELAQVLWAFASLYE 374
L K + F EQE+A +L A++ L E
Sbjct: 283 LLIKAVKKKLDNFNEQEIANILNAYSKLGE 312
>gi|412994033|emb|CCO14544.1| predicted protein [Bathycoccus prasinos]
Length = 790
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 92/205 (44%), Gaps = 27/205 (13%)
Query: 270 LVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEMDRVAEV--ALTKVGEF-----NS 322
L+ + T +PE S G+SN++WAL++ L+ + RV + A++K ++
Sbjct: 431 LLEMCETKIPEMSPLGLSNVSWALAR-----LFPDDPTRVKSLLSAISKRSALQMKYADA 485
Query: 323 QNVANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLE 381
+ ++ + A A++ L + +RA +I F+ ++A LWA+A L
Sbjct: 486 KCLSTILWALAALGFEPRSRLLASAQRRACEIEEEFRAPDVANALWAYAKWAR----LFS 541
Query: 382 SLDNAFKDATQFTCCLNKALSNCNENGGVKSSGDADSEGSL---SSPVL-SFNRDQLGNI 437
A K++ ++ + +E GD SL S V+ +F+ Q NI
Sbjct: 542 GGVGALKESVDYS-----EDESVDEGSSKSYGGDRAVITSLLRQSEAVMETFSAYQCANI 596
Query: 438 AWSYAVL-GQMDRIFFSDIWKTISR 461
WS A L ++ + ++ + I++
Sbjct: 597 CWSSATLNAKLPETYLENLLERIAK 621
>gi|303288517|ref|XP_003063547.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455379|gb|EEH52683.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 807
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 61/139 (43%), Gaps = 23/139 (16%)
Query: 255 TTHRLAFTRQREMSMLVAIAMTALP-----ECSAQGISNIAWALSKI---------GGEL 300
T H + R ++ A+ ALP +A ++N+AWA +K GG
Sbjct: 30 TGHGVGDDGDRASFAAISEALLALPGGTFDALTAPQLANVAWAFAKANDAGGGSTRGGPS 89
Query: 301 L---------YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRAS 351
+ S +A A ++ +F++Q + +VA AFA+ LF+ A+RA
Sbjct: 90 SSSISSISSPFASLFAALARSAASRANDFSAQELTDVAWAFANAGCVDGRLFAAFARRAE 149
Query: 352 DIVHTFQEQELAQVLWAFA 370
+ F ++EL WAFA
Sbjct: 150 TLADDFDDEELDNAEWAFA 168
>gi|159489962|ref|XP_001702960.1| predicted protein [Chlamydomonas reinhardtii]
gi|158270983|gb|EDO96813.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1282
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 57/120 (47%), Gaps = 20/120 (16%)
Query: 266 EMSMLVAIAMTALPECSAQGISNIAWALSKIG----GELL--YLSEMDRVAEVALTKVGE 319
E+S LVA LP + ++N+ WA+ K+G LL +L E A ++ +
Sbjct: 549 EVSELVA---QRLPTFDPRAVANVLWAVCKLGYSPAPPLLNQFLFE-------AYVRMEK 598
Query: 320 FNSQNVANVAGAFASM----QHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEP 375
FN+Q +AN++ A A++ + P + A V + QELA + WA + L P
Sbjct: 599 FNAQELANLSWALATLAAMGRQPVPAWLRKFISAAKLHVDELKPQELAHMAWALSRLCPP 658
>gi|72389356|ref|XP_844973.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62358897|gb|AAX79348.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70801507|gb|AAZ11414.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 516
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 4/92 (4%)
Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
A+ ++NI A SK G E L+ RV +A +VGEF + ++ +A AF+ +++
Sbjct: 182 AKDVTNIISAFSKTGINHEKLFAFLSRRVQTLA--RVGEFEAAHLVILANAFSRLRYRDK 239
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
LF +A+RA + EL ++ AF+ +
Sbjct: 240 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 271
>gi|389584499|dbj|GAB67231.1| hypothetical protein PCYB_112520 [Plasmodium cynomolgi strain B]
Length = 941
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 16/69 (23%), Positives = 39/69 (56%)
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRV 617
K+ E++G HF +N+ + LK + + G+ V+++ + EW +L+ + ++ Y++
Sbjct: 815 KLIIEVNGEHHFYKNSKSYTSLSKLKHKLLCDLGYTVINIPYFEWGQLRTNLDKKAYIKK 874
Query: 618 ILKDYIGGE 626
++ D + E
Sbjct: 875 LISDSLSFE 883
>gi|261328305|emb|CBH11282.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 516
Score = 40.0 bits (92), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 4/92 (4%)
Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
A+ ++NI A SK G E L+ RV +A +VGEF + ++ +A AF+ +++
Sbjct: 182 AKDVTNIISAFSKTGINHEKLFAFLSRRVQTLA--RVGEFEAAHLVILANAFSRLRYRDK 239
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
LF +A+RA + EL ++ AF+ +
Sbjct: 240 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 271
>gi|397640805|gb|EJK74327.1| hypothetical protein THAOC_03999, partial [Thalassiosira oceanica]
Length = 400
Score = 40.0 bits (92), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 80/170 (47%), Gaps = 29/170 (17%)
Query: 302 YLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-----LFSELAKRASDIVHT 356
YL D +A + + EF++++++N+ +F ++ + PD LF+ K A I+HT
Sbjct: 230 YLLIFDSIASSTVDMLNEFDARHMSNLIYSFGLVERN-PDIGGETLFNVFGKAAVKILHT 288
Query: 357 FQEQELAQVLWAF-------ASLYEPADPLLESLD-NAFKD-ATQFTCCL----NKALSN 403
F Q+++ +L AF ++L++ L LD F + A CL +ALSN
Sbjct: 289 FNSQDISNMLLAFVYVDAKNSALFQKTGEELLGLDLGEFTEQALANILCLYDFWPQALSN 348
Query: 404 CNENGGVKSSGDADSEG--------SLSSPVLSFNRDQLGNIAWSYAVLG 445
++G++ E +L + SF+ L N AW++A G
Sbjct: 349 V--VWAYATAGESHPELFKKMGDHIALLERLDSFDPQALSNTAWAFATAG 396
>gi|294865634|ref|XP_002764450.1| hypothetical protein Pmar_PMAR026874 [Perkinsus marinus ATCC 50983]
gi|239863879|gb|EEQ97167.1| hypothetical protein Pmar_PMAR026874 [Perkinsus marinus ATCC 50983]
Length = 195
Score = 40.0 bits (92), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 89/201 (44%), Gaps = 36/201 (17%)
Query: 183 RLSQFSGPSNRRK--EINLNKDIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATA 240
R+ + +G R K ++ L + ++DA T VLE++ T +G +N A A
Sbjct: 14 RMYEVAGRLRRGKSGDLVLQRRLMDASTPAAVLEIVLPNATKLGS---------VNYACA 64
Query: 241 LHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGEL 300
LHR A R R +S + +A+ + + A+ + I WAL+ + EL
Sbjct: 65 LHRCA---------VWFRSGKRRPSGLSQVPRLALQTVRDWRAREAATITWALA-VTREL 114
Query: 301 LYLSEMDRVAEVALTKVGEFNSQNVANVAGAFA-----SMQHSAPDLFSELAKR--ASDI 353
++ E R++ E + ++ANV + Q +A + +AKR A D+
Sbjct: 115 DHILEFARLS----MSCDEASGGDLANVVHSLTISGLNPRQCTAT--LAVVAKRVTAMDL 168
Query: 354 VHT--FQEQELAQVLWAFASL 372
H+ + ++LA V W F L
Sbjct: 169 SHSGVIEPKQLAAVFWGFVKL 189
>gi|399218084|emb|CCF74971.1| unnamed protein product [Babesia microti strain RI]
Length = 480
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 48/102 (47%), Gaps = 3/102 (2%)
Query: 279 PECSAQGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHS 338
P+ S+QG+S I ++SK ++ S R + + ++ EFN + VA A + +
Sbjct: 222 PKFSSQGLSLILNSISKYNDDI---SLFQRYSMIIQLRIDEFNIHSCCLVASAVSRANYK 278
Query: 339 APDLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLL 380
L LA+R + Q +A + ++FA L PL+
Sbjct: 279 EIKLLEVLAERVGKQSNELYPQAVATLAYSFAKLNHLHGPLM 320
>gi|302848319|ref|XP_002955692.1| hypothetical protein VOLCADRAFT_121443 [Volvox carteri f.
nagariensis]
gi|300259101|gb|EFJ43332.1| hypothetical protein VOLCADRAFT_121443 [Volvox carteri f.
nagariensis]
Length = 500
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 306 MDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQ 364
MD VA+ +K+G+F +Q+++N AFA +++ + + + ++ + ++ELA
Sbjct: 139 MDAVAQEIHSKLGQFRAQDLSNTLWAFAMLKYKPTEQWWQDFERQVFGALTDLTDRELAN 198
Query: 365 VLWAFASL 372
+LWAFA L
Sbjct: 199 LLWAFAVL 206
>gi|343924360|ref|ZP_08763910.1| hypothetical protein GOALK_015_00060 [Gordonia alkanivorans NBRC
16433]
gi|343765692|dbj|GAA10836.1| hypothetical protein GOALK_015_00060 [Gordonia alkanivorans NBRC
16433]
Length = 298
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 5/83 (6%)
Query: 525 QKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKR 584
+K +A L + W V GY VD +D+KVA EIDG S H ++
Sbjct: 197 RKALALLRSAEITGWTANAKVCGYVVDIAFIDQKVAVEIDGFAFHS--DAASFQHDRTRQ 254
Query: 585 RYIAAAGWNVVSLSHQEWEELQG 607
+ A GW V+ + W+++ G
Sbjct: 255 NVLIANGWTVLRFT---WQDITG 274
>gi|441516792|ref|ZP_20998536.1| hypothetical protein GOHSU_08_00250 [Gordonia hirsuta DSM 44140 =
NBRC 16056]
gi|441456258|dbj|GAC56497.1| hypothetical protein GOHSU_08_00250 [Gordonia hirsuta DSM 44140 =
NBRC 16056]
Length = 312
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 42/77 (54%), Gaps = 6/77 (7%)
Query: 530 RLLVSTGLN-WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIA 588
RLL GL+ W++++ G+++D D KVA EIDG + R+ L + KR +A
Sbjct: 215 RLLKDQGLDGWVQQHPFHGWSIDFAWPDLKVAVEIDG-WAYHRDHKAFLRDSR-KRNALA 272
Query: 589 AAGWNVVSLSHQEWEEL 605
AGW +S S W +L
Sbjct: 273 LAGWITLSFS---WHDL 286
>gi|215919094|ref|YP_002332981.1| hypothetical protein CBU_1061a [Coxiella burnetii RSA 493]
gi|206583979|gb|ACI15272.1| hypothetical membrane associated protein [Coxiella burnetii RSA
493]
Length = 368
Score = 39.7 bits (91), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 68/153 (44%), Gaps = 14/153 (9%)
Query: 228 SPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQREMSMLVAIAMTALPECSAQGIS 287
+P PL P+ + AK + L+ Q ++ + + P +AQ I+
Sbjct: 5 NPIPLDPIPLIRDFFHTAKQQK------NRPLSLNPQDYQTIKSILDNQSHPAFNAQSIA 58
Query: 288 NIAWALS--KIGGELLYLSEMDRVAEVALTKVGE-FNSQNVANVAGAFASMQHSAPD--- 341
N+ AL+ + L E+DR A+ + + FN Q++AN A A+M + D
Sbjct: 59 NLLLALAYRRTRWAALLNKELDRPLLHAIAQNADRFNPQDIANTLWALATMGINWRDIQE 118
Query: 342 --LFSELAKRASDIVHTFQEQELAQVLWAFASL 372
L + L K + + F Q++A LWA A++
Sbjct: 119 KELDNSLLKAIAQNANRFNPQDIANTLWALATM 151
>gi|294867000|ref|XP_002764924.1| hypothetical protein Pmar_PMAR007491 [Perkinsus marinus ATCC 50983]
gi|239864760|gb|EEQ97641.1| hypothetical protein Pmar_PMAR007491 [Perkinsus marinus ATCC 50983]
Length = 805
Score = 39.7 bits (91), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 67/301 (22%), Positives = 106/301 (35%), Gaps = 50/301 (16%)
Query: 357 FQEQELAQVLWAFASLYEPADPLLESLDNAFKD-------ATQFTCCLNKALSNCNENGG 409
F++QELA + W+ A+L L E + KD + L L++
Sbjct: 512 FKQQELALITWSLATLRISHQMLEEHCCHQAKDLLLTSGITSSHLSMLLWGLASNYHTSA 571
Query: 410 VKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTISRFEEQRISE 469
S + + S L F ++AWS A D + + FE
Sbjct: 572 PASELIQEVVARVRSRELRFAAADSFHVAWSLAAFDVFDPQSLEVLLSAAATFE------ 625
Query: 470 QYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKVTSSFQKEVA 529
+ + + +NQ H ++E +AG +R + S+FQ +V
Sbjct: 626 ------LDGAALQKINQVSMWSSSHGYEPTPMIVELFHRAAGSAQRDASVIDSAFQDQVT 679
Query: 530 RLLVSTGLNWIREYAV------DGYTVDAVLVDKKVA-----------------FEIDGP 566
L N EY V V+VD V E+DGP
Sbjct: 680 TCLRRAIGNSDYEYRVVSEMDLTNLGCPGVIVDLAVTRCESADECSRDEELPLIIEVDGP 739
Query: 567 THFSRNTGVPL-------GHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLRVIL 619
H+ R+ G L G +L+R + G++V +S +W L G E+ Y+ IL
Sbjct: 740 WHYVRSIGTSLPPGQKLCGKAVLRRNALRRLGYDVEEISFAQWSRL-GREERQKYIESIL 798
Query: 620 K 620
K
Sbjct: 799 K 799
>gi|221487299|gb|EEE25531.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 245
Score = 39.7 bits (91), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 84/187 (44%), Gaps = 42/187 (22%)
Query: 207 QTAQEVL-----------EVIAEMITAVGKGLSPSPLSPLN---------------IATA 240
+TAQE+L E+ ++ + A LSPS ++ + +AT
Sbjct: 58 ETAQELLRGKETKRRAFWEIFSKRVKASAHMLSPSLMALIAKSFDVHDRDTGIYVALATV 117
Query: 241 LHRIAKNMEKVSMMTTHRLAFTRQRE-------MSMLVAIAMTALPECSAQGISNIAWAL 293
L K + S++T + F+R+ + S L AL + + + + I +L
Sbjct: 118 LPEAVKRADGRSLLTLSDV-FSRRLKRDSNPHLFSTLARQLPNALYQLTGKDVLRILSSL 176
Query: 294 SKIGGELLYLSEMDRVAEVA---LTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRA 350
G L++M +VA L ++ E +S ++A+ + FAS + P+L+S LA+RA
Sbjct: 177 DAAG-----LADMLACRQVARKLLAELDELDSVDLADASAVFASQGYRNPELYSALARRA 231
Query: 351 SDIVHTF 357
D+ +F
Sbjct: 232 VDVKDSF 238
>gi|160872163|ref|ZP_02062295.1| RAP domain family [Rickettsiella grylli]
gi|159120962|gb|EDP46300.1| RAP domain family [Rickettsiella grylli]
Length = 941
Score = 39.7 bits (91), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 6/99 (6%)
Query: 521 TSSFQKEVARLLVSTG--LNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLG 578
TS Q EV + L++ ++ E+ ++ VD +KK+ +++GP+H+ G L
Sbjct: 787 TSRLQNEVFQYLLACFPEFKFVEEHFLEFTYVDIACPEKKILMQVNGPSHY---VGKKLN 843
Query: 579 -HTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ GW+VV + + +W+ L + YL+
Sbjct: 844 VSSQFNNHLFEKLGWSVVIIPYFDWQALIKESARKKYLK 882
>gi|399218603|emb|CCF75490.1| unnamed protein product [Babesia microti strain RI]
Length = 1215
Score = 39.7 bits (91), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 59/305 (19%), Positives = 118/305 (38%), Gaps = 70/305 (22%)
Query: 206 AQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAFTRQR 265
++++ ++LE+ E T + +N TALHRIAKN + R +
Sbjct: 362 SRSSSDILEIYKENFTEINY---------VNAVTALHRIAKNSKN-----HERYTLSNDP 407
Query: 266 EMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSEM--------DRVA----EVA 313
M+ L+ + +P+ Q I+N WAL+++ ++S + +++ ++
Sbjct: 408 TMNKLLDHIYSFIPQMDQQSITNTLWALTRLEIRPNWISNLFLKLIPLANKLTPSELSMS 467
Query: 314 LTKVGEFNSQN----VAN---------------------------------VAGAFASMQ 336
L V +FNS + V N +A +FA +
Sbjct: 468 LYCVAKFNSSSKKRLVTNQINKSTAYTIKDTLLTISRQRIEEFKMPIELTCIATSFARLN 527
Query: 337 HSAPDLFSELAKRASDI--VHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFT 394
+F +A ++ + ++ + + ++W+FA + LL F +
Sbjct: 528 VRDSHVFRYIADKSLQLFEMNKLDVEHICSLIWSFARVNIVNTSLLGHF-CKFIEKNADK 586
Query: 395 CCLNKALSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQ----LGNIAWSYAVLGQMDRI 450
C L ++ C + + + ++S + +F RD + IAWSY+ G D
Sbjct: 587 CALRDLVNLCWSLSKLNYTPNELFIYTMSPMLRTFIRDMNSRDVSIIAWSYSNAGIQDNE 646
Query: 451 FFSDI 455
F D+
Sbjct: 647 LFKDL 651
>gi|422921513|ref|ZP_16954736.1| hypothetical protein VCBJG01_0254 [Vibrio cholerae BJG-01]
gi|341648748|gb|EGS72784.1| hypothetical protein VCBJG01_0254 [Vibrio cholerae BJG-01]
Length = 108
Score = 39.7 bits (91), Expect = 5.1, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 3/69 (4%)
Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVV 595
G+ + R++ V Y +D K+A EIDG +HFS + H + Y+ G VV
Sbjct: 18 GVKFRRQFGVGNYVLDFYCSTYKLAVEIDGDSHFSEGGKI---HDEQRTAYLTRHGIRVV 74
Query: 596 SLSHQEWEE 604
++QE E+
Sbjct: 75 RYTNQEVEQ 83
>gi|340053741|emb|CCC48034.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 514
Score = 39.7 bits (91), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 4/92 (4%)
Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
A+ ++NI A SK G E L+ RV +A +VGEF + ++ +A AF+ +++
Sbjct: 180 AKDVTNIISAFSKTGINHEKLFSFLSKRVQTLA--RVGEFEAAHLVILANAFSRLRYRDK 237
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
LF +A+RA + EL ++ AF+ +
Sbjct: 238 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 269
>gi|405373797|ref|ZP_11028456.1| Aspartokinase [Chondromyces apiculatus DSM 436]
gi|397087311|gb|EJJ18361.1| Aspartokinase [Myxococcus sp. (contaminant ex DSM 436)]
Length = 425
Score = 39.7 bits (91), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 11/108 (10%)
Query: 124 KNKVTDDDLDFDLEDDMKMDDIMGSGNGYDMNDLRRTVSMM------AGGMFEEKREKTI 177
K+ TDD E+D M+D++ G YD N+ + TV + A +F EK I
Sbjct: 229 KSSFTDDPGTLVCEEDSSMEDVLVRGVAYDRNETKITVCGVPDIAGAAAKIFGPLDEKHI 288
Query: 178 EEFVHRLSQFSGPS-NRRKEINLNKDIVDAQTAQEVLEVIAEMITAVG 224
V + Q PS + R ++ D QTAQ+V+ +AE I A G
Sbjct: 289 --VVDLIVQ--NPSRDGRTDVTFTVGKTDFQTAQDVVRKVAEEIGAAG 332
>gi|342181126|emb|CCC90604.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 517
Score = 39.7 bits (91), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 4/92 (4%)
Query: 283 AQGISNIAWALSKIG--GELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP 340
A+ ++NI A SK G E L+ RV +A +VGEF + ++ +A AF+ +++
Sbjct: 183 AKDVTNIISAFSKTGINHEKLFSFLSRRVQTLA--RVGEFEAAHLVILANAFSRLRYRDK 240
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
LF +A+RA + EL ++ AF+ +
Sbjct: 241 FLFGAIARRAMSLRERVTVNELVPLIVAFSKI 272
>gi|254286257|ref|ZP_04961216.1| protein of unknown function [Vibrio cholerae AM-19226]
gi|150423672|gb|EDN15614.1| protein of unknown function [Vibrio cholerae AM-19226]
Length = 126
Score = 39.3 bits (90), Expect = 5.8, Method: Composition-based stats.
Identities = 27/98 (27%), Positives = 45/98 (45%), Gaps = 8/98 (8%)
Query: 512 KTKRFNQKVTSSFQKEVARLL-----VSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGP 566
++K F Q + ++ RL G+ + R++ V Y +D K+A EIDG
Sbjct: 7 RSKVFRQYLRNNMTHPEQRLWQHLRHFQLGVKFRRQFGVGNYVLDFYCSTYKLAVEIDGD 66
Query: 567 THFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
+HFS + H + Y+ G VV ++QE E+
Sbjct: 67 SHFSEGGKI---HDEQRTAYLTRHGIRVVRYTNQEVEQ 101
>gi|189183514|ref|YP_001937299.1| repeat-containing protein A_03 [Orientia tsutsugamushi str. Ikeda]
gi|189180285|dbj|BAG40065.1| repeat-containing protein A_03 [Orientia tsutsugamushi str. Ikeda]
Length = 237
Score = 39.3 bits (90), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 49/109 (44%), Gaps = 11/109 (10%)
Query: 280 ECSAQGISNIAWALSK----IGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASM 335
+ A+G++ I + +K IG E + + A+ + EFN Q +AN AF +
Sbjct: 56 QFDARGLATILYQFAKLNYVIGSEFI-----EAWTNKAINLMDEFNPQELANSIWAFGRL 110
Query: 336 Q-HSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL-YEPADPLLES 382
+ H + A+ + F Q LA +WAF L P+D +++
Sbjct: 111 EIHPSDQFIQAWIHHATKTIDNFNTQGLANSIWAFGRLEIHPSDQFIQA 159
>gi|156094199|ref|XP_001613137.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802011|gb|EDL43410.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 578
Score = 39.3 bits (90), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 6/91 (6%)
Query: 286 ISNIAWALSKI--GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
IS IA +K+ G L+ ++ E ++ E + Q+++N+ A++ + + L+
Sbjct: 189 ISQIANCFAKLNYGDATLFRHMEQQICE----RIDELSCQSISNICNAYSKLSLGSTTLY 244
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYE 374
L K + + F EQE+A +L A+A + E
Sbjct: 245 DHLIKAVTKNLQKFNEQEIANILNAYAKVGE 275
>gi|71030818|ref|XP_765051.1| hypothetical protein [Theileria parva strain Muguga]
gi|68352007|gb|EAN32768.1| hypothetical protein, conserved [Theileria parva]
Length = 471
Score = 39.3 bits (90), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 19/131 (14%)
Query: 517 NQKVTSSFQKEVARLLVSTGL-NWIREYAVDGYTVDA--VLVDKKVAFEIDGPTHFSRNT 573
N K+ S QK V+ L+ + + + D +VD L +K+ E+DGPTHF RN
Sbjct: 347 NGKIISKSQKLVSDFLIRQNIPHQLEILTSDLSSVDIYICLNGEKIILEVDGPTHFIRNL 406
Query: 574 GVP-----LGHTMLKRRYIAAAGWNVVSLS--HQEWEELQGSFEQLD-YLRVILKDYIGG 625
P +G K + + G+ +S+ H + ++ Q+D Y + +LK+
Sbjct: 407 NDPSETRKIGPCDFKEKMLKENGFVFISIPPIHSNTQNIK----QIDEYYKELLKN---- 458
Query: 626 EGSSNIAETLK 636
GS+++ E LK
Sbjct: 459 SGSAHLNEILK 469
>gi|421341825|ref|ZP_15792234.1| hypothetical protein VCHC43B1_0345 [Vibrio cholerae HC-43B1]
gi|395947002|gb|EJH57660.1| hypothetical protein VCHC43B1_0345 [Vibrio cholerae HC-43B1]
Length = 117
Score = 39.3 bits (90), Expect = 6.6, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 3/69 (4%)
Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVV 595
G+ + R++ V Y +D K+A EIDG +HFS + H + Y+ G VV
Sbjct: 27 GVKFRRQFGVGNYVLDFYCSTYKLAVEIDGDSHFSEGGKI---HDEQRTAYLKRHGIRVV 83
Query: 596 SLSHQEWEE 604
++QE E+
Sbjct: 84 RYTNQEVEQ 92
>gi|114569092|ref|YP_755772.1| hypothetical protein Mmar10_0541 [Maricaulis maris MCS10]
gi|114339554|gb|ABI64834.1| protein of unknown function DUF559 [Maricaulis maris MCS10]
Length = 225
Score = 39.3 bits (90), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 8/80 (10%)
Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVP--LGHTMLKRRYIAAAGWN 593
G + R++ V Y D V+ K+ E+DG TH G P L H + ++ AAGW
Sbjct: 72 GFKFRRQHPVAPYIADFACVELKLIVELDGDTH-----GTPQELAHDRRRTGFLEAAGWT 126
Query: 594 VV-SLSHQEWEELQGSFEQL 612
V+ + + ++ L G Q+
Sbjct: 127 VIRAFNIDVYQNLDGVLTQI 146
>gi|197245530|gb|AAI68451.1| Unknown (protein for MGC:136169) [Xenopus (Silurana) tropicalis]
Length = 546
Score = 39.3 bits (90), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 33/62 (53%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
V +++A IDG F NT LG +K+R++ G+ V+ + E++ L E ++Y
Sbjct: 473 VHRRIALCIDGQKRFCSNTHKLLGKESIKQRHLRLLGYEVIQIPFYEFDNLSYKEEIVEY 532
Query: 615 LR 616
L
Sbjct: 533 LH 534
>gi|342184581|emb|CCC94063.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 1024
Score = 39.3 bits (90), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 62/126 (49%), Gaps = 12/126 (9%)
Query: 252 SMMTTHRLAFTRQREMS---MLVAIAMTALPECSAQGISNIAWALSKIGG--ELLYLSEM 306
++M+ R+ FT QR+M L A+AM P CS Q ++NIA A S G E L+
Sbjct: 733 TLMSFARVGFT-QRDMVDSFTLRALAMA--PTCSLQALANIAIAFSISGCRHEELFSIIA 789
Query: 307 DRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVL 366
DR + + + + +A+V AFAS+ LF E R + +++ V+
Sbjct: 790 DRF----INQKMDIPAVTIASVLSAFASIGIRNDRLFIEAIPRVRHVGQYGTPKDITNVV 845
Query: 367 WAFASL 372
+A++ +
Sbjct: 846 YAYSQV 851
Score = 39.3 bits (90), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 2/87 (2%)
Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
I+N+ +A S++G L + R+A+ A+ GEF +VA + A+A + LF E
Sbjct: 841 ITNVVYAYSQVG--LWHYKLFVRLADRAIQLRGEFRCDHVAKLLEAYARVNMRYEKLFVE 898
Query: 346 LAKRASDIVHTFQEQELAQVLWAFASL 372
+ R + H E+ ++ ++ ++
Sbjct: 899 FSSRIQTLAHLMNAGEITSIVHSYVTV 925
>gi|317048838|ref|YP_004116486.1| hypothetical protein Pat9b_2630 [Pantoea sp. At-9b]
gi|316950455|gb|ADU69930.1| protein of unknown function DUF559 [Pantoea sp. At-9b]
Length = 117
Score = 39.3 bits (90), Expect = 7.0, Method: Composition-based stats.
Identities = 21/79 (26%), Positives = 43/79 (54%), Gaps = 4/79 (5%)
Query: 535 TGLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNV 594
+G+ + R+YA+ Y VD +++ + E+DG H ++T L + ++ Y+ GW V
Sbjct: 32 SGVKFRRQYAIGRYIVDFACIERLLVIELDGGQHAEQST---LHYDEVRTAYLHRCGWRV 88
Query: 595 VSL-SHQEWEELQGSFEQL 612
+ ++Q + EL E++
Sbjct: 89 IRFWNNQVFCELDAVMEEI 107
>gi|422348130|ref|ZP_16429035.1| hypothetical protein HMPREF9476_03108 [Clostridium perfringens
WAL-14572]
gi|373222679|gb|EHP45041.1| hypothetical protein HMPREF9476_03108 [Clostridium perfringens
WAL-14572]
Length = 315
Score = 38.9 bits (89), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 13/104 (12%)
Query: 486 QCLKLEHPHLQLALSSVL---EEKIASAGK---TKRFNQKVTSSFQKEVARLLVSTGLNW 539
+CLKL++ AL S+ EE +GK +K + KV + + L+ T ++
Sbjct: 13 RCLKLKN-----ALESIKPKKEEFSTFSGKKPFSKEYELKVKYNLENPYQSTLIGTAFDY 67
Query: 540 IREYAVDGYTVDAVLVDKKVAFEIDGPTH--FSRNTGVPLGHTM 581
+ + + YT V VD +AF+I P H T L H M
Sbjct: 68 LARFIISKYTFSYVSVDNLIAFKIAEPIHEIIDEETSSKLKHLM 111
>gi|291236686|ref|XP_002738268.1| PREDICTED: FAST kinase domain-containing protein 1-like
[Saccoglossus kowalevskii]
Length = 101
Score = 38.9 bits (89), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 7/76 (9%)
Query: 557 KKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEW--EELQGSFEQLDY 614
++VA E F N+ PLG+ +KRR++ G+ V++ H EW +L S + +Y
Sbjct: 15 ERVAIEFLSSKSFCTNSQHPLGYIDMKRRHLEIMGYRYVAIPHFEWFSMKLSSSDDYREY 74
Query: 615 LRVIL-----KDYIGG 625
LR L DY+ G
Sbjct: 75 LREKLFAQKDPDYLEG 90
>gi|389583480|dbj|GAB66215.1| hypothetical protein PCYB_083760, partial [Plasmodium cynomolgi
strain B]
Length = 468
Score = 38.9 bits (89), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 47/91 (51%), Gaps = 6/91 (6%)
Query: 286 ISNIAWALSKI--GGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLF 343
IS IA +K+ G + L+ ++ E ++ E + Q+++N+ A++ + + LF
Sbjct: 84 ISQIANCFAKLNYGDDKLFKHMEQQICE----RIDELSCQSISNICNAYSKLSLGSETLF 139
Query: 344 SELAKRASDIVHTFQEQELAQVLWAFASLYE 374
L K + F EQE+A +L A++ L E
Sbjct: 140 CRLIKTVKKNLDNFNEQEIANILNAYSKLGE 170
>gi|351542151|ref|NP_001135619.2| FAST kinase domains 3 [Xenopus (Silurana) tropicalis]
Length = 691
Score = 38.9 bits (89), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 33/62 (53%)
Query: 555 VDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDY 614
V +++A IDG F NT LG +K+R++ G+ V+ + E++ L E ++Y
Sbjct: 618 VHRRIALCIDGQKRFCSNTHKLLGKESIKQRHLRLLGYEVIQIPFYEFDNLSYKEEIVEY 677
Query: 615 LR 616
L
Sbjct: 678 LH 679
>gi|297581735|ref|ZP_06943657.1| DNA methyltransferase [Vibrio cholerae RC385]
gi|421350131|ref|ZP_15800499.1| hypothetical protein VCHE25_1308 [Vibrio cholerae HE-25]
gi|297534142|gb|EFH72981.1| DNA methyltransferase [Vibrio cholerae RC385]
gi|395955238|gb|EJH65841.1| hypothetical protein VCHE25_1308 [Vibrio cholerae HE-25]
Length = 126
Score = 38.9 bits (89), Expect = 7.7, Method: Composition-based stats.
Identities = 27/98 (27%), Positives = 45/98 (45%), Gaps = 8/98 (8%)
Query: 512 KTKRFNQKVTSSFQKEVARLL-----VSTGLNWIREYAVDGYTVDAVLVDKKVAFEIDGP 566
++K F Q + ++ RL G+ + R++ V Y +D K+A EIDG
Sbjct: 7 RSKVFRQYLRNNMTHPEQRLWQHLRHFQLGVKFRRQFGVGNYVLDFYCSTYKLAVEIDGD 66
Query: 567 THFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEE 604
+HFS + H + Y+ G VV ++QE E+
Sbjct: 67 SHFSEGGKI---HDEQRTAYLKRHGIRVVRYTNQEVEQ 101
>gi|255087452|ref|XP_002505649.1| predicted protein [Micromonas sp. RCC299]
gi|226520919|gb|ACO66907.1| predicted protein [Micromonas sp. RCC299]
Length = 629
Score = 38.9 bits (89), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 74/177 (41%), Gaps = 19/177 (10%)
Query: 202 DIVDAQTAQEVLEVIAEMITAVGKGLSPSPLSPLNIATALHRIAKNMEKVSMMTTHRLAF 261
D A +QE + +AE + ++P ++ N+ AL ++ VS RLA
Sbjct: 261 DAAAAAVSQEGWKRLAEAAEQQARDMNPQDIA--NVLNALSKLDAAAAAVSPEGWKRLAE 318
Query: 262 TRQREMSMLVAIAMTALPECSAQGISNIAWALSKIGGELLYLSE--MDRVAEVALTKVGE 319
+R+ E + QG +N+ ALSK+ +S RV E + E
Sbjct: 319 AAERQAR-----------EMNPQGNANVLNALSKLDAAAAEVSPEGWKRVGEAVERQARE 367
Query: 320 FNSQNVANVAGAFASMQHSA----PDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
N Q ANV A + + +A P+ + LA+ A Q++A VL A + L
Sbjct: 368 MNPQGNANVLNALSKLDAAAAAVSPEGWKRLAEAAERQARDMNPQDIANVLNALSKL 424
>gi|294865269|ref|XP_002764366.1| hypothetical protein Pmar_PMAR015373 [Perkinsus marinus ATCC 50983]
gi|239863598|gb|EEQ97083.1| hypothetical protein Pmar_PMAR015373 [Perkinsus marinus ATCC 50983]
Length = 810
Score = 38.9 bits (89), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 75/347 (21%), Positives = 126/347 (36%), Gaps = 77/347 (22%)
Query: 284 QGISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAP--- 340
Q + + WAL + L E V + K G +S+++A V S H +P
Sbjct: 519 QDVGLLVWALGTLRLSHYELEERCCVLARGMLKEGRIDSRHLAMVLWGITSNAHRSPSAI 578
Query: 341 DLFSELAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
DL ++ R + + ++ V+W+ A ++ L+ L A A
Sbjct: 579 DLIQDVIHRVESSTLSPRPADVTIVIWSMAVFDLYSEKALQKLLEALVKA---------- 628
Query: 401 LSNCNENGGVKSSGDADSEGSLSSPVLSFNRDQLGNIAWSYAVLGQMDRIFFSDIWKTIS 460
G S+ +E S + R+ S +W +
Sbjct: 629 --------GPMSNAPPRTEQGAS-----------------------LIRLHRSLLWARLC 657
Query: 461 RFEEQRISEQYREDIMFASQVHLVNQCLKLEHPHLQLALSSVLEEKIASAGKTKRFNQKV 520
+ SE+ HLV + P L SS L+ +I S +
Sbjct: 658 HGFQPSPSEE----------AHLVKIAQRQRAPGGGLVTSSTLQWEIRSELQRVLLEVAP 707
Query: 521 TSSFQKEVARLLVSTGLNWIREYAVDGYTVDAVLVDKK----VAFEIDGPTHFSR----N 572
T+S + E ++G VD ++D K + E+DG +HFS+ N
Sbjct: 708 TASLRDEYEL-----------PAPLEGIFVDLAVIDAKEQVLLIIEVDGYSHFSKLISDN 756
Query: 573 TGVPL---GHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQLDYLR 616
+ L G+T L RR + AG+ V+S+S +W Q + +YLR
Sbjct: 757 SLAELQYNGNTELSRRILRKAGYEVLSISTVDWNNTQ-RHRRGEYLR 802
>gi|153801511|ref|ZP_01956097.1| DNA methyltransferase [Vibrio cholerae MZO-3]
gi|153826738|ref|ZP_01979405.1| DNA methyltransferase [Vibrio cholerae MZO-2]
gi|417819161|ref|ZP_12465780.1| hypothetical protein VCHE39_0600 [Vibrio cholerae HE39]
gi|419835217|ref|ZP_14358665.1| hypothetical protein VCHC46B1_0339 [Vibrio cholerae HC-46B1]
gi|423733571|ref|ZP_17706797.1| hypothetical protein VCHC41B1_0330 [Vibrio cholerae HC-41B1]
gi|423944542|ref|ZP_17733223.1| hypothetical protein VCHE40_0267 [Vibrio cholerae HE-40]
gi|423973991|ref|ZP_17736771.1| hypothetical protein VCHE46_0269 [Vibrio cholerae HE-46]
gi|424007860|ref|ZP_17750816.1| hypothetical protein VCHC44C1_0325 [Vibrio cholerae HC-44C1]
gi|124122916|gb|EAY41659.1| DNA methyltransferase [Vibrio cholerae MZO-3]
gi|149739453|gb|EDM53691.1| DNA methyltransferase [Vibrio cholerae MZO-2]
gi|340043051|gb|EGR04012.1| hypothetical protein VCHE39_0600 [Vibrio cholerae HE39]
gi|408632129|gb|EKL04612.1| hypothetical protein VCHC41B1_0330 [Vibrio cholerae HC-41B1]
gi|408662338|gb|EKL33288.1| hypothetical protein VCHE40_0267 [Vibrio cholerae HE-40]
gi|408666350|gb|EKL37139.1| hypothetical protein VCHE46_0269 [Vibrio cholerae HE-46]
gi|408859358|gb|EKL99019.1| hypothetical protein VCHC46B1_0339 [Vibrio cholerae HC-46B1]
gi|408867417|gb|EKM06777.1| hypothetical protein VCHC44C1_0325 [Vibrio cholerae HC-44C1]
Length = 126
Score = 38.9 bits (89), Expect = 8.0, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 3/69 (4%)
Query: 536 GLNWIREYAVDGYTVDAVLVDKKVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVV 595
G+ + R++ V Y +D K+A EIDG +HFS + H + Y+ G VV
Sbjct: 36 GVKFRRQFGVGNYVLDFYCSTYKLAVEIDGDSHFSEGGKI---HDEQRTAYLKRHGIRVV 92
Query: 596 SLSHQEWEE 604
++QE E+
Sbjct: 93 RYTNQEVEQ 101
>gi|71748328|ref|XP_823219.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70832887|gb|EAN78391.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 1024
Score = 38.9 bits (89), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 53/115 (46%), Gaps = 15/115 (13%)
Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
I+N+ +A S++G L + R+A+ A+ GEF +A + A+A + LF E
Sbjct: 836 ITNVVYAYSQVG--LWHYKLFVRLADRAVQLRGEFRCDQLARLLEAYARVDMRYEKLFVE 893
Query: 346 LAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
+ R + H E++ V+ A+A + LD A F C+++A
Sbjct: 894 FSPRVQTVAHLLTAGEISTVVNAYAK--------VRVLDTAV-----FKACVDRA 935
>gi|302854443|ref|XP_002958729.1| hypothetical protein VOLCADRAFT_120035 [Volvox carteri f.
nagariensis]
gi|300255904|gb|EFJ40185.1| hypothetical protein VOLCADRAFT_120035 [Volvox carteri f.
nagariensis]
Length = 2274
Score = 38.9 bits (89), Expect = 8.5, Method: Composition-based stats.
Identities = 30/108 (27%), Positives = 55/108 (50%), Gaps = 5/108 (4%)
Query: 268 SMLVAIAMTALPECSAQGISNIAWALSKIGGELLY--LSEMDRVAEVALTKVGEFNSQNV 325
S+ V A T LP+ + + ++ + W+L+K+G + LS + + + + Q +
Sbjct: 1065 SLAVRFAQT-LPDATIREVATVLWSLAKLGRPAPHALLSHILAAQQRGFM-LRTASPQAI 1122
Query: 326 ANVAGAFASMQHSAPD-LFSELAKRASDIVHTFQEQELAQVLWAFASL 372
AN+ A A+ + P+ L S + ++ + FQ Q+ A VLWA A L
Sbjct: 1123 ANMLWALATWRTREPEPLLSLVLEQCYRALPAFQPQDTANVLWALARL 1170
>gi|159469824|ref|XP_001693063.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277865|gb|EDP03632.1| predicted protein [Chlamydomonas reinhardtii]
Length = 649
Score = 38.9 bits (89), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 9/94 (9%)
Query: 304 SEMDRVAEVALTKVGEFNSQNVANVAGAFASMQH--SAPDLFSELAKRASDIVHTFQEQE 361
S +D VA+V L+++ + VA F + +H + PD ++A + +F Q
Sbjct: 253 SLLDAVADVLLSRLDGLSHHEVATALWTFGTFRHRPAHPDFAKQVAAALYARMRSFSPQG 312
Query: 362 LAQVLWAFASLYEPADPLLESLD-------NAFK 388
LA V+ A A L ++PL+E L NAFK
Sbjct: 313 LAMVVKALAQLQWRSEPLMEQLIAAAEAKLNAFK 346
>gi|261333127|emb|CBH16122.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 1024
Score = 38.9 bits (89), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 53/115 (46%), Gaps = 15/115 (13%)
Query: 286 ISNIAWALSKIGGELLYLSEMDRVAEVALTKVGEFNSQNVANVAGAFASMQHSAPDLFSE 345
I+N+ +A S++G L + R+A+ A+ GEF +A + A+A + LF E
Sbjct: 836 ITNVVYAYSQVG--LWHYKLFVRLADRAVQLRGEFRCDQLARLLEAYARVDMRYEKLFVE 893
Query: 346 LAKRASDIVHTFQEQELAQVLWAFASLYEPADPLLESLDNAFKDATQFTCCLNKA 400
+ R + H E++ V+ A+A + LD A F C+++A
Sbjct: 894 FSPRVQTVAHLLTAGEISTVVNAYAK--------VRVLDTAV-----FKACVDRA 935
>gi|260802957|ref|XP_002596358.1| hypothetical protein BRAFLDRAFT_121233 [Branchiostoma floridae]
gi|229281613|gb|EEN52370.1| hypothetical protein BRAFLDRAFT_121233 [Branchiostoma floridae]
Length = 831
Score = 38.9 bits (89), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 15/54 (27%), Positives = 30/54 (55%)
Query: 558 KVAFEIDGPTHFSRNTGVPLGHTMLKRRYIAAAGWNVVSLSHQEWEELQGSFEQ 611
+VA + F RN+ LGH +++R++ G+ V+ + H EW ++ + E+
Sbjct: 762 RVAIDYQDARDFCRNSQHLLGHVAMRKRHLEILGYTVIQIPHFEWNSMKLATEE 815
>gi|156081959|ref|XP_001608472.1| Secretory protein [Plasmodium vivax Sal-1]
gi|148801043|gb|EDL42448.1| Secretory protein, putative [Plasmodium vivax]
Length = 441
Score = 38.9 bits (89), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 23/90 (25%), Positives = 42/90 (46%), Gaps = 1/90 (1%)
Query: 522 SSFQKEVARLLVSTGLNWIREYAVDGYTVD-AVLVDKKVAFEIDGPTHFSRNTGVPLGHT 580
S FQ EV+ L G++ + Y +D + +K+ + +DGP F +T +
Sbjct: 293 SEFQWEVSNCLAKLGISHRNTFLWGSYYIDIGEMNEKRNCWFVDGPACFYTSTNQYIESV 352
Query: 581 MLKRRYIAAAGWNVVSLSHQEWEELQGSFE 610
L+ R + GWN+ + +W +L +E
Sbjct: 353 KLQHRILYNLGWNIRRIVWLDWLQLGDDWE 382
>gi|357020460|ref|ZP_09082691.1| hypothetical protein KEK_10638 [Mycobacterium thermoresistibile
ATCC 19527]
gi|356478208|gb|EHI11345.1| hypothetical protein KEK_10638 [Mycobacterium thermoresistibile
ATCC 19527]
Length = 287
Score = 38.9 bits (89), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 522 SSFQKEVARLLVSTGLN-WIREYAVDGYTVDAVLVDKKVAFEIDGPTHFS 570
S+ ++++ RLL G++ W YA+ GY VD +VA E+DG + S
Sbjct: 190 SAAERKLVRLLRGAGISGWTTNYAIGGYKVDVAFPAGRVAIEVDGLAYHS 239
>gi|386590754|ref|YP_006087154.1| Dipeptide-binding ABC transporter [Salmonella enterica subsp.
enterica serovar Heidelberg str. B182]
gi|383797798|gb|AFH44880.1| Dipeptide-binding ABC transporter [Salmonella enterica subsp.
enterica serovar Heidelberg str. B182]
Length = 512
Score = 38.9 bits (89), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 20/70 (28%), Positives = 34/70 (48%)
Query: 134 FDLEDDMKMDDIMGSGNGYDMNDLRRTVSMMAGGMFEEKREKTIEEFVHRLSQFSGPSNR 193
F L+ DMK+ +++ G + L T+++ GG F++ + L + S P N
Sbjct: 63 FGLDKDMKVKNVLAKGYTVSDDGLTYTITLRQGGKFQDGADFDAAAVKANLDRASNPDNH 122
Query: 194 RKEINLNKDI 203
K NL K+I
Sbjct: 123 LKRYNLYKNI 132
>gi|159485166|ref|XP_001700618.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
gi|158272142|gb|EDO97947.1| predicted protein of CLR family [Chlamydomonas reinhardtii]
Length = 584
Score = 38.9 bits (89), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 29/53 (54%)
Query: 320 FNSQNVANVAGAFASMQHSAPDLFSELAKRASDIVHTFQEQELAQVLWAFASL 372
FN Q ++NV A A + H PDL LA A+ V + Q L+ LWA A+L
Sbjct: 191 FNQQELSNVLWACAKLGHRDPDLLQPLADAAAAAVASMTGQGLSNCLWALATL 243
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.130 0.369
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,632,927,709
Number of Sequences: 23463169
Number of extensions: 402818058
Number of successful extensions: 1976467
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 254
Number of HSP's successfully gapped in prelim test: 1435
Number of HSP's that attempted gapping in prelim test: 1959658
Number of HSP's gapped (non-prelim): 10985
length of query: 640
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 491
effective length of database: 8,863,183,186
effective search space: 4351822944326
effective search space used: 4351822944326
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)