BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 005433
         (697 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359476651|ref|XP_002273723.2| PREDICTED: uncharacterized protein LOC100264247 [Vitis vinifera]
 gi|297735064|emb|CBI17426.3| unnamed protein product [Vitis vinifera]
          Length = 696

 Score = 1302 bits (3369), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 613/697 (87%), Positives = 659/697 (94%), Gaps = 1/697 (0%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILMLSAFAIFFSLQHEGDFSF+EAWFHLS+EYPIK++A+RLPPP+VADLNGDG
Sbjct: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFKEAWFHLSDEYPIKYEAERLPPPLVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KEVLVATHDAKIQVLEPHARRVDEGFSEARVL EVSLLPDKIRI+SGRRAVAMATGV+D
Sbjct: 61  KKEVLVATHDAKIQVLEPHARRVDEGFSEARVLVEVSLLPDKIRISSGRRAVAMATGVVD 120

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R Y+QGQP KQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFP NAHHREIAISISNYTLK
Sbjct: 121 RHYKQGQPQKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPHNAHHREIAISISNYTLK 180

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGD GLVIVGGRMEM PH  MDPFE IG+ EKNAEQHRRSA+EKEASEN+GTVDLRHFAF
Sbjct: 181 HGDAGLVIVGGRMEMLPHIYMDPFEVIGMTEKNAEQHRRSANEKEASENAGTVDLRHFAF 240

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
           YAFAGRSG +RW RKNENI+   +DASQLIPQHNYKLD HALN+RHPGEFECREFRES+L
Sbjct: 241 YAFAGRSGAVRWMRKNENIQTLSSDASQLIPQHNYKLDAHALNTRHPGEFECREFRESIL 300

Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNL 360
           GVMPHHWDRREDTLLKL+HFRRHKRK LKK  GKST+YPFHKPEE+HPPGKD TKKISNL
Sbjct: 301 GVMPHHWDRREDTLLKLAHFRRHKRKTLKKTQGKSTNYPFHKPEENHPPGKDDTKKISNL 360

Query: 361 IGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL 420
           IGKAA YA SAKSKKP+ Y+PTITNYTQLWWVPNVVVAHQ+EGIEAVHL +GRT+CKLHL
Sbjct: 361 IGKAAKYASSAKSKKPLPYVPTITNYTQLWWVPNVVVAHQREGIEAVHLPTGRTICKLHL 420

Query: 421 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
           QEGGLHADINGDGVLDHVQ VGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI
Sbjct: 421 QEGGLHADINGDGVLDHVQVVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480

Query: 481 CHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEV 540
           CHHSPFNLF HGEFSR+F RT D+ SLEVATPILIPR+DGHRHRKGSHGD++FLTNRGEV
Sbjct: 481 CHHSPFNLFQHGEFSRSFSRTPDLGSLEVATPILIPRNDGHRHRKGSHGDIIFLTNRGEV 540

Query: 541 TAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAG 600
           T+YSPGLHGHDAIWQWQLLT ATWSNLPSPSGM E S VVPTLKAFSLR HDN+++ILA 
Sbjct: 541 TSYSPGLHGHDAIWQWQLLTGATWSNLPSPSGMME-SMVVPTLKAFSLRAHDNRELILAA 599

Query: 601 GDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
           GDQEA+++SPGGS+LTS++LPA PTHAL+CEDFSNDGLTD+IL+TSNGVYGFVQTRQPGA
Sbjct: 600 GDQEAIMMSPGGSLLTSVELPAAPTHALICEDFSNDGLTDLILVTSNGVYGFVQTRQPGA 659

Query: 661 LFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGLR 697
           LFFSTLVGCLIVVMGVIFVTQ+LNS+K KPRASSG R
Sbjct: 660 LFFSTLVGCLIVVMGVIFVTQYLNSMKGKPRASSGPR 696


>gi|255585207|ref|XP_002533306.1| aldehyde dehydrogenase, putative [Ricinus communis]
 gi|223526871|gb|EEF29083.1| aldehyde dehydrogenase, putative [Ricinus communis]
          Length = 1050

 Score = 1301 bits (3368), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 610/676 (90%), Positives = 647/676 (95%), Gaps = 2/676 (0%)

Query: 20   LQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLVATHDAKIQVLEPH 79
            +QHEGDFSFREAWFHLS+EYPIK++ADRLPPPIVADLNGDG+KEVLVATHDAKIQVLEPH
Sbjct: 372  VQHEGDFSFREAWFHLSDEYPIKYEADRLPPPIVADLNGDGKKEVLVATHDAKIQVLEPH 431

Query: 80   ARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSG 139
            +RRVDEGFSEARVLAEVSLLPDKIR+ASGRRAVAMA GVIDRTY+QGQPLKQVLVV+TSG
Sbjct: 432  SRRVDEGFSEARVLAEVSLLPDKIRVASGRRAVAMAAGVIDRTYKQGQPLKQVLVVITSG 491

Query: 140  WSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHT 199
            WSVMCFDHNL KLWEANLQEDFP NAHHREIAISISNYTL+HGDTGLV+VGGRMEMQPH 
Sbjct: 492  WSVMCFDHNLKKLWEANLQEDFPHNAHHREIAISISNYTLRHGDTGLVLVGGRMEMQPHV 551

Query: 200  IM--DPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNE 257
             +  DPFEEIG AEKNAE HRRSASEKEA+ENSGTVDLRHFAFYAFAGR+G LRWSRKNE
Sbjct: 552  YLELDPFEEIGTAEKNAEFHRRSASEKEATENSGTVDLRHFAFYAFAGRTGALRWSRKNE 611

Query: 258  NIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKL 317
            NIEAQP+DASQLIPQHNYKLDVHALNSRHPGEFECREFRES+LGVMPHHWDRREDT LKL
Sbjct: 612  NIEAQPSDASQLIPQHNYKLDVHALNSRHPGEFECREFRESILGVMPHHWDRREDTQLKL 671

Query: 318  SHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPV 377
            SHFRRHKRK LKKV GK+ +YPFHKPEE+HPPGKDSTKKIS +IGKAA YAGSAKSKKP 
Sbjct: 672  SHFRRHKRKTLKKVPGKTINYPFHKPEENHPPGKDSTKKISKIIGKAANYAGSAKSKKPF 731

Query: 378  NYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDH 437
             YIPTITNYTQLWWVPNVVVAHQKEGIEAVHLA+GRT+CKLHL EGGLHADINGDGVLDH
Sbjct: 732  PYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLATGRTLCKLHLLEGGLHADINGDGVLDH 791

Query: 438  VQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRN 497
            VQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLF HGEFSRN
Sbjct: 792  VQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFQHGEFSRN 851

Query: 498  FGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQ 557
            FGRTSD +SLEVA+PILIPRSDGH+HRKGSHGDV+FLTNRGEVT+YSPGLHGHDAIWQWQ
Sbjct: 852  FGRTSDASSLEVASPILIPRSDGHKHRKGSHGDVIFLTNRGEVTSYSPGLHGHDAIWQWQ 911

Query: 558  LLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTS 617
            LLTDATWSNLPSPSGM E   VVPTLKAFSLR+HDNQQMILA GDQEAVVISPGGSI T+
Sbjct: 912  LLTDATWSNLPSPSGMMEGGMVVPTLKAFSLRMHDNQQMILAAGDQEAVVISPGGSIQTT 971

Query: 618  IDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVI 677
            IDLPAPPTHAL+CEDFS+DGLTD+I++TSNGVYGFVQTR PGALFFSTLVGCL++VMGVI
Sbjct: 972  IDLPAPPTHALICEDFSSDGLTDLIVVTSNGVYGFVQTRTPGALFFSTLVGCLLIVMGVI 1031

Query: 678  FVTQHLNSVKAKPRAS 693
            FVTQHLNS+K KPRAS
Sbjct: 1032 FVTQHLNSIKGKPRAS 1047


>gi|224134190|ref|XP_002327778.1| predicted protein [Populus trichocarpa]
 gi|222836863|gb|EEE75256.1| predicted protein [Populus trichocarpa]
          Length = 693

 Score = 1270 bits (3287), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 602/698 (86%), Positives = 648/698 (92%), Gaps = 6/698 (0%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILMLSAF+IFFSLQHEGDFSFREAWFHL++EYPIK++ +RLPPPIV+DLNGDG
Sbjct: 1   MRKRDLAILMLSAFSIFFSLQHEGDFSFREAWFHLTDEYPIKYETERLPPPIVSDLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KE+LVATHDAKIQVLEPH RRVDEGFSEAR+L E+SLLPDK R+A+GRRAVAMATGVID
Sbjct: 61  KKEILVATHDAKIQVLEPHLRRVDEGFSEARLLTELSLLPDKTRVATGRRAVAMATGVID 120

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R Y++G PLKQVLVVVTSGWSVMCFDHNL KLWE NLQEDFP NAHHREIAISISNYTLK
Sbjct: 121 RRYKEGHPLKQVLVVVTSGWSVMCFDHNLKKLWETNLQEDFPHNAHHREIAISISNYTLK 180

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGD+GLVI+GGRMEMQPH   DPFEEIG+AEKNAEQHRRSASEKE SENSGTV+LRHFA 
Sbjct: 181 HGDSGLVIIGGRMEMQPHIYSDPFEEIGMAEKNAEQHRRSASEKEPSENSGTVNLRHFAL 240

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
           YAFAGR+G LRWSRKNE+ +A    ASQLIPQHNYKLDVHALNSRHPGEFECREFRES+L
Sbjct: 241 YAFAGRTGALRWSRKNESSDA----ASQLIPQHNYKLDVHALNSRHPGEFECREFRESIL 296

Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNL 360
           GVMPHHWDRREDT+L+LSHFRRHKRK  KK  GK+T+YPFHKPEE+HPPGKDS KKISNL
Sbjct: 297 GVMPHHWDRREDTVLQLSHFRRHKRKTSKKSNGKTTNYPFHKPEENHPPGKDSAKKISNL 356

Query: 361 IGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL 420
           IG+AA YAGS KSKKP  YIPTITNYTQLWW+PNVVVAHQKEGIEAVHLASGRT+CKLHL
Sbjct: 357 IGEAAKYAGSTKSKKPFQYIPTITNYTQLWWLPNVVVAHQKEGIEAVHLASGRTLCKLHL 416

Query: 421 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
           QEGGLHADINGDGVLDHVQAVGGNGAEQTV+SGSMEVL+PCWAVATSGVPVREQLFNASI
Sbjct: 417 QEGGLHADINGDGVLDHVQAVGGNGAEQTVISGSMEVLQPCWAVATSGVPVREQLFNASI 476

Query: 481 C-HHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGE 539
           C HHSP NLF HG+F RNFGRT DV+SLEVATPILIPR DGHRHRKGSHGDVVFLTNRGE
Sbjct: 477 CHHHSPLNLFQHGDFGRNFGRT-DVSSLEVATPILIPRGDGHRHRKGSHGDVVFLTNRGE 535

Query: 540 VTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILA 599
           VT+YSPGLHGHDA+WQWQ+ T ATWSNLPSPSGM E   VVPTLKAFSLR  DNQQMILA
Sbjct: 536 VTSYSPGLHGHDAVWQWQISTGATWSNLPSPSGMMEGGMVVPTLKAFSLRARDNQQMILA 595

Query: 600 GGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPG 659
            GDQEA VISPGGSI TS+DLPAPPTHAL+CEDFSNDGLTD+I++TSNGVYGFVQTR PG
Sbjct: 596 AGDQEASVISPGGSIQTSVDLPAPPTHALICEDFSNDGLTDLIVVTSNGVYGFVQTRSPG 655

Query: 660 ALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGLR 697
           ALFFSTLVGCL++VMGVIFVTQHLNS+K KPRASS  R
Sbjct: 656 ALFFSTLVGCLLIVMGVIFVTQHLNSIKEKPRASSAAR 693


>gi|224094889|ref|XP_002310280.1| predicted protein [Populus trichocarpa]
 gi|222853183|gb|EEE90730.1| predicted protein [Populus trichocarpa]
          Length = 679

 Score = 1231 bits (3186), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 583/698 (83%), Positives = 637/698 (91%), Gaps = 20/698 (2%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILMLSAF+IFFSLQHEGDFSFREAWFHL+++YPIK++ DRLPPPIV+DLNGDG
Sbjct: 1   MRKRDLAILMLSAFSIFFSLQHEGDFSFREAWFHLTDDYPIKYETDRLPPPIVSDLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KE+LVATHDAKI VLEPH+RRVDEGFSEAR+L E+SLLPDK R+A+GRRAVAMATGVI+
Sbjct: 61  KKEILVATHDAKILVLEPHSRRVDEGFSEARLLTELSLLPDKTRVATGRRAVAMATGVIE 120

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R Y++G PLKQVLVVVTSGWSVMCFDHNL KLWE N+QEDFP NAHHREIAISISNYTLK
Sbjct: 121 RRYKEGHPLKQVLVVVTSGWSVMCFDHNLKKLWETNVQEDFPHNAHHREIAISISNYTLK 180

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGD GLVI+GGRME+QPH  +DPFEEIG+AEKNAEQHRRSA EKE SENSGTV+LRHFA 
Sbjct: 181 HGDMGLVIIGGRMEVQPHNYLDPFEEIGMAEKNAEQHRRSAGEKEPSENSGTVNLRHFAL 240

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTD-ASQLIPQHNYKLDVHALNSRHPGEFECREFRESV 299
           YAFAGR+G +RWSRKNENIEA+ +D ASQLIPQHNYKLDVHALNSRHPGE          
Sbjct: 241 YAFAGRTGTVRWSRKNENIEAESSDAASQLIPQHNYKLDVHALNSRHPGE---------- 290

Query: 300 LGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISN 359
                   DRREDT+L+LSHFRRHKRK  KK  GK+++YPFHKPEE+HPPGKD+TKKISN
Sbjct: 291 --------DRREDTVLQLSHFRRHKRKTSKKSNGKNSNYPFHKPEENHPPGKDTTKKISN 342

Query: 360 LIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLH 419
           LIGKAA YA S KSKKP  YIPTITNYTQLWWVPNVVVAHQKEGIEA+HLASGRT+CKLH
Sbjct: 343 LIGKAAKYASSTKSKKPSQYIPTITNYTQLWWVPNVVVAHQKEGIEAIHLASGRTLCKLH 402

Query: 420 LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNAS 479
           LQEGGLHADINGDGVLDHVQAVGGNGAE+TVVSG+MEVL+PCWAVATSGVPVREQLFNAS
Sbjct: 403 LQEGGLHADINGDGVLDHVQAVGGNGAEKTVVSGAMEVLQPCWAVATSGVPVREQLFNAS 462

Query: 480 ICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGE 539
           ICHHSPFNLF HG+F RNFGRT DV+SLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGE
Sbjct: 463 ICHHSPFNLFQHGDFGRNFGRT-DVSSLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGE 521

Query: 540 VTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILA 599
           VT+YSPGLHGHDA+WQWQ+LT ATWSNLPSPSGM E   VVPTLKAFSLR HDNQQMILA
Sbjct: 522 VTSYSPGLHGHDAVWQWQILTGATWSNLPSPSGMMEGGMVVPTLKAFSLRAHDNQQMILA 581

Query: 600 GGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPG 659
            GDQEA VISPGGS+ TS DLPAPPTHAL+CEDF+NDGL D+I++TSNGVYGFVQTR PG
Sbjct: 582 AGDQEAAVISPGGSVQTSFDLPAPPTHALICEDFTNDGLPDLIVVTSNGVYGFVQTRSPG 641

Query: 660 ALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGLR 697
           ALFFSTLVGCL++VMGVIFVTQH+NS+K KPRASSGLR
Sbjct: 642 ALFFSTLVGCLLIVMGVIFVTQHINSIKGKPRASSGLR 679


>gi|356496701|ref|XP_003517204.1| PREDICTED: uncharacterized protein LOC100787497 [Glycine max]
          Length = 697

 Score = 1229 bits (3181), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 577/698 (82%), Positives = 637/698 (91%), Gaps = 2/698 (0%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILMLSAFAIFF+LQ +G  SF++AW HL++EYPIK++A+RLPPP+VADLNGDG
Sbjct: 1   MRKRDLAILMLSAFAIFFTLQQDGGISFKDAWMHLTDEYPIKYEAERLPPPLVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KEVLVATHDAKIQVLEPH+RRVDEGFSEARVLAEVSLLPDK+R+ +GRR VAMATG ID
Sbjct: 61  KKEVLVATHDAKIQVLEPHSRRVDEGFSEARVLAEVSLLPDKVRVMTGRRPVAMATGYID 120

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R Y+ GQP KQVLVVVTSGWSVMCFD NL KLWE NLQEDFP NAHHRE+AISISNYTLK
Sbjct: 121 R-YKIGQPQKQVLVVVTSGWSVMCFDSNLQKLWENNLQEDFPHNAHHREVAISISNYTLK 179

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGDTGL+IVGGRMEMQPH  MDPFEE+G+  + AEQHRRSA+EKEASENSGTVDLRHFAF
Sbjct: 180 HGDTGLIIVGGRMEMQPHIFMDPFEEMGMGARFAEQHRRSAAEKEASENSGTVDLRHFAF 239

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
           YAFAGRSG+ RWSRKNENIE   +DASQL+PQHNYKLDVHALN+R PGE+ECREFRES+L
Sbjct: 240 YAFAGRSGVERWSRKNENIEVHSSDASQLLPQHNYKLDVHALNTRQPGEYECREFRESIL 299

Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNL 360
           GVMPH W RREDTLLKL+HFRRHKRK LKK  GK+ SYPFHKPEE+HPPGKDSTKKISN+
Sbjct: 300 GVMPHQWARREDTLLKLAHFRRHKRKTLKKTPGKAMSYPFHKPEENHPPGKDSTKKISNI 359

Query: 361 IGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL 420
           IGKAA YAGSAKSKK + Y+PTITNYTQ+WWVPNVVVAHQKEGIEA+HLASGRT+CKLHL
Sbjct: 360 IGKAANYAGSAKSKKHLPYVPTITNYTQVWWVPNVVVAHQKEGIEALHLASGRTICKLHL 419

Query: 421 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
           QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWA+ATSGVP+REQLFN SI
Sbjct: 420 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAIATSGVPIREQLFNVSI 479

Query: 481 CHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEV 540
           CH++ FNLF HGE  R++ + SD+ASLEVATPILIPRSDGHRHRKGSHGDV+FLTNRGE+
Sbjct: 480 CHYTHFNLFQHGELYRSYSQGSDIASLEVATPILIPRSDGHRHRKGSHGDVIFLTNRGEI 539

Query: 541 TAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTE-ASTVVPTLKAFSLRVHDNQQMILA 599
           T+YSPGLHGHDAIWQWQ  T  TWSNLPSPSG+ E    V+PTLK  SLR+HDNQ+MILA
Sbjct: 540 TSYSPGLHGHDAIWQWQQSTGVTWSNLPSPSGVMEGGGLVIPTLKPLSLRLHDNQEMILA 599

Query: 600 GGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPG 659
            G+QEAV+ISPGGS+L +I+LP PPTH L+ EDFSNDGLTD+IL+TSNGVYGFVQTRQPG
Sbjct: 600 AGEQEAVIISPGGSLLATIELPGPPTHVLIAEDFSNDGLTDLILVTSNGVYGFVQTRQPG 659

Query: 660 ALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGLR 697
           ALFFS LVGCLIVVMGVIFVTQHLNS K KPR SSG R
Sbjct: 660 ALFFSMLVGCLIVVMGVIFVTQHLNSTKGKPRPSSGSR 697


>gi|356538246|ref|XP_003537615.1| PREDICTED: uncharacterized protein LOC100789851 [Glycine max]
          Length = 693

 Score = 1216 bits (3145), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 573/697 (82%), Positives = 632/697 (90%), Gaps = 4/697 (0%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILMLSAFAIFF+LQ +G  SF++AW HL++EYPIK++A+RLPPP+VADLNGDG
Sbjct: 1   MRKRDLAILMLSAFAIFFTLQQDGGISFKDAWMHLTDEYPIKYEAERLPPPLVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KEVLVATHDAKIQVLEPH+RRVDEGFSEARVLAEVSLLPDK+R+ +GRR VAMATG ID
Sbjct: 61  KKEVLVATHDAKIQVLEPHSRRVDEGFSEARVLAEVSLLPDKVRVMTGRRPVAMATGYID 120

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R Y+ GQP KQVLVVVTSGWSVMCFD NL KLWE NLQEDFP NAHHRE+AISISNYTLK
Sbjct: 121 R-YKIGQPQKQVLVVVTSGWSVMCFDSNLQKLWENNLQEDFPHNAHHREVAISISNYTLK 179

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGDTGL+IVGGRMEMQPH  MDPFEE+G+  + AEQH+RSA+EKEAS   GTVDLRHFAF
Sbjct: 180 HGDTGLIIVGGRMEMQPHIFMDPFEEMGMGARFAEQHQRSAAEKEAS---GTVDLRHFAF 236

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
           YAFAGRSG  RWSRKNENIEA  +DASQL+PQHNYKLDVHALN+R PGEFECREFRES+L
Sbjct: 237 YAFAGRSGDERWSRKNENIEAHSSDASQLLPQHNYKLDVHALNTRQPGEFECREFRESIL 296

Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNL 360
           GVMPH W RREDTL KL+HFRRHKRK LKK  GK+ SYPFHKPEE+HPPGKDSTKKISN+
Sbjct: 297 GVMPHQWARREDTLFKLAHFRRHKRKALKKTPGKAISYPFHKPEENHPPGKDSTKKISNI 356

Query: 361 IGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL 420
           IGKAA+YAGSAKSKK + Y+PTITNYTQ+WWVPNVVV+HQKEGIEA+HLA+GRT+CK HL
Sbjct: 357 IGKAASYAGSAKSKKHLPYVPTITNYTQVWWVPNVVVSHQKEGIEALHLATGRTICKFHL 416

Query: 421 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
           QEGGLHAD+NGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFN SI
Sbjct: 417 QEGGLHADVNGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNVSI 476

Query: 481 CHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEV 540
           CH++ FNLF HGE  R++ + SD ASLEVATPILIPRSDGHRHRKGSHGDV+FLTNRGE+
Sbjct: 477 CHYTHFNLFQHGELYRSYSQGSDTASLEVATPILIPRSDGHRHRKGSHGDVIFLTNRGEI 536

Query: 541 TAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAG 600
           T+YSPGLHGHDAIWQWQ  T  TWSNLPSPSGM E   V+PTLK  SLR+HDNQ+MILA 
Sbjct: 537 TSYSPGLHGHDAIWQWQQSTGVTWSNLPSPSGMMEGGLVIPTLKPLSLRLHDNQEMILAA 596

Query: 601 GDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
           G+QEAV+ISPGGSIL +I+LP PPTH L+ EDFSNDGLTD+IL+TS+GVYGFVQTRQPGA
Sbjct: 597 GEQEAVIISPGGSILATIELPGPPTHVLITEDFSNDGLTDLILVTSHGVYGFVQTRQPGA 656

Query: 661 LFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGLR 697
           LFFS LVGCLIVVMGVIFVTQHLNS K KPR SSG R
Sbjct: 657 LFFSMLVGCLIVVMGVIFVTQHLNSTKGKPRPSSGPR 693


>gi|449464520|ref|XP_004149977.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101223217 [Cucumis sativus]
          Length = 686

 Score = 1215 bits (3144), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 576/694 (82%), Positives = 632/694 (91%), Gaps = 10/694 (1%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILMLSAFAIFFSLQHEGDFSFREAW HL++EYPIK++ DRLPPP+VADLNGDG
Sbjct: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWMHLTDEYPIKYEGDRLPPPVVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KEVLVATHDAKI VLEPH+RRVDEGFS ARV          +RI+SGRR VAMATGVID
Sbjct: 61  KKEVLVATHDAKILVLEPHSRRVDEGFSHARV---------XVRISSGRRPVAMATGVID 111

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R  RQGQP+ QVLVVVTSGWSV+CFDHNLNKLWEANLQEDFP NAHHREIAISI+NYTLK
Sbjct: 112 RHPRQGQPVTQVLVVVTSGWSVLCFDHNLNKLWEANLQEDFPHNAHHREIAISITNYTLK 171

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGD+GL+IVGGRMEMQ H  MDPFEEIG+AEKNAEQHRRSA+EKEASENSG++DLRHFAF
Sbjct: 172 HGDSGLIIVGGRMEMQSHIFMDPFEEIGIAEKNAEQHRRSATEKEASENSGSIDLRHFAF 231

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
           YAFAGRSGL RWSRKNENIEA  +DASQLIPQHNYKLDVH+LN+RHPGEFECREFRES+L
Sbjct: 232 YAFAGRSGLPRWSRKNENIEAHSSDASQLIPQHNYKLDVHSLNARHPGEFECREFRESIL 291

Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNL 360
           GVMPHHWDRREDT+L+L+HFRRHKRK LKK  GKS +YPFHKPEE+HPPGKDS+K+I  +
Sbjct: 292 GVMPHHWDRREDTVLELAHFRRHKRKALKKTSGKSVNYPFHKPEENHPPGKDSSKRIPKI 351

Query: 361 IGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL 420
           IG AA  AGSAK+KKP+ Y+PTITNYT+LWW+PNVVVAHQKEGIEA+HLASGRT+CKLHL
Sbjct: 352 IGTAANIAGSAKTKKPLPYVPTITNYTKLWWLPNVVVAHQKEGIEALHLASGRTICKLHL 411

Query: 421 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
           QEGGLHADINGDGVLDHVQAVGGNGAE+TVVSGSMEV++PCWAVATSGVPVREQLFNASI
Sbjct: 412 QEGGLHADINGDGVLDHVQAVGGNGAERTVVSGSMEVIQPCWAVATSGVPVREQLFNASI 471

Query: 481 CHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEV 540
           CH SPFN F HGE SR FGRT D+ASLEVATPILI R DGHRHRKGSHGDVVFLTNRGEV
Sbjct: 472 CHFSPFNYFQHGELSR-FGRTPDMASLEVATPILISRKDGHRHRKGSHGDVVFLTNRGEV 530

Query: 541 TAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAG 600
           T+YSPGLHGH A WQWQ+ T ATWSNLPSPSGM +A TV+PTLKA  LRV   Q+M+LA 
Sbjct: 531 TSYSPGLHGHGADWQWQITTGATWSNLPSPSGMMDAGTVIPTLKAIDLRVGATQEMVLAA 590

Query: 601 GDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
           G+QEAVVISPGGS+  SI+LPA PTHAL+ EDFSNDGLTD+IL+TS GVYGFVQTRQPGA
Sbjct: 591 GEQEAVVISPGGSVQASIELPASPTHALITEDFSNDGLTDIILVTSTGVYGFVQTRQPGA 650

Query: 661 LFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASS 694
           LFFSTLVGCLI+VMGVIFVTQHLNS+K KPR S+
Sbjct: 651 LFFSTLVGCLILVMGVIFVTQHLNSIKGKPRPSA 684


>gi|297816364|ref|XP_002876065.1| FG-GAP repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297321903|gb|EFH52324.1| FG-GAP repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 698

 Score = 1205 bits (3117), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 561/698 (80%), Positives = 634/698 (90%), Gaps = 2/698 (0%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILMLS FAIFF+LQHEGDF+F+EAWFHL +EYP+K++ADRLPPPIVADLNGDG
Sbjct: 1   MRKRDLAILMLSGFAIFFTLQHEGDFAFKEAWFHLYDEYPVKYEADRLPPPIVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KEVLVAT+DAKIQVLEPH+RRVDEGFSEARVLAE+ LLPDKIRIASGRRAVAMATGVID
Sbjct: 61  KKEVLVATNDAKIQVLEPHSRRVDEGFSEARVLAEIPLLPDKIRIASGRRAVAMATGVID 120

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R Y+ G P KQV+VVVTSGWSV+CFDHNL KLWE NLQEDFP NAHHREIAISISNYTLK
Sbjct: 121 RYYKDGTPQKQVVVVVTSGWSVLCFDHNLKKLWETNLQEDFPHNAHHREIAISISNYTLK 180

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGDTGLVIVGGRMEMQP+  MDPFEE+G+  +NAEQHRRSA+E +ASE+SG ++LRHF+ 
Sbjct: 181 HGDTGLVIVGGRMEMQPYNHMDPFEELGMTAQNAEQHRRSATENQASEDSGAINLRHFSV 240

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
           YAFAG++GLLRWS+K +++EA  +DASQLIPQHNYKLDVHALNSRHPGEFECREFRES+L
Sbjct: 241 YAFAGKTGLLRWSKKTDDVEAHTSDASQLIPQHNYKLDVHALNSRHPGEFECREFRESIL 300

Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVG-KSTSYPFHKPEEHHPPGKDSTKKISN 359
            VMPHHWDRREDTLLKL+HFRRHKRK LKK  G KST+YPFHKPEEH P GKD ++KI  
Sbjct: 301 SVMPHHWDRREDTLLKLAHFRRHKRKTLKKQAGSKSTAYPFHKPEEHTPAGKDLSRKIPK 360

Query: 360 LIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLH 419
           LIGKAA YAGSAK KK + YIPTITNYT+LWWVPNVVVAHQKEGIEA+HL +GRT+CKL 
Sbjct: 361 LIGKAARYAGSAKPKKGMQYIPTITNYTKLWWVPNVVVAHQKEGIEAIHLPTGRTLCKLS 420

Query: 420 LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNAS 479
           L EGGLHADINGDGVLDHVQ VGGN  E+TVVSGSMEVL+PCWAVATSGVP+REQLFN S
Sbjct: 421 LLEGGLHADINGDGVLDHVQTVGGNVGERTVVSGSMEVLKPCWAVATSGVPIREQLFNVS 480

Query: 480 ICHHSPFNLFPH-GEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRG 538
           ICHHSPFN   + G++SR+F +  D ++LE+ATPILIPR DGH+HR+GSHGDV+FLTNRG
Sbjct: 481 ICHHSPFNFLHYGGDYSRHFAQARDTSTLEIATPILIPRDDGHKHRRGSHGDVIFLTNRG 540

Query: 539 EVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMIL 598
           EVT+Y+P +HGHDA+WQWQL T+ATWSNLPSPSG+TE+ TVVPTLK FSLR+HDNQ MIL
Sbjct: 541 EVTSYTPDVHGHDAVWQWQLQTEATWSNLPSPSGLTESGTVVPTLKPFSLRIHDNQPMIL 600

Query: 599 AGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQP 658
           AGGDQ AV+ISPGGSIL SI+LP+ PTHAL+ +DFSNDGLTDVI+MTSNGVYGFVQTRQP
Sbjct: 601 AGGDQAAVIISPGGSILASIELPSQPTHALITDDFSNDGLTDVIVMTSNGVYGFVQTRQP 660

Query: 659 GALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGL 696
           GALFFS+LVGCL+VVM VIFVTQHLNS++ KPR SS  
Sbjct: 661 GALFFSSLVGCLLVVMAVIFVTQHLNSIQGKPRPSSSF 698


>gi|22331734|ref|NP_190674.2| FG-GAP repeat-containing protein [Arabidopsis thaliana]
 gi|332645222|gb|AEE78743.1| FG-GAP repeat-containing protein [Arabidopsis thaliana]
          Length = 698

 Score = 1202 bits (3111), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 559/698 (80%), Positives = 634/698 (90%), Gaps = 2/698 (0%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILMLS FAIFF+LQHEGDF+F+EAWFHL +EYP+K++ADRLPPPIVADLNGDG
Sbjct: 1   MRKRDLAILMLSGFAIFFTLQHEGDFAFKEAWFHLYDEYPVKYEADRLPPPIVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KEVLVAT+DAKIQVLEPH+RRVDEGFSEARVLAE++LLPDKIR+ASGRRAVAMATGVID
Sbjct: 61  KKEVLVATNDAKIQVLEPHSRRVDEGFSEARVLAEITLLPDKIRVASGRRAVAMATGVID 120

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R Y+ G P KQV+VVVTSGWSV+CFDHNL KLWE NLQEDFP NAHHREIAISISNYTLK
Sbjct: 121 RYYKNGTPQKQVVVVVTSGWSVLCFDHNLKKLWETNLQEDFPHNAHHREIAISISNYTLK 180

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGDTGLVIVGGRMEMQP+  MDPFEE+G+  +NA+QHRRSA+E +ASE+SG ++LRHF+ 
Sbjct: 181 HGDTGLVIVGGRMEMQPYNHMDPFEELGMTAQNADQHRRSATENQASEDSGAINLRHFSV 240

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
           YAFAG++GLLRWS+K +++EA  +DASQLIPQHNYKLDVHALNSRHPGEFECREFRES+L
Sbjct: 241 YAFAGKTGLLRWSKKTDDVEAHTSDASQLIPQHNYKLDVHALNSRHPGEFECREFRESIL 300

Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVG-KSTSYPFHKPEEHHPPGKDSTKKISN 359
            VMPH WDRREDTLLKL+HFRRHKRK LKK  G KST+YPFHKPEEH P GKD ++KI  
Sbjct: 301 SVMPHRWDRREDTLLKLAHFRRHKRKTLKKQAGSKSTAYPFHKPEEHTPAGKDLSRKIPK 360

Query: 360 LIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLH 419
           LIGKAA YAGSAK KK + YIPTITNYT+LWWVPNVVVAHQKEGIEA+HL +GRT+CKL 
Sbjct: 361 LIGKAARYAGSAKPKKGMQYIPTITNYTKLWWVPNVVVAHQKEGIEAIHLPTGRTLCKLS 420

Query: 420 LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNAS 479
           L EGGLHADINGDGVLDHVQ VGGN  E+TVVSGSMEVL+PCWAVATSGVP+REQLFN S
Sbjct: 421 LLEGGLHADINGDGVLDHVQTVGGNVGERTVVSGSMEVLKPCWAVATSGVPIREQLFNVS 480

Query: 480 ICHHSPFNLFPH-GEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRG 538
           ICHHSPFN   + G++SR+F +  D ++LE+ATPILIPR DGH+HRKGSHGDV+FLTNRG
Sbjct: 481 ICHHSPFNFLHYGGDYSRHFAQARDTSTLEIATPILIPRDDGHKHRKGSHGDVIFLTNRG 540

Query: 539 EVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMIL 598
           EVT+Y+P +HGHDA+WQWQL T+ATWSNLPSPSG+TE+ TVVPTLK FSLR+HDNQ MIL
Sbjct: 541 EVTSYTPDVHGHDAVWQWQLQTEATWSNLPSPSGLTESGTVVPTLKPFSLRIHDNQPMIL 600

Query: 599 AGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQP 658
           AGGDQ AV+ISPGGSIL SI+LP+ PTHAL+ +DFSNDGLTDVI+MTSNGVYGFVQTRQP
Sbjct: 601 AGGDQAAVIISPGGSILASIELPSQPTHALITDDFSNDGLTDVIVMTSNGVYGFVQTRQP 660

Query: 659 GALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGL 696
           GALFFS+LVGCL+VVM VIFVTQHLNS++ KPR SS  
Sbjct: 661 GALFFSSLVGCLLVVMAVIFVTQHLNSIQGKPRPSSSF 698


>gi|18087608|gb|AAL58934.1|AF462847_1 AT3g51050/F24M12_90 [Arabidopsis thaliana]
 gi|24797020|gb|AAN64522.1| At3g51050/F24M12_90 [Arabidopsis thaliana]
          Length = 698

 Score = 1200 bits (3104), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 558/698 (79%), Positives = 633/698 (90%), Gaps = 2/698 (0%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILMLS FAIFF+LQHEGDF+F+EAWFHL +EYP+K++ADRLPPPIVADLNGDG
Sbjct: 1   MRKRDLAILMLSGFAIFFTLQHEGDFAFKEAWFHLYDEYPVKYEADRLPPPIVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KEVLVAT+DAKIQVLEPH+RRVDEGFSEARVLAE++LLPDKIR+ASGRRAVAMATGVID
Sbjct: 61  KKEVLVATNDAKIQVLEPHSRRVDEGFSEARVLAEITLLPDKIRVASGRRAVAMATGVID 120

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R Y+ G P KQV+VVVTSGWSV+CFDHNL KLWE NLQEDFP NA HREIAISISNYTLK
Sbjct: 121 RYYKNGTPQKQVVVVVTSGWSVLCFDHNLKKLWETNLQEDFPHNARHREIAISISNYTLK 180

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGDTGLVIVGGRMEMQP+  MDPFEE+G+  +NA+QHRRSA+E +ASE+SG ++LRHF+ 
Sbjct: 181 HGDTGLVIVGGRMEMQPYNHMDPFEELGMTAQNADQHRRSATENQASEDSGAINLRHFSV 240

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
           YAFAG++GLLRWS+K +++EA  +DASQLIPQHNYKLDVHALNSRHPGEFECREFRES+L
Sbjct: 241 YAFAGKTGLLRWSKKTDDVEAHTSDASQLIPQHNYKLDVHALNSRHPGEFECREFRESIL 300

Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVG-KSTSYPFHKPEEHHPPGKDSTKKISN 359
            VMPH WDRREDTLLKL+HFRRHKRK LKK  G KST+YPFHKPEEH P GKD ++KI  
Sbjct: 301 SVMPHRWDRREDTLLKLAHFRRHKRKTLKKQAGSKSTAYPFHKPEEHTPAGKDLSRKIPK 360

Query: 360 LIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLH 419
           LIGKAA YAGSAK KK + YIPTITNYT+LWWVPNVVVAHQKEGIEA+HL +GRT+CKL 
Sbjct: 361 LIGKAARYAGSAKPKKGMQYIPTITNYTKLWWVPNVVVAHQKEGIEAIHLPTGRTLCKLS 420

Query: 420 LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNAS 479
           L EGGLHADINGDGVLDHVQ VGGN  E+TVVSGSMEVL+PCWAVATSGVP+REQLFN S
Sbjct: 421 LLEGGLHADINGDGVLDHVQTVGGNVGERTVVSGSMEVLKPCWAVATSGVPIREQLFNVS 480

Query: 480 ICHHSPFNLFPH-GEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRG 538
           ICHHSPFN   + G++SR+F +  D ++LE+ATPILIPR DGH+HRKGSHGDV+FLTNRG
Sbjct: 481 ICHHSPFNFLHYGGDYSRHFAQARDTSTLEIATPILIPRDDGHKHRKGSHGDVIFLTNRG 540

Query: 539 EVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMIL 598
           EVT+Y+P +HGHDA+WQWQL T+ATWSNLPSPSG+TE+ TVVPTLK FSLR+HDNQ MIL
Sbjct: 541 EVTSYTPDVHGHDAVWQWQLQTEATWSNLPSPSGLTESGTVVPTLKPFSLRIHDNQPMIL 600

Query: 599 AGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQP 658
           AGGDQ AV+ISPGGSIL SI+LP+ PTHAL+ +DFSNDGLTDVI+MTSNGVYGFVQTRQP
Sbjct: 601 AGGDQAAVIISPGGSILASIELPSQPTHALITDDFSNDGLTDVIVMTSNGVYGFVQTRQP 660

Query: 659 GALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGL 696
           GALFFS+LVGCL+VVM VIFVTQHLNS++ KPR SS  
Sbjct: 661 GALFFSSLVGCLLVVMAVIFVTQHLNSIQGKPRPSSSF 698


>gi|449524524|ref|XP_004169272.1| PREDICTED: uncharacterized protein LOC101231345 [Cucumis sativus]
          Length = 664

 Score = 1166 bits (3016), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 551/663 (83%), Positives = 607/663 (91%), Gaps = 2/663 (0%)

Query: 33  FHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARV 92
            HL++EYPIK++ DRLPPP+VADLNGDG+KEVLVATHDAKI VLEPH+RRVDEGFS ARV
Sbjct: 1   MHLTDEYPIKYEGDRLPPPVVADLNGDGKKEVLVATHDAKILVLEPHSRRVDEGFSHARV 60

Query: 93  LAEVSLLPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKL 152
           L E SLLP K+RI+SGRR VAMATGVIDR  RQGQP+ QVLVVVTSGWSV+CFDHNLNKL
Sbjct: 61  LTEASLLPAKVRISSGRRPVAMATGVIDRHPRQGQPVTQVLVVVTSGWSVLCFDHNLNKL 120

Query: 153 WEANLQEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEK 212
           WEANLQEDFP NAHHREIAISI+NYTLKHGD+GL+IVGGRMEMQ H  MDPFEEIG+AEK
Sbjct: 121 WEANLQEDFPHNAHHREIAISITNYTLKHGDSGLIIVGGRMEMQSHIFMDPFEEIGIAEK 180

Query: 213 NAEQHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNE-NIEAQPTDASQLIP 271
           NAEQHRRSA+EKEASENSG++DLRHFAFYAFAGRSGL RWSRKNE NIEA  +DASQLIP
Sbjct: 181 NAEQHRRSATEKEASENSGSIDLRHFAFYAFAGRSGLPRWSRKNEVNIEAHSSDASQLIP 240

Query: 272 QHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKV 331
           QHNYKLDVH+LN+RHPGEFECREFRES+LGVMPHHWDRREDT+L+L+HFRRHKRK LKK 
Sbjct: 241 QHNYKLDVHSLNARHPGEFECREFRESILGVMPHHWDRREDTVLELAHFRRHKRKALKKT 300

Query: 332 VGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWW 391
            GKS +YPFHKPEE+HPPGKDS+K+I  +IG AA  AGSAK+KKP+ Y+PTITNYT+LWW
Sbjct: 301 SGKSVNYPFHKPEENHPPGKDSSKRIPKIIGTAANIAGSAKTKKPLPYVPTITNYTKLWW 360

Query: 392 VPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVV 451
           +PNVVVAHQKEGIEA+HLASGRT+CKLHLQEGGLHADINGDGVLDHVQAVGGNGAE+TVV
Sbjct: 361 LPNVVVAHQKEGIEALHLASGRTICKLHLQEGGLHADINGDGVLDHVQAVGGNGAERTVV 420

Query: 452 SGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVAT 511
           SGSMEV++PCWAVATSGVPVREQLFNASICH SPFN F HGE SR FGRT D+ASLEVAT
Sbjct: 421 SGSMEVIQPCWAVATSGVPVREQLFNASICHFSPFNYFQHGELSR-FGRTPDMASLEVAT 479

Query: 512 PILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPS 571
           PILI R DGHRHRKGSHGDVVFLTNRGEVT+YSPGLHGH A WQWQ+ T ATWSNLPSPS
Sbjct: 480 PILISRKDGHRHRKGSHGDVVFLTNRGEVTSYSPGLHGHGADWQWQITTGATWSNLPSPS 539

Query: 572 GMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCE 631
           GM +A TV+PTLKA  LRV   Q+M+LA G+QEAVVISPGGS+  SI+LPA PTHAL+ E
Sbjct: 540 GMMDAGTVIPTLKAIDLRVGATQEMVLAAGEQEAVVISPGGSVQASIELPASPTHALITE 599

Query: 632 DFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPR 691
           DFSNDGLTD+IL+TS GVYGFVQTRQPGALFFSTLVGCLI+VMGVIFVTQHLNS+K KPR
Sbjct: 600 DFSNDGLTDIILVTSTGVYGFVQTRQPGALFFSTLVGCLILVMGVIFVTQHLNSIKGKPR 659

Query: 692 ASS 694
            S+
Sbjct: 660 PSA 662


>gi|6562257|emb|CAB62627.1| putative protein [Arabidopsis thaliana]
          Length = 680

 Score = 1155 bits (2989), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 544/698 (77%), Positives = 618/698 (88%), Gaps = 20/698 (2%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILMLS FAIFF+LQHEGDF+F+EAWFHL +EYP+K++ADRLPPPIVADLNGDG
Sbjct: 1   MRKRDLAILMLSGFAIFFTLQHEGDFAFKEAWFHLYDEYPVKYEADRLPPPIVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KEVLVAT+DAKIQVLEPH+RRVDEGFSEARVLAE++LLPDKIR+ASGRRAVAMATGVID
Sbjct: 61  KKEVLVATNDAKIQVLEPHSRRVDEGFSEARVLAEITLLPDKIRVASGRRAVAMATGVID 120

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R Y+ G P KQV+VVVTSGWSV+CFDHNL KLWE NLQEDFP NAHHREIAISISNYTLK
Sbjct: 121 RYYKNGTPQKQVVVVVTSGWSVLCFDHNLKKLWETNLQEDFPHNAHHREIAISISNYTLK 180

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGDTGLVIVGGRMEMQP+  MDPFEE+G+  +NA+QHRRSA+E +ASE+SG ++LRHF+ 
Sbjct: 181 HGDTGLVIVGGRMEMQPYNHMDPFEELGMTAQNADQHRRSATENQASEDSGAINLRHFSV 240

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
           YAFAG++GLLRWS+K +++EA  +DASQLIPQHNYKLDVHALNSRHPGE           
Sbjct: 241 YAFAGKTGLLRWSKKTDDVEAHTSDASQLIPQHNYKLDVHALNSRHPGE----------- 289

Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVG-KSTSYPFHKPEEHHPPGKDSTKKISN 359
                  DRREDTLLKL+HFRRHKRK LKK  G KST+YPFHKPEEH P GKD ++KI  
Sbjct: 290 -------DRREDTLLKLAHFRRHKRKTLKKQAGSKSTAYPFHKPEEHTPAGKDLSRKIPK 342

Query: 360 LIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLH 419
           LIGKAA YAGSAK KK + YIPTITNYT+LWWVPNVVVAHQKEGIEA+HL +GRT+CKL 
Sbjct: 343 LIGKAARYAGSAKPKKGMQYIPTITNYTKLWWVPNVVVAHQKEGIEAIHLPTGRTLCKLS 402

Query: 420 LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNAS 479
           L EGGLHADINGDGVLDHVQ VGGN  E+TVVSGSMEVL+PCWAVATSGVP+REQLFN S
Sbjct: 403 LLEGGLHADINGDGVLDHVQTVGGNVGERTVVSGSMEVLKPCWAVATSGVPIREQLFNVS 462

Query: 480 ICHHSPFNLFPH-GEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRG 538
           ICHHSPFN   + G++SR+F +  D ++LE+ATPILIPR DGH+HRKGSHGDV+FLTNRG
Sbjct: 463 ICHHSPFNFLHYGGDYSRHFAQARDTSTLEIATPILIPRDDGHKHRKGSHGDVIFLTNRG 522

Query: 539 EVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMIL 598
           EVT+Y+P +HGHDA+WQWQL T+ATWSNLPSPSG+TE+ TVVPTLK FSLR+HDNQ MIL
Sbjct: 523 EVTSYTPDVHGHDAVWQWQLQTEATWSNLPSPSGLTESGTVVPTLKPFSLRIHDNQPMIL 582

Query: 599 AGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQP 658
           AGGDQ AV+ISPGGSIL SI+LP+ PTHAL+ +DFSNDGLTDVI+MTSNGVYGFVQTRQP
Sbjct: 583 AGGDQAAVIISPGGSILASIELPSQPTHALITDDFSNDGLTDVIVMTSNGVYGFVQTRQP 642

Query: 659 GALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGL 696
           GALFFS+LVGCL+VVM VIFVTQHLNS++ KPR SS  
Sbjct: 643 GALFFSSLVGCLLVVMAVIFVTQHLNSIQGKPRPSSSF 680


>gi|326530844|dbj|BAK01220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 704

 Score = 1088 bits (2815), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 523/707 (73%), Positives = 611/707 (86%), Gaps = 13/707 (1%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-YPIKFDADRLPPPIVADLNGD 59
           MRKRDL IL+L+AFA+FFSLQHEGDFSFREAW+HLS+E YPIK DADRLPPP+VADLNGD
Sbjct: 1   MRKRDLGILLLAAFAVFFSLQHEGDFSFREAWYHLSDEGYPIKHDADRLPPPLVADLNGD 60

Query: 60  GRKEVLVATHDAKIQVLE--PHARRVD----EGFSEARVLAEVSLLPDKIRIASGRRAVA 113
           GR+EVL+ THDAKIQVL+   HAR       + F EARV+AE+SLLP  +R+A+GRR VA
Sbjct: 61  GRREVLLPTHDAKIQVLQLPAHARLAAADTLDDFHEARVMAEISLLPANVRVAAGRRPVA 120

Query: 114 MATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAIS 173
           MA G +DR+Y+Q    KQVLVVVTSGW+VMCFDHNL KLWE +L +DFP  AHHRE+A+S
Sbjct: 121 MAVGTVDRSYKQADVRKQVLVVVTSGWAVMCFDHNLKKLWEVSLGDDFPHTAHHREVAVS 180

Query: 174 ISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTV 233
           ++NYTLKHGD GLVIVGGRMEMQ H+  D F++   ++ ++E+HRRSA+EK+ASE +G V
Sbjct: 181 VTNYTLKHGDAGLVIVGGRMEMQHHS-ADLFDDFMTSQHSSEEHRRSATEKQASE-AGNV 238

Query: 234 DLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECR 293
           D+RHFA YAF+GR+G LRWSRKNENI+AQP+DAS +IPQHNYKLDVH+LN+RHPGE+ECR
Sbjct: 239 DVRHFALYAFSGRTGTLRWSRKNENIQAQPSDASAMIPQHNYKLDVHSLNNRHPGEYECR 298

Query: 294 EFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDS 353
           +FRES+LGVMPHHWDRREDT L+L+HFR+HKRK LKK  GKS     HKP EH+P GKD 
Sbjct: 299 QFRESILGVMPHHWDRREDTSLQLAHFRKHKRKELKKTQGKSVVNNVHKPIEHNPLGKDD 358

Query: 354 TKKISNLIGKAATYAGSAKSKKPVN--YIPTITNYTQLWWVPNVVVAHQKEGIEAVHLAS 411
           T +IS  IGKAA  AGSAK KK ++  YIPTITNYTQ+WWVPNVVVAH+KEGIEA+HLAS
Sbjct: 359 TNRISKAIGKAADLAGSAKGKKSLHTLYIPTITNYTQVWWVPNVVVAHEKEGIEAIHLAS 418

Query: 412 GRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPV 471
           GRT+CKLHL EGGLHADINGDGVLDHVQ VGGNGAEQTVVSGSMEVL+PCWAVATSGVPV
Sbjct: 419 GRTLCKLHLTEGGLHADINGDGVLDHVQVVGGNGAEQTVVSGSMEVLKPCWAVATSGVPV 478

Query: 472 REQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDV 531
           REQLFN SICH++  NLF HG+FSR+FGRT D + LEVATP+L+ R DGH+HR+GSHGD+
Sbjct: 479 REQLFNVSICHYNHLNLFHHGDFSRSFGRTFDPSGLEVATPVLLQRDDGHKHRRGSHGDI 538

Query: 532 VFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVH 591
           +FLT+RGEVT+Y+PGL GHDA+W+WQL T ATWSNLPSPSGM E + VVPTLK FSLR +
Sbjct: 539 IFLTSRGEVTSYTPGLLGHDAMWRWQLSTGATWSNLPSPSGMME-NVVVPTLKTFSLRAY 597

Query: 592 DNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYG 651
           D +Q+I+AGGDQEAVVISP GS+LTSIDLPAPPTHAL+ EDFS DGLTD+IL+TS GVYG
Sbjct: 598 DPKQVIIAGGDQEAVVISPDGSLLTSIDLPAPPTHALILEDFSGDGLTDIILVTSGGVYG 657

Query: 652 FVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVK-AKPRASSGLR 697
           FVQTRQPGALFFSTLVG LIVV+GVIFV+ HLNS    KPR+S+G R
Sbjct: 658 FVQTRQPGALFFSTLVGFLIVVIGVIFVSLHLNSSNGGKPRSSTGYR 704


>gi|357140045|ref|XP_003571583.1| PREDICTED: uncharacterized protein LOC100827170 [Brachypodium
           distachyon]
          Length = 701

 Score = 1087 bits (2812), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 524/704 (74%), Positives = 608/704 (86%), Gaps = 10/704 (1%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-YPIKFDADRLPPPIVADLNGD 59
           MRKRDL IL+L+AFAIFFSLQHEGDFSFREAW+HLS++ YPIK+DADRLPPP+VADLNGD
Sbjct: 1   MRKRDLGILLLAAFAIFFSLQHEGDFSFREAWYHLSDDGYPIKYDADRLPPPLVADLNGD 60

Query: 60  GRKEVLVATHDAKIQVLEP-HARRVD--EGFSEARVLAEVSLLPDKIRIASGRRAVAMAT 116
           GR E+L+ THDAKIQVL+P HAR     + F EAR++AE+SLLP  +R++SGRR VAMA 
Sbjct: 61  GRPEILLPTHDAKIQVLQPPHARPASGFDDFHEARLMAEISLLPTNVRVSSGRRPVAMAV 120

Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISN 176
           G +DR+Y+     KQVLVVVTSGW+VMCFDHNLNKLWEANLQ+DFP  AHHRE+AISISN
Sbjct: 121 GAVDRSYKLADVRKQVLVVVTSGWAVMCFDHNLNKLWEANLQDDFPHAAHHREVAISISN 180

Query: 177 YTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLR 236
           YT+KHGD GLVIVGGRMEMQ H+  D F++   +E + E+HRRSASEK+ASE +G VD+R
Sbjct: 181 YTIKHGDAGLVIVGGRMEMQHHS-ADLFDDFMTSEHSREEHRRSASEKQASE-AGNVDVR 238

Query: 237 HFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFR 296
           HFA YAFAGR+G LRWSRKNENI++QP+DAS LIPQHNYKLDVH+LNSRHPG+FECREFR
Sbjct: 239 HFALYAFAGRTGALRWSRKNENIQSQPSDASALIPQHNYKLDVHSLNSRHPGQFECREFR 298

Query: 297 ESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKK 356
           ESVLGVMPHHWDRREDT L+L++FRRHKRK LKK  GK+     HKP EH+P GKD T +
Sbjct: 299 ESVLGVMPHHWDRREDTSLQLANFRRHKRKQLKKTPGKNAVNNVHKPSEHNPAGKDDTNR 358

Query: 357 ISNLIGKAATYAGSAKSKKPVN--YIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRT 414
           +S  IGKAA  AGSAK KK  +  ++PTITNYTQ+WWVPNVVVAH+KEGIEA+HLASGRT
Sbjct: 359 LSKAIGKAAELAGSAKGKKSQHTPFVPTITNYTQVWWVPNVVVAHEKEGIEAIHLASGRT 418

Query: 415 VCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQ 474
           +CKLHL EGGLHAD+NGDGVLDHVQ VG NG EQTVVSGSMEVL+PCWAVATSGVPVREQ
Sbjct: 419 ICKLHLTEGGLHADVNGDGVLDHVQVVGANGIEQTVVSGSMEVLKPCWAVATSGVPVREQ 478

Query: 475 LFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFL 534
           LFN SICH++ +NLF HG+FS++FGR  D   LEVATPIL+ R DGH+HR+GSHGD++FL
Sbjct: 479 LFNVSICHYNHYNLFHHGDFSKSFGRPFDPTGLEVATPILLQRDDGHKHRRGSHGDIIFL 538

Query: 535 TNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQ 594
           T+RGE T+YSPGL GHDA W+WQL T ATWSNLPSPSGM E + VVPTLKAFSLR +D +
Sbjct: 539 TSRGEATSYSPGLLGHDATWRWQLSTGATWSNLPSPSGMME-NIVVPTLKAFSLRAYDPK 597

Query: 595 QMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQ 654
           Q+I+AGGDQEAVVISP GS+L SI+LPAPPTHAL+ EDFS DGLTD+IL+TS GVYGFVQ
Sbjct: 598 QVIIAGGDQEAVVISPSGSLLASIELPAPPTHALILEDFSGDGLTDIILVTSGGVYGFVQ 657

Query: 655 TRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKA-KPRASSGLR 697
           TRQPGALFFSTLVGCLIVV+GVIFV+ HLNS  + KPR+S+  R
Sbjct: 658 TRQPGALFFSTLVGCLIVVIGVIFVSLHLNSSNSGKPRSSTEYR 701


>gi|115443607|ref|NP_001045583.1| Os02g0100700 [Oryza sativa Japonica Group]
 gi|41053219|dbj|BAD08180.1| unknown protein [Oryza sativa Japonica Group]
 gi|113535114|dbj|BAF07497.1| Os02g0100700 [Oryza sativa Japonica Group]
 gi|215697180|dbj|BAG91174.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218189856|gb|EEC72283.1| hypothetical protein OsI_05449 [Oryza sativa Indica Group]
 gi|222621985|gb|EEE56117.1| hypothetical protein OsJ_04982 [Oryza sativa Japonica Group]
          Length = 701

 Score = 1082 bits (2797), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 525/704 (74%), Positives = 600/704 (85%), Gaps = 10/704 (1%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSE-EYPIKFDADRLPPPIVADLNGD 59
           MRKRDL IL+L+AFA+FFSLQH+GD SFREAW+HLS+ +YPIK DADRLP P+VADLNGD
Sbjct: 1   MRKRDLGILLLAAFAVFFSLQHDGDLSFREAWYHLSDADYPIKHDADRLPSPLVADLNGD 60

Query: 60  GRKEVLVATHDAKIQVLEPHARRV--DEGFSEARVLAEVSLLPDKIRIASGRRAVAMATG 117
           G+ EVL+ THDAKIQVL+PH R    D  F +AR++A+VSLLP  +R++SGRR VAMA G
Sbjct: 61  GKPEVLIPTHDAKIQVLQPHPRPSPDDASFHDARLMADVSLLPSNVRLSSGRRPVAMAVG 120

Query: 118 VIDRTYRQG-QPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISN 176
            +DR Y     P KQ+LVVVTSGWSVMCFDHNL KLWEANLQ+DFP  AHHRE+AISI+N
Sbjct: 121 TVDRHYAHAPSPSKQLLVVVTSGWSVMCFDHNLKKLWEANLQDDFPHAAHHREVAISITN 180

Query: 177 YTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLR 236
           YTLKHGD GLVIVGGRMEMQ H+  + F+E  ++E N E+HRRSASEK+ASE +G  DLR
Sbjct: 181 YTLKHGDAGLVIVGGRMEMQHHS-AELFDEFMVSEHNREEHRRSASEKQASE-TGNTDLR 238

Query: 237 HFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFR 296
           HFA YAFAGR+G LRWSRKNENI +QP+DAS LIPQHNYKLD HALNSRHPG+FECREFR
Sbjct: 239 HFALYAFAGRTGELRWSRKNENIPSQPSDASVLIPQHNYKLDAHALNSRHPGQFECREFR 298

Query: 297 ESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKK 356
           ESVLGVMPHHWDRREDT L+L+HFRRHKRK LKK  GK+     HKP EH+PPGKD + +
Sbjct: 299 ESVLGVMPHHWDRREDTFLQLAHFRRHKRKALKKTPGKAVVNNVHKPSEHNPPGKDVSNR 358

Query: 357 ISNLIGKAATYAGSAKSKKPVN--YIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRT 414
           ++N+IGKAA  A S K KK     Y+PTITNYTQ+WWVPNVVVAH+KEGIEAVHLASGRT
Sbjct: 359 LANVIGKAADMANSNKIKKSQRTLYVPTITNYTQVWWVPNVVVAHEKEGIEAVHLASGRT 418

Query: 415 VCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQ 474
           +CKLHL EGGLHADINGDGVLDHVQ VG NG EQTVVSGSMEVL+PCWAVATSGVPVREQ
Sbjct: 419 ICKLHLTEGGLHADINGDGVLDHVQVVGANGIEQTVVSGSMEVLKPCWAVATSGVPVREQ 478

Query: 475 LFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFL 534
           LFN SICH++ FNLF HG+FSR+FGRT D   LEVATPIL+ R DGH+HR+GSHGD++FL
Sbjct: 479 LFNVSICHYNNFNLFHHGDFSRSFGRTFDTTGLEVATPILLQRDDGHKHRRGSHGDIIFL 538

Query: 535 TNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQ 594
           T+RGEVT+YSPGL GHDAIW+WQL T ATWSNLPSPSGM E + VVPTLKAFSLR +D +
Sbjct: 539 TSRGEVTSYSPGLLGHDAIWRWQLSTGATWSNLPSPSGMME-NIVVPTLKAFSLRAYDPK 597

Query: 595 QMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQ 654
           Q+I+AGGD EAVVISP G +L SI+LPAPPTHALV EDF+ DGLTD+IL+TS GVYGFVQ
Sbjct: 598 QVIIAGGDLEAVVISPSGGLLASIELPAPPTHALVLEDFNGDGLTDIILVTSGGVYGFVQ 657

Query: 655 TRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKA-KPRASSGLR 697
           TR PGALFFSTLVGCLIVV+GVIFV+ HLNS  + KPRAS+  R
Sbjct: 658 TRHPGALFFSTLVGCLIVVIGVIFVSLHLNSSNSGKPRASTDYR 701


>gi|293337243|ref|NP_001169290.1| uncharacterized protein LOC100383154 precursor [Zea mays]
 gi|224028447|gb|ACN33299.1| unknown [Zea mays]
          Length = 703

 Score = 1030 bits (2664), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 506/702 (72%), Positives = 600/702 (85%), Gaps = 11/702 (1%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-YPIKFDADRLPPPIVADLNGD 59
           MRKRDL IL+L+AFAIFFSL HEGDFSFRE+W+HL++E +PIK++ADRLPPP+VADLNGD
Sbjct: 1   MRKRDLGILLLAAFAIFFSLHHEGDFSFRESWYHLTDEDFPIKYEADRLPPPLVADLNGD 60

Query: 60  GRKEVLVATHDAKIQVLEP-HARRV--DEGFSEARVLAEVSLLPDKIRIASGRRAVAMAT 116
           G+ EVL+ THDAKIQVL+P HAR +  D  F EARV+A++SLLPD + +ASGRR +AMA 
Sbjct: 61  GKPEVLLPTHDAKIQVLQPPHARHLNDDSAFQEARVMADISLLPDNVLLASGRRPIAMAV 120

Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISN 176
           G +DR+YR G+  KQVLVVVTSGWSVMCFDHNL KLWE NLQ+DFP  AHHRE+AISI+N
Sbjct: 121 GNVDRSYRPGEVRKQVLVVVTSGWSVMCFDHNLKKLWEHNLQDDFPHAAHHREVAISITN 180

Query: 177 YTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLR 236
           YTLKHGD GLVIVGGRMEMQ H+  D F+E  + E N +  RRSASEK+ SE +G  DLR
Sbjct: 181 YTLKHGDAGLVIVGGRMEMQHHS-ADLFDEFMIPEHNMDDRRRSASEKQGSE-AGNADLR 238

Query: 237 HFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFR 296
           HFA YAFAGRSG  RWSRKNENI++QP+DAS ++PQHNYKLDVHALNS  PG+FECREFR
Sbjct: 239 HFALYAFAGRSGDRRWSRKNENIQSQPSDASVMLPQHNYKLDVHALNSHQPGQFECREFR 298

Query: 297 ESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKK 356
           ES+LG+MPHHWDRREDT L+L+HFR+HKRK +K+  GK+     +KP EH+PPGKD++ +
Sbjct: 299 ESILGIMPHHWDRREDTTLQLAHFRKHKRKQVKRTPGKAVINSVNKPIEHNPPGKDASNR 358

Query: 357 ISNLIGKAATYAGSAKSKKP--VNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRT 414
           I+  +GKAA  A S K++K   + Y+PTITN+TQ+WWVPNVVV H+KEGIE VHLASGRT
Sbjct: 359 IARALGKAADMANSNKARKAQRMQYVPTITNHTQVWWVPNVVVVHEKEGIEVVHLASGRT 418

Query: 415 VCKLHLQEGGLHADINGDGVLDHVQAVGGNG-AEQTVVSGSMEVLRPCWAVATSGVPVRE 473
           +CKLHL EGGLHADINGDGVLDHVQ VGGNG  EQTVVSGSMEVL+PCWAVATSGVPVRE
Sbjct: 419 ICKLHLNEGGLHADINGDGVLDHVQVVGGNGIKEQTVVSGSMEVLKPCWAVATSGVPVRE 478

Query: 474 QLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVF 533
           QLFN SICH++ FNLF HG+FSR+FGRT D A LEVATPIL+   DGH+HR+GSHGD++F
Sbjct: 479 QLFNVSICHYNHFNLFHHGDFSRSFGRTFDTAGLEVATPILVQTDDGHKHRRGSHGDIIF 538

Query: 534 LTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDN 593
           LT+RGEVT+YSPGL GHDA+W+WQ+ + ATWSNLPSPSGM E + VVPTLKAFSLR +D 
Sbjct: 539 LTSRGEVTSYSPGLLGHDAVWRWQVSSGATWSNLPSPSGMME-NIVVPTLKAFSLRAYDP 597

Query: 594 QQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFV 653
           +++I+AGGDQEAVV+SP G IL  I+LPAPPTHA + EDFS DGLTD+I++TS G+YGFV
Sbjct: 598 KEVIIAGGDQEAVVLSPSGGILAMIELPAPPTHAHLLEDFSGDGLTDMIVVTSGGIYGFV 657

Query: 654 QTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVK-AKPRASS 694
           QTRQPGALFFSTLVGCLIVV+GV+FV+ HL+S   AKPRASS
Sbjct: 658 QTRQPGALFFSTLVGCLIVVIGVVFVSLHLSSSNAAKPRASS 699


>gi|167997045|ref|XP_001751229.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697210|gb|EDQ83546.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 706

 Score =  806 bits (2081), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 402/700 (57%), Positives = 508/700 (72%), Gaps = 11/700 (1%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDL IL+LSA +IF+SLQ+EG  SFR AW H     PIK +A+RLPPP+VADLNGDG
Sbjct: 1   MRKRDLGILLLSALSIFYSLQNEGSLSFRPAWVHYKNSSPIKHEAERLPPPVVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
             EV+VA+ D  +QVL+P     + GFS+A VLAEVSLLPD++R+++GRR VA+A G + 
Sbjct: 61  HVEVVVASED-NLQVLDPRVSFAENGFSQAGVLAEVSLLPDRVRVSAGRRPVALAAGEVK 119

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R YR     K+V+VVVT+GWS+MCFDHNL KLWE ++Q+DF   +HH+E+AISISNYTLK
Sbjct: 120 RFYRAQDLKKKVIVVVTAGWSIMCFDHNLKKLWEDDVQDDFLHGSHHKEVAISISNYTLK 179

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           H DTGL+IVGG ME+QP   +DPFEE  LAEK  E HRR+A  KE + ++G    RHF++
Sbjct: 180 HRDTGLIIVGGSMEVQPQMHLDPFEEEWLAEKMFESHRRAAGAKEVACSTGWQQFRHFSY 239

Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
           YA+AG SG  RW  ++E+ + + T+A  L PQHNYKLD  +L +RH GE ECREFRESVL
Sbjct: 240 YAYAGMSGTRRWVHRSEDFQ-RATNAGALQPQHNYKLDASSLATRHLGEVECREFRESVL 298

Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDS-TKKISN 359
            VMPH W+ REDT  +L+HFR+H+RK++KK+ GK  S P  KP +   PGKD+    I+ 
Sbjct: 299 DVMPHRWEHREDTRFELAHFRKHRRKLVKKMQGKGGSIPSEKPADKSAPGKDAHGNPIAK 358

Query: 360 LIGKAATYAGSAKSKK-PVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKL 418
           ++GKAA  A  AK KK    Y+P ITN+T  WWVPNVVVAH KEGIEAVHLA+GRTVCKL
Sbjct: 359 VVGKAADLAVGAKVKKQQFQYVPMITNHTSFWWVPNVVVAHLKEGIEAVHLATGRTVCKL 418

Query: 419 HLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNA 478
            L EGGLHAD+NGDGVLDHVQAVGG+G+ + V +G  E L+PCWA+ATSGVPVREQLFN 
Sbjct: 419 FLPEGGLHADVNGDGVLDHVQAVGGHGSARIVPTGMTEALKPCWAIATSGVPVREQLFNG 478

Query: 479 SICHHSPFNLFPHGEFSRNFGRTSDVAS----LEVATPILIPRSDGHRHRKGSHGDVVFL 534
           ++C HSPF +F + EFS  FGR          +EV  PI++P  DG RHRKGSHGDVVFL
Sbjct: 479 TVCRHSPFQIFRYHEFSGEFGRRPHPGDAQEFVEVLAPIILPHPDGQRHRKGSHGDVVFL 538

Query: 535 TNRGEVTAYS--PGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHD 592
            +RGEVT++S      G +A  +WQ+ T A W+  P   G+     V+ TL A  LR + 
Sbjct: 539 NSRGEVTSFSVLGPKRGQEAQHRWQVATTAFWTTSPDLEGVG-TDRVISTLTALPLRKNG 597

Query: 593 NQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGF 652
               ILA G+ EAVV+SP GS +  + LP+  +  ++ +DFS D L D+IL+T +G++GF
Sbjct: 598 EVDAILAVGELEAVVLSPKGSHVAQVPLPSAGSAPIIYDDFSGDNLNDLILVTKDGIFGF 657

Query: 653 VQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRA 692
           VQTRQPGA+ F TL+G LI+VMGVIFVTQHL S K K +A
Sbjct: 658 VQTRQPGAILFLTLMGVLILVMGVIFVTQHLGSGKGKSKA 697


>gi|302768899|ref|XP_002967869.1| hypothetical protein SELMODRAFT_88168 [Selaginella moellendorffii]
 gi|300164607|gb|EFJ31216.1| hypothetical protein SELMODRAFT_88168 [Selaginella moellendorffii]
          Length = 685

 Score =  785 bits (2027), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/697 (55%), Positives = 505/697 (72%), Gaps = 18/697 (2%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-YPIKFDADRLPPPIVADLNGD 59
           MRKRDL IL++SAFAIFFSLQ+EG+FSF+E+WFH+ +E YPIK +A+RLPPP+V DLN D
Sbjct: 1   MRKRDLGILIISAFAIFFSLQNEGNFSFKESWFHVVDEAYPIKHEAERLPPPLVTDLNAD 60

Query: 60  GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
           GR EV+VATHD+K+ VL+PH++  ++GFSEAR + EVSLLP+K+RI++GRR VAMA GVI
Sbjct: 61  GRNEVIVATHDSKLLVLDPHSKLTEDGFSEARTITEVSLLPEKVRISAGRRPVAMAAGVI 120

Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTL 179
            R+Y +G+P KQVLVVVT+GW +MCFDHNL KLWE+++Q+DFP  A H+E++ISISN+TL
Sbjct: 121 QRSYNEGKPRKQVLVVVTAGWVIMCFDHNLKKLWESSVQDDFPHGAFHKEVSISISNFTL 180

Query: 180 KHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFA 239
           +HGDTGL+IVGG ME Q    +DPFEE  LAEK  E+HR +A  KE SE+       HF+
Sbjct: 181 RHGDTGLIIVGGSMEAQSQVYLDPFEEELLAEKEEERHRHAADAKEGSEDVNAGKTHHFS 240

Query: 240 FYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESV 299
           +YAFAG +G  RW  K ++     + A++L PQHNY+LD  +L  R   E  CR++R+S+
Sbjct: 241 YYAFAGMTGERRWEHKMQDFHRDLSSAAELTPQHNYRLDAASLQGRQLDEVHCRDYRQSI 300

Query: 300 LGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTS-YPFHKPEEHHPPGKDSTKKIS 358
           L V+PH W  R+DT  +L HF +H+RK  KK  G +   YP  +P + H PGKD + K++
Sbjct: 301 LQVLPHRWQSRDDTRFQLVHFVKHERKHHKKQPGSNRERYPLKRPNDPHVPGKDPSNKVA 360

Query: 359 NLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKL 418
             +GKAA  A SA+ K    YIP ITN +  WW+PNVVVAH KEGIEAVHLASGRTVCKL
Sbjct: 361 QALGKAAKLATSARPKTRYAYIPVITNQSSYWWLPNVVVAHLKEGIEAVHLASGRTVCKL 420

Query: 419 HLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNA 478
            L EGGLHADINGDGVLDHVQAVGG+G    V +G+ME ++PCWAVA+SG  VR+QLFN 
Sbjct: 421 WLHEGGLHADINGDGVLDHVQAVGGSGTH--VAAGTMEAIKPCWAVASSGTLVRQQLFNG 478

Query: 479 SICHHSPFNLFPHGEF-SRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNR 537
           SIC +S +  F  GEF SR FGR  D + +EV  P+   R  G       +GDV+FLT+R
Sbjct: 479 SICRNSGYGAFQQGEFMSRTFGRNLDASPIEVVPPVFFLRPTG-------YGDVIFLTSR 531

Query: 538 GEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMI 597
           GEVT+Y+P   G+    +WQ++T A WS+ P  S        VPTL+  +LR +   ++I
Sbjct: 532 GEVTSYTPA--GNQ---RWQIVTGAIWSSKPMLSSFG-VDRAVPTLEVMALRKNGRGEVI 585

Query: 598 LAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQ 657
           LA G+QEAV+IS  G  + ++ LP+PP   +V  DFS DG  DVI++T++G+YGFVQT+ 
Sbjct: 586 LAAGEQEAVLISSSGHQIATLALPSPPARPIVVADFSGDGFNDVIVVTASGIYGFVQTQH 645

Query: 658 PGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASS 694
            GA+FFS LVGCLI+ M VIFV QHL + K KP+  S
Sbjct: 646 QGAVFFSFLVGCLIIAMCVIFVMQHLQAPKGKPKKKS 682


>gi|302799784|ref|XP_002981650.1| hypothetical protein SELMODRAFT_115119 [Selaginella moellendorffii]
 gi|300150482|gb|EFJ17132.1| hypothetical protein SELMODRAFT_115119 [Selaginella moellendorffii]
          Length = 685

 Score =  785 bits (2027), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/697 (55%), Positives = 504/697 (72%), Gaps = 18/697 (2%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-YPIKFDADRLPPPIVADLNGD 59
           MRKRDL IL++SAFAIFFSLQ+EG+FSF+E+WFH+ +E YPIK +A+RLPPP+V DLN D
Sbjct: 1   MRKRDLGILIISAFAIFFSLQNEGNFSFKESWFHVVDEAYPIKHEAERLPPPLVTDLNAD 60

Query: 60  GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
           GR EV+VATHD+K+ VL+PH++  ++GFSEAR + EVSLLP+K+RI++GRR VAMA GVI
Sbjct: 61  GRNEVIVATHDSKLLVLDPHSKLNEDGFSEARTVTEVSLLPEKVRISAGRRPVAMAAGVI 120

Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTL 179
            R+Y +G+P KQVLVVVT+GW +MCFDHNL KLWE+++Q+DFP  A H+E++ISISN+TL
Sbjct: 121 QRSYNEGKPRKQVLVVVTAGWVIMCFDHNLKKLWESSVQDDFPHGAFHKEVSISISNFTL 180

Query: 180 KHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFA 239
           +HGDTGL+IVGG ME Q    +DPFEE  LAEK  E+HR +A  KE SE+       HF+
Sbjct: 181 RHGDTGLIIVGGSMEAQSQVYLDPFEEELLAEKEEERHRHAADAKEGSEDVNAGKTHHFS 240

Query: 240 FYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESV 299
           +YAFAG +G  RW  K ++     + A++L PQHNY+LD  +L  R   E  CR++R+S+
Sbjct: 241 YYAFAGMTGERRWEHKMQDFHRDLSSAAELTPQHNYRLDAASLQGRQLDEVHCRDYRQSI 300

Query: 300 LGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTS-YPFHKPEEHHPPGKDSTKKIS 358
           L V+PH W  R+DT  +L HF +H+RK  KK  G +   YP  +P + H PGKD + K++
Sbjct: 301 LQVLPHRWQSRDDTRFQLVHFVKHERKHHKKQPGSNRERYPLKRPNDPHVPGKDPSNKVA 360

Query: 359 NLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKL 418
             +GKAA  A SA+ K    YIP ITN +  WW+PNVVVAH KEGIEAVHLASGRTVCKL
Sbjct: 361 QALGKAAKLATSARPKTRYAYIPVITNQSSYWWLPNVVVAHLKEGIEAVHLASGRTVCKL 420

Query: 419 HLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNA 478
            LQEGGLHADINGDGVLDHVQAVGG+G    V  G+ME ++PCWAVA+SG  VR+QLFN 
Sbjct: 421 WLQEGGLHADINGDGVLDHVQAVGGSGTH--VAPGTMEAIKPCWAVASSGTLVRQQLFNG 478

Query: 479 SICHHSPFNLFPHGEF-SRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNR 537
           SIC +S +  F  GEF SR FGR  D + +EV  P+   R  G       +GDV+FLT+R
Sbjct: 479 SICRNSGYGAFQQGEFMSRTFGRNLDASPIEVVPPVFFLRPTG-------YGDVIFLTSR 531

Query: 538 GEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMI 597
           GEVT+Y+P  +      +WQ++T A WS+ P  S        VPTL+  +LR +   ++I
Sbjct: 532 GEVTSYTPAGNQ-----RWQIVTGAIWSSKPMLSSFG-VDRAVPTLEVMALRKNGRGEVI 585

Query: 598 LAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQ 657
           LA G+QEAV+IS  G  + ++ LP+PP   +V  DFS DG  DVI++T++G+YGFVQT+ 
Sbjct: 586 LAAGEQEAVLISSSGHQIATLALPSPPARPIVVADFSGDGFNDVIVVTASGIYGFVQTQH 645

Query: 658 PGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASS 694
            GA+FFS LVGCLI+ M VIFV QHL + K KP+  S
Sbjct: 646 QGAVFFSFLVGCLIIAMCVIFVMQHLQAPKGKPKKKS 682


>gi|168002559|ref|XP_001753981.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694957|gb|EDQ81303.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 721

 Score =  740 bits (1910), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 388/718 (54%), Positives = 497/718 (69%), Gaps = 30/718 (4%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDL IL+LSA AIFFSLQHEG  SFR+AW H  +  PI+  A+RLPPP+VADLNGDG
Sbjct: 1   MRKRDLGILLLSALAIFFSLQHEGSLSFRQAWVHYKDGSPIQHVAERLPPPVVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
             EV+VA  D K+QV++P      +GF+EARVLAEVSLLPD++R+ +GRR VA+A G + 
Sbjct: 61  HVEVVVALED-KLQVIDPRVSFASDGFAEARVLAEVSLLPDRVRVGAGRRPVALAAGEVK 119

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           RTYR  +  KQV+VVVT+GW++MCFDHNL K+WE ++Q+DFP  +HH+E+AISISNYTLK
Sbjct: 120 RTYRSLERKKQVIVVVTAGWNIMCFDHNLKKIWEDDVQDDFPHGSHHKEVAISISNYTLK 179

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENS----GTVDLR 236
           HGDTGLVIVGG ME+QP   +D FEE  LAEK  E H+ SA   E         G    R
Sbjct: 180 HGDTGLVIVGGSMEVQPQAHLDLFEEELLAEKATEVHQVSAVGIEVGCTGPLGRGRGCNR 239

Query: 237 HFAFYAFAGRSGLLR---WSR-----------KNENIEAQPTDASQLIPQHNYKLDVHAL 282
              ++   G S LL    W+            K+++ +  P + + L PQHNY+LD ++L
Sbjct: 240 EDGWWK--GASFLLLCICWNDWRPSLDSPKRGKDDDFQRAPGNGT-LQPQHNYRLDANSL 296

Query: 283 NSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHK 342
            +RH GE ECREFRESVLGVMPH W+ REDT  +LSHF++H RK++K++ GK  + P  K
Sbjct: 297 ATRHLGEVECREFRESVLGVMPHRWEHREDTRFELSHFQKHYRKLVKRIHGKEKNVPSQK 356

Query: 343 PEEHHPPGKDS-TKKISNLIGKAATYAGSAKSKKP-VNYIPTITNYTQLWWVPNVVVAHQ 400
           P + + PGK+S +  I+ ++G A   A   K +K    ++P ITN+T  WWVPNV+VAH 
Sbjct: 357 PADKNTPGKESLSNPITKVVGNAVQLAVGPKVRKERFQHVPMITNHTSHWWVPNVIVAHL 416

Query: 401 KEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP 460
           KEGIEAVHLA+GRTVCKL+L EGGLHADINGDGVLDHVQAVGG+   + V +G    L+P
Sbjct: 417 KEGIEAVHLATGRTVCKLYLPEGGLHADINGDGVLDHVQAVGGHRGTRLVPTGMTTTLKP 476

Query: 461 CWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVAS---LEVATPILIPR 517
           CWA+ATSGV VREQLFN ++C HSPF  + H +FS    R  +      +EV  PI++P 
Sbjct: 477 CWAIATSGVHVREQLFNGTVCRHSPFQAYSHFDFSGEIDRRLNPEMGDFVEVLAPIILPH 536

Query: 518 SDGHRHRKGSHGDVVFLTNRGEVTAYS--PGLHGHDAIWQWQLLTDATWSNLPSPSGMTE 575
            DG RHRKGSHGD++FL +RGEVT++S        +A  +WQL+T A W+      G  E
Sbjct: 537 PDGKRHRKGSHGDIIFLNSRGEVTSFSVLGPKRDQEAEHRWQLVTGAYWATSSELQGY-E 595

Query: 576 ASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSN 635
           +  VVP+L A  LR +   + ILA G+ E VVISP GS +T   LPA  +  ++  DFS 
Sbjct: 596 SDRVVPSLTALPLRKNGEPEAILAIGEIEGVVISPKGSPVTEFPLPASASAPIIYADFSG 655

Query: 636 DGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRAS 693
           DGL D+IL+T +G+YGFVQ R+PGA+ FSTLVG LI+VMGVIFVTQHL + K K R +
Sbjct: 656 DGLNDLILVTHDGIYGFVQARRPGAILFSTLVGVLILVMGVIFVTQHLGTGKGKSRPA 713


>gi|357483603|ref|XP_003612088.1| hypothetical protein MTR_5g021150 [Medicago truncatula]
 gi|355513423|gb|AES95046.1| hypothetical protein MTR_5g021150 [Medicago truncatula]
          Length = 780

 Score =  646 bits (1667), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 308/376 (81%), Positives = 339/376 (90%), Gaps = 2/376 (0%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           MRKRDLAILML AFAIFFSLQ +G  SF++AW HL++EYPIK++A+RLPPP+VADLNGDG
Sbjct: 1   MRKRDLAILMLCAFAIFFSLQQDGGVSFKDAWMHLTDEYPIKYEAERLPPPVVADLNGDG 60

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
           +KEVLVATHDAKIQ+LEPH+RRVDEGFSEARVLAEVSLLPDK+R+ SGRR VAMATG ID
Sbjct: 61  KKEVLVATHDAKIQILEPHSRRVDEGFSEARVLAEVSLLPDKVRVMSGRRPVAMATGFID 120

Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
           R +R GQP KQVLVVVTSGW VMCFD NL KLWE NLQEDFP NAHHRE++ISISNYTLK
Sbjct: 121 R-HRIGQPHKQVLVVVTSGWFVMCFDSNLQKLWENNLQEDFPHNAHHREVSISISNYTLK 179

Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
           HGDTGL+IVGGRMEMQPH  MDPFEE+G+  + AEQHRRSA+EKEASEN+GTVDLRHFAF
Sbjct: 180 HGDTGLIIVGGRMEMQPHIFMDPFEEMGMGARFAEQHRRSATEKEASENTGTVDLRHFAF 239

Query: 241 YAFAGRSGLLRWSRKNENIEAQP-TDASQLIPQHNYKLDVHALNSRHPGEFECREFRESV 299
           YAFAGRSG+ RWSRK ENIEA   +DASQLIPQHNYKLDVHALN R PGEFECREFRES+
Sbjct: 240 YAFAGRSGVERWSRKTENIEAAASSDASQLIPQHNYKLDVHALNRRQPGEFECREFRESI 299

Query: 300 LGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISN 359
           LGVMPH WDRREDTLLKL HF RHKRK LKK  GK+ +YPF KPEE+HPPGKDSTKKISN
Sbjct: 300 LGVMPHQWDRREDTLLKLVHFNRHKRKTLKKTPGKTINYPFDKPEENHPPGKDSTKKISN 359

Query: 360 LIGKAATYAGSAKSKK 375
           +IGKAA +AGSAKSKK
Sbjct: 360 IIGKAANFAGSAKSKK 375



 Score =  554 bits (1427), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 256/319 (80%), Positives = 288/319 (90%)

Query: 379 YIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHV 438
           Y+PTITNYT++WWVPNVVVAH KEGIE +HLASGRT+CKLHLQEGGLHADINGDGVLDHV
Sbjct: 462 YVPTITNYTKVWWVPNVVVAHLKEGIEVLHLASGRTLCKLHLQEGGLHADINGDGVLDHV 521

Query: 439 QAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNF 498
           QAVGGNGAEQTVVSGSM+VLRPCWAVATSGVPVREQLFN SICH++ FNLF HGE  R F
Sbjct: 522 QAVGGNGAEQTVVSGSMDVLRPCWAVATSGVPVREQLFNVSICHYTHFNLFQHGELYRGF 581

Query: 499 GRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQL 558
            R SD++SLEVATPILIPRSDGH+HRKGSHGDV+FLTNRGE+T+++PGLHGHDA+WQWQ 
Sbjct: 582 NRGSDMSSLEVATPILIPRSDGHKHRKGSHGDVIFLTNRGEITSHTPGLHGHDAVWQWQQ 641

Query: 559 LTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSI 618
            T  TWSNLPSP+GM E   V+PTLK F LR+HDN +MILA G+QEAVVISPGGSIL +I
Sbjct: 642 STGVTWSNLPSPAGMMEGGLVIPTLKPFPLRLHDNHEMILAAGEQEAVVISPGGSILATI 701

Query: 619 DLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIF 678
           +LP  PTH L+ EDFSNDGLTD+IL+TS+GVYGFVQTRQPGALFFS L+GCLIVVMG+IF
Sbjct: 702 ELPGSPTHVLIREDFSNDGLTDLILVTSSGVYGFVQTRQPGALFFSVLIGCLIVVMGIIF 761

Query: 679 VTQHLNSVKAKPRASSGLR 697
           VTQH+NS+K KPR SSG R
Sbjct: 762 VTQHINSMKGKPRPSSGPR 780


>gi|167997451|ref|XP_001751432.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697413|gb|EDQ83749.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 549

 Score =  621 bits (1602), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 311/543 (57%), Positives = 393/543 (72%), Gaps = 9/543 (1%)

Query: 158 QEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQH 217
           Q+DFP  +HH+E+AISISNYTLKHGDTGL+IVGG ME+QP   +DPFEE  +AE+  E H
Sbjct: 1   QDDFPHGSHHKEVAISISNYTLKHGDTGLIIVGGSMEVQPQVHLDPFEEELMAEQVIESH 60

Query: 218 RRSASEKE-ASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYK 276
           RRSA  KE  ++  G    RHF++YA+AG +G  RW  ++E+   + TDA  L PQHNY+
Sbjct: 61  RRSAGAKEDVTKKMGGGKERHFSYYAYAGMTGTRRWVHRSEDFR-RTTDAGALQPQHNYR 119

Query: 277 LDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKST 336
           LD  +L +RH GE ECREFRESVLGVMPH W+ REDT  +L+HFR+H+RK++KKV GK  
Sbjct: 120 LDASSLATRHLGEVECREFRESVLGVMPHRWEHREDTRFELAHFRKHRRKLVKKVPGKDN 179

Query: 337 SYPFHKPEEHHPPGKDST-KKISNLIGKAATYAGSAKSKKP-VNYIPTITNYTQLWWVPN 394
           S P  KP + + PG D+    I+ ++GKAA  A  AK +K    Y+P ITN+T  WWVPN
Sbjct: 180 SIPSEKPADRNTPGMDARGNPIAKVVGKAADLAVGAKVRKQQFQYVPVITNHTSHWWVPN 239

Query: 395 VVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGS 454
           VVVAH KEGIEAVHLA+GRTVCKL L EGGLHAD+NGDGVLDHVQAVGG+G+ + V +G 
Sbjct: 240 VVVAHLKEGIEAVHLATGRTVCKLFLPEGGLHADVNGDGVLDHVQAVGGHGSARIVPTGM 299

Query: 455 MEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRT--SDVASLEVATP 512
            E L+PCWAVATSGVPVR QLFN ++C HSPF +FPH EFS +FGR      A ++V  P
Sbjct: 300 TEALKPCWAVATSGVPVRAQLFNGTVCRHSPFQMFPHNEFSGDFGRRPHPGDALVQVVAP 359

Query: 513 ILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYS--PGLHGHDAIWQWQLLTDATWSNLPSP 570
           I++P  DG RHRKGSHGDVVFL +RGEVT++S      G +A  +WQ+ T A W+  P  
Sbjct: 360 IILPHPDGQRHRKGSHGDVVFLNSRGEVTSFSVLGPRRGQEAQHRWQVTTTAYWTTSPDL 419

Query: 571 SGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVC 630
            G+  A  ++PTL A  LR +     ILA G+ EAVV+SP GS +T + LP P +  ++ 
Sbjct: 420 QGVGTAR-IIPTLIALPLRKNGEADAILAVGEAEAVVLSPKGSYVTQVFLPTPGSAPMIY 478

Query: 631 EDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKP 690
           +DFS D L D+IL++ +G+YGFVQTRQPGA+ F +LVG LI+VMGVIFVTQH  + K K 
Sbjct: 479 DDFSGDKLNDLILVSEDGIYGFVQTRQPGAILFVSLVGVLILVMGVIFVTQHWGTGKGKS 538

Query: 691 RAS 693
           RA+
Sbjct: 539 RAA 541


>gi|384253534|gb|EIE27009.1| hypothetical protein COCSUDRAFT_83568 [Coccomyxa subellipsoidea
           C-169]
          Length = 697

 Score =  477 bits (1227), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 296/725 (40%), Positives = 399/725 (55%), Gaps = 71/725 (9%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
           M KRD A+L+LSAFAI+F++QHEG FS+R AW+ L ++  ++   D LPPP+ ADLNGDG
Sbjct: 1   MYKRDFAVLILSAFAIYFTIQHEGAFSYRRAWYTLHDQALLE-STDVLPPPVAADLNGDG 59

Query: 61  RKEVLVATHDAKIQVLEPH---ARRVDEGFSEARVLAEVSLLPDKIRIA-SGRRAVAMAT 116
           R EV+ ATHDAK+QV  P+    + + EGF++A +LAEVSL PD    A S  RAVA+A 
Sbjct: 60  RVEVITATHDAKLQVYTPYPPSGKSLWEGFAKAALLAEVSLAPDGPNEALSMHRAVALAA 119

Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHH--REIAISI 174
           G +D    +      VLVVVT+ W V+CFDHNL   W A ++ED P  AH   +E+AI I
Sbjct: 120 GYLDPKRDE-----LVLVVVTANWDVICFDHNLRLQWTARVKEDVPHGAHMAIKEVAIHI 174

Query: 175 SNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGT-- 232
           SN+T + GD G+V++GG +++             LA +  E        +      G   
Sbjct: 175 SNHTARVGDRGIVVIGGSVDLG-----------DLAHQGDEGESDLIDTRPCLGMGGKHQ 223

Query: 233 -VDL-RHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEF 290
            VD+ RHF++YAF G  G LRW  +          + QL+P  ++++D  AL +RH GE 
Sbjct: 224 GVDISRHFSYYAFDGGFGALRWKHEANFHRDTEKLSQQLLPALDFRMDAAALEARHFGEV 283

Query: 291 ECREFRESVLGVMPHHWDRREDTLLKLSHFRRH-------KRKILKKVVGKSTSYPFHKP 343
            CR+FRESVL V+PH W RR DT L+L+HF  H       KR++  +        P   P
Sbjct: 284 ACRDFRESVLAVLPHRWARRSDTRLQLAHFVHHRPHGGDRKRRLADR------GDPSGGP 337

Query: 344 EEHHPPGKDSTKK-------ISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVV 396
            +   PG+ + +        ++  +GK    A  A              ++     PNV+
Sbjct: 338 PKPRRPGRQADRAGVHHSNPVAKALGKVCPVAERAVLGGGTGGAAGAAQHSVHGVAPNVL 397

Query: 397 VAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSME 456
           VAH +EGIEAVHL SGRT+CKLHL   GLHAD+NGDGVLDHVQ  GG+G+      G   
Sbjct: 398 VAHLEEGIEAVHLYSGRTLCKLHLPSPGLHADLNGDGVLDHVQVAGGHGSPDGGSPGHRH 457

Query: 457 VLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSR-------NFGRTSDVASLEV 509
               CWAVA SG+P RE LFNA+IC       FP G++         +FG    V  LEV
Sbjct: 458 T-PGCWAVAKSGIPPREPLFNATICR------FP-GQWGAESSARAFSFGTDGGVVPLEV 509

Query: 510 ATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPS 569
           A P  +P        +  HG   FL +RGEVTAY+   HG      WQ      WSN   
Sbjct: 510 APPAFLPVPGPRGLYQQGHGLAAFLNSRGEVTAYN--AHGERL---WQHAMGLAWSNR-R 563

Query: 570 PSGM---TEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTH 626
           P G+     A  V PTL+A  LR       ILA G   AVV+S  G+ L ++  PA P  
Sbjct: 564 PGGLGGGNAARRVAPTLEALPLRAGAIPAAILASGSWSAVVLSEHGNRLETLQFPALPAL 623

Query: 627 ALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSV 686
            L   DF+ DGL D++L++ +G+YG+ Q R PG + F+ L+ CLIV M V++ TQ     
Sbjct: 624 PLQILDFNGDGLNDIVLVSHDGLYGWAQVRHPGGVPFAVLIACLIVAMAVVYFTQQAGPE 683

Query: 687 KAKPR 691
           + + +
Sbjct: 684 RGRTK 688


>gi|118488663|gb|ABK96143.1| unknown [Populus trichocarpa]
          Length = 242

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 212/243 (87%), Positives = 228/243 (93%), Gaps = 1/243 (0%)

Query: 455 MEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPIL 514
           MEVL+PCWAVATSGVPVREQLFNASICHHSPFNLF HG+F RNFGRT DV+SLEVATPIL
Sbjct: 1   MEVLQPCWAVATSGVPVREQLFNASICHHSPFNLFQHGDFGRNFGRT-DVSSLEVATPIL 59

Query: 515 IPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMT 574
           IPRSDGHRHRKGSHGDVVFLTNRGEVT+YSPGLHGHDA+WQWQ+LT ATWSNLPSPSGM 
Sbjct: 60  IPRSDGHRHRKGSHGDVVFLTNRGEVTSYSPGLHGHDAVWQWQILTGATWSNLPSPSGMM 119

Query: 575 EASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFS 634
           E   VVPTLKAFSLR HDNQQMILA GDQEA VISPGGS+ TS DLPAPPTHAL+CEDF+
Sbjct: 120 EGGMVVPTLKAFSLRAHDNQQMILAAGDQEAAVISPGGSVQTSFDLPAPPTHALICEDFT 179

Query: 635 NDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASS 694
           NDGLTD+I++TSNGVYGFVQTR PGALFFSTLVGCL++VMGVIFVTQH+NS+K KPRASS
Sbjct: 180 NDGLTDLIVVTSNGVYGFVQTRSPGALFFSTLVGCLLIVMGVIFVTQHINSIKGKPRASS 239

Query: 695 GLR 697
           GLR
Sbjct: 240 GLR 242


>gi|255074829|ref|XP_002501089.1| predicted protein [Micromonas sp. RCC299]
 gi|226516352|gb|ACO62347.1| predicted protein [Micromonas sp. RCC299]
          Length = 1195

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 269/753 (35%), Positives = 376/753 (49%), Gaps = 112/753 (14%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWF----HLSEEYPIKFDADRL-------- 48
           + KRDL +++L+   ++FSLQ+EG  SF+ AW      L E+    FDAD +        
Sbjct: 92  LAKRDLGVILLALLGVYFSLQNEGVVSFQRAWHVPGDALGED---DFDADPISHRAKDAH 148

Query: 49  -PPPIVADLNGDGRKEVLVATH-DAKIQVLEPHARRVD---------------------- 84
            P P+  DLNGDGR EV+VA+  + +I++  P A + +                      
Sbjct: 149 APRPVFFDLNGDGRNEVIVASSSEPEIRIASPPAGKREAPGVSSDRDSRLEGSGRSVRDD 208

Query: 85  --------EGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQ----V 132
                   +G+  AR +A  SL+P  +R+A+GRRAVA+A G ID    +   +K+    V
Sbjct: 209 DDDEYAWRDGWIPARTVASASLMPSNVRVAAGRRAVALAAGHIDPPVAKDASVKKTNKGV 268

Query: 133 LVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGR 192
           +VVVT+ W V+CFDHNL  +WE +LQ +FP +A   E+A+ +SN T+  GD G V+VGGR
Sbjct: 269 VVVVTAAWHVLCFDHNLKLMWENSLQAEFPRHARVAEVAVLVSNATMFEGDRGSVVVGGR 328

Query: 193 MEMQP--HTIMDPFEEIGLAEKNAEQHR---RSASEKEASENS------------GTVDL 235
           +E+        DP EE    E     HR   R A +  A  N             G    
Sbjct: 329 VELGDLDSDDEDPLEEELAHEDMMLGHRGGRRVAPDDLADANEVLHGRGRKGKGVGVDRS 388

Query: 236 RHFAFYAFAGRSGLLRWSRKNENIEAQPTD-ASQLIPQHNYKLDVHALNSRHPGEFECRE 294
           RHF +YAF G +G  RW  ++E+        A +L PQHNY+L   A +  H GE  CR+
Sbjct: 389 RHFNYYAFDGATGSTRWKHESEDFHRDLDGLADRLTPQHNYRLTAQASSGHHYGEVACRD 448

Query: 295 FRESVL-GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDS 353
           FRESV+   +PH W  REDT L+L+HFR H+     +V    +             G + 
Sbjct: 449 FRESVVVNALPHAWREREDTALRLAHFRHHRTAKGARVSKVGSGRRPAVGAAGGAEGAEH 508

Query: 354 TKKISNLIGKAATYAGSAKSKKPV-------NYIPTITNYTQLWWVPNVVVAHQKEGIEA 406
           T  ++  +  A   A       P         + P      +    PNV+VAHQ+ G+E 
Sbjct: 509 TNPVARALAGAINAAWRGNGAPPAPDRRGGRGHSPDGQIGAKHRAPPNVIVAHQEGGVEV 568

Query: 407 VHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAE---QTVVSGSMEVLRPCWA 463
           +HL SG+T+CK+ +  GGLHADINGDGV+DHVQA G  G +     V     +    CWA
Sbjct: 569 IHLYSGKTLCKMAMPPGGLHADINGDGVVDHVQAHGSEGVDAGGHPVTGADGKPAPDCWA 628

Query: 464 VATSGVPVREQLFNASICHHSP---------FNLFPHGEFSRNFGRTSDVASLEVATPIL 514
            ATSGVPVRE LF  S+C  S          F   P G +  N        ++++  P  
Sbjct: 629 QATSGVPVREHLFGGSVCRGSAGVVRHGSGHFKGEPGGGYVSN-------RAVQIVAPAQ 681

Query: 515 IPRSDGH--RHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSG 572
           + R +    R  + +  DVVFL +RGEVT Y     GHD   +WQ  TDA+W+  P   G
Sbjct: 682 LRRGEEKIVRSLRRAVKDVVFLNSRGEVTCY-----GHDGTRRWQQRTDASWA--PGDPG 734

Query: 573 MTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCED 632
                 VV +L +F LRV    +++LAGG   A +++P G  L +  LP  P  AL+  D
Sbjct: 735 ------VVASLASFPLRVGGASEVVLAGGATHAALLTPSGYRLNAFKLPGKPVAALLVVD 788

Query: 633 FSNDGLTDVILMTSNG-VYGFVQTRQPGALFFS 664
              DGL DV+  T +G VY + Q   PG   F+
Sbjct: 789 VDGDGLNDVVARTKDGDVYAWRQRSHPGLAPFT 821


>gi|330802543|ref|XP_003289275.1| hypothetical protein DICPUDRAFT_98309 [Dictyostelium purpureum]
 gi|325080624|gb|EGC34171.1| hypothetical protein DICPUDRAFT_98309 [Dictyostelium purpureum]
          Length = 709

 Score =  258 bits (660), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 206/748 (27%), Positives = 349/748 (46%), Gaps = 125/748 (16%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPI---KFDA-------DRLPP 50
           MRKRD  +L+LS   IF SL  + D+        L  E PI    +D        + LP 
Sbjct: 1   MRKRDTWLLLLSLVVIFISLYRQKDYKIS-----LLLEIPIDTNNYDNSHYPKINEYLPL 55

Query: 51  PIVADLNGDGRKEVLVATHDAKIQVLEP------HARRVDEG-FSEARVLA-----EVSL 98
           PI+ D++GDG+ E++  T+D ++ VLE          R++E   SE  +L      +VSL
Sbjct: 56  PIITDIDGDGKNEIIYVTNDYRLIVLETVTYENIELFRLNENELSEKNLLQLPIKYQVSL 115

Query: 99  LPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQ 158
              ++ +A+GRR  ++ TG     Y  G P +Q++VVVT GWS++CF+H L KLWE+   
Sbjct: 116 -KSEVGLATGRRPTSIKTGFT-SPYVPGVPRQQIIVVVTDGWSILCFNHQLKKLWESYAS 173

Query: 159 EDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHR 218
           ++  PN +  EI+ISI    + + D G+VI+GGRME     +     +I + E   E + 
Sbjct: 174 DEVLPNHYQSEISISI----VPNSDKGMVIIGGRMEPMEGYVHKSHYKIPMYEGKIETNE 229

Query: 219 RSASEKEASENSGT-VDLRHFAFYAFAGRSGLLRWSRK-------NENIEAQPTDASQLI 270
             + E++      T  D  HF+++A+ G  G   WS +       N++ + +    S  +
Sbjct: 230 EHSHERDDDHEKETHRDESHFSYFAYDGEKGYKIWSHEENDFMPVNKHTDEEHRKESD-V 288

Query: 271 PQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKK 330
             H++K  +++    H GE + R +R+SVL  +PH W    DT L   HF +    +   
Sbjct: 289 SLHSFKQHIYS-QLDHLGEVDWRTYRDSVLAALPHKWSTGYDTKLDARHFSKKINTVRGN 347

Query: 331 VVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGS---AKSKKPVNYIPTITNYT 387
           V   +T+            G ++ +  S LIG   +   S    KS+   N+   I +  
Sbjct: 348 VETSTTT------------GLNNQEWDSELIGVNPSNLESFSILKSQDKANFQSNIKD-- 393

Query: 388 QLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHA---------DINGDGVLDHV 438
                PNV+VAH K G+E V ++SG T+C+L L     H+         D+NGDGVLD +
Sbjct: 394 -----PNVIVAHNKNGLEVVQISSGNTLCRLVLDSSDFHSNGNYFISYIDLNGDGVLDQI 448

Query: 439 -------QAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPH 491
                  + +G +G ++TV          C A+  SG+P RE++F+  IC+   F     
Sbjct: 449 TTYAGSFKGLGEDGGKETV----------CKALGMSGIPAREKIFDLKICNEGMF----- 493

Query: 492 GEFSRNFGRTSDVASLEVATPILIPRSDG---------HRHRKGSHGDVVFLTNRGEVTA 542
            +F     + +D  +L         +++           R  +     +VFL N G +++
Sbjct: 494 -DFEYFMWKGTDKNNLNNKKKTKSTKNERLIQTIPPAFFRSSENDQIQMVFLVNTGLISS 552

Query: 543 YSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQM--ILAG 600
           ++           W++ T+  W+    P          P+L+ FS     N+ +  ILA 
Sbjct: 553 FNS-----KGSKMWKIETNIHWTRHIEP-------ITQPSLQVFSFESDTNRLVPFILAV 600

Query: 601 GDQEAVVISPGGSILTSIDLPAPPTHAL----VCEDFSNDGLTDVILMTSNGVYGFVQTR 656
           G++   ++S  G ++    L +  T+ +    +  DF+NDG+ D ++ T  G Y    T 
Sbjct: 601 GERMMAIVSEKGDMVLEQPLDSSETNPVMAPPIVGDFNNDGINDFLITTLKG-YSIYITE 659

Query: 657 QPGALFFSTLVGCLIVVMGVIFVTQHLN 684
           +  +     ++G L++ + ++F+    N
Sbjct: 660 KGTSSSLLPIIGMLLIGVLLVFLLSSSN 687


>gi|440793762|gb|ELR14937.1| FGGAP repeat domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 581

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 175/565 (30%), Positives = 268/565 (47%), Gaps = 94/565 (16%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDF-SFREAWFH-----LSEEYPIKFDADRLPPPIVA 54
           M+++D+AIL L+  AI  S+ ++G +   R  W H     L E        +RLPPP++ 
Sbjct: 1   MKQKDVAILFLATLAILLSVYYQGTYHGLRLVWVHPIDASLYENGHFPLQHERLPPPLIT 60

Query: 55  DLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEAR-----VLAEVSLLPDKIRIASGR 109
           D++GDG  EV+V T D++I++LE    +     S ++     V  E SLLP  + + +GR
Sbjct: 61  DVDGDGHNEVIVVTSDSRIKILEGTPSQSAGLGSPSQWHNLPVKEEASLLP-SVGVGAGR 119

Query: 110 RAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHRE 169
           R VA+++G +   Y++G    QV+VV+TSGW+V+ FD++L  LWE+ ++E  P + +H E
Sbjct: 120 RPVALSSGYLS-PYQEGLVRTQVIVVLTSGWTVLLFDNHLKLLWESTVRESLPAHLYHSE 178

Query: 170 IAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEAS-- 227
            A+ +    ++ GD G+VI   R+  +             A + A +H   A  ++AS  
Sbjct: 179 AALLVVPTPVRIGDAGVVIAAARLSRK-------------ANQQAAKHDHGAGGRKASWM 225

Query: 228 ------ENSGTVDLRHFAFYAFAGRSGLLRWSRKNENI-EAQPTDASQLIPQHNYKLDVH 280
                 E+   +   HF+++AF GR+G LRW  +  +  +   TDA   I +        
Sbjct: 226 ANGEEDEDDHDLGEEHFSYFAFDGRTGALRWKHEAGDFYDEYDTDAGDRIGEE------- 278

Query: 281 ALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPF 340
              + H GE    +FR SV+  +PH W RREDT L+++HF R  +               
Sbjct: 279 --RAMHLGEMSWNDFRYSVMQQLPHAWFRREDTGLRVAHFERGAK--------------- 321

Query: 341 HKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQ 400
                   P  D+T + S L G      G        +      +       PNV+VAH 
Sbjct: 322 --------PAADTTAQ-SQLSGLHVDVPGLVWGGHAPHSEDDFVHE------PNVIVAHF 366

Query: 401 KEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP 460
            +G+E +HL +GR V ++ L+   +  DINGDGV++ V+         T+ +        
Sbjct: 367 ADGVEVIHLYTGRPVTRMGLERSVVWDDINGDGVVEAVRL-------STLPTAGQSSEDQ 419

Query: 461 CWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTS--DVASLEVATPILIPR- 517
           CWA+  SGVP +E LFN SIC   P         + N  R S  D   L    PI I R 
Sbjct: 420 CWALGQSGVPPQEDLFNVSICEPKPQGFLE--LLTSNPKRKSRGDTGVLRATNPISIARP 477

Query: 518 -----SDGHRHRKGSHGDVVFLTNR 537
                 +GH   K    D VFL  R
Sbjct: 478 RTWAEPNGH---KDMWADDVFLAAR 499


>gi|66819765|ref|XP_643541.1| hypothetical protein DDB_G0275747 [Dictyostelium discoideum AX4]
 gi|60471668|gb|EAL69624.1| hypothetical protein DDB_G0275747 [Dictyostelium discoideum AX4]
          Length = 799

 Score =  221 bits (564), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 162/553 (29%), Positives = 260/553 (47%), Gaps = 92/553 (16%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDA----------DRLPP 50
           MRKRD  +L++S   +F SL  + D+       +L  E PI  +           + LP 
Sbjct: 1   MRKRDTWLLLVSILVVFISLYRQKDYKL-----NLLLEIPIDTNNYENSHYPKINEFLPL 55

Query: 51  PIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFS---------------------- 88
           PI+ DLNGDG  E++  T+D KI++L+       E F                       
Sbjct: 56  PIITDLNGDGFNEIIYVTNDFKIRILDTVTPNNIEKFRFGVNVNENENENENDNDQNSLL 115

Query: 89  EARVLAEVSLLPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHN 148
           E  VL E SLL D + +A+GRR +A+ TG    +   G+  +Q++VVVT GW+V+CFD+ 
Sbjct: 116 ELPVLYESSLLSD-VGLATGRRPIAIKTGYAKHSI-PGERRQQIIVVVTDGWAVLCFDNK 173

Query: 149 LNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIG 208
           L KLWE+   ++  P  +  EI+I+I   T+   D G+VI+GGRME  P  I  P   I 
Sbjct: 174 LKKLWESYASDEVLPGHYQSEISITILPTTIP-DDKGIVIIGGRMEPLPGVIHKPHYSIP 232

Query: 209 LAEKNAEQHRRSASEKEASENSGTV--DLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDA 266
           +  +  E  +    +++   +S T   +  HF+++++  + G  RWS +  + + +    
Sbjct: 233 MFGEKVETDKDHDHKRDDDHDSFTAHREENHFSYFSYDCKDGYKRWSHEENDFKPENPHT 292

Query: 267 SQL------IPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHF 320
           ++       I  H++K  + +    H GE + + +R+S+L  +PH W    DT L+  HF
Sbjct: 293 TEEHKKDSDISLHSFKQHIFS-QLEHLGEVDWKTYRDSILASLPHKWSSNFDTKLEPRHF 351

Query: 321 RRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAAT--------YAGSAK 372
                K +  + G + +              D T  I+ LIG + T        Y+ +  
Sbjct: 352 ----EKKINTIRGTTNN--------------DGTNVINGLIGSSNTLNNEMIEFYSKNFD 393

Query: 373 SKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHA----- 427
           +   ++    I+  +Q    PNV+V H K GIE +H+ SG+T+CKL L     +      
Sbjct: 394 TFSILSKKDQISFKSQHIENPNVIVTHHKYGIEVIHIVSGKTLCKLVLDGTDFYTDSNRY 453

Query: 428 ----DINGDGVLDHVQAVGGN--------GAEQTVVSGSMEVLRPCWAVATSGVPVREQL 475
               D+NGDGVLD V++  GN          +   +         C  +A SG+P +E L
Sbjct: 454 IVYLDLNGDGVLDQVRSYAGNFKNSQLGRNKKNKNIGDDNNEDSYCVGLALSGLPPKETL 513

Query: 476 FNASICHHSPFNL 488
           F+  IC    FN+
Sbjct: 514 FDLKICGGQIFNM 526



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 73/156 (46%), Gaps = 27/156 (17%)

Query: 531 VVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRV 590
            +F+ N G ++ +     G   IW+  + T   WS    P          P+L+ FS  +
Sbjct: 626 TLFMVNTGLISCFDTASFGGKKIWR--VDTGIHWSKHLEPR-------TFPSLQVFSFDI 676

Query: 591 HDNQQMILAGGDQEAVVISPGGSILT---------SIDLPAPPTHALVCEDFSNDGLTDV 641
              Q  ILA G+Q  V+++  G IL          S  + APP    V  DF+NDG+ D 
Sbjct: 677 QSTQPFILAIGEQSMVILNENGEILVDQQIWNAGESDPITAPP----VVADFNNDGINDF 732

Query: 642 ILMTSNGVYGFVQTRQPGALFFSTLVGCL-IVVMGV 676
           ++ T +G + FV  +      +STL+  + +V++G+
Sbjct: 733 LITTLSGYHIFVTEKGT----YSTLLPMIGLVLIGM 764


>gi|328867416|gb|EGG15798.1| hypothetical protein DFA_09466 [Dictyostelium fasciculatum]
          Length = 802

 Score =  221 bits (563), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 198/760 (26%), Positives = 329/760 (43%), Gaps = 140/760 (18%)

Query: 15  AIFFSLQHEGDFSFREAWFHLSEEYPIKFDA----------DRLPPPIVADLNGDGRKEV 64
           A+  SL    D+S      HLS   PI  +           + LP PI+ DLNGD R ++
Sbjct: 54  AVLVSLYTHQDYSL-----HLSLILPIDSNNYENNHFPQPHELLPQPIITDLNGDQRNQI 108

Query: 65  LVATHDAKIQVLEP------HARRVDEGFSEAR--VLAEVSLLPDKIRIASGRRAVAMAT 116
           +  T+D KI++++P           + G    R  ++ E SL   K  I+SGRR VA+ T
Sbjct: 109 IYVTNDQKIRIVDPLYAINNMNTNNENGRIVGRNDIIYEASL-SSKFGISSGRRPVALKT 167

Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISN 176
           G    +       +QV+VVV   WSV+C++  L  LWE+ + +D P N +  EIAI I  
Sbjct: 168 GYTTPSGTAAN-REQVIVVVLDDWSVLCYNSRLKPLWESFIIDDIPKNHYLSEIAIDIIP 226

Query: 177 YTLKHGDTGLVIVGGRME-------MQPHTI----MDP-------------FEEIGLAEK 212
             LK GD GL+++GGR+E        +PH +    +DP             F      E+
Sbjct: 227 INLKEGDQGLIVLGGRLEPVGNGVGHKPHVMPAIGIDPKKYMDNDYLSDGEFTGYDPNEE 286

Query: 213 NAEQHRRSASEKEASENSGT--VDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLI 270
           +           +  +  G    D  HF++YA   ++G  RWS +  + + +     +  
Sbjct: 287 DVGGGEHQGEGHDHKDEGGLHGRDESHFSYYALDAKTGAKRWSHEENDFKPKNVHLDEEY 346

Query: 271 ---PQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKI 327
               +H+YK  + +L   H GE   + F++++L  +PH W  R DT L+ +HF + K+  
Sbjct: 347 HGEKRHSYKQHIISLMD-HEGEVNWKLFKDNMLDQLPHSWSSRYDTKLQSNHFTKSKKS- 404

Query: 328 LKKVVGKSTSYPFHKPEEHHPPGKDST--KKISNLIGKAATYAGSAKSKKPVNYIPTITN 385
                G+S        +  + P    T  K+  NL G+ +    + +             
Sbjct: 405 -----GQSN-------QGGNTPSTSGTGGKEWENLFGQQSQSQTTQQRWNQEE------- 445

Query: 386 YTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLH-----------ADINGDGV 434
                   NV+V+H + GIE +H++SGRT+CKL L+    H            D++GDG+
Sbjct: 446 -------SNVIVSHHRHGIEVIHISSGRTLCKLLLEGTDAHRSLTSHHYIVYVDLDGDGM 498

Query: 435 LDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICH-HSPFNLFPHGE 493
           +D V +V G+                C  +A +G+P R+ LFN SIC   S  + F    
Sbjct: 499 VDQVHSVTGDPVGSVSSFSRSSRQDVCMGMALAGLPPRDHLFNKSICGWTSQLDFFWPAG 558

Query: 494 FSRNFGRTSD------VASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGL 547
           F R+ G + D        S +   P +    +     + +H D VFL N G++ + +   
Sbjct: 559 F-RSSGNSIDGLDKSKRVSYQTVRPAVF---NAVYSLRPTHKDTVFLVNSGKINSVNSRG 614

Query: 548 HGHDAIWQWQLLTDATWSNLPSPSGMTEASTVV--PTLKAFSLRV--HDNQQMILAGGDQ 603
           + +     W + + ATWS           + ++  P+++ F++    H   Q +LA G+ 
Sbjct: 615 YTN-----WNIDSPATWS---------RNNIIIHHPSIQPFNIATQPHQTDQQVLAIGES 660

Query: 604 EAVVISPGGSILTSIDLPAP----------------PTHALVCEDFSNDGLTDVILMTSN 647
             ++ +  GSIL    + +                 PT   +  D + DG+ D+I+ T +
Sbjct: 661 IVILSASDGSILLQKKIKSISFNNQNNNGNGNNNVLPTAPPIIGDLNQDGINDIIVPTLS 720

Query: 648 GVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVK 687
           G Y +  T+      FS     +I  + V  +     S+K
Sbjct: 721 GYYIYNLTKGYSTFLFSCFTVIIIATLLVTVILSKQQSLK 760


>gi|291230320|ref|XP_002735115.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 695

 Score =  208 bits (529), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 198/722 (27%), Positives = 312/722 (43%), Gaps = 118/722 (16%)

Query: 4   RDLAILMLSAFAIFFSLQHEGDFSFREAW-------FHLSEEYPIKFDADRLPPPIVADL 56
           RDL ++ + A A++  L+    +S +  W        + + EY  +++  ++P P V+D+
Sbjct: 14  RDLWLVAVCAIAVYL-LRVRESYSLQAKWRSDIDTSLYENNEYAKEWE--KIPLPFVSDI 70

Query: 57  NGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLP-----DKIRIASGRRA 111
             DG  EV+  T + KI++     ++              S+LP      ++ + S +  
Sbjct: 71  ESDGINEVIFVTKEPKIKIATVPTKKFGS-----------SVLPYLVTKHEVTLDSNKHP 119

Query: 112 VAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIA 171
           VA+ATG +       Q  KQV+V + S W+VMC+DH L  LW+  L          +E A
Sbjct: 120 VAIATGYLMEYQSMLQVRKQVVVALLSDWTVMCYDHKLQLLWQNKLSPILDERYFIKEAA 179

Query: 172 ISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSG 231
           I ++++ LK  D GL+I+GG +  + H       E  L     + HRR    K   + S 
Sbjct: 180 ILVTSHNLKKKDGGLIIIGGSIGDRHH------HEKKLTH---DHHRRIQDIKPIEDESD 230

Query: 232 TVDLR-----------HFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVH 280
             D R           HF+ YA +G+ G +RW     + +        L    ++KL + 
Sbjct: 231 EGDDRYQDKEKDTHVNHFSTYALSGKDGSIRWHHLPGDFKEHTNKDDALFSARHFKLGLK 290

Query: 281 ALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPF 340
              S H GE    ++ +++L  +PH W R  DT +     ++ K +I K        Y F
Sbjct: 291 KGLS-HEGESHWMQYNKAILKNLPHSWQRASDTRITFDRIQKSKGEIFK-------DYGF 342

Query: 341 HKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQ 400
                    G D    +    G    ++ S   K                  PN +V H 
Sbjct: 343 DDENVLDMIGLDEEHLVGYAFGGLRPHSASEHIKN-----------------PNAIVIHT 385

Query: 401 KEGIEAVHLASGRTVCKLHLQEG-GLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLR 459
            EGIE + LA+G+ +C+L LQ+  G++ DIN D +LDHV+            S S     
Sbjct: 386 HEGIEVLKLATGQPMCRLKLQQDKGVYMDINQDAILDHVKG---------HFSHSAHAKD 436

Query: 460 PCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEV---ATPILIP 516
            C AV  +G P    LFN SIC          G  S      +D   +E      P +I 
Sbjct: 437 NCLAVVKTGHPPHTLLFNGSICDSQSIL----GWLSFIDSADNDNGHIEDTHHTVPPVIV 492

Query: 517 RSDGHRHRKGSH-------------GDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDAT 563
           RS   R    +H              D +FL + G++T+Y P  HG    + WQ+ T A 
Sbjct: 493 RSVAERRGIWNHLLGETQLSASSKGFDSIFLISNGKLTSYGP--HGQ---FHWQVATPAK 547

Query: 564 WSNLPS---PSGM-----TEA--STVVPTLKAFSLRVHDNQQM-ILAGGDQEAVVISPGG 612
           WS+  S    +G+     TE   +T  P+++  +L+V+  + + +LAG D   +V    G
Sbjct: 548 WSDASSLVRRAGLLNREFTEKYRNTFQPSMQVMALQVYGKENIVVLAGWDSMVLVSLKDG 607

Query: 613 SILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIV 672
            IL    LP  P+ ++V  DF+NDG TD I+       GF   R  G L  + L+G  I+
Sbjct: 608 RILAEHTLPCQPSSSIVIGDFTNDGWTDFIVHCPTSYLGFSLNRTSGYL-STALIGAAII 666

Query: 673 VM 674
           V+
Sbjct: 667 VV 668


>gi|303290440|ref|XP_003064507.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454105|gb|EEH51412.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 566

 Score =  205 bits (521), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 175/548 (31%), Positives = 239/548 (43%), Gaps = 133/548 (24%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDAD-------------- 46
           + KRDLA++ LS FA++ SL  +G  SF  AW+  +EE      A               
Sbjct: 2   LAKRDLAVMFLSLFAVYHSLLRDGVVSFDRAWYVPAEESSDDLAASSARGDRAGPGTFAS 61

Query: 47  ---------------RLPPPIVADLNGDGRKEVLVATHD-AKIQVLEP------------ 78
                          R PPP+  DLNGDG  E++VA+   A+I+V+              
Sbjct: 62  SARFGGGGVAATRSSRAPPPVFFDLNGDGVNEMIVASRTLAEIRVVSVPSSASRRRRRDG 121

Query: 79  -----------------HARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID- 120
                            H    D  F+     A  SLLP   R+ +GR  +A+A G +  
Sbjct: 122 GGGAGGDGGGDHGDEGIHDDDADV-FAALTTTATASLLPANTRVVAGRTPIALAAGHLTP 180

Query: 121 --------RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAI 172
                   R     +  K V+VVVTSGW ++CFDHNL  LWE  L  +FP  A  RE+A+
Sbjct: 181 PRSSGSSSRANSNVKTRKAVVVVVTSGWHLLCFDHNLRLLWEVALSGEFPRRARIREVAV 240

Query: 173 SISNYTLKHGDTGLVIVGGRMEMQPH-----------TIMDPFEEIGLAEKNAEQHR--- 218
            +S +    GD G VIVGGR+E               ++ D FE     E     HR   
Sbjct: 241 VVSAHATYEGDVGAVIVGGRVETGTRDDDDEGGGGGDSLGDAFERQLDDEDVLRTHRGGA 300

Query: 219 -RSASEKEASENSGT-------------VDL-RHFAFYAFAGRSGLLRWSRKNENIEAQP 263
             +A E EA+E  G+             +D  RHF +YAF G++G  RW  ++E+   + 
Sbjct: 301 KATAREMEAAEKDGSNAEGDGTGAGKGRLDRSRHFNYYAFEGKTGARRWKHESEDFH-RD 359

Query: 264 TDA--SQLIPQHNYKLDVH------------ALNSRHPGEFECREFRESVLG-VMPHHWD 308
            DA   +L PQH+Y+LD              AL  RH GE  CREFRESV+   +PH W 
Sbjct: 360 VDALVDRLTPQHDYRLDAGAFTFTLVPIRPPALEGRHYGEVSCREFRESVVKRALPHQWR 419

Query: 309 RREDTLLKLSHFRRHK--RKILKKVVGKSTSYPFHKPEEHHPPGKD-STKKISNLIGK-- 363
            REDT L ++ F +HK  +      VG+  +            G +  T   +  +G   
Sbjct: 420 EREDTRLVVAAFEKHKPHKGARSNAVGRGGAATVDGSSSRGARGAEHGTNAFARALGTTA 479

Query: 364 ------AATYAGSAKSKKPVNYIPTITNYTQLWWV--------PNVVVAHQKEGIEAVHL 409
                  A+  G A S+ P     T T  T             PNVVVAH +EGIE +HL
Sbjct: 480 RAVATGGASNRGGASSRGPGGKTTTATRNTHRRVKRPSSGGDPPNVVVAHVEEGIEILHL 539

Query: 410 ASGRTVCK 417
            +GRT+CK
Sbjct: 540 HTGRTLCK 547


>gi|281209979|gb|EFA84147.1| hypothetical protein PPL_03221 [Polysphondylium pallidum PN500]
          Length = 748

 Score =  198 bits (504), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 206/748 (27%), Positives = 314/748 (41%), Gaps = 168/748 (22%)

Query: 29  REAWFHL-------SEEYPIKFDADRLPPPIVADLNGDGRKEVLVATHDAKIQVLEPHAR 81
           R+AW  L       +  YP   +   LP PI+ DL+GDG+        D KI++++P   
Sbjct: 4   RDAWMILVYTNQYANSHYPQINEV--LPNPIITDLDGDGKN-------DHKIRIVDPVLA 54

Query: 82  RVDEGFSEARVLAEVSL-----LPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVV 136
                  ++  + + SL     L  K+ ++SGRR VA+ TG     Y       Q++VVV
Sbjct: 55  DDSVVGGKSTSMGKASLRYEVSLASKVGLSSGRRIVALQTGYA-TPYISRTSSTQLVVVV 113

Query: 137 TSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQ 196
           T  W V+ FDH L  LWE  + ++ P N +H E+ I + +        GL+IVGGR+E  
Sbjct: 114 TEEWEVLVFDHQLKPLWERYVVDEIPKNHYHSEVTIQVVS-------NGLIIVGGRLEAI 166

Query: 197 PHTI----------MDP---FEEIGL-----------------AEKNAEQHRRSASEKEA 226
           P+ +          +DP     ++G                   E +A  H     E   
Sbjct: 167 PNELHRSHITPAIGIDPNALASKVGTLDEGDPTAATATGETASKEGDAHAHDHEKEEVHR 226

Query: 227 SENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQL---IPQHNYKLDVHALN 283
            EN       HF+FYAF+  +G + W     +   + T A +      +H+YK  V++L 
Sbjct: 227 DEN-------HFSFYAFSAYNGQMHWHHDELSFLPENTHADEEHHDEKRHSYKQHVYSL- 278

Query: 284 SRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKP 343
             H GE   + +++S+L  MPH W  + DT  +L+HF +                  H P
Sbjct: 279 LEHIGEVSWKSYKQSMLAAMPHRWSSKFDTRFQLAHFEKQGAN-------------RHNP 325

Query: 344 EEHHPPGKDSTKKISNLIG-KAATYAG-SAKSKKPVNYIPTITNYTQLWWVPNVVVAHQK 401
            +    G +     + +IG   + + G +  +  P  Y    T++      PNVV+AH K
Sbjct: 326 TK----GNNEADWNTEVIGVHPSHFEGLTGGNSAPAQYPHAETDHID---EPNVVIAHTK 378

Query: 402 EGIEAVHLASGRTVCKLHLQEGG---------LHADINGDGVLDHVQAVGGN--GAEQTV 450
            G+E + L +GRT+CKL L              + D+N DGV+D V A+ GN  G+    
Sbjct: 379 AGMEVIQLQNGRTLCKLLLDSTNKAENSDHYITYTDLNNDGVVDQVHAIAGNLPGSSMFS 438

Query: 451 VSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASL--- 507
            +   E+   C A+  SG+P R+ LFN SICH          +F   F R+S  A +   
Sbjct: 439 RNRRQEI---CLALGMSGLPPRDYLFNRSICHEGGM------DFDYFFWRSSPAAEVSRG 489

Query: 508 -EVATPILIPRSDGHRHRKGSH--GDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATW 564
            +   P +I  ++      GS    DVVFL N G++T+       H     WQ  TDA W
Sbjct: 490 AQTVAPAIIASANT---VAGSAKLNDVVFLVNSGKITSVR-----HTGRANWQRDTDARW 541

Query: 565 SNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDL---- 620
                P          P+++AFSL V+  +  +LA  D   V++   G IL S  L    
Sbjct: 542 QKDGQPYPH-------PSIQAFSLEVYGPKTNLLAVADH-IVILDQQGEILISERLGKKG 593

Query: 621 --------PAPPTHAL----------------------VCEDFSNDGLTDVILMTSNGVY 650
                    A  +HA                       +  D +NDG  DVI+ T +G Y
Sbjct: 594 SQKLAGGMSASKSHAAGEQEHYQPTDAAALSVMPMGPPIIGDLNNDGYNDVIVPTVHGYY 653

Query: 651 GFVQTRQPGALFFSTLVGCLIVVMGVIF 678
            +   R       S  V  + +   V+ 
Sbjct: 654 VYQLERGHSTFILSLFVIIISLAFTVML 681


>gi|302852355|ref|XP_002957698.1| hypothetical protein VOLCADRAFT_98769 [Volvox carteri f.
           nagariensis]
 gi|300256992|gb|EFJ41247.1| hypothetical protein VOLCADRAFT_98769 [Volvox carteri f.
           nagariensis]
          Length = 748

 Score =  186 bits (473), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 127/313 (40%), Positives = 171/313 (54%), Gaps = 34/313 (10%)

Query: 394 NVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGN--------- 444
           N +VA  +EG+E +HL SGRTVCKLHL    LHADINGDGVLDHV    G+         
Sbjct: 423 NALVAFLEEGVEVLHLYSGRTVCKLHLPPRTLHADINGDGVLDHVSVYHGHVGGGVDGNE 482

Query: 445 ------GAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRN- 497
                 G   ++      V   C AV  SG+P  E LF   +CH + F +      SRN 
Sbjct: 483 DTSLLPGELPSISGKGHAVFGRCTAVVRSGIPPTETLFKVQVCHTNKFGV----SASRNV 538

Query: 498 FGRTSDVAS-LEVATPILIPRSD--GHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIW 554
           F R + VA   E+ATPI++P  D  G   ++  HG +VFLTN+GE+TA + G   H    
Sbjct: 539 FQRAAAVAQPTELATPIMLPIPDLKGFSSKRRQHGMLVFLTNKGEMTAVT-GTGSH---- 593

Query: 555 QWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSI 614
            WQ     +W     P        VVPTL A +L  H     +LA G + AVV+S  G I
Sbjct: 594 LWQEYLQVSW-----PPADENPRHVVPTLAAMALYTHAVPTTVLAAGTEHAVVVSEHGHI 648

Query: 615 LTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQ-PGALFFSTLVGCLIVV 673
           L  ++LP+PPT  LV  DF+ DGL D+IL+T+ G+YG+VQ     G +  S L+  L+V 
Sbjct: 649 LAELELPSPPTQPLVVADFNGDGLNDIILVTNRGIYGYVQVPHLAGGMSLSALLLTLVVA 708

Query: 674 MGVIFVTQHLNSV 686
           +G+++ TQH + +
Sbjct: 709 LGLVYFTQHYDPM 721



 Score =  181 bits (458), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 114/302 (37%), Positives = 170/302 (56%), Gaps = 31/302 (10%)

Query: 53  VADLNGDGRKEVLVATHDAKIQVLEP--HARRVDEGFSEARVLAEVSLLPDKIRIASGRR 110
           +ADLNGDG  E++VAT D K+QV++P  H  R  EGF+ A  L  +SLL  +  +A+G R
Sbjct: 1   MADLNGDGHLELVVATPDLKLQVIQPAPHGHR-GEGFARAAELNAISLLFKRALVAAGHR 59

Query: 111 AVAMATGVID-RTYRQGQPL-KQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHR 168
            VA+A G +D     + +PL K V+V+VT+ W VMCFDHNL   WE + +  FP +A  +
Sbjct: 60  PVALAVGYLDPLPAERVRPLRKAVIVIVTASWRVMCFDHNLVLKWEYDAKMHFPHHARIK 119

Query: 169 EIAISISNYTLKHGDTGLVIVGG---RMEMQPHTIMDPFEEIGLA------EKNAEQHRR 219
           E+A+ I+ + +   D G+VIVG    R ++     ++   E G A      +    + R 
Sbjct: 120 EVAVYIAPHQVHEADRGIVIVGASVLRGDLASGEGLESVAEGGPAGFFEDDDVLLSEMRE 179

Query: 220 SASEKEASENSGTV----DL------------RHFAFYAFAGRSGLLRWSRKNENIEAQP 263
            A  KE ++++G      DL            RHF++ A  G +G LRW   + +     
Sbjct: 180 DAERKEHAKSAGVTESLTDLDPNDPLAGLPGSRHFSYVALEGGNGTLRWHHGSGDFHKDL 239

Query: 264 TD-ASQLIPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRR 322
           ++   +L PQ+NY+LD   L+ RH GE  CR++RESVL V+PH W+R  DT L  +HF +
Sbjct: 240 SELGGELTPQNNYRLDASKLDGRHFGEASCRDYRESVLHVLPHVWERPADTRLMEAHFIK 299

Query: 323 HK 324
           H+
Sbjct: 300 HR 301


>gi|405976894|gb|EKC41372.1| hypothetical protein CGI_10025644 [Crassostrea gigas]
          Length = 679

 Score =  186 bits (472), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 187/709 (26%), Positives = 300/709 (42%), Gaps = 105/709 (14%)

Query: 6   LAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDA-----DRLPPPIVADLNGDG 60
           + +L+ S  A  F  +    +  +  W   SE +  K        DRLPPPI+ DL+GDG
Sbjct: 17  IFVLLCSVIAYLF--RASDSYDLKPVWRKRSEPHHYKNKLYPTLDDRLPPPIITDLDGDG 74

Query: 61  RKEVLVATHDAKIQVLEPHARRVDEGFSEA--RVLAEVSLLPDKIRIASGRRAVAMATGV 118
             E+L+ THD K+ VL    R  D+        V+ + ++LP  I ++   R VAM TG 
Sbjct: 75  TNEILLITHDFKLNVLALPERAADDDDDTLPHVVVKQRAVLP--INVSEISRPVAMETGF 132

Query: 119 IDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHRE------IAI 172
                   Q  KQ++VV T  W V+C+DHNL  LW   L +     +H +E      + I
Sbjct: 133 TVPYSSMMQIRKQIVVVATDDWQVLCYDHNLELLWHKRLMD----VSHVKETYTVKAMGI 188

Query: 173 SISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGT 232
            I+ + +K  D G+VIVGG      HT  D    I        +H  + +E++   ++  
Sbjct: 189 LITPHNVKKKDQGMVIVGGSFTHLVHTPPDTTTTI--------KHSVNKTEEKQENSTDD 240

Query: 233 VDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFEC 292
             L HF+ +A +   G  RW     +     T+   +   H++KL +   +  H GE   
Sbjct: 241 NRLTHFSSFAVSALDGTSRWHHLPGDFGEIATNIKDIHGDHHWKLALKR-HRLHVGEAPW 299

Query: 293 REFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKD 352
             +++      PH W   +DT L L  FR+ +    K     S+S       +H      
Sbjct: 300 TMYKKEFSEFTPHLWVNLDDTKLTLGRFRKTEEGGEKYSSSSSSSSGMALTPDH------ 353

Query: 353 STKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASG 412
                  +IG A  Y G     +P +    + N       PN VV H   GIE ++L +G
Sbjct: 354 -------IIGYA--YGG----HRPHSNHEHVEN-------PNAVVIHTHNGIEVLNLLNG 393

Query: 413 RTVCKLHLQ-EGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPV 471
           + + +LHL  +GG++ DI+ DG +            + V+ G  +   PC+       PV
Sbjct: 394 QPITELHLPGDGGVYVDIDSDGEI------------EQVLWGLQDDYSPCYIEIWRINPV 441

Query: 472 REQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSD----------GH 521
           +E++    IC  +   LF    F+ ++    D  +L+   PI+I              GH
Sbjct: 442 KERIEQLPICRIT--RLF----FTSSWAYDED--NLKKLPPIIIKSVARKTGLIRYFMGH 493

Query: 522 RHRKGSHG-DVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNL---------PSPS 571
              K +H  D++     G V++++      D    WQ+ T A W+             P+
Sbjct: 494 HLPKFAHKHDIITFGGVGRVSSWN-----RDGNANWQIATPANWARAAIDRKKNKEADPA 548

Query: 572 GMTE-ASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVIS-PGGSILTSIDLPAPPTHALV 629
                     P+    S+ V+  Q +    G  E V++    G++L    LP  P+  L 
Sbjct: 549 AYQRFVEEFWPSHVLMSIPVYGQQNVAALSGFSEFVLVDLVNGNLLAEHSLPCSPSAPLT 608

Query: 630 CEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIF 678
             DF NDG+ DVI+  S G  GF    +   L F+ L G  + ++ ++ 
Sbjct: 609 VGDFDNDGVNDVIVTCSLGYIGFSLHHKTNHL-FTALYGLTVFILILLL 656


>gi|156371409|ref|XP_001628756.1| predicted protein [Nematostella vectensis]
 gi|156215741|gb|EDO36693.1| predicted protein [Nematostella vectensis]
          Length = 645

 Score =  186 bits (471), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 194/683 (28%), Positives = 293/683 (42%), Gaps = 138/683 (20%)

Query: 46  DRLPPPIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVL------AEVSLL 99
           D LP PI+ DL+ DG  E++VAT DAK++V+        E FS   +L      AE SLL
Sbjct: 55  DLLPKPIIVDLDNDGTNELVVATADAKVKVMVFPV----EDFSSENILPHLHIKAEYSLL 110

Query: 100 PDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQ- 158
            +  +  SG   VAMA G  +    +     QV+VVVTS W+V C  + L  +W+ +LQ 
Sbjct: 111 SEVEQSNSGVLPVAMAAGCQEHVMERD--CNQVIVVVTSDWTVHCMSNFLKPIWKTSLQS 168

Query: 159 EDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHR 218
           +D P    H           ++H  T    V G          D  +E+     N  +  
Sbjct: 169 KDHPLQPEH-----------IRHDHTVETGVAG----------DLVQEL----VNKPRIH 203

Query: 219 RSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLD 278
           R    K+A+ N       HF+ YA    +G +RW  +  +     T   +L+  +++KL 
Sbjct: 204 RPVQSKKAASNEAD----HFSTYALDSWTGQIRWRHEPGDFVTNQTAREELLSAYHFKLA 259

Query: 279 VHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSY 338
           +H+ N  H GE    +++ SVL  +P  W    DT +K++ FR+ K        G   S 
Sbjct: 260 LHS-NQYHAGEVRWEQYQASVLRSLPLRWAHPSDTNMKIADFRKGK-------TGSQIS- 310

Query: 339 PFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVA 398
                           KK S L           K  K         N TQ     N +V 
Sbjct: 311 --------------EVKKKSWLDLGFTDLTSDTKGTK---------NRTQ----GNAIVI 343

Query: 399 HQKEGIEAVHLASGRTVCK-LHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEV 457
               G+E +++ SG+ +C+ +H QE     D+NGD V+DHV       A Q   +     
Sbjct: 344 QSHTGLEVLNIQSGQPLCRYVHTQELTTAGDLNGDDVIDHVSM--HFSAHQLFPTD---- 397

Query: 458 LRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPR 517
           L  C A+A SG  V   LF   +C            F  + G   +V  L VA P+L+P 
Sbjct: 398 LPSCSAIARSGSRV---LFTGPVCASGSL-------FDWSSGEPHEVEPLLVA-PLLVPS 446

Query: 518 SD----------GH--RHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWS 565
                       GH  R R  +  D VFL + G++T+Y P  HG      WQ+  +  W+
Sbjct: 447 PPHRTGLFRHVMGHNLRQRSRAEMDAVFLVSTGKLTSYGP--HGEQ---NWQVTVEGAWA 501

Query: 566 NLPSPSGMTEAST------VVPTLKAFSLRVHDNQQ--MILAGGDQEAVVISPGGSILTS 617
               P    E ++        P+L++  +   + ++  ++++G    ++V    G+I+ S
Sbjct: 502 KHVRPDEHGEWTSENGDQRFFPSLQSMRVNADEEEESAVLVSGWYHISLVSLVDGTIMAS 561

Query: 618 IDLPAPPTHALVCEDFSNDGLTDVIL-----------MTSNGVY-GFVQTRQPGALF--- 662
             LP  P   +V  DF+NDGLTDV++            TSN  Y GF   R+PG L+   
Sbjct: 562 HSLPCQPVAPVVNGDFTNDGLTDVVVQCSQSQHCIYYQTSNFSYLGFALERRPGYLWTAI 621

Query: 663 --FSTLVGCLIVVMGVIFVTQHL 683
              S LV  L+ +  + FV + L
Sbjct: 622 WVVSGLVVALLTLGCMRFVEEEL 644


>gi|145355970|ref|XP_001422217.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582457|gb|ABP00534.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 484

 Score =  185 bits (470), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 143/444 (32%), Positives = 218/444 (49%), Gaps = 42/444 (9%)

Query: 238 FAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRE 297
           F +YAF  R+G  RW+  +   E +              LD   ++     E  CR+FRE
Sbjct: 17  FEYYAFDARTGARRWTETSGEDEHR---RRARARLSLRNLDDDEIDG--AVERACRDFRE 71

Query: 298 SVL-GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHP--PGKDST 354
           S++   +PH W    DT ++L+HFRRHK +       K+++    +        P   +T
Sbjct: 72  SIVQDGLPHLWRHPVDTKMRLAHFRRHKSRANAMARDKASAGKRKRGAAAAKAVPSNAAT 131

Query: 355 KK----ISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLA 410
           +     +  L G++        +  P  +  T     +    PNVVV+H  EGI+ +HL 
Sbjct: 132 RAFGRALDALKGESKRRLDDDDAHPP--HASTHARRVRKREPPNVVVSHHAEGIDVLHLY 189

Query: 411 SGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVP 470
           SG  VC++ L+  GLH D++GDGV+DHV+A G N         S E +  CWA  TSGVP
Sbjct: 190 SGVKVCEMKLKSPGLHVDLDGDGVVDHVEAHGRN-------LRSAEDIPHCWATVTSGVP 242

Query: 471 VREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPR-----SDGHRHRK 525
              +  +ASIC      L  H   ++  GR  D+ ++EV +PI + R     S+  +   
Sbjct: 243 DEGRTLSASICRGGS-GLAGH-RAAQAAGR--DIRAVEVVSPISLRRLPETSSELAKPLD 298

Query: 526 GSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKA 585
               D +FL +RGE+T Y+    G     +WQ+ T A W    +PS       V P+L +
Sbjct: 299 RMGRDAIFLNSRGEMTCYN-ARDGEQK--RWQIRTTADW----TPSD----DVVKPSLTS 347

Query: 586 FSLRVHDNQQMILAGGDQEAVVISPGGSILTS-IDLPAPPTHALVCEDFSNDGLTDVILM 644
           F +RV     + +A G    V+++  G   T+ I+LP+PP   LV  D   DG TD++L 
Sbjct: 348 FPIRVDGYLDLAIATGASSMVIVNSKGYRATAPIELPSPPAAPLVARDVDGDGFTDLVLH 407

Query: 645 TSNGVYGFVQTRQPGALFFSTLVG 668
           T +GVY ++QT + G L F+ L+G
Sbjct: 408 TKSGVYVWIQTPRAGNLPFTFLIG 431


>gi|290973099|ref|XP_002669287.1| FG-GAP repeat protein [Naegleria gruberi]
 gi|284082832|gb|EFC36543.1| FG-GAP repeat protein [Naegleria gruberi]
          Length = 683

 Score =  185 bits (470), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 164/737 (22%), Positives = 317/737 (43%), Gaps = 124/737 (16%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWF--HLSEEYPIKF---DADRLPPPIVAD 55
           MRKRD A+  L    +  +L  EG  +  +A+F    + + P+       D +P PI+ D
Sbjct: 1   MRKRDFAMFALVCLLLVVALNQEGSVTIEKAFFFSQTTNQQPLLLKTGQVDLVPKPIITD 60

Query: 56  LNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLP------------DKI 103
           ++GDG K+++ +  + +I ++        +  S + ++ +    P            DK 
Sbjct: 61  IDGDGSKDLITSNDNGRISIVSL------KTISSSLLVHQDMFNPITEEKSFDVNEKDKN 114

Query: 104 RIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANL------ 157
           +       + +  G I    ++GQ   + +  V S W V+C D +LN  WE++L      
Sbjct: 115 KGTVHHSIIGLGHGHIKVKKKKGQRGTKRIAAVLSNWKVLCLDPDLNLQWESDLLQHVVS 174

Query: 158 --------QEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGL 209
                    ED   +    E+ +++  + +   D G+VI+G R   +P  + D       
Sbjct: 175 EMKLTDVEMEDLRNDFKPFEVFVAVYPWRVHENDEGMVIIGFR---KPQEMFD------- 224

Query: 210 AEKNAEQHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQL 269
                     + S K   E +  +      ++   G +G LRW  ++    A+  D ++ 
Sbjct: 225 ----------NDSTKRTEETATQI-----TYFCLDGATGQLRWKHQSN---AKSWDDNEF 266

Query: 270 IPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILK 329
             QH++KL+     S   GE   RE+++ ++  +PH +    DT + L +F+  K+  + 
Sbjct: 267 YEQHSFKLNAQQHLS-DSGEVIWREYKKKIMHHLPHVYRHPYDTSVILDNFQPTKKAKVS 325

Query: 330 KVVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQL 389
           K+  ++    +           D  +K+ + I    T       K   N    I    + 
Sbjct: 326 KIQERAEKQDY----------GDIGEKVKSTIQTVTTKTKDTNEKLKKN----IELMKKF 371

Query: 390 WWVPNVVVAHQKEGIEAVHLASGRTVCKLH-LQEGGLHADINGDGVLDHVQAVGGNGAEQ 448
             +PNV+VAH  +GI+ +HL +GRT+ +   L+E   + DIN DG++D  +A        
Sbjct: 372 EKIPNVMVAHLSKGIQVIHLYTGRTLTQFSPLKENTYYQDINFDGMIDAAEA-------- 423

Query: 449 TVVSGSMEVLRPCWAVATSGVPV-REQLFNASICHHSPFNLFPHGEFSRNFGRTSD---- 503
                     + C A   SG P   + LF  S+C   P +     +FS  F  +++    
Sbjct: 424 ----------QVCKARVESGAPNPYDTLFRKSVCKLHPSSFIE--QFSIPFLMSNEQQER 471

Query: 504 ----------VASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYS----PGLHG 549
                        ++V TP +I      +  +    D+V+LT+ G VTA +      ++G
Sbjct: 472 QSFDDDEEYEDEQMDVVTPAIIEHYSAEQKPQHRARDLVYLTSTGLVTAITFDRKNSVNG 531

Query: 550 HDAI-WQWQLLTDATW--SNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAV 606
                 +WQ++T +++    +   SG  E  +  P + AF LR  +    ++A G++   
Sbjct: 532 EPKFRVKWQVMTPSSFRRERVLVESG-AETESFFPEIVAFPLRKFETHSFVVAVGEKRIT 590

Query: 607 VISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTL 666
           ++   G++   I+LP P    +   D + D + D+++ T  G+Y ++     G    + L
Sbjct: 591 IVDLFGNVQQIIELPHPSISPIQLVDINGDNILDIVVSTKQGIYAYITRVHTGMSVLTFL 650

Query: 667 VGCLIVVMGVIFVTQHL 683
           +  L V++G+++++ ++
Sbjct: 651 IVSLFVIVGLLYLSTYI 667


>gi|159484188|ref|XP_001700142.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158272638|gb|EDO98436.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 786

 Score =  182 bits (462), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 121/311 (38%), Positives = 169/311 (54%), Gaps = 34/311 (10%)

Query: 394 NVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQ------AVGGNGAE 447
           N +VA  +EG+E +HL SGRTVCKLHL    LHAD+NGDGVLDH+       A  G+G E
Sbjct: 454 NALVAFLEEGVEVLHLFSGRTVCKLHLPPRTLHADLNGDGVLDHISVYHGHTAQAGDGEE 513

Query: 448 ---------QTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRN- 497
                    ++V      V   C A   SG+P  E LF   +CH   F +      SRN 
Sbjct: 514 GDALLPGELRSVSGRGHAVSGRCTAHVRSGIPPSETLFTVQVCHTRKFGV----SASRNV 569

Query: 498 FGRTSDVAS-LEVATPILIPRSD--GHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIW 554
           F R +  A   E+ATP+++P  D  G+  R+  HG +VF+TN+GE+TA S G        
Sbjct: 570 FQRAAAAAQPTELATPVMLPIPDLHGYSSRRRQHGMLVFMTNKGEMTAVSSG-----GAH 624

Query: 555 QWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSI 614
            WQ     +W     P        VVPTL A +L  H     +LA G   AV++S  G +
Sbjct: 625 LWQEYLQVSW-----PPADQNPDHVVPTLAAMALYTHAVPTTVLAAGTDHAVIVSEHGHV 679

Query: 615 LTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQ-PGALFFSTLVGCLIVV 673
           L  ++LPA PT  LV  DF+ DGL D+I++T+ G+YG+VQ +   G +  S L+  L++ 
Sbjct: 680 LAELELPAAPTQPLVVSDFNGDGLNDIIVVTNKGLYGYVQVQHLAGGMSLSALLLTLLIA 739

Query: 674 MGVIFVTQHLN 684
           +G+++ TQH +
Sbjct: 740 LGLVYFTQHYD 750



 Score =  176 bits (447), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 110/303 (36%), Positives = 160/303 (52%), Gaps = 32/303 (10%)

Query: 54  ADLNGDGRKEVLVATHDAKIQVLEPHA-RRVDEGFSEARVLAEVSLLPDKIRIASGRRAV 112
           ADLNGDG  E++V T D K+QV++P       EGF+ A  +  +SLL  +  +A+G R V
Sbjct: 7   ADLNGDGHIELVVTTPDLKLQVVQPAPPGHHGEGFARAVEINSISLLFRRALVAAGHRPV 66

Query: 113 AMATGVID-RTYRQGQPL-KQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREI 170
           A+  G ID     + +PL KQV+VVVT+ W V+CFDHNL  LWE + +  FP ++H +E+
Sbjct: 67  ALQVGYIDPLPLERVRPLRKQVIVVVTASWQVLCFDHNLVMLWEYDARVHFPHHSHIKEV 126

Query: 171 AISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNA------------EQHR 218
           A+ IS + +   D GLV+VG  +        +  + +     N              + R
Sbjct: 127 AVYISPHQVHGADRGLVVVGASVMRGDVASGEGLQSVAGGGVNGTGSFFEDDDVLLSEMR 186

Query: 219 RSASEKEASENSGTV----DL------------RHFAFYAFAGRSGLLRWSRKNENIEAQ 262
             A  +  S+ +G      DL            RHF++ A  G  G +RW   + +    
Sbjct: 187 EDAERRRLSKGAGLAEQLSDLDKDDPLAGLPGSRHFSYVAVEGGKGAVRWQHGSGDFHKD 246

Query: 263 PTD-ASQLIPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFR 321
             + +S L+PQH Y+LD   L  RH GE  CR++RESVL V+PH WDR  DT L  +HF 
Sbjct: 247 LDELSSGLVPQHQYRLDAEKLEGRHFGEASCRDYRESVLHVLPHVWDRPADTRLMEAHFI 306

Query: 322 RHK 324
           +H+
Sbjct: 307 KHR 309


>gi|390356446|ref|XP_003728789.1| PREDICTED: uncharacterized protein LOC100891930 [Strongylocentrotus
           purpuratus]
          Length = 624

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 155/588 (26%), Positives = 256/588 (43%), Gaps = 84/588 (14%)

Query: 4   RDLAILMLSAFAIFFSLQHEGDFSFREAWFH-----LSEEYPIKFDADRLPPPIVADLNG 58
           RD  IL++ +    + L+ +  F    +W +       EE        +LP P+VAD+  
Sbjct: 19  RDGVILVILSLMATYLLRVQHSFELVPSWRYEVSTGFYEESMNTRTLKKLPVPLVADIES 78

Query: 59  DGRKEVLVATHDAKIQVLEPHARRVD-EGFSEARVLAEVSLLPDKIRIASGRR-AVAMAT 116
           DGR E+L+ T D+ + +L+        +       L E SLL D+  IA  ++ AVAMAT
Sbjct: 79  DGRNEILLTTKDSSLLLLKTSKPSPGLKALPYLNTLDEFSLLSDEADIAGDKKQAVAMAT 138

Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAH-HREIAISIS 175
           G ++      Q  KQV+VV+ + W+V+C + +   +W   L      + +   E AI I 
Sbjct: 139 GYLEPLTSDNQVRKQVIVVLCADWTVVCLNSDFKLIWSMKLPNTTWSHQYVMSEAAILIL 198

Query: 176 NYTLKHGDTGLVIVGGRM-EMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVD 234
            ++L+  D GLVI+GGR+ +   H  +         + + + H    + KE+  +    D
Sbjct: 199 PHSLQSDDGGLVIIGGRLADRLAHATL---------KHSHDGHGFDGANKESKSSH---D 246

Query: 235 LRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECRE 294
           + HF+ +A +GR G +RW     +   +     +L    ++KL++      H GE    +
Sbjct: 247 VGHFSTFALSGRHGDVRWHHLPGDFGEETNTEEKLFEGFHFKLNLKR-GWGHQGELGWNQ 305

Query: 295 FRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKS------TSYPFHKPEEHHP 348
           + +++L  MP  W    DT + ++ FR+     +KK   K       +SY     EEH  
Sbjct: 306 YNQALLKQMPFRWQSSMDTKIDIAEFRKDG---VKKSAEKDEHILSVSSYLTSSFEEH-- 360

Query: 349 PGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVH 408
                          A  + G         +I            PN V+ H ++GIE + 
Sbjct: 361 --------------IAGHHFGGLPPHSASEHIKN----------PNAVIIHSQQGIEVLE 396

Query: 409 LASGRTVCKLHLQEGGLHADINGDGVLDHVQAV-GGNGAEQTVVSGSMEVLRPCWAVATS 467
           L SGR + +L L     +ADI+ DGV+D  +A+   +G+E             C AV  +
Sbjct: 397 LKSGRPMTRLDLSPDATYADIDKDGVVDQAKALFTDDGSEGN-----------CQAVVKT 445

Query: 468 GVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGS 527
           G P   +L+N SIC+ S         ++   G +    + E++   LI +S   R    S
Sbjct: 446 GHPPHSELYNGSICYPSSLWAALSYPWAYASGSSEIKENQELSLRPLIVKSVAKRRGIIS 505

Query: 528 H----------GDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWS 565
           H           D +F  + G+VT+Y P        + WQ+ T A WS
Sbjct: 506 HLLGLSMSKAGMDTIFTVSTGQVTSYGP-----QGQFNWQVSTSALWS 548


>gi|424512936|emb|CCO66520.1| predicted protein [Bathycoccus prasinos]
          Length = 1026

 Score =  162 bits (410), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 193/762 (25%), Positives = 303/762 (39%), Gaps = 186/762 (24%)

Query: 1   MRKRDLAILMLSAFAI----FFSLQHE----GDFSFREAWFHLSEEYPIKFDADRLPPPI 52
           +R RD+ +L L  FA+    FF+  +        SF + W  L+ +        +   P+
Sbjct: 47  LRTRDVLVLSLFCFALLRCLFFNRNNSVGSLNGMSFVKQWEILTVDSSSSSSPPKPLLPV 106

Query: 53  VA-DLNGDGRKEVLVATHDA-KIQVLEPHAR--------RVDEGFSEA---------RVL 93
           +  DLNGDGR+E L+A  ++ KI + +  +R          + G +E          R+L
Sbjct: 107 LFFDLNGDGRQETLIAFDESNKIGIYDGSSRGGKGQKTKNNENGMNEDEDVGNENTLRLL 166

Query: 94  AEVSLLPDKIRIAS--------GRRAVAMA------------------TGVIDRTYRQGQ 127
             V L     +  +        G+R +A+A                  +G+  +  R+  
Sbjct: 167 KMVDLRETNEKSKTHRMRDEHGGKRVIALAAGRPEARRRGETKRRRSGSGIGSKQRRKTV 226

Query: 128 PL---KQVLVVVTSGWSVMCFDHNLNKLWEANLQED-FPPNAHH---------------- 167
            +   K + V VT    VM FDHN  K WE + ++  F   AHH                
Sbjct: 227 VMIKRKAIFVAVTEELQVMTFDHNAKKKWERSARDALFSGKAHHRHHLGGRGGRGDSMRV 286

Query: 168 REIAISISNYTLKHGDTGLVIVGGRMEMQPHTI------------MDPF----------- 204
            EIA+++ +   ++ D GLV+VGGR+E + +              M+ +           
Sbjct: 287 EEIAVTVVSAKGENED-GLVVVGGRVEYEKYQNEEGEEGEDYERGMEAYARELKDDEMLS 345

Query: 205 --------EEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKN 256
                   E I   E+  E      ++  AS+N        F + AF  RSG L W R  
Sbjct: 346 MHRGGRRDETIEFKEEGVEDIIEDDNDTFASQNGA------FVYLAFNARSGELVWRRVA 399

Query: 257 ENIEAQPTDA-SQLIPQHNYKLDVHALN-SRHPGEFE--CREFRESVLG-VMPHHWDRRE 311
            +  A P +      P H  +LD H      H  + +  CR +RES L   +PH W   E
Sbjct: 400 SDFVADPAELLRTTTPAHEVRLDSHLTKIEEHKSKTDGICRSYRESALAEALPHAWRDAE 459

Query: 312 DTLLKLSHFR------------RHKRKILKKVVG---KSTSYPFHKPEEHHP--PGKDST 354
           DT ++ + F             R KR    +  G    S +    K     P     D T
Sbjct: 460 DTRMRSAPFSKSQKQTSSSFGGRKKRARGGRDPGPLATSAATTNLKAASRDPNFAVADKT 519

Query: 355 KKISNLIGKAATYAGSAKS--------KKPVNYIPTITNYTQLWW-----VPNVVVAHQK 401
              S+L+ +  +Y    KS        ++ ++Y     ++  +       + N ++AH +
Sbjct: 520 NDFSSLLSRGFSYLNLGKSAVHHRDDEREKLDYHNRRFHHQHMMSSSNNNIKNAIIAHHR 579

Query: 402 EGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP- 460
           +G+E +   +G  +C+L LQ  G HADINGDG +DH+ A    G    +   +M+   P 
Sbjct: 580 DGVEVLDAFTGELICQLALQHPGYHADINGDGAIDHLDA---RGHRAHIADVAMDNFSPG 636

Query: 461 CWAVATSGVPVREQLFNASIC--------HHSPFNLFPHGEFSRNFG----RTSDVASLE 508
           CWA  TSG P  + +F  SIC          +P       +  +N G    R  +   ++
Sbjct: 637 CWASVTSGAPDVQTVFEGSICKPTRKSSMKSTPRQRAKSNKKQQNSGDFYLRNKNKDDVD 696

Query: 509 VATPILIPRSDG---HRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWS 565
           V TPI++ R D     R  + +  D VFL +RGE+T Y+      D   +W + T A+W 
Sbjct: 697 VLTPIIVRRDDKSERDRSFRRATKDAVFLNSRGELTCYAK-----DGTRRWMVNTRASWK 751

Query: 566 ---NLPSPSGM-------------TEASTVVPTLKAFSLRVH 591
              N P   G+              E    VPTL  F  R H
Sbjct: 752 IDVNGPPKDGIYDLFSDVNNNNDSKEMDRFVPTLATFKFRTH 793


>gi|443696151|gb|ELT96931.1| hypothetical protein CAPTEDRAFT_224542 [Capitella teleta]
          Length = 688

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 177/726 (24%), Positives = 299/726 (41%), Gaps = 119/726 (16%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFH-------LSEEYPIKFDADRLPPPIV 53
           +R +D+ I  + A  ++  L  +       AW+H        +  YP+  D +R P PIV
Sbjct: 20  VRLQDVVITAVCALLVYL-LGAKNQIILTPAWYHHFDAKHFENRRYPV--DEERKPQPIV 76

Query: 54  ADLNGDGRKEVLVATHDAKIQVLEPHA--RRVDEGFSEARVLAEVSLLPDKIRIASGRRA 111
            D +GD + EV++ +++ ++QVL  +A  +   +   + +V   V L  +          
Sbjct: 77  TDFDGDNKMEVVLISNENQLQVLLYNASQKLASKRLQQLQVKHSVVLPLEVNEYGEQEYP 136

Query: 112 VAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHRE-- 169
           VAM TG ID          QV+VV T+ W V+C+   L  LW+  L     P  H R+  
Sbjct: 137 VAMETGYIDPYISPEHSRSQVIVVATNNWHVLCYSSTLQLLWQHQLM----PLNHSRDHL 192

Query: 170 ----IAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKE 225
                A+ +S   L   D G + VGG    + H          L ++ A +    A +  
Sbjct: 193 QMTASALLVSAVKLHKNDQGAIFVGGAYGHKDHHARMKM----LLDEQASRAMHQAGDLH 248

Query: 226 ASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSR 285
            + +  T    HF+ +A +G +G +RW     + E + + A   +  H++KL++   +  
Sbjct: 249 TAPDDPTT---HFSTFALSGSNGAIRWHHLPGDFETKRSKAR--VDSHHWKLNLRTHHEE 303

Query: 286 HPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPF----- 340
           H GE    ++ ES L ++PH W    DT    +  R+ K  I++  +  S++ P      
Sbjct: 304 HMGEVHWTKYGESFLQLLPHDWTSLGDTKFDFADLRK-KELIIR--IESSSALPLVDIIG 360

Query: 341 HKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQ 400
              +++H  G                Y G      P ++   I         PN +V H 
Sbjct: 361 ESLDDYHISG--------------VAYGGLG----PHSHHEHIRK-------PNTLVVHN 395

Query: 401 KEGIEAVHLASGRTVCKLHL-QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLR 459
            +G+E + L  GR + +L + ++   + D+N DG L+  + V  +               
Sbjct: 396 HKGVEVLDLHVGRPLTRLKVARDKSAYIDVNEDGTLERARLVATDDG------------- 442

Query: 460 PCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLE-VATPILIPRS 518
            C+   ++  P    +F+ ++C  S    +  G     F R    A  E  + P    +S
Sbjct: 443 -CYGEVSTVHPRPLAIFSENVC--SALQWWGSGSV---FSRLPVAAEDEDCSVPPFFIKS 496

Query: 519 DGHR--------------HRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATW 564
              R               RKG   D +FLT++G +++++P     +    WQ+LT A W
Sbjct: 497 IAVRKGFVNHLLGYTLPSERKGY--DTLFLTSQGRLSSFAP-----NGDLNWQVLTPAQW 549

Query: 565 -----SNLPSPSGM---TEASTVVPTLKAFSLRVHDNQQM-ILAGGDQEAVVISPGGSIL 615
                 N   P  +          P+    S +V+  + + +L G D  A+V    G IL
Sbjct: 550 VEQTTRNWRDPGSIHADVYNHVFKPSATPMSPKVYGRKSVALLLGWDTIALVDLRDGQIL 609

Query: 616 TSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGF-VQTRQPGALFFSTLVGCLIVVM 674
               LP  P    V  DF+NDG  D I+  S+G   F V +  P      TLV  + VV+
Sbjct: 610 GEHSLPCQPIDKPVIADFTNDGQNDFIVTCSSGSITFSVSSEFP---IVHTLVIGVFVVL 666

Query: 675 GVIFVT 680
            V+ +T
Sbjct: 667 CVLLMT 672


>gi|407410236|gb|EKF32752.1| hypothetical protein MOQ_003389 [Trypanosoma cruzi marinkellei]
          Length = 711

 Score =  128 bits (322), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 173/744 (23%), Positives = 306/744 (41%), Gaps = 123/744 (16%)

Query: 1   MRKRDLAILMLSA-FAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGD 59
           MR RD+ I+ ++  F IF   Q  G   +R     +++    K+    +  P+V DL GD
Sbjct: 1   MRLRDILIVCVAIIFCIFGWSQEGGLIVWRNFKIPITDSEVSKY---VMSKPVVLDLMGD 57

Query: 60  GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
           GR  +L +T    +++ + H  RV+   SE      VS+ P    + +  R VA+A G +
Sbjct: 58  GRPVLLASTKYGSLELFKTHLARVN---SEN---VFVSIRPSS-SLKTLFRIVAIAAGKL 110

Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISISNYT 178
            RT ++       +VV++  + +     H+   +W+  L   +    H    ++S+    
Sbjct: 111 SRTSKE-----NAIVVISDDFRLRRISPHDFTAVWDVPLSASWIETYH---ASVSVVPER 162

Query: 179 LKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHF 238
           +   D G V+V  ++     T +  +     A+  A  H  S +E    E  G V     
Sbjct: 163 IHEQDEGTVLVAMQVAGPNGTELMLYAAFNGADGQARWHYTSDAENSIGEVLGEV----- 217

Query: 239 AFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRES 298
                            N+ +E  P D    + +++        N     E     +RE+
Sbjct: 218 ----------------CNDGVECLPPDNDTTVGRNSIFKSQGVKNQFRAQEKPWTFYREA 261

Query: 299 VLGVMPHHWDRREDTLLK---LSHFRRHKRKILK---KVVGKSTSYPFHKPEEHHPPGKD 352
           ++ ++PH +    D  +      H +  K++  +   +VV K         +E +    +
Sbjct: 262 IITLLPHRYSHPWDEHIHPHVFFHAKNRKKRSFRAGNRVVVKYKDRLIRMNKEDYGALGE 321

Query: 353 STKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASG 412
               I N+ G AAT     KS++P                 N ++ H K GIE +HL +G
Sbjct: 322 RL-GIMNMHGNAATPVN--KSRRP---------------AANAMIFHSKNGIEVIHLYTG 363

Query: 413 RTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVP 470
             + +L  L+  G++  DIN D  +D +    G   E     G ++++  C  V  +G+P
Sbjct: 364 SVITRLGPLKSSGVYYHDINDDFQVDAIGTQIGPRMEMHSRHG-VDLIDDCLGVIHTGIP 422

Query: 471 VRE-QLFNASICHHSPF----NLFPHGEFSRNFGRTSD----VASLEV------------ 509
           V E QLFNA+IC    F    +L  H  F  N  R  D    + +LE+            
Sbjct: 423 VAEDQLFNATICDTEGFFGRLDLIHH--FIDNDIRGEDAPHALNTLELIGSRSVLSKNTR 480

Query: 510 -ATPILIP-RSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLL-------- 559
             TP+++   +   R         VF+ + G VT   P       IW+ Q          
Sbjct: 481 SVTPLVVQLHTTKGRDLFQVERHAVFMIDSGLVTCVDPS--RRRVIWRSQTAAGFYGLRE 538

Query: 560 ---TDATWSNLPSPSGMTEASTVVPTLKAFSL-RVHDNQQM--------------ILAGG 601
               DA  + + +   M  A +  P L A+S  R  D+  M              I+A G
Sbjct: 539 AAEADAGMAGMSAKERMHRAVS-FPHLAAYSFYRSQDDAVMSVSRSERYLRTDPFIIAVG 597

Query: 602 DQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
           ++   ++S   G +L +I+L  PP   ++  D + DG+ D++++T  G+YGFV   Q  +
Sbjct: 598 ERYMCILSTRTGRVLRTIELEHPPVAPVILRDLNGDGINDIVVVTKEGIYGFVVGTQTSS 657

Query: 661 LFFSTLVGCLIVVMGVIFVTQHLN 684
              + L+  ++ ++ V+FV + ++
Sbjct: 658 DTVTALMILMVALLAVLFVVREMS 681


>gi|407848095|gb|EKG03575.1| hypothetical protein TCSYLVIO_005372 [Trypanosoma cruzi]
          Length = 711

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 170/744 (22%), Positives = 304/744 (40%), Gaps = 123/744 (16%)

Query: 1   MRKRDLAILMLSA-FAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGD 59
           MR RD+ I+ ++  F IF   Q  G   +R     +++    K+    +  P+V DL GD
Sbjct: 1   MRLRDILIVCVAIIFCIFGWSQEGGLIVWRNFKIPVTDSEVSKY---VMSKPVVLDLMGD 57

Query: 60  GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
           GR  +L +T    +++ + H  RV+   SE      VS+ P    + +  R VA+A G +
Sbjct: 58  GRPVLLASTKYGSLELFKTHLARVN---SED---VFVSIRPSS-SLETLFRIVAIAAGKL 110

Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISISNYT 178
            RT ++       +VV++  + +     H+L ++W+  L   +    H    ++S+    
Sbjct: 111 SRTSKE-----NAIVVISDDFRLRRISPHDLTEVWDVPLSASWIETYH---ASVSVVPER 162

Query: 179 LKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHF 238
           +   D G V+V  ++     T +  +     A+  A  H  S +E    E  G       
Sbjct: 163 IHEQDEGTVLVAMQVAGPNGTELMLYAAFNGADGQARWHYTSDAENSIGEVLG------- 215

Query: 239 AFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRES 298
                            N+ +E  P D    + +++        N     E     +RE+
Sbjct: 216 --------------EECNDGVECLPPDNDTSLRRNSIFKSQEVKNQFRAQEKPWTFYREA 261

Query: 299 VLGVMPHHWDRREDTLLK---LSHFRRHKRKILK---KVVGKSTSYPFHKPEEHHPPGKD 352
           ++ ++PH +    D  +      H +  K++  +   +VV K         +E +    +
Sbjct: 262 IITLLPHRYSHPWDEHIHPHVFFHAKNRKKRSFRAGNRVVVKYKDRLIRMNKEDYGALGE 321

Query: 353 STKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASG 412
               + N+ G AAT    ++ +                   N ++ H K GIE +HL +G
Sbjct: 322 RL-GVMNMHGNAATPVNKSRRR-----------------TANAMIFHSKNGIEVIHLYTG 363

Query: 413 RTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVP 470
             + +L  L+  G++  DIN D  +D +    G   E     G ++++  C  V  +G+P
Sbjct: 364 SVITRLGPLKSSGVYYHDINDDFQVDAIGTQIGPRMEMHSRHG-VDLIDDCLGVIHTGIP 422

Query: 471 VRE-QLFNASICHHSPF----NLFPHGEFSRNFGRTSD----VASLEV------------ 509
           V E QLFNA+IC    F    +L  H  F  N  R  D    + +LE+            
Sbjct: 423 VAEDQLFNATICDTEGFFGRLDLIHH--FIDNDIRGEDAPHALNTLELIGSRSVLSKNTR 480

Query: 510 -ATPILIP-RSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLL-------- 559
             TP+++   +   R         VF+ + G VT   P       IW+ Q          
Sbjct: 481 SVTPLVVQLHTTKGRDLFQVERHAVFMIDSGLVTCVDPS--RRRVIWRSQTAADFYGLRE 538

Query: 560 ---TDATWSNLPSPSGMTEASTVVPTLKAFSL-RVHDNQQM--------------ILAGG 601
               DA  + + +   M  A +  P L A+S  R  D+  M              I+A G
Sbjct: 539 AAEADAGMAGMSAKERMHRAVS-FPHLAAYSFYRGQDDSAMSVSGSERYLRTDPFIIAVG 597

Query: 602 DQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
           ++   ++S   G +L +I L  PP   ++  D + DG+ D+I++T  G+YGFV   Q  +
Sbjct: 598 ERYMCILSTRTGRVLRTIVLEHPPVAPVIVRDLNGDGINDIIVVTKEGIYGFVVGTQTSS 657

Query: 661 LFFSTLVGCLIVVMGVIFVTQHLN 684
              + L+  ++ ++ V+FV + ++
Sbjct: 658 DTVTALMILMVALLAVLFVVREMS 681


>gi|71667530|ref|XP_820713.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70886069|gb|EAN98862.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 711

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 170/744 (22%), Positives = 303/744 (40%), Gaps = 123/744 (16%)

Query: 1   MRKRDLAILMLSA-FAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGD 59
           MR RD+ I+ ++  F IF   Q  G   +R     +++    K+    +  P+V DL GD
Sbjct: 1   MRLRDILIVCVAIIFCIFGWSQEGGLIVWRNFKIPVTDSEVSKY---VMSKPVVLDLMGD 57

Query: 60  GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
           GR  +L +T    +++ + H  RV+   SE      VS+ P    + +  R VA+A G +
Sbjct: 58  GRPVLLASTKYGSLELFKTHLARVN---SED---VFVSIRPSS-SLKTLFRIVAIAAGKL 110

Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISISNYT 178
            RT ++       +VV++  + +     H+L ++W+  L   +    H    ++S+    
Sbjct: 111 SRTSKE-----NAIVVISDDFRLRRISPHDLTEVWDVPLSASWIETYH---ASVSVVPER 162

Query: 179 LKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHF 238
           +   D G V+V  ++     T +  +     A+  A  H  S +E    E  G       
Sbjct: 163 IHEQDEGTVLVAMQVAGPNGTELMLYAAFNGADGQARWHYTSDAENSIGEVLG------- 215

Query: 239 AFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRES 298
                            N+ +E  P D    + +++        N     E     +RE+
Sbjct: 216 --------------EECNDGVECLPPDNDTSLRRNSIFKSQGVKNQFRAQEKPWTFYREA 261

Query: 299 VLGVMPHHWDRREDTLLK---LSHFRRHKRKILK---KVVGKSTSYPFHKPEEHHPPGKD 352
           ++ ++PH +    D  +      H +  K++  +   +VV K         +E +    +
Sbjct: 262 IITLLPHRYSHPWDEHIHPHVFFHAKNRKKRSFRAGNRVVVKYKDRLIRMNKEDYGALGE 321

Query: 353 STKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASG 412
               + N+ G AAT    ++ +                   N ++ H K GIE +HL +G
Sbjct: 322 RL-GVMNMHGNAATTVNKSRRR-----------------AANAMIFHSKNGIEVIHLYTG 363

Query: 413 RTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVP 470
             + +L  L+  G++  DIN D  +D +    G   E     G ++++  C  V  +G+P
Sbjct: 364 SVITRLGPLKSSGVYYHDINDDFQVDAIGTQIGPRMEMHSRHG-VDLIDDCLGVIHTGIP 422

Query: 471 VRE-QLFNASICHHSPF----NLFPHGEFSRNFGRTSD----VASLEV------------ 509
           V E QLFNA+IC    F    +L  H  F  N  R  D    + +LE+            
Sbjct: 423 VAEDQLFNATICDTEGFFGRLDLIHH--FIDNDIRGEDAPHALNTLELIGSRSVLSKNTR 480

Query: 510 -ATPILIP-RSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLL-------- 559
             TP+++   +   R         VF+ + G VT   P       IW+ Q          
Sbjct: 481 SVTPLVVQLHTTKGRDLFQVERHAVFMIDSGLVTCVDPS--RRRVIWRSQTAAGFYGLRE 538

Query: 560 ---TDATWSNLPSPSGMTEASTVVPTLKAFSL-RVHDNQQM--------------ILAGG 601
               DA  + + +   M  A +  P L A+S  R  D   M              I+A G
Sbjct: 539 AAEADAGMAGMSAKERMHRAVS-FPHLAAYSFYRGQDESAMSVSGSERYLRTDPFIIAVG 597

Query: 602 DQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
           ++   ++S   G +L +I L  PP   ++  D + DG+ D+I++T  G+YGFV   Q  +
Sbjct: 598 ERYMCILSTRTGRVLRTIALEHPPVAPVIVRDLNGDGINDIIVVTKKGIYGFVVGTQTSS 657

Query: 661 LFFSTLVGCLIVVMGVIFVTQHLN 684
              + L+  ++ ++ V+FV + ++
Sbjct: 658 DTVTALMILMVALLAVLFVVREMS 681


>gi|71422558|ref|XP_812172.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70876921|gb|EAN90321.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 711

 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 171/751 (22%), Positives = 308/751 (41%), Gaps = 137/751 (18%)

Query: 1   MRKRDLAILMLSA-FAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGD 59
           MR RD+ I+ ++  F IF   Q  G   +R     +++    K+    +  P+V DL GD
Sbjct: 1   MRLRDILIVCVAIIFCIFGWSQEGGLIVWRNFKIPVTDSEVSKY---VMSKPVVLDLMGD 57

Query: 60  GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
           GR  +L +T    +++ + H  RV+   SE      VS+ P   R  +  R VA+A G +
Sbjct: 58  GRPVLLASTKYGSLELFKTHLARVN---SEN---VFVSIRPSSSR-KTLFRIVAIAAGKL 110

Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISISNYT 178
            RT ++       +VV++  + +     H+L ++W+  L   +    H    ++S+    
Sbjct: 111 SRTSKE-----NAIVVISDDFRLRRISPHDLTEVWDVPLSASWIETYH---ASVSVVPER 162

Query: 179 LKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHF 238
           +   D G V+V  ++     T +  +     A+  A  H  S  E    E  G       
Sbjct: 163 IHEQDEGTVLVAMQVAGPNGTELTLYAAFNGADGQARWHYTSDVENSIGEVLG------- 215

Query: 239 AFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRH--PGEFECRE-- 294
                            N+ +E  P       P+++  L  +++        +F  +E  
Sbjct: 216 --------------EECNDGVECLP-------PENDTSLRRNSIFKSQVVKNQFRAQEKP 254

Query: 295 ---FRESVLGVMPHHWDRREDTLLK---LSHFRRHKRKILK---KVVGKSTSYPFHKPEE 345
              +RE+++ ++PH +    D  +      H +  K++  +   +VV K         +E
Sbjct: 255 WTFYREAIITLLPHRYSHPWDEHIHPHVFFHAKNRKKRSFRAGNRVVVKYKDRLIRMNKE 314

Query: 346 HHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIE 405
            +    +    + N+ G AAT    ++ +                   N ++ H K GIE
Sbjct: 315 DYGALGERL-GVMNMHGNAATTVNKSRRR-----------------AANAMIFHSKNGIE 356

Query: 406 AVHLASGRTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWA 463
            +HL +G  + +L  L+  G++  DIN D  +D +    G   E     G ++++  C  
Sbjct: 357 VIHLYTGSVITRLGPLKSSGVYYHDINDDFQVDAIGTQIGPRMEMHSRHG-VDLIDDCLG 415

Query: 464 VATSGVPVRE-QLFNASICHHSPF----NLFPHGEFSRNFGRTSD----VASLEV----- 509
           V  +G+PV E QLFNA+IC    F    +L  H  F  N  R  D    + +LE+     
Sbjct: 416 VIHTGIPVAEDQLFNATICDTEGFFGRLDLIHH--FIDNDIRGEDAPHALNTLELIGSRS 473

Query: 510 --------ATPILIP-RSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLL- 559
                    TP+++   +   R         VF+ + G VT   P       +W+ Q   
Sbjct: 474 VLSKNTRSVTPLVVQLHTTKGRDLFQVERHAVFMIDSGLVTCVDPS--RRRVVWRSQTAA 531

Query: 560 ----------TDATWSNLPSPSGMTEASTVVPTLKAFSL-RVHDNQQM------------ 596
                      DA  + + +   M  A +  P L A+S  R  D+  M            
Sbjct: 532 GFYGLREAAEADAGMAGMSAKERMHRAVS-FPHLAAYSFYRSQDDSVMSVSGSERYLRTD 590

Query: 597 --ILAGGDQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFV 653
             I+A G++   ++S   G +L +I L  PP   ++  D + DG+ D+I++T  G+YGFV
Sbjct: 591 PFIIAVGERYMCILSTRTGRVLRTIALEHPPVAPVIVRDLNGDGINDIIVVTKEGIYGFV 650

Query: 654 QTRQPGALFFSTLVGCLIVVMGVIFVTQHLN 684
              Q  +   + L+  ++ ++ V+FV + ++
Sbjct: 651 VGTQTSSDTVTALMILMVALLAVLFVVREMS 681


>gi|303290438|ref|XP_003064506.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454104|gb|EEH51411.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 291

 Score =  122 bits (306), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 92/278 (33%), Positives = 132/278 (47%), Gaps = 29/278 (10%)

Query: 418 LHLQEGGLHADINGDGVLDHVQAVGGNGA----EQTVVSGSMEVLRPCWAVATSGVPVRE 473
           + L   GLHAD+NGDGV+DHV+  GG G         V      +  CWA  TSGVP RE
Sbjct: 1   MSLPSPGLHADVNGDGVVDHVEVFGGGGGGAGRSHAAVGADGRAVPSCWARVTSGVPARE 60

Query: 474 QLFNASICH-------HSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKG 526
            LF+ + C        H   N +     +  FG  +  A ++VA P  +        R+G
Sbjct: 61  ALFDGTACRGHAGVTRHGDRNSYGVDGAAGMFGGKN--ADVDVAPPAAL--------RRG 110

Query: 527 SHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATW---SNLPSPSGMTEASTVVPTL 583
              DV+   +RGEVT+Y P     D   +WQL TDA+W    ++               L
Sbjct: 111 EETDVILFNSRGEVTSYGP-----DGKRRWQLRTDASWQRRDDVYDAGYGGGGGGGGGHL 165

Query: 584 KAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVIL 643
           + F L V    ++++A G    VV+SPGG  + ++ LP+ P   ++  D + DGL D++ 
Sbjct: 166 ETFPLAVDGASEVVVALGAARGVVLSPGGYKIATLRLPSVPIAPVLIVDVNGDGLNDIVA 225

Query: 644 MTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQ 681
            T+ G Y + Q    G      L+G LIV M V F +Q
Sbjct: 226 RTALGTYCWTQRGAGGGGPLLVLLGFLIVGMVVAFASQ 263


>gi|440292440|gb|ELP85645.1| hypothetical protein EIN_409540 [Entamoeba invadens IP1]
          Length = 625

 Score =  122 bits (306), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 149/692 (21%), Positives = 283/692 (40%), Gaps = 103/692 (14%)

Query: 8   ILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLVA 67
           +++++ F ++  L      +  E  F  SE+Y     +     PI++D++GDG  E+LVA
Sbjct: 7   VVLVACFCLWIFLLKGQSLTQMEQLF--SEKYAANLGSTLNVVPIISDIDGDGLNEILVA 64

Query: 68  THDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVIDRTYRQGQ 127
            +     ++  + +          V+ + + +P      S  + + M TG  D+ + + +
Sbjct: 65  PYGVGKLIMYKYEKG-------NLVIKQTATIP------SSSQPIYMTTGY-DKPFVRNE 110

Query: 128 PLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDTGLV 187
           P +Q ++VV   + V+CF+ +L   W  N+ ++     +  E++  +  Y ++    G V
Sbjct: 111 PRQQTVIVVLRNFDVLCFNSDLTLRWTNNVYKEHAL-TYVEEVSGLVVPYVIQTKVNGGV 169

Query: 188 IVGGRMEMQPHTIMDPFEE--IGLAE-------KNAEQHRRSASEKEASENSGTVDLRHF 238
           I+  R      T  D +    +G  E       +  E+ +      + + +  TVD  H 
Sbjct: 170 ILAFR------TTQDWYNNNNVGFEEDLRVTLNRTFEEQKEDLMYDDDNIDQETVD--HM 221

Query: 239 AFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLD-VHALNSRHPGEFECREFRE 297
            +Y  + + G + W  + ++ +              YKL  +  ++ +   E +  EF +
Sbjct: 222 NYYCLSIKDGQVVWQHEVDDDQDYLDLVKNTDVAGEYKLSKLFGMSYKGAEEIQWHEFHD 281

Query: 298 SVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKI 357
           S+   +PH W    DT + +SHF R        +     +Y   + E    PG     K 
Sbjct: 282 SIYKNLPHQWASFRDTQITVSHFSR-------DLTSNPETYDNSRLE---MPGMKVQTKT 331

Query: 358 SNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCK 417
                                 I  ITN       PNVVV HQK G+EAVHL SG+ +  
Sbjct: 332 K---------------------IDRITN-------PNVVVIHQKNGLEAVHLFSGKPLVH 363

Query: 418 LHLQE-----GGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVR 472
           L L+          ADINGD  L+ V  V     E  V +   ++      +A +     
Sbjct: 364 LSLRATSDAGSSAFADINGDDALEEV-FVNQYSHEVDVQNSGEDLTCMVQVIAANS---Y 419

Query: 473 EQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSH-GDV 531
           E LF+ + C        P   F  +  R  D   +EV  P+L+P  +  R   GS   ++
Sbjct: 420 EYLFSENACK-------PAMRF--DLIRKKDKHVIEVLPPLLVPAFE--RKSDGSELYNI 468

Query: 532 VFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLR-V 590
           V + + G +TA      G++  + W       +      +  +  +  + +L+    + V
Sbjct: 469 VVINSDGLLTAV-----GYNGKFLWNSKVSRLFLAKEDQTHKSNLALFLYSLENEDQKEV 523

Query: 591 HDNQQMILAGGDQEAVVISPGGSILT--SIDLPAPPTHALVCEDFSNDGLTDVILMTSNG 648
             N + I+  G +   V++  G ++   +I+    P    V  D SNDG  +++++T+  
Sbjct: 524 QPNNRYIIVQGSKNIAVVNLNGDLMVEHNIESQGEPVMGPVFGDLSNDGNNEILIVTATK 583

Query: 649 VYGF-VQTRQPGALFFSTLVGCLIVVMGVIFV 679
           +  + V+  Q  +     + G + ++   I++
Sbjct: 584 IAAYNVRVIQNVSFLPVCIAGIIALISYNIYI 615


>gi|67477875|ref|XP_654372.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56471415|gb|EAL48985.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449703469|gb|EMD43913.1| FGGAP repeat-containing protein [Entamoeba histolytica KU27]
          Length = 623

 Score =  118 bits (296), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 150/654 (22%), Positives = 260/654 (39%), Gaps = 122/654 (18%)

Query: 51  PIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDK---IRIAS 107
           PI+ D++GDG  ++++A   +K    +     V +G  E +   EV+L P+K   I I S
Sbjct: 48  PIITDIDGDGINDIVLAPFASK----KLSMYSVKKGSLELK--KEVNL-PEKSFPIYITS 100

Query: 108 GRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHH 167
           G           D+ Y   +   Q+++V+   + V+CF+ +L   W  N  +     A+ 
Sbjct: 101 GY----------DKKYDNTEKRSQIIIVIMRDFEVICFNPDLTIRWRQNYYKS-KAKAYV 149

Query: 168 REIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEK--- 224
           +E++  +  Y ++    G VIV  R      T  D       A+K  E+  R   ++   
Sbjct: 150 QEVSAIVVPYDIQTKYQGAVIVAFR------TSYDEMLSGNRADKEFEEDFRVTLQQTFE 203

Query: 225 EASENSGTVDL-------RHFAFYAFAGRSGLLRW----SRKNENIEAQPTDASQLIPQH 273
           E  E+    D         H  +YAF+ + G + W      K+E +E    +  ++  + 
Sbjct: 204 EQKEDLINDDFDPEDMLKEHMNYYAFSTKDGQVIWQHETDEKDEYLELIEQNTDEI--EK 261

Query: 274 NYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVG 333
             K  + +  +   GE +  EF ++V   MPH W    DT +  SHF R        +V 
Sbjct: 262 YKKFKIFSSFTEEFGEIQWHEFHDAVFQNMPHQWTSYYDTKIIPSHFAR-------DMVN 314

Query: 334 KSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVP 393
             T          +P  KD                     K P N    I N       P
Sbjct: 315 NPTQLI-------NPKTKD--------------------IKYPYNSFDLIKN-------P 340

Query: 394 NVVVAHQKEGIEAVHLASGRTVCKLHL-------QEGGLHADINGDGVLDHVQAVGGNGA 446
           NV VAHQK GIE +HL +G+ +  LH+             ADI+GD  LD    V  +  
Sbjct: 341 NVFVAHQKNGIEVIHLFTGKPL--LHVSLSSTTMSSASAFADIDGDDSLDE---VFTHQF 395

Query: 447 EQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHS-PFNLFPHGEFSRNFGRTSDVA 505
             T+    ++    C   A +    +  LF+ + C  +  F+L           R  +  
Sbjct: 396 AHTLDYSRVDDDISCQVQAMTMGSFK-FLFSQNACTKARRFDL---------LRRKQENV 445

Query: 506 SLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWS 565
            + +  P+L+P  +   H      D++ L N G +T+ S     ++   +WQ    + W 
Sbjct: 446 EVIILPPLLVPNQEI-AHDGRPKFDIILLNNLGLMTSVS-----NNGALKWQAEVQSYW- 498

Query: 566 NLPSPSGMTEASTVVPTLKAFSLRVHDNQQ--MILAGGDQEAVVISPGGSILTS--IDLP 621
           +L   +    A T  P++ +F      N+    I++ G +   V+   G+++     D  
Sbjct: 499 DLEEKN----AKTFKPSIVSFKFTYGKNETDPFIVSQGSRFVNVVDLNGNVIAKKEFDHI 554

Query: 622 APPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
             P    V  D +ND   D++++T + +  +     P   F    +  LI+++ 
Sbjct: 555 CKPIMPPVFGDVNNDKKNDIVIVTDDHIMAYKMEVVPSVEFLPICIASLIILIS 608


>gi|340375465|ref|XP_003386255.1| PREDICTED: hypothetical protein LOC100636155 [Amphimedon
           queenslandica]
          Length = 803

 Score =  117 bits (292), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 85/334 (25%), Positives = 155/334 (46%), Gaps = 25/334 (7%)

Query: 4   RDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-----------YPIKFDADRLPPPI 52
           +++  L +S   + F L+ +    F+      +EE           +P++   ++LP PI
Sbjct: 472 QEILFLFISCSVLAFLLRSQDSLEFKLVLNVTTEERAARENYANQKFPLR--NEKLPLPI 529

Query: 53  VADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAV 112
           V DL  DG+ +V++ + D   +VL  ++R     +     +   +     + ++   R V
Sbjct: 530 VTDLESDGQTDVILVSAD---KVLNVYSRHYTPSYLGHHGIKHSN---KNVSLSKTSRVV 583

Query: 113 AMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAI 172
           AM TG +       Q  KQV+ V+ S W++ C++HNL  +W   L E   P +   E ++
Sbjct: 584 AMTTGFLQPYQSVVQVRKQVIAVLYSDWTLSCYNHNLKLMWSQKLNEKEIPIS--SEASL 641

Query: 173 SISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGT 232
            +S  ++K    G+++VGGR+        +     G ++K   +  R  +E+E  E+   
Sbjct: 642 LVSAISIKKSPAGVILVGGRINGNYDNNNNNDNIGGDSDK---RRMRRGAEQEKMEHQKL 698

Query: 233 VDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFEC 292
             L H+  YA +G++G L W  +  + E  P     L    ++KL  H     H GE   
Sbjct: 699 ESLGHYCTYAVSGKTGELLWKHEPGDFEVHPAYDIGLSSFTHFKLLFHKY-LLHEGEVSW 757

Query: 293 REFRESVLGVMPHHWDRREDTLLKLSHFRRHKRK 326
            ++  ++  +MPH W    DT L+L+ F + K+K
Sbjct: 758 HQYSSAIRQIMPHSWTDDFDTKLELAQFNKDKKK 791


>gi|407044533|gb|EKE42653.1| FG-GAP repeat-containing protein [Entamoeba nuttalli P19]
          Length = 623

 Score =  115 bits (287), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 147/657 (22%), Positives = 255/657 (38%), Gaps = 128/657 (19%)

Query: 51  PIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEAR---VLAEVSLLPDK---IR 104
           PI+ D++GDG  ++          VL P A +    +S  +   VL +   LP+K   I 
Sbjct: 48  PIITDIDGDGINDI----------VLAPFASKKLSMYSVKKGSLVLKKEVDLPEKSFPIY 97

Query: 105 IASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPN 164
           I SG           D+ Y   +   Q+++V+   + V+CF+ +L   W  N  +     
Sbjct: 98  ITSGY----------DKKYDDTEKRSQIIIVIMRDFEVICFNPDLTIRWRQNYYKS-KAK 146

Query: 165 AHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEK 224
           A+ +E++  +  Y ++    G VI   R      T  D       A+K  E+  R   ++
Sbjct: 147 AYVQEVSAIVVPYDIQTKYQGAVIAAFR------TSYDEMLSGNRADKEFEEDFRVTLQQ 200

Query: 225 ---EASENSGTVDL-------RHFAFYAFAGRSGLLRW----SRKNENIEAQPTDASQLI 270
              E  E+    D         H  +Y+F+ + G L W      K+E +E    +  ++ 
Sbjct: 201 TFDEQKEDLINDDFDPEDMLKEHMNYYSFSTKDGQLIWQHETDEKDEYLELIEQNTDEI- 259

Query: 271 PQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKK 330
            +   K  + +  +   GE +  EF ++V   MPH W    DT +  SHF R        
Sbjct: 260 -EKYKKFKIFSSFTEEFGEIQWHEFHDAVFQNMPHQWTSYYDTKIIPSHFAR-------D 311

Query: 331 VVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLW 390
           +V   T          +P  KD                     K P N    I N     
Sbjct: 312 MVNNPTQLI-------NPKTKD--------------------IKYPYNSFDLIKN----- 339

Query: 391 WVPNVVVAHQKEGIEAVHLASGRTVCKLHL-------QEGGLHADINGDGVLDHVQAVGG 443
             PNV V HQK GIE +HL +G+ +  LH+             ADI+GD  LD    +  
Sbjct: 340 --PNVFVVHQKNGIEVIHLFTGKPL--LHVSLSSSTLSSASAFADIDGDDSLDE---IFT 392

Query: 444 NGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHS-PFNLFPHGEFSRNFGRTS 502
           +    T+    ++    C   A +    +  LF+ + C  +  F+L           R  
Sbjct: 393 HQFAHTLDYSRVDDDISCQVQAMTMGSFK-FLFSQNACTKARRFDL---------LRRKQ 442

Query: 503 DVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDA 562
           +   + +  P+L+P  +   H      D++ L N G +T+ S     ++   +WQ    +
Sbjct: 443 ENVEVIILPPLLVPNQEI-AHDGRPKFDIILLNNLGLMTSVS-----NNGALKWQAEVQS 496

Query: 563 TWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQ--MILAGGDQEAVVISPGGSILT--SI 618
            W +L   +    A T  P++ +F      N+    I++ G +   V+   G+++     
Sbjct: 497 YW-DLEEKN----AKTFKPSIVSFKFTYGKNETDPFIVSQGSRFVNVVDLNGNVIAKKEF 551

Query: 619 DLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
           D    P    V  D +ND   D++++T + +  +     P   F    +  LI+++ 
Sbjct: 552 DHICKPIMPPVFGDVNNDKKNDIVIVTDDHIMAYKMEVVPSVEFLPICIASLIILIS 608


>gi|413951379|gb|AFW84028.1| hypothetical protein ZEAMMB73_488017 [Zea mays]
          Length = 90

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 53/88 (60%), Positives = 66/88 (75%), Gaps = 6/88 (6%)

Query: 455 MEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPIL 514
           MEVL+PCWAVA S +PV EQLFN SICH++ FN+F HG+FSR FG T D   LEV T   
Sbjct: 1   MEVLKPCWAVARSDMPVWEQLFNVSICHYNHFNMFHHGDFSRIFGITFDTIGLEVVT--- 57

Query: 515 IPRSDGHRHRKGSHGDVVFLTNRGEVTA 542
               DGH+HR+GSHGD++FLT+ GE++ 
Sbjct: 58  ---DDGHKHRRGSHGDIIFLTSPGELSV 82


>gi|361069131|gb|AEW08877.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168737|gb|AFG67480.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168739|gb|AFG67481.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168741|gb|AFG67482.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168743|gb|AFG67483.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168745|gb|AFG67484.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168747|gb|AFG67485.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168749|gb|AFG67486.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168751|gb|AFG67487.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168753|gb|AFG67488.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168755|gb|AFG67489.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168757|gb|AFG67490.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168759|gb|AFG67491.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168761|gb|AFG67492.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168763|gb|AFG67493.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168765|gb|AFG67494.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
 gi|383168767|gb|AFG67495.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
          Length = 68

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 51/66 (77%), Positives = 60/66 (90%)

Query: 510 ATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPS 569
           ATPILIPR+DGH+HRKGSHGDV+FLT+RGEVT+Y PG+HG  AI +WQLLT ATWSNLPS
Sbjct: 1   ATPILIPRNDGHKHRKGSHGDVIFLTSRGEVTSYFPGIHGQGAIRRWQLLTGATWSNLPS 60

Query: 570 PSGMTE 575
           P+G+ E
Sbjct: 61  PAGIVE 66


>gi|167393089|ref|XP_001740420.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165895471|gb|EDR23151.1| hypothetical protein EDI_031320 [Entamoeba dispar SAW760]
          Length = 610

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 148/656 (22%), Positives = 247/656 (37%), Gaps = 139/656 (21%)

Query: 51  PIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEAR---VLAEVSLLPDK---IR 104
           P+V D++GDG  ++          VL P A +    +S  +   VL +   LP+K   I 
Sbjct: 48  PLVTDIDGDGINDI----------VLTPFASKKLSIYSVKKGSLVLKKEVDLPEKSFPIY 97

Query: 105 IASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPN 164
           I SG           D+ Y   +   Q+++V+   + V+CF+ +L   W  N    + P 
Sbjct: 98  ITSGY----------DKQYNNTEKRSQIIIVIMRDFEVICFNPDLTIRWRQNY---YKPK 144

Query: 165 A--HHREIAISISNYTLKHGDTGLVIVGGRM----EMQPHTIMDPFEE---IGLAEKNAE 215
           A  + +E++  I  Y ++    G VI   R      +  + +   FEE   + L +   E
Sbjct: 145 AKAYVQEVSAIIVPYEIQTQYQGAVIAAFRTSYDENLSGNRVDKEFEEDFRVTLQQTFEE 204

Query: 216 QHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRW----SRKNENIEAQPTDASQLIP 271
           Q     ++    E+       H  +YAF+ + G L W      K+E +E    +  ++  
Sbjct: 205 QKEDLVNDDFDPEDMLK---EHMNYYAFSTKDGQLIWQHETDEKDEYLELIEQNTDEIEK 261

Query: 272 QHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKV 331
              +K  + +  +   GE +  EF ++V   MPH W    DT                  
Sbjct: 262 YKKFK--IFSSFTEEFGEIQWHEFHDAVFQNMPHQWTSYYDT------------------ 301

Query: 332 VGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWW 391
             K+ S+ F                                 K P N    I N      
Sbjct: 302 --KNNSFSF---------------------------CTRYDIKYPYNSFDLIKN------ 326

Query: 392 VPNVVVAHQKEGIEAVHLASGRTVCKLHL-------QEGGLHADINGDGVLDHVQAVGGN 444
            PNV V HQK GIE +HL +G+ +  LH+             ADI+GD  LD    V  +
Sbjct: 327 -PNVFVVHQKNGIEVIHLFTGKPL--LHVSLSSSTMSSASAFADIDGDDSLDE---VFTH 380

Query: 445 GAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHS-PFNLFPHGEFSRNFGRTSD 503
               T+    ++    C   A + +   + LF+ + C  S  F+L           R  +
Sbjct: 381 QIAHTLDYSKVDDDISCQVQAMT-MGNFQFLFSQNACTKSRRFDL---------LRRREE 430

Query: 504 VASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDAT 563
              + +  P+LIP  +   H      D++ L N G +T+ S     ++   +WQ    + 
Sbjct: 431 STEVIILPPLLIPNQEI-AHDGRPKFDIILLNNLGLMTSVS-----NNGALKWQAEVQSY 484

Query: 564 WSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQ--MILAGGDQEAVVISPGGSILT--SID 619
           W           A T  P L +F      N+    I++ G +   V+   G+++     D
Sbjct: 485 WD-----IEEKNAKTFKPALVSFKFTYGKNETDPFIVSQGSRFVNVVDLNGNVIAKKEFD 539

Query: 620 LPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
               P    V  D +ND   D+I++T + +  +     P   F    +  LIV+M 
Sbjct: 540 HICKPIMPPVFGDVNNDKKNDIIIVTDDHIMAYKMEVVPSVEFLPICIASLIVLMS 595


>gi|167378134|ref|XP_001734686.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165903708|gb|EDR29152.1| hypothetical protein EDI_127180 [Entamoeba dispar SAW760]
          Length = 622

 Score =  110 bits (274), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 148/699 (21%), Positives = 269/699 (38%), Gaps = 129/699 (18%)

Query: 8   ILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIK--FDADRLPPPIVADLNGDGRKEVL 65
           I+++  F  ++ L      S  E  F    EYP +  + ++    P++ D++GDG+ +++
Sbjct: 7   IILILCFVGWYFLIKNQSISSLELLF----EYPYQGNWQSELNAVPLITDIDGDGKNDLV 62

Query: 66  VATH-DAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVIDRTYR 124
           VA     KI +           F+   ++     L  ++ + +   A+ + +G  D+ Y+
Sbjct: 63  VAPFMSQKISIY---------SFNHGNLI-----LKKEVSLPNNTNAIYLTSGY-DKKYK 107

Query: 125 QGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDT 184
           +    +QV++VV   + V C+  +L   W+  +        + +E++  +  Y ++    
Sbjct: 108 ENLRREQVIIVVLRNFEVRCYTSDLKLRWKQQVYSSIA-QPYVQEVSAVVIPYQIQTKYQ 166

Query: 185 GLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASE-NSGTVD--------L 235
           G VI   R       + +       A+   EQ  R +  K   E N  ++D        +
Sbjct: 167 GAVITAFRTSNDKAFVGNR------ADTAFEQDERYSFGKPIFEQNEISMDDGGFVPELI 220

Query: 236 RHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNY-------KLDVHALNSRHPG 288
            H  +YAF+ + G + W  + ++   +  +  QLI Q +Y       K+D   L+     
Sbjct: 221 EHMNYYAFSLKDGQVIWQHERDD---EDQNDLQLIHQEDYIEVYKNLKMDKSILHDFDTV 277

Query: 289 EFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHP 348
           ++   EF + +   +PH W    DT +  +HF R        ++   T Y          
Sbjct: 278 QW--HEFHDCIFQNLPHIWASYYDTHIVPAHFSR-------DIINNPTEYI--------- 319

Query: 349 PGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVH 408
                                     +P + I    N   L   PNV+V H   GIE +H
Sbjct: 320 --------------------------RPNHSIYNSFNQFDLIKNPNVLVIHHMNGIEVIH 353

Query: 409 LASGRTVCKLHLQEGGL-----HADINGDGVLDHVQA---VGGNGAEQTVVSGSMEVLRP 460
           L +G+ +  + L    +      ADI+GD  L+ V     +  N  E+T      E    
Sbjct: 354 LFTGKPLLHVTLLTSTISAASSFADIDGDDALNEVYTHTNLQYNELERT------EDDYS 407

Query: 461 CWAVATSGVPVREQLFNASICHHSP-FNLFPHGEFSRNFGRTSDVASLEVATPILIPRSD 519
           C+  AT  V   + LF+ ++C     FNLF            S+    ++  P+LIP SD
Sbjct: 408 CYVHATV-VNTNDDLFSFNLCEQQKRFNLFRK-------SLNSEKQFFQILPPLLIP-SD 458

Query: 520 GHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTV 579
           G  H      D+  L + G +TA    LH  D    W       W          + +T 
Sbjct: 459 GKAHDGRPTYDIYTLNSNGLLTA----LHN-DGTLLWYKFLPVNWE-----LNQLDLTTF 508

Query: 580 VPTLKAFSLRVHDN-QQMILAGGDQEAVVISPGGSILTSIDLPA--PPTHALVCEDFSND 636
            P+ + F           I   GD E  +    G+++ +  L     P    +  D +ND
Sbjct: 509 KPSFQLFPYSYKGQVSSFIYVQGDNEICISDLKGNLIGTEKLKTKMKPIMPPIFGDLTND 568

Query: 637 GLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
             TD++ +T + +  +     P   F    +  ++++M 
Sbjct: 569 MKTDILFITEHSLVAYRFELLPSIEFLPICISFIVLMMS 607


>gi|440298618|gb|ELP91249.1| hypothetical protein EIN_151760 [Entamoeba invadens IP1]
          Length = 630

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 131/650 (20%), Positives = 257/650 (39%), Gaps = 107/650 (16%)

Query: 51  PIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRR 110
           P++ D++GDG  E++VA++ +   ++  + +         R++    L      I     
Sbjct: 48  PLITDIDGDGENELVVASYTSHKVIMYKYQK--------GRLVVNKEL-----EIPGQMF 94

Query: 111 AVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREI 170
            + M++G  +  Y +G+  KQ++V+V   + V+C + +L   W  N  +     A  +E+
Sbjct: 95  PIGMSSGY-ETPYVEGERRKQIVVIVMRDFEVICLNSDLTIRWRQNAYKA-KAQAFVQEV 152

Query: 171 AISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNA--------EQHRRSAS 222
           A+S+  Y L++G  G VI   R         +  +E+   ++          EQ    A+
Sbjct: 153 AVSVIPYNLQNGYKGAVITAFRTSNDKGFARNRADEVFEEDERVTTRGVTFDEQRDDMAN 212

Query: 223 EKEASENSGTVDLR-HFAFYAFAGRSGLLRWSRKNENIEA-----QPTDASQLIPQHNYK 276
           E+  +E   + +L+ H  +YAF  + G + W  + +  E+     +  D  +L       
Sbjct: 213 EQNVNEVGDSPELKEHMNYYAFHVKDGQVIWQHEMDEEESTLDLVKNADEVELFKS---- 268

Query: 277 LDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKST 336
           + + A      GE +  EF +++   +PH W    DT ++ SH  R        +V   T
Sbjct: 269 VKMKAEKGEGLGEVQWHEFHDALFQHLPHSWSSFFDTKIEYSHLSR-------DIVNNPT 321

Query: 337 SYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVV 396
            Y                K I  L         S +   P N I  I N       PN  
Sbjct: 322 EY----------------KNIKEL---------SKRKNTPHNDIDLIKN-------PNTF 349

Query: 397 VAHQKEGIEAVHLASGRTVCKLHL-----QEGGLHADINGDGVLDHVQA-VGGNGAEQTV 450
           + H   G+E +HL +G+ V  + L            D++G+ VLD +         E T 
Sbjct: 350 LIHHMNGVEVIHLFTGKPVFHITLLPSTVSSASSFVDLDGNDVLDEIYTHAYLLDYETTT 409

Query: 451 VSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVA 510
           V         C+   T     R  +F+ ++C     + +      ++ GR S    +++ 
Sbjct: 410 VDNDFS----CYLHVTEANSYR-SVFSQNVCEDKKISFWA----PKSKGRVS--KKVQIL 458

Query: 511 TPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSP 570
            P+L+   +  +  +  + D+  + + G + A S     +  + +W        + +P+ 
Sbjct: 459 PPLLVESGEVTKEGRKIY-DIYSVNSNGLLVAIS-----NKGVKKWS-------AQIPTK 505

Query: 571 SGM--TEASTVVPTLKAFSLRVHDNQQM--ILAGGDQEAVVISPGGSILTSIDL-PAPPT 625
            G+   E +T  P+   F      N  +  +L  GD    V+   G+++   +     P 
Sbjct: 506 WGLNQVEMNTFKPSSIQFHYTYGQNLTVPYLLVQGDSSFSVLDLMGNVVAFKEFGKVSPI 565

Query: 626 HALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
              +  D   +  +DVI+++ + + GF     P A F    +  ++ ++ 
Sbjct: 566 MPPIVGDIDGNTRSDVIIVSESYIMGFALEIVPSAKFIPVCIAGILALIA 615


>gi|407034104|gb|EKE37060.1| FG-GAP repeat-containing protein [Entamoeba nuttalli P19]
          Length = 622

 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 141/651 (21%), Positives = 253/651 (38%), Gaps = 117/651 (17%)

Query: 51  PIVADLNGDGRKEVLVATH-DAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGR 109
           P++ D++GDG+ +++VA     KI +           F++  ++     L  +  I +  
Sbjct: 48  PLITDIDGDGKNDLVVAPFMSQKISIY---------SFNQGELI-----LKKETSIPNNT 93

Query: 110 RAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHRE 169
             + + +G  D+ Y++    +QV++VV   + V C+  +L   W+  +        + +E
Sbjct: 94  NVIYLTSGY-DKEYKENLRREQVIIVVLRNFEVRCYTSDLKLRWKQQIYSSLA-QPYVQE 151

Query: 170 IAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEK---EA 226
           ++  +  Y ++    G VI   R       + +       A+   E   R    K   E 
Sbjct: 152 VSGVVIPYQIQTKYQGAVITAFRTSNDKAFVGNR------ADTAFEWDERDNFNKPIIEQ 205

Query: 227 SENS----GTVD--LRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVH 280
           +ENS    G V   + H  +YAF+ + G + W  + ++   +  +  QL+ Q +  L+V+
Sbjct: 206 NENSMDDGGFVPELIEHMNYYAFSIKDGQIIWQHERDD---EDQNDLQLVHQED-NLEVY 261

Query: 281 A---LNSRHPGEFEC---REFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGK 334
               +      +F+     EF + +   +PH W    DT +  +HF R        ++  
Sbjct: 262 KNLKMGKSILHDFDTIQWHEFHDCIFQNLPHTWSSYYDTHIVPAHFSR-------DIINN 314

Query: 335 STSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPN 394
            T Y        HP              K + Y          N    I N       PN
Sbjct: 315 PTEYI-------HP--------------KHSIYNS-------FNQFDLIKN-------PN 339

Query: 395 VVVAHQKEGIEAVHLASGRTVCKLHLQEGGL-----HADINGDGVLDHVQAVGGNGAEQT 449
           V+V H   GIE +HL +G+ +  + L    +      ADI+GD  L+ V         + 
Sbjct: 340 VLVVHHMNGIEVIHLFTGKPLLHVTLLTSTISAASSFADIDGDDELNEVYTHANLQYSEL 399

Query: 450 VVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSP-FNLFPHGEFSRNFGRTSDVASLE 508
             +G       C+  AT  V   + LF  ++C     FNLF            ++    +
Sbjct: 400 EQTGDD---YSCYVHATV-VNTNDDLFLFNLCEQQQRFNLFRK-------QSNTEKQFFQ 448

Query: 509 VATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLP 568
           V  P+LIP ++G  H      D+  L N G +TA    LH  D    W  +   TW    
Sbjct: 449 VLPPLLIP-NNGKAHDGRPTYDIYTLNNHGLLTA----LHN-DGTLLWNKMLPVTWE--- 499

Query: 569 SPSGMTEASTVVPTLKAFSLRVHDNQ--QMILAGGDQEAVVISPGGSILTSIDLPA--PP 624
                 +  T  P+   F    + NQ    I   GD E  ++   G+++ S +L     P
Sbjct: 500 --LNQVDLITFKPSFHLFQYS-YKNQTSSFIYIQGDNEICILDLNGNLIGSENLKTQMKP 556

Query: 625 THALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
               +  D +ND  TD++ +T + +  +     P   F    +  +++++ 
Sbjct: 557 IMPPIFGDLTNDMKTDILFITEHSLVAYHFELLPSIEFLPICISFIVLMIS 607


>gi|308813626|ref|XP_003084119.1| unnamed protein product [Ostreococcus tauri]
 gi|116056002|emb|CAL58535.1| unnamed protein product, partial [Ostreococcus tauri]
          Length = 205

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 68/204 (33%), Positives = 100/204 (49%), Gaps = 23/204 (11%)

Query: 289 EFECREFRESVL-GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHH 347
           E  C +FRES++   +PH W    DT + L+HF+RHK         ++ +    K  ++ 
Sbjct: 8   ERSCGDFRESIVEDALPHQWRHPSDTKMSLAHFKRHK--------SRANAMKRDKESKNR 59

Query: 348 PPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNY----TQLWWVPNVVVAHQKEG 403
              +D    ++N   +    A  A    P+             T    VPNV+V+H  EG
Sbjct: 60  RQRRDRDVVVANAATRTVGAAVDALRGTPMTRRAKRRARDARATSSDDVPNVIVSHHAEG 119

Query: 404 IEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP-CW 462
           ++ +HL SG  +C L+L+  GLH DI+GDGV+DHV+A G          GS+    P CW
Sbjct: 120 VDILHLHSGDVICSLYLKSPGLHVDIDGDGVMDHVEAHG---------RGSVGSDLPACW 170

Query: 463 AVATSGVPVREQLFNASICHHSPF 486
           A  TSGVP  E+  +AS+     F
Sbjct: 171 ATVTSGVPSEERALSASMSRRKRF 194


>gi|67471534|ref|XP_651715.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56468487|gb|EAL46329.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449705497|gb|EMD45528.1| FGGAP repeat-containing protein [Entamoeba histolytica KU27]
          Length = 622

 Score =  102 bits (254), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 132/645 (20%), Positives = 253/645 (39%), Gaps = 105/645 (16%)

Query: 51  PIVADLNGDGRKEVLVATH-DAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGR 109
           P++ D++GDG+ +++VA     KI +           F++ ++      L  +  I +  
Sbjct: 48  PLITDIDGDGKNDLVVAPFMSQKISIY---------SFNQGKLT-----LKKETSIPNTT 93

Query: 110 RAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHRE 169
             + + +G  ++ Y++    +QV++VV   + V C+  +L   W+  +        + +E
Sbjct: 94  NVIYLTSGY-EKEYKENLRREQVIIVVLRNFEVRCYTSDLKLRWKQQIYSSLA-QPYVQE 151

Query: 170 IAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASEN 229
           ++  +  Y ++    G VI   R       + +  +     ++  + ++    + E S +
Sbjct: 152 VSGVVIPYQIQTKYQGAVITAFRTSNDKAFVGNRADTAFEWDERNDFNKPIIEQNENSMD 211

Query: 230 SGTVD---LRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHA---LN 283
            G      + H  +YAF+ + G + W  + ++   +  +  QL+ Q +  L+V+    + 
Sbjct: 212 DGGFVPELIEHMNYYAFSIKDGQVIWQHERDD---EDQNDLQLVHQED-NLEVYKNLKMG 267

Query: 284 SRHPGEFEC---REFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPF 340
                +F+     EF + +   +PH W    DT +  +HF R        ++   T Y  
Sbjct: 268 KSILHDFDTIQWHEFHDCIFQNLPHTWSSYYDTHIVPAHFSR-------DIINNPTEYI- 319

Query: 341 HKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQ 400
                 HP              K + Y          N+   I N       PNV+V H 
Sbjct: 320 ------HP--------------KHSIYNS-------FNHFDLIKN-------PNVLVVHH 345

Query: 401 KEGIEAVHLASGRTVCKLHLQEGGL-----HADINGDGVLDHVQAVGGNGAEQTVVSGSM 455
             GIE +HL +G+ +  + L    +      ADI+GD  L+ V         +   +G  
Sbjct: 346 MNGIEVIHLFTGKPLLHVTLLTSTISAASSFADIDGDDELNEVYTHANLQYSELEQTGDD 405

Query: 456 EVLRPCWAVATSGVPVREQLFNASICHHSP-FNLFPHGEFSRNFGRTSDVASLEVATPIL 514
                C+  AT  V   + LF  ++C     FNLF            ++    +V  P+L
Sbjct: 406 ---YSCYVHATV-VNTNDDLFLFNLCEQQQRFNLFRK-------QSNTEKQFFQVLPPLL 454

Query: 515 IPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMT 574
           IP ++G  H      D+  L+N G +TA    LH  D    W  +    W          
Sbjct: 455 IP-NNGKAHDGRPTYDIYTLSNNGLLTA----LHN-DGTLLWNKMLPVNWE-----LNQV 503

Query: 575 EASTVVPTLKAFSLRVHDNQ--QMILAGGDQEAVVISPGGSILTSIDLPA--PPTHALVC 630
           +  T  P+   F    + NQ    I   GD E  ++   G+++ S +L     P    + 
Sbjct: 504 DLITFKPSFHLFQYS-YKNQTSSFIYIQGDNEICILDLNGNLIGSENLKTKMKPIMPPIF 562

Query: 631 EDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
            D +ND  TD++ +T + +  +     P   F    +  +++++ 
Sbjct: 563 GDLTNDMKTDILFITEHSLVAYHFELLPSIEFLPICISFIVLMIS 607


>gi|219112351|ref|XP_002177927.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410812|gb|EEC50741.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 765

 Score = 99.4 bits (246), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 102/416 (24%), Positives = 177/416 (42%), Gaps = 85/416 (20%)

Query: 295 FRESVL--GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKD 352
           +R S+L  GV+PH +   +D  +   HF   K                  P E    GKD
Sbjct: 379 YRRSLLTTGVLPHSYWSNDDATVTAVHFTHQK-----------------MPAEAKLKGKD 421

Query: 353 STK-KISNLIGKAATYAGSAKSKKPVNYIPTITNYTQL---WWVPNVVVAHQKEGIEAVH 408
             K K+   + K  T     K       +PT     +    +  PNV+V H + GI    
Sbjct: 422 KLKPKMGGGLSKGITI--KPKQTLSAKLLPTKKRRRKRLPHFGRPNVLVLHNQHGIHVRS 479

Query: 409 LASGRTVCKLHLQEGGLHADINGDGVLDHVQA-VGGNG---------------------- 445
           + +G ++C + L +  L+AD+N DGVLD VQ  VGG+                       
Sbjct: 480 IKNGMSLCHVSLSDDTLYADLNHDGVLDSVQVLVGGHADNLEDSTDDGYKFVSQLVKRVG 539

Query: 446 ------AEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFG 499
                 A+  + S +    R C A+A SG+P +E+LF+ S+C             S    
Sbjct: 540 DLGQEDAKVEIASRNRVRSRLCHAMALSGIPTKEELFSVSLC-------------SNKDD 586

Query: 500 RTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLL 559
           R   +A    + P+ I   +G  +      D++   + G+++ +  G+ G   +WQ   +
Sbjct: 587 RDESIAG---SPPLAIESLNGKGY------DLIVALSNGQISRFR-GVSGS-RVWQ---I 632

Query: 560 TDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSID 619
           +   + + P+    T  S  +  ++A        + +++AG +  A++ S  G++L +  
Sbjct: 633 SGRRFEDFPTWDDTTLVS--IDRIEA-EFAQPATRPILIAGENSMALLASRKGTVLATAS 689

Query: 620 LPAPPTHALVCEDFSNDGLTDVILMTSNGVYGF-VQTRQPGALFFSTLVGCLIVVM 674
            P P     +  DF+ DG TDV++MT + ++G+ V  +   ++ F   VG L+V M
Sbjct: 690 FPQPSMRKPLLMDFNGDGTTDVMVMTQDAIWGYRVVVKTGTSILFRITVGLLLVAM 745


>gi|224005014|ref|XP_002296158.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209586190|gb|ACI64875.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1072

 Score = 97.8 bits (242), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 114/448 (25%), Positives = 192/448 (42%), Gaps = 101/448 (22%)

Query: 291  EC-REFRESVL----GVMPH-HWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPE 344
            EC   FR SVL    G +PH +WD  E   + +  F R+K     K  G   + PF K  
Sbjct: 622  ECISHFRSSVLDEESGALPHEYWDDGEYGSISVGRFERNK-----KSSGGKRTKPFGK-- 674

Query: 345  EHHP------------------PGKDST----KKISNLIGKAATYAGSA-KSKKPVNYIP 381
            + +P                   G  +T    K+ S+++G      G + +S      +P
Sbjct: 675  KPNPLSTASGVLGGGARAGTTNAGSSATARQMKRSSDVVGSGGIAGGKSWQSDLFHRSVP 734

Query: 382  TITNYTQLWWV-------PNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGV 434
                  Q +         PNV++ H ++G+  + L +GR VC + L +  L+AD++ DG+
Sbjct: 735  QRLISQQRYNAFHPRSGKPNVIIFHGRDGLAVLSLKNGRPVCHISLMDHALYADLDKDGI 794

Query: 435  LDHVQ------AVGGNGAEQTVVS-------------------GSMEVLRPCWAVATSGV 469
            +D VQ      A+  +G  Q+++                      ++    C A+ TSG+
Sbjct: 795  IDMVQVVTSPEALSKSGGIQSLIERVTKSDNDNDEGRKTRERRTKLDAPVVCHALVTSGL 854

Query: 470  PVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHG 529
            P RE++F A +C   P         S +  R    A L  A P+L+  S G+ +      
Sbjct: 855  PPREEVFTAPLCLGGPL-------MSIDPKRPQ--AGLSAAPPLLVEGSLGYGN------ 899

Query: 530  DVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPS---GMTEASTVVPTLKAF 586
            DVVF  N G V  Y    +G + +W+   L D T S   S +   G  E   V       
Sbjct: 900  DVVFAMNNGVVVRYDS--NGRE-VWRKGGLKDGTPSWKASSNAFLGRIEFGAVKSHSSVS 956

Query: 587  SLRVHDNQQ-------MILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLT 639
            +    +  +       ++L+G D  A++ S  G++L+S+  P       +  D + DG  
Sbjct: 957  ASSHRNQHRPGSPVRPILLSGEDGAALISSASGNVLSSVVYPQSVVAQPMLSDLNGDGTD 1016

Query: 640  DVILMTSNGVYGF---VQTRQPGALFFS 664
            D+++++++G++G+   VQT + G  FFS
Sbjct: 1017 DLLVVSADGIWGYRVVVQTGRSG--FFS 1042


>gi|196003484|ref|XP_002111609.1| predicted protein [Trichoplax adhaerens]
 gi|190585508|gb|EDV25576.1| predicted protein [Trichoplax adhaerens]
          Length = 611

 Score = 94.0 bits (232), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 113/465 (24%), Positives = 191/465 (41%), Gaps = 82/465 (17%)

Query: 47  RLPPPIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEG-------FSEARVLAEVSLL 99
           +L  PI+ DL+GDG +++++AT D K++    ++ ++            E +   E++L 
Sbjct: 53  QLLDPIITDLDGDGNRDIIIATSDCKVRRSNSNSNKLLHATDLEQLPIPEIQFENEMALS 112

Query: 100 PDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQE 159
            D          +A++ G    T      +K+V+ VVT+ W+V   DH LN LW++++ E
Sbjct: 113 DD-----VTCHPIALSVGY---TNNMVHAMKKVIAVVTNDWNVYLLDHQLNLLWKSSIDE 164

Query: 160 DFPPNAHHREIAIS-ISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHR 218
                A+   IA + IS + +K+ D G+VI+       P T     +E+ +   +     
Sbjct: 165 KLEA-ANDSIIATAMISPHPIKNQDHGVVIIARF----PLTNQFTNQELSVRHDHGWTIE 219

Query: 219 RSASEKEASENSGTVDL------------RHFAFYAFAGRSGLLRWSRK-NENIEAQPTD 265
              +E  +S  +G VD+             H   YA    +G LRW  +  +N+  +  D
Sbjct: 220 DVIAENVSSYRAG-VDVGSTHRKDTHGKSVHCTIYALDLTTGELRWKHQPGDNVGLEMDD 278

Query: 266 ASQLIPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKR 325
            S L   H+YKL +      H     C  ++  ++  +PH W  R D  L +     H +
Sbjct: 279 LS-LFYGHHYKLGLRKDILCHRRSI-CGTYKMDIIRNLPHRWSSRRDAKLIVD---LHSK 333

Query: 326 KILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITN 385
           K               K    HP            I  A     S ++   +     I  
Sbjct: 334 K---------------KKGNKHP------------IFIAGFDVSSTRTNDDLAEWLRIVR 366

Query: 386 YTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL--QEGGLHADINGDGVLDHVQAVGG 443
             +     N+++ H  + IE  HL+SG+ VC L L  Q+     DIN DG++D +     
Sbjct: 367 IAK---AANILIIHTHQSIEIYHLSSGKLVCWLPLENQQSSTLEDINNDGLIDSLSDTSE 423

Query: 444 -NGAEQTVVSGSMEVLRPCWAVATS---------GVPVREQLFNA 478
               ++   SG  ++L P    +TS         G+   EQ FN 
Sbjct: 424 LEDLDRVATSGDDDMLPPITFKSTSQNAPLINWHGLLDSEQGFNT 468


>gi|299116071|emb|CBN74487.1| aldehyde dehydrogenase, putative [Ectocarpus siliculosus]
          Length = 724

 Score = 94.0 bits (232), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 94/341 (27%), Positives = 137/341 (40%), Gaps = 88/341 (25%)

Query: 401 KEGIEAVHLASGRTVCKLHL-----QEGGLHADINGDGVLDHVQAVGGNGA--------- 446
           ++G+EAV L++G+ +  + L      + G++ D+NGDGV+DHVQAVG  G          
Sbjct: 343 RDGLEAVELSTGKPLSAVALPAAAGTDAGVYVDLNGDGVVDHVQAVGTRGGQGWGHLHEG 402

Query: 447 --------EQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSR-N 497
                   E T    +   L PC+A+A SG+P REQLFNAS+CH +          S+ N
Sbjct: 403 VAMGHLSEEPTPSQPARRFLPPCYALAVSGLPPREQLFNASLCHDAGALFEVQERLSKPN 462

Query: 498 FGRTSDVASLEV--ATPILIPRSDGHRHRKG----------------------------- 526
              +S    LEV  A+P +IPR                                      
Sbjct: 463 SRSSSRSLDLEVGAASPAIIPRDSAGVAAAAATGAATARSSSGAGRGARSGGGGGGGGVG 522

Query: 527 ------SHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSG--MTEAST 578
                 +  D VF  + G V++Y       +   +WQ      W+   +  G        
Sbjct: 523 GWGEGPARFDAVFAVSSGVVSSYD-----DEGRLKWQDRRGPKWTRKDTEGGDPSNGGGY 577

Query: 579 VVPTLKAFSLRVH--------------DNQQMILAGGDQEAVVISPGGSILTSIDLPAPP 624
           VVP    F+L V                 ++ IL  G  +  V   GG +  S  L APP
Sbjct: 578 VVP----FTLEVGWPGRGRGRIINPSPGAEERILVVGQDKMCVYDRGGRLAGSTPLAAPP 633

Query: 625 THALVCEDFSNDGLTDVILMT-SNGVYGFVQTRQPG--ALF 662
           +   V  DF  DG+ DV+++  S  V G+     PG  A+F
Sbjct: 634 SQRPVLGDFDGDGVADVLVVGWSGAVAGYALAPDPGVRAMF 674



 Score = 46.2 bits (108), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 66/252 (26%), Positives = 99/252 (39%), Gaps = 49/252 (19%)

Query: 48  LPPPIVADLNGDGRKEVLV-ATHDAKIQVLE-PHARRVDEGFSEARVLAEVSLLPDKIRI 105
           +P P+V DL+GDG+ EV+V A     I+VL  P A    E   +  ++    L P + R+
Sbjct: 15  VPLPLVTDLDGDGKNEVVVLADGGMLIRVLSVPVAS--GETLVDPWIIHSTELRPTR-RL 71

Query: 106 ASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGW-----------SVMCFDHNLNKLWE 154
               RAVAMA G ++       P       V               SV CF+H L +LW 
Sbjct: 72  GRDERAVAMAAGYLEPPVNARDPGSSRQQQVQQRQQQRIVVVGEDSSVTCFNHQLGRLWT 131

Query: 155 ANLQEDFPPNAHHR-----EIAISISNYTLKHGD---------TGLVIVGGRMEMQ---- 196
            +L  +   +         ++AI++++   K  D          GL++VG  M  +    
Sbjct: 132 TSLAHEGSSSGEGGSFVIDQVAITVAHSVRKLDDRGRPKPGTGEGLIVVGISMRHRDGRF 191

Query: 197 -------------PHTIMDPFEEIGLAEKNAEQHRRSA--SEKEASENSGTVDLRHFAFY 241
                        P    DP      A    E+  R     EK + E  G     HF+ +
Sbjct: 192 HHSRAGRDPSGGSPVDGADPLGVHVEAFVEGEEDSRDTVEEEKMSEEEKGRRAAEHFSLF 251

Query: 242 AFAGRSGLLRWS 253
           A  G +G +RWS
Sbjct: 252 ALDGETGEVRWS 263


>gi|307103153|gb|EFN51416.1| hypothetical protein CHLNCDRAFT_59245 [Chlorella variabilis]
          Length = 925

 Score = 91.7 bits (226), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 65/98 (66%), Gaps = 7/98 (7%)

Query: 1  MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFH-LSEEYPIKFDADR----LPPPIVAD 55
          M KRD  IL+LS  A++ SLQHEG + F++AW+H  S+  P   + +     LPPP+VAD
Sbjct: 1  MYKRDFGILLLSVCALYLSLQHEGAYQFKKAWYHSFSDHEPALHELEEHPQPLPPPLVAD 60

Query: 56 LNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVL 93
          LNGDG+ EV+V T   ++Q+L P  RR  +GF++A  +
Sbjct: 61 LNGDGKPEVVVVTPTGRVQLLAP--RRFGDGFAKAEAI 96



 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 33/59 (55%)

Query: 266 ASQLIPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHK 324
           A  L+ QH Y +      +RH GE  CR++R SVL  +PH W    DT L  +HF +H+
Sbjct: 740 ADALVVQHAYHMSAEESETRHYGEASCRDYRASVLAALPHSWHSPLDTALVPAHFEKHR 798



 Score = 40.4 bits (93), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 18/22 (81%), Positives = 20/22 (90%)

Query: 393 PNVVVAHQKEGIEAVHLASGRT 414
           PNVVVAH +EGIEA+HL SGRT
Sbjct: 879 PNVVVAHLEEGIEAIHLFSGRT 900


>gi|356514228|ref|XP_003525808.1| PREDICTED: uncharacterized protein LOC100794105 [Glycine max]
          Length = 308

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 40/64 (62%), Positives = 44/64 (68%)

Query: 291 ECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPG 350
           E R         +   W +REDTL K +HFRRHKRK LKK  GKS SYPFHKPEE+HPPG
Sbjct: 240 EVRPLEAERFKALRIQWAQREDTLFKFAHFRRHKRKALKKTPGKSISYPFHKPEENHPPG 299

Query: 351 KDST 354
           KDST
Sbjct: 300 KDST 303


>gi|401427313|ref|XP_003878140.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322494387|emb|CBZ29688.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 727

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 170/738 (23%), Positives = 281/738 (38%), Gaps = 157/738 (21%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPI---KFDADRLPPPIVADLN 57
           MR RDL I+ +S          EG  +F+  +       PI   +     +P PI+ D +
Sbjct: 1   MRIRDLVIIGVSFVCSIICWSQEGSLNFQRGFM-----IPILDNELTEVVMPKPILVDPD 55

Query: 58  GDGRKEVLVATHDAKIQVLEPHA--RRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMA 115
           G G + +L +T    + +   +   RR+D  F      AE+++    + I +G       
Sbjct: 56  GSGSRVLLSSTTVGYLNIYNTYYTRRRIDGSFVRLNPSAELNVYTPIVGIGAG------- 108

Query: 116 TGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNL---NKLWEANL-QEDFPPNAHHREIA 171
                  Y +      +  VVT  + +M    +    + LW++ L    +     H  I+
Sbjct: 109 -------YTELNSSIMLCAVVTEDYQLMAVSIHTPSPSVLWKSQLVAPQYWSELTHASIS 161

Query: 172 ISISNYTLKHGDTGLVIVGGRM------EMQPHTIMDP---------FEEIGLAEKNAEQ 216
           I       +  D G++ V  ++      EM  ++  D          F + G      + 
Sbjct: 162 IVPERNWAE--DVGMIAVAAKVVDASGVEMMMYSAFDAKTGARRWVYFSDGG---NEMDD 216

Query: 217 HRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYK 276
             R A       ++G  +LR                    E      TD      QH+Y 
Sbjct: 217 VVREADNNGTETDNGLTELRLL----------------DREGAAKGGTD------QHDYL 254

Query: 277 LDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHK-RKILKKVVGKS 335
           L           E     FRESV+  MPH +    D  L    F R K ++ ++   GK+
Sbjct: 255 LG-------RKYEQPWTTFRESVMASMPHRYAHVWDAQLYPHVFYRAKAKRKVRSSRGKA 307

Query: 336 ---TSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWV 392
              T++ ++    H       T    N+  K +  A S +      Y P   N T+    
Sbjct: 308 ERRTTFQYNDRVVHM-----ETDDEGNMGDKLSALALSLRQSTQA-YPPK--NATRRRLR 359

Query: 393 P-NVVVAHQKEGIEAVHLASGRTVCKLH-LQEGG-LHADINGDGVLDHVQAVGGNGAEQT 449
           P NV V H + G+E VH+ +G TV  +  L+ GG  + DIN D VL+ V    G    ++
Sbjct: 360 PSNVFVFHGEHGVEVVHMYTGSTVTSVMPLRAGGTCYDDINDDLVLESVSTQIG---PRS 416

Query: 450 VVSGS--MEVLRPCWAVATSGVPVR-EQLFNASICHHSPFNLFPHGEFSRNF-------- 498
           +V     +++   C  +  +G P   ++LFNA++C      LF + +   +F        
Sbjct: 417 IVYAKHGIDLAYDCLGIIEAGAPSSADELFNATVCDTQ--GLFGNLDLIHHFVDGDIRGE 474

Query: 499 ------------GRTSDVASLEVATPILIPRSDGHRHRKGSHGD--VVFLTNRGEVTAYS 544
                       G  + V+S+  ATP LI +   +        +    F+ + G VT   
Sbjct: 475 EAPRALNTLELLGSRNVVSSMTRATPPLILQVQQNMGAGLQQVERYAAFMIDTGLVTCID 534

Query: 545 PGLHGHDAIWQWQLLTDATWSNLPSPS-GMTEASTV----------VPTLKAFSLRVHDN 593
           P       +W+ Q  T A ++  PSPS  M EA              P +  +SL   + 
Sbjct: 535 PSR--RRVLWRAQ--TGARFA--PSPSDDMAEARKSRYHVNRDVKPFPQMVPYSLMQKNR 588

Query: 594 Q-----------------QMILAGGDQEAVVI-SPGGSILTSIDLPAPPTHALVCEDFSN 635
           +                   +LA GD E  VI +  G + +SI L  PP   ++  DF+ 
Sbjct: 589 ETDVTFVGGGKESFRRVDTYVLAVGDAELSVIKTKNGKVTSSISLTEPPVAPVLVVDFNG 648

Query: 636 DGLTDVILMTSNGVYGFV 653
           DG  D+I+++   ++GFV
Sbjct: 649 DGTNDLIIVSKYHIFGFV 666


>gi|157874257|ref|XP_001685616.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|5852121|emb|CAB55366.1| hypothetical protein L1648.04 [Leishmania major]
 gi|68128688|emb|CAJ08820.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 727

 Score = 87.0 bits (214), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 169/738 (22%), Positives = 282/738 (38%), Gaps = 157/738 (21%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPI---KFDADRLPPPIVADLN 57
           MR RD+ I+ ++          EG  +F+  +       PI   +     +P PI+ D +
Sbjct: 1   MRIRDIVIIGVAFVCSIICWSQEGSLNFQRGFM-----IPILDNELTEVIMPKPILVDPD 55

Query: 58  GDGRKEVLVATHDAKIQVLEPHA--RRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMA 115
           G G + +L +T    + +   +   RR+DE F      AE+++    + I +G       
Sbjct: 56  GSGSRVLLSSTTVGYLSIYNTYYSRRRIDESFVRLNPSAELNVYTPIVGIGAG------- 108

Query: 116 TGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNL---NKLWEANL-QEDFPPNAHHREIA 171
                  Y +      +  VVT  + ++    +      LW+  L    +     H  I+
Sbjct: 109 -------YTELNSSIMLCAVVTEDYQLIAVSIHTPSPTVLWKTQLVGPRYWSELTHASIS 161

Query: 172 ISISNYTLKHGDTGLVIVGGRM------EMQPHTIMDP--------FEEIGLAEKNAEQH 217
           I       +  D G++ V  ++      EM  ++  D         +   G  E N    
Sbjct: 162 IVPERNWAE--DVGMIAVAAKVVDTSGVEMMMYSAFDAKTGARRWAYFSDGGNEMN--DV 217

Query: 218 RRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKL 277
            R A+       +G  DLR         R G                 A   I QH+Y L
Sbjct: 218 VREANNNGTQTENGVTDLR------LLDRDGA----------------AKGGINQHDYLL 255

Query: 278 DVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHK-RKILKKVVGKS- 335
                      E     FRESV+  MPH +    D  L    F R K  +  +   GK+ 
Sbjct: 256 G-------RKYEQPWTTFRESVMASMPHRYAHVWDAQLYPHVFYRAKAHRKARSSRGKAE 308

Query: 336 --TSYPFHKPEEHHPP--GKDSTKKISNLIG--KAATYAGSAKSKKPVNYIPTITNYTQL 389
              ++ ++    H     G +  KK+S L    + +T A  AK+    +  P+       
Sbjct: 309 RRATFRYNDRVVHMETDDGGNLGKKLSALALSLRQSTQAYPAKNATRRHRRPS------- 361

Query: 390 WWVPNVVVAHQKEGIEAVHLASGRTVCKLH-LQEGG-LHADINGDGVLDHVQAVGGNGAE 447
               NV V H ++G+E VH+ +G TV  +  L+ GG  + DINGD VL+ V    G  + 
Sbjct: 362 ----NVFVFHGEQGVEVVHMYTGSTVTSVMPLKAGGACYDDINGDLVLESVSTQIGPRSV 417

Query: 448 QTVVSGSMEVLRPCWAVATSGVPVR-EQLFNASICHHSPFNLFPHGEFSRNF-------- 498
                G +++   C  +  +G P   ++LFNA++C+     LF + +   +F        
Sbjct: 418 VYATRG-IDLKYDCLGIIEAGAPSSADELFNATVCNTQ--GLFGNLDLIHHFVDGDIRGE 474

Query: 499 ------------GRTSDVASLEVATPILIPRSDGHRHRKGSHGD--VVFLTNRGEVTAYS 544
                       G  + V+S+  ATP LI +   +        +    F+ + G VT   
Sbjct: 475 EAPRALNTLELLGSRNVVSSMTRATPPLILQVQQNMGAGLQQVERYAAFMIDTGLVTCID 534

Query: 545 PGLHGHDAIWQWQLLTDATWSNLPSPS-GMTEASTV----------VPTLKAFSLRVHDN 593
           P       +W+ Q  T A ++  PSPS  M EA              P +  +SL   + 
Sbjct: 535 PSR--RRVLWRAQ--TGARFA--PSPSDDMAEARKSRYHVNRDVKPFPQMVPYSLMQKNR 588

Query: 594 Q-----------------QMILAGGDQE-AVVISPGGSILTSIDLPAPPTHALVCEDFSN 635
           +                   +LA GD E +++ +  G +  S+ L  PP   ++  DF+ 
Sbjct: 589 ETDMTFVGGGKESFRRVDTYVLAVGDAEFSIIKTKNGKVTRSVSLKEPPVAPVLVVDFNG 648

Query: 636 DGLTDVILMTSNGVYGFV 653
           DG  D+I+++   V+GFV
Sbjct: 649 DGTNDLIIVSKYHVFGFV 666


>gi|261335691|emb|CBH18685.1| FG-GAP repeat protein, putative [Trypanosoma brucei gambiense
           DAL972]
          Length = 703

 Score = 84.7 bits (208), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 153/724 (21%), Positives = 276/724 (38%), Gaps = 153/724 (21%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIK-FDADR--LPPPIVADLN 57
           MR RD+ I+ ++     F    E   +  + +     E PI   ++ R  +P P+V D  
Sbjct: 1   MRLRDVFIVGVAIIFCIFGWHQEDGLAVWKGF-----EIPITDLESQRYMIPKPVVLDPM 55

Query: 58  GDGRKEVLVATHDAKIQVLEPHARR-VDEGFSEARVLAEVSLLPDKIRIASGRRAVAMAT 116
           GDGR  ++  T    +++   H+     E ++    + + S       I +GR AV    
Sbjct: 56  GDGRPVLIATTSYGSLEMFRTHSTSGAAETYATPVSMYQRSFFSRITAIGAGRLAVGE-- 113

Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISIS 175
                           + VVTS + +     H+ +++W   +      + H    ++S+ 
Sbjct: 114 -------------DYTIFVVTSDFLLYRLSPHDFSEVWSVPIHNVLSESFH---TSVSVL 157

Query: 176 NYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDL 235
           +  +  GD G V++  ++    HT +                                  
Sbjct: 158 SERIYEGDEGTVVIATQVPGPNHTKL---------------------------------- 183

Query: 236 RHFAFYAFAGRSGLLRWSRKNENIEAQ------PTDASQLIPQHNYKLDVHALNSRHPGE 289
               F AF G  G LRW R   + E+       P DA  +         V  ++S+    
Sbjct: 184 --MLFAAFNGADGKLRW-RYTSDAESSVREVLDPEDAVGVGGSSPSGASV-VVSSQATES 239

Query: 290 FECRE-----FRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPE 344
           F   E     +RE+V  ++PH +    D  L+   F   K +   K    ST    +K  
Sbjct: 240 FRLYEKPWTFYREAVSTLLPHRYSHPWDECLRAHVFFHTKNRKKTKAQAGSTVVVKYKDR 299

Query: 345 EHHPPGKDSTKKIS--NLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKE 402
                 +D  +      L+ K      S   K+  +               NV+  H + 
Sbjct: 300 FVRMNSEDYGELAERLGLVSKPQKKGNSHGEKERKS--------------ANVLAFHGEH 345

Query: 403 GIEAVHLASGRTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP 460
           GIE VHL +G  + ++  L+  G++  DIN D  +D V  + G   E+      ++V   
Sbjct: 346 GIEVVHLYTGSLITRVAPLKSAGVYYHDINDDFEIDAVSTLIGRRMEKHS-HFDVDVTLD 404

Query: 461 CWAVATSGVPVRE-QLFNASICHHSPF-------NLFPHGEFSRNFGRTSDVASLEVA-- 510
           C  V ++G P  +  LF  SIC+           + F  G+ +R  G    +++LE+   
Sbjct: 405 CLGVISTGAPAADHSLFQTSICNTEGIFGRLELIHRFIDGD-TRGEGTPEVLSALELVGS 463

Query: 511 -----------TPILIPRSDGHRHRKGSHGDV----VFLTNRGEVTAYSPGLHGHDAIWQ 555
                      TP+++     H  R      V    VF+T+ G VT   P       +W+
Sbjct: 464 HNTLSHTTKSVTPLVVQL---HTLRGRGLFQVERLAVFMTDSGLVTCVDPSRR--RVVWR 518

Query: 556 -------WQLLTDATWSNLPSPSGMTE---ASTVVPTLKAFSL-RVHDN----------- 593
                  W+L ++       + S  TE    +   P L +++  +V+++           
Sbjct: 519 SQTESSFWRLRSEREADVGEAASSETEHKQRTFPFPHLASYNFYQVNEDTVGHVGGPGRY 578

Query: 594 ---QQMILAGGDQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGV 649
                 I+A G+++  ++S   G ++  ++L  P    ++ +DF+ DG+ D+I++T  G+
Sbjct: 579 LRVDPYIVAVGERKLTILSSRTGRVMRVVELEEPAVAPVIVQDFNGDGINDIIVVTEGGI 638

Query: 650 YGFV 653
           YGFV
Sbjct: 639 YGFV 642


>gi|74026204|ref|XP_829668.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70835054|gb|EAN80556.1| FG-GAP repeat protein, putative [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 703

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 152/724 (20%), Positives = 277/724 (38%), Gaps = 153/724 (21%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIK-FDADR--LPPPIVADLN 57
           MR RD+ I+ ++     F    E   +  + +     E PI   ++ R  +P P+V D  
Sbjct: 1   MRLRDVFIVGVAIIFCIFGWHQEDGLAVWKGF-----EIPITDLESQRYMIPKPVVLDPM 55

Query: 58  GDGRKEVLVATHDAKIQVLEPHARR-VDEGFSEARVLAEVSLLPDKIRIASGRRAVAMAT 116
           GDGR  ++  T    +++ + H+     E ++    + + S       I +GR AV    
Sbjct: 56  GDGRPVLIATTSYGSLEMFKTHSTSGAAETYATPVSMYQRSFFSRITAIGAGRLAVGEDY 115

Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISIS 175
            +                VVTS + +     H+ +++W   +      + H    ++S+ 
Sbjct: 116 AIF---------------VVTSDFLLYRLSPHDFSEVWSVPIHNVLSESFH---TSVSVL 157

Query: 176 NYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDL 235
           +  +  GD G V++  ++    HT +                                  
Sbjct: 158 SERIYEGDEGTVVIATQVPGPNHTKL---------------------------------- 183

Query: 236 RHFAFYAFAGRSGLLRWSRKNENIEAQ------PTDASQLIPQHNYKLDVHALNSRHPGE 289
               F AF G  G LRW R   + E+       P DA  +         V  ++S+    
Sbjct: 184 --MLFAAFNGADGKLRW-RYTSDAESSVREVLDPEDAVGVGGSSPSGASV-VVSSQATES 239

Query: 290 FECRE-----FRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPE 344
           F   E     +RE+V  ++PH +    D  L+   F   K +   K    +T    +K  
Sbjct: 240 FRLYEKPWTFYREAVSTLLPHRYSHPWDECLRPHVFFHTKNRKKTKAQAGNTVVVKYKDR 299

Query: 345 EHHPPGKDSTKKIS--NLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKE 402
                 +D  +      L+ K      S   K+  +               NV+  H + 
Sbjct: 300 FVRMNSEDYGELAERLGLVSKPQKKGSSHGEKERKS--------------ANVLAFHGEH 345

Query: 403 GIEAVHLASGRTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP 460
           GIE VHL +G  + ++  L+  G++  DIN D  +D V  + G   E+      ++V   
Sbjct: 346 GIEVVHLYTGNLITRVAPLKSAGVYYHDINDDFEIDAVSTLIGRRMEKHS-HFDVDVTLD 404

Query: 461 CWAVATSGVPVRE-QLFNASICHHSPF-------NLFPHGEFSRNFGRTSDVASLEVA-- 510
           C  V ++G P  +  LF  SIC+           + F  G+ +R  G    +++LE+   
Sbjct: 405 CLGVISTGAPAADHSLFQTSICNTEGIFGRLELIHRFIDGD-TRGEGTPEVLSALELVGS 463

Query: 511 -----------TPILIPRSDGHRHRKGSHGDV----VFLTNRGEVTAYSPGLHGHDAIWQ 555
                      TP+++     H  R      V    VF+T+ G VT   P       +W+
Sbjct: 464 HNTLSHTTKSVTPLVVQL---HTLRGRGLFQVERLAVFMTDSGLVTCVDPSRR--RVVWR 518

Query: 556 -------WQLLTDATWSNLPSPSGMTE---ASTVVPTLKAFSL-RVHDN----------- 593
                  W+L ++       + S  TE    +   P L +++  +V+++           
Sbjct: 519 SQTESSFWRLRSEREADVGEAASSETEHKQRTFPFPHLASYNFYQVNEDTVGHVGGPGRY 578

Query: 594 ---QQMILAGGDQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGV 649
                 I+A G+++  ++S   G ++  ++L  P    ++ +DF+ DG+ D+I++T  G+
Sbjct: 579 LRVDPYIVAVGERKLTILSSRTGRVMRVVELEEPAVAPVIVQDFNGDGINDIIVVTEGGI 638

Query: 650 YGFV 653
           YGFV
Sbjct: 639 YGFV 642


>gi|146096931|ref|XP_001467982.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398021114|ref|XP_003863720.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134072348|emb|CAM71055.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322501953|emb|CBZ37036.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 727

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 163/742 (21%), Positives = 285/742 (38%), Gaps = 165/742 (22%)

Query: 1   MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPI---KFDADRLPPPIVADLN 57
           MR RD+ I+ ++          EG  +F+  +       PI   +     +P PI+ D +
Sbjct: 1   MRIRDIVIIGVAFVCSIICWSQEGSLNFQRGFM-----IPILDNELTEVIMPKPILVDPD 55

Query: 58  GDGRKEVLVATHDAKIQVLEPHA--RRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMA 115
           G G + +L +T    + +   +   RR+D+ F      AE+++    + I +G       
Sbjct: 56  GSGSRVLLSSTSVGYLNIYNTYYARRRIDDSFVRLNPSAELNVYTPIVGIGAG------- 108

Query: 116 TGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNL---NKLWEANLQEDFPPNAHHREIAI 172
                  Y +      +  VVT  + ++    +    + LW++ L     P         
Sbjct: 109 -------YTELNSSIMLCAVVTEDYQLIAVSIHTPSPSVLWKSQL---VAPRYWSELTHA 158

Query: 173 SISNYTLKH--GDTGLVIVGGRM------EMQPHTIMDP---------FEEIGLAEKNAE 215
           SIS    ++   D G++ V  ++      EM  ++  D          F + G      +
Sbjct: 159 SISIVPERNWAEDVGMIAVAAKVVDASGVEMMMYSAFDAKTGARRWAYFSDGG---NEMD 215

Query: 216 QHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNY 275
              R A+      ++G  +LR         R G                 A   I QH+Y
Sbjct: 216 DVVREANNNGTQTDNGVTELR------LLDREGA----------------AKGGINQHDY 253

Query: 276 KLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHK-RKILKKVVGK 334
            L           E     FRESV+  MPH +    D  L    F R K ++  +   GK
Sbjct: 254 LLG-------RKYEQPWTTFRESVMASMPHRYAHVWDAQLYPQVFYRAKGKRKARSSRGK 306

Query: 335 S---TSYPFHKPEEHHPP--GKDSTKKISNLIG--KAATYAGSAKSKKPVNYIPTITNYT 387
           +    ++ ++    H     G +   K+S L    + +T+A  AK+       P+     
Sbjct: 307 AERRATFRYNDRVVHMETDDGGNLGDKLSALALSLRQSTHAYPAKNATRRRRRPS----- 361

Query: 388 QLWWVPNVVVAHQKEGIEAVHLASGRTVCK-LHLQEGG-LHADINGDGVLDHVQAVGGNG 445
                 NV V H ++G+E VH+ +G TV   + L+ GG  + DIN D VL+ V    G  
Sbjct: 362 ------NVFVFHGEQGVEVVHMYTGSTVTSVMPLKAGGTCYDDINDDLVLESVSTQIG-- 413

Query: 446 AEQTVVSGS--MEVLRPCWAVATSGVPVR-EQLFNASICHHSPFNLFPHGEFSRNF---- 498
             ++VV     +++   C  +  +G P   ++LFNA++C+     LF + +   +F    
Sbjct: 414 -PRSVVYAKRGIDLTYDCLGIIEAGAPSSADELFNATVCNTQ--GLFGNLDLIHHFVDGD 470

Query: 499 ----------------GRTSDVASLEVATPILIPRSDGHRHRKGSHGD--VVFLTNRGEV 540
                           G  + V+S+  ATP LI +   +        +    F+ + G V
Sbjct: 471 IRGEEAPRALNTLELLGSRNVVSSMTRATPPLILQVQQNMGAGLQQVERYAAFMIDTGLV 530

Query: 541 TAYSPGLHGHDAIWQWQLLTDATWSNLPSPS-GMTEASTV----------VPTLKAFSLR 589
           T   P       +W+ Q  T A ++  PSPS  M E               P +  +SL 
Sbjct: 531 TCIDPSR--RRVLWRAQ--TGARFA--PSPSDDMAEIRKSRYHVNRDVKPFPQMVPYSLM 584

Query: 590 VHDNQ-----------------QMILAGGDQE-AVVISPGGSILTSIDLPAPPTHALVCE 631
             + +                   +LA GD E +++ +  G +  S+ L  PP   ++  
Sbjct: 585 QKNRETDVTFVGGGKEYFRRVDTYVLAVGDAEFSIIKTKNGKVTRSVSLTEPPVAPVLVV 644

Query: 632 DFSNDGLTDVILMTSNGVYGFV 653
           DF+ DG  D+I+++   ++G+V
Sbjct: 645 DFNGDGTNDLIIVSKYHIFGYV 666


>gi|154343447|ref|XP_001567669.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134065001|emb|CAM43112.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 727

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 103/420 (24%), Positives = 176/420 (41%), Gaps = 80/420 (19%)

Query: 295 FRESVLGVMPHHWDRREDTLLKLSHFRRHK-RKILKKVVGKS---TSYPFHKPEEHHPPG 350
           FRESV+  MPH +    D  L    F R K ++  +   GK+    ++ ++    H    
Sbjct: 266 FRESVMAAMPHRYAHVWDAQLYPHVFYRAKAKRKARPSRGKAKRRATFRYNDQVVHM--- 322

Query: 351 KDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLA 410
             +T+   NL  K +    S K     +     T   Q   + NV V H ++G+E VH+ 
Sbjct: 323 --ATEDDGNLGDKLSALRLSLKQPTQTSLAENATR--QRRRLSNVFVFHGEQGVEVVHMY 378

Query: 411 SGRTVCKLH--LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSG-SMEVLRPCWAVATS 467
           +G  V  +     +G  + DIN D VL+ +      G   TV +   +++   C  +  +
Sbjct: 379 TGNIVTNVMPLRADGTYYDDINDDLVLESLST--QIGPRSTVFAKRGIDLSYDCLGMIET 436

Query: 468 GVPVR-EQLFNASICHHSPFNLFPHGEFSRNF--------------------GRTSDVAS 506
           GVP+  ++LFNA++C+     LF + +   +F                    G  S  ++
Sbjct: 437 GVPLSADELFNATVCNTQ--GLFGNLDLIHHFVDGDIRGEEAPRALNTLELLGSRSVASA 494

Query: 507 LEVATPILIPRSDGHRHRKGSHGDV----VFLTNRGEVTAYSPGLHGHDAIWQWQLLTDA 562
           +  ATP LI +    ++  G    V     F+ + G VT   P       +W+ Q  T  
Sbjct: 495 MTRATPPLILQV--QQNMGGGLQQVERYAAFMIDTGLVTCIDPSRR--RVLWRAQ--TGG 548

Query: 563 TWSNLPSPSGMTEASTV----------VPTLKAFSLRVHDNQQM---------------- 596
            ++  PS   M EA  +           P +  +SL +  N++                 
Sbjct: 549 RFAPTPS-DDMAEARKIRYQVNSDVRPFPQMVPYSL-LQKNRETDVTFVGGGGEPFRRVD 606

Query: 597 --ILAGGDQE-AVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFV 653
             +LA GD E +++ +  G +  SI LP PP   ++  DF+ DG  D+I+++   ++GFV
Sbjct: 607 TYVLAMGDTEFSIIKTKNGKVTRSISLPEPPVAPVLVVDFNGDGTNDLIVVSKYHIFGFV 666


>gi|414885126|tpg|DAA61140.1| TPA: hypothetical protein ZEAMMB73_444374 [Zea mays]
          Length = 486

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/70 (54%), Positives = 53/70 (75%), Gaps = 2/70 (2%)

Query: 540 VTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILA 599
           VT+YS  L G++A+W+ Q+ +  TWS LPSPSG+ E   VVPTLKAFS+  +D +++ + 
Sbjct: 411 VTSYSSRLLGYNAVWR-QVSSRMTWSKLPSPSGLME-KIVVPTLKAFSVCAYDPKEVTIC 468

Query: 600 GGDQEAVVIS 609
           GGDQEAVV+S
Sbjct: 469 GGDQEAVVLS 478


>gi|340375802|ref|XP_003386423.1| PREDICTED: hypothetical protein LOC100636369 [Amphimedon
           queenslandica]
          Length = 617

 Score = 73.2 bits (178), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 95/200 (47%), Gaps = 22/200 (11%)

Query: 4   RDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-----------YPIKFDADRLPPPI 52
           +++  L +S   + F L+ +    F+      +EE           +P++   ++LP PI
Sbjct: 13  QEILFLFISCSVLAFLLRSQDSLEFKLVLNVTTEERAARENYANQKFPLR--NEKLPLPI 70

Query: 53  VADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAV 112
           V DL  DG+ ++LV+T     ++L  ++R     +     +   +     + ++     V
Sbjct: 71  VTDLESDGQTDILVSTD----KILNVYSRHYTPSYLGHHGIRHSN---KNVSLSKTSCVV 123

Query: 113 AMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAI 172
           AM TG +       Q  KQV+ V+ S W++ C++HNL  +W   L E   P +   E ++
Sbjct: 124 AMTTGFLQPYQSVVQVRKQVIAVLYSDWTLSCYNHNLKLMWSQKLNEKEIPISS--EASL 181

Query: 173 SISNYTLKHGDTGLVIVGGR 192
            +S  ++K    G+++VGGR
Sbjct: 182 LVSAISIKKSPAGVILVGGR 201


>gi|342186606|emb|CCC96093.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 483

 Score = 72.4 bits (176), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 134/314 (42%), Gaps = 62/314 (19%)

Query: 394 NVVVAHQKEGIEAVHLASGRTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVV 451
           NV+  H + GIE +HL +G  + +L  L+   ++  DIN D  ++ +  + G   E    
Sbjct: 117 NVLAYHGEHGIEVIHLYTGNLITRLAPLKSKNIYYHDINDDFQIESISPLIGRRMESHA- 175

Query: 452 SGSMEVLRPCWAVATSGVPVREQ-LFNASICHHSPFNLFPHGEFSRNF--GRTSDVASLE 508
              +E+   C  V ++G+PV    LFNASIC+     LF   +  R+F  G   D  + E
Sbjct: 176 KFDIELAYDCLGVISTGLPVAHHPLFNASICNTE--GLFGRLDLVRDFVDGDMRDGGNTE 233

Query: 509 VATPILIPRSDGHRH--RKGSHGDV---------------------VFLTNRGEVTAYSP 545
           V + + +    G R+   K +H  V                     VF+T+ G VT   P
Sbjct: 234 VLSVLELI---GSRNTLSKTTHSTVPLVVQLHTVKGKDLVQIERYAVFITDSGLVTCVDP 290

Query: 546 GLHGHDAIWQWQ------LLTDATWSN----LPSPSGMTEASTVVPTLKAFSL------- 588
             H     W+ Q      LL D   ++    + +       +   P L  ++        
Sbjct: 291 SRH--RVAWRSQTESSFYLLRDEQEADAEAGINTKMERNHLAVPFPHLAQYNFYQKNEDS 348

Query: 589 --------RVHDNQQMILAGGDQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLT 639
                   R       I+A G+++   +S   G ++  ++L  P    ++ +DF+ DG+ 
Sbjct: 349 VGYVGGIGRYLRTDPYIIAVGERKMTFLSSRTGRVMRVVELEEPAVAPVIVQDFNGDGIN 408

Query: 640 DVILMTSNGVYGFV 653
           D+I++T+ GVYG+V
Sbjct: 409 DIIVVTTGGVYGYV 422


>gi|340059814|emb|CCC54210.1| putative intergrin alpha chain protein [Trypanosoma vivax Y486]
          Length = 704

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 113/516 (21%), Positives = 199/516 (38%), Gaps = 100/516 (19%)

Query: 240 FYAFAGRSGLLRW---SRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECRE-- 294
           F AF G  G LRW   +  + ++    + +     +     D    ++R    F   E  
Sbjct: 188 FAAFNGADGELRWRYTTDADSSVREVLSPSVSTAAEEASSGDSTIASARTTDPFRLHEKP 247

Query: 295 ---FRESVLGVMPHH----WDRREDTLLKLSHFRRHKRKIL--KKVVGKSTSYPFHKPEE 345
              FRE+V+ ++PH     WD R    +      R KR     K+VV K         EE
Sbjct: 248 WSFFREAVVTLLPHRYAHPWDERIRPHVFFHTKNRKKRAAHSGKQVVVKYKDRFIRMNEE 307

Query: 346 HHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIE 405
           ++             +G+      + +  K    +P            NV+  H   G+E
Sbjct: 308 NYGE-----------LGEKLGLLNADEKHKDHTALPHQRGV-------NVLAFHGPHGVE 349

Query: 406 AVHLASGRTVCKLH--LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWA 463
            +HL +G  + +L      G L+ D+  D  +D V    G    Q   S  +E+   C  
Sbjct: 350 VLHLYTGNLITRLAPLKSTGVLYHDLGDDFQIDVVGTRIGPRM-QVHSSHGLEITDECMG 408

Query: 464 VATSGVPV-REQLFNASICHHSPF-------NLFPHGEFSRNFGRTSDVASLEV------ 509
              +G+P+  E+LF+ SIC+   F         F  G+  R  G +S + +LE+      
Sbjct: 409 TIHTGIPLAEEKLFSTSICNTEGFLGRLDLIRDFVGGDI-RGEGTSSVIDALELMGSQNV 467

Query: 510 -------ATPILIPRSDGHRHRKGSHG------DVVFLTNRGEVTAYSPGLHGHDAIWQW 556
                   TP+++       H     G        VF+ + G +T   P   G   +W+ 
Sbjct: 468 LSKTTRSVTPLVV-----QLHTVKGRGLFQVERYAVFMIDSGLITCVDPS-RGR-VLWRS 520

Query: 557 QLLTDATWSNLPSP-------SGMTEASTV-----VPTLKAFSL---------------R 589
           Q  TDA++ N+           G++E          P L  ++                R
Sbjct: 521 Q--TDASFENVGETHDAESVMVGLSEMEVKYRTRPFPHLAPYNFEQKNEDSVGHVGGVGR 578

Query: 590 VHDNQQMILAGGDQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNG 648
             +    I+A G+    ++S   G ++  + L   P   ++ +DF+ DG+ D+I++T  G
Sbjct: 579 YRNTDPYIIAVGESCLTLLSARTGRVMRVVLLDEVPVAPVIVQDFNGDGINDIIVVTEGG 638

Query: 649 VYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLN 684
           +YG+V   Q  +   + L+  +I ++ V+   + +N
Sbjct: 639 IYGYVMGAQTSSDTITALMILMIGLLAVLVAVREMN 674


>gi|390369515|ref|XP_003731654.1| PREDICTED: uncharacterized protein LOC100890980, partial
           [Strongylocentrotus purpuratus]
          Length = 184

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 44/162 (27%), Positives = 70/162 (43%), Gaps = 27/162 (16%)

Query: 415 VCKLHLQEGGLHADINGDGVLDHVQAV-GGNGAEQTVVSGSMEVLRPCWAVATSGVPVRE 473
           + +L L     +ADI+ DGV+D  +A+   +G+E             C AV  +G P   
Sbjct: 1   MTRLDLSPDATYADIDKDGVVDQAKALFTDDGSEGN-----------CQAVVKTGHPPHS 49

Query: 474 QLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSH----- 528
           +L+N SIC+ S         ++   G +    + E++   LI +S   R    SH     
Sbjct: 50  ELYNGSICYPSSLWAALSYPWAYASGSSEIKENQELSLRPLIVKSVAKRRGIISHLLGLS 109

Query: 529 -----GDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWS 565
                 D +F  + G+VT+Y P        + WQ+ T A WS
Sbjct: 110 MSKAGMDTIFTVSTGQVTSYGP-----QGQFNWQVSTSALWS 146


>gi|345004742|ref|YP_004807595.1| FG-GAP repeat-containing protein [halophilic archaeon DL31]
 gi|344320368|gb|AEN05222.1| FG-GAP repeat protein [halophilic archaeon DL31]
          Length = 426

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/71 (28%), Positives = 37/71 (52%), Gaps = 8/71 (11%)

Query: 7   AILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLV 66
           A+ + +     F+L  EG+  +R           ++ D    PPP+  DL+GDG +E++V
Sbjct: 311 AVYVTNKSGTVFALDEEGETVWRRE--------VVEEDTQMTPPPVAGDLDGDGSQELVV 362

Query: 67  ATHDAKIQVLE 77
           A +  ++ VL+
Sbjct: 363 AANTGEVTVLD 373


>gi|354611490|ref|ZP_09029446.1| FG-GAP repeat protein [Halobacterium sp. DL1]
 gi|353196310|gb|EHB61812.1| FG-GAP repeat protein [Halobacterium sp. DL1]
          Length = 378

 Score = 40.0 bits (92), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 14/40 (35%), Positives = 27/40 (67%)

Query: 44  DADRLPPPIVADLNGDGRKEVLVATHDAKIQVLEPHARRV 83
           D   +PPP++ D++GDG +E++  T+D  ++V+ P   +V
Sbjct: 297 DVQMMPPPVLGDVDGDGDRELVATTNDGMVKVVSPSDGQV 336


>gi|149920255|ref|ZP_01908726.1| cell surface protein [Plesiocystis pacifica SIR-1]
 gi|149818842|gb|EDM78282.1| cell surface protein [Plesiocystis pacifica SIR-1]
          Length = 538

 Score = 39.7 bits (91), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 54/126 (42%), Gaps = 12/126 (9%)

Query: 15  AIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLVATH--DAK 72
           ++ +   H G   ++ AW H + + P        P   V DL+GDG  E+ V+ H  DA 
Sbjct: 84  SVLYVYDHAGALLWQTAWSHSASDSPEHGTVRMWPSAAVGDLDGDGDVEIAVSAHPDDAG 143

Query: 73  IQV-LEPHARRVDEGFSEARVLAEVSLLPDKIRIASG-------RRAVAMATGV--IDRT 122
           + V +  H   +  G+ +A   AEV  +        G       ++A   AT V  +D T
Sbjct: 144 LNVAVYDHGGELLPGWPQAYADAEVRSIAAADVDGDGAHEILITKQASGPATNVFELDGT 203

Query: 123 YRQGQP 128
           +  G P
Sbjct: 204 HASGWP 209


>gi|156400118|ref|XP_001638847.1| predicted protein [Nematostella vectensis]
 gi|156225971|gb|EDO46784.1| predicted protein [Nematostella vectensis]
          Length = 628

 Score = 39.3 bits (90), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 6/58 (10%)

Query: 20 LQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLVATHDAKIQVLE 77
          L H+  +S  +AW   +   PI      +  P++ D+NGD  K+V V T D ++ V++
Sbjct: 13 LSHDCHYSLNQAWLSEAGAAPI------VSSPLIVDVNGDNIKDVAVTTFDGQVSVID 64


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.135    0.409 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,562,136,069
Number of Sequences: 23463169
Number of extensions: 506444641
Number of successful extensions: 1149468
Number of sequences better than 100.0: 84
Number of HSP's better than 100.0 without gapping: 67
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 1148991
Number of HSP's gapped (non-prelim): 204
length of query: 697
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 547
effective length of database: 8,839,720,017
effective search space: 4835326849299
effective search space used: 4835326849299
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)