BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 005433
(697 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359476651|ref|XP_002273723.2| PREDICTED: uncharacterized protein LOC100264247 [Vitis vinifera]
gi|297735064|emb|CBI17426.3| unnamed protein product [Vitis vinifera]
Length = 696
Score = 1302 bits (3369), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 613/697 (87%), Positives = 659/697 (94%), Gaps = 1/697 (0%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILMLSAFAIFFSLQHEGDFSF+EAWFHLS+EYPIK++A+RLPPP+VADLNGDG
Sbjct: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFKEAWFHLSDEYPIKYEAERLPPPLVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KEVLVATHDAKIQVLEPHARRVDEGFSEARVL EVSLLPDKIRI+SGRRAVAMATGV+D
Sbjct: 61 KKEVLVATHDAKIQVLEPHARRVDEGFSEARVLVEVSLLPDKIRISSGRRAVAMATGVVD 120
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R Y+QGQP KQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFP NAHHREIAISISNYTLK
Sbjct: 121 RHYKQGQPQKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPHNAHHREIAISISNYTLK 180
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGD GLVIVGGRMEM PH MDPFE IG+ EKNAEQHRRSA+EKEASEN+GTVDLRHFAF
Sbjct: 181 HGDAGLVIVGGRMEMLPHIYMDPFEVIGMTEKNAEQHRRSANEKEASENAGTVDLRHFAF 240
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
YAFAGRSG +RW RKNENI+ +DASQLIPQHNYKLD HALN+RHPGEFECREFRES+L
Sbjct: 241 YAFAGRSGAVRWMRKNENIQTLSSDASQLIPQHNYKLDAHALNTRHPGEFECREFRESIL 300
Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNL 360
GVMPHHWDRREDTLLKL+HFRRHKRK LKK GKST+YPFHKPEE+HPPGKD TKKISNL
Sbjct: 301 GVMPHHWDRREDTLLKLAHFRRHKRKTLKKTQGKSTNYPFHKPEENHPPGKDDTKKISNL 360
Query: 361 IGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL 420
IGKAA YA SAKSKKP+ Y+PTITNYTQLWWVPNVVVAHQ+EGIEAVHL +GRT+CKLHL
Sbjct: 361 IGKAAKYASSAKSKKPLPYVPTITNYTQLWWVPNVVVAHQREGIEAVHLPTGRTICKLHL 420
Query: 421 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
QEGGLHADINGDGVLDHVQ VGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI
Sbjct: 421 QEGGLHADINGDGVLDHVQVVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
Query: 481 CHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEV 540
CHHSPFNLF HGEFSR+F RT D+ SLEVATPILIPR+DGHRHRKGSHGD++FLTNRGEV
Sbjct: 481 CHHSPFNLFQHGEFSRSFSRTPDLGSLEVATPILIPRNDGHRHRKGSHGDIIFLTNRGEV 540
Query: 541 TAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAG 600
T+YSPGLHGHDAIWQWQLLT ATWSNLPSPSGM E S VVPTLKAFSLR HDN+++ILA
Sbjct: 541 TSYSPGLHGHDAIWQWQLLTGATWSNLPSPSGMME-SMVVPTLKAFSLRAHDNRELILAA 599
Query: 601 GDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
GDQEA+++SPGGS+LTS++LPA PTHAL+CEDFSNDGLTD+IL+TSNGVYGFVQTRQPGA
Sbjct: 600 GDQEAIMMSPGGSLLTSVELPAAPTHALICEDFSNDGLTDLILVTSNGVYGFVQTRQPGA 659
Query: 661 LFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGLR 697
LFFSTLVGCLIVVMGVIFVTQ+LNS+K KPRASSG R
Sbjct: 660 LFFSTLVGCLIVVMGVIFVTQYLNSMKGKPRASSGPR 696
>gi|255585207|ref|XP_002533306.1| aldehyde dehydrogenase, putative [Ricinus communis]
gi|223526871|gb|EEF29083.1| aldehyde dehydrogenase, putative [Ricinus communis]
Length = 1050
Score = 1301 bits (3368), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 610/676 (90%), Positives = 647/676 (95%), Gaps = 2/676 (0%)
Query: 20 LQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLVATHDAKIQVLEPH 79
+QHEGDFSFREAWFHLS+EYPIK++ADRLPPPIVADLNGDG+KEVLVATHDAKIQVLEPH
Sbjct: 372 VQHEGDFSFREAWFHLSDEYPIKYEADRLPPPIVADLNGDGKKEVLVATHDAKIQVLEPH 431
Query: 80 ARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSG 139
+RRVDEGFSEARVLAEVSLLPDKIR+ASGRRAVAMA GVIDRTY+QGQPLKQVLVV+TSG
Sbjct: 432 SRRVDEGFSEARVLAEVSLLPDKIRVASGRRAVAMAAGVIDRTYKQGQPLKQVLVVITSG 491
Query: 140 WSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHT 199
WSVMCFDHNL KLWEANLQEDFP NAHHREIAISISNYTL+HGDTGLV+VGGRMEMQPH
Sbjct: 492 WSVMCFDHNLKKLWEANLQEDFPHNAHHREIAISISNYTLRHGDTGLVLVGGRMEMQPHV 551
Query: 200 IM--DPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNE 257
+ DPFEEIG AEKNAE HRRSASEKEA+ENSGTVDLRHFAFYAFAGR+G LRWSRKNE
Sbjct: 552 YLELDPFEEIGTAEKNAEFHRRSASEKEATENSGTVDLRHFAFYAFAGRTGALRWSRKNE 611
Query: 258 NIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKL 317
NIEAQP+DASQLIPQHNYKLDVHALNSRHPGEFECREFRES+LGVMPHHWDRREDT LKL
Sbjct: 612 NIEAQPSDASQLIPQHNYKLDVHALNSRHPGEFECREFRESILGVMPHHWDRREDTQLKL 671
Query: 318 SHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPV 377
SHFRRHKRK LKKV GK+ +YPFHKPEE+HPPGKDSTKKIS +IGKAA YAGSAKSKKP
Sbjct: 672 SHFRRHKRKTLKKVPGKTINYPFHKPEENHPPGKDSTKKISKIIGKAANYAGSAKSKKPF 731
Query: 378 NYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDH 437
YIPTITNYTQLWWVPNVVVAHQKEGIEAVHLA+GRT+CKLHL EGGLHADINGDGVLDH
Sbjct: 732 PYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLATGRTLCKLHLLEGGLHADINGDGVLDH 791
Query: 438 VQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRN 497
VQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLF HGEFSRN
Sbjct: 792 VQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFQHGEFSRN 851
Query: 498 FGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQ 557
FGRTSD +SLEVA+PILIPRSDGH+HRKGSHGDV+FLTNRGEVT+YSPGLHGHDAIWQWQ
Sbjct: 852 FGRTSDASSLEVASPILIPRSDGHKHRKGSHGDVIFLTNRGEVTSYSPGLHGHDAIWQWQ 911
Query: 558 LLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTS 617
LLTDATWSNLPSPSGM E VVPTLKAFSLR+HDNQQMILA GDQEAVVISPGGSI T+
Sbjct: 912 LLTDATWSNLPSPSGMMEGGMVVPTLKAFSLRMHDNQQMILAAGDQEAVVISPGGSIQTT 971
Query: 618 IDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVI 677
IDLPAPPTHAL+CEDFS+DGLTD+I++TSNGVYGFVQTR PGALFFSTLVGCL++VMGVI
Sbjct: 972 IDLPAPPTHALICEDFSSDGLTDLIVVTSNGVYGFVQTRTPGALFFSTLVGCLLIVMGVI 1031
Query: 678 FVTQHLNSVKAKPRAS 693
FVTQHLNS+K KPRAS
Sbjct: 1032 FVTQHLNSIKGKPRAS 1047
>gi|224134190|ref|XP_002327778.1| predicted protein [Populus trichocarpa]
gi|222836863|gb|EEE75256.1| predicted protein [Populus trichocarpa]
Length = 693
Score = 1270 bits (3287), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 602/698 (86%), Positives = 648/698 (92%), Gaps = 6/698 (0%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILMLSAF+IFFSLQHEGDFSFREAWFHL++EYPIK++ +RLPPPIV+DLNGDG
Sbjct: 1 MRKRDLAILMLSAFSIFFSLQHEGDFSFREAWFHLTDEYPIKYETERLPPPIVSDLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KE+LVATHDAKIQVLEPH RRVDEGFSEAR+L E+SLLPDK R+A+GRRAVAMATGVID
Sbjct: 61 KKEILVATHDAKIQVLEPHLRRVDEGFSEARLLTELSLLPDKTRVATGRRAVAMATGVID 120
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R Y++G PLKQVLVVVTSGWSVMCFDHNL KLWE NLQEDFP NAHHREIAISISNYTLK
Sbjct: 121 RRYKEGHPLKQVLVVVTSGWSVMCFDHNLKKLWETNLQEDFPHNAHHREIAISISNYTLK 180
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGD+GLVI+GGRMEMQPH DPFEEIG+AEKNAEQHRRSASEKE SENSGTV+LRHFA
Sbjct: 181 HGDSGLVIIGGRMEMQPHIYSDPFEEIGMAEKNAEQHRRSASEKEPSENSGTVNLRHFAL 240
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
YAFAGR+G LRWSRKNE+ +A ASQLIPQHNYKLDVHALNSRHPGEFECREFRES+L
Sbjct: 241 YAFAGRTGALRWSRKNESSDA----ASQLIPQHNYKLDVHALNSRHPGEFECREFRESIL 296
Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNL 360
GVMPHHWDRREDT+L+LSHFRRHKRK KK GK+T+YPFHKPEE+HPPGKDS KKISNL
Sbjct: 297 GVMPHHWDRREDTVLQLSHFRRHKRKTSKKSNGKTTNYPFHKPEENHPPGKDSAKKISNL 356
Query: 361 IGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL 420
IG+AA YAGS KSKKP YIPTITNYTQLWW+PNVVVAHQKEGIEAVHLASGRT+CKLHL
Sbjct: 357 IGEAAKYAGSTKSKKPFQYIPTITNYTQLWWLPNVVVAHQKEGIEAVHLASGRTLCKLHL 416
Query: 421 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
QEGGLHADINGDGVLDHVQAVGGNGAEQTV+SGSMEVL+PCWAVATSGVPVREQLFNASI
Sbjct: 417 QEGGLHADINGDGVLDHVQAVGGNGAEQTVISGSMEVLQPCWAVATSGVPVREQLFNASI 476
Query: 481 C-HHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGE 539
C HHSP NLF HG+F RNFGRT DV+SLEVATPILIPR DGHRHRKGSHGDVVFLTNRGE
Sbjct: 477 CHHHSPLNLFQHGDFGRNFGRT-DVSSLEVATPILIPRGDGHRHRKGSHGDVVFLTNRGE 535
Query: 540 VTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILA 599
VT+YSPGLHGHDA+WQWQ+ T ATWSNLPSPSGM E VVPTLKAFSLR DNQQMILA
Sbjct: 536 VTSYSPGLHGHDAVWQWQISTGATWSNLPSPSGMMEGGMVVPTLKAFSLRARDNQQMILA 595
Query: 600 GGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPG 659
GDQEA VISPGGSI TS+DLPAPPTHAL+CEDFSNDGLTD+I++TSNGVYGFVQTR PG
Sbjct: 596 AGDQEASVISPGGSIQTSVDLPAPPTHALICEDFSNDGLTDLIVVTSNGVYGFVQTRSPG 655
Query: 660 ALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGLR 697
ALFFSTLVGCL++VMGVIFVTQHLNS+K KPRASS R
Sbjct: 656 ALFFSTLVGCLLIVMGVIFVTQHLNSIKEKPRASSAAR 693
>gi|224094889|ref|XP_002310280.1| predicted protein [Populus trichocarpa]
gi|222853183|gb|EEE90730.1| predicted protein [Populus trichocarpa]
Length = 679
Score = 1231 bits (3186), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 583/698 (83%), Positives = 637/698 (91%), Gaps = 20/698 (2%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILMLSAF+IFFSLQHEGDFSFREAWFHL+++YPIK++ DRLPPPIV+DLNGDG
Sbjct: 1 MRKRDLAILMLSAFSIFFSLQHEGDFSFREAWFHLTDDYPIKYETDRLPPPIVSDLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KE+LVATHDAKI VLEPH+RRVDEGFSEAR+L E+SLLPDK R+A+GRRAVAMATGVI+
Sbjct: 61 KKEILVATHDAKILVLEPHSRRVDEGFSEARLLTELSLLPDKTRVATGRRAVAMATGVIE 120
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R Y++G PLKQVLVVVTSGWSVMCFDHNL KLWE N+QEDFP NAHHREIAISISNYTLK
Sbjct: 121 RRYKEGHPLKQVLVVVTSGWSVMCFDHNLKKLWETNVQEDFPHNAHHREIAISISNYTLK 180
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGD GLVI+GGRME+QPH +DPFEEIG+AEKNAEQHRRSA EKE SENSGTV+LRHFA
Sbjct: 181 HGDMGLVIIGGRMEVQPHNYLDPFEEIGMAEKNAEQHRRSAGEKEPSENSGTVNLRHFAL 240
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTD-ASQLIPQHNYKLDVHALNSRHPGEFECREFRESV 299
YAFAGR+G +RWSRKNENIEA+ +D ASQLIPQHNYKLDVHALNSRHPGE
Sbjct: 241 YAFAGRTGTVRWSRKNENIEAESSDAASQLIPQHNYKLDVHALNSRHPGE---------- 290
Query: 300 LGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISN 359
DRREDT+L+LSHFRRHKRK KK GK+++YPFHKPEE+HPPGKD+TKKISN
Sbjct: 291 --------DRREDTVLQLSHFRRHKRKTSKKSNGKNSNYPFHKPEENHPPGKDTTKKISN 342
Query: 360 LIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLH 419
LIGKAA YA S KSKKP YIPTITNYTQLWWVPNVVVAHQKEGIEA+HLASGRT+CKLH
Sbjct: 343 LIGKAAKYASSTKSKKPSQYIPTITNYTQLWWVPNVVVAHQKEGIEAIHLASGRTLCKLH 402
Query: 420 LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNAS 479
LQEGGLHADINGDGVLDHVQAVGGNGAE+TVVSG+MEVL+PCWAVATSGVPVREQLFNAS
Sbjct: 403 LQEGGLHADINGDGVLDHVQAVGGNGAEKTVVSGAMEVLQPCWAVATSGVPVREQLFNAS 462
Query: 480 ICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGE 539
ICHHSPFNLF HG+F RNFGRT DV+SLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGE
Sbjct: 463 ICHHSPFNLFQHGDFGRNFGRT-DVSSLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGE 521
Query: 540 VTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILA 599
VT+YSPGLHGHDA+WQWQ+LT ATWSNLPSPSGM E VVPTLKAFSLR HDNQQMILA
Sbjct: 522 VTSYSPGLHGHDAVWQWQILTGATWSNLPSPSGMMEGGMVVPTLKAFSLRAHDNQQMILA 581
Query: 600 GGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPG 659
GDQEA VISPGGS+ TS DLPAPPTHAL+CEDF+NDGL D+I++TSNGVYGFVQTR PG
Sbjct: 582 AGDQEAAVISPGGSVQTSFDLPAPPTHALICEDFTNDGLPDLIVVTSNGVYGFVQTRSPG 641
Query: 660 ALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGLR 697
ALFFSTLVGCL++VMGVIFVTQH+NS+K KPRASSGLR
Sbjct: 642 ALFFSTLVGCLLIVMGVIFVTQHINSIKGKPRASSGLR 679
>gi|356496701|ref|XP_003517204.1| PREDICTED: uncharacterized protein LOC100787497 [Glycine max]
Length = 697
Score = 1229 bits (3181), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/698 (82%), Positives = 637/698 (91%), Gaps = 2/698 (0%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILMLSAFAIFF+LQ +G SF++AW HL++EYPIK++A+RLPPP+VADLNGDG
Sbjct: 1 MRKRDLAILMLSAFAIFFTLQQDGGISFKDAWMHLTDEYPIKYEAERLPPPLVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KEVLVATHDAKIQVLEPH+RRVDEGFSEARVLAEVSLLPDK+R+ +GRR VAMATG ID
Sbjct: 61 KKEVLVATHDAKIQVLEPHSRRVDEGFSEARVLAEVSLLPDKVRVMTGRRPVAMATGYID 120
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R Y+ GQP KQVLVVVTSGWSVMCFD NL KLWE NLQEDFP NAHHRE+AISISNYTLK
Sbjct: 121 R-YKIGQPQKQVLVVVTSGWSVMCFDSNLQKLWENNLQEDFPHNAHHREVAISISNYTLK 179
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGDTGL+IVGGRMEMQPH MDPFEE+G+ + AEQHRRSA+EKEASENSGTVDLRHFAF
Sbjct: 180 HGDTGLIIVGGRMEMQPHIFMDPFEEMGMGARFAEQHRRSAAEKEASENSGTVDLRHFAF 239
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
YAFAGRSG+ RWSRKNENIE +DASQL+PQHNYKLDVHALN+R PGE+ECREFRES+L
Sbjct: 240 YAFAGRSGVERWSRKNENIEVHSSDASQLLPQHNYKLDVHALNTRQPGEYECREFRESIL 299
Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNL 360
GVMPH W RREDTLLKL+HFRRHKRK LKK GK+ SYPFHKPEE+HPPGKDSTKKISN+
Sbjct: 300 GVMPHQWARREDTLLKLAHFRRHKRKTLKKTPGKAMSYPFHKPEENHPPGKDSTKKISNI 359
Query: 361 IGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL 420
IGKAA YAGSAKSKK + Y+PTITNYTQ+WWVPNVVVAHQKEGIEA+HLASGRT+CKLHL
Sbjct: 360 IGKAANYAGSAKSKKHLPYVPTITNYTQVWWVPNVVVAHQKEGIEALHLASGRTICKLHL 419
Query: 421 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWA+ATSGVP+REQLFN SI
Sbjct: 420 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAIATSGVPIREQLFNVSI 479
Query: 481 CHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEV 540
CH++ FNLF HGE R++ + SD+ASLEVATPILIPRSDGHRHRKGSHGDV+FLTNRGE+
Sbjct: 480 CHYTHFNLFQHGELYRSYSQGSDIASLEVATPILIPRSDGHRHRKGSHGDVIFLTNRGEI 539
Query: 541 TAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTE-ASTVVPTLKAFSLRVHDNQQMILA 599
T+YSPGLHGHDAIWQWQ T TWSNLPSPSG+ E V+PTLK SLR+HDNQ+MILA
Sbjct: 540 TSYSPGLHGHDAIWQWQQSTGVTWSNLPSPSGVMEGGGLVIPTLKPLSLRLHDNQEMILA 599
Query: 600 GGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPG 659
G+QEAV+ISPGGS+L +I+LP PPTH L+ EDFSNDGLTD+IL+TSNGVYGFVQTRQPG
Sbjct: 600 AGEQEAVIISPGGSLLATIELPGPPTHVLIAEDFSNDGLTDLILVTSNGVYGFVQTRQPG 659
Query: 660 ALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGLR 697
ALFFS LVGCLIVVMGVIFVTQHLNS K KPR SSG R
Sbjct: 660 ALFFSMLVGCLIVVMGVIFVTQHLNSTKGKPRPSSGSR 697
>gi|356538246|ref|XP_003537615.1| PREDICTED: uncharacterized protein LOC100789851 [Glycine max]
Length = 693
Score = 1216 bits (3145), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 573/697 (82%), Positives = 632/697 (90%), Gaps = 4/697 (0%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILMLSAFAIFF+LQ +G SF++AW HL++EYPIK++A+RLPPP+VADLNGDG
Sbjct: 1 MRKRDLAILMLSAFAIFFTLQQDGGISFKDAWMHLTDEYPIKYEAERLPPPLVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KEVLVATHDAKIQVLEPH+RRVDEGFSEARVLAEVSLLPDK+R+ +GRR VAMATG ID
Sbjct: 61 KKEVLVATHDAKIQVLEPHSRRVDEGFSEARVLAEVSLLPDKVRVMTGRRPVAMATGYID 120
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R Y+ GQP KQVLVVVTSGWSVMCFD NL KLWE NLQEDFP NAHHRE+AISISNYTLK
Sbjct: 121 R-YKIGQPQKQVLVVVTSGWSVMCFDSNLQKLWENNLQEDFPHNAHHREVAISISNYTLK 179
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGDTGL+IVGGRMEMQPH MDPFEE+G+ + AEQH+RSA+EKEAS GTVDLRHFAF
Sbjct: 180 HGDTGLIIVGGRMEMQPHIFMDPFEEMGMGARFAEQHQRSAAEKEAS---GTVDLRHFAF 236
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
YAFAGRSG RWSRKNENIEA +DASQL+PQHNYKLDVHALN+R PGEFECREFRES+L
Sbjct: 237 YAFAGRSGDERWSRKNENIEAHSSDASQLLPQHNYKLDVHALNTRQPGEFECREFRESIL 296
Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNL 360
GVMPH W RREDTL KL+HFRRHKRK LKK GK+ SYPFHKPEE+HPPGKDSTKKISN+
Sbjct: 297 GVMPHQWARREDTLFKLAHFRRHKRKALKKTPGKAISYPFHKPEENHPPGKDSTKKISNI 356
Query: 361 IGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL 420
IGKAA+YAGSAKSKK + Y+PTITNYTQ+WWVPNVVV+HQKEGIEA+HLA+GRT+CK HL
Sbjct: 357 IGKAASYAGSAKSKKHLPYVPTITNYTQVWWVPNVVVSHQKEGIEALHLATGRTICKFHL 416
Query: 421 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
QEGGLHAD+NGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFN SI
Sbjct: 417 QEGGLHADVNGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNVSI 476
Query: 481 CHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEV 540
CH++ FNLF HGE R++ + SD ASLEVATPILIPRSDGHRHRKGSHGDV+FLTNRGE+
Sbjct: 477 CHYTHFNLFQHGELYRSYSQGSDTASLEVATPILIPRSDGHRHRKGSHGDVIFLTNRGEI 536
Query: 541 TAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAG 600
T+YSPGLHGHDAIWQWQ T TWSNLPSPSGM E V+PTLK SLR+HDNQ+MILA
Sbjct: 537 TSYSPGLHGHDAIWQWQQSTGVTWSNLPSPSGMMEGGLVIPTLKPLSLRLHDNQEMILAA 596
Query: 601 GDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
G+QEAV+ISPGGSIL +I+LP PPTH L+ EDFSNDGLTD+IL+TS+GVYGFVQTRQPGA
Sbjct: 597 GEQEAVIISPGGSILATIELPGPPTHVLITEDFSNDGLTDLILVTSHGVYGFVQTRQPGA 656
Query: 661 LFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGLR 697
LFFS LVGCLIVVMGVIFVTQHLNS K KPR SSG R
Sbjct: 657 LFFSMLVGCLIVVMGVIFVTQHLNSTKGKPRPSSGPR 693
>gi|449464520|ref|XP_004149977.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101223217 [Cucumis sativus]
Length = 686
Score = 1215 bits (3144), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 576/694 (82%), Positives = 632/694 (91%), Gaps = 10/694 (1%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILMLSAFAIFFSLQHEGDFSFREAW HL++EYPIK++ DRLPPP+VADLNGDG
Sbjct: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWMHLTDEYPIKYEGDRLPPPVVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KEVLVATHDAKI VLEPH+RRVDEGFS ARV +RI+SGRR VAMATGVID
Sbjct: 61 KKEVLVATHDAKILVLEPHSRRVDEGFSHARV---------XVRISSGRRPVAMATGVID 111
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R RQGQP+ QVLVVVTSGWSV+CFDHNLNKLWEANLQEDFP NAHHREIAISI+NYTLK
Sbjct: 112 RHPRQGQPVTQVLVVVTSGWSVLCFDHNLNKLWEANLQEDFPHNAHHREIAISITNYTLK 171
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGD+GL+IVGGRMEMQ H MDPFEEIG+AEKNAEQHRRSA+EKEASENSG++DLRHFAF
Sbjct: 172 HGDSGLIIVGGRMEMQSHIFMDPFEEIGIAEKNAEQHRRSATEKEASENSGSIDLRHFAF 231
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
YAFAGRSGL RWSRKNENIEA +DASQLIPQHNYKLDVH+LN+RHPGEFECREFRES+L
Sbjct: 232 YAFAGRSGLPRWSRKNENIEAHSSDASQLIPQHNYKLDVHSLNARHPGEFECREFRESIL 291
Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNL 360
GVMPHHWDRREDT+L+L+HFRRHKRK LKK GKS +YPFHKPEE+HPPGKDS+K+I +
Sbjct: 292 GVMPHHWDRREDTVLELAHFRRHKRKALKKTSGKSVNYPFHKPEENHPPGKDSSKRIPKI 351
Query: 361 IGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL 420
IG AA AGSAK+KKP+ Y+PTITNYT+LWW+PNVVVAHQKEGIEA+HLASGRT+CKLHL
Sbjct: 352 IGTAANIAGSAKTKKPLPYVPTITNYTKLWWLPNVVVAHQKEGIEALHLASGRTICKLHL 411
Query: 421 QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASI 480
QEGGLHADINGDGVLDHVQAVGGNGAE+TVVSGSMEV++PCWAVATSGVPVREQLFNASI
Sbjct: 412 QEGGLHADINGDGVLDHVQAVGGNGAERTVVSGSMEVIQPCWAVATSGVPVREQLFNASI 471
Query: 481 CHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEV 540
CH SPFN F HGE SR FGRT D+ASLEVATPILI R DGHRHRKGSHGDVVFLTNRGEV
Sbjct: 472 CHFSPFNYFQHGELSR-FGRTPDMASLEVATPILISRKDGHRHRKGSHGDVVFLTNRGEV 530
Query: 541 TAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAG 600
T+YSPGLHGH A WQWQ+ T ATWSNLPSPSGM +A TV+PTLKA LRV Q+M+LA
Sbjct: 531 TSYSPGLHGHGADWQWQITTGATWSNLPSPSGMMDAGTVIPTLKAIDLRVGATQEMVLAA 590
Query: 601 GDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
G+QEAVVISPGGS+ SI+LPA PTHAL+ EDFSNDGLTD+IL+TS GVYGFVQTRQPGA
Sbjct: 591 GEQEAVVISPGGSVQASIELPASPTHALITEDFSNDGLTDIILVTSTGVYGFVQTRQPGA 650
Query: 661 LFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASS 694
LFFSTLVGCLI+VMGVIFVTQHLNS+K KPR S+
Sbjct: 651 LFFSTLVGCLILVMGVIFVTQHLNSIKGKPRPSA 684
>gi|297816364|ref|XP_002876065.1| FG-GAP repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297321903|gb|EFH52324.1| FG-GAP repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 698
Score = 1205 bits (3117), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 561/698 (80%), Positives = 634/698 (90%), Gaps = 2/698 (0%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILMLS FAIFF+LQHEGDF+F+EAWFHL +EYP+K++ADRLPPPIVADLNGDG
Sbjct: 1 MRKRDLAILMLSGFAIFFTLQHEGDFAFKEAWFHLYDEYPVKYEADRLPPPIVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KEVLVAT+DAKIQVLEPH+RRVDEGFSEARVLAE+ LLPDKIRIASGRRAVAMATGVID
Sbjct: 61 KKEVLVATNDAKIQVLEPHSRRVDEGFSEARVLAEIPLLPDKIRIASGRRAVAMATGVID 120
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R Y+ G P KQV+VVVTSGWSV+CFDHNL KLWE NLQEDFP NAHHREIAISISNYTLK
Sbjct: 121 RYYKDGTPQKQVVVVVTSGWSVLCFDHNLKKLWETNLQEDFPHNAHHREIAISISNYTLK 180
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGDTGLVIVGGRMEMQP+ MDPFEE+G+ +NAEQHRRSA+E +ASE+SG ++LRHF+
Sbjct: 181 HGDTGLVIVGGRMEMQPYNHMDPFEELGMTAQNAEQHRRSATENQASEDSGAINLRHFSV 240
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
YAFAG++GLLRWS+K +++EA +DASQLIPQHNYKLDVHALNSRHPGEFECREFRES+L
Sbjct: 241 YAFAGKTGLLRWSKKTDDVEAHTSDASQLIPQHNYKLDVHALNSRHPGEFECREFRESIL 300
Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVG-KSTSYPFHKPEEHHPPGKDSTKKISN 359
VMPHHWDRREDTLLKL+HFRRHKRK LKK G KST+YPFHKPEEH P GKD ++KI
Sbjct: 301 SVMPHHWDRREDTLLKLAHFRRHKRKTLKKQAGSKSTAYPFHKPEEHTPAGKDLSRKIPK 360
Query: 360 LIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLH 419
LIGKAA YAGSAK KK + YIPTITNYT+LWWVPNVVVAHQKEGIEA+HL +GRT+CKL
Sbjct: 361 LIGKAARYAGSAKPKKGMQYIPTITNYTKLWWVPNVVVAHQKEGIEAIHLPTGRTLCKLS 420
Query: 420 LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNAS 479
L EGGLHADINGDGVLDHVQ VGGN E+TVVSGSMEVL+PCWAVATSGVP+REQLFN S
Sbjct: 421 LLEGGLHADINGDGVLDHVQTVGGNVGERTVVSGSMEVLKPCWAVATSGVPIREQLFNVS 480
Query: 480 ICHHSPFNLFPH-GEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRG 538
ICHHSPFN + G++SR+F + D ++LE+ATPILIPR DGH+HR+GSHGDV+FLTNRG
Sbjct: 481 ICHHSPFNFLHYGGDYSRHFAQARDTSTLEIATPILIPRDDGHKHRRGSHGDVIFLTNRG 540
Query: 539 EVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMIL 598
EVT+Y+P +HGHDA+WQWQL T+ATWSNLPSPSG+TE+ TVVPTLK FSLR+HDNQ MIL
Sbjct: 541 EVTSYTPDVHGHDAVWQWQLQTEATWSNLPSPSGLTESGTVVPTLKPFSLRIHDNQPMIL 600
Query: 599 AGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQP 658
AGGDQ AV+ISPGGSIL SI+LP+ PTHAL+ +DFSNDGLTDVI+MTSNGVYGFVQTRQP
Sbjct: 601 AGGDQAAVIISPGGSILASIELPSQPTHALITDDFSNDGLTDVIVMTSNGVYGFVQTRQP 660
Query: 659 GALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGL 696
GALFFS+LVGCL+VVM VIFVTQHLNS++ KPR SS
Sbjct: 661 GALFFSSLVGCLLVVMAVIFVTQHLNSIQGKPRPSSSF 698
>gi|22331734|ref|NP_190674.2| FG-GAP repeat-containing protein [Arabidopsis thaliana]
gi|332645222|gb|AEE78743.1| FG-GAP repeat-containing protein [Arabidopsis thaliana]
Length = 698
Score = 1202 bits (3111), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 559/698 (80%), Positives = 634/698 (90%), Gaps = 2/698 (0%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILMLS FAIFF+LQHEGDF+F+EAWFHL +EYP+K++ADRLPPPIVADLNGDG
Sbjct: 1 MRKRDLAILMLSGFAIFFTLQHEGDFAFKEAWFHLYDEYPVKYEADRLPPPIVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KEVLVAT+DAKIQVLEPH+RRVDEGFSEARVLAE++LLPDKIR+ASGRRAVAMATGVID
Sbjct: 61 KKEVLVATNDAKIQVLEPHSRRVDEGFSEARVLAEITLLPDKIRVASGRRAVAMATGVID 120
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R Y+ G P KQV+VVVTSGWSV+CFDHNL KLWE NLQEDFP NAHHREIAISISNYTLK
Sbjct: 121 RYYKNGTPQKQVVVVVTSGWSVLCFDHNLKKLWETNLQEDFPHNAHHREIAISISNYTLK 180
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGDTGLVIVGGRMEMQP+ MDPFEE+G+ +NA+QHRRSA+E +ASE+SG ++LRHF+
Sbjct: 181 HGDTGLVIVGGRMEMQPYNHMDPFEELGMTAQNADQHRRSATENQASEDSGAINLRHFSV 240
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
YAFAG++GLLRWS+K +++EA +DASQLIPQHNYKLDVHALNSRHPGEFECREFRES+L
Sbjct: 241 YAFAGKTGLLRWSKKTDDVEAHTSDASQLIPQHNYKLDVHALNSRHPGEFECREFRESIL 300
Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVG-KSTSYPFHKPEEHHPPGKDSTKKISN 359
VMPH WDRREDTLLKL+HFRRHKRK LKK G KST+YPFHKPEEH P GKD ++KI
Sbjct: 301 SVMPHRWDRREDTLLKLAHFRRHKRKTLKKQAGSKSTAYPFHKPEEHTPAGKDLSRKIPK 360
Query: 360 LIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLH 419
LIGKAA YAGSAK KK + YIPTITNYT+LWWVPNVVVAHQKEGIEA+HL +GRT+CKL
Sbjct: 361 LIGKAARYAGSAKPKKGMQYIPTITNYTKLWWVPNVVVAHQKEGIEAIHLPTGRTLCKLS 420
Query: 420 LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNAS 479
L EGGLHADINGDGVLDHVQ VGGN E+TVVSGSMEVL+PCWAVATSGVP+REQLFN S
Sbjct: 421 LLEGGLHADINGDGVLDHVQTVGGNVGERTVVSGSMEVLKPCWAVATSGVPIREQLFNVS 480
Query: 480 ICHHSPFNLFPH-GEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRG 538
ICHHSPFN + G++SR+F + D ++LE+ATPILIPR DGH+HRKGSHGDV+FLTNRG
Sbjct: 481 ICHHSPFNFLHYGGDYSRHFAQARDTSTLEIATPILIPRDDGHKHRKGSHGDVIFLTNRG 540
Query: 539 EVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMIL 598
EVT+Y+P +HGHDA+WQWQL T+ATWSNLPSPSG+TE+ TVVPTLK FSLR+HDNQ MIL
Sbjct: 541 EVTSYTPDVHGHDAVWQWQLQTEATWSNLPSPSGLTESGTVVPTLKPFSLRIHDNQPMIL 600
Query: 599 AGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQP 658
AGGDQ AV+ISPGGSIL SI+LP+ PTHAL+ +DFSNDGLTDVI+MTSNGVYGFVQTRQP
Sbjct: 601 AGGDQAAVIISPGGSILASIELPSQPTHALITDDFSNDGLTDVIVMTSNGVYGFVQTRQP 660
Query: 659 GALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGL 696
GALFFS+LVGCL+VVM VIFVTQHLNS++ KPR SS
Sbjct: 661 GALFFSSLVGCLLVVMAVIFVTQHLNSIQGKPRPSSSF 698
>gi|18087608|gb|AAL58934.1|AF462847_1 AT3g51050/F24M12_90 [Arabidopsis thaliana]
gi|24797020|gb|AAN64522.1| At3g51050/F24M12_90 [Arabidopsis thaliana]
Length = 698
Score = 1200 bits (3104), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 558/698 (79%), Positives = 633/698 (90%), Gaps = 2/698 (0%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILMLS FAIFF+LQHEGDF+F+EAWFHL +EYP+K++ADRLPPPIVADLNGDG
Sbjct: 1 MRKRDLAILMLSGFAIFFTLQHEGDFAFKEAWFHLYDEYPVKYEADRLPPPIVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KEVLVAT+DAKIQVLEPH+RRVDEGFSEARVLAE++LLPDKIR+ASGRRAVAMATGVID
Sbjct: 61 KKEVLVATNDAKIQVLEPHSRRVDEGFSEARVLAEITLLPDKIRVASGRRAVAMATGVID 120
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R Y+ G P KQV+VVVTSGWSV+CFDHNL KLWE NLQEDFP NA HREIAISISNYTLK
Sbjct: 121 RYYKNGTPQKQVVVVVTSGWSVLCFDHNLKKLWETNLQEDFPHNARHREIAISISNYTLK 180
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGDTGLVIVGGRMEMQP+ MDPFEE+G+ +NA+QHRRSA+E +ASE+SG ++LRHF+
Sbjct: 181 HGDTGLVIVGGRMEMQPYNHMDPFEELGMTAQNADQHRRSATENQASEDSGAINLRHFSV 240
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
YAFAG++GLLRWS+K +++EA +DASQLIPQHNYKLDVHALNSRHPGEFECREFRES+L
Sbjct: 241 YAFAGKTGLLRWSKKTDDVEAHTSDASQLIPQHNYKLDVHALNSRHPGEFECREFRESIL 300
Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVG-KSTSYPFHKPEEHHPPGKDSTKKISN 359
VMPH WDRREDTLLKL+HFRRHKRK LKK G KST+YPFHKPEEH P GKD ++KI
Sbjct: 301 SVMPHRWDRREDTLLKLAHFRRHKRKTLKKQAGSKSTAYPFHKPEEHTPAGKDLSRKIPK 360
Query: 360 LIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLH 419
LIGKAA YAGSAK KK + YIPTITNYT+LWWVPNVVVAHQKEGIEA+HL +GRT+CKL
Sbjct: 361 LIGKAARYAGSAKPKKGMQYIPTITNYTKLWWVPNVVVAHQKEGIEAIHLPTGRTLCKLS 420
Query: 420 LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNAS 479
L EGGLHADINGDGVLDHVQ VGGN E+TVVSGSMEVL+PCWAVATSGVP+REQLFN S
Sbjct: 421 LLEGGLHADINGDGVLDHVQTVGGNVGERTVVSGSMEVLKPCWAVATSGVPIREQLFNVS 480
Query: 480 ICHHSPFNLFPH-GEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRG 538
ICHHSPFN + G++SR+F + D ++LE+ATPILIPR DGH+HRKGSHGDV+FLTNRG
Sbjct: 481 ICHHSPFNFLHYGGDYSRHFAQARDTSTLEIATPILIPRDDGHKHRKGSHGDVIFLTNRG 540
Query: 539 EVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMIL 598
EVT+Y+P +HGHDA+WQWQL T+ATWSNLPSPSG+TE+ TVVPTLK FSLR+HDNQ MIL
Sbjct: 541 EVTSYTPDVHGHDAVWQWQLQTEATWSNLPSPSGLTESGTVVPTLKPFSLRIHDNQPMIL 600
Query: 599 AGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQP 658
AGGDQ AV+ISPGGSIL SI+LP+ PTHAL+ +DFSNDGLTDVI+MTSNGVYGFVQTRQP
Sbjct: 601 AGGDQAAVIISPGGSILASIELPSQPTHALITDDFSNDGLTDVIVMTSNGVYGFVQTRQP 660
Query: 659 GALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGL 696
GALFFS+LVGCL+VVM VIFVTQHLNS++ KPR SS
Sbjct: 661 GALFFSSLVGCLLVVMAVIFVTQHLNSIQGKPRPSSSF 698
>gi|449524524|ref|XP_004169272.1| PREDICTED: uncharacterized protein LOC101231345 [Cucumis sativus]
Length = 664
Score = 1166 bits (3016), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 551/663 (83%), Positives = 607/663 (91%), Gaps = 2/663 (0%)
Query: 33 FHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARV 92
HL++EYPIK++ DRLPPP+VADLNGDG+KEVLVATHDAKI VLEPH+RRVDEGFS ARV
Sbjct: 1 MHLTDEYPIKYEGDRLPPPVVADLNGDGKKEVLVATHDAKILVLEPHSRRVDEGFSHARV 60
Query: 93 LAEVSLLPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKL 152
L E SLLP K+RI+SGRR VAMATGVIDR RQGQP+ QVLVVVTSGWSV+CFDHNLNKL
Sbjct: 61 LTEASLLPAKVRISSGRRPVAMATGVIDRHPRQGQPVTQVLVVVTSGWSVLCFDHNLNKL 120
Query: 153 WEANLQEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEK 212
WEANLQEDFP NAHHREIAISI+NYTLKHGD+GL+IVGGRMEMQ H MDPFEEIG+AEK
Sbjct: 121 WEANLQEDFPHNAHHREIAISITNYTLKHGDSGLIIVGGRMEMQSHIFMDPFEEIGIAEK 180
Query: 213 NAEQHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNE-NIEAQPTDASQLIP 271
NAEQHRRSA+EKEASENSG++DLRHFAFYAFAGRSGL RWSRKNE NIEA +DASQLIP
Sbjct: 181 NAEQHRRSATEKEASENSGSIDLRHFAFYAFAGRSGLPRWSRKNEVNIEAHSSDASQLIP 240
Query: 272 QHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKV 331
QHNYKLDVH+LN+RHPGEFECREFRES+LGVMPHHWDRREDT+L+L+HFRRHKRK LKK
Sbjct: 241 QHNYKLDVHSLNARHPGEFECREFRESILGVMPHHWDRREDTVLELAHFRRHKRKALKKT 300
Query: 332 VGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWW 391
GKS +YPFHKPEE+HPPGKDS+K+I +IG AA AGSAK+KKP+ Y+PTITNYT+LWW
Sbjct: 301 SGKSVNYPFHKPEENHPPGKDSSKRIPKIIGTAANIAGSAKTKKPLPYVPTITNYTKLWW 360
Query: 392 VPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVV 451
+PNVVVAHQKEGIEA+HLASGRT+CKLHLQEGGLHADINGDGVLDHVQAVGGNGAE+TVV
Sbjct: 361 LPNVVVAHQKEGIEALHLASGRTICKLHLQEGGLHADINGDGVLDHVQAVGGNGAERTVV 420
Query: 452 SGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVAT 511
SGSMEV++PCWAVATSGVPVREQLFNASICH SPFN F HGE SR FGRT D+ASLEVAT
Sbjct: 421 SGSMEVIQPCWAVATSGVPVREQLFNASICHFSPFNYFQHGELSR-FGRTPDMASLEVAT 479
Query: 512 PILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPS 571
PILI R DGHRHRKGSHGDVVFLTNRGEVT+YSPGLHGH A WQWQ+ T ATWSNLPSPS
Sbjct: 480 PILISRKDGHRHRKGSHGDVVFLTNRGEVTSYSPGLHGHGADWQWQITTGATWSNLPSPS 539
Query: 572 GMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCE 631
GM +A TV+PTLKA LRV Q+M+LA G+QEAVVISPGGS+ SI+LPA PTHAL+ E
Sbjct: 540 GMMDAGTVIPTLKAIDLRVGATQEMVLAAGEQEAVVISPGGSVQASIELPASPTHALITE 599
Query: 632 DFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPR 691
DFSNDGLTD+IL+TS GVYGFVQTRQPGALFFSTLVGCLI+VMGVIFVTQHLNS+K KPR
Sbjct: 600 DFSNDGLTDIILVTSTGVYGFVQTRQPGALFFSTLVGCLILVMGVIFVTQHLNSIKGKPR 659
Query: 692 ASS 694
S+
Sbjct: 660 PSA 662
>gi|6562257|emb|CAB62627.1| putative protein [Arabidopsis thaliana]
Length = 680
Score = 1155 bits (2989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 544/698 (77%), Positives = 618/698 (88%), Gaps = 20/698 (2%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILMLS FAIFF+LQHEGDF+F+EAWFHL +EYP+K++ADRLPPPIVADLNGDG
Sbjct: 1 MRKRDLAILMLSGFAIFFTLQHEGDFAFKEAWFHLYDEYPVKYEADRLPPPIVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KEVLVAT+DAKIQVLEPH+RRVDEGFSEARVLAE++LLPDKIR+ASGRRAVAMATGVID
Sbjct: 61 KKEVLVATNDAKIQVLEPHSRRVDEGFSEARVLAEITLLPDKIRVASGRRAVAMATGVID 120
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R Y+ G P KQV+VVVTSGWSV+CFDHNL KLWE NLQEDFP NAHHREIAISISNYTLK
Sbjct: 121 RYYKNGTPQKQVVVVVTSGWSVLCFDHNLKKLWETNLQEDFPHNAHHREIAISISNYTLK 180
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGDTGLVIVGGRMEMQP+ MDPFEE+G+ +NA+QHRRSA+E +ASE+SG ++LRHF+
Sbjct: 181 HGDTGLVIVGGRMEMQPYNHMDPFEELGMTAQNADQHRRSATENQASEDSGAINLRHFSV 240
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
YAFAG++GLLRWS+K +++EA +DASQLIPQHNYKLDVHALNSRHPGE
Sbjct: 241 YAFAGKTGLLRWSKKTDDVEAHTSDASQLIPQHNYKLDVHALNSRHPGE----------- 289
Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVG-KSTSYPFHKPEEHHPPGKDSTKKISN 359
DRREDTLLKL+HFRRHKRK LKK G KST+YPFHKPEEH P GKD ++KI
Sbjct: 290 -------DRREDTLLKLAHFRRHKRKTLKKQAGSKSTAYPFHKPEEHTPAGKDLSRKIPK 342
Query: 360 LIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLH 419
LIGKAA YAGSAK KK + YIPTITNYT+LWWVPNVVVAHQKEGIEA+HL +GRT+CKL
Sbjct: 343 LIGKAARYAGSAKPKKGMQYIPTITNYTKLWWVPNVVVAHQKEGIEAIHLPTGRTLCKLS 402
Query: 420 LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNAS 479
L EGGLHADINGDGVLDHVQ VGGN E+TVVSGSMEVL+PCWAVATSGVP+REQLFN S
Sbjct: 403 LLEGGLHADINGDGVLDHVQTVGGNVGERTVVSGSMEVLKPCWAVATSGVPIREQLFNVS 462
Query: 480 ICHHSPFNLFPH-GEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRG 538
ICHHSPFN + G++SR+F + D ++LE+ATPILIPR DGH+HRKGSHGDV+FLTNRG
Sbjct: 463 ICHHSPFNFLHYGGDYSRHFAQARDTSTLEIATPILIPRDDGHKHRKGSHGDVIFLTNRG 522
Query: 539 EVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMIL 598
EVT+Y+P +HGHDA+WQWQL T+ATWSNLPSPSG+TE+ TVVPTLK FSLR+HDNQ MIL
Sbjct: 523 EVTSYTPDVHGHDAVWQWQLQTEATWSNLPSPSGLTESGTVVPTLKPFSLRIHDNQPMIL 582
Query: 599 AGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQP 658
AGGDQ AV+ISPGGSIL SI+LP+ PTHAL+ +DFSNDGLTDVI+MTSNGVYGFVQTRQP
Sbjct: 583 AGGDQAAVIISPGGSILASIELPSQPTHALITDDFSNDGLTDVIVMTSNGVYGFVQTRQP 642
Query: 659 GALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASSGL 696
GALFFS+LVGCL+VVM VIFVTQHLNS++ KPR SS
Sbjct: 643 GALFFSSLVGCLLVVMAVIFVTQHLNSIQGKPRPSSSF 680
>gi|326530844|dbj|BAK01220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 704
Score = 1088 bits (2815), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 523/707 (73%), Positives = 611/707 (86%), Gaps = 13/707 (1%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-YPIKFDADRLPPPIVADLNGD 59
MRKRDL IL+L+AFA+FFSLQHEGDFSFREAW+HLS+E YPIK DADRLPPP+VADLNGD
Sbjct: 1 MRKRDLGILLLAAFAVFFSLQHEGDFSFREAWYHLSDEGYPIKHDADRLPPPLVADLNGD 60
Query: 60 GRKEVLVATHDAKIQVLE--PHARRVD----EGFSEARVLAEVSLLPDKIRIASGRRAVA 113
GR+EVL+ THDAKIQVL+ HAR + F EARV+AE+SLLP +R+A+GRR VA
Sbjct: 61 GRREVLLPTHDAKIQVLQLPAHARLAAADTLDDFHEARVMAEISLLPANVRVAAGRRPVA 120
Query: 114 MATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAIS 173
MA G +DR+Y+Q KQVLVVVTSGW+VMCFDHNL KLWE +L +DFP AHHRE+A+S
Sbjct: 121 MAVGTVDRSYKQADVRKQVLVVVTSGWAVMCFDHNLKKLWEVSLGDDFPHTAHHREVAVS 180
Query: 174 ISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTV 233
++NYTLKHGD GLVIVGGRMEMQ H+ D F++ ++ ++E+HRRSA+EK+ASE +G V
Sbjct: 181 VTNYTLKHGDAGLVIVGGRMEMQHHS-ADLFDDFMTSQHSSEEHRRSATEKQASE-AGNV 238
Query: 234 DLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECR 293
D+RHFA YAF+GR+G LRWSRKNENI+AQP+DAS +IPQHNYKLDVH+LN+RHPGE+ECR
Sbjct: 239 DVRHFALYAFSGRTGTLRWSRKNENIQAQPSDASAMIPQHNYKLDVHSLNNRHPGEYECR 298
Query: 294 EFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDS 353
+FRES+LGVMPHHWDRREDT L+L+HFR+HKRK LKK GKS HKP EH+P GKD
Sbjct: 299 QFRESILGVMPHHWDRREDTSLQLAHFRKHKRKELKKTQGKSVVNNVHKPIEHNPLGKDD 358
Query: 354 TKKISNLIGKAATYAGSAKSKKPVN--YIPTITNYTQLWWVPNVVVAHQKEGIEAVHLAS 411
T +IS IGKAA AGSAK KK ++ YIPTITNYTQ+WWVPNVVVAH+KEGIEA+HLAS
Sbjct: 359 TNRISKAIGKAADLAGSAKGKKSLHTLYIPTITNYTQVWWVPNVVVAHEKEGIEAIHLAS 418
Query: 412 GRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPV 471
GRT+CKLHL EGGLHADINGDGVLDHVQ VGGNGAEQTVVSGSMEVL+PCWAVATSGVPV
Sbjct: 419 GRTLCKLHLTEGGLHADINGDGVLDHVQVVGGNGAEQTVVSGSMEVLKPCWAVATSGVPV 478
Query: 472 REQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDV 531
REQLFN SICH++ NLF HG+FSR+FGRT D + LEVATP+L+ R DGH+HR+GSHGD+
Sbjct: 479 REQLFNVSICHYNHLNLFHHGDFSRSFGRTFDPSGLEVATPVLLQRDDGHKHRRGSHGDI 538
Query: 532 VFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVH 591
+FLT+RGEVT+Y+PGL GHDA+W+WQL T ATWSNLPSPSGM E + VVPTLK FSLR +
Sbjct: 539 IFLTSRGEVTSYTPGLLGHDAMWRWQLSTGATWSNLPSPSGMME-NVVVPTLKTFSLRAY 597
Query: 592 DNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYG 651
D +Q+I+AGGDQEAVVISP GS+LTSIDLPAPPTHAL+ EDFS DGLTD+IL+TS GVYG
Sbjct: 598 DPKQVIIAGGDQEAVVISPDGSLLTSIDLPAPPTHALILEDFSGDGLTDIILVTSGGVYG 657
Query: 652 FVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVK-AKPRASSGLR 697
FVQTRQPGALFFSTLVG LIVV+GVIFV+ HLNS KPR+S+G R
Sbjct: 658 FVQTRQPGALFFSTLVGFLIVVIGVIFVSLHLNSSNGGKPRSSTGYR 704
>gi|357140045|ref|XP_003571583.1| PREDICTED: uncharacterized protein LOC100827170 [Brachypodium
distachyon]
Length = 701
Score = 1087 bits (2812), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 524/704 (74%), Positives = 608/704 (86%), Gaps = 10/704 (1%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-YPIKFDADRLPPPIVADLNGD 59
MRKRDL IL+L+AFAIFFSLQHEGDFSFREAW+HLS++ YPIK+DADRLPPP+VADLNGD
Sbjct: 1 MRKRDLGILLLAAFAIFFSLQHEGDFSFREAWYHLSDDGYPIKYDADRLPPPLVADLNGD 60
Query: 60 GRKEVLVATHDAKIQVLEP-HARRVD--EGFSEARVLAEVSLLPDKIRIASGRRAVAMAT 116
GR E+L+ THDAKIQVL+P HAR + F EAR++AE+SLLP +R++SGRR VAMA
Sbjct: 61 GRPEILLPTHDAKIQVLQPPHARPASGFDDFHEARLMAEISLLPTNVRVSSGRRPVAMAV 120
Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISN 176
G +DR+Y+ KQVLVVVTSGW+VMCFDHNLNKLWEANLQ+DFP AHHRE+AISISN
Sbjct: 121 GAVDRSYKLADVRKQVLVVVTSGWAVMCFDHNLNKLWEANLQDDFPHAAHHREVAISISN 180
Query: 177 YTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLR 236
YT+KHGD GLVIVGGRMEMQ H+ D F++ +E + E+HRRSASEK+ASE +G VD+R
Sbjct: 181 YTIKHGDAGLVIVGGRMEMQHHS-ADLFDDFMTSEHSREEHRRSASEKQASE-AGNVDVR 238
Query: 237 HFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFR 296
HFA YAFAGR+G LRWSRKNENI++QP+DAS LIPQHNYKLDVH+LNSRHPG+FECREFR
Sbjct: 239 HFALYAFAGRTGALRWSRKNENIQSQPSDASALIPQHNYKLDVHSLNSRHPGQFECREFR 298
Query: 297 ESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKK 356
ESVLGVMPHHWDRREDT L+L++FRRHKRK LKK GK+ HKP EH+P GKD T +
Sbjct: 299 ESVLGVMPHHWDRREDTSLQLANFRRHKRKQLKKTPGKNAVNNVHKPSEHNPAGKDDTNR 358
Query: 357 ISNLIGKAATYAGSAKSKKPVN--YIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRT 414
+S IGKAA AGSAK KK + ++PTITNYTQ+WWVPNVVVAH+KEGIEA+HLASGRT
Sbjct: 359 LSKAIGKAAELAGSAKGKKSQHTPFVPTITNYTQVWWVPNVVVAHEKEGIEAIHLASGRT 418
Query: 415 VCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQ 474
+CKLHL EGGLHAD+NGDGVLDHVQ VG NG EQTVVSGSMEVL+PCWAVATSGVPVREQ
Sbjct: 419 ICKLHLTEGGLHADVNGDGVLDHVQVVGANGIEQTVVSGSMEVLKPCWAVATSGVPVREQ 478
Query: 475 LFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFL 534
LFN SICH++ +NLF HG+FS++FGR D LEVATPIL+ R DGH+HR+GSHGD++FL
Sbjct: 479 LFNVSICHYNHYNLFHHGDFSKSFGRPFDPTGLEVATPILLQRDDGHKHRRGSHGDIIFL 538
Query: 535 TNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQ 594
T+RGE T+YSPGL GHDA W+WQL T ATWSNLPSPSGM E + VVPTLKAFSLR +D +
Sbjct: 539 TSRGEATSYSPGLLGHDATWRWQLSTGATWSNLPSPSGMME-NIVVPTLKAFSLRAYDPK 597
Query: 595 QMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQ 654
Q+I+AGGDQEAVVISP GS+L SI+LPAPPTHAL+ EDFS DGLTD+IL+TS GVYGFVQ
Sbjct: 598 QVIIAGGDQEAVVISPSGSLLASIELPAPPTHALILEDFSGDGLTDIILVTSGGVYGFVQ 657
Query: 655 TRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKA-KPRASSGLR 697
TRQPGALFFSTLVGCLIVV+GVIFV+ HLNS + KPR+S+ R
Sbjct: 658 TRQPGALFFSTLVGCLIVVIGVIFVSLHLNSSNSGKPRSSTEYR 701
>gi|115443607|ref|NP_001045583.1| Os02g0100700 [Oryza sativa Japonica Group]
gi|41053219|dbj|BAD08180.1| unknown protein [Oryza sativa Japonica Group]
gi|113535114|dbj|BAF07497.1| Os02g0100700 [Oryza sativa Japonica Group]
gi|215697180|dbj|BAG91174.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218189856|gb|EEC72283.1| hypothetical protein OsI_05449 [Oryza sativa Indica Group]
gi|222621985|gb|EEE56117.1| hypothetical protein OsJ_04982 [Oryza sativa Japonica Group]
Length = 701
Score = 1082 bits (2797), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 525/704 (74%), Positives = 600/704 (85%), Gaps = 10/704 (1%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSE-EYPIKFDADRLPPPIVADLNGD 59
MRKRDL IL+L+AFA+FFSLQH+GD SFREAW+HLS+ +YPIK DADRLP P+VADLNGD
Sbjct: 1 MRKRDLGILLLAAFAVFFSLQHDGDLSFREAWYHLSDADYPIKHDADRLPSPLVADLNGD 60
Query: 60 GRKEVLVATHDAKIQVLEPHARRV--DEGFSEARVLAEVSLLPDKIRIASGRRAVAMATG 117
G+ EVL+ THDAKIQVL+PH R D F +AR++A+VSLLP +R++SGRR VAMA G
Sbjct: 61 GKPEVLIPTHDAKIQVLQPHPRPSPDDASFHDARLMADVSLLPSNVRLSSGRRPVAMAVG 120
Query: 118 VIDRTYRQG-QPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISN 176
+DR Y P KQ+LVVVTSGWSVMCFDHNL KLWEANLQ+DFP AHHRE+AISI+N
Sbjct: 121 TVDRHYAHAPSPSKQLLVVVTSGWSVMCFDHNLKKLWEANLQDDFPHAAHHREVAISITN 180
Query: 177 YTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLR 236
YTLKHGD GLVIVGGRMEMQ H+ + F+E ++E N E+HRRSASEK+ASE +G DLR
Sbjct: 181 YTLKHGDAGLVIVGGRMEMQHHS-AELFDEFMVSEHNREEHRRSASEKQASE-TGNTDLR 238
Query: 237 HFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFR 296
HFA YAFAGR+G LRWSRKNENI +QP+DAS LIPQHNYKLD HALNSRHPG+FECREFR
Sbjct: 239 HFALYAFAGRTGELRWSRKNENIPSQPSDASVLIPQHNYKLDAHALNSRHPGQFECREFR 298
Query: 297 ESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKK 356
ESVLGVMPHHWDRREDT L+L+HFRRHKRK LKK GK+ HKP EH+PPGKD + +
Sbjct: 299 ESVLGVMPHHWDRREDTFLQLAHFRRHKRKALKKTPGKAVVNNVHKPSEHNPPGKDVSNR 358
Query: 357 ISNLIGKAATYAGSAKSKKPVN--YIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRT 414
++N+IGKAA A S K KK Y+PTITNYTQ+WWVPNVVVAH+KEGIEAVHLASGRT
Sbjct: 359 LANVIGKAADMANSNKIKKSQRTLYVPTITNYTQVWWVPNVVVAHEKEGIEAVHLASGRT 418
Query: 415 VCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQ 474
+CKLHL EGGLHADINGDGVLDHVQ VG NG EQTVVSGSMEVL+PCWAVATSGVPVREQ
Sbjct: 419 ICKLHLTEGGLHADINGDGVLDHVQVVGANGIEQTVVSGSMEVLKPCWAVATSGVPVREQ 478
Query: 475 LFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFL 534
LFN SICH++ FNLF HG+FSR+FGRT D LEVATPIL+ R DGH+HR+GSHGD++FL
Sbjct: 479 LFNVSICHYNNFNLFHHGDFSRSFGRTFDTTGLEVATPILLQRDDGHKHRRGSHGDIIFL 538
Query: 535 TNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQ 594
T+RGEVT+YSPGL GHDAIW+WQL T ATWSNLPSPSGM E + VVPTLKAFSLR +D +
Sbjct: 539 TSRGEVTSYSPGLLGHDAIWRWQLSTGATWSNLPSPSGMME-NIVVPTLKAFSLRAYDPK 597
Query: 595 QMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQ 654
Q+I+AGGD EAVVISP G +L SI+LPAPPTHALV EDF+ DGLTD+IL+TS GVYGFVQ
Sbjct: 598 QVIIAGGDLEAVVISPSGGLLASIELPAPPTHALVLEDFNGDGLTDIILVTSGGVYGFVQ 657
Query: 655 TRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKA-KPRASSGLR 697
TR PGALFFSTLVGCLIVV+GVIFV+ HLNS + KPRAS+ R
Sbjct: 658 TRHPGALFFSTLVGCLIVVIGVIFVSLHLNSSNSGKPRASTDYR 701
>gi|293337243|ref|NP_001169290.1| uncharacterized protein LOC100383154 precursor [Zea mays]
gi|224028447|gb|ACN33299.1| unknown [Zea mays]
Length = 703
Score = 1030 bits (2664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 506/702 (72%), Positives = 600/702 (85%), Gaps = 11/702 (1%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-YPIKFDADRLPPPIVADLNGD 59
MRKRDL IL+L+AFAIFFSL HEGDFSFRE+W+HL++E +PIK++ADRLPPP+VADLNGD
Sbjct: 1 MRKRDLGILLLAAFAIFFSLHHEGDFSFRESWYHLTDEDFPIKYEADRLPPPLVADLNGD 60
Query: 60 GRKEVLVATHDAKIQVLEP-HARRV--DEGFSEARVLAEVSLLPDKIRIASGRRAVAMAT 116
G+ EVL+ THDAKIQVL+P HAR + D F EARV+A++SLLPD + +ASGRR +AMA
Sbjct: 61 GKPEVLLPTHDAKIQVLQPPHARHLNDDSAFQEARVMADISLLPDNVLLASGRRPIAMAV 120
Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISN 176
G +DR+YR G+ KQVLVVVTSGWSVMCFDHNL KLWE NLQ+DFP AHHRE+AISI+N
Sbjct: 121 GNVDRSYRPGEVRKQVLVVVTSGWSVMCFDHNLKKLWEHNLQDDFPHAAHHREVAISITN 180
Query: 177 YTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLR 236
YTLKHGD GLVIVGGRMEMQ H+ D F+E + E N + RRSASEK+ SE +G DLR
Sbjct: 181 YTLKHGDAGLVIVGGRMEMQHHS-ADLFDEFMIPEHNMDDRRRSASEKQGSE-AGNADLR 238
Query: 237 HFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFR 296
HFA YAFAGRSG RWSRKNENI++QP+DAS ++PQHNYKLDVHALNS PG+FECREFR
Sbjct: 239 HFALYAFAGRSGDRRWSRKNENIQSQPSDASVMLPQHNYKLDVHALNSHQPGQFECREFR 298
Query: 297 ESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKK 356
ES+LG+MPHHWDRREDT L+L+HFR+HKRK +K+ GK+ +KP EH+PPGKD++ +
Sbjct: 299 ESILGIMPHHWDRREDTTLQLAHFRKHKRKQVKRTPGKAVINSVNKPIEHNPPGKDASNR 358
Query: 357 ISNLIGKAATYAGSAKSKKP--VNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRT 414
I+ +GKAA A S K++K + Y+PTITN+TQ+WWVPNVVV H+KEGIE VHLASGRT
Sbjct: 359 IARALGKAADMANSNKARKAQRMQYVPTITNHTQVWWVPNVVVVHEKEGIEVVHLASGRT 418
Query: 415 VCKLHLQEGGLHADINGDGVLDHVQAVGGNG-AEQTVVSGSMEVLRPCWAVATSGVPVRE 473
+CKLHL EGGLHADINGDGVLDHVQ VGGNG EQTVVSGSMEVL+PCWAVATSGVPVRE
Sbjct: 419 ICKLHLNEGGLHADINGDGVLDHVQVVGGNGIKEQTVVSGSMEVLKPCWAVATSGVPVRE 478
Query: 474 QLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVF 533
QLFN SICH++ FNLF HG+FSR+FGRT D A LEVATPIL+ DGH+HR+GSHGD++F
Sbjct: 479 QLFNVSICHYNHFNLFHHGDFSRSFGRTFDTAGLEVATPILVQTDDGHKHRRGSHGDIIF 538
Query: 534 LTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDN 593
LT+RGEVT+YSPGL GHDA+W+WQ+ + ATWSNLPSPSGM E + VVPTLKAFSLR +D
Sbjct: 539 LTSRGEVTSYSPGLLGHDAVWRWQVSSGATWSNLPSPSGMME-NIVVPTLKAFSLRAYDP 597
Query: 594 QQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFV 653
+++I+AGGDQEAVV+SP G IL I+LPAPPTHA + EDFS DGLTD+I++TS G+YGFV
Sbjct: 598 KEVIIAGGDQEAVVLSPSGGILAMIELPAPPTHAHLLEDFSGDGLTDMIVVTSGGIYGFV 657
Query: 654 QTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVK-AKPRASS 694
QTRQPGALFFSTLVGCLIVV+GV+FV+ HL+S AKPRASS
Sbjct: 658 QTRQPGALFFSTLVGCLIVVIGVVFVSLHLSSSNAAKPRASS 699
>gi|167997045|ref|XP_001751229.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697210|gb|EDQ83546.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 706
Score = 806 bits (2081), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/700 (57%), Positives = 508/700 (72%), Gaps = 11/700 (1%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDL IL+LSA +IF+SLQ+EG SFR AW H PIK +A+RLPPP+VADLNGDG
Sbjct: 1 MRKRDLGILLLSALSIFYSLQNEGSLSFRPAWVHYKNSSPIKHEAERLPPPVVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
EV+VA+ D +QVL+P + GFS+A VLAEVSLLPD++R+++GRR VA+A G +
Sbjct: 61 HVEVVVASED-NLQVLDPRVSFAENGFSQAGVLAEVSLLPDRVRVSAGRRPVALAAGEVK 119
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R YR K+V+VVVT+GWS+MCFDHNL KLWE ++Q+DF +HH+E+AISISNYTLK
Sbjct: 120 RFYRAQDLKKKVIVVVTAGWSIMCFDHNLKKLWEDDVQDDFLHGSHHKEVAISISNYTLK 179
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
H DTGL+IVGG ME+QP +DPFEE LAEK E HRR+A KE + ++G RHF++
Sbjct: 180 HRDTGLIIVGGSMEVQPQMHLDPFEEEWLAEKMFESHRRAAGAKEVACSTGWQQFRHFSY 239
Query: 241 YAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESVL 300
YA+AG SG RW ++E+ + + T+A L PQHNYKLD +L +RH GE ECREFRESVL
Sbjct: 240 YAYAGMSGTRRWVHRSEDFQ-RATNAGALQPQHNYKLDASSLATRHLGEVECREFRESVL 298
Query: 301 GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDS-TKKISN 359
VMPH W+ REDT +L+HFR+H+RK++KK+ GK S P KP + PGKD+ I+
Sbjct: 299 DVMPHRWEHREDTRFELAHFRKHRRKLVKKMQGKGGSIPSEKPADKSAPGKDAHGNPIAK 358
Query: 360 LIGKAATYAGSAKSKK-PVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKL 418
++GKAA A AK KK Y+P ITN+T WWVPNVVVAH KEGIEAVHLA+GRTVCKL
Sbjct: 359 VVGKAADLAVGAKVKKQQFQYVPMITNHTSFWWVPNVVVAHLKEGIEAVHLATGRTVCKL 418
Query: 419 HLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNA 478
L EGGLHAD+NGDGVLDHVQAVGG+G+ + V +G E L+PCWA+ATSGVPVREQLFN
Sbjct: 419 FLPEGGLHADVNGDGVLDHVQAVGGHGSARIVPTGMTEALKPCWAIATSGVPVREQLFNG 478
Query: 479 SICHHSPFNLFPHGEFSRNFGRTSDVAS----LEVATPILIPRSDGHRHRKGSHGDVVFL 534
++C HSPF +F + EFS FGR +EV PI++P DG RHRKGSHGDVVFL
Sbjct: 479 TVCRHSPFQIFRYHEFSGEFGRRPHPGDAQEFVEVLAPIILPHPDGQRHRKGSHGDVVFL 538
Query: 535 TNRGEVTAYS--PGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHD 592
+RGEVT++S G +A +WQ+ T A W+ P G+ V+ TL A LR +
Sbjct: 539 NSRGEVTSFSVLGPKRGQEAQHRWQVATTAFWTTSPDLEGVG-TDRVISTLTALPLRKNG 597
Query: 593 NQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGF 652
ILA G+ EAVV+SP GS + + LP+ + ++ +DFS D L D+IL+T +G++GF
Sbjct: 598 EVDAILAVGELEAVVLSPKGSHVAQVPLPSAGSAPIIYDDFSGDNLNDLILVTKDGIFGF 657
Query: 653 VQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRA 692
VQTRQPGA+ F TL+G LI+VMGVIFVTQHL S K K +A
Sbjct: 658 VQTRQPGAILFLTLMGVLILVMGVIFVTQHLGSGKGKSKA 697
>gi|302768899|ref|XP_002967869.1| hypothetical protein SELMODRAFT_88168 [Selaginella moellendorffii]
gi|300164607|gb|EFJ31216.1| hypothetical protein SELMODRAFT_88168 [Selaginella moellendorffii]
Length = 685
Score = 785 bits (2027), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/697 (55%), Positives = 505/697 (72%), Gaps = 18/697 (2%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-YPIKFDADRLPPPIVADLNGD 59
MRKRDL IL++SAFAIFFSLQ+EG+FSF+E+WFH+ +E YPIK +A+RLPPP+V DLN D
Sbjct: 1 MRKRDLGILIISAFAIFFSLQNEGNFSFKESWFHVVDEAYPIKHEAERLPPPLVTDLNAD 60
Query: 60 GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
GR EV+VATHD+K+ VL+PH++ ++GFSEAR + EVSLLP+K+RI++GRR VAMA GVI
Sbjct: 61 GRNEVIVATHDSKLLVLDPHSKLTEDGFSEARTITEVSLLPEKVRISAGRRPVAMAAGVI 120
Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTL 179
R+Y +G+P KQVLVVVT+GW +MCFDHNL KLWE+++Q+DFP A H+E++ISISN+TL
Sbjct: 121 QRSYNEGKPRKQVLVVVTAGWVIMCFDHNLKKLWESSVQDDFPHGAFHKEVSISISNFTL 180
Query: 180 KHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFA 239
+HGDTGL+IVGG ME Q +DPFEE LAEK E+HR +A KE SE+ HF+
Sbjct: 181 RHGDTGLIIVGGSMEAQSQVYLDPFEEELLAEKEEERHRHAADAKEGSEDVNAGKTHHFS 240
Query: 240 FYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESV 299
+YAFAG +G RW K ++ + A++L PQHNY+LD +L R E CR++R+S+
Sbjct: 241 YYAFAGMTGERRWEHKMQDFHRDLSSAAELTPQHNYRLDAASLQGRQLDEVHCRDYRQSI 300
Query: 300 LGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTS-YPFHKPEEHHPPGKDSTKKIS 358
L V+PH W R+DT +L HF +H+RK KK G + YP +P + H PGKD + K++
Sbjct: 301 LQVLPHRWQSRDDTRFQLVHFVKHERKHHKKQPGSNRERYPLKRPNDPHVPGKDPSNKVA 360
Query: 359 NLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKL 418
+GKAA A SA+ K YIP ITN + WW+PNVVVAH KEGIEAVHLASGRTVCKL
Sbjct: 361 QALGKAAKLATSARPKTRYAYIPVITNQSSYWWLPNVVVAHLKEGIEAVHLASGRTVCKL 420
Query: 419 HLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNA 478
L EGGLHADINGDGVLDHVQAVGG+G V +G+ME ++PCWAVA+SG VR+QLFN
Sbjct: 421 WLHEGGLHADINGDGVLDHVQAVGGSGTH--VAAGTMEAIKPCWAVASSGTLVRQQLFNG 478
Query: 479 SICHHSPFNLFPHGEF-SRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNR 537
SIC +S + F GEF SR FGR D + +EV P+ R G +GDV+FLT+R
Sbjct: 479 SICRNSGYGAFQQGEFMSRTFGRNLDASPIEVVPPVFFLRPTG-------YGDVIFLTSR 531
Query: 538 GEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMI 597
GEVT+Y+P G+ +WQ++T A WS+ P S VPTL+ +LR + ++I
Sbjct: 532 GEVTSYTPA--GNQ---RWQIVTGAIWSSKPMLSSFG-VDRAVPTLEVMALRKNGRGEVI 585
Query: 598 LAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQ 657
LA G+QEAV+IS G + ++ LP+PP +V DFS DG DVI++T++G+YGFVQT+
Sbjct: 586 LAAGEQEAVLISSSGHQIATLALPSPPARPIVVADFSGDGFNDVIVVTASGIYGFVQTQH 645
Query: 658 PGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASS 694
GA+FFS LVGCLI+ M VIFV QHL + K KP+ S
Sbjct: 646 QGAVFFSFLVGCLIIAMCVIFVMQHLQAPKGKPKKKS 682
>gi|302799784|ref|XP_002981650.1| hypothetical protein SELMODRAFT_115119 [Selaginella moellendorffii]
gi|300150482|gb|EFJ17132.1| hypothetical protein SELMODRAFT_115119 [Selaginella moellendorffii]
Length = 685
Score = 785 bits (2027), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/697 (55%), Positives = 504/697 (72%), Gaps = 18/697 (2%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-YPIKFDADRLPPPIVADLNGD 59
MRKRDL IL++SAFAIFFSLQ+EG+FSF+E+WFH+ +E YPIK +A+RLPPP+V DLN D
Sbjct: 1 MRKRDLGILIISAFAIFFSLQNEGNFSFKESWFHVVDEAYPIKHEAERLPPPLVTDLNAD 60
Query: 60 GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
GR EV+VATHD+K+ VL+PH++ ++GFSEAR + EVSLLP+K+RI++GRR VAMA GVI
Sbjct: 61 GRNEVIVATHDSKLLVLDPHSKLNEDGFSEARTVTEVSLLPEKVRISAGRRPVAMAAGVI 120
Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTL 179
R+Y +G+P KQVLVVVT+GW +MCFDHNL KLWE+++Q+DFP A H+E++ISISN+TL
Sbjct: 121 QRSYNEGKPRKQVLVVVTAGWVIMCFDHNLKKLWESSVQDDFPHGAFHKEVSISISNFTL 180
Query: 180 KHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFA 239
+HGDTGL+IVGG ME Q +DPFEE LAEK E+HR +A KE SE+ HF+
Sbjct: 181 RHGDTGLIIVGGSMEAQSQVYLDPFEEELLAEKEEERHRHAADAKEGSEDVNAGKTHHFS 240
Query: 240 FYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRESV 299
+YAFAG +G RW K ++ + A++L PQHNY+LD +L R E CR++R+S+
Sbjct: 241 YYAFAGMTGERRWEHKMQDFHRDLSSAAELTPQHNYRLDAASLQGRQLDEVHCRDYRQSI 300
Query: 300 LGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTS-YPFHKPEEHHPPGKDSTKKIS 358
L V+PH W R+DT +L HF +H+RK KK G + YP +P + H PGKD + K++
Sbjct: 301 LQVLPHRWQSRDDTRFQLVHFVKHERKHHKKQPGSNRERYPLKRPNDPHVPGKDPSNKVA 360
Query: 359 NLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKL 418
+GKAA A SA+ K YIP ITN + WW+PNVVVAH KEGIEAVHLASGRTVCKL
Sbjct: 361 QALGKAAKLATSARPKTRYAYIPVITNQSSYWWLPNVVVAHLKEGIEAVHLASGRTVCKL 420
Query: 419 HLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNA 478
LQEGGLHADINGDGVLDHVQAVGG+G V G+ME ++PCWAVA+SG VR+QLFN
Sbjct: 421 WLQEGGLHADINGDGVLDHVQAVGGSGTH--VAPGTMEAIKPCWAVASSGTLVRQQLFNG 478
Query: 479 SICHHSPFNLFPHGEF-SRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNR 537
SIC +S + F GEF SR FGR D + +EV P+ R G +GDV+FLT+R
Sbjct: 479 SICRNSGYGAFQQGEFMSRTFGRNLDASPIEVVPPVFFLRPTG-------YGDVIFLTSR 531
Query: 538 GEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMI 597
GEVT+Y+P + +WQ++T A WS+ P S VPTL+ +LR + ++I
Sbjct: 532 GEVTSYTPAGNQ-----RWQIVTGAIWSSKPMLSSFG-VDRAVPTLEVMALRKNGRGEVI 585
Query: 598 LAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQ 657
LA G+QEAV+IS G + ++ LP+PP +V DFS DG DVI++T++G+YGFVQT+
Sbjct: 586 LAAGEQEAVLISSSGHQIATLALPSPPARPIVVADFSGDGFNDVIVVTASGIYGFVQTQH 645
Query: 658 PGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASS 694
GA+FFS LVGCLI+ M VIFV QHL + K KP+ S
Sbjct: 646 QGAVFFSFLVGCLIIAMCVIFVMQHLQAPKGKPKKKS 682
>gi|168002559|ref|XP_001753981.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694957|gb|EDQ81303.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 721
Score = 740 bits (1910), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/718 (54%), Positives = 497/718 (69%), Gaps = 30/718 (4%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDL IL+LSA AIFFSLQHEG SFR+AW H + PI+ A+RLPPP+VADLNGDG
Sbjct: 1 MRKRDLGILLLSALAIFFSLQHEGSLSFRQAWVHYKDGSPIQHVAERLPPPVVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
EV+VA D K+QV++P +GF+EARVLAEVSLLPD++R+ +GRR VA+A G +
Sbjct: 61 HVEVVVALED-KLQVIDPRVSFASDGFAEARVLAEVSLLPDRVRVGAGRRPVALAAGEVK 119
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
RTYR + KQV+VVVT+GW++MCFDHNL K+WE ++Q+DFP +HH+E+AISISNYTLK
Sbjct: 120 RTYRSLERKKQVIVVVTAGWNIMCFDHNLKKIWEDDVQDDFPHGSHHKEVAISISNYTLK 179
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENS----GTVDLR 236
HGDTGLVIVGG ME+QP +D FEE LAEK E H+ SA E G R
Sbjct: 180 HGDTGLVIVGGSMEVQPQAHLDLFEEELLAEKATEVHQVSAVGIEVGCTGPLGRGRGCNR 239
Query: 237 HFAFYAFAGRSGLLR---WSR-----------KNENIEAQPTDASQLIPQHNYKLDVHAL 282
++ G S LL W+ K+++ + P + + L PQHNY+LD ++L
Sbjct: 240 EDGWWK--GASFLLLCICWNDWRPSLDSPKRGKDDDFQRAPGNGT-LQPQHNYRLDANSL 296
Query: 283 NSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHK 342
+RH GE ECREFRESVLGVMPH W+ REDT +LSHF++H RK++K++ GK + P K
Sbjct: 297 ATRHLGEVECREFRESVLGVMPHRWEHREDTRFELSHFQKHYRKLVKRIHGKEKNVPSQK 356
Query: 343 PEEHHPPGKDS-TKKISNLIGKAATYAGSAKSKKP-VNYIPTITNYTQLWWVPNVVVAHQ 400
P + + PGK+S + I+ ++G A A K +K ++P ITN+T WWVPNV+VAH
Sbjct: 357 PADKNTPGKESLSNPITKVVGNAVQLAVGPKVRKERFQHVPMITNHTSHWWVPNVIVAHL 416
Query: 401 KEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP 460
KEGIEAVHLA+GRTVCKL+L EGGLHADINGDGVLDHVQAVGG+ + V +G L+P
Sbjct: 417 KEGIEAVHLATGRTVCKLYLPEGGLHADINGDGVLDHVQAVGGHRGTRLVPTGMTTTLKP 476
Query: 461 CWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVAS---LEVATPILIPR 517
CWA+ATSGV VREQLFN ++C HSPF + H +FS R + +EV PI++P
Sbjct: 477 CWAIATSGVHVREQLFNGTVCRHSPFQAYSHFDFSGEIDRRLNPEMGDFVEVLAPIILPH 536
Query: 518 SDGHRHRKGSHGDVVFLTNRGEVTAYS--PGLHGHDAIWQWQLLTDATWSNLPSPSGMTE 575
DG RHRKGSHGD++FL +RGEVT++S +A +WQL+T A W+ G E
Sbjct: 537 PDGKRHRKGSHGDIIFLNSRGEVTSFSVLGPKRDQEAEHRWQLVTGAYWATSSELQGY-E 595
Query: 576 ASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSN 635
+ VVP+L A LR + + ILA G+ E VVISP GS +T LPA + ++ DFS
Sbjct: 596 SDRVVPSLTALPLRKNGEPEAILAIGEIEGVVISPKGSPVTEFPLPASASAPIIYADFSG 655
Query: 636 DGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRAS 693
DGL D+IL+T +G+YGFVQ R+PGA+ FSTLVG LI+VMGVIFVTQHL + K K R +
Sbjct: 656 DGLNDLILVTHDGIYGFVQARRPGAILFSTLVGVLILVMGVIFVTQHLGTGKGKSRPA 713
>gi|357483603|ref|XP_003612088.1| hypothetical protein MTR_5g021150 [Medicago truncatula]
gi|355513423|gb|AES95046.1| hypothetical protein MTR_5g021150 [Medicago truncatula]
Length = 780
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/376 (81%), Positives = 339/376 (90%), Gaps = 2/376 (0%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
MRKRDLAILML AFAIFFSLQ +G SF++AW HL++EYPIK++A+RLPPP+VADLNGDG
Sbjct: 1 MRKRDLAILMLCAFAIFFSLQQDGGVSFKDAWMHLTDEYPIKYEAERLPPPVVADLNGDG 60
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID 120
+KEVLVATHDAKIQ+LEPH+RRVDEGFSEARVLAEVSLLPDK+R+ SGRR VAMATG ID
Sbjct: 61 KKEVLVATHDAKIQILEPHSRRVDEGFSEARVLAEVSLLPDKVRVMSGRRPVAMATGFID 120
Query: 121 RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLK 180
R +R GQP KQVLVVVTSGW VMCFD NL KLWE NLQEDFP NAHHRE++ISISNYTLK
Sbjct: 121 R-HRIGQPHKQVLVVVTSGWFVMCFDSNLQKLWENNLQEDFPHNAHHREVSISISNYTLK 179
Query: 181 HGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAF 240
HGDTGL+IVGGRMEMQPH MDPFEE+G+ + AEQHRRSA+EKEASEN+GTVDLRHFAF
Sbjct: 180 HGDTGLIIVGGRMEMQPHIFMDPFEEMGMGARFAEQHRRSATEKEASENTGTVDLRHFAF 239
Query: 241 YAFAGRSGLLRWSRKNENIEAQP-TDASQLIPQHNYKLDVHALNSRHPGEFECREFRESV 299
YAFAGRSG+ RWSRK ENIEA +DASQLIPQHNYKLDVHALN R PGEFECREFRES+
Sbjct: 240 YAFAGRSGVERWSRKTENIEAAASSDASQLIPQHNYKLDVHALNRRQPGEFECREFRESI 299
Query: 300 LGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISN 359
LGVMPH WDRREDTLLKL HF RHKRK LKK GK+ +YPF KPEE+HPPGKDSTKKISN
Sbjct: 300 LGVMPHQWDRREDTLLKLVHFNRHKRKTLKKTPGKTINYPFDKPEENHPPGKDSTKKISN 359
Query: 360 LIGKAATYAGSAKSKK 375
+IGKAA +AGSAKSKK
Sbjct: 360 IIGKAANFAGSAKSKK 375
Score = 554 bits (1427), Expect = e-155, Method: Compositional matrix adjust.
Identities = 256/319 (80%), Positives = 288/319 (90%)
Query: 379 YIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHV 438
Y+PTITNYT++WWVPNVVVAH KEGIE +HLASGRT+CKLHLQEGGLHADINGDGVLDHV
Sbjct: 462 YVPTITNYTKVWWVPNVVVAHLKEGIEVLHLASGRTLCKLHLQEGGLHADINGDGVLDHV 521
Query: 439 QAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNF 498
QAVGGNGAEQTVVSGSM+VLRPCWAVATSGVPVREQLFN SICH++ FNLF HGE R F
Sbjct: 522 QAVGGNGAEQTVVSGSMDVLRPCWAVATSGVPVREQLFNVSICHYTHFNLFQHGELYRGF 581
Query: 499 GRTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQL 558
R SD++SLEVATPILIPRSDGH+HRKGSHGDV+FLTNRGE+T+++PGLHGHDA+WQWQ
Sbjct: 582 NRGSDMSSLEVATPILIPRSDGHKHRKGSHGDVIFLTNRGEITSHTPGLHGHDAVWQWQQ 641
Query: 559 LTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSI 618
T TWSNLPSP+GM E V+PTLK F LR+HDN +MILA G+QEAVVISPGGSIL +I
Sbjct: 642 STGVTWSNLPSPAGMMEGGLVIPTLKPFPLRLHDNHEMILAAGEQEAVVISPGGSILATI 701
Query: 619 DLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIF 678
+LP PTH L+ EDFSNDGLTD+IL+TS+GVYGFVQTRQPGALFFS L+GCLIVVMG+IF
Sbjct: 702 ELPGSPTHVLIREDFSNDGLTDLILVTSSGVYGFVQTRQPGALFFSVLIGCLIVVMGIIF 761
Query: 679 VTQHLNSVKAKPRASSGLR 697
VTQH+NS+K KPR SSG R
Sbjct: 762 VTQHINSMKGKPRPSSGPR 780
>gi|167997451|ref|XP_001751432.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697413|gb|EDQ83749.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 549
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 311/543 (57%), Positives = 393/543 (72%), Gaps = 9/543 (1%)
Query: 158 QEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQH 217
Q+DFP +HH+E+AISISNYTLKHGDTGL+IVGG ME+QP +DPFEE +AE+ E H
Sbjct: 1 QDDFPHGSHHKEVAISISNYTLKHGDTGLIIVGGSMEVQPQVHLDPFEEELMAEQVIESH 60
Query: 218 RRSASEKE-ASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYK 276
RRSA KE ++ G RHF++YA+AG +G RW ++E+ + TDA L PQHNY+
Sbjct: 61 RRSAGAKEDVTKKMGGGKERHFSYYAYAGMTGTRRWVHRSEDFR-RTTDAGALQPQHNYR 119
Query: 277 LDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKST 336
LD +L +RH GE ECREFRESVLGVMPH W+ REDT +L+HFR+H+RK++KKV GK
Sbjct: 120 LDASSLATRHLGEVECREFRESVLGVMPHRWEHREDTRFELAHFRKHRRKLVKKVPGKDN 179
Query: 337 SYPFHKPEEHHPPGKDST-KKISNLIGKAATYAGSAKSKKP-VNYIPTITNYTQLWWVPN 394
S P KP + + PG D+ I+ ++GKAA A AK +K Y+P ITN+T WWVPN
Sbjct: 180 SIPSEKPADRNTPGMDARGNPIAKVVGKAADLAVGAKVRKQQFQYVPVITNHTSHWWVPN 239
Query: 395 VVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGS 454
VVVAH KEGIEAVHLA+GRTVCKL L EGGLHAD+NGDGVLDHVQAVGG+G+ + V +G
Sbjct: 240 VVVAHLKEGIEAVHLATGRTVCKLFLPEGGLHADVNGDGVLDHVQAVGGHGSARIVPTGM 299
Query: 455 MEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRT--SDVASLEVATP 512
E L+PCWAVATSGVPVR QLFN ++C HSPF +FPH EFS +FGR A ++V P
Sbjct: 300 TEALKPCWAVATSGVPVRAQLFNGTVCRHSPFQMFPHNEFSGDFGRRPHPGDALVQVVAP 359
Query: 513 ILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYS--PGLHGHDAIWQWQLLTDATWSNLPSP 570
I++P DG RHRKGSHGDVVFL +RGEVT++S G +A +WQ+ T A W+ P
Sbjct: 360 IILPHPDGQRHRKGSHGDVVFLNSRGEVTSFSVLGPRRGQEAQHRWQVTTTAYWTTSPDL 419
Query: 571 SGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVC 630
G+ A ++PTL A LR + ILA G+ EAVV+SP GS +T + LP P + ++
Sbjct: 420 QGVGTAR-IIPTLIALPLRKNGEADAILAVGEAEAVVLSPKGSYVTQVFLPTPGSAPMIY 478
Query: 631 EDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKP 690
+DFS D L D+IL++ +G+YGFVQTRQPGA+ F +LVG LI+VMGVIFVTQH + K K
Sbjct: 479 DDFSGDKLNDLILVSEDGIYGFVQTRQPGAILFVSLVGVLILVMGVIFVTQHWGTGKGKS 538
Query: 691 RAS 693
RA+
Sbjct: 539 RAA 541
>gi|384253534|gb|EIE27009.1| hypothetical protein COCSUDRAFT_83568 [Coccomyxa subellipsoidea
C-169]
Length = 697
Score = 477 bits (1227), Expect = e-132, Method: Compositional matrix adjust.
Identities = 296/725 (40%), Positives = 399/725 (55%), Gaps = 71/725 (9%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDG 60
M KRD A+L+LSAFAI+F++QHEG FS+R AW+ L ++ ++ D LPPP+ ADLNGDG
Sbjct: 1 MYKRDFAVLILSAFAIYFTIQHEGAFSYRRAWYTLHDQALLE-STDVLPPPVAADLNGDG 59
Query: 61 RKEVLVATHDAKIQVLEPH---ARRVDEGFSEARVLAEVSLLPDKIRIA-SGRRAVAMAT 116
R EV+ ATHDAK+QV P+ + + EGF++A +LAEVSL PD A S RAVA+A
Sbjct: 60 RVEVITATHDAKLQVYTPYPPSGKSLWEGFAKAALLAEVSLAPDGPNEALSMHRAVALAA 119
Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHH--REIAISI 174
G +D + VLVVVT+ W V+CFDHNL W A ++ED P AH +E+AI I
Sbjct: 120 GYLDPKRDE-----LVLVVVTANWDVICFDHNLRLQWTARVKEDVPHGAHMAIKEVAIHI 174
Query: 175 SNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGT-- 232
SN+T + GD G+V++GG +++ LA + E + G
Sbjct: 175 SNHTARVGDRGIVVIGGSVDLG-----------DLAHQGDEGESDLIDTRPCLGMGGKHQ 223
Query: 233 -VDL-RHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEF 290
VD+ RHF++YAF G G LRW + + QL+P ++++D AL +RH GE
Sbjct: 224 GVDISRHFSYYAFDGGFGALRWKHEANFHRDTEKLSQQLLPALDFRMDAAALEARHFGEV 283
Query: 291 ECREFRESVLGVMPHHWDRREDTLLKLSHFRRH-------KRKILKKVVGKSTSYPFHKP 343
CR+FRESVL V+PH W RR DT L+L+HF H KR++ + P P
Sbjct: 284 ACRDFRESVLAVLPHRWARRSDTRLQLAHFVHHRPHGGDRKRRLADR------GDPSGGP 337
Query: 344 EEHHPPGKDSTKK-------ISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVV 396
+ PG+ + + ++ +GK A A ++ PNV+
Sbjct: 338 PKPRRPGRQADRAGVHHSNPVAKALGKVCPVAERAVLGGGTGGAAGAAQHSVHGVAPNVL 397
Query: 397 VAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSME 456
VAH +EGIEAVHL SGRT+CKLHL GLHAD+NGDGVLDHVQ GG+G+ G
Sbjct: 398 VAHLEEGIEAVHLYSGRTLCKLHLPSPGLHADLNGDGVLDHVQVAGGHGSPDGGSPGHRH 457
Query: 457 VLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSR-------NFGRTSDVASLEV 509
CWAVA SG+P RE LFNA+IC FP G++ +FG V LEV
Sbjct: 458 T-PGCWAVAKSGIPPREPLFNATICR------FP-GQWGAESSARAFSFGTDGGVVPLEV 509
Query: 510 ATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPS 569
A P +P + HG FL +RGEVTAY+ HG WQ WSN
Sbjct: 510 APPAFLPVPGPRGLYQQGHGLAAFLNSRGEVTAYN--AHGERL---WQHAMGLAWSNR-R 563
Query: 570 PSGM---TEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTH 626
P G+ A V PTL+A LR ILA G AVV+S G+ L ++ PA P
Sbjct: 564 PGGLGGGNAARRVAPTLEALPLRAGAIPAAILASGSWSAVVLSEHGNRLETLQFPALPAL 623
Query: 627 ALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSV 686
L DF+ DGL D++L++ +G+YG+ Q R PG + F+ L+ CLIV M V++ TQ
Sbjct: 624 PLQILDFNGDGLNDIVLVSHDGLYGWAQVRHPGGVPFAVLIACLIVAMAVVYFTQQAGPE 683
Query: 687 KAKPR 691
+ + +
Sbjct: 684 RGRTK 688
>gi|118488663|gb|ABK96143.1| unknown [Populus trichocarpa]
Length = 242
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 212/243 (87%), Positives = 228/243 (93%), Gaps = 1/243 (0%)
Query: 455 MEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPIL 514
MEVL+PCWAVATSGVPVREQLFNASICHHSPFNLF HG+F RNFGRT DV+SLEVATPIL
Sbjct: 1 MEVLQPCWAVATSGVPVREQLFNASICHHSPFNLFQHGDFGRNFGRT-DVSSLEVATPIL 59
Query: 515 IPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMT 574
IPRSDGHRHRKGSHGDVVFLTNRGEVT+YSPGLHGHDA+WQWQ+LT ATWSNLPSPSGM
Sbjct: 60 IPRSDGHRHRKGSHGDVVFLTNRGEVTSYSPGLHGHDAVWQWQILTGATWSNLPSPSGMM 119
Query: 575 EASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFS 634
E VVPTLKAFSLR HDNQQMILA GDQEA VISPGGS+ TS DLPAPPTHAL+CEDF+
Sbjct: 120 EGGMVVPTLKAFSLRAHDNQQMILAAGDQEAAVISPGGSVQTSFDLPAPPTHALICEDFT 179
Query: 635 NDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVKAKPRASS 694
NDGLTD+I++TSNGVYGFVQTR PGALFFSTLVGCL++VMGVIFVTQH+NS+K KPRASS
Sbjct: 180 NDGLTDLIVVTSNGVYGFVQTRSPGALFFSTLVGCLLIVMGVIFVTQHINSIKGKPRASS 239
Query: 695 GLR 697
GLR
Sbjct: 240 GLR 242
>gi|255074829|ref|XP_002501089.1| predicted protein [Micromonas sp. RCC299]
gi|226516352|gb|ACO62347.1| predicted protein [Micromonas sp. RCC299]
Length = 1195
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 269/753 (35%), Positives = 376/753 (49%), Gaps = 112/753 (14%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWF----HLSEEYPIKFDADRL-------- 48
+ KRDL +++L+ ++FSLQ+EG SF+ AW L E+ FDAD +
Sbjct: 92 LAKRDLGVILLALLGVYFSLQNEGVVSFQRAWHVPGDALGED---DFDADPISHRAKDAH 148
Query: 49 -PPPIVADLNGDGRKEVLVATH-DAKIQVLEPHARRVD---------------------- 84
P P+ DLNGDGR EV+VA+ + +I++ P A + +
Sbjct: 149 APRPVFFDLNGDGRNEVIVASSSEPEIRIASPPAGKREAPGVSSDRDSRLEGSGRSVRDD 208
Query: 85 --------EGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQ----V 132
+G+ AR +A SL+P +R+A+GRRAVA+A G ID + +K+ V
Sbjct: 209 DDDEYAWRDGWIPARTVASASLMPSNVRVAAGRRAVALAAGHIDPPVAKDASVKKTNKGV 268
Query: 133 LVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGR 192
+VVVT+ W V+CFDHNL +WE +LQ +FP +A E+A+ +SN T+ GD G V+VGGR
Sbjct: 269 VVVVTAAWHVLCFDHNLKLMWENSLQAEFPRHARVAEVAVLVSNATMFEGDRGSVVVGGR 328
Query: 193 MEMQP--HTIMDPFEEIGLAEKNAEQHR---RSASEKEASENS------------GTVDL 235
+E+ DP EE E HR R A + A N G
Sbjct: 329 VELGDLDSDDEDPLEEELAHEDMMLGHRGGRRVAPDDLADANEVLHGRGRKGKGVGVDRS 388
Query: 236 RHFAFYAFAGRSGLLRWSRKNENIEAQPTD-ASQLIPQHNYKLDVHALNSRHPGEFECRE 294
RHF +YAF G +G RW ++E+ A +L PQHNY+L A + H GE CR+
Sbjct: 389 RHFNYYAFDGATGSTRWKHESEDFHRDLDGLADRLTPQHNYRLTAQASSGHHYGEVACRD 448
Query: 295 FRESVL-GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDS 353
FRESV+ +PH W REDT L+L+HFR H+ +V + G +
Sbjct: 449 FRESVVVNALPHAWREREDTALRLAHFRHHRTAKGARVSKVGSGRRPAVGAAGGAEGAEH 508
Query: 354 TKKISNLIGKAATYAGSAKSKKPV-------NYIPTITNYTQLWWVPNVVVAHQKEGIEA 406
T ++ + A A P + P + PNV+VAHQ+ G+E
Sbjct: 509 TNPVARALAGAINAAWRGNGAPPAPDRRGGRGHSPDGQIGAKHRAPPNVIVAHQEGGVEV 568
Query: 407 VHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAE---QTVVSGSMEVLRPCWA 463
+HL SG+T+CK+ + GGLHADINGDGV+DHVQA G G + V + CWA
Sbjct: 569 IHLYSGKTLCKMAMPPGGLHADINGDGVVDHVQAHGSEGVDAGGHPVTGADGKPAPDCWA 628
Query: 464 VATSGVPVREQLFNASICHHSP---------FNLFPHGEFSRNFGRTSDVASLEVATPIL 514
ATSGVPVRE LF S+C S F P G + N ++++ P
Sbjct: 629 QATSGVPVREHLFGGSVCRGSAGVVRHGSGHFKGEPGGGYVSN-------RAVQIVAPAQ 681
Query: 515 IPRSDGH--RHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSG 572
+ R + R + + DVVFL +RGEVT Y GHD +WQ TDA+W+ P G
Sbjct: 682 LRRGEEKIVRSLRRAVKDVVFLNSRGEVTCY-----GHDGTRRWQQRTDASWA--PGDPG 734
Query: 573 MTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCED 632
VV +L +F LRV +++LAGG A +++P G L + LP P AL+ D
Sbjct: 735 ------VVASLASFPLRVGGASEVVLAGGATHAALLTPSGYRLNAFKLPGKPVAALLVVD 788
Query: 633 FSNDGLTDVILMTSNG-VYGFVQTRQPGALFFS 664
DGL DV+ T +G VY + Q PG F+
Sbjct: 789 VDGDGLNDVVARTKDGDVYAWRQRSHPGLAPFT 821
>gi|330802543|ref|XP_003289275.1| hypothetical protein DICPUDRAFT_98309 [Dictyostelium purpureum]
gi|325080624|gb|EGC34171.1| hypothetical protein DICPUDRAFT_98309 [Dictyostelium purpureum]
Length = 709
Score = 258 bits (660), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 206/748 (27%), Positives = 349/748 (46%), Gaps = 125/748 (16%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPI---KFDA-------DRLPP 50
MRKRD +L+LS IF SL + D+ L E PI +D + LP
Sbjct: 1 MRKRDTWLLLLSLVVIFISLYRQKDYKIS-----LLLEIPIDTNNYDNSHYPKINEYLPL 55
Query: 51 PIVADLNGDGRKEVLVATHDAKIQVLEP------HARRVDEG-FSEARVLA-----EVSL 98
PI+ D++GDG+ E++ T+D ++ VLE R++E SE +L +VSL
Sbjct: 56 PIITDIDGDGKNEIIYVTNDYRLIVLETVTYENIELFRLNENELSEKNLLQLPIKYQVSL 115
Query: 99 LPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQ 158
++ +A+GRR ++ TG Y G P +Q++VVVT GWS++CF+H L KLWE+
Sbjct: 116 -KSEVGLATGRRPTSIKTGFT-SPYVPGVPRQQIIVVVTDGWSILCFNHQLKKLWESYAS 173
Query: 159 EDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHR 218
++ PN + EI+ISI + + D G+VI+GGRME + +I + E E +
Sbjct: 174 DEVLPNHYQSEISISI----VPNSDKGMVIIGGRMEPMEGYVHKSHYKIPMYEGKIETNE 229
Query: 219 RSASEKEASENSGT-VDLRHFAFYAFAGRSGLLRWSRK-------NENIEAQPTDASQLI 270
+ E++ T D HF+++A+ G G WS + N++ + + S +
Sbjct: 230 EHSHERDDDHEKETHRDESHFSYFAYDGEKGYKIWSHEENDFMPVNKHTDEEHRKESD-V 288
Query: 271 PQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKK 330
H++K +++ H GE + R +R+SVL +PH W DT L HF + +
Sbjct: 289 SLHSFKQHIYS-QLDHLGEVDWRTYRDSVLAALPHKWSTGYDTKLDARHFSKKINTVRGN 347
Query: 331 VVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGS---AKSKKPVNYIPTITNYT 387
V +T+ G ++ + S LIG + S KS+ N+ I +
Sbjct: 348 VETSTTT------------GLNNQEWDSELIGVNPSNLESFSILKSQDKANFQSNIKD-- 393
Query: 388 QLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHA---------DINGDGVLDHV 438
PNV+VAH K G+E V ++SG T+C+L L H+ D+NGDGVLD +
Sbjct: 394 -----PNVIVAHNKNGLEVVQISSGNTLCRLVLDSSDFHSNGNYFISYIDLNGDGVLDQI 448
Query: 439 -------QAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPH 491
+ +G +G ++TV C A+ SG+P RE++F+ IC+ F
Sbjct: 449 TTYAGSFKGLGEDGGKETV----------CKALGMSGIPAREKIFDLKICNEGMF----- 493
Query: 492 GEFSRNFGRTSDVASLEVATPILIPRSDG---------HRHRKGSHGDVVFLTNRGEVTA 542
+F + +D +L +++ R + +VFL N G +++
Sbjct: 494 -DFEYFMWKGTDKNNLNNKKKTKSTKNERLIQTIPPAFFRSSENDQIQMVFLVNTGLISS 552
Query: 543 YSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQM--ILAG 600
++ W++ T+ W+ P P+L+ FS N+ + ILA
Sbjct: 553 FNS-----KGSKMWKIETNIHWTRHIEP-------ITQPSLQVFSFESDTNRLVPFILAV 600
Query: 601 GDQEAVVISPGGSILTSIDLPAPPTHAL----VCEDFSNDGLTDVILMTSNGVYGFVQTR 656
G++ ++S G ++ L + T+ + + DF+NDG+ D ++ T G Y T
Sbjct: 601 GERMMAIVSEKGDMVLEQPLDSSETNPVMAPPIVGDFNNDGINDFLITTLKG-YSIYITE 659
Query: 657 QPGALFFSTLVGCLIVVMGVIFVTQHLN 684
+ + ++G L++ + ++F+ N
Sbjct: 660 KGTSSSLLPIIGMLLIGVLLVFLLSSSN 687
>gi|440793762|gb|ELR14937.1| FGGAP repeat domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 581
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 175/565 (30%), Positives = 268/565 (47%), Gaps = 94/565 (16%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDF-SFREAWFH-----LSEEYPIKFDADRLPPPIVA 54
M+++D+AIL L+ AI S+ ++G + R W H L E +RLPPP++
Sbjct: 1 MKQKDVAILFLATLAILLSVYYQGTYHGLRLVWVHPIDASLYENGHFPLQHERLPPPLIT 60
Query: 55 DLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEAR-----VLAEVSLLPDKIRIASGR 109
D++GDG EV+V T D++I++LE + S ++ V E SLLP + + +GR
Sbjct: 61 DVDGDGHNEVIVVTSDSRIKILEGTPSQSAGLGSPSQWHNLPVKEEASLLP-SVGVGAGR 119
Query: 110 RAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHRE 169
R VA+++G + Y++G QV+VV+TSGW+V+ FD++L LWE+ ++E P + +H E
Sbjct: 120 RPVALSSGYLS-PYQEGLVRTQVIVVLTSGWTVLLFDNHLKLLWESTVRESLPAHLYHSE 178
Query: 170 IAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEAS-- 227
A+ + ++ GD G+VI R+ + A + A +H A ++AS
Sbjct: 179 AALLVVPTPVRIGDAGVVIAAARLSRK-------------ANQQAAKHDHGAGGRKASWM 225
Query: 228 ------ENSGTVDLRHFAFYAFAGRSGLLRWSRKNENI-EAQPTDASQLIPQHNYKLDVH 280
E+ + HF+++AF GR+G LRW + + + TDA I +
Sbjct: 226 ANGEEDEDDHDLGEEHFSYFAFDGRTGALRWKHEAGDFYDEYDTDAGDRIGEE------- 278
Query: 281 ALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPF 340
+ H GE +FR SV+ +PH W RREDT L+++HF R +
Sbjct: 279 --RAMHLGEMSWNDFRYSVMQQLPHAWFRREDTGLRVAHFERGAK--------------- 321
Query: 341 HKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQ 400
P D+T + S L G G + + PNV+VAH
Sbjct: 322 --------PAADTTAQ-SQLSGLHVDVPGLVWGGHAPHSEDDFVHE------PNVIVAHF 366
Query: 401 KEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP 460
+G+E +HL +GR V ++ L+ + DINGDGV++ V+ T+ +
Sbjct: 367 ADGVEVIHLYTGRPVTRMGLERSVVWDDINGDGVVEAVRL-------STLPTAGQSSEDQ 419
Query: 461 CWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTS--DVASLEVATPILIPR- 517
CWA+ SGVP +E LFN SIC P + N R S D L PI I R
Sbjct: 420 CWALGQSGVPPQEDLFNVSICEPKPQGFLE--LLTSNPKRKSRGDTGVLRATNPISIARP 477
Query: 518 -----SDGHRHRKGSHGDVVFLTNR 537
+GH K D VFL R
Sbjct: 478 RTWAEPNGH---KDMWADDVFLAAR 499
>gi|66819765|ref|XP_643541.1| hypothetical protein DDB_G0275747 [Dictyostelium discoideum AX4]
gi|60471668|gb|EAL69624.1| hypothetical protein DDB_G0275747 [Dictyostelium discoideum AX4]
Length = 799
Score = 221 bits (564), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 162/553 (29%), Positives = 260/553 (47%), Gaps = 92/553 (16%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDA----------DRLPP 50
MRKRD +L++S +F SL + D+ +L E PI + + LP
Sbjct: 1 MRKRDTWLLLVSILVVFISLYRQKDYKL-----NLLLEIPIDTNNYENSHYPKINEFLPL 55
Query: 51 PIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFS---------------------- 88
PI+ DLNGDG E++ T+D KI++L+ E F
Sbjct: 56 PIITDLNGDGFNEIIYVTNDFKIRILDTVTPNNIEKFRFGVNVNENENENENDNDQNSLL 115
Query: 89 EARVLAEVSLLPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHN 148
E VL E SLL D + +A+GRR +A+ TG + G+ +Q++VVVT GW+V+CFD+
Sbjct: 116 ELPVLYESSLLSD-VGLATGRRPIAIKTGYAKHSI-PGERRQQIIVVVTDGWAVLCFDNK 173
Query: 149 LNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIG 208
L KLWE+ ++ P + EI+I+I T+ D G+VI+GGRME P I P I
Sbjct: 174 LKKLWESYASDEVLPGHYQSEISITILPTTIP-DDKGIVIIGGRMEPLPGVIHKPHYSIP 232
Query: 209 LAEKNAEQHRRSASEKEASENSGTV--DLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDA 266
+ + E + +++ +S T + HF+++++ + G RWS + + + +
Sbjct: 233 MFGEKVETDKDHDHKRDDDHDSFTAHREENHFSYFSYDCKDGYKRWSHEENDFKPENPHT 292
Query: 267 SQL------IPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHF 320
++ I H++K + + H GE + + +R+S+L +PH W DT L+ HF
Sbjct: 293 TEEHKKDSDISLHSFKQHIFS-QLEHLGEVDWKTYRDSILASLPHKWSSNFDTKLEPRHF 351
Query: 321 RRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAAT--------YAGSAK 372
K + + G + + D T I+ LIG + T Y+ +
Sbjct: 352 ----EKKINTIRGTTNN--------------DGTNVINGLIGSSNTLNNEMIEFYSKNFD 393
Query: 373 SKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHA----- 427
+ ++ I+ +Q PNV+V H K GIE +H+ SG+T+CKL L +
Sbjct: 394 TFSILSKKDQISFKSQHIENPNVIVTHHKYGIEVIHIVSGKTLCKLVLDGTDFYTDSNRY 453
Query: 428 ----DINGDGVLDHVQAVGGN--------GAEQTVVSGSMEVLRPCWAVATSGVPVREQL 475
D+NGDGVLD V++ GN + + C +A SG+P +E L
Sbjct: 454 IVYLDLNGDGVLDQVRSYAGNFKNSQLGRNKKNKNIGDDNNEDSYCVGLALSGLPPKETL 513
Query: 476 FNASICHHSPFNL 488
F+ IC FN+
Sbjct: 514 FDLKICGGQIFNM 526
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 73/156 (46%), Gaps = 27/156 (17%)
Query: 531 VVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRV 590
+F+ N G ++ + G IW+ + T WS P P+L+ FS +
Sbjct: 626 TLFMVNTGLISCFDTASFGGKKIWR--VDTGIHWSKHLEPR-------TFPSLQVFSFDI 676
Query: 591 HDNQQMILAGGDQEAVVISPGGSILT---------SIDLPAPPTHALVCEDFSNDGLTDV 641
Q ILA G+Q V+++ G IL S + APP V DF+NDG+ D
Sbjct: 677 QSTQPFILAIGEQSMVILNENGEILVDQQIWNAGESDPITAPP----VVADFNNDGINDF 732
Query: 642 ILMTSNGVYGFVQTRQPGALFFSTLVGCL-IVVMGV 676
++ T +G + FV + +STL+ + +V++G+
Sbjct: 733 LITTLSGYHIFVTEKGT----YSTLLPMIGLVLIGM 764
>gi|328867416|gb|EGG15798.1| hypothetical protein DFA_09466 [Dictyostelium fasciculatum]
Length = 802
Score = 221 bits (563), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 198/760 (26%), Positives = 329/760 (43%), Gaps = 140/760 (18%)
Query: 15 AIFFSLQHEGDFSFREAWFHLSEEYPIKFDA----------DRLPPPIVADLNGDGRKEV 64
A+ SL D+S HLS PI + + LP PI+ DLNGD R ++
Sbjct: 54 AVLVSLYTHQDYSL-----HLSLILPIDSNNYENNHFPQPHELLPQPIITDLNGDQRNQI 108
Query: 65 LVATHDAKIQVLEP------HARRVDEGFSEAR--VLAEVSLLPDKIRIASGRRAVAMAT 116
+ T+D KI++++P + G R ++ E SL K I+SGRR VA+ T
Sbjct: 109 IYVTNDQKIRIVDPLYAINNMNTNNENGRIVGRNDIIYEASL-SSKFGISSGRRPVALKT 167
Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISN 176
G + +QV+VVV WSV+C++ L LWE+ + +D P N + EIAI I
Sbjct: 168 GYTTPSGTAAN-REQVIVVVLDDWSVLCYNSRLKPLWESFIIDDIPKNHYLSEIAIDIIP 226
Query: 177 YTLKHGDTGLVIVGGRME-------MQPHTI----MDP-------------FEEIGLAEK 212
LK GD GL+++GGR+E +PH + +DP F E+
Sbjct: 227 INLKEGDQGLIVLGGRLEPVGNGVGHKPHVMPAIGIDPKKYMDNDYLSDGEFTGYDPNEE 286
Query: 213 NAEQHRRSASEKEASENSGT--VDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLI 270
+ + + G D HF++YA ++G RWS + + + + +
Sbjct: 287 DVGGGEHQGEGHDHKDEGGLHGRDESHFSYYALDAKTGAKRWSHEENDFKPKNVHLDEEY 346
Query: 271 ---PQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKI 327
+H+YK + +L H GE + F++++L +PH W R DT L+ +HF + K+
Sbjct: 347 HGEKRHSYKQHIISLMD-HEGEVNWKLFKDNMLDQLPHSWSSRYDTKLQSNHFTKSKKS- 404
Query: 328 LKKVVGKSTSYPFHKPEEHHPPGKDST--KKISNLIGKAATYAGSAKSKKPVNYIPTITN 385
G+S + + P T K+ NL G+ + + +
Sbjct: 405 -----GQSN-------QGGNTPSTSGTGGKEWENLFGQQSQSQTTQQRWNQEE------- 445
Query: 386 YTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLH-----------ADINGDGV 434
NV+V+H + GIE +H++SGRT+CKL L+ H D++GDG+
Sbjct: 446 -------SNVIVSHHRHGIEVIHISSGRTLCKLLLEGTDAHRSLTSHHYIVYVDLDGDGM 498
Query: 435 LDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICH-HSPFNLFPHGE 493
+D V +V G+ C +A +G+P R+ LFN SIC S + F
Sbjct: 499 VDQVHSVTGDPVGSVSSFSRSSRQDVCMGMALAGLPPRDHLFNKSICGWTSQLDFFWPAG 558
Query: 494 FSRNFGRTSD------VASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGL 547
F R+ G + D S + P + + + +H D VFL N G++ + +
Sbjct: 559 F-RSSGNSIDGLDKSKRVSYQTVRPAVF---NAVYSLRPTHKDTVFLVNSGKINSVNSRG 614
Query: 548 HGHDAIWQWQLLTDATWSNLPSPSGMTEASTVV--PTLKAFSLRV--HDNQQMILAGGDQ 603
+ + W + + ATWS + ++ P+++ F++ H Q +LA G+
Sbjct: 615 YTN-----WNIDSPATWS---------RNNIIIHHPSIQPFNIATQPHQTDQQVLAIGES 660
Query: 604 EAVVISPGGSILTSIDLPAP----------------PTHALVCEDFSNDGLTDVILMTSN 647
++ + GSIL + + PT + D + DG+ D+I+ T +
Sbjct: 661 IVILSASDGSILLQKKIKSISFNNQNNNGNGNNNVLPTAPPIIGDLNQDGINDIIVPTLS 720
Query: 648 GVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLNSVK 687
G Y + T+ FS +I + V + S+K
Sbjct: 721 GYYIYNLTKGYSTFLFSCFTVIIIATLLVTVILSKQQSLK 760
>gi|291230320|ref|XP_002735115.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 695
Score = 208 bits (529), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 198/722 (27%), Positives = 312/722 (43%), Gaps = 118/722 (16%)
Query: 4 RDLAILMLSAFAIFFSLQHEGDFSFREAW-------FHLSEEYPIKFDADRLPPPIVADL 56
RDL ++ + A A++ L+ +S + W + + EY +++ ++P P V+D+
Sbjct: 14 RDLWLVAVCAIAVYL-LRVRESYSLQAKWRSDIDTSLYENNEYAKEWE--KIPLPFVSDI 70
Query: 57 NGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLP-----DKIRIASGRRA 111
DG EV+ T + KI++ ++ S+LP ++ + S +
Sbjct: 71 ESDGINEVIFVTKEPKIKIATVPTKKFGS-----------SVLPYLVTKHEVTLDSNKHP 119
Query: 112 VAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIA 171
VA+ATG + Q KQV+V + S W+VMC+DH L LW+ L +E A
Sbjct: 120 VAIATGYLMEYQSMLQVRKQVVVALLSDWTVMCYDHKLQLLWQNKLSPILDERYFIKEAA 179
Query: 172 ISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSG 231
I ++++ LK D GL+I+GG + + H E L + HRR K + S
Sbjct: 180 ILVTSHNLKKKDGGLIIIGGSIGDRHH------HEKKLTH---DHHRRIQDIKPIEDESD 230
Query: 232 TVDLR-----------HFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVH 280
D R HF+ YA +G+ G +RW + + L ++KL +
Sbjct: 231 EGDDRYQDKEKDTHVNHFSTYALSGKDGSIRWHHLPGDFKEHTNKDDALFSARHFKLGLK 290
Query: 281 ALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPF 340
S H GE ++ +++L +PH W R DT + ++ K +I K Y F
Sbjct: 291 KGLS-HEGESHWMQYNKAILKNLPHSWQRASDTRITFDRIQKSKGEIFK-------DYGF 342
Query: 341 HKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQ 400
G D + G ++ S K PN +V H
Sbjct: 343 DDENVLDMIGLDEEHLVGYAFGGLRPHSASEHIKN-----------------PNAIVIHT 385
Query: 401 KEGIEAVHLASGRTVCKLHLQEG-GLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLR 459
EGIE + LA+G+ +C+L LQ+ G++ DIN D +LDHV+ S S
Sbjct: 386 HEGIEVLKLATGQPMCRLKLQQDKGVYMDINQDAILDHVKG---------HFSHSAHAKD 436
Query: 460 PCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEV---ATPILIP 516
C AV +G P LFN SIC G S +D +E P +I
Sbjct: 437 NCLAVVKTGHPPHTLLFNGSICDSQSIL----GWLSFIDSADNDNGHIEDTHHTVPPVIV 492
Query: 517 RSDGHRHRKGSH-------------GDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDAT 563
RS R +H D +FL + G++T+Y P HG + WQ+ T A
Sbjct: 493 RSVAERRGIWNHLLGETQLSASSKGFDSIFLISNGKLTSYGP--HGQ---FHWQVATPAK 547
Query: 564 WSNLPS---PSGM-----TEA--STVVPTLKAFSLRVHDNQQM-ILAGGDQEAVVISPGG 612
WS+ S +G+ TE +T P+++ +L+V+ + + +LAG D +V G
Sbjct: 548 WSDASSLVRRAGLLNREFTEKYRNTFQPSMQVMALQVYGKENIVVLAGWDSMVLVSLKDG 607
Query: 613 SILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIV 672
IL LP P+ ++V DF+NDG TD I+ GF R G L + L+G I+
Sbjct: 608 RILAEHTLPCQPSSSIVIGDFTNDGWTDFIVHCPTSYLGFSLNRTSGYL-STALIGAAII 666
Query: 673 VM 674
V+
Sbjct: 667 VV 668
>gi|303290440|ref|XP_003064507.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454105|gb|EEH51412.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 566
Score = 205 bits (521), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 175/548 (31%), Positives = 239/548 (43%), Gaps = 133/548 (24%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDAD-------------- 46
+ KRDLA++ LS FA++ SL +G SF AW+ +EE A
Sbjct: 2 LAKRDLAVMFLSLFAVYHSLLRDGVVSFDRAWYVPAEESSDDLAASSARGDRAGPGTFAS 61
Query: 47 ---------------RLPPPIVADLNGDGRKEVLVATHD-AKIQVLEP------------ 78
R PPP+ DLNGDG E++VA+ A+I+V+
Sbjct: 62 SARFGGGGVAATRSSRAPPPVFFDLNGDGVNEMIVASRTLAEIRVVSVPSSASRRRRRDG 121
Query: 79 -----------------HARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVID- 120
H D F+ A SLLP R+ +GR +A+A G +
Sbjct: 122 GGGAGGDGGGDHGDEGIHDDDADV-FAALTTTATASLLPANTRVVAGRTPIALAAGHLTP 180
Query: 121 --------RTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAI 172
R + K V+VVVTSGW ++CFDHNL LWE L +FP A RE+A+
Sbjct: 181 PRSSGSSSRANSNVKTRKAVVVVVTSGWHLLCFDHNLRLLWEVALSGEFPRRARIREVAV 240
Query: 173 SISNYTLKHGDTGLVIVGGRMEMQPH-----------TIMDPFEEIGLAEKNAEQHR--- 218
+S + GD G VIVGGR+E ++ D FE E HR
Sbjct: 241 VVSAHATYEGDVGAVIVGGRVETGTRDDDDEGGGGGDSLGDAFERQLDDEDVLRTHRGGA 300
Query: 219 -RSASEKEASENSGT-------------VDL-RHFAFYAFAGRSGLLRWSRKNENIEAQP 263
+A E EA+E G+ +D RHF +YAF G++G RW ++E+ +
Sbjct: 301 KATAREMEAAEKDGSNAEGDGTGAGKGRLDRSRHFNYYAFEGKTGARRWKHESEDFH-RD 359
Query: 264 TDA--SQLIPQHNYKLDVH------------ALNSRHPGEFECREFRESVLG-VMPHHWD 308
DA +L PQH+Y+LD AL RH GE CREFRESV+ +PH W
Sbjct: 360 VDALVDRLTPQHDYRLDAGAFTFTLVPIRPPALEGRHYGEVSCREFRESVVKRALPHQWR 419
Query: 309 RREDTLLKLSHFRRHK--RKILKKVVGKSTSYPFHKPEEHHPPGKD-STKKISNLIGK-- 363
REDT L ++ F +HK + VG+ + G + T + +G
Sbjct: 420 EREDTRLVVAAFEKHKPHKGARSNAVGRGGAATVDGSSSRGARGAEHGTNAFARALGTTA 479
Query: 364 ------AATYAGSAKSKKPVNYIPTITNYTQLWWV--------PNVVVAHQKEGIEAVHL 409
A+ G A S+ P T T T PNVVVAH +EGIE +HL
Sbjct: 480 RAVATGGASNRGGASSRGPGGKTTTATRNTHRRVKRPSSGGDPPNVVVAHVEEGIEILHL 539
Query: 410 ASGRTVCK 417
+GRT+CK
Sbjct: 540 HTGRTLCK 547
>gi|281209979|gb|EFA84147.1| hypothetical protein PPL_03221 [Polysphondylium pallidum PN500]
Length = 748
Score = 198 bits (504), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 206/748 (27%), Positives = 314/748 (41%), Gaps = 168/748 (22%)
Query: 29 REAWFHL-------SEEYPIKFDADRLPPPIVADLNGDGRKEVLVATHDAKIQVLEPHAR 81
R+AW L + YP + LP PI+ DL+GDG+ D KI++++P
Sbjct: 4 RDAWMILVYTNQYANSHYPQINEV--LPNPIITDLDGDGKN-------DHKIRIVDPVLA 54
Query: 82 RVDEGFSEARVLAEVSL-----LPDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVV 136
++ + + SL L K+ ++SGRR VA+ TG Y Q++VVV
Sbjct: 55 DDSVVGGKSTSMGKASLRYEVSLASKVGLSSGRRIVALQTGYA-TPYISRTSSTQLVVVV 113
Query: 137 TSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQ 196
T W V+ FDH L LWE + ++ P N +H E+ I + + GL+IVGGR+E
Sbjct: 114 TEEWEVLVFDHQLKPLWERYVVDEIPKNHYHSEVTIQVVS-------NGLIIVGGRLEAI 166
Query: 197 PHTI----------MDP---FEEIGL-----------------AEKNAEQHRRSASEKEA 226
P+ + +DP ++G E +A H E
Sbjct: 167 PNELHRSHITPAIGIDPNALASKVGTLDEGDPTAATATGETASKEGDAHAHDHEKEEVHR 226
Query: 227 SENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQL---IPQHNYKLDVHALN 283
EN HF+FYAF+ +G + W + + T A + +H+YK V++L
Sbjct: 227 DEN-------HFSFYAFSAYNGQMHWHHDELSFLPENTHADEEHHDEKRHSYKQHVYSL- 278
Query: 284 SRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKP 343
H GE + +++S+L MPH W + DT +L+HF + H P
Sbjct: 279 LEHIGEVSWKSYKQSMLAAMPHRWSSKFDTRFQLAHFEKQGAN-------------RHNP 325
Query: 344 EEHHPPGKDSTKKISNLIG-KAATYAG-SAKSKKPVNYIPTITNYTQLWWVPNVVVAHQK 401
+ G + + +IG + + G + + P Y T++ PNVV+AH K
Sbjct: 326 TK----GNNEADWNTEVIGVHPSHFEGLTGGNSAPAQYPHAETDHID---EPNVVIAHTK 378
Query: 402 EGIEAVHLASGRTVCKLHLQEGG---------LHADINGDGVLDHVQAVGGN--GAEQTV 450
G+E + L +GRT+CKL L + D+N DGV+D V A+ GN G+
Sbjct: 379 AGMEVIQLQNGRTLCKLLLDSTNKAENSDHYITYTDLNNDGVVDQVHAIAGNLPGSSMFS 438
Query: 451 VSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASL--- 507
+ E+ C A+ SG+P R+ LFN SICH +F F R+S A +
Sbjct: 439 RNRRQEI---CLALGMSGLPPRDYLFNRSICHEGGM------DFDYFFWRSSPAAEVSRG 489
Query: 508 -EVATPILIPRSDGHRHRKGSH--GDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATW 564
+ P +I ++ GS DVVFL N G++T+ H WQ TDA W
Sbjct: 490 AQTVAPAIIASANT---VAGSAKLNDVVFLVNSGKITSVR-----HTGRANWQRDTDARW 541
Query: 565 SNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDL---- 620
P P+++AFSL V+ + +LA D V++ G IL S L
Sbjct: 542 QKDGQPYPH-------PSIQAFSLEVYGPKTNLLAVADH-IVILDQQGEILISERLGKKG 593
Query: 621 --------PAPPTHAL----------------------VCEDFSNDGLTDVILMTSNGVY 650
A +HA + D +NDG DVI+ T +G Y
Sbjct: 594 SQKLAGGMSASKSHAAGEQEHYQPTDAAALSVMPMGPPIIGDLNNDGYNDVIVPTVHGYY 653
Query: 651 GFVQTRQPGALFFSTLVGCLIVVMGVIF 678
+ R S V + + V+
Sbjct: 654 VYQLERGHSTFILSLFVIIISLAFTVML 681
>gi|302852355|ref|XP_002957698.1| hypothetical protein VOLCADRAFT_98769 [Volvox carteri f.
nagariensis]
gi|300256992|gb|EFJ41247.1| hypothetical protein VOLCADRAFT_98769 [Volvox carteri f.
nagariensis]
Length = 748
Score = 186 bits (473), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 171/313 (54%), Gaps = 34/313 (10%)
Query: 394 NVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGN--------- 444
N +VA +EG+E +HL SGRTVCKLHL LHADINGDGVLDHV G+
Sbjct: 423 NALVAFLEEGVEVLHLYSGRTVCKLHLPPRTLHADINGDGVLDHVSVYHGHVGGGVDGNE 482
Query: 445 ------GAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRN- 497
G ++ V C AV SG+P E LF +CH + F + SRN
Sbjct: 483 DTSLLPGELPSISGKGHAVFGRCTAVVRSGIPPTETLFKVQVCHTNKFGV----SASRNV 538
Query: 498 FGRTSDVAS-LEVATPILIPRSD--GHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIW 554
F R + VA E+ATPI++P D G ++ HG +VFLTN+GE+TA + G H
Sbjct: 539 FQRAAAVAQPTELATPIMLPIPDLKGFSSKRRQHGMLVFLTNKGEMTAVT-GTGSH---- 593
Query: 555 QWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSI 614
WQ +W P VVPTL A +L H +LA G + AVV+S G I
Sbjct: 594 LWQEYLQVSW-----PPADENPRHVVPTLAAMALYTHAVPTTVLAAGTEHAVVVSEHGHI 648
Query: 615 LTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQ-PGALFFSTLVGCLIVV 673
L ++LP+PPT LV DF+ DGL D+IL+T+ G+YG+VQ G + S L+ L+V
Sbjct: 649 LAELELPSPPTQPLVVADFNGDGLNDIILVTNRGIYGYVQVPHLAGGMSLSALLLTLVVA 708
Query: 674 MGVIFVTQHLNSV 686
+G+++ TQH + +
Sbjct: 709 LGLVYFTQHYDPM 721
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 114/302 (37%), Positives = 170/302 (56%), Gaps = 31/302 (10%)
Query: 53 VADLNGDGRKEVLVATHDAKIQVLEP--HARRVDEGFSEARVLAEVSLLPDKIRIASGRR 110
+ADLNGDG E++VAT D K+QV++P H R EGF+ A L +SLL + +A+G R
Sbjct: 1 MADLNGDGHLELVVATPDLKLQVIQPAPHGHR-GEGFARAAELNAISLLFKRALVAAGHR 59
Query: 111 AVAMATGVID-RTYRQGQPL-KQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHR 168
VA+A G +D + +PL K V+V+VT+ W VMCFDHNL WE + + FP +A +
Sbjct: 60 PVALAVGYLDPLPAERVRPLRKAVIVIVTASWRVMCFDHNLVLKWEYDAKMHFPHHARIK 119
Query: 169 EIAISISNYTLKHGDTGLVIVGG---RMEMQPHTIMDPFEEIGLA------EKNAEQHRR 219
E+A+ I+ + + D G+VIVG R ++ ++ E G A + + R
Sbjct: 120 EVAVYIAPHQVHEADRGIVIVGASVLRGDLASGEGLESVAEGGPAGFFEDDDVLLSEMRE 179
Query: 220 SASEKEASENSGTV----DL------------RHFAFYAFAGRSGLLRWSRKNENIEAQP 263
A KE ++++G DL RHF++ A G +G LRW + +
Sbjct: 180 DAERKEHAKSAGVTESLTDLDPNDPLAGLPGSRHFSYVALEGGNGTLRWHHGSGDFHKDL 239
Query: 264 TD-ASQLIPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRR 322
++ +L PQ+NY+LD L+ RH GE CR++RESVL V+PH W+R DT L +HF +
Sbjct: 240 SELGGELTPQNNYRLDASKLDGRHFGEASCRDYRESVLHVLPHVWERPADTRLMEAHFIK 299
Query: 323 HK 324
H+
Sbjct: 300 HR 301
>gi|405976894|gb|EKC41372.1| hypothetical protein CGI_10025644 [Crassostrea gigas]
Length = 679
Score = 186 bits (472), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 187/709 (26%), Positives = 300/709 (42%), Gaps = 105/709 (14%)
Query: 6 LAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDA-----DRLPPPIVADLNGDG 60
+ +L+ S A F + + + W SE + K DRLPPPI+ DL+GDG
Sbjct: 17 IFVLLCSVIAYLF--RASDSYDLKPVWRKRSEPHHYKNKLYPTLDDRLPPPIITDLDGDG 74
Query: 61 RKEVLVATHDAKIQVLEPHARRVDEGFSEA--RVLAEVSLLPDKIRIASGRRAVAMATGV 118
E+L+ THD K+ VL R D+ V+ + ++LP I ++ R VAM TG
Sbjct: 75 TNEILLITHDFKLNVLALPERAADDDDDTLPHVVVKQRAVLP--INVSEISRPVAMETGF 132
Query: 119 IDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHRE------IAI 172
Q KQ++VV T W V+C+DHNL LW L + +H +E + I
Sbjct: 133 TVPYSSMMQIRKQIVVVATDDWQVLCYDHNLELLWHKRLMD----VSHVKETYTVKAMGI 188
Query: 173 SISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGT 232
I+ + +K D G+VIVGG HT D I +H + +E++ ++
Sbjct: 189 LITPHNVKKKDQGMVIVGGSFTHLVHTPPDTTTTI--------KHSVNKTEEKQENSTDD 240
Query: 233 VDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFEC 292
L HF+ +A + G RW + T+ + H++KL + + H GE
Sbjct: 241 NRLTHFSSFAVSALDGTSRWHHLPGDFGEIATNIKDIHGDHHWKLALKR-HRLHVGEAPW 299
Query: 293 REFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKD 352
+++ PH W +DT L L FR+ + K S+S +H
Sbjct: 300 TMYKKEFSEFTPHLWVNLDDTKLTLGRFRKTEEGGEKYSSSSSSSSGMALTPDH------ 353
Query: 353 STKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASG 412
+IG A Y G +P + + N PN VV H GIE ++L +G
Sbjct: 354 -------IIGYA--YGG----HRPHSNHEHVEN-------PNAVVIHTHNGIEVLNLLNG 393
Query: 413 RTVCKLHLQ-EGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPV 471
+ + +LHL +GG++ DI+ DG + + V+ G + PC+ PV
Sbjct: 394 QPITELHLPGDGGVYVDIDSDGEI------------EQVLWGLQDDYSPCYIEIWRINPV 441
Query: 472 REQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSD----------GH 521
+E++ IC + LF F+ ++ D +L+ PI+I GH
Sbjct: 442 KERIEQLPICRIT--RLF----FTSSWAYDED--NLKKLPPIIIKSVARKTGLIRYFMGH 493
Query: 522 RHRKGSHG-DVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNL---------PSPS 571
K +H D++ G V++++ D WQ+ T A W+ P+
Sbjct: 494 HLPKFAHKHDIITFGGVGRVSSWN-----RDGNANWQIATPANWARAAIDRKKNKEADPA 548
Query: 572 GMTE-ASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVIS-PGGSILTSIDLPAPPTHALV 629
P+ S+ V+ Q + G E V++ G++L LP P+ L
Sbjct: 549 AYQRFVEEFWPSHVLMSIPVYGQQNVAALSGFSEFVLVDLVNGNLLAEHSLPCSPSAPLT 608
Query: 630 CEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIF 678
DF NDG+ DVI+ S G GF + L F+ L G + ++ ++
Sbjct: 609 VGDFDNDGVNDVIVTCSLGYIGFSLHHKTNHL-FTALYGLTVFILILLL 656
>gi|156371409|ref|XP_001628756.1| predicted protein [Nematostella vectensis]
gi|156215741|gb|EDO36693.1| predicted protein [Nematostella vectensis]
Length = 645
Score = 186 bits (471), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 194/683 (28%), Positives = 293/683 (42%), Gaps = 138/683 (20%)
Query: 46 DRLPPPIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVL------AEVSLL 99
D LP PI+ DL+ DG E++VAT DAK++V+ E FS +L AE SLL
Sbjct: 55 DLLPKPIIVDLDNDGTNELVVATADAKVKVMVFPV----EDFSSENILPHLHIKAEYSLL 110
Query: 100 PDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQ- 158
+ + SG VAMA G + + QV+VVVTS W+V C + L +W+ +LQ
Sbjct: 111 SEVEQSNSGVLPVAMAAGCQEHVMERD--CNQVIVVVTSDWTVHCMSNFLKPIWKTSLQS 168
Query: 159 EDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHR 218
+D P H ++H T V G D +E+ N +
Sbjct: 169 KDHPLQPEH-----------IRHDHTVETGVAG----------DLVQEL----VNKPRIH 203
Query: 219 RSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLD 278
R K+A+ N HF+ YA +G +RW + + T +L+ +++KL
Sbjct: 204 RPVQSKKAASNEAD----HFSTYALDSWTGQIRWRHEPGDFVTNQTAREELLSAYHFKLA 259
Query: 279 VHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSY 338
+H+ N H GE +++ SVL +P W DT +K++ FR+ K G S
Sbjct: 260 LHS-NQYHAGEVRWEQYQASVLRSLPLRWAHPSDTNMKIADFRKGK-------TGSQIS- 310
Query: 339 PFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVA 398
KK S L K K N TQ N +V
Sbjct: 311 --------------EVKKKSWLDLGFTDLTSDTKGTK---------NRTQ----GNAIVI 343
Query: 399 HQKEGIEAVHLASGRTVCK-LHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEV 457
G+E +++ SG+ +C+ +H QE D+NGD V+DHV A Q +
Sbjct: 344 QSHTGLEVLNIQSGQPLCRYVHTQELTTAGDLNGDDVIDHVSM--HFSAHQLFPTD---- 397
Query: 458 LRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPR 517
L C A+A SG V LF +C F + G +V L VA P+L+P
Sbjct: 398 LPSCSAIARSGSRV---LFTGPVCASGSL-------FDWSSGEPHEVEPLLVA-PLLVPS 446
Query: 518 SD----------GH--RHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWS 565
GH R R + D VFL + G++T+Y P HG WQ+ + W+
Sbjct: 447 PPHRTGLFRHVMGHNLRQRSRAEMDAVFLVSTGKLTSYGP--HGEQ---NWQVTVEGAWA 501
Query: 566 NLPSPSGMTEAST------VVPTLKAFSLRVHDNQQ--MILAGGDQEAVVISPGGSILTS 617
P E ++ P+L++ + + ++ ++++G ++V G+I+ S
Sbjct: 502 KHVRPDEHGEWTSENGDQRFFPSLQSMRVNADEEEESAVLVSGWYHISLVSLVDGTIMAS 561
Query: 618 IDLPAPPTHALVCEDFSNDGLTDVIL-----------MTSNGVY-GFVQTRQPGALF--- 662
LP P +V DF+NDGLTDV++ TSN Y GF R+PG L+
Sbjct: 562 HSLPCQPVAPVVNGDFTNDGLTDVVVQCSQSQHCIYYQTSNFSYLGFALERRPGYLWTAI 621
Query: 663 --FSTLVGCLIVVMGVIFVTQHL 683
S LV L+ + + FV + L
Sbjct: 622 WVVSGLVVALLTLGCMRFVEEEL 644
>gi|145355970|ref|XP_001422217.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582457|gb|ABP00534.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 484
Score = 185 bits (470), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 143/444 (32%), Positives = 218/444 (49%), Gaps = 42/444 (9%)
Query: 238 FAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRE 297
F +YAF R+G RW+ + E + LD ++ E CR+FRE
Sbjct: 17 FEYYAFDARTGARRWTETSGEDEHR---RRARARLSLRNLDDDEIDG--AVERACRDFRE 71
Query: 298 SVL-GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHP--PGKDST 354
S++ +PH W DT ++L+HFRRHK + K+++ + P +T
Sbjct: 72 SIVQDGLPHLWRHPVDTKMRLAHFRRHKSRANAMARDKASAGKRKRGAAAAKAVPSNAAT 131
Query: 355 KK----ISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLA 410
+ + L G++ + P + T + PNVVV+H EGI+ +HL
Sbjct: 132 RAFGRALDALKGESKRRLDDDDAHPP--HASTHARRVRKREPPNVVVSHHAEGIDVLHLY 189
Query: 411 SGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVP 470
SG VC++ L+ GLH D++GDGV+DHV+A G N S E + CWA TSGVP
Sbjct: 190 SGVKVCEMKLKSPGLHVDLDGDGVVDHVEAHGRN-------LRSAEDIPHCWATVTSGVP 242
Query: 471 VREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPR-----SDGHRHRK 525
+ +ASIC L H ++ GR D+ ++EV +PI + R S+ +
Sbjct: 243 DEGRTLSASICRGGS-GLAGH-RAAQAAGR--DIRAVEVVSPISLRRLPETSSELAKPLD 298
Query: 526 GSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKA 585
D +FL +RGE+T Y+ G +WQ+ T A W +PS V P+L +
Sbjct: 299 RMGRDAIFLNSRGEMTCYN-ARDGEQK--RWQIRTTADW----TPSD----DVVKPSLTS 347
Query: 586 FSLRVHDNQQMILAGGDQEAVVISPGGSILTS-IDLPAPPTHALVCEDFSNDGLTDVILM 644
F +RV + +A G V+++ G T+ I+LP+PP LV D DG TD++L
Sbjct: 348 FPIRVDGYLDLAIATGASSMVIVNSKGYRATAPIELPSPPAAPLVARDVDGDGFTDLVLH 407
Query: 645 TSNGVYGFVQTRQPGALFFSTLVG 668
T +GVY ++QT + G L F+ L+G
Sbjct: 408 TKSGVYVWIQTPRAGNLPFTFLIG 431
>gi|290973099|ref|XP_002669287.1| FG-GAP repeat protein [Naegleria gruberi]
gi|284082832|gb|EFC36543.1| FG-GAP repeat protein [Naegleria gruberi]
Length = 683
Score = 185 bits (470), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 164/737 (22%), Positives = 317/737 (43%), Gaps = 124/737 (16%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWF--HLSEEYPIKF---DADRLPPPIVAD 55
MRKRD A+ L + +L EG + +A+F + + P+ D +P PI+ D
Sbjct: 1 MRKRDFAMFALVCLLLVVALNQEGSVTIEKAFFFSQTTNQQPLLLKTGQVDLVPKPIITD 60
Query: 56 LNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLP------------DKI 103
++GDG K+++ + + +I ++ + S + ++ + P DK
Sbjct: 61 IDGDGSKDLITSNDNGRISIVSL------KTISSSLLVHQDMFNPITEEKSFDVNEKDKN 114
Query: 104 RIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANL------ 157
+ + + G I ++GQ + + V S W V+C D +LN WE++L
Sbjct: 115 KGTVHHSIIGLGHGHIKVKKKKGQRGTKRIAAVLSNWKVLCLDPDLNLQWESDLLQHVVS 174
Query: 158 --------QEDFPPNAHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGL 209
ED + E+ +++ + + D G+VI+G R +P + D
Sbjct: 175 EMKLTDVEMEDLRNDFKPFEVFVAVYPWRVHENDEGMVIIGFR---KPQEMFD------- 224
Query: 210 AEKNAEQHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQL 269
+ S K E + + ++ G +G LRW ++ A+ D ++
Sbjct: 225 ----------NDSTKRTEETATQI-----TYFCLDGATGQLRWKHQSN---AKSWDDNEF 266
Query: 270 IPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILK 329
QH++KL+ S GE RE+++ ++ +PH + DT + L +F+ K+ +
Sbjct: 267 YEQHSFKLNAQQHLS-DSGEVIWREYKKKIMHHLPHVYRHPYDTSVILDNFQPTKKAKVS 325
Query: 330 KVVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQL 389
K+ ++ + D +K+ + I T K N I +
Sbjct: 326 KIQERAEKQDY----------GDIGEKVKSTIQTVTTKTKDTNEKLKKN----IELMKKF 371
Query: 390 WWVPNVVVAHQKEGIEAVHLASGRTVCKLH-LQEGGLHADINGDGVLDHVQAVGGNGAEQ 448
+PNV+VAH +GI+ +HL +GRT+ + L+E + DIN DG++D +A
Sbjct: 372 EKIPNVMVAHLSKGIQVIHLYTGRTLTQFSPLKENTYYQDINFDGMIDAAEA-------- 423
Query: 449 TVVSGSMEVLRPCWAVATSGVPV-REQLFNASICHHSPFNLFPHGEFSRNFGRTSD---- 503
+ C A SG P + LF S+C P + +FS F +++
Sbjct: 424 ----------QVCKARVESGAPNPYDTLFRKSVCKLHPSSFIE--QFSIPFLMSNEQQER 471
Query: 504 ----------VASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYS----PGLHG 549
++V TP +I + + D+V+LT+ G VTA + ++G
Sbjct: 472 QSFDDDEEYEDEQMDVVTPAIIEHYSAEQKPQHRARDLVYLTSTGLVTAITFDRKNSVNG 531
Query: 550 HDAI-WQWQLLTDATW--SNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAV 606
+WQ++T +++ + SG E + P + AF LR + ++A G++
Sbjct: 532 EPKFRVKWQVMTPSSFRRERVLVESG-AETESFFPEIVAFPLRKFETHSFVVAVGEKRIT 590
Query: 607 VISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTL 666
++ G++ I+LP P + D + D + D+++ T G+Y ++ G + L
Sbjct: 591 IVDLFGNVQQIIELPHPSISPIQLVDINGDNILDIVVSTKQGIYAYITRVHTGMSVLTFL 650
Query: 667 VGCLIVVMGVIFVTQHL 683
+ L V++G+++++ ++
Sbjct: 651 IVSLFVIVGLLYLSTYI 667
>gi|159484188|ref|XP_001700142.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272638|gb|EDO98436.1| predicted protein [Chlamydomonas reinhardtii]
Length = 786
Score = 182 bits (462), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 121/311 (38%), Positives = 169/311 (54%), Gaps = 34/311 (10%)
Query: 394 NVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQ------AVGGNGAE 447
N +VA +EG+E +HL SGRTVCKLHL LHAD+NGDGVLDH+ A G+G E
Sbjct: 454 NALVAFLEEGVEVLHLFSGRTVCKLHLPPRTLHADLNGDGVLDHISVYHGHTAQAGDGEE 513
Query: 448 ---------QTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRN- 497
++V V C A SG+P E LF +CH F + SRN
Sbjct: 514 GDALLPGELRSVSGRGHAVSGRCTAHVRSGIPPSETLFTVQVCHTRKFGV----SASRNV 569
Query: 498 FGRTSDVAS-LEVATPILIPRSD--GHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIW 554
F R + A E+ATP+++P D G+ R+ HG +VF+TN+GE+TA S G
Sbjct: 570 FQRAAAAAQPTELATPVMLPIPDLHGYSSRRRQHGMLVFMTNKGEMTAVSSG-----GAH 624
Query: 555 QWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSI 614
WQ +W P VVPTL A +L H +LA G AV++S G +
Sbjct: 625 LWQEYLQVSW-----PPADQNPDHVVPTLAAMALYTHAVPTTVLAAGTDHAVIVSEHGHV 679
Query: 615 LTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQ-PGALFFSTLVGCLIVV 673
L ++LPA PT LV DF+ DGL D+I++T+ G+YG+VQ + G + S L+ L++
Sbjct: 680 LAELELPAAPTQPLVVSDFNGDGLNDIIVVTNKGLYGYVQVQHLAGGMSLSALLLTLLIA 739
Query: 674 MGVIFVTQHLN 684
+G+++ TQH +
Sbjct: 740 LGLVYFTQHYD 750
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 110/303 (36%), Positives = 160/303 (52%), Gaps = 32/303 (10%)
Query: 54 ADLNGDGRKEVLVATHDAKIQVLEPHA-RRVDEGFSEARVLAEVSLLPDKIRIASGRRAV 112
ADLNGDG E++V T D K+QV++P EGF+ A + +SLL + +A+G R V
Sbjct: 7 ADLNGDGHIELVVTTPDLKLQVVQPAPPGHHGEGFARAVEINSISLLFRRALVAAGHRPV 66
Query: 113 AMATGVID-RTYRQGQPL-KQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREI 170
A+ G ID + +PL KQV+VVVT+ W V+CFDHNL LWE + + FP ++H +E+
Sbjct: 67 ALQVGYIDPLPLERVRPLRKQVIVVVTASWQVLCFDHNLVMLWEYDARVHFPHHSHIKEV 126
Query: 171 AISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNA------------EQHR 218
A+ IS + + D GLV+VG + + + + N + R
Sbjct: 127 AVYISPHQVHGADRGLVVVGASVMRGDVASGEGLQSVAGGGVNGTGSFFEDDDVLLSEMR 186
Query: 219 RSASEKEASENSGTV----DL------------RHFAFYAFAGRSGLLRWSRKNENIEAQ 262
A + S+ +G DL RHF++ A G G +RW + +
Sbjct: 187 EDAERRRLSKGAGLAEQLSDLDKDDPLAGLPGSRHFSYVAVEGGKGAVRWQHGSGDFHKD 246
Query: 263 PTD-ASQLIPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFR 321
+ +S L+PQH Y+LD L RH GE CR++RESVL V+PH WDR DT L +HF
Sbjct: 247 LDELSSGLVPQHQYRLDAEKLEGRHFGEASCRDYRESVLHVLPHVWDRPADTRLMEAHFI 306
Query: 322 RHK 324
+H+
Sbjct: 307 KHR 309
>gi|390356446|ref|XP_003728789.1| PREDICTED: uncharacterized protein LOC100891930 [Strongylocentrotus
purpuratus]
Length = 624
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 155/588 (26%), Positives = 256/588 (43%), Gaps = 84/588 (14%)
Query: 4 RDLAILMLSAFAIFFSLQHEGDFSFREAWFH-----LSEEYPIKFDADRLPPPIVADLNG 58
RD IL++ + + L+ + F +W + EE +LP P+VAD+
Sbjct: 19 RDGVILVILSLMATYLLRVQHSFELVPSWRYEVSTGFYEESMNTRTLKKLPVPLVADIES 78
Query: 59 DGRKEVLVATHDAKIQVLEPHARRVD-EGFSEARVLAEVSLLPDKIRIASGRR-AVAMAT 116
DGR E+L+ T D+ + +L+ + L E SLL D+ IA ++ AVAMAT
Sbjct: 79 DGRNEILLTTKDSSLLLLKTSKPSPGLKALPYLNTLDEFSLLSDEADIAGDKKQAVAMAT 138
Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAH-HREIAISIS 175
G ++ Q KQV+VV+ + W+V+C + + +W L + + E AI I
Sbjct: 139 GYLEPLTSDNQVRKQVIVVLCADWTVVCLNSDFKLIWSMKLPNTTWSHQYVMSEAAILIL 198
Query: 176 NYTLKHGDTGLVIVGGRM-EMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVD 234
++L+ D GLVI+GGR+ + H + + + + H + KE+ + D
Sbjct: 199 PHSLQSDDGGLVIIGGRLADRLAHATL---------KHSHDGHGFDGANKESKSSH---D 246
Query: 235 LRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECRE 294
+ HF+ +A +GR G +RW + + +L ++KL++ H GE +
Sbjct: 247 VGHFSTFALSGRHGDVRWHHLPGDFGEETNTEEKLFEGFHFKLNLKR-GWGHQGELGWNQ 305
Query: 295 FRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKS------TSYPFHKPEEHHP 348
+ +++L MP W DT + ++ FR+ +KK K +SY EEH
Sbjct: 306 YNQALLKQMPFRWQSSMDTKIDIAEFRKDG---VKKSAEKDEHILSVSSYLTSSFEEH-- 360
Query: 349 PGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVH 408
A + G +I PN V+ H ++GIE +
Sbjct: 361 --------------IAGHHFGGLPPHSASEHIKN----------PNAVIIHSQQGIEVLE 396
Query: 409 LASGRTVCKLHLQEGGLHADINGDGVLDHVQAV-GGNGAEQTVVSGSMEVLRPCWAVATS 467
L SGR + +L L +ADI+ DGV+D +A+ +G+E C AV +
Sbjct: 397 LKSGRPMTRLDLSPDATYADIDKDGVVDQAKALFTDDGSEGN-----------CQAVVKT 445
Query: 468 GVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGS 527
G P +L+N SIC+ S ++ G + + E++ LI +S R S
Sbjct: 446 GHPPHSELYNGSICYPSSLWAALSYPWAYASGSSEIKENQELSLRPLIVKSVAKRRGIIS 505
Query: 528 H----------GDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWS 565
H D +F + G+VT+Y P + WQ+ T A WS
Sbjct: 506 HLLGLSMSKAGMDTIFTVSTGQVTSYGP-----QGQFNWQVSTSALWS 548
>gi|424512936|emb|CCO66520.1| predicted protein [Bathycoccus prasinos]
Length = 1026
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 193/762 (25%), Positives = 303/762 (39%), Gaps = 186/762 (24%)
Query: 1 MRKRDLAILMLSAFAI----FFSLQHE----GDFSFREAWFHLSEEYPIKFDADRLPPPI 52
+R RD+ +L L FA+ FF+ + SF + W L+ + + P+
Sbjct: 47 LRTRDVLVLSLFCFALLRCLFFNRNNSVGSLNGMSFVKQWEILTVDSSSSSSPPKPLLPV 106
Query: 53 VA-DLNGDGRKEVLVATHDA-KIQVLEPHAR--------RVDEGFSEA---------RVL 93
+ DLNGDGR+E L+A ++ KI + + +R + G +E R+L
Sbjct: 107 LFFDLNGDGRQETLIAFDESNKIGIYDGSSRGGKGQKTKNNENGMNEDEDVGNENTLRLL 166
Query: 94 AEVSLLPDKIRIAS--------GRRAVAMA------------------TGVIDRTYRQGQ 127
V L + + G+R +A+A +G+ + R+
Sbjct: 167 KMVDLRETNEKSKTHRMRDEHGGKRVIALAAGRPEARRRGETKRRRSGSGIGSKQRRKTV 226
Query: 128 PL---KQVLVVVTSGWSVMCFDHNLNKLWEANLQED-FPPNAHH---------------- 167
+ K + V VT VM FDHN K WE + ++ F AHH
Sbjct: 227 VMIKRKAIFVAVTEELQVMTFDHNAKKKWERSARDALFSGKAHHRHHLGGRGGRGDSMRV 286
Query: 168 REIAISISNYTLKHGDTGLVIVGGRMEMQPHTI------------MDPF----------- 204
EIA+++ + ++ D GLV+VGGR+E + + M+ +
Sbjct: 287 EEIAVTVVSAKGENED-GLVVVGGRVEYEKYQNEEGEEGEDYERGMEAYARELKDDEMLS 345
Query: 205 --------EEIGLAEKNAEQHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKN 256
E I E+ E ++ AS+N F + AF RSG L W R
Sbjct: 346 MHRGGRRDETIEFKEEGVEDIIEDDNDTFASQNGA------FVYLAFNARSGELVWRRVA 399
Query: 257 ENIEAQPTDA-SQLIPQHNYKLDVHALN-SRHPGEFE--CREFRESVLG-VMPHHWDRRE 311
+ A P + P H +LD H H + + CR +RES L +PH W E
Sbjct: 400 SDFVADPAELLRTTTPAHEVRLDSHLTKIEEHKSKTDGICRSYRESALAEALPHAWRDAE 459
Query: 312 DTLLKLSHFR------------RHKRKILKKVVG---KSTSYPFHKPEEHHP--PGKDST 354
DT ++ + F R KR + G S + K P D T
Sbjct: 460 DTRMRSAPFSKSQKQTSSSFGGRKKRARGGRDPGPLATSAATTNLKAASRDPNFAVADKT 519
Query: 355 KKISNLIGKAATYAGSAKS--------KKPVNYIPTITNYTQLWW-----VPNVVVAHQK 401
S+L+ + +Y KS ++ ++Y ++ + + N ++AH +
Sbjct: 520 NDFSSLLSRGFSYLNLGKSAVHHRDDEREKLDYHNRRFHHQHMMSSSNNNIKNAIIAHHR 579
Query: 402 EGIEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP- 460
+G+E + +G +C+L LQ G HADINGDG +DH+ A G + +M+ P
Sbjct: 580 DGVEVLDAFTGELICQLALQHPGYHADINGDGAIDHLDA---RGHRAHIADVAMDNFSPG 636
Query: 461 CWAVATSGVPVREQLFNASIC--------HHSPFNLFPHGEFSRNFG----RTSDVASLE 508
CWA TSG P + +F SIC +P + +N G R + ++
Sbjct: 637 CWASVTSGAPDVQTVFEGSICKPTRKSSMKSTPRQRAKSNKKQQNSGDFYLRNKNKDDVD 696
Query: 509 VATPILIPRSDG---HRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWS 565
V TPI++ R D R + + D VFL +RGE+T Y+ D +W + T A+W
Sbjct: 697 VLTPIIVRRDDKSERDRSFRRATKDAVFLNSRGELTCYAK-----DGTRRWMVNTRASWK 751
Query: 566 ---NLPSPSGM-------------TEASTVVPTLKAFSLRVH 591
N P G+ E VPTL F R H
Sbjct: 752 IDVNGPPKDGIYDLFSDVNNNNDSKEMDRFVPTLATFKFRTH 793
>gi|443696151|gb|ELT96931.1| hypothetical protein CAPTEDRAFT_224542 [Capitella teleta]
Length = 688
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 177/726 (24%), Positives = 299/726 (41%), Gaps = 119/726 (16%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFH-------LSEEYPIKFDADRLPPPIV 53
+R +D+ I + A ++ L + AW+H + YP+ D +R P PIV
Sbjct: 20 VRLQDVVITAVCALLVYL-LGAKNQIILTPAWYHHFDAKHFENRRYPV--DEERKPQPIV 76
Query: 54 ADLNGDGRKEVLVATHDAKIQVLEPHA--RRVDEGFSEARVLAEVSLLPDKIRIASGRRA 111
D +GD + EV++ +++ ++QVL +A + + + +V V L +
Sbjct: 77 TDFDGDNKMEVVLISNENQLQVLLYNASQKLASKRLQQLQVKHSVVLPLEVNEYGEQEYP 136
Query: 112 VAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHRE-- 169
VAM TG ID QV+VV T+ W V+C+ L LW+ L P H R+
Sbjct: 137 VAMETGYIDPYISPEHSRSQVIVVATNNWHVLCYSSTLQLLWQHQLM----PLNHSRDHL 192
Query: 170 ----IAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKE 225
A+ +S L D G + VGG + H L ++ A + A +
Sbjct: 193 QMTASALLVSAVKLHKNDQGAIFVGGAYGHKDHHARMKM----LLDEQASRAMHQAGDLH 248
Query: 226 ASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSR 285
+ + T HF+ +A +G +G +RW + E + + A + H++KL++ +
Sbjct: 249 TAPDDPTT---HFSTFALSGSNGAIRWHHLPGDFETKRSKAR--VDSHHWKLNLRTHHEE 303
Query: 286 HPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPF----- 340
H GE ++ ES L ++PH W DT + R+ K I++ + S++ P
Sbjct: 304 HMGEVHWTKYGESFLQLLPHDWTSLGDTKFDFADLRK-KELIIR--IESSSALPLVDIIG 360
Query: 341 HKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQ 400
+++H G Y G P ++ I PN +V H
Sbjct: 361 ESLDDYHISG--------------VAYGGLG----PHSHHEHIRK-------PNTLVVHN 395
Query: 401 KEGIEAVHLASGRTVCKLHL-QEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLR 459
+G+E + L GR + +L + ++ + D+N DG L+ + V +
Sbjct: 396 HKGVEVLDLHVGRPLTRLKVARDKSAYIDVNEDGTLERARLVATDDG------------- 442
Query: 460 PCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLE-VATPILIPRS 518
C+ ++ P +F+ ++C S + G F R A E + P +S
Sbjct: 443 -CYGEVSTVHPRPLAIFSENVC--SALQWWGSGSV---FSRLPVAAEDEDCSVPPFFIKS 496
Query: 519 DGHR--------------HRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATW 564
R RKG D +FLT++G +++++P + WQ+LT A W
Sbjct: 497 IAVRKGFVNHLLGYTLPSERKGY--DTLFLTSQGRLSSFAP-----NGDLNWQVLTPAQW 549
Query: 565 -----SNLPSPSGM---TEASTVVPTLKAFSLRVHDNQQM-ILAGGDQEAVVISPGGSIL 615
N P + P+ S +V+ + + +L G D A+V G IL
Sbjct: 550 VEQTTRNWRDPGSIHADVYNHVFKPSATPMSPKVYGRKSVALLLGWDTIALVDLRDGQIL 609
Query: 616 TSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGF-VQTRQPGALFFSTLVGCLIVVM 674
LP P V DF+NDG D I+ S+G F V + P TLV + VV+
Sbjct: 610 GEHSLPCQPIDKPVIADFTNDGQNDFIVTCSSGSITFSVSSEFP---IVHTLVIGVFVVL 666
Query: 675 GVIFVT 680
V+ +T
Sbjct: 667 CVLLMT 672
>gi|407410236|gb|EKF32752.1| hypothetical protein MOQ_003389 [Trypanosoma cruzi marinkellei]
Length = 711
Score = 128 bits (322), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 173/744 (23%), Positives = 306/744 (41%), Gaps = 123/744 (16%)
Query: 1 MRKRDLAILMLSA-FAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGD 59
MR RD+ I+ ++ F IF Q G +R +++ K+ + P+V DL GD
Sbjct: 1 MRLRDILIVCVAIIFCIFGWSQEGGLIVWRNFKIPITDSEVSKY---VMSKPVVLDLMGD 57
Query: 60 GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
GR +L +T +++ + H RV+ SE VS+ P + + R VA+A G +
Sbjct: 58 GRPVLLASTKYGSLELFKTHLARVN---SEN---VFVSIRPSS-SLKTLFRIVAIAAGKL 110
Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISISNYT 178
RT ++ +VV++ + + H+ +W+ L + H ++S+
Sbjct: 111 SRTSKE-----NAIVVISDDFRLRRISPHDFTAVWDVPLSASWIETYH---ASVSVVPER 162
Query: 179 LKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHF 238
+ D G V+V ++ T + + A+ A H S +E E G V
Sbjct: 163 IHEQDEGTVLVAMQVAGPNGTELMLYAAFNGADGQARWHYTSDAENSIGEVLGEV----- 217
Query: 239 AFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRES 298
N+ +E P D + +++ N E +RE+
Sbjct: 218 ----------------CNDGVECLPPDNDTTVGRNSIFKSQGVKNQFRAQEKPWTFYREA 261
Query: 299 VLGVMPHHWDRREDTLLK---LSHFRRHKRKILK---KVVGKSTSYPFHKPEEHHPPGKD 352
++ ++PH + D + H + K++ + +VV K +E + +
Sbjct: 262 IITLLPHRYSHPWDEHIHPHVFFHAKNRKKRSFRAGNRVVVKYKDRLIRMNKEDYGALGE 321
Query: 353 STKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASG 412
I N+ G AAT KS++P N ++ H K GIE +HL +G
Sbjct: 322 RL-GIMNMHGNAATPVN--KSRRP---------------AANAMIFHSKNGIEVIHLYTG 363
Query: 413 RTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVP 470
+ +L L+ G++ DIN D +D + G E G ++++ C V +G+P
Sbjct: 364 SVITRLGPLKSSGVYYHDINDDFQVDAIGTQIGPRMEMHSRHG-VDLIDDCLGVIHTGIP 422
Query: 471 VRE-QLFNASICHHSPF----NLFPHGEFSRNFGRTSD----VASLEV------------ 509
V E QLFNA+IC F +L H F N R D + +LE+
Sbjct: 423 VAEDQLFNATICDTEGFFGRLDLIHH--FIDNDIRGEDAPHALNTLELIGSRSVLSKNTR 480
Query: 510 -ATPILIP-RSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLL-------- 559
TP+++ + R VF+ + G VT P IW+ Q
Sbjct: 481 SVTPLVVQLHTTKGRDLFQVERHAVFMIDSGLVTCVDPS--RRRVIWRSQTAAGFYGLRE 538
Query: 560 ---TDATWSNLPSPSGMTEASTVVPTLKAFSL-RVHDNQQM--------------ILAGG 601
DA + + + M A + P L A+S R D+ M I+A G
Sbjct: 539 AAEADAGMAGMSAKERMHRAVS-FPHLAAYSFYRSQDDAVMSVSRSERYLRTDPFIIAVG 597
Query: 602 DQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
++ ++S G +L +I+L PP ++ D + DG+ D++++T G+YGFV Q +
Sbjct: 598 ERYMCILSTRTGRVLRTIELEHPPVAPVILRDLNGDGINDIVVVTKEGIYGFVVGTQTSS 657
Query: 661 LFFSTLVGCLIVVMGVIFVTQHLN 684
+ L+ ++ ++ V+FV + ++
Sbjct: 658 DTVTALMILMVALLAVLFVVREMS 681
>gi|407848095|gb|EKG03575.1| hypothetical protein TCSYLVIO_005372 [Trypanosoma cruzi]
Length = 711
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 170/744 (22%), Positives = 304/744 (40%), Gaps = 123/744 (16%)
Query: 1 MRKRDLAILMLSA-FAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGD 59
MR RD+ I+ ++ F IF Q G +R +++ K+ + P+V DL GD
Sbjct: 1 MRLRDILIVCVAIIFCIFGWSQEGGLIVWRNFKIPVTDSEVSKY---VMSKPVVLDLMGD 57
Query: 60 GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
GR +L +T +++ + H RV+ SE VS+ P + + R VA+A G +
Sbjct: 58 GRPVLLASTKYGSLELFKTHLARVN---SED---VFVSIRPSS-SLETLFRIVAIAAGKL 110
Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISISNYT 178
RT ++ +VV++ + + H+L ++W+ L + H ++S+
Sbjct: 111 SRTSKE-----NAIVVISDDFRLRRISPHDLTEVWDVPLSASWIETYH---ASVSVVPER 162
Query: 179 LKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHF 238
+ D G V+V ++ T + + A+ A H S +E E G
Sbjct: 163 IHEQDEGTVLVAMQVAGPNGTELMLYAAFNGADGQARWHYTSDAENSIGEVLG------- 215
Query: 239 AFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRES 298
N+ +E P D + +++ N E +RE+
Sbjct: 216 --------------EECNDGVECLPPDNDTSLRRNSIFKSQEVKNQFRAQEKPWTFYREA 261
Query: 299 VLGVMPHHWDRREDTLLK---LSHFRRHKRKILK---KVVGKSTSYPFHKPEEHHPPGKD 352
++ ++PH + D + H + K++ + +VV K +E + +
Sbjct: 262 IITLLPHRYSHPWDEHIHPHVFFHAKNRKKRSFRAGNRVVVKYKDRLIRMNKEDYGALGE 321
Query: 353 STKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASG 412
+ N+ G AAT ++ + N ++ H K GIE +HL +G
Sbjct: 322 RL-GVMNMHGNAATPVNKSRRR-----------------TANAMIFHSKNGIEVIHLYTG 363
Query: 413 RTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVP 470
+ +L L+ G++ DIN D +D + G E G ++++ C V +G+P
Sbjct: 364 SVITRLGPLKSSGVYYHDINDDFQVDAIGTQIGPRMEMHSRHG-VDLIDDCLGVIHTGIP 422
Query: 471 VRE-QLFNASICHHSPF----NLFPHGEFSRNFGRTSD----VASLEV------------ 509
V E QLFNA+IC F +L H F N R D + +LE+
Sbjct: 423 VAEDQLFNATICDTEGFFGRLDLIHH--FIDNDIRGEDAPHALNTLELIGSRSVLSKNTR 480
Query: 510 -ATPILIP-RSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLL-------- 559
TP+++ + R VF+ + G VT P IW+ Q
Sbjct: 481 SVTPLVVQLHTTKGRDLFQVERHAVFMIDSGLVTCVDPS--RRRVIWRSQTAADFYGLRE 538
Query: 560 ---TDATWSNLPSPSGMTEASTVVPTLKAFSL-RVHDNQQM--------------ILAGG 601
DA + + + M A + P L A+S R D+ M I+A G
Sbjct: 539 AAEADAGMAGMSAKERMHRAVS-FPHLAAYSFYRGQDDSAMSVSGSERYLRTDPFIIAVG 597
Query: 602 DQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
++ ++S G +L +I L PP ++ D + DG+ D+I++T G+YGFV Q +
Sbjct: 598 ERYMCILSTRTGRVLRTIVLEHPPVAPVIVRDLNGDGINDIIVVTKEGIYGFVVGTQTSS 657
Query: 661 LFFSTLVGCLIVVMGVIFVTQHLN 684
+ L+ ++ ++ V+FV + ++
Sbjct: 658 DTVTALMILMVALLAVLFVVREMS 681
>gi|71667530|ref|XP_820713.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70886069|gb|EAN98862.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 711
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 170/744 (22%), Positives = 303/744 (40%), Gaps = 123/744 (16%)
Query: 1 MRKRDLAILMLSA-FAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGD 59
MR RD+ I+ ++ F IF Q G +R +++ K+ + P+V DL GD
Sbjct: 1 MRLRDILIVCVAIIFCIFGWSQEGGLIVWRNFKIPVTDSEVSKY---VMSKPVVLDLMGD 57
Query: 60 GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
GR +L +T +++ + H RV+ SE VS+ P + + R VA+A G +
Sbjct: 58 GRPVLLASTKYGSLELFKTHLARVN---SED---VFVSIRPSS-SLKTLFRIVAIAAGKL 110
Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISISNYT 178
RT ++ +VV++ + + H+L ++W+ L + H ++S+
Sbjct: 111 SRTSKE-----NAIVVISDDFRLRRISPHDLTEVWDVPLSASWIETYH---ASVSVVPER 162
Query: 179 LKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHF 238
+ D G V+V ++ T + + A+ A H S +E E G
Sbjct: 163 IHEQDEGTVLVAMQVAGPNGTELMLYAAFNGADGQARWHYTSDAENSIGEVLG------- 215
Query: 239 AFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECREFRES 298
N+ +E P D + +++ N E +RE+
Sbjct: 216 --------------EECNDGVECLPPDNDTSLRRNSIFKSQGVKNQFRAQEKPWTFYREA 261
Query: 299 VLGVMPHHWDRREDTLLK---LSHFRRHKRKILK---KVVGKSTSYPFHKPEEHHPPGKD 352
++ ++PH + D + H + K++ + +VV K +E + +
Sbjct: 262 IITLLPHRYSHPWDEHIHPHVFFHAKNRKKRSFRAGNRVVVKYKDRLIRMNKEDYGALGE 321
Query: 353 STKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASG 412
+ N+ G AAT ++ + N ++ H K GIE +HL +G
Sbjct: 322 RL-GVMNMHGNAATTVNKSRRR-----------------AANAMIFHSKNGIEVIHLYTG 363
Query: 413 RTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVP 470
+ +L L+ G++ DIN D +D + G E G ++++ C V +G+P
Sbjct: 364 SVITRLGPLKSSGVYYHDINDDFQVDAIGTQIGPRMEMHSRHG-VDLIDDCLGVIHTGIP 422
Query: 471 VRE-QLFNASICHHSPF----NLFPHGEFSRNFGRTSD----VASLEV------------ 509
V E QLFNA+IC F +L H F N R D + +LE+
Sbjct: 423 VAEDQLFNATICDTEGFFGRLDLIHH--FIDNDIRGEDAPHALNTLELIGSRSVLSKNTR 480
Query: 510 -ATPILIP-RSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLL-------- 559
TP+++ + R VF+ + G VT P IW+ Q
Sbjct: 481 SVTPLVVQLHTTKGRDLFQVERHAVFMIDSGLVTCVDPS--RRRVIWRSQTAAGFYGLRE 538
Query: 560 ---TDATWSNLPSPSGMTEASTVVPTLKAFSL-RVHDNQQM--------------ILAGG 601
DA + + + M A + P L A+S R D M I+A G
Sbjct: 539 AAEADAGMAGMSAKERMHRAVS-FPHLAAYSFYRGQDESAMSVSGSERYLRTDPFIIAVG 597
Query: 602 DQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGA 660
++ ++S G +L +I L PP ++ D + DG+ D+I++T G+YGFV Q +
Sbjct: 598 ERYMCILSTRTGRVLRTIALEHPPVAPVIVRDLNGDGINDIIVVTKKGIYGFVVGTQTSS 657
Query: 661 LFFSTLVGCLIVVMGVIFVTQHLN 684
+ L+ ++ ++ V+FV + ++
Sbjct: 658 DTVTALMILMVALLAVLFVVREMS 681
>gi|71422558|ref|XP_812172.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876921|gb|EAN90321.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 711
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 171/751 (22%), Positives = 308/751 (41%), Gaps = 137/751 (18%)
Query: 1 MRKRDLAILMLSA-FAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGD 59
MR RD+ I+ ++ F IF Q G +R +++ K+ + P+V DL GD
Sbjct: 1 MRLRDILIVCVAIIFCIFGWSQEGGLIVWRNFKIPVTDSEVSKY---VMSKPVVLDLMGD 57
Query: 60 GRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVI 119
GR +L +T +++ + H RV+ SE VS+ P R + R VA+A G +
Sbjct: 58 GRPVLLASTKYGSLELFKTHLARVN---SEN---VFVSIRPSSSR-KTLFRIVAIAAGKL 110
Query: 120 DRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISISNYT 178
RT ++ +VV++ + + H+L ++W+ L + H ++S+
Sbjct: 111 SRTSKE-----NAIVVISDDFRLRRISPHDLTEVWDVPLSASWIETYH---ASVSVVPER 162
Query: 179 LKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDLRHF 238
+ D G V+V ++ T + + A+ A H S E E G
Sbjct: 163 IHEQDEGTVLVAMQVAGPNGTELTLYAAFNGADGQARWHYTSDVENSIGEVLG------- 215
Query: 239 AFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRH--PGEFECRE-- 294
N+ +E P P+++ L +++ +F +E
Sbjct: 216 --------------EECNDGVECLP-------PENDTSLRRNSIFKSQVVKNQFRAQEKP 254
Query: 295 ---FRESVLGVMPHHWDRREDTLLK---LSHFRRHKRKILK---KVVGKSTSYPFHKPEE 345
+RE+++ ++PH + D + H + K++ + +VV K +E
Sbjct: 255 WTFYREAIITLLPHRYSHPWDEHIHPHVFFHAKNRKKRSFRAGNRVVVKYKDRLIRMNKE 314
Query: 346 HHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIE 405
+ + + N+ G AAT ++ + N ++ H K GIE
Sbjct: 315 DYGALGERL-GVMNMHGNAATTVNKSRRR-----------------AANAMIFHSKNGIE 356
Query: 406 AVHLASGRTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWA 463
+HL +G + +L L+ G++ DIN D +D + G E G ++++ C
Sbjct: 357 VIHLYTGSVITRLGPLKSSGVYYHDINDDFQVDAIGTQIGPRMEMHSRHG-VDLIDDCLG 415
Query: 464 VATSGVPVRE-QLFNASICHHSPF----NLFPHGEFSRNFGRTSD----VASLEV----- 509
V +G+PV E QLFNA+IC F +L H F N R D + +LE+
Sbjct: 416 VIHTGIPVAEDQLFNATICDTEGFFGRLDLIHH--FIDNDIRGEDAPHALNTLELIGSRS 473
Query: 510 --------ATPILIP-RSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLL- 559
TP+++ + R VF+ + G VT P +W+ Q
Sbjct: 474 VLSKNTRSVTPLVVQLHTTKGRDLFQVERHAVFMIDSGLVTCVDPS--RRRVVWRSQTAA 531
Query: 560 ----------TDATWSNLPSPSGMTEASTVVPTLKAFSL-RVHDNQQM------------ 596
DA + + + M A + P L A+S R D+ M
Sbjct: 532 GFYGLREAAEADAGMAGMSAKERMHRAVS-FPHLAAYSFYRSQDDSVMSVSGSERYLRTD 590
Query: 597 --ILAGGDQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFV 653
I+A G++ ++S G +L +I L PP ++ D + DG+ D+I++T G+YGFV
Sbjct: 591 PFIIAVGERYMCILSTRTGRVLRTIALEHPPVAPVIVRDLNGDGINDIIVVTKEGIYGFV 650
Query: 654 QTRQPGALFFSTLVGCLIVVMGVIFVTQHLN 684
Q + + L+ ++ ++ V+FV + ++
Sbjct: 651 VGTQTSSDTVTALMILMVALLAVLFVVREMS 681
>gi|303290438|ref|XP_003064506.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454104|gb|EEH51411.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 291
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 132/278 (47%), Gaps = 29/278 (10%)
Query: 418 LHLQEGGLHADINGDGVLDHVQAVGGNGA----EQTVVSGSMEVLRPCWAVATSGVPVRE 473
+ L GLHAD+NGDGV+DHV+ GG G V + CWA TSGVP RE
Sbjct: 1 MSLPSPGLHADVNGDGVVDHVEVFGGGGGGAGRSHAAVGADGRAVPSCWARVTSGVPARE 60
Query: 474 QLFNASICH-------HSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKG 526
LF+ + C H N + + FG + A ++VA P + R+G
Sbjct: 61 ALFDGTACRGHAGVTRHGDRNSYGVDGAAGMFGGKN--ADVDVAPPAAL--------RRG 110
Query: 527 SHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATW---SNLPSPSGMTEASTVVPTL 583
DV+ +RGEVT+Y P D +WQL TDA+W ++ L
Sbjct: 111 EETDVILFNSRGEVTSYGP-----DGKRRWQLRTDASWQRRDDVYDAGYGGGGGGGGGHL 165
Query: 584 KAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVIL 643
+ F L V ++++A G VV+SPGG + ++ LP+ P ++ D + DGL D++
Sbjct: 166 ETFPLAVDGASEVVVALGAARGVVLSPGGYKIATLRLPSVPIAPVLIVDVNGDGLNDIVA 225
Query: 644 MTSNGVYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQ 681
T+ G Y + Q G L+G LIV M V F +Q
Sbjct: 226 RTALGTYCWTQRGAGGGGPLLVLLGFLIVGMVVAFASQ 263
>gi|440292440|gb|ELP85645.1| hypothetical protein EIN_409540 [Entamoeba invadens IP1]
Length = 625
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 149/692 (21%), Positives = 283/692 (40%), Gaps = 103/692 (14%)
Query: 8 ILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLVA 67
+++++ F ++ L + E F SE+Y + PI++D++GDG E+LVA
Sbjct: 7 VVLVACFCLWIFLLKGQSLTQMEQLF--SEKYAANLGSTLNVVPIISDIDGDGLNEILVA 64
Query: 68 THDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVIDRTYRQGQ 127
+ ++ + + V+ + + +P S + + M TG D+ + + +
Sbjct: 65 PYGVGKLIMYKYEKG-------NLVIKQTATIP------SSSQPIYMTTGY-DKPFVRNE 110
Query: 128 PLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDTGLV 187
P +Q ++VV + V+CF+ +L W N+ ++ + E++ + Y ++ G V
Sbjct: 111 PRQQTVIVVLRNFDVLCFNSDLTLRWTNNVYKEHAL-TYVEEVSGLVVPYVIQTKVNGGV 169
Query: 188 IVGGRMEMQPHTIMDPFEE--IGLAE-------KNAEQHRRSASEKEASENSGTVDLRHF 238
I+ R T D + +G E + E+ + + + + TVD H
Sbjct: 170 ILAFR------TTQDWYNNNNVGFEEDLRVTLNRTFEEQKEDLMYDDDNIDQETVD--HM 221
Query: 239 AFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLD-VHALNSRHPGEFECREFRE 297
+Y + + G + W + ++ + YKL + ++ + E + EF +
Sbjct: 222 NYYCLSIKDGQVVWQHEVDDDQDYLDLVKNTDVAGEYKLSKLFGMSYKGAEEIQWHEFHD 281
Query: 298 SVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKDSTKKI 357
S+ +PH W DT + +SHF R + +Y + E PG K
Sbjct: 282 SIYKNLPHQWASFRDTQITVSHFSR-------DLTSNPETYDNSRLE---MPGMKVQTKT 331
Query: 358 SNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLASGRTVCK 417
I ITN PNVVV HQK G+EAVHL SG+ +
Sbjct: 332 K---------------------IDRITN-------PNVVVIHQKNGLEAVHLFSGKPLVH 363
Query: 418 LHLQE-----GGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWAVATSGVPVR 472
L L+ ADINGD L+ V V E V + ++ +A +
Sbjct: 364 LSLRATSDAGSSAFADINGDDALEEV-FVNQYSHEVDVQNSGEDLTCMVQVIAANS---Y 419
Query: 473 EQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSH-GDV 531
E LF+ + C P F + R D +EV P+L+P + R GS ++
Sbjct: 420 EYLFSENACK-------PAMRF--DLIRKKDKHVIEVLPPLLVPAFE--RKSDGSELYNI 468
Query: 532 VFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLR-V 590
V + + G +TA G++ + W + + + + + +L+ + V
Sbjct: 469 VVINSDGLLTAV-----GYNGKFLWNSKVSRLFLAKEDQTHKSNLALFLYSLENEDQKEV 523
Query: 591 HDNQQMILAGGDQEAVVISPGGSILT--SIDLPAPPTHALVCEDFSNDGLTDVILMTSNG 648
N + I+ G + V++ G ++ +I+ P V D SNDG +++++T+
Sbjct: 524 QPNNRYIIVQGSKNIAVVNLNGDLMVEHNIESQGEPVMGPVFGDLSNDGNNEILIVTATK 583
Query: 649 VYGF-VQTRQPGALFFSTLVGCLIVVMGVIFV 679
+ + V+ Q + + G + ++ I++
Sbjct: 584 IAAYNVRVIQNVSFLPVCIAGIIALISYNIYI 615
>gi|67477875|ref|XP_654372.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56471415|gb|EAL48985.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449703469|gb|EMD43913.1| FGGAP repeat-containing protein [Entamoeba histolytica KU27]
Length = 623
Score = 118 bits (296), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 150/654 (22%), Positives = 260/654 (39%), Gaps = 122/654 (18%)
Query: 51 PIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDK---IRIAS 107
PI+ D++GDG ++++A +K + V +G E + EV+L P+K I I S
Sbjct: 48 PIITDIDGDGINDIVLAPFASK----KLSMYSVKKGSLELK--KEVNL-PEKSFPIYITS 100
Query: 108 GRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHH 167
G D+ Y + Q+++V+ + V+CF+ +L W N + A+
Sbjct: 101 GY----------DKKYDNTEKRSQIIIVIMRDFEVICFNPDLTIRWRQNYYKS-KAKAYV 149
Query: 168 REIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEK--- 224
+E++ + Y ++ G VIV R T D A+K E+ R ++
Sbjct: 150 QEVSAIVVPYDIQTKYQGAVIVAFR------TSYDEMLSGNRADKEFEEDFRVTLQQTFE 203
Query: 225 EASENSGTVDL-------RHFAFYAFAGRSGLLRW----SRKNENIEAQPTDASQLIPQH 273
E E+ D H +YAF+ + G + W K+E +E + ++ +
Sbjct: 204 EQKEDLINDDFDPEDMLKEHMNYYAFSTKDGQVIWQHETDEKDEYLELIEQNTDEI--EK 261
Query: 274 NYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVG 333
K + + + GE + EF ++V MPH W DT + SHF R +V
Sbjct: 262 YKKFKIFSSFTEEFGEIQWHEFHDAVFQNMPHQWTSYYDTKIIPSHFAR-------DMVN 314
Query: 334 KSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVP 393
T +P KD K P N I N P
Sbjct: 315 NPTQLI-------NPKTKD--------------------IKYPYNSFDLIKN-------P 340
Query: 394 NVVVAHQKEGIEAVHLASGRTVCKLHL-------QEGGLHADINGDGVLDHVQAVGGNGA 446
NV VAHQK GIE +HL +G+ + LH+ ADI+GD LD V +
Sbjct: 341 NVFVAHQKNGIEVIHLFTGKPL--LHVSLSSTTMSSASAFADIDGDDSLDE---VFTHQF 395
Query: 447 EQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHS-PFNLFPHGEFSRNFGRTSDVA 505
T+ ++ C A + + LF+ + C + F+L R +
Sbjct: 396 AHTLDYSRVDDDISCQVQAMTMGSFK-FLFSQNACTKARRFDL---------LRRKQENV 445
Query: 506 SLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWS 565
+ + P+L+P + H D++ L N G +T+ S ++ +WQ + W
Sbjct: 446 EVIILPPLLVPNQEI-AHDGRPKFDIILLNNLGLMTSVS-----NNGALKWQAEVQSYW- 498
Query: 566 NLPSPSGMTEASTVVPTLKAFSLRVHDNQQ--MILAGGDQEAVVISPGGSILTS--IDLP 621
+L + A T P++ +F N+ I++ G + V+ G+++ D
Sbjct: 499 DLEEKN----AKTFKPSIVSFKFTYGKNETDPFIVSQGSRFVNVVDLNGNVIAKKEFDHI 554
Query: 622 APPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
P V D +ND D++++T + + + P F + LI+++
Sbjct: 555 CKPIMPPVFGDVNNDKKNDIVIVTDDHIMAYKMEVVPSVEFLPICIASLIILIS 608
>gi|340375465|ref|XP_003386255.1| PREDICTED: hypothetical protein LOC100636155 [Amphimedon
queenslandica]
Length = 803
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 85/334 (25%), Positives = 155/334 (46%), Gaps = 25/334 (7%)
Query: 4 RDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-----------YPIKFDADRLPPPI 52
+++ L +S + F L+ + F+ +EE +P++ ++LP PI
Sbjct: 472 QEILFLFISCSVLAFLLRSQDSLEFKLVLNVTTEERAARENYANQKFPLR--NEKLPLPI 529
Query: 53 VADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAV 112
V DL DG+ +V++ + D +VL ++R + + + + ++ R V
Sbjct: 530 VTDLESDGQTDVILVSAD---KVLNVYSRHYTPSYLGHHGIKHSN---KNVSLSKTSRVV 583
Query: 113 AMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAI 172
AM TG + Q KQV+ V+ S W++ C++HNL +W L E P + E ++
Sbjct: 584 AMTTGFLQPYQSVVQVRKQVIAVLYSDWTLSCYNHNLKLMWSQKLNEKEIPIS--SEASL 641
Query: 173 SISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGT 232
+S ++K G+++VGGR+ + G ++K + R +E+E E+
Sbjct: 642 LVSAISIKKSPAGVILVGGRINGNYDNNNNNDNIGGDSDK---RRMRRGAEQEKMEHQKL 698
Query: 233 VDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFEC 292
L H+ YA +G++G L W + + E P L ++KL H H GE
Sbjct: 699 ESLGHYCTYAVSGKTGELLWKHEPGDFEVHPAYDIGLSSFTHFKLLFHKY-LLHEGEVSW 757
Query: 293 REFRESVLGVMPHHWDRREDTLLKLSHFRRHKRK 326
++ ++ +MPH W DT L+L+ F + K+K
Sbjct: 758 HQYSSAIRQIMPHSWTDDFDTKLELAQFNKDKKK 791
>gi|407044533|gb|EKE42653.1| FG-GAP repeat-containing protein [Entamoeba nuttalli P19]
Length = 623
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 147/657 (22%), Positives = 255/657 (38%), Gaps = 128/657 (19%)
Query: 51 PIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEAR---VLAEVSLLPDK---IR 104
PI+ D++GDG ++ VL P A + +S + VL + LP+K I
Sbjct: 48 PIITDIDGDGINDI----------VLAPFASKKLSMYSVKKGSLVLKKEVDLPEKSFPIY 97
Query: 105 IASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPN 164
I SG D+ Y + Q+++V+ + V+CF+ +L W N +
Sbjct: 98 ITSGY----------DKKYDDTEKRSQIIIVIMRDFEVICFNPDLTIRWRQNYYKS-KAK 146
Query: 165 AHHREIAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEK 224
A+ +E++ + Y ++ G VI R T D A+K E+ R ++
Sbjct: 147 AYVQEVSAIVVPYDIQTKYQGAVIAAFR------TSYDEMLSGNRADKEFEEDFRVTLQQ 200
Query: 225 ---EASENSGTVDL-------RHFAFYAFAGRSGLLRW----SRKNENIEAQPTDASQLI 270
E E+ D H +Y+F+ + G L W K+E +E + ++
Sbjct: 201 TFDEQKEDLINDDFDPEDMLKEHMNYYSFSTKDGQLIWQHETDEKDEYLELIEQNTDEI- 259
Query: 271 PQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKK 330
+ K + + + GE + EF ++V MPH W DT + SHF R
Sbjct: 260 -EKYKKFKIFSSFTEEFGEIQWHEFHDAVFQNMPHQWTSYYDTKIIPSHFAR-------D 311
Query: 331 VVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLW 390
+V T +P KD K P N I N
Sbjct: 312 MVNNPTQLI-------NPKTKD--------------------IKYPYNSFDLIKN----- 339
Query: 391 WVPNVVVAHQKEGIEAVHLASGRTVCKLHL-------QEGGLHADINGDGVLDHVQAVGG 443
PNV V HQK GIE +HL +G+ + LH+ ADI+GD LD +
Sbjct: 340 --PNVFVVHQKNGIEVIHLFTGKPL--LHVSLSSSTLSSASAFADIDGDDSLDE---IFT 392
Query: 444 NGAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHS-PFNLFPHGEFSRNFGRTS 502
+ T+ ++ C A + + LF+ + C + F+L R
Sbjct: 393 HQFAHTLDYSRVDDDISCQVQAMTMGSFK-FLFSQNACTKARRFDL---------LRRKQ 442
Query: 503 DVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDA 562
+ + + P+L+P + H D++ L N G +T+ S ++ +WQ +
Sbjct: 443 ENVEVIILPPLLVPNQEI-AHDGRPKFDIILLNNLGLMTSVS-----NNGALKWQAEVQS 496
Query: 563 TWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQ--MILAGGDQEAVVISPGGSILT--SI 618
W +L + A T P++ +F N+ I++ G + V+ G+++
Sbjct: 497 YW-DLEEKN----AKTFKPSIVSFKFTYGKNETDPFIVSQGSRFVNVVDLNGNVIAKKEF 551
Query: 619 DLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
D P V D +ND D++++T + + + P F + LI+++
Sbjct: 552 DHICKPIMPPVFGDVNNDKKNDIVIVTDDHIMAYKMEVVPSVEFLPICIASLIILIS 608
>gi|413951379|gb|AFW84028.1| hypothetical protein ZEAMMB73_488017 [Zea mays]
Length = 90
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 53/88 (60%), Positives = 66/88 (75%), Gaps = 6/88 (6%)
Query: 455 MEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPIL 514
MEVL+PCWAVA S +PV EQLFN SICH++ FN+F HG+FSR FG T D LEV T
Sbjct: 1 MEVLKPCWAVARSDMPVWEQLFNVSICHYNHFNMFHHGDFSRIFGITFDTIGLEVVT--- 57
Query: 515 IPRSDGHRHRKGSHGDVVFLTNRGEVTA 542
DGH+HR+GSHGD++FLT+ GE++
Sbjct: 58 ---DDGHKHRRGSHGDIIFLTSPGELSV 82
>gi|361069131|gb|AEW08877.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168737|gb|AFG67480.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168739|gb|AFG67481.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168741|gb|AFG67482.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168743|gb|AFG67483.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168745|gb|AFG67484.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168747|gb|AFG67485.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168749|gb|AFG67486.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168751|gb|AFG67487.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168753|gb|AFG67488.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168755|gb|AFG67489.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168757|gb|AFG67490.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168759|gb|AFG67491.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168761|gb|AFG67492.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168763|gb|AFG67493.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168765|gb|AFG67494.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
gi|383168767|gb|AFG67495.1| Pinus taeda anonymous locus CL2087Contig1_01 genomic sequence
Length = 68
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 51/66 (77%), Positives = 60/66 (90%)
Query: 510 ATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPS 569
ATPILIPR+DGH+HRKGSHGDV+FLT+RGEVT+Y PG+HG AI +WQLLT ATWSNLPS
Sbjct: 1 ATPILIPRNDGHKHRKGSHGDVIFLTSRGEVTSYFPGIHGQGAIRRWQLLTGATWSNLPS 60
Query: 570 PSGMTE 575
P+G+ E
Sbjct: 61 PAGIVE 66
>gi|167393089|ref|XP_001740420.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165895471|gb|EDR23151.1| hypothetical protein EDI_031320 [Entamoeba dispar SAW760]
Length = 610
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 148/656 (22%), Positives = 247/656 (37%), Gaps = 139/656 (21%)
Query: 51 PIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEAR---VLAEVSLLPDK---IR 104
P+V D++GDG ++ VL P A + +S + VL + LP+K I
Sbjct: 48 PLVTDIDGDGINDI----------VLTPFASKKLSIYSVKKGSLVLKKEVDLPEKSFPIY 97
Query: 105 IASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPN 164
I SG D+ Y + Q+++V+ + V+CF+ +L W N + P
Sbjct: 98 ITSGY----------DKQYNNTEKRSQIIIVIMRDFEVICFNPDLTIRWRQNY---YKPK 144
Query: 165 A--HHREIAISISNYTLKHGDTGLVIVGGRM----EMQPHTIMDPFEE---IGLAEKNAE 215
A + +E++ I Y ++ G VI R + + + FEE + L + E
Sbjct: 145 AKAYVQEVSAIIVPYEIQTQYQGAVIAAFRTSYDENLSGNRVDKEFEEDFRVTLQQTFEE 204
Query: 216 QHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRW----SRKNENIEAQPTDASQLIP 271
Q ++ E+ H +YAF+ + G L W K+E +E + ++
Sbjct: 205 QKEDLVNDDFDPEDMLK---EHMNYYAFSTKDGQLIWQHETDEKDEYLELIEQNTDEIEK 261
Query: 272 QHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKV 331
+K + + + GE + EF ++V MPH W DT
Sbjct: 262 YKKFK--IFSSFTEEFGEIQWHEFHDAVFQNMPHQWTSYYDT------------------ 301
Query: 332 VGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWW 391
K+ S+ F K P N I N
Sbjct: 302 --KNNSFSF---------------------------CTRYDIKYPYNSFDLIKN------ 326
Query: 392 VPNVVVAHQKEGIEAVHLASGRTVCKLHL-------QEGGLHADINGDGVLDHVQAVGGN 444
PNV V HQK GIE +HL +G+ + LH+ ADI+GD LD V +
Sbjct: 327 -PNVFVVHQKNGIEVIHLFTGKPL--LHVSLSSSTMSSASAFADIDGDDSLDE---VFTH 380
Query: 445 GAEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHS-PFNLFPHGEFSRNFGRTSD 503
T+ ++ C A + + + LF+ + C S F+L R +
Sbjct: 381 QIAHTLDYSKVDDDISCQVQAMT-MGNFQFLFSQNACTKSRRFDL---------LRRREE 430
Query: 504 VASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDAT 563
+ + P+LIP + H D++ L N G +T+ S ++ +WQ +
Sbjct: 431 STEVIILPPLLIPNQEI-AHDGRPKFDIILLNNLGLMTSVS-----NNGALKWQAEVQSY 484
Query: 564 WSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQ--MILAGGDQEAVVISPGGSILT--SID 619
W A T P L +F N+ I++ G + V+ G+++ D
Sbjct: 485 WD-----IEEKNAKTFKPALVSFKFTYGKNETDPFIVSQGSRFVNVVDLNGNVIAKKEFD 539
Query: 620 LPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
P V D +ND D+I++T + + + P F + LIV+M
Sbjct: 540 HICKPIMPPVFGDVNNDKKNDIIIVTDDHIMAYKMEVVPSVEFLPICIASLIVLMS 595
>gi|167378134|ref|XP_001734686.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165903708|gb|EDR29152.1| hypothetical protein EDI_127180 [Entamoeba dispar SAW760]
Length = 622
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 148/699 (21%), Positives = 269/699 (38%), Gaps = 129/699 (18%)
Query: 8 ILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIK--FDADRLPPPIVADLNGDGRKEVL 65
I+++ F ++ L S E F EYP + + ++ P++ D++GDG+ +++
Sbjct: 7 IILILCFVGWYFLIKNQSISSLELLF----EYPYQGNWQSELNAVPLITDIDGDGKNDLV 62
Query: 66 VATH-DAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMATGVIDRTYR 124
VA KI + F+ ++ L ++ + + A+ + +G D+ Y+
Sbjct: 63 VAPFMSQKISIY---------SFNHGNLI-----LKKEVSLPNNTNAIYLTSGY-DKKYK 107
Query: 125 QGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAISISNYTLKHGDT 184
+ +QV++VV + V C+ +L W+ + + +E++ + Y ++
Sbjct: 108 ENLRREQVIIVVLRNFEVRCYTSDLKLRWKQQVYSSIA-QPYVQEVSAVVIPYQIQTKYQ 166
Query: 185 GLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASE-NSGTVD--------L 235
G VI R + + A+ EQ R + K E N ++D +
Sbjct: 167 GAVITAFRTSNDKAFVGNR------ADTAFEQDERYSFGKPIFEQNEISMDDGGFVPELI 220
Query: 236 RHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNY-------KLDVHALNSRHPG 288
H +YAF+ + G + W + ++ + + QLI Q +Y K+D L+
Sbjct: 221 EHMNYYAFSLKDGQVIWQHERDD---EDQNDLQLIHQEDYIEVYKNLKMDKSILHDFDTV 277
Query: 289 EFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHP 348
++ EF + + +PH W DT + +HF R ++ T Y
Sbjct: 278 QW--HEFHDCIFQNLPHIWASYYDTHIVPAHFSR-------DIINNPTEYI--------- 319
Query: 349 PGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVH 408
+P + I N L PNV+V H GIE +H
Sbjct: 320 --------------------------RPNHSIYNSFNQFDLIKNPNVLVIHHMNGIEVIH 353
Query: 409 LASGRTVCKLHLQEGGL-----HADINGDGVLDHVQA---VGGNGAEQTVVSGSMEVLRP 460
L +G+ + + L + ADI+GD L+ V + N E+T E
Sbjct: 354 LFTGKPLLHVTLLTSTISAASSFADIDGDDALNEVYTHTNLQYNELERT------EDDYS 407
Query: 461 CWAVATSGVPVREQLFNASICHHSP-FNLFPHGEFSRNFGRTSDVASLEVATPILIPRSD 519
C+ AT V + LF+ ++C FNLF S+ ++ P+LIP SD
Sbjct: 408 CYVHATV-VNTNDDLFSFNLCEQQKRFNLFRK-------SLNSEKQFFQILPPLLIP-SD 458
Query: 520 GHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTV 579
G H D+ L + G +TA LH D W W + +T
Sbjct: 459 GKAHDGRPTYDIYTLNSNGLLTA----LHN-DGTLLWYKFLPVNWE-----LNQLDLTTF 508
Query: 580 VPTLKAFSLRVHDN-QQMILAGGDQEAVVISPGGSILTSIDLPA--PPTHALVCEDFSND 636
P+ + F I GD E + G+++ + L P + D +ND
Sbjct: 509 KPSFQLFPYSYKGQVSSFIYVQGDNEICISDLKGNLIGTEKLKTKMKPIMPPIFGDLTND 568
Query: 637 GLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
TD++ +T + + + P F + ++++M
Sbjct: 569 MKTDILFITEHSLVAYRFELLPSIEFLPICISFIVLMMS 607
>gi|440298618|gb|ELP91249.1| hypothetical protein EIN_151760 [Entamoeba invadens IP1]
Length = 630
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 131/650 (20%), Positives = 257/650 (39%), Gaps = 107/650 (16%)
Query: 51 PIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRR 110
P++ D++GDG E++VA++ + ++ + + R++ L I
Sbjct: 48 PLITDIDGDGENELVVASYTSHKVIMYKYQK--------GRLVVNKEL-----EIPGQMF 94
Query: 111 AVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREI 170
+ M++G + Y +G+ KQ++V+V + V+C + +L W N + A +E+
Sbjct: 95 PIGMSSGY-ETPYVEGERRKQIVVIVMRDFEVICLNSDLTIRWRQNAYKA-KAQAFVQEV 152
Query: 171 AISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNA--------EQHRRSAS 222
A+S+ Y L++G G VI R + +E+ ++ EQ A+
Sbjct: 153 AVSVIPYNLQNGYKGAVITAFRTSNDKGFARNRADEVFEEDERVTTRGVTFDEQRDDMAN 212
Query: 223 EKEASENSGTVDLR-HFAFYAFAGRSGLLRWSRKNENIEA-----QPTDASQLIPQHNYK 276
E+ +E + +L+ H +YAF + G + W + + E+ + D +L
Sbjct: 213 EQNVNEVGDSPELKEHMNYYAFHVKDGQVIWQHEMDEEESTLDLVKNADEVELFKS---- 268
Query: 277 LDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKST 336
+ + A GE + EF +++ +PH W DT ++ SH R +V T
Sbjct: 269 VKMKAEKGEGLGEVQWHEFHDALFQHLPHSWSSFFDTKIEYSHLSR-------DIVNNPT 321
Query: 337 SYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVV 396
Y K I L S + P N I I N PN
Sbjct: 322 EY----------------KNIKEL---------SKRKNTPHNDIDLIKN-------PNTF 349
Query: 397 VAHQKEGIEAVHLASGRTVCKLHL-----QEGGLHADINGDGVLDHVQA-VGGNGAEQTV 450
+ H G+E +HL +G+ V + L D++G+ VLD + E T
Sbjct: 350 LIHHMNGVEVIHLFTGKPVFHITLLPSTVSSASSFVDLDGNDVLDEIYTHAYLLDYETTT 409
Query: 451 VSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVA 510
V C+ T R +F+ ++C + + ++ GR S +++
Sbjct: 410 VDNDFS----CYLHVTEANSYR-SVFSQNVCEDKKISFWA----PKSKGRVS--KKVQIL 458
Query: 511 TPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSP 570
P+L+ + + + + D+ + + G + A S + + +W + +P+
Sbjct: 459 PPLLVESGEVTKEGRKIY-DIYSVNSNGLLVAIS-----NKGVKKWS-------AQIPTK 505
Query: 571 SGM--TEASTVVPTLKAFSLRVHDNQQM--ILAGGDQEAVVISPGGSILTSIDL-PAPPT 625
G+ E +T P+ F N + +L GD V+ G+++ + P
Sbjct: 506 WGLNQVEMNTFKPSSIQFHYTYGQNLTVPYLLVQGDSSFSVLDLMGNVVAFKEFGKVSPI 565
Query: 626 HALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
+ D + +DVI+++ + + GF P A F + ++ ++
Sbjct: 566 MPPIVGDIDGNTRSDVIIVSESYIMGFALEIVPSAKFIPVCIAGILALIA 615
>gi|407034104|gb|EKE37060.1| FG-GAP repeat-containing protein [Entamoeba nuttalli P19]
Length = 622
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 141/651 (21%), Positives = 253/651 (38%), Gaps = 117/651 (17%)
Query: 51 PIVADLNGDGRKEVLVATH-DAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGR 109
P++ D++GDG+ +++VA KI + F++ ++ L + I +
Sbjct: 48 PLITDIDGDGKNDLVVAPFMSQKISIY---------SFNQGELI-----LKKETSIPNNT 93
Query: 110 RAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHRE 169
+ + +G D+ Y++ +QV++VV + V C+ +L W+ + + +E
Sbjct: 94 NVIYLTSGY-DKEYKENLRREQVIIVVLRNFEVRCYTSDLKLRWKQQIYSSLA-QPYVQE 151
Query: 170 IAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEK---EA 226
++ + Y ++ G VI R + + A+ E R K E
Sbjct: 152 VSGVVIPYQIQTKYQGAVITAFRTSNDKAFVGNR------ADTAFEWDERDNFNKPIIEQ 205
Query: 227 SENS----GTVD--LRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVH 280
+ENS G V + H +YAF+ + G + W + ++ + + QL+ Q + L+V+
Sbjct: 206 NENSMDDGGFVPELIEHMNYYAFSIKDGQIIWQHERDD---EDQNDLQLVHQED-NLEVY 261
Query: 281 A---LNSRHPGEFEC---REFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGK 334
+ +F+ EF + + +PH W DT + +HF R ++
Sbjct: 262 KNLKMGKSILHDFDTIQWHEFHDCIFQNLPHTWSSYYDTHIVPAHFSR-------DIINN 314
Query: 335 STSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPN 394
T Y HP K + Y N I N PN
Sbjct: 315 PTEYI-------HP--------------KHSIYNS-------FNQFDLIKN-------PN 339
Query: 395 VVVAHQKEGIEAVHLASGRTVCKLHLQEGGL-----HADINGDGVLDHVQAVGGNGAEQT 449
V+V H GIE +HL +G+ + + L + ADI+GD L+ V +
Sbjct: 340 VLVVHHMNGIEVIHLFTGKPLLHVTLLTSTISAASSFADIDGDDELNEVYTHANLQYSEL 399
Query: 450 VVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSP-FNLFPHGEFSRNFGRTSDVASLE 508
+G C+ AT V + LF ++C FNLF ++ +
Sbjct: 400 EQTGDD---YSCYVHATV-VNTNDDLFLFNLCEQQQRFNLFRK-------QSNTEKQFFQ 448
Query: 509 VATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLP 568
V P+LIP ++G H D+ L N G +TA LH D W + TW
Sbjct: 449 VLPPLLIP-NNGKAHDGRPTYDIYTLNNHGLLTA----LHN-DGTLLWNKMLPVTWE--- 499
Query: 569 SPSGMTEASTVVPTLKAFSLRVHDNQ--QMILAGGDQEAVVISPGGSILTSIDLPA--PP 624
+ T P+ F + NQ I GD E ++ G+++ S +L P
Sbjct: 500 --LNQVDLITFKPSFHLFQYS-YKNQTSSFIYIQGDNEICILDLNGNLIGSENLKTQMKP 556
Query: 625 THALVCEDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
+ D +ND TD++ +T + + + P F + +++++
Sbjct: 557 IMPPIFGDLTNDMKTDILFITEHSLVAYHFELLPSIEFLPICISFIVLMIS 607
>gi|308813626|ref|XP_003084119.1| unnamed protein product [Ostreococcus tauri]
gi|116056002|emb|CAL58535.1| unnamed protein product, partial [Ostreococcus tauri]
Length = 205
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 100/204 (49%), Gaps = 23/204 (11%)
Query: 289 EFECREFRESVL-GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHH 347
E C +FRES++ +PH W DT + L+HF+RHK ++ + K ++
Sbjct: 8 ERSCGDFRESIVEDALPHQWRHPSDTKMSLAHFKRHK--------SRANAMKRDKESKNR 59
Query: 348 PPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNY----TQLWWVPNVVVAHQKEG 403
+D ++N + A A P+ T VPNV+V+H EG
Sbjct: 60 RQRRDRDVVVANAATRTVGAAVDALRGTPMTRRAKRRARDARATSSDDVPNVIVSHHAEG 119
Query: 404 IEAVHLASGRTVCKLHLQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP-CW 462
++ +HL SG +C L+L+ GLH DI+GDGV+DHV+A G GS+ P CW
Sbjct: 120 VDILHLHSGDVICSLYLKSPGLHVDIDGDGVMDHVEAHG---------RGSVGSDLPACW 170
Query: 463 AVATSGVPVREQLFNASICHHSPF 486
A TSGVP E+ +AS+ F
Sbjct: 171 ATVTSGVPSEERALSASMSRRKRF 194
>gi|67471534|ref|XP_651715.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56468487|gb|EAL46329.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449705497|gb|EMD45528.1| FGGAP repeat-containing protein [Entamoeba histolytica KU27]
Length = 622
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 132/645 (20%), Positives = 253/645 (39%), Gaps = 105/645 (16%)
Query: 51 PIVADLNGDGRKEVLVATH-DAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGR 109
P++ D++GDG+ +++VA KI + F++ ++ L + I +
Sbjct: 48 PLITDIDGDGKNDLVVAPFMSQKISIY---------SFNQGKLT-----LKKETSIPNTT 93
Query: 110 RAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHRE 169
+ + +G ++ Y++ +QV++VV + V C+ +L W+ + + +E
Sbjct: 94 NVIYLTSGY-EKEYKENLRREQVIIVVLRNFEVRCYTSDLKLRWKQQIYSSLA-QPYVQE 151
Query: 170 IAISISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASEN 229
++ + Y ++ G VI R + + + ++ + ++ + E S +
Sbjct: 152 VSGVVIPYQIQTKYQGAVITAFRTSNDKAFVGNRADTAFEWDERNDFNKPIIEQNENSMD 211
Query: 230 SGTVD---LRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKLDVHA---LN 283
G + H +YAF+ + G + W + ++ + + QL+ Q + L+V+ +
Sbjct: 212 DGGFVPELIEHMNYYAFSIKDGQVIWQHERDD---EDQNDLQLVHQED-NLEVYKNLKMG 267
Query: 284 SRHPGEFEC---REFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPF 340
+F+ EF + + +PH W DT + +HF R ++ T Y
Sbjct: 268 KSILHDFDTIQWHEFHDCIFQNLPHTWSSYYDTHIVPAHFSR-------DIINNPTEYI- 319
Query: 341 HKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQ 400
HP K + Y N+ I N PNV+V H
Sbjct: 320 ------HP--------------KHSIYNS-------FNHFDLIKN-------PNVLVVHH 345
Query: 401 KEGIEAVHLASGRTVCKLHLQEGGL-----HADINGDGVLDHVQAVGGNGAEQTVVSGSM 455
GIE +HL +G+ + + L + ADI+GD L+ V + +G
Sbjct: 346 MNGIEVIHLFTGKPLLHVTLLTSTISAASSFADIDGDDELNEVYTHANLQYSELEQTGDD 405
Query: 456 EVLRPCWAVATSGVPVREQLFNASICHHSP-FNLFPHGEFSRNFGRTSDVASLEVATPIL 514
C+ AT V + LF ++C FNLF ++ +V P+L
Sbjct: 406 ---YSCYVHATV-VNTNDDLFLFNLCEQQQRFNLFRK-------QSNTEKQFFQVLPPLL 454
Query: 515 IPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMT 574
IP ++G H D+ L+N G +TA LH D W + W
Sbjct: 455 IP-NNGKAHDGRPTYDIYTLSNNGLLTA----LHN-DGTLLWNKMLPVNWE-----LNQV 503
Query: 575 EASTVVPTLKAFSLRVHDNQ--QMILAGGDQEAVVISPGGSILTSIDLPA--PPTHALVC 630
+ T P+ F + NQ I GD E ++ G+++ S +L P +
Sbjct: 504 DLITFKPSFHLFQYS-YKNQTSSFIYIQGDNEICILDLNGNLIGSENLKTKMKPIMPPIF 562
Query: 631 EDFSNDGLTDVILMTSNGVYGFVQTRQPGALFFSTLVGCLIVVMG 675
D +ND TD++ +T + + + P F + +++++
Sbjct: 563 GDLTNDMKTDILFITEHSLVAYHFELLPSIEFLPICISFIVLMIS 607
>gi|219112351|ref|XP_002177927.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410812|gb|EEC50741.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 765
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 102/416 (24%), Positives = 177/416 (42%), Gaps = 85/416 (20%)
Query: 295 FRESVL--GVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPGKD 352
+R S+L GV+PH + +D + HF K P E GKD
Sbjct: 379 YRRSLLTTGVLPHSYWSNDDATVTAVHFTHQK-----------------MPAEAKLKGKD 421
Query: 353 STK-KISNLIGKAATYAGSAKSKKPVNYIPTITNYTQL---WWVPNVVVAHQKEGIEAVH 408
K K+ + K T K +PT + + PNV+V H + GI
Sbjct: 422 KLKPKMGGGLSKGITI--KPKQTLSAKLLPTKKRRRKRLPHFGRPNVLVLHNQHGIHVRS 479
Query: 409 LASGRTVCKLHLQEGGLHADINGDGVLDHVQA-VGGNG---------------------- 445
+ +G ++C + L + L+AD+N DGVLD VQ VGG+
Sbjct: 480 IKNGMSLCHVSLSDDTLYADLNHDGVLDSVQVLVGGHADNLEDSTDDGYKFVSQLVKRVG 539
Query: 446 ------AEQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSRNFG 499
A+ + S + R C A+A SG+P +E+LF+ S+C S
Sbjct: 540 DLGQEDAKVEIASRNRVRSRLCHAMALSGIPTKEELFSVSLC-------------SNKDD 586
Query: 500 RTSDVASLEVATPILIPRSDGHRHRKGSHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLL 559
R +A + P+ I +G + D++ + G+++ + G+ G +WQ +
Sbjct: 587 RDESIAG---SPPLAIESLNGKGY------DLIVALSNGQISRFR-GVSGS-RVWQ---I 632
Query: 560 TDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILAGGDQEAVVISPGGSILTSID 619
+ + + P+ T S + ++A + +++AG + A++ S G++L +
Sbjct: 633 SGRRFEDFPTWDDTTLVS--IDRIEA-EFAQPATRPILIAGENSMALLASRKGTVLATAS 689
Query: 620 LPAPPTHALVCEDFSNDGLTDVILMTSNGVYGF-VQTRQPGALFFSTLVGCLIVVM 674
P P + DF+ DG TDV++MT + ++G+ V + ++ F VG L+V M
Sbjct: 690 FPQPSMRKPLLMDFNGDGTTDVMVMTQDAIWGYRVVVKTGTSILFRITVGLLLVAM 745
>gi|224005014|ref|XP_002296158.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209586190|gb|ACI64875.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1072
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 114/448 (25%), Positives = 192/448 (42%), Gaps = 101/448 (22%)
Query: 291 EC-REFRESVL----GVMPH-HWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPE 344
EC FR SVL G +PH +WD E + + F R+K K G + PF K
Sbjct: 622 ECISHFRSSVLDEESGALPHEYWDDGEYGSISVGRFERNK-----KSSGGKRTKPFGK-- 674
Query: 345 EHHP------------------PGKDST----KKISNLIGKAATYAGSA-KSKKPVNYIP 381
+ +P G +T K+ S+++G G + +S +P
Sbjct: 675 KPNPLSTASGVLGGGARAGTTNAGSSATARQMKRSSDVVGSGGIAGGKSWQSDLFHRSVP 734
Query: 382 TITNYTQLWWV-------PNVVVAHQKEGIEAVHLASGRTVCKLHLQEGGLHADINGDGV 434
Q + PNV++ H ++G+ + L +GR VC + L + L+AD++ DG+
Sbjct: 735 QRLISQQRYNAFHPRSGKPNVIIFHGRDGLAVLSLKNGRPVCHISLMDHALYADLDKDGI 794
Query: 435 LDHVQ------AVGGNGAEQTVVS-------------------GSMEVLRPCWAVATSGV 469
+D VQ A+ +G Q+++ ++ C A+ TSG+
Sbjct: 795 IDMVQVVTSPEALSKSGGIQSLIERVTKSDNDNDEGRKTRERRTKLDAPVVCHALVTSGL 854
Query: 470 PVREQLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSHG 529
P RE++F A +C P S + R A L A P+L+ S G+ +
Sbjct: 855 PPREEVFTAPLCLGGPL-------MSIDPKRPQ--AGLSAAPPLLVEGSLGYGN------ 899
Query: 530 DVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPS---GMTEASTVVPTLKAF 586
DVVF N G V Y +G + +W+ L D T S S + G E V
Sbjct: 900 DVVFAMNNGVVVRYDS--NGRE-VWRKGGLKDGTPSWKASSNAFLGRIEFGAVKSHSSVS 956
Query: 587 SLRVHDNQQ-------MILAGGDQEAVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLT 639
+ + + ++L+G D A++ S G++L+S+ P + D + DG
Sbjct: 957 ASSHRNQHRPGSPVRPILLSGEDGAALISSASGNVLSSVVYPQSVVAQPMLSDLNGDGTD 1016
Query: 640 DVILMTSNGVYGF---VQTRQPGALFFS 664
D+++++++G++G+ VQT + G FFS
Sbjct: 1017 DLLVVSADGIWGYRVVVQTGRSG--FFS 1042
>gi|196003484|ref|XP_002111609.1| predicted protein [Trichoplax adhaerens]
gi|190585508|gb|EDV25576.1| predicted protein [Trichoplax adhaerens]
Length = 611
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 113/465 (24%), Positives = 191/465 (41%), Gaps = 82/465 (17%)
Query: 47 RLPPPIVADLNGDGRKEVLVATHDAKIQVLEPHARRVDEG-------FSEARVLAEVSLL 99
+L PI+ DL+GDG +++++AT D K++ ++ ++ E + E++L
Sbjct: 53 QLLDPIITDLDGDGNRDIIIATSDCKVRRSNSNSNKLLHATDLEQLPIPEIQFENEMALS 112
Query: 100 PDKIRIASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQE 159
D +A++ G T +K+V+ VVT+ W+V DH LN LW++++ E
Sbjct: 113 DD-----VTCHPIALSVGY---TNNMVHAMKKVIAVVTNDWNVYLLDHQLNLLWKSSIDE 164
Query: 160 DFPPNAHHREIAIS-ISNYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHR 218
A+ IA + IS + +K+ D G+VI+ P T +E+ + +
Sbjct: 165 KLEA-ANDSIIATAMISPHPIKNQDHGVVIIARF----PLTNQFTNQELSVRHDHGWTIE 219
Query: 219 RSASEKEASENSGTVDL------------RHFAFYAFAGRSGLLRWSRK-NENIEAQPTD 265
+E +S +G VD+ H YA +G LRW + +N+ + D
Sbjct: 220 DVIAENVSSYRAG-VDVGSTHRKDTHGKSVHCTIYALDLTTGELRWKHQPGDNVGLEMDD 278
Query: 266 ASQLIPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKR 325
S L H+YKL + H C ++ ++ +PH W R D L + H +
Sbjct: 279 LS-LFYGHHYKLGLRKDILCHRRSI-CGTYKMDIIRNLPHRWSSRRDAKLIVD---LHSK 333
Query: 326 KILKKVVGKSTSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITN 385
K K HP I A S ++ + I
Sbjct: 334 K---------------KKGNKHP------------IFIAGFDVSSTRTNDDLAEWLRIVR 366
Query: 386 YTQLWWVPNVVVAHQKEGIEAVHLASGRTVCKLHL--QEGGLHADINGDGVLDHVQAVGG 443
+ N+++ H + IE HL+SG+ VC L L Q+ DIN DG++D +
Sbjct: 367 IAK---AANILIIHTHQSIEIYHLSSGKLVCWLPLENQQSSTLEDINNDGLIDSLSDTSE 423
Query: 444 -NGAEQTVVSGSMEVLRPCWAVATS---------GVPVREQLFNA 478
++ SG ++L P +TS G+ EQ FN
Sbjct: 424 LEDLDRVATSGDDDMLPPITFKSTSQNAPLINWHGLLDSEQGFNT 468
>gi|299116071|emb|CBN74487.1| aldehyde dehydrogenase, putative [Ectocarpus siliculosus]
Length = 724
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 94/341 (27%), Positives = 137/341 (40%), Gaps = 88/341 (25%)
Query: 401 KEGIEAVHLASGRTVCKLHL-----QEGGLHADINGDGVLDHVQAVGGNGA--------- 446
++G+EAV L++G+ + + L + G++ D+NGDGV+DHVQAVG G
Sbjct: 343 RDGLEAVELSTGKPLSAVALPAAAGTDAGVYVDLNGDGVVDHVQAVGTRGGQGWGHLHEG 402
Query: 447 --------EQTVVSGSMEVLRPCWAVATSGVPVREQLFNASICHHSPFNLFPHGEFSR-N 497
E T + L PC+A+A SG+P REQLFNAS+CH + S+ N
Sbjct: 403 VAMGHLSEEPTPSQPARRFLPPCYALAVSGLPPREQLFNASLCHDAGALFEVQERLSKPN 462
Query: 498 FGRTSDVASLEV--ATPILIPRSDGHRHRKG----------------------------- 526
+S LEV A+P +IPR
Sbjct: 463 SRSSSRSLDLEVGAASPAIIPRDSAGVAAAAATGAATARSSSGAGRGARSGGGGGGGGVG 522
Query: 527 ------SHGDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSG--MTEAST 578
+ D VF + G V++Y + +WQ W+ + G
Sbjct: 523 GWGEGPARFDAVFAVSSGVVSSYD-----DEGRLKWQDRRGPKWTRKDTEGGDPSNGGGY 577
Query: 579 VVPTLKAFSLRVH--------------DNQQMILAGGDQEAVVISPGGSILTSIDLPAPP 624
VVP F+L V ++ IL G + V GG + S L APP
Sbjct: 578 VVP----FTLEVGWPGRGRGRIINPSPGAEERILVVGQDKMCVYDRGGRLAGSTPLAAPP 633
Query: 625 THALVCEDFSNDGLTDVILMT-SNGVYGFVQTRQPG--ALF 662
+ V DF DG+ DV+++ S V G+ PG A+F
Sbjct: 634 SQRPVLGDFDGDGVADVLVVGWSGAVAGYALAPDPGVRAMF 674
Score = 46.2 bits (108), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 99/252 (39%), Gaps = 49/252 (19%)
Query: 48 LPPPIVADLNGDGRKEVLV-ATHDAKIQVLE-PHARRVDEGFSEARVLAEVSLLPDKIRI 105
+P P+V DL+GDG+ EV+V A I+VL P A E + ++ L P + R+
Sbjct: 15 VPLPLVTDLDGDGKNEVVVLADGGMLIRVLSVPVAS--GETLVDPWIIHSTELRPTR-RL 71
Query: 106 ASGRRAVAMATGVIDRTYRQGQPLKQVLVVVTSGW-----------SVMCFDHNLNKLWE 154
RAVAMA G ++ P V SV CF+H L +LW
Sbjct: 72 GRDERAVAMAAGYLEPPVNARDPGSSRQQQVQQRQQQRIVVVGEDSSVTCFNHQLGRLWT 131
Query: 155 ANLQEDFPPNAHHR-----EIAISISNYTLKHGD---------TGLVIVGGRMEMQ---- 196
+L + + ++AI++++ K D GL++VG M +
Sbjct: 132 TSLAHEGSSSGEGGSFVIDQVAITVAHSVRKLDDRGRPKPGTGEGLIVVGISMRHRDGRF 191
Query: 197 -------------PHTIMDPFEEIGLAEKNAEQHRRSA--SEKEASENSGTVDLRHFAFY 241
P DP A E+ R EK + E G HF+ +
Sbjct: 192 HHSRAGRDPSGGSPVDGADPLGVHVEAFVEGEEDSRDTVEEEKMSEEEKGRRAAEHFSLF 251
Query: 242 AFAGRSGLLRWS 253
A G +G +RWS
Sbjct: 252 ALDGETGEVRWS 263
>gi|307103153|gb|EFN51416.1| hypothetical protein CHLNCDRAFT_59245 [Chlorella variabilis]
Length = 925
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 65/98 (66%), Gaps = 7/98 (7%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFH-LSEEYPIKFDADR----LPPPIVAD 55
M KRD IL+LS A++ SLQHEG + F++AW+H S+ P + + LPPP+VAD
Sbjct: 1 MYKRDFGILLLSVCALYLSLQHEGAYQFKKAWYHSFSDHEPALHELEEHPQPLPPPLVAD 60
Query: 56 LNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVL 93
LNGDG+ EV+V T ++Q+L P RR +GF++A +
Sbjct: 61 LNGDGKPEVVVVTPTGRVQLLAP--RRFGDGFAKAEAI 96
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 33/59 (55%)
Query: 266 ASQLIPQHNYKLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHK 324
A L+ QH Y + +RH GE CR++R SVL +PH W DT L +HF +H+
Sbjct: 740 ADALVVQHAYHMSAEESETRHYGEASCRDYRASVLAALPHSWHSPLDTALVPAHFEKHR 798
Score = 40.4 bits (93), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 18/22 (81%), Positives = 20/22 (90%)
Query: 393 PNVVVAHQKEGIEAVHLASGRT 414
PNVVVAH +EGIEA+HL SGRT
Sbjct: 879 PNVVVAHLEEGIEAIHLFSGRT 900
>gi|356514228|ref|XP_003525808.1| PREDICTED: uncharacterized protein LOC100794105 [Glycine max]
Length = 308
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/64 (62%), Positives = 44/64 (68%)
Query: 291 ECREFRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPEEHHPPG 350
E R + W +REDTL K +HFRRHKRK LKK GKS SYPFHKPEE+HPPG
Sbjct: 240 EVRPLEAERFKALRIQWAQREDTLFKFAHFRRHKRKALKKTPGKSISYPFHKPEENHPPG 299
Query: 351 KDST 354
KDST
Sbjct: 300 KDST 303
>gi|401427313|ref|XP_003878140.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494387|emb|CBZ29688.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 727
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 170/738 (23%), Positives = 281/738 (38%), Gaps = 157/738 (21%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPI---KFDADRLPPPIVADLN 57
MR RDL I+ +S EG +F+ + PI + +P PI+ D +
Sbjct: 1 MRIRDLVIIGVSFVCSIICWSQEGSLNFQRGFM-----IPILDNELTEVVMPKPILVDPD 55
Query: 58 GDGRKEVLVATHDAKIQVLEPHA--RRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMA 115
G G + +L +T + + + RR+D F AE+++ + I +G
Sbjct: 56 GSGSRVLLSSTTVGYLNIYNTYYTRRRIDGSFVRLNPSAELNVYTPIVGIGAG------- 108
Query: 116 TGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNL---NKLWEANL-QEDFPPNAHHREIA 171
Y + + VVT + +M + + LW++ L + H I+
Sbjct: 109 -------YTELNSSIMLCAVVTEDYQLMAVSIHTPSPSVLWKSQLVAPQYWSELTHASIS 161
Query: 172 ISISNYTLKHGDTGLVIVGGRM------EMQPHTIMDP---------FEEIGLAEKNAEQ 216
I + D G++ V ++ EM ++ D F + G +
Sbjct: 162 IVPERNWAE--DVGMIAVAAKVVDASGVEMMMYSAFDAKTGARRWVYFSDGG---NEMDD 216
Query: 217 HRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYK 276
R A ++G +LR E TD QH+Y
Sbjct: 217 VVREADNNGTETDNGLTELRLL----------------DREGAAKGGTD------QHDYL 254
Query: 277 LDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHK-RKILKKVVGKS 335
L E FRESV+ MPH + D L F R K ++ ++ GK+
Sbjct: 255 LG-------RKYEQPWTTFRESVMASMPHRYAHVWDAQLYPHVFYRAKAKRKVRSSRGKA 307
Query: 336 ---TSYPFHKPEEHHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWV 392
T++ ++ H T N+ K + A S + Y P N T+
Sbjct: 308 ERRTTFQYNDRVVHM-----ETDDEGNMGDKLSALALSLRQSTQA-YPPK--NATRRRLR 359
Query: 393 P-NVVVAHQKEGIEAVHLASGRTVCKLH-LQEGG-LHADINGDGVLDHVQAVGGNGAEQT 449
P NV V H + G+E VH+ +G TV + L+ GG + DIN D VL+ V G ++
Sbjct: 360 PSNVFVFHGEHGVEVVHMYTGSTVTSVMPLRAGGTCYDDINDDLVLESVSTQIG---PRS 416
Query: 450 VVSGS--MEVLRPCWAVATSGVPVR-EQLFNASICHHSPFNLFPHGEFSRNF-------- 498
+V +++ C + +G P ++LFNA++C LF + + +F
Sbjct: 417 IVYAKHGIDLAYDCLGIIEAGAPSSADELFNATVCDTQ--GLFGNLDLIHHFVDGDIRGE 474
Query: 499 ------------GRTSDVASLEVATPILIPRSDGHRHRKGSHGD--VVFLTNRGEVTAYS 544
G + V+S+ ATP LI + + + F+ + G VT
Sbjct: 475 EAPRALNTLELLGSRNVVSSMTRATPPLILQVQQNMGAGLQQVERYAAFMIDTGLVTCID 534
Query: 545 PGLHGHDAIWQWQLLTDATWSNLPSPS-GMTEASTV----------VPTLKAFSLRVHDN 593
P +W+ Q T A ++ PSPS M EA P + +SL +
Sbjct: 535 PSR--RRVLWRAQ--TGARFA--PSPSDDMAEARKSRYHVNRDVKPFPQMVPYSLMQKNR 588
Query: 594 Q-----------------QMILAGGDQEAVVI-SPGGSILTSIDLPAPPTHALVCEDFSN 635
+ +LA GD E VI + G + +SI L PP ++ DF+
Sbjct: 589 ETDVTFVGGGKESFRRVDTYVLAVGDAELSVIKTKNGKVTSSISLTEPPVAPVLVVDFNG 648
Query: 636 DGLTDVILMTSNGVYGFV 653
DG D+I+++ ++GFV
Sbjct: 649 DGTNDLIIVSKYHIFGFV 666
>gi|157874257|ref|XP_001685616.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|5852121|emb|CAB55366.1| hypothetical protein L1648.04 [Leishmania major]
gi|68128688|emb|CAJ08820.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 727
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 169/738 (22%), Positives = 282/738 (38%), Gaps = 157/738 (21%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPI---KFDADRLPPPIVADLN 57
MR RD+ I+ ++ EG +F+ + PI + +P PI+ D +
Sbjct: 1 MRIRDIVIIGVAFVCSIICWSQEGSLNFQRGFM-----IPILDNELTEVIMPKPILVDPD 55
Query: 58 GDGRKEVLVATHDAKIQVLEPHA--RRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMA 115
G G + +L +T + + + RR+DE F AE+++ + I +G
Sbjct: 56 GSGSRVLLSSTTVGYLSIYNTYYSRRRIDESFVRLNPSAELNVYTPIVGIGAG------- 108
Query: 116 TGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNL---NKLWEANL-QEDFPPNAHHREIA 171
Y + + VVT + ++ + LW+ L + H I+
Sbjct: 109 -------YTELNSSIMLCAVVTEDYQLIAVSIHTPSPTVLWKTQLVGPRYWSELTHASIS 161
Query: 172 ISISNYTLKHGDTGLVIVGGRM------EMQPHTIMDP--------FEEIGLAEKNAEQH 217
I + D G++ V ++ EM ++ D + G E N
Sbjct: 162 IVPERNWAE--DVGMIAVAAKVVDTSGVEMMMYSAFDAKTGARRWAYFSDGGNEMN--DV 217
Query: 218 RRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNYKL 277
R A+ +G DLR R G A I QH+Y L
Sbjct: 218 VREANNNGTQTENGVTDLR------LLDRDGA----------------AKGGINQHDYLL 255
Query: 278 DVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHK-RKILKKVVGKS- 335
E FRESV+ MPH + D L F R K + + GK+
Sbjct: 256 G-------RKYEQPWTTFRESVMASMPHRYAHVWDAQLYPHVFYRAKAHRKARSSRGKAE 308
Query: 336 --TSYPFHKPEEHHPP--GKDSTKKISNLIG--KAATYAGSAKSKKPVNYIPTITNYTQL 389
++ ++ H G + KK+S L + +T A AK+ + P+
Sbjct: 309 RRATFRYNDRVVHMETDDGGNLGKKLSALALSLRQSTQAYPAKNATRRHRRPS------- 361
Query: 390 WWVPNVVVAHQKEGIEAVHLASGRTVCKLH-LQEGG-LHADINGDGVLDHVQAVGGNGAE 447
NV V H ++G+E VH+ +G TV + L+ GG + DINGD VL+ V G +
Sbjct: 362 ----NVFVFHGEQGVEVVHMYTGSTVTSVMPLKAGGACYDDINGDLVLESVSTQIGPRSV 417
Query: 448 QTVVSGSMEVLRPCWAVATSGVPVR-EQLFNASICHHSPFNLFPHGEFSRNF-------- 498
G +++ C + +G P ++LFNA++C+ LF + + +F
Sbjct: 418 VYATRG-IDLKYDCLGIIEAGAPSSADELFNATVCNTQ--GLFGNLDLIHHFVDGDIRGE 474
Query: 499 ------------GRTSDVASLEVATPILIPRSDGHRHRKGSHGD--VVFLTNRGEVTAYS 544
G + V+S+ ATP LI + + + F+ + G VT
Sbjct: 475 EAPRALNTLELLGSRNVVSSMTRATPPLILQVQQNMGAGLQQVERYAAFMIDTGLVTCID 534
Query: 545 PGLHGHDAIWQWQLLTDATWSNLPSPS-GMTEASTV----------VPTLKAFSLRVHDN 593
P +W+ Q T A ++ PSPS M EA P + +SL +
Sbjct: 535 PSR--RRVLWRAQ--TGARFA--PSPSDDMAEARKSRYHVNRDVKPFPQMVPYSLMQKNR 588
Query: 594 Q-----------------QMILAGGDQE-AVVISPGGSILTSIDLPAPPTHALVCEDFSN 635
+ +LA GD E +++ + G + S+ L PP ++ DF+
Sbjct: 589 ETDMTFVGGGKESFRRVDTYVLAVGDAEFSIIKTKNGKVTRSVSLKEPPVAPVLVVDFNG 648
Query: 636 DGLTDVILMTSNGVYGFV 653
DG D+I+++ V+GFV
Sbjct: 649 DGTNDLIIVSKYHVFGFV 666
>gi|261335691|emb|CBH18685.1| FG-GAP repeat protein, putative [Trypanosoma brucei gambiense
DAL972]
Length = 703
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 153/724 (21%), Positives = 276/724 (38%), Gaps = 153/724 (21%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIK-FDADR--LPPPIVADLN 57
MR RD+ I+ ++ F E + + + E PI ++ R +P P+V D
Sbjct: 1 MRLRDVFIVGVAIIFCIFGWHQEDGLAVWKGF-----EIPITDLESQRYMIPKPVVLDPM 55
Query: 58 GDGRKEVLVATHDAKIQVLEPHARR-VDEGFSEARVLAEVSLLPDKIRIASGRRAVAMAT 116
GDGR ++ T +++ H+ E ++ + + S I +GR AV
Sbjct: 56 GDGRPVLIATTSYGSLEMFRTHSTSGAAETYATPVSMYQRSFFSRITAIGAGRLAVGE-- 113
Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISIS 175
+ VVTS + + H+ +++W + + H ++S+
Sbjct: 114 -------------DYTIFVVTSDFLLYRLSPHDFSEVWSVPIHNVLSESFH---TSVSVL 157
Query: 176 NYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDL 235
+ + GD G V++ ++ HT +
Sbjct: 158 SERIYEGDEGTVVIATQVPGPNHTKL---------------------------------- 183
Query: 236 RHFAFYAFAGRSGLLRWSRKNENIEAQ------PTDASQLIPQHNYKLDVHALNSRHPGE 289
F AF G G LRW R + E+ P DA + V ++S+
Sbjct: 184 --MLFAAFNGADGKLRW-RYTSDAESSVREVLDPEDAVGVGGSSPSGASV-VVSSQATES 239
Query: 290 FECRE-----FRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPE 344
F E +RE+V ++PH + D L+ F K + K ST +K
Sbjct: 240 FRLYEKPWTFYREAVSTLLPHRYSHPWDECLRAHVFFHTKNRKKTKAQAGSTVVVKYKDR 299
Query: 345 EHHPPGKDSTKKIS--NLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKE 402
+D + L+ K S K+ + NV+ H +
Sbjct: 300 FVRMNSEDYGELAERLGLVSKPQKKGNSHGEKERKS--------------ANVLAFHGEH 345
Query: 403 GIEAVHLASGRTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP 460
GIE VHL +G + ++ L+ G++ DIN D +D V + G E+ ++V
Sbjct: 346 GIEVVHLYTGSLITRVAPLKSAGVYYHDINDDFEIDAVSTLIGRRMEKHS-HFDVDVTLD 404
Query: 461 CWAVATSGVPVRE-QLFNASICHHSPF-------NLFPHGEFSRNFGRTSDVASLEVA-- 510
C V ++G P + LF SIC+ + F G+ +R G +++LE+
Sbjct: 405 CLGVISTGAPAADHSLFQTSICNTEGIFGRLELIHRFIDGD-TRGEGTPEVLSALELVGS 463
Query: 511 -----------TPILIPRSDGHRHRKGSHGDV----VFLTNRGEVTAYSPGLHGHDAIWQ 555
TP+++ H R V VF+T+ G VT P +W+
Sbjct: 464 HNTLSHTTKSVTPLVVQL---HTLRGRGLFQVERLAVFMTDSGLVTCVDPSRR--RVVWR 518
Query: 556 -------WQLLTDATWSNLPSPSGMTE---ASTVVPTLKAFSL-RVHDN----------- 593
W+L ++ + S TE + P L +++ +V+++
Sbjct: 519 SQTESSFWRLRSEREADVGEAASSETEHKQRTFPFPHLASYNFYQVNEDTVGHVGGPGRY 578
Query: 594 ---QQMILAGGDQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGV 649
I+A G+++ ++S G ++ ++L P ++ +DF+ DG+ D+I++T G+
Sbjct: 579 LRVDPYIVAVGERKLTILSSRTGRVMRVVELEEPAVAPVIVQDFNGDGINDIIVVTEGGI 638
Query: 650 YGFV 653
YGFV
Sbjct: 639 YGFV 642
>gi|74026204|ref|XP_829668.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70835054|gb|EAN80556.1| FG-GAP repeat protein, putative [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 703
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 152/724 (20%), Positives = 277/724 (38%), Gaps = 153/724 (21%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIK-FDADR--LPPPIVADLN 57
MR RD+ I+ ++ F E + + + E PI ++ R +P P+V D
Sbjct: 1 MRLRDVFIVGVAIIFCIFGWHQEDGLAVWKGF-----EIPITDLESQRYMIPKPVVLDPM 55
Query: 58 GDGRKEVLVATHDAKIQVLEPHARR-VDEGFSEARVLAEVSLLPDKIRIASGRRAVAMAT 116
GDGR ++ T +++ + H+ E ++ + + S I +GR AV
Sbjct: 56 GDGRPVLIATTSYGSLEMFKTHSTSGAAETYATPVSMYQRSFFSRITAIGAGRLAVGEDY 115
Query: 117 GVIDRTYRQGQPLKQVLVVVTSGWSVMCFD-HNLNKLWEANLQEDFPPNAHHREIAISIS 175
+ VVTS + + H+ +++W + + H ++S+
Sbjct: 116 AIF---------------VVTSDFLLYRLSPHDFSEVWSVPIHNVLSESFH---TSVSVL 157
Query: 176 NYTLKHGDTGLVIVGGRMEMQPHTIMDPFEEIGLAEKNAEQHRRSASEKEASENSGTVDL 235
+ + GD G V++ ++ HT +
Sbjct: 158 SERIYEGDEGTVVIATQVPGPNHTKL---------------------------------- 183
Query: 236 RHFAFYAFAGRSGLLRWSRKNENIEAQ------PTDASQLIPQHNYKLDVHALNSRHPGE 289
F AF G G LRW R + E+ P DA + V ++S+
Sbjct: 184 --MLFAAFNGADGKLRW-RYTSDAESSVREVLDPEDAVGVGGSSPSGASV-VVSSQATES 239
Query: 290 FECRE-----FRESVLGVMPHHWDRREDTLLKLSHFRRHKRKILKKVVGKSTSYPFHKPE 344
F E +RE+V ++PH + D L+ F K + K +T +K
Sbjct: 240 FRLYEKPWTFYREAVSTLLPHRYSHPWDECLRPHVFFHTKNRKKTKAQAGNTVVVKYKDR 299
Query: 345 EHHPPGKDSTKKIS--NLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKE 402
+D + L+ K S K+ + NV+ H +
Sbjct: 300 FVRMNSEDYGELAERLGLVSKPQKKGSSHGEKERKS--------------ANVLAFHGEH 345
Query: 403 GIEAVHLASGRTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRP 460
GIE VHL +G + ++ L+ G++ DIN D +D V + G E+ ++V
Sbjct: 346 GIEVVHLYTGNLITRVAPLKSAGVYYHDINDDFEIDAVSTLIGRRMEKHS-HFDVDVTLD 404
Query: 461 CWAVATSGVPVRE-QLFNASICHHSPF-------NLFPHGEFSRNFGRTSDVASLEVA-- 510
C V ++G P + LF SIC+ + F G+ +R G +++LE+
Sbjct: 405 CLGVISTGAPAADHSLFQTSICNTEGIFGRLELIHRFIDGD-TRGEGTPEVLSALELVGS 463
Query: 511 -----------TPILIPRSDGHRHRKGSHGDV----VFLTNRGEVTAYSPGLHGHDAIWQ 555
TP+++ H R V VF+T+ G VT P +W+
Sbjct: 464 HNTLSHTTKSVTPLVVQL---HTLRGRGLFQVERLAVFMTDSGLVTCVDPSRR--RVVWR 518
Query: 556 -------WQLLTDATWSNLPSPSGMTE---ASTVVPTLKAFSL-RVHDN----------- 593
W+L ++ + S TE + P L +++ +V+++
Sbjct: 519 SQTESSFWRLRSEREADVGEAASSETEHKQRTFPFPHLASYNFYQVNEDTVGHVGGPGRY 578
Query: 594 ---QQMILAGGDQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGV 649
I+A G+++ ++S G ++ ++L P ++ +DF+ DG+ D+I++T G+
Sbjct: 579 LRVDPYIVAVGERKLTILSSRTGRVMRVVELEEPAVAPVIVQDFNGDGINDIIVVTEGGI 638
Query: 650 YGFV 653
YGFV
Sbjct: 639 YGFV 642
>gi|146096931|ref|XP_001467982.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398021114|ref|XP_003863720.1| hypothetical protein, conserved [Leishmania donovani]
gi|134072348|emb|CAM71055.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322501953|emb|CBZ37036.1| hypothetical protein, conserved [Leishmania donovani]
Length = 727
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 163/742 (21%), Positives = 285/742 (38%), Gaps = 165/742 (22%)
Query: 1 MRKRDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPI---KFDADRLPPPIVADLN 57
MR RD+ I+ ++ EG +F+ + PI + +P PI+ D +
Sbjct: 1 MRIRDIVIIGVAFVCSIICWSQEGSLNFQRGFM-----IPILDNELTEVIMPKPILVDPD 55
Query: 58 GDGRKEVLVATHDAKIQVLEPHA--RRVDEGFSEARVLAEVSLLPDKIRIASGRRAVAMA 115
G G + +L +T + + + RR+D+ F AE+++ + I +G
Sbjct: 56 GSGSRVLLSSTSVGYLNIYNTYYARRRIDDSFVRLNPSAELNVYTPIVGIGAG------- 108
Query: 116 TGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNL---NKLWEANLQEDFPPNAHHREIAI 172
Y + + VVT + ++ + + LW++ L P
Sbjct: 109 -------YTELNSSIMLCAVVTEDYQLIAVSIHTPSPSVLWKSQL---VAPRYWSELTHA 158
Query: 173 SISNYTLKH--GDTGLVIVGGRM------EMQPHTIMDP---------FEEIGLAEKNAE 215
SIS ++ D G++ V ++ EM ++ D F + G +
Sbjct: 159 SISIVPERNWAEDVGMIAVAAKVVDASGVEMMMYSAFDAKTGARRWAYFSDGG---NEMD 215
Query: 216 QHRRSASEKEASENSGTVDLRHFAFYAFAGRSGLLRWSRKNENIEAQPTDASQLIPQHNY 275
R A+ ++G +LR R G A I QH+Y
Sbjct: 216 DVVREANNNGTQTDNGVTELR------LLDREGA----------------AKGGINQHDY 253
Query: 276 KLDVHALNSRHPGEFECREFRESVLGVMPHHWDRREDTLLKLSHFRRHK-RKILKKVVGK 334
L E FRESV+ MPH + D L F R K ++ + GK
Sbjct: 254 LLG-------RKYEQPWTTFRESVMASMPHRYAHVWDAQLYPQVFYRAKGKRKARSSRGK 306
Query: 335 S---TSYPFHKPEEHHPP--GKDSTKKISNLIG--KAATYAGSAKSKKPVNYIPTITNYT 387
+ ++ ++ H G + K+S L + +T+A AK+ P+
Sbjct: 307 AERRATFRYNDRVVHMETDDGGNLGDKLSALALSLRQSTHAYPAKNATRRRRRPS----- 361
Query: 388 QLWWVPNVVVAHQKEGIEAVHLASGRTVCK-LHLQEGG-LHADINGDGVLDHVQAVGGNG 445
NV V H ++G+E VH+ +G TV + L+ GG + DIN D VL+ V G
Sbjct: 362 ------NVFVFHGEQGVEVVHMYTGSTVTSVMPLKAGGTCYDDINDDLVLESVSTQIG-- 413
Query: 446 AEQTVVSGS--MEVLRPCWAVATSGVPVR-EQLFNASICHHSPFNLFPHGEFSRNF---- 498
++VV +++ C + +G P ++LFNA++C+ LF + + +F
Sbjct: 414 -PRSVVYAKRGIDLTYDCLGIIEAGAPSSADELFNATVCNTQ--GLFGNLDLIHHFVDGD 470
Query: 499 ----------------GRTSDVASLEVATPILIPRSDGHRHRKGSHGD--VVFLTNRGEV 540
G + V+S+ ATP LI + + + F+ + G V
Sbjct: 471 IRGEEAPRALNTLELLGSRNVVSSMTRATPPLILQVQQNMGAGLQQVERYAAFMIDTGLV 530
Query: 541 TAYSPGLHGHDAIWQWQLLTDATWSNLPSPS-GMTEASTV----------VPTLKAFSLR 589
T P +W+ Q T A ++ PSPS M E P + +SL
Sbjct: 531 TCIDPSR--RRVLWRAQ--TGARFA--PSPSDDMAEIRKSRYHVNRDVKPFPQMVPYSLM 584
Query: 590 VHDNQ-----------------QMILAGGDQE-AVVISPGGSILTSIDLPAPPTHALVCE 631
+ + +LA GD E +++ + G + S+ L PP ++
Sbjct: 585 QKNRETDVTFVGGGKEYFRRVDTYVLAVGDAEFSIIKTKNGKVTRSVSLTEPPVAPVLVV 644
Query: 632 DFSNDGLTDVILMTSNGVYGFV 653
DF+ DG D+I+++ ++G+V
Sbjct: 645 DFNGDGTNDLIIVSKYHIFGYV 666
>gi|154343447|ref|XP_001567669.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065001|emb|CAM43112.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 727
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 103/420 (24%), Positives = 176/420 (41%), Gaps = 80/420 (19%)
Query: 295 FRESVLGVMPHHWDRREDTLLKLSHFRRHK-RKILKKVVGKS---TSYPFHKPEEHHPPG 350
FRESV+ MPH + D L F R K ++ + GK+ ++ ++ H
Sbjct: 266 FRESVMAAMPHRYAHVWDAQLYPHVFYRAKAKRKARPSRGKAKRRATFRYNDQVVHM--- 322
Query: 351 KDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIEAVHLA 410
+T+ NL K + S K + T Q + NV V H ++G+E VH+
Sbjct: 323 --ATEDDGNLGDKLSALRLSLKQPTQTSLAENATR--QRRRLSNVFVFHGEQGVEVVHMY 378
Query: 411 SGRTVCKLH--LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSG-SMEVLRPCWAVATS 467
+G V + +G + DIN D VL+ + G TV + +++ C + +
Sbjct: 379 TGNIVTNVMPLRADGTYYDDINDDLVLESLST--QIGPRSTVFAKRGIDLSYDCLGMIET 436
Query: 468 GVPVR-EQLFNASICHHSPFNLFPHGEFSRNF--------------------GRTSDVAS 506
GVP+ ++LFNA++C+ LF + + +F G S ++
Sbjct: 437 GVPLSADELFNATVCNTQ--GLFGNLDLIHHFVDGDIRGEEAPRALNTLELLGSRSVASA 494
Query: 507 LEVATPILIPRSDGHRHRKGSHGDV----VFLTNRGEVTAYSPGLHGHDAIWQWQLLTDA 562
+ ATP LI + ++ G V F+ + G VT P +W+ Q T
Sbjct: 495 MTRATPPLILQV--QQNMGGGLQQVERYAAFMIDTGLVTCIDPSRR--RVLWRAQ--TGG 548
Query: 563 TWSNLPSPSGMTEASTV----------VPTLKAFSLRVHDNQQM---------------- 596
++ PS M EA + P + +SL + N++
Sbjct: 549 RFAPTPS-DDMAEARKIRYQVNSDVRPFPQMVPYSL-LQKNRETDVTFVGGGGEPFRRVD 606
Query: 597 --ILAGGDQE-AVVISPGGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNGVYGFV 653
+LA GD E +++ + G + SI LP PP ++ DF+ DG D+I+++ ++GFV
Sbjct: 607 TYVLAMGDTEFSIIKTKNGKVTRSISLPEPPVAPVLVVDFNGDGTNDLIVVSKYHIFGFV 666
>gi|414885126|tpg|DAA61140.1| TPA: hypothetical protein ZEAMMB73_444374 [Zea mays]
Length = 486
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/70 (54%), Positives = 53/70 (75%), Gaps = 2/70 (2%)
Query: 540 VTAYSPGLHGHDAIWQWQLLTDATWSNLPSPSGMTEASTVVPTLKAFSLRVHDNQQMILA 599
VT+YS L G++A+W+ Q+ + TWS LPSPSG+ E VVPTLKAFS+ +D +++ +
Sbjct: 411 VTSYSSRLLGYNAVWR-QVSSRMTWSKLPSPSGLME-KIVVPTLKAFSVCAYDPKEVTIC 468
Query: 600 GGDQEAVVIS 609
GGDQEAVV+S
Sbjct: 469 GGDQEAVVLS 478
>gi|340375802|ref|XP_003386423.1| PREDICTED: hypothetical protein LOC100636369 [Amphimedon
queenslandica]
Length = 617
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 95/200 (47%), Gaps = 22/200 (11%)
Query: 4 RDLAILMLSAFAIFFSLQHEGDFSFREAWFHLSEE-----------YPIKFDADRLPPPI 52
+++ L +S + F L+ + F+ +EE +P++ ++LP PI
Sbjct: 13 QEILFLFISCSVLAFLLRSQDSLEFKLVLNVTTEERAARENYANQKFPLR--NEKLPLPI 70
Query: 53 VADLNGDGRKEVLVATHDAKIQVLEPHARRVDEGFSEARVLAEVSLLPDKIRIASGRRAV 112
V DL DG+ ++LV+T ++L ++R + + + + ++ V
Sbjct: 71 VTDLESDGQTDILVSTD----KILNVYSRHYTPSYLGHHGIRHSN---KNVSLSKTSCVV 123
Query: 113 AMATGVIDRTYRQGQPLKQVLVVVTSGWSVMCFDHNLNKLWEANLQEDFPPNAHHREIAI 172
AM TG + Q KQV+ V+ S W++ C++HNL +W L E P + E ++
Sbjct: 124 AMTTGFLQPYQSVVQVRKQVIAVLYSDWTLSCYNHNLKLMWSQKLNEKEIPISS--EASL 181
Query: 173 SISNYTLKHGDTGLVIVGGR 192
+S ++K G+++VGGR
Sbjct: 182 LVSAISIKKSPAGVILVGGR 201
>gi|342186606|emb|CCC96093.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 483
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 134/314 (42%), Gaps = 62/314 (19%)
Query: 394 NVVVAHQKEGIEAVHLASGRTVCKLH-LQEGGLHA-DINGDGVLDHVQAVGGNGAEQTVV 451
NV+ H + GIE +HL +G + +L L+ ++ DIN D ++ + + G E
Sbjct: 117 NVLAYHGEHGIEVIHLYTGNLITRLAPLKSKNIYYHDINDDFQIESISPLIGRRMESHA- 175
Query: 452 SGSMEVLRPCWAVATSGVPVREQ-LFNASICHHSPFNLFPHGEFSRNF--GRTSDVASLE 508
+E+ C V ++G+PV LFNASIC+ LF + R+F G D + E
Sbjct: 176 KFDIELAYDCLGVISTGLPVAHHPLFNASICNTE--GLFGRLDLVRDFVDGDMRDGGNTE 233
Query: 509 VATPILIPRSDGHRH--RKGSHGDV---------------------VFLTNRGEVTAYSP 545
V + + + G R+ K +H V VF+T+ G VT P
Sbjct: 234 VLSVLELI---GSRNTLSKTTHSTVPLVVQLHTVKGKDLVQIERYAVFITDSGLVTCVDP 290
Query: 546 GLHGHDAIWQWQ------LLTDATWSN----LPSPSGMTEASTVVPTLKAFSL------- 588
H W+ Q LL D ++ + + + P L ++
Sbjct: 291 SRH--RVAWRSQTESSFYLLRDEQEADAEAGINTKMERNHLAVPFPHLAQYNFYQKNEDS 348
Query: 589 --------RVHDNQQMILAGGDQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLT 639
R I+A G+++ +S G ++ ++L P ++ +DF+ DG+
Sbjct: 349 VGYVGGIGRYLRTDPYIIAVGERKMTFLSSRTGRVMRVVELEEPAVAPVIVQDFNGDGIN 408
Query: 640 DVILMTSNGVYGFV 653
D+I++T+ GVYG+V
Sbjct: 409 DIIVVTTGGVYGYV 422
>gi|340059814|emb|CCC54210.1| putative intergrin alpha chain protein [Trypanosoma vivax Y486]
Length = 704
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 113/516 (21%), Positives = 199/516 (38%), Gaps = 100/516 (19%)
Query: 240 FYAFAGRSGLLRW---SRKNENIEAQPTDASQLIPQHNYKLDVHALNSRHPGEFECRE-- 294
F AF G G LRW + + ++ + + + D ++R F E
Sbjct: 188 FAAFNGADGELRWRYTTDADSSVREVLSPSVSTAAEEASSGDSTIASARTTDPFRLHEKP 247
Query: 295 ---FRESVLGVMPHH----WDRREDTLLKLSHFRRHKRKIL--KKVVGKSTSYPFHKPEE 345
FRE+V+ ++PH WD R + R KR K+VV K EE
Sbjct: 248 WSFFREAVVTLLPHRYAHPWDERIRPHVFFHTKNRKKRAAHSGKQVVVKYKDRFIRMNEE 307
Query: 346 HHPPGKDSTKKISNLIGKAATYAGSAKSKKPVNYIPTITNYTQLWWVPNVVVAHQKEGIE 405
++ +G+ + + K +P NV+ H G+E
Sbjct: 308 NYGE-----------LGEKLGLLNADEKHKDHTALPHQRGV-------NVLAFHGPHGVE 349
Query: 406 AVHLASGRTVCKLH--LQEGGLHADINGDGVLDHVQAVGGNGAEQTVVSGSMEVLRPCWA 463
+HL +G + +L G L+ D+ D +D V G Q S +E+ C
Sbjct: 350 VLHLYTGNLITRLAPLKSTGVLYHDLGDDFQIDVVGTRIGPRM-QVHSSHGLEITDECMG 408
Query: 464 VATSGVPV-REQLFNASICHHSPF-------NLFPHGEFSRNFGRTSDVASLEV------ 509
+G+P+ E+LF+ SIC+ F F G+ R G +S + +LE+
Sbjct: 409 TIHTGIPLAEEKLFSTSICNTEGFLGRLDLIRDFVGGDI-RGEGTSSVIDALELMGSQNV 467
Query: 510 -------ATPILIPRSDGHRHRKGSHG------DVVFLTNRGEVTAYSPGLHGHDAIWQW 556
TP+++ H G VF+ + G +T P G +W+
Sbjct: 468 LSKTTRSVTPLVV-----QLHTVKGRGLFQVERYAVFMIDSGLITCVDPS-RGR-VLWRS 520
Query: 557 QLLTDATWSNLPSP-------SGMTEASTV-----VPTLKAFSL---------------R 589
Q TDA++ N+ G++E P L ++ R
Sbjct: 521 Q--TDASFENVGETHDAESVMVGLSEMEVKYRTRPFPHLAPYNFEQKNEDSVGHVGGVGR 578
Query: 590 VHDNQQMILAGGDQEAVVISP-GGSILTSIDLPAPPTHALVCEDFSNDGLTDVILMTSNG 648
+ I+A G+ ++S G ++ + L P ++ +DF+ DG+ D+I++T G
Sbjct: 579 YRNTDPYIIAVGESCLTLLSARTGRVMRVVLLDEVPVAPVIVQDFNGDGINDIIVVTEGG 638
Query: 649 VYGFVQTRQPGALFFSTLVGCLIVVMGVIFVTQHLN 684
+YG+V Q + + L+ +I ++ V+ + +N
Sbjct: 639 IYGYVMGAQTSSDTITALMILMIGLLAVLVAVREMN 674
>gi|390369515|ref|XP_003731654.1| PREDICTED: uncharacterized protein LOC100890980, partial
[Strongylocentrotus purpuratus]
Length = 184
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 70/162 (43%), Gaps = 27/162 (16%)
Query: 415 VCKLHLQEGGLHADINGDGVLDHVQAV-GGNGAEQTVVSGSMEVLRPCWAVATSGVPVRE 473
+ +L L +ADI+ DGV+D +A+ +G+E C AV +G P
Sbjct: 1 MTRLDLSPDATYADIDKDGVVDQAKALFTDDGSEGN-----------CQAVVKTGHPPHS 49
Query: 474 QLFNASICHHSPFNLFPHGEFSRNFGRTSDVASLEVATPILIPRSDGHRHRKGSH----- 528
+L+N SIC+ S ++ G + + E++ LI +S R SH
Sbjct: 50 ELYNGSICYPSSLWAALSYPWAYASGSSEIKENQELSLRPLIVKSVAKRRGIISHLLGLS 109
Query: 529 -----GDVVFLTNRGEVTAYSPGLHGHDAIWQWQLLTDATWS 565
D +F + G+VT+Y P + WQ+ T A WS
Sbjct: 110 MSKAGMDTIFTVSTGQVTSYGP-----QGQFNWQVSTSALWS 146
>gi|345004742|ref|YP_004807595.1| FG-GAP repeat-containing protein [halophilic archaeon DL31]
gi|344320368|gb|AEN05222.1| FG-GAP repeat protein [halophilic archaeon DL31]
Length = 426
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/71 (28%), Positives = 37/71 (52%), Gaps = 8/71 (11%)
Query: 7 AILMLSAFAIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLV 66
A+ + + F+L EG+ +R ++ D PPP+ DL+GDG +E++V
Sbjct: 311 AVYVTNKSGTVFALDEEGETVWRRE--------VVEEDTQMTPPPVAGDLDGDGSQELVV 362
Query: 67 ATHDAKIQVLE 77
A + ++ VL+
Sbjct: 363 AANTGEVTVLD 373
>gi|354611490|ref|ZP_09029446.1| FG-GAP repeat protein [Halobacterium sp. DL1]
gi|353196310|gb|EHB61812.1| FG-GAP repeat protein [Halobacterium sp. DL1]
Length = 378
Score = 40.0 bits (92), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 14/40 (35%), Positives = 27/40 (67%)
Query: 44 DADRLPPPIVADLNGDGRKEVLVATHDAKIQVLEPHARRV 83
D +PPP++ D++GDG +E++ T+D ++V+ P +V
Sbjct: 297 DVQMMPPPVLGDVDGDGDRELVATTNDGMVKVVSPSDGQV 336
>gi|149920255|ref|ZP_01908726.1| cell surface protein [Plesiocystis pacifica SIR-1]
gi|149818842|gb|EDM78282.1| cell surface protein [Plesiocystis pacifica SIR-1]
Length = 538
Score = 39.7 bits (91), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 54/126 (42%), Gaps = 12/126 (9%)
Query: 15 AIFFSLQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLVATH--DAK 72
++ + H G ++ AW H + + P P V DL+GDG E+ V+ H DA
Sbjct: 84 SVLYVYDHAGALLWQTAWSHSASDSPEHGTVRMWPSAAVGDLDGDGDVEIAVSAHPDDAG 143
Query: 73 IQV-LEPHARRVDEGFSEARVLAEVSLLPDKIRIASG-------RRAVAMATGV--IDRT 122
+ V + H + G+ +A AEV + G ++A AT V +D T
Sbjct: 144 LNVAVYDHGGELLPGWPQAYADAEVRSIAAADVDGDGAHEILITKQASGPATNVFELDGT 203
Query: 123 YRQGQP 128
+ G P
Sbjct: 204 HASGWP 209
>gi|156400118|ref|XP_001638847.1| predicted protein [Nematostella vectensis]
gi|156225971|gb|EDO46784.1| predicted protein [Nematostella vectensis]
Length = 628
Score = 39.3 bits (90), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 6/58 (10%)
Query: 20 LQHEGDFSFREAWFHLSEEYPIKFDADRLPPPIVADLNGDGRKEVLVATHDAKIQVLE 77
L H+ +S +AW + PI + P++ D+NGD K+V V T D ++ V++
Sbjct: 13 LSHDCHYSLNQAWLSEAGAAPI------VSSPLIVDVNGDNIKDVAVTTFDGQVSVID 64
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,562,136,069
Number of Sequences: 23463169
Number of extensions: 506444641
Number of successful extensions: 1149468
Number of sequences better than 100.0: 84
Number of HSP's better than 100.0 without gapping: 67
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 1148991
Number of HSP's gapped (non-prelim): 204
length of query: 697
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 547
effective length of database: 8,839,720,017
effective search space: 4835326849299
effective search space used: 4835326849299
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)