BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 003314
         (831 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359492622|ref|XP_002282350.2| PREDICTED: uncharacterized protein LOC100267859 [Vitis vinifera]
          Length = 1068

 Score = 1434 bits (3713), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 686/807 (85%), Positives = 753/807 (93%), Gaps = 2/807 (0%)

Query: 2   GSDKHSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEKL 61
           GSDK S GLL+TL+MERVRTILTH +PYPHEHSRHAIIAVVVGCLFFISSDNMHTLI+KL
Sbjct: 51  GSDKQSVGLLETLKMERVRTILTHRYPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIQKL 110

Query: 62  DNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMGV 121
           DNNIKWWSMYACLLGFFYFFSSPFIGKTI PSYSNFSRWY+AWILVAA+YHLPSF SMGV
Sbjct: 111 DNNIKWWSMYACLLGFFYFFSSPFIGKTIKPSYSNFSRWYVAWILVAAIYHLPSFLSMGV 170

Query: 122 DLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFC 181
           D+RMNLSLFLTI+++S+LFLLVFHI+FLGLWY+GLV+RVAGK+PEILTIIQNC V+S+ C
Sbjct: 171 DMRMNLSLFLTIYVSSILFLLVFHIMFLGLWYIGLVARVAGKKPEILTIIQNCAVLSIAC 230

Query: 182 CVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPVG 241
           CVFYSHCGNRA+LR RP ERRNS WFS WKKEERNTWL+KF RMNELKDQVCSSWFAPVG
Sbjct: 231 CVFYSHCGNRAILRQRPFERRNSGWFSFWKKEERNTWLSKFTRMNELKDQVCSSWFAPVG 290

Query: 242 SASDYPLLSKWVIYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWALTH 299
           SASDYPLLSKWVIYGEL       GSSDEISPIYSLWATFIGLYIANYVVERS+GWALTH
Sbjct: 291 SASDYPLLSKWVIYGELACTGSCPGSSDEISPIYSLWATFIGLYIANYVVERSSGWALTH 350

Query: 300 PLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAM 359
           PLSV++YE++KKKQ+KP+FLDMVPWYSGTSADLFKT FDLLVSVTVFVGRFDMRMMQA+M
Sbjct: 351 PLSVKDYEELKKKQMKPDFLDMVPWYSGTSADLFKTAFDLLVSVTVFVGRFDMRMMQASM 410

Query: 360 NKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFT 419
           NK  +G  HGD+LYDH SEKEDLWFDFMADTGDGGNSSY+VARLLAQP IR+   DS   
Sbjct: 411 NKACDGVPHGDILYDHFSEKEDLWFDFMADTGDGGNSSYTVARLLAQPSIRLNTKDSFRV 470

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           LPRGD+LLIGGDLAYPNPSAFTYERRLF PFEYALQPPPWY+ +H+AVNKPEVP G+ EL
Sbjct: 471 LPRGDLLLIGGDLAYPNPSAFTYERRLFCPFEYALQPPPWYRVEHIAVNKPEVPCGLSEL 530

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLD 539
           KQY+GPQC++IPGNHDWFDGL+TFMR+ICHKSWLGGWFMPQKKSYFALQLPK WWVFGLD
Sbjct: 531 KQYEGPQCFVIPGNHDWFDGLHTFMRYICHKSWLGGWFMPQKKSYFALQLPKRWWVFGLD 590

Query: 540 LALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYL 599
           LALH DIDVYQF FF EL+K++VGE DSVIIMTHEPNWLLDWY+N+VSGKNV HLICDYL
Sbjct: 591 LALHADIDVYQFNFFVELIKDKVGENDSVIIMTHEPNWLLDWYWNDVSGKNVSHLICDYL 650

Query: 600 KGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYE 659
           KGRCKLR+AGD+HHYMRHS V SD PVYVQHLLVNGCGGAFLHPTHVFSNF + YG +Y+
Sbjct: 651 KGRCKLRMAGDLHHYMRHSSVSSDKPVYVQHLLVNGCGGAFLHPTHVFSNFNELYGASYK 710

Query: 660 SKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFS 719
           S+AAYPSFEDSSRIALGNILKFRKKNWQFDFIGGI+YFVLVFSMFPQC+L+HIL++DSFS
Sbjct: 711 SEAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIIYFVLVFSMFPQCKLDHILQDDSFS 770

Query: 720 GHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHL 779
           GHLRSFF T+W+AFMY+LEHSYVS AGA+LLL+ AI FVP KLSRKKR +IG+LHVSAHL
Sbjct: 771 GHLRSFFSTMWDAFMYMLEHSYVSLAGAMLLLMAAIIFVPPKLSRKKRVIIGILHVSAHL 830

Query: 780 AAALILMLLLELGVETCIQHKLLATSG 806
           AAAL+LMLLLELGVETCI+H+LLATSG
Sbjct: 831 AAALVLMLLLELGVETCIRHRLLATSG 857


>gi|255538398|ref|XP_002510264.1| hydrolase, putative [Ricinus communis]
 gi|223550965|gb|EEF52451.1| hydrolase, putative [Ricinus communis]
          Length = 1006

 Score = 1423 bits (3684), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 679/794 (85%), Positives = 744/794 (93%), Gaps = 4/794 (0%)

Query: 16  MERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEKLDNNIKWWSMYACLL 75
           MERVRTILTHT+PYPHEHSRHAIIAVVVGCLFFISSDNMHTL+EKLDNN+KWWSMYACLL
Sbjct: 1   MERVRTILTHTYPYPHEHSRHAIIAVVVGCLFFISSDNMHTLVEKLDNNVKWWSMYACLL 60

Query: 76  GFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTIFL 135
           GFFYFFSSPF+ KTI PSYSNFSRWYIAWIL+AA+YHLPSFQSMG+DLRMNLSLFLTI++
Sbjct: 61  GFFYFFSSPFLEKTIKPSYSNFSRWYIAWILIAALYHLPSFQSMGLDLRMNLSLFLTIYV 120

Query: 136 ASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLR 195
           +S+LFLLVFHIIF+GLWYVGLVSRVA K+PEILTI+QNC V+SV CCVFYSHCGNRA+LR
Sbjct: 121 SSILFLLVFHIIFVGLWYVGLVSRVAAKKPEILTILQNCAVLSVACCVFYSHCGNRAILR 180

Query: 196 HRPLERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWVIY 255
            RPL R+NSSWF+ WKKEERNTWLA  +RMNELKDQ CSSWFAPVGSASDYPLLSKWVIY
Sbjct: 181 DRPLARKNSSWFTFWKKEERNTWLANLIRMNELKDQFCSSWFAPVGSASDYPLLSKWVIY 240

Query: 256 GELG-NDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSVEEYEKMKKK 312
           GELG N +G  GSSDEISPIYSLWATFIGLYIANYVVERSTGWAL+HPLSV+EYEK+K K
Sbjct: 241 GELGCNGSGCAGSSDEISPIYSLWATFIGLYIANYVVERSTGWALSHPLSVQEYEKLKAK 300

Query: 313 QLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLL 372
           Q+KP+FLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAM K ++GA+  DLL
Sbjct: 301 QMKPDFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMTKVEDGAEQRDLL 360

Query: 373 YDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDL 432
           YDH SEKEDLWFDFMADTGDGGNSSY+VARLLAQP I +TR +SV +LPRG +LLIGGDL
Sbjct: 361 YDHFSEKEDLWFDFMADTGDGGNSSYTVARLLAQPSI-LTRGESVRSLPRGKLLLIGGDL 419

Query: 433 AYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPG 492
           AYPNPSAFTYE+RLF PFEYALQPPPWYK++H+A NKPE+P GV ELKQYDGPQC+IIPG
Sbjct: 420 AYPNPSAFTYEKRLFCPFEYALQPPPWYKQEHIATNKPELPVGVSELKQYDGPQCFIIPG 479

Query: 493 NHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFK 552
           NHDWFDGL+TFMR+ICHKSWLGGWFMPQKKSYFALQLP  WWVFGLDLALH DIDVYQFK
Sbjct: 480 NHDWFDGLHTFMRYICHKSWLGGWFMPQKKSYFALQLPNRWWVFGLDLALHNDIDVYQFK 539

Query: 553 FFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMH 612
           FF+EL+KE+VGE DSVIIMTHEPNWLLDWY++ VSGKNV HLIC YLKGRCKLRIAGD+H
Sbjct: 540 FFSELIKEKVGENDSVIIMTHEPNWLLDWYWDGVSGKNVSHLICTYLKGRCKLRIAGDLH 599

Query: 613 HYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSR 672
           HYMRHSYVPSDGPV+VQHLLVNGCGGAFLHPTHVFSNF++ YGT YE+KAAYPS EDSSR
Sbjct: 600 HYMRHSYVPSDGPVHVQHLLVNGCGGAFLHPTHVFSNFKELYGTKYETKAAYPSLEDSSR 659

Query: 673 IALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNA 732
           IALGNILKFRKKNWQFDFIGGI+YF+L FSMFPQC+LNHIL+ D+FSG LRSFFGT WN+
Sbjct: 660 IALGNILKFRKKNWQFDFIGGIIYFILSFSMFPQCKLNHILQADTFSGQLRSFFGTAWNS 719

Query: 733 FMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELG 792
           FMYVLEHSYVS AG ++LLIVAI FVP K+SRKK+A+IG+LHVSAHLA+ALILMLLLELG
Sbjct: 720 FMYVLEHSYVSLAGVVVLLIVAIAFVPPKVSRKKQAIIGILHVSAHLASALILMLLLELG 779

Query: 793 VETCIQHKLLATSG 806
           VE CI+H LLATSG
Sbjct: 780 VEMCIRHNLLATSG 793


>gi|356552184|ref|XP_003544449.1| PREDICTED: uncharacterized protein LOC100820584 [Glycine max]
          Length = 1021

 Score = 1414 bits (3659), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 677/808 (83%), Positives = 745/808 (92%), Gaps = 3/808 (0%)

Query: 1   MGSDKHSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60
           MGS K SAG+LDTL+M+RVRTILTHT+PYPHEHSRHA+IAVVVGCLFFISSDN+HTL+EK
Sbjct: 1   MGSSKQSAGILDTLKMQRVRTILTHTYPYPHEHSRHAVIAVVVGCLFFISSDNIHTLVEK 60

Query: 61  LDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120
           LDNN+KWWSMYACL GFFYFFSSPFIGKT  PSYSNFSRWYIAWILVAAVYHLPSFQSMG
Sbjct: 61  LDNNVKWWSMYACLFGFFYFFSSPFIGKTFKPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120

Query: 121 VDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVF 180
           VD+RMNLSLFLTI+L+S+LFLLVFHIIFLGLWY+G VSRVAGKRPEILTI+QNC V+SV 
Sbjct: 121 VDMRMNLSLFLTIYLSSILFLLVFHIIFLGLWYIGFVSRVAGKRPEILTILQNCAVLSVA 180

Query: 181 CCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPV 240
           CCVFYSHCGNRA+LR RPL+RRNS+WFS WKKEERNTWLAKFLRMNELKDQVCSSWFAPV
Sbjct: 181 CCVFYSHCGNRAMLRERPLDRRNSNWFSFWKKEERNTWLAKFLRMNELKDQVCSSWFAPV 240

Query: 241 GSASDYPLLSKWVIYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWALT 298
           GSASDYPLLSKWVIYGE+  +    GSSDEISPIYSLWATFIGLYIANYVVERSTGWALT
Sbjct: 241 GSASDYPLLSKWVIYGEIACNGSCPGSSDEISPIYSLWATFIGLYIANYVVERSTGWALT 300

Query: 299 HPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAA 358
           HPLSV+EYEK+KKKQ+KP+FLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAA
Sbjct: 301 HPLSVKEYEKLKKKQMKPDFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAA 360

Query: 359 MNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVF 418
           M++  +G   GDLLYDH SEK+D WFDFMADTGDGGNSSY+VARLLA+P IR  +DDS  
Sbjct: 361 MSRVSDGNHQGDLLYDHFSEKDDFWFDFMADTGDGGNSSYAVARLLAKPFIRTLKDDSEL 420

Query: 419 TLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPE 478
           TLPRG++LLIGGDLAYPNPSAFTYERRLF PFEYALQPPPWYK + +AVNKPEVP G  +
Sbjct: 421 TLPRGNLLLIGGDLAYPNPSAFTYERRLFVPFEYALQPPPWYKAEQIAVNKPEVPFGA-Q 479

Query: 479 LKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGL 538
           LKQY+GPQC++IPGNHDWFDGL TFMR+ICH+SWLGGW MPQKKSYFALQLPK WWVFGL
Sbjct: 480 LKQYNGPQCFVIPGNHDWFDGLQTFMRYICHRSWLGGWLMPQKKSYFALQLPKRWWVFGL 539

Query: 539 DLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDY 598
           DLALH DIDVYQFKFF EL+ E+V E DSVII+THEPNWL DWY+N+V+GKN+ HLI DY
Sbjct: 540 DLALHGDIDVYQFKFFTELITEKVQEDDSVIIITHEPNWLTDWYWNDVTGKNISHLISDY 599

Query: 599 LKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTY 658
           L+GRCKLR+AGD+HHYMRHS+V SDGPV+V HLLVNGCGGAFLHPTHVFS F K    +Y
Sbjct: 600 LRGRCKLRMAGDLHHYMRHSHVKSDGPVHVHHLLVNGCGGAFLHPTHVFSKFNKLDEVSY 659

Query: 659 ESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSF 718
           E KAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGI+YFVLVFSMFPQC+LNHIL++D+F
Sbjct: 660 ECKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIIYFVLVFSMFPQCQLNHILQDDTF 719

Query: 719 SGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAH 778
           SGH+RSF GTVWN F+Y+L+HS VS  GA+LLLI A +FVP KLSRKKRA+IGVLHVSAH
Sbjct: 720 SGHIRSFLGTVWNGFIYILQHSCVSLVGAILLLIAAYSFVPPKLSRKKRAIIGVLHVSAH 779

Query: 779 LAAALILMLLLELGVETCIQHKLLATSG 806
           LAAALILMLLLE+G+E CIQHKLLATSG
Sbjct: 780 LAAALILMLLLEIGIEICIQHKLLATSG 807


>gi|356564208|ref|XP_003550348.1| PREDICTED: uncharacterized protein LOC100819940 [Glycine max]
          Length = 1021

 Score = 1409 bits (3647), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 675/808 (83%), Positives = 745/808 (92%), Gaps = 3/808 (0%)

Query: 1   MGSDKHSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60
           MGS K SAG+LDTL+MERVRTILTHT+PYPHEHSRHA+IAVVVGCLFFISSDN+HTL+EK
Sbjct: 1   MGSSKQSAGILDTLKMERVRTILTHTYPYPHEHSRHAVIAVVVGCLFFISSDNIHTLVEK 60

Query: 61  LDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120
           LD N+KWWSMYACL GFFYFFSSPFIGKT  PSYSNFSRWYIAWILVAAVYHLPSFQSMG
Sbjct: 61  LDKNVKWWSMYACLFGFFYFFSSPFIGKTFKPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120

Query: 121 VDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVF 180
           VD+RMNLSLFLTI+L+S+LFLLVFHIIFLGLWY+G VSRVAGKRPEILTI+QNC V+SV 
Sbjct: 121 VDMRMNLSLFLTIYLSSILFLLVFHIIFLGLWYIGFVSRVAGKRPEILTILQNCAVLSVA 180

Query: 181 CCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPV 240
           CCVFYSHCGNRA+LR RPL+RRNS+WFS WKKEERNTWLAKFLRMNELKDQVCSSWFAPV
Sbjct: 181 CCVFYSHCGNRAMLRERPLDRRNSNWFSFWKKEERNTWLAKFLRMNELKDQVCSSWFAPV 240

Query: 241 GSASDYPLLSKWVIYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWALT 298
           GSASDYPLLSKWVIYGE+  +    GSSDEISPIYSLWATFIGLYIANYVVERSTGWALT
Sbjct: 241 GSASDYPLLSKWVIYGEIACNGSCPGSSDEISPIYSLWATFIGLYIANYVVERSTGWALT 300

Query: 299 HPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAA 358
           HPLSV+EYEK+KKKQ+KP+FLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAA
Sbjct: 301 HPLSVKEYEKLKKKQMKPDFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAA 360

Query: 359 MNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVF 418
           M++  +G    DLLYDH SEK+D WFDFMADTGDGGNSSY+VARLLA+P IR  +DDS  
Sbjct: 361 MSRVSDGNHQDDLLYDHFSEKDDFWFDFMADTGDGGNSSYAVARLLAKPFIRTLKDDSEL 420

Query: 419 TLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPE 478
           TLPRG++L+IGGDLAYPNPSAFTYERRLF PFEYALQPPPWYK + +AVNKPEVP G  +
Sbjct: 421 TLPRGNLLIIGGDLAYPNPSAFTYERRLFVPFEYALQPPPWYKAEQIAVNKPEVPFGA-Q 479

Query: 479 LKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGL 538
           LKQY+GPQC++IPGNHDWFDGL TFMR+ICH+SWLGGW MPQKKSYFALQLPK WWVFGL
Sbjct: 480 LKQYNGPQCFVIPGNHDWFDGLQTFMRYICHRSWLGGWLMPQKKSYFALQLPKRWWVFGL 539

Query: 539 DLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDY 598
           DLALH DIDVYQFKFF+EL+ E+V + DSVII+THEPNWL DWY+N+V+GKN+ HLI DY
Sbjct: 540 DLALHGDIDVYQFKFFSELITEKVQDDDSVIIITHEPNWLTDWYWNDVTGKNISHLISDY 599

Query: 599 LKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTY 658
           L+GRCKLR+AGD+HHYMRHS+V SDGPV++ HLLVNGCGGAFLHPTHVFS F K    +Y
Sbjct: 600 LRGRCKLRMAGDLHHYMRHSHVKSDGPVHIHHLLVNGCGGAFLHPTHVFSKFNKLDEVSY 659

Query: 659 ESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSF 718
           E KAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGI+YFVLVFSMFPQCELNHIL++D+F
Sbjct: 660 ECKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIIYFVLVFSMFPQCELNHILQDDTF 719

Query: 719 SGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAH 778
           SGH++SF GTVWN F+Y+L+HS VS AGA+LLLI A +FVP KLSRKKRA+IGVLHVSAH
Sbjct: 720 SGHIKSFLGTVWNGFIYILQHSCVSLAGAILLLIAAYSFVPPKLSRKKRAIIGVLHVSAH 779

Query: 779 LAAALILMLLLELGVETCIQHKLLATSG 806
           LAAALILMLLLE+GVE CIQHKLLATSG
Sbjct: 780 LAAALILMLLLEIGVEICIQHKLLATSG 807


>gi|302142362|emb|CBI19565.3| unnamed protein product [Vitis vinifera]
          Length = 1017

 Score = 1404 bits (3633), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 676/806 (83%), Positives = 741/806 (91%), Gaps = 15/806 (1%)

Query: 16  MERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEKLDNNIKWWSMYACLL 75
           MERVRTILTH +PYPHEHSRHAIIAVVVGCLFFISSDNMHTLI+KLDNNIKWWSMYACLL
Sbjct: 1   MERVRTILTHRYPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIQKLDNNIKWWSMYACLL 60

Query: 76  GFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTIFL 135
           GFFYFFSSPFIGKTI PSYSNFSRWY+AWILVAA+YHLPSF SMGVD+RMNLSLFLTI++
Sbjct: 61  GFFYFFSSPFIGKTIKPSYSNFSRWYVAWILVAAIYHLPSFLSMGVDMRMNLSLFLTIYV 120

Query: 136 ASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLR 195
           +S+LFLLVFHI+FLGLWY+GLV+RVAGK+PEILTIIQNC V+S+ CCVFYSHCGNRA+LR
Sbjct: 121 SSILFLLVFHIMFLGLWYIGLVARVAGKKPEILTIIQNCAVLSIACCVFYSHCGNRAILR 180

Query: 196 HRPLERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWVIY 255
            RP ERRNS WFS WKKEERNTWL+KF RMNELKDQVCSSWFAPVGSASDYPLLSKWVIY
Sbjct: 181 QRPFERRNSGWFSFWKKEERNTWLSKFTRMNELKDQVCSSWFAPVGSASDYPLLSKWVIY 240

Query: 256 GELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSVEEYEKMKKKQ 313
           GEL       GSSDEISPIYSLWATFIGLYIANYVVERS+GWALTHPLSV++YE++KKKQ
Sbjct: 241 GELACTGSCPGSSDEISPIYSLWATFIGLYIANYVVERSSGWALTHPLSVKDYEELKKKQ 300

Query: 314 LKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLY 373
           +KP+FLDMVPWYSGTSADLFKT FDLLVSVTVFVGRFDMRMMQA+MNK  +G  HGD+LY
Sbjct: 301 MKPDFLDMVPWYSGTSADLFKTAFDLLVSVTVFVGRFDMRMMQASMNKACDGVPHGDILY 360

Query: 374 DHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLA 433
           DH SEKEDLWFDFMADTGDGGNSSY+VARLLAQP IR+   DS   LPRGD+LLIGGDLA
Sbjct: 361 DHFSEKEDLWFDFMADTGDGGNSSYTVARLLAQPSIRLNTKDSFRVLPRGDLLLIGGDLA 420

Query: 434 YPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGN 493
           YPNPSAFTYERRLF PFEYALQPPPWY+ +H+AVNKPEVP G+ ELKQY+GPQC++IPGN
Sbjct: 421 YPNPSAFTYERRLFCPFEYALQPPPWYRVEHIAVNKPEVPCGLSELKQYEGPQCFVIPGN 480

Query: 494 HDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKF 553
           HDWFDGL+TFMR+ICHKSWLGGWFMPQKKSYFALQLPK WWVFGLDLALH DIDVYQF F
Sbjct: 481 HDWFDGLHTFMRYICHKSWLGGWFMPQKKSYFALQLPKRWWVFGLDLALHADIDVYQFNF 540

Query: 554 FAELVKEQ-------------VGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLK 600
           F EL+K++             VGE DSVIIMTHEPNWLLDWY+N+VSGKNV HLICDYLK
Sbjct: 541 FVELIKDKDLFLEYIEETMMNVGENDSVIIMTHEPNWLLDWYWNDVSGKNVSHLICDYLK 600

Query: 601 GRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYES 660
           GRCKLR+AGD+HHYMRHS V SD PVYVQHLLVNGCGGAFLHPTHVFSNF + YG +Y+S
Sbjct: 601 GRCKLRMAGDLHHYMRHSSVSSDKPVYVQHLLVNGCGGAFLHPTHVFSNFNELYGASYKS 660

Query: 661 KAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSG 720
           +AAYPSFEDSSRIALGNILKFRKKNWQFDFIGGI+YFVLVFSMFPQC+L+HIL++DSFSG
Sbjct: 661 EAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIIYFVLVFSMFPQCKLDHILQDDSFSG 720

Query: 721 HLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLA 780
           HLRSFF T+W+AFMY+LEHSYVS AGA+LLL+ AI FVP KLSRKKR +IG+LHVSAHLA
Sbjct: 721 HLRSFFSTMWDAFMYMLEHSYVSLAGAMLLLMAAIIFVPPKLSRKKRVIIGILHVSAHLA 780

Query: 781 AALILMLLLELGVETCIQHKLLATSG 806
           AAL+LMLLLELGVETCI+H+LLATSG
Sbjct: 781 AALVLMLLLELGVETCIRHRLLATSG 806


>gi|449470047|ref|XP_004152730.1| PREDICTED: uncharacterized protein LOC101204257 [Cucumis sativus]
 gi|449496008|ref|XP_004160010.1| PREDICTED: uncharacterized LOC101204257 [Cucumis sativus]
          Length = 1025

 Score = 1389 bits (3594), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 671/808 (83%), Positives = 743/808 (91%), Gaps = 2/808 (0%)

Query: 1   MGSDKHSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60
           M S+  SAGLLDT +M+RVRTI THT+PYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK
Sbjct: 1   MVSENISAGLLDTFKMKRVRTIFTHTYPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60

Query: 61  LDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120
           LD NIKWWS+Y+CLLGFFYFFSSPFIGKTI PSYSNFSRWYIAWILVAAVYHLPSFQSMG
Sbjct: 61  LDQNIKWWSIYSCLLGFFYFFSSPFIGKTIKPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120

Query: 121 VDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVF 180
           VD+RMNLS+F+TI+++S+LFL VFHI+F+GLWYVGLVSRVAGKRPEIL I QNC VIS+ 
Sbjct: 121 VDIRMNLSMFITIYISSILFLTVFHILFIGLWYVGLVSRVAGKRPEILAIFQNCAVISIA 180

Query: 181 CCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPV 240
           CCVFYSHCGN  VL+ R L+R+ S+WFS WKKEERNTWLAKFLR+NELKDQVCSSWFAPV
Sbjct: 181 CCVFYSHCGNHGVLKDRTLQRKTSNWFSFWKKEERNTWLAKFLRVNELKDQVCSSWFAPV 240

Query: 241 GSASDYPLLSKWVIYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWALT 298
           GSASDYPLLSKWVIY EL  +    G SD ISPIYSLWATFIGLYIANYVVERSTGWAL+
Sbjct: 241 GSASDYPLLSKWVIYSELACNGSCTGPSDGISPIYSLWATFIGLYIANYVVERSTGWALS 300

Query: 299 HPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAA 358
           HPLSV+EYEK+K+KQ+KP+FLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAA
Sbjct: 301 HPLSVKEYEKLKRKQMKPDFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAA 360

Query: 359 MNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVF 418
           M K ++GA+   LLYDH SE++DLWFDFMADTGDGGNSSYSVARLLAQP IR+  DDS++
Sbjct: 361 MRKLEDGARQDGLLYDHYSERDDLWFDFMADTGDGGNSSYSVARLLAQPSIRIVEDDSIY 420

Query: 419 TLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPE 478
            LPRGD+LLIGGDLAYPNPSAFTYERRLF PFEYALQPPPWYK DH+AV KPE+P  + E
Sbjct: 421 NLPRGDMLLIGGDLAYPNPSAFTYERRLFCPFEYALQPPPWYKSDHIAVKKPELPHWMSE 480

Query: 479 LKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGL 538
           LKQYDGPQCY+IPGNHDWFDGL+T+MR+ICHKSWLGGWFMPQKKSYFAL+LPK WWVFGL
Sbjct: 481 LKQYDGPQCYVIPGNHDWFDGLHTYMRYICHKSWLGGWFMPQKKSYFALKLPKRWWVFGL 540

Query: 539 DLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDY 598
           DLALH DIDVYQFKFF+ELV+E++G  DSVIIMTHEPNWLLD Y+ +VSGKNV HLICDY
Sbjct: 541 DLALHGDIDVYQFKFFSELVQEKMGADDSVIIMTHEPNWLLDCYWKDVSGKNVSHLICDY 600

Query: 599 LKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTY 658
           LKGRCKLRIAGD+HHYMRHS V SD  V V HLLVNGCGGAFLHPTHVFS+FRKF G+TY
Sbjct: 601 LKGRCKLRIAGDLHHYMRHSAVKSDESVNVHHLLVNGCGGAFLHPTHVFSSFRKFCGSTY 660

Query: 659 ESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSF 718
           E KAAYPSFEDS RIALGNILKFRKKNWQFDFIGGI+YF+LVFSMFPQC+L+HIL+EDSF
Sbjct: 661 ECKAAYPSFEDSGRIALGNILKFRKKNWQFDFIGGIIYFILVFSMFPQCKLDHILQEDSF 720

Query: 719 SGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAH 778
           SGHL+SFFGTVWNAF+Y+L  SYVS AGA++LLIVA+TF+PSK S+KKR +IG+LHVSAH
Sbjct: 721 SGHLKSFFGTVWNAFLYMLGESYVSLAGAIVLLIVAVTFIPSKASKKKRVIIGLLHVSAH 780

Query: 779 LAAALILMLLLELGVETCIQHKLLATSG 806
           LAAAL LMLLLELG+ETCI+H+LLATSG
Sbjct: 781 LAAALFLMLLLELGLETCIRHELLATSG 808


>gi|334186440|ref|NP_192917.3| calcineurin-like phosphoesterase domain-containing protein
           [Arabidopsis thaliana]
 gi|332657650|gb|AEE83050.1| calcineurin-like phosphoesterase domain-containing protein
           [Arabidopsis thaliana]
          Length = 1013

 Score = 1355 bits (3507), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 639/809 (78%), Positives = 727/809 (89%), Gaps = 5/809 (0%)

Query: 1   MGSDKHSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60
           M S++HSA L ++L ME  RTILTHT+PYPHEHSRHAIIAV+ GCLFFISSDNM TLIEK
Sbjct: 1   MVSERHSARLYNSLPMESFRTILTHTYPYPHEHSRHAIIAVLFGCLFFISSDNMQTLIEK 60

Query: 61  LDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120
              ++KWWSMYACLLGFFYFFSSPFI KTI P+YSNFSRWYIAWILVAA+YHLP+FQSMG
Sbjct: 61  F--SVKWWSMYACLLGFFYFFSSPFIQKTIRPNYSNFSRWYIAWILVAALYHLPNFQSMG 118

Query: 121 VDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVF 180
           +DLRMNLSLFLTI+++S+LFL+VFHIIFLGLWYVGLVSRVAG+RPEILTI+QNC V+S+ 
Sbjct: 119 LDLRMNLSLFLTIYISSILFLVVFHIIFLGLWYVGLVSRVAGRRPEILTILQNCAVLSMA 178

Query: 181 CCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEER-NTWLAKFLRMNELKDQVCSSWFAP 239
           CC+FYSHCGNRAVLR +PL R+ +SWFS WK+E R NTWLAKF+RMNELKDQVCSSWFAP
Sbjct: 179 CCIFYSHCGNRAVLRQKPLGRQYTSWFSFWKREHRHNTWLAKFIRMNELKDQVCSSWFAP 238

Query: 240 VGSASDYPLLSKWVIYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWAL 297
           VGSASDYPLLSKW IYGE+  +     S+DEISPIYSLWATFIGLYIANYVVERSTGWAL
Sbjct: 239 VGSASDYPLLSKWFIYGEIACNGSCPDSADEISPIYSLWATFIGLYIANYVVERSTGWAL 298

Query: 298 THPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQA 357
           THPLSV++YEK+K +QLKP+FLDMVPWYSGTSADLFKTVFDLLVSVTVF+GRFDMRM+QA
Sbjct: 299 THPLSVDKYEKLKNQQLKPDFLDMVPWYSGTSADLFKTVFDLLVSVTVFLGRFDMRMLQA 358

Query: 358 AMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSV 417
           AM K  + +   +LLYDHL+EK+D WFDFMADTGDGGNSSYSVA+LLAQP +RV   ++ 
Sbjct: 359 AMTKSGDASGRKELLYDHLAEKQDFWFDFMADTGDGGNSSYSVAKLLAQPSLRVPVANNF 418

Query: 418 FTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVP 477
            +LPRG+VLLIGGDLAYPNPS+FTYE+RLF PFEYALQPP WYK D +AV+KPE+P+GV 
Sbjct: 419 ISLPRGNVLLIGGDLAYPNPSSFTYEKRLFCPFEYALQPPRWYKNDSIAVDKPELPNGVS 478

Query: 478 ELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFG 537
           +LK Y+GPQC++IPGNHDWFDGLNTFMR+ICHKSWLGGW MPQKKSYFALQLPKGWWVFG
Sbjct: 479 DLKSYEGPQCFLIPGNHDWFDGLNTFMRYICHKSWLGGWLMPQKKSYFALQLPKGWWVFG 538

Query: 538 LDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICD 597
           LDLALH DIDV QFKFF+ELVK++VGE D+VII+THEPNWLLDWY++  +G+NV+HLICD
Sbjct: 539 LDLALHGDIDVDQFKFFSELVKDKVGESDAVIIITHEPNWLLDWYWSGDTGQNVRHLICD 598

Query: 598 YLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTT 657
            LK RCKLR+AGD+HHYMRHS   SDGP +VQHLLVNGCGGAFLHPTHVFS F KFYG +
Sbjct: 599 VLKYRCKLRMAGDLHHYMRHSCNQSDGPAHVQHLLVNGCGGAFLHPTHVFSKFSKFYGAS 658

Query: 658 YESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDS 717
           Y SK AYPSF+DSS+IALGNILKFRKKNWQFDFIGGI+YF+LVFS+FPQC+L H+LR DS
Sbjct: 659 YGSKVAYPSFDDSSKIALGNILKFRKKNWQFDFIGGIIYFILVFSLFPQCKLAHVLRGDS 718

Query: 718 FSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSA 777
           FSGHL SF GTVW+AF YV+E SYVSF G L+LLI AITFVPSK+S KKR +IGVLHV+A
Sbjct: 719 FSGHLESFLGTVWSAFAYVMEQSYVSFTGVLMLLITAITFVPSKVSLKKRVVIGVLHVAA 778

Query: 778 HLAAALILMLLLELGVETCIQHKLLATSG 806
           HL AALILML+LELG+E CIQH LLA SG
Sbjct: 779 HLMAALILMLMLELGIEICIQHNLLANSG 807


>gi|297813683|ref|XP_002874725.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297320562|gb|EFH50984.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 1027

 Score = 1353 bits (3501), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 644/827 (77%), Positives = 729/827 (88%), Gaps = 18/827 (2%)

Query: 1   MGSDKHSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60
           M SD HSA L ++L MERVRTILTHT+PYPHEHSRHAIIAV+ GCLFFISSDNM TLIEK
Sbjct: 1   MVSDGHSARLYNSLPMERVRTILTHTYPYPHEHSRHAIIAVLFGCLFFISSDNMQTLIEK 60

Query: 61  LDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120
              ++KWWSMYACLLGFFYFFSSPFI KTI P+YSNFSRWYIAWILVAA+YHLP+FQSMG
Sbjct: 61  F--SVKWWSMYACLLGFFYFFSSPFIQKTIRPNYSNFSRWYIAWILVAALYHLPNFQSMG 118

Query: 121 VDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVF 180
           +DLRMNLSLFLTI+++S+LFL+VFHIIFLGLWYVGLVSRVAG+RPEILTI+QNC V+S+ 
Sbjct: 119 LDLRMNLSLFLTIYISSILFLVVFHIIFLGLWYVGLVSRVAGRRPEILTILQNCAVLSMA 178

Query: 181 CCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEER-NTWLAKFLRMNELKDQVCSSWFAP 239
           CC+FYSHCGNRA+LR +PL R+ SSWFS WK+E R NTWLAKF+RMNELKDQVCSSWFAP
Sbjct: 179 CCIFYSHCGNRAILRQKPLGRQYSSWFSFWKREHRHNTWLAKFIRMNELKDQVCSSWFAP 238

Query: 240 VGSASDYPLLSKWVIYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWAL 297
           VGSASDYPLLSKW IYGE+  +     S+DEISPIYSLWATFIGLYIANYVVERSTGWAL
Sbjct: 239 VGSASDYPLLSKWFIYGEIACNGSCPDSADEISPIYSLWATFIGLYIANYVVERSTGWAL 298

Query: 298 THPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQA 357
           THPLSV++YEK+K +QLKP+FLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRM+QA
Sbjct: 299 THPLSVDKYEKLKNQQLKPDFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMLQA 358

Query: 358 AMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSV 417
           AM K  +     +LLYDHL++K+D WFDFMADTGDGGNSSYSVA+LLAQP +RV   D+ 
Sbjct: 359 AMTKSGDATGRKELLYDHLADKQDFWFDFMADTGDGGNSSYSVAKLLAQPSLRVPVADNF 418

Query: 418 FTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVP 477
            +LPRG+VLLIGGDLAYPNPSAFTYE+RLF PFEYALQPP WYK D +AV+KPE+P+GV 
Sbjct: 419 ISLPRGNVLLIGGDLAYPNPSAFTYEKRLFCPFEYALQPPRWYKNDSIAVDKPELPNGVS 478

Query: 478 ELKQYDGPQCYIIPGNH-------------DWFDGLNTFMRFICHKSWLGGWFMPQKKSY 524
           +LK Y+GPQC++IPGNH             +WFDGLNTFMR+ICHK+WLGGW MPQKKSY
Sbjct: 479 DLKSYEGPQCFLIPGNHGKFQVSASFLFQINWFDGLNTFMRYICHKNWLGGWLMPQKKSY 538

Query: 525 FALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFN 584
           FALQLPKGWWVFGLDLALH DIDV QFKFF+ELVK++VGE D+VII+THEPNWLLDWY++
Sbjct: 539 FALQLPKGWWVFGLDLALHGDIDVDQFKFFSELVKDKVGENDAVIIITHEPNWLLDWYWS 598

Query: 585 NVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPT 644
             +GKNV+HLICD LK RCKLR+AGD+HHYMRHS   SDGP +VQHLLVNGCGGAFLHPT
Sbjct: 599 GDTGKNVRHLICDVLKYRCKLRMAGDLHHYMRHSCNQSDGPAHVQHLLVNGCGGAFLHPT 658

Query: 645 HVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMF 704
           HVFSNF KFYG +Y SK AYPSF+DSS+IALGNILKFRKKNWQFDFIGGI+YF+LVFS+F
Sbjct: 659 HVFSNFSKFYGASYGSKVAYPSFDDSSKIALGNILKFRKKNWQFDFIGGIIYFILVFSLF 718

Query: 705 PQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSR 764
           PQC+L H+LR DSFSGHL SF GTVW+AF YV+E SYVSF G L+LLI AITFVPSK+S 
Sbjct: 719 PQCKLAHVLRGDSFSGHLESFLGTVWSAFAYVMEQSYVSFTGVLMLLITAITFVPSKVSP 778

Query: 765 KKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSGEFFIL 811
           KKR +IGVLHV+AHL AALILML+LELG+E CIQH LLA S  +  L
Sbjct: 779 KKRVVIGVLHVAAHLMAALILMLMLELGIEICIQHNLLANSAGYHTL 825


>gi|297803822|ref|XP_002869795.1| hypothetical protein ARALYDRAFT_914305 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315631|gb|EFH46054.1| hypothetical protein ARALYDRAFT_914305 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1015

 Score = 1340 bits (3467), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 644/809 (79%), Positives = 730/809 (90%), Gaps = 3/809 (0%)

Query: 1   MGSDKHSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60
           MGSDKHSA  L  L+MERVRTILTHT+PYPHEHSRHA+IAV++GCLFFISS+NMH+L+EK
Sbjct: 1   MGSDKHSARFLHNLKMERVRTILTHTYPYPHEHSRHAMIAVILGCLFFISSENMHSLVEK 60

Query: 61  LDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120
           LDNN KWWSMYACLLGFFYFFSSPFI KTI PSYS FSRWYIAWILVAA+YHLPSFQSMG
Sbjct: 61  LDNNFKWWSMYACLLGFFYFFSSPFIRKTIRPSYSTFSRWYIAWILVAALYHLPSFQSMG 120

Query: 121 VDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVF 180
           +DLRMNLSLFLTI+++S++FLLVFHI+FLGLWY+GLVSRVAG+RPEILTI+Q+C V+S+ 
Sbjct: 121 LDLRMNLSLFLTIYISSIVFLLVFHIVFLGLWYIGLVSRVAGRRPEILTILQSCAVLSIS 180

Query: 181 CCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEERN-TWLAKFLRMNELKDQVCSSWFAP 239
           CC+FYSHCGNRA  R  PLERR++S FSLWK E+ N TWL KF  ++EL+DQVCSSWFAP
Sbjct: 181 CCIFYSHCGNRAFQRQTPLERRHASRFSLWKGEDGNSTWLVKFTHIDELRDQVCSSWFAP 240

Query: 240 VGSASDYPLLSKWVIYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWAL 297
           VGSA DYPLLSKWVIYGEL  +     SSDEISPIYSLWATFIGLYIANYVVERSTGWAL
Sbjct: 241 VGSARDYPLLSKWVIYGELACNGSCPDSSDEISPIYSLWATFIGLYIANYVVERSTGWAL 300

Query: 298 THPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQA 357
            HPLSVE YEK+K++Q+KP FLDMVPWYSGTSADLFKTVFDLLVSVTVF+GRFDMRMMQA
Sbjct: 301 AHPLSVENYEKLKRQQMKPNFLDMVPWYSGTSADLFKTVFDLLVSVTVFLGRFDMRMMQA 360

Query: 358 AMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSV 417
           AM KD +G +  +LLYDH ++K D WFDFMADTGDGGNSSYSVA+LLAQP I+V   +  
Sbjct: 361 AMTKDCDGNKSKELLYDHFTDKTDFWFDFMADTGDGGNSSYSVAKLLAQPFIKVPLANDS 420

Query: 418 FTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVP 477
            +L RG++LLIGGDLAYPNPSAFTYE+RLF PFEYALQPP WYK D ++VNKPE+P GV 
Sbjct: 421 ISLERGNILLIGGDLAYPNPSAFTYEKRLFCPFEYALQPPHWYKTDSISVNKPELPDGVS 480

Query: 478 ELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFG 537
           +LK YDGPQC++IPGNHDWFDGLNTFMR++CHKSWLGGWFMPQKKSYFALQLPKGWWVFG
Sbjct: 481 DLKHYDGPQCFLIPGNHDWFDGLNTFMRYVCHKSWLGGWFMPQKKSYFALQLPKGWWVFG 540

Query: 538 LDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICD 597
           LDLALH DIDVYQF FF+ELVKE+VGE D+VII+THEPNWLLDWY+ + +GKN++HLI D
Sbjct: 541 LDLALHGDIDVYQFNFFSELVKEKVGENDAVIIITHEPNWLLDWYWKHDTGKNMRHLIYD 600

Query: 598 YLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTT 657
           +LKGRCKLR+AGD+HHYMRHS   SDGPV+V HLLVNGCGGAFLHPTHVF +F KFYG +
Sbjct: 601 FLKGRCKLRMAGDLHHYMRHSCTQSDGPVHVPHLLVNGCGGAFLHPTHVFRSFSKFYGAS 660

Query: 658 YESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDS 717
           YESK+AYPSF+DSSRIALGNILKFRKKNWQFDFIGGI+YF+LVFS+FPQC+L HILR DS
Sbjct: 661 YESKSAYPSFDDSSRIALGNILKFRKKNWQFDFIGGIIYFLLVFSLFPQCKLGHILRGDS 720

Query: 718 FSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSA 777
           FSGHL SFFGTVW++F+YV+E SYVSF G L+LLI AI FVPSK+SR+KR +IG+LHVSA
Sbjct: 721 FSGHLGSFFGTVWSSFVYVIEQSYVSFTGVLMLLITAIMFVPSKISRRKRLLIGILHVSA 780

Query: 778 HLAAALILMLLLELGVETCIQHKLLATSG 806
           HL AALILMLLLELG+E CIQHKLLATSG
Sbjct: 781 HLTAALILMLLLELGIEICIQHKLLATSG 809


>gi|240256041|ref|NP_194031.5| hydrolase/ protein serine/threonine phosphatase [Arabidopsis
           thaliana]
 gi|332659291|gb|AEE84691.1| hydrolase/ protein serine/threonine phosphatase [Arabidopsis
           thaliana]
          Length = 1015

 Score = 1339 bits (3466), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 646/809 (79%), Positives = 729/809 (90%), Gaps = 3/809 (0%)

Query: 1   MGSDKHSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60
           MGSDK+SA  L  L+MERVRTILTHT+PYPHEHSRHA+IAVV+GC+FFISS+NMH+L+EK
Sbjct: 1   MGSDKNSARFLHNLKMERVRTILTHTYPYPHEHSRHAMIAVVLGCMFFISSENMHSLVEK 60

Query: 61  LDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120
           LDNN KWWSMYACLLGFFYFFSSPFI KTI PSYS FSRWYIAWILVAA+YHLPSFQSMG
Sbjct: 61  LDNNFKWWSMYACLLGFFYFFSSPFIKKTIRPSYSTFSRWYIAWILVAALYHLPSFQSMG 120

Query: 121 VDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVF 180
           +DLRMNLSLFLTI+++S++FLLVFHIIFLGLWY+GLVSRVAG+RPEILTI+Q+C V+S+ 
Sbjct: 121 LDLRMNLSLFLTIYISSIVFLLVFHIIFLGLWYIGLVSRVAGRRPEILTILQSCAVLSIS 180

Query: 181 CCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEERN-TWLAKFLRMNELKDQVCSSWFAP 239
           CC+FYSHCGNRA  R  PLE+R+SS FSLWK E+ N TWLAKF  ++EL+DQVCSSWFAP
Sbjct: 181 CCIFYSHCGNRAFQRQTPLEKRHSSRFSLWKGEDGNSTWLAKFTHIDELRDQVCSSWFAP 240

Query: 240 VGSASDYPLLSKWVIYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWAL 297
           VGSA DYPLLSKWVIYGEL  +     SSDEISPIYSLWATFIGLYIANYVVERSTGWAL
Sbjct: 241 VGSARDYPLLSKWVIYGELACNGSCPDSSDEISPIYSLWATFIGLYIANYVVERSTGWAL 300

Query: 298 THPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQA 357
            HPLSVE YEK+K++Q+KP FLDMVPWYSGTSADLFKTVFDLLVSVTVF+GRFDMRMMQA
Sbjct: 301 AHPLSVENYEKLKRQQMKPNFLDMVPWYSGTSADLFKTVFDLLVSVTVFLGRFDMRMMQA 360

Query: 358 AMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSV 417
           AMNKD +G +  +LLYDH ++K D WFDFMADTGDGGNSSYSVA+LLAQP I V  D+  
Sbjct: 361 AMNKDCDGNKSKELLYDHFADKTDFWFDFMADTGDGGNSSYSVAKLLAQPFINVPLDNDS 420

Query: 418 FTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVP 477
            +L RG++LLIGGDLAYPNPSAFTYE+RLF PFEYALQPP WYK D ++VNKPE+P GV 
Sbjct: 421 ISLERGNILLIGGDLAYPNPSAFTYEKRLFCPFEYALQPPHWYKTDSISVNKPELPDGVS 480

Query: 478 ELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFG 537
           +LK YDGPQC++IPGNHDWFDGLNTFMR++CHKSWLGGWFMPQKKSYFALQLPKGWWVFG
Sbjct: 481 DLKHYDGPQCFLIPGNHDWFDGLNTFMRYVCHKSWLGGWFMPQKKSYFALQLPKGWWVFG 540

Query: 538 LDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICD 597
           LDLALH DIDVYQF FF++LVKE+VGE D+VII+THEPNWLLDWY+ + +GKN++HLI +
Sbjct: 541 LDLALHGDIDVYQFNFFSKLVKEKVGENDAVIIITHEPNWLLDWYWKDDTGKNMRHLIFE 600

Query: 598 YLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTT 657
           +LKGRCKLR+AGD+HHYMRHS   SDGPV+V HLLVNGCGGAFLHPTHVF  F KFYG +
Sbjct: 601 FLKGRCKLRMAGDLHHYMRHSCTQSDGPVHVPHLLVNGCGGAFLHPTHVFRCFSKFYGAS 660

Query: 658 YESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDS 717
           YESK+AYPSFEDSSRIALGNILKFRKKNWQFDFIGGI+YF+LVFS+FPQCEL HILR DS
Sbjct: 661 YESKSAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIIYFLLVFSLFPQCELGHILRGDS 720

Query: 718 FSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSA 777
           FSGHL SFFGTVW++F+YV E SYVSF G L+LLI AI FVPSK+SR+KR +IG+LHVSA
Sbjct: 721 FSGHLGSFFGTVWSSFVYVTEQSYVSFTGVLMLLITAIMFVPSKISRRKRLLIGILHVSA 780

Query: 778 HLAAALILMLLLELGVETCIQHKLLATSG 806
           HL AALILMLLLELG+E CIQHKLLA SG
Sbjct: 781 HLMAALILMLLLELGIEICIQHKLLANSG 809


>gi|5139333|emb|CAB45564.1| hypothetical protein [Arabidopsis thaliana]
 gi|7269147|emb|CAB79255.1| hypothetical protein [Arabidopsis thaliana]
          Length = 932

 Score = 1337 bits (3461), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 646/809 (79%), Positives = 729/809 (90%), Gaps = 3/809 (0%)

Query: 1   MGSDKHSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60
           MGSDK+SA  L  L+MERVRTILTHT+PYPHEHSRHA+IAVV+GC+FFISS+NMH+L+EK
Sbjct: 27  MGSDKNSARFLHNLKMERVRTILTHTYPYPHEHSRHAMIAVVLGCMFFISSENMHSLVEK 86

Query: 61  LDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120
           LDNN KWWSMYACLLGFFYFFSSPFI KTI PSYS FSRWYIAWILVAA+YHLPSFQSMG
Sbjct: 87  LDNNFKWWSMYACLLGFFYFFSSPFIKKTIRPSYSTFSRWYIAWILVAALYHLPSFQSMG 146

Query: 121 VDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVF 180
           +DLRMNLSLFLTI+++S++FLLVFHIIFLGLWY+GLVSRVAG+RPEILTI+Q+C V+S+ 
Sbjct: 147 LDLRMNLSLFLTIYISSIVFLLVFHIIFLGLWYIGLVSRVAGRRPEILTILQSCAVLSIS 206

Query: 181 CCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEERN-TWLAKFLRMNELKDQVCSSWFAP 239
           CC+FYSHCGNRA  R  PLE+R+SS FSLWK E+ N TWLAKF  ++EL+DQVCSSWFAP
Sbjct: 207 CCIFYSHCGNRAFQRQTPLEKRHSSRFSLWKGEDGNSTWLAKFTHIDELRDQVCSSWFAP 266

Query: 240 VGSASDYPLLSKWVIYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWAL 297
           VGSA DYPLLSKWVIYGEL  +     SSDEISPIYSLWATFIGLYIANYVVERSTGWAL
Sbjct: 267 VGSARDYPLLSKWVIYGELACNGSCPDSSDEISPIYSLWATFIGLYIANYVVERSTGWAL 326

Query: 298 THPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQA 357
            HPLSVE YEK+K++Q+KP FLDMVPWYSGTSADLFKTVFDLLVSVTVF+GRFDMRMMQA
Sbjct: 327 AHPLSVENYEKLKRQQMKPNFLDMVPWYSGTSADLFKTVFDLLVSVTVFLGRFDMRMMQA 386

Query: 358 AMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSV 417
           AMNKD +G +  +LLYDH ++K D WFDFMADTGDGGNSSYSVA+LLAQP I V  D+  
Sbjct: 387 AMNKDCDGNKSKELLYDHFADKTDFWFDFMADTGDGGNSSYSVAKLLAQPFINVPLDNDS 446

Query: 418 FTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVP 477
            +L RG++LLIGGDLAYPNPSAFTYE+RLF PFEYALQPP WYK D ++VNKPE+P GV 
Sbjct: 447 ISLERGNILLIGGDLAYPNPSAFTYEKRLFCPFEYALQPPHWYKTDSISVNKPELPDGVS 506

Query: 478 ELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFG 537
           +LK YDGPQC++IPGNHDWFDGLNTFMR++CHKSWLGGWFMPQKKSYFALQLPKGWWVFG
Sbjct: 507 DLKHYDGPQCFLIPGNHDWFDGLNTFMRYVCHKSWLGGWFMPQKKSYFALQLPKGWWVFG 566

Query: 538 LDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICD 597
           LDLALH DIDVYQF FF++LVKE+VGE D+VII+THEPNWLLDWY+ + +GKN++HLI +
Sbjct: 567 LDLALHGDIDVYQFNFFSKLVKEKVGENDAVIIITHEPNWLLDWYWKDDTGKNMRHLIFE 626

Query: 598 YLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTT 657
           +LKGRCKLR+AGD+HHYMRHS   SDGPV+V HLLVNGCGGAFLHPTHVF  F KFYG +
Sbjct: 627 FLKGRCKLRMAGDLHHYMRHSCTQSDGPVHVPHLLVNGCGGAFLHPTHVFRCFSKFYGAS 686

Query: 658 YESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDS 717
           YESK+AYPSFEDSSRIALGNILKFRKKNWQFDFIGGI+YF+LVFS+FPQCEL HILR DS
Sbjct: 687 YESKSAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIIYFLLVFSLFPQCELGHILRGDS 746

Query: 718 FSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSA 777
           FSGHL SFFGTVW++F+YV E SYVSF G L+LLI AI FVPSK+SR+KR +IG+LHVSA
Sbjct: 747 FSGHLGSFFGTVWSSFVYVTEQSYVSFTGVLMLLITAIMFVPSKISRRKRLLIGILHVSA 806

Query: 778 HLAAALILMLLLELGVETCIQHKLLATSG 806
           HL AALILMLLLELG+E CIQHKLLA SG
Sbjct: 807 HLMAALILMLLLELGIEICIQHKLLANSG 835


>gi|5002515|emb|CAB44318.1| putative protein [Arabidopsis thaliana]
 gi|7267880|emb|CAB78223.1| putative protein [Arabidopsis thaliana]
          Length = 1012

 Score = 1331 bits (3444), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 632/822 (76%), Positives = 720/822 (87%), Gaps = 18/822 (2%)

Query: 16  MERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEKLDNNIKWWSMYACLL 75
           ME  RTILTHT+PYPHEHSRHAIIAV+ GCLFFISSDNM TLIEK   ++KWWSMYACLL
Sbjct: 1   MESFRTILTHTYPYPHEHSRHAIIAVLFGCLFFISSDNMQTLIEKF--SVKWWSMYACLL 58

Query: 76  GFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTIFL 135
           GFFYFFSSPFI KTI P+YSNFSRWYIAWILVAA+YHLP+FQSMG+DLRMNLSLFLTI++
Sbjct: 59  GFFYFFSSPFIQKTIRPNYSNFSRWYIAWILVAALYHLPNFQSMGLDLRMNLSLFLTIYI 118

Query: 136 ASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLR 195
           +S+LFL+VFHIIFLGLWYVGLVSRVAG+RPEILTI+QNC V+S+ CC+FYSHCGNRAVLR
Sbjct: 119 SSILFLVVFHIIFLGLWYVGLVSRVAGRRPEILTILQNCAVLSMACCIFYSHCGNRAVLR 178

Query: 196 HRPLERRNSSWFSLWKKEER-NTWLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWVI 254
            +PL R+ +SWFS WK+E R NTWLAKF+RMNELKDQVCSSWFAPVGSASDYPLLSKW I
Sbjct: 179 QKPLGRQYTSWFSFWKREHRHNTWLAKFIRMNELKDQVCSSWFAPVGSASDYPLLSKWFI 238

Query: 255 YGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSVEEYEKMKKK 312
           YGE+  +     S+DEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSV++YEK+K +
Sbjct: 239 YGEIACNGSCPDSADEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSVDKYEKLKNQ 298

Query: 313 QLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLL 372
           QLKP+FLDMVPWYSGTSADLFKTVFDLLVSVTVF+GRFDMRM+QAAM K  + +   +LL
Sbjct: 299 QLKPDFLDMVPWYSGTSADLFKTVFDLLVSVTVFLGRFDMRMLQAAMTKSGDASGRKELL 358

Query: 373 YDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDL 432
           YDHL+EK+D WFDFMADTGDGGNSSYSVA+LLAQP +RV   ++  +LPRG+VLLIGGDL
Sbjct: 359 YDHLAEKQDFWFDFMADTGDGGNSSYSVAKLLAQPSLRVPVANNFISLPRGNVLLIGGDL 418

Query: 433 AYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPG 492
           AYPNPS+FTYE+RLF PFEYALQPP WYK D +AV+KPE+P+GV +LK Y+GPQC++IPG
Sbjct: 419 AYPNPSSFTYEKRLFCPFEYALQPPRWYKNDSIAVDKPELPNGVSDLKSYEGPQCFLIPG 478

Query: 493 NH-------------DWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLD 539
           NH             +WFDGLNTFMR+ICHKSWLGGW MPQKKSYFALQLPKGWWVFGLD
Sbjct: 479 NHGEFQVSAAFIFQINWFDGLNTFMRYICHKSWLGGWLMPQKKSYFALQLPKGWWVFGLD 538

Query: 540 LALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYL 599
           LALH DIDV QFKFF+ELVK++VGE D+VII+THEPNWLLDWY++  +G+NV+HLICD L
Sbjct: 539 LALHGDIDVDQFKFFSELVKDKVGESDAVIIITHEPNWLLDWYWSGDTGQNVRHLICDVL 598

Query: 600 KGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYE 659
           K RCKLR+AGD+HHYMRHS   SDGP +VQHLLVNGCGGAFLHPTHVFS F KFYG +Y 
Sbjct: 599 KYRCKLRMAGDLHHYMRHSCNQSDGPAHVQHLLVNGCGGAFLHPTHVFSKFSKFYGASYG 658

Query: 660 SKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFS 719
           SK AYPSF+DSS+IALGNILKFRKKNWQFDFIGGI+YF+LVFS+FPQC+L H+LR DSFS
Sbjct: 659 SKVAYPSFDDSSKIALGNILKFRKKNWQFDFIGGIIYFILVFSLFPQCKLAHVLRGDSFS 718

Query: 720 GHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHL 779
           GHL SF GTVW+AF YV+E SYVSF G L+LLI AITFVPSK+S KKR +IGVLHV+AHL
Sbjct: 719 GHLESFLGTVWSAFAYVMEQSYVSFTGVLMLLITAITFVPSKVSLKKRVVIGVLHVAAHL 778

Query: 780 AAALILMLLLELGVETCIQHKLLATSGEFFILVSFNSVTMND 821
            AALILML+LELG+E CIQH LLA S  +  L  +     N+
Sbjct: 779 MAALILMLMLELGIEICIQHNLLANSAGYHTLYEWYKSVENE 820


>gi|357154629|ref|XP_003576847.1| PREDICTED: uncharacterized protein LOC100842069 [Brachypodium
           distachyon]
          Length = 1019

 Score = 1242 bits (3213), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 576/803 (71%), Positives = 691/803 (86%), Gaps = 4/803 (0%)

Query: 7   SAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEKLDNNIK 66
           S  L+ +L MERVRTILTH +PYPHEHSRH +IAV+ G LF ISSDN+  LI KLD N K
Sbjct: 8   SGCLIVSLEMERVRTILTHRYPYPHEHSRHFMIAVIAGWLFLISSDNLQNLIMKLDKNFK 67

Query: 67  WWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMN 126
           WWSMYACL+GFFYFFSSPFI KTI P+YSNFSRWYIAWI +AA+YHLPSFQSMG+DLRMN
Sbjct: 68  WWSMYACLIGFFYFFSSPFIRKTIKPNYSNFSRWYIAWIFLAALYHLPSFQSMGLDLRMN 127

Query: 127 LSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFCCVFYS 186
           LSLFLTI+++S++FL+VFH+IFLGLWY+GLVSR+A K+PE+LTIIQNC VIS+ CCVFYS
Sbjct: 128 LSLFLTIYISSLIFLIVFHVIFLGLWYLGLVSRMAEKKPEMLTIIQNCAVISIACCVFYS 187

Query: 187 HCGNRAVLRHRPLERRNSSW--FSLWKKE-ERNTWLAKFLRMNELKDQVCSSWFAPVGSA 243
           HCGNR V R +  +RR +SW  FSLW+K+ + NT ++K LRM++ KDQ+CSSWFAPVGSA
Sbjct: 188 HCGNRTVSRDKSTDRRTASWVAFSLWRKQNDDNTLISKLLRMHKFKDQICSSWFAPVGSA 247

Query: 244 SDYPLLSKWVIYGELGNDNGGSSDEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSV 303
           SDYPLLSKW IYGEL ++    S+ ISP+YSLWATFIGLY+ANYVVERSTGWALTHPL++
Sbjct: 248 SDYPLLSKWAIYGELASNGSEHSNIISPVYSLWATFIGLYMANYVVERSTGWALTHPLTI 307

Query: 304 EEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQ 363
            EYE++K+  LKPEF DMVPWYSGTS DLFKTVFDL+VSVT+FVGRFDMRMMQAAMNK  
Sbjct: 308 SEYERLKR-LLKPEFEDMVPWYSGTSTDLFKTVFDLMVSVTLFVGRFDMRMMQAAMNKTP 366

Query: 364 EGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRG 423
           + ++  DLLYDHL  K++LWFDF+ADTGDGGNS+Y++ARLLAQP + +  DDS  T PRG
Sbjct: 367 DESKSSDLLYDHLDGKDELWFDFIADTGDGGNSTYAIARLLAQPSLVIKSDDSRLTFPRG 426

Query: 424 DVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYD 483
           ++LLIGGDLAYPNPS+F+YERR F PFE AL+PP WYK +H+A+ KPE+P GV EL++Y 
Sbjct: 427 ELLLIGGDLAYPNPSSFSYERRFFSPFEDALKPPAWYKPEHIALEKPELPLGVSELRKYR 486

Query: 484 GPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALH 543
           GPQC++IPGNHDWFDGL+TFMR+ICHKSWLGGWF+PQK+SYFAL+LP GWWVFGLD ALH
Sbjct: 487 GPQCFLIPGNHDWFDGLHTFMRYICHKSWLGGWFLPQKRSYFALKLPNGWWVFGLDQALH 546

Query: 544 CDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRC 603
            DIDVYQFKFFAEL +E+VGE DSVI++THEPNWLLDWY+ + +GKNV +LIC+YLKGRC
Sbjct: 547 GDIDVYQFKFFAELCREKVGESDSVIVITHEPNWLLDWYWGDKTGKNVTYLICEYLKGRC 606

Query: 604 KLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAA 663
           KLR+AGD+HHYMRHS V S  PV+V HLLVNGCGGAFLHPTHVF NF++ YG  YE+KA 
Sbjct: 607 KLRMAGDLHHYMRHSCVESKEPVHVHHLLVNGCGGAFLHPTHVFENFKECYGNKYETKAT 666

Query: 664 YPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLR 723
           YPS++DSS+IALGNILKFR+KNWQFD IGG VYFVLVFSMFPQC+   IL EDS+   + 
Sbjct: 667 YPSYDDSSKIALGNILKFRRKNWQFDVIGGFVYFVLVFSMFPQCDSFRILHEDSWGDRVS 726

Query: 724 SFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAAL 783
           SFF  +WNA   +LE SYVS AG + LL+V+  FVP+KLSR++RA++G LH SAH+ +A+
Sbjct: 727 SFFIAMWNAVFEILERSYVSLAGVVTLLMVSFFFVPTKLSRRRRALLGFLHASAHITSAV 786

Query: 784 ILMLLLELGVETCIQHKLLATSG 806
           +LMLL+ELG+E CI++ LLATSG
Sbjct: 787 LLMLLMELGIEICIRNHLLATSG 809


>gi|115470207|ref|NP_001058702.1| Os07g0106000 [Oryza sativa Japonica Group]
 gi|33354215|dbj|BAC81181.1| unknown protein [Oryza sativa Japonica Group]
 gi|50508992|dbj|BAD31941.1| unknown protein [Oryza sativa Japonica Group]
 gi|113610238|dbj|BAF20616.1| Os07g0106000 [Oryza sativa Japonica Group]
 gi|215706427|dbj|BAG93283.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218198956|gb|EEC81383.1| hypothetical protein OsI_24595 [Oryza sativa Indica Group]
 gi|222636303|gb|EEE66435.1| hypothetical protein OsJ_22800 [Oryza sativa Japonica Group]
          Length = 1016

 Score = 1240 bits (3209), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 590/810 (72%), Positives = 693/810 (85%), Gaps = 5/810 (0%)

Query: 1   MGSDKHSAG-LLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIE 59
           MGSDK S   LL TL+M+ VRTILTHT+PYPHEHSRH + AV++ CLFFISSDNMHTLI 
Sbjct: 1   MGSDKQSGSPLLGTLKMKSVRTILTHTYPYPHEHSRHIMTAVIIACLFFISSDNMHTLIH 60

Query: 60  KLDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSM 119
           KLDNNIKWWSMY CL+GFFYFFSSPF+G+TI PSYSNF+RWY+AWI  A++YHLPSFQSM
Sbjct: 61  KLDNNIKWWSMYVCLIGFFYFFSSPFLGRTIQPSYSNFNRWYVAWICFASLYHLPSFQSM 120

Query: 120 GVDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISV 179
           GVD+RMNLSLFLTI+ +SVLF++ FHI+F+GLWY+GLV+R+AG RP I TI QNC VIS+
Sbjct: 121 GVDMRMNLSLFLTIYFSSVLFIIAFHIVFIGLWYIGLVARMAGTRPGIWTIFQNCTVISI 180

Query: 180 FCCVFYSHCGNRAVLRHRPLERR-NSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFA 238
            CCVFYSHCGN AV + +   R  + +  +  + E+  TW++ FLRMNELKDQ+CSSWFA
Sbjct: 181 ACCVFYSHCGNLAVHKSKSFSRNSDPNLLAFLENEKGTTWISNFLRMNELKDQICSSWFA 240

Query: 239 PVGSASDYPLLSKWVIYGEL--GNDNGGSSDEISPIYSLWATFIGLYIANYVVERSTGWA 296
           PVGSASDYPLLSKWVIYGEL       G SDEISP+YSLWATF+GLYIAN+VVERSTGWA
Sbjct: 241 PVGSASDYPLLSKWVIYGELVCSGSCAGPSDEISPLYSLWATFVGLYIANFVVERSTGWA 300

Query: 297 LTHPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQ 356
           LTHP +V E EK+K+ Q+KP+FLDMVPWYSGTSADLFKT FDL+VSVT+FVGRFDMRMMQ
Sbjct: 301 LTHPSTVLEEEKLKR-QMKPDFLDMVPWYSGTSADLFKTAFDLMVSVTLFVGRFDMRMMQ 359

Query: 357 AAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDS 416
           AAM +  +  Q+ DLLYD+ +E+EDLWFDF+ADTGDGGNSSY+VARLLAQP I+     S
Sbjct: 360 AAMKRTTDETQNDDLLYDYFNEREDLWFDFVADTGDGGNSSYTVARLLAQPSIQTVIGGS 419

Query: 417 VFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGV 476
           + TLPRG++LLIGGDLAYPNPS+FTYE R F P+EYALQPPPWY+ +H+A++KPEVP G+
Sbjct: 420 MHTLPRGNLLLIGGDLAYPNPSSFTYEMRFFSPYEYALQPPPWYRAEHIALDKPEVPLGI 479

Query: 477 PELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVF 536
            ++K YDGPQC+IIPGNHDWFDGL+TFMR++CHKSWLGGWF+PQKKSYFAL+LP+GWWVF
Sbjct: 480 SKMKDYDGPQCFIIPGNHDWFDGLHTFMRYVCHKSWLGGWFLPQKKSYFALRLPQGWWVF 539

Query: 537 GLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLIC 596
           GLDLALH DIDVYQFKFFAEL + ++GE DSVI+MTHEPNWLLDWY+   +GKNV HLI 
Sbjct: 540 GLDLALHGDIDVYQFKFFAELCRNKIGENDSVIVMTHEPNWLLDWYWKETTGKNVSHLIQ 599

Query: 597 DYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGT 656
           DYL GRCKLR+AGD+HH+MRHS    D P  VQHLLVNGCGGAFLHPTHVF NF +F G 
Sbjct: 600 DYLNGRCKLRLAGDLHHFMRHSANQIDNPTSVQHLLVNGCGGAFLHPTHVFKNFEQFSGA 659

Query: 657 TYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILRED 716
           TYE KAAYPSF+DSS IALGNILKFRKKNWQFD IGG +YF+LVFSMFPQC L HIL E+
Sbjct: 660 TYECKAAYPSFDDSSGIALGNILKFRKKNWQFDTIGGFIYFILVFSMFPQCNLGHILNEE 719

Query: 717 SFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVS 776
           ++SG L SF  T+W+A +Y+ EHSYVS  G+L LL+ + +FVPSKLSR+KRA+IG LHV 
Sbjct: 720 TWSGRLGSFSNTIWSALLYIFEHSYVSSVGSLTLLLASYSFVPSKLSRRKRAIIGGLHVL 779

Query: 777 AHLAAALILMLLLELGVETCIQHKLLATSG 806
           AHL AAL+LMLLLELG+E CI++ LLATSG
Sbjct: 780 AHLTAALLLMLLLELGIEICIRNHLLATSG 809


>gi|414589082|tpg|DAA39653.1| TPA: hypothetical protein ZEAMMB73_888857 [Zea mays]
          Length = 1041

 Score = 1234 bits (3192), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 576/812 (70%), Positives = 692/812 (85%), Gaps = 7/812 (0%)

Query: 1   MGSDK----HSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHT 56
           MG +K     S  L+++L+MER+R ILTH +PYPHEHSRH IIAV    LFFISSDN+  
Sbjct: 1   MGKEKLHKQPSGRLIESLKMERMRNILTHRYPYPHEHSRHFIIAVFACWLFFISSDNLQN 60

Query: 57  LIEKLDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSF 116
           LI KLD N KWWSMYACL+GFFYFFSSPFI KTI P+YSNF+RWYIAWI +AA+YHLPSF
Sbjct: 61  LIMKLDKNFKWWSMYACLIGFFYFFSSPFIRKTIKPNYSNFNRWYIAWIFLAALYHLPSF 120

Query: 117 QSMGVDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVV 176
           QSMG+DLRMNLSLFLTI+++S++FL+VFHIIFLGLWY+G VSR+A K+PE+LTIIQNC V
Sbjct: 121 QSMGLDLRMNLSLFLTIYISSLIFLMVFHIIFLGLWYLGFVSRMAEKKPEMLTIIQNCAV 180

Query: 177 ISVFCCVFYSHCGNRAVLRHRPLERRNSSW--FSLWKKEERNTWLAKFLRMNELKDQVCS 234
           IS+ CCVFYSHCGNR V R + ++RR +SW  FSLW K + NT +++ LRM++ K+Q+CS
Sbjct: 181 ISIACCVFYSHCGNRTVSRDKSIDRRTASWIVFSLWTKHDDNTLISRLLRMHKFKEQICS 240

Query: 235 SWFAPVGSASDYPLLSKWVIYGELGNDNGGSSDEISPIYSLWATFIGLYIANYVVERSTG 294
           SWFAPVGSASDYPLLSKW IYGEL ++  GSS+EISP+YSLWATF+GLYIANYV+ERSTG
Sbjct: 241 SWFAPVGSASDYPLLSKWAIYGELSSNGSGSSNEISPVYSLWATFMGLYIANYVIERSTG 300

Query: 295 WALTHPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRM 354
           W LTHPL++ EYEK+KK QLKP+F DMVPWYSGTS DLFKTVFDL++SVT+FVGRFDMRM
Sbjct: 301 WVLTHPLTISEYEKLKK-QLKPDFEDMVPWYSGTSTDLFKTVFDLMISVTLFVGRFDMRM 359

Query: 355 MQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRD 414
           MQAAMNK  + A   DLLYDHL  K++LWFDF+ADTGDGGNS+Y+VARLLAQP + +  D
Sbjct: 360 MQAAMNKTPDEANSHDLLYDHLDGKDELWFDFIADTGDGGNSTYAVARLLAQPLLVINSD 419

Query: 415 DSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPS 474
           DS  T PRG +LL+GGDLAYPNPS+F+YERR F PFEYALQPP WYK +H+A+ KPE+P 
Sbjct: 420 DSRLTFPRGQLLLVGGDLAYPNPSSFSYERRFFCPFEYALQPPAWYKPEHIALEKPELPL 479

Query: 475 GVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWW 534
           GV EL++Y GPQC++IPGNHDWFDGLNTF+R+ICHKSW+GGWF+PQKKSYFAL+LP GWW
Sbjct: 480 GVSELRRYRGPQCFMIPGNHDWFDGLNTFIRYICHKSWVGGWFLPQKKSYFALKLPNGWW 539

Query: 535 VFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHL 594
           VFGLD ALH DIDVYQFKFFAEL +++VGE DSVII+THEPNWLLDWY+ + +G NV +L
Sbjct: 540 VFGLDQALHGDIDVYQFKFFAELCQQKVGESDSVIIITHEPNWLLDWYWGDSTGTNVAYL 599

Query: 595 ICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFY 654
           I +YL+GRCKLR+AGD+HHYMRHS + S  PV+VQHLLVNGCGGAFLHPTHVF NFR FY
Sbjct: 600 IREYLRGRCKLRMAGDLHHYMRHSCIESKEPVHVQHLLVNGCGGAFLHPTHVFENFRVFY 659

Query: 655 GTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILR 714
           G  YE+K+ YPS+ DSS+IALGNILKFR+KNWQFD IGG VYFVLVFSMFPQC+  HIL 
Sbjct: 660 GNKYETKSTYPSYHDSSKIALGNILKFRRKNWQFDVIGGFVYFVLVFSMFPQCDSFHILH 719

Query: 715 EDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLH 774
           EDS++G +  FF  +WNA   +LE SYVS  G + LL+V+  FVP+KLSR++R ++G LH
Sbjct: 720 EDSWAGRINGFFTAMWNAVFEILERSYVSLGGVVTLLMVSFFFVPTKLSRRRRVLLGFLH 779

Query: 775 VSAHLAAALILMLLLELGVETCIQHKLLATSG 806
            +AHL +A++LMLL+EL +E CI++ LLATSG
Sbjct: 780 AAAHLTSAVLLMLLMELAIEICIRNHLLATSG 811


>gi|357111797|ref|XP_003557697.1| PREDICTED: uncharacterized protein LOC100823404 [Brachypodium
           distachyon]
          Length = 1016

 Score = 1224 bits (3168), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 594/811 (73%), Positives = 689/811 (84%), Gaps = 7/811 (0%)

Query: 1   MGSDKHSAG-LLDTLRME-RVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLI 58
           MGSDK S   LL TL++  RVRTILTHT+PYPHEHSRH + AV++GCLFFISSDNMHTLI
Sbjct: 1   MGSDKQSGSPLLGTLKVGGRVRTILTHTYPYPHEHSRHIMTAVIIGCLFFISSDNMHTLI 60

Query: 59  EKLDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQS 118
            KLDNNIKWWSMY CL+GFFYFFSSPF+G+TI PSYSNF+RWY+AWI  A++YHLPSFQS
Sbjct: 61  HKLDNNIKWWSMYVCLIGFFYFFSSPFLGRTIQPSYSNFNRWYVAWICFASLYHLPSFQS 120

Query: 119 MGVDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVIS 178
           MGVD+RMNLSLFLTI+ +SVLF+L FHIIF+GLWY+GLV+R+AG RP I TIIQNC VIS
Sbjct: 121 MGVDMRMNLSLFLTIYFSSVLFILAFHIIFIGLWYIGLVARMAGTRPGIWTIIQNCTVIS 180

Query: 179 VFCCVFYSHCGNRAVLRHRPLERR-NSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWF 237
           + CCVFYSHCGN AV +     R  + S  +  K E   TW++ FL MNELKDQ+CSSWF
Sbjct: 181 IACCVFYSHCGNLAVHKSESFARNSDPSLLAFLKNENGTTWISNFLFMNELKDQICSSWF 240

Query: 238 APVGSASDYPLLSKWVIYGEL--GNDNGGSSDEISPIYSLWATFIGLYIANYVVERSTGW 295
           APVGSASDYPLLSKWVIYGEL       G SDEISP+YSLWATF+GLYIAN+VVERSTGW
Sbjct: 241 APVGSASDYPLLSKWVIYGELVCSGSCAGPSDEISPLYSLWATFVGLYIANFVVERSTGW 300

Query: 296 ALTHPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMM 355
           ALTH   V E EK+KK  +KP+FLDMVPWYSGTSADLFKT FDL+VSVT+FVGRFDMRMM
Sbjct: 301 ALTHLSPVSEEEKLKK-HMKPDFLDMVPWYSGTSADLFKTAFDLMVSVTLFVGRFDMRMM 359

Query: 356 QAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDD 415
           QAAM K+ +  Q+ DLLYD+   KEDLWFDF+ADTGDGGNSSY+VARLLAQP I+     
Sbjct: 360 QAAM-KNTDETQNEDLLYDYFHGKEDLWFDFVADTGDGGNSSYTVARLLAQPSIQTVIGG 418

Query: 416 SVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSG 475
           S+ TLPRG +L+IGGDLAYPNPS+FTYERR F PFEYA+QPP WYK +H+A++KPEVP G
Sbjct: 419 SMHTLPRGKLLVIGGDLAYPNPSSFTYERRFFCPFEYAMQPPHWYKAEHIALDKPEVPPG 478

Query: 476 VPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWV 535
           V ++K+Y+GPQC+IIPGNHDWFDGL+TFMR+ICHKSWLGGW +PQKKSYFALQLPKGWW+
Sbjct: 479 VSKMKEYNGPQCFIIPGNHDWFDGLHTFMRYICHKSWLGGWILPQKKSYFALQLPKGWWI 538

Query: 536 FGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLI 595
           FGLDLALH DIDVYQFKFFAEL + +VGE DSVII+THEPNWLLDWY+   +GKNV HLI
Sbjct: 539 FGLDLALHGDIDVYQFKFFAELCQNKVGENDSVIIVTHEPNWLLDWYWKETTGKNVSHLI 598

Query: 596 CDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYG 655
            DYL GRCKLR+AGD+HH+MRHS   SD P +VQHLLVNGCGGAFLHPTHVF NF +F G
Sbjct: 599 QDYLHGRCKLRMAGDLHHFMRHSATQSDKPTFVQHLLVNGCGGAFLHPTHVFKNFERFSG 658

Query: 656 TTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILRE 715
            TYE KAAYPS+++SS IALGNILKFRKKNWQFD IGG +YF+LVFSMFPQC L HIL E
Sbjct: 659 ATYECKAAYPSYDESSGIALGNILKFRKKNWQFDIIGGFIYFILVFSMFPQCNLVHILNE 718

Query: 716 DSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHV 775
           +++ G L+SF  T+W+A +Y+ EHSYVS  G+L LL+ + +FVPSKL+RKKRA+IG LHV
Sbjct: 719 ETWYGRLQSFSSTIWSALLYIFEHSYVSSVGSLTLLMASYSFVPSKLTRKKRAIIGGLHV 778

Query: 776 SAHLAAALILMLLLELGVETCIQHKLLATSG 806
            AHL AAL+LMLL+ELG+E CI++ LLATSG
Sbjct: 779 LAHLTAALLLMLLMELGIEVCIRNHLLATSG 809


>gi|242047122|ref|XP_002461307.1| hypothetical protein SORBIDRAFT_02g000620 [Sorghum bicolor]
 gi|241924684|gb|EER97828.1| hypothetical protein SORBIDRAFT_02g000620 [Sorghum bicolor]
          Length = 1018

 Score = 1214 bits (3140), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 584/812 (71%), Positives = 696/812 (85%), Gaps = 7/812 (0%)

Query: 1   MGSDKHSAG---LLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTL 57
           MGSDK       LL TL+M RVRTILTHT+PYPHEHSRH + AV++ CLFFISSDNMHTL
Sbjct: 1   MGSDKQIGSPRPLLGTLKMGRVRTILTHTYPYPHEHSRHIMTAVIIACLFFISSDNMHTL 60

Query: 58  IEKLDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQ 117
           I KLDNNIKWWSMY CL+GFFYFFSSPF+G+TI PSYSNF+RWY+AWI  A++YHLPSFQ
Sbjct: 61  IHKLDNNIKWWSMYVCLIGFFYFFSSPFLGRTIQPSYSNFNRWYVAWICFASLYHLPSFQ 120

Query: 118 SMGVDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVI 177
           SMGVD+RMNLSLFLTI+ +SVLF++ FHIIF+GLWY+GLV+R+AG RP I TI+QNC VI
Sbjct: 121 SMGVDMRMNLSLFLTIYFSSVLFIIAFHIIFIGLWYIGLVARLAGTRPGIWTILQNCTVI 180

Query: 178 SVFCCVFYSHCGNRAVLRHRPL-ERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSW 236
           S+ CCVFYSHCGNRAV + +      + S  +  K E  +TW++ FLRMN+LKD++CSSW
Sbjct: 181 SIACCVFYSHCGNRAVHKSKSFGSSSDPSLLAFLKNENGSTWISNFLRMNQLKDEICSSW 240

Query: 237 FAPVGSASDYPLLSKWVIYGEL--GNDNGGSSDEISPIYSLWATFIGLYIANYVVERSTG 294
           FAPVGSASDYP+L+KWVIYGEL       G SDEISP+YSLWATF+GLYIAN+VVERSTG
Sbjct: 241 FAPVGSASDYPILAKWVIYGELVCSGSCAGPSDEISPLYSLWATFVGLYIANFVVERSTG 300

Query: 295 WALTHPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRM 354
           WALTHP +  E EK+K+  +KP+FLDMVPWYSGTSADLFKT FDL+VSVT+FVGRFDMRM
Sbjct: 301 WALTHPSTDLEDEKLKR-HMKPDFLDMVPWYSGTSADLFKTAFDLMVSVTLFVGRFDMRM 359

Query: 355 MQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRD 414
           MQAAM    +   + DLLYD+ +E+EDLWFDF+ADTGDGGNSSY+VARLLAQP IR    
Sbjct: 360 MQAAMKGPTDNTSNDDLLYDYFNEREDLWFDFVADTGDGGNSSYTVARLLAQPSIRTVIG 419

Query: 415 DSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPS 474
            S+ TLPRG++L+IGGDLAYPNPS+FTYERR FRPFEYALQPPPWY+ +H+A++KPE+P 
Sbjct: 420 GSMHTLPRGNLLIIGGDLAYPNPSSFTYERRFFRPFEYALQPPPWYRDEHIALDKPELPP 479

Query: 475 GVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWW 534
           GV ++ +YDGPQC+IIPGNHDWFDGL+TFMR+ICHKSWLGGWF+PQKKSYFAL LPKGWW
Sbjct: 480 GVSKMTEYDGPQCFIIPGNHDWFDGLHTFMRYICHKSWLGGWFLPQKKSYFALHLPKGWW 539

Query: 535 VFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHL 594
           +FGLDL+LH D+DVYQFKFFA++ + +VGE DSVI++THEPNWLLDWY+N  +GKNV HL
Sbjct: 540 IFGLDLSLHGDVDVYQFKFFADVCRNKVGENDSVIVVTHEPNWLLDWYWNETTGKNVSHL 599

Query: 595 ICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFY 654
           I +YLKGRCKLR+AGD+HH+MRHS   S+   +VQHLLVNGCGGAFLHPTHVF NF +F 
Sbjct: 600 IQEYLKGRCKLRMAGDLHHFMRHSATQSEKTNFVQHLLVNGCGGAFLHPTHVFRNFERFS 659

Query: 655 GTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILR 714
           GTTYE KAAYPS+++SS IALGNILKFRKKNWQFD IGG +YF+LVFSMFPQC L HIL 
Sbjct: 660 GTTYECKAAYPSYDESSGIALGNILKFRKKNWQFDIIGGFIYFILVFSMFPQCNLVHILN 719

Query: 715 EDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLH 774
           E+++SG L+SF GT+W+A +Y+ EHSYVS  G+L LL+ + +FVPSKLSR++RA+IG LH
Sbjct: 720 EETWSGRLKSFSGTIWSALLYIFEHSYVSSVGSLTLLMASYSFVPSKLSRRRRAIIGGLH 779

Query: 775 VSAHLAAALILMLLLELGVETCIQHKLLATSG 806
           V AHL AAL+LMLLLELG+E CI++ LLATSG
Sbjct: 780 VLAHLTAALLLMLLLELGIEICIRNHLLATSG 811


>gi|414883332|tpg|DAA59346.1| TPA: hypothetical protein ZEAMMB73_449975 [Zea mays]
          Length = 1018

 Score = 1206 bits (3119), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 580/812 (71%), Positives = 693/812 (85%), Gaps = 7/812 (0%)

Query: 1   MGSDKHSAG---LLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTL 57
           MGSDK       LL+TL+M RVRTILTHT+PYPHEHSRH + AV++ CLFFISSDNMHTL
Sbjct: 1   MGSDKQIGSPRQLLETLKMGRVRTILTHTYPYPHEHSRHIMTAVIIACLFFISSDNMHTL 60

Query: 58  IEKLDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQ 117
           I KLDNNIKWWSMY CL+GFFYFFSSPF+G+TI PSYSNF+RWY+AWI  A++YHLPSFQ
Sbjct: 61  IHKLDNNIKWWSMYVCLIGFFYFFSSPFLGRTIQPSYSNFNRWYVAWICFASLYHLPSFQ 120

Query: 118 SMGVDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVI 177
           SMGVD+RMNLSLFLTI+ +SVLF++ FHIIF+GLWY+GLV+R+AG RP + TI+QNC VI
Sbjct: 121 SMGVDMRMNLSLFLTIYFSSVLFIIAFHIIFIGLWYIGLVARLAGTRPGVWTILQNCTVI 180

Query: 178 SVFCCVFYSHCGNRAVLRHRPL-ERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSW 236
           S+ CCVFYSHCGN AV + +      + +  +  K E  +TW++ FLRMN+LKD++CSSW
Sbjct: 181 SIACCVFYSHCGNLAVHKSKSFGSSSDPNLLAFLKNENGSTWISNFLRMNQLKDEICSSW 240

Query: 237 FAPVGSASDYPLLSKWVIYGEL--GNDNGGSSDEISPIYSLWATFIGLYIANYVVERSTG 294
           FAPVGSASDYP+L+KWVIYGEL       G SDEISP+YSLWATF+GLYIAN+VVERSTG
Sbjct: 241 FAPVGSASDYPILAKWVIYGELVCSGSCAGPSDEISPLYSLWATFVGLYIANFVVERSTG 300

Query: 295 WALTHPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRM 354
           WALTHP +  E EK+K+  +KP+FLDMVPWYSGTSADLFKT FDL+VSVT+FVGRFDMRM
Sbjct: 301 WALTHPSTDLEDEKLKR-HMKPDFLDMVPWYSGTSADLFKTAFDLMVSVTLFVGRFDMRM 359

Query: 355 MQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRD 414
           MQAAM        + DLLYDH +E+EDLWFDF+ADTGDGGNSSY+VARLLAQP IR    
Sbjct: 360 MQAAMKGPTGNTPNDDLLYDHFNEREDLWFDFVADTGDGGNSSYTVARLLAQPSIRTVIG 419

Query: 415 DSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPS 474
            S+ TLPRG++L+IGGDLAYPNPS+FTYERR FRPFEYALQPPPWY+ +H+A++KPE+P 
Sbjct: 420 GSMHTLPRGNLLIIGGDLAYPNPSSFTYERRFFRPFEYALQPPPWYRDEHIALDKPELPP 479

Query: 475 GVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWW 534
           GV ++ +YDGPQC+IIPGNHDWFDGL+TFMR+ICHKSWLGGWF+PQKKSYFAL LPKGWW
Sbjct: 480 GVSKMTEYDGPQCFIIPGNHDWFDGLHTFMRYICHKSWLGGWFLPQKKSYFALHLPKGWW 539

Query: 535 VFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHL 594
           +FGLDL+LH D+DVYQFKFFA++ + +VGE DSVI++THEPNWLLDWY+N  +GKNV HL
Sbjct: 540 IFGLDLSLHGDVDVYQFKFFADVCQNKVGENDSVIVVTHEPNWLLDWYWNETTGKNVSHL 599

Query: 595 ICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFY 654
           I +YLKGRCKLR+AGD+HH+MRHS   S+   +VQHLLVNGCGGAFLHPTHVF NF +F 
Sbjct: 600 IQEYLKGRCKLRMAGDLHHFMRHSATRSEKNNFVQHLLVNGCGGAFLHPTHVFRNFERFS 659

Query: 655 GTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILR 714
           GTTYE KAAYPS+++S+ IALGNILKFRKKNWQFD IGG +YF+LVFSMFPQC L  IL 
Sbjct: 660 GTTYECKAAYPSYDESTGIALGNILKFRKKNWQFDIIGGFIYFILVFSMFPQCNLVRILN 719

Query: 715 EDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLH 774
           E+++SG L+SF GT+W+A +Y+ EHSYVS  G+L LL  + +FVPSKLSR++RA+IG LH
Sbjct: 720 EETWSGRLKSFSGTIWSALLYIFEHSYVSSVGSLTLLTASYSFVPSKLSRRRRAIIGGLH 779

Query: 775 VSAHLAAALILMLLLELGVETCIQHKLLATSG 806
           V AHL AAL+LMLLLELG+E CI++ LLATSG
Sbjct: 780 VLAHLTAALLLMLLLELGIEICIRNHLLATSG 811


>gi|218199046|gb|EEC81473.1| hypothetical protein OsI_24798 [Oryza sativa Indica Group]
          Length = 935

 Score = 1108 bits (2867), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 508/710 (71%), Positives = 616/710 (86%), Gaps = 3/710 (0%)

Query: 99  RWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVS 158
           RWYIAWI +AA+YHLPSFQSMG+DLRMNLSLFLTI+++S++FL+VFH+IFLGLWY+GLVS
Sbjct: 18  RWYIAWIFLAALYHLPSFQSMGLDLRMNLSLFLTIYISSLIFLIVFHVIFLGLWYLGLVS 77

Query: 159 RVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLRHRPLERRNSSW--FSLWKKEERN 216
           R+A K+PE+LTIIQNC VIS+ CCV YSHCGN+ + R + ++RR +SW  FSLWKK + N
Sbjct: 78  RMAEKKPEMLTIIQNCAVISIACCVLYSHCGNKTITRDKSIDRRTASWVAFSLWKKHDDN 137

Query: 217 TWLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWVIYGELGNDNGGSSDEISPIYSLW 276
           + ++K LRM++ K+Q+CSSWFAPVGSASDYPLLSKW IY EL ++  G S++ISP+YSLW
Sbjct: 138 SLISKLLRMHKFKEQICSSWFAPVGSASDYPLLSKWAIYEELASNGSGHSNDISPVYSLW 197

Query: 277 ATFIGLYIANYVVERSTGWALTHPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTV 336
           ATFIGLYIANYVVERSTGWALTHPL++ EYEK+KK QLKP+F DMVPWYSGTS DLFKTV
Sbjct: 198 ATFIGLYIANYVVERSTGWALTHPLTMSEYEKLKK-QLKPDFEDMVPWYSGTSTDLFKTV 256

Query: 337 FDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNS 396
           FDL+VSVT+FVGRFDMRMMQAAMNK  + ++  DL YDHL  K++LWFDF+ADTGDGGNS
Sbjct: 257 FDLMVSVTLFVGRFDMRMMQAAMNKTPDESKSSDLFYDHLDGKDELWFDFIADTGDGGNS 316

Query: 397 SYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQP 456
           +Y+VARLLAQP + +  D S  T PRG +LLIGGDLAYPNPS+F+YERR F PFEYALQP
Sbjct: 317 TYAVARLLAQPSLAIKSDGSRQTFPRGQLLLIGGDLAYPNPSSFSYERRFFCPFEYALQP 376

Query: 457 PPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGW 516
           P WYK +H+A+ KPE+P GV EL++Y GPQC++IPGNHDWFDGL+TFMR+ICHKSWLGGW
Sbjct: 377 PAWYKPEHIALEKPELPLGVSELRKYRGPQCFMIPGNHDWFDGLHTFMRYICHKSWLGGW 436

Query: 517 FMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPN 576
           F+PQK+SYFAL+LP GWWVFGLD ALH DIDVYQFKFFAEL +++VGE DSVI++THEPN
Sbjct: 437 FLPQKRSYFALKLPNGWWVFGLDQALHGDIDVYQFKFFAELCQQKVGESDSVILITHEPN 496

Query: 577 WLLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGC 636
           WLLDWY+ + +G NV++LI +YLKGRCKLR+AGD+HHYMRHS++ S  PV+VQHLLVNGC
Sbjct: 497 WLLDWYWGDKTGTNVEYLIREYLKGRCKLRMAGDLHHYMRHSFIESKEPVHVQHLLVNGC 556

Query: 637 GGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVY 696
           GGAFLHPTHVF NFR+FYG  YE+K AYPS++DSS+IALGNILKFR+KNWQFD IGG VY
Sbjct: 557 GGAFLHPTHVFENFREFYGNKYETKIAYPSYDDSSKIALGNILKFRRKNWQFDVIGGFVY 616

Query: 697 FVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAIT 756
           FVLVFSMFPQC+   ILREDS++  + SFF  +WN    +LEHSYVS AG + LL+V+  
Sbjct: 617 FVLVFSMFPQCDSFRILREDSWADRVNSFFTAMWNVVFEILEHSYVSLAGVVTLLMVSFF 676

Query: 757 FVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSG 806
           FVP+KLSR++RA++G LH  AHL +A+ILMLL+EL +E CI++ LLATSG
Sbjct: 677 FVPTKLSRRRRALLGFLHAVAHLTSAVILMLLMELAIEICIRNNLLATSG 726


>gi|242048434|ref|XP_002461963.1| hypothetical protein SORBIDRAFT_02g011280 [Sorghum bicolor]
 gi|241925340|gb|EER98484.1| hypothetical protein SORBIDRAFT_02g011280 [Sorghum bicolor]
          Length = 936

 Score = 1085 bits (2806), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 520/793 (65%), Positives = 616/793 (77%), Gaps = 75/793 (9%)

Query: 16  MERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEKLDNNIKWWSMYACLL 75
           MER+R ILTH +PYPHEHSRH IIAV    LFFISSDN+  LI KLD N KWWSMYACL+
Sbjct: 1   MERMRNILTHRYPYPHEHSRHFIIAVFACWLFFISSDNLQNLIMKLDKNFKWWSMYACLI 60

Query: 76  GFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTIFL 135
           GFFYFFSSPFI KTI P+YSNF+RWYIAWI +AA+YHLPSFQSMG+DLRMNLSLFLTIF+
Sbjct: 61  GFFYFFSSPFIRKTIKPNYSNFNRWYIAWIFLAALYHLPSFQSMGLDLRMNLSLFLTIFI 120

Query: 136 ASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLR 195
           +S++FL+VFHIIFLGLWY+GLVSR+A K+PEILTIIQNC VIS+ CCVFYSHCGNR V R
Sbjct: 121 SSLIFLMVFHIIFLGLWYLGLVSRMAEKKPEILTIIQNCAVISIACCVFYSHCGNRTVSR 180

Query: 196 HRPLERRNSSW--FSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWV 253
            + ++RR +SW  FSLW K + NT +++ LR                             
Sbjct: 181 DKSIDRRTASWIVFSLWTKHDDNTLISRLLR----------------------------- 211

Query: 254 IYGELGNDNGGSSDEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSVEEYEKMKKKQ 313
                      SS+EISP+YSLWATF+GLYIANYVVERSTG                   
Sbjct: 212 -----------SSNEISPVYSLWATFMGLYIANYVVERSTG------------------- 241

Query: 314 LKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLY 373
                         TS DLFKTVFDL++SVT+FVGRFDMRMMQAAMNK  + A   DLLY
Sbjct: 242 --------------TSTDLFKTVFDLMISVTLFVGRFDMRMMQAAMNKTPDEANSHDLLY 287

Query: 374 DHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLA 433
           DHL  K++LWFDF+ADTGDGGNS+YSVARLLAQP + +  DDS  T PRG +LLIGGDLA
Sbjct: 288 DHLDGKDELWFDFIADTGDGGNSTYSVARLLAQPSLVIKSDDSRITFPRGQLLLIGGDLA 347

Query: 434 YPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGN 493
           YPNPS+F+YERR F  FEYALQPP WYK +H+A+ KPE+P GV EL++Y GPQC++IPGN
Sbjct: 348 YPNPSSFSYERRFFCSFEYALQPPAWYKPEHIALEKPELPLGVSELRRYRGPQCFMIPGN 407

Query: 494 HDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKF 553
           HDWFDGLNTF+R+ICHKSWLGGWF+PQKKSYFAL+LP GWWVFGLD ALH DIDVYQFKF
Sbjct: 408 HDWFDGLNTFIRYICHKSWLGGWFLPQKKSYFALKLPNGWWVFGLDQALHGDIDVYQFKF 467

Query: 554 FAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHH 613
           FAEL +++VGE DS+II+THEPNWLLDWY+ + +G NV +LI +YL+GRCKLR+AGD+HH
Sbjct: 468 FAELCQQKVGETDSIIIITHEPNWLLDWYWGDSTGTNVAYLIREYLRGRCKLRMAGDLHH 527

Query: 614 YMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRI 673
           YMRHS + S  PV+VQHLLVNGCGGAFLHPTHVF NFR FYG  YE+K+ YPS+ DSS+I
Sbjct: 528 YMRHSCIESKEPVHVQHLLVNGCGGAFLHPTHVFENFRVFYGNKYETKSTYPSYTDSSKI 587

Query: 674 ALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAF 733
           ALGNILKFR+KNWQFD IGG VYFVLVFSMFPQC+   IL EDS++G +  FF  +WNA 
Sbjct: 588 ALGNILKFRRKNWQFDVIGGFVYFVLVFSMFPQCDSFRILHEDSWAGRINGFFSAMWNAI 647

Query: 734 MYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGV 793
             +LE SYVS  G + LL+V+  FVP+KLSR++R ++G LH +AHL +A++LMLL+EL +
Sbjct: 648 FEILERSYVSLGGVVTLLMVSFFFVPTKLSRRRRVLLGFLHAAAHLTSAVLLMLLMELAI 707

Query: 794 ETCIQHKLLATSG 806
           E CI++ LLATSG
Sbjct: 708 EICIRNHLLATSG 720


>gi|168024972|ref|XP_001765009.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683818|gb|EDQ70225.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1105

 Score = 1056 bits (2730), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 510/811 (62%), Positives = 632/811 (77%), Gaps = 31/811 (3%)

Query: 16  MERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEKLDNNIKWWSMYACLL 75
           + RVRT+L H +PYPHEHS+HA+IAVV   LFFI SDN+H ++ KLD NIKWWS+Y  LL
Sbjct: 9   VRRVRTMLQHEYPYPHEHSQHALIAVVAVALFFIFSDNLHIVLHKLDTNIKWWSIYGFLL 68

Query: 76  GFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTIFL 135
           GFFYFFSSPF+G TI PSYSNFSRWY+ W+L+AAVYHLPSFQSMGVD+RMNLSLFLT+F+
Sbjct: 69  GFFYFFSSPFLGSTIQPSYSNFSRWYVGWLLIAAVYHLPSFQSMGVDIRMNLSLFLTLFV 128

Query: 136 ASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLR 195
           ASVL L++FH+ FLGLWY+G  +R+AGKRPEILTI+QN  V+S+ CC FYSHCGN+A  +
Sbjct: 129 ASVLVLVLFHVAFLGLWYLGFAARLAGKRPEILTILQNSAVLSIACCAFYSHCGNQAPAQ 188

Query: 196 HRPLERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWVIY 255
               +R+ S +      E+   W A    M+  K+Q+C+ W  PVG+A+DYP+ SKW +Y
Sbjct: 189 PGIFQRQRSMF------EDLPKWFAGLQNMHHAKEQMCTEWLGPVGTAADYPVFSKWALY 242

Query: 256 GEL-GNDN--GGSSDEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSVEEYEKMKKK 312
           GEL  ND   GG +D ISP+YSLWATFIGLY+ANYVVERSTGWALTHP    +    K  
Sbjct: 243 GELVCNDGCVGGPTDIISPVYSLWATFIGLYVANYVVERSTGWALTHPQPETKNGPQKTT 302

Query: 313 QLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNK--DQEGAQHGD 370
              PEFLDMVPWYSGTSADLFKT FDLLVSVT+F+GRFDMR MQ   ++  D+   +  D
Sbjct: 303 VTAPEFLDMVPWYSGTSADLFKTAFDLLVSVTLFLGRFDMRTMQVCSSRKYDEYLRKKDD 362

Query: 371 -LLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIG 429
              YDHLS+++ LWFDFMADTGDGGNS+YSVARLLAQP +++ + +    LPRGD+L+IG
Sbjct: 363 GFFYDHLSKRDGLWFDFMADTGDGGNSTYSVARLLAQPFLKMGKTE----LPRGDLLIIG 418

Query: 430 GDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYI 489
           GDLAYPNPS ++YERRLF PFEYA+QPP WYK +H+AV KPE+P GV  L+++  PQC+ 
Sbjct: 419 GDLAYPNPSTYSYERRLFLPFEYAMQPPVWYKPEHIAVTKPELPEGVHSLEEFRAPQCFA 478

Query: 490 IPGNH--------------DWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWV 535
           IPGNH              DWFDGL+TFMR+ICH+SWLGGW +PQ+KSYFAL+LP GWW+
Sbjct: 479 IPGNHGNLLTSTAWFHLCADWFDGLDTFMRYICHRSWLGGWLLPQQKSYFALRLPCGWWI 538

Query: 536 FGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLI 595
           FG D ALH DID++QFK+F E+VKE VGE DSVI++THEPNWLLDWY+++ +G NV H +
Sbjct: 539 FGFDQALHGDIDIFQFKYFTEIVKEHVGEDDSVILVTHEPNWLLDWYWDSTTGSNVAHFV 598

Query: 596 CDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYG 655
             YLKGRC+LRIAGD+H+YMRH  V S     V+HL+VNG GGAFLHPTHVF  F KF  
Sbjct: 599 EHYLKGRCRLRIAGDLHNYMRHKLV-SGSSTSVEHLIVNGSGGAFLHPTHVFGGFNKFQD 657

Query: 656 TTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILRE 715
             YE K AYPS ++S +IA GNILKFRKKNW+FD IGG+VYF+LVFSMFPQCEL+ +LR+
Sbjct: 658 GVYEKKFAYPSLKESEQIAWGNILKFRKKNWRFDVIGGVVYFILVFSMFPQCELDQVLRD 717

Query: 716 DSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHV 775
           D+  GH+  F  T+  AF+ +LEHSYVS  GAL+L+I++I FVP K+SR KR +IG+LH 
Sbjct: 718 DTMLGHVWEFGATMGRAFLDMLEHSYVSVTGALVLVILSIQFVPVKVSRSKRIVIGLLHF 777

Query: 776 SAHLAAALILMLLLELGVETCIQHKLLATSG 806
           +AHL +A+  M+LLE+G+E C++H LL TSG
Sbjct: 778 AAHLTSAIAYMILLEIGIEICVRHDLLGTSG 808


>gi|302785752|ref|XP_002974647.1| hypothetical protein SELMODRAFT_102099 [Selaginella moellendorffii]
 gi|300157542|gb|EFJ24167.1| hypothetical protein SELMODRAFT_102099 [Selaginella moellendorffii]
          Length = 999

 Score = 1046 bits (2705), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 512/807 (63%), Positives = 637/807 (78%), Gaps = 20/807 (2%)

Query: 16  MERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEKLDNNIKWWSMYACLL 75
           M+RV+ ILTH + YPHEHSRHA++AV+ GCLFFIS+DN HTLI + DNNIKWWS+YA LL
Sbjct: 1   MDRVKEILTHQYSYPHEHSRHAMLAVLAGCLFFISTDNFHTLIHRFDNNIKWWSIYAFLL 60

Query: 76  GFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTIFL 135
           GFFYFFSSPF+ +TI PSYSNF+RWY+ W+LVAAVYHLPSFQSMGVD+RMNLSLFLT+F+
Sbjct: 61  GFFYFFSSPFLVRTIEPSYSNFNRWYVLWLLVAAVYHLPSFQSMGVDIRMNLSLFLTLFM 120

Query: 136 ASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLR 195
           ASVL L++FHI +LGLWY+GLV+R+AG RPEILTI+QN  V+S+ CCVFYSHCGNR   +
Sbjct: 121 ASVLILVLFHISYLGLWYLGLVARLAGNRPEILTILQNSTVLSIACCVFYSHCGNRGP-K 179

Query: 196 HRPLERRNSS--WFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWV 253
            +PL R+ SS   F          W+  +  M+E+K+QVC +W  PVGSA+DYP+ SKWV
Sbjct: 180 GKPLGRQLSSTVLFENLLHSNLTKWINNWPPMHEVKEQVCHNWLGPVGSAADYPVFSKWV 239

Query: 254 IYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSVEEYEKMKK 311
           +YGE+   +    +S+ ISPI+SLWATFIGLYIANY+VERSTGWALTHP       + +K
Sbjct: 240 VYGEMVCHDACVKTSENISPIFSLWATFIGLYIANYIVERSTGWALTHP--QPSGARRRK 297

Query: 312 KQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDL 371
               PEFLDMVPWYSGTSADLFKT FDL++SVT+F+GRFD+R MQAA++K Q+  + G  
Sbjct: 298 IVTAPEFLDMVPWYSGTSADLFKTAFDLVISVTLFLGRFDLRTMQAAISKAQQTEKDG-F 356

Query: 372 LYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGD 431
           LY H  +KE++WFDFMADTGDGGNS+Y+VARLLAQP ++V        L R  +LL+GGD
Sbjct: 357 LYTHFHKKEEMWFDFMADTGDGGNSTYTVARLLAQPFLKVNVGGRETVLQRSSLLLLGGD 416

Query: 432 LAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIP 491
           LAYPNPS  +YE+RLFRPFEYA+QPP +Y+ +H+A  KPE+P  V +L+ Y GPQC+ IP
Sbjct: 417 LAYPNPSPTSYEQRLFRPFEYAMQPPKFYRPEHIAATKPELPDNVEKLQAYVGPQCFAIP 476

Query: 492 GNH-----------DWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDL 540
           GNH           DWFDGL TFMR+ICHKSWLGGW +PQ+KSYFALQLP GWWVFGLD 
Sbjct: 477 GNHGCSSYTCIIVSDWFDGLETFMRYICHKSWLGGWLLPQEKSYFALQLPHGWWVFGLDQ 536

Query: 541 ALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLK 600
           ALH DID++QFK+F+E+ KEQVG +DSVII+THEP WLLDWY+   +G NV HLI D+LK
Sbjct: 537 ALHGDIDLFQFKYFSEIAKEQVGSKDSVIIITHEPTWLLDWYWEGSTGNNVSHLIKDHLK 596

Query: 601 GRCKLRIAGDMHHYMRHSYVP-SDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYE 659
           GRC+LR++GD+H YMRH+  P ++ P  VQHL+VNGCGGAFLHPTHVF+ F +F GTTY+
Sbjct: 597 GRCRLRLSGDLHFYMRHTAGPAAENPASVQHLIVNGCGGAFLHPTHVFTKFSQFEGTTYQ 656

Query: 660 SKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFS 719
           +KA+YP   DS +IALGNILKFRKKNW+FD IGG++YFVLVFSMFPQC+L+   ++++F 
Sbjct: 657 NKASYPLPVDSRKIALGNILKFRKKNWRFDIIGGVLYFVLVFSMFPQCDLDKFFQKNTFF 716

Query: 720 GHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHL 779
           GH R FF TV     ++LE SYVS  G   +LI+A+ FVP K+SRKK+ +IG LH   HL
Sbjct: 717 GHTRQFFNTVGQVIYFLLEDSYVSLGGLFGMLILAVLFVPGKVSRKKKFLIGFLHCCVHL 776

Query: 780 AAALILMLLLELGVETCIQHKLLATSG 806
             A+ L+LLLELG+ETC++H LL  +G
Sbjct: 777 IVAIGLLLLLELGIETCVRHGLLGNAG 803


>gi|302759871|ref|XP_002963358.1| hypothetical protein SELMODRAFT_80477 [Selaginella moellendorffii]
 gi|300168626|gb|EFJ35229.1| hypothetical protein SELMODRAFT_80477 [Selaginella moellendorffii]
          Length = 999

 Score = 1046 bits (2705), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 512/807 (63%), Positives = 637/807 (78%), Gaps = 20/807 (2%)

Query: 16  MERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEKLDNNIKWWSMYACLL 75
           M+RV+ ILTH + YPHEHSRHA++AV+ GCLFFIS+DN HTLI + DNNIKWWS+YA LL
Sbjct: 1   MDRVKEILTHQYSYPHEHSRHAMLAVLAGCLFFISTDNFHTLIHRFDNNIKWWSIYAFLL 60

Query: 76  GFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTIFL 135
           GFFYFFSSPF+ +TI PSYSNF+RWY+ W+LVAAVYHLPSFQSMGVD+RMNLSLFLT+F+
Sbjct: 61  GFFYFFSSPFLVRTIEPSYSNFNRWYVLWLLVAAVYHLPSFQSMGVDIRMNLSLFLTLFM 120

Query: 136 ASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLR 195
           ASVL L++FHI +LGLWY+GLV+R+AG RPEILTI+QN  V+S+ CCVFYSHCGNR   +
Sbjct: 121 ASVLILVLFHISYLGLWYLGLVARLAGNRPEILTILQNSTVLSIACCVFYSHCGNRGP-K 179

Query: 196 HRPLERRNSS--WFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWV 253
            +PL R+ SS   F          W+  +  M+E+K+QVC +W  PVGSA+DYP+ SKWV
Sbjct: 180 GKPLGRQLSSTVLFENLLHSNLTKWINNWPPMHEVKEQVCHNWLGPVGSAADYPVFSKWV 239

Query: 254 IYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSVEEYEKMKK 311
           +YGE+   +    +S+ ISPI+SLWATFIGLYIANY+VERSTGWALTHP       + +K
Sbjct: 240 VYGEMVCHDACVKTSENISPIFSLWATFIGLYIANYIVERSTGWALTHP--QPSGARRRK 297

Query: 312 KQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDL 371
               PEFLDMVPWYSGTSADLFKT FDL++SVT+F+GRFD+R MQAA++K Q+  + G  
Sbjct: 298 IVTAPEFLDMVPWYSGTSADLFKTAFDLVISVTLFLGRFDLRTMQAAISKAQQTEKDG-F 356

Query: 372 LYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGD 431
           LY H  +KE++WFDFMADTGDGGNS+Y+VARLLAQP ++V        L R  +LL+GGD
Sbjct: 357 LYTHFHKKEEMWFDFMADTGDGGNSTYTVARLLAQPFLKVNVGGRETVLQRSSLLLLGGD 416

Query: 432 LAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIP 491
           LAYPNPS  +YE+RLFRPFEYA+QPP +Y+ +H+A  KPE+P  V +L+ Y GPQC+ IP
Sbjct: 417 LAYPNPSPTSYEQRLFRPFEYAMQPPKFYRPEHIAATKPELPDNVEKLQAYVGPQCFAIP 476

Query: 492 GNH-----------DWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDL 540
           GNH           DWFDGL TFMR+ICHKSWLGGW +PQ+KSYFALQLP GWWVFGLD 
Sbjct: 477 GNHGCSSYTCIFVSDWFDGLETFMRYICHKSWLGGWLLPQEKSYFALQLPHGWWVFGLDQ 536

Query: 541 ALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLK 600
           ALH DID++QFK+F+E+ KEQVG +DSVII+THEP WLLDWY+   +G NV HLI D+LK
Sbjct: 537 ALHGDIDLFQFKYFSEIAKEQVGSKDSVIIITHEPTWLLDWYWEGSTGNNVSHLIKDHLK 596

Query: 601 GRCKLRIAGDMHHYMRHSYVP-SDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYE 659
           GRC+LR++GD+H YMRH+  P ++ P  VQHL+VNGCGGAFLHPTHVF+ F +F GTTY+
Sbjct: 597 GRCRLRLSGDLHFYMRHTAGPAAENPASVQHLIVNGCGGAFLHPTHVFTKFSQFEGTTYQ 656

Query: 660 SKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFS 719
           +KA+YP   DS +IALGNILKFRKKNW+FD IGG++YFVLVFSMFPQC+L+   ++++F 
Sbjct: 657 NKASYPLPVDSRKIALGNILKFRKKNWRFDIIGGVLYFVLVFSMFPQCDLDKFFQKNTFF 716

Query: 720 GHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHL 779
           GH R FF TV     ++LE SYVS  G   +LI+A+ FVP K+SRKK+ +IG LH   HL
Sbjct: 717 GHTRQFFNTVGQVIYFLLEDSYVSLGGLFGMLILAVLFVPGKVSRKKKFLIGFLHCCVHL 776

Query: 780 AAALILMLLLELGVETCIQHKLLATSG 806
             A+ L+LLLELG+ETC++H LL  +G
Sbjct: 777 IVAIGLLLLLELGIETCVRHGLLGNAG 803


>gi|168039409|ref|XP_001772190.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676521|gb|EDQ63003.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 945

 Score = 1046 bits (2704), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 503/805 (62%), Positives = 623/805 (77%), Gaps = 24/805 (2%)

Query: 16  MERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEKLDNNIKWWSMYACLL 75
           ++R+ T +T    YPHEHSRHA++AV+   +FFIS+DNMH ++ KLD N KWWS+YA LL
Sbjct: 1   LDRIATAMTDESLYPHEHSRHAMLAVLATTMFFISTDNMHIVLNKLDANSKWWSIYAILL 60

Query: 76  GFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTIFL 135
           GFFYFFSSPF+G TI PSYSNFSRWY+ W+LVAA+YHLP+ +SMGVD+RMNLSLFL +FL
Sbjct: 61  GFFYFFSSPFLGNTIQPSYSNFSRWYVVWLLVAAMYHLPTIRSMGVDVRMNLSLFLILFL 120

Query: 136 ASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLR 195
           AS++ LL+FH++F+ LWY+GL S+ A K P+ILTIIQN VV+S+ CCVFYSHCGN+ V R
Sbjct: 121 ASMMTLLLFHVVFMVLWYIGLASQDARKPPKILTIIQNSVVLSIACCVFYSHCGNQ-VPR 179

Query: 196 HRP----------LERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPVGSASD 245
            +P          L  ++  W +     +   W      M  +K QVC +W APVGSA D
Sbjct: 180 QKPTMWSTYTLNKLGEKSIKWVARPPDPKSQPW-----NMQHMKSQVCWTWLAPVGSAKD 234

Query: 246 YPLLSKWVIYGELGNDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSV 303
           YP+ SKWV+YGEL  ++   G SD ISP+YSLWATFIG+YIAN+VVERST WAL+H   +
Sbjct: 235 YPVFSKWVVYGELVCNDACIGPSDNISPVYSLWATFIGIYIANFVVERSTEWALSH---L 291

Query: 304 EEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQ 363
               + +    KPEFLDMVPWYSGTSADLFKT FDLLVSVT+F+GRFDMR MQAAM + Q
Sbjct: 292 ASNAQQRNTLTKPEFLDMVPWYSGTSADLFKTAFDLLVSVTLFLGRFDMRTMQAAMTQAQ 351

Query: 364 EGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVT--RDDSVFTLP 421
           +  +   +++ HLS+K +LW DFMADTGDGGNS+YS+ARLLAQP + V   R      LP
Sbjct: 352 Q-QRSKSVVHSHLSQKSELWMDFMADTGDGGNSTYSIARLLAQPSLCVADERCCGFHNLP 410

Query: 422 RGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQ 481
           R  +LLIGGDLAYPNPSAF+YERR FRPFEYALQPPPWY+ +H+AV+KPE+P GV  L+ 
Sbjct: 411 RSQLLLIGGDLAYPNPSAFSYERRFFRPFEYALQPPPWYRPEHIAVSKPELPEGVKSLED 470

Query: 482 YDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLA 541
            + PQC+ IPGNHDWFDGL+TFMR+ICH+SWLGGW +PQ KSYFAL+LP GWW+FGLD A
Sbjct: 471 LEAPQCFAIPGNHDWFDGLDTFMRYICHRSWLGGWLLPQSKSYFALKLPHGWWIFGLDQA 530

Query: 542 LHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKG 601
           LH DID++QFK+F+ + K +VGE +S+IIMTHEPNWLLDWY+ N +GKNV HLI ++LKG
Sbjct: 531 LHGDIDIFQFKYFSNIAKNEVGENESIIIMTHEPNWLLDWYWENSTGKNVSHLIQEHLKG 590

Query: 602 RCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESK 661
           RC+LR+AGD+HHYMRHS         ++HLLVNG GGAFLHPTH+F+ F+ F G  YE  
Sbjct: 591 RCRLRLAGDLHHYMRHSISGGAQNNCIEHLLVNGQGGAFLHPTHIFAKFQDFQGNKYEKI 650

Query: 662 AAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGH 721
           AAYP+   S RIALGNILKFR+KNWQFD IGGIVYFVLVFSMFPQCELN I       G 
Sbjct: 651 AAYPTMRQSERIALGNILKFRRKNWQFDVIGGIVYFVLVFSMFPQCELNEIFNNGDIRGI 710

Query: 722 LRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAA 781
           L +F+ T+  AF+ ++EHSYVS  G L+LL +A++FVP K+SR KR +IG LHV+ HL +
Sbjct: 711 LIAFWRTMGQAFIQMVEHSYVSMTGTLVLLALAVSFVPVKVSRSKRIIIGFLHVTLHLVS 770

Query: 782 ALILMLLLELGVETCIQHKLLATSG 806
           A++LMLLLE+G+ETCI+H  L TSG
Sbjct: 771 AMVLMLLLEIGIETCIRHNHLGTSG 795


>gi|222636390|gb|EEE66522.1| hypothetical protein OsJ_22999 [Oryza sativa Japonica Group]
          Length = 821

 Score =  942 bits (2436), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 440/649 (67%), Positives = 528/649 (81%), Gaps = 39/649 (6%)

Query: 160 VAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLRHRPLERRNSSW--FSLWKKEERNT 217
           +A K+PE+LTIIQNC VIS+ CCV YSHCGN+ + R + ++RR +SW  FSLWKK + N+
Sbjct: 1   MAEKKPEMLTIIQNCAVISIACCVLYSHCGNKTITRDKSIDRRTASWVAFSLWKKHDDNS 60

Query: 218 WLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWVIYGELGNDNGGSSDEISPIYSLWA 277
            ++K LRM++ K+Q+CSSWFAPVGSASDYPLLSKW IY EL ++  G S++ISP+YSLWA
Sbjct: 61  LISKLLRMHKFKEQICSSWFAPVGSASDYPLLSKWAIYEELASNGSGHSNDISPVYSLWA 120

Query: 278 TFIGLYIANYVVERSTGWALTHPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVF 337
           TFIGLYIANYVVERSTGWALTHPL++ EYEK+KK QLKP+F DMVPWYSGTS DLFKTVF
Sbjct: 121 TFIGLYIANYVVERSTGWALTHPLTMSEYEKLKK-QLKPDFEDMVPWYSGTSTDLFKTVF 179

Query: 338 DLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSS 397
           DL+VSVT+FVGRFDMRMMQAAMNK  + ++  DL YDHL  K++LWFDF+ADTGDGGNS+
Sbjct: 180 DLMVSVTLFVGRFDMRMMQAAMNKTPDESKSSDLFYDHLDGKDELWFDFIADTGDGGNST 239

Query: 398 YSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPP 457
           Y+VARLLAQP + +  D S  T PRG +LLIGGDLAYPNPS+F+YERR F PFEYALQPP
Sbjct: 240 YAVARLLAQPSLAIKSDGSRQTFPRGQLLLIGGDLAYPNPSSFSYERRFFCPFEYALQPP 299

Query: 458 PWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWF 517
            WYK +H+A+ KPE+P GV EL++Y GPQC++IPGNHDWFDGL+TFMR+ICHKSWLGGWF
Sbjct: 300 AWYKPEHIALEKPELPLGVSELRKYRGPQCFMIPGNHDWFDGLHTFMRYICHKSWLGGWF 359

Query: 518 MPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNW 577
           +PQK+SYFAL+LP GWWVFGLD ALH DIDVYQFKFFAEL +++VGE DSVI++THEPNW
Sbjct: 360 LPQKRSYFALKLPNGWWVFGLDQALHGDIDVYQFKFFAELCQQKVGESDSVILITHEPNW 419

Query: 578 LLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCG 637
           LLDWY+ + +G NV++LI +YLKGRCKLR+AGD+HHYMRHS++ S  PV+VQHLLVNGCG
Sbjct: 420 LLDWYWGDKTGTNVEYLIREYLKGRCKLRMAGDLHHYMRHSFIESKEPVHVQHLLVNGCG 479

Query: 638 GAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYF 697
           GAFLHPTHVF NFR+FYG  YE+K AYPS++DSS+IALGNILKFR+KNWQFD IGG VYF
Sbjct: 480 GAFLHPTHVFENFREFYGNKYETKIAYPSYDDSSKIALGNILKFRRKNWQFDVIGGFVYF 539

Query: 698 VLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITF 757
           VLVFSMFPQC+   ILREDS++  + SFF  +WN    +LEHSYVS A            
Sbjct: 540 VLVFSMFPQCDSFRILREDSWADRVNSFFTAMWNVVFEILEHSYVSLA------------ 587

Query: 758 VPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSG 806
                                   A+ILMLL+EL +E CI++ LLATSG
Sbjct: 588 ------------------------AVILMLLMELAIEICIRNNLLATSG 612


>gi|224136686|ref|XP_002326920.1| predicted protein [Populus trichocarpa]
 gi|222835235|gb|EEE73670.1| predicted protein [Populus trichocarpa]
          Length = 499

 Score =  905 bits (2340), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/500 (86%), Positives = 468/500 (93%), Gaps = 4/500 (0%)

Query: 1   MGSDKHSAGLLDTLRMERVRTILTHTHPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60
           MGSDK + GLL+TLRMERVRTILTHT+PYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK
Sbjct: 1   MGSDKQTTGLLETLRMERVRTILTHTYPYPHEHSRHAIIAVVVGCLFFISSDNMHTLIEK 60

Query: 61  LDNNIKWWSMYACLLGFFYFFSSPFIGKTITPSYSNFSRWYIAWILVAAVYHLPSFQSMG 120
           LDNNIKWWSMYACLLGFFYFFSSPF+GKTI PSYSNFSRWYIAWILVA +YHLPSFQSMG
Sbjct: 61  LDNNIKWWSMYACLLGFFYFFSSPFLGKTIKPSYSNFSRWYIAWILVATLYHLPSFQSMG 120

Query: 121 VDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVF 180
           VD+RMNLSLFLTI ++S+LFLLVFHIIF+GLWY+GLVSRVAG+RP ILTI+QNC V+SV 
Sbjct: 121 VDMRMNLSLFLTISVSSILFLLVFHIIFIGLWYIGLVSRVAGRRPAILTILQNCAVLSVA 180

Query: 181 CCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPV 240
           CCVFYSHCGN A LR R  +R+ SSWFS WKKEER+TWLAKFLRMNELKDQVCSSWFAPV
Sbjct: 181 CCVFYSHCGNLANLRDRRSQRKYSSWFSFWKKEERSTWLAKFLRMNELKDQVCSSWFAPV 240

Query: 241 GSASDYPLLSKWVIYGELG-NDNG--GSSDEISPIYSLWATFIGLYIANYVVERSTGWAL 297
           GSASDYPLLSKWVIYGELG N +G  GSSDEISP+YSLWATFIGLYIANYVVERSTGWAL
Sbjct: 241 GSASDYPLLSKWVIYGELGCNGSGCAGSSDEISPLYSLWATFIGLYIANYVVERSTGWAL 300

Query: 298 THPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQA 357
           THPLSVEEYEK KKKQ+KP+FLDMVPWYSGTSADLFKT FDLLVSVTVFVGRFDMRMMQA
Sbjct: 301 THPLSVEEYEKSKKKQMKPDFLDMVPWYSGTSADLFKTAFDLLVSVTVFVGRFDMRMMQA 360

Query: 358 AMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSV 417
           AMN+ Q+GAQ G LLYDH ++K++LWFDFMADTGDGGNSSY+VARLLAQP I+VTR DSV
Sbjct: 361 AMNRAQDGAQQG-LLYDHFNDKDELWFDFMADTGDGGNSSYTVARLLAQPSIQVTRGDSV 419

Query: 418 FTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVP 477
            +LPRG++LLIGGDLAYPNPS+FTYERRLF PFEYALQPPPWYK+DH+AVNKPE+P GV 
Sbjct: 420 LSLPRGNLLLIGGDLAYPNPSSFTYERRLFCPFEYALQPPPWYKQDHIAVNKPELPDGVA 479

Query: 478 ELKQYDGPQCYIIPGNHDWF 497
           ELKQYDGPQC++IPGNH WF
Sbjct: 480 ELKQYDGPQCFLIPGNHGWF 499


>gi|414589083|tpg|DAA39654.1| TPA: hypothetical protein ZEAMMB73_888857 [Zea mays]
          Length = 756

 Score =  822 bits (2123), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/527 (71%), Positives = 451/527 (85%), Gaps = 1/527 (0%)

Query: 280 IGLYIANYVVERSTGWALTHPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDL 339
           +GLYIANYV+ERSTGW LTHPL++ EYEK+KK QLKP+F DMVPWYSGTS DLFKTVFDL
Sbjct: 1   MGLYIANYVIERSTGWVLTHPLTISEYEKLKK-QLKPDFEDMVPWYSGTSTDLFKTVFDL 59

Query: 340 LVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYS 399
           ++SVT+FVGRFDMRMMQAAMNK  + A   DLLYDHL  K++LWFDF+ADTGDGGNS+Y+
Sbjct: 60  MISVTLFVGRFDMRMMQAAMNKTPDEANSHDLLYDHLDGKDELWFDFIADTGDGGNSTYA 119

Query: 400 VARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPW 459
           VARLLAQP + +  DDS  T PRG +LL+GGDLAYPNPS+F+YERR F PFEYALQPP W
Sbjct: 120 VARLLAQPLLVINSDDSRLTFPRGQLLLVGGDLAYPNPSSFSYERRFFCPFEYALQPPAW 179

Query: 460 YKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMP 519
           YK +H+A+ KPE+P GV EL++Y GPQC++IPGNHDWFDGLNTF+R+ICHKSW+GGWF+P
Sbjct: 180 YKPEHIALEKPELPLGVSELRRYRGPQCFMIPGNHDWFDGLNTFIRYICHKSWVGGWFLP 239

Query: 520 QKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLL 579
           QKKSYFAL+LP GWWVFGLD ALH DIDVYQFKFFAEL +++VGE DSVII+THEPNWLL
Sbjct: 240 QKKSYFALKLPNGWWVFGLDQALHGDIDVYQFKFFAELCQQKVGESDSVIIITHEPNWLL 299

Query: 580 DWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGA 639
           DWY+ + +G NV +LI +YL+GRCKLR+AGD+HHYMRHS + S  PV+VQHLLVNGCGGA
Sbjct: 300 DWYWGDSTGTNVAYLIREYLRGRCKLRMAGDLHHYMRHSCIESKEPVHVQHLLVNGCGGA 359

Query: 640 FLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVL 699
           FLHPTHVF NFR FYG  YE+K+ YPS+ DSS+IALGNILKFR+KNWQFD IGG VYFVL
Sbjct: 360 FLHPTHVFENFRVFYGNKYETKSTYPSYHDSSKIALGNILKFRRKNWQFDVIGGFVYFVL 419

Query: 700 VFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVP 759
           VFSMFPQC+  HIL EDS++G +  FF  +WNA   +LE SYVS  G + LL+V+  FVP
Sbjct: 420 VFSMFPQCDSFHILHEDSWAGRINGFFTAMWNAVFEILERSYVSLGGVVTLLMVSFFFVP 479

Query: 760 SKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSG 806
           +KLSR++R ++G LH +AHL +A++LMLL+EL +E CI++ LLATSG
Sbjct: 480 TKLSRRRRVLLGFLHAAAHLTSAVLLMLLMELAIEICIRNHLLATSG 526


>gi|326492201|dbj|BAJ98325.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 534

 Score =  535 bits (1378), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 245/327 (74%), Positives = 285/327 (87%)

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLD 539
           K+YDGPQC+IIPGNHDWFDGL+TFMR+ICHKSWLGGWF+PQ+KSYFALQL KGWW+FGLD
Sbjct: 1   KKYDGPQCFIIPGNHDWFDGLHTFMRYICHKSWLGGWFLPQRKSYFALQLTKGWWIFGLD 60

Query: 540 LALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYL 599
           LALH DIDVYQFKFFAEL + +VGE DSVII+THEPNWLLDWY+   +GKNV HLI DYL
Sbjct: 61  LALHGDIDVYQFKFFAELCRNKVGENDSVIIVTHEPNWLLDWYWKETTGKNVSHLIQDYL 120

Query: 600 KGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYE 659
            GRCKLR+AGD+HH+MRHS   SD P +VQHLLVNGCGGAFLHPTHVF NF +  GTTYE
Sbjct: 121 NGRCKLRMAGDLHHFMRHSATRSDKPTFVQHLLVNGCGGAFLHPTHVFKNFERSSGTTYE 180

Query: 660 SKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFS 719
            KAAYPS+E+SS IALGNILKFRKKNWQFD IGG +YF+LVFSMFPQC L HIL E+++S
Sbjct: 181 CKAAYPSYEESSGIALGNILKFRKKNWQFDIIGGFIYFILVFSMFPQCNLVHILNEETWS 240

Query: 720 GHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHL 779
           G L+SF  T+W+A +Y+ EHSYVS  G+L LL+ + +FVPSKL+RKKRA+IG LHV AHL
Sbjct: 241 GRLQSFSSTIWSALLYIFEHSYVSSVGSLTLLMASYSFVPSKLTRKKRAIIGGLHVVAHL 300

Query: 780 AAALILMLLLELGVETCIQHKLLATSG 806
            AAL+LMLL+ELG+E CI++ LLATSG
Sbjct: 301 TAALVLMLLMELGIEICIRNHLLATSG 327


>gi|224136690|ref|XP_002326921.1| predicted protein [Populus trichocarpa]
 gi|222835236|gb|EEE73671.1| predicted protein [Populus trichocarpa]
          Length = 521

 Score =  525 bits (1353), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 261/313 (83%), Positives = 297/313 (94%)

Query: 495 DWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFF 554
           DWFDGL+TFMR+ICHKSWLGGWFMPQKKSYFALQLPK WWVFGLDLALH DIDVYQFKFF
Sbjct: 1   DWFDGLHTFMRYICHKSWLGGWFMPQKKSYFALQLPKRWWVFGLDLALHNDIDVYQFKFF 60

Query: 555 AELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHY 614
           AEL++E+V + DSVI++THEPNWLLDWY+N+VSGKNV HLICDYLKGRCK+R+AGD+HHY
Sbjct: 61  AELIQEKVADNDSVILITHEPNWLLDWYWNDVSGKNVSHLICDYLKGRCKIRVAGDLHHY 120

Query: 615 MRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIA 674
           MRHS+VP+DGPV+VQHLLVNGCGGAFLHPTHVFSNF+K YGT+YE+KAAYPS EDSSRIA
Sbjct: 121 MRHSFVPADGPVHVQHLLVNGCGGAFLHPTHVFSNFKKLYGTSYENKAAYPSLEDSSRIA 180

Query: 675 LGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFM 734
           LGNILKFRKKNWQFD IGG +YFVL FSMFPQC+L+HIL++++FSGHL SFFGTVWN FM
Sbjct: 181 LGNILKFRKKNWQFDIIGGFIYFVLSFSMFPQCKLDHILQDNTFSGHLWSFFGTVWNVFM 240

Query: 735 YVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVE 794
           +VLEHSYVS  GA+LLLI+AI FVP K+SRKKRA+IG+LHVS+HLAAALILMLLLELG+E
Sbjct: 241 HVLEHSYVSMTGAILLLILAIAFVPPKVSRKKRAVIGILHVSSHLAAALILMLLLELGIE 300

Query: 795 TCIQHKLLATSGE 807
           TCI+HKLLATSG+
Sbjct: 301 TCIRHKLLATSGK 313


>gi|7486571|pir||T05132 hypothetical protein F7H19.190 - Arabidopsis thaliana
          Length = 443

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 256/323 (79%), Positives = 289/323 (89%)

Query: 495 DWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFF 554
           DWFDGLNTFMR++CHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALH DIDVYQF FF
Sbjct: 21  DWFDGLNTFMRYVCHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHGDIDVYQFNFF 80

Query: 555 AELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHY 614
           ++LVKE+VGE D+VII+THEPNWLLDWY+ + +GKN++HLI ++LKGRCKLR+AGD+HHY
Sbjct: 81  SKLVKEKVGENDAVIIITHEPNWLLDWYWKDDTGKNMRHLIFEFLKGRCKLRMAGDLHHY 140

Query: 615 MRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIA 674
           MRHS   SDGPV+V HLLVNGCGGAFLHPTHVF  F KFYG +YESK+AYPSFEDSSRIA
Sbjct: 141 MRHSCTQSDGPVHVPHLLVNGCGGAFLHPTHVFRCFSKFYGASYESKSAYPSFEDSSRIA 200

Query: 675 LGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFM 734
           LGNILKFRKKNWQFDFIGGI+YF+LVFS+FPQCEL HILR DSFSGHL SFFGTVW++F+
Sbjct: 201 LGNILKFRKKNWQFDFIGGIIYFLLVFSLFPQCELGHILRGDSFSGHLGSFFGTVWSSFV 260

Query: 735 YVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVE 794
           YV E SYVSF G L+LLI AI FVPSK+SR+KR +IG+LHVSAHL AALILMLLLELG+E
Sbjct: 261 YVTEQSYVSFTGVLMLLITAIMFVPSKISRRKRLLIGILHVSAHLMAALILMLLLELGIE 320

Query: 795 TCIQHKLLATSGEFFILVSFNSV 817
            CIQHKLLA SG   +   + SV
Sbjct: 321 ICIQHKLLANSGYHTLYQWYKSV 343


>gi|115470487|ref|NP_001058842.1| Os07g0134500 [Oryza sativa Japonica Group]
 gi|34394414|dbj|BAC83511.1| unknown protein [Oryza sativa Japonica Group]
 gi|113610378|dbj|BAF20756.1| Os07g0134500 [Oryza sativa Japonica Group]
          Length = 527

 Score =  502 bits (1292), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 228/318 (71%), Positives = 276/318 (86%)

Query: 489 IIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDV 548
           +IPGNHDWFDGL+TFMR+ICHKSWLGGWF+PQK+SYFAL+LP GWWVFGLD ALH DIDV
Sbjct: 1   MIPGNHDWFDGLHTFMRYICHKSWLGGWFLPQKRSYFALKLPNGWWVFGLDQALHGDIDV 60

Query: 549 YQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLRIA 608
           YQFKFFAEL +++VGE DSVI++THEPNWLLDWY+ + +G NV++LI +YLKGRCKLR+A
Sbjct: 61  YQFKFFAELCQQKVGESDSVILITHEPNWLLDWYWGDKTGTNVEYLIREYLKGRCKLRMA 120

Query: 609 GDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFE 668
           GD+HHYMRHS++ S  PV+VQHLLVNGCGGAFLHPTHVF NFR+FYG  YE+K AYPS++
Sbjct: 121 GDLHHYMRHSFIESKEPVHVQHLLVNGCGGAFLHPTHVFENFREFYGNKYETKIAYPSYD 180

Query: 669 DSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGT 728
           DSS+IALGNILKFR+KNWQFD IGG VYFVLVFSMFPQC+   ILREDS++  + SFF  
Sbjct: 181 DSSKIALGNILKFRRKNWQFDVIGGFVYFVLVFSMFPQCDSFRILREDSWADRVNSFFTA 240

Query: 729 VWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLL 788
           +WN    +LEHSYVS AG + LL+V+  FVP+KLSR++RA++G LH  AHL +A+ILMLL
Sbjct: 241 MWNVVFEILEHSYVSLAGVVTLLMVSFFFVPTKLSRRRRALLGFLHAVAHLTSAVILMLL 300

Query: 789 LELGVETCIQHKLLATSG 806
           +EL +E CI++ LLATSG
Sbjct: 301 MELAIEICIRNNLLATSG 318


>gi|384253332|gb|EIE26807.1| hypothetical protein COCSUDRAFT_12255, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 884

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 286/747 (38%), Positives = 407/747 (54%), Gaps = 84/747 (11%)

Query: 94  YSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWY 153
           Y NF   YI W+  A   HLPSF+++G D+R ++S+ LTIFL S L   +  I+  GL  
Sbjct: 1   YINFYSLYIGWLCSAVFLHLPSFKALGFDVRTDVSMLLTIFLLSCLVSRLVPIMPSGLLS 60

Query: 154 VGLVSRVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKE 213
            G           +L I+  CV I V C  +YS CGN A               ++    
Sbjct: 61  QG----------HLLLILHLCVTILVACSTYYSFCGNAA---------------AVGADA 95

Query: 214 ERNTWLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWVIYGELG-------------N 260
           +    +  F      +  VC+ W  P+    ++P  S WVIYG                 
Sbjct: 96  KGGGAVGGF------RAAVCTKWLRPI-DMREHPAFSSWVIYGGASPLHLSFLAFSTQMT 148

Query: 261 DNGGSSDEISPIYSLWATFIGLYIANYVVERSTGWALTHPLSVEEYEKMKKKQLKPEFLD 320
            +  +++ ISP+++LW T + +Y+  Y    S        L+ EE      +Q  P+FL 
Sbjct: 149 SSIPAAEIISPVFTLWITLVIMYMGAYHAPFSLK---IQQLNAEEEGASSDEQ--PDFLP 203

Query: 321 MVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAM------NKDQEGAQHGDLLYD 374
           M PWYSGTSAD+++T+F L+VS+ +F+GRFDMR MQAA              +     ++
Sbjct: 204 MFPWYSGTSADMYRTIFGLIVSLKLFLGRFDMRTMQAATGVGPPGGGSGPPREGDGFTFE 263

Query: 375 HLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFT----LPRGDVLLIGG 430
           H ++K +LW DF ADTGDGG+ +Y+VAR +A P +     D V+     LPRGDVL++GG
Sbjct: 264 HEADKRELWIDFCADTGDGGDPTYAVARAMAAPVLHAM--DMVYARGKLLPRGDVLILGG 321

Query: 431 DLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVP----SGVPELKQYDGPQ 486
           DLAYPNPS  TYE R   PFE AL PPP      + VNKP++P    +    L++YDGP 
Sbjct: 322 DLAYPNPSTETYELRFVGPFETALPPPPGVHPGRLVVNKPDLPMVACAASETLRRYDGPT 381

Query: 487 CYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDI 546
            + IPGNHDW DGL TF RFI H+ WLGGW +PQ+KSYFAL+LP GWW+FGLDLAL  DI
Sbjct: 382 AFAIPGNHDWIDGLETFQRFIHHRGWLGGWLLPQEKSYFALRLPHGWWLFGLDLALVDDI 441

Query: 547 DVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLR 606
           D+ Q+++FA +  E++G  D  I++TH+P WL+DW+    +  N++ L+  +L+GR +L 
Sbjct: 442 DMCQYRYFARIADERMGPEDQAILVTHQPGWLVDWFHEEAAALNLRQLVRGHLRGRARLH 501

Query: 607 IAGD--MHHYM-----------RHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKF 653
           +AGD  + H +             + + +  P   +HL+VNG GGAFLHPTHVF+  R  
Sbjct: 502 LAGDSPLTHPLPALQAVGGSAATAATIANLHPFDPEHLIVNGLGGAFLHPTHVFAPARF- 560

Query: 654 YGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQC-ELNHI 712
              +++  A YP+ E S ++   N+  FR KN +FD IGG  YF++V S+ P+C  +  I
Sbjct: 561 --ASFQCAAVYPTPEQSLQLGRQNLHLFRFKNTRFDVIGGAFYFLMVVSVLPRCSRVTDI 618

Query: 713 LREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGV 772
           L   +    L        +A   +   SY+S    L L ++       +       MI  
Sbjct: 619 LDAHTLLEALGYLVLAAADALAAIFTESYLSLIAVLCLYMLCWCMANGRTGGLSTQMIAA 678

Query: 773 LHVSAHLAAALIL-MLLLELGVETCIQ 798
           L  ++    A IL ++LLELGVETCI 
Sbjct: 679 LAHASAHLTAAILSLVLLELGVETCIS 705


>gi|326498529|dbj|BAJ98692.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 502

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 204/290 (70%), Positives = 246/290 (84%)

Query: 517 FMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPN 576
           F+PQK+SYFAL+LP GWWVFGLD ALH DIDVYQFKFFAEL +++VGE DSVI++THEPN
Sbjct: 3   FLPQKRSYFALKLPNGWWVFGLDQALHGDIDVYQFKFFAELCQQKVGEHDSVILITHEPN 62

Query: 577 WLLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGC 636
           WLLDWY+++ +GKNV +LI +YLKGRCKLR+AGD+HHYMRHS   +  PV+VQHLLVNGC
Sbjct: 63  WLLDWYWSDKTGKNVTYLIREYLKGRCKLRMAGDLHHYMRHSCTETKEPVHVQHLLVNGC 122

Query: 637 GGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVY 696
           GGAFLHPTHVF NF++ YG  YE+KA YPS+EDSS+IALGNILKFR+KNWQFD IGG VY
Sbjct: 123 GGAFLHPTHVFENFKECYGNKYETKAVYPSYEDSSKIALGNILKFRRKNWQFDVIGGFVY 182

Query: 697 FVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAIT 756
           FVLVFSMFPQC+   IL EDS+ G + SFF   WNA   +LEHSYVS AG L LL V+  
Sbjct: 183 FVLVFSMFPQCDSFRILHEDSWDGRVNSFFNATWNAIFEILEHSYVSLAGVLALLTVSFF 242

Query: 757 FVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSG 806
           FVP+KLSRK+RA++G LH +AH+ +A++LMLL+ELG+E CI++ LLATSG
Sbjct: 243 FVPTKLSRKRRALLGFLHATAHITSAVLLMLLMELGIEICIRNHLLATSG 292


>gi|348675773|gb|EGZ15591.1| hypothetical protein PHYSODRAFT_315827 [Phytophthora sojae]
          Length = 1041

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 293/885 (33%), Positives = 422/885 (47%), Gaps = 172/885 (19%)

Query: 60  KLDNNIKWWSMYACLLGFFYFFSSPFIGK-----TITPSYSNFSRWYIAWILVAAVYHLP 114
           K ++ + W+++Y  +   FYFF+SPF+G      + TP Y  FS    AW+  AA  H P
Sbjct: 12  KAEDQLVWFTLYLVIYLAFYFFTSPFVGNRELLNSWTP-YVYFSMVLYAWLFSAAALHTP 70

Query: 115 --SFQSMGVDLRMNLSLFLTIFLASVLFLLVFHIIF----------LGL-WYVGLVSRVA 161
             +   +   ++  +SL    F AS +FL+V  ++           LGL W V  VS   
Sbjct: 71  VHALGLLDSTIKQPISLLFPFFSASFVFLIVLEVLMVLFLSRIAKRLGLSWTVAHVSLAR 130

Query: 162 GKRPEILTIIQNCVVISVFCCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEERNTWLAK 221
                 + +++N   ISV C     HC         P E R                   
Sbjct: 131 S----FMNVLRNSAAISVACVALIVHCDATYDCAASPDEPRT------------------ 168

Query: 222 FLRMNELKDQVCSSWFAPVGSASDYPLLSKWVIYGELGNDNGGSSDEISPIYSLWATFIG 281
                      C+  F+ V S     L+             GG+         +W   + 
Sbjct: 169 ------YNSHACAQIFSFVQSEDVETLVP---------GSYGGA-------ILIWCMGMF 206

Query: 282 LYIANYVVERSTGWAL-------THPLSVEEYEKMKK-----KQLKPE-FLDMVPWYSGT 328
           L I N+ +ER +G  +       ++ +   E +++ +     ++L PE  L MVPWYS  
Sbjct: 207 LAITNFFLERMSGMRMMVSFGWFSNQVEDNEVDELSEGEHGHERLPPEQNLPMVPWYSML 266

Query: 329 SADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMA 388
              LF TVFDLL+S+ +F+GRFDMR MQ A++ + E     D  +DHL++++++W DFMA
Sbjct: 267 ---LFDTVFDLLISLKIFLGRFDMRTMQRALHPNDE-----DYCFDHLADRDEVWLDFMA 318

Query: 389 DTGDGGNSSYSVARLLAQPHIRVT----RDDSVFT----------------LPRGDVLLI 428
           D GDG NSSY +ARLLAQP + V      D S  T                 PRGD L+I
Sbjct: 319 DCGDGFNSSYQIARLLAQPELEVDCEVPDDKSSDTEQESNEKTKTKTIKRVFPRGDALVI 378

Query: 429 GGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCY 488
           GGDLAYP+P + TYE RLFR FEYA++PPP Y    ++  K + P G   L+ Y+GP  +
Sbjct: 379 GGDLAYPHPDSKTYETRLFRCFEYAMKPPPSYHPSAISTRK-KAPDGCSSLRDYEGPSVF 437

Query: 489 IIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDV 548
            IPGNHDWFDGLNTF R+IC + WLGGW +PQK SYF+++LP GWW+FG+DLAL  D+D 
Sbjct: 438 AIPGNHDWFDGLNTFTRYICQRDWLGGWLLPQKTSYFSIKLPHGWWLFGVDLALENDVDT 497

Query: 549 YQFKFFAELVKEQVGERDSVIIMTHEPNWL---------LDWYF-----NNVSGKNVKHL 594
            QF FF  +V+ Q+G  D+VI++THEP WL         LD  F     N + G+    L
Sbjct: 498 EQFGFFERVVQTQMGPNDAVIVLTHEPRWLLDVYEDKSNLDAKFAYLIENVLKGRCAVRL 557

Query: 595 ICD-----------------------YLKGRCKLRIAG-------------DMHHYMRHS 618
             D                       +   R K+  A                 H  +H 
Sbjct: 558 AGDIHNYTRHSLVEETHTLKRPASMQFDTARPKISTANLPRRHSFSSPVHKHFPHMKQHD 617

Query: 619 YVPSDGPVYV----------------QHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKA 662
            +P++  + +                +HL+++G GGAFLHPTH+ S      G TYE K 
Sbjct: 618 DIPTEERIAMDAQAQSAPQPERERSAEHLIISGGGGAFLHPTHIPSPTLTSNGGTYEHKQ 677

Query: 663 AYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHL 722
            YP    S R A+ N+  FR+ NW+FD IGGI YF +VFSMFP+C +  I    ++    
Sbjct: 678 CYPPAHVSRRYAVLNVFGFRRINWRFDAIGGIGYFAMVFSMFPRCSVGSIYAAATYWEAA 737

Query: 723 RSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAA 782
             F   + +    ++  SYVS   ++ +L+  I F     +  KR  +G+     H  AA
Sbjct: 738 AQFCQELVHILYDMVTTSYVSLLCSIGMLVGMIGFADCT-TLPKRCAMGMAVSLFHCIAA 796

Query: 783 LILMLLLELGVETCIQHKLLATSGEFFILVSFNSVTMNDCGISSF 827
             ++L+ E  +E       L   GE  + + F+S   +   I  +
Sbjct: 797 FTILLVYECLLEVASVRGSLGREGEHSLYLFFSSTLPDFSAIRQY 841


>gi|301123029|ref|XP_002909241.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262100003|gb|EEY58055.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1041

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 232/613 (37%), Positives = 332/613 (54%), Gaps = 104/613 (16%)

Query: 60  KLDNNIKWWSMYACLLGFFYFFSSPFIGK-----TITPSYSNFSRWYIAWILVAAVYHLP 114
           K ++ + W+++Y  +   FYFF+SPF+G      + TP Y   S    AW++ AA  H P
Sbjct: 11  KAEDQLVWFALYLVIYLAFYFFTSPFVGNRELLNSWTP-YVYVSMVLYAWLISAAALHTP 69

Query: 115 --SFQSMGVDLRMNLSLFLTIFLASVLFLLVFHII---FLG-----LWYVGLVSRVAGKR 164
             +   +   ++  +SL    F A+ +FL+V  ++   FL        +   V+ V+  R
Sbjct: 70  VHALGLLDSTIKQPISLLFPFFSATFVFLIVLEVLMALFLSRITKKFGFAWTVAHVSLAR 129

Query: 165 PEILTIIQNCVVISVFCCVFYSHCGNRAVLRHRPLERRNSSWFSLWKKEERNTWLAKFLR 224
              + +++N   ISV C     HC              ++++      +E  T+      
Sbjct: 130 -SFMNVLRNSAAISVACVAVIVHC--------------DATYDCAASADEPTTY------ 168

Query: 225 MNELKDQVCSSWFAPVGSASDYPLLSKWVIYGELGNDNGGSSDEISPIYSLWATFIGLYI 284
                   C+  F+ V S     L+             GG+         +W   + L I
Sbjct: 169 ----NSHACAQIFSFVQSDDVETLVPA---------SYGGA-------VLIWCMGMFLAI 208

Query: 285 ANYVVERSTGWALTHPLS------VEEYEKMKKKQ-----LKPE-FLDMVPWYSGTSADL 332
            N+ +ER +G  +            +E +++ + +     L PE  L MVPWYS     L
Sbjct: 209 TNFFLERFSGIRMIASFGWFSNRVEDEMDELSEGENGHELLPPEQNLPMVPWYSML---L 265

Query: 333 FKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGD 392
           F TVFDLL+S+ +F+GRFDMR MQ A++   E     D  +DHL+EK+++W DFMAD GD
Sbjct: 266 FDTVFDLLISLKIFLGRFDMRTMQRALHPKDE-----DYCFDHLAEKDEVWLDFMADCGD 320

Query: 393 GGNSSYSVARLLAQPHIRV----------------------TRDDSV-FTLPRGDVLLIG 429
           G NSSY +ARLLAQP + V                      T+  +V    PRGD L+IG
Sbjct: 321 GFNSSYQIARLLAQPQLEVDCQVPDDKSSDGDTLDQEQASTTKTKTVKRVFPRGDALVIG 380

Query: 430 GDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYI 489
           GDLAYP+P   TYE RLFR FEYA++PP  Y    ++  K + P G P L++Y+GP  + 
Sbjct: 381 GDLAYPHPDDKTYETRLFRCFEYAMKPPSSYHPSAISTQK-KTPDGCPSLREYEGPSVFA 439

Query: 490 IPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVY 549
           IPGNHDWFDGLNTF R+IC + WLGGW +PQK SYF+L+LP GWW+FG+DLAL  D+D  
Sbjct: 440 IPGNHDWFDGLNTFTRYICQRDWLGGWLIPQKTSYFSLKLPHGWWLFGVDLALENDVDTE 499

Query: 550 QFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVK--HLICDYLKGRCKLRI 607
           QF FF  +VK Q+G  D+VI++THEP WLL+ Y  + S  +VK  +LI + LKGR  +R+
Sbjct: 500 QFGFFERVVKTQMGPNDAVIVLTHEPRWLLNVY-EDRSNVDVKLSYLIENVLKGRVAVRL 558

Query: 608 AGDMHHYMRHSYV 620
           AGD+H+YMR+S V
Sbjct: 559 AGDIHNYMRYSLV 571



 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 67/207 (32%), Positives = 107/207 (51%), Gaps = 1/207 (0%)

Query: 621 PSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILK 680
           P+D     +HL+++G GGAFLHPTH+ S+     G  Y+ K  YP    S R A+ N+  
Sbjct: 636 PADLERSAEHLIISGGGGAFLHPTHIPSSNLASNGGNYKQKLCYPPAHVSRRYAVLNVFG 695

Query: 681 FRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHS 740
           FR+ NW+FD IGG+ YF LVFSMFP+C +  I  + +       F+  + +    ++  S
Sbjct: 696 FRRLNWRFDAIGGVGYFALVFSMFPRCSVTSIYAQATRWDAAAQFYLELVHLLYDMVTTS 755

Query: 741 YVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHK 800
           YVS   ++++L+  I F     +  KR  +G+     H  AA  ++L+ E  +E      
Sbjct: 756 YVSLTCSIVMLVGTIGFADCT-TLPKRCALGLAVAFFHYVAAFTILLVYECLLEVASVRG 814

Query: 801 LLATSGEFFILVSFNSVTMNDCGISSF 827
            L   GE  + + F+S   +  G+  +
Sbjct: 815 ALGRDGEHTLYLFFSSTLPDLSGVRKY 841


>gi|223998152|ref|XP_002288749.1| hypothetical protein THAPSDRAFT_261726 [Thalassiosira pseudonana
           CCMP1335]
 gi|220975857|gb|EED94185.1| hypothetical protein THAPSDRAFT_261726 [Thalassiosira pseudonana
           CCMP1335]
          Length = 866

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 212/550 (38%), Positives = 314/550 (57%), Gaps = 52/550 (9%)

Query: 295 WALTHPLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRM 354
           W+L+  L  ++  + K +Q K + L M PW+      L K+ FD+LVS+ +F+GRFD R 
Sbjct: 65  WSLSSTL--QQLTRAKAEQPK-DTLSMCPWFH---VMLIKSGFDILVSLHIFLGRFDARK 118

Query: 355 MQAAMNKDQEG-----------AQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARL 403
           MQ A+ + ++              H +   D   E    WFDFM+D GDG NSSY V+R 
Sbjct: 119 MQVALLESKDANAVFDFSNCLHQNHDNAAND--DEDNGFWFDFMSDCGDGFNSSYQVSRC 176

Query: 404 LAQPHIRVTR------DDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPP 457
           LAQP + VT          V  LPRG +L+IGGDLAYP+P+  TYERR FR FE A+ PP
Sbjct: 177 LAQPFLGVTAFSTAAPKRKVRQLPRGKLLVIGGDLAYPDPTPETYERRFFRTFEDAMPPP 236

Query: 458 PWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWF 517
           P ++K+H++++KP +P    ++  Y+GP  + IPGNHDWFDGL T+ R+I  + WLGGW 
Sbjct: 237 PSFRKEHISIHKPALPVKGNQISSYEGPIAFAIPGNHDWFDGLATYTRYILSRDWLGGWI 296

Query: 518 MPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNW 577
           MPQ+ SYFAL+LP GWW+ G DLAL  DI++ QF FFAEL +  +   D+VII+TH P W
Sbjct: 297 MPQQTSYFALKLPCGWWLLGFDLALDNDINIEQFAFFAELSETAMQPDDNVIIVTHIPFW 356

Query: 578 LLDWYFNN----VSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPS----------- 622
           +L  Y N+    +   N++ L+  +L+GR +LR+AGD+HHY RH+               
Sbjct: 357 VLSDYENHSDDALPETNLRELMRTHLRGRVRLRLAGDLHHYTRHTACDDGRNTSTNNTNS 416

Query: 623 -DGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYG--TTYESKAAYPSFEDSSRIALGNIL 679
            D PV    L+V+G GGAFLHPTH F +  +       Y  + AYPS + S+ ++  N+ 
Sbjct: 417 RDSPV----LIVSGGGGAFLHPTHCFRDTLQVGDDKQEYRRECAYPSTKVSAHLSWLNLW 472

Query: 680 KFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEH 739
           +FR +NW+ D I  + YF L  S FP C +     + + + +L        +  +Y++  
Sbjct: 473 QFRWRNWRLDVIWAVTYFGLCSSFFPLCGVYDDYLQYNPTHNLSGLVSWALSRMLYLISQ 532

Query: 740 SYVSFAGALLLLIVAITFV-----PSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVE 794
            +++   +LL  I  +  +     PS +      +  +LH  AH+++AL+ +L +E   E
Sbjct: 533 IFITGRVSLLFTIFVLGVLYSFTDPSGVKVSTHLVWSLLHSLAHISSALMCLLFVECLAE 592

Query: 795 TCIQHKLLAT 804
             +   L+ T
Sbjct: 593 FIVSEGLVDT 602


>gi|33354216|dbj|BAC81182.1| unknown protein [Oryza sativa Japonica Group]
 gi|50508993|dbj|BAD31942.1| unknown protein [Oryza sativa Japonica Group]
 gi|50510155|dbj|BAD31123.1| unknown protein [Oryza sativa Japonica Group]
          Length = 443

 Score =  333 bits (855), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 170/236 (72%), Positives = 196/236 (83%)

Query: 571 MTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQH 630
           MTHEPNWLLDWY+   +GKNV HLI DYL GRCKLR+AGD+HH+MRHS    D P  VQH
Sbjct: 1   MTHEPNWLLDWYWKETTGKNVSHLIQDYLNGRCKLRLAGDLHHFMRHSANQIDNPTSVQH 60

Query: 631 LLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDF 690
           LLVNGCGGAFLHPTHVF NF +F G TYE KAAYPSF+DSS IALGNILKFRKKNWQFD 
Sbjct: 61  LLVNGCGGAFLHPTHVFKNFEQFSGATYECKAAYPSFDDSSGIALGNILKFRKKNWQFDT 120

Query: 691 IGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLL 750
           IGG +YF+LVFSMFPQC L HIL E+++SG L SF  T+W+A +Y+ EHSYVS  G+L L
Sbjct: 121 IGGFIYFILVFSMFPQCNLGHILNEETWSGRLGSFSNTIWSALLYIFEHSYVSSVGSLTL 180

Query: 751 LIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSG 806
           L+ + +FVPSKLSR+KRA+IG LHV AHL AAL+LMLLLELG+E CI++ LLATSG
Sbjct: 181 LLASYSFVPSKLSRRKRAIIGGLHVLAHLTAALLLMLLLELGIEICIRNHLLATSG 236


>gi|397602091|gb|EJK58073.1| hypothetical protein THAOC_21826 [Thalassiosira oceanica]
          Length = 731

 Score =  303 bits (776), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 158/365 (43%), Positives = 221/365 (60%), Gaps = 37/365 (10%)

Query: 308 KMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEG-- 365
           + K +Q   + L MV WYS     +F T FD+L++  VF+GRFD R MQ A+ +  +G  
Sbjct: 348 RQKDEQQPQDILPMVSWYSNM---IFATGFDMLLASKVFLGRFDERKMQYALLRGTKGPT 404

Query: 366 --AQHGDLLYDHLSEK------EDLWFDFMADTGDGGNSSYSVARLLAQPHIRV-TRDDS 416
              +    L+D   E+      +  WFD+M+D GDG +SSY VARLLAQP + V T    
Sbjct: 405 SETELARPLFDFSKERRSEGDSDGFWFDWMSDCGDGFHSSYQVARLLAQPSLDVVTPSHG 464

Query: 417 VFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVP--- 473
              LPRG +L+IGGDLAYP P+ F YE+R FR FE A+ PPP Y+K+H+++ KP +P   
Sbjct: 465 TRALPRGKLLVIGGDLAYPGPTPFNYEQRFFRTFEDAMSPPPSYRKEHISIKKPALPVKG 524

Query: 474 --------SGVPELKQYDGPQCYI-------IPGNHDWFDGLNTFMRFICHKSWLGGWFM 518
                        L+ Y+GP  +I       +PGNHDWFDGL  + R+I  + WLGGW +
Sbjct: 525 WDADAVAGDDGDALQNYEGPVTFIKTKLETSVPGNHDWFDGLAAYTRYILSRDWLGGWLI 584

Query: 519 PQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWL 578
           PQ+ SYFAL+LPK WW+ G DLAL  DI++ QF FFA L +  + + D+V++ +H P+W+
Sbjct: 585 PQRTSYFALKLPKNWWLLGFDLALDDDINIEQFHFFANLAENDMQKDDNVVVASHVPHWV 644

Query: 579 LDWYFN---NVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNG 635
           L+ Y     +    N++ LI  +LK R +LR+AGD+HHY RH  VP +       L+V+G
Sbjct: 645 LEDYEEFKHDEKETNLRELIKSFLKIRVRLRLAGDLHHYTRH--VPCEERSVQPQLVVSG 702

Query: 636 CGGAF 640
            GG F
Sbjct: 703 AGGHF 707


>gi|325191039|emb|CCA25522.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 837

 Score =  291 bits (745), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 153/358 (42%), Positives = 216/358 (60%), Gaps = 55/358 (15%)

Query: 309 MKKKQLKPEF-LDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQ 367
           MK+++L  E  L+MVPWYS     LF T F+L+VS+ +F+GRFD R +Q A++   +   
Sbjct: 1   MKQEKLPDEHHLEMVPWYSMF---LFNTAFELMVSLKLFLGRFDHRSLQRAIHSSDD--- 54

Query: 368 HGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRV-------------TRD 414
             + ++DHL+ ++++W DF+AD GDG +SSY VARLLAQP + V              + 
Sbjct: 55  --NYIFDHLASRDEVWLDFVADCGDGFDSSYQVARLLAQPDLEVECSAIGRKGEKKDQKR 112

Query: 415 DSVFTLPRGDVLLIGGDLAYPNPSA------------------FTYERRLFRPFEYALQP 456
           +     PRGDVL++GGDLAYP+P+A                  F Y+ R +R FEYA++P
Sbjct: 113 NEKRVYPRGDVLIVGGDLAYPHPNAVNYEVEGILDVISLAHTSFLYQSRFWRVFEYAMKP 172

Query: 457 PPWYKKDHVAV-------------NKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTF 503
           P +Y    V+              N   +PS  P L  Y+GP  + IPGNHDWFDGL+TF
Sbjct: 173 PSFYDPTAVSARKLGLAPYHKAKGNASAMPSA-PFLNNYEGPCAFAIPGNHDWFDGLSTF 231

Query: 504 MRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVG 563
            RF+C++ WLGGW +PQK S+F L+LPKGWW+  +DLAL  DI+  QF+ F  + ++ + 
Sbjct: 232 NRFVCNRDWLGGWHLPQKTSHFILKLPKGWWLIAVDLALEDDINTEQFELFQRVAEKSIQ 291

Query: 564 ERDSVIIMTHEPNWLLDWYF-NNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYV 620
             D VII+THEP W+LD       S + + +LI     GR  +R+AGD+H+YMRHS V
Sbjct: 292 AEDGVIIVTHEPRWILDGIEGREKSEEKLTYLITRVFNGRVVVRLAGDIHNYMRHSLV 349



 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 95/201 (47%), Gaps = 2/201 (0%)

Query: 625 PVYVQHLLVNGCGGAFLHPTHVFS-NFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRK 683
           P  ++H  V+G GGAFLHPTH    +  +  G+TY     YP    S R AL N+  FR+
Sbjct: 417 PGAIKHFFVSGGGGAFLHPTHAPECDTIQVNGSTYMQSNCYPPKHVSKRYALLNVFGFRR 476

Query: 684 KNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVS 743
            NW+FD IGG+ YF L+ SMFP+C +  I +  S+   L+ F   V      ++  SYVS
Sbjct: 477 INWRFDVIGGLGYFCLIASMFPRCSVREIYKSGSWLAMLQLFILEVCAVQYEMMSTSYVS 536

Query: 744 FAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLA 803
                + ++V + F     S  K+ M G      H  AA   ++  E  V+  I    L 
Sbjct: 537 LT-TYVYMLVTLVFFADCTSFTKQVMTGAFMSWIHCIAASACLIFYECIVDFAILKGGLG 595

Query: 804 TSGEFFILVSFNSVTMNDCGI 824
             G   +   F +   +  G+
Sbjct: 596 QEGSHSLFQYFTTNVFSFSGL 616


>gi|302846678|ref|XP_002954875.1| hypothetical protein VOLCADRAFT_65342 [Volvox carteri f.
           nagariensis]
 gi|300259850|gb|EFJ44074.1| hypothetical protein VOLCADRAFT_65342 [Volvox carteri f.
           nagariensis]
          Length = 572

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 143/319 (44%), Positives = 198/319 (62%), Gaps = 18/319 (5%)

Query: 490 IPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVY 549
           IPGNHDW DGL TF R I H+ WLGGW +PQ+KSYFAL+LP GWW+FG DLAL  DID+ 
Sbjct: 1   IPGNHDWIDGLETFTREIQHQGWLGGWLLPQEKSYFALRLPAGWWLFGFDLALVQDIDMQ 60

Query: 550 QFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVS-GKNVKHLICDYLKGRCKLRIA 608
           Q+++FA +V++++G  D VI+MTHEP WLL+W++     G N++ L+  +L+GR ++ +A
Sbjct: 61  QYRYFANVVEQRMGPEDQVILMTHEPLWLLEWFWRRPHLGANLRQLVRGHLRGRARVHLA 120

Query: 609 GDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFE 668
           GD+H YMRHS+  S  P   QHL+VNG GGAFLHPTHV    +      Y   A YPS  
Sbjct: 121 GDLHFYMRHSWRWSRHPHDPQHLVVNGGGGAFLHPTHVSPGPQG----EYVCAACYPSPR 176

Query: 669 DSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCE-LNHILREDSFSGHLRSFFG 727
            S ++   N+  FR +N +FD IGG+ YF+LV S+ P+C  L  +L  DS +        
Sbjct: 177 TSLQLGRKNLHVFRLRNTRFDVIGGVFYFLLVVSVLPRCSHLAEVLEADSPA----KAVS 232

Query: 728 TVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILML 787
            +W+A+   L     + AG   L   A+  +        R      H + H+  A++L+L
Sbjct: 233 LMWSAYTDTL----AAIAGRSYLSAGALVVLLLLSLGLARGA----HCATHVTFAIVLLL 284

Query: 788 LLELGVETCIQHKLLATSG 806
           LLELGVETCI+++ L   G
Sbjct: 285 LLELGVETCIKYERLGKDG 303


>gi|159474886|ref|XP_001695554.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158275565|gb|EDP01341.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 477

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 147/326 (45%), Positives = 201/326 (61%), Gaps = 9/326 (2%)

Query: 490 IPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVY 549
           IPGNHDW DGL TF R I HK W+GGW MPQ+KSYFAL+LP GWW+FG DLAL  DID+ 
Sbjct: 1   IPGNHDWIDGLETFTREIQHKGWMGGWLMPQEKSYFALRLPLGWWLFGFDLALVQDIDMQ 60

Query: 550 QFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLRIAG 609
           Q+++FA +V++++   D VI+MTHEP WLL+WY++   G N++ L+  +L+GR +L +AG
Sbjct: 61  QYRYFANVVEQRMEPGDQVILMTHEPLWLLEWYWHRCLGANLRQLVRGHLRGRARLHLAG 120

Query: 610 DMHHYMRHSYVP---SDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGT-TYESKAAYP 665
           D+H YMRHS+     S  P   +HL+V+G GGAFLHPTHV     K  G  +Y  +AAYP
Sbjct: 121 DLHFYMRHSWARAAWSRHPHDPEHLVVSGGGGAFLHPTHVCVRAFKVCGHFSYVCRAAYP 180

Query: 666 SFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQC--ELNHILREDSFSGHLR 723
           S   S  +   N+  FR KN +FD IGG+VYF+LV S+ P+C   L  +L  +S +    
Sbjct: 181 SPRASLALGRKNLHLFRLKNTRFDVIGGVVYFLLVVSVLPRCSSHLAAVLEAESPAAAAL 240

Query: 724 SFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAM---IGVLHVSAHLA 780
                  +    ++  SY    G    +      VP  L R+ R +       H   H++
Sbjct: 241 HLLAAYLDTLAAIVGRSYGQGRGGAGAVASGRPHVPCCLVRRTRELSVAFAAAHTLTHVS 300

Query: 781 AALILMLLLELGVETCIQHKLLATSG 806
            A+ L+LLLELGVETCI+++ L   G
Sbjct: 301 LAVWLVLLLELGVETCIRYEGLGADG 326


>gi|421875681|ref|ZP_16307267.1| calcineurin-like phosphoesterase family protein [Brevibacillus
           laterosporus GI-9]
 gi|372455315|emb|CCF16816.1| calcineurin-like phosphoesterase family protein [Brevibacillus
           laterosporus GI-9]
          Length = 630

 Score =  225 bits (574), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 160/504 (31%), Positives = 241/504 (47%), Gaps = 80/504 (15%)

Query: 321 MVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQA-------------AMNKDQEGA- 366
           MV WY+     L  T    L+S T+F    D R+++A             A+ +D++G  
Sbjct: 31  MVNWYN--PKQLIVTGIKTLLS-TLFGLYSDFRLIEAYGQSAPRFEDYSKAIKQDEQGNY 87

Query: 367 ---QHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFT--LP 421
              + G    D   E+E++W D+++D GDG NS+Y++A  L++P + +   DS  +    
Sbjct: 88  VTDKQGFYCLDESREREEVWIDYVSDLGDGFNSTYTIAYYLSRPTLCLEDTDSNHSHITS 147

Query: 422 RGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQ 481
           RG++L+ GGD  YP  +   YE RL  P+  A++                         +
Sbjct: 148 RGNILVFGGDQVYPTANKKAYETRLITPYYLAMR-----------------------YSE 184

Query: 482 YDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLA 541
           +  PQ + IPGNHDW+DGL  F R  C + W  GW  PQ+KSYFA++LP  WW+ G D+ 
Sbjct: 185 HPHPQVFAIPGNHDWYDGLVAFTRLFCSRKWFNGWQAPQQKSYFAIKLPHHWWLLGTDVQ 244

Query: 542 LHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLD---WYFNNVSGKNVKHLICDY 598
           L+ DID  Q KFF E + EQ+ E D VI+   EP W+       +++   +N    +   
Sbjct: 245 LNSDIDGPQIKFF-ESIAEQMQEGDRVILCNAEPYWVSSSKYGEYDHTYNENNLFFLEKL 303

Query: 599 LKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQH-----LLVNGCGGAFLHPTHVFSN--FR 651
           LK + ++ IAGD HHY R  +VP D      H      +  G GGAFLHPTH F     +
Sbjct: 304 LKKQIQVFIAGDQHHYRRFEFVPEDSKGNPDHSKKVVKITAGGGGAFLHPTHDFKEQYIK 363

Query: 652 KFYGTTYESKAA--------YPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSM 703
           + +     SK          +P  + S ++  GN+L F  KN  F  +  I+Y +L F++
Sbjct: 364 EHFSEQITSKEKRMFKRMKDFPDQQKSRQLCRGNLL-FLAKNKTFGIVTSIIYLILAFAV 422

Query: 704 FPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITF--VPSK 761
           F           DS S    S FG      +Y   H   S      +LI+A  F      
Sbjct: 423 F---------GVDSPSPPDGS-FGDSLVITIYKATH---SIQAIFWMLIIAGGFWLFTDT 469

Query: 762 LSRKKRAMIGVLHVSAHLAAALIL 785
            SR+ + + G +H  +H+ AAL++
Sbjct: 470 HSRRYKWIAGCIHGLSHITAALLI 493


>gi|383452348|ref|YP_005366337.1| hypothetical protein COCOR_00329 [Corallococcus coralloides DSM
           2259]
 gi|380727416|gb|AFE03418.1| hypothetical protein COCOR_00329 [Corallococcus coralloides DSM
           2259]
          Length = 610

 Score =  224 bits (570), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 136/386 (35%), Positives = 196/386 (50%), Gaps = 52/386 (13%)

Query: 307 EKMKKKQLKPE-FLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEG 365
           E+      +P+   DMV W       + +T  D +V+ TVF  R D R+++A +   Q  
Sbjct: 34  ERGTATATRPQKHADMVRWLH--PGQVLRTGLDAVVA-TVFGARADHRLIEAVVRPQQP- 89

Query: 366 AQHGDLLYDHLSEKE---DLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDD--SVFTL 420
                  +D+  E     D W D+++D GDG +S+Y+VARLLA P +R+  +D  +    
Sbjct: 90  ------YFDYSEESGADGDFWLDYVSDIGDGWDSTYAVARLLALPELRLPEEDGKTAHIT 143

Query: 421 PRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELK 480
           PRG VL+ GGD  YP  S  TYE R  +P+E A++                         
Sbjct: 144 PRGRVLVFGGDEVYPGASRETYEERTVQPYEAAMR-----------------------RS 180

Query: 481 QYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDL 540
           Q   P  ++IPGNHDW+DGL+ FMR  C + WL G    Q +SYFAL+LPK WW+ G D+
Sbjct: 181 QAPHPDLFVIPGNHDWYDGLSAFMRLFCAQRWLAGRRTRQSRSYFALKLPKSWWLIGTDV 240

Query: 541 ALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSG----KNVKHLIC 596
            L+ DIDV Q ++F + V EQ+G  D VI+   EP W+L        G     N+++L  
Sbjct: 241 QLNSDIDVPQVEYFRQ-VAEQMGPDDRVILCNAEPAWVLAAAARRTKGSYLENNLEYLEE 299

Query: 597 DYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGT 656
             L  R  + +AGD+HHY RH           QH +  G GGAF+HPTH  +      G+
Sbjct: 300 KVLGRRIAVFLAGDLHHYRRHEDDTG------QHRITAGGGGAFMHPTHAPAAPVLRDGS 353

Query: 657 TYESKAAYPSFEDSSRIALGNILKFR 682
           T   + ++P    S ++A  N+   R
Sbjct: 354 TL--RKSFPDEATSRKLARKNLFLIR 377


>gi|401421230|ref|XP_003875104.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322491340|emb|CBZ26609.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 1213

 Score =  223 bits (569), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 149/464 (32%), Positives = 221/464 (47%), Gaps = 79/464 (17%)

Query: 381 DLWFDFMADTGDGGNSSYSVARLLAQPHIRVTR----------------DDSVFT----- 419
           D+WFD++AD GDG N +Y++ARLLA+P +++                  DDS  T     
Sbjct: 481 DIWFDWIADVGDGFNPTYAMARLLARPSLKIRSHRPPSKRVGLSFLSAFDDSTPTETPTF 540

Query: 420 ------LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPP-----WYKKDHVAVN 468
                 LPRG  +L+GGDLAYP+P+  TY  RL  P+  A+         ++ +    V 
Sbjct: 541 DREPSVLPRGSFVLVGGDLAYPSPNDETYTTRLLEPYHDAMSSNARLQSVFHVEQRRVVV 600

Query: 469 KPEVPSGVPELKQYDG---------------------------PQCYIIPGNHDWFDGLN 501
                + V  +   D                            P  + IPGNHDWFDGL 
Sbjct: 601 ADASDADVAHMHMLDAETVSRMATGHAALRTGRATAEEALRSVPLLFAIPGNHDWFDGLT 660

Query: 502 TFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQ 561
           T+ ++I  ++WLGGW MPQ+ S+F LQLP  W+V   D     DIDV Q  +F +++++ 
Sbjct: 661 TYRKYILERTWLGGWLMPQRSSFFVLQLPHNWFVLCGDTGNVQDIDVAQRNYFLDVIEKH 720

Query: 562 VGERDSVIIMTHEPNWLLDWYF--NNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSY 619
           +     VI+  HEP WL D     + ++   +   + + L  R +LR+AGD+HHY RH+ 
Sbjct: 721 MDAESCVILAAHEPGWLYDSMLCKSELTQPELAK-VSEALGTRLRLRLAGDIHHYSRHT- 778

Query: 620 VPSDGPVYVQHLLVNGCGGAFLH---PTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALG 676
            P D       L+V+G GGAFLH    T + S       T Y    A+P+  ++  I L 
Sbjct: 779 -PRDASSEAATLVVSGGGGAFLHGPRNTPIVSQL-----TAYRRACAFPA-HNTLPILLS 831

Query: 677 NILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSF--FGTVW--NA 732
            +L FR  NW+FD I G++ F++V S+ PQ   +  +R DS S  L +       W    
Sbjct: 832 RLLGFRVINWKFDLIIGVLSFLVVVSLLPQSIKD--VRRDSESTPLMTLPDAAAAWWERV 889

Query: 733 FMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVS 776
            +YV+       A  L  L+    F  + + R     + +LH S
Sbjct: 890 CVYVVTLFTKGIASVLATLVFFAAFAAAGVERNTPVWMRLLHSS 933


>gi|115376324|ref|ZP_01463563.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
 gi|310817971|ref|YP_003950329.1| hypothetical protein STAUR_0698 [Stigmatella aurantiaca DW4/3-1]
 gi|115366674|gb|EAU65670.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
 gi|309391043|gb|ADO68502.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 591

 Score =  223 bits (569), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 139/368 (37%), Positives = 192/368 (52%), Gaps = 45/368 (12%)

Query: 320 DMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEK 379
           DMV W     A   +   D +V+  VF  R D R+++A M + QE        Y H+ + 
Sbjct: 16  DMVRWLH--PAQFIRASMDAIVAA-VFGTRADQRLVEA-MVRPQEPY----FDYSHIEDG 67

Query: 380 ED-LWFDFMADTGDGGNSSYSVARLLAQPHIRV-TRDDSVFTLPRGDVLLIGGDLAYPNP 437
           ED  W D++ADTGDG NS+Y +ARLLA P + + T D  + T  RG VL+ GGD  YP  
Sbjct: 68  EDSFWLDYVADTGDGWNSTYCIARLLALPEMELATADGPLHTTRRGSVLVFGGDTVYPGA 127

Query: 438 SAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWF 497
           S  TYE RL +P+E A++  P              PS          P  ++IPGNHDW+
Sbjct: 128 SRETYEERLIQPYESAMRRSP-------------TPS----------PDMFVIPGNHDWY 164

Query: 498 DGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAEL 557
           DGL  F+R  C + W+ G    Q +SYFAL+LP+ WW+ G D+ L+ DIDV Q ++F + 
Sbjct: 165 DGLAAFLRLFCARRWMAGRRTRQSRSYFALKLPRNWWLLGTDVQLNSDIDVPQVEYFRQ- 223

Query: 558 VKEQVGERDSVIIMTHEPNWLLDWYFNNVSG---KNVKHLICDYLKGRCKLRIAGDMHHY 614
           V  ++   D VI+   EP W+         G    N+++L    L  R  + +AGD+HHY
Sbjct: 224 VASRMDPEDRVILCNAEPAWIHAANTKRKRGYLENNLEYLQEKVLGKRISVFLAGDLHHY 283

Query: 615 MRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIA 674
            RH     DG    Q  +  G GGAFLHPTH         G  Y  + ++P  + S +IA
Sbjct: 284 RRHEN--PDG----QQKITAGGGGAFLHPTHAPKADVLLDG--YTVQKSFPDEKTSRKIA 335

Query: 675 LGNILKFR 682
            GN+L  R
Sbjct: 336 RGNLLLIR 343


>gi|146085103|ref|XP_001465175.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134069272|emb|CAM67422.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 1213

 Score =  223 bits (567), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 138/409 (33%), Positives = 199/409 (48%), Gaps = 73/409 (17%)

Query: 380 EDLWFDFMADTGDGGNSSYSVARLLAQPHIRV----------------TRDDSV------ 417
            D+WFD++AD GDG N +Y++ARLLA+P +++                T DDS       
Sbjct: 480 RDIWFDWIADVGDGFNPTYAMARLLARPSLKIRWHRPPSKRVGLSFLPTFDDSTPTNTPT 539

Query: 418 -----FTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKK---------- 462
                F LPRG  +L+GGDLAYP P+  TY  RLF P+  A+      +           
Sbjct: 540 VDREPFVLPRGSFVLVGGDLAYPGPNDETYTTRLFEPYHDAMSSNVRLQSVFHAEQRRVV 599

Query: 463 --------------------DHVAVNKPEVPSG--VPELKQYDGPQCYIIPGNHDWFDGL 500
                                 +A  +  + +G    E      P  + IPGNHDWFDGL
Sbjct: 600 VADASDADVAHIHLLDAETVSRMATGRAALRTGRATAEEALRSVPLLFAIPGNHDWFDGL 659

Query: 501 NTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKE 560
            T+ ++I  ++W+GGW MPQ+ S+F L+LP  W+V   D     DIDV Q  +F +++++
Sbjct: 660 TTYRKYILERTWIGGWLMPQRSSFFVLRLPHNWFVLCGDTGNMQDIDVAQRNYFLDVIEK 719

Query: 561 QVGERDSVIIMTHEPNWLLDWYFNNVSGKNVK-HLICDYLKGRCKLRIAGDMHHYMRHSY 619
            +     VI+  HEP WL D           +   + + L  R +LR+AGD+HHY RH+ 
Sbjct: 720 CMDAESCVILAAHEPGWLYDSMLRKSELTQPELAKVSEALGTRLRLRLAGDIHHYSRHT- 778

Query: 620 VPSDGPVYVQHLLVNGCGGAFLH---PTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALG 676
            P D       L+V+G GGAFLH    T V S       T Y    A+P+  ++    L 
Sbjct: 779 -PRDASSEAATLVVSGGGGAFLHGPRNTPVVSQL-----TAYRRACAFPA-RNTLPTLLS 831

Query: 677 NILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSF 725
            +L FR  NW+FD I G+  F++V S+ PQ   +  +R DS S  L + 
Sbjct: 832 RLLGFRVINWKFDLIIGVFSFLVVVSLLPQSIKD--VRHDSESSPLMTL 878


>gi|294935292|ref|XP_002781370.1| hypothetical protein Pmar_PMAR020755 [Perkinsus marinus ATCC 50983]
 gi|239891951|gb|EER13165.1| hypothetical protein Pmar_PMAR020755 [Perkinsus marinus ATCC 50983]
          Length = 1048

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 138/437 (31%), Positives = 219/437 (50%), Gaps = 49/437 (11%)

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           LPR  V+  GGDLAYP PS   +  RL RP E+AL  PP+   + V   + +  + V   
Sbjct: 385 LPRASVVFHGGDLAYPVPSHKAFVNRLIRPLEWAL--PPY---EEVTTTESKSSAVV--- 436

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLD 539
              D PQ + IPGNHDW+DGL  ++ ++  +  L GW +PQK SYFA++L  GWW++GLD
Sbjct: 437 ---DQPQFFAIPGNHDWYDGLEVYLHWLVGQDHLAGWKLPQKCSYFAVKLSHGWWIWGLD 493

Query: 540 LALHCDIDVYQFKFFAELVKE-QVGERDSVIIMTHEPNWLL-----DWYFNNVSGKNVKH 593
           L+L  D+D  Q+++F  L+   +V   D V+++TH PNW +      W+  + +      
Sbjct: 494 LSLSYDLDRPQYEYFCGLLDSGKVSTEDRVVVITHRPNWEMGTIDHGWFMTSATPSQRTG 553

Query: 594 LICDYL-----KGRCKLRIAGDMHHYMRHSYVPS---------------------DGPVY 627
            +   L     + R  +R+AGD+HHY R  Y+PS                     DG   
Sbjct: 554 YLLSVLLDKIGEPRLAMRLAGDVHHYSR--YMPSVITSTFVLRRIRGGMISLSVQDGSKG 611

Query: 628 VQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQ 687
           V  L+ +G  GAFLHPTH     +      Y    +YP  + S R+   N ++FR++NW 
Sbjct: 612 VP-LVTSGGAGAFLHPTHFPP--KDILQKEYTRVESYPPEKVSRRLTWLNPIQFRRRNWG 668

Query: 688 FDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGA 747
            D + G+ Y  +  S  P C+   +L++ +    L  F   +  A+  +   SYVS  G 
Sbjct: 669 ADVVLGMWYLAMSISALPLCDAGRVLKQPNALLGLYEFVVLIIEAYDKIFRQSYVSLVGQ 728

Query: 748 LLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSGE 807
           ++ + + I     ++ + KR ++G++H   H  AA+  + L+E   E        A+ G+
Sbjct: 729 IIFIAMCIGCAEEQMGQAKRFIVGLIHGMCHSLAAVSAVCLVECFCEYLSNVTPTASGGD 788

Query: 808 FF-ILVSFNSVTMNDCG 823
              IL S  ++ ++  G
Sbjct: 789 ALEILDSTEALIVSALG 805



 Score = 84.7 bits (208), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 95/180 (52%), Gaps = 28/180 (15%)

Query: 256 GELGNDNGGSSDEISPIYSLW----ATFIGLYIANYVVER---------STGWALT-HPL 301
           G LG  + G  D +    SL      T + LY  +++V+R         S G  L  HP 
Sbjct: 142 GVLGVTSNGYCDYLDSPISLQLPVVITMLLLYGISWLVQRILGRRHGFLSAGPVLAKHPQ 201

Query: 302 SVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVF--DLLVSVTVFVGRFDMRMMQAAM 359
           + EE EK +    K     MV WYS     LF T +   + V + VF+GRFD+R + AA+
Sbjct: 202 AEEEAEKRRGPHPKLR-KAMVSWYS-----LFMTTYIPQVTVMLKVFMGRFDVRTLLAAL 255

Query: 360 NKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPH--IRVTRDDSV 417
            ++ E      L ++  S+KE+ WFDF AD GDG +SSY+V RL+AQP+  + V +D +V
Sbjct: 256 TREPEDT----LTFEDQSDKEETWFDFFADGGDGFDSSYTVGRLIAQPYLGVDVPKDAAV 311


>gi|389603734|ref|XP_003723012.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|322504754|emb|CBZ14538.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 1212

 Score =  220 bits (561), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 138/439 (31%), Positives = 212/439 (48%), Gaps = 67/439 (15%)

Query: 380 EDLWFDFMADTGDGGNSSYSVARLLAQPHIRVT--------------------------- 412
           +D+WFD+++D GDG N +Y++ARLLA+P +++T                           
Sbjct: 479 KDIWFDWISDVGDGFNPTYAMARLLARPFLKLTFYHPPSRRVGLSFFPTYEEVTPAKTPS 538

Query: 413 RDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKK---------- 462
            + +   LPRG  +L+GGDLAYP+P+  TY  RLF P+  A+      +           
Sbjct: 539 PNSAPDVLPRGSFVLVGGDLAYPSPNDETYTTRLFEPYHDAMSGNVRLQSVFHAEQQRVI 598

Query: 463 --------------------DHVAVNKPEVPSG--VPELKQYDGPQCYIIPGNHDWFDGL 500
                                 +A  +  + +G    E      P  + IPGNHDWFDGL
Sbjct: 599 VADASDADVAHVHLLDAETVSRMATGRAALRTGRATAEEALRSVPLLFAIPGNHDWFDGL 658

Query: 501 NTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKE 560
            TF ++I  ++W+GGW MPQ+ S+F L+LP  W++   D     DIDV Q  +F +++++
Sbjct: 659 TTFHKYILERTWIGGWLMPQRSSFFVLRLPHNWFILCGDTGNMQDIDVAQRNYFLDVIEK 718

Query: 561 QVGERDSVIIMTHEPNWLLDWYFNNVS-GKNVKHLICDYLKGRCKLRIAGDMHHYMRHSY 619
            +     VI+  HEP W+ D   +     +     +C+ L  R +LR+AGD+HHY RH  
Sbjct: 719 YMDVESCVILAAHEPGWVYDSMLHKPKLAQPELSKVCEALGTRLRLRLAGDIHHYSRH-- 776

Query: 620 VPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNIL 679
           VP D       L+V+G GGAFLH     S   +  GT Y    A+P   ++    L  +L
Sbjct: 777 VPVDASSEAATLVVSGGGGAFLHGARDDSIISQ--GTKYVRACAFPE-HNTLPAMLNRLL 833

Query: 680 KFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEH 739
            FR  NW+FD I G+  F++V S+ PQ   +  +R D  SG L +    V   +  V  +
Sbjct: 834 GFRVINWKFDLIIGVCSFLVVVSLLPQSIKD--VRCDLESGPLMTLPDAVAAWWERVCVY 891

Query: 740 SYVSFAGALLLLIVAITFV 758
               F+  +  L+  + F+
Sbjct: 892 IVTLFSKGIASLLATLGFL 910


>gi|157868483|ref|XP_001682794.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68126250|emb|CAJ03651.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 1213

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 138/428 (32%), Positives = 206/428 (48%), Gaps = 86/428 (20%)

Query: 360 NKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRV-------- 411
           + +++ A+ GD          D+WFD++AD GDG N +Y++ARLLA+P +++        
Sbjct: 469 SPERKEAEKGD---------TDIWFDWIADVGDGFNPTYAMARLLARPFLKIRWHRPPSK 519

Query: 412 ---------------------TRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPF 450
                                 RD   F LPRG  +L+GGDLAYP+P+  TY  RLF P+
Sbjct: 520 RVGLSFLPTFDDPTPTDTPTIDRDP--FALPRGSFVLVGGDLAYPSPNDETYTTRLFEPY 577

Query: 451 EYALQPPPWYKK------------------------------DHVAVNKPEVPSG--VPE 478
             A+      +                                 +A  +  + +G    E
Sbjct: 578 HDAMSSNVRLQSLFHAEQRRVVVADASDADVAHMHLLDAETVSRMATGRAALRTGHATAE 637

Query: 479 LKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGL 538
                 P  + IPGNHDWFDGL T+ ++I  ++W+GGW MPQ+ S+F L+LP  W+V   
Sbjct: 638 EALRSVPLLFAIPGNHDWFDGLTTYRKYILERTWIGGWLMPQRSSFFVLRLPHNWFVLCG 697

Query: 539 DLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVK-HLICD 597
           D     DIDV Q  +F +++++ +     V++  HEP W+ D   +       +   +  
Sbjct: 698 DTGNMQDIDVAQRNYFLDVIEKYMDAESCVVLAAHEPGWIYDSMLHKSELTQPELAKVSK 757

Query: 598 YLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLH---PTHVFSNFRKFY 654
            L  R +LR+AGD+HHY RH+  P D       L+V+G GGAFLH    T V S      
Sbjct: 758 ALGTRLRLRLAGDIHHYSRHT--PRDASSEAATLVVSGGGGAFLHGPRNTPVVSQL---- 811

Query: 655 GTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILR 714
            T Y    A+P+  ++    L  +L FR  NW+FDFI G+  F++V S+ PQ   +  +R
Sbjct: 812 -TAYRRACAFPA-RNTLPTLLSRLLGFRVINWKFDFIIGVFSFLVVVSLLPQSIKD--VR 867

Query: 715 EDSFSGHL 722
            DS S  L
Sbjct: 868 RDSESSPL 875


>gi|294868612|ref|XP_002765607.1| hypothetical protein Pmar_PMAR013671 [Perkinsus marinus ATCC 50983]
 gi|239865686|gb|EEQ98324.1| hypothetical protein Pmar_PMAR013671 [Perkinsus marinus ATCC 50983]
          Length = 1068

 Score =  219 bits (558), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 147/476 (30%), Positives = 233/476 (48%), Gaps = 56/476 (11%)

Query: 387 MADTGDGGNSSYSVARLLAQPHI----RVTRDDSV----FTLPRGDVLLIGGDLAYPNPS 438
           M   G GGN   + + + ++ H+    RV  D         LPR  V+  GGDLAYP PS
Sbjct: 345 MPILGHGGNRVRTDS-VSSETHLTGLQRVPSDSGTGHRRIMLPRASVVFHGGDLAYPVPS 403

Query: 439 AFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFD 498
              +  RL RP E+AL  PP+   + V   + +  + V      D PQ + IPGNHDW+D
Sbjct: 404 HKAFVNRLIRPLEWAL--PPY---EEVTTTESKSSAIV------DRPQFFAIPGNHDWYD 452

Query: 499 GLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELV 558
           GL  ++ ++  +  L GW +PQK SYFA++L  GWW++GLDL+L  D+D  Q+++F  L+
Sbjct: 453 GLEVYLHWLVGQDHLAGWKLPQKCSYFAVKLSHGWWIWGLDLSLSYDLDRPQYEYFCGLL 512

Query: 559 KE-QVGERDSVIIMTHEPNWLL-----DWYFNNVSGKNVKHLICDYL-----KGRCKLRI 607
              +V   D V+++TH PNW +      W   + +       +   L     + R  +R+
Sbjct: 513 DSGKVSTEDRVVVITHRPNWEMGTIDHGWSMTSATPSQRTGYLLSVLLDKIGEPRLAMRL 572

Query: 608 AGDMHHYMRHSYVPS-------------------DGPVYVQHLLVNGCGGAFLHPTHVFS 648
           AGD+HHY R  Y+PS                   DG   V  L+ +G  GAFLHPTH   
Sbjct: 573 AGDVHHYSR--YMPSVITFGCRRKSGGMISLFVQDGSKGVP-LVTSGGAGAFLHPTHFPP 629

Query: 649 NFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCE 708
             +      Y    +YP  + S R+   N ++FR++NW  D + G+ Y  +  S  P C+
Sbjct: 630 --KDILQKEYTRVESYPPEKVSRRLTWLNPIQFRRRNWGADVVLGMWYLAMSISALPLCD 687

Query: 709 LNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRA 768
              +L++ +    L  F   +  A+  +   SYVS  G ++ + + I     ++ + KR 
Sbjct: 688 AGRVLKQPNALLGLYEFVVLIIEAYDKIFRQSYVSLVGQIIFIAMCIGCAEEQMGQAKRF 747

Query: 769 MIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSGEFF-ILVSFNSVTMNDCG 823
           ++G++H   H  AA+  + L+E   E        A+ G+   IL S  ++ ++  G
Sbjct: 748 IVGLIHGMCHSLAAVSAVCLVECFCEYLSNVTPTASGGDALEILDSTEALIVSALG 803



 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 65/172 (37%), Positives = 91/172 (52%), Gaps = 26/172 (15%)

Query: 256 GELGNDNGGSSDEISPIYSLW----ATFIGLYIANYVVER---------STGWALT-HPL 301
           G LG  + G  D +    SL      T + LY  +++V+R         S G  L  HP 
Sbjct: 142 GVLGVTSNGYCDYLDSPISLQLPVVITMLLLYGISWLVQRILGRRHGFLSAGPVLAKHPQ 201

Query: 302 SVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVF--DLLVSVTVFVGRFDMRMMQAAM 359
           + EE EK +    K     MV WYS     LF T +   + V + VF+GRFD+R + AA+
Sbjct: 202 AEEEAEKRRGPHPKLR-KAMVSWYS-----LFMTTYIPQVTVMLKVFMGRFDVRTLLAAL 255

Query: 360 NKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRV 411
            K+ E      L ++  S+KE+ WFDF AD GDG +SSY+V RL+AQP++ V
Sbjct: 256 TKEPEDT----LTFEDQSDKEETWFDFFADGGDGFDSSYTVGRLIAQPYLGV 303


>gi|294911727|ref|XP_002778050.1| hypothetical protein Pmar_PMAR018487 [Perkinsus marinus ATCC 50983]
 gi|239886171|gb|EER09845.1| hypothetical protein Pmar_PMAR018487 [Perkinsus marinus ATCC 50983]
          Length = 403

 Score =  214 bits (546), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 130/379 (34%), Positives = 189/379 (49%), Gaps = 95/379 (25%)

Query: 336 VFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGN 395
           +FD+++ + VF+G+FD R++ +   K+        + +  LS +ED+W DF+AD GDG +
Sbjct: 5   LFDIILMMKVFLGKFDARVLHSVQAKE--------ITFSDLSSREDVWLDFLADGGDGFD 56

Query: 396 SSYSVARLL-----------------------------AQPHIRV--------------- 411
           S+Y+++RLL                             A+P I                 
Sbjct: 57  STYTISRLLAQPHLSVGVPNDRALLSQARDALVSADGVARPRIATEVDELVPLSRKTIRS 116

Query: 412 ------TRDDS------------------VFTLPRGDVLLIGGDLAYPNPSAFTYERRLF 447
                  RDDS                     LPR DV+  GGD+AYP+PS   Y  R  
Sbjct: 117 LPTFNSVRDDSPVSVKGRGDGRFTLAHRNTLMLPRADVVFHGGDVAYPSPSNEEYVSRFI 176

Query: 448 RPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFI 507
           RP E+AL  P    +D+V                 + PQ +IIPGNHDW DGL TF+ +I
Sbjct: 177 RPLEWAL--PRINSEDNVK-------------DSVEQPQMFIIPGNHDWHDGLETFLHWI 221

Query: 508 CHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKE-QVGERD 566
            +   + GW +PQK SYFA++L  GWWV+GLDL L  D+D  Q+++F  L++  +V   D
Sbjct: 222 VYNQKVAGWKLPQKHSYFAVKLSYGWWVWGLDLGLSYDLDRPQYEYFCALLETGKVEADD 281

Query: 567 SVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPV 626
            V+++TH PNW+ D      +G  +  L+    + R  +R+AGD+HHY R  Y+P DG  
Sbjct: 282 RVVVLTHRPNWVFDPAVAERTGYTLNVLLEKIGEPRLAMRLAGDLHHYTR--YMPGDGSS 339

Query: 627 YVQHLLVNGCGGAFLHPTH 645
               L+ +G  GAFLHPTH
Sbjct: 340 GPP-LVTSGGAGAFLHPTH 357


>gi|294942176|ref|XP_002783414.1| hypothetical protein Pmar_PMAR006940 [Perkinsus marinus ATCC 50983]
 gi|239895869|gb|EER15210.1| hypothetical protein Pmar_PMAR006940 [Perkinsus marinus ATCC 50983]
          Length = 562

 Score =  214 bits (546), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 147/428 (34%), Positives = 215/428 (50%), Gaps = 93/428 (21%)

Query: 283 YIANYVVERSTGWALTHPLSVE----EYEKMKKKQLKPEFLD-MVPWYSGTSADLFKTVF 337
           ++   ++ R  G+    P+  +    E E  K++   P+    MV WYS     LF T +
Sbjct: 116 WLLQRILGRRHGFVAAGPVLAKYHQAEEEAEKRRGPDPKLRKAMVSWYS-----LFMTTY 170

Query: 338 --DLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGN 395
              + V + VF+GRFD+R + AA+ ++ EG     + ++  S+KE+ WFDF AD GDG +
Sbjct: 171 IPQVTVMLKVFMGRFDVRTLLAALTREPEGT----VTFEDQSDKEETWFDFFADGGDGFD 226

Query: 396 SSYSVARLLAQPH--IRVTRDDSV--------------------------FTLPRGD--- 424
           SSY+V RL+AQP+  + V +DD V                           TL R +   
Sbjct: 227 SSYTVGRLIAQPYLGVDVPKDDHVAEKVSKLTISGMRSMASSGEGNSSESLTLQRHNSIP 286

Query: 425 -----------------VLLI--------GGDLAYPNPSAFTYERRLFRPFEYALQPPPW 459
                            VLL         GGDLAYP PS   +  RL RP E+AL P   
Sbjct: 287 LLGHVGNRALKEFRHPIVLLATDCHLVFHGGDLAYPVPSHEAFVDRLIRPPEWALPP--- 343

Query: 460 YKKDHVAVNKPEVPSGVPELKQY-DGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFM 518
           Y+         E P+GV +     D PQ + IPGNHDW+DGL  ++ ++  +  L GW +
Sbjct: 344 YR---------EAPTGVSKASAVADQPQFFAIPGNHDWYDGLEVYLHWLVGQDHLAGWKV 394

Query: 519 PQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKE-QVGERDSVIIMTHEPNW 577
           PQK +YFA++L  GWW++GLDL+L  D+D  Q+++F  L+   +V   D V+++TH PNW
Sbjct: 395 PQKSTYFAVKLSHGWWIWGLDLSLSYDLDRPQYEYFCGLLDSGKVDTEDRVVVITHRPNW 454

Query: 578 LLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCG 637
                 +  +G  V  L+    + R  +R+AGD HHY R+       P  + H  + G  
Sbjct: 455 DPCGVPSQRTGYLVSVLLDKIGEPRLGMRLAGDTHHYSRYM------PAVLPHPTLGGA- 507

Query: 638 GAFLHPTH 645
           GAFLHPTH
Sbjct: 508 GAFLHPTH 515


>gi|294942172|ref|XP_002783412.1| hypothetical protein Pmar_PMAR006938 [Perkinsus marinus ATCC 50983]
 gi|239895867|gb|EER15208.1| hypothetical protein Pmar_PMAR006938 [Perkinsus marinus ATCC 50983]
          Length = 389

 Score =  212 bits (539), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 130/371 (35%), Positives = 188/371 (50%), Gaps = 97/371 (26%)

Query: 343 VTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVAR 402
           + VF+G+FD R++ +A  ++        + +  LS +ED+W DF+AD GDG +S+Y+++R
Sbjct: 2   MKVFLGKFDARVLHSAHARE--------ITFSDLSGREDVWLDFLADGGDGFDSTYTISR 53

Query: 403 LLAQPHIRV--------------------------------------------------T 412
           LLAQPH+ V                                                   
Sbjct: 54  LLAQPHLSVEVPNDRALRAQARDALVSADGVARPRIATEVDELVPLSRKTIRSLPSFDSV 113

Query: 413 RDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEV 472
           RDDS  ++   DV+  GGD+AYP+PS   Y  R  RP E+AL  P    +D+V       
Sbjct: 114 RDDSPVSVKGRDVVFHGGDVAYPSPSNEEYVSRFIRPLEWAL--PRINSEDNVK------ 165

Query: 473 PSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKG 532
                     + PQ +IIPGNHDW DGL TF+ +I +   + GW +PQK SYFA++L  G
Sbjct: 166 -------DSVEQPQMFIIPGNHDWHDGLETFLHWIVYNQKVAGWKLPQKHSYFAVKLSHG 218

Query: 533 WWVFGLDLALHCDIDVYQFKFFAELVKE-QVGERDSVIIMTHEPNWLLDWYFNNVS---- 587
           WWV+GLDL L  D+D  Q+++F  L++  +V   D V+++TH PNW+ D     VS    
Sbjct: 219 WWVWGLDLGLSYDLDRPQYEYFCALLETGKVEADDRVVVLTHRPNWVFDPAVAEVSIFPL 278

Query: 588 ----------GKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVP---SDGPVYVQHLLVN 634
                     G  +  L+    + R  +R+AGD+HHY R  Y+P   S+GP     L+ +
Sbjct: 279 VCELRSLQRTGYTLNVLLEKIGEPRLAMRLAGDLHHYTR--YMPGNGSNGPP----LVTS 332

Query: 635 GCGGAFLHPTH 645
           G  GAFLHPTH
Sbjct: 333 GGAGAFLHPTH 343


>gi|442324603|ref|YP_007364624.1| hypothetical protein MYSTI_07668 [Myxococcus stipitatus DSM 14675]
 gi|441492245|gb|AGC48940.1| hypothetical protein MYSTI_07668 [Myxococcus stipitatus DSM 14675]
          Length = 611

 Score =  210 bits (534), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 133/376 (35%), Positives = 188/376 (50%), Gaps = 60/376 (15%)

Query: 321 MVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSE-- 378
           MV W     A L +T  D LV+  VF  R D R+++A +          D  +D+  E  
Sbjct: 51  MVRWLH--PAQLLRTGLDALVAA-VFGARADHRLIEAVVRPQ-------DPYFDYSQETS 100

Query: 379 -KEDLWFDFMADTGDGGNSSYSVARLLAQPH----IRVTRDDSVFTLPRGDVLLIGGDLA 433
            + D W D++ADTGDG +S+Y+VARLLA P     +R       F   RG VL++GGD  
Sbjct: 101 PEGDFWLDYVADTGDGWDSTYTVARLLALPELKLPVRGQPRAGEFDTQRGRVLVMGGDGV 160

Query: 434 YPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGN 493
           YP  S   YE RL +P+E A++             +   P+          P  +IIPGN
Sbjct: 161 YPGASREVYEERLIQPYEAAMR-------------RSSTPN----------PDLFIIPGN 197

Query: 494 HDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKF 553
           HDW+DGL+ FMR  C   W+ G    Q +SYF+L+LP+GWW+ G D+ L+ DIDV Q ++
Sbjct: 198 HDWYDGLSAFMRLFCANRWIAGRRTRQSRSYFSLKLPQGWWLIGTDVQLNSDIDVPQVEY 257

Query: 554 FAELVKEQVGERDSVIIMTHEPNWLL-------DWYFNNVSGKNVKHLICDYLKGRCKLR 606
           F + V +Q+G  D VI+   EP W+L         Y  N    N+++L    L  R  + 
Sbjct: 258 FRQ-VADQMGPEDRVILCNAEPAWILAATQRRKGSYLEN----NLEYLQEKVLGRRISIF 312

Query: 607 IAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPS 666
           +AGD+HHY RH           +  +  G GGAF+HPTH         G  Y  + ++P 
Sbjct: 313 LAGDLHHYRRHEDAAG------RQKITAGGGGAFMHPTHAPKAHVLRDG--YMLQKSFPD 364

Query: 667 FEDSSRIALGNILKFR 682
              S  +A  N+   R
Sbjct: 365 ERTSRSLARKNLFLIR 380


>gi|342179818|emb|CCC89292.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 743

 Score =  208 bits (529), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 145/466 (31%), Positives = 221/466 (47%), Gaps = 60/466 (12%)

Query: 385 DFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYER 444
           D       GG      AR L+      T D   F LPR   ++IGGDLAYPNP+  TY  
Sbjct: 57  DRFTSPAGGGKLQAMSARSLSLSGFVPTTD---FVLPRASFVIIGGDLAYPNPTNETYRT 113

Query: 445 RLFRPFEYALQ--PP--PWYKK--DHVAVNKPE----------VPSGVPELKQYDG---- 484
           RL  P+  A++   P     KK  +H+ V  PE              V  + Q++     
Sbjct: 114 RLLEPYNDAIRRCTPLCEMVKKYYNHLVVQNPENKDEVHIRMLAARKVEAMTQHNKLTDE 173

Query: 485 -----------PQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGW 533
                      P  + IPGNHDW DGL TF +FI  +SWLGGWFMPQK SYF +QLP  W
Sbjct: 174 KITEEGVLRSIPMLFAIPGNHDWLDGLVTFKKFIIDESWLGGWFMPQKSSYFVIQLPYNW 233

Query: 534 WVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKH 593
           ++  +D     DID  Q  +F +  ++ + E   V++++HEP W+ +    +++      
Sbjct: 234 FILCMDTGNEADIDPSQRNYFLDYTEKTLDELSCVVLVSHEPGWVYEAMRTDLTSTMQPE 293

Query: 594 L--ICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLH---PTHVFS 648
           L  + + L  R +LR+ GD+HHY RH+  P+D       L+V+G GGAFLH    T V S
Sbjct: 294 LNRVVNALGTRLRLRLCGDIHHYSRHT--PTDPFSEAPVLVVSGGGGAFLHGSRNTPVIS 351

Query: 649 NFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCE 708
                 GT Y+  AA+P     +   L  ++ FR  NW+FD I G + F L+ S  P   
Sbjct: 352 Q-----GTEYKRVAAFPRHNHVTSF-LARLVGFRLINWKFDIIAGFMSFGLIASSLPLNM 405

Query: 709 LNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLS----R 764
            +  L+E      +   +    +     +E  + +F   ++ L VA+ F  S  S    R
Sbjct: 406 QDEKLKE------IVDIYALASSTLFRTVELFFFTFDKGIVSLFVALCFFASFFSLGSGR 459

Query: 765 KK---RAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSGE 807
           K    R +   L     + A+  ++ +++  +   + H+L++++ E
Sbjct: 460 KGLCFRVVYATLWTVLVVLASTGMLSIIQTTLAYMMNHQLVSSTKE 505


>gi|444913519|ref|ZP_21233669.1| hypothetical protein D187_05839 [Cystobacter fuscus DSM 2262]
 gi|444715643|gb|ELW56507.1| hypothetical protein D187_05839 [Cystobacter fuscus DSM 2262]
          Length = 654

 Score =  206 bits (524), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 129/368 (35%), Positives = 184/368 (50%), Gaps = 46/368 (12%)

Query: 320 DMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEK 379
           DMV W       L ++  D LV+ ++F  R D R+++A +       Q     Y   +  
Sbjct: 81  DMVRWLH--PQHLIRSGLDALVA-SIFGVRADHRLIEAVVRP-----QAPYFDYSDEAAG 132

Query: 380 EDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDS-VFTLPRGDVLLIGGDLAYPNPS 438
           ED W D+++DTGDG NS+Y+VARLLA+P + +   +     L RG +L+ GGD  YP  S
Sbjct: 133 EDFWLDYVSDTGDGWNSTYAVARLLAKPELELKDPNGDTHALERGRILVFGGDQVYPGAS 192

Query: 439 AFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFD 498
             TYE+RL +P+  A+               PE PS          P  + IPGNHDW+D
Sbjct: 193 RDTYEQRLVQPYREAMS------------RSPE-PS----------PHLFAIPGNHDWYD 229

Query: 499 GLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELV 558
           GL+ FMR  C   W  G    Q +SYFAL+LP+GWW+ G D+ L+ DIDV Q ++F +  
Sbjct: 230 GLSAFMRLFCADRWFAGRRTRQSRSYFALKLPRGWWLIGTDVQLNSDIDVPQLEYFRQTA 289

Query: 559 KE-QVGERDSVIIMTHEPNWLLDWYFNNVSG---KNVKHLICDYLKGRCKLRIAGDMHHY 614
              Q G+R  +I+   EP W+         G    N+++L       R  L +AGD+HHY
Sbjct: 290 ASMQPGDR--IILCNAEPAWIHAATTPRPRGYMENNLEYLQEKVFGRRISLFLAGDLHHY 347

Query: 615 MRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIA 674
            RH           +  +  G GGAF+HPTH     R   G+  E K  +P  + S  + 
Sbjct: 348 KRHEDAAG------RQKITAGGGGAFMHPTHAPQAQRLRCGS--EQKKCFPDEKTSRELT 399

Query: 675 LGNILKFR 682
             N++  R
Sbjct: 400 RQNLMLIR 407


>gi|320162669|gb|EFW39568.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 926

 Score =  202 bits (513), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 142/419 (33%), Positives = 203/419 (48%), Gaps = 71/419 (16%)

Query: 384 FDFMADTGDGGNSSYSVARLLAQPHIRV--TRDDSVFTLPRGDVLLIGGDLAYPNPSAFT 441
            DF+ADTGDG NS++++A LLAQP + V   +      L RG VL++GGDL YP+PS   
Sbjct: 323 LDFVADTGDGWNSTFTIATLLAQPRLDVLDPKHGEYICLERGKVLVLGGDLCYPDPSPQN 382

Query: 442 YERRLFRPFEYALQPPPWYKKDHVA---VNKPEVPSGVPELKQYDGPQCYIIPGNHDWFD 498
           Y+ R   PF +AL    W +KD  +   + +  + S          P  + IPGNHDW D
Sbjct: 383 YKSRFVEPFRHAL----WPEKDFRSGFDIERRHITSST-------HPHAFAIPGNHDWLD 431

Query: 499 GLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELV 558
           GL  F R +C    LGGW +PQ++SYFALQLP GWW+FG+D  L  DID  QF++F   V
Sbjct: 432 GLVAFRRVMCSGHRLGGWVLPQRRSYFALQLPDGWWLFGIDDQLTYDIDDGQFRYFRN-V 490

Query: 559 KEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHS 618
              +   D VI+  H P W+LD +   V    +  L    L  + KL +AGD+H Y RH 
Sbjct: 491 AAHMAPTDRVIVAMHRPFWILDGHPREV----LSVLFDKALGDKLKLILAGDLHFYSRHE 546

Query: 619 YVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNI 678
               DG +   +++V G GG                   Y+ +  +PS   S ++ L  +
Sbjct: 547 R--QDGKI---NMIVAGGGG------------------KYDLRNVFPSAAISRKLRLQGL 583

Query: 679 LKFRKKNWQFDFIGGIVYFVLV-FSMFPQCELNHI---LREDSFSGHLRSFF-------- 726
           L F  KN+ F  +  ++Y  +V F  +   E  H+    +E S S  L            
Sbjct: 584 L-FLPKNYSFGLVTAVLYLFMVGFLPWTLWEPPHVPASAKEPSVSSQLPLHMPFSSPFTF 642

Query: 727 ------GTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHL 779
                 G +W++  + L   ++   G L   +          SR    +IG+LHVS HL
Sbjct: 643 FTFLLQGILWSSLSFSL---FLVLWGGLSFFVDV-----GHRSRFWSVLIGLLHVSVHL 693


>gi|261326024|emb|CBH08850.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 1118

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 115/321 (35%), Positives = 170/321 (52%), Gaps = 38/321 (11%)

Query: 418 FTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQ--------PPPWYKK------D 463
           F+LPR   +++GGDLAYPNP+  TY  RL  P+  AL+           WY +      D
Sbjct: 471 FSLPRASFVVVGGDLAYPNPTNETYRTRLLEPYNNALRCCAPLCKLVKKWYNRLVVPEED 530

Query: 464 HVAVNKPEV--PSGVPELKQ---------------YDGPQCYIIPGNHDWFDGLNTFMRF 506
           +  V +  +   S V E+ Q               +  P  + IPGNHDW DGL TF +F
Sbjct: 531 NKDVARIHMLSASKVSEMTQRRDIADMCLGEDEVVHSTPLLFAIPGNHDWLDGLVTFKKF 590

Query: 507 ICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERD 566
           I  +SW+GGWFMPQK S+F + LP  W++  +D     DID  Q  +F   ++E +    
Sbjct: 591 IIDESWMGGWFMPQKSSFFVINLPYNWFLLCVDTGSVTDIDPGQRNYFLNYIEEHLDVSS 650

Query: 567 SVIIMTHEPNWLLDWYFNNVSG--KNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDG 624
            VI+++HEP W+ +    N++   +   H + D L  R ++R+ GD+HHY RH+  P+D 
Sbjct: 651 CVILISHEPGWIYEAMNTNLTSTMQPELHRVVDALGTRLRMRLCGDIHHYSRHT--PTDA 708

Query: 625 PVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKK 684
                 L+V+G GGAFLH     +N   + GT Y+ +AA+P  ++     L  ++ FR  
Sbjct: 709 LSEAPVLVVSGGGGAFLHGAR--NNTVIYQGTEYKREAAFPR-DNHVTSFLTRLVGFRLI 765

Query: 685 NWQFDFIGGIVYFVLVFSMFP 705
           NW+FD I G + F L+ S  P
Sbjct: 766 NWKFDIIAGFMCFGLITSSLP 786



 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 24/40 (60%), Positives = 32/40 (80%)

Query: 380 EDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFT 419
           E++WFDF+AD GDG NS+Y +ARL+AQP +R+   DSV T
Sbjct: 374 ENVWFDFVADVGDGFNSTYEMARLMAQPFLRLASSDSVQT 413


>gi|115504031|ref|XP_001218808.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|83642290|emb|CAJ16040.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 1118

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 115/321 (35%), Positives = 170/321 (52%), Gaps = 38/321 (11%)

Query: 418 FTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQ--------PPPWYKK------D 463
           F+LPR   +++GGDLAYPNP+  TY  RL  P+  AL+           WY +      D
Sbjct: 471 FSLPRASFVVVGGDLAYPNPTNETYRTRLLEPYNNALRCCAPLCKLVKKWYNRLVVPEED 530

Query: 464 HVAVNKPEV--PSGVPELKQ---------------YDGPQCYIIPGNHDWFDGLNTFMRF 506
           +  V +  +   S V E+ Q               +  P  + IPGNHDW DGL TF +F
Sbjct: 531 NKDVARIHMLSASKVSEMTQRRDIADMCLGEDEVVHSTPLLFAIPGNHDWLDGLVTFKKF 590

Query: 507 ICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERD 566
           I  +SW+GGWFMPQK S+F + LP  W++  +D     DID  Q  +F   ++E +    
Sbjct: 591 IIDESWMGGWFMPQKSSFFVINLPYNWFLLCVDTGSVTDIDPGQRNYFLNYIEEHLDVSS 650

Query: 567 SVIIMTHEPNWLLDWYFNNVSG--KNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDG 624
            VI+++HEP W+ +    N++   +   H + D L  R ++R+ GD+HHY RH+  P+D 
Sbjct: 651 CVILISHEPGWIYEAMNTNLTSTMQPELHRVVDALGTRLRMRLCGDIHHYSRHT--PTDA 708

Query: 625 PVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKK 684
                 L+V+G GGAFLH     +N   + GT Y+ +AA+P  ++     L  ++ FR  
Sbjct: 709 LSEAPVLVVSGGGGAFLHGAR--NNTVIYQGTEYKREAAFPR-DNHVTSFLTRLVGFRLI 765

Query: 685 NWQFDFIGGIVYFVLVFSMFP 705
           NW+FD I G + F L+ S  P
Sbjct: 766 NWKFDIIAGFMCFGLITSSLP 786



 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 24/40 (60%), Positives = 32/40 (80%)

Query: 380 EDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFT 419
           E++WFDF+AD GDG NS+Y +ARL+AQP +R+   DSV T
Sbjct: 374 ENVWFDFVADVGDGFNSTYEMARLMAQPFLRLASSDSVQT 413


>gi|405351722|ref|ZP_11023140.1| Hypothetical protein A176_5608 [Chondromyces apiculatus DSM 436]
 gi|397093023|gb|EJJ23755.1| Hypothetical protein A176_5608 [Myxococcus sp. (contaminant ex DSM
           436)]
          Length = 607

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 137/409 (33%), Positives = 198/409 (48%), Gaps = 59/409 (14%)

Query: 290 ERSTGWALTHPLSVEEYE--KMKKKQLKPEFL---DMVPWYSGTSADLFKTVFDLLVSVT 344
           ER     +T P +V   E   +  ++L        DMV W     A L +T  D +V+  
Sbjct: 11  EREPEREVTRPYAVSSREGPPLSDQRLAARVAKRADMVRWLH--PAQLLRTGLDAVVAA- 67

Query: 345 VFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLL 404
           VF  R D R+++A +        + D   + + E    W D+++DTGDG NS+Y+VARLL
Sbjct: 68  VFGARADQRLIEAVVRPQ---VPYFDYSQEPMPEG-GFWLDYVSDTGDGWNSTYAVARLL 123

Query: 405 AQPHIRVTRDDSVFTLP----RGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWY 460
           A P + ++        P    RG +L+ GGD  YP  S   Y+ RL +P+E A+      
Sbjct: 124 ALPELALSVRGEPQAEPHDTRRGRLLIFGGDGVYPGASREIYDERLVQPYEAAM------ 177

Query: 461 KKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQ 520
           ++ H        PS          P  ++IPGNHDW+DGL  F+R  C   W+ G    Q
Sbjct: 178 RRSHA-------PS----------PDLFVIPGNHDWYDGLGAFLRLFCASRWIAGRRTRQ 220

Query: 521 KKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLL- 579
            +SYFAL+LP+ WW+ G D+ L+ DIDV Q ++F E V   +G  D +I+   EP W+L 
Sbjct: 221 SRSYFALKLPQRWWLIGTDVQLNSDIDVPQVEYFRE-VASHMGPEDRIILCNAEPAWILA 279

Query: 580 ------DWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLV 633
                   Y  N    N+++L       R  + +AGD+HHY RH           +  + 
Sbjct: 280 ATERRKGSYLEN----NLEYLQEKVFGRRINVFLAGDLHHYRRHEDAGG------RQKIT 329

Query: 634 NGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFR 682
            G GGAFLHPTH  S      G T +   ++P    S R+A  N+L  R
Sbjct: 330 AGGGGAFLHPTHAPSAHVLRDGYTLQK--SFPDEWTSRRLARQNLLLIR 376


>gi|149276881|ref|ZP_01883024.1| hypothetical protein PBAL39_15914 [Pedobacter sp. BAL39]
 gi|149232550|gb|EDM37926.1| hypothetical protein PBAL39_15914 [Pedobacter sp. BAL39]
          Length = 586

 Score =  199 bits (507), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 133/399 (33%), Positives = 203/399 (50%), Gaps = 51/399 (12%)

Query: 321 MVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKE 380
           M  W+  +   L       L+S T F    D R +QAA++KD +      L    +++++
Sbjct: 11  MTNWFQPSM--LLNVALKSLISGT-FGNYADRRELQAALSKDSD-EDSCKLRKRLVTDRQ 66

Query: 381 DLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDD---SVFTLPRGDVLLIGGDLAYPNP 437
           ++W DF+ DTGDG +S+YSVA+  AQ  + V  +D      T  RGD+L++GGD  YP P
Sbjct: 67  EIWIDFICDTGDGFDSTYSVAKKAAQRQLVVQTEDRKSERVTTTRGDILILGGDEVYPFP 126

Query: 438 SAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWF 497
           +   Y  +   PFE              A +K EV      L + + P  + IPGNHDW+
Sbjct: 127 TLDNYTTKFKVPFE-------------AAGDKNEV------LSELNRPLLFAIPGNHDWY 167

Query: 498 DGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAEL 557
           DGL  FM+  C +  +G W   QK+SYFA+ LP  +W++  D+ L+ +ID  Q  +F  +
Sbjct: 168 DGLGNFMKLFCQQRSIGIWRTVQKRSYFAIPLPNNYWIWATDIQLNSNIDQPQLDYFTRM 227

Query: 558 VKEQVGERDSVIIMTHEPNWLL------DWYFNNVSGKNVKHLICDYL----KGRCKLRI 607
            +E++ + D +I++T EP W+       D  F+ +     +H+  ++     K +  + I
Sbjct: 228 AREEMQDGDKIILVTAEPAWVYKEIRKNDQSFDRLDFFISRHIRDEHQLIGKKFKLAINI 287

Query: 608 AGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTH-------VFSNFRKFYGTTYES 660
            GD+HHY R+S    +G  Y+     +G GGAFLH TH         +  R     T   
Sbjct: 288 TGDLHHYSRYSQ-KENGHQYI----TSGGGGAFLHLTHNLPPVLDAIAGDRAQEKIT--R 340

Query: 661 KAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVL 699
           KA +PS  DS R+ LGN+L F  KN  F  +  +VY  L
Sbjct: 341 KAIFPSVSDSKRLLLGNLL-FPFKNPAFISLACLVYVYL 378


>gi|162450904|ref|YP_001613271.1| calcineurin-like phosphoesterase [Sorangium cellulosum So ce56]
 gi|161161486|emb|CAN92791.1| Calcineurin-like phosphoesterase family protein [Sorangium
           cellulosum So ce56]
          Length = 591

 Score =  199 bits (506), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 134/393 (34%), Positives = 198/393 (50%), Gaps = 51/393 (12%)

Query: 321 MVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHG-DLLYDHLSEK 379
           M  WYS     L ++  D+ VS  +   R D R+++A       G Q   D    +   +
Sbjct: 18  MTCWYS--PFQLARSALDMTVS-RLMGARGDFRLLEAIA-----GPQPPFDYAVRNGVMR 69

Query: 380 EDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSA 439
           ++LW D++ADTGDG N +Y++A LLA+P + +   ++     RG+VL++GGD  YP+ + 
Sbjct: 70  DELWIDYVADTGDGFNPTYAIASLLARPVLSLGGRETR----RGEVLIMGGDQVYPSATR 125

Query: 440 FTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDG 499
             Y  RL  P+  AL                      P+    D P  + IPGNHDW+DG
Sbjct: 126 QAYWTRLVEPYAAAL----------------------PKADVTDPPHLFAIPGNHDWYDG 163

Query: 500 LNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVK 559
           L +FMR    K  LG W   Q +SYFAL+LP G+W+ G+D+ L  DID  Q ++F +L  
Sbjct: 164 LMSFMRLFGQKRTLGAWKTFQSRSYFALKLPHGFWLLGVDIQLESDIDQPQIEYFCKLAT 223

Query: 560 EQVGERDSVIIMTHEPNWLLDWYFNNVSGKNV---KHLICDYLKG-RCKLRIAGDMHHYM 615
             + + D VI+ T EP+W+    +N     N+   +  + D   G     RIAGD+HHY 
Sbjct: 224 HDMRDGDRVILCTAEPDWVKGAIYNPELQSNLAFFEKQLADARPGVEIVARIAGDLHHYR 283

Query: 616 RHSYVPSDGPVYVQHLLVNGCGGAFLHPTH--VFSNFRKFYGT---TYESKAAYPSFEDS 670
           RH    +DG    +  ++ G GGAFLHPTH    +  R   G     Y  +A++PS  +S
Sbjct: 284 RHES--ADG----RQNIIAGGGGAFLHPTHGEPVTVVRSGAGADQRPYMLRASFPSESES 337

Query: 671 SRIALGNILKFRKKNWQFDFIGGIVYFVLVFSM 703
            R+A  N L F + N  F      VY +   ++
Sbjct: 338 RRLAWRN-LSFARHNLGFWPAAAFVYMLSSLTL 369


>gi|343087017|ref|YP_004776312.1| metallophosphoesterase [Cyclobacterium marinum DSM 745]
 gi|342355551|gb|AEL28081.1| metallophosphoesterase [Cyclobacterium marinum DSM 745]
          Length = 578

 Score =  199 bits (505), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 130/364 (35%), Positives = 185/364 (50%), Gaps = 57/364 (15%)

Query: 314 LKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRF-DMRMMQAAMNKDQEGAQHGDLL 372
           +K E   MV WY       F +V  +L SV    G F D R +QAA+++D +        
Sbjct: 1   MKFERKPMVNWYDPKQL-AFTSVKTVLSSV---FGNFADRRELQAALDQDCKP------- 49

Query: 373 YDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDL 432
           +D+ S+KE LWFD+++D GDG NS+Y++A LLA+  + +        L RGDVL++GGD 
Sbjct: 50  FDY-SKKEALWFDYISDLGDGFNSTYTIASLLAREQLELRGK----ALKRGDVLIMGGDE 104

Query: 433 AYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPG 492
            YP P    Y+ RL  P+  A      + KD  A+ +P+V               + IPG
Sbjct: 105 VYPTPENIEYDNRLRGPYTAA------FPKDEKAIERPDV---------------FAIPG 143

Query: 493 NHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFK 552
           NHDW+DGL  F+R    K  LG W   Q +SYFAL+LP  +WV  +D+ L+ DID  Q  
Sbjct: 144 NHDWYDGLTNFLRLFTQKRSLGNWKTQQNRSYFALKLPYDYWVIAIDIQLNADIDFPQIC 203

Query: 553 FFAELVKEQVGERDSVIIMTHEPNWLLDWY-FNNVSGKNVKHLICDYLKGR--------- 602
           FF ++ KE       VI+ T EP+W+   +   N S   ++  I   L G+         
Sbjct: 204 FFKKIAKEHFNPNSKVILCTSEPSWVYKSFDTKNNSFDRLQFFIDRVLLGQGEKDYEEKN 263

Query: 603 ----CKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTY 658
                +  + GD+HHY R+  V  +      H +  G GGAF+HPTH  S   +  G T+
Sbjct: 264 KSVNIEAILTGDLHHYARYETVKDEAKPC--HFITAGGGGAFMHPTHTLS--EEIIG-TH 318

Query: 659 ESKA 662
           E KA
Sbjct: 319 ERKA 322


>gi|108760965|ref|YP_635183.1| hypothetical protein MXAN_7070 [Myxococcus xanthus DK 1622]
 gi|108464845|gb|ABF90030.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
          Length = 611

 Score =  199 bits (505), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 134/395 (33%), Positives = 194/395 (49%), Gaps = 61/395 (15%)

Query: 300 PLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAM 359
           P+S +      +KQ      DMV W     A L +T  D +V+  VF  R D R+++A +
Sbjct: 35  PVSDQSMAAHPRKQA-----DMVRWLH--PAQLLRTGLDAVVAA-VFGARADQRLIEAVV 86

Query: 360 NKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVT-----RD 414
                   + D   + + E    W D+++DTGDG NS+Y+VARLLA P + ++       
Sbjct: 87  RPQ---VPYFDYSQEPMPEG-GFWLDYVSDTGDGWNSTYAVARLLALPELTLSVQGQPHA 142

Query: 415 DSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPS 474
           DS  T  RG +L+ GGD  YP  S   Y+ RL +P+E A++             +   PS
Sbjct: 143 DSHATQ-RGRLLVHGGDGVYPGASREVYDERLVQPYEAAMR-------------RSHAPS 188

Query: 475 GVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWW 534
                     P  ++IPGNHDW+DGL  F+R  C   W+ G    Q +SYFAL+LP+ WW
Sbjct: 189 ----------PDLFVIPGNHDWYDGLGAFLRLFCANRWIAGRRTRQSRSYFALKLPQRWW 238

Query: 535 VFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLL-------DWYFNNVS 587
           + G D+ L+ DIDV Q ++F E V   +G  D +I+   EP W+L         Y  N  
Sbjct: 239 LIGTDVQLNSDIDVPQVEYFRE-VASHMGPDDRIILCNAEPAWILAATERRKGSYLEN-- 295

Query: 588 GKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVF 647
             N+++L       R  + +AGD+HHY RH           +  +  G GGAFLHPTH  
Sbjct: 296 --NLEYLQEKVFGRRINVFLAGDLHHYRRHEDAGG------RQKITAGGGGAFLHPTHAP 347

Query: 648 SNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFR 682
           S      G T +   ++P    S ++A  N+L  R
Sbjct: 348 SAHVLRDGYTLQK--SFPDERTSRKLARQNLLLIR 380


>gi|399002879|ref|ZP_10705556.1| hypothetical protein PMI21_04168 [Pseudomonas sp. GM18]
 gi|398123873|gb|EJM13404.1| hypothetical protein PMI21_04168 [Pseudomonas sp. GM18]
          Length = 658

 Score =  196 bits (499), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 144/441 (32%), Positives = 210/441 (47%), Gaps = 56/441 (12%)

Query: 381 DLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAF 440
           D+W D++ADTGDG +S+YS+A  ++Q  +++       TLPRGDVLL+GGD  YP P+  
Sbjct: 97  DVWIDYLADTGDGWDSTYSMALCVSQ-DVKLPE----LTLPRGDVLLLGGDQVYPTPAQS 151

Query: 441 TYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGL 500
            Y  R   PF  A   P       V  N+ + P     + Q   P     PGNHDW+DGL
Sbjct: 152 GYRTRFLDPFRAAFSAP----VPKVRPNEQDQP-----VPQPGAPWMVATPGNHDWYDGL 202

Query: 501 NTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAEL-VK 559
             F +  C +  +GGW   Q+ SY+ LQLP GWW++GLDL L   ID  Q ++F E+  K
Sbjct: 203 RGFSQLFCEQKPIGGWETRQRTSYYVLQLPNGWWIWGLDLQLESLIDRQQKQYFEEMRAK 262

Query: 560 EQVGERDSVIIMTHEPNWLLD-----------------WYFNNVSGKNVKHLICDYLKGR 602
            Q G+R  VI+ T EP+W+ +                       S + ++ L+ D+L   
Sbjct: 263 LQPGDR--VILCTPEPSWVDEAERLARVGSKTLPSIETQTLRFSSLREIEQLLGDHL--- 317

Query: 603 CKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRK---FYGTT-- 657
             + +AGD HHY R  Y P  G    Q +   G GGAFLH TH   +  +     GT   
Sbjct: 318 -AVVLAGDSHHYAR--YQPKAGSQAPQRITCGG-GGAFLHGTHQLPDPPEPINVGGTRQH 373

Query: 658 YESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDS 717
           YE  A YP  + S ++      +    N  F  +  I+Y +  + +    ++ H  R++ 
Sbjct: 374 YELAATYPDKKTSEQLR-NRAWRLPTHNLSFCGMLAILYLLFDWMVESASKMPHPARDNR 432

Query: 718 -----FSGHLRSF--FGTVWNAFMYVLEH--SYVSFAGALLLLIVAITFVPSKLSRKKRA 768
                 SG   S      VW   + VL H  S V  A  ++L    ++    K +RK   
Sbjct: 433 SLIEVLSGLEASIPNLREVWRQLVLVLAHSPSSVMLAVTIVLGCAVLSAAGVKRTRKLAY 492

Query: 769 MIGVLHVSAHLAAALILMLLL 789
            +G +H   HL  A+ L+ L+
Sbjct: 493 GVGAVHGLLHLGLAIGLLWLM 513


>gi|338531859|ref|YP_004665193.1| hypothetical protein LILAB_11020 [Myxococcus fulvus HW-1]
 gi|337257955|gb|AEI64115.1| hypothetical protein LILAB_11020 [Myxococcus fulvus HW-1]
          Length = 611

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 133/404 (32%), Positives = 191/404 (47%), Gaps = 71/404 (17%)

Query: 300 PLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAM 359
           PLS        +K+      DMV W     A L +T  D +V+  VF  R D R+++A +
Sbjct: 35  PLSDRSLAARPQKRA-----DMVRWLH--PAQLLRTGLDAVVAA-VFGARADQRLIEAVV 86

Query: 360 NKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVAR----------LLAQPHI 409
                   + D   + + E    W D+++DTGDG NS+Y+VAR          +  QPH 
Sbjct: 87  RPQ---VPYFDYSQEPMPEG-GFWLDYVSDTGDGWNSTYAVARLLALPELELAVQGQPHA 142

Query: 410 RVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNK 469
             + D       RG +L+ GGD  YP  S   Y+ RL +P+E A++             +
Sbjct: 143 E-SHDTQ-----RGRLLIFGGDGVYPGASREVYDERLVQPYEAAMR-------------R 183

Query: 470 PEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQL 529
            + PS          P  ++IPGNHDW+DGL  F+R  C   W+ G    Q +SYFAL+L
Sbjct: 184 SQAPS----------PDLFVIPGNHDWYDGLGAFLRLFCANRWIAGRRTRQSRSYFALKL 233

Query: 530 PKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLL-------DWY 582
           P+ WW+ G D+ L+ DIDV Q ++F E V   +G  D +I+   EP W+L         Y
Sbjct: 234 PQRWWLIGTDVQLNSDIDVPQVEYFRE-VASHMGPDDRIILCNAEPAWILAATERRKGTY 292

Query: 583 FNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLH 642
             N    N+++L       R  + +AGD+HHY RH           +  +  G GGAFLH
Sbjct: 293 LEN----NLEYLQEKVFGRRINVFLAGDLHHYRRHEDAGG------RQKITAGGGGAFLH 342

Query: 643 PTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNW 686
           PTH  S      G T +   ++P    S R+A  N+L  R   W
Sbjct: 343 PTHAPSAHVLRDGYTLQK--SFPDEWTSRRLARQNLLLIRHSPW 384


>gi|375011402|ref|YP_004988390.1| putative phosphohydrolase [Owenweeksia hongkongensis DSM 17368]
 gi|359347326|gb|AEV31745.1| putative phosphohydrolase [Owenweeksia hongkongensis DSM 17368]
          Length = 560

 Score =  193 bits (491), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 119/393 (30%), Positives = 197/393 (50%), Gaps = 47/393 (11%)

Query: 322 VPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKED 381
           V WY      L  T    ++S  +F    D R +QAA+ + ++   H     D+   ++D
Sbjct: 9   VEWYD--PKQLANTGIKAVIS-GIFGNFNDKREIQAALYQQEDSKAH-----DYSQGRDD 60

Query: 382 LWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFT 441
           +W D+++DTGDG ++++++A LLA+  + V        +PRG +L++GGD  YP  +   
Sbjct: 61  IWVDYISDTGDGFDATFTMATLLAKEELEVDGQ----KIPRGKLLIMGGDQVYPVATREE 116

Query: 442 YERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLN 501
           Y  RL  P+  AL                  P+   +      P  + IPGNHDW+DGL 
Sbjct: 117 YRNRLQGPYATAL------------------PADNTDNNGDRAPHLFAIPGNHDWYDGLT 158

Query: 502 TFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQ 561
           TF++  C + W+G W   QK+SYFAL+LP   W+FG+D+ L+ D+D  Q ++F  ++KE+
Sbjct: 159 TFIKVFCQQRWIGNWRTRQKRSYFALKLPHNMWLFGIDVQLNSDVDFNQIQYFENVLKEE 218

Query: 562 VGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHL---ICDYLKGRCK------LRIAGDMH 612
           V +   +I+ T EP W+      + +  N++     +C       +      L +AGD H
Sbjct: 219 VKQGGKIILCTAEPTWVYSTSKKSDANNNLEFFEKKLCAINDSTAQPYAKQILTLAGDWH 278

Query: 613 HYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVF-SNFRKFYGTTYESKAAYPSFEDSS 671
           HY R  Y   +G + +      G GGAFLHPT     +    +G     K+ +PS  +S 
Sbjct: 279 HYAR--YENENGGMKI----TAGGGGAFLHPTQNLPEHIGDIFGGDLILKSRFPSSGESK 332

Query: 672 RIALGNILKFRKKNWQFDFIGGIVYFVLVFSMF 704
           ++   N  KF   N++   + G +Y ++ + +F
Sbjct: 333 KLLFNN-FKFPFANFKMSLVLGAIYALVGWLLF 364


>gi|440747910|ref|ZP_20927165.1| hypothetical protein C943_4169 [Mariniradius saccharolyticus AK6]
 gi|436483652|gb|ELP39692.1| hypothetical protein C943_4169 [Mariniradius saccharolyticus AK6]
          Length = 573

 Score =  193 bits (490), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 152/499 (30%), Positives = 220/499 (44%), Gaps = 70/499 (14%)

Query: 314 LKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLY 373
           +K E   MV WY      L  T    +VS TVF    D R MQAA++        G   Y
Sbjct: 1   MKFERKPMVDWYD--PKQLAATGVKTVVS-TVFGNFADRREMQAALDP-------GKAYY 50

Query: 374 DHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLA 433
           D  S++ D W DF++D GDG N ++++A LLAQ  + V  +       RG++L++GGD  
Sbjct: 51  D-FSDRGDFWLDFISDLGDGFNPTFTLAHLLAQEKLVVDGN----VTKRGNILVMGGDQV 105

Query: 434 YPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGN 493
           YP P    Y  RL  P+             H A  K        +    D P  ++IPGN
Sbjct: 106 YPTPEMDEYRNRLQGPY-------------HAAFPK--------KSDDKDPPSLFVIPGN 144

Query: 494 HDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKF 553
           HDW+DGL  F++  C    LG W   Q +SYFA++LP  +W+ G+D+ L+ DIDV Q K+
Sbjct: 145 HDWYDGLTNFLKIFCQGRSLGNWRTEQTRSYFAIKLPHRYWLLGIDIQLNSDIDVPQLKY 204

Query: 554 FAELVKE-QVGERDSVIIMTHEPNWLLDWY-FNNVSGKNVKHLICDYLKG---------- 601
           F  +     +   D VI+ T EP W+ + +   N S K +K  +   ++G          
Sbjct: 205 FQNVAAHPDMQPGDKVILATAEPAWVYESFDEKNSSNKRLKFFVDRIIRGAKDDPDDPTG 264

Query: 602 ---------RCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFS-NFR 651
                    +    I GD+HHY R+     DG    + L+  G GGAF+H TH      +
Sbjct: 265 FYNGKNKDIQITTIITGDLHHYSRYLETLPDG--NERQLITAGGGGAFMHTTHSLKPEIK 322

Query: 652 KFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNH 711
           K  G     KA++PS  DS      N+L F    W    +G +      F        N 
Sbjct: 323 KSEGFDARFKASFPSKSDSLNQNTKNLLFFWYGPWMVLILGLVHALTFYFLKVSPAGTNR 382

Query: 712 ILREDSFSGHLRSFFGTVWNAF-MYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMI 770
            L    F       F  +W+   M +     + F     + I     V S   +K  +++
Sbjct: 383 SLSPPGF------VFADLWDVIGMSITTPMVLIFHLLFAVGIWQFADVKSGNHKKLNSLV 436

Query: 771 GVLHVSAHLAAALILMLLL 789
           G+LH   H+   LI  LL+
Sbjct: 437 GLLHGLGHV---LIFNLLV 452


>gi|398906189|ref|ZP_10653322.1| hypothetical protein PMI30_05252 [Pseudomonas sp. GM50]
 gi|398173571|gb|EJM61403.1| hypothetical protein PMI30_05252 [Pseudomonas sp. GM50]
          Length = 654

 Score =  191 bits (485), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 144/439 (32%), Positives = 209/439 (47%), Gaps = 50/439 (11%)

Query: 381 DLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAF 440
           DLW D++ADTGDG +S+YS+A  ++Q    V      FTLPRGDVLL+GGD  YP P+  
Sbjct: 95  DLWIDYLADTGDGWDSTYSMALCVSQ---EVKLPQYPFTLPRGDVLLLGGDQVYPTPARS 151

Query: 441 TYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGL 500
            Y  R   PF  A  P P  K   V  N  + P     + Q   P     PGNHDW+DGL
Sbjct: 152 GYRTRFLDPFRAAF-PAPVPK---VRPNDQDQP-----VSQPGAPWIVATPGNHDWYDGL 202

Query: 501 NTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAEL-VK 559
             F +  C +  +GGW   Q+ SY+ LQLP GWWV+GLDL L   ID  Q ++F E+  K
Sbjct: 203 RGFSQLFCEQKPIGGWETRQRTSYYVLQLPNGWWVWGLDLQLESMIDREQKRYFKEMRAK 262

Query: 560 EQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHL---------------ICDYLKGRCK 604
            Q G+R  VI+   EP+W+ +     ++ +  K L               I + L     
Sbjct: 263 LQPGDR--VILCAPEPSWVDE--AERLAREESKALPSIETQTPRFRSLREIEELLGDHLA 318

Query: 605 LRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRK---FYGTT--YE 659
           + +AGD HHY R  Y P  G    Q +   G GGAFL+ TH   +  K     GT   Y+
Sbjct: 319 VVLAGDSHHYAR--YQPKAGTQAPQRITCGG-GGAFLNGTHQLPDPPKPINVGGTRQHYD 375

Query: 660 SKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILRED-SF 718
             A YP  + S ++      +   +N  F  +  I+Y +  + +    ++ H  R++ S 
Sbjct: 376 LAAVYPDKKTSEQLR-NRAWRLPTRNLSFCGMLAILYLLFDWMVQSASKVPHPARDNRSL 434

Query: 719 SGHLRSFFGTV------WNAFMYVLEHSYVSFAGALLLLIVAITFVPS--KLSRKKRAMI 770
              L     ++      W     V+ +S  S   A+++++    F  +  K +RK    +
Sbjct: 435 MEKLSDLEVSIPNLLEAWRQLFLVMAYSPSSVMLAVIIVLGCAVFSAAGVKRTRKLAYAV 494

Query: 771 GVLHVSAHLAAALILMLLL 789
           G  H   HL  A+ L+ L+
Sbjct: 495 GAAHGLLHLGLAIGLLWLM 513


>gi|108759444|ref|YP_633099.1| hypothetical protein MXAN_4943 [Myxococcus xanthus DK 1622]
 gi|108463324|gb|ABF88509.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
          Length = 596

 Score =  190 bits (483), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 153/493 (31%), Positives = 228/493 (46%), Gaps = 67/493 (13%)

Query: 315 KPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQ-HGDLLY 373
           KP    MV W+  ++  L KT     +S  V   + D R++ A  N  Q   Q + D   
Sbjct: 22  KPHGKPMVRWFHPST--LVKTGVKAFLSA-VIGKQADRRLLDALTNPKQPLEQPYFDCTV 78

Query: 374 DHLSEKED-LWFDFMADTGDGGNSSYSVARLLAQPHIRVTR-DDSVFTLPRGDVLLIGGD 431
           D  +   D LW D+++D GDG N++Y+VA  +  P + +   +        G+VL+ GGD
Sbjct: 79  DDENRPRDVLWLDYVSDLGDGWNATYAVATAVMNPALALEDPEGETHHTQGGEVLVFGGD 138

Query: 432 LAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIP 491
             YP  S   Y  R   P+E AL             ++  VPS             + +P
Sbjct: 139 EVYPTASVQEYYDRTVGPYEAALN------------HRRRVPS------------LFAVP 174

Query: 492 GNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQF 551
           GNHDW+DGL +F+R  C       W   Q++SYFA++LP GWW+ G D+ L  D+D  Q 
Sbjct: 175 GNHDWYDGLVSFVRLFCQGRTSEAWRTRQRRSYFAIKLPHGWWLLGTDMQLESDLDNTQV 234

Query: 552 KFFAELVK-EQVGERDSVIIMTHEPNWLLDWYFNNVSGK-----NVKHLICDYLKGRCKL 605
           +FF E+V   Q GER  VI+   EP W +      ++G+     N++ L    L  +  +
Sbjct: 235 EFFKEVVSYMQDGER--VILCNAEPEWAVS-KIKAIAGRKALEGNLQFLEQLVLGKKVSI 291

Query: 606 RIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYP 665
            ++GD+HHYMRHS    DG    +  ++ G GGA+L+PTH+    R+  G  +E +  +P
Sbjct: 292 FLSGDLHHYMRHSG--KDG----RQKIIAGGGGAYLYPTHLPGEKRETEG--FELRQCFP 343

Query: 666 SFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSF 725
             + S R+   N L F   N  F    G  Y  L + +    ++N     D  S      
Sbjct: 344 PSQKSRRLTWHN-LAFPFLNPWFGVFMGAFYLQLGWGL--ASDMNSTGVSDKLS------ 394

Query: 726 FGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALIL 785
                      L+ +   F G  L L+ AITF  ++     R + G LH S HL  A +L
Sbjct: 395 -----RVLHRALKGTGTLFLGG-LTLVSAITFADARRGASWRWLAGGLHGSTHLLTAGLL 448

Query: 786 -----MLLLELGV 793
                 L+ +LGV
Sbjct: 449 TRAAATLVKQLGV 461


>gi|398843188|ref|ZP_10600337.1| hypothetical protein PMI18_05773 [Pseudomonas sp. GM102]
 gi|398103805|gb|EJL93967.1| hypothetical protein PMI18_05773 [Pseudomonas sp. GM102]
          Length = 654

 Score =  189 bits (481), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 143/439 (32%), Positives = 208/439 (47%), Gaps = 50/439 (11%)

Query: 381 DLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAF 440
           DLW D++ADTGDG +S+YS+A  ++Q    V   +   TLPRGDVLL+GGD  YP P+  
Sbjct: 95  DLWVDYLADTGDGWDSTYSMALCVSQ---DVKLPEYPLTLPRGDVLLLGGDQVYPTPAQS 151

Query: 441 TYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGL 500
            Y  R   PF  A  P P  K   V  N  + P     + Q   P     PGNHDW+DGL
Sbjct: 152 GYRTRFLDPFRAAF-PAPVPK---VRPNDQDQP-----VPQPGAPWIVATPGNHDWYDGL 202

Query: 501 NTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAEL-VK 559
             F +  C +  +GGW   Q+ SY+ LQLP GWWV+GLDL L   ID  Q ++F E+  K
Sbjct: 203 RGFSQLFCEQKPIGGWETRQRTSYYVLQLPNGWWVWGLDLQLESMIDREQKRYFKEMHAK 262

Query: 560 EQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHL---------------ICDYLKGRCK 604
            Q G+R  VI+   EP+W+ +     ++ +  K L               I + L     
Sbjct: 263 LQPGDR--VILCAPEPSWVDE--AERLAREESKALPSIETQTPRFRSLREIEELLGDHLA 318

Query: 605 LRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRK---FYGTT--YE 659
           + +AGD HHY R  Y P  G    Q +   G GGAFL+ TH   +  K     GT   Y+
Sbjct: 319 VVLAGDSHHYAR--YQPKAGTQAPQRITCGG-GGAFLNGTHQLPDPPKPINVGGTRQHYD 375

Query: 660 SKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILRED-SF 718
             A YP  + S ++      +   +N  F  +  I+Y +  + +    ++ H  R++ S 
Sbjct: 376 LAAVYPDKKTSEQLR-NRAWRLPTRNLSFCGMLAILYLLFDWMVQSASKVPHPARDNRSL 434

Query: 719 SGHLRSFFGTV------WNAFMYVLEHSYVSFAGALLLLIVAITFVPS--KLSRKKRAMI 770
              L     ++      W     V+ +S  S   A+ +++    F  +  K +RK    +
Sbjct: 435 MEELSRLEASIPNLLEAWRQLFLVMAYSPSSVMLAVTIVLGCAVFSAAGVKRTRKLAYAV 494

Query: 771 GVLHVSAHLAAALILMLLL 789
           G  H   HL  A+ L+ L+
Sbjct: 495 GAAHGLLHLGLAIGLLWLM 513


>gi|336176547|ref|YP_004581922.1| hypothetical protein [Frankia symbiont of Datisca glomerata]
 gi|334857527|gb|AEH08001.1| hypothetical protein FsymDg_0452 [Frankia symbiont of Datisca
           glomerata]
          Length = 622

 Score =  189 bits (481), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 109/325 (33%), Positives = 166/325 (51%), Gaps = 44/325 (13%)

Query: 376 LSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYP 435
           LS++++LWFD+++D GDG +S+Y+VA LLA P +++    +    PRG +L++GGD  YP
Sbjct: 66  LSDQDELWFDYVSDLGDGFDSTYTVASLLAAPGLQLAGSPT----PRGQLLVMGGDQCYP 121

Query: 436 NPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDG----PQCYIIP 491
            PS   YE +L  P+  AL   P      VA   P  P G  E          P+ +++P
Sbjct: 122 TPSITGYENKLIGPYRAAL---PTVASPTVA--SPTAPPGTDEQPAQSARSAQPKLFVVP 176

Query: 492 GNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQF 551
           GNHDW+DGL  FMR  C +S +GGW   Q +SYFA++LP  WW+ G+D+     ID  Q 
Sbjct: 177 GNHDWYDGLTAFMRVFCQRSTIGGWRTEQTRSYFAVKLPHRWWLLGIDIQFDSYIDDPQR 236

Query: 552 KFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLK--------GRC 603
           ++F + V EQ+   D+VI+ + +P+W+     +   G      + DY +         + 
Sbjct: 237 RYFLD-VAEQMRPGDAVILCSAKPSWV-----STGEGNAEAFAVLDYFERTVIRKAGAQV 290

Query: 604 KLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNF------------R 651
           ++ +AGD+HHY R+  V  D           G GGA+L  TH                 R
Sbjct: 291 RVSLAGDLHHYARYRQVDGD-----TQKFTAGGGGAYLSATHHLPQHLTLPPPASRAPSR 345

Query: 652 KFYGTTYESKAAYPSFEDSSRIALG 676
                 Y+ + ++P    S R+A G
Sbjct: 346 TDPPAHYQLETSFPDARTSRRLAAG 370


>gi|310823863|ref|YP_003956221.1| hypothetical protein STAUR_6637 [Stigmatella aurantiaca DW4/3-1]
 gi|309396935|gb|ADO74394.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 545

 Score =  189 bits (479), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 137/466 (29%), Positives = 220/466 (47%), Gaps = 70/466 (15%)

Query: 332 LFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLS-EKEDLWFDFMADT 390
           L KT    ++S T+   + D R++ A      E     D   D     + +LW D+++D 
Sbjct: 10  LAKTGVRAVISATIG-KQADRRLLDALAAPKVEPC---DFSVDAAGLPRRELWLDYVSDL 65

Query: 391 GDGGNSSYSVARLLAQPHIRVTRDDS--VFTLPRGDVLLIGGDLAYPNPSAFTYERRLFR 448
           GDG +S+Y+VA  ++QP + + RD++        G+VL+ GGD  YP  S   YE +  +
Sbjct: 66  GDGWDSTYAVALAVSQPTL-ILRDETGAAHETQGGEVLVFGGDEVYPAASVKAYEEKTVQ 124

Query: 449 PFEYAL--QPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRF 506
           P+  A+  Q PP +                           + +PGNHDW+DGL +FMR 
Sbjct: 125 PWSSAMRGQRPPAH--------------------------LFAVPGNHDWYDGLVSFMRL 158

Query: 507 ICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERD 566
            C    L GW   Q +SYFA++LP+GWW+ G D+ L  DID  Q  +F E+ K Q+G++D
Sbjct: 159 FCQGRSLAGWQTHQHRSYFAVKLPQGWWLLGTDMQLESDIDQPQVCYFQEIAK-QIGDKD 217

Query: 567 SVIIMTHEPNWLLDWYFNNVS----GKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPS 622
            +I+   EP WL      ++       N+  L    L  +  + +AGD+HHY RH+    
Sbjct: 218 RIILCLAEPAWLGAQVHASMGRSYLENNLDFLEDQVLGKKISVFLAGDLHHYRRHAN--D 275

Query: 623 DGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFR 682
           +G    +  +  G GGAFLHPTH+           +  + ++PS E+S R++  N L   
Sbjct: 276 EG----RQKITAGGGGAFLHPTHLPRRHPPAL-EDFTVRKSFPSAEESRRLSWRN-LWLV 329

Query: 683 KKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYV 742
             N +F  + G++Y +L +++        +                  NA   V+  +  
Sbjct: 330 SHNPRFGILMGLIYTLLAWALAVNIGTARLP-----------------NALENVMAMALS 372

Query: 743 SFAGALLLLIVA---ITFVPSKLSRKKRAMIGVLHVSAHLAAALIL 785
           +   AL+ L +    I F      R  R ++G++H  AHL AA ++
Sbjct: 373 TPGSALVCLAIILGLIAFADRSFGR-GRWLVGLVHSLAHLGAAFLI 417


>gi|115372653|ref|ZP_01459960.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
 gi|115370374|gb|EAU69302.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
          Length = 571

 Score =  188 bits (478), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 137/466 (29%), Positives = 220/466 (47%), Gaps = 70/466 (15%)

Query: 332 LFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLS-EKEDLWFDFMADT 390
           L KT    ++S T+   + D R++ A      E     D   D     + +LW D+++D 
Sbjct: 36  LAKTGVRAVISATIG-KQADRRLLDALAAPKVEPC---DFSVDAAGLPRRELWLDYVSDL 91

Query: 391 GDGGNSSYSVARLLAQPHIRVTRDDS--VFTLPRGDVLLIGGDLAYPNPSAFTYERRLFR 448
           GDG +S+Y+VA  ++QP + + RD++        G+VL+ GGD  YP  S   YE +  +
Sbjct: 92  GDGWDSTYAVALAVSQPTL-ILRDETGAAHETQGGEVLVFGGDEVYPAASVKAYEEKTVQ 150

Query: 449 PFEYAL--QPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRF 506
           P+  A+  Q PP +                           + +PGNHDW+DGL +FMR 
Sbjct: 151 PWSSAMRGQRPPAH--------------------------LFAVPGNHDWYDGLVSFMRL 184

Query: 507 ICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERD 566
            C    L GW   Q +SYFA++LP+GWW+ G D+ L  DID  Q  +F E+ K Q+G++D
Sbjct: 185 FCQGRSLAGWQTHQHRSYFAVKLPQGWWLLGTDMQLESDIDQPQVCYFQEIAK-QIGDKD 243

Query: 567 SVIIMTHEPNWLLDWYFNNVS----GKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPS 622
            +I+   EP WL      ++       N+  L    L  +  + +AGD+HHY RH+    
Sbjct: 244 RIILCLAEPAWLGAQVHASMGRSYLENNLDFLEDQVLGKKISVFLAGDLHHYRRHAN--D 301

Query: 623 DGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFR 682
           +G    +  +  G GGAFLHPTH+           +  + ++PS E+S R++  N L   
Sbjct: 302 EG----RQKITAGGGGAFLHPTHLPRRHPPAL-EDFTVRKSFPSAEESRRLSWRN-LWLV 355

Query: 683 KKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYV 742
             N +F  + G++Y +L +++        +                  NA   V+  +  
Sbjct: 356 SHNPRFGILMGLIYTLLAWALAVNIGTARLP-----------------NALENVMAMALS 398

Query: 743 SFAGALLLLIVA---ITFVPSKLSRKKRAMIGVLHVSAHLAAALIL 785
           +   AL+ L +    I F      R  R ++G++H  AHL AA ++
Sbjct: 399 TPGSALVCLAIILGLIAFADRSFGR-GRWLVGLVHSLAHLGAAFLI 443


>gi|383456297|ref|YP_005370286.1| hypothetical protein COCOR_04316 [Corallococcus coralloides DSM
           2259]
 gi|380729763|gb|AFE05765.1| hypothetical protein COCOR_04316 [Corallococcus coralloides DSM
           2259]
          Length = 606

 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 129/394 (32%), Positives = 191/394 (48%), Gaps = 54/394 (13%)

Query: 316 PEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDH 375
           P    MV W+    A L K+    L+S T   GR   R +  A+++ Q       L +D+
Sbjct: 25  PPGRKMVSWFD--PAVLAKSGMKALLSGTF--GRQADRRLLDAVSRPQP------LCFDY 74

Query: 376 LSE-----KEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRG-DVLLIG 429
             +     +++LW D+++D GDG +S+Y+VA  +  P + +T +     + RG DVL+ G
Sbjct: 75  SVDDTGHPRDELWLDYVSDLGDGWDSTYAVASTVMAPELALTDESGKTHVTRGGDVLVFG 134

Query: 430 GDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYI 489
           GD  YP  S   Y+ R   P+E AL                         K+   P  + 
Sbjct: 135 GDEVYPTASVDEYQARTVVPYEDALN------------------------KRRHRPHLFA 170

Query: 490 IPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVY 549
           +PGNHDW+DGL +F R  C      GW   Q++SYFAL+LP GWW+ G D+ L  D+D  
Sbjct: 171 VPGNHDWYDGLVSFTRLFCQGRRSHGWRTQQQRSYFALRLPHGWWLLGTDMQLESDLDAP 230

Query: 550 QFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVS----GKNVKHLICDYLKGRCKL 605
           Q +FF ++  +  G  D VI+   EP W+               N+  L    L  +  +
Sbjct: 231 QVEFFQKVAGQMRGT-DRVILCNAEPAWVKQQVMPRAGRGFLDHNIDFLEQKVLGKKVSV 289

Query: 606 RIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYP 665
            +AGD+HHY RH    +DG    +  ++ G GGAFLHPTH+    +   G  +  KA+YP
Sbjct: 290 FLAGDLHHYRRHES--ADG----RQKIIAGGGGAFLHPTHLPRVDQSPGG--FVQKASYP 341

Query: 666 SFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVL 699
             + S R+A  N+L F   N  F    G+VY +L
Sbjct: 342 PQKTSQRLAWRNLL-FTALNPWFGVFMGVVYTLL 374


>gi|157868485|ref|XP_001682795.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68126251|emb|CAJ03655.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 1399

 Score =  187 bits (475), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 110/322 (34%), Positives = 165/322 (51%), Gaps = 41/322 (12%)

Query: 410  RVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKK------- 462
            +V  DD V TLPRG  +L+GGDLAYP+P+  TY  RLF P+  A+      +        
Sbjct: 728  KVGTDDFV-TLPRGSFVLVGGDLAYPSPNDETYTTRLFEPYHDAMSSNVRLQSLFHAEQR 786

Query: 463  -----------------------DHVAVNKPEVPSG--VPELKQYDGPQCYIIPGNHDWF 497
                                     +A  +  + +G    E      P  + IPGNHDWF
Sbjct: 787  RVVVADASDADVAHMHLLDAETVSRMATGRAALRTGHATAEEALRSVPLLFAIPGNHDWF 846

Query: 498  DGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAEL 557
            DGL T+ ++I  ++W+GGW MPQ+ S+F L+LP  W+V   D     DIDV Q  +F ++
Sbjct: 847  DGLTTYRKYILERTWIGGWLMPQRSSFFVLRLPHNWFVLCGDTGNMQDIDVAQRNYFLDV 906

Query: 558  VKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVK-HLICDYLKGRCKLRIAGDMHHYMR 616
            +++ +     VI+  HEP+W+LD    +   +  + H + + L  R +LR+AGD+HHY R
Sbjct: 907  IEKYMDAESCVILAAHEPDWVLDAMERDDKARQPELHRVVEALGTRLRLRLAGDIHHYSR 966

Query: 617  HSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALG 676
            H+  P D       L+V+G GGAFLH     ++     GT Y    A+P  E ++ + + 
Sbjct: 967  HT--PRDASSEAATLVVSGGGGAFLHGAR--NDVIISQGTRYVRACAFP--EQNTFMNMA 1020

Query: 677  NIL-KFRKKNWQFDFIGGIVYF 697
            + L  FR  NW+FD + G + F
Sbjct: 1021 SRLWGFRVINWKFDLVVGFLCF 1042



 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 18/29 (62%), Positives = 25/29 (86%)

Query: 379 KEDLWFDFMADTGDGGNSSYSVARLLAQP 407
           + D+WFD++AD GDG N +Y++ARLLAQP
Sbjct: 534 EPDVWFDWIADVGDGFNPTYAMARLLAQP 562


>gi|398861007|ref|ZP_10616646.1| hypothetical protein PMI36_04611 [Pseudomonas sp. GM79]
 gi|398233895|gb|EJN19799.1| hypothetical protein PMI36_04611 [Pseudomonas sp. GM79]
          Length = 658

 Score =  186 bits (473), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 139/439 (31%), Positives = 205/439 (46%), Gaps = 50/439 (11%)

Query: 381 DLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAF 440
           D+W D++ADTGDG +S+YS+A  ++         +   +LPRGDVLL+GGD  YP P+  
Sbjct: 96  DVWVDYLADTGDGWDSTYSMALCVSH---AADLPEHQLSLPRGDVLLLGGDQVYPTPAGN 152

Query: 441 TYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGL 500
            Y  R   P+  A  P P  K+      +P        ++Q   P     PGNHDW+DGL
Sbjct: 153 GYRTRFLDPYRAAF-PAPVPKERPGDQAQP--------VRQPGAPWILATPGNHDWYDGL 203

Query: 501 NTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAEL-VK 559
             F +  C +  +G W   Q+  Y+ LQLP GWWV+GLDL     ID  Q ++F ++  K
Sbjct: 204 RGFSQLFCEQKPIGAWETRQRTGYYVLQLPNGWWVWGLDLQFESMIDRQQKQYFQQMHAK 263

Query: 560 EQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHL---------------ICDYLKGRCK 604
            Q G+R  VI+ T EP+W+ +     ++ +  K L               I + L     
Sbjct: 264 LQPGDR--VILCTPEPSWVDE--AERLAREESKALPSIETQTPRFRSLREIEELLGDHLA 319

Query: 605 LRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRK---FYGTT--YE 659
           + +AGD HHY R  Y P  G    Q +   G GGAFL+ TH   +  K     GT   YE
Sbjct: 320 VVLAGDSHHYAR--YQPKAGTQAPQRITCGG-GGAFLNATHHLPDPPKPINVGGTRQHYE 376

Query: 660 SKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILRE---- 715
             A YP+ + S ++      +   +N  F  +  I+Y +  + +     + H  R     
Sbjct: 377 LAAVYPAKKTSEQLR-NRAWRLPTRNLSFCAMLAILYLLFDWMVQSASTVPHPARGNRSL 435

Query: 716 -DSFSGHLRSF--FGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPS--KLSRKKRAMI 770
            D  S    S      VW   + VL HS  S   A+++++    F  +  K +RK    I
Sbjct: 436 MDVLSDLQASIPNISEVWRHLLLVLSHSPSSVMFAVIIVLGCAVFSAAGVKRTRKLAYAI 495

Query: 771 GVLHVSAHLAAALILMLLL 789
           G +H   HL  A+ L+ L+
Sbjct: 496 GAVHGGLHLGLAIGLLWLM 514


>gi|401421232|ref|XP_003875105.1| conserved hypothetical protein [Leishmania mexicana
            MHOM/GT/2001/U1103]
 gi|322491341|emb|CBZ26610.1| conserved hypothetical protein [Leishmania mexicana
            MHOM/GT/2001/U1103]
          Length = 1397

 Score =  186 bits (472), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 110/327 (33%), Positives = 163/327 (49%), Gaps = 39/327 (11%)

Query: 404  LAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPP----- 458
            L++  ++V +D  V TLPRG  +L+GGDLAYP+P+  TY  RLF P+  A+         
Sbjct: 720  LSRSLLKVEKDGFV-TLPRGSFVLVGGDLAYPSPNDETYTTRLFEPYHDAMSSNARLQSV 778

Query: 459  WYKKDHVAVNKPEVPSGVPELKQYDG---------------------------PQCYIIP 491
            ++ +    V      + V  +   D                            P  + IP
Sbjct: 779  FHAEQRRVVVADASDADVAHMHMLDAETVSRMATGHAALRTGRATAEEALRSVPLLFAIP 838

Query: 492  GNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQF 551
            GNHDWFDGL T+ ++I  ++WLGGW MPQ+ S+F LQLP  W+V   D     DIDV Q 
Sbjct: 839  GNHDWFDGLTTYRKYILERTWLGGWLMPQRSSFFVLQLPHNWFVLCGDTGNVQDIDVAQR 898

Query: 552  KFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVK-HLICDYLKGRCKLRIAGD 610
             +F +++++ +     VI+  HEP W+LD    +   +  +   + + L  R +LR+AGD
Sbjct: 899  NYFLDVIEKHMDAESCVILAAHEPGWVLDAMERDDRARQPELDRVVEALGTRLRLRLAGD 958

Query: 611  MHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDS 670
            +HHY RH+  P D       L+V+G GGAFLH     ++     GT Y    A+P     
Sbjct: 959  IHHYSRHT--PRDASSEAATLVVSGGGGAFLHGAR--NDVVISQGTRYVRACAFPERNTF 1014

Query: 671  SRIALGNILKFRKKNWQFDFIGGIVYF 697
              +A   +  FR  NW+FD + G + F
Sbjct: 1015 MNMA-SRLWGFRVINWKFDLVVGFLCF 1040



 Score = 47.8 bits (112), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 28/33 (84%)

Query: 379 KEDLWFDFMADTGDGGNSSYSVARLLAQPHIRV 411
           + D+WFD++AD GDG N +Y++ARLLAQP +R+
Sbjct: 532 EPDVWFDWIADVGDGFNPTYAMARLLAQPMLRL 564


>gi|389603736|ref|XP_003723013.1| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
 gi|322504755|emb|CBZ14539.1| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
          Length = 1415

 Score =  186 bits (471), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 107/316 (33%), Positives = 157/316 (49%), Gaps = 38/316 (12%)

Query: 415  DSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKK------------ 462
            DS  TLPRG  +L+GGDLAYP+P+  TY  RLF P+  A+      +             
Sbjct: 748  DSFVTLPRGSFVLVGGDLAYPSPNDETYTTRLFEPYHDAMSGNVRLQSVFHAEQQRVIVA 807

Query: 463  ------------------DHVAVNKPEVPSG--VPELKQYDGPQCYIIPGNHDWFDGLNT 502
                                +A  +  + +G    E      P  + IPGNHDWFDGL T
Sbjct: 808  DASDADVAHVHLLDAETVSRMATGRAALRTGRATAEEALRSVPLLFAIPGNHDWFDGLTT 867

Query: 503  FMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQV 562
            F ++I  ++W+GGW MPQ+ S+F L+LP  W++   D     DIDV Q  +F +++++ +
Sbjct: 868  FHKYILERTWIGGWLMPQRSSFFVLRLPHNWFILCGDTGNMQDIDVAQRNYFLDVIEKYM 927

Query: 563  GERDSVIIMTHEPNWLLDWYFNNVSGKNVK-HLICDYLKGRCKLRIAGDMHHYMRHSYVP 621
                 VI+ +HEP W+ D    +   +  + + + + L  R +LR+AGD+HHY RH  VP
Sbjct: 928  DVESCVILASHEPGWVYDAMEKDEKARQPELNRVVEALGTRLRLRLAGDIHHYSRH--VP 985

Query: 622  SDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKF 681
             D       L+V+G GGAFLH     S   +  GT Y    A+P       +A   +  F
Sbjct: 986  VDASSEAATLVVSGGGGAFLHGARDDSIISQ--GTKYVRACAFPDRNTFMNMA-SRLWGF 1042

Query: 682  RKKNWQFDFIGGIVYF 697
            R  NW+FD + G + F
Sbjct: 1043 RVINWKFDLVVGFLCF 1058



 Score = 47.8 bits (112), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 28/33 (84%)

Query: 379 KEDLWFDFMADTGDGGNSSYSVARLLAQPHIRV 411
           + D+WFD++AD GDG N +Y++ARLLAQP +R+
Sbjct: 549 EPDVWFDWIADVGDGFNPTYAMARLLAQPMLRL 581


>gi|113867665|ref|YP_726154.1| hypothetical protein H16_A1654 [Ralstonia eutropha H16]
 gi|113526441|emb|CAJ92786.1| hypothetical membrane spanning protein [Ralstonia eutropha H16]
          Length = 660

 Score =  184 bits (467), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 145/451 (32%), Positives = 213/451 (47%), Gaps = 59/451 (13%)

Query: 339 LLVSVTVFVGRF-DMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSS 397
           + V+V + +G F D RM+QAA+               H      +W D++ADTGDG +S+
Sbjct: 60  MRVAVAMAIGAFADQRMVQAALTTLNPNPPLPVPADGH----GGVWVDYLADTGDGWDST 115

Query: 398 YSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPP 457
           YS+A  L   H    R+  V TLPR  VLL+GGD  YP P+   Y  R   PF  A   P
Sbjct: 116 YSMA--LCVSHAVTLREQGV-TLPRAKVLLLGGDQVYPTPAHEGYRTRFVDPFRAAFPAP 172

Query: 458 PWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWF 517
                  V + +P   S  P +K  D P     PGNHDW+DGL  F R  C    +GGW 
Sbjct: 173 -------VEIVRPAY-SDQP-IKLPDAPWMVATPGNHDWYDGLRGFARLFCSGKPVGGWE 223

Query: 518 MPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAEL-VKEQVGERDSVIIMTHEPN 576
             Q+  Y+ALQLP GWWV+GLDL L  +ID  Q ++F ++    Q G+R  V++   E +
Sbjct: 224 TRQRTGYYALQLPGGWWVWGLDLQLESEIDRPQREYFQKMHTALQPGDR--VVLCAPEAS 281

Query: 577 WLLDW---------YFNNVSGKNVKHLICDYLKG----RCKLRIAGDMHHYMRHSYVPSD 623
           W+ +             ++  K  +      ++G    R  L +AG+ HHY    YVP +
Sbjct: 282 WIDEMERVRRGERRAVPSIEAKTPRFRSLSEIEGMLGDRLALVLAGNSHHYAH--YVPRE 339

Query: 624 GPVYVQHLLVNGCGGAFLHPTHVFSN---------FRKFYGTTYESKAAYPSFEDSSRIA 674
           G     H +  G GGAFLH TH+  +          R+F    YE K+AYP  + +SR  
Sbjct: 340 G-TAGPHRITCGGGGAFLHGTHLLPDPPRPIVVGPARQF----YELKSAYPE-KSASRQL 393

Query: 675 LGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILRE-----DSFSGHLRSF--FG 727
                +   +N  F  +  I+Y +  + +     ++   R+     D  +G   +F   G
Sbjct: 394 RNRAWQLPARNLSFCALVAILYLLFDWIIQSASLVSVPGRDQRSFVDRLAGLEVTFGNIG 453

Query: 728 TVWNAFMYVLEHSYVS--FAGALLLLIVAIT 756
            V++    V+ HS  S  FA A++    A++
Sbjct: 454 EVFHQLFRVMAHSPASVVFAAAIVFTCAALS 484


>gi|146085100|ref|XP_001465174.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134069271|emb|CAM67421.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 1398

 Score =  183 bits (465), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 106/317 (33%), Positives = 159/317 (50%), Gaps = 40/317 (12%)

Query: 415  DSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKK------------ 462
            D   TLPRG  +L+GGDLAYP P+  TY  RLF P+  A+      +             
Sbjct: 731  DGFVTLPRGSFVLVGGDLAYPGPNDETYTTRLFEPYHDAMSSNVRLQSVFHAEQRRVVVA 790

Query: 463  ------------------DHVAVNKPEVPSG--VPELKQYDGPQCYIIPGNHDWFDGLNT 502
                                +A  +  + +G    E      P  + IPGNHDWFDGL T
Sbjct: 791  DASDADVAHIHLLDAETVSRMATGRAALRTGRATAEEALRSVPLLFAIPGNHDWFDGLTT 850

Query: 503  FMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQV 562
            + ++I  ++W+GGW MPQ+ S+F L+LP  W+V   D     DIDV Q  +F +++++ +
Sbjct: 851  YRKYILERTWIGGWLMPQRSSFFVLRLPHNWFVLCGDTGNMQDIDVAQRNYFLDVIEKCM 910

Query: 563  GERDSVIIMTHEPNWLLDWYFNNVSGKNVK-HLICDYLKGRCKLRIAGDMHHYMRHSYVP 621
                 VI+  HEP W+LD    +   +  + + + + L  R +LR+AGD+HHY RH+  P
Sbjct: 911  DAESCVILAAHEPGWVLDAMERDDKARQPELNRVAEALGTRLRLRLAGDIHHYSRHT--P 968

Query: 622  SDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNIL-K 680
             D       L+V+G GGAFLH     +   +  GT Y    A+P  E ++ + + + L  
Sbjct: 969  RDASSEAATLVVSGGGGAFLHGARNDAIISQ--GTRYVRACAFP--EQNTFMNMASRLWG 1024

Query: 681  FRKKNWQFDFIGGIVYF 697
            FR  NW+FD + G + F
Sbjct: 1025 FRVINWKFDLVVGFLCF 1041



 Score = 47.0 bits (110), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 27/31 (87%)

Query: 381 DLWFDFMADTGDGGNSSYSVARLLAQPHIRV 411
           D+WFD++AD GDG N +Y++ARLLAQP +R+
Sbjct: 536 DVWFDWIADVGDGFNPTYAMARLLAQPILRL 566


>gi|339325808|ref|YP_004685501.1| hypothetical protein CNE_1c16770 [Cupriavidus necator N-1]
 gi|338165965|gb|AEI77020.1| hypothetical membrane protein [Cupriavidus necator N-1]
          Length = 666

 Score =  183 bits (464), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 142/456 (31%), Positives = 208/456 (45%), Gaps = 68/456 (14%)

Query: 339 LLVSVTVFVGRF-DMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSS 397
           + V+V + +G F D RM+QAA+              +H      +W D++ADTGDG NS+
Sbjct: 61  MRVAVAMAIGAFADQRMVQAALTTLNPNPPLPVPADEH----GGVWVDYLADTGDGWNST 116

Query: 398 YSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPP 457
           YS+A  L   H    R+  V TLPR  VLL+GGD  YP P+   Y  R   PF  A   P
Sbjct: 117 YSMA--LCVSHAVTLREQGV-TLPRAKVLLLGGDQVYPTPAHDGYRTRFVDPFRAAFPAP 173

Query: 458 -----PWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSW 512
                P Y    + +               D P     PGNHDW+DGL  F R  C    
Sbjct: 174 VEAVHPVYSDQTIKL-------------PLDAPWMVATPGNHDWYDGLRGFARLFCSGKP 220

Query: 513 LGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKE-QVGERDSVIIM 571
           +GGW   Q+  Y+ALQLP GWW++GLDL L  +ID  Q ++F +L    Q G+R  V++ 
Sbjct: 221 VGGWETRQRTGYYALQLPGGWWIWGLDLQLESEIDRPQREYFQKLCTALQPGDR--VVLC 278

Query: 572 THEPNWLLDW---------YFNNVSGKNVKHLICDYLKG----RCKLRIAGDMHHYMRHS 618
             E +W+ +             ++  K  +      ++G    R  L +AG+ HHY    
Sbjct: 279 APEASWIDEMERVRRGERRAMPSIETKTPRFRSLSEIEGMLGDRLALVLAGNSHHYAH-- 336

Query: 619 YVPSDGPVYVQHLLVNGCGGAFLHPTHVFSN---------FRKFYGTTYESKAAYPSFED 669
           YVP  G     H +  G GGAFLH TH+  +          R+F    YE K+AYP  + 
Sbjct: 337 YVPRAG-TAGPHRITCGGGGAFLHGTHLLPDPPRPIVVGPARQF----YELKSAYPE-KA 390

Query: 670 SSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILRED-SFSGHLRSF--- 725
           ++R       +   +N  F  +  I+Y +  +S+     ++   R+   F G L      
Sbjct: 391 TARKLRNRAWQLPARNLSFCALVAILYLLFDWSIQSASLVSVPGRDQRGFVGRLAGLEVT 450

Query: 726 ---FGTVWNAFMYVLEHSYVS--FAGALLLLIVAIT 756
               G +++    V+ HS  S  FA A++    A++
Sbjct: 451 FGNIGEMFHQLFRVMAHSPASVVFAAAIVFSCAALS 486


>gi|91978404|ref|YP_571063.1| hypothetical protein RPD_3941 [Rhodopseudomonas palustris BisB5]
 gi|91684860|gb|ABE41162.1| hypothetical protein RPD_3941 [Rhodopseudomonas palustris BisB5]
          Length = 579

 Score =  181 bits (458), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 129/408 (31%), Positives = 199/408 (48%), Gaps = 67/408 (16%)

Query: 332 LFKTVFDLLVSVTVFVGRFDMRMMQAAMN--KDQEGAQHGDLLYDHLSEKED--LWFDFM 387
           LFK + ++++S +VF    D R++ AA++    Q+  Q      + L    D  LW DF+
Sbjct: 24  LFKLLVNVVIS-SVFGSYADRRLLIAALDTTDTQKLLQRARETREMLQPGPDGALWLDFV 82

Query: 388 ADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLF 447
           AD GDG +S+YSVA LLAQ  +RV   D    LPRG  L++GGD  YP  +A  Y  +L+
Sbjct: 83  ADLGDGFDSTYSVATLLAQKQLRVGGRD----LPRGQALIMGGDEVYPKATADAYRYQLY 138

Query: 448 RPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFI 507
            P+ +A   P            P   +G P          + IPGNHDW+DGL+ F+ + 
Sbjct: 139 WPYAWASPDP-----------HPGEATGTP---------LFAIPGNHDWYDGLSLFLAWF 178

Query: 508 CHKSWL--GGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGER 565
           C    +  G W   Q++SYFA Q+   WW++ +D+ L  ++D  Q  +F + + E + E 
Sbjct: 179 CRAKPVRFGSWRTVQRRSYFANQITDTWWIWAIDIQLADNMDQPQADYF-KTIAENMPEN 237

Query: 566 DSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYL--------KG-RCKLRIAGDMHHYMR 616
             +I+ + EP WL    +   S ++    I +Y         KG    + ++GD HHY R
Sbjct: 238 SKIILCSAEPGWL----YVETSSESTSWEIVEYAIELAENAGKGLTVPVVLSGDTHHYNR 293

Query: 617 HSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNF---------RKFYGTTYESKAA---- 663
           ++ + +      Q  + +G GGAFLHPTH   +          +     +   K A    
Sbjct: 294 YTGLKN------QQYITSGGGGAFLHPTHQLEDVIPLRRCGVNQSLTLASASDKGAGPAV 347

Query: 664 YPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFV--LVFSMFPQCEL 709
           YP FE S  +   N L F   NW F  + G+VYF+  +  S+ P  ++
Sbjct: 348 YPGFELSKSLVWRN-LYFALTNWDFSLLMGMVYFLFGVAISLRPHWDM 394


>gi|383775510|ref|YP_005460076.1| hypothetical protein AMIS_3400 [Actinoplanes missouriensis 431]
 gi|381368742|dbj|BAL85560.1| hypothetical protein AMIS_3400 [Actinoplanes missouriensis 431]
          Length = 583

 Score =  180 bits (457), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 149/514 (28%), Positives = 227/514 (44%), Gaps = 102/514 (19%)

Query: 300 PLSVEEYEKMKKKQLKPEFLDMVPWYS-GTSADLFKTVFDLLVSVTVFVGRF-DMRMMQA 357
           P+  E+    +   L P+ L   P    G  A L      L   + +  G + D R +Q 
Sbjct: 9   PVESEQTVPERPGSLDPQELGFTPKGPIGWLAPLLLLSTGLRALLAILFGAYLDKRELQN 68

Query: 358 AMNKDQEGAQHGDLLYDHLSEKE-DLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDS 416
           +++ D          +DH S  + +LW D++AD GDG +++YSVA LLAQP + V  +  
Sbjct: 69  SLDDD---------FFDHSSTADGELWLDYVADLGDGFDATYSVAYLLAQPELEVDGE-- 117

Query: 417 VFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGV 476
              LPRG +LL+GGD  YP  S   YE R+  P+  AL               PE P+G 
Sbjct: 118 --RLPRGRLLLMGGDQVYPLASGDGYENRMKGPYRAAL---------------PEAPAGE 160

Query: 477 PELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHK--SWLGGWFMPQKKSYFALQLPKGWW 534
           P       P  + +PGNHDW+DGL  F+R    +    +GGW   Q++SYFA++LP  WW
Sbjct: 161 PR------PTLFALPGNHDWYDGLTAFLRLFARRKDGHIGGWRTEQRRSYFAVRLPANWW 214

Query: 535 VFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHL 594
           +F +D      ID  Q  +F E   E+VG  D VI+MT  P W+        + K  ++ 
Sbjct: 215 LFAVDEQFGAYIDDPQLLYF-ERAAEEVGPDDRVILMTPSPTWV------KAADKPEEYD 267

Query: 595 ICDYL--------KGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAF------ 640
             DY         +   ++ ++GD+HHY R  Y   D     + L+  G GGA+      
Sbjct: 268 AVDYFIRTILAPTRAHVRVLVSGDLHHYAR--YTGDD-----RELITCGGGGAYTLGTQN 320

Query: 641 ------LHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGI 694
                 + P    +   K    TY  + ++P  + S R   G   +  ++N  F  + GI
Sbjct: 321 LPGELTVPPKETLTR-SKSRSRTYGLEKSFPDPDLSRRWGRGVFHRLPRRNKGFATMLGI 379

Query: 695 VYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVA 754
           +  + + +M          RED          G++   F   L          +LL+++A
Sbjct: 380 IQTLTMLAMAGAA----ASRED----------GSILKLFTIPLV--------LMLLVVMA 417

Query: 755 ITFV---PSKLSRKKRA---MIGVLHVSAHLAAA 782
            T +   P      KR    ++GVLH  A +A A
Sbjct: 418 ATTLFAQPPPAPSPKRVRHWVLGVLHGFAQIALA 451


>gi|386847643|ref|YP_006265656.1| hypothetical protein ACPL_2693 [Actinoplanes sp. SE50/110]
 gi|359835147|gb|AEV83588.1| hypothetical protein ACPL_2693 [Actinoplanes sp. SE50/110]
          Length = 586

 Score =  180 bits (457), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 134/441 (30%), Positives = 194/441 (43%), Gaps = 90/441 (20%)

Query: 348 GRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQP 407
           G  D R +QA+   D         ++       + W DF+AD GDG +++YS+A LLAQP
Sbjct: 55  GYLDKRELQASFPND---------VHREAGPDGEAWIDFVADLGDGFHATYSIAYLLAQP 105

Query: 408 HIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAV 467
            ++V   D    LPRG  L++GGD  YP PSA  YE RL  P+  A+             
Sbjct: 106 SLKVGEHD----LPRGRALILGGDEVYPTPSAQGYEDRLVGPYHAAM------------- 148

Query: 468 NKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFIC--HKSWLGGWFMPQKKSYF 525
             P  P G        GP  Y +PGNHDW+DGL  F+R      ++ +GGW +PQ +SYF
Sbjct: 149 --PGTPPG-----DGAGPAMYALPGNHDWYDGLTAFLRLFTGTRRTGIGGWRLPQHRSYF 201

Query: 526 ALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNN 585
           A+QLP  WW+  LD      ID  Q  +F+  V    G +  VI+ T  P W        
Sbjct: 202 AVQLPGDWWLLALDDQDSTYIDDPQLAYFSR-VAANFGPQTRVIVATASPTW-------- 252

Query: 586 VSGKNVKHL----------ICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNG 635
           V G +V  +          + +    + +L ++GD HHY R+S          + L+  G
Sbjct: 253 VQGDDVPEVYASLDYFVRAVIEPTGAKIRLMVSGDWHHYARYSGA-------ERELITCG 305

Query: 636 CGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIV 695
            GGA+L+PTH          T     A  PS   S R+      +F  K      +    
Sbjct: 306 GGGAYLYPTHQLPE------TIEVPPADLPS--PSPRVKYSLRSRFPGK------LRSQA 351

Query: 696 YFVLVFSMFPQCELN--------HILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGA 747
           Y   +F   P+   +        H +   + SG L+S FG+    F        ++    
Sbjct: 352 YAASIFGRLPKDNPSFIGMIGAVHTMMLLAASGVLKSGFGSPLQKFA-------LAPLVV 404

Query: 748 LLLLIVAITFVPSKLSRKKRA 768
           L+ L+VA ++  + LSR  R 
Sbjct: 405 LMALVVAGSYAFAHLSRSVRG 425


>gi|386845458|ref|YP_006263471.1| hypothetical protein ACPL_506 [Actinoplanes sp. SE50/110]
 gi|359832962|gb|AEV81403.1| hypothetical protein ACPL_506 [Actinoplanes sp. SE50/110]
          Length = 575

 Score =  179 bits (455), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 126/455 (27%), Positives = 205/455 (45%), Gaps = 83/455 (18%)

Query: 288 VVERSTGWALTHPLSVEEYEKMKKKQLKPEFLDMVPWYS-GTSADLFKTVFDLLVSVTVF 346
           VV    GW+       E     +   + P+ L   P    G  A L      L   + + 
Sbjct: 3   VVPPPAGWS-----GAEHTVPTRPTSMDPQELGFTPKQPIGWLAPLLLLSTGLRALLAIL 57

Query: 347 VGRF-DMRMMQAAMNKDQEGAQHGDLLYDHLSEKE-DLWFDFMADTGDGGNSSYSVARLL 404
            G + D R +Q A++ D          +DH +  + ++W D++AD GDG +++YS+A LL
Sbjct: 58  FGAYMDKRELQNALDAD---------FFDHSATADGEIWLDYVADLGDGFDATYSIAWLL 108

Query: 405 AQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDH 464
           AQP  R+T D +   LPRG +LL+GGD  YP  S   YE R+  P+  AL          
Sbjct: 109 AQP--RLTVDGAA--LPRGRLLLMGGDQVYPLASGDGYESRMKGPYRAAL---------- 154

Query: 465 VAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHK--SWLGGWFMPQKK 522
                PE P+G P       P  + +PGNHDW+DGL  F+R    +   ++GGW   Q++
Sbjct: 155 -----PEAPAGEPR------PTLFALPGNHDWYDGLTAFLRLFARRKDGFIGGWRTEQRR 203

Query: 523 SYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWL---- 578
           SYFA++LP  WW+F +D      ID  Q  +F +  +E V   D +I+MT  P W+    
Sbjct: 204 SYFAVKLPNNWWLFAVDEQFGAYIDDPQLLYFEKAARE-VTPDDRIILMTPSPTWVKARN 262

Query: 579 -------LDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHL 631
                  +D++   +             + + ++ ++GD+HHY R++          + L
Sbjct: 263 DPEAYDAVDYFLRTILAPT---------RAQVRVLVSGDLHHYARYTG-------EQREL 306

Query: 632 LVNGCGGAFLHPTHVFS-----------NFRKFYGTTYESKAAYPSFEDSSRIALGNILK 680
           +  G GGA+L  TH                 +     Y+  A +P+   S R+  G   +
Sbjct: 307 ITCGGGGAYLLGTHNLPEQLVVPPRETLTRSRSLSRVYDLAARFPAPAASRRLGWGAFRR 366

Query: 681 FRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILRE 715
             ++N  F  + GI++ + + +M        I++ 
Sbjct: 367 VPRRNAGFATMLGIIHTLTMLAMAGAASQGGIIQR 401


>gi|397611057|gb|EJK61150.1| hypothetical protein THAOC_18408 [Thalassiosira oceanica]
          Length = 525

 Score =  176 bits (446), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 90/214 (42%), Positives = 124/214 (57%), Gaps = 36/214 (16%)

Query: 454 LQPPPWYKKDHVAVNKPEVP-----------SGVPELKQYDGPQCYIIPGNHDWFDGLNT 502
           + PPP Y+K+H+++ KP +P                L+ Y+GP  +IIPGNHDWFDGL  
Sbjct: 1   MSPPPSYRKEHISIKKPALPVKGWDVDAVAGDDGDALQNYEGPVTFIIPGNHDWFDGLAA 60

Query: 503 FMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQ----FKFFAELV 558
           + R+I  + WLGGW +PQ+ SYFAL+LPK WW+ G DLAL  DI++ Q    F   A LV
Sbjct: 61  YTRYILSRDWLGGWLIPQRTSYFALKLPKNWWLLGFDLALDDDINIEQWFIDFHLTALLV 120

Query: 559 KEQVGERDSVIIMTHEPNWLLDWY---FNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYM 615
                           P+W+L+ Y    ++    N++ LI  +LK R +LR+AGD+HHY 
Sbjct: 121 ----------------PHWVLEDYEEFKHDEKETNLRELINSFLKNRVRLRLAGDLHHYT 164

Query: 616 RHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSN 649
           RH  +P         L+V+G GGAFLHPTH F +
Sbjct: 165 RH--IPCAERSVQPQLVVSGGGGAFLHPTHTFGD 196


>gi|293337063|ref|NP_001170129.1| uncharacterized protein LOC100384054 [Zea mays]
 gi|224033729|gb|ACN35940.1| unknown [Zea mays]
          Length = 135

 Score =  175 bits (443), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 79/125 (63%), Positives = 102/125 (81%), Gaps = 2/125 (1%)

Query: 119 MGVDLRMNLSLFLTIFLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVIS 178
           MG+DLRMNLS+FLTI+++S++FL+VFH IFLGLWY+G VSR+A K+PE+LTIIQNC VIS
Sbjct: 1   MGLDLRMNLSIFLTIYISSIIFLMVFHTIFLGLWYLGFVSRMAEKKPEMLTIIQNCAVIS 60

Query: 179 VFCCVFYSHCGNRAVLRHRPLERRNSSW--FSLWKKEERNTWLAKFLRMNELKDQVCSSW 236
           + CCVFYSHCGNR V   + ++RR +SW  FSLW K + NT + + LRM++ K Q+CSSW
Sbjct: 61  IACCVFYSHCGNRTVSSDKSIDRRTASWIVFSLWTKHDDNTLIPRLLRMHKFKLQICSSW 120

Query: 237 FAPVG 241
           F PV 
Sbjct: 121 FPPVS 125


>gi|421597903|ref|ZP_16041425.1| hypothetical protein BCCGELA001_10647 [Bradyrhizobium sp.
           CCGE-LA001]
 gi|404269981|gb|EJZ34139.1| hypothetical protein BCCGELA001_10647 [Bradyrhizobium sp.
           CCGE-LA001]
          Length = 586

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 122/381 (32%), Positives = 186/381 (48%), Gaps = 70/381 (18%)

Query: 351 DMRMMQAAMN--KDQEGAQHGDLLYDHLSEKED--LWFDFMADTGDGGNSSYSVARLLAQ 406
           D R++ AA++    +E  +        L+  +D  +W D++AD GDG +S+Y+VA LLA+
Sbjct: 42  DRRLIIAALDTVSPEEHTKRAKDFRSRLTTDKDGAVWIDWVADLGDGFDSTYAVASLLAR 101

Query: 407 PHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVA 466
             +++        LPRG+ L++GGD  YP  S   Y  +L +P+        W   DH  
Sbjct: 102 KELKIGE----VALPRGEALIMGGDEVYPKASREAYMNQLRQPYA-------WAAPDH-- 148

Query: 467 VNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFIC-HKSW-LGGWFMPQKKSY 524
                      + K  +G     IPGNHDW+DGL  F+ + C  K W LG W   Q++SY
Sbjct: 149 -----------DRKDDNGRPVLAIPGNHDWYDGLVLFLAYFCKEKPWHLGAWRSYQRRSY 197

Query: 525 FALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFN 584
           FA+Q+ + WW++  D+ L  D+D  Q  +F ++ +  + E   +I+ + EP WL    + 
Sbjct: 198 FAVQITETWWLWATDIQLADDMDQPQADYFKQIAR-SMPENSKIILCSAEPGWL----YT 252

Query: 585 NVSGKNVKHLICDYLKG---------RCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNG 635
           + + K+ +  I +Y  G            + ++GD HHY R  YV  D   YV     +G
Sbjct: 253 DSNRKSWE--IMEYAAGIALNAGRGHTIPVLLSGDTHHYSR--YVGKDDRQYV----TSG 304

Query: 636 CGGAFLHPTHV--------FSNFRK---------FYGTTYESKAAYPSFEDSSRIALGNI 678
            GGAFLHPTH         F + R+                + AAYPSF  S  + L N+
Sbjct: 305 GGGAFLHPTHQLEQDVAVGFVDHREALKLGAISDHTDAKKSTAAAYPSFSTSKWLTLKNV 364

Query: 679 LKFRKKNWQFDFIGGIVYFVL 699
           L F   NW F  + G +YF+L
Sbjct: 365 L-FAFTNWDFSLLMGAIYFLL 384


>gi|238062276|ref|ZP_04606985.1| hypothetical protein MCAG_03242 [Micromonospora sp. ATCC 39149]
 gi|237884087|gb|EEP72915.1| hypothetical protein MCAG_03242 [Micromonospora sp. ATCC 39149]
          Length = 612

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 114/374 (30%), Positives = 172/374 (45%), Gaps = 68/374 (18%)

Query: 354 MMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTR 413
           +  A ++K +     GD ++  +     LW D++AD GDG N++YSVA LLAQP + V  
Sbjct: 94  LFGAYLDKRELQNAFGDGVFRQVGPDGGLWLDYVADLGDGFNATYSVAYLLAQPELAV-- 151

Query: 414 DDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVP 473
                 LPR   L++GGD  YP+ S   YE R   P++ AL            V  PE P
Sbjct: 152 --DGHRLPRAQTLVMGGDQVYPSASYSAYEDRCKGPYQAALP-----------VTPPERP 198

Query: 474 SGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFICHK--SWLGGWFMPQKKSYFALQLPK 531
           +             + +PGNHDW+DGL  F+R          GGW   Q +SYFA++LP 
Sbjct: 199 T------------LFAVPGNHDWYDGLTAFLRLFVRSRDRHFGGWGTGQSRSYFAVELPA 246

Query: 532 GWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWL-----------LD 580
            WW+ GLD      +D  Q  +F + V +++G R  VI+    P W+           +D
Sbjct: 247 DWWLLGLDDQSGSYLDDPQLTYF-DAVAKRLGPRSRVILAVPAPTWVKAADHPTAYDSID 305

Query: 581 WYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAF 640
           ++   +               R +L I+GD+HHY R+S     GP   + L+  G GGA+
Sbjct: 306 YFLRTIVAPT---------GARVRLLISGDLHHYARYS-----GP--DRQLITCGSGGAY 349

Query: 641 LHPTHVFS-----------NFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFD 689
           L+PTH              + R      Y+  A++P    S R   G   +  ++N  F 
Sbjct: 350 LYPTHHLPERIEVPPKDTLSRRASRSLPYDLAASFPDRARSRRYGWGVFARLPRRNPGFG 409

Query: 690 FIGGIVYFVLVFSM 703
            + G V+ +L+ +M
Sbjct: 410 TLLGTVHTLLMLAM 423


>gi|367472586|ref|ZP_09472167.1| conserved membrane hypothetical protein [Bradyrhizobium sp. ORS
           285]
 gi|365275198|emb|CCD84635.1| conserved membrane hypothetical protein [Bradyrhizobium sp. ORS
           285]
          Length = 612

 Score =  174 bits (440), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 128/397 (32%), Positives = 192/397 (48%), Gaps = 67/397 (16%)

Query: 332 LFKTVFDLLVSVTVFVGRFDMRMMQAAMNK--DQEGAQHGDLLYDHLS--EKEDLWFDFM 387
           L   + + +V+  +F    D R+M AA++     E  +        LS   K  +W DF+
Sbjct: 23  LLIKLLNNVVTSAMFGQYADRRLMVAALDTVTPDEHMRRATAFARSLSPTTKGPVWIDFV 82

Query: 388 ADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLF 447
           AD GDG +S+Y+VA LL +  + +  +    TLPRG++L++GGD  YP  +A TY  +L 
Sbjct: 83  ADLGDGFDSTYAVACLLGRQTLELGGE----TLPRGEMLIMGGDEVYPLANAQTYRNQLR 138

Query: 448 RPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFI 507
           +P++       W   DH   +   VP              + IPGNHDW+DGL  F+ + 
Sbjct: 139 KPYQ-------WASPDHDKTDDEGVP-------------LFAIPGNHDWYDGLVQFLAYF 178

Query: 508 CHKS--WLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVK-EQVGE 564
              +    G W   QK+SYFA+QL + WW++G+D+ L  ++D  Q  +F  + + +Q+  
Sbjct: 179 TRPTPTHFGSWRTQQKRSYFAIQLTESWWLWGMDIQLAENMDQPQADYFKLIAQSDQLKP 238

Query: 565 RDSVIIMTHEPNWLLDWYFNNVSGKNVKHL-ICDYLKGRCK---------LRIAGDMHHY 614
              +I+ T EP WL        +  N++   I DY+ G  K         L ++GD HHY
Sbjct: 239 GSKIILCTAEPGWLY-------TDTNLRSWEIVDYVLGIAKKADKALTIPLLLSGDTHHY 291

Query: 615 MRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSN--FRKFYGT----------TYESKA 662
            R  Y+  DG  +V     +G GGAFLHPTH        ++ GT            +  A
Sbjct: 292 SR--YIAEDGTQFV----TSGGGGAFLHPTHQLEKNVSVRWTGTKVPLTLADKENSKDPA 345

Query: 663 AYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVL 699
            YPS   +SR  L   L F   NW F     +VYFV+
Sbjct: 346 VYPSMH-ASRELLWRNLWFALTNWDFSIFMALVYFVV 381


>gi|365880544|ref|ZP_09419908.1| conserved membrane hypothetical protein [Bradyrhizobium sp. ORS
           375]
 gi|365291383|emb|CCD92439.1| conserved membrane hypothetical protein [Bradyrhizobium sp. ORS
           375]
          Length = 615

 Score =  173 bits (439), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 128/396 (32%), Positives = 192/396 (48%), Gaps = 65/396 (16%)

Query: 332 LFKTVFDLLVSVTVFVGRFDMRMMQAAMNK--DQEGAQHGDLLYDHLS--EKEDLWFDFM 387
           L   + + +V+  +F    D R+M AA++    +E           LS   K  +W DF+
Sbjct: 23  LLIKLLNNVVTSAMFGQYADRRLMVAALDTVPPEEHMARATAFARSLSPSTKGPVWIDFV 82

Query: 388 ADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLF 447
           AD GDG +S+Y+VA LL    +++   +    LPRG +L++GGD  YP  +A TY  +L 
Sbjct: 83  ADLGDGFDSTYAVACLLGSEALKLGGKE----LPRGQMLIMGGDEVYPLANAQTYRNQLR 138

Query: 448 RPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFI 507
           +P++       W   DH   +   VP              + IPGNHDW+DGL  F+ + 
Sbjct: 139 KPYQ-------WASPDHDKTDDDGVP-------------MFAIPGNHDWYDGLVQFLAYF 178

Query: 508 CHKS--WLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVK-EQVGE 564
              +    G W   QK+SYFALQL + WW++G+D+ L  ++D  Q  +F  + + +Q+  
Sbjct: 179 TRPTPTHFGSWRTQQKRSYFALQLTESWWLWGMDIQLADNMDQPQADYFKLIAQSDQLKP 238

Query: 565 RDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCK---------LRIAGDMHHYM 615
              +I+ T EP WL    + + + ++ +  I DY+ G  K         L ++GD HHY 
Sbjct: 239 GSKIILCTAEPGWL----YTDTNMRSWE--IVDYVLGIAKKADKALTIPLLLSGDTHHYS 292

Query: 616 RHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSN--FRKFYGTTY----------ESKAA 663
           R  Y   DG  +V     +G GGAFLHPTH        ++ GT               AA
Sbjct: 293 R--YHAEDGTQFV----TSGGGGAFLHPTHQLEKNVSVRWTGTKLPLTLADTDEGNKPAA 346

Query: 664 YPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVL 699
           YPS + S  +   N L F   NW F    G+VYFV+
Sbjct: 347 YPSMQTSRGLVWRN-LWFALTNWDFSIFMGLVYFVV 381


>gi|159039913|ref|YP_001539166.1| hypothetical protein Sare_4398 [Salinispora arenicola CNS-205]
 gi|157918748|gb|ABW00176.1| conserved hypothetical protein [Salinispora arenicola CNS-205]
          Length = 608

 Score =  173 bits (438), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 165/573 (28%), Positives = 239/573 (41%), Gaps = 128/573 (22%)

Query: 245 DYPLLSKWVIYGELGNDNGGSSDEI-----SPIYSLWATFIGLYIANYVVERSTGWALTH 299
           D+P        G  G D GG  + +     +P  S            +  E + G     
Sbjct: 4   DHPAGPSGRHLGRHGPDGGGPGEAVPGSGPAPTSS--------PNGPHASEAAGGRPACR 55

Query: 300 PLSVEEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAM 359
           P S +  E +     KP     VPW +     L  T    L+++ +F    D R +Q A 
Sbjct: 56  PRSTDPAE-LGFTPRKP-----VPWLA--PFLLVSTGIRTLLAL-LFGAYLDKRELQTAF 106

Query: 360 NKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFT 419
           +           +   +     +W D++AD GDG +++YSVA LLAQ  + V        
Sbjct: 107 DAK---------ISRQVGPDGGVWLDYVADLGDGFDATYSVAYLLAQRELMV----EGHR 153

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           LPR  VL++GGD  YP+ +  TYE R   P++ AL            V  PE P+     
Sbjct: 154 LPRAQVLVMGGDQVYPSAAFDTYEDRCKGPYQAALP-----------VTPPEQPT----- 197

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFICHKS--WLGGWFMPQKKSYFALQLPKGWWVFG 537
                   + IPGNHDW+DGL  F+R          GGW   Q +SYFA++LP  WW+FG
Sbjct: 198 -------LFAIPGNHDWYDGLTAFLRLFVRSRDRHFGGWNTEQSRSYFAVELPADWWLFG 250

Query: 538 LDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLIC- 596
           LD      +D  Q  +F + V E++G +  VI+    P W+          K  KH    
Sbjct: 251 LDDQSGSYLDDPQLTYFDD-VAERLGPQSRVILAVPMPTWV----------KATKHPTAY 299

Query: 597 ---DYL--------KGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTH 645
              DY           + +L I+GD+HHY R++     GP   + L+  G GGA+L+PTH
Sbjct: 300 DSIDYFIRTIVAPTGAQVRLLISGDLHHYARYA-----GP--DRQLITCGGGGAYLYPTH 352

Query: 646 VFSN----------FRKFYGT-TYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGI 694
           +              R+   T  YE    YP    S R A G  L+   +N  F  + G 
Sbjct: 353 LLPERIQVPPKETLARRASATQVYELAGRYPDVARSRRYAWGAFLRLPLRNPGFTTLLGA 412

Query: 695 VYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVA 754
           +Y +LV +M   C      R+D+    LR F                V  A  LL+ ++ 
Sbjct: 413 LYALLVLAMVGVC----TNRDDA---QLRLF---------------SVPLAAMLLVTLLG 450

Query: 755 ITFV--PSKLSRKKRA---MIGVLHVSAHLAAA 782
             F   P   + K+R    ++GV H  AH+A A
Sbjct: 451 AFFFAKPPGSAGKRRLRHWLLGVGHGLAHVALA 483


>gi|260221206|emb|CBA29538.1| hypothetical protein Csp_A12650 [Curvibacter putative symbiont of
           Hydra magnipapillata]
          Length = 609

 Score =  171 bits (433), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 117/364 (32%), Positives = 165/364 (45%), Gaps = 72/364 (19%)

Query: 374 DHLSE---KEDLWFDFMADTGDGGNSSYSVARLLAQPHI--RVTRDDSVFTLPRGDVLLI 428
           DH  E     + WFDF++DTGDGGN++++VAR L    +  + T        PRG +L +
Sbjct: 57  DHAHETGPDGNFWFDFVSDTGDGGNATFTVARALLAEELAPQETGASGTVVFPRGKILYL 116

Query: 429 GGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCY 488
           GGDL YP+ +   Y+ R    FE A                        +    DG   Y
Sbjct: 117 GGDLCYPSANVQEYQYRFLEMFEGARS---------------------AQGADPDGRSAY 155

Query: 489 IIPGNHDWFDGLNTFMRFICHKS--WLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDI 546
            IP NHDWFD ++TF R+  H++   + G   PQ +SYFA +LP  WWV GLD AL  DI
Sbjct: 156 AIPQNHDWFDSISTFKRYFVHRNNGEVCGLKTPQTRSYFATRLPHRWWVLGLDFALAGDI 215

Query: 547 DVYQFKFFAELVKEQVGERDS----------VIIMTHEPNWLLDWYFNNVSGKNVKHLIC 596
           D  Q++ F  L  + +G  D+          ++++  EP W      + + G   ++   
Sbjct: 216 DRGQYEAFRRLAGDAIGGEDNPSQQIQAGDQLVLIYPEPYWTRPIGDSALQGYPKRYQRL 275

Query: 597 DYL---KG-RCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTH------- 645
           + +   KG   ++R+AGD+HHY R S  P D       LL  G GGAFLHPTH       
Sbjct: 276 EAMLEAKGIHIRMRLAGDVHHYSRESSEPLDA-RQDDMLLTCGSGGAFLHPTHARALTQA 334

Query: 646 -----------VFSNFRKFY-----------GTTYESKAAYPSFEDSSRIALGNILKFRK 683
                      + S  R              G +Y  + AYP   DS  ++ GN+L   K
Sbjct: 335 KLKCQASDPDAISSELRTATHIGTVGSTGGAGASYVRQCAYPDPADSRALSRGNVLSLFK 394

Query: 684 KNWQ 687
             W 
Sbjct: 395 FAWN 398


>gi|146341618|ref|YP_001206666.1| hypothetical protein BRADO4720 [Bradyrhizobium sp. ORS 278]
 gi|146194424|emb|CAL78449.1| conserved hypothetical protein; putative membrane protein
           [Bradyrhizobium sp. ORS 278]
          Length = 613

 Score =  169 bits (429), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 129/397 (32%), Positives = 189/397 (47%), Gaps = 67/397 (16%)

Query: 332 LFKTVFDLLVSVTVFVGRFDMRMMQAAMN--KDQEGAQHGDLLYDHLS--EKEDLWFDFM 387
           L   + + +V+   F    D R+M AA++    +E  +        LS   K  +W DF+
Sbjct: 23  LLAKLLNNVVTSAQFGQYADRRLMVAALDTVSPEEHMRRATAFARSLSPTTKGPVWIDFV 82

Query: 388 ADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLF 447
           AD GDG +S+Y+VA LL     R + D     LPRG++L++GGD  YP  +A TY  +L 
Sbjct: 83  ADLGDGFDSTYAVACLLG----RESLDVGSGLLPRGEMLIMGGDEVYPLANAQTYRNQLR 138

Query: 448 RPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFI 507
           +P++       W   DH   +   VP              + IPGNHDW+DGL  F+ + 
Sbjct: 139 KPYQ-------WASPDHDKTDDDGVP-------------LFAIPGNHDWYDGLVQFLAYF 178

Query: 508 CHKS--WLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVK-EQVGE 564
              +    G W   QK+SYFA+QL + WW++ +D+ L  D+D  Q  +F  + + +Q+  
Sbjct: 179 TRPTPTHFGSWRTQQKRSYFAIQLTESWWLWAMDIQLADDMDQPQADYFKLIAQSDQLKP 238

Query: 565 RDSVIIMTHEPNWLLDWYFNNVSGKNVKHL-ICDYLKGRCK---------LRIAGDMHHY 614
              +I+ T EP WL        +  N++   I DY+    +         L ++GD HHY
Sbjct: 239 GSKIILCTAEPGWLY-------TDTNMRSWGIVDYVLSIARRADKALTIPLLLSGDTHHY 291

Query: 615 MRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSN--FRKFYGTTY----------ESKA 662
            R  Y   DG  +V     +G GGAFLHPTH        ++ GT            +  A
Sbjct: 292 SR--YHAEDGTQFV----TSGGGGAFLHPTHQLETNVSVRWTGTKVPLTLAGKDGGKEPA 345

Query: 663 AYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVL 699
           AYPS +  SR  L   L F   NW F    G+VYFV+
Sbjct: 346 AYPSMQ-ISRDLLWRNLWFALTNWDFSIFLGLVYFVV 381


>gi|145596528|ref|YP_001160825.1| hypothetical protein Strop_4017 [Salinispora tropica CNB-440]
 gi|145305865|gb|ABP56447.1| hypothetical protein Strop_4017 [Salinispora tropica CNB-440]
          Length = 605

 Score =  169 bits (428), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 134/429 (31%), Positives = 188/429 (43%), Gaps = 93/429 (21%)

Query: 382 LWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFT 441
           LW D++AD GDG +++YSVA LLAQ  + V        LPR  VL++GGD  YP+    T
Sbjct: 117 LWLDYVADVGDGFDATYSVAYLLAQRELTV----DGHRLPRAQVLVMGGDQVYPSADFET 172

Query: 442 YERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLN 501
           YE R   P++ AL            V  PE P+             + IPGNHDW DGL 
Sbjct: 173 YEDRCKGPYQAAL-----------PVTPPEQPT------------LFAIPGNHDWHDGLT 209

Query: 502 TFMRFICHKS--WLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVK 559
            F+R         LGGW   Q +SYFA++LP  WW+ GLD      +D  Q  +F + V 
Sbjct: 210 AFLRLFVRSRDRHLGGWNTEQSRSYFAVELPANWWLLGLDDQSGSYLDDPQLSYF-DSVA 268

Query: 560 EQVGERDSVIIMTHEPNWL-----------LDWYFNNVSGKNVKHLICDYLKGRCKLRIA 608
           +++G +  VI+    P W+           +D++   +               + +L I+
Sbjct: 269 QRLGPQSRVILAVPMPAWIKATKDPSAYDSIDYFIRTIIAPT---------GAQVRLLIS 319

Query: 609 GDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSN-----------FRKFYGTT 657
           GD HHY R++     GP   + L+  G GGA+L+PTH+               R     +
Sbjct: 320 GDQHHYARYA-----GP--DRQLITCGGGGAYLYPTHLLPERIQVPPAETLARRASAPQS 372

Query: 658 YESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNHILREDS 717
           YE    YP    S R A G   +   +N  F  + GI+Y +L+ SM   C  NH      
Sbjct: 373 YELAGCYPEATRSRRYAWGIFPRLPWRNRGFATLLGILYTLLILSMVGVCT-NHD----- 426

Query: 718 FSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRA----MIGVL 773
            S  LR F              S    A  L+ L  A+ F  +  S  KR     ++G+ 
Sbjct: 427 -SAQLRLF--------------SVPLVAMVLVTLTGAVLFAKAPGSGGKRRVRHWLLGLG 471

Query: 774 HVSAHLAAA 782
           H  AHL  A
Sbjct: 472 HGLAHLGLA 480


>gi|332669812|ref|YP_004452820.1| hypothetical protein Celf_1298 [Cellulomonas fimi ATCC 484]
 gi|332338850|gb|AEE45433.1| hypothetical protein Celf_1298 [Cellulomonas fimi ATCC 484]
          Length = 611

 Score =  169 bits (427), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 137/447 (30%), Positives = 197/447 (44%), Gaps = 70/447 (15%)

Query: 331 DLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADT 390
           +L +T   + ++ +VF    D R +QAA+         G L+        D+W DF+AD 
Sbjct: 28  ELARTALKVALA-SVFTAYGDRREVQAALP--------GGLVEPAHRPTGDVWLDFVADL 78

Query: 391 GDGGNSSYSVARLLAQPHIRVTRDDSV--FTLPRGDVLLIGGDLAYPNPSAFTYERRLFR 448
           GDG +++Y+VA LLAQP + V R        LPRG VL++GGD  YP  SA  YE R   
Sbjct: 79  GDGFDATYTVATLLAQPELEVARPGGGPDLALPRGHVLVMGGDEVYPAASARGYEHRTLG 138

Query: 449 PFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMR-FI 507
           P+  AL               P  P G P L     P    IPGNHDW+DGL  ++R F 
Sbjct: 139 PYGAAL---------------PAPPPGTP-LDATTEPTLLAIPGNHDWYDGLTAWLRVFT 182

Query: 508 CHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDS 567
           C  S +G W   Q++SYFA++L  GWW+ GLD  L   +D  Q  +F   V   +   D+
Sbjct: 183 CGAS-VGAWRTVQRRSYFAVRLAPGWWLLGLDSQLDEYVDGPQLDYFRTHVTAHLRPGDA 241

Query: 568 VIIMTHEPNWLLDW----YFNN---VSGKNVKHLIC------DYLKGRCKLRIAGDMHHY 614
           V++   EP W         FN    V  + V+H         +    R +L I+GD HHY
Sbjct: 242 VVVCAAEPAWAKAGSDPDAFNQLHFVEREVVRHRRVPGRREPEETGARVRLWISGDSHHY 301

Query: 615 MRHSYVP------SDGPVYVQHLLVNGCGGAFLH------------PTHVFSNFRKFYGT 656
            R++  P      + G       +  G GGA+L             P       +   GT
Sbjct: 302 SRYAERPPAGTATTPGDARAVQAVTCGLGGAYLGDTLHLPAAVELPPAASRMRTKASPGT 361

Query: 657 TYE-SKAAYPSFEDSSRI--ALGNILK---FRKKNWQFDFIGGIVYFVLVFSMFPQCELN 710
            ++ +   YP  ++S R+   + N        ++N       G+V  VLV  +     + 
Sbjct: 362 WFDRAGPTYPPQDESRRLRRRVANPASRWWAGRRNPGLLATAGVVQLVLVGCL---AGVL 418

Query: 711 HILREDSFSGHLRSFFGTVWNAFMYVL 737
            +LR  S +  LR   G    A + VL
Sbjct: 419 GVLRGGSPT-LLRGPAGAATEAALGVL 444


>gi|365887679|ref|ZP_09426504.1| conserved membrane hypothetical protein [Bradyrhizobium sp. STM
           3809]
 gi|365336713|emb|CCD99035.1| conserved membrane hypothetical protein [Bradyrhizobium sp. STM
           3809]
          Length = 609

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 126/399 (31%), Positives = 185/399 (46%), Gaps = 71/399 (17%)

Query: 332 LFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGA--QHGDLLYDHLSEKED--LWFDFM 387
           L   + + +++  +F    D R+M AA++        +    L D      D  +W DF+
Sbjct: 23  LLIKLLNNVITSAMFGHYADRRLMVAALDTVAPAVHLKRATALVDQFGPTRDGPVWLDFV 82

Query: 388 ADTGDGGNSSYSVARLLAQPHIRVTRDDSVFT---LPRGDVLLIGGDLAYPNPSAFTYER 444
           AD GDG +S+Y+VA LLA       RD   F    LPRG +L++GGD  YP  +A TY+ 
Sbjct: 83  ADLGDGFDSTYAVACLLA-------RDSLTFDGEPLPRGQILVMGGDEVYPYANAQTYKN 135

Query: 445 RLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFM 504
           +L RP++       W   DH   +   VP              + IPGNHDW+DGL+ F+
Sbjct: 136 QLRRPYQ-------WASPDHDKTDDRGVP-------------LFAIPGNHDWYDGLSQFL 175

Query: 505 RFICHKS--WLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVK-EQ 561
            +    +    G W   Q +SYFA+QL + WW++ +D+ L  D+D  Q  +F  + + +Q
Sbjct: 176 AYFTRPTPTHFGSWRTRQSRSYFAVQLTRTWWIWAMDIQLADDMDQPQADYFNLIAQSDQ 235

Query: 562 VGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKG---------RCKLRIAGDMH 612
           +     +I+ T EP WL    + + + K+ +  I DY  G            L ++GD H
Sbjct: 236 LLPGSRIILCTAEPGWL----YTDTNTKSWE--IVDYALGIAAKANKQLTIPLLLSGDTH 289

Query: 613 HYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTY------------ES 660
           HY R  Y   DG  +V     +G GGAFLHPTH   +      T              + 
Sbjct: 290 HYSR--YQAGDGTQFV----TSGGGGAFLHPTHQLKSDVSVRWTDRTVPLTLGRRNGGQE 343

Query: 661 KAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVL 699
            AAYP    +SR  L   L F   NW F    G+VY  +
Sbjct: 344 PAAYPPMP-TSRSLLWRNLWFALTNWDFSIFMGLVYVAV 381


>gi|330465266|ref|YP_004403009.1| metallophosphoesterase [Verrucosispora maris AB-18-032]
 gi|328808237|gb|AEB42409.1| metallophosphoesterase [Verrucosispora maris AB-18-032]
          Length = 606

 Score =  166 bits (421), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 119/397 (29%), Positives = 179/397 (45%), Gaps = 61/397 (15%)

Query: 322 VPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKED 381
           VPW +     L  T    L+++ +F    D R +Q A++ +         +   +     
Sbjct: 68  VPWLAPLL--LISTGLRTLLAM-LFGAYLDKRELQKALDSE---------IAKQVGPDGG 115

Query: 382 LWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFT 441
           LW D++AD GDG N++YSVA LLAQP + V        LPR   L++GGD  YP+ +   
Sbjct: 116 LWLDYVADLGDGFNATYSVAYLLAQPELTV----DGHRLPRAQTLVMGGDQVYPSAAYEA 171

Query: 442 YERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLN 501
           YE R   P++ AL                  P+  PE      P  + +PGNHDW+DGL 
Sbjct: 172 YEDRCKGPYQAAL------------------PTTPPEQ-----PSVFAVPGNHDWYDGLT 208

Query: 502 TFMRFI--CHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVK 559
            F+R          GGW   Q +SYFA +LP  WW+ GLD      +D  Q  +F + V 
Sbjct: 209 AFLRLFVRTRDRHFGGWRTGQSRSYFAAELPADWWLLGLDDQSGSYVDDPQLTYF-DTVA 267

Query: 560 EQVGERDSVIIMTHEPNWL--LDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRH 617
            ++G +  VI+    P W+  +D      S       I      R ++ I+GD+HHY R+
Sbjct: 268 RKLGPQSKVILAVPAPAWVKAVDSPSAYDSIDYFIRTIIAPTGARVRVLISGDLHHYARY 327

Query: 618 SYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSN-----------FRKFYGTTYESKAAYPS 666
           +  P D     + L+  G GGA+L+PTH                R     +Y   A YP 
Sbjct: 328 TG-PDD-----RQLITCGSGGAYLYPTHKLPEQIEVPPRDTLARRSSPSRSYALAARYPD 381

Query: 667 FEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSM 703
              S R   G   +  ++N  F  + G+++ +L+ S+
Sbjct: 382 AARSRRYGWGIFGRLPRRNPGFATLLGVLHTLLMLSI 418


>gi|171059864|ref|YP_001792213.1| hypothetical protein Lcho_3190 [Leptothrix cholodnii SP-6]
 gi|170777309|gb|ACB35448.1| conserved hypothetical membrane spanning protein [Leptothrix
           cholodnii SP-6]
          Length = 661

 Score =  165 bits (418), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 111/358 (31%), Positives = 154/358 (43%), Gaps = 59/358 (16%)

Query: 374 DHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLA 433
           + L   + +W D++ADTGDG +++Y+VA  LA+  I +        LPR D+LL+GGD  
Sbjct: 84  ERLRNADSVWIDYIADTGDGWDATYTVAHTLARDAIEIDGR----RLPRADLLLMGGDQV 139

Query: 434 YPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGN 493
           YP P+   Y  RL  PF  AL               P  P+G         P     PGN
Sbjct: 140 YPTPAGSAYRTRLVDPFGSAL---------------PGRPAGADPQLDPAAPLLLATPGN 184

Query: 494 HDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKF 553
           HDW+DGL  F    C    LGGW   Q+ SYFA+QLP GWW++GLDL L  ++D  Q  +
Sbjct: 185 HDWYDGLRGFNNLFCSGQNLGGWQSFQRGSYFAVQLPHGWWLWGLDLQLESELDGPQRDY 244

Query: 554 F-AELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHL------------------ 594
           F A+  K   G R  V+++  EP+W+ +  F   +  +   L                  
Sbjct: 245 FTAQATKLDAGAR--VLLLVPEPSWIDEGSFRASTQNDSDKLRTIEVQRARTRGWKEIES 302

Query: 595 ICDYLKGRCKLRIAGDMHHYMRH-------SYVPSDGPVYVQHLLVNGCGGAFLHPTHVF 647
           +     GR    +AGD+HHY R+           +  P+     +  G GGAF+  TH  
Sbjct: 303 MVATRGGRVAATLAGDLHHYARYAPAAGPTDAPAAAAPLAAPQRITCGGGGAFMLGTHHL 362

Query: 648 SN----FRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVF 701
                  R+         A YP        A+G     R + WQ  F        L F
Sbjct: 363 PGELQMGRRGQPAVQRLAATYP--------AIGESRSLRNRAWQLPFTNPTFGLTLGF 412


>gi|20804241|emb|CAD31267.1| HYPOTHETICAL PROTEIN [Mesorhizobium loti R7A]
          Length = 624

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 111/343 (32%), Positives = 167/343 (48%), Gaps = 56/343 (16%)

Query: 382 LWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFT 441
           +W DF+AD GDG +++Y++A L++Q  + V    +     RG +L++GGD  YP  ++ T
Sbjct: 117 VWVDFVADLGDGFDATYAIASLISQEKLTVGGQQTR----RGQLLVMGGDEVYPRAASET 172

Query: 442 YERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLN 501
           Y+++L  P+++A                   P   P L +  GP  Y +PGNHDW+DGL 
Sbjct: 173 YQKQLRDPYDWAF------------------PDPNPGLLK--GPPVYAVPGNHDWYDGLV 212

Query: 502 TFMRFICHKSW--LGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVK 559
            F+   C K    LGGW   Q++SYFA+QL   WW++ +D  L  D+D  Q ++F E+ K
Sbjct: 213 LFLALFCRKDHMHLGGWRTHQRRSYFAVQLTDQWWLWAIDAQLVDDVDQPQKEYFLEIAK 272

Query: 560 EQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCKLRI----AGDMHHYM 615
             + +   +I+   EP WL      N + + + ++    LK R  L I    +GD HHY 
Sbjct: 273 A-MPDNAKIILCGPEPGWLYTGKAGNRALRVMSYIGSIALKQRRGLTIPLVLSGDTHHYS 331

Query: 616 RHSYVPSDGPVYVQHLLVNGCGGAFLHPTH----------VFSNFRKFYG---------T 656
           R  YV  DG       + +G GGAFLHPTH          V   +    G          
Sbjct: 332 R--YVADDGSA---QFVTSGGGGAFLHPTHQVEPSVNLDRVSDGYSWLSGQIKGLRLGAN 386

Query: 657 TYESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVL 699
               +A YPS  DS  +  GN + F   N  F  + G  Y+++
Sbjct: 387 EQGREAVYPSKADSLSMLRGNFV-FVAYNPAFALVLGSAYWLI 428


>gi|291303735|ref|YP_003515013.1| calcineurin-like phosphoesterase family protein [Stackebrandtia
           nassauensis DSM 44728]
 gi|290572955|gb|ADD45920.1| calcineurin-like phosphoesterase family protein [Stackebrandtia
           nassauensis DSM 44728]
          Length = 549

 Score =  159 bits (403), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 126/418 (30%), Positives = 192/418 (45%), Gaps = 66/418 (15%)

Query: 304 EEYEKMKKKQLKPEFLDMVPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQ 363
           E    +  KQL+ + L  V W S     +       LV   V     D R +QA      
Sbjct: 3   ERPTDLTPKQLRFQRLPAVRWLS---TRVLSGTAVRLVMARVLGAFLDKRELQAY----- 54

Query: 364 EGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDD----SVFT 419
             AQ G  ++DH S  E+LW D++AD GDG +++YSVA L +QP +   +         T
Sbjct: 55  --AQQG--VFDH-SAGEELWIDYVADAGDGFDATYSVAYLTSQPELTPQKSRHGHAPSQT 109

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           LPRG++L++GGD  YP  +   YE R   P++ AL                  P G    
Sbjct: 110 LPRGNILVMGGDQVYPAANWRDYENRCKGPYQAAL------------------PKG---- 147

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLD 539
              DG   Y IPGNHDW+DGL  F+R    +  +GG    Q++SY+A +LP  WW+ G+D
Sbjct: 148 ---DG-SLYAIPGNHDWYDGLTAFLRLFGAERDIGGRSTYQRRSYWAARLPHRWWLVGID 203

Query: 540 LALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSG-KNVKHLICDY 598
                 +D  Q  +F E   EQ+   D VI+    P+W   W  ++        + I  +
Sbjct: 204 AQFDAYLDSPQLAYFTEAF-EQMEPGDPVILCVPRPSWT--WTDSDPRAFDRTDYFIRTF 260

Query: 599 LK---GRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKF-- 653
           ++   GR  L + GD HHY+ +  V  +     +HL+  G GGA+L  TH   +  +   
Sbjct: 261 IEPRGGRVPLILTGDRHHYVHYEEVDGE-----RHLITAGGGGAYLSGTHTMPDELQAPP 315

Query: 654 ------YGTT---YESKAAYPSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFS 702
                 +G+    Y+ K ++P    S + AL    +   +N  F  + GI++ + + S
Sbjct: 316 PESMARHGSETRLYKRKGSFPDRAKSWQQALKIYWRMPIRNPSFVALLGIIHMLGLLS 373


>gi|326521988|dbj|BAK04122.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 322

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 75/115 (65%), Positives = 96/115 (83%)

Query: 692 GGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLL 751
           GG +YF+LVFSMFPQC L HIL E+++SG L+SF  T+W+A +Y+ EHSYVS  G+L LL
Sbjct: 1   GGFIYFILVFSMFPQCNLVHILNEETWSGRLQSFSSTIWSALLYIFEHSYVSSVGSLTLL 60

Query: 752 IVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSG 806
           + + +FVPSKL+RKKRA+IG LHV AHL AAL+LMLL+ELG+E CI++ LLATSG
Sbjct: 61  MASYSFVPSKLTRKKRAIIGGLHVVAHLTAALVLMLLMELGIEICIRNHLLATSG 115


>gi|91976656|ref|YP_569315.1| hypothetical protein RPD_2179 [Rhodopseudomonas palustris BisB5]
 gi|91683112|gb|ABE39414.1| hypothetical protein RPD_2179 [Rhodopseudomonas palustris BisB5]
          Length = 590

 Score =  159 bits (402), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 122/396 (30%), Positives = 188/396 (47%), Gaps = 60/396 (15%)

Query: 332 LFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGA--QHGDLLYDHLS-EKEDLWFDFMA 388
           L K + +++VS ++F    D R++ AA++  +     Q  +     L  + +++W D++A
Sbjct: 24  LLKLINNVVVS-SIFGQYADRRLIIAALDTVEPAVHLQRAEAFRGKLDGDGDEIWIDWVA 82

Query: 389 DTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFR 448
           D GDG +S+Y+VA LL+   + V  +     LPRG  L++GGD  YP  S   Y  +L +
Sbjct: 83  DLGDGFDSTYAVASLLSAKTLIV--EGVTTPLPRGQALIMGGDEVYPLASRQAYYNQLRQ 140

Query: 449 PFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFIC 508
           P+        W   DH   +      G P L          IPGNHDW+DGL  F+   C
Sbjct: 141 PYA-------WASPDHDKSDD----KGRPVLA---------IPGNHDWYDGLVLFLALFC 180

Query: 509 -HKSW-LGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERD 566
             K W +G     Q++SYFA++L + WW++  D+ L  D+D  Q  +F   + + + E  
Sbjct: 181 KEKPWHIGSLRTCQRRSYFAVRLTEDWWLWATDVQLLDDMDKPQADYFTT-IAQGMPENS 239

Query: 567 SVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCK-----LRIAGDMHHYMRHSYVP 621
           ++I+ + EP WL  +   N S   +        +   K     L ++GD HHY R  Y  
Sbjct: 240 NIILCSAEPGWL--YTDTNRSSWEIMEYAAGIARKAGKNHSIPLILSGDTHHYSR--YES 295

Query: 622 SDGPVYVQHLLVNGCGGAFLHPTH--------VFSNFRKFY---------GTTYESKAAY 664
             G    +  + +G GGAFLHPTH        VF + R+                ++AAY
Sbjct: 296 DRG----RQFITSGGGGAFLHPTHQLEKEVSVVFVDHRETLKLGVMTDRSDAKKTTEAAY 351

Query: 665 PSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLV 700
           PSF  S  +  GNI  F   NW F  + G +YF+  
Sbjct: 352 PSFSKSRALTFGNIF-FAFTNWDFSLLMGAIYFLFA 386


>gi|192290722|ref|YP_001991327.1| hypothetical protein Rpal_2336 [Rhodopseudomonas palustris TIE-1]
 gi|192284471|gb|ACF00852.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1]
          Length = 590

 Score =  158 bits (400), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 121/396 (30%), Positives = 188/396 (47%), Gaps = 60/396 (15%)

Query: 332 LFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGA--QHGDLLYDHLS-EKEDLWFDFMA 388
           L K + +++VS ++F    D R++ AA++  +     Q  +     L  + +++W D++A
Sbjct: 24  LLKLINNVVVS-SIFGQYADRRLIIAALDTVEPAVHLQRAEAFRGKLDGDGDEIWIDWVA 82

Query: 389 DTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFR 448
           D GDG +S+Y+VA LL+   + V  +     LPRG  L++GGD  YP  S   Y  +L +
Sbjct: 83  DLGDGFDSTYAVASLLSAKTLTV--EGVTTPLPRGQALIMGGDEVYPLASRQAYYNQLRQ 140

Query: 449 PFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFIC 508
           P+        W   DH   +      G P L          IPGNHDW+DGL  F+   C
Sbjct: 141 PYA-------WASPDHDKSDD----KGRPVLA---------IPGNHDWYDGLVLFLALFC 180

Query: 509 -HKSW-LGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERD 566
             K W +G     Q++SYFA++L + WW++  D+ L  D+D  Q  +F   + + + E  
Sbjct: 181 KEKPWHIGSLRTCQRRSYFAVRLTEDWWLWATDVQLLDDMDKPQADYFTT-IAQGMPENS 239

Query: 567 SVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGRCK-----LRIAGDMHHYMRHSYVP 621
           ++I+ + EP WL  +   N S   +        +   K     L ++GD HHY R  Y  
Sbjct: 240 NIILCSAEPGWL--YTDTNRSSWEIMEYAAGIARKAGKNHSIPLILSGDTHHYSR--YES 295

Query: 622 SDGPVYVQHLLVNGCGGAFLHPTH--------VFSNFRKFY---------GTTYESKAAY 664
             G    +  + +G GGAFLHPTH        +F + R+                ++AAY
Sbjct: 296 DRG----RQFITSGGGGAFLHPTHQLEKEVSVLFVDHRETLKLGVMTDRSDAKRTTEAAY 351

Query: 665 PSFEDSSRIALGNILKFRKKNWQFDFIGGIVYFVLV 700
           PSF  S  +  GNI  F   NW F  + G +YF+  
Sbjct: 352 PSFSKSRALTFGNIF-FAFTNWDFSLLMGAIYFLFA 386


>gi|443288351|ref|ZP_21027445.1| Conserved hypothetical protein (Metallo-dependent phosphatases)
           [Micromonospora lupini str. Lupac 08]
 gi|385888681|emb|CCH15519.1| Conserved hypothetical protein (Metallo-dependent phosphatases)
           [Micromonospora lupini str. Lupac 08]
          Length = 578

 Score =  155 bits (392), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 114/372 (30%), Positives = 161/372 (43%), Gaps = 60/372 (16%)

Query: 332 LFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTG 391
           L  T    L+++ +F    D R +Q A          GD ++        LW D++AD G
Sbjct: 49  LISTGIRTLLAI-LFGAYLDKRELQNAF---------GDDIFRQAGPDGGLWLDYVADLG 98

Query: 392 DGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFE 451
           DG +++YSVA LLAQP + V        LPR   L++GGD  YP+ +   YE R   P++
Sbjct: 99  DGFHATYSVAYLLAQPELTV----DGHRLPRAQTLVMGGDQVYPSAAYAEYENRCKGPYQ 154

Query: 452 YALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFI--CH 509
            AL               P  P         D P  + +PGNHDW+DGL  F+R      
Sbjct: 155 AAL---------------PATPP--------DRPTLFAVPGNHDWYDGLTAFLRLFVRSR 191

Query: 510 KSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVI 569
                GW   Q +SYFA++LP GWW+ G+D      +D  Q  +F E+ +    E   VI
Sbjct: 192 DRNFAGWRTGQSRSYFAVELPAGWWLLGVDDQSGSYLDDPQLAYFDEVARRLTPE-SKVI 250

Query: 570 IMTHEPNWL--LDWYFNNVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVY 627
           I    P W+   D      S       I D    + +L ++GD+HHY R++     GP  
Sbjct: 251 IAAPSPTWVKAADDPTAYDSIDYFVRTIIDPTGAQVRLLLSGDLHHYARYA-----GP-- 303

Query: 628 VQHLLVNGCGGAFLHPTHVFS-----------NFRKFYGTTYESKAAYPSFEDSSRIALG 676
            + L+  G GGA+L+PTH                R      Y+  A YP    S R   G
Sbjct: 304 DRQLITCGGGGAYLYPTHKLPERIEVPPRDTLTRRASRTQPYDLVARYPDAARSRRYGWG 363

Query: 677 NILKFRKKNWQF 688
              +   +N  F
Sbjct: 364 IFARLPFRNPGF 375


>gi|433649674|ref|YP_007294676.1| hypothetical protein Mycsm_05062 [Mycobacterium smegmatis JS623]
 gi|433299451|gb|AGB25271.1| hypothetical protein Mycsm_05062 [Mycobacterium smegmatis JS623]
          Length = 611

 Score =  155 bits (392), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 136/479 (28%), Positives = 204/479 (42%), Gaps = 69/479 (14%)

Query: 322 VPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKED 381
           V W+S  +  L +    +++S   F    D R MQ+ +      A  G          +D
Sbjct: 25  VRWFSPPT--LARAAAKVVLSA-AFGDYLDKREMQSTLESRVLRAGAG---------SDD 72

Query: 382 LWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFT 441
           +W DF+ADT DG N +YSVA   +Q  I     D    LPR  +++ GGD  YP  +   
Sbjct: 73  VWIDFIADTADGFNQTYSVAWCASQREIAPIGLDR--KLPRAQLIVFGGDEVYPYATPKE 130

Query: 442 YERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLN 501
           YE R   P+  AL   PW + D          S  P+ K     +   IPGNHDW+DGL 
Sbjct: 131 YEDRFSGPYLAAL---PWTEPD----------SSRPQEKHA---RMLAIPGNHDWYDGLT 174

Query: 502 TFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQ 561
            FMR      W+GG  + Q +SYFA+ LP  +W++G+D+     +D  Q K+F +    Q
Sbjct: 175 GFMRLFAQADWIGGRELEQSRSYFAVDLPGPFWLWGIDIQSDNYVDALQIKYF-KTAATQ 233

Query: 562 VGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKG--RCKLRIAGDMHHYMRHSY 619
           +   D++I+ T +P+W  D      + +N+  +    +    R  L ++GD HHY R+  
Sbjct: 234 MTPDDALILCTAKPSW-TDVRDAKDAYRNLAFVERTMVPPGVRTILMLSGDKHHYARYEA 292

Query: 620 VPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRKFYGTTYESKAA------------YPSF 667
               GP   +  +  G GGAFL  T   ++         +   A            YPS 
Sbjct: 293 AKDVGPEGPRMRVTAGGGGAFLSTTQKLADPANVPRPIGDDNPATDDTEPFNLACRYPSL 352

Query: 668 EDSSRIALGNILKFRKKNWQFDFIGGIVYFVLVFSMFPQCELNH------ILRED-SFSG 720
             S R+  G+I     +N  F  I  I+Y +L  S   +    +      I RE   F  
Sbjct: 353 GQSRRLN-GHIFSLGLRNPWFMLIPAIMYVLLFVSSVSRLGQQNNEITLDIKREPFGFQD 411

Query: 721 HLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKL----SRKKRAMIGVLHV 775
            L S  G    A ++           A+  ++ A   VP ++    SR  R + GV   
Sbjct: 412 FLVSAMGVTTLAIVF-----------AVAAILSAFYIVPKRVNVGKSRMYRGLAGVTQT 459


>gi|302864965|ref|YP_003833602.1| metallophosphoesterase [Micromonospora aurantiaca ATCC 27029]
 gi|302567824|gb|ADL44026.1| metallophosphoesterase [Micromonospora aurantiaca ATCC 27029]
          Length = 576

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 120/409 (29%), Positives = 176/409 (43%), Gaps = 77/409 (18%)

Query: 305 EYEKMKKKQLKPEFLDM-----VPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAM 359
           E    + + + P  L       VPW +     L  T    L+++ +F    D R +Q A 
Sbjct: 17  ERATRRPRSIDPRELGFTPRRPVPWLAPLL--LISTGLRTLLAM-LFGAYLDKRELQNAF 73

Query: 360 NKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFT 419
           +         D  +  +     LW D++AD GDG +++YSVA LLAQP + V       T
Sbjct: 74  S---------DGTFRQVGPDGGLWLDYVADLGDGFDATYSVAYLLAQPELTV----DGHT 120

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           LPR   L++GGD  YP      YE R   P++ AL                  P   PE 
Sbjct: 121 LPRAQTLVMGGDQVYPAAGYEAYEDRCKGPYQAAL------------------PVAPPEQ 162

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFI--CHKSWLGGWFMPQKKSYFALQLPKGWWVFG 537
                P+ + +PGNHDW+DGL  F+R          GGW   Q +SYFA++LP GWW+ G
Sbjct: 163 -----PKLFAVPGNHDWYDGLTAFLRLFVRTRDRHFGGWGTGQSRSYFAVELPAGWWLLG 217

Query: 538 LDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICD 597
           LD      +D  Q  +F E+ ++   E   VI+    P W+       V   N    I  
Sbjct: 218 LDDQSGSYLDDPQLTYFDEVARKLTPE-SKVILAVPAPTWV-----KAVDHPNAYDSIDY 271

Query: 598 YLK-------GRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSN- 649
           +++        + ++ ++GD+HHY R  Y  +D     + L+  G GGA+L+PTH     
Sbjct: 272 FIRTLVAPTGAQVRVLVSGDLHHYAR--YAGTD-----RQLITCGGGGAYLYPTHRLPEK 324

Query: 650 ---------FRKFYGT-TYESKAAYPSFEDSSRIALGNILKFRKKNWQF 688
                     R+   T  Y+  A YP    S R   G   +   +N  F
Sbjct: 325 LEVPPRDTLTRRASNTREYDLAARYPDKARSRRYGWGIFARLPFRNPGF 373


>gi|315501250|ref|YP_004080137.1| metallophosphoesterase [Micromonospora sp. L5]
 gi|315407869|gb|ADU05986.1| metallophosphoesterase [Micromonospora sp. L5]
          Length = 576

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 119/409 (29%), Positives = 175/409 (42%), Gaps = 77/409 (18%)

Query: 305 EYEKMKKKQLKPEFLDM-----VPWYSGTSADLFKTVFDLLVSVTVFVGRFDMRMMQAAM 359
           E    + + + P  L       VPW +     L  T    L+++ +F    D R +Q A 
Sbjct: 17  ERATRRPRSIDPRELGFTPRRPVPWLAPLL--LISTGLRTLLAM-LFGAYLDKRELQNAF 73

Query: 360 NKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFT 419
           +         D  +  +     LW D++AD GDG +++YSVA LLAQP + V       T
Sbjct: 74  S---------DGTFRQVGPDGGLWLDYVADLGDGFDATYSVAYLLAQPELTV----DGHT 120

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           LPR   L++GGD  YP      YE R   P++ AL                  P   PE 
Sbjct: 121 LPRAQTLVMGGDQVYPAAGYEAYEDRCKGPYQAAL------------------PVAPPEQ 162

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFI--CHKSWLGGWFMPQKKSYFALQLPKGWWVFG 537
                P+ + +PGNHDW+DGL  F+R           GW   Q +SYFA++LP GWW+ G
Sbjct: 163 -----PKLFAVPGNHDWYDGLTAFLRLFVRTRDRHFAGWGTGQSRSYFAVELPAGWWLLG 217

Query: 538 LDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICD 597
           LD      +D  Q  +F E+ ++   E   VI+    P W+       V   N    I  
Sbjct: 218 LDDQSGSYLDDPQLTYFDEVARKLTPE-SKVILAVPAPTWV-----KAVDHPNAYDSIDY 271

Query: 598 YLK-------GRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSN- 649
           +++        + ++ ++GD+HHY R  Y  +D     + L+  G GGA+L+PTH     
Sbjct: 272 FIRTLVAPTGAQVRVLVSGDLHHYAR--YAGTD-----RQLITCGGGGAYLYPTHRLPEK 324

Query: 650 ---------FRKFYGT-TYESKAAYPSFEDSSRIALGNILKFRKKNWQF 688
                     R+   T  Y+  A YP    S R   G   +   +N  F
Sbjct: 325 LEVPPRDTLTRRASNTREYDLAARYPDKARSRRYGWGIFARLPFRNPGF 373


>gi|375107756|ref|ZP_09754017.1| hypothetical protein BurJ1DRAFT_4484 [Burkholderiales bacterium
           JOSHI_001]
 gi|374668487|gb|EHR73272.1| hypothetical protein BurJ1DRAFT_4484 [Burkholderiales bacterium
           JOSHI_001]
          Length = 630

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 99/281 (35%), Positives = 146/281 (51%), Gaps = 36/281 (12%)

Query: 383 WFDFMADTGDGGNSSYSVAR--LLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAF 440
           WFDF+ADTGDGGN+SY+VAR  L AQ    V  + +   LP   +L++GGDLAYP+ S  
Sbjct: 72  WFDFLADTGDGGNASYTVARGALAAQ----VQAEGAEHLLPEARLLVLGGDLAYPSASPE 127

Query: 441 TYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQC------YIIPGNH 494
            Y+ RL   FE             +A ++    + VP    +D P          IP NH
Sbjct: 128 LYQSRLVEMFE-------------LARDRASRFADVPRDLAHDAPIAPSHKLVAAIPQNH 174

Query: 495 DWFDGLNTFMRFICHKSWLG--GWFMPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFK 552
           DWFD  +TF R+  +    G  G   PQ+++YFA  LP  W++  LD AL  D+D  Q++
Sbjct: 175 DWFDSASTFCRYFVNDEKAGFVGARAPQRRTYFAFALPHDWFLLALDFALTGDLDRLQYE 234

Query: 553 FFAELV-KEQVGERDSVIIMTHEPNWLLDWYFNNVSG-----KNVKHLICDYLKGR-CKL 605
            F +L+ +  + E  +V+++  EP W      +  +      + ++HL+ D  +GR  +L
Sbjct: 235 AFIDLMDRGALPEGANVVLVYPEPWWTRPLGADTRTAYPRRYQRLEHLLAD--RGRQVRL 292

Query: 606 RIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHV 646
           R+AGD+HHY R     +        ++  G GGAF H TH 
Sbjct: 293 RLAGDLHHYTRERLDSAPPAGRSTDIVTCGSGGAFGHATHT 333


>gi|294911735|ref|XP_002778052.1| hypothetical protein Pmar_PMAR018489 [Perkinsus marinus ATCC 50983]
 gi|239886173|gb|EER09847.1| hypothetical protein Pmar_PMAR018489 [Perkinsus marinus ATCC 50983]
          Length = 458

 Score =  146 bits (368), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 87/227 (38%), Positives = 121/227 (53%), Gaps = 29/227 (12%)

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           LPR +V+  GGDLAYP PS   +  RL RP E+AL P              E P+GV + 
Sbjct: 213 LPRANVVFHGGDLAYPVPSHEAFVDRLIRPLEWALPP------------HQEAPTGVSKA 260

Query: 480 KQY-DGPQCYIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLPKGWWVFGL 538
               D PQ + IPGNHDW+DGL  ++ +   +  L GW +PQK +YFA++L  G      
Sbjct: 261 SAIADQPQFFAIPGNHDWYDGLEVYLHWFVGQDHLAGWKVPQKSTYFAVKLSHG------ 314

Query: 539 DLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDY 598
                  I V+    +   ++ +V   D V+++TH PNW  D    + +G  V  L+   
Sbjct: 315 ----PSTIRVF---LWPSGLRGKVDTEDRVVVITHRPNWECDVVERSRTGYLVSVLLDKI 367

Query: 599 LKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTH 645
            + R  +R+AGD HHY R  Y+P+DG   V  L+ +G  GAFLHPTH
Sbjct: 368 GEPRLGMRLAGDTHHYSR--YMPADGSKGVP-LVTSGGAGAFLHPTH 411



 Score = 82.8 bits (203), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 78/137 (56%), Gaps = 19/137 (13%)

Query: 291 RSTGWAL---THPLSVEEYEKMKKKQLKPE---FLDMVPWYSGTSADLFKTVF--DLLVS 342
           R T W     T P    + E+  +K+  P+      MV WYS     LF T +   + V 
Sbjct: 8   RPTSWVCICRTGPGKYHQAEEEAEKRRGPDPKLRKSMVSWYS-----LFMTTYIPQVTVM 62

Query: 343 VTVFVGRFDMRMMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSYSVAR 402
           + VF+GRFD+R + AA+ ++ EG     + ++  S+KE+ WFDF AD GDG +SSY+V R
Sbjct: 63  LKVFMGRFDVRTLLAALTREPEGT----VTFEDQSDKEETWFDFFADGGDGFDSSYTVGR 118

Query: 403 LLAQPHIRVT--RDDSV 417
           L+AQP++ V   +DD V
Sbjct: 119 LIAQPYLGVDVFKDDHV 135


>gi|192362133|ref|YP_001981109.1| hypothetical protein CJA_0587 [Cellvibrio japonicus Ueda107]
 gi|190688298|gb|ACE85976.1| hypothetical protein CJA_0587 [Cellvibrio japonicus Ueda107]
          Length = 620

 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 110/335 (32%), Positives = 160/335 (47%), Gaps = 61/335 (18%)

Query: 380 EDLWFDFMADTGDGGNSSYSVARLLAQPHI-RVTRD----DSVFTLPRGDVLLIGGDLAY 434
           +D W+DF++DTGDGGN++Y+VAR +  P + +  R+    D   TLP G++L++GGDLAY
Sbjct: 65  DDFWWDFVSDTGDGGNAAYAVARQMQIPLLNKKVREGLVGDIPDTLPMGELLVLGGDLAY 124

Query: 435 PNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPS--GVPELKQYDGPQCYIIPG 492
           P  S   Y+ RL   +               A  + E P+     +L+         IP 
Sbjct: 125 PGASVEEYQYRLAEMW--------------TASGQQERPAQEAAAQLRP-----SLAIPQ 165

Query: 493 NHDWFDGLNTFMRFIC---HKSWLGGWFMP----------------QKKSYFALQLPKGW 533
           NHDWFD +++F  +      K    G+ +                 QK+SYFA +LP  W
Sbjct: 166 NHDWFDNISSFNLYFVDRREKEPEQGFSIKSAETQVEVTPLSTRKLQKQSYFAARLPNNW 225

Query: 534 WVFGLDLALHCDIDVYQFKFFAEL------VKEQVGERDSVIIMTHEPNWLLDWYFNNVS 587
            + GLD AL  DID  Q   F  L       + Q+   D++I++  EP W  D   +   
Sbjct: 226 VILGLDFALVGDIDRKQHIAFRNLFNGSPGCEPQITPEDNIILLYPEPYWTRDLGDHARE 285

Query: 588 G-----KNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVN-GCGGAFL 641
           G     + ++  I D   G+ +LRIAGD+HHY R     S G      +L+  G GGAFL
Sbjct: 286 GYPKRYQRLEAFIRDK-NGKIRLRIAGDIHHYARE--FSSAGQSGADDMLITAGGGGAFL 342

Query: 642 HPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALG 676
           HPTH  +N +      +  +    S +  SRI LG
Sbjct: 343 HPTHT-NNTKADKVRCHREEPFAMSDDLKSRIRLG 376


>gi|116669815|ref|YP_830748.1| hypothetical protein Arth_1254 [Arthrobacter sp. FB24]
 gi|116609924|gb|ABK02648.1| hypothetical protein Arth_1254 [Arthrobacter sp. FB24]
          Length = 680

 Score =  135 bits (340), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 96/296 (32%), Positives = 133/296 (44%), Gaps = 58/296 (19%)

Query: 377 SEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPN 436
           S    LW DF AD GDG +++Y+VA LLA+  + V   +    L RG VL++GGD  YP 
Sbjct: 111 SGAATLWLDFTADLGDGFDATYTVASLLAEKSLLVDGHE----LSRGKVLVLGGDEVYPV 166

Query: 437 PSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDW 496
            +   YE R+  P+  AL       +D V                        +PGNHDW
Sbjct: 167 AAPAAYEDRMVGPYRTALPGGRSPGRDGV---------------------LLALPGNHDW 205

Query: 497 FDGLNTFMRFICHKSWLGGWFMPQKKSYFALQL-----PKGWWVFGLDLALHCDIDVYQF 551
           +DGL +F+R    +  +GGW   Q +SYFAL+L       GWW+ GLD  L   ID  Q 
Sbjct: 206 YDGLTSFIRLFTRQRNIGGWRTIQTRSYFALRLTGGDDSPGWWLVGLDSQLGQYIDEPQL 265

Query: 552 KFFAELVKEQVGERDSVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKGR--------- 602
            +F   V  ++   D++I+    P W+ +    N   + V     DYL+ R         
Sbjct: 266 DYFYNTVTTRLRPGDAIILCVAAPYWVRETENANAF-RQVHFFEQDYLRRRFNRRAGLFE 324

Query: 603 -----CKLRIAGDMHHYMRH------------SYVPSDGPVYVQHLLVNGCGGAFL 641
                 +L + GD+HHY R+               P D P   Q L+  G GGAFL
Sbjct: 325 ETGASVRLWLTGDLHHYSRYEEQAPEAAGERTGTRPGDDPRRTQ-LITCGLGGAFL 379


>gi|384221826|ref|YP_005612992.1| hypothetical protein BJ6T_81590 [Bradyrhizobium japonicum USDA 6]
 gi|354960725|dbj|BAL13404.1| hypothetical protein BJ6T_81590 [Bradyrhizobium japonicum USDA 6]
          Length = 321

 Score =  132 bits (332), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 149/300 (49%), Gaps = 48/300 (16%)

Query: 344 TVFVGRFDMRMMQAAMNK--DQEGAQHGDLLYDHLSEKE--DLWFDFMADTGDGGNSSYS 399
           ++F    D R++ AA++    +E ++  +     L   +   +W D++AD GDG +S+Y+
Sbjct: 35  SIFGQYADRRLVIAALDTVPPEEHSKRAEDFRSRLKTDQHGGVWIDWVADLGDGFDSTYA 94

Query: 400 VARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPW 459
           VA LLA   +++        LPRG  L++GGD  YP  +   Y  +L +P+ +A      
Sbjct: 95  VASLLASKDLKIGE----TLLPRGQALIMGGDEVYPKATREAYANQLRQPYAWA------ 144

Query: 460 YKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTFMRFIC-HKSW-LGGWF 517
                     P+     P+ K  DG     IPGNHDW+DGL  F+   C  K W +G W 
Sbjct: 145 ---------APD-----PDRKNDDGRPLLAIPGNHDWYDGLVLFLALFCKEKPWHVGAWR 190

Query: 518 MPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNW 577
             Q++SYFA++L + WW++  D+ L  D+D  Q  +F ++    + E   +I+ + EP W
Sbjct: 191 SYQRRSYFAVRLTETWWLWATDIQLADDMDGPQADYFKQIAT-AMPENSRIILCSAEPGW 249

Query: 578 LLDWYFNNVSGKNVKHLICDYLKG---------RCKLRIAGDMHHYMRHSYVPSDGPVYV 628
           L    + + + K+ +  I +Y  G            + ++GD HHY R  YV  DG  YV
Sbjct: 250 L----YTDSNRKSWE--IMEYAAGIAINAGRGHTIPVLLSGDTHHYSR--YVGKDGRQYV 301


>gi|292491438|ref|YP_003526877.1| hypothetical protein Nhal_1339 [Nitrosococcus halophilus Nc4]
 gi|291580033|gb|ADE14490.1| conserved hypothetical protein [Nitrosococcus halophilus Nc4]
          Length = 828

 Score =  122 bits (306), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 121/402 (30%), Positives = 174/402 (43%), Gaps = 101/402 (25%)

Query: 341 VSVTVFVG-RFDMR-MMQAAMNKDQEGAQHGDLLYDHLSEKEDLWFDFMADTGDGGNSSY 398
           +++  F G + D R  M+A +N      +HG+       E E  WFD++AD+GDG  ++Y
Sbjct: 37  IAIYAFFGDKLDSRDWMRAEIND-----RHGEA-----PEGEAFWFDYLADSGDGQCATY 86

Query: 399 SVARLL----------AQPHI-RVTR-----DDS--VFTLPRGDVLLIGGDLAYPNPSAF 440
           ++A L           + P + + T+     DD+   F LPRG+ L +GGD AY      
Sbjct: 87  NIAYLCMHDLWLPNENSAPDLNKETKTVSLADDTGNAFKLPRGEFLFVGGDTAYHIADYT 146

Query: 441 TYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGL 500
           T   R  RPF +A +           +  PEV    PE ++      Y IPGNHD++D L
Sbjct: 147 TLAERFQRPFNWAYED----------IFGPEV---TPEARR----PIYGIPGNHDYYDAL 189

Query: 501 NTFMRFICH---------------KSWLGGWFMPQKKSYFALQLPKGWWVFGLDLALHCD 545
           + F R                   +  L G+   Q+ SY AL+LP GWW +GLD A    
Sbjct: 190 DGFNRQFRKPFNLENVDEVGSTQPQLKLKGFERTQEASYVALKLPYGWWFWGLD-AQGGS 248

Query: 546 IDVYQFKFFAELVKEQVGER---------DSVIIMTHEPNWLLDWYFNN----VSGKNVK 592
           ID  Q  FF+ +   Q+ E          D +I+ T EP      +       V      
Sbjct: 249 IDRRQATFFSSICNPQISEAKAESVPRVPDKLIVATPEPAIKFGKWAKEDEKIVETFEKL 308

Query: 593 HLICDYLKGR--------CKLRIAGDMHHYMRH----SYVPSDGPVYVQHLLVNGCGGAF 640
            L   +LK +        C+L I+GD+HHY R+    +   S+        +V G GGAF
Sbjct: 309 GLEPSFLKSKGGRLSPTQCRLDISGDIHHYARYWGQNAADHSENTRSNYASVVAGGGGAF 368

Query: 641 LHPTHVFSNFRKFYGTTYESKAA---YPSFEDSSRIALGNIL 679
           LHP+H          T  E  A    YPS  DS R+    +L
Sbjct: 369 LHPSH----------TDVEEVAENQLYPSRRDSHRLITKRLL 400


>gi|325982638|ref|YP_004295040.1| hypothetical protein NAL212_2042 [Nitrosomonas sp. AL212]
 gi|325532157|gb|ADZ26878.1| hypothetical protein NAL212_2042 [Nitrosomonas sp. AL212]
          Length = 824

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 108/383 (28%), Positives = 156/383 (40%), Gaps = 92/383 (24%)

Query: 379 KEDLWFDFMADTGDGGNSSYSVARL----LAQPHIRVTRDDSVF---------TLPRGDV 425
           +++ WFD++ADTGDG  + Y+VA L    L     ++TR   V           LPRG+ 
Sbjct: 56  EQEYWFDYIADTGDGSRAVYNVAYLCMSGLWLKDEKITRGHPVSLSRTGEFNQRLPRGEF 115

Query: 426 LLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGP 485
           L +GGD AY        + R   PF +A +         +AV   E         + D  
Sbjct: 116 LFVGGDTAYHAADIAALKERFQTPFNWAYE--------DIAVTTGE---------KIDQR 158

Query: 486 QCYIIPGNHDWFDGLNTFMRFICHKS-------------------WLGGWFMPQKKSYFA 526
             Y IP NHD++D L+ F R  CH                      L G+   QK SY  
Sbjct: 159 PIYGIPANHDYYDALDGFNRQFCHPIVQDIHPLVREQEDLTDPPLGLHGFRREQKSSYVW 218

Query: 527 LQLPKGWWVFGLD--------------LALHCDIDVYQFKFFAELVKEQVGER------D 566
           L LP GW ++GLD              +   CD        F E  KE+V E       D
Sbjct: 219 LNLPFGWRLWGLDSQASKMDKRQQAFFVTQFCDKLTRDGSLFDENKKEEVQETLRNAIPD 278

Query: 567 SVIIMTHEPNWLLDWYFNNVSGKNVKHLICDYLKG----------RCKLRIAGDMHHYMR 616
            +I+ T EP+ +      + +      L      G          +C+L I+GD+HHY R
Sbjct: 279 KLIVATPEPSTVFGKRATSHAAMTELFLRLGLEPGFLQDGRLDHSKCRLDISGDVHHYER 338

Query: 617 HSYVPSDGPVYVQHL-LVNGCGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIAL 675
           +    ++   Y  +  +V G GGAFLHP+H  +          + ++ YP+  DS R   
Sbjct: 339 YWGNANENGEYSNYASVVAGGGGAFLHPSHTDAG-------EIKKQSVYPAEMDSHREVT 391

Query: 676 GNILKFRKKNWQFDFIGGIVYFV 698
             IL      WQ  F+GG  + +
Sbjct: 392 QRIL----NPWQI-FLGGYAWLI 409


>gi|300113979|ref|YP_003760554.1| hypothetical protein Nwat_1308 [Nitrosococcus watsonii C-113]
 gi|299539916|gb|ADJ28233.1| conserved hypothetical protein [Nitrosococcus watsonii C-113]
          Length = 830

 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 100/329 (30%), Positives = 141/329 (42%), Gaps = 78/329 (23%)

Query: 378 EKEDLWFDFMADTGDGGNSSYSVARL---------------LAQPHIRVT---RDDSVFT 419
           E+E  WFD++AD+GDG  ++Y++A L               +AQ    V+      +VF 
Sbjct: 64  EEEAFWFDYLADSGDGQCATYNIAYLCQHDLWLPNENPSPDIAQESKTVSLIGDAGNVFK 123

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           LPRG+ L +GGD +Y      T   R  +PF +A +           +  PE     PE 
Sbjct: 124 LPRGEFLFVGGDTSYHIADYATLADRFQQPFNWAYED----------IFGPET---RPET 170

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFICH---------------KSWLGGWFMPQKKSY 524
           ++      Y IPGNHD++D L+ F R                   +  L G+   Q+ SY
Sbjct: 171 RR----PIYGIPGNHDYYDALDGFNRQFLKPFNQEHEQDTEGAGPQLSLKGFERLQEASY 226

Query: 525 FALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGER-----------DSVIIMTH 573
            AL+LP GWW +GLD      ID  Q  FF  L   +V E            D +I+ T 
Sbjct: 227 VALKLPYGWWFWGLDTQA-GKIDRRQAAFFLSLCNPEVSEATAENKAARKAPDKLIVATP 285

Query: 574 EPNWLLDWYFNN----VSGKNVKHLICDYLK--------GRCKLRIAGDMHHYMRH---- 617
           EP      +       V       L   +LK         +C+L I+GD+HHY R+    
Sbjct: 286 EPTTQFGRWAREEEAIVETFKKLELAPSFLKSNAGNLPPSQCRLDISGDIHHYARYWGKG 345

Query: 618 SYVPSDGPVYVQHLLVNGCGGAFLHPTHV 646
           +   SD        +V G GGAFLHP+H 
Sbjct: 346 AVDKSDHTRTNYASVVAGGGGAFLHPSHT 374


>gi|77165278|ref|YP_343803.1| hypothetical protein Noc_1802 [Nitrosococcus oceani ATCC 19707]
 gi|76883592|gb|ABA58273.1| hypothetical protein Noc_1802 [Nitrosococcus oceani ATCC 19707]
          Length = 828

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 96/329 (29%), Positives = 141/329 (42%), Gaps = 78/329 (23%)

Query: 378 EKEDLWFDFMADTGDGGNSSYSVARL------LAQPHIRVTRDD------------SVFT 419
           E+E  WFD++AD+GDG  ++Y++A L      L   +    RD             + F 
Sbjct: 64  EEEAFWFDYLADSGDGQCATYNIAYLCQHDLWLPNENSPPDRDQESKTVSLIGGGGNTFK 123

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           LPRG+ L +GGD AY      T   R  RPF +A +           +  PE     PE 
Sbjct: 124 LPRGEFLFVGGDTAYHIADYTTLAERFQRPFNWAYED----------IFGPET---RPET 170

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFICH---------------KSWLGGWFMPQKKSY 524
           ++      Y IPGNHD++D L+ F R                   +  L G+   Q+ SY
Sbjct: 171 RR----PIYGIPGNHDYYDALDGFNRQFLKPFNQEHAQATEGAGPQLSLKGFERIQEASY 226

Query: 525 FALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGER-----------DSVIIMTH 573
            AL+LP  WW +GLD     +ID  Q  FF  L    V E            D +I+ T 
Sbjct: 227 VALKLPYDWWFWGLDTQA-GEIDRRQAAFFLSLCNPGVSEATAESKTARKAPDKLIVATP 285

Query: 574 EPNWLLDWYFN------------NVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRH-SYV 620
           EP      +              +++   +K    +  + +C+L I+GD+HHY R+    
Sbjct: 286 EPTTKFGQWAREEEAIVGTFKKLDLAPSFLKSNAGNLPRSQCRLDISGDIHHYARYWGKG 345

Query: 621 PSDGPVYVQ---HLLVNGCGGAFLHPTHV 646
            +D   + +     +V G GGAFLHP+H 
Sbjct: 346 AADKSAHTRANYASVVAGGGGAFLHPSHT 374


>gi|254434331|ref|ZP_05047839.1| hypothetical protein NOC27_1262 [Nitrosococcus oceani AFC27]
 gi|207090664|gb|EDZ67935.1| hypothetical protein NOC27_1262 [Nitrosococcus oceani AFC27]
          Length = 809

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 96/329 (29%), Positives = 141/329 (42%), Gaps = 78/329 (23%)

Query: 378 EKEDLWFDFMADTGDGGNSSYSVARL------LAQPHIRVTRDD------------SVFT 419
           E+E  WFD++AD+GDG  ++Y++A L      L   +    RD             + F 
Sbjct: 45  EEEAFWFDYLADSGDGQCATYNIAYLCQHDLWLPNENSPPDRDQESKTVSLIGGGGNTFK 104

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           LPRG+ L +GGD AY      T   R  RPF +A +           +  PE     PE 
Sbjct: 105 LPRGEFLFVGGDTAYHIADYTTLAERFQRPFNWAYED----------IFGPET---RPET 151

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFICH---------------KSWLGGWFMPQKKSY 524
           ++      Y IPGNHD++D L+ F R                   +  L G+   Q+ SY
Sbjct: 152 RR----PIYGIPGNHDYYDALDGFNRQFLKPFNQEHAQATEGAGPQLSLKGFERIQEASY 207

Query: 525 FALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGER-----------DSVIIMTH 573
            AL+LP  WW +GLD     +ID  Q  FF  L    V E            D +I+ T 
Sbjct: 208 VALKLPYDWWFWGLDTQA-GEIDRRQAAFFLSLCNPGVSEATAESKTARKAPDKLIVATP 266

Query: 574 EPNWLLDWYFN------------NVSGKNVKHLICDYLKGRCKLRIAGDMHHYMRH-SYV 620
           EP      +              +++   +K    +  + +C+L I+GD+HHY R+    
Sbjct: 267 EPTTKFGQWAREEEAIVGTFKKLDLAPSFLKSNAGNLPRSQCRLDISGDIHHYARYWGKG 326

Query: 621 PSDGPVYVQ---HLLVNGCGGAFLHPTHV 646
            +D   + +     +V G GGAFLHP+H 
Sbjct: 327 AADKSAHTRANYASVVAGGGGAFLHPSHT 355


>gi|307102217|gb|EFN50573.1| hypothetical protein CHLNCDRAFT_136265 [Chlorella variabilis]
          Length = 99

 Score =  108 bits (269), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 44/91 (48%), Positives = 69/91 (75%)

Query: 518 MPQKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNW 577
           MPQ+KSYFA++LP GWW+FGLDLAL  DID+ Q+ +FA + +E++G  D V+++ H P+W
Sbjct: 1   MPQEKSYFAIRLPHGWWLFGLDLALEDDIDMCQYSYFARIAEERLGPGDQVVLVQHCPSW 60

Query: 578 LLDWYFNNVSGKNVKHLICDYLKGRCKLRIA 608
           L+DW++    G N++ L+   L+GR +L++A
Sbjct: 61  LVDWFWGRCQGSNLRQLVRGPLRGRARLQLA 91


>gi|340052117|emb|CCC46388.1| conserved hypothetical protein, fragment [Trypanosoma vivax Y486]
          Length = 909

 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 86/294 (29%), Positives = 141/294 (47%), Gaps = 34/294 (11%)

Query: 520 QKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLL 579
           Q+ S+F LQLP  W++   D     DID  Q  +F E +++ + E   VI++ HEP W+ 
Sbjct: 379 QRSSFFILQLPYNWFMLCADTGSTTDIDTTQRNYFFEFIEKNLNEASCVILVCHEPAWVY 438

Query: 580 DWYFNNVSGKNVK---HLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNG- 635
           +   N  S + ++   + + + L  R +LR+ GD+HHY RH+  P+D       L+V+G 
Sbjct: 439 E-AMNKKSTRPMQPQLNQVIEVLGTRLRLRLCGDVHHYSRHT--PADALSEAPTLIVSGG 495

Query: 636 -----CGGAFLHPTHVFSNFRKFYGTTYESKAAYPSFEDSSRIALGNILKFRKKNWQFDF 690
                C G     T V S      GT Y   AA+P+  D + I L  ++ FR  NW+FD 
Sbjct: 496 KEVPFCMG---RSTPVISQ-----GTEYIRSAAFPAHNDVTSI-LARLVGFRLINWKFDI 546

Query: 691 IGGIVYFVLVFSMFPQCELNHILREDSFSGHLRSFFGT-VWNAFMYVLEHSYVSFAGALL 749
           I G++ F L+ S  P    +  L + +    L ++ G      F++ +  +  SF   +L
Sbjct: 547 IAGVMCFGLIISALPLSMEDSRLHQINDVFQLFTYIGIRTVELFIFTINEAITSF---IL 603

Query: 750 LLIVAITFVPSKLSRKKRAMIGV--LHVSAHLAAALILMLLLELGVETCIQHKL 801
            + V   F  +   +K  +  GV  LH +       IL++L+   V + +Q  L
Sbjct: 604 SICVFFIFFLAGGEKKSASFRGVYALHWT-------ILVVLVSTSVLSFVQATL 650


>gi|147792849|emb|CAN68801.1| hypothetical protein VITISV_008808 [Vitis vinifera]
          Length = 289

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 63/78 (80%), Positives = 73/78 (93%)

Query: 729 VWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLSRKKRAMIGVLHVSAHLAAALILMLL 788
           +W+AFMY+LEHSYVS AGA+LLL+ AI FVP KLSRKKR +IG+LHVSAHLAAAL+LMLL
Sbjct: 1   MWDAFMYMLEHSYVSLAGAMLLLMAAIIFVPPKLSRKKRVIIGILHVSAHLAAALVLMLL 60

Query: 789 LELGVETCIQHKLLATSG 806
           LELGVETCI+H+LLATSG
Sbjct: 61  LELGVETCIRHRLLATSG 78


>gi|226532072|ref|NP_001143466.1| uncharacterized protein LOC100276134 [Zea mays]
 gi|195621032|gb|ACG32346.1| hypothetical protein [Zea mays]
          Length = 130

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 59/103 (57%), Positives = 74/103 (71%)

Query: 704 FPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVPSKLS 763
            PQC L HIL E+++SG L+SF GT+W A   +   SYVS  G+L LL+ + +F PSKLS
Sbjct: 16  LPQCNLVHILNEETWSGRLKSFSGTIWCALPQIFWQSYVSSVGSLTLLMASYSFKPSKLS 75

Query: 764 RKKRAMIGVLHVSAHLAAALILMLLLELGVETCIQHKLLATSG 806
           R +RA+IGVLHV AH  A L+LMLLLELG E CI+  LL +SG
Sbjct: 76  RMRRAIIGVLHVLAHFTATLLLMLLLELGTEICIRDHLLTSSG 118


>gi|159474884|ref|XP_001695553.1| hypothetical protein CHLREDRAFT_149761 [Chlamydomonas reinhardtii]
 gi|158275564|gb|EDP01340.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 225

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 101/257 (39%), Gaps = 92/257 (35%)

Query: 77  FFYFFSSPFIGKTITPS---YSNFSRWYIAWILVAAVYHLPSFQSMGVDLRMNLSLFLTI 133
           F Y ++ PFI   +  +   + NFS  YI W+  A  YHLPS  S+G+D+R ++S  + +
Sbjct: 14  FAYMYARPFIRVGLGSAKRGFINFSTLYIVWLCSAVFYHLPSLASLGLDVRADVSFLIVV 73

Query: 134 FLASVLFLLVFHIIFLGLWYVGLVSRVAGKRPEILTIIQNCVVISVFCCVFYSHCGNRAV 193
           FLAS+ F                     G R   + ++ N  +I+V C  +Y+ CGN   
Sbjct: 74  FLASLAFF-------------------RGLRELWVVLLLNAAIIAVACSTYYTFCGN--- 111

Query: 194 LRHRPLERRNSSWFSLWKKEERNTWLAKFLRMNELKDQVCSSWFAPVGSASDYPLLSKWV 253
                                   ++    R   LK  VCS W  P+ + S+YP  S W+
Sbjct: 112 ----------------------GRFVVSPGRTPPLKAAVCSKWLHPILT-SEYPRFSSWM 148

Query: 254 IYGE---LGNDNGGSS-----------------------------------------DEI 269
           +YGE   LG D G ++                                         D +
Sbjct: 149 LYGEGSGLGLDMGNATAAGGGSSRNGSSSSSSSSGSGSFISVDFPLDGEGVKDVPAGDVL 208

Query: 270 SPIYSLWATFIGLYIAN 286
           SP++S+W T + +Y+  
Sbjct: 209 SPVFSMWVTLVAVYVGE 225


>gi|398014385|ref|XP_003860383.1| hypothetical protein, conserved, partial [Leishmania donovani]
 gi|322498604|emb|CBZ33676.1| hypothetical protein, conserved, partial [Leishmania donovani]
          Length = 657

 Score = 80.1 bits (196), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 55/178 (30%), Positives = 77/178 (43%), Gaps = 59/178 (33%)

Query: 380 EDLWFDFMADTGDGGNSSYSVARLLAQPHIRV----------------TRDDSV------ 417
            D+WFD++AD GDG N +Y++ARLLA+P +++                T DDS       
Sbjct: 480 RDIWFDWIADVGDGFNPTYAMARLLARPSLKIRWHRPPSKRVGLSFLPTFDDSTPTNTPT 539

Query: 418 -----FTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKK---------- 462
                F LPRG  +L+GGDLAYP+P+  TY  RLF P+  A+      +           
Sbjct: 540 VDREPFVLPRGSFVLVGGDLAYPSPNDETYTTRLFEPYHDAMSSNVRLQSVFHAEQRRVV 599

Query: 463 --------------------DHVAVNKPEVPSG--VPELKQYDGPQCYIIPGNHDWFD 498
                                 +A  +  + +G    E      P  + IPGNHDWFD
Sbjct: 600 VADASDADVAHIHLLDAETVSRMATGRAALRTGRATAEEALRSVPLLFAIPGNHDWFD 657


>gi|269126043|ref|YP_003299413.1| metallophosphoesterase [Thermomonospora curvata DSM 43183]
 gi|268311001|gb|ACY97375.1| metallophosphoesterase [Thermomonospora curvata DSM 43183]
          Length = 517

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 83/311 (26%), Positives = 116/311 (37%), Gaps = 91/311 (29%)

Query: 380 EDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSA 439
           E+  F  + DTG+G  S Y+V      P + V  D         D ++I  D+ YP   A
Sbjct: 104 EEFSFLLLGDTGEGDRSQYAVV----PPMLAVGADT--------DFMVICSDVIYPGGEA 151

Query: 440 FTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDG 499
             YE + FRP  YA  P P                             Y IPGNHDW+DG
Sbjct: 152 ADYEAKFFRP--YADYPAP----------------------------IYAIPGNHDWYDG 181

Query: 500 LNTFMRFIC------HKSWLGG--------WFMPQK----------------KSYFALQL 529
           L  FMR  C         W G         W  P                  +S  ALQ 
Sbjct: 182 LRGFMRVFCGLEGAHEPRWRGPLGPLARLLWRDPAPIDDGELAVARERWRGTRSQRALQ- 240

Query: 530 PKGWW--------VFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDW 581
           P  +W        + G+D  +   ID  Q  +  E+     G +   +++T +P  ++D 
Sbjct: 241 PGPYWAIDAPSLRIIGIDTGITGSIDRDQAAWLREV---SAGPKPK-LLLTGKP-LIVDD 295

Query: 582 YFNNVSGKNVKHLICDYLKG---RCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGG 638
                  +  +  + D +     R    I GD+H+Y R+S    DG       LV+G GG
Sbjct: 296 RIEPGPIEGEQATVADIVTDPAHRYVAVIGGDIHNYQRYSRTLEDG--RTIEYLVSGGGG 353

Query: 639 AFLHPTHVFSN 649
           AF+H TH    
Sbjct: 354 AFMHATHTIPK 364


>gi|271967936|ref|YP_003342132.1| hypothetical protein [Streptosporangium roseum DSM 43021]
 gi|270511111|gb|ACZ89389.1| hypothetical protein Sros_6679 [Streptosporangium roseum DSM 43021]
          Length = 502

 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 76/307 (24%), Positives = 114/307 (37%), Gaps = 92/307 (29%)

Query: 384 FDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYE 443
           F  + DTG+G  S Y+V      P +     D+ F +       I  D+ YP  S   YE
Sbjct: 99  FLVLGDTGEGDASQYAVI-----PGLLKLGADTSFAV-------IASDVVYPTGSGNEYE 146

Query: 444 RRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLNTF 503
            + FRP+                             K Y  P  Y IPGNHDW+DGL  F
Sbjct: 147 DKFFRPY-----------------------------KDYRAP-IYAIPGNHDWYDGLGGF 176

Query: 504 MRFICHKSWLGG----------------WFMPQK-----------------------KSY 524
           MR  C    L                  W  P+K                         Y
Sbjct: 177 MRVFCDAPALKAERQGFRLTPSGLRGLLWRKPEKIDEARLARAREHRPLPVQRAVQPAPY 236

Query: 525 FALQLPKGWWVFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDWYFN 584
           +A++ P G  + G+D  +H  +D  Q ++  E+ ++    R  V+I T +P +  + Y  
Sbjct: 237 WAMETP-GLLIVGVDTGIHSTLDREQEQWLREVSRD---PRPKVLI-TGKPVYTRNEYKP 291

Query: 585 NV--SGKNVKHLICDYLKGRCKLRIAGDMHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLH 642
           +    G  +  ++ D         I GD+H+Y R       G   +Q+++  G  GAF+H
Sbjct: 292 SKLEGGGTIDDIVADPAHNYVAA-IGGDVHNYQRFPI--KAGGRTIQYIVAGGS-GAFMH 347

Query: 643 PTHVFSN 649
            TH    
Sbjct: 348 ATHTIPR 354


>gi|440697028|ref|ZP_20879472.1| hypothetical protein STRTUCAR8_09781 [Streptomyces turgidiscabies
           Car8]
 gi|440280719|gb|ELP68415.1| hypothetical protein STRTUCAR8_09781 [Streptomyces turgidiscabies
           Car8]
          Length = 522

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 78/316 (24%), Positives = 113/316 (35%), Gaps = 99/316 (31%)

Query: 384 FDFM--ADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFT 441
           F FM   DTG+G    Y+V      P       D+ FT+       +  D+ YP  +   
Sbjct: 92  FSFMVIGDTGEGDEPQYAVV-----PGFLRAGQDTAFTV-------LASDVIYPVGATDD 139

Query: 442 YERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYIIPGNHDWFDGLN 501
           Y  + FRP++                              Y  P  Y IPGNHDW++ LN
Sbjct: 140 YGTKFFRPYQ-----------------------------DYPAP-IYAIPGNHDWYEDLN 169

Query: 502 TFMRFICH------------------KSWLGGWFMPQK------------KSYFALQL-- 529
            FMR  C                   +S L  W  P              +S  A Q   
Sbjct: 170 GFMRVFCDAPPLPAEPAPHAFTPGRLRSLL--WHRPSAVDEQRLAEAAKLRSAPAQQAVQ 227

Query: 530 PKGWW--------VFGLDLALHCDIDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDW 581
           P  +W        V G+D  L   +D  Q ++  E+    V +    I++T  P ++   
Sbjct: 228 PGPYWAIDAGPIRVIGIDTGLLGTLDAEQGRWLREVSAGPVPK----ILVTGSPLYVDGE 283

Query: 582 YFNNV--SGKNVKHLICDYLKGRCKLRIAGDMHHYMRH------SYVPSDGPVYVQHLLV 633
           +       G  V  L+ D  +      I GD+H+Y R+      S   + GP      +V
Sbjct: 284 HHPCPIDGGGTVDDLVRDPERNFVAA-IGGDIHNYQRYPVTLPGSGDGTAGPARTVQYIV 342

Query: 634 NGCGGAFLHPTHVFSN 649
           +G GGAF+H TH    
Sbjct: 343 SGGGGAFMHATHTIKR 358


>gi|395774786|ref|ZP_10455301.1| hypothetical protein Saci8_33661 [Streptomyces acidiscabies 84-104]
          Length = 477

 Score = 64.3 bits (155), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 122/332 (36%), Gaps = 96/332 (28%)

Query: 363 QEGAQHGDLLYDHLSEKEDL-WFDFM--ADTGDGGNSSYSVARLLAQPHIRVTRDDSVFT 419
           Q+ AQ  +   D +  ++D   F FM   DTG+G +  Y+V      P      + + F 
Sbjct: 42  QQKAQQ-EAPADKVIRRDDPDRFSFMVIGDTGEGDDPQYAVV-----PGFLKMSEGTRFA 95

Query: 420 LPRGDVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPEL 479
           +       +  D+ YP  SA  Y  + FRP  YA  P P                     
Sbjct: 96  V-------VASDVIYPVGSAADYGTKFFRP--YASYPAP--------------------- 125

Query: 480 KQYDGPQCYIIPGNHDWFDGLNTFMRFICH---------------KSWLGGWFMPQKKSY 524
                   Y +PGNHDW++GL  FMR  C                K+WL      + +  
Sbjct: 126 -------IYAVPGNHDWYEGLGAFMRVFCGDAPPLPAEPKPRAPTKAWLRSLLWHEARPG 178

Query: 525 FALQL----------------PKGWW--------VFGLDLALHCDIDVYQFKFFAELVKE 560
              +L                P  +W        + G+D  L   +D  Q  +  E+ + 
Sbjct: 179 DGQRLDKVRELRSAPGQQAVQPGPYWAIDAGPVRIIGIDTGLLGTLDAEQGAWLREVSR- 237

Query: 561 QVGERDSVIIMTHEPNWLLDWYFNNVS---GKNVKHLICDYLKGRCKLRIAGDMHHYMRH 617
             G R   I++T  P + +D   +  +   G  V  ++     G     I GD+H+Y R+
Sbjct: 238 --GPRPK-ILVTGSPLY-VDGRSDPCAIEGGGTVDEIVRAPEHGYVAA-IGGDIHNYQRY 292

Query: 618 SYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSN 649
               +DG       +V+G GGAF H TH    
Sbjct: 293 PVRCADG--RTLQYVVSGGGGAFTHATHTIPR 322


>gi|398014387|ref|XP_003860384.1| hypothetical protein, conserved, partial [Leishmania donovani]
 gi|322498605|emb|CBZ33677.1| hypothetical protein, conserved, partial [Leishmania donovani]
          Length = 766

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/59 (45%), Positives = 39/59 (66%), Gaps = 5/59 (8%)

Query: 392 DGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVLLIGGDLAYPNPSAFTYERRLFRPF 450
           DG ++ ++  R L    ++V +D  V TLPRG  +L+GGDLAYP+P+  TY  RLF P+
Sbjct: 713 DGQSAKHTPFRSL----LKVGKDGFV-TLPRGSFVLVGGDLAYPSPNDETYTTRLFEPY 766



 Score = 47.0 bits (110), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 27/31 (87%)

Query: 381 DLWFDFMADTGDGGNSSYSVARLLAQPHIRV 411
           D+WFD++AD GDG N +Y++ARLLAQP +R+
Sbjct: 536 DVWFDWIADVGDGFNPTYAMARLLAQPILRL 566


>gi|336119262|ref|YP_004574039.1| hypothetical protein MLP_36220 [Microlunatus phosphovorus NM-1]
 gi|334687051|dbj|BAK36636.1| hypothetical protein MLP_36220 [Microlunatus phosphovorus NM-1]
          Length = 994

 Score = 48.9 bits (115), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 67/309 (21%), Positives = 115/309 (37%), Gaps = 85/309 (27%)

Query: 424 DVLLIGGDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYD 483
           D ++I  D+ YP      Y   L+RP+                   PE P    ++K   
Sbjct: 141 DFVVIMSDVIYPAGDVDDYVDGLYRPYR-----------------TPE-PDDNSKVKFLV 182

Query: 484 GPQCYIIPGNHDWFDGLNTFM-RFI------------CHKSWLGG------WFMP----- 519
            P    +PGNHDW+DGL  FM  F+              ++W+        W  P     
Sbjct: 183 KPPIIALPGNHDWYDGLAGFMYHFVDQQPLPSAAYAPTRETWIWSRLIRVLWRRPRPARA 242

Query: 520 ----------------------QKKSYFALQLPKGWWVFGLDLALHCDIDVYQFKFFAEL 557
                                 Q   YFA+++P    V  +D  +  +ID  Q+ +  E 
Sbjct: 243 EARHRHESHPITQSFDKTYAINQPGPYFAIRMPHLLLVC-IDTGIGGNIDEQQWDWL-ER 300

Query: 558 VKEQVGERDSVIIMTHEP----NWLLDWYFNNVSGKNVKHLICDYLKGRCKLRIA---GD 610
           +    G +   +++T +P      ++  Y  +         + + +  +    IA   GD
Sbjct: 301 ISRLTGPK---VLLTGKPLVVNGKIVPCYIGHRRSARKADSVWNLVIDKDYEYIATLGGD 357

Query: 611 MHHYMRHSYVPSDGPVYVQHLLVNGCGGAFLHPTHVFSNFRK-------FYGTTYESKAA 663
            H+Y ++    + G  + Q+ LV+G GGAF H TH   +  +             + ++ 
Sbjct: 358 THNYQKYERKQTQG--FPQYHLVSGGGGAFTHATHPNVSLDRDARFQDDLQPDPAKPRSV 415

Query: 664 YPSFEDSSR 672
           +PS+E S R
Sbjct: 416 FPSYETSLR 424


>gi|294942178|ref|XP_002783415.1| hypothetical protein Pmar_PMAR006941 [Perkinsus marinus ATCC 50983]
 gi|239895870|gb|EER15211.1| hypothetical protein Pmar_PMAR006941 [Perkinsus marinus ATCC 50983]
          Length = 412

 Score = 45.8 bits (107), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 47/95 (49%)

Query: 700 VFSMFPQCELNHILREDSFSGHLRSFFGTVWNAFMYVLEHSYVSFAGALLLLIVAITFVP 759
           V+ + P   L  +LR+ +    L  F G +  A+  +   SYVS  G ++ + + I    
Sbjct: 10  VYLLSPYVMLGRVLRQPTAFLSLYEFIGLMVEAYDKIFRQSYVSLIGQIMYITMCIGCAE 69

Query: 760 SKLSRKKRAMIGVLHVSAHLAAALILMLLLELGVE 794
            ++   KR ++G++H   H  AA+  + L+EL  E
Sbjct: 70  EQMGEAKRFIVGLIHGLCHSLAAVSAVCLVELLCE 104


>gi|37522079|ref|NP_925456.1| hypothetical protein glr2510 [Gloeobacter violaceus PCC 7421]
 gi|35213078|dbj|BAC90451.1| glr2510 [Gloeobacter violaceus PCC 7421]
          Length = 531

 Score = 43.5 bits (101), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 56/129 (43%), Gaps = 23/129 (17%)

Query: 371 LLYDHLSEKEDLWFDFMADTGDGGNSSYSVARLLAQPHIRVTRDDSVFTLPRGDVL-LIG 429
           +L D   E+ +  F  + D+G G +  ++  R +A+  +   R+D  F L  GDV+ L+G
Sbjct: 38  VLDDGQPEEPEFSFLVVGDSGSGPHRGHNPQRQIAE-QMLTQREDCRFVLHTGDVMYLVG 96

Query: 430 GDLAYPNPSAFTYERRLFRPFEYALQPPPWYKKDHVAVNKPEVPSGVPELKQYDGPQCYI 489
            D  YP      Y     R F    + P     D +  N+P +P                
Sbjct: 97  SDEYYPKNFIQPY-----REFLVGGERPEQIAYDRMVFNQPFLP---------------- 135

Query: 490 IPGNHDWFD 498
           IPGNHD++D
Sbjct: 136 IPGNHDYYD 144


>gi|167615070|ref|ZP_02383705.1| Ser/Thr protein phosphatase family protein family [Burkholderia
           thailandensis Bt4]
          Length = 540

 Score = 39.7 bits (91), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 5/104 (4%)

Query: 488 YIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLP-KGWWVFGLDLALHCD- 545
           + +PGNH++F G  +F+  +     + G    Q+ SYF L+    GW   GLD   H   
Sbjct: 230 FTVPGNHEYFTGAVSFLHALDSGELVDGPAQRQQASYFCLRTADDGWQFLGLDTGYHGHY 289

Query: 546 IDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDW--YFNNVS 587
           ++V      A L +  +G+ ++       P+W  D   YF + S
Sbjct: 290 MNVAASAQQATLERLHIGKVETAGEGA-SPHWPTDRNPYFRHAS 332


>gi|83718137|ref|YP_438585.1| Ser/Thr protein phosphatase [Burkholderia thailandensis E264]
 gi|257141644|ref|ZP_05589906.1| Ser/Thr protein phosphatase family protein family [Burkholderia
           thailandensis E264]
 gi|83651962|gb|ABC36026.1| Ser/Thr protein phosphatase family protein family [Burkholderia
           thailandensis E264]
          Length = 540

 Score = 39.7 bits (91), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 5/104 (4%)

Query: 488 YIIPGNHDWFDGLNTFMRFICHKSWLGGWFMPQKKSYFALQLP-KGWWVFGLDLALHCD- 545
           + +PGNH++F G  +F+  +     + G    Q+ SYF L+    GW   GLD   H   
Sbjct: 230 FTVPGNHEYFTGAVSFLHALDSGELVDGPAQRQQASYFCLRTADDGWQFLGLDTGYHGHY 289

Query: 546 IDVYQFKFFAELVKEQVGERDSVIIMTHEPNWLLDW--YFNNVS 587
           ++V      A L +  +G+ ++       P+W  D   YF + S
Sbjct: 290 MNVAASAQQATLERLHIGKVETAGEGA-SPHWPTDRNPYFRHAS 332


>gi|366997536|ref|XP_003678530.1| hypothetical protein NCAS_0J02140 [Naumovozyma castellii CBS 4309]
 gi|342304402|emb|CCC72193.1| hypothetical protein NCAS_0J02140 [Naumovozyma castellii CBS 4309]
          Length = 1350

 Score = 39.3 bits (90), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 33/150 (22%), Positives = 63/150 (42%), Gaps = 22/150 (14%)

Query: 218  WLAKFLRMNELKDQVCSSWFAPVGSASDYPL--LSKWVIYGELGNDNGGSSDEISPIYSL 275
            WL  + +MN + D+VC  W + +  A++Y L  ++ WV    +   NG S++  +P+++ 
Sbjct: 1111 WLKDWNKMNTILDEVCYDWDS-LQEATEYSLSMINSWVSVLCVTRVNGSSTNFRTPVFTT 1169

Query: 276  WATFIGLYIANYVVERSTGWALTHPLSVEEYEKMKKKQLKPEFLDMVPWYSGTS------ 329
               F+ + +    ++R   WA        +Y+ M +        D + W    S      
Sbjct: 1170 MCVFVTILVIAEYLKRIELWA-------SQYDPMDQMPTTLRISDRILWLKTESVLKRVQ 1222

Query: 330  ------ADLFKTVFDLLVSVTVFVGRFDMR 353
                   DL K+  + L       G FD++
Sbjct: 1223 KKLLPQGDLMKSYVEFLRLQNDTNGSFDIK 1252


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.326    0.141    0.458 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,100,141,906
Number of Sequences: 23463169
Number of extensions: 636762941
Number of successful extensions: 1680685
Number of sequences better than 100.0: 149
Number of HSP's better than 100.0 without gapping: 124
Number of HSP's successfully gapped in prelim test: 25
Number of HSP's that attempted gapping in prelim test: 1679898
Number of HSP's gapped (non-prelim): 260
length of query: 831
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 680
effective length of database: 8,816,256,848
effective search space: 5995054656640
effective search space used: 5995054656640
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 81 (35.8 bits)