BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 007238
         (611 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255582589|ref|XP_002532077.1| conserved hypothetical protein [Ricinus communis]
 gi|223528259|gb|EEF30311.1| conserved hypothetical protein [Ricinus communis]
          Length = 814

 Score = 1044 bits (2699), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 496/612 (81%), Positives = 551/612 (90%), Gaps = 10/612 (1%)

Query: 2   LVQDRTLPKSPKSQIRT-------SSHRFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAA 54
           +VQ+R  PKSPKS   T       +++RFS SKSLDFSTW  +NL+KI+    LIAT+AA
Sbjct: 50  VVQERATPKSPKSPRTTLPTVNHHNNYRFSPSKSLDFSTWFTENLYKIIICFFLIATVAA 109

Query: 55  LSFLRNFTDTASLI--QSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVS 112
           + F RN  DTA+ +  QSKSQ      +P P INWN I+PI D +S +  FR+E+WIV S
Sbjct: 110 VFFFRNTGDTAAFLYLQSKSQPIE-KTLPFPHINWNQIKPITDSASPFVNFRTERWIVAS 168

Query: 113 VDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDS 172
           V  YP+DSLKKLVKIKGWQ+LAIGNS+TPK W LKG I+LSL+ QA+LGFRV+DF+P+DS
Sbjct: 169 VSDYPSDSLKKLVKIKGWQLLAIGNSKTPKGWALKGCIYLSLEQQASLGFRVVDFVPFDS 228

Query: 173 YVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPN 232
           YVRKS GYLFAIQHGAKKIFDADDRG+VIGDDLGKHFDVELVGEGARQETILQYSHEN N
Sbjct: 229 YVRKSVGYLFAIQHGAKKIFDADDRGEVIGDDLGKHFDVELVGEGARQETILQYSHENEN 288

Query: 233 RTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFY 292
           RT+VNPY+HFGQRSVWPRGLPLENVGEI HEEFYT+VFGGKQFIQQGISNGLPDVDSVFY
Sbjct: 289 RTVVNPYIHFGQRSVWPRGLPLENVGEIGHEEFYTQVFGGKQFIQQGISNGLPDVDSVFY 348

Query: 293 FTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVL 352
           FTRK  LE+FDIRFD+  PKVALPQG+MVP+NSFNTIYQSSAFW LMLPVSVSTMASDVL
Sbjct: 349 FTRKSGLESFDIRFDEHAPKVALPQGIMVPLNSFNTIYQSSAFWGLMLPVSVSTMASDVL 408

Query: 353 RGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHR 412
           RG+WGQRLLWEIGGYVVVYPPTVHRYD+IEAYPFSEEKDLHVNVGRLIKFL++WRS KHR
Sbjct: 409 RGYWGQRLLWEIGGYVVVYPPTVHRYDRIEAYPFSEEKDLHVNVGRLIKFLIAWRSTKHR 468

Query: 413 FFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDR 472
            FEK+LELS++MAEEGFWTE+DVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDR
Sbjct: 469 LFEKILELSYAMAEEGFWTEQDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDR 528

Query: 473 KEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYG 532
           +EF+PRKLPSVHLGVEE GTV+YEIGNLIRWRKNFGN+VLIMFC+GPVERTALEWRLLYG
Sbjct: 529 REFIPRKLPSVHLGVEEIGTVNYEIGNLIRWRKNFGNIVLIMFCTGPVERTALEWRLLYG 588

Query: 533 RIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQ 592
           RIFKTV+ILS+QKNEDLAVE G LEQ+YRHLPKIF R+TSAEGFLFL+DDT+LNYWNLLQ
Sbjct: 589 RIFKTVVILSQQKNEDLAVEEGNLEQLYRHLPKIFDRFTSAEGFLFLKDDTVLNYWNLLQ 648

Query: 593 ADKNKLWITDKV 604
           ADK+KLWITDKV
Sbjct: 649 ADKSKLWITDKV 660


>gi|225441834|ref|XP_002284060.1| PREDICTED: uncharacterized protein LOC100264133 [Vitis vinifera]
          Length = 762

 Score = 1040 bits (2689), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 497/608 (81%), Positives = 552/608 (90%), Gaps = 5/608 (0%)

Query: 1   MLVQDRTLPKSPKSQIRT--SSH--RFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALS 56
           MLVQDR+ PKSPK+ IR   S H  RF++ K+LDFSTW  +NL+KIVT+ LLIAT+AAL 
Sbjct: 1   MLVQDRSTPKSPKTHIRALHSLHPDRFTEPKNLDFSTWFSENLYKIVTISLLIATVAALF 60

Query: 57  FLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRY 116
           FLRN  DTA+L+  ++Q  S   I  P INWNS+  ++DKS  Y+ FRSE+WI+VSV  Y
Sbjct: 61  FLRNVADTAALVSYETQAKSLEKIEFPQINWNSVALVSDKSP-YANFRSERWILVSVSNY 119

Query: 117 PTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRK 176
           PTDSL+KLVKIKGWQVLAIGNS+TP +W+LKGAIFLSL+ QANLGFRV+D LPYDS+VRK
Sbjct: 120 PTDSLRKLVKIKGWQVLAIGNSKTPSDWSLKGAIFLSLEQQANLGFRVVDHLPYDSFVRK 179

Query: 177 SCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIV 236
           + GYLFAIQHGAKKIFDADDRGDVI +DLGKHFDVEL+GEGARQ+ ILQYSHENPNRTIV
Sbjct: 180 NVGYLFAIQHGAKKIFDADDRGDVIDNDLGKHFDVELIGEGARQDIILQYSHENPNRTIV 239

Query: 237 NPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRK 296
           NPY+HFGQRSVWPRGLPLENVGEI HEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRK
Sbjct: 240 NPYIHFGQRSVWPRGLPLENVGEIGHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRK 299

Query: 297 PSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFW 356
           P LEAFDIRFD+  PKVALPQG MVPVNSFNT+Y SSAFWALMLPVSVSTMASDVLRG+W
Sbjct: 300 PGLEAFDIRFDEHAPKVALPQGTMVPVNSFNTLYHSSAFWALMLPVSVSTMASDVLRGYW 359

Query: 357 GQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEK 416
           GQRLLWEIGGYVVVYPPTVHRYD+IE+YPFSEEKDLHVNVGRL+KFLVSWRS+KHR FEK
Sbjct: 360 GQRLLWEIGGYVVVYPPTVHRYDRIESYPFSEEKDLHVNVGRLLKFLVSWRSSKHRLFEK 419

Query: 417 VLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFV 476
           +LELS+ MAEEGFWTE+DVKFTAAWLQDL+AVGYQQPRLMSLELDRPRASIGHGDRKEF+
Sbjct: 420 ILELSYVMAEEGFWTEKDVKFTAAWLQDLLAVGYQQPRLMSLELDRPRASIGHGDRKEFI 479

Query: 477 PRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFK 536
           P+KLPSVHLGVEETG V+ EIG+LIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIF+
Sbjct: 480 PQKLPSVHLGVEETGVVNNEIGSLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFR 539

Query: 537 TVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKN 596
           TV+IL+EQKN DLAVE G+L+ VY+ L  IFSR+TSAEGFLFL D+TILNYWNLLQADK+
Sbjct: 540 TVVILAEQKNADLAVEEGRLDFVYKQLLNIFSRFTSAEGFLFLHDNTILNYWNLLQADKS 599

Query: 597 KLWITDKV 604
            LWITDKV
Sbjct: 600 NLWITDKV 607


>gi|147852317|emb|CAN82225.1| hypothetical protein VITISV_011873 [Vitis vinifera]
          Length = 762

 Score = 1038 bits (2685), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 496/608 (81%), Positives = 552/608 (90%), Gaps = 5/608 (0%)

Query: 1   MLVQDRTLPKSPKSQIRT--SSH--RFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALS 56
           MLVQDR+ PKSPK+ IR   S H  RF++ K+LDFSTW  +NL+KIVT+ LLIAT+AAL 
Sbjct: 1   MLVQDRSTPKSPKTHIRALHSLHPDRFTEPKNLDFSTWFSENLYKIVTISLLIATVAALF 60

Query: 57  FLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRY 116
           FLRN  DTA+L+  ++Q  S   I  P INWNS+  ++DKS  Y+ FRSE+WI+VSV  Y
Sbjct: 61  FLRNVADTAALVSYETQAKSLEKIEFPQINWNSVALVSDKSP-YANFRSERWILVSVSNY 119

Query: 117 PTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRK 176
           PTDSL+KLVKIKGWQVLAIGNS+TP +W+LKGAIFLSL+ QANLGFRV+D LPYDS+VRK
Sbjct: 120 PTDSLRKLVKIKGWQVLAIGNSKTPSDWSLKGAIFLSLEQQANLGFRVVDHLPYDSFVRK 179

Query: 177 SCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIV 236
           + GYLFAIQHGAKKIFDADDRGDVI +DLGKHFDVEL+GEGARQ+ ILQYSHENPNRTIV
Sbjct: 180 NVGYLFAIQHGAKKIFDADDRGDVIDNDLGKHFDVELIGEGARQDIILQYSHENPNRTIV 239

Query: 237 NPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRK 296
           NPY+HFGQRSVWPRGLPLENVGEI HEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRK
Sbjct: 240 NPYIHFGQRSVWPRGLPLENVGEIGHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRK 299

Query: 297 PSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFW 356
           P LEAFDIRFD+  PKVALPQG MVPVN+FNT+Y SSAFWALMLPVSVSTMASDVLRG+W
Sbjct: 300 PGLEAFDIRFDEHAPKVALPQGTMVPVNTFNTLYHSSAFWALMLPVSVSTMASDVLRGYW 359

Query: 357 GQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEK 416
           GQRLLWEIGGYVVVYPPTVHRYD+IE+YPFSEEKDLHVNVGRL+KFLVSWRS+KHR FEK
Sbjct: 360 GQRLLWEIGGYVVVYPPTVHRYDRIESYPFSEEKDLHVNVGRLLKFLVSWRSSKHRLFEK 419

Query: 417 VLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFV 476
           +LELS+ MAEEGFWTE+DVKFTAAWLQDL+AVGYQQPRLMSLELDRPRASIGHGDRKEF+
Sbjct: 420 ILELSYVMAEEGFWTEKDVKFTAAWLQDLLAVGYQQPRLMSLELDRPRASIGHGDRKEFI 479

Query: 477 PRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFK 536
           P+KLPSVHLGVEETG V+ EIG+LIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIF+
Sbjct: 480 PQKLPSVHLGVEETGVVNNEIGSLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFR 539

Query: 537 TVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKN 596
           TV+IL+EQKN DLAVE G+L+ VY+ L  IFSR+TSAEGFLFL D+TILNYWNLLQADK+
Sbjct: 540 TVVILAEQKNADLAVEEGRLDFVYKQLLNIFSRFTSAEGFLFLHDNTILNYWNLLQADKS 599

Query: 597 KLWITDKV 604
            LWITDKV
Sbjct: 600 NLWITDKV 607


>gi|449437678|ref|XP_004136618.1| PREDICTED: uncharacterized protein LOC101214137 [Cucumis sativus]
 gi|449523175|ref|XP_004168600.1| PREDICTED: uncharacterized protein LOC101224948 [Cucumis sativus]
          Length = 762

 Score = 1035 bits (2676), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 487/608 (80%), Positives = 548/608 (90%), Gaps = 4/608 (0%)

Query: 1   MLVQDRTLPKSPKSQIRT----SSHRFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALS 56
           MLVQ+R+ PKSPK+QIRT     SHRFS+SKSLDFSTW+ DN++++VT+LLLI T+AAL 
Sbjct: 1   MLVQERSTPKSPKTQIRTLPTLHSHRFSESKSLDFSTWLSDNVYRVVTILLLIVTVAALF 60

Query: 57  FLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRY 116
           FLRN  D+A+L+  +SQ  +   I  P I+WNSI  I   S++Y  FRSE+WIVVSV  Y
Sbjct: 61  FLRNVGDSAALLCFQSQTAALEKIQFPKIDWNSIASIPASSNLYPEFRSEQWIVVSVSNY 120

Query: 117 PTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRK 176
           P+DSL+KLVK+KGWQVLAIGNS TP +W LKGAI+LSLD Q+ LGFRV+++LPYDS+VRK
Sbjct: 121 PSDSLRKLVKMKGWQVLAIGNSLTPADWALKGAIYLSLDEQSKLGFRVVEYLPYDSFVRK 180

Query: 177 SCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIV 236
           + GYLFAIQHGAKKIFD DDRG+VI  DLGKHFDV+LVGEGARQE ILQYSHENPNRT+V
Sbjct: 181 TVGYLFAIQHGAKKIFDVDDRGEVIDGDLGKHFDVQLVGEGARQEIILQYSHENPNRTVV 240

Query: 237 NPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRK 296
           NPY+HFGQRSVWPRGLPLENVGE++HEEFYTE+FGGKQFIQQGISNGLPDVDSVFYFTRK
Sbjct: 241 NPYIHFGQRSVWPRGLPLENVGELAHEEFYTEIFGGKQFIQQGISNGLPDVDSVFYFTRK 300

Query: 297 PSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFW 356
             LEAFDIRFD+R PKVALPQGMMVP+NSFNT+Y +SAFWALMLPVS+STMASDVLRG+W
Sbjct: 301 SGLEAFDIRFDERAPKVALPQGMMVPINSFNTLYHTSAFWALMLPVSISTMASDVLRGYW 360

Query: 357 GQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEK 416
           GQRLLWEIGGYVVVYPPT+HRYDKIEAYPFSEE+DLHVNVGRL+KFL SWRS+KHR FEK
Sbjct: 361 GQRLLWEIGGYVVVYPPTIHRYDKIEAYPFSEERDLHVNVGRLVKFLNSWRSSKHRLFEK 420

Query: 417 VLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFV 476
           +LELS  MAEEGFWTE+DVKFTAAWLQDLIAVGYQQPRLMSLELDRPRA+IG GDRKEFV
Sbjct: 421 ILELSFVMAEEGFWTEKDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRATIGDGDRKEFV 480

Query: 477 PRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFK 536
           P+KLPS+HLGVEETGTVSYEIGNLIRWRK FGNVVLIMFC+ PVERTALEWRLLYGRIFK
Sbjct: 481 PQKLPSIHLGVEETGTVSYEIGNLIRWRKFFGNVVLIMFCNSPVERTALEWRLLYGRIFK 540

Query: 537 TVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKN 596
           TVIILSE KN DL VE G+L+  Y++LPK+F  Y+ AEGFLFLQDDTILNYWNLLQADK+
Sbjct: 541 TVIILSETKNADLVVEEGRLDHAYKYLPKVFDTYSGAEGFLFLQDDTILNYWNLLQADKS 600

Query: 597 KLWITDKV 604
           KLWITDKV
Sbjct: 601 KLWITDKV 608


>gi|224087016|ref|XP_002308029.1| predicted protein [Populus trichocarpa]
 gi|222854005|gb|EEE91552.1| predicted protein [Populus trichocarpa]
          Length = 771

 Score = 1033 bits (2672), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 501/618 (81%), Positives = 549/618 (88%), Gaps = 15/618 (2%)

Query: 1   MLVQDRTL----PKSPKSQIRTS--------SHRFSDSKSLDFSTWVRDNLFKIVTVLLL 48
           MLVQDR      PKSPKSQIR S         HRFS+SKSLDFSTWV +N  KIVT+ +L
Sbjct: 1   MLVQDRVTTNPNPKSPKSQIRASINSHHHDLHHRFSESKSLDFSTWVSENFCKIVTITVL 60

Query: 49  IATIAALSFLRNFTDTASL--IQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSE 106
           +AT+AA+ FL +  DTA+L  IQSK+Q       P P INWN+I  IADKSS Y+ FRSE
Sbjct: 61  VATVAAILFLLSTGDTAALSYIQSKAQPLDKAHHP-PRINWNNIPSIADKSSPYTNFRSE 119

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLD 166
           KWIVVSV  YP+DSLKKLV+IKGWQ+LAIGNSRTP +W+LKGAI+LSL+ QA LGFRV  
Sbjct: 120 KWIVVSVSHYPSDSLKKLVRIKGWQLLAIGNSRTPNDWSLKGAIYLSLEQQATLGFRVSG 179

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQY 226
           +LP+DSY+RKS GYLFAIQHGAKKIFDADDRG+VI  DLGKHFDVEL+GEGARQETILQY
Sbjct: 180 YLPFDSYLRKSVGYLFAIQHGAKKIFDADDRGEVIDGDLGKHFDVELIGEGARQETILQY 239

Query: 227 SHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPD 286
           SHEN NR++VNPYVHFGQR+VWPRGLPLENVGEI HEEFYTEVFGGKQFIQQGISNGLPD
Sbjct: 240 SHENENRSVVNPYVHFGQRTVWPRGLPLENVGEIGHEEFYTEVFGGKQFIQQGISNGLPD 299

Query: 287 VDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVST 346
           VDSVFY TRK  LEAFDIRFD+R PKVALPQG+M+PVNSFNTIY SSAFW LMLPVSVST
Sbjct: 300 VDSVFYHTRKTGLEAFDIRFDERAPKVALPQGVMMPVNSFNTIYHSSAFWGLMLPVSVST 359

Query: 347 MASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSW 406
           MASDVLRG+WGQRLLWEIGGYVVVYPPTVHRYD +  YPFSEEKDLHVNVGRLIKFLV+W
Sbjct: 360 MASDVLRGYWGQRLLWEIGGYVVVYPPTVHRYDTVGGYPFSEEKDLHVNVGRLIKFLVAW 419

Query: 407 RSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRAS 466
           RS+KH  FEK+LELS +MAEEGFW+E+DVKFTAAWLQDL+AVGYQQPRLMS ELDRPR +
Sbjct: 420 RSSKHELFEKILELSFAMAEEGFWSEQDVKFTAAWLQDLLAVGYQQPRLMSFELDRPRPN 479

Query: 467 IGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALE 526
           IGHGDRKEFVPRKLPSVHLGVEETGTV+YEIGNLIRWRKNFGNVVLIMFC+GPVERTALE
Sbjct: 480 IGHGDRKEFVPRKLPSVHLGVEETGTVNYEIGNLIRWRKNFGNVVLIMFCNGPVERTALE 539

Query: 527 WRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILN 586
           WRLLYGRIFKTVIILS QKNEDLA+EAG L+++Y+HLPKIF RY+SAEGFLFLQDDTILN
Sbjct: 540 WRLLYGRIFKTVIILSSQKNEDLAIEAGHLDRMYKHLPKIFDRYSSAEGFLFLQDDTILN 599

Query: 587 YWNLLQADKNKLWITDKV 604
           YWNLLQADK KLWITDKV
Sbjct: 600 YWNLLQADKTKLWITDKV 617


>gi|224139872|ref|XP_002323318.1| predicted protein [Populus trichocarpa]
 gi|222867948|gb|EEF05079.1| predicted protein [Populus trichocarpa]
          Length = 771

 Score = 1012 bits (2617), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 488/617 (79%), Positives = 545/617 (88%), Gaps = 13/617 (2%)

Query: 1   MLVQDRTL----PKSPKSQIR-TSSH-------RFSDSKSLDFSTWVRDNLFKIVTVLLL 48
           MLVQ R      PKSPKSQIR T +H       RFS+SKSLDFSTWV +N +KI+T+ +L
Sbjct: 1   MLVQGRVTTNPNPKSPKSQIRPTINHNHHDLHQRFSESKSLDFSTWVSENFYKIITITVL 60

Query: 49  IATIAALSFLRNFTDTASLIQSKSQEHSPNAIP-LPVINWNSIQPIADKSSVYSRFRSEK 107
           IAT+AA+ FLR+  DTA+ +  +SQ    +     P I+WN+I  I DKSS Y+ FRSEK
Sbjct: 61  IATVAAIFFLRSTGDTAAFLYLQSQAQPLDKTHHFPRIDWNNIPAITDKSSPYANFRSEK 120

Query: 108 WIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDF 167
           WIVVSV  YP+DSLKKLV+IKGWQ+LAIGNSRTP +W+LKGAI+LSL+ QA+LGFRVL +
Sbjct: 121 WIVVSVSHYPSDSLKKLVRIKGWQLLAIGNSRTPNDWSLKGAIYLSLEQQASLGFRVLGY 180

Query: 168 LPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYS 227
           +PYDSY+RKS GYLFAIQHGAKKIFDADDRG+VI  DLGKHFDVEL+GEGARQETILQYS
Sbjct: 181 VPYDSYLRKSVGYLFAIQHGAKKIFDADDRGEVIDGDLGKHFDVELIGEGARQETILQYS 240

Query: 228 HENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDV 287
           HEN NR++VNPYVHFGQR+VWPRGLPLENVGE+ HEEFYTEV+GGKQFIQQGISNGLPDV
Sbjct: 241 HENENRSVVNPYVHFGQRTVWPRGLPLENVGELGHEEFYTEVYGGKQFIQQGISNGLPDV 300

Query: 288 DSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTM 347
           DSVFY+TRK  LEAFDIRFD+R PKVALPQG+MVPVNSFNTIY SSAFW LMLPVSVS M
Sbjct: 301 DSVFYYTRKTGLEAFDIRFDERAPKVALPQGVMVPVNSFNTIYHSSAFWGLMLPVSVSNM 360

Query: 348 ASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWR 407
           ASDVLRG+WGQRLLWEIGGYVVVYPPTVHRYD +  YPFSEEKDLHVNVGRL+KFLV+WR
Sbjct: 361 ASDVLRGYWGQRLLWEIGGYVVVYPPTVHRYDTVGGYPFSEEKDLHVNVGRLVKFLVAWR 420

Query: 408 SNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASI 467
           S++HR FEK+LELS +MAE GFW+E+DVKFTAAWLQDL+AVGY+QPRLMS ELDRPR +I
Sbjct: 421 SSEHRLFEKILELSFAMAEGGFWSEQDVKFTAAWLQDLLAVGYRQPRLMSFELDRPRPTI 480

Query: 468 GHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEW 527
           GHGDRKEFVPRK PSVHLGVEETGTV+YEI NLIRWRKNFGNVVLIMFC+GPVERTALEW
Sbjct: 481 GHGDRKEFVPRKFPSVHLGVEETGTVNYEIANLIRWRKNFGNVVLIMFCNGPVERTALEW 540

Query: 528 RLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNY 587
           RLLYGRIFKTVIILS QKNEDLAVEAG L+ +Y+HLPKIF RY+SAEGFLFLQDDTILNY
Sbjct: 541 RLLYGRIFKTVIILSWQKNEDLAVEAGHLDHIYKHLPKIFDRYSSAEGFLFLQDDTILNY 600

Query: 588 WNLLQADKNKLWITDKV 604
           WNLLQA K KLWITDKV
Sbjct: 601 WNLLQASKAKLWITDKV 617


>gi|356500503|ref|XP_003519071.1| PREDICTED: uncharacterized protein LOC100786801 [Glycine max]
          Length = 759

 Score =  998 bits (2581), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 470/606 (77%), Positives = 539/606 (88%), Gaps = 4/606 (0%)

Query: 1   MLVQDRTLPKS--PKSQIRTSSHRFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALSFL 58
           M+VQ+R+LPKS  PK   RT++   + +KSLDFS WV DNL +IV VLLL+AT+AAL FL
Sbjct: 1   MMVQERSLPKSVNPKPHTRTAA--LASTKSLDFSAWVSDNLVRIVAVLLLVATVAALFFL 58

Query: 59  RNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRYPT 118
           RN  DTA+L+  ++Q      I  P ++W++I PIADK+S +S FRSEKWIVVSV  YP+
Sbjct: 59  RNVGDTAALLCFENQARELERIAYPRVDWSAIAPIADKTSKFSSFRSEKWIVVSVSGYPS 118

Query: 119 DSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSC 178
           ++L++LVK+KGWQV+A+G S TP +W LKGAIFLSL+ Q NLGFRV+D+LPYDS+VRKS 
Sbjct: 119 EALRRLVKMKGWQVVAVGGSNTPSDWTLKGAIFLSLEEQVNLGFRVVDYLPYDSFVRKSV 178

Query: 179 GYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNP 238
           GYLFAIQHGAKKIFDADDRG+VI DDLGKHFDVELVGEGARQE +LQYSH+NPNRT+VNP
Sbjct: 179 GYLFAIQHGAKKIFDADDRGEVIDDDLGKHFDVELVGEGARQEVLLQYSHDNPNRTVVNP 238

Query: 239 YVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPS 298
           YVHFGQRSVWPRGLPLE VGEI HEEFYT+VFGG QFIQQGISNGLPDVDSVFYFTRK  
Sbjct: 239 YVHFGQRSVWPRGLPLEKVGEIGHEEFYTQVFGGMQFIQQGISNGLPDVDSVFYFTRKSV 298

Query: 299 LEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQ 358
           LE FDIRFD+  PKVALPQGMMVPVNSFNT+Y SSAFWALMLPVSVSTMASDVLRG+WGQ
Sbjct: 299 LETFDIRFDEHAPKVALPQGMMVPVNSFNTMYHSSAFWALMLPVSVSTMASDVLRGYWGQ 358

Query: 359 RLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVL 418
           RLLWE+GGYVVVYPPTVHRYD+IEAYPFSEEKDLHVNVGRLI +L+SWRS+KHR FEK+L
Sbjct: 359 RLLWEVGGYVVVYPPTVHRYDRIEAYPFSEEKDLHVNVGRLINYLISWRSDKHRLFEKIL 418

Query: 419 ELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPR 478
           +LS +MAEEGFWTE+DVK TAAWLQDL+AVGYQQPRLMSLEL RPRA+IGHGD+KEFVP+
Sbjct: 419 DLSFAMAEEGFWTEKDVKLTAAWLQDLLAVGYQQPRLMSLELGRPRANIGHGDQKEFVPQ 478

Query: 479 KLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTV 538
           KLPSVHLGVEETGTV+YEI NLIRWRK FGNVVLIM C+GPVERTALEWRLLYGRIF++V
Sbjct: 479 KLPSVHLGVEETGTVNYEISNLIRWRKTFGNVVLIMHCNGPVERTALEWRLLYGRIFRSV 538

Query: 539 IILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKL 598
           +ILSE+K+ DL V  G L+  YR+LPKIF +++SAEGFLF+QD+TILNYWNLLQADK KL
Sbjct: 539 VILSEKKDVDLVVGEGHLDYAYRYLPKIFDQFSSAEGFLFVQDNTILNYWNLLQADKTKL 598

Query: 599 WITDKV 604
           WIT+KV
Sbjct: 599 WITNKV 604


>gi|297739659|emb|CBI29841.3| unnamed protein product [Vitis vinifera]
          Length = 726

 Score =  994 bits (2569), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 482/608 (79%), Positives = 530/608 (87%), Gaps = 41/608 (6%)

Query: 1   MLVQDRTLPKSPKSQIRT--SSH--RFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALS 56
           MLVQDR+ PKSPK+ IR   S H  RF++ K+LDFSTW  +NL+KIVT+ LLIAT+AAL 
Sbjct: 1   MLVQDRSTPKSPKTHIRALHSLHPDRFTEPKNLDFSTWFSENLYKIVTISLLIATVAALF 60

Query: 57  FLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRY 116
           FLRN                                     S Y+ FRSE+WI+VSV  Y
Sbjct: 61  FLRN-------------------------------------SPYANFRSERWILVSVSNY 83

Query: 117 PTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRK 176
           PTDSL+KLVKIKGWQVLAIGNS+TP +W+LKGAIFLSL+ QANLGFRV+D LPYDS+VRK
Sbjct: 84  PTDSLRKLVKIKGWQVLAIGNSKTPSDWSLKGAIFLSLEQQANLGFRVVDHLPYDSFVRK 143

Query: 177 SCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIV 236
           + GYLFAIQHGAKKIFDADDRGDVI +DLGKHFDVEL+GEGARQ+ ILQYSHENPNRTIV
Sbjct: 144 NVGYLFAIQHGAKKIFDADDRGDVIDNDLGKHFDVELIGEGARQDIILQYSHENPNRTIV 203

Query: 237 NPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRK 296
           NPY+HFGQRSVWPRGLPLENVGEI HEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRK
Sbjct: 204 NPYIHFGQRSVWPRGLPLENVGEIGHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRK 263

Query: 297 PSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFW 356
           P LEAFDIRFD+  PKVALPQG MVPVNSFNT+Y SSAFWALMLPVSVSTMASDVLRG+W
Sbjct: 264 PGLEAFDIRFDEHAPKVALPQGTMVPVNSFNTLYHSSAFWALMLPVSVSTMASDVLRGYW 323

Query: 357 GQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEK 416
           GQRLLWEIGGYVVVYPPTVHRYD+IE+YPFSEEKDLHVNVGRL+KFLVSWRS+KHR FEK
Sbjct: 324 GQRLLWEIGGYVVVYPPTVHRYDRIESYPFSEEKDLHVNVGRLLKFLVSWRSSKHRLFEK 383

Query: 417 VLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFV 476
           +LELS+ MAEEGFWTE+DVKFTAAWLQDL+AVGYQQPRLMSLELDRPRASIGHGDRKEF+
Sbjct: 384 ILELSYVMAEEGFWTEKDVKFTAAWLQDLLAVGYQQPRLMSLELDRPRASIGHGDRKEFI 443

Query: 477 PRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFK 536
           P+KLPSVHLGVEETG V+ EIG+LIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIF+
Sbjct: 444 PQKLPSVHLGVEETGVVNNEIGSLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFR 503

Query: 537 TVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKN 596
           TV+IL+EQKN DLAVE G+L+ VY+ L  IFSR+TSAEGFLFL D+TILNYWNLLQADK+
Sbjct: 504 TVVILAEQKNADLAVEEGRLDFVYKQLLNIFSRFTSAEGFLFLHDNTILNYWNLLQADKS 563

Query: 597 KLWITDKV 604
            LWITDKV
Sbjct: 564 NLWITDKV 571


>gi|18405801|ref|NP_565960.1| uncharacterized protein [Arabidopsis thaliana]
 gi|2335100|gb|AAC02770.1| expressed protein [Arabidopsis thaliana]
 gi|15810461|gb|AAL07118.1| unknown protein [Arabidopsis thaliana]
 gi|330254936|gb|AEC10030.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 771

 Score =  992 bits (2565), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 477/614 (77%), Positives = 535/614 (87%), Gaps = 10/614 (1%)

Query: 1   MLVQDRTLP---KSPKSQIRT-SSH-----RFSDSKSLDFSTWVRDNLFKIVTVLLLIAT 51
           MLVQDR  P   K PKSQIR   +H     RFS+ K+LDFSTW  +NL +I    LLI T
Sbjct: 1   MLVQDRAAPSPAKPPKSQIRELPTHQQIRRRFSEPKNLDFSTWFSENLSRIAVFSLLIVT 60

Query: 52  IAALSFLRNFTDTASLIQSKSQEHS-PNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIV 110
           I A  FL N TDTASL+  +SQ      ++  P I WNSI  + DK+S Y+ F++EKWIV
Sbjct: 61  IVAFFFLYNTTDTASLLCFQSQSTQFLQSLSRPQIKWNSIPVVPDKTSPYANFQTEKWIV 120

Query: 111 VSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPY 170
           VSV +YPT+ LK LVKI+GWQVLAIGNS TPK+W+LKG+IFLSLD QA LG+RVLD LPY
Sbjct: 121 VSVTKYPTEELKSLVKIRGWQVLAIGNSATPKDWSLKGSIFLSLDAQAELGYRVLDHLPY 180

Query: 171 DSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHEN 230
           DS+VRKS GYLFAIQHGAKKI+DADDRG+VI  DLGKHFDVELVG  ++QE ILQYSHEN
Sbjct: 181 DSFVRKSVGYLFAIQHGAKKIYDADDRGEVIDGDLGKHFDVELVGLDSKQEPILQYSHEN 240

Query: 231 PNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSV 290
           PNRT+VNPY+HFGQRSVWPRGLPLENVGEI+HEE+YTEVFGGKQFIQQGISNGLPDVDSV
Sbjct: 241 PNRTVVNPYIHFGQRSVWPRGLPLENVGEINHEEYYTEVFGGKQFIQQGISNGLPDVDSV 300

Query: 291 FYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASD 350
           FYFTRK +LEAFDIRFD+  PKVALPQG+MVPVNSFNT+Y SSAFW LMLPVSVS+MASD
Sbjct: 301 FYFTRKTTLEAFDIRFDEHSPKVALPQGVMVPVNSFNTLYHSSAFWGLMLPVSVSSMASD 360

Query: 351 VLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNK 410
           VLRG+WGQRLLWE+GGYV VYPPT HR+D+IEAYPF EEKDLHVNVGRLIKFL++WRS K
Sbjct: 361 VLRGYWGQRLLWELGGYVAVYPPTAHRFDRIEAYPFVEEKDLHVNVGRLIKFLLAWRSEK 420

Query: 411 HRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHG 470
           H FFE VL+LS +MAEEGFWTE+D+KFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHG
Sbjct: 421 HSFFETVLDLSFAMAEEGFWTEQDLKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHG 480

Query: 471 DRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLL 530
           DRKEFVPRKLPSVHLGVEETGTVS EIGNLIRWRKNFGNVVL+MFC+GPVERTALEWRLL
Sbjct: 481 DRKEFVPRKLPSVHLGVEETGTVSTEIGNLIRWRKNFGNVVLVMFCNGPVERTALEWRLL 540

Query: 531 YGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNL 590
           YGRIFKTV+ILS QKN DL VE  +L+ +Y+HLPKIF RY+SAEGFLF++DDT+LNYWNL
Sbjct: 541 YGRIFKTVVILSSQKNSDLYVEEAKLDHIYKHLPKIFDRYSSAEGFLFVEDDTVLNYWNL 600

Query: 591 LQADKNKLWITDKV 604
           LQADK+K+W TDKV
Sbjct: 601 LQADKSKIWTTDKV 614


>gi|15230300|ref|NP_191301.1| uncharacterized protein [Arabidopsis thaliana]
 gi|6706413|emb|CAB66099.1| putative protein [Arabidopsis thaliana]
 gi|53828547|gb|AAU94383.1| At3g57420 [Arabidopsis thaliana]
 gi|59958348|gb|AAX12884.1| At3g57420 [Arabidopsis thaliana]
 gi|110739068|dbj|BAF01451.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646132|gb|AEE79653.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 765

 Score =  989 bits (2558), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 470/607 (77%), Positives = 533/607 (87%), Gaps = 3/607 (0%)

Query: 1   MLVQDRTLPKSPKSQIRT--SSHRFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALSFL 58
           MLVQDR  PK PKS+IR   S  RF++ K LDFS+WV DN+++IV + L I T+AA  FL
Sbjct: 1   MLVQDRVAPKPPKSRIRELPSRDRFAEPKILDFSSWVSDNVYRIVIIFLFIVTVAAFFFL 60

Query: 59  RNFTDTASLIQ-SKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRYP 117
            N TDTASL+        S  ++  P INWNSIQ ++DK+S Y+ FR+EKWIVVSV ++P
Sbjct: 61  YNTTDTASLLCFQSQSTQSLQSLTRPQINWNSIQIVSDKTSPYASFRTEKWIVVSVTKHP 120

Query: 118 TDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKS 177
           T+ LK LVKIKGWQVLAIGNS TPK+WNLKGAIFLSLD QA L +R+LD LPYDS+VRKS
Sbjct: 121 TEELKGLVKIKGWQVLAIGNSLTPKDWNLKGAIFLSLDAQAELNYRILDHLPYDSFVRKS 180

Query: 178 CGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVN 237
            GYLFAIQHGAKKIFDADDRG+VI  DLGKHFDVELVGE ARQE ILQYSHENPNRT+VN
Sbjct: 181 VGYLFAIQHGAKKIFDADDRGEVIDGDLGKHFDVELVGEDARQEPILQYSHENPNRTVVN 240

Query: 238 PYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKP 297
           PY+HFGQRSVWPRGLPLENVGEI+HEE+YTEVFGGKQFIQQGISNGLPDVDSV+Y TRK 
Sbjct: 241 PYIHFGQRSVWPRGLPLENVGEINHEEYYTEVFGGKQFIQQGISNGLPDVDSVYYSTRKT 300

Query: 298 SLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWG 357
           + E FDIRFD+  PKVALPQGMMVPVNSFNT+Y SSAFW LMLPVSVS+MASDV+RG+WG
Sbjct: 301 TFEPFDIRFDEHSPKVALPQGMMVPVNSFNTLYHSSAFWGLMLPVSVSSMASDVIRGYWG 360

Query: 358 QRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKV 417
           QRLLWE+GGYV VYPPTVHRYD++EAYPFS+EKDLH+NVGRLIKFL++WRSNKHRFFE +
Sbjct: 361 QRLLWELGGYVAVYPPTVHRYDRVEAYPFSDEKDLHINVGRLIKFLLAWRSNKHRFFETI 420

Query: 418 LELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVP 477
           L+LS  MAE+GFWTE DVKFTAAWLQDL+ VGYQQPRLMSLELDRPRA+IGHGDRKEFVP
Sbjct: 421 LDLSFVMAEQGFWTELDVKFTAAWLQDLLMVGYQQPRLMSLELDRPRATIGHGDRKEFVP 480

Query: 478 RKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKT 537
           RKLPSVHLGVEE GTVS EIGNLI+WRKNFGNVVLIMFC+GPVERTALEWRLLYGRIFKT
Sbjct: 481 RKLPSVHLGVEEIGTVSSEIGNLIKWRKNFGNVVLIMFCNGPVERTALEWRLLYGRIFKT 540

Query: 538 VIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNK 597
           V+ILS +KN DL V+  +L+ +Y+ LPKIF RY+SA+GF+F++DDT+LNYWNLLQADK K
Sbjct: 541 VVILSSRKNSDLYVQEAKLDHIYKRLPKIFDRYSSADGFVFVEDDTVLNYWNLLQADKTK 600

Query: 598 LWITDKV 604
           LW TDKV
Sbjct: 601 LWTTDKV 607


>gi|297820532|ref|XP_002878149.1| hypothetical protein ARALYDRAFT_907203 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323987|gb|EFH54408.1| hypothetical protein ARALYDRAFT_907203 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 765

 Score =  986 bits (2549), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 471/607 (77%), Positives = 532/607 (87%), Gaps = 3/607 (0%)

Query: 1   MLVQDRTLPKSPKSQIRT--SSHRFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALSFL 58
           MLVQDR  PK PKS+IR   S  RF++ K+LDFS+WV DN+++IV   L I T+AA  FL
Sbjct: 1   MLVQDRVAPKPPKSRIRELPSRDRFAEPKNLDFSSWVSDNVYRIVIFFLFIVTVAAFFFL 60

Query: 59  RNFTDTASLIQ-SKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRYP 117
            N TDTASL+        S  ++  P INWNSIQ ++DK+S Y+ FR+EKWIVVSV +YP
Sbjct: 61  YNTTDTASLLCFQSQSTQSLQSLTRPQINWNSIQIVSDKTSPYASFRTEKWIVVSVTKYP 120

Query: 118 TDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKS 177
           T+ LK LVKIKGWQVLAIGNS TPK+W LKGAIFLSLD QA L +R+LD LPYDS+VRKS
Sbjct: 121 TEELKGLVKIKGWQVLAIGNSLTPKDWILKGAIFLSLDAQAELNYRILDHLPYDSFVRKS 180

Query: 178 CGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVN 237
            GYLFAIQHGAKKI+DADDRG+VI  DLGKHFDVELVGE ARQE ILQYSHENPNRT+VN
Sbjct: 181 VGYLFAIQHGAKKIYDADDRGEVIDGDLGKHFDVELVGEDARQEPILQYSHENPNRTVVN 240

Query: 238 PYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKP 297
           PY+HFGQRSVWPRGLPLENVGEI+HEE+YTEVFGGKQFIQQGISNGLPDVDSV+Y TRK 
Sbjct: 241 PYIHFGQRSVWPRGLPLENVGEINHEEYYTEVFGGKQFIQQGISNGLPDVDSVYYSTRKT 300

Query: 298 SLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWG 357
           + E FDIRFD+  PKVALPQGMMVPVNSFNT+Y SSAFW LMLPVSVS+MASDV+RG+WG
Sbjct: 301 TFEPFDIRFDEHSPKVALPQGMMVPVNSFNTLYHSSAFWGLMLPVSVSSMASDVIRGYWG 360

Query: 358 QRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKV 417
           QRLLWE+GGYV VYPPTVHRYD++EAYPFS+EKDLHVNVGRLIKFL++WRSNKHRFFE +
Sbjct: 361 QRLLWELGGYVAVYPPTVHRYDRVEAYPFSDEKDLHVNVGRLIKFLLAWRSNKHRFFETI 420

Query: 418 LELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVP 477
           L+LS  MAE+GFWTE DVKFTAAWLQDL+ VGYQQPRLMSLELDRPRA+IGHGDRKEFVP
Sbjct: 421 LDLSFVMAEQGFWTELDVKFTAAWLQDLLMVGYQQPRLMSLELDRPRATIGHGDRKEFVP 480

Query: 478 RKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKT 537
           RKLPSVHLGVEE GTVS EIGNLI+WRKNFGNVVLIMFC+GPVERTALEWRLLYGRIFKT
Sbjct: 481 RKLPSVHLGVEEIGTVSSEIGNLIKWRKNFGNVVLIMFCNGPVERTALEWRLLYGRIFKT 540

Query: 538 VIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNK 597
           V+ILS +K+ DL V+  +L+ +Y+ LPKIF RY+SA+GFLF++DDTILNYWNLLQADK K
Sbjct: 541 VVILSSRKDSDLYVQEAKLDHIYKRLPKIFDRYSSADGFLFVEDDTILNYWNLLQADKTK 600

Query: 598 LWITDKV 604
           LW TDKV
Sbjct: 601 LWTTDKV 607


>gi|297827827|ref|XP_002881796.1| hypothetical protein ARALYDRAFT_483259 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327635|gb|EFH58055.1| hypothetical protein ARALYDRAFT_483259 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 771

 Score =  982 bits (2539), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 474/614 (77%), Positives = 533/614 (86%), Gaps = 10/614 (1%)

Query: 1   MLVQDRTLP---KSPKSQIRT-SSH-----RFSDSKSLDFSTWVRDNLFKIVTVLLLIAT 51
           MLVQDR  P   K PKSQIR   +H     RFS+ K+LDFSTW  +NL +I    LLI T
Sbjct: 1   MLVQDRAAPSPAKPPKSQIRELPTHQQIRRRFSEPKNLDFSTWFSENLSRIAVFSLLIVT 60

Query: 52  IAALSFLRNFTDTASLIQ-SKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIV 110
           I AL FL N TDTASL+        S  ++  P I WNSI+ + DK+S Y+ F +EKWIV
Sbjct: 61  IVALFFLYNTTDTASLLCFQSQSTQSLQSLSRPQIKWNSIRVVPDKTSPYANFLTEKWIV 120

Query: 111 VSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPY 170
           VSV +YPT+ LK LVKI+GWQVLAIGNS TPK+W+LKG+IFLSLD QA LG+RVLD LPY
Sbjct: 121 VSVTKYPTEELKSLVKIRGWQVLAIGNSVTPKDWSLKGSIFLSLDAQAELGYRVLDHLPY 180

Query: 171 DSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHEN 230
           DS+VRKS GYLFAIQHGAKKI+DADDRG+VI  DLGKHFDVELVG  ++QE ILQYSHEN
Sbjct: 181 DSFVRKSVGYLFAIQHGAKKIYDADDRGEVIDGDLGKHFDVELVGVDSKQEPILQYSHEN 240

Query: 231 PNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSV 290
           PNRT+VNPY+HFGQRSVWPRGLPLENVGEI+HEE+YTEVFGGKQFIQQGISNGLPDVDSV
Sbjct: 241 PNRTVVNPYIHFGQRSVWPRGLPLENVGEINHEEYYTEVFGGKQFIQQGISNGLPDVDSV 300

Query: 291 FYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASD 350
           FYFTRK +LEAFDIRFD+  PKVALPQG+MVPVNSFNT+Y SSAFW LMLPVSVS MASD
Sbjct: 301 FYFTRKTTLEAFDIRFDEHSPKVALPQGVMVPVNSFNTLYHSSAFWGLMLPVSVSCMASD 360

Query: 351 VLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNK 410
           VLRG+WGQRLLWE+GGYV VYPPT HR+D+IEAYPF EEKDLHVNVGRLIKFL++WRS K
Sbjct: 361 VLRGYWGQRLLWELGGYVAVYPPTAHRFDRIEAYPFVEEKDLHVNVGRLIKFLLAWRSEK 420

Query: 411 HRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHG 470
           H FFE +L+LS +MAEEGFWTE+D+KFTAAWLQDLIAVGYQQPRLMSLELDRPRA+IGHG
Sbjct: 421 HSFFETILDLSFAMAEEGFWTEQDLKFTAAWLQDLIAVGYQQPRLMSLELDRPRANIGHG 480

Query: 471 DRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLL 530
           DRKEFVPRKLPSVHLGVEETGTVS EIGNLIRWRKNFGNVVL+MFCSGPVERTALEWRLL
Sbjct: 481 DRKEFVPRKLPSVHLGVEETGTVSTEIGNLIRWRKNFGNVVLVMFCSGPVERTALEWRLL 540

Query: 531 YGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNL 590
           YGRIFKTV+ILS QKN DL ++  +L+ +Y+HLPKIF RY+SAEGFLF++DDT+LNYWNL
Sbjct: 541 YGRIFKTVVILSSQKNSDLYIKEAKLDHIYKHLPKIFDRYSSAEGFLFVEDDTVLNYWNL 600

Query: 591 LQADKNKLWITDKV 604
           LQADK+K+W TDKV
Sbjct: 601 LQADKSKIWTTDKV 614


>gi|356534762|ref|XP_003535921.1| PREDICTED: uncharacterized protein LOC100805551 [Glycine max]
          Length = 759

 Score =  974 bits (2517), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 464/604 (76%), Positives = 537/604 (88%)

Query: 1   MLVQDRTLPKSPKSQIRTSSHRFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALSFLRN 60
           M+VQ+R+LPKS  S+    +   + +KSLDFS WV DNL +IV V+LL+AT+AA+ FLRN
Sbjct: 1   MMVQERSLPKSVNSKPHARTAALASTKSLDFSAWVSDNLVRIVAVVLLVATVAAVFFLRN 60

Query: 61  FTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRYPTDS 120
             DTA+L+  ++Q      I  P ++W++I PIAD++S +S FRSEKWIVVSV  YP+D+
Sbjct: 61  AGDTAALLCFENQARELERIAYPRVDWSAIAPIADRTSKFSSFRSEKWIVVSVSGYPSDA 120

Query: 121 LKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGY 180
           L++LVK+KGWQV+A+G S TP +W LKGAIFLSL+ Q NLGFRV+D+LPYDS+VRKS GY
Sbjct: 121 LRRLVKMKGWQVVAVGGSNTPSDWTLKGAIFLSLEEQVNLGFRVVDYLPYDSFVRKSVGY 180

Query: 181 LFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYV 240
           LFAIQHGAKKIFDADDRG+VI  DLGKHFDVELVGE ARQE +LQYSH+NPNRT+VNPYV
Sbjct: 181 LFAIQHGAKKIFDADDRGEVIDGDLGKHFDVELVGEAARQEVLLQYSHDNPNRTVVNPYV 240

Query: 241 HFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLE 300
           HFGQRSVWPRGLPLENVGEI HEEFYT+VFGGKQFIQQGISNGLPDVDSVFYFTRK  LE
Sbjct: 241 HFGQRSVWPRGLPLENVGEIGHEEFYTQVFGGKQFIQQGISNGLPDVDSVFYFTRKSGLE 300

Query: 301 AFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRL 360
           AFDI+FD+  PKVALPQGMMVPVNSFNT+Y S AFWALMLPVSVSTMASDVLRG+WGQRL
Sbjct: 301 AFDIQFDEHAPKVALPQGMMVPVNSFNTMYHSPAFWALMLPVSVSTMASDVLRGYWGQRL 360

Query: 361 LWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLEL 420
           LWE+GGYVVVYPPTVHRYD+IEAYPFSEEKDLHVNVGRLI +L+SWRS+KHR FEK+L+L
Sbjct: 361 LWEVGGYVVVYPPTVHRYDRIEAYPFSEEKDLHVNVGRLINYLISWRSDKHRLFEKILDL 420

Query: 421 SHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKL 480
           S +MAEEGFWTE+DVK TAAWLQDL+AVGYQQPRLMSLEL RPRA+IGHGD+KEFVP+KL
Sbjct: 421 SFAMAEEGFWTEKDVKLTAAWLQDLLAVGYQQPRLMSLELGRPRANIGHGDQKEFVPQKL 480

Query: 481 PSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVII 540
           PSVHLGVEETGTV+YEI NLI WRK FGNVVLIM+C+GPVERTALEWRLLYGRIF++V+I
Sbjct: 481 PSVHLGVEETGTVNYEIANLIWWRKTFGNVVLIMYCNGPVERTALEWRLLYGRIFRSVVI 540

Query: 541 LSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWI 600
           LSE+K+ DL VE G L+  YR+LPKIF +++SAEGFLF+QD+TILNYWNLLQADK KLWI
Sbjct: 541 LSEKKDVDLVVEEGHLDYAYRYLPKIFDQFSSAEGFLFVQDNTILNYWNLLQADKTKLWI 600

Query: 601 TDKV 604
           T+KV
Sbjct: 601 TNKV 604


>gi|225450038|ref|XP_002273124.1| PREDICTED: uncharacterized protein LOC100256796 [Vitis vinifera]
          Length = 753

 Score =  905 bits (2338), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/605 (71%), Positives = 507/605 (83%), Gaps = 9/605 (1%)

Query: 1   MLVQDRTLPKSPKSQIRTSSHRFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALSFLRN 60
           MLVQDR + K  K+Q       F      +FSTWV  N  KI+ + LLI T+A + F+RN
Sbjct: 1   MLVQDRKIIKPSKTQSTKPQEHF------NFSTWVSSNFPKIIVISLLIVTVAVVFFVRN 54

Query: 61  FTDTASLIQS-KSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRYPTD 119
             D  S++ S KS+  S   I  P I+++SI P +DKSS ++ FRSE+WIVVSV  YP+D
Sbjct: 55  --DAVSILYSGKSRSKSLKPIQFPKISFSSIPPNSDKSSPFATFRSERWIVVSVSNYPSD 112

Query: 120 SLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCG 179
           SL+ LVKIKGWQVLA+GNSRTP NW LKGAIFLSL+ QA L FR+L++LPYDSYVRKS G
Sbjct: 113 SLRSLVKIKGWQVLAVGNSRTPANWELKGAIFLSLEQQAKLEFRILEYLPYDSYVRKSVG 172

Query: 180 YLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPY 239
           YLFAIQHGAK IFDADDRG+VI  ++GK FD++L G  A QE ILQY+ ENPNRT+VNPY
Sbjct: 173 YLFAIQHGAKMIFDADDRGEVIDWEVGKRFDLDLFGVDAMQERILQYNRENPNRTVVNPY 232

Query: 240 VHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSL 299
           +HFGQRSVWPRGLPLENVGEI HEE+Y EVFGG QFIQQGISNGLPDVDSVFY TRK   
Sbjct: 233 IHFGQRSVWPRGLPLENVGEIVHEEYYNEVFGGMQFIQQGISNGLPDVDSVFYLTRKLDS 292

Query: 300 EAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQR 359
           EAFD+ FD+   KVALPQG+MVP+NSFNT++ S+AFW LMLPVSVS+MASDVLRG+W QR
Sbjct: 293 EAFDMSFDEHALKVALPQGVMVPLNSFNTLFHSNAFWGLMLPVSVSSMASDVLRGYWAQR 352

Query: 360 LLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLE 419
           LLWE+GG+VVVYPPT++R D+IEAYPFSEEKDLHVNVGRLIK+LVSWRS +HR FEK++E
Sbjct: 353 LLWEVGGFVVVYPPTIYRKDEIEAYPFSEEKDLHVNVGRLIKYLVSWRSGRHRLFEKIME 412

Query: 420 LSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRK 479
           LS+S+A+EGFWTERDVKFT AWLQDL+AVGYQQPRLM+LELDRPRAS G  DRKEF+PRK
Sbjct: 413 LSYSLAKEGFWTERDVKFTGAWLQDLLAVGYQQPRLMALELDRPRASSGDADRKEFIPRK 472

Query: 480 LPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVI 539
           LPSVHL VEE+G V+YEIGNLIRWRK+F NVVLI+F SGPVERTALEWRLLYGRIFKTV+
Sbjct: 473 LPSVHLAVEESGAVNYEIGNLIRWRKSFSNVVLILFVSGPVERTALEWRLLYGRIFKTVV 532

Query: 540 ILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLW 599
           ILS + + DLAVE    +QVY++LPKIF R++SAEGFLFLQD+TILNYWNL+Q DK KLW
Sbjct: 533 ILSAKSDVDLAVEEAHPDQVYKYLPKIFERFSSAEGFLFLQDNTILNYWNLMQGDKTKLW 592

Query: 600 ITDKV 604
           ITDKV
Sbjct: 593 ITDKV 597


>gi|326525585|dbj|BAJ88839.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 777

 Score =  852 bits (2201), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 403/622 (64%), Positives = 489/622 (78%), Gaps = 16/622 (2%)

Query: 1   MLVQDRTLP------KSPKSQIRTSS----HRFSDSKSLDFSTWVRDNLFKIVTVLLLIA 50
           MLVQDR LP      KSPKS          H    +KSLDFS W  ++  +++ +L  +A
Sbjct: 1   MLVQDRVLPEHAGSNKSPKSPRAAPGSDRRHPRPFAKSLDFSNWASEHSSRLLLLLFAVA 60

Query: 51  TIAALSFLRNF-TDTASLI----QSKSQEHSPNAIPLPVINWNSIQPIADKSSV-YSRFR 104
           ++AA+  LR    D A+L+     S      P  +P P + W+ I PIA  S+  ++ FR
Sbjct: 61  SVAAVFLLRGAGPDAAALLCLDRSSSRSAAGPAKLPYPDVAWSKIPPIAIASAAPFASFR 120

Query: 105 SEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRV 164
           +E+WIVVSV   PT +L  L ++KGWQ+LA+GNS TP +W+LKGAIFLSLD+QA LG+R 
Sbjct: 121 AERWIVVSVSSPPTAALAALTRLKGWQLLAVGNSHTPSDWDLKGAIFLSLDLQAQLGYRS 180

Query: 165 LDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETIL 224
           +DFLPY S+VRK+ GYLFAIQHGAK IFDADDR +V G+DLGKHFDV+L    A    ++
Sbjct: 181 VDFLPYASHVRKTAGYLFAIQHGAKLIFDADDRAEVPGNDLGKHFDVDLGSGIANHPVLI 240

Query: 225 QYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGL 284
           QYSH +PNRT+VNPYVHFGQRSVWPRGLPL+ VGE++HE FYTE+F G+QFIQQG+S+GL
Sbjct: 241 QYSHADPNRTVVNPYVHFGQRSVWPRGLPLDKVGEVAHEAFYTEIFSGRQFIQQGLSDGL 300

Query: 285 PDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSV 344
           PDVD+VFYFTRKP    FD+RFD   PKVALPQGMM PVNSFNT++ + AFW LM+PVSV
Sbjct: 301 PDVDAVFYFTRKPPTAPFDLRFDPEAPKVALPQGMMAPVNSFNTLFHAQAFWGLMMPVSV 360

Query: 345 STMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLV 404
           S+MA+DV+RG+W QR+LWEIGGYV  YPPT++R D ++AYPF+EEKDLHVNVGRLIKFL 
Sbjct: 361 SSMAADVIRGYWAQRILWEIGGYVAFYPPTIYRKDHVQAYPFAEEKDLHVNVGRLIKFLN 420

Query: 405 SWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPR 464
            WRSNK   FEK+L+LS++MAEEGFW E+DV+ TAAWLQDL+A GY+QPRLMSLE+DR R
Sbjct: 421 EWRSNKQSLFEKILDLSYAMAEEGFWMEQDVRLTAAWLQDLLAAGYRQPRLMSLEIDRQR 480

Query: 465 ASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTA 524
           A+IG GD KEFVP+KLPSVHLGV+E GTV+YEIGNLI+WRKNFGNVVLIM  SGPV+R A
Sbjct: 481 ATIGEGDMKEFVPKKLPSVHLGVDEIGTVNYEIGNLIKWRKNFGNVVLIMHVSGPVDRVA 540

Query: 525 LEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTI 584
           LEWRLLYGRIFKTVIIL+EQ N +LAVE   L   Y++LPK+F RY  A+GFLFLQD  I
Sbjct: 541 LEWRLLYGRIFKTVIILAEQSNAELAVERCALSHAYKYLPKVFGRYGGADGFLFLQDHMI 600

Query: 585 LNYWNLLQADKNKLWITDKVLY 606
           LNYWNLLQADK KLWITDK+ +
Sbjct: 601 LNYWNLLQADKEKLWITDKIAH 622


>gi|413945237|gb|AFW77886.1| hypothetical protein ZEAMMB73_039824 [Zea mays]
          Length = 778

 Score =  828 bits (2139), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 398/623 (63%), Positives = 495/623 (79%), Gaps = 17/623 (2%)

Query: 1   MLVQDRTLPKSPKSQIRTSS-----------HRFSDSKSLDFSTWVRDNLFKIVTVLLLI 49
           MLVQDR  P +  +  + SS           H    +K+LDF+TW  ++  K++ +LL I
Sbjct: 1   MLVQDRASPHAAAAGQKPSSSPRGAPGADRRHPRPFAKNLDFATWASEHSSKLLLLLLAI 60

Query: 50  ATIAALSFLRNFT-DTASLI----QSKSQEHSPNAIPLPVINWNSIQPIA-DKSSVYSRF 103
           A+ AA+  LR    D A+L+     ++S+  +P  +P P + W+ + P+A    S ++ F
Sbjct: 61  ASAAAVFLLRGAAPDAAALLCLDRSARSRSGAPAKLPYPDVAWSKVPPLAIAAGSPFASF 120

Query: 104 RSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFR 163
           R+E+WIVV+V   PT +L  L ++KGWQ+LA+G+S TP  W LKGA+FLSL++QA LG+R
Sbjct: 121 RAERWIVVAVSSPPTAALAALARVKGWQLLAVGDSHTPAGWELKGAVFLSLELQAQLGYR 180

Query: 164 VLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETI 223
            +DFLPY S+VRK+ GYLFAIQHGAK IFDADDR +V G+DLGKHFDV+L         +
Sbjct: 181 SVDFLPYGSHVRKTAGYLFAIQHGAKVIFDADDRAEVPGNDLGKHFDVDLGSGVTNHPVL 240

Query: 224 LQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNG 283
           LQYSH +PNRT+VNPYVHFGQRSVWPRGLPL+ VGE++HE FYTEVF G+QFIQQG+S+G
Sbjct: 241 LQYSHADPNRTVVNPYVHFGQRSVWPRGLPLDKVGEVAHEVFYTEVFSGRQFIQQGLSDG 300

Query: 284 LPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVS 343
           LPDVD+VFYFTRKP   AFD+RFD   PKVALPQGMM PVNSFNT++QS AFW LM+PVS
Sbjct: 301 LPDVDAVFYFTRKPPTSAFDLRFDSEAPKVALPQGMMAPVNSFNTLFQSPAFWGLMMPVS 360

Query: 344 VSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFL 403
           VS+MA+DV+RG+W QR+LWEIGGYV  YPPT++R D I+AYPF+EEKDLHVNVGRLIKFL
Sbjct: 361 VSSMAADVIRGYWAQRILWEIGGYVAFYPPTIYRKDYIQAYPFAEEKDLHVNVGRLIKFL 420

Query: 404 VSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRP 463
             WRSNK   FEK+L+LS++MAEEGFWTE+DV+ TAAWLQDL+AVGY+QPRLMSLE+DR 
Sbjct: 421 NEWRSNKRTLFEKILDLSYAMAEEGFWTEQDVRLTAAWLQDLLAVGYRQPRLMSLEIDRQ 480

Query: 464 RASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERT 523
           RA+IG GD KEFVP+KLPSVHLGV+E GTV+YEIGNLI+WRKNFGNVV+IM  SGPV+RT
Sbjct: 481 RATIGEGDMKEFVPKKLPSVHLGVDEIGTVNYEIGNLIKWRKNFGNVVMIMHVSGPVDRT 540

Query: 524 ALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDT 583
           ALEWRLLYGRIFKTVIIL+EQ N +LAVE   L   Y++LPK+F RY+ A+GF+FLQD  
Sbjct: 541 ALEWRLLYGRIFKTVIILAEQSNAELAVERCTLSHAYKYLPKVFERYSGADGFVFLQDHM 600

Query: 584 ILNYWNLLQADKNKLWITDKVLY 606
           +LNYWNL+QADK KLWIT+K+ +
Sbjct: 601 VLNYWNLMQADKEKLWITNKIAH 623


>gi|357133852|ref|XP_003568536.1| PREDICTED: uncharacterized protein LOC100834910 [Brachypodium
           distachyon]
          Length = 783

 Score =  827 bits (2136), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 400/628 (63%), Positives = 490/628 (78%), Gaps = 22/628 (3%)

Query: 1   MLVQDRTLP-------KSPKSQIRTSS---------HRFSDSKSLDFSTWVRDNLFKIVT 44
           MLVQ R  P        + KS   TS          H    +KSLDF +W  ++  K++ 
Sbjct: 1   MLVQGRVFPDDAPGNNNNNKSAAPTSPRGAPGANRRHPRPFAKSLDFGSWASEHSSKLLL 60

Query: 45  VLLLIATIAALSFLRNF-TDTASLI----QSKSQEHSPNAIPLPVINWNSIQPIADKSSV 99
           +L  +A++AA+  LR    D A+L+     S S   +P  +P P + W+ I P+A  S+V
Sbjct: 61  LLFAVASVAAVFLLRGAGPDAAALLCLDRSSHSNNGAPARLPYPDVPWSKIPPLAVASAV 120

Query: 100 -YSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQA 158
            ++ FR+E+WIVVSV   PT +L  L ++KGWQ+L +GNS TP  W LKGAIFLSL++QA
Sbjct: 121 PFASFRAERWIVVSVSSAPTAALAALTRVKGWQLLVVGNSHTPSGWELKGAIFLSLELQA 180

Query: 159 NLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGA 218
            LG+R +DFLPY S+VRK+ GYLFAIQHGAK +FDADDR +V G+DLGKHFDV+L    A
Sbjct: 181 QLGYRSVDFLPYASHVRKTAGYLFAIQHGAKVVFDADDRAEVPGNDLGKHFDVDLGSGVA 240

Query: 219 RQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQ 278
               +LQYSH +PNRT+VNPYVHFGQRSVWPRGLPL+ VGE++HE FYTEVF G+QFIQQ
Sbjct: 241 NHPVLLQYSHADPNRTVVNPYVHFGQRSVWPRGLPLDKVGEVAHEVFYTEVFSGRQFIQQ 300

Query: 279 GISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL 338
           G+S+GLPDVD+VFYFTRKP    FD+RFD   PKVALPQGMM PVNSFNT++ + AFW L
Sbjct: 301 GLSDGLPDVDAVFYFTRKPPTAPFDLRFDGEAPKVALPQGMMAPVNSFNTLFHTQAFWGL 360

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGR 398
           MLPVSVS+MA+DV+RG+W QR+LWEIGGYV  YPPT++R D ++AYPF+EEKDLHVNVGR
Sbjct: 361 MLPVSVSSMAADVIRGYWAQRILWEIGGYVAFYPPTIYRKDHVQAYPFAEEKDLHVNVGR 420

Query: 399 LIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSL 458
           LIKFL  WRSNK   FE++L+LS++MAEEGFW E+DV+ TAAWLQDL+AVGY+QPRLMSL
Sbjct: 421 LIKFLNEWRSNKRTLFERILDLSYAMAEEGFWMEQDVRLTAAWLQDLLAVGYRQPRLMSL 480

Query: 459 ELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSG 518
           E+DR RA+IG GD KEFVP+KLPSVHLGV+E GTV+YEIGNLI+WRKNFGNVVLIM  SG
Sbjct: 481 EIDRQRATIGEGDMKEFVPKKLPSVHLGVDEIGTVNYEIGNLIKWRKNFGNVVLIMHVSG 540

Query: 519 PVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLF 578
           PV+RTALEWRLLYGRIFKTVIIL+EQ N +LAV+   L   Y++LPK+F RY+ A+GFLF
Sbjct: 541 PVDRTALEWRLLYGRIFKTVIILAEQSNVELAVDRCALSHAYKYLPKVFGRYSGADGFLF 600

Query: 579 LQDDTILNYWNLLQADKNKLWITDKVLY 606
           LQD  ILNYWNLLQADK KLWIT+K+ +
Sbjct: 601 LQDHMILNYWNLLQADKEKLWITNKIAH 628


>gi|242090429|ref|XP_002441047.1| hypothetical protein SORBIDRAFT_09g019340 [Sorghum bicolor]
 gi|241946332|gb|EES19477.1| hypothetical protein SORBIDRAFT_09g019340 [Sorghum bicolor]
          Length = 784

 Score =  824 bits (2129), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/629 (63%), Positives = 492/629 (78%), Gaps = 23/629 (3%)

Query: 1   MLVQDRTLP------------KSPKSQIRTSS-----HRFSDSKSLDFSTWVRDNLFKIV 43
           MLVQDR  P            + P S  R +      H    +K+LDF+TW  ++  K++
Sbjct: 1   MLVQDRVSPHAAAAAAAAGQNQKPSSSPRGAPGADRRHPRPFAKNLDFATWASEHSSKLL 60

Query: 44  TVLLLIATIAALSFLRNFT-DTASLI----QSKSQEHSPNAIPLPVINWNSIQPIA-DKS 97
            +L  +A+ AA+  LR    D A+L+     ++S    P  +P P + W+ + P+A    
Sbjct: 61  LLLFAVASAAAVFLLRGAAPDAAALLCLDRSARSGSGGPAKLPYPDVAWSKVPPLAIAAG 120

Query: 98  SVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQ 157
           S ++ FR+E+WIVV+V   PT +L  L ++KGWQ+LA+G+SRTP  W LKGAIFLSL++Q
Sbjct: 121 SPFASFRAERWIVVAVSSPPTAALAALARVKGWQLLAVGDSRTPAGWELKGAIFLSLELQ 180

Query: 158 ANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEG 217
           A LG+R +DFLPY S+VRK+ GYLFAIQHGAK IFDADDR +V G+DLGKHFDV+L    
Sbjct: 181 AQLGYRSVDFLPYGSHVRKTAGYLFAIQHGAKVIFDADDRAEVPGNDLGKHFDVDLGSGV 240

Query: 218 ARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQ 277
                +LQYSH +PNRT+VNPYVHFGQRSVWPRGLPL+ VGE++HE FYTEVF G+QFIQ
Sbjct: 241 TNHPVLLQYSHADPNRTVVNPYVHFGQRSVWPRGLPLDKVGEVAHEVFYTEVFSGRQFIQ 300

Query: 278 QGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWA 337
           QG+S+GLPDVD+VFYFTRKP   AFD+RFD   PKVALPQGMM PVNSFNT++QS AFW 
Sbjct: 301 QGLSDGLPDVDAVFYFTRKPPTSAFDLRFDSEAPKVALPQGMMAPVNSFNTLFQSPAFWG 360

Query: 338 LMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVG 397
           LM+PVSVS+MA+DV+RG+W QR+LWEIGGYV  YPPT++R D I+AYPF+EEKDLHVNVG
Sbjct: 361 LMMPVSVSSMAADVIRGYWAQRILWEIGGYVAFYPPTIYRKDHIQAYPFAEEKDLHVNVG 420

Query: 398 RLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMS 457
           RLIKFL  WRSNK   FEK+L+LS++MAEEGFW E+DV+ TAAWLQDL+AVGY+QPRLMS
Sbjct: 421 RLIKFLNEWRSNKRTLFEKILDLSYAMAEEGFWMEQDVRLTAAWLQDLLAVGYRQPRLMS 480

Query: 458 LELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCS 517
           LE+DR RA+IG GD KEFVP+KLPSVHLGV+E GTV+YEIGNLI+WRKNFGNVV+IM  S
Sbjct: 481 LEIDRQRATIGEGDMKEFVPKKLPSVHLGVDEIGTVNYEIGNLIKWRKNFGNVVMIMHVS 540

Query: 518 GPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFL 577
           GPV+RTALEWRLLYGRIFKTVIIL+EQ N +LAVE   L   Y++LPK+F RY+ A+GF+
Sbjct: 541 GPVDRTALEWRLLYGRIFKTVIILAEQSNAELAVERCTLSHAYKYLPKVFERYSGADGFV 600

Query: 578 FLQDDTILNYWNLLQADKNKLWITDKVLY 606
           FLQD  ILNYWNL+QADK KLWIT+K+ +
Sbjct: 601 FLQDHMILNYWNLMQADKEKLWITNKIAH 629


>gi|449436327|ref|XP_004135944.1| PREDICTED: uncharacterized protein LOC101209752 [Cucumis sativus]
 gi|449488825|ref|XP_004158183.1| PREDICTED: uncharacterized protein LOC101229743 [Cucumis sativus]
          Length = 757

 Score =  818 bits (2112), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 394/605 (65%), Positives = 481/605 (79%), Gaps = 5/605 (0%)

Query: 1   MLVQDRTLPKSPKSQIRTSSHRFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALSFLRN 60
           MLVQDR   ++PK      ++ F +SK  DFS WV  NLFK+ T+  L  TIA+  FLR 
Sbjct: 1   MLVQDR---QNPKPHQIPLANPFPESKPFDFSNWVSLNLFKLATLFFLTLTIASFFFLRG 57

Query: 61  FTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRYPTDS 120
             D+A+ +   S+        LP+IN++SI P+ DKSS Y+ F S++WIVVSV  YP+DS
Sbjct: 58  APDSAAFLCFNSRPKPSQLTHLPIINFDSIHPLVDKSSSYASFSSDRWIVVSVSSYPSDS 117

Query: 121 LKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGY 180
           L+KL K +GWQVLA+GNSRTP +W+LKG I+LSL+ Q++LGFRV+DFL YDSY RK+ GY
Sbjct: 118 LRKLAKTRGWQVLAVGNSRTPSDWSLKGVIYLSLEEQSSLGFRVVDFLSYDSYARKTVGY 177

Query: 181 LFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYV 240
           LFAIQHGAK IFDADDRG+VI  DLGKHFD++L      QE IL++  ENPN+T+VNPY+
Sbjct: 178 LFAIQHGAKMIFDADDRGEVIDGDLGKHFDLKLSNVDTLQERILEFDFENPNKTVVNPYI 237

Query: 241 HFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLE 300
           HFGQRSVWPRGLPLENVG++ +EE Y++VFGG QFIQQGISNGLPDVDSVFYFTRK S +
Sbjct: 238 HFGQRSVWPRGLPLENVGDVLYEEHYSQVFGGMQFIQQGISNGLPDVDSVFYFTRKTSSQ 297

Query: 301 AFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRL 360
           AFDIRFDD  PKVA+P G+MVP+NSFNT++ +SA WALMLPVSVSTMA D+LRG+W QRL
Sbjct: 298 AFDIRFDDHAPKVAIPHGVMVPLNSFNTLFHNSALWALMLPVSVSTMACDILRGYWAQRL 357

Query: 361 LWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLEL 420
           LWE+GG+V VYPPT+ RYD IE YPFSEEKDLHVNVGRL+KFL SW SNK  FFEKV+EL
Sbjct: 358 LWELGGFVAVYPPTMFRYDDIEGYPFSEEKDLHVNVGRLVKFLSSWTSNKATFFEKVMEL 417

Query: 421 SHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKL 480
           S+SM EEGFW E DVK   AWLQDLI+VGY QPR+   E+ + R     GD + FVP+KL
Sbjct: 418 SNSMEEEGFWKENDVKLIGAWLQDLISVGYIQPRMKGFEMKKQRKR-RIGDGRSFVPKKL 476

Query: 481 PSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFC-SGPVERTALEWRLLYGRIFKTVI 539
           P  HLGVEE+ TV++EIG LIRWRK FGNVV+++F  +G VERTA++W+LLYGRIFKTV+
Sbjct: 477 PGFHLGVEESETVNFEIGKLIRWRKKFGNVVMVLFVENGDVERTAMKWKLLYGRIFKTVV 536

Query: 540 ILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLW 599
           +++E   EDL VE   LE +Y++LP +F R+ +AEGFLFLQD+TILNYWNLLQADK+KLW
Sbjct: 537 VVAEHGREDLGVEEASLEFIYKYLPMVFERFPNAEGFLFLQDNTILNYWNLLQADKDKLW 596

Query: 600 ITDKV 604
           IT KV
Sbjct: 597 ITYKV 601


>gi|218196734|gb|EEC79161.1| hypothetical protein OsI_19835 [Oryza sativa Indica Group]
          Length = 647

 Score =  768 bits (1983), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/488 (72%), Positives = 414/488 (84%)

Query: 119 DSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSC 178
           D +     + GWQ+LA+GNS TP  W LKGAIFLSL++QA LG+R +DFLPY S+VRK+ 
Sbjct: 5   DRVSPHAAVAGWQLLAVGNSHTPSGWELKGAIFLSLELQAQLGYRSVDFLPYASHVRKTA 64

Query: 179 GYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNP 238
           GYLFAIQHGAK IFDADDR +V G+DLGKHFDV+L         +LQYSH +PNRT+VNP
Sbjct: 65  GYLFAIQHGAKVIFDADDRAEVPGNDLGKHFDVDLGSGVTNHPVLLQYSHADPNRTVVNP 124

Query: 239 YVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPS 298
           YVHFGQRSVWPRGLPL+ VGE++HE FYTEVF G+Q+IQQG+S+GLPDVD+VFYFTRKP 
Sbjct: 125 YVHFGQRSVWPRGLPLDKVGEVAHEVFYTEVFSGQQYIQQGLSDGLPDVDAVFYFTRKPP 184

Query: 299 LEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQ 358
             AFD+RFD   PKVALPQG M PVNSFNT++ + AFW LM+PVSVS+MASDV+RG+W Q
Sbjct: 185 TAAFDLRFDAEAPKVALPQGTMAPVNSFNTLFHTPAFWGLMMPVSVSSMASDVIRGYWAQ 244

Query: 359 RLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVL 418
           R+LWEIGGYV  YPPT++R D I+AYPF+EEKDLHVNVGRLIKFL  WRSNK   FE++L
Sbjct: 245 RILWEIGGYVAFYPPTIYRKDHIQAYPFAEEKDLHVNVGRLIKFLNEWRSNKRTLFERIL 304

Query: 419 ELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPR 478
           +LS++MAEEGFWTE+DV+ TAAWLQDL+AVGY+QPRLMSLE+DR RA+IG GD KEFVP+
Sbjct: 305 DLSYAMAEEGFWTEQDVRLTAAWLQDLLAVGYRQPRLMSLEIDRQRATIGEGDMKEFVPK 364

Query: 479 KLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTV 538
           KLPSVHLGV+E GTV+YEIGNLI+WRKNFGNVVLIM  SGPV+RTALEWRLLYGRIFKTV
Sbjct: 365 KLPSVHLGVDEIGTVNYEIGNLIKWRKNFGNVVLIMHVSGPVDRTALEWRLLYGRIFKTV 424

Query: 539 IILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKL 598
           IIL+EQ N +LAVE   L   Y+ LPK+F+RY  A+GFLFLQD  ILNYWNLLQADK KL
Sbjct: 425 IILAEQSNTELAVERCALSHAYKFLPKVFARYGGADGFLFLQDHMILNYWNLLQADKEKL 484

Query: 599 WITDKVLY 606
           WIT+K+ +
Sbjct: 485 WITNKIAH 492


>gi|147787473|emb|CAN62330.1| hypothetical protein VITISV_029810 [Vitis vinifera]
          Length = 690

 Score =  731 bits (1886), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/602 (61%), Positives = 437/602 (72%), Gaps = 76/602 (12%)

Query: 4   QDRTLPKSPKSQIRTSSHRFSDSKSLDFSTWVRDNLFKIVTVLLLIATIAALSFLRNFTD 63
           +DR + K  K+Q       F      +FSTWV  N  KI+ + LLI T+A + F+RN  D
Sbjct: 8   KDRKIIKPSKTQSTKPQEHF------NFSTWVSSNFPKIIVISLLIVTVAVVFFVRN--D 59

Query: 64  TASLIQS-KSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRYPTDSLK 122
             S++ S KS+  S   I  P I+++SI P +DKSS ++ FRSE+WIVVSV  YP+DSL+
Sbjct: 60  AVSILYSGKSRSKSLKPIQFPKISFSSIPPNSDKSSPFATFRSERWIVVSVSNYPSDSLR 119

Query: 123 KLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLF 182
            LVKIKGWQVLA+GNSRTP NW LKGAIFLSL+ QA L FR+L++LPYDSYVRKS GYLF
Sbjct: 120 SLVKIKGWQVLAVGNSRTPANWELKGAIFLSLEQQAKLEFRILEYLPYDSYVRKSVGYLF 179

Query: 183 AIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHF 242
           AIQHGAK IFDADDRG+VI  ++GK FD++L G  A QE ILQY+ ENPNRT+VNPY+HF
Sbjct: 180 AIQHGAKMIFDADDRGEVIDWEVGKRFDLDLFGVDAMQERILQYNRENPNRTVVNPYIHF 239

Query: 243 GQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAF 302
           GQRSVWPRGLPLENVGEI HEE+Y EVFGG QFIQQGISNGLPDVDSVFY TRK   EAF
Sbjct: 240 GQRSVWPRGLPLENVGEIVHEEYYNEVFGGMQFIQQGISNGLPDVDSVFYLTRKLDSEAF 299

Query: 303 DIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLW 362
           D+ FD+   KVALPQG+MVP+NSFNT++ S+AFW LMLPVSVS+MASDVLRG+W QRLLW
Sbjct: 300 DMSFDEHALKVALPQGVMVPLNSFNTLFHSNAFWGLMLPVSVSSMASDVLRGYWAQRLLW 359

Query: 363 EIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSH 422
           E+GG+VVVYPPT++R D+IEAYPFSEEKDLH                             
Sbjct: 360 EVGGFVVVYPPTIYRKDEIEAYPFSEEKDLH----------------------------- 390

Query: 423 SMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKLPS 482
                                DL+AVGYQQPRLM+LELDRPRAS G  DRKEF+PRKLPS
Sbjct: 391 ---------------------DLLAVGYQQPRLMALELDRPRASSGDADRKEFIPRKLPS 429

Query: 483 VHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILS 542
           VHL VEE+G V+YEIGNLI               SG       +WRLLYGRIFKTV+ILS
Sbjct: 430 VHLAVEESGAVNYEIGNLI---------------SG--THCLWKWRLLYGRIFKTVVILS 472

Query: 543 EQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWITD 602
            + + DLAVE    +QVY++LPKIF R++SAEGFLFLQD+TILNYWNL+Q DK KLWITD
Sbjct: 473 AKSDVDLAVEEAHPDQVYKYLPKIFERFSSAEGFLFLQDNTILNYWNLMQGDKTKLWITD 532

Query: 603 KV 604
           KV
Sbjct: 533 KV 534


>gi|414887928|tpg|DAA63942.1| TPA: hypothetical protein ZEAMMB73_890297 [Zea mays]
          Length = 736

 Score =  700 bits (1807), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/566 (58%), Positives = 417/566 (73%), Gaps = 4/566 (0%)

Query: 41  KIVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPI-ADKSSV 99
           ++V VLL     A    L  +   +      +   +P  +P P + W+ + P+ A  +S 
Sbjct: 16  RVVYVLLAALATAPFLLLLLYGGASPSALCPTSYRTPRRLPYPSVLWSKVPPLPALSTSP 75

Query: 100 YSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQAN 159
           +   R  +WIV +   +     + L  + GWQVLA+ +  TP +W+  GA+ L+L  Q  
Sbjct: 76  HPDLRGSRWIVFTASPH-APRHRPLRAVPGWQVLAVADEATPADWSHPGAVLLTLTDQGR 134

Query: 160 LGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGAR 219
           LGFR + FLP     RK+  YLFA+Q GA+ I+DAD R  V+G +L +HFDV+L  +   
Sbjct: 135 LGFRSVAFLPARGPARKAAAYLFAVQRGARVIYDADARNAVLGGNLTRHFDVDL-DQRHG 193

Query: 220 QETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQG 279
              +LQYSH NPNRTIVNP+VHFGQ S+WPRGLPLE  GE+  EEFYTE+F G QF+QQG
Sbjct: 194 GAVLLQYSHANPNRTIVNPFVHFGQPSIWPRGLPLEKAGELDAEEFYTEIFSGGQFMQQG 253

Query: 280 ISNGLPDVDSVFYFTRKP-SLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL 338
           + NGLPDVDSVFYFTRK   +EAFD +FD   PKVALPQGMM PVNS NT++ S AFW L
Sbjct: 254 MCNGLPDVDSVFYFTRKSLEMEAFDFQFDADAPKVALPQGMMTPVNSVNTLFHSPAFWGL 313

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGR 398
            LPVSVS MASDV+RG+W QR+LWEIGGY+VVYPPTVHR D +  +PF EEKDLHVNVG+
Sbjct: 314 ALPVSVSPMASDVIRGYWAQRILWEIGGYLVVYPPTVHRIDNVRGHPFDEEKDLHVNVGK 373

Query: 399 LIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSL 458
           LIKFL+ WRS+K   FE++L+LS++M EEGFW+E+D+ F  AWLQDL+A+GY+QPRLMSL
Sbjct: 374 LIKFLMEWRSSKLTLFERILDLSYAMTEEGFWSEKDLHFMTAWLQDLVAIGYRQPRLMSL 433

Query: 459 ELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSG 518
           E+DRPRA+IGHGD+K F P+KLPSVHLGVEE G VS +IGNLI+WRK+FG++VLI+ C+ 
Sbjct: 434 EIDRPRATIGHGDKKGFGPKKLPSVHLGVEEIGEVSTDIGNLIKWRKHFGDIVLIVHCTE 493

Query: 519 PVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLF 578
            V+RTALEWRLLYGRIF+ V++LSEQ N DLAVE   L Q Y++LPK+F R+  A+GFLF
Sbjct: 494 LVDRTALEWRLLYGRIFRAVVVLSEQSNSDLAVEFSNLTQAYKYLPKVFDRFGGAQGFLF 553

Query: 579 LQDDTILNYWNLLQADKNKLWITDKV 604
           LQD  + NYWNLL ADK KLWIT++V
Sbjct: 554 LQDRMVFNYWNLLNADKAKLWITNQV 579


>gi|326497711|dbj|BAK05945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 597

 Score =  698 bits (1801), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/536 (61%), Positives = 407/536 (75%), Gaps = 3/536 (0%)

Query: 76  SPNAIPLPVINWNSIQPIADKSSVYSR-FRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLA 134
           +P  IP P + W+ + P+    S      R+  W+V S   +     + L    GWQ+LA
Sbjct: 50  APRRIPYPSVLWSHVPPLPGLPSSPLPDLRASHWVVFSASPH-HQRHRPLAAAPGWQLLA 108

Query: 135 IGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDA 194
           + +  TP  W+  GA  L+L  QA LGFR ++FLP   + RK+  YLFA+Q GA+ ++ A
Sbjct: 109 VADEATPPGWSHPGAALLTLADQARLGFRSVEFLPARGHARKAAAYLFAVQRGARVVYGA 168

Query: 195 DDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPL 254
           D R  V G++L +HFDV+L         +LQYSH +PNRT+VNPYVHFGQ SVWPRGLPL
Sbjct: 169 DARNAVAGNNLTRHFDVDLDQRQGGGSVLLQYSHADPNRTVVNPYVHFGQPSVWPRGLPL 228

Query: 255 ENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKP-SLEAFDIRFDDRVPKV 313
           +  GE+  EEFYTEV+GG QFIQQG+ NGLPDVD+VFY TRK   +EAFD+ FD   PKV
Sbjct: 229 DKAGEVGAEEFYTEVYGGGQFIQQGLCNGLPDVDAVFYLTRKSLEMEAFDVHFDADAPKV 288

Query: 314 ALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPP 373
           ALPQG+M PVNS NT++ + AFW L LPVSVS MASDV+RG+W QR+LWEIGG +VVYPP
Sbjct: 289 ALPQGVMAPVNSLNTMFHAPAFWGLALPVSVSPMASDVIRGYWAQRILWEIGGQLVVYPP 348

Query: 374 TVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTER 433
           TVHR D + A+PF +EKD+HVNVGRLI FL+ WRS K   FE++L+LS++MAEEGFW E+
Sbjct: 349 TVHRTDNVHAHPFDDEKDIHVNVGRLINFLMEWRSTKPTLFERILDLSYAMAEEGFWWEK 408

Query: 434 DVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTV 493
           D+ F AAWLQDL+AVGY+QPRLMSLE+DRPRA+IGHGD++EFVP+KLPSVHLGVEE G V
Sbjct: 409 DLHFMAAWLQDLVAVGYRQPRLMSLEIDRPRAAIGHGDKQEFVPKKLPSVHLGVEEIGEV 468

Query: 494 SYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEA 553
           S EIGNLI+WRK+FG+VVLI+ C+GPV+RTALEWRLLYGRIF+ V+ILSEQ N DLAVE+
Sbjct: 469 STEIGNLIKWRKHFGDVVLIVHCTGPVDRTALEWRLLYGRIFRAVVILSEQGNSDLAVES 528

Query: 554 GQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWITDKVLYLQL 609
                 Y++LPK+F R+  AEGF+FLQD  +LNYWNLL ADK+KLWIT+KV  L L
Sbjct: 529 SNFAHAYKYLPKVFDRFAGAEGFVFLQDYMVLNYWNLLDADKSKLWITNKVPTLLL 584


>gi|226500386|ref|NP_001143150.1| uncharacterized protein LOC100275631 [Zea mays]
 gi|195615080|gb|ACG29370.1| hypothetical protein [Zea mays]
          Length = 736

 Score =  697 bits (1800), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/566 (57%), Positives = 417/566 (73%), Gaps = 4/566 (0%)

Query: 41  KIVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPI-ADKSSV 99
           ++V V+L     A    L  ++  +      +   +P  +P P + W+ + P+ A  +S 
Sbjct: 16  RVVYVILAALATAPFLLLLLYSGASRSALCPTSYRAPRRLPYPSVLWSKVPPLPALSTSP 75

Query: 100 YSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQAN 159
           +   R  +WIV +   +     + L  + GWQVLA+ +  TP +W+  GA+ L+L  Q  
Sbjct: 76  HPDLRGSRWIVFTASPH-APRHRPLRAVPGWQVLAVADEATPADWSHPGAVLLTLTDQGR 134

Query: 160 LGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGAR 219
           LGF  + FLP     RK+  YLFA+Q GA+ I+DAD R  V+G +L +HFDV+L  +   
Sbjct: 135 LGFSSVAFLPARGPARKAAAYLFAVQRGARVIYDADARNAVLGGNLTRHFDVDL-DQRHG 193

Query: 220 QETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQG 279
              +LQYSH NPNRTIVNP+VHFGQ S+WPRGLPLE  GE+  EEFYTE+F G QF+QQG
Sbjct: 194 GAVLLQYSHANPNRTIVNPFVHFGQPSIWPRGLPLEKAGELDAEEFYTEIFSGGQFMQQG 253

Query: 280 ISNGLPDVDSVFYFTRKP-SLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL 338
           + NGLPDVDSVFYFTRK   +EAFD +FD   PKVALPQGMM PVNS NT++ S AFW L
Sbjct: 254 MCNGLPDVDSVFYFTRKSLEMEAFDFQFDADAPKVALPQGMMTPVNSVNTLFHSPAFWGL 313

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGR 398
            LPVSVS MASDV+RG+W QR+LWEIGGY+VVYPPTVHR D +  +PF EEKDLHVNVG+
Sbjct: 314 ALPVSVSPMASDVIRGYWAQRILWEIGGYLVVYPPTVHRIDNVRGHPFDEEKDLHVNVGK 373

Query: 399 LIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSL 458
           LIKFL+ WRS+K   FE++L+LS++M EEGFW+E+D+ F  AWLQDL+A+GY+QPRLMSL
Sbjct: 374 LIKFLMEWRSSKLTLFERILDLSYAMTEEGFWSEKDLHFMTAWLQDLVAIGYRQPRLMSL 433

Query: 459 ELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSG 518
           E+DRPRA+IGHGD+K F P+KLPSVHLGVEE G VS +IGNLI+WRK+FG++VLI+ C+ 
Sbjct: 434 EIDRPRATIGHGDKKGFGPKKLPSVHLGVEEIGEVSTDIGNLIKWRKHFGDIVLIVHCTE 493

Query: 519 PVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLF 578
            V+RTALEWRLLYGRIF+ V++LSEQ N DLAVE   L Q Y++LPK+F R+  A+GFLF
Sbjct: 494 LVDRTALEWRLLYGRIFRAVVVLSEQSNSDLAVEFSNLTQAYKYLPKVFDRFGGAQGFLF 553

Query: 579 LQDDTILNYWNLLQADKNKLWITDKV 604
           LQD  + NYWNLL ADK KLWIT++V
Sbjct: 554 LQDRMVFNYWNLLNADKAKLWITNQV 579


>gi|242046792|ref|XP_002461142.1| hypothetical protein SORBIDRAFT_02g041570 [Sorghum bicolor]
 gi|241924519|gb|EER97663.1| hypothetical protein SORBIDRAFT_02g041570 [Sorghum bicolor]
          Length = 736

 Score =  695 bits (1794), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/566 (58%), Positives = 416/566 (73%), Gaps = 4/566 (0%)

Query: 41  KIVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADK-SSV 99
           ++V VLL     A    L  +   +      +   +P  +P P + W+ + P+    SS 
Sbjct: 16  RVVYVLLAALATAPFLLLLLYGGASPSALCPAAYRAPRRLPYPSVLWSKVPPLPVLLSSP 75

Query: 100 YSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQAN 159
           +   R  +WIV     +     + L  + GWQ+LA+ +  TP +W+  GA+ L+L  Q +
Sbjct: 76  HPDLRGSRWIVFIASPH-APRHRPLRAVPGWQLLAVADEATPADWSHPGAVLLTLADQDH 134

Query: 160 LGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGAR 219
           LGFR + FLP     RK+  YLFA+Q GA+ I+DAD R  V+G +L  HFDV+L  +   
Sbjct: 135 LGFRSVAFLPARGPARKAAAYLFAVQRGARVIYDADVRNAVLGGNLTSHFDVDL-DQRQG 193

Query: 220 QETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQG 279
              +LQYSH +PNRT+VNP+VHFGQ SVWPRGLPLE  GE+  EEFYTE+F G QF+QQG
Sbjct: 194 GAVLLQYSHADPNRTVVNPFVHFGQPSVWPRGLPLEKAGELDAEEFYTEIFSGGQFMQQG 253

Query: 280 ISNGLPDVDSVFYFTRKP-SLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL 338
           + NGLPDVDSVFYFTRK   +EAFD RFD   PKVALPQG M PVNS NT++ S AFW L
Sbjct: 254 MCNGLPDVDSVFYFTRKSLEMEAFDFRFDADAPKVALPQGTMTPVNSVNTLFHSPAFWGL 313

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGR 398
            LPVSVS MASDV+RG+W QR+LWEIGGY+VVYPPTVHR D + A+ F EEKDLHVNVG+
Sbjct: 314 ALPVSVSPMASDVIRGYWAQRILWEIGGYLVVYPPTVHRIDNVRAHTFDEEKDLHVNVGK 373

Query: 399 LIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSL 458
           LIKFL+ WRS+K   FE++L+LS++M EEGFW E+D+ F  +WLQDL+A+GY+QPRLMSL
Sbjct: 374 LIKFLMEWRSSKRTLFERILDLSYAMTEEGFWGEKDLHFMTSWLQDLVAIGYRQPRLMSL 433

Query: 459 ELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSG 518
           E+DRPRA+IGHGD+KEF P+KLPSVHLGVEE G VS EIGNLI+WRK+FG++VLI+ C+ 
Sbjct: 434 EIDRPRATIGHGDKKEFAPKKLPSVHLGVEEIGEVSTEIGNLIKWRKHFGDIVLIVHCTE 493

Query: 519 PVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLF 578
            V+RTALEWRLLYGRIF+ V+ILSEQ N DLAVE   L Q Y++LPK+F+R+  A+GFLF
Sbjct: 494 LVDRTALEWRLLYGRIFRAVVILSEQSNSDLAVEFSNLAQAYKYLPKVFARFGGAQGFLF 553

Query: 579 LQDDTILNYWNLLQADKNKLWITDKV 604
           LQD  + NYWNLL ADK+KLWIT++V
Sbjct: 554 LQDHMVFNYWNLLNADKDKLWITNQV 579


>gi|357121675|ref|XP_003562543.1| PREDICTED: uncharacterized protein LOC100832736 [Brachypodium
           distachyon]
          Length = 735

 Score =  692 bits (1787), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/568 (60%), Positives = 415/568 (73%), Gaps = 8/568 (1%)

Query: 44  TVLLLIATIAALSFL--RNFTDTASLIQSK-SQEHSPNAIPLPVINWNSIQPIADKSSVY 100
            V LL+A    L FL        ++L +S  S   S   I  P + W+ + P+   S   
Sbjct: 14  AVYLLVAAAPFLLFLLYGGIASPSALCRSSGSALASGRRIAYPTVLWSRVPPLPPPSPSA 73

Query: 101 SRF--RSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQA 158
                R  +WIV S   +     + L    GW +LA+ +  TP  W+  GA  L+L  Q+
Sbjct: 74  PLPSLRGPRWIVFSASAHHARH-RPLAAAPGWNLLAVADEATPPGWSHPGAALLTLADQS 132

Query: 159 NLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGD-DLGKHFDVELVGEG 217
            LGFR + FLP     RK+  YLFA+Q GA+ I+DAD R  V GD +L +HFDV+L    
Sbjct: 133 LLGFRSVAFLPGRGPARKAAAYLFALQRGARVIYDADVRNAVAGDGNLTRHFDVDLDQRQ 192

Query: 218 ARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQ 277
                +LQYSH +PNRT+VNPYVHFGQ SVWPRG+PLE  GE+  EEFYTEVFGG QFIQ
Sbjct: 193 GGGSVLLQYSHADPNRTVVNPYVHFGQPSVWPRGMPLEKAGEVGAEEFYTEVFGGAQFIQ 252

Query: 278 QGISNGLPDVDSVFYFTRKPS-LEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFW 336
           QG+ NGLPDVD+VFYFTRK S +EAFD+RFD   PKVALPQG+M PVNS NT++ S AFW
Sbjct: 253 QGLCNGLPDVDAVFYFTRKSSGMEAFDVRFDADAPKVALPQGVMAPVNSLNTLFHSPAFW 312

Query: 337 ALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNV 396
            L LPVSVS MASDV+RG+W QR+LWEIGG +VVYPPTVHR D + A+PF +EKD+HVNV
Sbjct: 313 GLALPVSVSPMASDVIRGYWAQRILWEIGGQLVVYPPTVHRSDNVHAHPFDDEKDIHVNV 372

Query: 397 GRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLM 456
           GRLI FL+ WRS K   FE++L+LS+ MAEEGFW E+D++F AAWLQDL+AVGY+QPRLM
Sbjct: 373 GRLINFLMEWRSKKQTLFERILDLSYVMAEEGFWGEKDLQFMAAWLQDLVAVGYRQPRLM 432

Query: 457 SLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFC 516
           SLE+DRPRA IGHGD++EFVP+KLPSVHLG EE G VS EIGNLI+WRK+FG+VVLI+ C
Sbjct: 433 SLEIDRPRAIIGHGDKQEFVPKKLPSVHLGAEEIGEVSTEIGNLIKWRKHFGDVVLIVHC 492

Query: 517 SGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGF 576
           + PV+RTALEWRLLYGRIF+ V+ILSEQ N DLAVE   L Q Y++LPK+F R+  AEGF
Sbjct: 493 TEPVDRTALEWRLLYGRIFRAVVILSEQGNSDLAVEFSNLAQAYKYLPKVFDRFAGAEGF 552

Query: 577 LFLQDDTILNYWNLLQADKNKLWITDKV 604
           +FLQD  +LNYWNLL ADK+KLWIT KV
Sbjct: 553 VFLQDHMVLNYWNLLDADKSKLWITYKV 580


>gi|125559448|gb|EAZ04984.1| hypothetical protein OsI_27165 [Oryza sativa Indica Group]
          Length = 730

 Score =  684 bits (1764), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/506 (62%), Positives = 396/506 (78%), Gaps = 11/506 (2%)

Query: 104 RSEKWIV-VSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGF 162
           R+ +WI+  +   +P    + L  + GWQ+LA+ +  TP +W+  GA  L+L  QA LGF
Sbjct: 74  RASRWIIFAAAAHHPRH--RPLPAVPGWQLLAVADETTPPDWSHPGAALLTLADQARLGF 131

Query: 163 RVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVEL---VGEGAR 219
           R + FLP   + RK+  YLFA+Q GA+ I+DAD R  V+G +L KHFDV+L    G G  
Sbjct: 132 RSVAFLPARGHARKAAAYLFAVQRGARVIYDADARNAVLGSNLTKHFDVDLDHRQGGG-- 189

Query: 220 QETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQG 279
              +LQYSH +PNRT+VNPYVHFGQ SVWPRGLPL   GE+  EEFYT+VFGG QFIQQG
Sbjct: 190 --VLLQYSHADPNRTVVNPYVHFGQPSVWPRGLPLHKAGEVGVEEFYTQVFGGGQFIQQG 247

Query: 280 ISNGLPDVDSVFYFTRKPS-LEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL 338
           + NGLPDVD+VFYFTRK S +EAFD+RFD   PKVALPQGMM P+NS NT++ S AFW L
Sbjct: 248 LCNGLPDVDAVFYFTRKSSEMEAFDLRFDADAPKVALPQGMMAPINSVNTLFHSPAFWGL 307

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGR 398
            LPVSVS MA+DV+RG+W QR+LWEIGGY+VVYPPTVHR D + A+PF +EKD+HV+VGR
Sbjct: 308 ALPVSVSPMAADVIRGYWSQRILWEIGGYLVVYPPTVHRMDNVHAHPFDDEKDIHVSVGR 367

Query: 399 LIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSL 458
           LI FL+ WRS+K   FE++L+LS++M EEGFW E+D++F +AWLQDL++VGY+QPRLMSL
Sbjct: 368 LIDFLMEWRSHKQTLFERILDLSYAMTEEGFWGEKDLQFMSAWLQDLVSVGYRQPRLMSL 427

Query: 459 ELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSG 518
           E+DRPRA+IGHGD++ FVP+KLP+VHLGVEE G VS EIGNLI+WRK+FG+VVLI+ C+ 
Sbjct: 428 EIDRPRATIGHGDKQVFVPKKLPAVHLGVEEIGEVSTEIGNLIKWRKHFGDVVLIVHCTV 487

Query: 519 PVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLF 578
           PV+R ALEWRLLYGRIF+ V+ILSE+ N DLAVE   L Q Y+ LPK+F R+  A GF+F
Sbjct: 488 PVDRVALEWRLLYGRIFRAVVILSEKSNSDLAVEVSNLAQAYKFLPKVFDRFAGAGGFMF 547

Query: 579 LQDDTILNYWNLLQADKNKLWITDKV 604
           LQD  ILNYWNL   DK KLWIT+KV
Sbjct: 548 LQDHMILNYWNLYDFDKAKLWITNKV 573


>gi|125601360|gb|EAZ40936.1| hypothetical protein OsJ_25418 [Oryza sativa Japonica Group]
          Length = 697

 Score =  680 bits (1755), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/494 (63%), Positives = 389/494 (78%), Gaps = 8/494 (1%)

Query: 115 RYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYV 174
           R  T +  +  +  GWQ+LA+ +  TP +W+  GA  L+L  QA LGFR + FLP   + 
Sbjct: 51  RPTTRATARCPRSPGWQLLAVADETTPPDWSHPGAALLTLADQARLGFRSVAFLPARGHA 110

Query: 175 RKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVEL---VGEGARQETILQYSHENP 231
           RK+  YLFA+Q GA+ I+DAD R  V+G +L KHFDV+L    G G     +LQYSH +P
Sbjct: 111 RKAAAYLFAVQRGARVIYDADARNAVLGSNLTKHFDVDLDHRQGGG----VLLQYSHADP 166

Query: 232 NRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVF 291
           NRT+VNPYVHFGQ SVWPRGLPL   GE+  EEFYT+VFGG QFIQQG+ NGLPDVD+VF
Sbjct: 167 NRTVVNPYVHFGQPSVWPRGLPLHKAGEVGVEEFYTQVFGGGQFIQQGLCNGLPDVDAVF 226

Query: 292 YFTRKPS-LEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASD 350
           YFTRK S +EAFD+RFD   PKVALPQGMM P+NS NT++ S AFW L LPVSVS MA+D
Sbjct: 227 YFTRKSSEMEAFDLRFDADAPKVALPQGMMAPINSVNTLFHSPAFWGLALPVSVSPMAAD 286

Query: 351 VLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNK 410
           V+RG+W QR+LWEIGGY+VVYPPTVHR D + A+PF +EKD+HV+VGRLI FL+ WRS+K
Sbjct: 287 VIRGYWSQRILWEIGGYLVVYPPTVHRMDNVHAHPFDDEKDIHVSVGRLIDFLMEWRSHK 346

Query: 411 HRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHG 470
              FE++L+LS++M EEGFW E+D++F +AWLQDL++VGY+QPRLMSLE+DRPRA+IGHG
Sbjct: 347 QTLFERILDLSYAMTEEGFWGEKDLQFMSAWLQDLVSVGYRQPRLMSLEIDRPRATIGHG 406

Query: 471 DRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLL 530
           D++ FVP+KLP+VHLGVEE G VS EIGNLI+WRK+FG+VVLI+ C+ PV+R ALEWRLL
Sbjct: 407 DKQVFVPKKLPTVHLGVEEIGEVSTEIGNLIKWRKHFGDVVLIVHCTVPVDRVALEWRLL 466

Query: 531 YGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNL 590
           YGRIF+ V+ILSE+ N DLAVE   L Q Y+ LPK+F R+  A GF+FLQD  ILNYWNL
Sbjct: 467 YGRIFRAVVILSEKSNSDLAVEVSNLAQAYKFLPKVFDRFAGAGGFMFLQDHMILNYWNL 526

Query: 591 LQADKNKLWITDKV 604
              DK KLWIT+KV
Sbjct: 527 YDFDKAKLWITNKV 540


>gi|22831270|dbj|BAC16125.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|50510128|dbj|BAD31094.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 729

 Score =  678 bits (1749), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/506 (62%), Positives = 395/506 (78%), Gaps = 12/506 (2%)

Query: 104 RSEKWIV-VSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGF 162
           R+ +WI+  +   +P    + L  + GWQ+LA+ +  TP +W+  GA  L+L  QA LGF
Sbjct: 74  RASRWIIFAAAAHHPRH--RPLPAVPGWQLLAVADETTPPDWSHPGAALLTLADQARLGF 131

Query: 163 RVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVEL---VGEGAR 219
           R + FLP   + RK+  YLFA+Q GA+ I+DAD R  V+G +L KHFDV+L    G G  
Sbjct: 132 RSVAFLPARGHARKAAAYLFAVQRGARVIYDADARNAVLGSNLTKHFDVDLDHRQGGG-- 189

Query: 220 QETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQG 279
              +LQYSH +PNRT+VNPYVHFGQ SVWPRGLPL   GE+  EEFYT+VFGG QFIQQG
Sbjct: 190 --VLLQYSHADPNRTVVNPYVHFGQPSVWPRGLPLHKAGEVGVEEFYTQVFGGGQFIQQG 247

Query: 280 ISNGLPDVDSVFYFTRKPS-LEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL 338
           + NGLPDVD+VFYFTRK S +EAFD+RFD   PKVALPQGMM P+NS NT++ S AFW L
Sbjct: 248 LCNGLPDVDAVFYFTRKSSEMEAFDLRFDADAPKVALPQGMMAPINSVNTLFHSPAFWGL 307

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGR 398
            LPVSVS MA+DV+RG+W QR+LWEIGGY+VVYPPTVHR D + A+PF +EKD+HV+VGR
Sbjct: 308 ALPVSVSPMAADVIRGYWSQRILWEIGGYLVVYPPTVHRMDNVHAHPFDDEKDIHVSVGR 367

Query: 399 LIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSL 458
           LI FL+ WRS+K   FE++L+LS++M EEGFW E+D++F +AWLQDL++VGY+QPRLMSL
Sbjct: 368 LIDFLMEWRSHKQTLFERILDLSYAMTEEGFWGEKDLQFMSAWLQDLVSVGYRQPRLMSL 427

Query: 459 ELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSG 518
           E+DRPRA+IGHGD++ FVP+KLP+VHLGVEE G VS EIGNLI+WRK+FG+VVLI+ C+ 
Sbjct: 428 EIDRPRATIGHGDKQVFVPKKLPTVHLGVEEIGEVSTEIGNLIKWRKHFGDVVLIVHCTV 487

Query: 519 PVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLF 578
           PV+R ALEWRLLYGRIF+ V+ILSE+ N DLAVE   L Q Y  LPK+F R+  A GF+F
Sbjct: 488 PVDRVALEWRLLYGRIFRAVVILSEKSNSDLAVEVSNLAQAY-FLPKVFDRFAGAGGFMF 546

Query: 579 LQDDTILNYWNLLQADKNKLWITDKV 604
           LQD  ILNYWNL   DK KLWIT+KV
Sbjct: 547 LQDHMILNYWNLYDFDKAKLWITNKV 572


>gi|297607739|ref|NP_001060503.2| Os07g0656400 [Oryza sativa Japonica Group]
 gi|255678031|dbj|BAF22417.2| Os07g0656400 [Oryza sativa Japonica Group]
          Length = 530

 Score =  624 bits (1609), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 290/463 (62%), Positives = 364/463 (78%), Gaps = 11/463 (2%)

Query: 104 RSEKWIV-VSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGF 162
           R+ +WI+  +   +P    + L  + GWQ+LA+ +  TP +W+  GA  L+L  QA LGF
Sbjct: 74  RASRWIIFAAAAHHPRH--RPLPAVPGWQLLAVADETTPPDWSHPGAALLTLADQARLGF 131

Query: 163 RVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVEL---VGEGAR 219
           R + FLP   + RK+  YLFA+Q GA+ I+DAD R  V+G +L KHFDV+L    G G  
Sbjct: 132 RSVAFLPARGHARKAAAYLFAVQRGARVIYDADARNAVLGSNLTKHFDVDLDHRQGGG-- 189

Query: 220 QETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQG 279
              +LQYSH +PNRT+VNPYVHFGQ SVWPRGLPL   GE+  EEFYT+VFGG QFIQQG
Sbjct: 190 --VLLQYSHADPNRTVVNPYVHFGQPSVWPRGLPLHKAGEVGVEEFYTQVFGGGQFIQQG 247

Query: 280 ISNGLPDVDSVFYFTRKPS-LEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL 338
           + NGLPDVD+VFYFTRK S +EAFD+RFD   PKVALPQGMM P+NS NT++ S AFW L
Sbjct: 248 LCNGLPDVDAVFYFTRKSSEMEAFDLRFDADAPKVALPQGMMAPINSVNTLFHSPAFWGL 307

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGR 398
            LPVSVS MA+DV+RG+W QR+LWEIGGY+VVYPPTVHR D + A+PF +EKD+HV+VGR
Sbjct: 308 ALPVSVSPMAADVIRGYWSQRILWEIGGYLVVYPPTVHRMDNVHAHPFDDEKDIHVSVGR 367

Query: 399 LIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSL 458
           LI FL+ WRS+K   FE++L+LS++M EEGFW E+D++F +AWLQDL++VGY+QPRLMSL
Sbjct: 368 LIDFLMEWRSHKQTLFERILDLSYAMTEEGFWGEKDLQFMSAWLQDLVSVGYRQPRLMSL 427

Query: 459 ELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSG 518
           E+DRPRA+IGHGD++ FVP+KLP+VHLGVEE G VS EIGNLI+WRK+FG+VVLI+ C+ 
Sbjct: 428 EIDRPRATIGHGDKQVFVPKKLPTVHLGVEEIGEVSTEIGNLIKWRKHFGDVVLIVHCTV 487

Query: 519 PVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYR 561
           PV+R ALEWRLLYGRIF+ V+ILSE+ N DLAVE   L Q Y+
Sbjct: 488 PVDRVALEWRLLYGRIFRAVVILSEKSNSDLAVEVSNLAQAYK 530


>gi|302819526|ref|XP_002991433.1| hypothetical protein SELMODRAFT_133561 [Selaginella moellendorffii]
 gi|300140826|gb|EFJ07545.1| hypothetical protein SELMODRAFT_133561 [Selaginella moellendorffii]
          Length = 802

 Score =  572 bits (1474), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 277/571 (48%), Positives = 382/571 (66%), Gaps = 8/571 (1%)

Query: 33  TWVRDNLFKIVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQP 92
           ++V +N  KIV  L +  +      +R+  D A L   +S       IP P ++  +   
Sbjct: 83  SFVVENFPKIVIGLFVFLSAIVFLMVRSRGDNAVLSCIESASRQREEIPYPRVDLEAASA 142

Query: 93  IADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFL 152
             DK  +    +SEKWIVV+V   P++ +++L K+ GWQ+LA+GNS+TP  W + GAIFL
Sbjct: 143 KVDKGIL----KSEKWIVVAVSGAPSEEIQQLAKLDGWQLLALGNSQTPTKWEVPGAIFL 198

Query: 153 SLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVE 212
           S D QA L FRV   +  D   RK+ GYLFAIQHGA+KI+DAD++  V G +L K FDVE
Sbjct: 199 SKDAQAGLNFRVQSHIDPDGPARKNVGYLFAIQHGARKIYDADEKIIVRGGNLEKVFDVE 258

Query: 213 LVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGG 272
           L G   R+E + QY     NRTIVNPYVHFGQRS+WPRG P+  VGE S E  Y E+  G
Sbjct: 259 LSGTSGRREPLYQYRMVE-NRTIVNPYVHFGQRSMWPRGFPVRMVGETSLEVAYNEIAPG 317

Query: 273 KQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQS 332
           + FIQQG++NG  DVD++FY+TR+   EA  I FD + P VALPQG M PV+S NT++ S
Sbjct: 318 RHFIQQGLANGFADVDALFYYTRRSEREALSIEFDLQAPPVALPQGTMAPVSSVNTLFHS 377

Query: 333 SAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDL 392
            A W+LM+P  VS+ A+DV+RG+W QRLLWE+GG VVV+PPT HR D+++     +EKDL
Sbjct: 378 PALWSLMIPADVSSRAADVVRGYWAQRLLWEVGGMVVVFPPTAHRVDQLDPILLKDEKDL 437

Query: 393 HVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQ 452
           H  + RLI F+VSWRS+K   F+++L LSHSMAE G+W+ ++V  T AWLQDL++VGY+Q
Sbjct: 438 H-KMERLINFVVSWRSDKRSLFQRILHLSHSMAENGYWSAQNVDLTVAWLQDLVSVGYRQ 496

Query: 453 PRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVL 512
           PR+++LEL R    + + +  +FVP  LPSV+LG+ E+  +  E+G+ ++WR+ FGN+VL
Sbjct: 497 PRMLALELGRVDPLLYNSEHVQFVPETLPSVYLGIHESSQLEKEMGDWLKWRRYFGNIVL 556

Query: 513 IMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTS 572
           ++ CS     T L WR+ Y R+FK V I S + N  L VE G     Y+ LP++F RY  
Sbjct: 557 VLDCSPDANATVLAWRMFYSRLFKHVEIRSRESNAGLRVEGGNF--TYQSLPEVFDRYPH 614

Query: 573 AEGFLFLQDDTILNYWNLLQADKNKLWITDK 603
           A+G+L+L+DD + NYWN + ++KNKLW   K
Sbjct: 615 ADGYLYLKDDAVFNYWNFVTSNKNKLWSLQK 645


>gi|302813286|ref|XP_002988329.1| hypothetical protein SELMODRAFT_127776 [Selaginella moellendorffii]
 gi|300144061|gb|EFJ10748.1| hypothetical protein SELMODRAFT_127776 [Selaginella moellendorffii]
          Length = 802

 Score =  570 bits (1469), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 276/571 (48%), Positives = 381/571 (66%), Gaps = 8/571 (1%)

Query: 33  TWVRDNLFKIVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQP 92
           ++V +N  KIV  L +  +      +R+  D A L   +S       IP P ++  +   
Sbjct: 83  SFVVENFPKIVIGLFVFLSAIVFLMVRSRGDNAVLSCIESASRQREEIPYPRVDLEAASA 142

Query: 93  IADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFL 152
             DK  +    +SEKWIVV+V   P++ +++L K+ GWQ+LA+GNS+TP  W + GAIFL
Sbjct: 143 KVDKGIL----KSEKWIVVAVSGAPSEEIQQLAKLDGWQLLALGNSQTPTKWEVPGAIFL 198

Query: 153 SLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVE 212
           S D QA L FRV   +  D   RK+ GYLFAIQHGA+KI+DAD+   V G +L K FDVE
Sbjct: 199 SKDAQAGLNFRVQSHIDPDGPARKNVGYLFAIQHGARKIYDADETIIVRGGNLEKVFDVE 258

Query: 213 LVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGG 272
           L G   R+E + QY     NRTIVNPYVHFGQRS+WPRG P+  VGE S E  Y E+  G
Sbjct: 259 LSGTSGRREPLYQYRMVE-NRTIVNPYVHFGQRSMWPRGFPVRMVGETSLEVAYNEIAPG 317

Query: 273 KQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQS 332
           + FIQQG++NG  DVD++FY+TR+   EA  I FD + P VALPQG M PV+S NT++ S
Sbjct: 318 RHFIQQGLANGFADVDALFYYTRRSEREALSIEFDLQAPPVALPQGTMAPVSSVNTLFHS 377

Query: 333 SAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDL 392
            A W+LM+P  VS+ A+DV+RG+W QRLLWE+GG +VV+PPT HR D+++     +EKDL
Sbjct: 378 PALWSLMIPADVSSRAADVVRGYWAQRLLWEVGGMLVVFPPTAHRVDQLDPILLKDEKDL 437

Query: 393 HVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQ 452
           H  + RLI F+VSWRS+K   F+++L LSHSMAE G+W+ ++V  T AWLQDL++VGY+Q
Sbjct: 438 H-KMERLINFVVSWRSDKRSLFQRILHLSHSMAENGYWSAQNVDLTVAWLQDLVSVGYRQ 496

Query: 453 PRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVL 512
           PR+++LEL R    + + +  +FVP  LPSV+LG+ E+  +  E+G+ ++WR+ FGN+VL
Sbjct: 497 PRMLALELGRVDPLLYNSEHVQFVPETLPSVYLGIHESSQLEKEMGDWLKWRRYFGNIVL 556

Query: 513 IMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTS 572
           ++ CS     T L WR+ Y R+FK V I S + N  L VE G     Y+ LP++F RY  
Sbjct: 557 VLDCSPDANATVLAWRMFYSRLFKHVEIRSRESNAGLRVEGGNF--TYQSLPEVFDRYPH 614

Query: 573 AEGFLFLQDDTILNYWNLLQADKNKLWITDK 603
           A+G+L+L+DD + NYWN + ++KNKLW   K
Sbjct: 615 ADGYLYLKDDAVFNYWNFVTSNKNKLWSLQK 645


>gi|168023027|ref|XP_001764040.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162684779|gb|EDQ71179.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 732

 Score =  569 bits (1466), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 270/573 (47%), Positives = 383/573 (66%), Gaps = 9/573 (1%)

Query: 31  FSTWVRDNLFKIVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSI 90
              W+ +NL K+V V+ +  +   L  + N+ + ++LI +++       IP P +N N I
Sbjct: 2   LGAWLMENLSKVVIVVFVFLSALVLIVMLNYGEQSALICAEAVAEELQRIPYPDLNLNHI 61

Query: 91  QPIADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAI 150
            P   K   Y+  R++KWIVV+    PT  ++ L ++ GWQVLA+    TP +W + G I
Sbjct: 62  TPRVHKGR-YAAMRTDKWIVVAALGAPTSHIQALTRVSGWQVLAVAGEDTPADWKVAGVI 120

Query: 151 FLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFD 210
           FLS+D QA L +R+   LPY++Y+RK+ GYLFAIQHGAK I+DADD+  VIGDDL   FD
Sbjct: 121 FLSMDDQAALSYRISAHLPYNNYLRKNIGYLFAIQHGAKIIYDADDKESVIGDDLESKFD 180

Query: 211 VELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVF 270
           V L G  AR+  ILQ+    PNRT+VNP+VHFGQ++VWPRG PLE V +I+ +  Y EVF
Sbjct: 181 VYLQGRRARRGPILQF-RTLPNRTMVNPFVHFGQKTVWPRGYPLEFVQQIAPDISYNEVF 239

Query: 271 GGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIY 330
            GKQFIQQG++NGLPDVDS+FY TR+      +I FD   P V+LP G M P N+FNT++
Sbjct: 240 PGKQFIQQGLANGLPDVDSIFYNTRRSHDGNININFDVNAPPVSLPHGTMAPCNAFNTLF 299

Query: 331 QSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEK 390
            S+AFW L+LPV++S   +D++RG+W QR++WE+GG +VVYPPTV R D      F +EK
Sbjct: 300 HSAAFWGLLLPVTLSPKTADIVRGYWAQRIVWEVGGMMVVYPPTVVREDSGMPLSFLDEK 359

Query: 391 DLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGY 450
           DLH    RL++FLV WRS+K   F++++ L+H+MA EGFW  +DV+  A WL+DL++VG+
Sbjct: 360 DLHAESRRLVEFLVKWRSSKTTLFDRIIHLTHTMAFEGFWGAQDVELAADWLKDLLSVGW 419

Query: 451 QQPRL----MSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKN 506
           +QPRL    + +++D    S+ H   K+FVP   P+VHLGVE+   ++ E  + + WRK 
Sbjct: 420 RQPRLVGSDLDVQIDDSTPSLAH---KQFVPLSYPTVHLGVEDCTALTEEFVDFLTWRKF 476

Query: 507 FGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKI 566
           +GN+VL++ CS P+  T L WRLLYGR+FK V++LS++    L V A      Y   PKI
Sbjct: 477 YGNMVLVLECSWPLNHTVLAWRLLYGRLFKHVVVLSQENEPGLGVRASDWWLSYSMFPKI 536

Query: 567 FSRYTSAEGFLFLQDDTILNYWNLLQADKNKLW 599
           F +Y +A+GF+ +++  + NYWNL  A+K  LW
Sbjct: 537 FEKYPTADGFVVMREAVVFNYWNLASANKTNLW 569


>gi|168012400|ref|XP_001758890.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690027|gb|EDQ76396.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 706

 Score =  558 bits (1437), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 264/524 (50%), Positives = 360/524 (68%), Gaps = 9/524 (1%)

Query: 80  IPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSR 139
           IP P +N   I    DK   Y+  R++KWIVV+    PT  ++ L ++ GWQVLA+    
Sbjct: 23  IPYPELNLKHIPAQVDKGR-YTAVRTDKWIVVAALGAPTAHIQALTRVSGWQVLAVAGEN 81

Query: 140 TPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGD 199
           TP +W + GAIFLS+D QA LG+R+   LP  +Y+RK+ GYLFAIQHGA+ IFDAD++  
Sbjct: 82  TPADWKVAGAIFLSMDDQAALGYRISAHLPDSNYLRKNIGYLFAIQHGAQVIFDADEKES 141

Query: 200 VIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGE 259
           VIG+DL   FDV L G  AR++ ILQ+    PNRT+VNP++HFGQ+SVWPRG PLE V E
Sbjct: 142 VIGEDLDSKFDVYLQGRRARRDPILQF-RTLPNRTVVNPFIHFGQKSVWPRGYPLEFVEE 200

Query: 260 ISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGM 319
           I+ +  Y+EVF GKQFIQQG++NGLPD+DS+FY TR+       I FD   P VALP G 
Sbjct: 201 IAPDISYSEVFPGKQFIQQGLANGLPDIDSIFYNTRRSRNGHISINFDTNAPPVALPHGT 260

Query: 320 MVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYD 379
           M P N+FNT++ S+AFW LMLPV++S  A+D++RG+W QR+LWE+GG +V+YPPTV R D
Sbjct: 261 MAPCNAFNTLFHSAAFWGLMLPVTLSPKAADIVRGYWAQRILWEVGGIMVIYPPTVVRED 320

Query: 380 KIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTA 439
                 F +EKDL+    RL++FLV WRS K   F++++ L+HSMAEEGFW   DVK T 
Sbjct: 321 SGMPLSFVDEKDLYAESRRLVEFLVKWRSTKPTLFDRIIHLTHSMAEEGFWGAIDVKLTV 380

Query: 440 AWLQDLIAVGYQQPRLMSLELDR----PRASIGHGDRKEFVPRKLPSVHLGVEETGTVSY 495
            WL DL++VG++QPRL+  +LD        S+ H   K+FVPR  P+VHLGVE+   ++ 
Sbjct: 381 DWLTDLLSVGWRQPRLVGSDLDALIDDSAPSLAH---KQFVPRSFPTVHLGVEDGTALTE 437

Query: 496 EIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQ 555
           E  + + WRK +GN+VL++ C+ P+  T L WRLLYGR+FK V++LS++    L V A  
Sbjct: 438 EFADFLTWRKFYGNMVLVLECAWPLNHTVLSWRLLYGRLFKHVVVLSQENEPGLGVHASD 497

Query: 556 LEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLW 599
           L   Y  LPKIF +Y +A+GF+ +++  + NYW +  A+K K+W
Sbjct: 498 LWISYSMLPKIFEKYPAADGFVVMKEAVVFNYWKIASANKTKIW 541


>gi|302793811|ref|XP_002978670.1| hypothetical protein SELMODRAFT_152766 [Selaginella moellendorffii]
 gi|300153479|gb|EFJ20117.1| hypothetical protein SELMODRAFT_152766 [Selaginella moellendorffii]
          Length = 624

 Score =  503 bits (1296), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 243/482 (50%), Positives = 333/482 (69%), Gaps = 15/482 (3%)

Query: 124 LVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFA 183
           +VK++GWQVLAIG++ TP +W + GAIFLS D+Q    +R+   LPY+SYVRKS GYLFA
Sbjct: 1   MVKLQGWQVLAIGDTDTPADWLVPGAIFLSTDLQTTFRYRITSLLPYNSYVRKSIGYLFA 60

Query: 184 IQHGAKKIFDADDRGDVI-GDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHF 242
           IQHGA +I+DAD     + G  LGK FD+EL    + ++T+LQY  +N  RT+VNP++HF
Sbjct: 61  IQHGAVRIYDADTHSTFLAGGHLGKSFDIEL----SPRKTLLQYKAKN--RTLVNPFIHF 114

Query: 243 GQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAF 302
           GQRSVWPRGL L +V +I+ E +Y EV GG QFIQQG  NGLPDVDS+FY TR+ + E  
Sbjct: 115 GQRSVWPRGLSLTSVPDIAPEFYYDEVSGGNQFIQQGTGNGLPDVDSIFYHTRRLAGEPI 174

Query: 303 DIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLW 362
           +I FD   P+VALP+G M PVNS NT++   AFWA+MLPV+V T  SDV+RG+W QRLLW
Sbjct: 175 NIEFDHLAPEVALPRGTMAPVNSLNTLFHEQAFWAMMLPVTVHTAVSDVIRGYWAQRLLW 234

Query: 363 EIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSH 422
           ++GG V  YPP+VHR D +E   F +E+DL  + G+LI FL SW S+K  FFE+VL+LS+
Sbjct: 235 DVGGIVAFYPPSVHRLDTLEGSTFGDEEDLLHDWGQLIDFLKSWHSSKSTFFERVLDLSY 294

Query: 423 SMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKLPS 482
            MA+ GFW+ +D+  T AWLQDL++VGY+ P+L      +P  +   G R +F+P+ L  
Sbjct: 295 EMAKNGFWSGQDLALTVAWLQDLVSVGYKPPKL------QPVENTIKGSR-QFLPQILKP 347

Query: 483 VHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILS 542
           V+ GV +  +V  E+G+L++WR +  N+VLI+ C  P       WR+LYGR+FK V+++S
Sbjct: 348 VYSGVTDAVSVEKEMGHLLKWRSSSANMVLILECDWPRRSNIPVWRMLYGRLFKHVVVIS 407

Query: 543 EQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWITD 602
            + +  L ++ G   Q Y   P IF +Y  A GFL+L+D  + NYWN +Q +  KLW   
Sbjct: 408 TEADSTLGIDVGGGWQAYSSFPGIFDKYPEAAGFLYLKDHVVFNYWN-MQGNNKKLWTMH 466

Query: 603 KV 604
           +V
Sbjct: 467 EV 468


>gi|302805705|ref|XP_002984603.1| hypothetical protein SELMODRAFT_234595 [Selaginella moellendorffii]
 gi|300147585|gb|EFJ14248.1| hypothetical protein SELMODRAFT_234595 [Selaginella moellendorffii]
          Length = 624

 Score =  500 bits (1287), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 242/482 (50%), Positives = 332/482 (68%), Gaps = 15/482 (3%)

Query: 124 LVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFA 183
           +VK++GWQVLAIG++ TP +W + GAIFLS D+Q    +R+   LPY+SYVRKS GYLFA
Sbjct: 1   MVKLQGWQVLAIGDTDTPADWLVPGAIFLSTDLQTTFRYRITSLLPYNSYVRKSIGYLFA 60

Query: 184 IQHGAKKIFDADDRGDVI-GDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHF 242
           IQHGA +I+DAD     + G  LGK FD+EL    + ++T+LQY  +N  RT+VNP++HF
Sbjct: 61  IQHGAVRIYDADTHSTFLAGGHLGKSFDIEL----SPRKTLLQYKAKN--RTLVNPFIHF 114

Query: 243 GQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAF 302
           GQRSVWPRGL L +V +I+ E +Y EV GG QFIQQG  NGLPDVDS+FY TR+ + E  
Sbjct: 115 GQRSVWPRGLSLTSVPDIAPEFYYDEVSGGNQFIQQGTGNGLPDVDSIFYHTRRLAGEPI 174

Query: 303 DIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLW 362
           +I FD   P+VALP+G M PVNS NT++   AFWA+MLPV+V T  SDV+RG+W QRLLW
Sbjct: 175 NIEFDHLAPEVALPRGTMAPVNSLNTLFHEQAFWAMMLPVTVHTAVSDVIRGYWAQRLLW 234

Query: 363 EIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSH 422
           ++GG V  YPP+VHR D +E   F +E+DL  + G+LI FL SW S+K  FFE+VL+LS+
Sbjct: 235 DVGGIVAFYPPSVHRLDTLEGSTFGDEEDLLHDWGQLIDFLKSWHSSKSTFFERVLDLSY 294

Query: 423 SMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKLPS 482
            MA+ GFW+ +D+  T AWLQDL++VGY+ P+L      +P  +   G R +F+P+ L  
Sbjct: 295 EMAKNGFWSGQDLALTVAWLQDLVSVGYKPPKL------QPVENTIKGSR-QFLPQILKP 347

Query: 483 VHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILS 542
           V+ GV +  +V  E+G+L++WR +  N+VLI+ C          WR+LYGR+FK V+++S
Sbjct: 348 VYPGVTDAVSVEKEMGHLLKWRSSSANMVLILECDWARRSNIPVWRMLYGRLFKHVVVIS 407

Query: 543 EQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWITD 602
            + +  L ++ G   Q Y   P IF +Y  A GFL+L+D  + NYWN +Q +  KLW   
Sbjct: 408 TEADSTLGIDVGGGWQAYSSFPGIFDKYPEAAGFLYLKDHVVFNYWN-MQGNNKKLWTMH 466

Query: 603 KV 604
           +V
Sbjct: 467 EV 468


>gi|297736303|emb|CBI24941.3| unnamed protein product [Vitis vinifera]
          Length = 441

 Score =  494 bits (1271), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 226/285 (79%), Positives = 261/285 (91%)

Query: 320 MVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYD 379
           MVP+NSFNT++ S+AFW LMLPVSVS+MASDVLRG+W QRLLWE+GG+VVVYPPT++R D
Sbjct: 1   MVPLNSFNTLFHSNAFWGLMLPVSVSSMASDVLRGYWAQRLLWEVGGFVVVYPPTIYRKD 60

Query: 380 KIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTA 439
           +IEAYPFSEEKDLHVNVGRLIK+LVSWRS +HR FEK++ELS+S+A+EGFWTERDVKFT 
Sbjct: 61  EIEAYPFSEEKDLHVNVGRLIKYLVSWRSGRHRLFEKIMELSYSLAKEGFWTERDVKFTG 120

Query: 440 AWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGN 499
           AWLQDL+AVGYQQPRLM+LELDRPRAS G  DRKEF+PRKLPSVHL VEE+G V+YEIGN
Sbjct: 121 AWLQDLLAVGYQQPRLMALELDRPRASSGDADRKEFIPRKLPSVHLAVEESGAVNYEIGN 180

Query: 500 LIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQV 559
           LIRWRK+F NVVLI+F SGPVERTALEWRLLYGRIFKTV+ILS + + DLAVE    +QV
Sbjct: 181 LIRWRKSFSNVVLILFVSGPVERTALEWRLLYGRIFKTVVILSAKSDVDLAVEEAHPDQV 240

Query: 560 YRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWITDKV 604
           Y++LPKIF R++SAEGFLFLQD+TILNYWNL+Q DK KLWITDKV
Sbjct: 241 YKYLPKIFERFSSAEGFLFLQDNTILNYWNLMQGDKTKLWITDKV 285


>gi|297604448|ref|NP_001055443.2| Os05g0391200 [Oryza sativa Japonica Group]
 gi|54287510|gb|AAV31254.1| unknown protein [Oryza sativa Japonica Group]
 gi|222631476|gb|EEE63608.1| hypothetical protein OsJ_18425 [Oryza sativa Japonica Group]
 gi|255676335|dbj|BAF17357.2| Os05g0391200 [Oryza sativa Japonica Group]
          Length = 442

 Score =  467 bits (1201), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 214/287 (74%), Positives = 251/287 (87%)

Query: 320 MVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYD 379
           M PVNSFNT++ + AFW LM+PVSVS+MASDV+RG+W QR+LWEIGGYV  YPPT++R D
Sbjct: 1   MAPVNSFNTLFHTPAFWGLMMPVSVSSMASDVIRGYWAQRILWEIGGYVAFYPPTIYRKD 60

Query: 380 KIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTA 439
            I+AYPF+EEKDLHVNVGRLIKFL  WRSNK   FE++L+LS++MAEEGFWTE+DV+ TA
Sbjct: 61  HIQAYPFAEEKDLHVNVGRLIKFLNEWRSNKRTLFERILDLSYAMAEEGFWTEQDVRLTA 120

Query: 440 AWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGN 499
           AWLQDL+AVGY+QPRLMSLE+DR RA+IG GD KEFVP+KLPSVHLGV+E GTV+YEIGN
Sbjct: 121 AWLQDLLAVGYRQPRLMSLEIDRQRATIGEGDMKEFVPKKLPSVHLGVDEIGTVNYEIGN 180

Query: 500 LIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQV 559
           LI+WRKNFGNVVLIM  SGPV+RTALEWRLLYGRIFKTVIIL+EQ N +LAVE   L   
Sbjct: 181 LIKWRKNFGNVVLIMHVSGPVDRTALEWRLLYGRIFKTVIILAEQSNTELAVERCALSHA 240

Query: 560 YRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWITDKVLY 606
           Y+ LPK+F+RY  A+GFLFLQD  ILNYWNLLQADK KLWIT+K+ +
Sbjct: 241 YKFLPKVFARYGGADGFLFLQDHMILNYWNLLQADKEKLWITNKIAH 287


>gi|326528461|dbj|BAJ93374.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 395

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 178/240 (74%), Positives = 206/240 (85%)

Query: 367 YVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAE 426
           YV  YPPT++R D ++AYPF+EEKDLHVNVGRLIKFL  WRSNK   FEK+L+LS++MAE
Sbjct: 1   YVAFYPPTIYRKDHVQAYPFAEEKDLHVNVGRLIKFLNEWRSNKQSLFEKILDLSYAMAE 60

Query: 427 EGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLG 486
           EGFW E+DV+ TAAWLQDL+A GY+QPRLMSLE+DR RA+IG GD KEFVP+KLPSVHLG
Sbjct: 61  EGFWMEQDVRLTAAWLQDLLAAGYRQPRLMSLEIDRQRATIGEGDMKEFVPKKLPSVHLG 120

Query: 487 VEETGTVSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKN 546
           V+E GTV+YEIGNLI+WRKNFGNVVLIM  SGPV+R ALEWRLLYGRIFKTVIIL+EQ N
Sbjct: 121 VDEIGTVNYEIGNLIKWRKNFGNVVLIMHVSGPVDRVALEWRLLYGRIFKTVIILAEQSN 180

Query: 547 EDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWITDKVLY 606
            +LAVE   L   Y++LPK+F RY  A+GFLFLQD  ILNYWNLLQADK KLWITDK+ +
Sbjct: 181 AELAVERCALSHAYKYLPKVFGRYGGADGFLFLQDHMILNYWNLLQADKEKLWITDKIAH 240


>gi|297824119|ref|XP_002879942.1| hypothetical protein ARALYDRAFT_903497 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325781|gb|EFH56201.1| hypothetical protein ARALYDRAFT_903497 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 231

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 173/252 (68%), Positives = 193/252 (76%), Gaps = 42/252 (16%)

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGR 398
           MLPVSVS+MASDVLRG WGQRLLWE+GGYV VYPPT HR+D+                  
Sbjct: 1   MLPVSVSSMASDVLRGCWGQRLLWELGGYVAVYPPTAHRFDR------------------ 42

Query: 399 LIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSL 458
             +   SW                      F TE+D+KFTAAWLQDLIAVGYQQPRLMSL
Sbjct: 43  --RGGSSW----------------------FSTEQDLKFTAAWLQDLIAVGYQQPRLMSL 78

Query: 459 ELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSG 518
           ELDRPRAS GHGDR+EFVPR LPSVHLGVEETGTVS EIGNLIRWRKNFGNV+L++FC+G
Sbjct: 79  ELDRPRASFGHGDRREFVPRNLPSVHLGVEETGTVSTEIGNLIRWRKNFGNVLLVVFCNG 138

Query: 519 PVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLF 578
           PVERTALEWRLLYGRIFKTV+ILS QKN DL VE  +L+ +Y+HLPKIF RY+SAEGFLF
Sbjct: 139 PVERTALEWRLLYGRIFKTVVILSSQKNSDLYVEEAKLDHIYKHLPKIFDRYSSAEGFLF 198

Query: 579 LQDDTILNYWNL 590
           ++DDTILNYWNL
Sbjct: 199 VEDDTILNYWNL 210


>gi|326499041|dbj|BAK06011.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 371

 Score =  339 bits (869), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 153/214 (71%), Positives = 186/214 (86%)

Query: 391 DLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGY 450
           D+HVNVGRLI FL+ WRS K   FE++L+LS++MAEEGFW E+D+ F AAWLQDL+AVGY
Sbjct: 1   DIHVNVGRLINFLMEWRSTKPTLFERILDLSYAMAEEGFWWEKDLHFMAAWLQDLVAVGY 60

Query: 451 QQPRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNV 510
           +QPRLMSLE+DRPRA+IGHGD++EFVP+KLPSVHLGVEE G VS EIGNLI+WRK+FG+V
Sbjct: 61  RQPRLMSLEIDRPRAAIGHGDKQEFVPKKLPSVHLGVEEIGEVSTEIGNLIKWRKHFGDV 120

Query: 511 VLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRY 570
           VLI+ C+GPV+RTALEWRLLYGRIF+ V+ILSEQ N DLAVE+      Y++LPK+F R+
Sbjct: 121 VLIVHCTGPVDRTALEWRLLYGRIFRAVVILSEQGNSDLAVESSNFAHAYKYLPKVFDRF 180

Query: 571 TSAEGFLFLQDDTILNYWNLLQADKNKLWITDKV 604
             AEGF+FLQD  +LNYWNLL ADK+KLWIT+KV
Sbjct: 181 AGAEGFVFLQDYMVLNYWNLLDADKSKLWITNKV 214


>gi|156367414|ref|XP_001627412.1| predicted protein [Nematostella vectensis]
 gi|156214321|gb|EDO35312.1| predicted protein [Nematostella vectensis]
          Length = 450

 Score =  319 bits (817), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 163/427 (38%), Positives = 241/427 (56%), Gaps = 23/427 (5%)

Query: 35  VRDNLFKIVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPV--INWNSIQ- 91
           +R +L + V    ++  +    +    +   S   S +Q      IP     +NW S++ 
Sbjct: 6   MRLSLTRTVIFTAIVLQVFVFYYFYTCSKHVSSNDSNAQWKRVKRIPRQATEVNWESVKR 65

Query: 92  -PIADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAI 150
                K  +Y     +KWI+++    PT+ +KKL  I+GW+V+ +G+++TP +W+L   +
Sbjct: 66  NQAPPKDEMY-----DKWIIITTINEPTEDVKKLASIEGWKVVVVGDTKTPSDWSLPNCV 120

Query: 151 FLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFD 210
           FLS++ Q  LG+R++D LPY SY RK+ GYL+AI HGAK I++ DD        +   F 
Sbjct: 121 FLSVEKQKTLGYRIVDLLPYKSYARKNLGYLYAIHHGAKYIYETDDDNSPTSGQIT--FY 178

Query: 211 VELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVF 270
            +  GE     T         NR  VNPY HFGQ ++WPRG PLEN+  + +E  + +  
Sbjct: 179 EQTTGEFYVYAT---------NRLTVNPYAHFGQVTIWPRGYPLENIS-LPNENTFHKCN 228

Query: 271 GGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIY 330
             +  IQQG+ +G PDVD++F  TRK +    D++FD   P V LP   M P NS NT +
Sbjct: 229 NVEPTIQQGVVDGDPDVDAIFRLTRKDADVRIDVKFDSSAPAVLLPPHTMAPFNSQNTFF 288

Query: 331 QSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY--PFSE 388
               FW L++PV+ +    D+ RG+W QRLLWE+ GY+  +PP   +Y     +   F E
Sbjct: 289 MHKGFWGLLIPVTPTFRVCDIWRGYWAQRLLWEVNGYLSFFPPNAKQYRSAHNFLLDFIE 348

Query: 389 EKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAV 448
           EK+L+   G+L+KFL  W+S K  FF ++L+LS +MAE  FW   D   T AWL DLI+V
Sbjct: 349 EKELYHKSGKLVKFLTDWKSEKDHFFSRILDLSIAMAEAEFWGTEDALLTEAWLHDLISV 408

Query: 449 GYQQPRL 455
           GY+ PRL
Sbjct: 409 GYEPPRL 415


>gi|156379603|ref|XP_001631546.1| predicted protein [Nematostella vectensis]
 gi|156218588|gb|EDO39483.1| predicted protein [Nematostella vectensis]
          Length = 475

 Score =  308 bits (789), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 150/375 (40%), Positives = 226/375 (60%), Gaps = 21/375 (5%)

Query: 85  INWNSIQ--PIADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPK 142
           ++W+SI+  P  +KS +     ++KW+V++   YPTD +KKL K+ GW+V+ +G+++TP 
Sbjct: 81  LDWSSIKMKPKPEKSEM-----NDKWVVITTINYPTDDVKKLAKMDGWKVVVVGDTKTPS 135

Query: 143 NWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIG 202
           +W+    +FLS+  Q  LG+R+ D LPY SY RK+ GYL+AIQHGAK I+D DD      
Sbjct: 136 DWSHPNCVFLSVKRQKELGYRIADLLPYKSYARKNIGYLYAIQHGAKYIYDTDDDNHPTS 195

Query: 203 DDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISH 262
             L  H       +  + E  +  +  N    +VNPY +FGQR++WPRG PL+N+     
Sbjct: 196 GKLEFH-------DKEKGEYYIYKTSAN----VVNPYANFGQRTIWPRGYPLQNISAPMV 244

Query: 263 EEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVP 322
           + F  +    +  IQQG+ +G PDVD++F  TRK      D++FD + P + LP G M P
Sbjct: 245 KTF-VKCKNVQTSIQQGVVDGDPDVDAIFRLTRKDENVRLDVKFDPKAPPILLPPGTMAP 303

Query: 323 VNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIE 382
            NS NT +     WAL++P++      D+ RG+WGQRLLWEIGG++  +PP   +Y    
Sbjct: 304 FNSQNTFFLDKGLWALLIPITTKFRVCDIWRGYWGQRLLWEIGGHLSFFPPNAMQYRSAH 363

Query: 383 AY--PFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAA 440
            Y   F +E DL+ + GRL++FL  W+S +  FF + L+L+ SM +  F   +D   T A
Sbjct: 364 DYHLDFVDEVDLYNDAGRLVEFLREWKSPRKDFFSRALDLTVSMVDNRFMFPKDAILTEA 423

Query: 441 WLQDLIAVGYQQPRL 455
           WL DL+++GY+ P L
Sbjct: 424 WLYDLVSIGYKVPSL 438


>gi|443718134|gb|ELU08880.1| hypothetical protein CAPTEDRAFT_206067 [Capitella teleta]
          Length = 796

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 176/498 (35%), Positives = 264/498 (53%), Gaps = 48/498 (9%)

Query: 85  INWNSIQPIADKSSVY--SRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPK 142
           I+W+ + P A+ + V   +  + +KWI+V+  + PT ++KK+ +I  W +L + + +TPK
Sbjct: 89  IDWDFVAP-AEPAKVQRNAELKHDKWIIVTTVQKPTSAMKKMAQIPNWLLLVVADGKTPK 147

Query: 143 NWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIG 202
            W+L GAI L +  Q  L ++V  +LPYDSY RK  GYL+AI+HGAK I++ DD     G
Sbjct: 148 TWSLPGAILLDVKSQKELHYQVHSYLPYDSYTRKVIGYLYAIEHGAKYIYETDDDNFPEG 207

Query: 203 DDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISH 262
           D       +        Q  +L    +N   T  NP VHFGQ ++WPRG PL+ +G  S 
Sbjct: 208 D-------LTQFQTSMGQSELLLVETKN---TTYNPSVHFGQGTMWPRGFPLDEIGYPSS 257

Query: 263 EEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVP 322
            + Y+        IQQG+ NG PDVD++F  TRK  LE  D++FD+  P V LP G   P
Sbjct: 258 RD-YSLCQMNVPSIQQGLVNGDPDVDALFRLTRKHGLEDLDVKFDNAAPPVVLPHGTYSP 316

Query: 323 VNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPP-TVHRYDK- 380
            NS NT++ + AFWAL+LPVSVS  A D+ R +W Q L+W +G  V  Y P +V + +  
Sbjct: 317 FNSQNTLFTAKAFWALVLPVSVSMRACDIYRSYWAQTLMWTLGDNVGFYAPNSVQKRNPH 376

Query: 381 ---IEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKF 437
              ++AY   EE +L+ ++G  + F+  W+ +K  FF+ V +L+H + E GF+  RD   
Sbjct: 377 SHIMDAY---EETELYHHMGAYVYFMKKWKCDKVFFFDCVSQLTHDLVERGFFVRRDADL 433

Query: 438 TAAWLQDLIAVGYQQPRL----MSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTV 493
           T AW+ DL  +GY  P +     S   + P   +   D +E V   LP     +  T  V
Sbjct: 434 TDAWITDLATIGYAPPIMRSDTKSCHTNDPLHVVFFPDEQETV---LPHSSRKMIPTDLV 490

Query: 494 SYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWR--LLYGRIF-KTVIILSEQKNEDLA 550
           +++          + N  L   C       A  W   +  G  F + V+++S Q N +  
Sbjct: 491 NHQ----------YVNKYLTETCGFAY---AFHWHNIMHEGHKFDREVLVISLQNNPEDI 537

Query: 551 VEAGQLEQVYR-HLPKIF 567
           + +  LE  YR H P I 
Sbjct: 538 IPS--LEATYRPHFPHIL 553


>gi|443698351|gb|ELT98389.1| hypothetical protein CAPTEDRAFT_204971 [Capitella teleta]
          Length = 725

 Score =  277 bits (709), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 169/495 (34%), Positives = 260/495 (52%), Gaps = 42/495 (8%)

Query: 85  INWNSIQPIADKSSVY--SRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPK 142
           I+W  + P A+ + V      + + WI+V+  + PT +++K+V+I  WQ+L + + +TP+
Sbjct: 21  IDWGFVAP-AEPAKVQRNPELKHDNWIIVTTVQKPTSAMEKIVQIPNWQLLVVADKKTPE 79

Query: 143 NWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIG 202
            W+L GAIFL +  Q  L ++V  +LPY+SY RK  GYL+AI+HGAK I++ DD      
Sbjct: 80  TWSLPGAIFLDIRSQKELQYKVHSYLPYNSYSRKVMGYLYAIEHGAKYIYETDD------ 133

Query: 203 DDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISH 262
           D+  +    +      + E +L    E  N T  NP VHFGQ ++WPRG PL+ +G  S 
Sbjct: 134 DNFPEENLTQFQTSIGQSELLLV---ETKNAT-YNPLVHFGQGTMWPRGFPLDEIGYPSS 189

Query: 263 EEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVP 322
            + Y+        IQQG+ NG PDVD++F  TRK  LE  D++FD+  P V LP G   P
Sbjct: 190 RD-YSLCQMNVPSIQQGLVNGDPDVDALFRLTRKHGLEDLDVKFDNAAPPVVLPHGTYSP 248

Query: 323 VNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIE 382
            NS NT++ + AFWAL+LPVSV+    D+ R +W Q L+W +G  +  Y P   +     
Sbjct: 249 FNSQNTLFTAKAFWALVLPVSVTMRECDIYRSYWAQTLMWTLGDNLGFYAPNAVQRRNSH 308

Query: 383 AY--PFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAA 440
           +Y     EE +L+ ++G ++ F+  W+ +K  FF+ V +L+H + E GF+  RD     A
Sbjct: 309 SYIKDAIEETELYHHMGEIMYFMKEWKCDKVFFFDCVSQLTHGLVERGFFVRRDADLIDA 368

Query: 441 WLQDLIAVGYQQPRL----MSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYE 496
           W+ DL  +GY  P +     S   + P   +   D +E V   LP     +  T  V+++
Sbjct: 369 WITDLATIGYAPPIMRSDTKSCHTNDPLHVVFFPDEQETV---LPHSSRKMIPTDLVNHQ 425

Query: 497 IGNLIRWRKNFGNVVLIMFCSGPVERTALEWR--LLYGRIF-KTVIILSEQKNEDLAVEA 553
                     + N  L   C       A  W   +  G  F + V+++S Q N +  + +
Sbjct: 426 ----------YVNKYLTETCGFAY---AFHWHNIMHEGHKFDREVLVISLQNNPEDIIPS 472

Query: 554 GQLEQVYR-HLPKIF 567
             LE  YR H P I 
Sbjct: 473 --LEATYRPHFPHIL 485


>gi|156405128|ref|XP_001640584.1| predicted protein [Nematostella vectensis]
 gi|156227719|gb|EDO48521.1| predicted protein [Nematostella vectensis]
          Length = 463

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 140/354 (39%), Positives = 203/354 (57%), Gaps = 17/354 (4%)

Query: 104 RSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFR 163
           R ++W+V++    PT + K+L  ++GW+ + IG+ +TP +W+    I+L LD Q +LG+ 
Sbjct: 105 RHDRWVVLTTVHEPTIAAKRLAGLEGWRTVVIGDEKTPPDWSHSNVIYLDLDKQKSLGYE 164

Query: 164 VLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETI 223
           + + +P + Y RK+ GYL+AIQHGA  I+DAD    ++ + LG H D     E  R+  +
Sbjct: 165 ISNHIPKNHYSRKNIGYLYAIQHGANIIYDADTNTQLLRNKLGFHLD-----EDPRK--L 217

Query: 224 LQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNG 283
           L YS    N  I+NPY HFGQ +VWPRG PLE +G      F     G    IQQ +SNG
Sbjct: 218 LVYS---TNHNIINPYPHFGQSTVWPRGYPLEMIGAPPQHTFVL-CEGINPGIQQALSNG 273

Query: 284 LPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVS 343
             DVDS+F  TR     A +I F+ +  KVA+P G     N+ NT+Y     W L+LPVS
Sbjct: 274 ASDVDSIFKLTRNNHNTALNITFNGKSEKVAIPHGAFSVFNAQNTLYHHDVLWGLLLPVS 333

Query: 344 VSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHR----YDKIEAYPFSEEKDLHVNVGRL 399
           V +  +DV R +W QRL+W++G  +V +PP  HR    +D +      EE+ L+ N G  
Sbjct: 334 VQSRVTDVWRSYWAQRLIWQVGRSLVFHPPNSHRSQLPHDNLRT--LREEQMLYYNTGEY 391

Query: 400 IKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQP 453
           ++ L+ W S K   F++VL+L   +  +  W+ RD     AWL DLI VGY  P
Sbjct: 392 LESLLEWSSTKSAVFDQVLDLGIFLTRKKLWSVRDAHLLEAWLHDLIRVGYIPP 445


>gi|384244611|gb|EIE18111.1| hypothetical protein COCSUDRAFT_49414 [Coccomyxa subellipsoidea
           C-169]
          Length = 708

 Score =  241 bits (615), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 165/544 (30%), Positives = 262/544 (48%), Gaps = 78/544 (14%)

Query: 79  AIPLPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNS 138
           A+ L V +  ++  I+    + +R +   W+V++   YPTD++K+L K  GW+V+ + + 
Sbjct: 11  AVLLTVADAKNLHVISSAPGL-ARDKHSNWVVITTINYPTDTVKRLAKAPGWRVVVVADQ 69

Query: 139 RTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRG 198
           +TP++W L     L L+ Q  L ++VL  LPY+ Y RK+ GYL+AIQHGA ++++ DD  
Sbjct: 70  KTPRDWQLYNVDILDLEKQKELDYKVLALLPYNHYGRKNLGYLWAIQHGATQVYETDDDN 129

Query: 199 DVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENV- 257
           ++  D            E      +  Y ++     + NPY  FGQ  +WPRG PLE++ 
Sbjct: 130 ELKLD------------EPPALSGLSYYVYDASGVEVCNPYAFFGQPQIWPRGYPLEHIK 177

Query: 258 GEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQ 317
           G  S   F  +    +  I QG+++  PDVD+++  T     +   I FD  VP V  P 
Sbjct: 178 GAPSCTNFTRQP--AQPLILQGLADMDPDVDAIYRLT-----QPLGIAFDSNVPLVVFPH 230

Query: 318 GMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHR 377
           G+M P NS NT++   A W L++PV+ +    D+ RG+W QRLLWEI G +   PPTV++
Sbjct: 231 GVMAPFNSQNTLFARDALWGLLIPVTTTFRVCDIWRGYWVQRLLWEIDGNLAFGPPTVNQ 290

Query: 378 YDKIEA--YPFSEEKDLHVNVGRLIKFLVSWR-SNKHRFFEKVLELSHSMAEEGFWTERD 434
           +       +  +EE DL+   G L+K L SWR ++  +  + + +L+  MA+ GFW    
Sbjct: 291 FRNPHNLLHDMAEEADLYAKAGDLVKLLSSWRGASVKKLPDLIGDLAQIMADSGFWE--- 347

Query: 435 VKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVS 494
                   QDL AVGY+ P+                 R++  P +  SV +    T    
Sbjct: 348 --------QDLKAVGYRFPK-----------------RQKRRPAEPASVEVARAHTPA-- 380

Query: 495 YEIGNLIRWRKNFGNVVLIMF---CSGPVERTALEWRLLYGRIFKTVIIL-----SEQKN 546
                   WR+    ++++ F     G ++   L  R  Y  IF  +I       SE   
Sbjct: 381 -------SWRRYDAIILVVNFNKAYDGMLKVLEL-LREAYQPIFSRIIFTGGTRPSEFPG 432

Query: 547 EDLAVEA----GQLEQVYRHLPKIFSRYTSAE--GFLFLQDDTILNYWNLLQADKNKLWI 600
           E+  VE     G + Q    L  +     +    G+L L DD I+++  L   D  K+W 
Sbjct: 433 EERWVECDGSGGSMMQ--SCLANVMQEVEAPHGGGYLMLGDDVIISHCQLAAFDPKKVWF 490

Query: 601 TDKV 604
              V
Sbjct: 491 QRAV 494


>gi|183178941|gb|ACC43950.1| unknown [Philodina roseola]
          Length = 664

 Score =  234 bits (597), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 139/427 (32%), Positives = 224/427 (52%), Gaps = 29/427 (6%)

Query: 46  LLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKS--SVYSRF 103
            ++IAT+  LS         S++  KS     N  P P      ++P   K+        
Sbjct: 9   FIIIATMMILSLFVFMIFYHSVLMPKSFSRIINGSPSP------LRPAERKNLKPFSCPI 62

Query: 104 RSEKWIVVSVDRYPTDSLKKLVKIKG-WQVLAIGNSRTPKNW----NLKGA--IFLSLDM 156
           R ++WI+V+   YPT S+ K + +   W ++ + + +TPK+W    ++K +  IFLS++ 
Sbjct: 63  RGDRWIIVTSIFYPTPSIYKFLNLTTEWNLIVVADRKTPKDWLEYLSIKTSRLIFLSVEE 122

Query: 157 QANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGE 216
           Q  L FR++DFLP+ SY RK+ GYL AIQ GAK +F++DD      D+L +  D+  + +
Sbjct: 123 QKTLNFRIIDFLPFGSYARKNLGYLIAIQCGAKIVFESDD------DNLLETDDIFHLPK 176

Query: 217 GARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYT----EVFGG 272
             R   +   S        VN Y  FG   +WPRG P++ +  ++ + +++    ++   
Sbjct: 177 IVRPNDVPWISFHRQRSPFVNIYGSFGHSQIWPRGFPVDELRNVTEDGWHSVRRNDIEEM 236

Query: 273 KQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQS 332
             +IQQ +++  PDVD++F  T    L    + FD   P +A+ Q    P N+ NTI   
Sbjct: 237 PAYIQQYLADLDPDVDALFRLTH--PLSVGRVHFDRTQPPIAIDQSTFSPYNTQNTITHY 294

Query: 333 SAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY--PFSEEK 390
            AFW L LPV+ +    D+ RGFW QRLLW+IGGY++    TV +     +Y     EE 
Sbjct: 295 EAFWGLYLPVTTTFRVCDIWRGFWVQRLLWDIGGYLIFGTATVRQIRNSHSYLKDMQEED 354

Query: 391 DLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGY 450
            L+   G  ++FL SW S +     ++  L+  +A+ GFW   ++    AWL DL++VGY
Sbjct: 355 QLYHQSGSFVRFLASWTSPERTLIRRIALLARDIAQAGFWHSNEINIIDAWLNDLLSVGY 414

Query: 451 QQPRLMS 457
           + P ++S
Sbjct: 415 KFPSIVS 421


>gi|183178953|gb|ACC43961.1| unknown [Philodina roseola]
          Length = 665

 Score =  232 bits (591), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 138/428 (32%), Positives = 224/428 (52%), Gaps = 30/428 (7%)

Query: 46  LLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVI---NWNSIQPIADKSSVYSR 102
            ++IAT+  LS         S++  KS     N  P P +      +++P +        
Sbjct: 9   FIIIATMMILSLFVFMIFYHSVLIPKSFSRIINGSPSPSLAEAERKNLKPFS------CP 62

Query: 103 FRSEKWIVVSVDRYPTDSLKKLVKIKG-WQVLAIGNSRTPKNW----NLKGA--IFLSLD 155
            R ++WI+V+   YPT S+ K + +   W ++ + + +TPK+W    ++K +  IFLS++
Sbjct: 63  IRGDRWIIVTSIFYPTPSIYKFLNLTTEWNLIVVADRKTPKDWLEHLSIKTSRLIFLSVE 122

Query: 156 MQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVG 215
            Q  L FR++DFLP+ SY RK+ GYL AIQ GA  +F++DD      D+L +  D+  + 
Sbjct: 123 EQKTLNFRIIDFLPFGSYARKNLGYLIAIQCGANIVFESDD------DNLLETDDIFHLP 176

Query: 216 EGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQ- 274
           +  R   +   S        VN Y  FG   +WPRG P++ +  ++ + +++     K+ 
Sbjct: 177 KIVRPNDVPWISFHRQRSPFVNIYGSFGHSQIWPRGFPVDELRNVTEDGWHSVRRNDKEE 236

Query: 275 ---FIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQ 331
              +IQQ +++  PDVD++F  T    L    + FD   P +A+ Q    P N+ NTI  
Sbjct: 237 MPAYIQQYLADLDPDVDALFRLTHP--LSVGRVHFDRTQPPIAIDQSTFSPYNTQNTITH 294

Query: 332 SSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY--PFSEE 389
             AFW L LPV+ +    D+ RGFW QRLLW+IGGY++    TV +     +Y     EE
Sbjct: 295 YEAFWGLYLPVTTTFRVCDIWRGFWVQRLLWDIGGYLIFGTATVRQIRNSHSYLKDMQEE 354

Query: 390 KDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVG 449
             L+   G  ++FL SW S +     ++  L+  +A+ GFW   ++    AWL DL++VG
Sbjct: 355 DQLYHQSGSFVRFLASWTSPERTLIRRIALLARDIAQAGFWHSNEIDIIDAWLNDLLSVG 414

Query: 450 YQQPRLMS 457
           Y+ P ++S
Sbjct: 415 YKFPSIIS 422


>gi|358057238|dbj|GAA96847.1| hypothetical protein E5Q_03520 [Mixia osmundae IAM 14324]
          Length = 1148

 Score =  221 bits (563), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 120/356 (33%), Positives = 192/356 (53%), Gaps = 17/356 (4%)

Query: 108 WIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAI-FLSLDMQANLGFRVLD 166
           W+VV+    PT +++ L  +  W+V  + + +TP++W+   A  FLS + Q+ L FRV+ 
Sbjct: 485 WMVVTTVNLPTSTMEALCALDNWEVAVVADLKTPRSWSSGPACHFLSTNYQSRLPFRVVS 544

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQY 226
            +PY +Y RKS GYLFAI +GA+ I D DD      D+L     V    +     T L  
Sbjct: 545 RIPYKAYTRKSIGYLFAIANGAELIQDTDD------DNLPNEEIVLQDPDSPEFMTALPS 598

Query: 227 SHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFY-----TEVFGGKQFIQQGIS 281
            +   +R ++NPY HF +  +WPRG PLE     +   +      +E   G+  IQQG++
Sbjct: 599 GNLETSR-VINPYAHFARGDIWPRGFPLEEYDRNATMRYLKASEASENVQGRALIQQGLA 657

Query: 282 NGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLP 341
           +  PDVD++F    +  +    +RF   VP + + +G + P NS NT++   AFW L+LP
Sbjct: 658 DLDPDVDAIFRLLNREDIA--KVRFCKAVPSLKMARGALAPFNSQNTLFHHDAFWGLLLP 715

Query: 342 VSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY--PFSEEKDLHVNVGRL 399
           ++V+  A+D++RG+W QRLLW++GG +    P+V +      Y    + E+ L+   G L
Sbjct: 716 ITVTFRATDIIRGYWAQRLLWDVGGTLAFREPSVDQIRNAHDYIQDMTSEEKLYTQSGDL 775

Query: 400 IKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRL 455
             FL  W  +      ++L L  +M  E F    DV+   AW+ DL ++GY+ P +
Sbjct: 776 TTFLQDWSDSSLDLPTRLLHLLRAMQSEKFIRGPDVELAKAWVADLRSIGYEFPEI 831


>gi|308510034|ref|XP_003117200.1| hypothetical protein CRE_02033 [Caenorhabditis remanei]
 gi|308242114|gb|EFO86066.1| hypothetical protein CRE_02033 [Caenorhabditis remanei]
          Length = 816

 Score =  221 bits (562), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 150/446 (33%), Positives = 227/446 (50%), Gaps = 37/446 (8%)

Query: 39  LFKIVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSS 98
           L K+  +LLLI   +++ F+  +      I+S     S         N + I P+AD   
Sbjct: 3   LMKLNKILLLIVCSSSV-FITIYWSATHGIRSSRNTRS---------NSDRINPVADVK- 51

Query: 99  VYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQA 158
                +  KWIVV+   YPT+ +K+L   + W ++ + +++TP +W L+   FLS+D Q 
Sbjct: 52  -----KGNKWIVVTSVNYPTEDVKRLSSFEEWNLVVVADTKTPVDWKLETVHFLSVDYQK 106

Query: 159 NLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLG-KHFDVELVGEG 217
            L F ++  LPY SY RK+ GYL+AI  GA+ I+D DD      D LG   FD E    G
Sbjct: 107 QLPFSIVSSLPYKSYTRKNIGYLYAISQGAEWIYDTDDDNKPY-DKLGLNQFDYEDTVSG 165

Query: 218 ARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISH-EEFYTEVFGGKQ-F 275
            R +  ++ S E   R + NPY  FG   +WPRG PLE + + ++ +E     +  K+  
Sbjct: 166 VRYQ--VKNSSEIIQR-LFNPYRFFGVDQMWPRGFPLEYIEKHTNGKENQVLCYKMKRSS 222

Query: 276 IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAF 335
           +QQG+ +  PDVD+V+      S    D++F+   P +AL  G   P NS NT++  SAF
Sbjct: 223 VQQGLVHHDPDVDAVYRLLNADSNSGLDVKFNKFAPPIALSVGTFSPWNSQNTLFHKSAF 282

Query: 336 WALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSE 388
             L LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F +
Sbjct: 283 HTLFLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFAPTNAIQFRNAHDYLK----DFKD 337

Query: 389 EKDLHVNVGRLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLIA 447
           EK ++ + G++I+FL  W+ +K    E  +  LS  + E   W E D K    +L DL  
Sbjct: 338 EKQVYEDSGKIIEFLNDWKCSKDINLEDCINNLSEDLVENNLWGEDDSKLMKLFLDDLKL 397

Query: 448 VGYQQPRLMSLELDRPR-ASIGHGDR 472
           +G++ P LM  E + P  AS    DR
Sbjct: 398 MGFKYPDLMGEEYEDPYIASDNETDR 423


>gi|449678106|ref|XP_004209003.1| PREDICTED: uncharacterized protein LOC100197693 [Hydra
           magnipapillata]
          Length = 373

 Score =  214 bits (546), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 111/318 (34%), Positives = 181/318 (56%), Gaps = 14/318 (4%)

Query: 146 LKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDL 205
           L G I++S++ Q  LG+  ++ L Y +Y RK+ GYL+AIQHGAK I+D DD  D + +  
Sbjct: 34  LDGVIYISVEDQKKLGYETVNLLKYRAYTRKNIGYLYAIQHGAKYIYDTDD--DNVPNTG 91

Query: 206 GKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEF 265
              FD+ L     +++ ++ +S    NRT  N + HFGQ ++WPRG PL  +G++     
Sbjct: 92  KIDFDMTL-----KRKYLVYHS----NRTFYNVFAHFGQSTLWPRGYPLSFIGDLPIRT- 141

Query: 266 YTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNS 325
           Y +    + ++QQG+ NG PD+D++   TRK S   F+I+FD++   V LP     P NS
Sbjct: 142 YRKCLNTEPYVQQGVVNGDPDLDAIQRLTRKDSNVKFNIKFDEKQEPVVLPHKSFTPYNS 201

Query: 326 FNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY- 384
            NT +  +AFW L+LP + +   +D+ R +  QRLLW+IGG++  Y P  ++      Y 
Sbjct: 202 QNTFHSYNAFWGLLLPQTTAFRVTDIWRSYITQRLLWDIGGHLAYYGPNAYQDRTGHDYL 261

Query: 385 -PFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQ 443
             + +E  L+ +   L  FL+ W+SN +    +  EL   + ++     RDV    AW++
Sbjct: 262 LDYLDESALYNDCLTLTNFLLRWKSNNNSVLTRYFELIKDLYKQKILKIRDVHIAKAWVR 321

Query: 444 DLIAVGYQQPRLMSLELD 461
           DL++ GYQ P +   +++
Sbjct: 322 DLLSFGYQAPNITKTKME 339


>gi|297824117|ref|XP_002879941.1| hypothetical protein ARALYDRAFT_903496 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325780|gb|EFH56200.1| hypothetical protein ARALYDRAFT_903496 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 122

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 97/118 (82%), Positives = 108/118 (91%)

Query: 166 DFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQ 225
           + LPYDS+VRKS GYLFAIQHGAKKI+DADDRG+VI  DLGKHFDVELVG  ++Q+ ILQ
Sbjct: 5   NHLPYDSFVRKSVGYLFAIQHGAKKIYDADDRGEVIDGDLGKHFDVELVGVDSKQQPILQ 64

Query: 226 YSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNG 283
           YSHEN NRT+VNPY+HFGQ SVWPRGLPLENVGEI+HEE+YTEVFGG QFIQQGISNG
Sbjct: 65  YSHENSNRTVVNPYIHFGQHSVWPRGLPLENVGEINHEEYYTEVFGGTQFIQQGISNG 122


>gi|268555818|ref|XP_002635898.1| Hypothetical protein CBG01120 [Caenorhabditis briggsae]
          Length = 670

 Score =  209 bits (533), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 127/373 (34%), Positives = 194/373 (52%), Gaps = 29/373 (7%)

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLD 166
           KWIVV+   YPT+ +K+L  I+ W ++ + +++TP++WNL+G  FLS++ Q NL F ++ 
Sbjct: 53  KWIVVTSVNYPTEDVKRLASIESWNLVVVADTKTPEDWNLEGVHFLSVEFQKNLPFSLIS 112

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLG-KHFDVELVGEGARQETILQ 225
            LPY SY RK+ GYL+AI  GA+ I+D DD      D LG   FD +    G R      
Sbjct: 113 SLPYKSYTRKNIGYLYAISQGAEWIYDTDDDNKPY-DKLGLDQFDYDDTVSGVR------ 165

Query: 226 YSHENPNRTI----VNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGK---QFIQQ 278
           Y+ EN    I     NPY + G + +WPRG PLE+    ++ +   ++   K     +QQ
Sbjct: 166 YTVENAKDGIRNRLFNPYRYGGIQQMWPRGFPLEHFENHTNGK-DNQILCQKMSRSAVQQ 224

Query: 279 GISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL 338
           G+ +  PDVD+++           D++F+   PK+ L  G   P NS NT++  SAF  L
Sbjct: 225 GLVHHDPDVDAIYRLLNADKSTGLDVKFNKFAPKIILSIGTYSPWNSQNTLFHKSAFHTL 284

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKD 391
            LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F +EK 
Sbjct: 285 FLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHDYLK----DFKDEKQ 339

Query: 392 LHVNVGRLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLIAVGY 450
           +  + GR +KFL +W  +     E  + +LS  +  E  W E D K    +L D+  +G+
Sbjct: 340 VFEDSGRFLKFLHNWNCSNATVLEDCMKKLSEDLVLEKLWGEEDAKLMGMFLDDMKVMGF 399

Query: 451 QQPRLMSLELDRP 463
           + P L+      P
Sbjct: 400 EFPPLIGESYQDP 412


>gi|384244543|gb|EIE18044.1| hypothetical protein COCSUDRAFT_49421 [Coccomyxa subellipsoidea
           C-169]
          Length = 766

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 146/523 (27%), Positives = 242/523 (46%), Gaps = 52/523 (9%)

Query: 95  DKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSL 154
           DK    +      W+V++   YPT++++ L     WQV+ + +++TP +W L   + LS+
Sbjct: 43  DKDVKTAENERMNWVVITTINYPTETIRLLASAPDWQVVVVADNKTPVDWALDNVVLLSI 102

Query: 155 DMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELV 214
           + Q +L + ++  LP++ Y RK+ GYL+AI+HGA ++++ DD  ++I  +  K       
Sbjct: 103 EEQESLKYNIMTLLPFNHYGRKNIGYLYAIEHGATQVYETDDDNEIISTNPLK------- 155

Query: 215 GEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQ 274
                    L+Y   N    + NPY +FG  S+WPRGL       +     Y  V     
Sbjct: 156 ---VPSFRALEYFVYN-TTGVCNPYHYFGYPSIWPRGLLSNRYTCVLIVPTYPSVLA--- 208

Query: 275 FIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSA 334
            +QQG++N  PDVD+++  T     +   + F   +P V LP+  + P NS NT++   A
Sbjct: 209 -LQQGLANLDPDVDAIYRLT-----QPLGVHFRADLPAVVLPERTICPWNSQNTLFAKDA 262

Query: 335 FWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY--PFSEEKDL 392
                  + V+T+     R +W QRLLWEIGG +   PPTV++          F EE  L
Sbjct: 263 LCGHTALLVVATVLFIQCR-YWVQRLLWEIGGNIAFGPPTVNQLRNAHNLMRDFEEENPL 321

Query: 393 HVNVGRLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQ 451
           +   G L++ L +W +        ++  L+  MA+E  W + DV   AAW+ DL  VGY 
Sbjct: 322 YNQAGALVELLNAWVAPPGSDLPTLMTSLAQKMADEKMWEQGDVDLMAAWVADLKEVGYV 381

Query: 452 QPRLMSLE---LDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFG 508
            PRL + +   +  P   +   D  E V    P V                 + WR+ + 
Sbjct: 382 FPRLRNADERSIQGPDGDLLREDGAEAVAPHDPFVAPRTP------------LHWRR-YD 428

Query: 509 NVVLIMFCSGPVERTALEWRLL---YGRIFKTVII--LSEQKNE-----DLAVEAGQLEQ 558
           N+VLI+  +         ++LL   Y  +F T++     E+  E     +    +G    
Sbjct: 429 NIVLIIMFNTKYPSWLETFQLLKEAYTPMFGTLVFTGFPERPEEVPMGDNFVTCSGTGHL 488

Query: 559 VYRHLPKIFSRYTSAE--GFLFLQDDTILNYWNLLQADKNKLW 599
            Y         + +    G+L L DDTI+N+  +   + +K+W
Sbjct: 489 QYICFANAMQEFAAPANGGYLILGDDTIINHCQMQHFNASKIW 531


>gi|341886795|gb|EGT42730.1| hypothetical protein CAEBREN_01149 [Caenorhabditis brenneri]
          Length = 782

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 141/495 (28%), Positives = 231/495 (46%), Gaps = 55/495 (11%)

Query: 36  RDNLFKIVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIAD 95
           R  L K++  L    TI+ +S L N   +   I S    H+ ++ PL             
Sbjct: 4   RRQLLKVL--LFAFGTISIISLLHNGYSSHIRIVSI---HNNDSTPLK------------ 46

Query: 96  KSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLD 155
                   +  KWIVV+    PT+ +K+L     W ++ + +++TP +W L+   FLS++
Sbjct: 47  --------KGNKWIVVTSISSPTNDVKRLASFDDWNLVVVADTKTPLDWKLENVHFLSVE 98

Query: 156 MQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVG 215
            Q  L F ++  LPY SY RK+ GYL+AI HGA+ I+D DD        L + F  E   
Sbjct: 99  YQNQLPFSLVSSLPYKSYTRKNIGYLYAISHGAEWIYDTDDDNKPFDKGLNQ-FQYEDTV 157

Query: 216 EGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEIS--HEEFYTEVFGGK 273
            G R    +  S +   R + NPY  FG   +WPRG PLE++ + +  H +  +     +
Sbjct: 158 SGVRYR--VNSSEDGILRRLFNPYQFFGVNQMWPRGFPLEHIEKHTNAHGQQVSCYKMKR 215

Query: 274 QFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSS 333
             +QQG+ +  PDVD+++      S    D++F++  P + L  G   P NS NT++  S
Sbjct: 216 AAVQQGLVHHDPDVDAIYRLLNADSKTGLDVKFNEFAPPITLSVGTYSPWNSQNTLFHKS 275

Query: 334 AFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPF 386
           AF  L LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F
Sbjct: 276 AFHTLFLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHDYLK----DF 330

Query: 387 SEEKDLHVNVGRLIKFLVSWRSNK---HRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQ 443
            +EK ++ + G++IKFL  W+ +    +     + EL + +  E  W ++D +    +L 
Sbjct: 331 KDEKQVYEDSGKMIKFLHEWKCSNAISNNLENCIYELMNELVVENLWGKKDSELMKMFLN 390

Query: 444 DLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKL--------PSVHLGVEETGTVSY 495
           DL +VG++ P ++      P +   +   ++   R++        P  H    +   V  
Sbjct: 391 DLKSVGFEFPVMVGESYRDPYSPSTNETSRDVNCRRMNLEFELIDPKEHHRKNKKRAVQK 450

Query: 496 --EIGNLIRWRKNFG 508
               GNL+ W    G
Sbjct: 451 LNYFGNLVEWCNETG 465


>gi|341880723|gb|EGT36658.1| hypothetical protein CAEBREN_29663 [Caenorhabditis brenneri]
          Length = 730

 Score =  202 bits (513), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 142/450 (31%), Positives = 216/450 (48%), Gaps = 29/450 (6%)

Query: 42  IVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYS 101
           I +  L+ A +A L  L        LI  +S     NA PL       I P A      S
Sbjct: 10  IRSFFLISAIVACLLLLYMNNMDDLLIMKRSVRLFVNA-PLET---EDIIPTA------S 59

Query: 102 RFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLG 161
             +  KWIVV+   YPT+ +K+L     W ++ + +++TP +W L    FLS++ Q  L 
Sbjct: 60  IKKGNKWIVVTSISYPTEDVKRLASFDDWNLVVVADTKTPLDWKLDNVHFLSVEYQEQLP 119

Query: 162 FRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQE 221
           F ++  LPY SY RK+ GYL+AI HGA+ I+D DD     G  L K FD E    G R  
Sbjct: 120 FSLVKSLPYKSYTRKNIGYLYAIYHGAEWIYDTDDDNKPYGLGL-KQFDYEDTVSGVRYR 178

Query: 222 TILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQ---FIQQ 278
            +   S E     + NPY  FG   +WPRG PLE + E        +V   K     +QQ
Sbjct: 179 -VQNESSEGILERLFNPYQFFGMDQMWPRGFPLEYL-EKHRNGKDQQVLCYKMKRAAVQQ 236

Query: 279 GISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL 338
           G+ +  PD+D+++      S    D++F+   P + L      P NS NT++  SAF  L
Sbjct: 237 GLVHHDPDLDAIYRLLHADSNSGLDVKFNKFAPPITLSIETYSPWNSQNTLFHKSAFHTL 296

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKD 391
            LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F +EK 
Sbjct: 297 FLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHDYLK----DFKDEKQ 351

Query: 392 LHVNVGRLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLIAVGY 450
           ++ + GR+I+FL  W+ +     E+ + +L+  + +   W E+D +    +L DL  +G+
Sbjct: 352 VYEDSGRMIEFLHKWKCSDGNGLEECISQLTDDLVKNELWEEKDSELMKMFLDDLKFLGF 411

Query: 451 QQPRLMSLELDRPRASIGHGDRKEFVPRKL 480
           + P L+      P +   +   +E   RK+
Sbjct: 412 KFPNLIDDSYKDPYSPPENETLREVNCRKM 441


>gi|86564532|ref|NP_504993.3| Protein ZK105.3 [Caenorhabditis elegans]
 gi|351050146|emb|CCD64283.1| Protein ZK105.3 [Caenorhabditis elegans]
          Length = 802

 Score =  202 bits (513), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 120/372 (32%), Positives = 189/372 (50%), Gaps = 25/372 (6%)

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLD 166
           KWIVV+    PT+ +K+L     W ++ + +++TP +W L+   FLS++ Q  L F +  
Sbjct: 49  KWIVVTSVSAPTEDVKRLSSFPDWNLVVVADTKTPLDWKLENVHFLSVEYQKQLPFSISA 108

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQY 226
            LPY SY RK+ GYL+AI HGA+ I+D DD     G  L K FD +    G R    ++ 
Sbjct: 109 LLPYKSYTRKNIGYLYAISHGAEWIYDTDDDNKPYGQGL-KQFDFDDTISGVRYRPQMR- 166

Query: 227 SHENPNRTIVNPYVHFGQRSVWPRGLPLENV-----GEISHEEFYTEVFGGKQFIQQGIS 281
           S E   + + NPY  +G   +WPRG PLE++     G  S    Y      +  +QQG+ 
Sbjct: 167 SEERILKRLFNPYRFYGMDQMWPRGFPLEHIEKHTNGNDSQVLCYQM---KRAAVQQGLV 223

Query: 282 NGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLP 341
              PDVD+++      +    +++F+   P + L  G   P NS NT++  SAF  L LP
Sbjct: 224 RHDPDVDAIYRLLHADTKSGLNLKFNKFAPPITLSVGTYSPWNSQNTLFHKSAFHTLFLP 283

Query: 342 VSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDLHV 394
            +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F +EK ++ 
Sbjct: 284 TTVSFRTTDIWRSFVSQKIL-HLSGLTVSFVPTNAVQFRNAHDYLK----DFKDEKQVYE 338

Query: 395 NVGRLIKFLVSWRS---NKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQ 451
           + G++I++L  W+    N       V +L++ + E   W + D   T  +L DL  VG++
Sbjct: 339 DSGKMIEYLHDWKCAPENSSDLERCVKQLANDLVEVKLWGKEDAMLTEMFLNDLKRVGFE 398

Query: 452 QPRLMSLELDRP 463
            PR++    + P
Sbjct: 399 FPRILDGNYEDP 410


>gi|308472668|ref|XP_003098561.1| hypothetical protein CRE_05085 [Caenorhabditis remanei]
 gi|308268827|gb|EFP12780.1| hypothetical protein CRE_05085 [Caenorhabditis remanei]
          Length = 864

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 138/488 (28%), Positives = 239/488 (48%), Gaps = 35/488 (7%)

Query: 47  LLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFR-- 104
           ++I +I +   L  F  ++ ++   S      ++ LP  +  +++ I+  +S  S F+  
Sbjct: 25  MMILSIVSRILLLIFCASSIIMMYYSYNSDYGSVGLPTDH--NLKRISKNASAISEFKYV 82

Query: 105 --------SEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDM 156
                     KWIVV+   YPT+ +K+L  I+ W ++ + +++TP +W L    FL +  
Sbjct: 83  RPVARVKKGNKWIVVTSISYPTEDVKRLASIEDWNLVVVADTKTPIDWKLDDVHFLPVLY 142

Query: 157 QANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGE 216
           Q  L F +   LPY SY RK+ GYL+AI  GA+ I+D DD            FD +    
Sbjct: 143 QKTLPFSLSYSLPYKSYTRKNIGYLYAIAQGAEWIYDTDDDNKPYDKRGLDQFDYDETIS 202

Query: 217 GARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISH-EEFYTEVFGGKQ- 274
           G R +  ++ S+      + NPY  +G   +WPRG PLE++ + S+ +E     +  K+ 
Sbjct: 203 GVRFQ--VKNSNAGVLERLFNPYRFYGMDQMWPRGFPLEHIEKHSNGKEQQALCYKMKRS 260

Query: 275 FIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSA 334
            +QQG+ +  PDVD+V+      S    DI+F+   P + L  G   P NS NT++  SA
Sbjct: 261 AVQQGLVHHDPDVDAVYRLLHADSKSGLDIKFNMFTPPITLSVGTYSPWNSQNTLFHKSA 320

Query: 335 FWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFS 387
           F AL LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F 
Sbjct: 321 FHALFLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAIQFRNAHDYLK----DFK 375

Query: 388 EEKDLHVNVGRLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLI 446
           +EK ++ + G++I+FL +W+ +     E  + +L   +     W + D K  + +L DL 
Sbjct: 376 DEKQVYEDSGKMIEFLSNWKCSNGNSLEGCINDLLKDLVTNNLWGKEDFKLMSFFLNDLK 435

Query: 447 AVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKN 506
            +G++ P L+      P  +  + + +    R++   +L  +      Y+  N+I+  + 
Sbjct: 436 YMGFEFPELIGENYQDPYTASNNEEDRNVNCRRM---NLEFDLVDPREYQRQNIIKAEQK 492

Query: 507 ---FGNVV 511
              FG++V
Sbjct: 493 LNYFGDLV 500


>gi|392889020|ref|NP_493817.2| Protein F46F5.11 [Caenorhabditis elegans]
 gi|351062195|emb|CCD70109.1| Protein F46F5.11 [Caenorhabditis elegans]
          Length = 798

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 121/370 (32%), Positives = 189/370 (51%), Gaps = 21/370 (5%)

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLD 166
           KWIVV+    PT+ +K+L     W ++ + +++TP +W LK   FLS++ Q  L F +  
Sbjct: 48  KWIVVTSVSAPTEDVKRLASFPDWNLVVVADTKTPLDWKLKNVHFLSVEYQKKLPFSMSS 107

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQY 226
            LPY SY RK+ GYL+AI HGA+ I+D DD     G  L K F+ E    G R +  L  
Sbjct: 108 LLPYKSYTRKNIGYLYAISHGAEWIYDTDDDNKPFGQGL-KQFNFEESVSGVRYQPNLMS 166

Query: 227 SHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFG---GKQFIQQGISNG 283
           S E   R + NPY  +G   +WPRG PLE++ E       ++V      +  +QQG+ + 
Sbjct: 167 SQEISQR-LFNPYEFYGVDQMWPRGFPLEHI-EKHKNRNDSQVLCYEMKRAAVQQGLVHH 224

Query: 284 LPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVS 343
            PDVD+++      S    +++F+   P + L  G   P NS NT++  SAF  L LP +
Sbjct: 225 DPDVDAIYRLLHADSKNGLNLQFNKFAPPITLSVGSYSPWNSQNTLFHKSAFHTLFLPTT 284

Query: 344 VSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDLHVNV 396
           VS   +D+ R F  Q++L  + G  V + PT        H Y K     F +EK ++ + 
Sbjct: 285 VSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHNYLK----DFKDEKQVYEDS 339

Query: 397 GRLIKFLVSWRS---NKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQP 453
           GR+I +L +W+    N  +    + +L + + +   W E+D   T  +L DL  + ++ P
Sbjct: 340 GRMIDYLHNWKCSPENSKQIENCIKQLVNDLVKVKLWGEQDAVLTELFLADLKDMRFEFP 399

Query: 454 RLMSLELDRP 463
            L+      P
Sbjct: 400 SLVGDNFKEP 409


>gi|189313899|gb|ACD88939.1| DUF288 containing protein [Adineta vaga]
          Length = 680

 Score =  198 bits (503), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 118/386 (30%), Positives = 205/386 (53%), Gaps = 28/386 (7%)

Query: 87  WNSIQPIADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKG-WQVLAIGNSRTPKNW- 144
           WN+ Q     S+     R +KWIV++   YPT ++ K + +   W ++ I + +TP +W 
Sbjct: 48  WNTKQ----SSTYVCPIRGDKWIVITTIHYPTQAIYKFLNLTTPWNLIIIADRKTPTHWL 103

Query: 145 --------NLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADD 196
                   +    + L    Q +L FR+L FLP  SY RK+ GYL AIQ GA+ IF++DD
Sbjct: 104 KHLNSHNTSRLLFLSLQQQQQHSLHFRILQFLPQGSYARKNLGYLIAIQCGAQIIFESDD 163

Query: 197 RGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLEN 256
                 D+L ++ D+ L+ +  + + +  ++        VN Y  FG   +WPRG P++ 
Sbjct: 164 ------DNLLENNDIYLLPKLLQPKHLPWFAFHRQRSLFVNIYASFGHPHIWPRGFPIDQ 217

Query: 257 VGEISHEEFYT----EVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPK 312
           +  ++ + +++    +      +IQQ +++  PDVD+++      ++    ++FD   P 
Sbjct: 218 LRNLTEDGWHSLRQNQQNITHAYIQQYLADLDPDVDAIYRLAHPMTIGR--VQFDRDQPP 275

Query: 313 VALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYP 372
           +AL      P N+ NT+    AFW L LPV+ +    D+ RG+W QRLLW+IGG+++   
Sbjct: 276 IALESFTFSPYNTQNTVTYYEAFWGLYLPVTTTFRVCDIWRGYWVQRLLWDIGGHLIFGR 335

Query: 373 PTVHRYDKIEAY--PFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFW 430
            TV +     +Y     +E  L+      ++FL SW S+      ++ EL+ ++++ GFW
Sbjct: 336 STVQQIRNSHSYIEDMDDEYQLYHQSASFVRFLASWSSSNPSLVGRIRELARAISQGGFW 395

Query: 431 TERDVKFTAAWLQDLIAVGYQQPRLM 456
             ++V+ T AWL DL +VGY+ P ++
Sbjct: 396 KWKEVEITDAWLDDLRSVGYKFPSIV 421


>gi|392887377|ref|NP_493108.4| Protein F56H6.7 [Caenorhabditis elegans]
 gi|262225525|emb|CAB04496.6| Protein F56H6.7 [Caenorhabditis elegans]
          Length = 800

 Score =  197 bits (500), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 119/359 (33%), Positives = 189/359 (52%), Gaps = 14/359 (3%)

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLD 166
           +WIVV+    PT+ +K+L  I+ W ++ +G+++TP +W L+   FLS+  Q  L F ++ 
Sbjct: 61  RWIVVTSVSPPTEDVKRLAAIEDWNLVVVGDTKTPLDWQLENVHFLSVVYQKQLPFSLVT 120

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLG-KHFDVELVGEGARQETILQ 225
            LPY SY RK+ GYL+AI  GA+ I+D DD      D LG K FD E    G R    L 
Sbjct: 121 ELPYKSYTRKNIGYLYAISKGAEWIYDTDDDNKPC-DKLGLKQFDYEDQVSGVR---FLP 176

Query: 226 YSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQ---FIQQGISN 282
            +    ++ I NPY  +G   +WPRG PLE+  + ++    T+V   K     +QQG+ +
Sbjct: 177 QNASEISQRIFNPYRFYGMDGMWPRGFPLEHFEKHTNGN-DTQVLCYKMKRAAVQQGLVH 235

Query: 283 GLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPV 342
             PDVD+++           ++ F+   P + L  G   P NS NT++  SAF  + LP 
Sbjct: 236 HDPDVDAIYRLLNADKNSGLNVEFNKFAPPITLSVGTYSPWNSQNTLFHKSAFHTMFLPT 295

Query: 343 SVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY--PFSEEKDLHVNVGRLI 400
           +VS   +D+ R F  Q++L   G  V   P    ++     Y   F +EK ++ + G++I
Sbjct: 296 TVSFRTTDIWRSFISQKILHLSGLTVSFVPANAVQFRNAHDYLKDFKDEKQVYEDSGKMI 355

Query: 401 KFLVSWRS--NKHRFFEKVLE-LSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLM 456
           +FL +W    N     E  ++ L + + + GFW E D K    +L DL  +G++ P+L+
Sbjct: 356 EFLHNWNCTLNNSTVLEDCIDRLLYDLVKVGFWLEDDAKMMEMYLDDLKNMGFEFPKLI 414


>gi|406970120|gb|EKD94592.1| hypothetical protein ACD_26C00029G0002 [uncultured bacterium]
          Length = 366

 Score =  196 bits (498), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 119/366 (32%), Positives = 195/366 (53%), Gaps = 49/366 (13%)

Query: 101 SRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANL 160
           S F  +KWIV++  +YPT  +KKL +I+GW +L +G+ +TPK+W+L+   +LS + Q +L
Sbjct: 23  SLFSYDKWIVITSIQYPTAQVKKLAQIEGWHLLVVGDKKTPKDWSLENCEYLSPERQLSL 82

Query: 161 GFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGD--DLGKHFDVELVGEGA 218
           G+ +   LP++ Y RK+ GYL+AI+HGA  I+D DD  + +G+  +L K+  + ++    
Sbjct: 83  GYELAKLLPWNHYSRKNIGYLYAIEHGANIIYDTDDDNEPLGELKELSKNTVLPVIS--- 139

Query: 219 RQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQF--- 275
                       PN  I N Y +F +  VWPRG PLE +   SHE    E F        
Sbjct: 140 -----------GPNGCI-NIYSYFEKPDVWPRGYPLEYIKN-SHEFNLLEQFEESSLENS 186

Query: 276 -----IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIY 330
                I+QG+ NG PD+D+++  TR     A +I F  +   V  P G+  P NS NT +
Sbjct: 187 NVEIGIEQGLVNGDPDIDAIYRLTR---FHAGNIIFTKKQACVLAP-GIYCPFNSQNTFF 242

Query: 331 QSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVY--PPTVHRYDKIEAYP-FS 387
              AF+ L +P SVS   SD+ RG++ Q+L+ ++ G  + +  P  V   +  +    F+
Sbjct: 243 HKKAFFTLYIPGSVSMRVSDIWRGYYAQKLI-QLSGLSLAFSGPSAVQERNNHDLLKDFA 301

Query: 388 EEKDLHVNVGRLIKFLVSWRS--------NKHRFFEKVLELSHSMAEEGFWTERDVKFTA 439
            E DL++  G+L++FL  W++        N H+ F+ ++       +  F   +++    
Sbjct: 302 LEDDLYIKSGKLVEFLSQWKALYTDNNLENMHKLFQDLI-------DNKFLKNKELDLLM 354

Query: 440 AWLQDL 445
           AW+ D 
Sbjct: 355 AWINDF 360


>gi|71988391|ref|NP_503859.2| Protein F02C9.2 [Caenorhabditis elegans]
 gi|351059014|emb|CCD66877.1| Protein F02C9.2 [Caenorhabditis elegans]
          Length = 806

 Score =  195 bits (495), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 141/493 (28%), Positives = 231/493 (46%), Gaps = 49/493 (9%)

Query: 35  VRDNLFKIVTVLLLIATIAALS---FLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQ 91
           ++ ++F     +LLIA +  +    F  N+       + + + HS            +IQ
Sbjct: 2   IQRSIFHFYLNILLIACVTVVGLTYFYSNYCSNNLNSRERYRLHS------------AIQ 49

Query: 92  PIADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIF 151
           P+A+           KWIVV+    PT+ +K+L     W ++ + +++TP +W L+ A F
Sbjct: 50  PVAEIRP------GNKWIVVTSISLPTEDVKRLASFTDWNLVVVADTKTPLDWELENAHF 103

Query: 152 LSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLG-KHFD 210
           LS++ Q    F ++  L Y SY RK+ GYL+AI  GA+ I+D DD      D LG   F+
Sbjct: 104 LSVEFQKKSPFSLVSSLSYKSYTRKNIGYLYAISQGAEWIYDTDDDNKPY-DMLGLNQFN 162

Query: 211 VELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLEN-VGEISHEEFYTEV 269
            +    G R       + E   R + NPY  +G   +WPRG PLE+ V   +  E     
Sbjct: 163 FKETTSGVRFRPANGTATEIQQR-LFNPYRFYGMDQMWPRGFPLEHFVKHTNGNETQVLC 221

Query: 270 FGGKQ-FIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNT 328
           +  K+  +QQG+ +  PDVD+++      S    +++F+   P + L  G   P NS NT
Sbjct: 222 YKMKRAAVQQGLVHHDPDVDAIYRLQHADSRSGLNVKFNKFAPPITLSVGTYSPWNSQNT 281

Query: 329 IYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKI 381
           ++  SAF  L LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K 
Sbjct: 282 MFHKSAFHTLFLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVHFRNAHNYLK- 339

Query: 382 EAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEK-VLELSHSMAEEGFWTERDVKFTAA 440
               F +E+ ++ + GR+I+FL +W        +  +++L++ + E     E D      
Sbjct: 340 ---DFKDEQQVYEDSGRIIEFLHNWNCKTGSSIQSCIVQLANDLVEVKLLGEEDESLMEM 396

Query: 441 WLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPR------KLPSVHLGVEETGTVS 494
           +L DL A+G++ P L+      P A   +   +E   R      KL   +  V E    S
Sbjct: 397 FLNDLTALGFEFPSLIGDNYVDPYAPSANESSREVNCRRMYLEFKLVDPNTNVSEISRTS 456

Query: 495 YE----IGNLIRW 503
            E     G++I+W
Sbjct: 457 QEKLNYFGDIIKW 469


>gi|189313910|gb|ACD88950.1| DUF288 containing protein [Adineta vaga]
          Length = 671

 Score =  194 bits (494), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 116/377 (30%), Positives = 202/377 (53%), Gaps = 26/377 (6%)

Query: 97  SSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKG-WQVLAIGNSRTPKNW---------NL 146
           +S +   R +KWIV++   YPT ++ K + +   W ++ I + +TP +W         + 
Sbjct: 51  NSSHCPIRGDKWIVITTIHYPTQAIYKFLNLTTPWNLIIIADRKTPTHWLKHLNSHNTSR 110

Query: 147 KGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLG 206
              + L    Q +L FR+L FLP  SY RK+ GYL AIQ GA+ IF++DD      D+L 
Sbjct: 111 LLFLSLQQQQQHSLHFRILQFLPQGSYARKNLGYLIAIQCGAQIIFESDD------DNLL 164

Query: 207 KHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFY 266
           ++ D+ L+ +  + + +  ++        VN Y  FG   +WPRG P++ +  ++ E+ +
Sbjct: 165 ENNDIYLLPKLLQPKHLPWFAFHRQRSLFVNIYASFGHPHIWPRGFPIDQLRNLT-EDGW 223

Query: 267 TEVFGGKQ-----FIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMV 321
             +   +Q     +IQQ +++  PDVD+++      ++    ++FD   P +AL      
Sbjct: 224 HSLRQNQQNITHAYIQQYLADLDPDVDAIYRLAHPMTIGR--VQFDRDQPPIALESFTFS 281

Query: 322 PVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKI 381
           P N+ NT+    AFW L LPV+ +    D+ RG+W QRLLW+IGG+++    TV +    
Sbjct: 282 PYNTQNTVTYYEAFWGLYLPVTTTFRVCDIWRGYWVQRLLWDIGGHLIFGRSTVQQIRNS 341

Query: 382 EAY--PFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTA 439
            +Y     +E  L+      ++FL SW S+      ++ EL+ ++++ GFW  ++V+   
Sbjct: 342 HSYIEDMDDEYQLYHQSASFVRFLASWSSSNPSLVGRIRELARAISQGGFWKWKEVEIID 401

Query: 440 AWLQDLIAVGYQQPRLM 456
           AWL DL +VGY+ P ++
Sbjct: 402 AWLDDLRSVGYKFPSIV 418


>gi|341886762|gb|EGT42697.1| hypothetical protein CAEBREN_32780 [Caenorhabditis brenneri]
          Length = 813

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 120/404 (29%), Positives = 200/404 (49%), Gaps = 18/404 (4%)

Query: 89  SIQPIADKSS--VYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNL 146
           SIQP   K S  V      ++WIVV+    PT+ +K+L     W ++ + +++TP +W L
Sbjct: 64  SIQPKTFKKSEAVAPVKEGKRWIVVTSISLPTEDVKRLASFADWNLVVVADTKTPLDWEL 123

Query: 147 KGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLG 206
           +   FLS++ Q  L F ++  LPY SY RK+ GYL+AI HGA+ I+D DD     G  L 
Sbjct: 124 ENVHFLSVEYQKLLPFSLVSLLPYKSYTRKNIGYLYAISHGAEWIYDTDDDNKPYGLGLD 183

Query: 207 KHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEIS--HEE 264
           + F  E V  G R     +         + NPY  +G   +WPRG PLE++ + +  H +
Sbjct: 184 Q-FQYEDVVSGIRYRVNNESEVTGIIDRLFNPYRFYGLDQMWPRGFPLEHIEKHTNGHAK 242

Query: 265 FYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVN 324
             +     +  +QQG+ +  PDVD+++           DI+F+   P + L  G   P N
Sbjct: 243 QVSCYKMKRAAVQQGLVHHDPDVDAIYRLLHAERSSGLDIKFNKFAPPITLSVGTYSPWN 302

Query: 325 SFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHR 377
           S NT++  SA   L LP +VS   +D+ R F  Q++L  + G  V + PT        H 
Sbjct: 303 SQNTLFHKSAVHTLFLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHD 361

Query: 378 YDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLE-LSHSMAEEGFWTERDVK 436
           Y K     F +EK ++ + G++I+FL +W        +  +  L+  +  +  W E+D  
Sbjct: 362 YLK----DFKDEKQVYEDSGKMIEFLHNWNCRDFTTIDDCMVLLAEDLVAQNLWGEQDSI 417

Query: 437 FTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRKEFVPRKL 480
               +L DL ++G++ P ++    + P +   +   ++   R++
Sbjct: 418 LLEMFLTDLKSIGFKFPEMVEENYEDPYSPSTNEKSRDVNCRRM 461


>gi|308808151|ref|XP_003081386.1| predicted CDS, putative cytoplasmic protein family member, with a
           coiled coil-4 domain, of ancient origin (ISS)
           [Ostreococcus tauri]
 gi|116059848|emb|CAL55555.1| predicted CDS, putative cytoplasmic protein family member, with a
           coiled coil-4 domain, of ancient origin (ISS)
           [Ostreococcus tauri]
          Length = 533

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 192/373 (51%), Gaps = 32/373 (8%)

Query: 97  SSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQ----VLAIGNSRTPKNWNLKGAIFL 152
           S+     +  +W+VV+    PT  ++ +  +        ++ + +++TP +W+ +G  FL
Sbjct: 113 SAATGDAKPTRWVVVTSINAPTSDMRTMCGVAAKDPALGMVVVADTKTPTDWSAEGCDFL 172

Query: 153 SLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVE 212
           S++ Q  +G ++   LPY SY RK+ GYL+AI  GA+ I++ DD      D+L   F   
Sbjct: 173 SVEAQKKMGSKLAAALPYKSYARKNLGYLYAISKGAEMIYETDD------DNL-SDFTKV 225

Query: 213 LVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENV----GEISHEEFYTE 268
              E  + E       E+ +    N Y +FG+  +WPRG PL  +    G +  E+   +
Sbjct: 226 FTPERVQDEVCSARLVEDKDHAAQNVYAYFGRPDIWPRGFPLNEINNTGGNVLMEKAVQK 285

Query: 269 VFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNT 328
            +     I+  + NG PD D++F  TR  ++    ++ D  VP VAL  G++ P NS   
Sbjct: 286 HY---SPIKSLLVNGDPDTDAIFRLTRGEAIGK--VQLDGDVPPVALDHGVICPFNSQAV 340

Query: 329 IYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYV------VVYPPTVHRYDKIE 382
           ++   AF+ +++P +      D+ RG++ QRLLW++GG +      VV   T H Y    
Sbjct: 341 LWSKEAFFLMLIPATTPMRVCDIWRGYFSQRLLWDMGGRLLFDQADVVQVRTAHDY---- 396

Query: 383 AYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWL 442
              F  E +L+ + GR++K L+ W+       ++ ++L  ++ +  FWTE   K+  AW+
Sbjct: 397 LEDFEGELELYADAGRMVKALLEWKPKGDNMADRFVDLCRTLQDGKFWTE--TKYCEAWV 454

Query: 443 QDLIAVGYQQPRL 455
           +DL  +GY+ P++
Sbjct: 455 EDLRTMGYEFPKV 467


>gi|71983179|ref|NP_493147.2| Protein E03H4.4 [Caenorhabditis elegans]
 gi|62553984|emb|CAB04026.2| Protein E03H4.4 [Caenorhabditis elegans]
          Length = 805

 Score =  191 bits (486), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 129/429 (30%), Positives = 207/429 (48%), Gaps = 41/429 (9%)

Query: 39  LFKIVTVLLLIATIAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSS 98
           +F ++ VL LI           F    S I S    ++PN   +  I      PI     
Sbjct: 14  VFGVLLVLFLI-----------FKLHESTITSPVISYTPNPRFVAAIKSIGFPPIK---- 58

Query: 99  VYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQA 158
                   +WIVV+   +PT+ +K+L  I+ W ++ + +++TP +W L+   FLS++ Q 
Sbjct: 59  -----AGNRWIVVTSVSHPTEDVKRLAAIEDWNLVVVADTKTPVDWWLENVHFLSVEYQK 113

Query: 159 NLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLG-KHFDVELVGEG 217
            L F ++  LPY SY RK+ GYL+AI  GA+ I+D DD      D LG K FD E    G
Sbjct: 114 QLPFSLVTKLPYKSYTRKNIGYLYAISKGAEWIYDTDDDNKPY-DKLGLKQFDYEDQVSG 172

Query: 218 ARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQ--- 274
           AR    L       ++ I NPY  +G   +WPRG PLE+  + ++    ++V   K    
Sbjct: 173 AR---FLPQDARELSQRIFNPYRFYGMDGMWPRGFPLEHFEKHTNGN-SSQVLCYKMERA 228

Query: 275 FIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSA 334
            +QQG+    PDVD+++           ++ F+   P + L  G   P NS NT++  SA
Sbjct: 229 AVQQGLVQHDPDVDAIYRLLNADKNSGLNVEFNKFAPPITLSVGTYSPWNSQNTLFHKSA 288

Query: 335 FWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFS 387
           F  + LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F 
Sbjct: 289 FHTMFLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAIQFRNAHDYLK----DFR 343

Query: 388 EEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIA 447
           +EK ++ + G++I F  +W+ +     + + +L + +    FW   D +    +L DL  
Sbjct: 344 DEKRVYEDSGKMIDFFHNWKCDSKTLEDCIHKLLYDLVTADFWLRDDAEMMEMYLDDLKN 403

Query: 448 VGYQQPRLM 456
           +G+Q  +L+
Sbjct: 404 LGFQFSKLL 412


>gi|308472708|ref|XP_003098581.1| hypothetical protein CRE_05084 [Caenorhabditis remanei]
 gi|308268847|gb|EFP12800.1| hypothetical protein CRE_05084 [Caenorhabditis remanei]
          Length = 840

 Score =  191 bits (486), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 135/483 (27%), Positives = 231/483 (47%), Gaps = 35/483 (7%)

Query: 52  IAALSFLRNFTDTASLIQSKSQEHSPNAIPLPVINWNSIQPIADKSSVYSRFR------- 104
           I + + L  F  ++ ++   S       + LP  +  +++ I+  +S  S F+       
Sbjct: 31  IVSRTLLLIFCASSIIMMYYSYNSDYGTVGLPTDH--NLKRISKNASAISEFKYVRPVAR 88

Query: 105 ---SEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLG 161
                KWIVV+   YPT+ +K+L  I+ W ++ + +++TP +W L    FL +  Q  L 
Sbjct: 89  VKKGNKWIVVTSISYPTEDVKRLASIEDWNLVVVADTKTPVDWKLDDVHFLPVLYQKTLP 148

Query: 162 FRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQE 221
           F +   LPY SY RK+ GYL+AI  GA+ I+D DD            FD +    G R +
Sbjct: 149 FSLSYSLPYKSYTRKNIGYLYAIAQGAEWIYDTDDDNKPYDKRGLDQFDYDETISGVRFQ 208

Query: 222 TILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISH-EEFYTEVFGGKQ-FIQQG 279
             ++ S       + NPY  +G   +WPRG PLE++ + S+ +E     +  K+  +QQG
Sbjct: 209 --VKNSEAGVLERLFNPYRFYGIDQMWPRGFPLEHIEKHSNGKEHQVLCYKMKRSSVQQG 266

Query: 280 ISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALM 339
           + +  PDVD+V+           DI+F+   P + L  G   P NS NT++  SAF  L 
Sbjct: 267 LVHHDPDVDAVYRLLHADPKSGLDIKFNMFSPPITLSVGTYSPWNSQNTLFHKSAFHTLF 326

Query: 340 LPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDL 392
           LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F +EK +
Sbjct: 327 LPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAIQFRNAHDYLK----DFKDEKQV 381

Query: 393 HVNVGRLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQ 451
           + + G++I+FL +W+       E  + +L   +     W + D K  + +L DL  +G++
Sbjct: 382 YEDSGKMIEFLSNWKCLNGNSLEGCINDLLKDLVTNNLWGKEDFKLMSFFLNDLKYMGFE 441

Query: 452 QPRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKN---FG 508
            P L+      P  +  + + +    R++   +L  +      Y+  N+I+  +    FG
Sbjct: 442 FPELIGENYQDPYTASNNEEDRNVNCRRM---NLEFDLVDPREYQRQNIIKAEQKLNYFG 498

Query: 509 NVV 511
           ++V
Sbjct: 499 DLV 501


>gi|71996148|ref|NP_503670.2| Protein F56A4.6 [Caenorhabditis elegans]
 gi|351019371|emb|CCD62316.1| Protein F56A4.6 [Caenorhabditis elegans]
          Length = 796

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 125/392 (31%), Positives = 196/392 (50%), Gaps = 27/392 (6%)

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLD 166
           KWIVV+    PTD +K L     W ++ + +++TP +WNL+   FLS++ Q  L F +  
Sbjct: 61  KWIVVTSISLPTDDVKVLASFVDWNLVVVADTKTPLDWNLENVHFLSVEYQKQLPFSLAF 120

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQY 226
            LPY SY RK+ GYL+AI  GA+ I+D DD      D L K F  +      R  ++L  
Sbjct: 121 SLPYKSYTRKNIGYLYAISQGAEWIYDTDDDNKPY-DKLPK-FPYQFDLRDMRDISVL-- 176

Query: 227 SHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISH-EEFYTEVFGGKQ-FIQQGISNGL 284
                 + + NPY  FG   +WPRG PLE+  + ++  E     +  K+  +QQG+ +  
Sbjct: 177 -----TQRLFNPYRIFGMEQMWPRGFPLEHFEKHTNGNESQVLCYKMKRAAVQQGLVHHD 231

Query: 285 PDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSV 344
           PDVD+++      S    DI F+   P + L  G   P NS NT++  SAF  L LP +V
Sbjct: 232 PDVDAIYRLLHADSSNGLDISFNKFAPPITLSVGTYSPWNSQNTMFHKSAFHTLFLPTTV 291

Query: 345 STMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDLHVNVG 397
           S   +D+ R F  Q++L  + G  V + PT        H Y K     F +EK ++ + G
Sbjct: 292 SFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHNYLK----DFKDEKQVYEDSG 346

Query: 398 RLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLM 456
           ++I+FL SW  +     +  + EL + + +     ++D      +L DL A+G++ P L+
Sbjct: 347 KMIEFLHSWNCSTGNSTQSCMIELVNDLVKVKLLGKQDASLMEMFLNDLTAMGFEYPSLL 406

Query: 457 SLELDRPRASIGHGDRKEFVPRKLPSVHLGVE 488
             +   P A   +   +E   R++   HL  E
Sbjct: 407 GEDYIDPYAPSMNESTREVNCRRM---HLEFE 435


>gi|32567126|ref|NP_503697.2| Protein Y45G12C.11 [Caenorhabditis elegans]
 gi|351018360|emb|CCD62306.1| Protein Y45G12C.11 [Caenorhabditis elegans]
          Length = 779

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 122/392 (31%), Positives = 193/392 (49%), Gaps = 44/392 (11%)

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLD 166
           KWIVV+    PTD +K+L     W ++ + +++TP +WNL+   FLS++ Q  L F +  
Sbjct: 61  KWIVVTSISLPTDDVKRLASFVDWNLVVVADTKTPLDWNLENVHFLSVEYQKQLPFSLAF 120

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQY 226
            LPY SY RK+ GYL+AI  GA+ I+D DD          K +D                
Sbjct: 121 SLPYKSYTRKNIGYLYAISQGAEWIYDTDDDN--------KPYD---------------- 156

Query: 227 SHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISH-EEFYTEVFGGKQF-IQQGISNGL 284
             + P + + NPY  FG   +WPRG PLE+  + ++  E     +  K+  +QQG+ +  
Sbjct: 157 --KLPKQRLFNPYRIFGMEQMWPRGFPLEHFEKHTNGNESQVLCYKMKRAAVQQGLVHHD 214

Query: 285 PDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSV 344
           PDVD+++      S    DI F+   P + L  G   P NS NT++  SAF  L LP +V
Sbjct: 215 PDVDAIYRLLHADSSNGLDISFNKFAPPITLSVGTYSPWNSQNTMFHKSAFHTLFLPTTV 274

Query: 345 STMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDLHVNVG 397
           S   +D+ R F  Q++L  + G  V + PT        H Y K     F +EK ++ + G
Sbjct: 275 SFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHNYLK----DFKDEKQVYEDSG 329

Query: 398 RLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLM 456
           ++I+FL SW  +     +  + EL + + +     ++D      +L DL A+G++ P L+
Sbjct: 330 KMIEFLHSWNCSTGNSTQSCMIELVNDLVKVKLLGKQDASLMEMFLNDLTAMGFEYPSLL 389

Query: 457 SLELDRPRASIGHGDRKEFVPRKLPSVHLGVE 488
             +   P A   +   +E   R++   HL  E
Sbjct: 390 GEDYIDPYAPSMNESTREVNCRRM---HLEFE 418


>gi|71987610|ref|NP_493110.2| Protein F56H6.9 [Caenorhabditis elegans]
 gi|62554003|emb|CAB04498.3| Protein F56H6.9 [Caenorhabditis elegans]
          Length = 803

 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 116/367 (31%), Positives = 191/367 (52%), Gaps = 24/367 (6%)

Query: 104 RSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFR 163
           +  +WIVV+    PT+ +K+L  I+ W ++ + +++TP +W L+   FLS+  Q  L F 
Sbjct: 55  KGNRWIVVTSVSQPTEDVKRLAAIEDWNLVVVADTKTPLDWKLENVHFLSVAYQKQLPFT 114

Query: 164 VLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLG-KHFDVELVGEGARQET 222
           ++  LPY SY RK+ GYL+AI  GA+ I+D DD      D LG K FD E    G R   
Sbjct: 115 LVSELPYKSYTRKNIGYLYAISKGAEWIYDTDDDNKPY-DKLGLKQFDYEDQVSGVR--- 170

Query: 223 ILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQ---FIQQG 279
            L  +    ++ + NPY  +G   +WPRG PLE+  + ++    ++V   K     +QQG
Sbjct: 171 FLPQNASGISQRLFNPYRFYGMDGMWPRGFPLEHFEKHTNGN-NSQVLCYKMKRAAVQQG 229

Query: 280 ISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALM 339
           + +  PDVD+++           ++ F+   P + L  G   P NS NT++  SAF  + 
Sbjct: 230 LVHHDPDVDAIYRLLNADKNNGLNVEFNKFAPPITLSVGTYSPWNSQNTLFHKSAFHTMF 289

Query: 340 LPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDL 392
           LP +VS   +D+ R F  Q++L  + G  V +  T        H Y K     F  EK +
Sbjct: 290 LPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVSTNAVQFRNAHDYLK----DFKNEKQV 344

Query: 393 HVNVGRLIKFLVSW---RSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVG 449
           + + G++I+FL +W   R+N       + +L   +A+E  W   D +    +L+DL ++G
Sbjct: 345 YEDSGKMIEFLHNWNCTRNNSTVLENCINQLLVDLAKEKLWGSEDARLMGMYLEDLKSMG 404

Query: 450 YQQPRLM 456
           ++ P+L+
Sbjct: 405 FKFPKLV 411


>gi|392919341|ref|NP_504727.2| Protein T15B7.10 [Caenorhabditis elegans]
 gi|373254212|emb|CCD68171.1| Protein T15B7.10 [Caenorhabditis elegans]
          Length = 443

 Score =  179 bits (455), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 112/369 (30%), Positives = 180/369 (48%), Gaps = 44/369 (11%)

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLD 166
           KWIVV+    PT+ +K+L     W ++ + + +TP +W L+   FLS+  Q  L F ++ 
Sbjct: 103 KWIVVTTISLPTEDVKRLASFVDWNLVVVADIKTPLDWKLENVHFLSVQFQKQLPFSLVS 162

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQY 226
            LPY SY RK+ GYL+AI   A+ I+D DD          K +D                
Sbjct: 163 SLPYKSYKRKNIGYLYAISQEAEWIYDTDDAN--------KPYD---------------- 198

Query: 227 SHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISH-EEFYTEVFGGKQF-IQQGISNGL 284
                 + + NPY  +G   +WPRG PLE+  + ++  E  +  +  K+  +QQG+ +  
Sbjct: 199 -----KQRLFNPYRFYGMDQMWPRGFPLEHFEKHTNGNETLSSCYQMKRAAVQQGLVHHD 253

Query: 285 PDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSV 344
           PDVD+++      S    DI+F+   P + L  G   P NS NT++  SAF  L LP +V
Sbjct: 254 PDVDAIYRLIHADSKNGLDIKFNKFAPPITLSVGTYSPWNSQNTMFHKSAFHTLFLPTTV 313

Query: 345 STMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDLHVNVG 397
           S   +D+ R F  Q++L  + G  V + PT        H Y K       +EK ++ + G
Sbjct: 314 SFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHNYLK----DLKDEKQVYEDSG 368

Query: 398 RLIKFLVSWR-SNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLM 456
           R+I+FL +W  S ++     ++E+++ +  E    + D K    +L DL  +G+  P L+
Sbjct: 369 RMIEFLHNWNCSTRNSTRSCIIEMTNDLVTEKLLGKEDAKLMEMFLNDLTEMGFTFPVLL 428

Query: 457 SLELDRPRA 465
                 P A
Sbjct: 429 EHNYLDPYA 437


>gi|453232384|ref|NP_504731.2| Protein T15B7.8 [Caenorhabditis elegans]
 gi|393793284|emb|CCD68162.2| Protein T15B7.8 [Caenorhabditis elegans]
          Length = 841

 Score =  178 bits (452), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 116/399 (29%), Positives = 196/399 (49%), Gaps = 48/399 (12%)

Query: 101 SRFRS-EKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQAN 159
           SR ++  KWIVV+    PT+ +K+L     W ++ + +++TP +W L+   FLS+  Q  
Sbjct: 95  SRIKAGNKWIVVTTISSPTEDIKRLASFVDWNLVVVADTKTPLDWKLENVHFLSVQYQRQ 154

Query: 160 LGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGAR 219
           L F ++  LPY SY RK+ GYL+AI  GA+ ++D DD          K +D         
Sbjct: 155 LPFSLVSSLPYKSYTRKNIGYLYAISQGAEWVYDTDDDN--------KPYD--------- 197

Query: 220 QETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISH-EEFYTEVFGGKQF-IQ 277
                        + + NPY  +G   + PRG PLE+  + ++  E     +  K+  +Q
Sbjct: 198 ------------KQRLFNPYRFYGMDRMCPRGFPLEHFDKHTNGNETLVLCYQMKRAAVQ 245

Query: 278 QGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWA 337
           QG+ +  PDVD+++      S    D RF+   P + L  G   P NS NT++  SAF  
Sbjct: 246 QGLVHHDPDVDAIYRLIHADSKNGLDNRFNKFAPAITLSVGTYSPWNSQNTMFHKSAFHT 305

Query: 338 LMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEK 390
           L LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K       +EK
Sbjct: 306 LFLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHNYLK----DLKDEK 360

Query: 391 DLHVNVGRLIKFLVSWR-SNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVG 449
            ++ + GR+I+FL +W+ S ++     ++E+++ + ++    + D K    +L DL  +G
Sbjct: 361 QVYEDSGRMIEFLHNWKCSTRNSSQNCIIEMTNDLVKKKLLGKEDAKLMEMFLNDLTEMG 420

Query: 450 YQQPRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVE 488
           ++ P L+  +   P A   +   ++   R++   HL  E
Sbjct: 421 FKFPILIENDFLDPYAPSTNETSRDVNCRRM---HLEFE 456


>gi|294463263|gb|ADE77167.1| unknown [Picea sitchensis]
          Length = 269

 Score =  175 bits (444), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 79/112 (70%), Positives = 96/112 (85%)

Query: 493 VSYEIGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVE 552
           +++EIGNLIRWRK +GN+VLIM CSGPV  T L WR+LYGRIFK+V+++SEQ N DL VE
Sbjct: 1   MNFEIGNLIRWRKFYGNIVLIMHCSGPVNHTVLGWRMLYGRIFKSVVVVSEQSNPDLGVE 60

Query: 553 AGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWITDKV 604
            G+  QVY+ LPKIF RYT+A+GF+FL+DDTILNYWNLLQADK +LWIT KV
Sbjct: 61  YGEWWQVYKVLPKIFERYTNADGFMFLKDDTILNYWNLLQADKTRLWITHKV 112


>gi|298706837|emb|CBJ25801.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 400

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 110/374 (29%), Positives = 176/374 (47%), Gaps = 31/374 (8%)

Query: 104 RSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFR 163
           + E+W V++    PTD++++L   + W V+ +G+   P  +N++G I+L+   Q  L +R
Sbjct: 27  KCERWAVLTSIFEPTDTVRQLGAAEDWCVVVVGDQNGPAEYNVEGVIYLTPQDQEQLPYR 86

Query: 164 VLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETI 223
           ++  LP++ + RK+ GYL+A+ HGA  I+D DD   +I  + G    +  V   +     
Sbjct: 87  IVPLLPWNHFGRKNIGYLYAVHHGATVIYDVDDDNALIHPEAGVPHALSPVTPASTTSFA 146

Query: 224 LQYSHENPNRTIVNPYVHF-GQRSVWPRGLPLENVGEI-----------SHEEFYTEVFG 271
           +      P   + NPY  F G  +VWPRG PL+++ +            S  E      G
Sbjct: 147 V-----GPEAFVHNPYGCFGGPGNVWPRGFPLDSINDADSNRCDEVAVDSAGESAAPEEG 201

Query: 272 GKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVA-----LPQGMMVPVNSF 326
            +  + Q ++N  PDVD+V+  T  P    FD    D VP+       +P     P N+ 
Sbjct: 202 WRLGVVQALANHDPDVDAVYRLTYPPGGLPFDFEVPDPVPEGMSSLKIVPPAAFTPYNAQ 261

Query: 327 NTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY-- 384
            T++   AFW ++LPV+V    SD+ R ++ Q LL   G      P  V +      Y  
Sbjct: 262 ATLHFPPAFWGMLLPVTVHGRVSDIWRSYFTQTLLTSTGAVTAFAPAWVEQIRNPHNYLA 321

Query: 385 PFSEEKDLHVNVGRLIKFL-------VSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKF 437
            F  E  L+   G L+ FL       V+         E++  L   M E G   + DV+ 
Sbjct: 322 DFQAELPLYEQSGALVAFLDGHRRQSVAASEAGVGLPERIDALMVEMYEYGVLEQADVQL 381

Query: 438 TAAWLQDLIAVGYQ 451
           + AWL+DL +VGY 
Sbjct: 382 SQAWLEDLYSVGYN 395


>gi|308506363|ref|XP_003115364.1| hypothetical protein CRE_18496 [Caenorhabditis remanei]
 gi|308255899|gb|EFO99851.1| hypothetical protein CRE_18496 [Caenorhabditis remanei]
          Length = 1251

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 104/320 (32%), Positives = 164/320 (51%), Gaps = 40/320 (12%)

Query: 104 RSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFR 163
           +  KWIVV+   +PT+ +K+L   + W ++ + +++TP +W L+   FLS++ Q  L F 
Sbjct: 16  KGNKWIVVTSVNHPTEDVKRLSSFRDWNLVVVADTKTPVDWELEDVHFLSVEYQKTLPFS 75

Query: 164 VLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETI 223
           ++  LPY SY RK+ GYL+AI  GA+ I+D DD     G  L + F  E V  G R +  
Sbjct: 76  LVSSLPYKSYTRKNIGYLYAISQGAEWIYDTDDDNKPYGLGLNQ-FQFEDVVSGVRYQ-- 132

Query: 224 LQYSHENPNRTIVNPYVHFGQRSVWPRGLPLE--NVGEISHEEFY-----TEVFGGKQFI 276
           ++ S E   + I NPY  +G   +WPRG PLE   V +I+HE F       ++ G  + +
Sbjct: 133 VKNSSEGILQRIFNPYRFYGIDQMWPRGFPLEYIEVIDITHERFQIYSRNIQMEGKTKLL 192

Query: 277 QQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFW 336
             G ++GL                  DI+F+   P + L  G   P NS N ++  +AF 
Sbjct: 193 HAGSTSGL------------------DIKFNKFAPPITLSVGTYSPWNSQNILFHKTAFH 234

Query: 337 ALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEE 389
            L LP +V    +D+ R F  QR++  + G  V + PT        H Y K     F +E
Sbjct: 235 TLFLPTTVPFRTTDIWRSFISQRIV-HLSGLTVSFVPTNAVQFRNAHDYLK----DFKDE 289

Query: 390 KDLHVNVGRLIKFLVSWRSN 409
           K ++ + G++I+FL +W  +
Sbjct: 290 KQVYEDSGKIIEFLDNWNCS 309


>gi|299470238|emb|CBN79542.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 794

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 117/397 (29%), Positives = 187/397 (47%), Gaps = 37/397 (9%)

Query: 83  PVINWNSIQPIADKSSVY----SRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNS 138
           P +   +  P  + SS++    +    E+W V++    PTD++K+L ++  W V+ +G+ 
Sbjct: 65  PRMEIRTTPPSVEPSSLFPPPAAEDTCERWAVLASADEPTDAVKQLAELGEWCVVVVGDK 124

Query: 139 RTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRG 198
             P  +N+ G + L+   Q  L +R+ D +P++   RK+ GYL+AI HGAK I+D DD  
Sbjct: 125 DGPTEYNVVGVVLLTPSDQEALPYRITDLIPWNHVGRKNIGYLYAIHHGAKVIYDVDDAH 184

Query: 199 DVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRS-VWPRGLPLENV 257
            ++  + G  F      E +  E  L +    P+  + NPY  FG    VWPRG P   +
Sbjct: 185 VLMRPEEGVPF-----AETSSAEHELSF-FSRPSTCVHNPYPCFGASGVVWPRGFPPAKI 238

Query: 258 GEISHEEFYTEVFGGKQFIQ-----QGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPK 312
            + S       + GG    Q     Q +++  PDVD+++  T  P      + F +  P 
Sbjct: 239 RDKSSSMCGVVMGGGGAGEQRVGVVQALADNNPDVDALYRMTCAP--RGSPLSFVEESPP 296

Query: 313 VA------LPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGG 366
           +       +P     P N+  T++   AFW ++LPV+V    SD+ R ++ Q LL   G 
Sbjct: 297 LPGSSLRLVPAWTFSPYNAKATLHFPVAFWGMLLPVTVHERVSDIWRSYFTQTLLPSAGA 356

Query: 367 YVVVYPPTVHR-YDKIEAY--PFSEEKDLHVNVGRLIKFLVSWR----------SNKHRF 413
            V   PP V R  +   +Y   F  E  L+   G L+ FL+ +R          ++    
Sbjct: 357 VVGFAPPWVTRELEGPNSYRDDFQAELPLYEQSGALVDFLLQYRHAVEDEASAQASPESQ 416

Query: 414 FEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGY 450
             ++  LS +M E G     DV  T AWL+DL  VGY
Sbjct: 417 ASRIEALSVTMYEHGIVEGDDVALTQAWLKDLRDVGY 453


>gi|428319704|ref|YP_007117586.1| Protein of unknown function DUF288 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428243384|gb|AFZ09170.1| Protein of unknown function DUF288 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 343

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 101/346 (29%), Positives = 179/346 (51%), Gaps = 23/346 (6%)

Query: 105 SEKWIVVSVDRYPTDSLKKLVKIK---GWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLG 161
           S K +V++    PT +L K   I    GW+++ +G+ +TP++++L GA + +++ Q    
Sbjct: 12  SVKSLVITTINKPTAALFKYRDILLDLGWKIIVVGDKKTPRDFDLPGAEYFNVEQQCEEF 71

Query: 162 FRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQE 221
             +   +P + Y RK+ GYL+A++ GA+ I + DD  ++  DD   +F   LV   A   
Sbjct: 72  GELASLIPMNHYSRKNLGYLYAMRMGAEAIAETDD-DNIPYDDKYPNFLPSLVKTPAVDV 130

Query: 222 TILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGIS 281
                      +  VN Y +F  + +WPRGLPL+ V     E   TE      ++QQG++
Sbjct: 131 -----------KGAVNVYSYFTSKKIWPRGLPLDKVNSFVDENLATEK-EVTCYVQQGLA 178

Query: 282 NGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLP 341
           +  PDVD+++  T        +I+F+    K++L  G   P N+ NT++   AF  ++LP
Sbjct: 179 DLDPDVDAIYRLTVGDE----NIKFEPH-KKLSLSPGCYSPFNTQNTLFDKQAFPLMLLP 233

Query: 342 VSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVH--RYDKIEAYPFSEEKDLHVNVGRL 399
           + VS+  +D+ + +  QRLLW +   V+   P+V+  R +      FSEE  L+  V  L
Sbjct: 234 IGVSSRVTDIWKSYIAQRLLWCMNSSVLFLSPSVYQLRNEHNLMKDFSEEIPLYTQVHNL 293

Query: 400 IKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDL 445
           I  L ++ S+     E ++E+   +   GF  E +V+    W++++
Sbjct: 294 IDLLENFTSDASDACELMIEMYAYLNRNGFLGEIEVRLCELWIEEV 339


>gi|300175503|emb|CBK20814.2| unnamed protein product [Blastocystis hominis]
          Length = 441

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 101/367 (27%), Positives = 181/367 (49%), Gaps = 38/367 (10%)

Query: 105 SEKWIVVSVDRYPTDSLKKLVKIK-GWQVLAIGNSRTPKNWNLKGA--IFLSLDMQANLG 161
            + W V++    PT  +++L + +    V+ + + ++P  +N+  A  ++L+++ Q  L 
Sbjct: 92  CKSWAVITSVNSPTVVVRQLAETEENLCVVVVADKKSPIEYNVTRAHLVYLTVEDQEKLD 151

Query: 162 FRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQE 221
           + ++  +P++ + RK+ G+L+AIQHGAK+IFD DD  ++I D         ++ +  R++
Sbjct: 152 YNIMKLVPWNHFARKNVGFLYAIQHGAKRIFDLDDDNELISDK-------NIMNQVFRKD 204

Query: 222 TILQYSHENPNRTIVNPYVHFGQRS---VWPRGLPLENVGEISHEEFYTE------VFGG 272
               +   N  + + NPY+ +  +    +WPRG PLE +       F  E          
Sbjct: 205 K-KTFKFVNTTQYVTNPYMIYLNKEGEYIWPRGYPLEAIKTPHDYSFIDENPSEKSSLVN 263

Query: 273 KQFIQQGISNGLPDVDSVFYFTRK-PSLEAFDIRFDDRVP-KVALPQGMMVPVNSFNTIY 330
           K  + Q + N  PD+D+++  T   PS       FD  +   + L +    P N+ +T++
Sbjct: 264 KIGVIQYLQNVNPDLDAIYRITSTIPST------FDPSITYCIILKKTSFSPWNAQSTVF 317

Query: 331 QSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTV------HRYDKIEAY 384
           +   FW ++LP++V    SD+ R ++ QR++WE   Y+   P  V      HR  K    
Sbjct: 318 EYETFWGMLLPMTVHGRVSDIWRSYFTQRVMWERDKYMAFCPSIVNHIRNQHRLIK---- 373

Query: 385 PFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQD 444
            F  E  L+     ++KFL  W        E + EL   M E G    RDV+F  AW++D
Sbjct: 374 DFDAEMPLYTQTEAMLKFLNEWTPKAQEVPEILEELYVEMYERGIVELRDVEFIQAWIRD 433

Query: 445 LIAVGYQ 451
           L+ +GY+
Sbjct: 434 LVQIGYR 440


>gi|422295611|gb|EKU22910.1| hypothetical protein NGA_0436100 [Nannochloropsis gaditana CCMP526]
          Length = 693

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 104/362 (28%), Positives = 178/362 (49%), Gaps = 30/362 (8%)

Query: 117 PTDSLKKLVKIKGWQVLAIGNSRTPKNWNL--KGAIFLSLDMQANLGFRVLDFLPYDSYV 174
           PT  +K+L  +K W V+ +G+ ++P  +++     +FLS + Q  L + ++  L ++ + 
Sbjct: 4   PTVLVKQLAGMKNWCVVVVGDKKSPPTYDIPSDNLVFLSPEEQEALPYHIIPLLRWNHFG 63

Query: 175 RKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVEL-VGEGARQETILQYSHENPNR 233
           RK+ G+L+A+ HGA+ I+D DD   +  D  G  F  +  +GE A  + +++    +   
Sbjct: 64  RKNIGFLYAMHHGAEMIYDTDDDNILKVDSEGNPFIPDFSLGELATSKDVVRPGQSH--- 120

Query: 234 TIVNPYVHFGQRSV--------WPRGLPLENVGEISH-------EEFYTEVFGGKQFIQQ 278
            + NPY  F   +V        WPRG P++ + + S        EE   E  GG   I Q
Sbjct: 121 -VYNPYPSFDSVNVKDGSPAFVWPRGFPVDLITDASTWNVSRGVEEGTHE--GGVITIVQ 177

Query: 279 GISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL 338
            +++  PDVD+++  T    L     R   R     +P G+M P N+  T++  +AFW +
Sbjct: 178 SLADHDPDVDALYRLTSHLPLS---FRSGGRARFEVIPPGVMTPFNAQATVFGKAAFWGM 234

Query: 339 MLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY--PFSEEKDLHVNV 396
           +LP++V    SD+ R +   R++WE G  V    P V +     +Y   F  E DL+   
Sbjct: 235 LLPITVHGRVSDIWRSYITGRIMWEAGQRVAFASPFVTQCRNPHSYLADFDAESDLYERA 294

Query: 397 GRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTER-DVKFTAAWLQDLIAVGYQQPRL 455
           G L+ +L+ WR         + E++ +M E  F  +  DV    AW++DL  +G   P  
Sbjct: 295 GALVSWLLKWRPVSPYLEGMIEEMAVAMYEMDFLHDPLDVDLAIAWIEDLRGIGVAMPNT 354

Query: 456 MS 457
           +S
Sbjct: 355 LS 356


>gi|268565543|ref|XP_002647351.1| Hypothetical protein CBG06402 [Caenorhabditis briggsae]
          Length = 1108

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 120/433 (27%), Positives = 189/433 (43%), Gaps = 88/433 (20%)

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNL---KGAIFLSLDMQANLGFR 163
           KWIVV+   YPTD + +L  I  W ++ +G+++TPK+W L   K  IF  +  Q    F+
Sbjct: 59  KWIVVTSINYPTDDVMRLAAIPDWNLVVVGDTKTPKDWELPNKKLIIFRKILRQGLKQFQ 118

Query: 164 VLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETI 223
                 Y+  V         +++ AK   +A++   +I                      
Sbjct: 119 ------YEETVS-------GVRYQAKSFEEANNSTGII---------------------- 143

Query: 224 LQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQF-------- 275
                    + + NPY  +G   +WPRG PLEN+      E ++ V G +          
Sbjct: 144 ---------KRLFNPYQFYGVDQMWPRGFPLENI------EKHSNVLGQQTLCYQMPRPA 188

Query: 276 IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAF 335
           +QQG+ +  PDVD+++           DI+F++  P + L  G   P NS NT++  SAF
Sbjct: 189 VQQGLVHHDPDVDAIYRLLHANPKTGLDIKFNEFAPPIILSVGTYSPWNSQNTLFHKSAF 248

Query: 336 WALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSE 388
             L LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F +
Sbjct: 249 HTLFLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHDYLK----DFKD 303

Query: 389 EKDLHVNVGRLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLIA 447
           EK ++ + GR ++FL SW        E  + +L+  + E  FW   D K    +L DL  
Sbjct: 304 EKSVYEDSGRFLEFLHSWNCKNGPVLENCMNQLAEDLVENNFWRNEDAKLMMMFLSDLKL 363

Query: 448 VGYQQPRLMSLELDRPR-ASIGHGDRK----------EFV-PRKLPSVHLGVEETGTVSY 495
           +G++ P ++  E   P  AS    +R           E V PR     +L  ++ G    
Sbjct: 364 LGFEFPEILKGEYVEPYLASANETERNVNCRRMNLEFELVDPRNYEQQNL--QKAGQKLQ 421

Query: 496 EIGNLIRWRKNFG 508
            IG+L+ W K  G
Sbjct: 422 YIGDLVDWCKETG 434


>gi|434394831|ref|YP_007129778.1| Protein of unknown function DUF288 [Gloeocapsa sp. PCC 7428]
 gi|428266672|gb|AFZ32618.1| Protein of unknown function DUF288 [Gloeocapsa sp. PCC 7428]
          Length = 326

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 97/342 (28%), Positives = 164/342 (47%), Gaps = 28/342 (8%)

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLD 166
            +IV++    PT++LKK   +  WQV+ + + +TPK+W L     LS++ Q  L F +L 
Sbjct: 4   NFIVITSINSPTEALKKFSLMPDWQVILVADLKTPKDWQLDNVKVLSVEEQKTLPFTILK 63

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQY 226
           +LP++ Y RK+ GYL+A+  GA+ I++ DD  ++  D       V+L  +     T    
Sbjct: 64  YLPWNHYARKNIGYLYAMLQGAELIYETDD-DNIPYDSWHGFHPVQLQAKAYTSST---- 118

Query: 227 SHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPD 286
                     N Y +F + ++WPRG PL  +   +  +   E       +QQG+++  PD
Sbjct: 119 -------KFFNAYSYFCEANIWPRGFPLTAIHSPTELQIANEFISAP--VQQGLADLDPD 169

Query: 287 VDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVST 346
           VD+++           +++F  R P V L  G   P NS NT++   AF  + LP  V  
Sbjct: 170 VDAIYRLAIGK-----EVKFSQREP-VFLAPGTYCPFNSQNTLWYPEAFQYMYLPAFVFN 223

Query: 347 MASDVLRGFWGQRLLWEIGGYVVVYPPTVHR---YDKIEAYPFSEEKDLHVNVGRLIKFL 403
             +D+ RG+  Q  L +    V+    +V++   Y K+  + F EE DL+     LI  L
Sbjct: 224 RLTDIWRGYIAQHFLHQKAQGVLFCNASVYQERNYHKL-LHDFIEEIDLYTRTEELINVL 282

Query: 404 VSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDL 445
             + S+   F      +   + +  F  + +V    +WL+DL
Sbjct: 283 NEYTSHSQDF----AGIMQHLHQHHFVKDEEVVLFDSWLEDL 320


>gi|149390757|gb|ABR25396.1| unknown [Oryza sativa Indica Group]
          Length = 247

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 65/90 (72%), Positives = 75/90 (83%)

Query: 517 SGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGF 576
           SGPV+RTALEWRLLYGRIFKTVIIL+EQ N +LAVE   L   Y+ LPK+F+RY  A+GF
Sbjct: 3   SGPVDRTALEWRLLYGRIFKTVIILAEQSNTELAVERCALSHAYKFLPKVFARYGGADGF 62

Query: 577 LFLQDDTILNYWNLLQADKNKLWITDKVLY 606
           LFLQD  ILNYWNLLQADK KLWIT+K+ +
Sbjct: 63  LFLQDHMILNYWNLLQADKEKLWITNKIAH 92


>gi|209524116|ref|ZP_03272667.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|209495491|gb|EDZ95795.1| conserved hypothetical protein [Arthrospira maxima CS-328]
          Length = 340

 Score =  131 bits (330), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 99/317 (31%), Positives = 153/317 (48%), Gaps = 26/317 (8%)

Query: 135 IGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDA 194
           IG+  +P ++NL+G  F S+  Q  L        P   Y RK+ GYL AIQ GA+ I + 
Sbjct: 37  IGDEISPSDFNLEGCDFYSIARQEALDLSFPKICPKRHYARKNIGYLLAIQQGAEIIIET 96

Query: 195 DDRGDVIGDDLGKHFDVELVGEG-ARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLP 253
           DD           +F  E   E   R +T+       PN    N Y +F   ++WPRGLP
Sbjct: 97  DD----------DNFPYESFWEKRERYQTVSSI----PNLGWCNVYKYFTDANIWPRGLP 142

Query: 254 LENVGEISHEEFYT-EVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPK 312
           L+ V   S  ++ T E+      IQQG++N  PDVD+++     P  ++F+   + R  +
Sbjct: 143 LDEVNCQSLPDWDTLEITLANCPIQQGLANDNPDVDAIYRLIF-PLPQSFN---NHR--R 196

Query: 313 VALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYP 372
           +AL  G   P NS NT + + A+  L LP   S   +D+ R F  QR+ WE G  V+ + 
Sbjct: 197 IALASGSWCPFNSQNTTWWADAYPLLYLPAYCSFRMTDIWRSFIAQRIAWENGWSVLFHQ 256

Query: 373 PTVH--RYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNK--HRFFEKVLELSHSMAEEG 428
           PTV+  R +      F EE   +++   + K L + +     H+  E +L    ++   G
Sbjct: 257 PTVYQERNEHNLMRDFQEEIPGYIHNKAIAKTLENLKLTPGLHKLSENLLVCYEALVSMG 316

Query: 429 FWTERDVKFTAAWLQDL 445
           F  ++++    AWL DL
Sbjct: 317 FIDKQELNLAQAWLDDL 333


>gi|219117235|ref|XP_002179412.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409303|gb|EEC49235.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 842

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 173/368 (47%), Gaps = 36/368 (9%)

Query: 95  DKSSVYSRFRSE-----KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNW----- 144
           +KS++ S F        +W VV+    P +S+  + K++ W ++ IG+++TP        
Sbjct: 97  NKSTLGSSFSKNFKDCLQWAVVTTIFEPGESIYGVSKLRNWCLVIIGDTKTPDAAYADLN 156

Query: 145 NLKGAIFLSL-DMQANLGFRVL-DFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIG 202
           +L   I+LS  D    LG       LP+ S+ RK+ GYLFAI+HGA+ I+D DD   +  
Sbjct: 157 SLDNVIYLSARDQMLFLGKSPFGQILPFQSFARKNLGYLFAIRHGAQVIYDFDDDNVLQK 216

Query: 203 DDLGKHFDVELVGEGARQETILQYSHENPNRTI-VNPYVHFGQR--SVWPRGLPLENV-- 257
            + G+  +     +G + ++IL      PN  +  NP  + G    + WPRG PL+++  
Sbjct: 217 TENGESKEPFTYRQGMKSDSILVRFDPRPNLPLPFNPLPYMGPNVTNPWPRGFPLQDLTT 276

Query: 258 GEISHEEFYTEVFG----GKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKV 313
                +   + VFG     +  + Q + +G PDVD+++  TR          F++   K+
Sbjct: 277 SNAGMQSDPSLVFGSIPVSRIGVIQSVCDGDPDVDAIWRMTRD-----LPFGFEEDSQKL 331

Query: 314 ALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEI------GGY 367
            +        N+  T++  ++FWA+ LP SV    +D+ R +  QRL  +I       G 
Sbjct: 332 LVASKTFASYNAQATVHLQNSFWAMFLPFSVPGRVTDIWRAYVAQRLFRDINLSLVYAGP 391

Query: 368 VVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEE 427
           +V +  T H Y       F  E+DL++    L+  L  W S+      K+  L  ++ E 
Sbjct: 392 LVTHTRTAHNY----LADFQAEQDLYMKTNPLLGLLDGWESDSTSLPGKLEALYVALYEH 447

Query: 428 GFWTERDV 435
           G+    DV
Sbjct: 448 GYVGLVDV 455


>gi|298711676|emb|CBJ32728.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 613

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 97/323 (30%), Positives = 143/323 (44%), Gaps = 27/323 (8%)

Query: 181 LFAIQHGAKKIFDADDRGDVIGDDLG-KHFDVELVGEGARQETILQYSHENPNRTIVNPY 239
           ++AI HGA+ I+D DDR  ++    G  H DV   G    +  + ++ H + +  + NPY
Sbjct: 1   MYAIHHGAEVIYDVDDRNALVDPQQGVPHSDVSSSG----KPDVFRF-HSDESAIVHNPY 55

Query: 240 VHFGQRSV-WPRGLPLENVGEISHEEFYTEVFGGKQFIQ--QGISNGLPDVDSVFYFTRK 296
             FG   V WPRG PL  V  +      +E     Q I   Q ++N  PDVD+++  T  
Sbjct: 56  PCFGAPGVVWPRGFPLNKVQLVDSSTCSSEGAMDSQVIGVVQALANHDPDVDAIYRMTYP 115

Query: 297 PSLEAFDIRFDDRVPKV-----ALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDV 351
           P    F    +D          A+P     P N+  T++   AFW L+LP +V    SD 
Sbjct: 116 PGGLPFSFVAEDSSKAETRNLRAVPASAFTPYNAQATLHFQVAFWGLLLPTTVDGRVSDT 175

Query: 352 LRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY--PFSEEKDLHVNVGRLIKFLVSWR-- 407
            R ++ Q LL  +G      P  V +      Y   F  E  L+   G L++FL+ +R  
Sbjct: 176 WRSYFTQALLPAVGAVAAFSPGWVEQVGNPRNYLADFKAEFPLYQRSGALVEFLLQYRDL 235

Query: 408 -SNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRAS 466
            SN      ++  L+ +M E G   + DV    AWL+DL   GY  P     E D    S
Sbjct: 236 VSNASALPLEIEALAVAMYEYGIVEDEDVALMQAWLEDLRDAGYAFP-----EYDMQHQS 290

Query: 467 IGHG---DRKEFVPRKLPSVHLG 486
              G    +   V  KLP++ +G
Sbjct: 291 TAAGVARQQHTSVDEKLPALQIG 313


>gi|86748655|ref|YP_485151.1| hypothetical protein RPB_1530 [Rhodopseudomonas palustris HaA2]
 gi|86571683|gb|ABD06240.1| conserved Hypothetical protein ZK105.3 [Rhodopseudomonas palustris
           HaA2]
          Length = 381

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 102/383 (26%), Positives = 176/383 (45%), Gaps = 42/383 (10%)

Query: 82  LPVINWNSIQPIADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVK---IKGWQVLAIGNS 138
           LP I+ +S + +  +  ++      + I+V+    P   +K + K     G+  + +G++
Sbjct: 26  LPEISPDSQRSLHFQPLLHGSAEMNQAIIVTSINAPNPVMKAIAKDANPAGFDFIVVGDT 85

Query: 139 RTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRG 198
           +TP  + + G  FLS+D Q + G +     P  SY RK+ GYL AI  GA+ I + DD  
Sbjct: 86  KTPDGFAIDGCRFLSIDEQLSSGLKYARVAPMASYARKNVGYLSAISRGAQMIAETDD-- 143

Query: 199 DVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVG 258
               D+  +    E   E  R++T+   +        VN Y +F   ++WPRGLPL+++ 
Sbjct: 144 ----DNFPRPAFFE---ERRRRQTVPTVAGAG----WVNAYRYFSDSNIWPRGLPLDHIQ 192

Query: 259 EISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPK-VALPQ 317
               E     V      IQQG+++  PDVD+++         A  +  + R  + VA  +
Sbjct: 193 RAVPEWEALPVGDVDSPIQQGLADENPDVDAIYRL-------ALTLPQNFRTDRTVAFGE 245

Query: 318 GMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTV-- 375
           G   P NS NT +   AF  + LP + +   +D+ R    QR+ W+ G +++ + PTV  
Sbjct: 246 GAWCPFNSQNTSWWPDAFPLMYLPATCNFRVTDIWRSLIAQRIAWQNGWHILFHGPTVWQ 305

Query: 376 HRYDKIEAYPFSEEKDLHVNVGRLIKFL---------VSWRSNKHRFFEKVLELSHSMAE 426
            R +      F +E   ++N  R+   L          +   + HR +E +L L      
Sbjct: 306 DRNEHDLMADFEDEIPGYLNNHRIRLMLEQLPLQGGVANIAHDLHRCYEAMLGL------ 359

Query: 427 EGFWTERDVKFTAAWLQDLIAVG 449
            G  T  ++    AW++D+  VG
Sbjct: 360 -GLVTAAEMTLLEAWIEDIQRVG 381


>gi|323447189|gb|EGB03128.1| hypothetical protein AURANDRAFT_34450 [Aureococcus anophagefferens]
          Length = 467

 Score =  121 bits (304), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 101/378 (26%), Positives = 161/378 (42%), Gaps = 23/378 (6%)

Query: 105 SEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKN-------WNLKGAIFLSLDMQ 157
            E+  VV+    P++++ ++    GW ++ +G+ +T                 +LS   Q
Sbjct: 71  CERCGVVTTINPPSEAILRVGNASGWCLVVVGDRKTADGPYEALAAAAPATVAYLSAAAQ 130

Query: 158 ANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEG 217
             L + +    P+D + RK+ GYL+AI  GA  IFD DD  +V+   LG           
Sbjct: 131 ETLPYGLASATPWDHFARKNLGYLYAIHAGAATIFDFDD-DNVL---LGAPPASAGAAAR 186

Query: 218 ARQETILQYSHENPNRTIVNPYV-HFGQRSVWPRGLPLENV-GEISHEEFYTEVFGGKQF 275
           A    +      +      N Y   FG    WPRGLPL+ + G  +              
Sbjct: 187 ASDPRLAAPDAPDAGSAFFNAYAASFGAEKAWPRGLPLDAINGPAAAAAADDPRAADDVV 246

Query: 276 IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFD---DRVPKVALPQGMMVPVNSFNTIYQS 332
           + Q ++N  PDVD+++    + +L    + FD        V L  G + P N+  T++  
Sbjct: 247 VAQLLANHDPDVDAIYRLGPRAALP---LPFDFPSSHGRGVVLDGGAVCPFNAQATLFDR 303

Query: 333 SAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVH--RYDKIEAYPFSEEK 390
           +AFWAL+LP SV    +D+ RGF  QR+L   G  +   PP V   R D      +  E+
Sbjct: 304 AAFWALLLPASVHGRVADIWRGFVAQRVLRAAGLRLAFLPPGVTQLRNDHDALADYMSER 363

Query: 391 DLHVNVGRLIKFL--VSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAV 448
            L+     +++ L  V+           V +   ++ E G     DV +   WL DL AV
Sbjct: 364 PLYEKADAVVRVLDGVAPHRPGGSVARAVEDAYVALYEHGLVALDDVAYAQLWLADLYAV 423

Query: 449 GYQQPRLMSLELDRPRAS 466
           G   P+L   +   P A+
Sbjct: 424 GLALPKLRRHKARTPGAT 441


>gi|323447328|gb|EGB03254.1| hypothetical protein AURANDRAFT_34288 [Aureococcus anophagefferens]
          Length = 445

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 99/367 (26%), Positives = 157/367 (42%), Gaps = 23/367 (6%)

Query: 105 SEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKN-------WNLKGAIFLSLDMQ 157
            E+  VV+    P++++ ++    GW ++ +G+ +T                 +LS   Q
Sbjct: 71  CERCGVVTTINPPSEAILRVGNASGWCLVVVGDRKTADGPYEALAAAAPATVAYLSAAAQ 130

Query: 158 ANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEG 217
             L + +    P+D + RK+ GYL+AI  GA  IFD DD  +V+   LG           
Sbjct: 131 ETLPYGLASATPWDHFARKNLGYLYAIHAGAATIFDFDD-DNVL---LGAPPASAGAAAR 186

Query: 218 ARQETILQYSHENPNRTIVNPYV-HFGQRSVWPRGLPLENV-GEISHEEFYTEVFGGKQF 275
           A    +      +      N Y   FG    WPRGLPL+ + G  +              
Sbjct: 187 ASDPRLAAPDAPDAGSAFFNAYAASFGAEKAWPRGLPLDAINGPAAAAAADDPRAADDVV 246

Query: 276 IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFD---DRVPKVALPQGMMVPVNSFNTIYQS 332
           + Q ++N  PDVD+++    + +L    + FD        V L  G + P N+  T++  
Sbjct: 247 VAQLLANHDPDVDAIYRLGPRAALP---LPFDFPSSHGRGVVLDGGAVCPFNAQATLFDR 303

Query: 333 SAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVH--RYDKIEAYPFSEEK 390
           +AFWAL+LP SV    +D+ RGF  QR+L   G  +   PP V   R D      +  E+
Sbjct: 304 AAFWALLLPASVHGRVADIWRGFVAQRVLRAAGLRLAFLPPGVTQLRNDHDALADYMSER 363

Query: 391 DLHVNVGRLIKFL--VSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAV 448
            L+     +++ L  V+           V +   ++ E G     DV +   WL DL AV
Sbjct: 364 PLYEKADAVVRVLDGVAPHRPGGSVARAVEDAYVALYEHGLVALDDVAYAQLWLADLYAV 423

Query: 449 GYQQPRL 455
           G   P+L
Sbjct: 424 GLALPKL 430


>gi|323446419|gb|EGB02587.1| hypothetical protein AURANDRAFT_68744 [Aureococcus anophagefferens]
          Length = 448

 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 99/367 (26%), Positives = 157/367 (42%), Gaps = 23/367 (6%)

Query: 105 SEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKN-------WNLKGAIFLSLDMQ 157
            E+  VV+    P++++ ++    GW ++ +G+ +T                 +LS   Q
Sbjct: 71  CERCGVVTTINPPSEAILRVGNASGWCLVVVGDRKTADGPYEALAAAAPATVAYLSAAAQ 130

Query: 158 ANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEG 217
             L + +    P+D + RK+ GYL+AI  GA  IFD DD  +V+   LG           
Sbjct: 131 ETLPYGLASATPWDHFARKNLGYLYAIHAGAATIFDFDD-DNVL---LGAPPASAGAAAR 186

Query: 218 ARQETILQYSHENPNRTIVNPYV-HFGQRSVWPRGLPLENV-GEISHEEFYTEVFGGKQF 275
           A    +      +      N Y   FG    WPRGLPL+ + G  +              
Sbjct: 187 ASDPRLAAPDAPDAGSAFFNAYAASFGAEKAWPRGLPLDAINGPAAAAAADDPRAADDVV 246

Query: 276 IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFD---DRVPKVALPQGMMVPVNSFNTIYQS 332
           + Q ++N  PDVD+++    + +L    + FD        V L  G + P N+  T++  
Sbjct: 247 VAQLLANHDPDVDAIYRLGPRAALP---LPFDFPSSHGRGVVLDGGAVCPFNAQATLFDR 303

Query: 333 SAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVH--RYDKIEAYPFSEEK 390
           +AFWAL+LP SV    +D+ RGF  QR+L   G  +   PP V   R D      +  E+
Sbjct: 304 AAFWALLLPASVHGRVADIWRGFVAQRVLRAAGLRLAFLPPGVTQLRNDHDALADYMSER 363

Query: 391 DLHVNVGRLIKFL--VSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAV 448
            L+     +++ L  V+           V +   ++ E G     DV +   WL DL AV
Sbjct: 364 PLYEKADAVVRVLDGVAPHRPGGSVARAVEDAYVALYEHGLVALDDVAYAQLWLADLYAV 423

Query: 449 GYQQPRL 455
           G   P+L
Sbjct: 424 GLALPKL 430


>gi|384245084|gb|EIE18580.1| hypothetical protein COCSUDRAFT_45356 [Coccomyxa subellipsoidea
           C-169]
          Length = 837

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 163/376 (43%), Gaps = 47/376 (12%)

Query: 100 YSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQAN 159
           Y++  S   +VV ++  P +       +  W+  A   S+ P N      + L    Q  
Sbjct: 14  YAKLDSWALLVVELEGMPQNWRPAFDSL--WEASAGAASQAPPN-----LVLLDRTTQQQ 66

Query: 160 LGFRVLDFLPYDSYVR-KSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGE-- 216
           LGF        DS  R K+ G LFAI  GA  I +A++  +           VE  G+  
Sbjct: 67  LGFASGGC--SDSKARSKNIGSLFAIMCGADVIIEAEEGVE----------HVEAAGQLP 114

Query: 217 --GARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHE-----EFYTEV 269
              A     LQ +  +P+  ++NPY  FG   +WP   P   V   + E     +   + 
Sbjct: 115 LQAAASGPFLQ-AFGDPSSRLINPYALFGHPEIWPAVFPPAAVSNATFEFRKVQQPPDQD 173

Query: 270 FGGKQFIQQGISNGLPDVDSVFYFT----RKPSLEAFDIRFDDRVPKVALPQGMMVPVNS 325
              +  IQ  + N  P  D+V   T    + P       RF  +   + +  G   P+  
Sbjct: 174 GSYRPLIQSALVNDYPATDAVLGLTLLAHKGPQ------RFYSKPAAIGVQPGYFAPLGL 227

Query: 326 FNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVH-----RYDK 380
            +T+Y S AFW L++  + +   +   R  W Q+LLW +GG +++  P+       R  +
Sbjct: 228 GSTVYGSDAFWGLVMQGASNQALAPAWRSLWVQKLLWGVGGQLLILAPSARQNRTVRLLQ 287

Query: 381 IEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAA 440
           +EA    +E + +   G L+ FL  W  N++    K+L+L+  +   GFW++ +V    A
Sbjct: 288 LEAQ--GQEMEGYSKTGTLVDFLHHWEGNENILDLKMLQLARDLRSAGFWSQAEVDSMGA 345

Query: 441 WLQDLIAVGYQQPRLM 456
           W+ DL AVGY  P ++
Sbjct: 346 WVADLRAVGYVFPDVL 361


>gi|406958959|gb|EKD86436.1| hypothetical protein ACD_37C00283G0002 [uncultured bacterium]
          Length = 323

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 157/340 (46%), Gaps = 27/340 (7%)

Query: 107 KWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLD 166
           K IV++    PT ++    K+K + +   G+++TPK W+ K   +LS+  Q     ++  
Sbjct: 4   KAIVITSIYPPTKAVLLFSKLKSFAMFVSGDNKTPKGWSHKNVHYLSISDQHKKFPKLSK 63

Query: 167 FLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQY 226
            +  + Y RK+  YL AI  G + +++ DD      D+L  +F    +      E I   
Sbjct: 64  LVSQNHYARKNFAYLSAILSGIEFLYETDD------DNLPYNFFPNFIDSEKNMEEI--- 114

Query: 227 SHENPNRTI-VNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLP 285
                N  +  N Y  F ++ VWPRG+PL  +     +    +V      IQQ +++  P
Sbjct: 115 -----NAPLSFNIYSEFTKKRVWPRGIPLNLIDNKISKRKKNKVIP---LIQQSLADLDP 166

Query: 286 DVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVS 345
           DVD+++  T        D+    +   + L  G   P NS NT +    F  L LP +V 
Sbjct: 167 DVDAIYRLTNG------DVITFAKGKILCLATGTFAPFNSQNTYWSKKVFPLLYLPSTVD 220

Query: 346 TMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY--PFSEEKDLHVNVGRLIKFL 403
           +   D+ RG+  QR+LWE+   ++   P+V++   +  Y   F +E +L+     L+  L
Sbjct: 221 SRVCDIWRGYIAQRILWELNSRLIFLSPSVYQKRNVHDYMKDFVQELELYTKTEDLLITL 280

Query: 404 VSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQ 443
              +  K      ++++   + E+GF+ ++++     WL+
Sbjct: 281 NKIKL-KGNIDVMLIDIYSLLIEKGFFKKKELSILREWLR 319


>gi|375255451|ref|YP_005014618.1| hypothetical protein BFO_1748 [Tannerella forsythia ATCC 43037]
 gi|363408574|gb|AEW22260.1| hypothetical protein BFO_1748 [Tannerella forsythia ATCC 43037]
          Length = 331

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 153/341 (44%), Gaps = 28/341 (8%)

Query: 111 VSVDRYPTDS-LKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLP 169
           ++ D++P  S   +   +   + + IG+ ++P  ++L G  F S++ Q  + + +   LP
Sbjct: 11  IASDKHPVLSRFAQEAALHSVRFMVIGDKKSP-TFHLDGCDFFSIERQCVMPYTLARLLP 69

Query: 170 YDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHE 229
           +  Y RK+ GYL A +HGA+ I + DD             D        R+   +  +H 
Sbjct: 70  FGHYARKNLGYLEAARHGAEIIIETDD-------------DNYPETCFWRERNKMVTAHC 116

Query: 230 NPNRTIVNPYVHFGQRSVWPRGLPLENV-GEISHEEFYTEVFGGKQFIQQGISNGLPDVD 288
              +  VN Y ++ +  VWPRG  LE++  E+   E   ++      IQQG+++  PDVD
Sbjct: 117 LKEKGWVNMYGYYTRSIVWPRGFALEHIQSELPELEPLQKILAP---IQQGLADLNPDVD 173

Query: 289 SVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMA 348
           +++  T     +   + F     ++AL  G + P NS NT +   AF  + LP   S   
Sbjct: 174 AIYRLT-----QPLPVSFQKEPKRIALGHGSICPFNSQNTTWFREAFPLMYLPSYCSFRM 228

Query: 349 SDVLRGFWGQRLLWEIGGYVVVYPPTV--HRYDKIEAYPFSEEKDLHVNVGRLIKFLVS- 405
           +D+ R F  QR+ W  G  ++ +  TV   R +      F +E   + N   ++  L+  
Sbjct: 229 TDIWRSFVAQRIAWTCGWNILFHEATVWQERNEHAIIKDFKDEISGYCNNREIMDRLMQL 288

Query: 406 -WRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDL 445
             +       E ++    +  E     E+++    AW+ D+
Sbjct: 289 DLKEGVEAIPENLIRCYRAFVEMSLIEEKELTLLDAWITDI 329


>gi|436834157|ref|YP_007319373.1| hypothetical protein FAES_0769 [Fibrella aestuarina BUZ 2]
 gi|384065570|emb|CCG98780.1| hypothetical protein FAES_0769 [Fibrella aestuarina BUZ 2]
          Length = 334

 Score =  110 bits (274), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 91/330 (27%), Positives = 151/330 (45%), Gaps = 33/330 (10%)

Query: 129 GWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGA 188
           G + + +G++++P  ++L G  F S+D Q  L F +++ LP   Y RK+ GYL A+Q GA
Sbjct: 30  GVKFVVMGDTKSPTQFDLSGCDFWSIDRQLTLPFSLVENLPTRHYGRKNLGYLVAMQQGA 89

Query: 189 KKIFDADDRGDVIGDDLGKHFDVELVGEG---ARQETILQYSHENPNRTIVNPYVHFGQR 245
           + I + DD      D+  +        EG    RQ T  Q +H        N Y +F  +
Sbjct: 90  QVIIETDD------DNFPR--------EGFWTNRQRT--QPAHSLTQTGWTNVYKYFTDK 133

Query: 246 SVWPRGLPLENVGE-ISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEAFDI 304
            +WPRG  LE++ + +      +EV      IQQG+++  PDVD+++  T         +
Sbjct: 134 HIWPRGYALEHLHDTLPDLPGLSEVVCP---IQQGLADENPDVDAIYRLTLP-----LPL 185

Query: 305 RFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEI 364
            F+ R   VAL  G     NS NT +   AF  L LP   S   +D+ R +  QR+ W  
Sbjct: 186 NFEQR-DSVALGDGAWCAFNSQNTTWFPEAFPLLYLPSHCSFRMTDIWRSYVAQRVAWTC 244

Query: 365 GGYVVVYPPTV--HRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWR--SNKHRFFEKVLEL 420
           G  ++ +  TV   R +      F +E   +    ++   L +           E +L  
Sbjct: 245 GWSILFHNATVWQERNEHNLMRDFEDEVSGYTQNRQICLDLAALDLPEGTEHIHENLLTC 304

Query: 421 SHSMAEEGFWTERDVKFTAAWLQDLIAVGY 450
              + E+G+  + ++    AW+ DL  +G+
Sbjct: 305 YRLLTEKGYVGKAEMPLVEAWVADLRKLGF 334


>gi|341897241|gb|EGT53176.1| hypothetical protein CAEBREN_15029 [Caenorhabditis brenneri]
          Length = 473

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 73/241 (30%), Positives = 119/241 (49%), Gaps = 24/241 (9%)

Query: 247 VWPRGLPLENVGEISHEEFYTEVFGGKQ---FIQQGISNGLPDVDSVFYFTRKPSLEAFD 303
           +WPRG PLE++ + ++E   ++V   K     +QQG+ +  PDVD+++      S    D
Sbjct: 4   MWPRGFPLEHIEKHTNEN-SSQVLCYKMKRAAVQQGLVHHDPDVDAIYRLLHADSNSGLD 62

Query: 304 IRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWE 363
           ++F+   P + L  G   P NS NT++  SAF  L LP +VS   +D+ R F  Q++L  
Sbjct: 63  VKFNKFTPLITLSVGTYSPWNSQNTLFHKSAFHTLFLPTTVSFRTTDIWRSFISQKIL-H 121

Query: 364 IGGYVVVYPPT-------VHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNK---HRF 413
           + G  V + PT        H Y K     F +E  ++ + G++I+FL  W+ +    +  
Sbjct: 122 LSGLTVSFVPTNAVQFRNAHDYLK----DFKDENQVYEDSGKMIEFLHKWKCSNESSNSL 177

Query: 414 FEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRL-MSLELDRPRASIGHGDR 472
            E + +LS  M   G W   D +    +L DL ++G    R+ +  EL  P+     G R
Sbjct: 178 EECINQLSDDMVINGLWGVEDSELMKMFLSDLKSMG----RINLEFELVDPKEDEEQGLR 233

Query: 473 K 473
           K
Sbjct: 234 K 234


>gi|117923426|ref|YP_864043.1| hypothetical protein Mmc1_0108 [Magnetococcus marinus MC-1]
 gi|117607182|gb|ABK42637.1| conserved hypothetical protein [Magnetococcus marinus MC-1]
          Length = 340

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 92/338 (27%), Positives = 150/338 (44%), Gaps = 20/338 (5%)

Query: 120 SLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCG 179
           +L +  +  G+  +  G+S++P ++ L G  FLSL+ Q   GFR+    P   Y RK+  
Sbjct: 19  ALAQGCQAAGYDFILAGDSKSPDSFALDGCHFLSLEQQRQSGFRLGLSSPIKHYARKNIA 78

Query: 180 YLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPY 239
           YL AI  G + I + DD      D+  +              T+ Q    N     + P 
Sbjct: 79  YLQAIAQGTQCILETDD------DNWPRAAFFAPRSRMVETVTVQQPGWLNVYGLFLQPD 132

Query: 240 VHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSL 299
            H     +WPRGLPL+ V +       T +      IQQG+++  PDVD+++  T  P  
Sbjct: 133 DH--ALPLWPRGLPLDAVRQSLPP--LTAMQSVDCPIQQGLADENPDVDAIYRLTL-PLP 187

Query: 300 EAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQR 359
             F     DR  ++AL +G+  P NS NT++   AF  L LP + S   +D+ R F  QR
Sbjct: 188 RNF---IADR--QIALGEGVWSPFNSQNTLWWRDAFPLLYLPATCSFRMTDIWRSFVAQR 242

Query: 360 LLWEIGGYVVVYPPTV--HRYDKIEAYPFSEEKDLHVNVGRLIKFL--VSWRSNKHRFFE 415
           L W  G  V+ + PTV   R +      F +E   +++   +   L  ++  +      +
Sbjct: 243 LAWSCGWRVLFFSPTVWQERNEHDLNRDFQDEVPGYLHNAAIAAGLAQLNLPTGTAHLLD 302

Query: 416 KVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQP 453
            +      + E+G     ++     W+ DL   G++ P
Sbjct: 303 NLHTCYAWLVEQGHMQPLELSLLQDWIFDLTQCGWKAP 340


>gi|308471344|ref|XP_003097903.1| hypothetical protein CRE_12969 [Caenorhabditis remanei]
 gi|308239208|gb|EFO83160.1| hypothetical protein CRE_12969 [Caenorhabditis remanei]
          Length = 582

 Score =  105 bits (261), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 67/206 (32%), Positives = 101/206 (49%), Gaps = 14/206 (6%)

Query: 276 IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAF 335
           +QQG+ +  PDVD+V+      S    D++F+   P + L  G   P NS NT++  SAF
Sbjct: 6   VQQGLVHHDPDVDAVYRLLNADSNSGLDVKFNKFAPPITLSVGTYSPWNSQNTLFHKSAF 65

Query: 336 WALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSE 388
             L LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F +
Sbjct: 66  HTLFLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAIQFRNAHDYLK----DFKD 120

Query: 389 EKDLHVNVGRLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLIA 447
           EK ++ + G++I FL  W   K    E  + EL   + E   W E D K    +L DL +
Sbjct: 121 EKQVYEDSGKIIDFLNGWNCLKVINLEDCINELLEDLVENNLWGEDDSKLMKLFLNDLKS 180

Query: 448 VGYQQPRLMSLELDRPR-ASIGHGDR 472
           +G++ P L+  + + P  AS    DR
Sbjct: 181 MGFKYPDLIGEKYEDPYIASDNETDR 206


>gi|341886636|gb|EGT42571.1| hypothetical protein CAEBREN_32781 [Caenorhabditis brenneri]
          Length = 556

 Score = 99.4 bits (246), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 152/371 (40%), Gaps = 66/371 (17%)

Query: 276 IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAF 335
           +QQG+ +  PDVD+++           D++F+     + L  G   P NS NT++  SAF
Sbjct: 6   VQQGLVHHDPDVDAIYRLLHADQNTGLDVKFNKFASPITLSVGTYSPWNSQNTLFHKSAF 65

Query: 336 WALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSE 388
             L LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F +
Sbjct: 66  HTLFLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHDYLK----DFKD 120

Query: 389 EKDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAV 448
           EK ++ + G++I+FL +W+       E + +L   +  E  W E D+K    +L DL  +
Sbjct: 121 EKQVYEDSGKMIEFLHNWKCLNGTLEECIYKLLTDLVAENLWGEEDLKLMRMFLSDLKTL 180

Query: 449 GYQQPRLMSLELDRPRASIGHGDRKEFVPRKLPSVHLGVEETGTVSYE------IGNLIR 502
           G+  P+++      P +   +   ++   R+   ++L  +      YE       G+L++
Sbjct: 181 GFIFPKIIKNRYIDPYSPSTNETTRDVNCRR---INLEFDLVDPREYEQQKLNYFGHLVK 237

Query: 503 WRKNFG--------------------------NVVLIMFCSGPVERTALEWRLLYGRIFK 536
           W    G                          N VLI+  + P +      + LY   F 
Sbjct: 238 WCNESGYPTKSFPSPEQLEEQHADTYVLQKDLNSVLILVNNYPWKYGMGLLQRLYQPYFA 297

Query: 537 TVIILS----EQ-KNEDLAVEAGQLEQVYRHLPKIFSRY--------------TSAEGFL 577
            VI       EQ +N++       +  ++ +  +    Y               + EG+ 
Sbjct: 298 AVIFCGPWYPEQFQNDNYTSLVHPVNYIHFNPAENHRGYFCYHCMTLVKEMGLQNVEGYF 357

Query: 578 FLQDDTILNYW 588
           F+ DDT+ N W
Sbjct: 358 FVADDTVFNMW 368


>gi|323447327|gb|EGB03253.1| hypothetical protein AURANDRAFT_68172 [Aureococcus anophagefferens]
          Length = 408

 Score = 95.5 bits (236), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 89/364 (24%), Positives = 139/364 (38%), Gaps = 57/364 (15%)

Query: 105 SEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKN-------WNLKGAIFLSLDMQ 157
            E+  VV+    P++++ ++    GW ++ +G+ +T                 +LS   Q
Sbjct: 71  CERCGVVTTINPPSEAILRVGNASGWCLVVVGDRKTADGPYEALAAAAPATVAYLSAAAQ 130

Query: 158 ANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEG 217
             L + +    P+D + RK+ GYL+AI  GA  IFD DD  +V+   LG           
Sbjct: 131 ETLPYGLASATPWDHFARKNLGYLYAIHAGAATIFDFDD-DNVL---LGAPPASAGAAAR 186

Query: 218 ARQETILQYSHENPNRTIVNPY-VHFGQRSVWPRGLPLENV-GEISHEEFYTEVFGGKQF 275
           A    +      +      N Y   FG    WPRGLPL+ + G  +              
Sbjct: 187 ASDPRLAAPDAPDAGSAFFNAYAASFGAEKAWPRGLPLDAINGPAAAAAADDPRAADDVV 246

Query: 276 IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAF 335
           + Q ++N  PD                                         T++  +AF
Sbjct: 247 VAQLLANHDPDA----------------------------------------TLFDRAAF 266

Query: 336 WALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVH--RYDKIEAYPFSEEKDLH 393
           WAL+LP SV    +D+ RGF  QR+L   G  +   PP V   R D      +  E+ L+
Sbjct: 267 WALLLPASVHGRVADIWRGFVAQRVLRAAGLRLAFLPPGVTQLRNDHDALADYMSERPLY 326

Query: 394 VNVGRLIKFL--VSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQ 451
                +++ L  V+           V +   ++ E G     DV +   WL DL AVG  
Sbjct: 327 EKADAVVRVLDGVAPHRPGGSVARAVEDAYVALYEHGLVALDDVAYAQLWLADLYAVGLA 386

Query: 452 QPRL 455
            P+L
Sbjct: 387 LPKL 390


>gi|341886559|gb|EGT42494.1| hypothetical protein CAEBREN_29347 [Caenorhabditis brenneri]
          Length = 697

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 76/138 (55%), Gaps = 5/138 (3%)

Query: 84  VINWNSIQPIADKS--SVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTP 141
           +I+W  I P  +K+  S+       KWIVV+    PT+ +K+L     W ++ + +++TP
Sbjct: 19  IISW--IYPSKNKTIQSIAPVKNGNKWIVVTSISSPTNDVKRLASFDDWNLVVVADTKTP 76

Query: 142 KNWNLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVI 201
            +W L+   FLS++ Q  L F ++  LPY SY RK+ GYL+AI HGA+ I+D DD     
Sbjct: 77  LDWKLENVHFLSVEYQNQLPFSLVSSLPYKSYTRKNIGYLYAISHGAEWIYDTDDDNKPF 136

Query: 202 GDDLGKHFDVELVGEGAR 219
              L + F  E    G R
Sbjct: 137 DKGLNQ-FQYEDTVSGVR 153



 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/231 (26%), Positives = 104/231 (45%), Gaps = 25/231 (10%)

Query: 298 SLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWG 357
           S    D++F++  P + L  G   P NS NT++  SAF  L LP +VS   +D+ R F  
Sbjct: 170 SKTGLDVKFNEFAPPITLSVGTYSPWNSQNTLFHKSAFHTLFLPTTVSFRTTDIWRSFIS 229

Query: 358 QRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNK 410
           Q++L  + G  V + PT        H Y K     F +EK ++ + G++IKFL  W+ + 
Sbjct: 230 QKIL-HLSGLTVSFVPTNAVQFRNAHDYLK----DFKDEKQVYEDSGKMIKFLHEWKCSN 284

Query: 411 ---HRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASI 467
              +     + EL + +  E  W ++D +    +L DL +VG++ P ++      P +  
Sbjct: 285 AISNNLENCIYELMNELVVENLWGKKDSELMKMFLNDLKSVGFEFPVMVGESYRDPYSPS 344

Query: 468 GHGDRKEFVPRKL--------PSVHLGVEETGTVSY--EIGNLIRWRKNFG 508
            +   ++   R++        P  H    +   V      GNL+ W    G
Sbjct: 345 TNETSRDVNCRRMNLEFELIDPKEHHRKNKKRAVQKLNYFGNLVEWCNETG 395


>gi|308509514|ref|XP_003116940.1| hypothetical protein CRE_01640 [Caenorhabditis remanei]
 gi|308241854|gb|EFO85806.1| hypothetical protein CRE_01640 [Caenorhabditis remanei]
          Length = 565

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 63/198 (31%), Positives = 93/198 (46%), Gaps = 25/198 (12%)

Query: 275 FIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSA 334
           F  + ISNGL + DS             D++F+   P +AL  G   P NS NT++  SA
Sbjct: 9   FDLKSISNGLLNADSN---------SGLDVKFNKFAPPIALSVGTFSPWNSQNTLFHKSA 59

Query: 335 FWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFS 387
           F  L LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F 
Sbjct: 60  FHTLFLPTTVSFRTTDIWRSFILQKIL-HLSGLTVSFVPTNTIQFRNAHDYLK----DFK 114

Query: 388 EEKDLHVNVGRLIKFLVSWRSNKHRFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLI 446
            EK ++ + G++I+FL  W+ +K    E  +  LS  + E   W E D K    +L DL 
Sbjct: 115 NEKQVYEDSGKIIEFLNDWKCSKDINLEDCINNLSEDLVENNLWGEDDSKLIKLFLNDLK 174

Query: 447 AVGYQQPRLMSLELDRPR 464
           ++G      +  EL  P+
Sbjct: 175 SMGRMN---LEFELIDPK 189


>gi|341902699|gb|EGT58634.1| hypothetical protein CAEBREN_24535 [Caenorhabditis brenneri]
          Length = 632

 Score = 89.7 bits (221), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 85/356 (23%), Positives = 146/356 (41%), Gaps = 77/356 (21%)

Query: 301 AFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRL 360
             DI+F+   P + L  G   P NS NT++  SAF  L LP +VS   +D+ R F  Q++
Sbjct: 109 GLDIKFNKFAPPITLSVGTYSPWNSQNTLFHKSAFHTLFLPTTVSFRTTDIWRSFISQKI 168

Query: 361 LWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKHRF 413
           L  + G  V + PT        H Y K     F +EK ++ + GR+I+FL  W+  K + 
Sbjct: 169 L-HLSGLTVSFVPTNAVQFRNAHDYLK----DFKDEKQVYEDSGRMIEFLHGWKCQK-KI 222

Query: 414 FEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPRASIGHGDRK 473
            + ++ L+  +  E  W+E D +    ++ DL  +G++ P L++     P +   +   +
Sbjct: 223 EDCMVLLAKDLVTEELWSEEDSELLEMFITDLKLMGFEFPELVTENYQDPYSPSTNESSR 282

Query: 474 EFVPRKLPSVHLGVEETGTVSYE-------------IGNLIRWR---------------- 504
           +   R++   +L  E      Y+              G+L+ W                 
Sbjct: 283 DVNCRRM---NLEFELVDPREYDEQNLKKAVQKLNYFGDLVDWCNETGHSNLSQSFPSPE 339

Query: 505 ------------KNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVE 552
                       + + N VLI+  + P +      + LY   F TVI       E++  +
Sbjct: 340 QLKNEHDNSVVLQKYSNSVLILVNNYPWQYGMGLLQRLYQPYFATVIFCGSWYPENIIDQ 399

Query: 553 AGQLEQV----YRHL-PK-----IFSRY----------TSAEGFLFLQDDTILNYW 588
                 +    Y HL P+      FS +          ++ EG+ F+ DDT+ N W
Sbjct: 400 DNYTSTLHPINYIHLNPEENHRGYFSYHCLTLVKEMGLSNVEGYFFVADDTVFNIW 455


>gi|159474070|ref|XP_001695152.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276086|gb|EDP01860.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 904

 Score = 88.6 bits (218), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 75/287 (26%), Positives = 127/287 (44%), Gaps = 20/287 (6%)

Query: 130 WQVLAIGNSRTPKNW--NLKGAIFLSLDMQANLGFRVLDFLPYDSYVRKSCGYLFAIQHG 187
           W    + + ++P ++  N  G + L++  Q  L + V D +P++ + RK+ G+++A  HG
Sbjct: 4   WCKCFVLDRKSPPDFQANGPGMVVLTVAAQEKLKWAVADRMPWNHFGRKNLGFVYAALHG 63

Query: 188 AKKIFDADDRGDVIGDDLGKHFDVELVGEGAR-QETILQYSHENPNRTIVNPYVHFGQRS 246
           A+ I+D DD   V+  D  +     L        E +  +        + NPY  +G   
Sbjct: 64  AEYIYDTDDDNFVLDGD-ARFLPRSLTPPAPDGPEGVQVHWPAGTGARVFNPYPFWGV-D 121

Query: 247 VWPRGLPLENV-------GEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSL 299
            WPRG PL  +       G +            +  + Q ++N  PDVD+V   T +   
Sbjct: 122 AWPRGFPLTMITNETTRSGALPVTAADAPQLPPRVCVLQSLANADPDVDAVHRLTGR--- 178

Query: 300 EAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQ-SSAFWALMLPVSVSTMASDVLRGFWGQ 358
               + F  R   +A P G   P N+  T++   +   AL LPV+V    SD+ R +  Q
Sbjct: 179 --LPLFFAPRRAWLAYPAGTYAPFNAQATLFDARALAAALALPVTVHGRVSDIWRSYIMQ 236

Query: 359 RLLWEIGGYVVVYPPTVHRYDKIEAY--PFSEEKDLHVNVGRLIKFL 403
           R +W++G  +    P V +Y     Y   F  E DL++    L++ L
Sbjct: 237 RAMWDMGCGLAFADPWVTQYRNAHKYLRDFQSELDLYLKTEGLLEVL 283


>gi|159476456|ref|XP_001696327.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158282552|gb|EDP08304.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 555

 Score = 85.5 bits (210), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 121/286 (42%), Gaps = 30/286 (10%)

Query: 181 LFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSHENPNRTIVNPYV 240
           ++A+ HGA+ IFD DD      D+     D + +    +       +      T+ NPY 
Sbjct: 1   MYAVLHGAEFIFDTDD------DNFVLDGDAKFLPRSTKLPEGWTLNTPTTGATVFNPYP 54

Query: 241 HFGQRSVWPRGLPLENVGEIS-------HEEFYTEVFGGKQFIQQGISNGLPDVDSVFYF 293
           H+G  + WPRG PL  +  ++                  +    Q ++N  PDVD+++  
Sbjct: 55  HWGVDT-WPRGFPLTQITNVTTRTGVRPAASTNAPALPPRVCALQSLANADPDVDALYRL 113

Query: 294 TRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALML-PVSVSTMASDVL 352
           T         + F  +   +A P G   P N+  T++ + A  A +  PV+V    SD+ 
Sbjct: 114 T-----GGLPLYFPPQRAWLAYPAGTYAPFNAQATLFDARALSAALALPVTVHGRVSDIW 168

Query: 353 RGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAY--PFSEEKDLHVNVGRL------IKFLV 404
           R +  QR +W++G  +    P V +Y     Y   FS E DL++    L      + F  
Sbjct: 169 RSYIMQRAMWDLGCGLAFADPWVTQYRNAHKYLKDFSSELDLYLKTEGLLVVLNGLAFPP 228

Query: 405 SWR--SNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAV 448
            WR  S   +   +VL L  ++ E G     DV    A+L DL A+
Sbjct: 229 DWRFASPDAQLAGRVLALYVALYEHGLLEVEDVLMVHAYLSDLGAL 274


>gi|321478604|gb|EFX89561.1| hypothetical protein DAPPUDRAFT_95105 [Daphnia pulex]
          Length = 1228

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 82/307 (26%), Positives = 125/307 (40%), Gaps = 62/307 (20%)

Query: 174 VRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQYSH----- 228
            R++ GYL+AIQHGA+ IFDA             +   ++  E  R+E   Q  +     
Sbjct: 9   ARRNAGYLYAIQHGARHIFDAYPE---------TYTSAKIPLETFRREMFRQLQNVALNV 59

Query: 229 ------ENPN-RTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGIS 281
                 E P  + + NPY HFG+  +W  G    N                ++FI   I 
Sbjct: 60  ALGVVSERPYVKRVQNPYAHFGRPDLWTEGFRRNN----------------QRFIHNHIY 103

Query: 282 NGLPDVDSVFYFTRKPSLEAF-DIR---FDDRVPKVALPQGMMVPVNSFNTIYQSSAFWA 337
                        R PS+E F DI    FD   P + LP   + P +S NT+Y   AFW 
Sbjct: 104 R--------ICEVRPPSIEKFLDIDEDYFDWAAPSLTLPGSTVAPFSSKNTLYSIEAFWG 155

Query: 338 LML-PVSVSTMASDV----LRGFWGQRLLWEIGGYV---VVYPPTVHRYDKIEAYPFSEE 389
           L+L P   S+ A  +    LR  W Q +L +I G +   +    T+ R +K    P    
Sbjct: 156 LVLFPTGNSSQAPTIPQHLLRTLWNQAVLGDIAGSLKLSLANVSTLQRREKSRKNP---- 211

Query: 390 KDLHVNVGRLIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVG 449
            DL    G L  FL+ W  +         +L  +M +  + + + +    +W+ DL    
Sbjct: 212 ADLREPDG-LASFLLKWTCSTRSSLSCTRDLFQTMFQLHYISFKSLGVLQSWINDLKRSH 270

Query: 450 YQQPRLM 456
           Y +P ++
Sbjct: 271 YLEPPVI 277


>gi|341896205|gb|EGT52140.1| hypothetical protein CAEBREN_02655 [Caenorhabditis brenneri]
          Length = 639

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 61/198 (30%), Positives = 95/198 (47%), Gaps = 26/198 (13%)

Query: 276 IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAF 335
           +QQG+ +  PDVD+++      S    D++F+   P + L  G   P NS NT++  SAF
Sbjct: 84  VQQGLVHHDPDVDAIYRLLHADSSSGLDVKFNKFAPPITLSIGTYSPWNSQNTLFHKSAF 143

Query: 336 WALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSE 388
             L LP +VS   +D+ R F  Q++L  + G  V + PT        H Y K     F +
Sbjct: 144 HTLYLPTTVSFRTTDIWRSFISQKIL-HLSGLTVSFVPTNAVQFRNAHDYLK----DFKD 198

Query: 389 EKDLHVNVGRLIKFLVSWRSNKH--RFFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDL 445
           EK ++ + G++I+FL  W+ +       EK + +LS  M   G +           L +L
Sbjct: 199 EKQVYEDSGKMIEFLHKWKCSNESSNSLEKCINQLSDDMVINGLFK----------LPEL 248

Query: 446 IAVGYQQPRL-MSLELDR 462
           I   ++ P L  S E DR
Sbjct: 249 IKEVHEDPYLPSSNETDR 266


>gi|341896233|gb|EGT52168.1| hypothetical protein CAEBREN_09047 [Caenorhabditis brenneri]
          Length = 391

 Score = 75.9 bits (185), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 61/232 (26%), Positives = 102/232 (43%), Gaps = 49/232 (21%)

Query: 247 VWPRGLPLENV-----GEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFTRKPSLEA 301
           +WPRG PLE++     G  S    Y      +  +QQG+ +  PDVD+++          
Sbjct: 1   MWPRGFPLEHIEKHTNGNSSKVLCYQ---MKRAAVQQGLVHHDPDVDAIY---------- 47

Query: 302 FDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLL 361
                            ++   NS NT++   AF  L LP +VS   +D+ R F  Q++L
Sbjct: 48  ----------------RLLHAWNSQNTLFHKLAFHTLYLPTTVSFRTTDIWRSFISQKIL 91

Query: 362 WEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWRSNKH--R 412
             + G  V +  T        H Y K     F +EK ++ + G++I+FL  W+ +     
Sbjct: 92  -HLSGLTVSFVSTNAVQFRNAHDYLK----DFKDEKQVYEDSGKMIEFLHKWKCSNESSN 146

Query: 413 FFEKVL-ELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRP 463
             EK + +LS  M     W   D +    +L DL ++G++ P L+  + + P
Sbjct: 147 SLEKCINQLSDDMVINDLWGTEDSELMKMFLSDLKSMGFKFPELIKEDYEDP 198


>gi|25396324|pir||B88989 protein F02C9.2 [imported] - Caenorhabditis elegans
          Length = 528

 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 80/365 (21%), Positives = 139/365 (38%), Gaps = 69/365 (18%)

Query: 235 IVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFYFT 294
           + NPY  +G   +WPRG PL++                                      
Sbjct: 18  LFNPYRFYGMDQMWPRGFPLQHAD------------------------------------ 41

Query: 295 RKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRG 354
              S    +++F+   P + L  G   P NS NT++  SAF  L LP +VS   +D+ R 
Sbjct: 42  ---SRSGLNVKFNKFAPPITLSVGTYSPWNSQNTMFHKSAFHTLFLPTTVSFRTTDIWRS 98

Query: 355 FWGQRLLWEIGGYVVVYPPT-------VHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSWR 407
           F  Q++L  + G  V + PT        H Y K     F +E+ ++ + GR+I+FL +W 
Sbjct: 99  FISQKIL-HLSGLTVSFVPTNAVHFRNAHNYLK----DFKDEQQVYEDSGRIIEFLHNWN 153

Query: 408 SNKHRFFEK-VLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLMSLELDRPR-A 465
                  +  +++L++ + E      R  +    +  D+I     + R  S+ ++ P   
Sbjct: 154 CKTGSSIQSCIVQLANDLVE----ISRTSQEKLNYFGDIIKWC-NETRKSSVSINFPSPK 208

Query: 466 SIGHGDRKEFVPRKLPSVHLGVEETGTVSYEIGNLIRWRKNFGNVVLIMFCSG--PVERT 523
            +     K +V +K     L V       Y +G + R  + +     ++FC    P E +
Sbjct: 209 QLASLHEKSYVLKKHMDSVLIVVNNYPWKYGMGLIQRLYQPY--FATVIFCGSWYPAEFS 266

Query: 524 ALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDT 583
                        T+  ++        +E G        L K    +   EG+  + DDT
Sbjct: 267 DDT------NFTPTLFPINYIHMNPAEIEKGYFAYHCVTLAKELGLH-DVEGYFLVADDT 319

Query: 584 ILNYW 588
           + N W
Sbjct: 320 VFNIW 324


>gi|110669385|ref|YP_659196.1| protein transglucosylase [Haloquadratum walsbyi DSM 16790]
 gi|109627132|emb|CAJ53615.1| homolog to arabinopyranose mutase [Haloquadratum walsbyi DSM 16790]
          Length = 382

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 63/225 (28%), Positives = 105/225 (46%), Gaps = 21/225 (9%)

Query: 163 RVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQET 222
           R+ ++LPY+S  R++ GYL A + GA  I   DD      DD+   F    VGE    ++
Sbjct: 84  RLDEYLPYNSIQRRNIGYLQASEAGADVIVSLDDDNLAQDDDIAGDFGT--VGE---TQS 138

Query: 223 ILQYSHENP--NRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQ-FIQQG 279
           +L+ S  N   N   +  Y     R ++ RG P     E   E+ Y+     +   I+ G
Sbjct: 139 VLEVSAPNNWYNSASMMEYEQESSRDIYHRGFPYSRRDE---EQGYSFTERNRMVMIRAG 195

Query: 280 ISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL- 338
           +   +PDVD + +  R P   +   RF++R+  VAL      PVN+ NT + +     + 
Sbjct: 196 LWLDVPDVDVITHLERGPRATSVKERFNNRL--VALDNETFCPVNTQNTAFHTDLMPLIH 253

Query: 339 MLPVSVSTMASDVLR------GFWGQRLLWEIGGYVVVYPP-TVH 376
            +P+       ++ R      GF+ +++L E+GG V    P ++H
Sbjct: 254 TIPMGDEVEGMEISRFDDIWLGFFAEKILQEMGGTVAYGSPVSIH 298


>gi|323446854|gb|EGB02873.1| hypothetical protein AURANDRAFT_68488 [Aureococcus anophagefferens]
          Length = 691

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 43/143 (30%), Positives = 65/143 (45%), Gaps = 4/143 (2%)

Query: 328 TIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVH--RYDKIEAYP 385
           T++  +AFWAL+LP SV    +D+ RGF  QR+L   G  +   PP V   R D      
Sbjct: 2   TLFDRAAFWALLLPASVHGRVADIWRGFVAQRVLRAAGLRLAFLPPGVTQLRNDHDALAD 61

Query: 386 FSEEKDLHVNVGRLIKFL--VSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQ 443
           +  E+ L+     +++ L  V+           V +   ++ E G     DV +   WL 
Sbjct: 62  YMSERPLYEKADAVVRVLDGVAPHRPGGSVARAVEDAYVALYEHGLVALDDVAYAQLWLA 121

Query: 444 DLIAVGYQQPRLMSLELDRPRAS 466
           DL AVG   P+L   +   P A+
Sbjct: 122 DLYAVGLALPKLRRHKARTPGAT 144


>gi|323447188|gb|EGB03127.1| hypothetical protein AURANDRAFT_68275 [Aureococcus anophagefferens]
          Length = 151

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 45/150 (30%), Positives = 68/150 (45%), Gaps = 5/150 (3%)

Query: 328 TIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVH--RYDKIEAYP 385
           T++  +AFWAL+LP SV    +D+ RGF  QR+L   G  +   PP V   R D      
Sbjct: 2   TLFDRAAFWALLLPASVHGRVADIWRGFVAQRVLRAAGLRLAFLPPGVTQLRNDHDALAD 61

Query: 386 FSEEKDLHVNVGRLIKFL--VSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQ 443
           +  E+ L+     +++ L  V+           V +   ++ E G     DV +   WL 
Sbjct: 62  YMSERPLYEKADAVVRVLDGVAPHRPGGSVARAVEDAYVALYEHGLVALDDVAYAQLWLA 121

Query: 444 DLIAVGYQQPRLMSLEL-DRPRASIGHGDR 472
           DL AVG   P+L   +   RP   + +G R
Sbjct: 122 DLYAVGLALPKLRRHKARHRPVMMVKNGGR 151


>gi|453050261|gb|EME97807.1| hypothetical protein H340_24735 [Streptomyces mobaraensis NBRC
           13819 = DSM 40847]
          Length = 371

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 88/356 (24%), Positives = 141/356 (39%), Gaps = 54/356 (15%)

Query: 129 GWQVLAIGNSRTPKNWNL-------KGAIFLSLDMQ------ANLGFRVLDFLPYDSYVR 175
           G +++ I + RTP  ++        +GA  LS D+       A LG  V + +PYDS  R
Sbjct: 29  GARLVVIPDRRTPAAFHAACDRARARGAAILSPDVAEQDRLLAKLG--VPELVPYDSDNR 86

Query: 176 KSCGYLFAIQHGAKKIFDADDRG-DVIGDDLGKHFDVELVGEG-ARQETILQYSH--ENP 231
           ++ GYL +  +G+      DD     +   L +H    +V EG AR  T+   S      
Sbjct: 87  RNIGYLLSYLNGSACAVSMDDDNLPAVSPFLDEH---RVVLEGPARHRTVSSPSGWFNCC 143

Query: 232 NRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVF 291
           +   V P        V PRG P     + +   +  E    +  +  G+  G PDVD+V 
Sbjct: 144 DLLDVTPC------RVHPRGFPYGPRTDPAAPTWTEETADVR--VNAGLWLGDPDVDAVT 195

Query: 292 YFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSA-----FWALMLPVSVST 346
               +P++ A+      R P   L +    PVNS NT     A     F  +  PV  + 
Sbjct: 196 RLAVRPTVTAY------RGPAAVLARDTWCPVNSQNTAVHRDALPAYYFLRMGQPVGGAP 249

Query: 347 MA--SDVLRGFWGQRLLWEIGGYVVVYPPTVHRYDKIEAYPFSEEKDLHVNVGRLIKFLV 404
           +    D+  G++       +G  V    P VH   +  A+    +    +   R +  L+
Sbjct: 250 LERFGDIFSGYFLAACTKHLGHSVRFGGPLVHH--ERNAHDLFADLTAELPAIRFMDELL 307

Query: 405 SW----RSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLM 456
            W    R +   + E    L+H + E   + E+      AW QD  A  ++   LM
Sbjct: 308 DWLREFRPDGSDYREAYASLAHGLRE---FAEQ--ARGPAWTQDARAFLHRSAHLM 358


>gi|297824115|ref|XP_002879940.1| hypothetical protein ARALYDRAFT_903495 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297325779|gb|EFH56199.1| hypothetical protein ARALYDRAFT_903495 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 67

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/67 (53%), Positives = 41/67 (61%), Gaps = 9/67 (13%)

Query: 1  MLVQDRTL---PKSPKSQIRT-SSH-----RFSDSKSLDFSTWVRDNLFKIVTVLLLIAT 51
          MLVQDR      K PKSQIR   +H     RFS+ K+LDFSTW  +NL +I    LLI T
Sbjct: 1  MLVQDRAASSPAKPPKSQIRELPTHQQIRRRFSEPKNLDFSTWFSENLSRIAVFSLLIVT 60

Query: 52 IAALSFL 58
          I AL FL
Sbjct: 61 IVALFFL 67


>gi|308471386|ref|XP_003097924.1| hypothetical protein CRE_12970 [Caenorhabditis remanei]
 gi|308239229|gb|EFO83181.1| hypothetical protein CRE_12970 [Caenorhabditis remanei]
          Length = 144

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 8/88 (9%)

Query: 90  IQPIADKSSVYSRFRSEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGA 149
           I P+AD        +  KWIVV+   YPT+ +K+L   + W ++ + +++TP +W L+  
Sbjct: 44  IIPVADVK------KGNKWIVVTSVNYPTEDVKRLSSFEEWNLVVVADTKTPVDWKLETV 97

Query: 150 IFLSLDMQAN--LGFRVLDFLPYDSYVR 175
            FLS+D Q +  LG    D+    S VR
Sbjct: 98  HFLSVDYQKHLRLGLNQFDYEDTVSGVR 125


>gi|323447594|gb|EGB03509.1| hypothetical protein AURANDRAFT_67942 [Aureococcus anophagefferens]
          Length = 229

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 7/99 (7%)

Query: 105 SEKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKN-------WNLKGAIFLSLDMQ 157
            E+  VV+    P++++ ++    GW ++ +G+ +T                 +LS   Q
Sbjct: 71  CERCGVVTTINPPSEAILRVGNASGWCLVVVGDRKTADGPYEALAAAAPATVAYLSAAAQ 130

Query: 158 ANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADD 196
             L + +    P+D + RK+ GYL+AI  GA  IFD DD
Sbjct: 131 ETLPYGLASATPWDHFARKNLGYLYAIHAGAATIFDFDD 169


>gi|156406679|ref|XP_001641172.1| predicted protein [Nematostella vectensis]
 gi|156228310|gb|EDO49109.1| predicted protein [Nematostella vectensis]
          Length = 326

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 44/95 (46%), Gaps = 3/95 (3%)

Query: 509 NVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVY---RHLPK 565
           NVVL++    P   T    R  Y + F+ +I+   + NE L +     E  Y     L K
Sbjct: 48  NVVLVIIYHYPYYETLPIIRSFYEKAFRKIIVCGAEANETLGIMGVAHENGYWGYECLGK 107

Query: 566 IFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWI 600
               Y   EG+L + DD +  +WN+   D+ K+W+
Sbjct: 108 AARDYPGYEGYLQIHDDILFQWWNVFSEDRTKIWL 142


>gi|449662267|ref|XP_004205507.1| PREDICTED: uncharacterized protein LOC101239808 [Hydra
           magnipapillata]
          Length = 363

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 54/111 (48%), Gaps = 6/111 (5%)

Query: 499 NLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIIL-SEQKNEDLAVEAGQLE 557
           N   + K F N+ LI+  + P   +      LYG IF+ V    S Q      V    + 
Sbjct: 71  NATCYSKYFNNIALIIVYNNPFYDSIPLLSELYGPIFQRVFFCGSIQAKSFTNVTVVNIH 130

Query: 558 QV---YRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQA--DKNKLWITDK 603
           +    Y  L +I   + S EG+L++ DD +LNYWNL++   + N +WI++ 
Sbjct: 131 RGLFGYECLAEIIRSHHSFEGYLYINDDVVLNYWNLIENKFNTNSIWISNN 181


>gi|357063961|gb|AET51853.1| transglycosylse [Marinactinospora thermotolerans]
          Length = 376

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 69/283 (24%), Positives = 102/283 (36%), Gaps = 45/283 (15%)

Query: 119 DSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVL------------- 165
           D L   ++  G +++ I       + N   A+F + +    LG  V+             
Sbjct: 22  DRLAPALRDAGARLIVI------PDRNTGPALFAACERHRRLGLDVVCPSVAEQQDLLER 75

Query: 166 ----DFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQE 221
               D +PY S  R++ GYL A   G   I   DD      DD  +   V  V +G R +
Sbjct: 76  LAVPDLIPYHSDNRRNVGYLMAWMEGFDVIVSMDDDNLPTTDDFVERHQV--VCQGPRTQ 133

Query: 222 TILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVF--GGKQFIQQG 279
            +   S    N   +   +      V+PRG P       +H +  T V        I  G
Sbjct: 134 PVTASSDGWFNNCAL---LEVEPTEVFPRGFPFH--ARPAHAQARTSVCERPADVRINAG 188

Query: 280 ISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSA----- 334
           +  G PDVD++     +P+  A           V L +G   PVNS NT     A     
Sbjct: 189 LWLGDPDVDAITRLAVRPNALAHSGG------SVVLAEGTWCPVNSQNTAVHRDALPAYY 242

Query: 335 FWALMLPVSVSTMA--SDVLRGFWGQRLLWEIGGYVVVYPPTV 375
           F  +  PV    M    D+  G++ Q     +G  V    P V
Sbjct: 243 FLRMGQPVDGVPMERFGDIFSGYFVQVCAQHLGHAVRFGDPVV 285


>gi|238059558|ref|ZP_04604267.1| Ata16 protein [Micromonospora sp. ATCC 39149]
 gi|237881369|gb|EEP70197.1| Ata16 protein [Micromonospora sp. ATCC 39149]
          Length = 383

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 74/188 (39%), Gaps = 24/188 (12%)

Query: 158 ANLGFRVLDFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRG-DVIGDDLGKHFDVELVGE 216
           A LG   L  +PYDS  R++ GYL + Q  A  +   DD    + GD L  H    +V  
Sbjct: 79  AGLGAPTL--IPYDSDNRRNVGYLLSWQSDADFLISVDDDNFPIDGDFLTAH---AVVAA 133

Query: 217 GARQETILQYSHE--NP-NRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGK 273
           G R   ++       NP  +  V P        V+PRG P  +       E  TE    +
Sbjct: 134 GPRPARVVTAESGWWNPCGQLTVAPM------PVYPRGFPYAHRSPTPTSE-RTETVDVR 186

Query: 274 QFIQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSS 333
             I  G+  G PDVD++     +P + A         P +    G   PVNS NT     
Sbjct: 187 --INAGLWLGDPDVDAITRIAVRPEVTAMP------APALVCDTGTWAPVNSQNTAVHRD 238

Query: 334 AFWALMLP 341
           A  A   P
Sbjct: 239 AIPAYYFP 246


>gi|294887539|ref|XP_002772156.1| UDP-glucose 4-epimerase, putative [Perkinsus marinus ATCC 50983]
 gi|239876102|gb|EER03972.1| UDP-glucose 4-epimerase, putative [Perkinsus marinus ATCC 50983]
          Length = 477

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 80/193 (41%), Gaps = 28/193 (14%)

Query: 238 PYVHFGQRSVWPRGLPL-----ENVGEISHEEFYTEVFGGKQFIQQGISNGLPDVDSVFY 292
           P +   +  +WPRG PL     +     +      + +  +  + Q +++  PD D+++ 
Sbjct: 14  PGLDNAETVLWPRGYPLSYIRRDRATTTAKPSRTLDTWTREIAVVQTLADNDPDFDAIYR 73

Query: 293 FTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQS-SAFWALMLPVSVSTM---- 347
            TR   ++   +          L      P+N+   ++++  A W L LPV+VS      
Sbjct: 74  LTRPLPVDFHQLL----TSAFLLAPPTFTPLNAQACLFKAYDALWGLYLPVTVSIYPYSI 129

Query: 348 -----------ASDVLRGFWGQRLLWEIGGYVVVYPPT-VHRYDKIEAY--PFSEEKDLH 393
                       SD+ R F  QRLLW++G  V V   T V +      Y   F  E D++
Sbjct: 130 VWSHPEQVHGRVSDIWRSFVLQRLLWDLGASVAVAGRTWVRQLRNSHDYLADFIAEDDVY 189

Query: 394 VNVGRLIKFLVSW 406
                +++FLV W
Sbjct: 190 KKAEAMMRFLVGW 202


>gi|7504465|pir||T22803 hypothetical protein F56H6.10 - Caenorhabditis elegans
          Length = 609

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 37/158 (23%), Positives = 72/158 (45%), Gaps = 31/158 (19%)

Query: 309 RVPKVALPQGMMVPVNSFNTIYQSSAFWALMLPVSVSTMASDVLRGFWGQRLLWEIGGYV 368
           ++ + A+ QG++      + IY+++  W                R F  Q++L  + G  
Sbjct: 26  KMKRAAVQQGLVHHDPDVDAIYRTTDIW----------------RSFISQKIL-HLSGLT 68

Query: 369 VVYPPT-------VHRYDKIEAYPFSEEKDLHVNVGRLIKFLVSW---RSNKHRFFEKVL 418
           V +  T        H Y K     F  EK ++ + G++I+FL +W   R+N       + 
Sbjct: 69  VSFVSTNAVQFRNAHDYLK----DFKNEKQVYEDSGKMIEFLHNWNCTRNNSTVLENCIN 124

Query: 419 ELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLM 456
           +L   +A+E  W   D +    +L+DL ++G++ P+L+
Sbjct: 125 QLLVDLAKEKLWGSEDARLMGMYLEDLKSMGFKFPKLV 162


>gi|156392753|ref|XP_001636212.1| predicted protein [Nematostella vectensis]
 gi|156223313|gb|EDO44149.1| predicted protein [Nematostella vectensis]
          Length = 344

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 3/100 (3%)

Query: 504 RKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQVY--- 560
           R  F  V LI+    P   +    R  Y   F  +I+   + N+   V     E+ Y   
Sbjct: 62  RPYFDTVALIIVYHYPYYESFPLLRSFYENGFDRIIVCGPEANDKFKVMQVSHEKGYWGY 121

Query: 561 RHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWI 600
             L K    Y++ EG+L + DD +  +WN+   DKNK+W+
Sbjct: 122 ECLAKAARLYSNYEGYLQIHDDALFLWWNVKGTDKNKMWL 161


>gi|110669375|ref|YP_659186.1| hypothetical protein HQ3513A [Haloquadratum walsbyi DSM 16790]
          Length = 183

 Score = 45.4 bits (106), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 9/97 (9%)

Query: 284 LPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSSAFWAL-MLPV 342
           +PDVD + +  R P   +   RF++R+  VAL      PVN+ NT + +     +  +P+
Sbjct: 1   MPDVDVITHLERGPRATSVKERFNNRL--VALDNETFCPVNTQNTAFHTDLMPLIHTIPM 58

Query: 343 SVSTMASDVLR------GFWGQRLLWEIGGYVVVYPP 373
                  ++ R      GF+ +++L E+GG V    P
Sbjct: 59  GDEVEGMEISRFDDIWLGFFAEKILQEMGGTVAYGSP 95


>gi|156352280|ref|XP_001622687.1| predicted protein [Nematostella vectensis]
 gi|156209284|gb|EDO30587.1| predicted protein [Nematostella vectensis]
          Length = 400

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 25/99 (25%), Positives = 49/99 (49%), Gaps = 3/99 (3%)

Query: 504 RKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNED---LAVEAGQLEQVY 560
           R  F +V+L++  + P   +   ++  Y  +F  +I      + +   + VE  +    Y
Sbjct: 117 RDVFSDVLLLIVFNYPYYESIKLFKSFYQPVFPHIIFCGPPDSSNKHVMNVEIFRGVLGY 176

Query: 561 RHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLW 599
             L +    +    G+L++ DD ILNYWNL+  +K+++W
Sbjct: 177 ECLGRAIREHPGYAGYLYINDDVILNYWNLVGFNKSQIW 215


>gi|443698350|gb|ELT98388.1| hypothetical protein CAPTEDRAFT_204969 [Capitella teleta]
          Length = 763

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 113/549 (20%), Positives = 195/549 (35%), Gaps = 78/549 (14%)

Query: 106 EKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVL 165
           ++W++V +  +P    +  +   GW +  +G      + + +   F S D Q       +
Sbjct: 64  KQWLIVQLTEHPEICTRLAMSFPGWTIALVGVKSFSDHSHSRCRYFSSQDAQDFWNSNRM 123

Query: 166 DFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQ 225
             L  ++       YL A++  A  I+  D   D+      +   +  +         L 
Sbjct: 124 TLLGENNPSLLQVAYLQAVKEKADVIYLPDANLDL------RELSMPSIAHPQSSFQGLT 177

Query: 226 YSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLP 285
           Y  E+ +    +P VHFG  ++    LP E           + ++    F Q  +     
Sbjct: 178 YIPESGHY--FDPNVHFGC-NISSAYLPSEQ----------STIYKLCTFPQSPVIQTPA 224

Query: 286 DVDSVFYFTRKPSLEAF---DIRF-DDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLP 341
            V  +     +   +A    +IR+ D   P V L  G   P++  N+ +   AFWAL   
Sbjct: 225 IVGPLQLVIAQDFSDALLQENIRYCDSYAPPVLLHPGTFAPMHFNNSAFLYDAFWALPFQ 284

Query: 342 VSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRY---DKIEAYPFSEEKDLHVNVGR 398
             +S +  D+   F  QRL+   G         VH     DKI   P    K   V V  
Sbjct: 285 FELS-IWDDLQWSFILQRLIGLTGSNNQTNSVLVHFQGIQDKIP--PAIAIKTESVRVK- 340

Query: 399 LIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRL--- 455
               L+ +R N   F +    L   ++ E F     V+    WL+ L  +GY+ P +   
Sbjct: 341 ----LLEFRCNVDSFVDCASNLLSDLSNEKFIENSTVESFLKWLKMLQMMGYRFPSIIHQ 396

Query: 456 -MSLELDRPRASIGHGD--------RKEFVPRKLPSV----------HLGVEETGTVSYE 496
            +S  +D     I             K ++P+ +  V          H   +     S  
Sbjct: 397 GLSSSIDCSEEHIKFHPLNFSTAMITKPYLPKPMLPVSNLDFISQLYHSTCDSVKIPSAN 456

Query: 497 IGNLIRWRKNFGNVVLIMFCSGPVERTALEWRLLYGRIFKTVI----------ILSEQKN 546
             +  R R  +  ++L++  +GPV      +  LY   F  ++          IL   K 
Sbjct: 457 NIDFARPRVQYSEILLLVIFNGPVYAALPYFEALYRSFFPNIVYCGPGHPNYQILQNFKQ 516

Query: 547 EDLAV------EAGQLEQV--YRHLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKL 598
             ++         G +E    Y  L        + +G+L + DD +L+    L + KN  
Sbjct: 517 LKISFISYHKSPKGHVEGALNYECLSIAMKMNYNVQGYLTIADDMVLS----LSSIKNHT 572

Query: 599 WITDKVLYL 607
              D V YL
Sbjct: 573 DHFDSVWYL 581


>gi|347755318|ref|YP_004862882.1| hypothetical protein [Candidatus Chloracidobacterium thermophilum
           B]
 gi|347587836|gb|AEP12366.1| hypothetical protein Cabther_A1616 [Candidatus Chloracidobacterium
           thermophilum B]
          Length = 378

 Score = 43.5 bits (101), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 59/230 (25%), Positives = 98/230 (42%), Gaps = 38/230 (16%)

Query: 168 LPYDSYVRKSCGYLFAIQHGAKKIFDADDRG------DVIGDDLGKHFDVEL-VGEGARQ 220
           +P+ S  R++ G+L A + G   I   DD        D +G+      DV L V  G+  
Sbjct: 81  IPWRSDNRRNVGFLMAYRDGCDPIISIDDDNYPTPGWDFLGEHAVTGCDVTLPVAVGSD- 139

Query: 221 ETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLEN----VGEISHEEFYTEVFGGKQFI 276
                + +     T+  P +  GQ +V+PRG P        G +S          G+  +
Sbjct: 140 ----NWFNICSMMTVDCPPLGGGQ-TVYPRGFPYPRRTLACGTVS-----ATAETGRVAV 189

Query: 277 QQGISNGLPDVDSVFYF-TRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNTIYQSS-- 333
             G+ +G PDVD+     TR  + EAF   +        L +G+  P+N+ NT    +  
Sbjct: 190 NAGLWSGDPDVDAATRIVTRCATREAFTQSY-------LLGRGVRSPINTQNTAVMRAAL 242

Query: 334 -AFWALMLPVSVSTMA----SDVLRGFWGQRLLWEIGGYVVVYPPTV-HR 377
            A++ + + VS++ +      D+  G++ Q     +G  V V  P V HR
Sbjct: 243 PAYYYVKMGVSLAGLKLDRFGDIFSGYFVQLCAEAVGHRVRVGSPVVEHR 292


>gi|156366207|ref|XP_001627031.1| predicted protein [Nematostella vectensis]
 gi|156213928|gb|EDO34931.1| predicted protein [Nematostella vectensis]
          Length = 314

 Score = 43.1 bits (100), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 3/98 (3%)

Query: 509 NVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQV---YRHLPK 565
           NV LI+    P   +    R  Y   F+ +I    + NE + V     ++    Y  + K
Sbjct: 32  NVALIIIYHYPYYDSFPLLRSFYENGFRKIIACGPKANETIGVLQVSHDRGFWGYECIGK 91

Query: 566 IFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLWITDK 603
               +   EG+L + DD++  +WN+L  DK+K+W  D+
Sbjct: 92  AARLHPGYEGYLQIHDDSLFLWWNVLGVDKDKMWKFDQ 129


>gi|413945236|gb|AFW77885.1| hypothetical protein ZEAMMB73_039824 [Zea mays]
          Length = 179

 Score = 42.7 bits (99), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 16/23 (69%), Positives = 21/23 (91%)

Query: 584 ILNYWNLLQADKNKLWITDKVLY 606
           +LNYWNL+QADK KLWIT+K+ +
Sbjct: 2   VLNYWNLMQADKEKLWITNKIAH 24


>gi|297621773|ref|YP_003709910.1| hypothetical protein wcw_1556 [Waddlia chondrophila WSU 86-1044]
 gi|297377074|gb|ADI38904.1| hypothetical protein wcw_1556 [Waddlia chondrophila WSU 86-1044]
 gi|337292402|emb|CCB90428.1| putative uncharacterized protein [Waddlia chondrophila 2032/99]
          Length = 281

 Score = 42.0 bits (97), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 23/94 (24%), Positives = 45/94 (47%), Gaps = 1/94 (1%)

Query: 507 FGNVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKN-EDLAVEAGQLEQVYRHLPK 565
           F +++LI+  + P        + +Y   F  ++   E  + E + ++ G    V+R L  
Sbjct: 37  FEDILLIINFNHPYYGNIEFLKEIYSPYFPNIVFYGEAAHPEVVKIKTGIGWHVHRVLKD 96

Query: 566 IFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLW 599
              RY    G++  QDD  + +WN  + +K+K+W
Sbjct: 97  ALIRYPGFRGYICTQDDCFIGFWNFQELNKDKIW 130


>gi|443698353|gb|ELT98391.1| hypothetical protein CAPTEDRAFT_204973 [Capitella teleta]
          Length = 768

 Score = 42.0 bits (97), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 78/358 (21%), Positives = 132/358 (36%), Gaps = 34/358 (9%)

Query: 106 EKWIVVSVDRYPTDSLKKLVKIKGWQVLAIGNSRTPKNWNLKGAIFLSLDMQANLGFRVL 165
           ++W++V +  +P    +  +   GW +  +G      + + +   F S D Q       +
Sbjct: 63  KQWLIVQLTEHPEICTRLAMSFPGWTIALVGVKSFSDHSHSRCRYFSSQDAQDIWNSNRM 122

Query: 166 DFLPYDSYVRKSCGYLFAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGEGARQETILQ 225
             L  +        YL A++  A  I+  D   D+      +   +  +         L 
Sbjct: 123 TLLSENYPSLLQVAYLQAVKENADVIYLPDANLDL------RELSMPSIAPPQSSFQGLT 176

Query: 226 YSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGLP 285
           Y  E+ +    +P VHFG  ++    LP E           + ++    F Q  +     
Sbjct: 177 YIPESGHY--FDPNVHFG-CNISSAYLPSEQ----------STIYKLCTFPQSPVIQTPA 223

Query: 286 DVDSVFYFTRKPSLEAF---DIRF-DDRVPKVALPQGMMVPVNSFNTIYQSSAFWALMLP 341
            V  +     +   +A    +IR+ D   P V L  G   P++  N+ +   AFWAL   
Sbjct: 224 IVGPLQLVIAQDFSDALLQENIRYCDSYAPPVLLHPGTFAPMHFNNSAFLYDAFWALPFQ 283

Query: 342 VSVSTMASDVLRGFWGQRLLWEIGGYVVVYPPTVHRY---DKIEAYPFSEEKDLHVNVGR 398
             +S +  D+   F  QRL+   G         VH     DKI   P    K   V V  
Sbjct: 284 FELS-IWDDLQWSFILQRLIGLTGSNNQTNSVLVHFQGIQDKIP--PAIAIKTESVRVK- 339

Query: 399 LIKFLVSWRSNKHRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQDLIAVGYQQPRLM 456
               L+ +R N   F E    L   ++ E F     V+    WL+ L  +GY+ P ++
Sbjct: 340 ----LLEFRCNVDSFVECASNLLSDLSNEKFIENSTVESFLKWLKMLQMMGYRFPSII 393


>gi|260785972|ref|XP_002588033.1| hypothetical protein BRAFLDRAFT_83011 [Branchiostoma floridae]
 gi|229273190|gb|EEN44044.1| hypothetical protein BRAFLDRAFT_83011 [Branchiostoma floridae]
          Length = 553

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 57/131 (43%), Gaps = 23/131 (17%)

Query: 216 EGARQETILQYSHENPNRTIVNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQF 275
           E  R++  ++Y+  N N            +S+ PR   + N  ++S+  F   + GG  F
Sbjct: 35  ESQREDLAVRYAINNINSI----------KSLLPRTKLISNTQQVSNTSFTAAMQGGMSF 84

Query: 276 -------IQQGISNGLPDVDSVFYFTRKPSLEAFDIRFDDRVPKVALPQGMMVPVNSFNT 328
                  + +GIS  LP  DS F    +P+L  +  +  DR  +      +   +NS   
Sbjct: 85  EIYLCHSMDRGISGVLPWSDSGFAICTRPALLCWSDKGLDREKR------LHKSLNSTKR 138

Query: 329 IYQSSAFWALM 339
           +Y+S+  WA M
Sbjct: 139 LYRSAGPWAFM 149


>gi|449679524|ref|XP_002155119.2| PREDICTED: uncharacterized protein LOC100203850 [Hydra
           magnipapillata]
          Length = 715

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 6/73 (8%)

Query: 386 FSEEKDLHVNVGRLIKFLVSWRSNK-HRFFEKVLELSHSMAEEGFWTERDVKFTAAWLQD 444
           FSE++     +G L  +L  W+S++   FF  V +L+  +A  G+ ++   K +  WL  
Sbjct: 648 FSEKE-----IGDLSSYLYYWQSDETESFFHIVDQLNFELAVRGYLSQLHAKLSRWWLHK 702

Query: 445 LIAVGYQQPRLMS 457
           L+ +GY QP L S
Sbjct: 703 LVKLGYYQPSLPS 715


>gi|294886897|ref|XP_002771908.1| hypothetical protein Pmar_PMAR023022 [Perkinsus marinus ATCC 50983]
 gi|239875708|gb|EER03724.1| hypothetical protein Pmar_PMAR023022 [Perkinsus marinus ATCC 50983]
          Length = 137

 Score = 40.4 bits (93), Expect = 2.7,   Method: Composition-based stats.
 Identities = 31/133 (23%), Positives = 60/133 (45%), Gaps = 20/133 (15%)

Query: 182 FAIQHGAKKIFDADDRGDVIGDDLGKHFDVELVGE-GARQET-------ILQYSHENPNR 233
           +A+ HGAKK+FD DD   +  D + +    + +G  GA +ET        +  S  +   
Sbjct: 5   YALIHGAKKVFDLDDDNIIYADSVQEITKGDFMGYCGASRETTGCPGKHTVTISATSTVP 64

Query: 234 TIVNPY-------VHFGQRSVWPRGLPL-----ENVGEISHEEFYTEVFGGKQFIQQGIS 281
           ++ NPY       +   +  +WPRG PL     +     +     ++ +  +  + Q ++
Sbjct: 65  SVFNPYSTGMVPGLDNAETVLWPRGYPLSYIRRDRATTTAKPSSTSDTWTREIAVVQTLA 124

Query: 282 NGLPDVDSVFYFT 294
           +  PD D+++  T
Sbjct: 125 DNDPDFDAIYRLT 137


>gi|156370882|ref|XP_001628496.1| predicted protein [Nematostella vectensis]
 gi|156215474|gb|EDO36433.1| predicted protein [Nematostella vectensis]
          Length = 366

 Score = 39.7 bits (91), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 22/98 (22%), Positives = 48/98 (48%), Gaps = 5/98 (5%)

Query: 507 FG-NVVLIMFCSGPVERTALEWRLLYGRIFKTVIILSEQKNEDLAVEAGQLEQ----VYR 561
           FG +++L++  S PV  +    + LY  +F  +++   + +    ++   +       Y 
Sbjct: 51  FGIDLLLVIVYSVPVYDSLPTLKALYQDVFPNILVCGPEPSNIYKIQITDIGIRGFFSYE 110

Query: 562 HLPKIFSRYTSAEGFLFLQDDTILNYWNLLQADKNKLW 599
            + +         G+L++ DD I+N+WNL++ DK  +W
Sbjct: 111 CMGRAIRENPGYNGYLYINDDMIVNWWNLVRLDKTLIW 148


>gi|340959777|gb|EGS20958.1| NADP-dependent glutamate dehydrogenase-like protein [Chaetomium
           thermophilum var. thermophilum DSM 1495]
          Length = 451

 Score = 38.9 bits (89), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 8/109 (7%)

Query: 181 LFAIQHGAKKIFDADDRGDVIG-DDLGKHF-DVELVGE-GARQETILQYSHENPNRTI-- 235
           L AI+ GA  +  +D +G +I  DD G    D+E + +   R+  + +Y +++  R I  
Sbjct: 236 LKAIELGATVVSLSDSKGALIAVDDKGVTVEDIEAIMKLKERRRPLSEYEYKDNLRYIEG 295

Query: 236 VNPYVHFGQRSVWPRGLPLENVGEISHEEFYTEVFGGKQFIQQGISNGL 284
           V P+VH GQ  +    LP     E+S EE    V  G +FI +G + G 
Sbjct: 296 VRPWVHVGQVDI---ALPCATQNEVSKEEAEALVANGCKFIAEGSNMGC 341


>gi|449684486|ref|XP_002168503.2| PREDICTED: uncharacterized protein LOC100205841 [Hydra
           magnipapillata]
          Length = 351

 Score = 38.9 bits (89), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 18/69 (26%), Positives = 39/69 (56%), Gaps = 1/69 (1%)

Query: 532 GRIFKTVIILSEQKNEDL-AVEAGQLEQVYRHLPKIFSRYTSAEGFLFLQDDTILNYWNL 590
            RI+   + L+ Q   ++  V+      +Y  L ++   +T+  G+LF+ ++ +LNYWN+
Sbjct: 90  NRIYCGSVPLNNQTEINVKVVDTKHGAFLYDCLTEVMKTHTNFTGYLFIGEEILLNYWNM 149

Query: 591 LQADKNKLW 599
           ++ D  ++W
Sbjct: 150 IEFDLGRIW 158


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.138    0.420 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,864,670,042
Number of Sequences: 23463169
Number of extensions: 427211672
Number of successful extensions: 858114
Number of sequences better than 100.0: 143
Number of HSP's better than 100.0 without gapping: 116
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 857650
Number of HSP's gapped (non-prelim): 198
length of query: 611
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 462
effective length of database: 8,863,183,186
effective search space: 4094790631932
effective search space used: 4094790631932
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 80 (35.4 bits)