BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 015859
         (399 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225465688|ref|XP_002272694.1| PREDICTED: uncharacterized protein LOC100251105 [Vitis vinifera]
          Length = 405

 Score =  634 bits (1634), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 303/405 (74%), Positives = 344/405 (84%), Gaps = 6/405 (1%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MAFLFQKFQEAVKTLAKSP FARDPR LQFEAD+NRLFLYTSYNRLGR+A+EADAE IID
Sbjct: 1   MAFLFQKFQEAVKTLAKSPVFARDPRHLQFEADINRLFLYTSYNRLGRNADEADAENIID 60

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPI--ESSAQSNAAPRRSGLS 118
           MAGKASL+DQQ+QV +NIHSQIK FC ++D +LL   KN+    ES  Q+ A PRRSGLS
Sbjct: 61  MAGKASLADQQKQVQENIHSQIKTFCNTLDGVLLPDSKNMNELPESCPQTKATPRRSGLS 120

Query: 119 FAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
           FAVG    P+D+  VPQT+ L R ELSQ LKD IGYTLDLKPSQIPH++AGQGLFL GEA
Sbjct: 121 FAVGRDGPPSDNSAVPQTKPLKRAELSQSLKDLIGYTLDLKPSQIPHKDAGQGLFLDGEA 180

Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
           +VGAV+A+YPG+IY PAYY+YIPGYPRVDA NPYLITRYDGTVINAQPWG G +TRELWD
Sbjct: 181 DVGAVVALYPGVIYFPAYYQYIPGYPRVDAHNPYLITRYDGTVINAQPWGVGGETRELWD 240

Query: 239 GLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
           G ++ E+  N +G EKGSD+ WKLLSKPM+ K GGSG ++LERRNPLA AHFANHP+KGM
Sbjct: 241 GFSVTEVSANMQGDEKGSDRIWKLLSKPMEAKHGGSGCDVLERRNPLAFAHFANHPSKGM 300

Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSS----TPVL 354
            PNVM+CPYDFPLTE+DMR YIPN+ FG+++EV M+R GSFWFK GS + S      PVL
Sbjct: 301 SPNVMVCPYDFPLTEQDMRAYIPNVLFGSSKEVKMKRLGSFWFKSGSSNKSELDAHVPVL 360

Query: 355 KTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           KTL LVATRA+CDEEVLLNYRLSNSKRRP WY+PVDEEEDRRRWS
Sbjct: 361 KTLVLVATRALCDEEVLLNYRLSNSKRRPAWYTPVDEEEDRRRWS 405


>gi|224083904|ref|XP_002307167.1| predicted protein [Populus trichocarpa]
 gi|222856616|gb|EEE94163.1| predicted protein [Populus trichocarpa]
          Length = 402

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 300/402 (74%), Positives = 347/402 (86%), Gaps = 3/402 (0%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MA LF KFQE VKTLAKSPTFAR+PR+ QFE DMNRLFL+TSYNRLG++AEEADAEEII+
Sbjct: 1   MASLFNKFQEFVKTLAKSPTFARNPREFQFETDMNRLFLFTSYNRLGKNAEEADAEEIIE 60

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILL--LPEKNVPIESSAQSNAAPRRSGLS 118
           MA KAS++DQQ+QV +NIH QI  FC +M+EILL  L +++   E+ AQSNAAPRRSGLS
Sbjct: 61  MASKASVADQQKQVQENIHFQITNFCSAMNEILLPDLEKRSKKDEAPAQSNAAPRRSGLS 120

Query: 119 FAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
           FAVG +  P D P +P+T+QL+R+++SQ L+D IGYTLDLKPS IPH+EAG+GLFL GEA
Sbjct: 121 FAVGKSGPPIDRPAIPETKQLSRSKVSQSLRDEIGYTLDLKPSLIPHKEAGEGLFLSGEA 180

Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
           +VG VIAIYPG+IYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINA PWGSG ++RE+WD
Sbjct: 181 DVGTVIAIYPGVIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINALPWGSGGESREVWD 240

Query: 239 GLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
           GLT+PEI  N +  EKG D+ W++LSKP++  R GS  ++LERRNPLALAHFANHPAKGM
Sbjct: 241 GLTVPEIRSNVQSVEKGQDRVWRMLSKPLEGTRVGSIGDVLERRNPLALAHFANHPAKGM 300

Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGS-GSGSSTPVLKTL 357
            PNVMICPYDFPLTEKDMR +IPN+SFGN EEVNMRRFGSFWFK  +  S S  PVLKTL
Sbjct: 301 DPNVMICPYDFPLTEKDMRTFIPNVSFGNHEEVNMRRFGSFWFKSSAKNSVSDVPVLKTL 360

Query: 358 ALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
            LVATRA+C+EEVLLNYRLSN+KRRP WYSPVDEEEDRRRWS
Sbjct: 361 VLVATRALCNEEVLLNYRLSNTKRRPAWYSPVDEEEDRRRWS 402


>gi|255558659|ref|XP_002520354.1| conserved hypothetical protein [Ricinus communis]
 gi|223540452|gb|EEF42020.1| conserved hypothetical protein [Ricinus communis]
          Length = 405

 Score =  584 bits (1505), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 284/405 (70%), Positives = 334/405 (82%), Gaps = 6/405 (1%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFAR---DPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEE 57
           MAFLF KFQEAVKTLAKSPTFAR   +PRQLQF  D+NRLFLYTSYNRLGR+AE+AD EE
Sbjct: 1   MAFLFNKFQEAVKTLAKSPTFARFTREPRQLQFAIDINRLFLYTSYNRLGRNAEDADVEE 60

Query: 58  IIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILL--LPEKNVPIESSAQSNAAPRRS 115
           II+MA  +S++DQQ QV +NIHSQI+ FC SMDEILL  + ++N   E+ AQ +  PRRS
Sbjct: 61  IIEMASTSSVTDQQNQVQENIHSQIESFCTSMDEILLPDMMKRNELGEALAQLDPVPRRS 120

Query: 116 GLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC 175
           GLS AVG +   T  P + +TR L   E+S RLK  IGYTL+LKPSQIPH+EAGQGLF+ 
Sbjct: 121 GLSLAVGKSTPSTGCPAITETRPLKSGEVSSRLKGLIGYTLNLKPSQIPHKEAGQGLFID 180

Query: 176 GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRE 235
           GEA+VG+VIA YPG+IY+PAYYRYIPGYPRVDA NPYLITRYDGTVINAQPWG G +TRE
Sbjct: 181 GEADVGSVIAFYPGVIYAPAYYRYIPGYPRVDACNPYLITRYDGTVINAQPWGFGGETRE 240

Query: 236 LWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPA 295
            W+GL +P++  + +  EKG D++WK+LSKP++    GS +E+LE+RNPLALAHFANHPA
Sbjct: 241 AWNGLLVPDVRHSVQIREKGVDRWWKMLSKPLEGTEFGSAAEVLEQRNPLALAHFANHPA 300

Query: 296 KGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKW-GSGSGSSTPVL 354
           K + PNVMICPYDFPLTEK+MR YIPNI FGN+EEVNMRRFGSFWF+    GSGS  P+L
Sbjct: 301 KDIAPNVMICPYDFPLTEKNMRTYIPNIIFGNSEEVNMRRFGSFWFRSKAKGSGSGIPIL 360

Query: 355 KTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           K L LVATRA+ +EEVLLNYRLS+SKRRP WY+PVDEEEDRRRWS
Sbjct: 361 KALVLVATRALSNEEVLLNYRLSHSKRRPEWYTPVDEEEDRRRWS 405


>gi|449464032|ref|XP_004149733.1| PREDICTED: uncharacterized protein LOC101219994 [Cucumis sativus]
          Length = 396

 Score =  583 bits (1504), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 288/407 (70%), Positives = 332/407 (81%), Gaps = 19/407 (4%)

Query: 1   MAFLFQKFQEAVKTLAKSPTF------ARDPRQLQFEADMNRLFLYTSYNRLGRDAEEAD 54
           MAFLFQKFQEAVK LAK+P F      ARDPR+LQFEADMNRLFL+TSYNRLG DA EAD
Sbjct: 1   MAFLFQKFQEAVKVLAKNPLFTKNPLFARDPRRLQFEADMNRLFLFTSYNRLGMDATEAD 60

Query: 55  AEEIIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRR 114
           AEEIIDMA KAS +DQQ+QV +NIHSQ+K FC+ MD ILL  E   P ES  +S  A R+
Sbjct: 61  AEEIIDMASKASFADQQKQVQENIHSQVKSFCMHMDGILLPDE---PAESPEKSKDAVRK 117

Query: 115 SGLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFL 174
           SGLSFAVG T        +P TR L R+ELSQ+LKD IGYTLD+KPS+IPH++AGQGLF+
Sbjct: 118 SGLSFAVGRTN------TIPTTRPLGRSELSQQLKDEIGYTLDIKPSRIPHKDAGQGLFI 171

Query: 175 CGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTR 234
            GEA+VG++IAIYPG++YSPA+Y+YIPGYPRVDAQNPYLITRYDGTVINAQPWG G DTR
Sbjct: 172 DGEADVGSIIAIYPGVVYSPAHYQYIPGYPRVDAQNPYLITRYDGTVINAQPWGLGADTR 231

Query: 235 ELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHP 294
           E+W GLT+P   P  +G EK SD+ W++LSKP++ KR   G + +ERRNPLA AHFANHP
Sbjct: 232 EVWHGLTVPNSNPTKQGDEK-SDRLWRMLSKPLEAKRVQHGGDAIERRNPLAFAHFANHP 290

Query: 295 AKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSG--SGSSTP 352
           AK MVPNVM+CPYDFPLTEKDMR YIPN+   N EEV M+RFGSFWFK G    +GS TP
Sbjct: 291 AKDMVPNVMLCPYDFPLTEKDMRVYIPNVPVAN-EEVGMKRFGSFWFKSGRSRINGSDTP 349

Query: 353 VLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           +LKT+ LVATRA+C+EEVLLNYRLSNSKRRP WY+PVDE EDRRRWS
Sbjct: 350 ILKTIVLVATRALCNEEVLLNYRLSNSKRRPSWYTPVDEGEDRRRWS 396


>gi|297808317|ref|XP_002872042.1| hypothetical protein ARALYDRAFT_489175 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317879|gb|EFH48301.1| hypothetical protein ARALYDRAFT_489175 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 398

 Score =  558 bits (1437), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 268/402 (66%), Positives = 326/402 (81%), Gaps = 7/402 (1%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MA LF KFQ+AV  LAKS TFA++PRQLQFEAD+N+LF+YTSY RLGR+AEE DAEEII+
Sbjct: 1   MATLFNKFQQAVGVLAKSTTFAKNPRQLQFEADVNKLFMYTSYYRLGREAEETDAEEIIE 60

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAP-RRSGLSF 119
           MAGKA+LS+QQ+QV +NIH Q++ FC  MD ILL   +N   ES +QS + P RRSGL+F
Sbjct: 61  MAGKATLSEQQKQVQENIHYQVENFCSLMDGILLPDVRNN--ESGSQSTSPPPRRSGLTF 118

Query: 120 AVGPT-ARPT-DSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGE 177
           A+G   A P  D+P+VP+T+ L   E+SQRL D +GYTL++KPS IPH++AGQG F+ GE
Sbjct: 119 AIGSNNAFPAADTPLVPETKPLKLNEVSQRLMDQMGYTLEIKPSVIPHKDAGQGCFIKGE 178

Query: 178 ANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELW 237
           A+VG V+A YPG+IYSPA+YRYIPGYP+VD+QN YLITRYDGTVINAQPWG G ++RE+W
Sbjct: 179 ADVGTVLAFYPGVIYSPAFYRYIPGYPKVDSQNSYLITRYDGTVINAQPWGLGGESREVW 238

Query: 238 DGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKG 297
           +G   P +  NSK AE GSD+ WK LSKP++    G   ++LERRNPLA  H ANHPAK 
Sbjct: 239 NGSYTPAVKANSKAAENGSDRLWKALSKPLEG--SGEAKDVLERRNPLAFGHLANHPAKE 296

Query: 298 MVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTL 357
           M PNVMICPYDFPL  KD+RPYIPNISFG++ E+ M+RFGSFWFK GS +G   PVLKTL
Sbjct: 297 MTPNVMICPYDFPLMAKDLRPYIPNISFGDSGEIKMKRFGSFWFKTGSKNGLEAPVLKTL 356

Query: 358 ALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
            LVATR++C+EE+LLNYRLSNSKRRP WY+PV+EEEDRRRWS
Sbjct: 357 VLVATRSLCNEELLLNYRLSNSKRRPDWYTPVNEEEDRRRWS 398


>gi|22327005|ref|NP_197711.2| uncharacterized protein [Arabidopsis thaliana]
 gi|18491157|gb|AAL69481.1| unknown protein [Arabidopsis thaliana]
 gi|20258933|gb|AAM14182.1| unknown protein [Arabidopsis thaliana]
 gi|332005749|gb|AED93132.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 399

 Score =  556 bits (1434), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 265/401 (66%), Positives = 322/401 (80%), Gaps = 4/401 (0%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MA LF KFQ+AV  LAKS TFA++PRQLQFEAD+N+LF+YTSYNRLGR+AEE DAEEII+
Sbjct: 1   MATLFNKFQQAVGVLAKSTTFAKNPRQLQFEADVNKLFMYTSYNRLGREAEETDAEEIIE 60

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
           MAGKA+LS+QQ+QV +NIH Q+++FC  MD ILL   +     S + S   PRRSGL+FA
Sbjct: 61  MAGKATLSEQQKQVQENIHYQVEKFCSLMDGILLPDVRRNESGSQSTSPRPPRRSGLTFA 120

Query: 121 VGPT-ARP-TDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
           +G   A P  D P+VP+T+ L   ++SQRL D +GYTL+ KPS IPH++AGQG F+ GEA
Sbjct: 121 IGSNNAFPHADQPLVPETKPLKLNDVSQRLMDQMGYTLETKPSVIPHKDAGQGCFIKGEA 180

Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
           +VG V+A YPG+IYSPA+Y+YIPGYP+VD+QN YLITRYDGTVINAQPWG G ++RE W+
Sbjct: 181 DVGTVLAFYPGVIYSPAFYKYIPGYPKVDSQNSYLITRYDGTVINAQPWGLGGESREAWN 240

Query: 239 GLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
           G   P +  N+K AE GSD+ WK LSKP++    G G E+LERRNPLA  H ANHPAK M
Sbjct: 241 GSYTPAVKANTKTAENGSDRLWKALSKPLEG--SGKGKEVLERRNPLAFGHLANHPAKEM 298

Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLA 358
            PNVMICPYDFPL  KD+RPYIPNISFG++ E+ M+RFGSFWFK GS +G   PVLKTL 
Sbjct: 299 TPNVMICPYDFPLMAKDLRPYIPNISFGDSGEIKMKRFGSFWFKTGSKNGLEAPVLKTLV 358

Query: 359 LVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           LVATR++C+EE+LLNYRLSNSKRRP WY+PV+EEEDRRRWS
Sbjct: 359 LVATRSLCNEELLLNYRLSNSKRRPDWYTPVNEEEDRRRWS 399


>gi|10177809|dbj|BAB11175.1| unnamed protein product [Arabidopsis thaliana]
          Length = 437

 Score =  556 bits (1434), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 265/401 (66%), Positives = 322/401 (80%), Gaps = 4/401 (0%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MA LF KFQ+AV  LAKS TFA++PRQLQFEAD+N+LF+YTSYNRLGR+AEE DAEEII+
Sbjct: 39  MATLFNKFQQAVGVLAKSTTFAKNPRQLQFEADVNKLFMYTSYNRLGREAEETDAEEIIE 98

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
           MAGKA+LS+QQ+QV +NIH Q+++FC  MD ILL   +     S + S   PRRSGL+FA
Sbjct: 99  MAGKATLSEQQKQVQENIHYQVEKFCSLMDGILLPDVRRNESGSQSTSPRPPRRSGLTFA 158

Query: 121 VGPT-ARP-TDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
           +G   A P  D P+VP+T+ L   ++SQRL D +GYTL+ KPS IPH++AGQG F+ GEA
Sbjct: 159 IGSNNAFPHADQPLVPETKPLKLNDVSQRLMDQMGYTLETKPSVIPHKDAGQGCFIKGEA 218

Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
           +VG V+A YPG+IYSPA+Y+YIPGYP+VD+QN YLITRYDGTVINAQPWG G ++RE W+
Sbjct: 219 DVGTVLAFYPGVIYSPAFYKYIPGYPKVDSQNSYLITRYDGTVINAQPWGLGGESREAWN 278

Query: 239 GLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
           G   P +  N+K AE GSD+ WK LSKP++    G G E+LERRNPLA  H ANHPAK M
Sbjct: 279 GSYTPAVKANTKTAENGSDRLWKALSKPLEG--SGKGKEVLERRNPLAFGHLANHPAKEM 336

Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLA 358
            PNVMICPYDFPL  KD+RPYIPNISFG++ E+ M+RFGSFWFK GS +G   PVLKTL 
Sbjct: 337 TPNVMICPYDFPLMAKDLRPYIPNISFGDSGEIKMKRFGSFWFKTGSKNGLEAPVLKTLV 396

Query: 359 LVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           LVATR++C+EE+LLNYRLSNSKRRP WY+PV+EEEDRRRWS
Sbjct: 397 LVATRSLCNEELLLNYRLSNSKRRPDWYTPVNEEEDRRRWS 437


>gi|356532231|ref|XP_003534677.1| PREDICTED: uncharacterized protein LOC100782005 [Glycine max]
          Length = 402

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 277/406 (68%), Positives = 321/406 (79%), Gaps = 11/406 (2%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MA LFQKFQEAVKTLAK P+F +DPRQLQFEAD+NRLFLY SYN LG++A+EA AEEII+
Sbjct: 1   MALLFQKFQEAVKTLAKRPSFTKDPRQLQFEADINRLFLYLSYNHLGKNADEAHAEEIIE 60

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
           MA KAS +DQQ QV +N+HSQIK FC  MDEILL  EK V        N  PRRSGL FA
Sbjct: 61  MASKASFADQQMQVQENVHSQIKTFCTFMDEILLPNEKMV--NEHQLQNTVPRRSGLGFA 118

Query: 121 VGPTARP--TDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
           VG    P        P+TR L++  +SQ LKD +GYT+++ PSQIPH++AGQGLFL G  
Sbjct: 119 VGSRNDPPLAFGAASPKTRPLSQAVVSQNLKDQLGYTVNVTPSQIPHKDAGQGLFLDGTV 178

Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
           +VGAV+A YPGI+YSPAYYRYIPGYP+VD  NPYLITRYDG VINAQPWGSG D RELW+
Sbjct: 179 DVGAVVAFYPGIVYSPAYYRYIPGYPKVDTLNPYLITRYDGNVINAQPWGSGGDNRELWN 238

Query: 239 GLTLPEIMPNSKGA--EKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAK 296
           G    EI P+ KGA  EKGS++FWKLLSKP++  +GG+ +E++ERRNPLALAHFANHP K
Sbjct: 239 GRKTGEIKPDVKGAEPEKGSERFWKLLSKPLEGYQGGN-NEVIERRNPLALAHFANHPPK 297

Query: 297 GMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFK---WGSGSGSSTPV 353
           G+ PNVMICPYDFPLTE +MR YIPN+ FGNA EVNMRRFGSFWFK       SGS+ P 
Sbjct: 298 GVQPNVMICPYDFPLTESNMRVYIPNVLFGNA-EVNMRRFGSFWFKSGGGSRNSGSNVPT 356

Query: 354 LKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           LKTL L+ATR + DEE+ LNYRLSN+KRRP WY+PVDEEEDRRRWS
Sbjct: 357 LKTLVLIATRPLQDEELFLNYRLSNTKRRPEWYNPVDEEEDRRRWS 402


>gi|242062410|ref|XP_002452494.1| hypothetical protein SORBIDRAFT_04g026890 [Sorghum bicolor]
 gi|241932325|gb|EES05470.1| hypothetical protein SORBIDRAFT_04g026890 [Sorghum bicolor]
          Length = 403

 Score =  536 bits (1381), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 259/404 (64%), Positives = 314/404 (77%), Gaps = 6/404 (1%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MA LF K QEA++T+AKSP FARDPR LQFEAD+NRLFLYTSY+RLG +AEE DAEEIID
Sbjct: 1   MASLFNKIQEAIRTIAKSPMFARDPRHLQFEADVNRLFLYTSYSRLGENAEEKDAEEIID 60

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
           +A KAS++DQQ+QV +N+H Q++  C SMD IL     N P +  ++++   RRSGLSFA
Sbjct: 61  LASKASVADQQKQVQENVHYQLRHMCQSMDNILCPDTTNDPSKEPSEAHNHSRRSGLSFA 120

Query: 121 VGPTARPT-DSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEAN 179
           VG  A     S V+P TR LTR ELS++ +D  GYTLD+ PS IPH++AGQGLFL G+AN
Sbjct: 121 VGGAASTNKQSAVIPATRPLTRAELSKKFRDHFGYTLDIGPSGIPHKDAGQGLFLSGKAN 180

Query: 180 VGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDG 239
           VGAV+AIYPG+IYSPAYYRYIPGYPR+D  N YLITRYDGT+I+A+PW  G +TRELWDG
Sbjct: 181 VGAVLAIYPGVIYSPAYYRYIPGYPRIDVCNNYLITRYDGTIIDAKPWYFGGETRELWDG 240

Query: 240 LTLPEI--MPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKG 297
             L +   MP  KG+E  SD+ W++LSKP++     +  E+LERRNPLA  HFANHP KG
Sbjct: 241 SDLVDYNAMP-PKGSENNSDRVWRMLSKPLEKSVRENFGEVLERRNPLAFGHFANHPPKG 299

Query: 298 MVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSS--TPVLK 355
             PNVMICPYDFPLTEK+MR YIPN++FG  E + M+RFGSF+FK G     +  + VLK
Sbjct: 300 SSPNVMICPYDFPLTEKNMRVYIPNVTFGGEETIKMKRFGSFYFKSGGSDNQAGDSLVLK 359

Query: 356 TLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           TL LV+TR+ICDEE+ LNYR SNSKRRP WYSPVDEEED+RRWS
Sbjct: 360 TLVLVSTRSICDEELFLNYRYSNSKRRPEWYSPVDEEEDKRRWS 403


>gi|357150341|ref|XP_003575426.1| PREDICTED: uncharacterized protein LOC100822461 [Brachypodium
           distachyon]
          Length = 402

 Score =  535 bits (1378), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 262/403 (65%), Positives = 306/403 (75%), Gaps = 5/403 (1%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MAFLF KFQEA++TLAK+P FARDPR LQF+AD+NRLFLYTSY RLG  +EE DAEEIID
Sbjct: 1   MAFLFNKFQEAIRTLAKNPKFARDPRHLQFDADVNRLFLYTSYYRLGEHSEEKDAEEIID 60

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
           MA KAS+ DQQ+QV  N+H Q+K  C +MD IL    KN P +S + ++   RRSGLSFA
Sbjct: 61  MASKASVVDQQKQVQGNVHYQLKHMCQAMDSILRPDIKNDPSQSPSDTHNKSRRSGLSFA 120

Query: 121 VGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANV 180
           VG      + PVVP TR LT+ ELS + +D  GYTLD++PS IPH++AGQGLFL GEA V
Sbjct: 121 VGTGVSSANKPVVPATRPLTQAELSNKFRDQFGYTLDIRPSGIPHKDAGQGLFLSGEARV 180

Query: 181 GAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGL 240
           GAV+AIYPG+IYSPAYYRYIPGYPR+DA N YLITRYDGTVINA+PW  G +T ELWDG 
Sbjct: 181 GAVVAIYPGVIYSPAYYRYIPGYPRIDACNSYLITRYDGTVINAKPWHLGGETMELWDGS 240

Query: 241 TLPEIMPN-SKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMV 299
            L +     S   E  SD+ W++L+KP+   R  +  E+LERRNPLA  HFANHP  G  
Sbjct: 241 HLVDYNSTPSTSPESNSDRAWRMLNKPLVKSRNENFGEVLERRNPLAFGHFANHPPGGST 300

Query: 300 PNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSST---PVLKT 356
           PNVMICPYDFPLTEKDMR YIPNI+FG  E V M+RFGSFWFK   GSG  T    VLKT
Sbjct: 301 PNVMICPYDFPLTEKDMRLYIPNITFGGEEPVTMKRFGSFWFK-SRGSGQQTGESSVLKT 359

Query: 357 LALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           L LV+TR+I DEE+ LNYR S+SK RP WY+PVDEEED+RRWS
Sbjct: 360 LVLVSTRSIRDEEIFLNYRYSSSKARPEWYTPVDEEEDKRRWS 402


>gi|212723622|ref|NP_001131459.1| uncharacterized protein LOC100192794 [Zea mays]
 gi|194691586|gb|ACF79877.1| unknown [Zea mays]
 gi|413937913|gb|AFW72464.1| hypothetical protein ZEAMMB73_372625 [Zea mays]
          Length = 403

 Score =  533 bits (1372), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 255/403 (63%), Positives = 311/403 (77%), Gaps = 4/403 (0%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MAFLF KFQEA++T+AKSP FARDPR LQFEAD+NRLFLYTSY RLG +A+E DAEEIID
Sbjct: 1   MAFLFNKFQEAIRTIAKSPMFARDPRHLQFEADVNRLFLYTSYYRLGENADEKDAEEIID 60

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
           +A KAS++DQQ+QV DN+H ++   C +MD IL     N P +  ++++   +RSGLSFA
Sbjct: 61  LASKASIADQQKQVQDNVHYELTHMCQAMDSILCPDTINDPSKGPSEAHNHSQRSGLSFA 120

Query: 121 VGPTARPT-DSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEAN 179
           VG  A     S ++P TR LTR ELS++ +D  GYTLD+ PS IPH++AGQGLFL G+AN
Sbjct: 121 VGGAASANKQSAIIPATRPLTRAELSKKFRDHFGYTLDIGPSGIPHKDAGQGLFLSGQAN 180

Query: 180 VGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDG 239
            GAV+AIYPG+IYSPAYYRYIPGYPR+D  N YLITRYDGT+I+A+PW  G +TR LWDG
Sbjct: 181 AGAVLAIYPGVIYSPAYYRYIPGYPRIDVCNNYLITRYDGTIIDAKPWCFGGETRALWDG 240

Query: 240 LTLPEIMPNS-KGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
             L +    + KG+E  SD+ W++LSKP++     +  E+LERRNPLA  HFANHP KG 
Sbjct: 241 SDLVDYNAMAPKGSENNSDRVWRMLSKPLEKGVRENFGEVLERRNPLAFGHFANHPPKGS 300

Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSS--TPVLKT 356
            PNVMICPYDFPLTEKDMR YIPNI+FG  E + M+RFGSF+FK G     +  + VLKT
Sbjct: 301 SPNVMICPYDFPLTEKDMRVYIPNITFGGQETIRMKRFGSFYFKSGGSDNQAGDSLVLKT 360

Query: 357 LALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           L LV+TR+ICDEE+ LNYR SNSKRRP WYSPVDEEED+RRWS
Sbjct: 361 LVLVSTRSICDEELFLNYRYSNSKRRPEWYSPVDEEEDKRRWS 403


>gi|326511880|dbj|BAJ92084.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326522819|dbj|BAJ88455.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 401

 Score =  530 bits (1365), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 256/403 (63%), Positives = 308/403 (76%), Gaps = 6/403 (1%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MAFLF KFQEA+KTLAKSP FARDPR +QF+AD+NRLFL+TSYNRLG+ +EE D EEIID
Sbjct: 1   MAFLFNKFQEAIKTLAKSPRFARDPRHIQFDADVNRLFLFTSYNRLGKHSEEKDVEEIID 60

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
           +A KAS++DQQ+QV  N+H Q+K  C +MD IL  P+ N P +S + +    RRSGLS A
Sbjct: 61  LANKASVTDQQKQVQQNVHYQLKHICKAMDSILR-PDTNDPSQSPSDAYNNSRRSGLSLA 119

Query: 121 VGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANV 180
           VG      + P VP TR LTR ELS + +D  GYTLD+KPS IPH++AGQGLFL GEANV
Sbjct: 120 VGTGVASANKPAVPATRPLTRAELSNKFRDQFGYTLDIKPSLIPHKDAGQGLFLSGEANV 179

Query: 181 GAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGL 240
           GAV+AIYPG+IYSPAYYRYIPGYPR+DA N YLITRYDGT+INA+PW  G +TRE+WDG 
Sbjct: 180 GAVLAIYPGVIYSPAYYRYIPGYPRIDACNSYLITRYDGTIINAKPWHLGGETREVWDGS 239

Query: 241 TLPEIMPN-SKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMV 299
              +      K  E  SD+ W++L+KP+      +  ++LERRNPLA  HFANHP  G  
Sbjct: 240 DPVDYSTTPPKSPESNSDRAWRMLNKPLQKSGSENFGDVLERRNPLAFGHFANHPPGGST 299

Query: 300 PNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKW---GSGSGSSTPVLKT 356
           PNVMICPYDFPLTEKD+R YIPNI+FG  E V M+RFGSFWFK    G  +G S PVL+T
Sbjct: 300 PNVMICPYDFPLTEKDIRAYIPNIAFGGEEPVTMKRFGSFWFKSRTPGKPTGES-PVLRT 358

Query: 357 LALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           L LV+TR++CDEE+ LNYR S+SK RP WY+PVDEEED+RRWS
Sbjct: 359 LVLVSTRSVCDEELYLNYRYSSSKGRPEWYTPVDEEEDKRRWS 401


>gi|115447467|ref|NP_001047513.1| Os02g0634600 [Oryza sativa Japonica Group]
 gi|49388185|dbj|BAD25311.1| unknown protein [Oryza sativa Japonica Group]
 gi|113537044|dbj|BAF09427.1| Os02g0634600 [Oryza sativa Japonica Group]
 gi|215678770|dbj|BAG95207.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623301|gb|EEE57433.1| hypothetical protein OsJ_07637 [Oryza sativa Japonica Group]
          Length = 405

 Score =  528 bits (1360), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 261/410 (63%), Positives = 309/410 (75%), Gaps = 16/410 (3%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFA------RDPRQLQFEADMNRLFLYTSYNRLGRDAEEAD 54
           M  LF +FQEAVKTLAK+P FA      RDPR LQFEAD+NRLFLYTSY RLG +AEE D
Sbjct: 1   MGSLFIRFQEAVKTLAKNPMFANNRLFARDPRHLQFEADVNRLFLYTSYYRLGANAEEKD 60

Query: 55  AEEIIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRR 114
           AEEIIDMA KAS+S+QQ+QV +N+H Q+   C +MD ILL   KN     ++++N  PRR
Sbjct: 61  AEEIIDMASKASVSEQQKQVQENVHYQLTNMCQAMDSILLPDTKN----GASEANNYPRR 116

Query: 115 SGLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFL 174
           SGLSFAVG      + P VP TR L R ELS + +D   YTLD++PSQIPH++AGQGLFL
Sbjct: 117 SGLSFAVGTGVASANKPDVPSTRPLNRAELSNKFRDHFQYTLDIRPSQIPHKDAGQGLFL 176

Query: 175 CGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTR 234
            GE N GAV+AIYPG++YSPAYYRYIPGYP++DA N YLITRYDGT+I+A+PW  G D+R
Sbjct: 177 SGETNAGAVLAIYPGVVYSPAYYRYIPGYPKIDACNNYLITRYDGTIIDAKPWQLGGDSR 236

Query: 235 ELWDGLTLPEI--MPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFAN 292
           E+WDG  L +   +P SK  E  SD+ W++LSKP+      +  E+LERRNPLA  HFAN
Sbjct: 237 EIWDGSDLVDYNAVP-SKSQESNSDRAWRMLSKPLKKGHTENFGEVLERRNPLAFGHFAN 295

Query: 293 HPAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEE-VNMRRFGSFWFKWGSGSG--S 349
           HP KG  PNVMICPYDFPLTEKDMR YIPNI+FG  EE V M+RFGSFWFK G       
Sbjct: 296 HPPKGSTPNVMICPYDFPLTEKDMRVYIPNITFGGEEEPVTMKRFGSFWFKSGRSGNQEG 355

Query: 350 STPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
            +PVLKTL LV+TR+ICDEE+ LNYR SNSK+RP WY PVDEEED+RRWS
Sbjct: 356 ESPVLKTLVLVSTRSICDEELFLNYRYSNSKKRPEWYIPVDEEEDKRRWS 405


>gi|147812456|emb|CAN77337.1| hypothetical protein VITISV_003407 [Vitis vinifera]
          Length = 345

 Score =  520 bits (1338), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 249/345 (72%), Positives = 286/345 (82%), Gaps = 6/345 (1%)

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVP--IESSAQSNAAPRRSGLS 118
           MAGKASL+DQQ+QV +NIHSQIK FC ++D +LL   KN+    ES  Q+ A PRRSGLS
Sbjct: 1   MAGKASLADQQKQVQENIHSQIKTFCNTLDXVLLPDSKNMNELPESCPQTKAXPRRSGLS 60

Query: 119 FAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
           FAVG    P D+  VPQT+ L R ELSQ LKD IGYTLDLKPSQIPH++AGQGLFL GEA
Sbjct: 61  FAVGRDGPPXDNSAVPQTKPLKRAELSQSLKDLIGYTLDLKPSQIPHKDAGQGLFLDGEA 120

Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
           +VGAV+A+YPG+IY PAYY+YIPGYPRVDA NPYLITRYDGTVINAQPWG G +TRELWD
Sbjct: 121 DVGAVVALYPGVIYFPAYYQYIPGYPRVDAHNPYLITRYDGTVINAQPWGVGGETRELWD 180

Query: 239 GLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
           G ++ E+  N +GAEKGSD+ WKLLSKPM++K  GSG ++LERRNPLA AHFANHP+KGM
Sbjct: 181 GFSVTEVSANMQGAEKGSDRIWKLLSKPMESKHRGSGCDVLERRNPLAFAHFANHPSKGM 240

Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSS----TPVL 354
            PNVM+CPYDFPLTE+DMR YIPN+ FG++ EV M+R GSFWFK GS + S      PVL
Sbjct: 241 SPNVMVCPYDFPLTEQDMRAYIPNVLFGSSXEVKMKRXGSFWFKSGSSNKSELDAHVPVL 300

Query: 355 KTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           KTL LVATRA+CDEEVLLNYRLSNSKRRP WY+PVDEEEDRRRWS
Sbjct: 301 KTLVLVATRALCDEEVLLNYRLSNSKRRPAWYTPVDEEEDRRRWS 345


>gi|195645104|gb|ACG42020.1| hypothetical protein [Zea mays]
 gi|413937914|gb|AFW72465.1| hypothetical protein ZEAMMB73_372625 [Zea mays]
          Length = 384

 Score =  499 bits (1286), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 240/383 (62%), Positives = 293/383 (76%), Gaps = 4/383 (1%)

Query: 21  FARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVLDNIHS 80
           FARDPR LQFEAD+NRLFLYTSY RLG +A+E DAEEIID+A KAS++DQQ+QV DN+H 
Sbjct: 2   FARDPRHLQFEADVNRLFLYTSYYRLGENADEKDAEEIIDLASKASIADQQKQVQDNVHY 61

Query: 81  QIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFAVGPTARPT-DSPVVPQTRQL 139
           ++   C +MD IL     N P +  ++++   +RSGLSFAVG  A     S ++P TR L
Sbjct: 62  ELTHMCQAMDSILCPDTINDPSKGPSEAHNHSQRSGLSFAVGGAASANKQSAIIPATRPL 121

Query: 140 TRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRY 199
           TR ELS++ +D  GYTLD+ PS IPH++AGQGLFL G+AN GAV+AIYPG+IYSPAYYRY
Sbjct: 122 TRAELSKKFRDHFGYTLDIGPSGIPHKDAGQGLFLSGQANAGAVLAIYPGVIYSPAYYRY 181

Query: 200 IPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNS-KGAEKGSDQ 258
           IPGYPR+D  N YLITRYDGT+I+A+PW  G +TR LWDG  L +    + KG+E  SD+
Sbjct: 182 IPGYPRIDVCNNYLITRYDGTIIDAKPWCFGGETRALWDGSDLVDYNAMAPKGSENNSDR 241

Query: 259 FWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKDMRP 318
            W++LSKP++     +  E+LERRNPLA  HFANHP KG  PNVMICPYDFPLTEKDMR 
Sbjct: 242 VWRMLSKPLEKGVRENFGEVLERRNPLAFGHFANHPPKGSSPNVMICPYDFPLTEKDMRV 301

Query: 319 YIPNISFGNAEEVNMRRFGSFWFKWGSGSGSS--TPVLKTLALVATRAICDEEVLLNYRL 376
           YIPNI+FG  E + M+RFGSF+FK G     +  + VLKTL LV+TR+ICDEE+ LNYR 
Sbjct: 302 YIPNITFGGQETIRMKRFGSFYFKSGGSDNQAGDSLVLKTLVLVSTRSICDEELFLNYRY 361

Query: 377 SNSKRRPVWYSPVDEEEDRRRWS 399
           SNSKRRP WYSPVDEEED+RRWS
Sbjct: 362 SNSKRRPEWYSPVDEEEDKRRWS 384


>gi|102139939|gb|ABF70082.1| hypothetical protein MA4_54B05.28 [Musa acuminata]
          Length = 592

 Score =  486 bits (1250), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 235/364 (64%), Positives = 279/364 (76%), Gaps = 6/364 (1%)

Query: 38  FLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPE 97
           +  +SYNRLG  AEE DA+EII+MA KAS+ DQQ+QV +NIH QIK  C  MDEIL +  
Sbjct: 205 YAKSSYNRLGNSAEENDADEIIEMANKASVKDQQKQVQENIHFQIKNMCKIMDEILRVDN 264

Query: 98  KNVPIESSA---QSNAAPRRSGLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGY 154
           KN+P   S+     N   RRSGLSFA+G      + PVVP T+ LTR+ELS  L D +GY
Sbjct: 265 KNIPDPPSSIPCSQNGPRRRSGLSFAIGKGTPSANEPVVPTTQPLTRSELSHCLMDRLGY 324

Query: 155 TLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLI 214
             D+KPSQIPHEEAGQGLFL GEA+VGA+IA YPG+IYSPAYYRYIPGYPRVDA N YLI
Sbjct: 325 APDIKPSQIPHEEAGQGLFLNGEADVGAIIAFYPGMIYSPAYYRYIPGYPRVDASNSYLI 384

Query: 215 TRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPN-SKGAEKGSDQFWKLLSKPMDNKRGG 273
           TRYDG VINAQPWGSG +TRELWDG   P+  P+ S+   +GSD+ W++LSKP++    G
Sbjct: 385 TRYDGNVINAQPWGSGGETRELWDGFYHPQYNPDTSEETGRGSDRMWRMLSKPLEASNRG 444

Query: 274 SGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNM 333
           +  E+LERRNPLA  HFANHP KG+ PNVM+CPYDFPLTEKDMR YIPN+ +G+ E + M
Sbjct: 445 TIGEVLERRNPLAFGHFANHPPKGIAPNVMVCPYDFPLTEKDMRVYIPNLIYGDEESITM 504

Query: 334 RRFGSFWFKWGSG--SGSSTPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDE 391
           RRFG+FWF+ GS       +PVLKTL LVATRA+C+EEV LNYRLSN KRRP WY+PVDE
Sbjct: 505 RRFGTFWFRLGSSVDPDGDSPVLKTLVLVATRALCNEEVFLNYRLSNQKRRPAWYTPVDE 564

Query: 392 EEDR 395
           EED+
Sbjct: 565 EEDK 568


>gi|297806861|ref|XP_002871314.1| hypothetical protein ARALYDRAFT_487659 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317151|gb|EFH47573.1| hypothetical protein ARALYDRAFT_487659 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 395

 Score =  468 bits (1205), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 239/400 (59%), Positives = 300/400 (75%), Gaps = 11/400 (2%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           M FLF KFQ+ V  LAKS TFA+ PRQLQFEAD+N+LF++TSYNRLGR+AEEADAEEII+
Sbjct: 6   MMFLFSKFQQGVGILAKSTTFAKSPRQLQFEADINKLFMFTSYNRLGRNAEEADAEEIIE 65

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
           +AGKA+L +QQ+QV +NIH Q++ FC SMDEILL        E +   + + R + +S A
Sbjct: 66  IAGKATLFEQQKQVQENIHYQVQNFCSSMDEILLTDIDKTKEELNKPGSESRRDNNVSAA 125

Query: 121 VGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANV 180
                   D P+VP+T+ L   E+S RLKD IGYTL +KPS IPH++AGQG F+ GEA+V
Sbjct: 126 --------DKPIVPETKPLKLAEVSHRLKDRIGYTLQIKPSLIPHKDAGQGCFIEGEADV 177

Query: 181 GAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDG- 239
           GAV+A YPG+IYSPA+YRYIPGYP VDAQN YLIT YDGTVINAQPWG G ++RE+W+G 
Sbjct: 178 GAVLAFYPGVIYSPAFYRYIPGYPNVDAQNSYLITGYDGTVINAQPWGRGGESREVWNGS 237

Query: 240 LTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMV 299
            T PEI  ++  AE GS + WK+LS+P+ +   GS   +LE RNPLA  HF NHP K M 
Sbjct: 238 FTTPEIRTDTTTAENGSSKVWKMLSRPLGDL--GSVEAVLEMRNPLAFGHFINHPGKDME 295

Query: 300 PNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLAL 359
           PNVM+CPYDFPL E +MR YIPN++FGN  ++ M RFGSF  + G+ S     VLKTL L
Sbjct: 296 PNVMMCPYDFPLPETEMRAYIPNVAFGNTGDIKMGRFGSFLSRTGNNSSLDARVLKTLVL 355

Query: 360 VATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           VAT+A+C+EE++LN RLSNS++R  WY P+DE+EDRR WS
Sbjct: 356 VATKALCNEELMLNDRLSNSEKRLEWYIPLDEQEDRREWS 395


>gi|224083906|ref|XP_002307168.1| predicted protein [Populus trichocarpa]
 gi|222856617|gb|EEE94164.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 230/383 (60%), Positives = 288/383 (75%), Gaps = 19/383 (4%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           M  LF KFQE VKTLAKSPT     +      D+ RLF+ TS NRLGR+A++ADA++II+
Sbjct: 1   MTSLFHKFQEVVKTLAKSPTDGTSTK------DLKRLFILTSCNRLGRNADKADAKDIIE 54

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILL--LPEKNVPIESSAQSNAAPRRSGLS 118
           MA K  +++QQ+Q+ +NIH QIK FC +M++ILL    +++   E+ AQSNA        
Sbjct: 55  MANKFYVAEQQKQIQENIHFQIKNFCTAMNDILLPDARKRSRKDEAPAQSNA-------- 106

Query: 119 FAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
           F+VG +  PTD   +P+ R+L+  E+SQ LKD IGYTLD+KPS +PHE+AGQGL + GEA
Sbjct: 107 FSVGKSGPPTDRHAIPEMRRLSHAEVSQSLKDHIGYTLDVKPSLMPHEKAGQGLLINGEA 166

Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
           + GAVIAIYPGII+SPAYY+YIPGYPR+ A+N YLITR+DGTVINAQPWG G + RE+WD
Sbjct: 167 DAGAVIAIYPGIIHSPAYYQYIPGYPRIHARNRYLITRHDGTVINAQPWGFGGEIREVWD 226

Query: 239 GLTLPEIMPNSKGAEK-GSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKG 297
           G T+PEIMPN +   K G +  W++LSKP++    GS  ++LERRNPLALAHFAN  A G
Sbjct: 227 GFTVPEIMPNVQSTGKDGPELVWRVLSKPLEGTGVGSIGDVLERRNPLALAHFANQVAVG 286

Query: 298 MVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTL 357
           M  NVM CPYDFPLTE DMR YIPN+SF N  EVN+RR+GSFWF   S S S+ PVLKTL
Sbjct: 287 MHQNVMACPYDFPLTEMDMRTYIPNVSFANP-EVNIRRYGSFWFSSKS-SESNVPVLKTL 344

Query: 358 ALVATRAICDEEVLLNYRLSNSK 380
            LVATRA+CDEEVL++YRL+N K
Sbjct: 345 VLVATRALCDEEVLVDYRLTNMK 367


>gi|357473943|ref|XP_003607256.1| hypothetical protein MTR_4g075120 [Medicago truncatula]
 gi|355508311|gb|AES89453.1| hypothetical protein MTR_4g075120 [Medicago truncatula]
          Length = 371

 Score =  449 bits (1156), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 240/401 (59%), Positives = 291/401 (72%), Gaps = 42/401 (10%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFA--RDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEI 58
           MAFLFQKFQEAVKTLAKSPTFA  RDPRQLQ++ D+NRLFLYTSYNRLG++A EADAEEI
Sbjct: 1   MAFLFQKFQEAVKTLAKSPTFAFARDPRQLQYQIDLNRLFLYTSYNRLGKNASEADAEEI 60

Query: 59  IDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEK---NVPIESSAQSNAAPRRS 115
           I++A KAS++DQQ  V +N+H+QIK FC  MD + L  EK   +V  E S Q+   P+ S
Sbjct: 61  IEIATKASVADQQMVVQENVHAQIKAFCTFMDAVFLPNEKKVNDVSFELSQQTKILPQHS 120

Query: 116 GLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC 175
            L  A G          VP+ R +++ ELSQ+LKD +GYTL++KPSQI H++AGQGLFL 
Sbjct: 121 DLRSANG----------VPKQRLISQVELSQKLKDELGYTLNVKPSQISHKDAGQGLFLD 170

Query: 176 GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRE 235
           G  +VGAV+A YPG++YSPAYY +IPGY  +D QNPYLITR+DG VI+AQ WG G D +E
Sbjct: 171 GVVDVGAVVAFYPGVVYSPAYYHHIPGY--LDEQNPYLITRHDGNVIDAQLWGRGGDKKE 228

Query: 236 LWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPA 295
           LW+G  + +        EKGS          +DN       ++LERRNPLALAHFANHP+
Sbjct: 229 LWNGRKMVD--------EKGSQ---------VDN-----SDDVLERRNPLALAHFANHPS 266

Query: 296 KGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV-- 353
           KGM+PNVMICPYDFPL E DMR YIPN+ FGNA E N  RFGSFWFK      + + V  
Sbjct: 267 KGMLPNVMICPYDFPLIENDMRAYIPNVLFGNAAEENTERFGSFWFKSRVPRNNESHVPT 326

Query: 354 -LKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEE 393
            LKT+ LVATRA+ DEE+LLNYRL N+KR P WY+PVDEEE
Sbjct: 327 TLKTVVLVATRALQDEELLLNYRLGNTKRCPEWYAPVDEEE 367


>gi|15241572|ref|NP_196444.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332003893|gb|AED91276.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 386

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 232/398 (58%), Positives = 290/398 (72%), Gaps = 24/398 (6%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           M FLF KFQ+ V  LAKS TFA++PRQLQFEAD+N+LF++TSYNRLGRDAEEADAEEII+
Sbjct: 6   MMFLFGKFQQGVGVLAKSTTFAKNPRQLQFEADINKLFMFTSYNRLGRDAEEADAEEIIE 65

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLL-----PEKNVPIESSAQSNAAPRRS 115
           MAGKA+ S+QQ+QV +NIH Q++ FC SM+EILL       + N P   S + N  P   
Sbjct: 66  MAGKATFSEQQKQVQENIHYQLQNFCSSMNEILLTNIDKTKDLNEPGSESRRDNIVP--- 122

Query: 116 GLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC 175
                        D P+VP+T+ L   E+S RLKD +GYTL++KPS IPH++AGQG F+ 
Sbjct: 123 -----------AADKPIVPETKPLKLDEVSHRLKDRVGYTLEIKPSLIPHKDAGQGCFIE 171

Query: 176 GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRE 235
           GEA+VGAV+A YPG+IYSPA++RYIPGYP VDAQN YLIT YDGTVINAQPWG G  +RE
Sbjct: 172 GEADVGAVLAFYPGVIYSPAFHRYIPGYPNVDAQNSYLITGYDGTVINAQPWGRGGISRE 231

Query: 236 LWDG-LTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSE-MLERRNPLALAHFANH 293
           +W+G  T PEI   +   E  SD+ WK+LS+P++   G  G E ++E RNPLA  HF NH
Sbjct: 232 VWNGSFTTPEIRTEATTTENSSDKVWKMLSRPLE---GSGGVEAVIEMRNPLAFGHFINH 288

Query: 294 PAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV 353
           P K M  NVMICPYDF L+E +MR YIPN++FGN  +V M RF SF  + GS S     V
Sbjct: 289 PGKEMESNVMICPYDFLLSETEMRAYIPNVAFGNTGDVKMGRFESFLSRTGSNSSLDAHV 348

Query: 354 LKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDE 391
           LKT+ LVAT+A+C+EE++LNYRLSNS++R  WY P+DE
Sbjct: 349 LKTIVLVATKALCNEELMLNYRLSNSEKRLEWYIPLDE 386


>gi|26453064|dbj|BAC43608.1| unknown protein [Arabidopsis thaliana]
          Length = 386

 Score =  447 bits (1149), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 231/398 (58%), Positives = 290/398 (72%), Gaps = 24/398 (6%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           M FLF KFQ+ V  LAKS TFA++PRQLQFEAD+N+LF++TSYNRLGRDAEEADAEEII+
Sbjct: 6   MMFLFGKFQQGVGVLAKSTTFAKNPRQLQFEADINKLFMFTSYNRLGRDAEEADAEEIIE 65

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLL-----PEKNVPIESSAQSNAAPRRS 115
           MAGKA+ S+QQ+QV +NIH Q++ FC SM+EILL       + N P   S + N  P   
Sbjct: 66  MAGKATFSEQQKQVQENIHYQLQNFCSSMNEILLTNIDKTKDLNEPGSESRRDNIVP--- 122

Query: 116 GLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC 175
                        D P+VP+T+ L   E+S RLKD +GYTL++KPS IPH++AGQG F+ 
Sbjct: 123 -----------AADKPIVPETKPLKLDEVSHRLKDRVGYTLEIKPSLIPHKDAGQGCFIE 171

Query: 176 GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRE 235
           GEA+VGAV+A YPG+IY+PA++RYIPGYP VDAQN YLIT YDGTVINAQPWG G  +RE
Sbjct: 172 GEADVGAVLAFYPGVIYTPAFHRYIPGYPNVDAQNSYLITGYDGTVINAQPWGRGGISRE 231

Query: 236 LWDG-LTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSE-MLERRNPLALAHFANH 293
           +W+G  T PEI   +   E  SD+ WK+LS+P++   G  G E ++E RNPLA  HF NH
Sbjct: 232 VWNGSFTTPEIRTEATTTENSSDKVWKMLSRPLE---GSGGVEAVIEMRNPLAFGHFINH 288

Query: 294 PAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV 353
           P K M  NVMICPYDF L+E +MR YIPN++FGN  +V M RF SF  + GS S     V
Sbjct: 289 PGKEMESNVMICPYDFLLSETEMRAYIPNVAFGNTGDVKMGRFESFLSRTGSNSSLDAHV 348

Query: 354 LKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDE 391
           LKT+ LVAT+A+C+EE++LNYRLSNS++R  WY P+DE
Sbjct: 349 LKTIVLVATKALCNEELMLNYRLSNSEKRLEWYIPLDE 386


>gi|218191231|gb|EEC73658.1| hypothetical protein OsI_08190 [Oryza sativa Indica Group]
          Length = 352

 Score =  432 bits (1112), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 228/410 (55%), Positives = 269/410 (65%), Gaps = 69/410 (16%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFA------RDPRQLQFEADMNRLFLYTSYNRLGRDAEEAD 54
           M  LF +FQEAVKTLAK+P FA      RDPR LQFEAD+NRLFLYTSY RLG +AEE D
Sbjct: 1   MGSLFIRFQEAVKTLAKNPMFANNRLFARDPRHLQFEADVNRLFLYTSYYRLGANAEEKD 60

Query: 55  AEEIIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRR 114
           AEEIIDMA KAS+S+QQ+QV +N+H Q+   C +MD ILL   KN     ++++N  PRR
Sbjct: 61  AEEIIDMASKASVSEQQKQVQENVHYQLTNMCQAMDSILLPDTKN----GASEANNYPRR 116

Query: 115 SGLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFL 174
           SGLSFAVG      + P                                           
Sbjct: 117 SGLSFAVGTGVASANKP------------------------------------------- 133

Query: 175 CGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTR 234
                     AIYPG++YSPAYYRYIPGYP++DA N YLITRYDGT+I+A+PW  G D+R
Sbjct: 134 ----------AIYPGVVYSPAYYRYIPGYPKIDACNNYLITRYDGTIIDAKPWQLGGDSR 183

Query: 235 ELWDGLTLPEI--MPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFAN 292
           E+WDG  L +   +P SK  E  SD+ W++LSKP+      +  E+LERRNPLA  HFAN
Sbjct: 184 EIWDGSDLVDYNAVP-SKSQESNSDRAWRMLSKPLKKGHTENFGEVLERRNPLAFGHFAN 242

Query: 293 HPAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEE-VNMRRFGSFWFKWGSGSG--S 349
           HP KG  PNVMICPYDFPLTEKDMR YIPNI+FG  EE V M+RFGSFWFK G       
Sbjct: 243 HPPKGSTPNVMICPYDFPLTEKDMRVYIPNITFGGEEEPVTMKRFGSFWFKSGRSGNQVG 302

Query: 350 STPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
            +PVLKTL LV+TR+ICDEE+ LNYR SNSK+RP WY PVDEEED+RRWS
Sbjct: 303 ESPVLKTLVLVSTRSICDEELFLNYRYSNSKKRPEWYIPVDEEEDKRRWS 352


>gi|449531573|ref|XP_004172760.1| PREDICTED: uncharacterized protein LOC101224517, partial [Cucumis
           sativus]
          Length = 270

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 196/270 (72%), Positives = 230/270 (85%), Gaps = 4/270 (1%)

Query: 132 VVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGII 191
            +P TR L R+ELSQ+LKD IGYTLD+KPS+IPH++AGQGLF+ GEA+VG++IAIYPG++
Sbjct: 3   TIPTTRPLGRSELSQQLKDEIGYTLDIKPSRIPHKDAGQGLFIDGEADVGSIIAIYPGVV 62

Query: 192 YSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKG 251
           YSPA+Y+YIPGYPRVDAQNPYLITRYDGTVINAQPWG G DTRE+W GLT+P   P  +G
Sbjct: 63  YSPAHYQYIPGYPRVDAQNPYLITRYDGTVINAQPWGLGADTREVWHGLTVPNSNPTKQG 122

Query: 252 AEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPL 311
            EK SD+ W++LSKP++ KR   G + +ERRNPLA AHFANHPAK MVPNVM+CPYDFPL
Sbjct: 123 DEK-SDRLWRMLSKPLEAKRVQHGGDAIERRNPLAFAHFANHPAKDMVPNVMLCPYDFPL 181

Query: 312 TEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSG--SGSSTPVLKTLALVATRAICDEE 369
           TEKDMR YIPN+   N EEV M+RFGSFWFK G    +GS TP+LKT+ LVATRA+C+EE
Sbjct: 182 TEKDMRVYIPNVPVAN-EEVGMKRFGSFWFKSGRSRINGSDTPILKTIVLVATRALCNEE 240

Query: 370 VLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           VLLNYRLSNSKRRP WY+PVDE EDRRRWS
Sbjct: 241 VLLNYRLSNSKRRPSWYTPVDEGEDRRRWS 270


>gi|359806342|ref|NP_001241484.1| uncharacterized protein LOC100782051 [Glycine max]
 gi|255642207|gb|ACU21368.1| unknown [Glycine max]
          Length = 322

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 198/328 (60%), Positives = 247/328 (75%), Gaps = 23/328 (7%)

Query: 73  QVLDNIHSQIKRFCLSMDEILLLPEKNV--PIESSAQSNAAPRRSGLSFAVGPTARPTDS 130
           QV +N+HSQIK FC  MDE+LL  EK V  P+E S Q N++P  SGL+ A          
Sbjct: 2   QVKENVHSQIKAFCTYMDEMLLPNEKRVNDPMELSQQENSSPHHSGLNSA---------- 51

Query: 131 PVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGI 190
            VV + R  T+ E+SQ+LKD +GY L++KPSQI H+++GQGLFL G A+VGAVIA YPG+
Sbjct: 52  KVVSKQRSYTQAEVSQKLKDQLGYKLNVKPSQISHKDSGQGLFLDGAADVGAVIAFYPGV 111

Query: 191 IYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSK 250
           +YSPAYY+YIPGY  +D QNPYLITRYDGTVI+AQPWG G +  E ++G  + E   +++
Sbjct: 112 VYSPAYYQYIPGY--LDEQNPYLITRYDGTVIDAQPWGCGGEGPERFNGRKMLERKLDNE 169

Query: 251 GAEKGSDQFWKLLSKPMDNKRGGSGSEM--LERRNPLALAHFANHPAKGMVPNVMICPYD 308
           GAEKGSD+    +SKP++  +  + +E   +ERRNPLALAHFANHPAKGM+PNVMICPYD
Sbjct: 170 GAEKGSDR----MSKPLEGSQVENDNENDDVERRNPLALAHFANHPAKGMLPNVMICPYD 225

Query: 309 FPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV--LKTLALVATRAIC 366
           FPLTE +MR YIPNI FGNA EVN++R GSFWFK G    S + V  LKT+ LVATR++ 
Sbjct: 226 FPLTENNMRVYIPNILFGNA-EVNIKRLGSFWFKSGVSRNSQSDVLTLKTVVLVATRSLQ 284

Query: 367 DEEVLLNYRLSNSKRRPVWYSPVDEEED 394
           DEE+LLNYRLSNSK+ P WY+PVDEEED
Sbjct: 285 DEELLLNYRLSNSKKWPEWYAPVDEEED 312


>gi|8346564|emb|CAB93728.1| hypothetical protein (fragment at BAC end) [Arabidopsis thaliana]
          Length = 325

 Score =  377 bits (967), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 197/335 (58%), Positives = 244/335 (72%), Gaps = 24/335 (7%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           M FLF KFQ+ V  LAKS TFA++PRQLQFEAD+N+LF++TSYNRLGRDAEEADAEEII+
Sbjct: 6   MMFLFGKFQQGVGVLAKSTTFAKNPRQLQFEADINKLFMFTSYNRLGRDAEEADAEEIIE 65

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLL-----PEKNVPIESSAQSNAAPRRS 115
           MAGKA+ S+QQ+QV +NIH Q++ FC SM+EILL       + N P   S + N  P   
Sbjct: 66  MAGKATFSEQQKQVQENIHYQLQNFCSSMNEILLTNIDKTKDLNEPGSESRRDNIVP--- 122

Query: 116 GLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC 175
                        D P+VP+T+ L   E+S RLKD +GYTL++KPS IPH++AGQG F+ 
Sbjct: 123 -----------AADKPIVPETKPLKLDEVSHRLKDRVGYTLEIKPSLIPHKDAGQGCFIE 171

Query: 176 GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRE 235
           GEA+VGAV+A YPG+IYSPA++RYIPGYP VDAQN YLIT YDGTVINAQPWG G  +RE
Sbjct: 172 GEADVGAVLAFYPGVIYSPAFHRYIPGYPNVDAQNSYLITGYDGTVINAQPWGRGGISRE 231

Query: 236 LWDG-LTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSE-MLERRNPLALAHFANH 293
           +W+G  T PEI   +   E  SD+ WK+LS+P++   G  G E ++E RNPLA  HF NH
Sbjct: 232 VWNGSFTTPEIRTEATTTENSSDKVWKMLSRPLE---GSGGVEAVIEMRNPLAFGHFINH 288

Query: 294 PAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNA 328
           P K M  NVMICPYDF L+E +MR YIPN++FGN 
Sbjct: 289 PGKEMESNVMICPYDFLLSETEMRAYIPNVAFGNT 323


>gi|296085325|emb|CBI29057.3| unnamed protein product [Vitis vinifera]
          Length = 383

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 168/238 (70%), Positives = 191/238 (80%), Gaps = 7/238 (2%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
           MAFLFQKFQEAVKTLAKSP FARDPR LQFEAD+NRLFLYTSYNRLGR+A+EADAE IID
Sbjct: 83  MAFLFQKFQEAVKTLAKSPVFARDPRHLQFEADINRLFLYTSYNRLGRNADEADAENIID 142

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPI--ESSAQSNAAPRRSGLS 118
           MAGKASL+DQQ+QV +NIHSQIK FC ++D +LL   KN+    ES  Q+ A PRRSGLS
Sbjct: 143 MAGKASLADQQKQVQENIHSQIKTFCNTLDGVLLPDSKNMNELPESCPQTKATPRRSGLS 202

Query: 119 FAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
           FAVG    P+D+  VPQT+ L R ELSQ LKD IGYTLDLKPSQIPH++AGQGLFL GEA
Sbjct: 203 FAVGRDGPPSDNSAVPQTKPLKRAELSQSLKDLIGYTLDLKPSQIPHKDAGQGLFLDGEA 262

Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITR-----YDGTVINAQPWGSGW 231
           +VGAV+A+YPG+IY PAYY+YIPGYPRVDA NPYLIT        G  +  +  GS W
Sbjct: 263 DVGAVVALYPGVIYFPAYYQYIPGYPRVDAHNPYLITSKPMEAKHGGSVKMKRLGSFW 320



 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 56/73 (76%), Positives = 60/73 (82%), Gaps = 4/73 (5%)

Query: 331 VNMRRFGSFWFKWGSGSGS----STPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWY 386
           V M+R GSFWFK GS + S      PVLKTL LVATRA+CDEEVLLNYRLSNSKRRP WY
Sbjct: 311 VKMKRLGSFWFKSGSSNKSELDAHVPVLKTLVLVATRALCDEEVLLNYRLSNSKRRPAWY 370

Query: 387 SPVDEEEDRRRWS 399
           +PVDEEEDRRRWS
Sbjct: 371 TPVDEEEDRRRWS 383


>gi|302763379|ref|XP_002965111.1| hypothetical protein SELMODRAFT_167135 [Selaginella moellendorffii]
 gi|300167344|gb|EFJ33949.1| hypothetical protein SELMODRAFT_167135 [Selaginella moellendorffii]
          Length = 399

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 176/393 (44%), Positives = 235/393 (59%), Gaps = 19/393 (4%)

Query: 16  AKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVL 75
           AKS     D R++Q E D++RLFL TSY RL   A + D + I  +       DQ+ QV 
Sbjct: 10  AKSKVVVPDYRKVQKEIDISRLFLLTSYRRLDSSATDEDLDRIEQLGRTVPFPDQEHQVQ 69

Query: 76  DNIHSQIKRFCLSMDEILLLP-EKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVV- 133
           +NIH Q++    ++D I L   E+   +E   +     R SGL+FAV  +    +   + 
Sbjct: 70  ENIHDQVRNISRALDTIFLAGCERKKSLEEGFK-----RPSGLAFAVTKSKPDIEVQKLL 124

Query: 134 --PQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGII 191
             P T  L + ++ + LK+ +G++L LK S++P   AGQGLFL G A  G V+A+YPG+I
Sbjct: 125 GPPATASLGKYDIDEALKNQLGFSLHLKDSEVPGHNAGQGLFLEGIARTGTVVALYPGVI 184

Query: 192 YSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKG 251
           YSPA Y YIPGYP++D  NPYL++RY+G VI+ + WG G  +RE WDG   PE       
Sbjct: 185 YSPAKYWYIPGYPKLD--NPYLVSRYEGYVIDGKSWGKGGTSREFWDGYDEPEEKEEDLE 242

Query: 252 AEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPL 311
             K       ++ K  +   G + S+++ERRNPLALAHF NHP KG +PNVMICPYDFP 
Sbjct: 243 DRKEEQGEEAMVDKASE---GEAASQVIERRNPLALAHFVNHPPKGQLPNVMICPYDFPE 299

Query: 312 TEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWG-----SGSGSSTPVLKTLALVATRAIC 366
            E+DMRPYIPN+ F +    NM R G  W K       S        L+ L LV TR +C
Sbjct: 300 HEEDMRPYIPNLRFEDDTVKNMERRGKSWSKVPLPEVLSAKDRDPTYLRGLVLVTTRNVC 359

Query: 367 DEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           +EE+LLNYRLSN K +P WY PVDE E+RRRW+
Sbjct: 360 NEELLLNYRLSNPKTKPEWYHPVDEAEERRRWA 392


>gi|302757549|ref|XP_002962198.1| hypothetical protein SELMODRAFT_77390 [Selaginella moellendorffii]
 gi|300170857|gb|EFJ37458.1| hypothetical protein SELMODRAFT_77390 [Selaginella moellendorffii]
          Length = 407

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 176/393 (44%), Positives = 235/393 (59%), Gaps = 19/393 (4%)

Query: 16  AKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVL 75
           AKS     D R++Q E D++RLFL TSY RL   A + D + I  +       DQ+ QV 
Sbjct: 10  AKSKIVVPDYRKVQKEIDISRLFLLTSYRRLDSSATDEDLDRIEQLGRTVPFPDQEHQVQ 69

Query: 76  DNIHSQIKRFCLSMDEILLLP-EKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVV- 133
           +NIH Q++    ++D I L   E+   +E   +     R SGL+FAV  +    +   + 
Sbjct: 70  ENIHDQVRNISRALDTIFLAGCERKKSLEEGFK-----RPSGLAFAVTKSKPDIEVQKLL 124

Query: 134 --PQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGII 191
             P T  L + ++ + LK+ +G++L LK S++P   AGQGLFL G A  G V+A+YPG+I
Sbjct: 125 GPPATASLGKYDIDEALKNQLGFSLHLKDSEVPGHNAGQGLFLEGIARTGTVVALYPGVI 184

Query: 192 YSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKG 251
           YSPA Y YIPGYP++D  NPYL++RY+G VI+ + WG G  +RE WDG   PE       
Sbjct: 185 YSPAKYWYIPGYPKLD--NPYLVSRYEGYVIDGKSWGKGGTSREFWDGYDEPEEKEEDLE 242

Query: 252 AEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPL 311
             K       ++ K  +   G + S+++ERRNPLALAHF NHP KG +PNVMICPYDFP 
Sbjct: 243 DRKEEQGEEAMVDKASE---GEAASQVIERRNPLALAHFVNHPPKGQLPNVMICPYDFPE 299

Query: 312 TEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWG-----SGSGSSTPVLKTLALVATRAIC 366
            E+DMRPYIPN+ F +    NM R G  W K       S        L+ L LV TR +C
Sbjct: 300 HEEDMRPYIPNLRFEDDTVKNMERRGKSWSKVPLPEVLSAKDRDPTYLRGLVLVTTRNVC 359

Query: 367 DEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           +EE+LLNYRLSN K +P WY PVDE E+RRRW+
Sbjct: 360 NEELLLNYRLSNPKTKPEWYHPVDEAEERRRWA 392


>gi|168018707|ref|XP_001761887.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686942|gb|EDQ73328.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 412

 Score =  298 bits (762), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 171/403 (42%), Positives = 236/403 (58%), Gaps = 35/403 (8%)

Query: 24  DPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVLDNIHSQIK 83
           D R+LQ E D ++LFL TS+ R     E   AEE+  +A      +QQ+QV + +H QIK
Sbjct: 16  DKRRLQHEIDTHKLFLLTSFQREATSGEIGVAEELTKLAEATPFEEQQRQVQEQVHGQIK 75

Query: 84  RFCLSMDEILL---LPEKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVVP--QTRQ 138
           +    +D ILL   LPE  +P   S +     R SGL FA+G    P   P     +T+ 
Sbjct: 76  KLSTVLDSILLKDPLPE--IPSPPSEKK----RPSGLFFALGRQPEPEAPPRANAIETKP 129

Query: 139 LTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYR 198
           LTR +++   ++  G+TL L+ S IPHEEAG GLF+ G A  G V+++YPG++YSP+ YR
Sbjct: 130 LTREQVASAFREHTGFTLCLQRSGIPHEEAGTGLFIKGRAEPGTVVSMYPGVVYSPSQYR 189

Query: 199 YIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD---GLTLPEIMPNSKGAEKG 255
           YIPGYP+VD  N YLI+RYDG VI+ +PWG G D RE WD   G++  +    +  A + 
Sbjct: 190 YIPGYPKVDVDNGYLISRYDGLVIDGKPWGKGDDIREWWDGHNGISFEQAAELTGPARQS 249

Query: 256 SDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLT-EK 314
              FW+ +      +    G+E LERR+P+AL HFANHP +G+ PNVMICPY F ++ ++
Sbjct: 250 LPSFWQFVGGGKVMQGPLEGAE-LERRSPIALGHFANHPPEGVHPNVMICPYTFSISGQQ 308

Query: 315 DMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV-------------------LK 355
            MR YIPN+ FGN EE +M+R G  W   G       P                    + 
Sbjct: 309 SMRAYIPNVMFGNDEEFDMQRRGPLWINEGKKERPEDPADTRLSTVERSEQPITLISDVH 368

Query: 356 TLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRW 398
           TL LV TR + +EE+ LNYRLS+  ++P WY PV+ EE++RRW
Sbjct: 369 TLVLVTTREVDNEELFLNYRLSSYIQQPEWYHPVNVEEEKRRW 411


>gi|388519347|gb|AFK47735.1| unknown [Lotus japonicus]
          Length = 153

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 95/147 (64%), Positives = 114/147 (77%), Gaps = 7/147 (4%)

Query: 250 KGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDF 309
           +GAEKG+D     LSKP++  +  S  ++LE RNPL+LAHFANHPAKGM+PNVM+C YDF
Sbjct: 9   EGAEKGADS----LSKPLEGSQVDSSDDVLECRNPLSLAHFANHPAKGMLPNVMVCSYDF 64

Query: 310 PLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWG--SGSGSSTPVLKTLALVATRAICD 367
           PLTE +MR YIPNI +GNA EVN++ F  FWFK G    S S  P L T+ LVATRA+ D
Sbjct: 65  PLTENNMRVYIPNILYGNA-EVNVKSFSRFWFKNGVSRNSESKVPTLNTVVLVATRALQD 123

Query: 368 EEVLLNYRLSNSKRRPVWYSPVDEEED 394
           EE+LLNYRLSNSKRRP WY+P+DEEED
Sbjct: 124 EELLLNYRLSNSKRRPEWYAPLDEEED 150


>gi|384246392|gb|EIE19882.1| hypothetical protein COCSUDRAFT_44295 [Coccomyxa subellipsoidea
           C-169]
          Length = 261

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 108/260 (41%), Positives = 148/260 (56%), Gaps = 47/260 (18%)

Query: 149 KDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDA 208
           ++ +G++L L PS + H +AG+GL+L G A  GAV+ +YPGIIY+ ++YR +PGYP+VD 
Sbjct: 40  EERMGFSLCLAPSAVGHADAGEGLWLRGRAKTGAVVGLYPGIIYTRSHYREMPGYPKVDR 99

Query: 209 QNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMD 268
           ++ +LI+R+DG +++A+PWG G         L  P+   +    +  +D   +L      
Sbjct: 100 ESDFLISRFDGAILDAKPWGRG--------ALEEPQSAESLGYRDGLADALTQL------ 145

Query: 269 NKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGN- 327
                    +LERR+P +LAH+ANHP  G  PN+MI  YDF       R YIPN  FG+ 
Sbjct: 146 --------HVLERRHPFSLAHYANHPPAGRRPNIMIAAYDFQNCRDGTRRYIPNAMFGSP 197

Query: 328 --------AEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLSNS 379
                   AEE+ M                  P+L  LALVATR I DEE+LLNYRLS  
Sbjct: 198 RASPGLTGAEEIPM---------------DDRPLL-CLALVATREIEDEELLLNYRLSPG 241

Query: 380 KRRPVWYSPVDEEEDRRRWS 399
             RP WY+ VDE ED RRWS
Sbjct: 242 LARPSWYTAVDEAEDSRRWS 261


>gi|449512760|ref|XP_004164133.1| PREDICTED: uncharacterized LOC101224517 [Cucumis sativus]
          Length = 135

 Score =  181 bits (458), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 95/135 (70%), Positives = 106/135 (78%), Gaps = 9/135 (6%)

Query: 1   MAFLFQKFQEAVKTLAKSPTF------ARDPRQLQFEADMNRLFLYTSYNRLGRDAEEAD 54
           MAFLFQKFQEAVK LAK+P F      ARDPR+LQFEADMNRLFL+TSYNRLG DA EAD
Sbjct: 1   MAFLFQKFQEAVKVLAKNPMFTKNPLFARDPRRLQFEADMNRLFLFTSYNRLGMDATEAD 60

Query: 55  AEEIIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRR 114
           AEEIIDMA KAS +DQQ+QV +NIHSQ+K FC+ MD ILL  E   P ES  +S  A R+
Sbjct: 61  AEEIIDMASKASFADQQKQVQENIHSQVKSFCMHMDGILLPDE---PAESPEKSKDAVRK 117

Query: 115 SGLSFAVGPTARPTD 129
           SGLSFAVG T+ PTD
Sbjct: 118 SGLSFAVGRTSPPTD 132


>gi|255089437|ref|XP_002506640.1| predicted protein [Micromonas sp. RCC299]
 gi|226521913|gb|ACO67898.1| predicted protein [Micromonas sp. RCC299]
          Length = 435

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 136/447 (30%), Positives = 205/447 (45%), Gaps = 110/447 (24%)

Query: 26  RQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVLDNIHSQIKRF 85
           R++Q E D+  LF+ TS +   RD    D  ++ID+     LS++Q+             
Sbjct: 23  RRVQEELDLRELFVLTSSH--ARDVPVDDDADLIDV-----LSEEQR------------- 62

Query: 86  CLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVVPQTRQLTRTELS 145
            L+ DE+     KN  + ++  +     R  L        R TD          +R +++
Sbjct: 63  GLTYDEVKAEVTKN--LTANLHTMCGEVRDAL--------RATDG-----VGHGSRDDVT 107

Query: 146 QRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPR 205
              +   G++  +  S  P+  AG+GLF+ G+ + G ++ +YPG++Y+P   R++PGYP+
Sbjct: 108 AEYRRRAGFSAHVAESTAPN--AGRGLFVNGQVDPGEIVCVYPGVVYAPENLRHMPGYPK 165

Query: 206 VDAQNPYLITRYDGTVINAQPWG--SGWDTRELW----DGLT---------------LPE 244
           + A NPYL+ RYDG+V++A PWG  +GW + +      DG T                P 
Sbjct: 166 IAADNPYLVARYDGSVVDASPWGLGAGWSSAKTSLDGDDGWTDGERRRVQGFGRWPGAPI 225

Query: 245 IMPNSKGAEKGSDQ-------FWKLLS---KPMDNKR---GGSGSEMLERRNPLALAHFA 291
              +++G   G D         W   +    P+D  +     + +E ++RR+PLALAH A
Sbjct: 226 GTDDARGDSPGGDSPGGDSPGGWLRAAAGETPLDPTQLTDARTAAESVDRRHPLALAHLA 285

Query: 292 NHPAKGMVPNVMICPYDFPL---TE------KDMRPYIPNISFGN-----AEEVNMRRFG 337
           NHP KG  PNV +   D      TE      K +R Y+PN+  G            RR G
Sbjct: 286 NHPPKGTSPNVAVASVDVVFDAETEELDKDVKALRRYLPNVGLGRWRVPLGTGEAKRRGG 345

Query: 338 SFWFKWGSGS--GS-----------------------STPVLKTLALVATRAICDEEVLL 372
           +   K   G   GS                          V+ TL LVA R I DEEV L
Sbjct: 346 AGEAKRARGGFVGSLARMFGEDDEDGDIEEAEAEAPNGKVVVPTLVLVAKRFILDEEVFL 405

Query: 373 NYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           NYRLS   +RP WY PV+ EE++RRW+
Sbjct: 406 NYRLSTHVKRPDWYHPVNAEEEKRRWA 432


>gi|308811843|ref|XP_003083229.1| unnamed protein product [Ostreococcus tauri]
 gi|116055108|emb|CAL57504.1| unnamed protein product [Ostreococcus tauri]
          Length = 302

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 105/287 (36%), Positives = 138/287 (48%), Gaps = 43/287 (14%)

Query: 150 DAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQ 209
           +A G+   L  S I    AG G+F+ G    G V+A YPG+ Y  +  RY+ GYPRV   
Sbjct: 22  EAYGHHCYLDASTI-GAAAGTGVFVRGNVGAGRVVATYPGLTYVGSAVRYMKGYPRVGVN 80

Query: 210 NPYLITRYDGTVINAQPWGSGWDTREL---WDGLTLPEIMPNSKGAEKGSDQFWKLLSKP 266
           N YL+ R DG VI+ +PWG G D  +    W G  +       + A+ GS     L  + 
Sbjct: 81  NEYLMARSDGCVIDGKPWGRGSDEGDGFGDWPGSPVKLTEEEEREADGGSFFTRLLRPRL 140

Query: 267 MDNKRGGSGSEM--LERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKDMRPYIPNIS 324
              +R    +E   LER NP A AHFANHP +G  PNV++   D P ++ D R Y+PN++
Sbjct: 141 PRERRERLLAECMSLERDNPYACAHFANHPPEGTSPNVIVATVDIP-SDPDTRRYVPNVN 199

Query: 325 F------------------------GNAEEVNMRRFGSFWFK-------WGSGSGSSTPV 353
                                        +V+   FG    K         SG GS    
Sbjct: 200 IEPDDGDKSSESVWSIIRDGERTIGERVRDVSASLFGEDADKPKAKSTEPTSGDGS---- 255

Query: 354 LKTLALVATRAICD-EEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           ++TL LVATR I D EE+ LNYRLS     P WY+ VD EED RRW+
Sbjct: 256 IRTLVLVATRDIEDGEEIFLNYRLSTHVTPPSWYTRVDPEEDDRRWA 302


>gi|145353865|ref|XP_001421220.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581457|gb|ABO99513.1| predicted protein, partial [Ostreococcus lucimarinus CCE9901]
          Length = 295

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 97/289 (33%), Positives = 132/289 (45%), Gaps = 35/289 (12%)

Query: 143 ELSQRLKDAIGYTLDLKPSQIPHE--EAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYI 200
           +L++  +       D+ PS        AG GLF  G    G V+A YPG+ Y  A  R++
Sbjct: 8   QLAELARATPARACDVAPSTAGRSGASAGDGLFARGAVAAGRVVATYPGLTYVGAATRFM 67

Query: 201 PGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFW 260
             YPRV  +N YLI R DG+VI+A+PWG G +  + W G   P  +  ++        + 
Sbjct: 68  KDYPRVTRENEYLIARSDGSVIDAEPWGRGGENDDAWPGA--PVALSEAEARRADGASWL 125

Query: 261 KLLSKPM----DNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKDM 316
           + L KP        R       L+R + LALAH+ANHP  G  PNV++   D P  + D 
Sbjct: 126 ERLRKPTLPRATRARLARECATLDRGDELALAHYANHPPSGTAPNVVVASVDVP-RDSDA 184

Query: 317 RPYIPNISFG------------------------NAEEVNMRRFGSFWFK-WGSGSGSST 351
           R    N++                           A + ++  FG    K    G     
Sbjct: 185 RRRTQNVNVECEEEGEETESVWSIVRDGERSLVERARDASLAMFGEDADKPRARGGTVRR 244

Query: 352 PVLKTLALVATRAICD-EEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
            V++ L LV+TR I D EE+ LNYRLS     P WY  VD EED RRWS
Sbjct: 245 DVVRGLVLVSTRDIEDGEEIWLNYRLSTHVTPPAWYVRVDPEEDERRWS 293


>gi|412986804|emb|CCO15230.1| predicted protein [Bathycoccus prasinos]
          Length = 449

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 130/477 (27%), Positives = 211/477 (44%), Gaps = 132/477 (27%)

Query: 18  SPTFARDPRQ---LQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQV 74
           + +F++ PRQ   LQ E+D+ +LFL T +            E  + M  + +  + +++ 
Sbjct: 10  ASSFSKKPRQGRALQEESDVQKLFLLTQFR-----------ERTMMMTRERNEREGEEE- 57

Query: 75  LDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVVP 134
                S+ +R  ++  E+L         E+S + ++A R++ +  +V          ++ 
Sbjct: 58  ----DSRARREAVA-KEVL---------EASTKCSSAERKARVWRSVQTQLTTIGLDLMG 103

Query: 135 QTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSP 194
           +T  ++ T  + +          +KPS     E  +G+FL G A+VG V+++YPGI+Y+ 
Sbjct: 104 RTGDISETNTTNKNT--------VKPSDGVENE--EGVFLNGPASVGTVVSLYPGIVYAK 153

Query: 195 AYYRYIPGYPRVDAQNPYLITRYDGTVINAQPW-------------------------GS 229
              R++ G+P V  +N +L+ RYD TV++A+PW                           
Sbjct: 154 EALRFLKGFPNVAKENAFLVARYDDTVVDAKPWEEERRRSSSERSSSGVVEGSSDSESSI 213

Query: 230 GWDTRELWDGLTLPEIMPNSK--GAEKGSDQFWKL------LSKPMDN---KRG--GSGS 276
                  W G  L E +   K  GA K  +   KL      LS+P  +   KR      +
Sbjct: 214 NSSGSSSWPGFPLREDLEKEKIVGALK-KEWVEKLPPWLEKLSRPELHAKIKRACLDVST 272

Query: 277 EMLER--RNPLALAHFANHPAKGMVPNVMICPYDFPLTEKD------------------- 315
           ++L +  ++P+ALAH ANHP +G  PNVMIC +D+ + E                     
Sbjct: 273 DLLVQAGKHPIALAHKANHPPQGKTPNVMICMHDYDVEEGAKEALRMMNMDTEDSQKLER 332

Query: 316 ----MRPYIPNISFGN----------AEEVNMRRFGSFWFKWGSGSGSSTP--------- 352
               +R  IPN+   N          A+      F      + +GS    P         
Sbjct: 333 MKHFVRAAIPNVYLSNDIRDYERFDPAKSARNASFLKMVSSFAAGSVFEDPNEKAKDLNE 392

Query: 353 ------VLK---TLALVATRAI-CDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
                 V+K   T+ALVATR I  +EE+ L+YRLS+   RP WY PVDEE ++RRWS
Sbjct: 393 AQMVDIVMKAAPTIALVATRDIEANEELFLDYRLSSHVERPDWYVPVDEEAEKRRWS 449


>gi|449018475|dbj|BAM81877.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 377

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 111/393 (28%), Positives = 171/393 (43%), Gaps = 55/393 (13%)

Query: 26  RQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVLDNIHSQIKRF 85
           R+LQ + D  +LFL       G +   A A+EI+  A +A+ +     +     S + + 
Sbjct: 20  RELQHQRDNMQLFLLAVSYEKGFEQARALAQEIVSSAEQATRASSPLLMFRRALS-VWKH 78

Query: 86  CLSMDEILLLPEKNVPIESSAQSNAAPR---RSGLSFAVGPTARPTDSPVVPQTRQLTRT 142
             S++     P+ ++  +  AQS    R   R   +      + PT +            
Sbjct: 79  SASLEMKTDSPQ-HILDDQGAQSRLIERVNARQVHNLLAAAASMPTSASAA--------- 128

Query: 143 ELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPG 202
                      +TL L+PSQ  +  AG G+ L G A +G+V+A YPGI Y   ++  IPG
Sbjct: 129 ------TSGPAFTLSLRPSQ-SNVNAGYGVILNGHARMGSVVAFYPGIAYRVRHWTSIPG 181

Query: 203 YPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKL 262
           YP + AQNPYL  RYD  +++A P        E+  GL    I+ ++            L
Sbjct: 182 YPYITAQNPYLFARYDRVIVDANP------NLEVSRGLMNIPILDHA------------L 223

Query: 263 LSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLT--EKDMRPYI 320
            S     ++  S  E     NP AL H  NHP +G   NVM C ++F L   E  +  +I
Sbjct: 224 ESVVRHTRQTESLHECWVGLNPYALGHRVNHPPRGREANVMPCAWNFDLQRLEPSLLSFI 283

Query: 321 PNISFGNAEEVN----------MRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEV 370
           PN  F    +++            RF  ++   G     S   +  + L+A R + DEE+
Sbjct: 284 PNRLFTKYRQLSSGILGPTSMLWSRFRHWFDSLGHQDNMSLTEIPGVVLIALRDLQDEEL 343

Query: 371 LLNYRLSNSK----RRPVWYSPVDEEEDRRRWS 399
            LNYR + +     + P WY   D+E  +RRWS
Sbjct: 344 FLNYRYNPAALERLQAPSWYWQPDQEAAKRRWS 376


>gi|452820490|gb|EME27532.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
 gi|452820491|gb|EME27533.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
          Length = 436

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 108/429 (25%), Positives = 183/429 (42%), Gaps = 106/429 (24%)

Query: 31  EADMNRLFLYTSYNRLGRDAEE-----ADAEEIIDMAGKASLSD------QQQQVLD-NI 78
           + D  RL++ TS  R G+ A +      D  E ID   +    +      +Q+++L+ NI
Sbjct: 44  QNDTLRLYVLTSLKRDGKKAADFFSRIMDGSETIDRVTQKKFEEFLPSPEEQKEILERNI 103

Query: 79  HSQI------------KRFCLSMDEILLL-----------PEKNVPIESSAQSNAAPRRS 115
             Q+            ++ CL    + LL           P   +   +  ++     RS
Sbjct: 104 WEQLLNMFSVLDICSRRKPCLPYGSLALLEKPRFPDPLPDPSDLISFANRIKAGQLTDRS 163

Query: 116 GLSFAVGPTAR---PTDSPVVPQTRQLTR--------TELSQRLKDAIGYTLDLKPSQIP 164
           G   ++    +     D  ++   R++ +         E+S+R++  +G+ +    S IP
Sbjct: 164 GDLVSLSSNHKELQTIDRQILQIDRRIKQYVPTEQDYEEMSRRIEKELGFIVYRATSTIP 223

Query: 165 HEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINA 224
             +AGQGLF+ G+  VG+V+A+YPGIIY P+ +  +PGYP V   N +L+ R DG +I+ 
Sbjct: 224 --DAGQGLFIKGQCTVGSVVALYPGIIYFPSEWEKLPGYPYVSKYNVHLVKRQDGVIIDG 281

Query: 225 QPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNP 284
           +P    +D + +      PE+              W                   ER NP
Sbjct: 282 KP----YDEQTISFQGERPEVA-------------W-------------------ERINP 305

Query: 285 LALAHFANHPAKGMVPNVMICPYDFPLTEKD--MRPYIPNISFGNAEE--------VNMR 334
            AL  +ANHP KG  PN++   +D+PL      M+  +PN    + +E          M 
Sbjct: 306 FALGQYANHPPKGTEPNIIALSFDYPLYRMSPAMKILVPNRYLLDEQEDEAWSSHLSKME 365

Query: 335 RFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWY-SPVDEE- 392
               F      G    + V++ + ++  R + +EE+ +NYR +     P WY  P  EE 
Sbjct: 366 DVMQF------GKDRKSRVMRGMVIITLRDVENEELFINYRYNPFATHPEWYWDPSPEET 419

Query: 393 ----EDRRR 397
               EDR++
Sbjct: 420 QWLIEDRQK 428


>gi|348688367|gb|EGZ28181.1| hypothetical protein PHYSODRAFT_309093 [Phytophthora sojae]
          Length = 312

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 103/387 (26%), Positives = 163/387 (42%), Gaps = 108/387 (27%)

Query: 26  RQLQFEADMNRLFLYTSYNRLGRDAEEADAE------EIIDMAGKASLSDQQQQVLDNIH 79
           R+ Q EAD+  + + + +  L  D + A+ E      + +++ G      Q+Q V   + 
Sbjct: 19  RRFQEEADVGVVAVLSCFRSLKLDPQNANKEHEELFDQAVEIVGTKKFEQQKQYVQGEVA 78

Query: 80  SQIKRFCLSMDEILLLP---EKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVVPQT 136
            Q++  C   + +L +P   EKNVP  S +      R + +   +G              
Sbjct: 79  KQVQS-CF--EAVLRMPKDSEKNVPKMSLS---GFERTTEVEEKLG-------------- 118

Query: 137 RQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAY 196
                  +S   +   G+TL+++ S +P  +AG G+ L G A +G+++A+YPGI+Y P +
Sbjct: 119 -------ISDNFQKLFGFTLNVEKSSVP--DAGDGVKLRGSAPIGSLVALYPGIVYLPEH 169

Query: 197 YRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGS 256
           Y+    Y    + NPY   R+D  +I+A+   S                           
Sbjct: 170 YKK-KHYLSELSNNPYARARFDSVIIDAKNESS--------------------------- 201

Query: 257 DQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP----LT 312
                   KP               RNPLA+AH  NHP  G  PNV+   ++FP     T
Sbjct: 202 --------KP---------------RNPLAVAHKINHPPNGTPPNVLGFAFEFPPEEPFT 238

Query: 313 EKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEE-VL 371
            K+    IPN        ++M  FG               ++ +LA +AT  I DEE + 
Sbjct: 239 TKEFDELIPNSFVKEPSRLSM--FGK------------RAIVHSLAFIATADIADEEELF 284

Query: 372 LNYRLSNSKRRPVWYSPVDEEEDRRRW 398
           LNYR +   + P WY+PVD E DR  W
Sbjct: 285 LNYRYNPDHQLPEWYTPVDVESDRAMW 311


>gi|301117376|ref|XP_002906416.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262107765|gb|EEY65817.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 312

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 164/384 (42%), Gaps = 102/384 (26%)

Query: 26  RQLQFEADMNRLFLYTSYNRLGRDAEEADAE------EIIDMAGKASLSDQQQQVLDNIH 79
           R+ Q EAD+    + + +  L  D + A+ E      + +++ G      Q+Q V + + 
Sbjct: 19  RRFQEEADVGVAAVLSCFRSLKLDPQNANKEHEELFDQAVEIVGSKKFEQQKQYVQNEVA 78

Query: 80  SQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVVPQTRQL 139
            Q++   +  + +L +P+ +          +APR S   F      R T+        +L
Sbjct: 79  KQVQ---ICFEAVLKMPKDS--------EKSAPRMSLSGFE-----RTTEV-----EEKL 117

Query: 140 TRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRY 199
             T+  Q+L    G+TL+++ S +P   A +G+ L G A +G+++A+YPGI+Y P +Y+ 
Sbjct: 118 GITDNFQKL---FGFTLNVEESSVPG--ADEGVKLRGSAPIGSLVALYPGIVYLPEHYKK 172

Query: 200 IPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQF 259
              +  V + N Y   R+D  +I+A+   S                              
Sbjct: 173 -KHHLGVLSNNAYARARFDSVIIDAKNESS------------------------------ 201

Query: 260 WKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP----LTEKD 315
                KP               RNPLA+AH  NHP  G  PNV+   ++FP     + K+
Sbjct: 202 -----KP---------------RNPLAVAHKINHPPSGTPPNVIGFAFEFPPEEPFSTKE 241

Query: 316 MRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEE-VLLNY 374
               IPN        ++M  FG               ++  LA +AT  I DEE + LNY
Sbjct: 242 YDELIPNSFVQEPSRLSM--FGK------------RAIVHGLAFIATADIADEEELFLNY 287

Query: 375 RLSNSKRRPVWYSPVDEEEDRRRW 398
           R +  +  P WY+PVD + DR  W
Sbjct: 288 RYNPDRPLPEWYTPVDVDSDRAMW 311


>gi|66811938|ref|XP_640148.1| hypothetical protein DDB_G0282599 [Dictyostelium discoideum AX4]
 gi|60468149|gb|EAL66159.1| hypothetical protein DDB_G0282599 [Dictyostelium discoideum AX4]
          Length = 378

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 86/259 (33%), Positives = 117/259 (45%), Gaps = 60/259 (23%)

Query: 153 GYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPY 212
           G+ L ++ S I HEEAG G+ + GE   G VIAIYPG  Y       IP    V + N Y
Sbjct: 152 GFRLHVRKSDINHEEAGYGVHIEGECVPGTVIAIYPGDTYDGD---NIPS--NVISDNDY 206

Query: 213 LITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRG 272
           +I+RYDGTVI+    G  W+ R   D L L E             +F+K L         
Sbjct: 207 MISRYDGTVID----GRSWNKRA--DELILKE-------------KFFKKLGTK------ 241

Query: 273 GSGSEMLERRNPLALAHFANHP-----------AKGMVPNVMICPYDFPLT-EKDMRPYI 320
            SG+++L  +NP A+ +F NHP            +   PNV+   Y+F       +  YI
Sbjct: 242 SSGTDLLCYKNPFAIGNFINHPPIITKDGDGDTDQTKEPNVIAYSYNFRSNFPNHLSQYI 301

Query: 321 PNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVL-KTLALVATRAICDEEVLLNYRLSNS 379
           PN +       N                  T VL ++L L+A + I ++EV LNYR +  
Sbjct: 302 PNKTVTEHSIFN-----------------DTSVLRRSLLLIAFKPIKNQEVFLNYRFNPD 344

Query: 380 KRRPVWYSPVDEEEDRRRW 398
              P WY   D EE +RRW
Sbjct: 345 LPYPNWYKQPDLEEAKRRW 363


>gi|328867711|gb|EGG16093.1| hypothetical protein DFA_09765 [Dictyostelium fasciculatum]
          Length = 468

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 78/252 (30%), Positives = 116/252 (46%), Gaps = 53/252 (21%)

Query: 154 YTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYL 213
           ++L +K S I H EAG G+ + G  + G V+AIYPG  Y+ +   +         +N Y+
Sbjct: 247 FSLVIKKSSIDHGEAGYGVHVKGLVSPGTVLAIYPGDTYTASNMDHA-----AILENEYM 301

Query: 214 ITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGG 273
           ++RYDGTVI+ + W    D            I  +   +E G+                 
Sbjct: 302 MSRYDGTVIDGRSWLKKVDM----------SIYRSKYFSETGT---------------SV 336

Query: 274 SGSEMLERRNPLALAHFANHPAKGMV------PNVMICPYDFPLTEKD-MRPYIPNISFG 326
           S  E+L+ RNP A+  F NHP++          NV+   Y+FP T  D ++PYIP+ S  
Sbjct: 337 SEEELLKYRNPFAIGQFINHPSEHKQNPNFPHANVIGIKYNFPQTFPDHLKPYIPHTS-- 394

Query: 327 NAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWY 386
              + N  R  + +              K+  L+A R I DEE+LLNYR + +   P WY
Sbjct: 395 -VTDTNFFRDKNIF-------------AKSYVLIANRYIQDEELLLNYRYNPANPYPPWY 440

Query: 387 SPVDEEEDRRRW 398
              D  E +RRW
Sbjct: 441 FQPDILEAKRRW 452


>gi|330806343|ref|XP_003291130.1| hypothetical protein DICPUDRAFT_81815 [Dictyostelium purpureum]
 gi|325078691|gb|EGC32328.1| hypothetical protein DICPUDRAFT_81815 [Dictyostelium purpureum]
          Length = 362

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/247 (29%), Positives = 114/247 (46%), Gaps = 47/247 (19%)

Query: 153 GYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPY 212
           G+ + +K S + HEEAG G+++ GE   G V+A+YPG  Y       IP  P +  +N Y
Sbjct: 146 GFKIHVKSSTVDHEEAGYGVYIEGEVIPGTVVALYPGDTYDG---ENIP--PSIVRENEY 200

Query: 213 LITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRG 272
           +++RYDGT+I+ + W    D       LTL   M    G             KP D    
Sbjct: 201 MMSRYDGTIIDGRSWNKKCDE------LTLKSKMFKEIGTN----------VKPTD---- 240

Query: 273 GSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKD-MRPYIPNISFGNAEEV 331
                +L  RNP A+ +F NHP     PNV+   +++     D +  YIPN +    +  
Sbjct: 241 -----ILCYRNPFAIGNFINHPPADDQPNVIAYAFNYRRGFPDHLVHYIPNRTITENQ-- 293

Query: 332 NMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDE 391
                   +F        ++ + + + L+A + I + E+ LNYR +     P WY+  D 
Sbjct: 294 --------YF------SDNSVIRRGVVLIAFKPIINSELFLNYRFNPDLPYPDWYTQPDL 339

Query: 392 EEDRRRW 398
           EE +RRW
Sbjct: 340 EEAKRRW 346


>gi|298709446|emb|CBJ31352.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 384

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 74/274 (27%), Positives = 116/274 (42%), Gaps = 79/274 (28%)

Query: 141 RTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYI 200
           R  L + L+  +G+   +K S +    AG GLF+ G A  GAV+AIYPG+++ P + R  
Sbjct: 154 RKHLKEMLRSELGFNTVVKDSTL--SGAGLGLFVEGSAPEGAVVAIYPGLVHLPEHLRSQ 211

Query: 201 P-GYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQF 259
                 +  ++ +L+ R DG +++ +  G                ++P+S          
Sbjct: 212 KYAASLLPDEDHFLMGRADGCLVDGRTAG----------------LLPSS---------- 245

Query: 260 WKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKD--- 315
                                   PL  AH ANHP +G   NV+   YDFP L+E D   
Sbjct: 246 ------------------------PLTFAHAANHPPEGQAANVLQVAYDFPALSEDDVGI 281

Query: 316 ---------MRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRA-I 365
                    +RP +PN            R+         G    + +++T+ LVA R  +
Sbjct: 282 AKGGRFPERLRPLVPN------------RYHRAPKALFGGKLDQSAMVETVVLVACRGGL 329

Query: 366 CDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
            D+EV ++YR + + R P WYSPVD E     W+
Sbjct: 330 HDQEVFMDYRFNPNMRHPDWYSPVDPEATAEMWA 363


>gi|303284110|ref|XP_003061346.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457697|gb|EEH54996.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 609

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 80/198 (40%), Gaps = 66/198 (33%)

Query: 268 DNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPL---------------- 311
           D +     +  +ERR+PLALAHFANHP  G  PNV +   D                   
Sbjct: 410 DRRVAFDAAAAVERRHPLALAHFANHPPGGTSPNVAVASVDVVFEEEEEEEEDAGAAEAG 469

Query: 312 ------------TEKDMRPYIPNISFGN---------AEEVNM---------------RR 335
                        +  +RPY+PN+  G+          EE  +               R 
Sbjct: 470 TAEGPRSSTTDSVDASLRPYLPNVGIGSWGVPTRGWELEEGTLDSEQNKTGGFAGLLRRL 529

Query: 336 FG-------SFWFKWGSGSGSSTP-------VLKTLALVATRAICDEEVLLNYRLSNSKR 381
           FG        F  K    +  S         V+ TLALVA  A+ DEEV LNYRLS   +
Sbjct: 530 FGEDDDDDDDFETKARKKAADSEAARRRNRVVVPTLALVALEALQDEEVFLNYRLSTHVK 589

Query: 382 RPVWYSPVDEEEDRRRWS 399
           RP WY PVD  ED RRW+
Sbjct: 590 RPEWYVPVDPAEDARRWA 607



 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 61/213 (28%), Positives = 98/213 (46%), Gaps = 26/213 (12%)

Query: 12  VKTLA---KSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAE-EADAEEIIDMAGKASL 67
           VKT A    +    R  R+ Q  AD++RL + T+Y    RD + EA   +++  A   S 
Sbjct: 38  VKTFAGRGSTDHLTRGARRAQEHADLHRLHVLTTY--YARDGDGEALLNDVLAGAEGKSY 95

Query: 68  SDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFAVGPTARP 127
              +  V DN+ SQI+  C ++   L  P     +           R+  S         
Sbjct: 96  EQLKASVSDNLTSQIRTICAAVRASLASPRVPPRVGPRGGGGGDRDRASSSS-------- 147

Query: 128 TDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIY 187
                   +R+    E++ RL    G+   ++ S      +G+GLF+ G+ + G ++A+Y
Sbjct: 148 -------SSREAIANEITIRL----GFACAIQRSTA-GAGSGRGLFVRGKVDPGEIVALY 195

Query: 188 PGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGT 220
           PG+ Y      ++PG+P V A NPYLI R+DGT
Sbjct: 196 PGVAYDVRSVTHLPGFPEVAAANPYLIARFDGT 228


>gi|302851678|ref|XP_002957362.1| hypothetical protein VOLCADRAFT_107629 [Volvox carteri f.
           nagariensis]
 gi|300257321|gb|EFJ41571.1| hypothetical protein VOLCADRAFT_107629 [Volvox carteri f.
           nagariensis]
          Length = 523

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 67/123 (54%), Gaps = 31/123 (25%)

Query: 139 LTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYR 198
           L++ ++++ L+  IG+ L+L PS +PH ++G GLFL GEA  GA++AI+PG++Y      
Sbjct: 159 LSQQQVAEALQRRIGFRLELGPSSVPHPDSGTGLFLRGEARPGAIVAIFPGVLYGRTQLA 218

Query: 199 YIPGYPR-------------------------------VDAQNPYLITRYDGTVINAQPW 227
           ++P +P+                               VD  NPYL  RYD ++++++PW
Sbjct: 219 HMPNFPKARPERFLVLVMKLVVKLVLALDAGLRRSGVAVDMDNPYLSCRYDQSIVDSKPW 278

Query: 228 GSG 230
           G G
Sbjct: 279 GRG 281



 Score = 43.1 bits (100), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 32/48 (66%), Gaps = 1/48 (2%)

Query: 353 VLKTLALVATRAICD-EEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           V++ L LV+T A+ D +E+L NYR++    RP WY   D E ++RRW+
Sbjct: 439 VVRLLVLVSTTAVRDGDELLQNYRMNPHVARPDWYVVHDAEAEQRRWA 486


>gi|156378496|ref|XP_001631178.1| predicted protein [Nematostella vectensis]
 gi|156218214|gb|EDO39115.1| predicted protein [Nematostella vectensis]
          Length = 375

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/159 (33%), Positives = 75/159 (47%), Gaps = 42/159 (26%)

Query: 154 YTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYL 213
           ++L +KPS I H EAG+G+FL G A  G V+  +PG++Y P Y R +P YP +D  N YL
Sbjct: 96  FSLSIKPSTIQHPEAGEGVFLDGSAVPGTVVMFHPGLVYLPPYIRKMPNYPWIDKDNTYL 155

Query: 214 ITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGG 273
           ++RYDG +++ +    G DT                                  D +R  
Sbjct: 156 MSRYDGCIVDPK----GLDT----------------------------------DIQR-- 175

Query: 274 SGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLT 312
           S   M+   +PLA  H  NH   G  PN++   YDF  T
Sbjct: 176 SDGTMVA--HPLAAGHKVNHAPPGTKPNILPFSYDFAKT 212



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 34/47 (72%)

Query: 353 VLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           VL+++  V T  + ++E+ +NYRL+ + + P WY+PVD +ED RRW+
Sbjct: 328 VLRSVVYVTTNHVHNQELFVNYRLNPAIKLPEWYTPVDRKEDLRRWN 374


>gi|413937912|gb|AFW72463.1| hypothetical protein ZEAMMB73_372625, partial [Zea mays]
          Length = 56

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 35/45 (77%), Positives = 39/45 (86%)

Query: 1  MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNR 45
          MAFLF KFQEA++T+AKSP FARDPR LQFEAD+NRLFLYT   R
Sbjct: 1  MAFLFNKFQEAIRTIAKSPMFARDPRHLQFEADVNRLFLYTRKCR 45


>gi|325186225|emb|CCA20726.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 133

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 44/121 (36%), Positives = 63/121 (52%), Gaps = 18/121 (14%)

Query: 282 RNPLALAHFANHPAKGMVPNVMICPYDFPL----TEKDMRPYIPNISFGNAEEVNMRRFG 337
           RNPL + H  NHP    +PNV+  PYDFP+    T+++  P IPN        ++M  FG
Sbjct: 26  RNPLGVGHMINHPPANELPNVIAFPYDFPMREPFTKEEHIPLIPNSFIDQPSRLSM--FG 83

Query: 338 SFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRR 397
                          ++ +LA ++ R I DEE+ LNYR + +   P WY+PVD E D+  
Sbjct: 84  K------------RILIHSLAFISLREIEDEELFLNYRYNPNLPYPEWYTPVDLESDKLL 131

Query: 398 W 398
           W
Sbjct: 132 W 132


>gi|440798065|gb|ELR19136.1| hypothetical protein ACA1_335990 [Acanthamoeba castellanii str.
           Neff]
          Length = 317

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 66/225 (29%), Positives = 95/225 (42%), Gaps = 46/225 (20%)

Query: 177 EANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWG-SGWDTRE 235
           E  V     I PG ++ P    Y      +   N Y+  RYDG V++ + W    +  R 
Sbjct: 127 EQGVHVKGHILPGTVHFP----YTLNENAIKGNN-YMYGRYDGVVVDGRDWERKAFAARR 181

Query: 236 LWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPA 295
           L   L   E +  S   EK                       +L  RNP  +A + NHP 
Sbjct: 182 LQTRL---ERVGQSIKDEKT----------------------LLRFRNPFGIAQYINHPP 216

Query: 296 KGMVPNVMICPYDFPLTEKD-MRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVL 354
            G  PNVM   Y+FP T  D ++PYIP+    + +E N+      +F     +  +   +
Sbjct: 217 GGKQPNVMCVGYEFPNTFPDELKPYIPH---EHVKEPNI------FF-----NTMTDCYI 262

Query: 355 KTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
           +++  +ATR I DEE+ LNYR +     P WY   D  E  RRW 
Sbjct: 263 RSIMCIATRHITDEELFLNYRYNPINPYPDWYVQPDVAEAERRWK 307


>gi|388521337|gb|AFK48730.1| unknown [Lotus japonicus]
          Length = 77

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 44/73 (60%), Positives = 51/73 (69%), Gaps = 3/73 (4%)

Query: 61  MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEK--NVPIESSAQSNAAPRRSGLS 118
           MA KAS  DQQ+QV +N+HSQI+ FC  MDEILL  E+  N P+  S Q+   PRRSGLS
Sbjct: 1   MASKASFDDQQKQVQENVHSQIRTFCSFMDEILLPNEEPLNDPLGLSQQATVFPRRSGLS 60

Query: 119 FAVGPT-ARPTDS 130
           FAVG T A P DS
Sbjct: 61  FAVGRTDAPPHDS 73


>gi|440802939|gb|ELR23854.1| hypothetical protein ACA1_123630 [Acanthamoeba castellanii str.
           Neff]
          Length = 326

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 67/232 (28%), Positives = 95/232 (40%), Gaps = 54/232 (23%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVD 207
           ++   G+ L  + S IP  +AG GLFL G  + G  + IYPG +Y P     +P      
Sbjct: 131 MRTQFGFALVKRRSTIP--DAGHGLFLEGAVSEGTCVCIYPGAVYQPGDPLLLP-----S 183

Query: 208 AQNPYLITRYDGTVINAQPWG-SGWDTRELWDGLTLPEIMPNSKGAEKG-SDQFWKLLSK 265
            +N YLI R DG+ ++ +P G S +  R +++            G   G  D  W     
Sbjct: 184 LRNDYLIRRSDGSSVDGRPRGLSRYFYRSIYEKF----------GPFAGLCDATW----- 228

Query: 266 PMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDF-PLTEKDMRPYIPNIS 324
            +D K GG+        NP    H  NH     VPNVM   Y F P T       IPN+ 
Sbjct: 229 -LDIKEGGTNEH-----NPFGWGHLINHSD---VPNVMYFEYQFPPDTPAHYLSSIPNVF 279

Query: 325 FGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRL 376
           + + +                        +KT+ L+A R I DEE+  NY  
Sbjct: 280 YSSKDPAC--------------------AVKTIVLMAMRRIRDEELFSNYNF 311


>gi|440797554|gb|ELR18638.1| hypothetical protein ACA1_298810 [Acanthamoeba castellanii str.
           Neff]
          Length = 317

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 67/248 (27%), Positives = 109/248 (43%), Gaps = 51/248 (20%)

Query: 151 AIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQN 210
           + G+TL  +PS I    AG+G+F+ G+A+ G V+A YPG +Y+ +    +  YP +   N
Sbjct: 100 SCGFTLAKRPSSI--SGAGEGVFIQGKASRGTVVAFYPGCVYAESDLPLV--YPHIYTDN 155

Query: 211 PYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNK 270
            YL  R D  +I+A                       +SKG+ + +  F    S+    +
Sbjct: 156 DYLAFRSDKMLIDA-----------------------SSKGSSRQA--FLAAWSRDHPRQ 190

Query: 271 RGGSGSEMLERR---NPLALAHFANHPAKGMVPNVMICPYDFPLTEKD----MRPYIPNI 323
           +  SG ++       N  A+ H  NHP     PNV+  P+DF   + D    +R +IPN+
Sbjct: 191 QPPSGDKIDFAEVFANSYAVGHLVNHPPADRRPNVVSLPFDFLAADLDAHDHLRRHIPNV 250

Query: 324 SFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICD-EEVLLNYRLS-NSKR 381
            F     +  R  G         +G+   V+  + +VA   + D  E+L++Y        
Sbjct: 251 YF-----LPRRHEGD--------AGADGVVMSGMVIVAADDLEDGAELLMDYHYGMRGAD 297

Query: 382 RPVWYSPV 389
            P WY P 
Sbjct: 298 VPSWYRPA 305


>gi|403368929|gb|EJY84299.1| hypothetical protein OXYTRI_17960 [Oxytricha trifallax]
          Length = 363

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 82/183 (44%), Gaps = 36/183 (19%)

Query: 141 RTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFL-CGEAN----VGAVIAIYPGIIYSPA 195
           + +L ++LK  +G+ L +K S I H EAG+G+FL C         G ++ I+PG++  P 
Sbjct: 62  KEQLLKQLKYRVGHVLRIKDSSIDHHEAGRGVFLECPRRQPIVLPGTLLGIFPGVVCDPG 121

Query: 196 YYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKG 255
               +P  P+     PYL  R+DG  I+ +                LP  MP+      G
Sbjct: 122 C--PMPPTPK-RGLRPYL-KRFDGYWIDYEK--------------ELPYPMPSP-----G 158

Query: 256 SD--QFWKLLSKPMDNKRGGSGSEML----ERRNPLALAHFANHPAKGMVPNVMICPYDF 309
           S+   F +  S  +D  + G     +    E  NP AL H  NHP   +  NV +  +D 
Sbjct: 159 SNLIDFAENFSSQID--KSGMHKSFIEVPPEYMNPFALGHLINHPPPDIPANVQLIDFDL 216

Query: 310 PLT 312
           P T
Sbjct: 217 PYT 219


>gi|390360997|ref|XP_788396.2| PREDICTED: uncharacterized protein C5orf35-like [Strongylocentrotus
           purpuratus]
          Length = 297

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 65/235 (27%), Positives = 97/235 (41%), Gaps = 59/235 (25%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVI-AIYPGIIYSPAYYRYIPGYPRV 206
           +   +GY +  +PSQI  E  G G+F+        V+ A+YPG IY        PG P +
Sbjct: 109 MHQTLGYAIGRRPSQI--EGGGVGVFVTKGVVPKGVVVAMYPGTIY-------YPGEPIL 159

Query: 207 --DAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLS 264
                NP++    DG  I+        + + L   +       +  G    SD  W L S
Sbjct: 160 LQSIANPFVFRCLDGIHIDG-------NDKRLSKMIYRSCARRDRLGPYMFSDLSW-LTS 211

Query: 265 KPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMI----CPYDFPLTEKDMRPYI 320
            P+               NP+A+  + N+  KG   NV       P DFP+    +R YI
Sbjct: 212 CPL---------------NPMAVGQYVNNQHKGAPANVCYQEFDVPLDFPIK---LRKYI 253

Query: 321 PNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAI-CDEEVLLNY 374
           PN  FG   +V                G++   ++T+ALV+ R I C EE+L +Y
Sbjct: 254 PNAFFGTLPDV----------------GNAGRNMRTVALVSVREIQCGEELLSSY 292


>gi|47227924|emb|CAF97553.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 279

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/180 (26%), Positives = 78/180 (43%), Gaps = 38/180 (21%)

Query: 151 AIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRV 206
            +G+ +D KPS +P   AG G+F+  G    GA++A+YPG +Y    P +++ I      
Sbjct: 100 TLGFCIDRKPSSLPF--AGTGVFVTKGFVPKGAIVALYPGTVYQAYEPIFFQSI------ 151

Query: 207 DAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKP 266
             +NP++    DG +I+    G          G        +  G    SD  W L S P
Sbjct: 152 --RNPFVFRCIDGILIDGNDKGISRMVFRSCSGR-------DRVGFFMMSDISW-LTSDP 201

Query: 267 MDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           +               NPLA+  + N+ +     NV    YD P     ++R ++PN++F
Sbjct: 202 L---------------NPLAVGQYVNNCSNQRPANVCYQEYDVPDAFPTELRQFLPNVNF 246


>gi|190344028|gb|ACE75808.1| hypothetical protein [Sorex araneus]
          Length = 299

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 85/192 (44%), Gaps = 38/192 (19%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
           L   +G+++ L  S +    AG+G+F+  G    GAV+++YPG +Y    P +++ I   
Sbjct: 116 LYKTLGFSVALATSSLI--SAGKGVFVTKGLIPKGAVVSMYPGTVYQKYEPIFFQSI--- 170

Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
                 NP++    DG +I+    G         +G        +  G  K SD  W L 
Sbjct: 171 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDQLGPFKMSDSTW-LT 217

Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
           S               E RNPLAL  + N+ +     NV    +D P +   +++ Y+PN
Sbjct: 218 S---------------EIRNPLALGQYVNNCSNDRAANVCYQEFDVPAVFPLELKQYLPN 262

Query: 323 ISFGNAEEVNMR 334
           I++ + ++  +R
Sbjct: 263 IAYSSDKQSLLR 274


>gi|410924740|ref|XP_003975839.1| PREDICTED: SET domain-containing protein 9-like [Takifugu rubripes]
          Length = 283

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 82/189 (43%), Gaps = 38/189 (20%)

Query: 151 AIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQ 209
            +G+ +D KPS +    AG G+F+  G    GA++A+YPG +Y P    Y P + +   +
Sbjct: 103 TLGFCIDRKPSSLAF--AGTGVFVTKGLVPKGAIVAMYPGTVYQP----YEPIFFQ-SIR 155

Query: 210 NPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDN 269
           NP++    DG +I+    G          G        +  G    SD  W L + P+  
Sbjct: 156 NPFVFRCIDGILIDGNDKGISKMVFRSCSG-------RDRVGFFMVSDVSW-LTNDPL-- 205

Query: 270 KRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYD----FPLTEKDMRPYIPNISF 325
                        NPLA+  + N+ +     NV    YD    FP+   ++R Y+PN++F
Sbjct: 206 -------------NPLAVGQYVNNCSNERPANVCYQEYDVPDKFPI---ELRQYLPNVNF 249

Query: 326 GNAEEVNMR 334
               +  +R
Sbjct: 250 RQEVQAPLR 258


>gi|321459697|gb|EFX70748.1| hypothetical protein DAPPUDRAFT_327850 [Daphnia pulex]
          Length = 265

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 56/236 (23%), Positives = 94/236 (39%), Gaps = 52/236 (22%)

Query: 143 ELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIP 201
           E  + ++  +GYT+ L+ S IP  EAG G+F+  G    G+++A+YPG +     YR + 
Sbjct: 72  EALKVMERVLGYTIQLRKSNIP--EAGTGVFVSKGTVTAGSIVALYPGTV-----YRVME 124

Query: 202 GYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWK 261
                   NP++    DG  I+ +  G    +R+++                   D  W 
Sbjct: 125 PLLLQSIVNPFIFRCIDGLHIDGKDRGI---SRKIFKSCVY-------------RDLRWP 168

Query: 262 LLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTE--KDMRPY 319
                +    G       +  NPL    + N+       NV     D P+     ++R Y
Sbjct: 169 FFPADLTWLTG-------DLINPLNCGQYVNNETPEYPCNVAYQECDLPIDRFPIELRCY 221

Query: 320 IPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAI-CDEEVLLNY 374
           +PNI + ++E                  G+ T  L+ +ALVA + I  D E+   Y
Sbjct: 222 LPNIYYSSSE------------------GNQTKFLRVVALVAVKDIGIDTELFSTY 259


>gi|301621891|ref|XP_002940280.1| PREDICTED: uncharacterized protein C5orf35 [Xenopus (Silurana)
           tropicalis]
          Length = 297

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 59/238 (24%), Positives = 99/238 (41%), Gaps = 60/238 (25%)

Query: 143 ELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYR 198
           E  + L   +G+ +   PS I  + AG G+F+  G    GAV+++YPG +Y    P +++
Sbjct: 109 EAREILHSTLGFCIARAPSSI--DSAGTGVFVSKGHVQKGAVVSMYPGTVYQQYEPIFFQ 166

Query: 199 YIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQ 258
            I        +NP++    DG +I+    G    ++ L+   +  + +    G  K  D 
Sbjct: 167 SI--------RNPFIFRCIDGILIDGNDKGI---SQYLYRSCSRRDQL----GPLKLCDA 211

Query: 259 FWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMR 317
            W L + P               +NP A+  + N+ +     NV    +D P     ++R
Sbjct: 212 TW-LSAFP---------------QNPWAVGQYVNNCSNEREANVCYQEFDVPEYFPIELR 255

Query: 318 PYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAI-CDEEVLLNY 374
            Y+PNI++                     S  S   L+ + LVA R I C EE+  NY
Sbjct: 256 QYLPNINY---------------------SPDSQGPLRCVVLVALRDIDCGEELFSNY 292


>gi|219111099|ref|XP_002177301.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411836|gb|EEC51764.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 470

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 61/271 (22%), Positives = 107/271 (39%), Gaps = 79/271 (29%)

Query: 135 QTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIY-- 192
           Q+  + R+ L   +++ +G+++    S IP   AG+G F+ G A  G+++A  PG ++  
Sbjct: 249 QSLNMCRSAL---IREELGFSVVSFRSLIP--GAGRGAFVDGNALAGSILAFQPGDVWPK 303

Query: 193 ------SPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIM 246
                 +P    +  G       + ++  R+D  V++++       TRE           
Sbjct: 304 EHLLTSAPDVIEHFAG-----EDDCHVSLRFDDYVVDSRQSPVTVLTRE----------- 347

Query: 247 PNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICP 306
                                       GS      NP A+ H ANHP     PNV    
Sbjct: 348 ----------------------------GS-----MNPWAIGHMANHPPSKTQPNVQSTM 374

Query: 307 YDFP----LTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVAT 362
            +F     + E D+  Y+PN ++        R F            +   V+  L L++ 
Sbjct: 375 LNFTERMNMAENDLFRYLPN-TYARLPSWRSRVF-----------EAEEVVMHGLCLLSK 422

Query: 363 RAICDEEVLLNYRLSNSKRRPVWYSPVDEEE 393
           R + +EE++ +YRL  S+  P WYS V  ++
Sbjct: 423 RDVSNEELVYDYRL-QSETTPDWYSVVRHDD 452


>gi|397566635|gb|EJK45127.1| hypothetical protein THAOC_36272 [Thalassiosira oceanica]
          Length = 472

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 58/138 (42%), Gaps = 31/138 (22%)

Query: 282 RNPLALAHFANHPAKGMV---------PNVMICPYDFPLT-----EKDMRPYIPNISFGN 327
           +N  A+AH  NHP              PN + C  +F  T     +  +R Y+PN     
Sbjct: 339 KNLWAIAHIVNHPPLRKTGERDDVFSGPNCVTCSINFTETMFRDQQDKLRDYLPNEYEQP 398

Query: 328 AEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLS------NSKR 381
            +E         W +  S       V+  L L A R + DEE+  +YRLS      N ++
Sbjct: 399 PQE---------WTR--SMFDDEKVVMHGLGLFALRDLEDEELFFDYRLSPDDRQPNGQQ 447

Query: 382 RPVWYSPVDEEEDRRRWS 399
            P WY  +DEE  + RW+
Sbjct: 448 YPHWYRVIDEEAVKNRWN 465


>gi|344272547|ref|XP_003408093.1| PREDICTED: uncharacterized protein C5orf35-like [Loxodonta
           africana]
          Length = 299

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/172 (26%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGFVPKGAVVSMYPGTVYQKYEPVFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W               SEM   RN
Sbjct: 186 GNDKGISKVVYRSCNG-------RDQLGPLKMSDSTW-------------LTSEM---RN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFPLT-EKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P     ++R Y+PNI++   ++  +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAAFPVELRQYLPNIAYSYDKQSPLR 274


>gi|307110916|gb|EFN59151.1| hypothetical protein CHLNCDRAFT_49994 [Chlorella variabilis]
          Length = 988

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/230 (25%), Positives = 91/230 (39%), Gaps = 31/230 (13%)

Query: 144 LSQRLKDAIGYTLDLKPSQIPHEEAGQGLFL-CGEANVGAVIAIYPGIIYSPAYYRYI-- 200
           L++ L D IG+ L  +PS I H +AG G+FL  G    G ++ +YPG+++    +  +  
Sbjct: 596 LTEALADTIGFELQRRPSGIQHPDAGDGVFLTAGGCPAGGILTLYPGLVHDMGDFVLLLG 655

Query: 201 ---------PGYPRVDAQNPYLI---TRYDGTVINAQPWGSG---WDTRELWDGLTLPE- 244
                    P  P     N YL+         +++ QP G     +           PE 
Sbjct: 656 VSAPDDACAPAPPAWTLDNHYLLQLRCLNAACLVDGQPGGLSAQRFQAAAASWAAGQPEA 715

Query: 245 IMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANH-PAKGMVPNVM 303
            + N+     G  Q  +L         G  G E L R+   AL H  NH PA        
Sbjct: 716 ALVNASWLALGGQQRAEL--------DGRLGCEALLRQA--ALGHMLNHGPAAAANAAFD 765

Query: 304 ICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV 353
           +CP    L     R Y+P++S G   E  +R       +   G+ + +PV
Sbjct: 766 MCPLPAALPPSLQR-YVPSLSVGGRGEEEVRWVPLVTARTALGATAGSPV 814


>gi|292617940|ref|XP_001332805.2| PREDICTED: uncharacterized protein C5orf35-like [Danio rerio]
          Length = 288

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 58/232 (25%), Positives = 97/232 (41%), Gaps = 62/232 (26%)

Query: 150 DAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDA 208
           + +G++++ K S +  E AG G+F+  G    G+++A+YPG IY P    +         
Sbjct: 107 ETLGFSVERKISSL--ERAGTGVFVMRGRVPAGSIVAMYPGTIYQPDEPIFFQS-----V 159

Query: 209 QNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMD 268
           +NP++    DG +I+              +  ++ +I+  S     G D+F  L      
Sbjct: 160 RNPFVFRCIDGILIDG-------------NDKSISKIVYRSCS---GRDRFGPL------ 197

Query: 269 NKRGGSGSEMLER-RNPLALAHFANHPAKGMVPNVMI----CPYDFPLTEKDMRPYIPNI 323
             R   GS +     NPLA+  + N+ +     NV       P  FPL   ++R Y+PN+
Sbjct: 198 --RLSDGSWLTPHPENPLAVGQYVNNCSNEKSANVCFQELDVPDGFPL---ELRQYLPNV 252

Query: 324 SFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICD-EEVLLNY 374
           ++                        S   L+ + LVA R I D EE+  NY
Sbjct: 253 NY---------------------RADSKRPLRCVVLVALRDIDDGEELFSNY 283


>gi|183398083|gb|ACC62499.1| hypothetical protein [Rhinolophus ferrumequinum]
          Length = 299

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 44/172 (25%), Positives = 76/172 (44%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYHTYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E +N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIQN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++ N ++  +R
Sbjct: 223 PLAIGQYVNNCSNDRPANVCYQEFDVPAVFPIELKQYLPNIAYSNDKQSTLR 274


>gi|426246475|ref|XP_004017019.1| PREDICTED: SET domain-containing protein 9 [Ovis aries]
          Length = 299

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 38/192 (19%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
           L   +G+++   PS +    AG+G+F+  G    GAV+++YPG +Y    P +++ I   
Sbjct: 116 LYKTLGFSVARAPSSLI--SAGKGVFVTKGLVRKGAVVSMYPGTVYQKYEPIFFQSI--- 170

Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
                 NP++    DG +I+    G         +G        +  G  K SD  W L 
Sbjct: 171 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDQLGPLKMSDSTW-LT 217

Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLT-EKDMRPYIPN 322
           S               E  NPLA+  + N+ +     NV    +D P     +++ Y+PN
Sbjct: 218 S---------------EIHNPLAIGQYVNNCSNDRAANVCYQEFDVPAAFPIELKQYLPN 262

Query: 323 ISFGNAEEVNMR 334
           I++   ++  +R
Sbjct: 263 IAYSYDKQSPLR 274


>gi|61841357|ref|XP_592703.1| PREDICTED: uncharacterized protein C5orf35 isoform 1 [Bos taurus]
 gi|297487643|ref|XP_002696366.1| PREDICTED: uncharacterized protein C5orf35 [Bos taurus]
 gi|296475835|tpg|DAA17950.1| TPA: hypothetical protein BOS_19749 [Bos taurus]
 gi|440904685|gb|ELR55160.1| hypothetical protein M91_11582 [Bos grunniens mutus]
          Length = 299

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 38/192 (19%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
           L   +G+++   PS +    AG+G+F+  G    GAV+++YPG +Y    P +++ I   
Sbjct: 116 LYKTLGFSVARAPSSLI--SAGKGVFVTKGLVRKGAVVSMYPGTVYQKYEPIFFQSI--- 170

Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
                 NP++    DG +I+    G          G        +  G  K SD  W L 
Sbjct: 171 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCSG-------RDQLGPLKMSDSTW-LT 217

Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
           S               E  NPLA+  + N+ +     NV    +D P +   +++ Y+PN
Sbjct: 218 S---------------EIHNPLAIGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPN 262

Query: 323 ISFGNAEEVNMR 334
           I++   ++  +R
Sbjct: 263 IAYSYDKQSPLR 274


>gi|255076193|ref|XP_002501771.1| predicted protein [Micromonas sp. RCC299]
 gi|226517035|gb|ACO63029.1| predicted protein [Micromonas sp. RCC299]
          Length = 298

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 94/248 (37%), Gaps = 61/248 (24%)

Query: 156 LDLKPSQIPHEEAGQGLFLCGEANV---GAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPY 212
           ++++ SQ+  + AG GLFL  +A+    G +IAIY G+ Y+P                P 
Sbjct: 63  VEVRRSQVRSDSAGSGLFLRSDAHAAKPGDLIAIYAGV-YTPPAPDVTHAADGTSILIPL 121

Query: 213 LITRYDGT-VINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKR 271
              R +G  VIN Q  G GW              +     A KG    W           
Sbjct: 122 PALREEGAYVINLQ--GGGWIC-------GAESALQAKNNAAKGLCAGW----------- 161

Query: 272 GGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKD-MRPYIPNISFGNAEE 330
                         A+A  ANHP +G +PNV     D+     D +R +I  I+      
Sbjct: 162 --------------AVAALANHPPRGSIPNVEAIAVDWDDAISDAVRKHINPITT----- 202

Query: 331 VNMRRFGSFWFKWGSGSGSSTPVLKT-----LALVATRAIC-DEEVLLNYRLSNSKRRPV 384
                       W    G S  V +T     +  +A+R I   +E+L NYRL+     P 
Sbjct: 203 ----------LPWYIDDGVSVFVPQTVRSRGIVFLASRTINPGDEILFNYRLNPQGNLPR 252

Query: 385 WYSPVDEE 392
           WY+ V +E
Sbjct: 253 WYAAVPDE 260


>gi|432913661|ref|XP_004079001.1| PREDICTED: SET domain-containing protein 9-like [Oryzias latipes]
          Length = 268

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 48/180 (26%), Positives = 78/180 (43%), Gaps = 38/180 (21%)

Query: 151 AIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQ 209
            +G+ +D +PS +P   AG G+F+  G    G V+A+YPG +Y P    Y P   +   +
Sbjct: 88  TLGFCIDRRPSTLP--LAGTGVFVVNGVIPKGTVVAMYPGTVYQP----YDPILFQ-SIR 140

Query: 210 NPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDN 269
           NP++    DG +I+    G      +   G        +  G    SD  W L + P   
Sbjct: 141 NPFIFRCIDGVLIDGSDKGMSKMVFKSCSG-------RDRLGPLVTSDTSW-LTASP--- 189

Query: 270 KRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYD----FPLTEKDMRPYIPNISF 325
                       RNPLA+  + N+ +     NV    YD    FP+   ++  ++PN++F
Sbjct: 190 ------------RNPLAVGQYVNNCSNTKPANVCYQEYDVPDRFPI---ELWQFLPNVNF 234


>gi|348501071|ref|XP_003438094.1| PREDICTED: uncharacterized protein C5orf35-like [Oreochromis
           niloticus]
          Length = 292

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 47/189 (24%), Positives = 80/189 (42%), Gaps = 38/189 (20%)

Query: 151 AIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQ 209
            +G+ ++ KPS +P   AG G+F+  G    GA +A+YPG +Y P    Y P   +   +
Sbjct: 112 TLGFCIERKPSTLPF--AGSGVFVTKGFVPKGATVAMYPGTVYQP----YEPILLQ-SIR 164

Query: 210 NPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDN 269
           NP++    DG +++    G      +   G        +  G    SD  W L   P   
Sbjct: 165 NPFVFRCIDGILVDGNDKGISRMVFKSCSGR-------DRIGPFLTSDTSW-LTDTP--- 213

Query: 270 KRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYD----FPLTEKDMRPYIPNISF 325
                       +NPLA+  + N+ +     NV    YD    FP+   ++  Y+PN+++
Sbjct: 214 ------------QNPLAVGQYVNNCSDERPANVCYQEYDVPDKFPI---ELHQYLPNVNY 258

Query: 326 GNAEEVNMR 334
               +  +R
Sbjct: 259 SQGTQRRLR 267


>gi|363744171|ref|XP_001233556.2| PREDICTED: uncharacterized protein C5orf35 isoform 1 [Gallus
           gallus]
          Length = 296

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 52/208 (25%), Positives = 93/208 (44%), Gaps = 40/208 (19%)

Query: 134 PQTRQLTRTELS--QRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGI 190
           P  R   R  LS  + L   +G++L  + S +    AG G+F+  G A  GAV+A+YPG 
Sbjct: 97  PPGRHSQREPLSPEEVLFRTLGFSLARERSSL--LSAGTGVFVSRGSAPQGAVVAMYPGT 154

Query: 191 IY---SPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMP 247
           +Y    P +++ +         NP++    DG +I+    G    +R ++   +  + + 
Sbjct: 155 VYRKHEPIFFQSL--------GNPFIFRCIDGILIDGNDKGL---SRSVYRSCSWRDQL- 202

Query: 248 NSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPY 307
              G  + SD  W L + P               +NPLA+  + N+ +     NV    +
Sbjct: 203 ---GPFQMSDVSW-LTAAP---------------QNPLAVGQYVNNCSYEKAANVCYQEF 243

Query: 308 DFPLT-EKDMRPYIPNISFGNAEEVNMR 334
           D P     +++ Y+PNI + +  E ++R
Sbjct: 244 DVPRHFPVELKQYLPNIVYSHEIESHLR 271


>gi|311273761|ref|XP_003134022.1| PREDICTED: uncharacterized protein C5orf35-like [Sus scrofa]
          Length = 299

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 50/207 (24%), Positives = 88/207 (42%), Gaps = 40/207 (19%)

Query: 135 QTRQLTRTELS--QRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGII 191
           + R   RT L   + L   +G+++   PS +    AG+G+F+  G     AV+++YPG +
Sbjct: 101 EYRHQQRTTLKPEEILYKTLGFSVARAPSSLI--SAGKGVFVTKGLVPKDAVVSMYPGTV 158

Query: 192 YS---PAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPN 248
           Y    P +++ I         NP++    DG +I+    G         +G        +
Sbjct: 159 YQKYEPVFFQSI--------GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RD 203

Query: 249 SKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYD 308
             G  K SD  W L S               E  NPLA+  + N+ +     NV    +D
Sbjct: 204 RLGPLKMSDSTW-LTS---------------EIHNPLAVGQYVNNCSNDKAANVCYQEFD 247

Query: 309 FP-LTEKDMRPYIPNISFGNAEEVNMR 334
            P +   +++ Y+PNI++   ++  +R
Sbjct: 248 VPAVFPIELKQYLPNIAYSYDKQSPLR 274


>gi|390459961|ref|XP_003732396.1| PREDICTED: uncharacterized protein C5orf35 homolog isoform 2
           [Callithrix jacchus]
          Length = 273

 Score = 46.6 bits (109), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 78/183 (42%), Gaps = 38/183 (20%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
           L   +G+++    S +    AG+G+F+  G    GAV+++YPG +Y    P +++ I   
Sbjct: 90  LYKTLGFSVAQATSSL--NSAGKGVFVTKGLVPKGAVVSMYPGTVYQNYEPIFFQSIG-- 145

Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
                 NP++    DG +I+    G         +G        +  G  K SD  W L 
Sbjct: 146 ------NPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LT 191

Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
           S               E  NPLA+  + N+ +     NV    +D P +   +++ Y+PN
Sbjct: 192 S---------------EIHNPLAMGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPN 236

Query: 323 ISF 325
           I++
Sbjct: 237 IAY 239


>gi|169409540|gb|ACA57888.1| hypothetical protein [Callicebus moloch]
          Length = 299

 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 46/183 (25%), Positives = 78/183 (42%), Gaps = 38/183 (20%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
           L   +GY++    S +    AG+G+F+  G    GAV+++YPG +Y    P +++ I   
Sbjct: 116 LYKTLGYSVAQATSSLI--SAGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--- 170

Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
                 NP++    DG +I+    G         +G        +  G  K SD  W L 
Sbjct: 171 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LT 217

Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
           S               E  NPLA+  + N+ +     NV    +D P +   +++ Y+PN
Sbjct: 218 S---------------EIHNPLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPN 262

Query: 323 ISF 325
           I++
Sbjct: 263 IAY 265


>gi|296194560|ref|XP_002745008.1| PREDICTED: uncharacterized protein C5orf35 homolog isoform 1
           [Callithrix jacchus]
 gi|166064956|gb|ABY79120.1| hypothetical protein [Callithrix jacchus]
          Length = 299

 Score = 45.8 bits (107), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 78/183 (42%), Gaps = 38/183 (20%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
           L   +G+++    S +    AG+G+F+  G    GAV+++YPG +Y    P +++ I   
Sbjct: 116 LYKTLGFSVAQATSSL--NSAGKGVFVTKGLVPKGAVVSMYPGTVYQNYEPIFFQSI--- 170

Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
                 NP++    DG +I+    G         +G        +  G  K SD  W L 
Sbjct: 171 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LT 217

Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
           S               E  NPLA+  + N+ +     NV    +D P +   +++ Y+PN
Sbjct: 218 S---------------EIHNPLAMGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPN 262

Query: 323 ISF 325
           I++
Sbjct: 263 IAY 265


>gi|395510329|ref|XP_003759430.1| PREDICTED: SET domain-containing protein 9 [Sarcophilus harrisii]
          Length = 270

 Score = 45.8 bits (107), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 50/204 (24%), Positives = 86/204 (42%), Gaps = 46/204 (22%)

Query: 135 QTRQLTRTELSQR--------LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIA 185
           Q  +L R  + Q+        L + +G+++  + S +    AG+G+F+  G    GAV++
Sbjct: 66  QRTELPRCNIQQQNIYKPGEVLFNTLGFSISRETSSLI--SAGKGVFVTKGFVPKGAVVS 123

Query: 186 IYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTL 242
           +YPG +Y    P +++ I         NP++    DG +I+    G         +G   
Sbjct: 124 MYPGTVYQKHEPIFFQSI--------GNPFIFRCLDGMLIDGNDKGISKVVYRSCNG--- 172

Query: 243 PEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNV 302
                +  G  K SD  W L S               E +NPLA+  + N+ +     NV
Sbjct: 173 ----RDQLGPFKMSDSTW-LTS---------------EIQNPLAVGQYVNNCSNDKEANV 212

Query: 303 MICPYDFP-LTEKDMRPYIPNISF 325
               +D P L   + + Y+PNI +
Sbjct: 213 CYQEFDVPELFPLEFKQYLPNIMY 236


>gi|281182766|ref|NP_001162402.1| SET domain-containing protein 9 [Papio anubis]
 gi|163780989|gb|ABY40772.1| chromosome 5 open reading frame 35 (predicted) [Papio anubis]
          Length = 298

 Score = 45.8 bits (107), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 133 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 184

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 185 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 221

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 222 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPVELKQYLPNIAYSYDKQSPLR 273


>gi|302564946|ref|NP_001181100.1| SET domain containing 9 [Macaca mulatta]
 gi|355691322|gb|EHH26507.1| hypothetical protein EGK_16503 [Macaca mulatta]
 gi|355749929|gb|EHH54267.1| hypothetical protein EGM_15070 [Macaca fascicularis]
 gi|380809686|gb|AFE76718.1| hypothetical protein LOC133383 isoform 1 [Macaca mulatta]
          Length = 299

 Score = 45.8 bits (107), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPVELKQYLPNIAYSYDKQSPLR 274


>gi|197102146|ref|NP_001125419.1| SET domain-containing protein 9 [Pongo abelii]
 gi|75055093|sp|Q5RBW0.1|SETD9_PONAB RecName: Full=SET domain-containing protein 9
 gi|55727995|emb|CAH90750.1| hypothetical protein [Pongo abelii]
          Length = 273

 Score = 45.8 bits (107), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 108 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSIG--------NPFIFRCLDGVLID 159

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 160 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 196

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++
Sbjct: 197 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 239


>gi|397514313|ref|XP_003827435.1| PREDICTED: SET domain-containing protein 9 isoform 1 [Pan paniscus]
          Length = 299

 Score = 45.8 bits (107), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 274


>gi|426384667|ref|XP_004058879.1| PREDICTED: SET domain-containing protein 9 isoform 1 [Gorilla
           gorilla gorilla]
          Length = 299

 Score = 45.8 bits (107), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 274


>gi|338718749|ref|XP_001916598.2| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           C5orf35-like, partial [Equus caballus]
          Length = 279

 Score = 45.4 bits (106), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 114 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGILID 165

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 166 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 202

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 203 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 254


>gi|114600191|ref|XP_517753.2| PREDICTED: SET domain-containing protein 9 isoform 2 [Pan
           troglodytes]
 gi|119575352|gb|EAW54957.1| hypothetical protein MGC33648, isoform CRA_b [Homo sapiens]
 gi|410225266|gb|JAA09852.1| chromosome 5 open reading frame 35 [Pan troglodytes]
 gi|410252724|gb|JAA14329.1| chromosome 5 open reading frame 35 [Pan troglodytes]
 gi|410301704|gb|JAA29452.1| chromosome 5 open reading frame 35 [Pan troglodytes]
 gi|410329565|gb|JAA33729.1| chromosome 5 open reading frame 35 [Pan troglodytes]
          Length = 299

 Score = 45.4 bits (106), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 274


>gi|22328098|gb|AAH36528.1| Chromosome 5 open reading frame 35 [Homo sapiens]
 gi|325464235|gb|ADZ15888.1| chromosome 5 open reading frame 35 [synthetic construct]
          Length = 299

 Score = 45.4 bits (106), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 274


>gi|285397547|ref|NP_714917.2| SET domain-containing protein 9 isoform 1 [Homo sapiens]
 gi|296434439|sp|Q8NE22.2|SETD9_HUMAN RecName: Full=SET domain-containing protein 9
          Length = 299

 Score = 45.4 bits (106), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 274


>gi|291395387|ref|XP_002714031.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
          Length = 628

 Score = 45.4 bits (106), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 44/183 (24%), Positives = 75/183 (40%), Gaps = 38/183 (20%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
           L   +G+ +   PS +    AG+G+F+  G    GAV+++YPG +Y    P +++ I   
Sbjct: 445 LYKTLGFCVVRAPSSV--ISAGKGVFVAKGLVPKGAVVSMYPGTVYQKHEPIFFQSI--- 499

Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
                 NP++    DG +I+    G          G        +  G  K SD  W  L
Sbjct: 500 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCSGR-------DRLGPLKMSDSTW--L 545

Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLT-EKDMRPYIPN 322
           +   D              NPLA+  + N+ +     NV    +D P     +++ Y+PN
Sbjct: 546 TAETD--------------NPLAVGQYVNNCSNERAANVCYQEFDVPAVFPVELKQYLPN 591

Query: 323 ISF 325
            ++
Sbjct: 592 TAY 594


>gi|431908607|gb|ELK12200.1| hypothetical protein PAL_GLEAN10009679 [Pteropus alecto]
          Length = 299

 Score = 45.4 bits (106), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 44/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVFRSCNG-------RDRLGPLKMSDSTW-LTS---------------EFHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   +E  +R
Sbjct: 223 PLAVGQYVNNCSNDRPANVCYQEFDVPAVFPIELKQYLPNIAYSFDKESPLR 274


>gi|397514315|ref|XP_003827436.1| PREDICTED: SET domain-containing protein 9 isoform 2 [Pan paniscus]
          Length = 271

 Score = 45.1 bits (105), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 265


>gi|297294316|ref|XP_002804415.1| PREDICTED: uncharacterized protein C5orf35-like isoform 2 [Macaca
           mulatta]
          Length = 271

 Score = 45.1 bits (105), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPVELKQYLPNIAY 265


>gi|332821466|ref|XP_003310776.1| PREDICTED: SET domain-containing protein 9 isoform 1 [Pan
           troglodytes]
 gi|426384669|ref|XP_004058880.1| PREDICTED: SET domain-containing protein 9 isoform 2 [Gorilla
           gorilla gorilla]
          Length = 271

 Score = 45.1 bits (105), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 265


>gi|285397577|ref|NP_001165461.1| SET domain-containing protein 9 isoform 2 [Homo sapiens]
          Length = 271

 Score = 45.1 bits (105), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 265


>gi|345327968|ref|XP_001507930.2| PREDICTED: uncharacterized protein C5orf35-like [Ornithorhynchus
           anatinus]
          Length = 339

 Score = 45.1 bits (105), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 67/163 (41%), Gaps = 36/163 (22%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG IY    P +++ I         NP++    DG +I+
Sbjct: 174 AGKGVFVTKGFVPKGAVVSMYPGTIYQMSEPIFFQSI--------GNPFIFRCIDGVLID 225

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G          G        +  G  K SD  W   S                 +N
Sbjct: 226 GNDKGISKAVYRSCSGR-------DQLGPFKMSDSTWLTSSV----------------QN 262

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           PLA+  + N+ +     NV    +D P L   + + Y+PNI++
Sbjct: 263 PLAVGQYVNNCSNDKAANVCYQEFDVPELFPIEFKRYLPNIAY 305


>gi|403267548|ref|XP_003925888.1| PREDICTED: SET domain-containing protein 9 [Saimiri boliviensis
           boliviensis]
          Length = 299

 Score = 44.7 bits (104), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTKGFVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W L S               E  N
Sbjct: 186 GNNKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 265


>gi|301785768|ref|XP_002928298.1| PREDICTED: uncharacterized protein C5orf35-like [Ailuropoda
           melanoleuca]
 gi|281339574|gb|EFB15158.1| hypothetical protein PANDA_018216 [Ailuropoda melanoleuca]
          Length = 299

 Score = 44.7 bits (104), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTRGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W               SE+    N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPFKMSDSTW-------------LTSEIC---N 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSCDKQSPLR 274


>gi|145545702|ref|XP_001458535.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426355|emb|CAK91138.1| unnamed protein product [Paramecium tetraurelia]
          Length = 569

 Score = 44.7 bits (104), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 77/185 (41%), Gaps = 41/185 (22%)

Query: 145 SQRLKDAIGYTLDLKPSQIPHEEAGQGLFL-CGEAN-----VGAVIAIYPGIIYSPAYYR 198
           +QR+   +G+ L+L+ S I H +AG+G+FL    AN      G ++   PG+IY      
Sbjct: 282 TQRVTQNLGFKLNLRESTIDHSDAGEGVFLETLPANKRLVPAGTLLGFVPGLIYD----- 336

Query: 199 YIPGYPRVDAQNP--YLITRYDGTVINAQ-----PW--GSGWDTRELWDGLTLPEIMPNS 249
               Y     + P      R+DG + +       P+  G G+D  E        +I    
Sbjct: 337 ---NYKSYSEKTPPDMHFFRFDGPIFDFTSRIYYPYVPGYGFDEYE-------QKISDIK 386

Query: 250 KGAEKGSDQFWKLLSKPMDNKRGGSGSEML--ERRNPLALAHFANHPAKGMVPNVMICPY 307
           + A++ S  +         NK  G G E++  E  NP AL H  NH  +    NV +   
Sbjct: 387 ELADRSSRSY--------RNKSKG-GYELVKGEDINPYALGHKINHTPRNKSTNVSLIDI 437

Query: 308 DFPLT 312
             P T
Sbjct: 438 FVPQT 442


>gi|73949541|ref|XP_849744.1| PREDICTED: uncharacterized protein C5orf35 isoform 1 [Canis lupus
           familiaris]
          Length = 299

 Score = 44.3 bits (103), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG +I+
Sbjct: 134 AGKGVFVTRGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W               SE+    N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPFKMSDSTW-------------LTSEIC---N 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSCDKQSPLR 274


>gi|410948609|ref|XP_003981023.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein 9
           [Felis catus]
          Length = 307

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+A+YPG +Y    P +++ I         NP++    DG +++
Sbjct: 142 AGKGVFVTRGLVPKGAVVAMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLVD 193

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  K SD  W               SE+    N
Sbjct: 194 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-------------LTSEIC---N 230

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++
Sbjct: 231 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 273


>gi|348569018|ref|XP_003470295.1| PREDICTED: uncharacterized protein C5orf35-like [Cavia porcellus]
          Length = 299

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DGT+I+
Sbjct: 134 AGRGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGTLID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G          G        +  G  K SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCSG-------RDRLGPLKMSDATW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           PLA+  + N+ +     NV    ++ P +   +++ Y+PNI++
Sbjct: 223 PLAIGQYVNNCSNDRAANVCYQEFEVPAVFPIELKQYLPNIAY 265


>gi|355736428|gb|AES11997.1| hypothetical protein [Mustela putorius furo]
          Length = 219

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 83/192 (43%), Gaps = 38/192 (19%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
           L   +G+++  + S +    AG+G+F+  G    GAV+++YPG +Y    P +++ I   
Sbjct: 37  LYKTLGFSVARETSSLI--SAGKGVFVTRGWVPKGAVVSMYPGTVYQKYEPIFFQSI--- 91

Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
                 NP++    DG +I+    G         +G        +  G  K SD  W L 
Sbjct: 92  -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDRLGPFKMSDSTW-LT 138

Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
           S               E  NPLA+  + N+ +     NV    +D P +   +++ Y+PN
Sbjct: 139 S---------------EICNPLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPN 183

Query: 323 ISFGNAEEVNMR 334
           I++   ++  +R
Sbjct: 184 IAYSCDKQSPLR 195


>gi|327262831|ref|XP_003216227.1| PREDICTED: uncharacterized protein C5orf35-like [Anolis
           carolinensis]
          Length = 295

 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 44/189 (23%), Positives = 83/189 (43%), Gaps = 32/189 (16%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRV 206
           L + +G+++    S +    AG G+F+  G    G V+++YPG +Y     +Y P + + 
Sbjct: 112 LFNTLGFSITRSCSSLV--SAGTGVFVTKGLVPKGTVVSMYPGTVYE----KYEPIFFQ- 164

Query: 207 DAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKP 266
              NP++    DG +I+    G    ++ ++   +      +  G  K SD FW  L+  
Sbjct: 165 SIGNPFIFRCIDGVLIDGNDKGI---SKAIYRSCS----KRDQFGPLKMSDAFW--LTNS 215

Query: 267 MDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
           M              +NPLA+  + N+ +     NV    +D P     + + Y+PNI +
Sbjct: 216 M--------------QNPLAVGQYVNNCSTEKSANVCYQEFDVPEYFPIEFKQYLPNIKY 261

Query: 326 GNAEEVNMR 334
            +  +  +R
Sbjct: 262 SHEVQRPLR 270


>gi|351697671|gb|EHB00590.1| hypothetical protein GW7_10244 [Heterocephalus glaber]
          Length = 274

 Score = 43.9 bits (102), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 44/172 (25%), Positives = 73/172 (42%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV++IYPG IY    P +++ I         NP++    DG +I+
Sbjct: 109 AGRGVFVTKGLVPKGAVVSIYPGTIYQKYEPIFFQSI--------GNPFIFRCLDGVLID 160

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G          G        +  G  K SD  W L S               E  N
Sbjct: 161 GNDKGISKAVYRSCSG-------RDRIGPLKMSDSTW-LTS---------------EICN 197

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    ++ P +   +++ Y+PNI++   ++  +R
Sbjct: 198 PLAVGQYVNNCSNDRPANVCYQEFEVPAVFPIELKQYLPNIAYSYDKQSPLR 249


>gi|444725095|gb|ELW65674.1| hypothetical protein TREES_T100005305 [Tupaia chinensis]
          Length = 299

 Score = 43.5 bits (101), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 73/172 (42%), Gaps = 36/172 (20%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG+G+F+  G    GAV+++YPG +Y    P +++ I         NP++    DG  I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVFID 185

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G         +G        +  G  + SD  W L S               E  N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLQMSDSTW-LTS---------------EIHN 222

Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D P +   +++ Y+PNI++   ++  +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPTVFPIELKQYLPNIAYSYDKQSPLR 274


>gi|449270355|gb|EMC81040.1| hypothetical protein A306_11210, partial [Columba livia]
          Length = 297

 Score = 43.5 bits (101), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 42/192 (21%), Positives = 86/192 (44%), Gaps = 38/192 (19%)

Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIY---SPAYYRYIPGY 203
           L + +G+++    S +    AG G+F+  G    G V+++YPG +Y    P +++ +   
Sbjct: 114 LFNTLGFSITRDRSSLV--SAGTGVFVTKGFVPKGTVVSMYPGTVYRKHEPIFFQSLG-- 169

Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
                 NP++    DG +I+    G    +R ++   +  + +    G  + SD+ W  L
Sbjct: 170 ------NPFIFRCIDGILIDGNDKGL---SRSVYRSCSRRDQL----GPFQMSDESW--L 214

Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
           +  +              +NPLA+  + N+ +     NV    +D P     +++ Y+PN
Sbjct: 215 TAAL--------------QNPLAVGQYVNNCSHDKAANVCYQEFDVPGYFPVELKQYLPN 260

Query: 323 ISFGNAEEVNMR 334
           I + +  E ++R
Sbjct: 261 IVYSHDIESHLR 272


>gi|224090581|ref|XP_002187427.1| PREDICTED: SET domain-containing protein 9 [Taeniopygia guttata]
          Length = 294

 Score = 42.7 bits (99), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 40/175 (22%), Positives = 79/175 (45%), Gaps = 42/175 (24%)

Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIY---SPAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
           AG G+F+  G    G V+++YPG +Y    P +++ +         NP++    DG +I+
Sbjct: 129 AGTGVFVTKGFVPKGTVVSMYPGTVYRKHEPIFFQSLG--------NPFIFRCIDGVLID 180

Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
               G    +R ++   +  + +    G  + SD+ W  L+  +              +N
Sbjct: 181 GNDKGL---SRSVYRSCSRRDQL----GPFQMSDESW--LTATL--------------QN 217

Query: 284 PLALAHFANHPAKGMVPNVMICPYD----FPLTEKDMRPYIPNISFGNAEEVNMR 334
           PLA+  + N+ +     NV    +D    FP+   +++ Y+PNI + +  E ++R
Sbjct: 218 PLAVGQYVNNCSHEKTANVCYQEFDVPGHFPV---ELKQYLPNIVYSHDIESHLR 269


>gi|196011120|ref|XP_002115424.1| hypothetical protein TRIADDRAFT_50702 [Trichoplax adhaerens]
 gi|190582195|gb|EDV22269.1| hypothetical protein TRIADDRAFT_50702 [Trichoplax adhaerens]
          Length = 255

 Score = 42.4 bits (98), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 49/200 (24%), Positives = 86/200 (43%), Gaps = 43/200 (21%)

Query: 145 SQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGY 203
           S  + +  G+T+  K ++    + G G+++  G    G V+A+YPG IY           
Sbjct: 65  SMIMMNVCGFTIARKETK----DKGIGVYVSDGYIPPGKVVALYPGTIYQSH-----DSM 115

Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
                +N ++    DGT+I+ +  G    ++ ++   T  +I+    G    +D  W L 
Sbjct: 116 LFQSIRNSFMFRCVDGTIIDGKDHGI---SKAVYMSCTKRDIL----GHHILNDITW-LT 167

Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMI----CPYDFPLTEKDMRPY 319
           S P+               NP+A+  + N+ +K    NV+      P D PLT   +  +
Sbjct: 168 SNPV---------------NPMAIGQYINNQSKNSPANVIYHEINFPEDLPLT---LYRF 209

Query: 320 IPNISFGNA---EEVNMRRF 336
           IPN+ F N    EE+  R+ 
Sbjct: 210 IPNVYFRNLTYEEEMGTRKI 229


>gi|340384402|ref|XP_003390701.1| PREDICTED: hypothetical protein LOC100637857 [Amphimedon
           queenslandica]
          Length = 539

 Score = 41.6 bits (96), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 56/128 (43%), Gaps = 13/128 (10%)

Query: 103 ESSAQSNAAPRRSGLSFAVGPTARPTDSPVVPQTRQLT----RTELSQRLKDAIGYTLDL 158
           +S A S   P+   LS+ + P    T       T QL     R+   + L D   +TL++
Sbjct: 35  QSKALSQLKPQLETLSYIIKPLTPLTFKSNFSDTNQLKYQTLRSSAIKSLFDINNFTLNV 94

Query: 159 KPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRY 217
            PS    +  G G+F+  GE   G ++A+YPG IY P     +         N Y++   
Sbjct: 95  SPSL---QAEGLGVFVSQGEVREGQLVALYPGTIYLPLQPILLQSL-----HNSYILRCV 146

Query: 218 DGTVINAQ 225
           DG  I+ +
Sbjct: 147 DGIHIDGR 154


>gi|281212403|gb|EFA86563.1| hypothetical protein PPL_00364 [Polysphondylium pallidum PN500]
          Length = 141

 Score = 41.2 bits (95), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 24/42 (57%)

Query: 358 ALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
             VA + I DEE+ LNYR + +   P WY   D EE +RRW 
Sbjct: 85  GYVAYQPIKDEELFLNYRFNPANPYPAWYHQPDIEEAKRRWG 126


>gi|428179967|gb|EKX48836.1| hypothetical protein GUITHDRAFT_105460 [Guillardia theta CCMP2712]
          Length = 225

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 32/49 (65%), Gaps = 3/49 (6%)

Query: 354 LKTLALVATRAICD-EEVLLNYRLSNSKRR--PVWYSPVDEEEDRRRWS 399
           L  + L++   I D  E+LL+YRL+  K++  P WY+PVD +  +RRW+
Sbjct: 175 LGHMVLISKSDIPDGSEILLDYRLNPKKKKYWPSWYTPVDVDAAKRRWA 223


>gi|325186224|emb|CCA20725.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 179

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 45/209 (21%), Positives = 88/209 (42%), Gaps = 46/209 (22%)

Query: 1   MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEII- 59
           M+ +F++F+ A++    S       R+ Q E D   + L T + +L        A+ I+ 
Sbjct: 1   MSSIFRRFRLALEGTIPS-------RRRQEEVDTGWVALLTEFRKLSEAKHSKQAKRILV 53

Query: 60  ---------DMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIE-SSAQSN 109
                    D+  K S  +Q+ Q+   I ++++    S          + P+   S  SN
Sbjct: 54  LDELLTEASDLIKKKSFKEQKAQIQAEISARLEEIFKSYS--------SRPMHWDSVGSN 105

Query: 110 AAPRRSGLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAG 169
            A     LS +        D  ++                D +G+ L ++ S +  ++AG
Sbjct: 106 KAK----LSISGFSRKHFIDEDILEN--------------DLLGFRLSVEQSNL--DKAG 145

Query: 170 QGLFLCGEANVGAVIAIYPGIIYSPAYYR 198
            G+ + G+   G+++A+YPG +Y P +Y+
Sbjct: 146 DGVKVYGKVPPGSLVALYPGTVYLPEHYK 174


>gi|223997052|ref|XP_002288199.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975307|gb|EED93635.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 598

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 60/141 (42%), Gaps = 35/141 (24%)

Query: 283 NPLALAHFANHP-----------AKGMV-PNVMICPYDFP------LTEKDMRPYIPNIS 324
           N  AL H  NHP           ++ ++ PN +  P ++        T  D++ YIPN  
Sbjct: 465 NLWALGHIVNHPPAPLKSTESSSSQTLIGPNCVSVPINYTESMFKQHTSSDLKRYIPN-- 522

Query: 325 FGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLS---NSKR 381
                E+  + +    F           V+  + LVA R + DEE+  +YRLS   ++K+
Sbjct: 523 ---EYELPPQPWAKNVF------DREKVVMHGMGLVALRDVKDEELFYDYRLSPDESAKK 573

Query: 382 R---PVWYSPVDEEEDRRRWS 399
           R   P WY   D E    RWS
Sbjct: 574 RSQYPPWYHVWDAEAMNNRWS 594


>gi|225709596|gb|ACO10644.1| C5orf35 [Caligus rogercresseyi]
          Length = 241

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 9/85 (10%)

Query: 145 SQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGY 203
           S  L    GY L +  S+   E  G G+FL  G  + G ++ +YPG IY P     IP  
Sbjct: 61  SSILSSQFGYELCVSGSK---EGKGCGVFLSSGSVHPGQIVGLYPGSIYHPGDPILIP-- 115

Query: 204 PRVDAQNPYLITRYDGTVINAQPWG 228
                 NPY++   DG +I+ +  G
Sbjct: 116 ---SLFNPYILQLSDGVMIDGRSHG 137


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.409 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,567,799,126
Number of Sequences: 23463169
Number of extensions: 294714666
Number of successful extensions: 667880
Number of sequences better than 100.0: 118
Number of HSP's better than 100.0 without gapping: 52
Number of HSP's successfully gapped in prelim test: 66
Number of HSP's that attempted gapping in prelim test: 667597
Number of HSP's gapped (non-prelim): 164
length of query: 399
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 254
effective length of database: 8,957,035,862
effective search space: 2275087108948
effective search space used: 2275087108948
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)