BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015859
(399 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225465688|ref|XP_002272694.1| PREDICTED: uncharacterized protein LOC100251105 [Vitis vinifera]
Length = 405
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 303/405 (74%), Positives = 344/405 (84%), Gaps = 6/405 (1%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MAFLFQKFQEAVKTLAKSP FARDPR LQFEAD+NRLFLYTSYNRLGR+A+EADAE IID
Sbjct: 1 MAFLFQKFQEAVKTLAKSPVFARDPRHLQFEADINRLFLYTSYNRLGRNADEADAENIID 60
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPI--ESSAQSNAAPRRSGLS 118
MAGKASL+DQQ+QV +NIHSQIK FC ++D +LL KN+ ES Q+ A PRRSGLS
Sbjct: 61 MAGKASLADQQKQVQENIHSQIKTFCNTLDGVLLPDSKNMNELPESCPQTKATPRRSGLS 120
Query: 119 FAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
FAVG P+D+ VPQT+ L R ELSQ LKD IGYTLDLKPSQIPH++AGQGLFL GEA
Sbjct: 121 FAVGRDGPPSDNSAVPQTKPLKRAELSQSLKDLIGYTLDLKPSQIPHKDAGQGLFLDGEA 180
Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
+VGAV+A+YPG+IY PAYY+YIPGYPRVDA NPYLITRYDGTVINAQPWG G +TRELWD
Sbjct: 181 DVGAVVALYPGVIYFPAYYQYIPGYPRVDAHNPYLITRYDGTVINAQPWGVGGETRELWD 240
Query: 239 GLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
G ++ E+ N +G EKGSD+ WKLLSKPM+ K GGSG ++LERRNPLA AHFANHP+KGM
Sbjct: 241 GFSVTEVSANMQGDEKGSDRIWKLLSKPMEAKHGGSGCDVLERRNPLAFAHFANHPSKGM 300
Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSS----TPVL 354
PNVM+CPYDFPLTE+DMR YIPN+ FG+++EV M+R GSFWFK GS + S PVL
Sbjct: 301 SPNVMVCPYDFPLTEQDMRAYIPNVLFGSSKEVKMKRLGSFWFKSGSSNKSELDAHVPVL 360
Query: 355 KTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
KTL LVATRA+CDEEVLLNYRLSNSKRRP WY+PVDEEEDRRRWS
Sbjct: 361 KTLVLVATRALCDEEVLLNYRLSNSKRRPAWYTPVDEEEDRRRWS 405
>gi|224083904|ref|XP_002307167.1| predicted protein [Populus trichocarpa]
gi|222856616|gb|EEE94163.1| predicted protein [Populus trichocarpa]
Length = 402
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 300/402 (74%), Positives = 347/402 (86%), Gaps = 3/402 (0%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MA LF KFQE VKTLAKSPTFAR+PR+ QFE DMNRLFL+TSYNRLG++AEEADAEEII+
Sbjct: 1 MASLFNKFQEFVKTLAKSPTFARNPREFQFETDMNRLFLFTSYNRLGKNAEEADAEEIIE 60
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILL--LPEKNVPIESSAQSNAAPRRSGLS 118
MA KAS++DQQ+QV +NIH QI FC +M+EILL L +++ E+ AQSNAAPRRSGLS
Sbjct: 61 MASKASVADQQKQVQENIHFQITNFCSAMNEILLPDLEKRSKKDEAPAQSNAAPRRSGLS 120
Query: 119 FAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
FAVG + P D P +P+T+QL+R+++SQ L+D IGYTLDLKPS IPH+EAG+GLFL GEA
Sbjct: 121 FAVGKSGPPIDRPAIPETKQLSRSKVSQSLRDEIGYTLDLKPSLIPHKEAGEGLFLSGEA 180
Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
+VG VIAIYPG+IYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINA PWGSG ++RE+WD
Sbjct: 181 DVGTVIAIYPGVIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINALPWGSGGESREVWD 240
Query: 239 GLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
GLT+PEI N + EKG D+ W++LSKP++ R GS ++LERRNPLALAHFANHPAKGM
Sbjct: 241 GLTVPEIRSNVQSVEKGQDRVWRMLSKPLEGTRVGSIGDVLERRNPLALAHFANHPAKGM 300
Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGS-GSGSSTPVLKTL 357
PNVMICPYDFPLTEKDMR +IPN+SFGN EEVNMRRFGSFWFK + S S PVLKTL
Sbjct: 301 DPNVMICPYDFPLTEKDMRTFIPNVSFGNHEEVNMRRFGSFWFKSSAKNSVSDVPVLKTL 360
Query: 358 ALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
LVATRA+C+EEVLLNYRLSN+KRRP WYSPVDEEEDRRRWS
Sbjct: 361 VLVATRALCNEEVLLNYRLSNTKRRPAWYSPVDEEEDRRRWS 402
>gi|255558659|ref|XP_002520354.1| conserved hypothetical protein [Ricinus communis]
gi|223540452|gb|EEF42020.1| conserved hypothetical protein [Ricinus communis]
Length = 405
Score = 584 bits (1505), Expect = e-164, Method: Compositional matrix adjust.
Identities = 284/405 (70%), Positives = 334/405 (82%), Gaps = 6/405 (1%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFAR---DPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEE 57
MAFLF KFQEAVKTLAKSPTFAR +PRQLQF D+NRLFLYTSYNRLGR+AE+AD EE
Sbjct: 1 MAFLFNKFQEAVKTLAKSPTFARFTREPRQLQFAIDINRLFLYTSYNRLGRNAEDADVEE 60
Query: 58 IIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILL--LPEKNVPIESSAQSNAAPRRS 115
II+MA +S++DQQ QV +NIHSQI+ FC SMDEILL + ++N E+ AQ + PRRS
Sbjct: 61 IIEMASTSSVTDQQNQVQENIHSQIESFCTSMDEILLPDMMKRNELGEALAQLDPVPRRS 120
Query: 116 GLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC 175
GLS AVG + T P + +TR L E+S RLK IGYTL+LKPSQIPH+EAGQGLF+
Sbjct: 121 GLSLAVGKSTPSTGCPAITETRPLKSGEVSSRLKGLIGYTLNLKPSQIPHKEAGQGLFID 180
Query: 176 GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRE 235
GEA+VG+VIA YPG+IY+PAYYRYIPGYPRVDA NPYLITRYDGTVINAQPWG G +TRE
Sbjct: 181 GEADVGSVIAFYPGVIYAPAYYRYIPGYPRVDACNPYLITRYDGTVINAQPWGFGGETRE 240
Query: 236 LWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPA 295
W+GL +P++ + + EKG D++WK+LSKP++ GS +E+LE+RNPLALAHFANHPA
Sbjct: 241 AWNGLLVPDVRHSVQIREKGVDRWWKMLSKPLEGTEFGSAAEVLEQRNPLALAHFANHPA 300
Query: 296 KGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKW-GSGSGSSTPVL 354
K + PNVMICPYDFPLTEK+MR YIPNI FGN+EEVNMRRFGSFWF+ GSGS P+L
Sbjct: 301 KDIAPNVMICPYDFPLTEKNMRTYIPNIIFGNSEEVNMRRFGSFWFRSKAKGSGSGIPIL 360
Query: 355 KTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
K L LVATRA+ +EEVLLNYRLS+SKRRP WY+PVDEEEDRRRWS
Sbjct: 361 KALVLVATRALSNEEVLLNYRLSHSKRRPEWYTPVDEEEDRRRWS 405
>gi|449464032|ref|XP_004149733.1| PREDICTED: uncharacterized protein LOC101219994 [Cucumis sativus]
Length = 396
Score = 583 bits (1504), Expect = e-164, Method: Compositional matrix adjust.
Identities = 288/407 (70%), Positives = 332/407 (81%), Gaps = 19/407 (4%)
Query: 1 MAFLFQKFQEAVKTLAKSPTF------ARDPRQLQFEADMNRLFLYTSYNRLGRDAEEAD 54
MAFLFQKFQEAVK LAK+P F ARDPR+LQFEADMNRLFL+TSYNRLG DA EAD
Sbjct: 1 MAFLFQKFQEAVKVLAKNPLFTKNPLFARDPRRLQFEADMNRLFLFTSYNRLGMDATEAD 60
Query: 55 AEEIIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRR 114
AEEIIDMA KAS +DQQ+QV +NIHSQ+K FC+ MD ILL E P ES +S A R+
Sbjct: 61 AEEIIDMASKASFADQQKQVQENIHSQVKSFCMHMDGILLPDE---PAESPEKSKDAVRK 117
Query: 115 SGLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFL 174
SGLSFAVG T +P TR L R+ELSQ+LKD IGYTLD+KPS+IPH++AGQGLF+
Sbjct: 118 SGLSFAVGRTN------TIPTTRPLGRSELSQQLKDEIGYTLDIKPSRIPHKDAGQGLFI 171
Query: 175 CGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTR 234
GEA+VG++IAIYPG++YSPA+Y+YIPGYPRVDAQNPYLITRYDGTVINAQPWG G DTR
Sbjct: 172 DGEADVGSIIAIYPGVVYSPAHYQYIPGYPRVDAQNPYLITRYDGTVINAQPWGLGADTR 231
Query: 235 ELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHP 294
E+W GLT+P P +G EK SD+ W++LSKP++ KR G + +ERRNPLA AHFANHP
Sbjct: 232 EVWHGLTVPNSNPTKQGDEK-SDRLWRMLSKPLEAKRVQHGGDAIERRNPLAFAHFANHP 290
Query: 295 AKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSG--SGSSTP 352
AK MVPNVM+CPYDFPLTEKDMR YIPN+ N EEV M+RFGSFWFK G +GS TP
Sbjct: 291 AKDMVPNVMLCPYDFPLTEKDMRVYIPNVPVAN-EEVGMKRFGSFWFKSGRSRINGSDTP 349
Query: 353 VLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
+LKT+ LVATRA+C+EEVLLNYRLSNSKRRP WY+PVDE EDRRRWS
Sbjct: 350 ILKTIVLVATRALCNEEVLLNYRLSNSKRRPSWYTPVDEGEDRRRWS 396
>gi|297808317|ref|XP_002872042.1| hypothetical protein ARALYDRAFT_489175 [Arabidopsis lyrata subsp.
lyrata]
gi|297317879|gb|EFH48301.1| hypothetical protein ARALYDRAFT_489175 [Arabidopsis lyrata subsp.
lyrata]
Length = 398
Score = 558 bits (1437), Expect = e-156, Method: Compositional matrix adjust.
Identities = 268/402 (66%), Positives = 326/402 (81%), Gaps = 7/402 (1%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MA LF KFQ+AV LAKS TFA++PRQLQFEAD+N+LF+YTSY RLGR+AEE DAEEII+
Sbjct: 1 MATLFNKFQQAVGVLAKSTTFAKNPRQLQFEADVNKLFMYTSYYRLGREAEETDAEEIIE 60
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAP-RRSGLSF 119
MAGKA+LS+QQ+QV +NIH Q++ FC MD ILL +N ES +QS + P RRSGL+F
Sbjct: 61 MAGKATLSEQQKQVQENIHYQVENFCSLMDGILLPDVRNN--ESGSQSTSPPPRRSGLTF 118
Query: 120 AVGPT-ARPT-DSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGE 177
A+G A P D+P+VP+T+ L E+SQRL D +GYTL++KPS IPH++AGQG F+ GE
Sbjct: 119 AIGSNNAFPAADTPLVPETKPLKLNEVSQRLMDQMGYTLEIKPSVIPHKDAGQGCFIKGE 178
Query: 178 ANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELW 237
A+VG V+A YPG+IYSPA+YRYIPGYP+VD+QN YLITRYDGTVINAQPWG G ++RE+W
Sbjct: 179 ADVGTVLAFYPGVIYSPAFYRYIPGYPKVDSQNSYLITRYDGTVINAQPWGLGGESREVW 238
Query: 238 DGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKG 297
+G P + NSK AE GSD+ WK LSKP++ G ++LERRNPLA H ANHPAK
Sbjct: 239 NGSYTPAVKANSKAAENGSDRLWKALSKPLEG--SGEAKDVLERRNPLAFGHLANHPAKE 296
Query: 298 MVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTL 357
M PNVMICPYDFPL KD+RPYIPNISFG++ E+ M+RFGSFWFK GS +G PVLKTL
Sbjct: 297 MTPNVMICPYDFPLMAKDLRPYIPNISFGDSGEIKMKRFGSFWFKTGSKNGLEAPVLKTL 356
Query: 358 ALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
LVATR++C+EE+LLNYRLSNSKRRP WY+PV+EEEDRRRWS
Sbjct: 357 VLVATRSLCNEELLLNYRLSNSKRRPDWYTPVNEEEDRRRWS 398
>gi|22327005|ref|NP_197711.2| uncharacterized protein [Arabidopsis thaliana]
gi|18491157|gb|AAL69481.1| unknown protein [Arabidopsis thaliana]
gi|20258933|gb|AAM14182.1| unknown protein [Arabidopsis thaliana]
gi|332005749|gb|AED93132.1| uncharacterized protein [Arabidopsis thaliana]
Length = 399
Score = 556 bits (1434), Expect = e-156, Method: Compositional matrix adjust.
Identities = 265/401 (66%), Positives = 322/401 (80%), Gaps = 4/401 (0%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MA LF KFQ+AV LAKS TFA++PRQLQFEAD+N+LF+YTSYNRLGR+AEE DAEEII+
Sbjct: 1 MATLFNKFQQAVGVLAKSTTFAKNPRQLQFEADVNKLFMYTSYNRLGREAEETDAEEIIE 60
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
MAGKA+LS+QQ+QV +NIH Q+++FC MD ILL + S + S PRRSGL+FA
Sbjct: 61 MAGKATLSEQQKQVQENIHYQVEKFCSLMDGILLPDVRRNESGSQSTSPRPPRRSGLTFA 120
Query: 121 VGPT-ARP-TDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
+G A P D P+VP+T+ L ++SQRL D +GYTL+ KPS IPH++AGQG F+ GEA
Sbjct: 121 IGSNNAFPHADQPLVPETKPLKLNDVSQRLMDQMGYTLETKPSVIPHKDAGQGCFIKGEA 180
Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
+VG V+A YPG+IYSPA+Y+YIPGYP+VD+QN YLITRYDGTVINAQPWG G ++RE W+
Sbjct: 181 DVGTVLAFYPGVIYSPAFYKYIPGYPKVDSQNSYLITRYDGTVINAQPWGLGGESREAWN 240
Query: 239 GLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
G P + N+K AE GSD+ WK LSKP++ G G E+LERRNPLA H ANHPAK M
Sbjct: 241 GSYTPAVKANTKTAENGSDRLWKALSKPLEG--SGKGKEVLERRNPLAFGHLANHPAKEM 298
Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLA 358
PNVMICPYDFPL KD+RPYIPNISFG++ E+ M+RFGSFWFK GS +G PVLKTL
Sbjct: 299 TPNVMICPYDFPLMAKDLRPYIPNISFGDSGEIKMKRFGSFWFKTGSKNGLEAPVLKTLV 358
Query: 359 LVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
LVATR++C+EE+LLNYRLSNSKRRP WY+PV+EEEDRRRWS
Sbjct: 359 LVATRSLCNEELLLNYRLSNSKRRPDWYTPVNEEEDRRRWS 399
>gi|10177809|dbj|BAB11175.1| unnamed protein product [Arabidopsis thaliana]
Length = 437
Score = 556 bits (1434), Expect = e-156, Method: Compositional matrix adjust.
Identities = 265/401 (66%), Positives = 322/401 (80%), Gaps = 4/401 (0%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MA LF KFQ+AV LAKS TFA++PRQLQFEAD+N+LF+YTSYNRLGR+AEE DAEEII+
Sbjct: 39 MATLFNKFQQAVGVLAKSTTFAKNPRQLQFEADVNKLFMYTSYNRLGREAEETDAEEIIE 98
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
MAGKA+LS+QQ+QV +NIH Q+++FC MD ILL + S + S PRRSGL+FA
Sbjct: 99 MAGKATLSEQQKQVQENIHYQVEKFCSLMDGILLPDVRRNESGSQSTSPRPPRRSGLTFA 158
Query: 121 VGPT-ARP-TDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
+G A P D P+VP+T+ L ++SQRL D +GYTL+ KPS IPH++AGQG F+ GEA
Sbjct: 159 IGSNNAFPHADQPLVPETKPLKLNDVSQRLMDQMGYTLETKPSVIPHKDAGQGCFIKGEA 218
Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
+VG V+A YPG+IYSPA+Y+YIPGYP+VD+QN YLITRYDGTVINAQPWG G ++RE W+
Sbjct: 219 DVGTVLAFYPGVIYSPAFYKYIPGYPKVDSQNSYLITRYDGTVINAQPWGLGGESREAWN 278
Query: 239 GLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
G P + N+K AE GSD+ WK LSKP++ G G E+LERRNPLA H ANHPAK M
Sbjct: 279 GSYTPAVKANTKTAENGSDRLWKALSKPLEG--SGKGKEVLERRNPLAFGHLANHPAKEM 336
Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLA 358
PNVMICPYDFPL KD+RPYIPNISFG++ E+ M+RFGSFWFK GS +G PVLKTL
Sbjct: 337 TPNVMICPYDFPLMAKDLRPYIPNISFGDSGEIKMKRFGSFWFKTGSKNGLEAPVLKTLV 396
Query: 359 LVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
LVATR++C+EE+LLNYRLSNSKRRP WY+PV+EEEDRRRWS
Sbjct: 397 LVATRSLCNEELLLNYRLSNSKRRPDWYTPVNEEEDRRRWS 437
>gi|356532231|ref|XP_003534677.1| PREDICTED: uncharacterized protein LOC100782005 [Glycine max]
Length = 402
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 277/406 (68%), Positives = 321/406 (79%), Gaps = 11/406 (2%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MA LFQKFQEAVKTLAK P+F +DPRQLQFEAD+NRLFLY SYN LG++A+EA AEEII+
Sbjct: 1 MALLFQKFQEAVKTLAKRPSFTKDPRQLQFEADINRLFLYLSYNHLGKNADEAHAEEIIE 60
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
MA KAS +DQQ QV +N+HSQIK FC MDEILL EK V N PRRSGL FA
Sbjct: 61 MASKASFADQQMQVQENVHSQIKTFCTFMDEILLPNEKMV--NEHQLQNTVPRRSGLGFA 118
Query: 121 VGPTARP--TDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
VG P P+TR L++ +SQ LKD +GYT+++ PSQIPH++AGQGLFL G
Sbjct: 119 VGSRNDPPLAFGAASPKTRPLSQAVVSQNLKDQLGYTVNVTPSQIPHKDAGQGLFLDGTV 178
Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
+VGAV+A YPGI+YSPAYYRYIPGYP+VD NPYLITRYDG VINAQPWGSG D RELW+
Sbjct: 179 DVGAVVAFYPGIVYSPAYYRYIPGYPKVDTLNPYLITRYDGNVINAQPWGSGGDNRELWN 238
Query: 239 GLTLPEIMPNSKGA--EKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAK 296
G EI P+ KGA EKGS++FWKLLSKP++ +GG+ +E++ERRNPLALAHFANHP K
Sbjct: 239 GRKTGEIKPDVKGAEPEKGSERFWKLLSKPLEGYQGGN-NEVIERRNPLALAHFANHPPK 297
Query: 297 GMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFK---WGSGSGSSTPV 353
G+ PNVMICPYDFPLTE +MR YIPN+ FGNA EVNMRRFGSFWFK SGS+ P
Sbjct: 298 GVQPNVMICPYDFPLTESNMRVYIPNVLFGNA-EVNMRRFGSFWFKSGGGSRNSGSNVPT 356
Query: 354 LKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
LKTL L+ATR + DEE+ LNYRLSN+KRRP WY+PVDEEEDRRRWS
Sbjct: 357 LKTLVLIATRPLQDEELFLNYRLSNTKRRPEWYNPVDEEEDRRRWS 402
>gi|242062410|ref|XP_002452494.1| hypothetical protein SORBIDRAFT_04g026890 [Sorghum bicolor]
gi|241932325|gb|EES05470.1| hypothetical protein SORBIDRAFT_04g026890 [Sorghum bicolor]
Length = 403
Score = 536 bits (1381), Expect = e-150, Method: Compositional matrix adjust.
Identities = 259/404 (64%), Positives = 314/404 (77%), Gaps = 6/404 (1%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MA LF K QEA++T+AKSP FARDPR LQFEAD+NRLFLYTSY+RLG +AEE DAEEIID
Sbjct: 1 MASLFNKIQEAIRTIAKSPMFARDPRHLQFEADVNRLFLYTSYSRLGENAEEKDAEEIID 60
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
+A KAS++DQQ+QV +N+H Q++ C SMD IL N P + ++++ RRSGLSFA
Sbjct: 61 LASKASVADQQKQVQENVHYQLRHMCQSMDNILCPDTTNDPSKEPSEAHNHSRRSGLSFA 120
Query: 121 VGPTARPT-DSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEAN 179
VG A S V+P TR LTR ELS++ +D GYTLD+ PS IPH++AGQGLFL G+AN
Sbjct: 121 VGGAASTNKQSAVIPATRPLTRAELSKKFRDHFGYTLDIGPSGIPHKDAGQGLFLSGKAN 180
Query: 180 VGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDG 239
VGAV+AIYPG+IYSPAYYRYIPGYPR+D N YLITRYDGT+I+A+PW G +TRELWDG
Sbjct: 181 VGAVLAIYPGVIYSPAYYRYIPGYPRIDVCNNYLITRYDGTIIDAKPWYFGGETRELWDG 240
Query: 240 LTLPEI--MPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKG 297
L + MP KG+E SD+ W++LSKP++ + E+LERRNPLA HFANHP KG
Sbjct: 241 SDLVDYNAMP-PKGSENNSDRVWRMLSKPLEKSVRENFGEVLERRNPLAFGHFANHPPKG 299
Query: 298 MVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSS--TPVLK 355
PNVMICPYDFPLTEK+MR YIPN++FG E + M+RFGSF+FK G + + VLK
Sbjct: 300 SSPNVMICPYDFPLTEKNMRVYIPNVTFGGEETIKMKRFGSFYFKSGGSDNQAGDSLVLK 359
Query: 356 TLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
TL LV+TR+ICDEE+ LNYR SNSKRRP WYSPVDEEED+RRWS
Sbjct: 360 TLVLVSTRSICDEELFLNYRYSNSKRRPEWYSPVDEEEDKRRWS 403
>gi|357150341|ref|XP_003575426.1| PREDICTED: uncharacterized protein LOC100822461 [Brachypodium
distachyon]
Length = 402
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 262/403 (65%), Positives = 306/403 (75%), Gaps = 5/403 (1%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MAFLF KFQEA++TLAK+P FARDPR LQF+AD+NRLFLYTSY RLG +EE DAEEIID
Sbjct: 1 MAFLFNKFQEAIRTLAKNPKFARDPRHLQFDADVNRLFLYTSYYRLGEHSEEKDAEEIID 60
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
MA KAS+ DQQ+QV N+H Q+K C +MD IL KN P +S + ++ RRSGLSFA
Sbjct: 61 MASKASVVDQQKQVQGNVHYQLKHMCQAMDSILRPDIKNDPSQSPSDTHNKSRRSGLSFA 120
Query: 121 VGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANV 180
VG + PVVP TR LT+ ELS + +D GYTLD++PS IPH++AGQGLFL GEA V
Sbjct: 121 VGTGVSSANKPVVPATRPLTQAELSNKFRDQFGYTLDIRPSGIPHKDAGQGLFLSGEARV 180
Query: 181 GAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGL 240
GAV+AIYPG+IYSPAYYRYIPGYPR+DA N YLITRYDGTVINA+PW G +T ELWDG
Sbjct: 181 GAVVAIYPGVIYSPAYYRYIPGYPRIDACNSYLITRYDGTVINAKPWHLGGETMELWDGS 240
Query: 241 TLPEIMPN-SKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMV 299
L + S E SD+ W++L+KP+ R + E+LERRNPLA HFANHP G
Sbjct: 241 HLVDYNSTPSTSPESNSDRAWRMLNKPLVKSRNENFGEVLERRNPLAFGHFANHPPGGST 300
Query: 300 PNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSST---PVLKT 356
PNVMICPYDFPLTEKDMR YIPNI+FG E V M+RFGSFWFK GSG T VLKT
Sbjct: 301 PNVMICPYDFPLTEKDMRLYIPNITFGGEEPVTMKRFGSFWFK-SRGSGQQTGESSVLKT 359
Query: 357 LALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
L LV+TR+I DEE+ LNYR S+SK RP WY+PVDEEED+RRWS
Sbjct: 360 LVLVSTRSIRDEEIFLNYRYSSSKARPEWYTPVDEEEDKRRWS 402
>gi|212723622|ref|NP_001131459.1| uncharacterized protein LOC100192794 [Zea mays]
gi|194691586|gb|ACF79877.1| unknown [Zea mays]
gi|413937913|gb|AFW72464.1| hypothetical protein ZEAMMB73_372625 [Zea mays]
Length = 403
Score = 533 bits (1372), Expect = e-149, Method: Compositional matrix adjust.
Identities = 255/403 (63%), Positives = 311/403 (77%), Gaps = 4/403 (0%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MAFLF KFQEA++T+AKSP FARDPR LQFEAD+NRLFLYTSY RLG +A+E DAEEIID
Sbjct: 1 MAFLFNKFQEAIRTIAKSPMFARDPRHLQFEADVNRLFLYTSYYRLGENADEKDAEEIID 60
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
+A KAS++DQQ+QV DN+H ++ C +MD IL N P + ++++ +RSGLSFA
Sbjct: 61 LASKASIADQQKQVQDNVHYELTHMCQAMDSILCPDTINDPSKGPSEAHNHSQRSGLSFA 120
Query: 121 VGPTARPT-DSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEAN 179
VG A S ++P TR LTR ELS++ +D GYTLD+ PS IPH++AGQGLFL G+AN
Sbjct: 121 VGGAASANKQSAIIPATRPLTRAELSKKFRDHFGYTLDIGPSGIPHKDAGQGLFLSGQAN 180
Query: 180 VGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDG 239
GAV+AIYPG+IYSPAYYRYIPGYPR+D N YLITRYDGT+I+A+PW G +TR LWDG
Sbjct: 181 AGAVLAIYPGVIYSPAYYRYIPGYPRIDVCNNYLITRYDGTIIDAKPWCFGGETRALWDG 240
Query: 240 LTLPEIMPNS-KGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
L + + KG+E SD+ W++LSKP++ + E+LERRNPLA HFANHP KG
Sbjct: 241 SDLVDYNAMAPKGSENNSDRVWRMLSKPLEKGVRENFGEVLERRNPLAFGHFANHPPKGS 300
Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSS--TPVLKT 356
PNVMICPYDFPLTEKDMR YIPNI+FG E + M+RFGSF+FK G + + VLKT
Sbjct: 301 SPNVMICPYDFPLTEKDMRVYIPNITFGGQETIRMKRFGSFYFKSGGSDNQAGDSLVLKT 360
Query: 357 LALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
L LV+TR+ICDEE+ LNYR SNSKRRP WYSPVDEEED+RRWS
Sbjct: 361 LVLVSTRSICDEELFLNYRYSNSKRRPEWYSPVDEEEDKRRWS 403
>gi|326511880|dbj|BAJ92084.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326522819|dbj|BAJ88455.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 401
Score = 530 bits (1365), Expect = e-148, Method: Compositional matrix adjust.
Identities = 256/403 (63%), Positives = 308/403 (76%), Gaps = 6/403 (1%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MAFLF KFQEA+KTLAKSP FARDPR +QF+AD+NRLFL+TSYNRLG+ +EE D EEIID
Sbjct: 1 MAFLFNKFQEAIKTLAKSPRFARDPRHIQFDADVNRLFLFTSYNRLGKHSEEKDVEEIID 60
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
+A KAS++DQQ+QV N+H Q+K C +MD IL P+ N P +S + + RRSGLS A
Sbjct: 61 LANKASVTDQQKQVQQNVHYQLKHICKAMDSILR-PDTNDPSQSPSDAYNNSRRSGLSLA 119
Query: 121 VGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANV 180
VG + P VP TR LTR ELS + +D GYTLD+KPS IPH++AGQGLFL GEANV
Sbjct: 120 VGTGVASANKPAVPATRPLTRAELSNKFRDQFGYTLDIKPSLIPHKDAGQGLFLSGEANV 179
Query: 181 GAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGL 240
GAV+AIYPG+IYSPAYYRYIPGYPR+DA N YLITRYDGT+INA+PW G +TRE+WDG
Sbjct: 180 GAVLAIYPGVIYSPAYYRYIPGYPRIDACNSYLITRYDGTIINAKPWHLGGETREVWDGS 239
Query: 241 TLPEIMPN-SKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMV 299
+ K E SD+ W++L+KP+ + ++LERRNPLA HFANHP G
Sbjct: 240 DPVDYSTTPPKSPESNSDRAWRMLNKPLQKSGSENFGDVLERRNPLAFGHFANHPPGGST 299
Query: 300 PNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKW---GSGSGSSTPVLKT 356
PNVMICPYDFPLTEKD+R YIPNI+FG E V M+RFGSFWFK G +G S PVL+T
Sbjct: 300 PNVMICPYDFPLTEKDIRAYIPNIAFGGEEPVTMKRFGSFWFKSRTPGKPTGES-PVLRT 358
Query: 357 LALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
L LV+TR++CDEE+ LNYR S+SK RP WY+PVDEEED+RRWS
Sbjct: 359 LVLVSTRSVCDEELYLNYRYSSSKGRPEWYTPVDEEEDKRRWS 401
>gi|115447467|ref|NP_001047513.1| Os02g0634600 [Oryza sativa Japonica Group]
gi|49388185|dbj|BAD25311.1| unknown protein [Oryza sativa Japonica Group]
gi|113537044|dbj|BAF09427.1| Os02g0634600 [Oryza sativa Japonica Group]
gi|215678770|dbj|BAG95207.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623301|gb|EEE57433.1| hypothetical protein OsJ_07637 [Oryza sativa Japonica Group]
Length = 405
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 261/410 (63%), Positives = 309/410 (75%), Gaps = 16/410 (3%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFA------RDPRQLQFEADMNRLFLYTSYNRLGRDAEEAD 54
M LF +FQEAVKTLAK+P FA RDPR LQFEAD+NRLFLYTSY RLG +AEE D
Sbjct: 1 MGSLFIRFQEAVKTLAKNPMFANNRLFARDPRHLQFEADVNRLFLYTSYYRLGANAEEKD 60
Query: 55 AEEIIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRR 114
AEEIIDMA KAS+S+QQ+QV +N+H Q+ C +MD ILL KN ++++N PRR
Sbjct: 61 AEEIIDMASKASVSEQQKQVQENVHYQLTNMCQAMDSILLPDTKN----GASEANNYPRR 116
Query: 115 SGLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFL 174
SGLSFAVG + P VP TR L R ELS + +D YTLD++PSQIPH++AGQGLFL
Sbjct: 117 SGLSFAVGTGVASANKPDVPSTRPLNRAELSNKFRDHFQYTLDIRPSQIPHKDAGQGLFL 176
Query: 175 CGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTR 234
GE N GAV+AIYPG++YSPAYYRYIPGYP++DA N YLITRYDGT+I+A+PW G D+R
Sbjct: 177 SGETNAGAVLAIYPGVVYSPAYYRYIPGYPKIDACNNYLITRYDGTIIDAKPWQLGGDSR 236
Query: 235 ELWDGLTLPEI--MPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFAN 292
E+WDG L + +P SK E SD+ W++LSKP+ + E+LERRNPLA HFAN
Sbjct: 237 EIWDGSDLVDYNAVP-SKSQESNSDRAWRMLSKPLKKGHTENFGEVLERRNPLAFGHFAN 295
Query: 293 HPAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEE-VNMRRFGSFWFKWGSGSG--S 349
HP KG PNVMICPYDFPLTEKDMR YIPNI+FG EE V M+RFGSFWFK G
Sbjct: 296 HPPKGSTPNVMICPYDFPLTEKDMRVYIPNITFGGEEEPVTMKRFGSFWFKSGRSGNQEG 355
Query: 350 STPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
+PVLKTL LV+TR+ICDEE+ LNYR SNSK+RP WY PVDEEED+RRWS
Sbjct: 356 ESPVLKTLVLVSTRSICDEELFLNYRYSNSKKRPEWYIPVDEEEDKRRWS 405
>gi|147812456|emb|CAN77337.1| hypothetical protein VITISV_003407 [Vitis vinifera]
Length = 345
Score = 520 bits (1338), Expect = e-145, Method: Compositional matrix adjust.
Identities = 249/345 (72%), Positives = 286/345 (82%), Gaps = 6/345 (1%)
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVP--IESSAQSNAAPRRSGLS 118
MAGKASL+DQQ+QV +NIHSQIK FC ++D +LL KN+ ES Q+ A PRRSGLS
Sbjct: 1 MAGKASLADQQKQVQENIHSQIKTFCNTLDXVLLPDSKNMNELPESCPQTKAXPRRSGLS 60
Query: 119 FAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
FAVG P D+ VPQT+ L R ELSQ LKD IGYTLDLKPSQIPH++AGQGLFL GEA
Sbjct: 61 FAVGRDGPPXDNSAVPQTKPLKRAELSQSLKDLIGYTLDLKPSQIPHKDAGQGLFLDGEA 120
Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
+VGAV+A+YPG+IY PAYY+YIPGYPRVDA NPYLITRYDGTVINAQPWG G +TRELWD
Sbjct: 121 DVGAVVALYPGVIYFPAYYQYIPGYPRVDAHNPYLITRYDGTVINAQPWGVGGETRELWD 180
Query: 239 GLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGM 298
G ++ E+ N +GAEKGSD+ WKLLSKPM++K GSG ++LERRNPLA AHFANHP+KGM
Sbjct: 181 GFSVTEVSANMQGAEKGSDRIWKLLSKPMESKHRGSGCDVLERRNPLAFAHFANHPSKGM 240
Query: 299 VPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSS----TPVL 354
PNVM+CPYDFPLTE+DMR YIPN+ FG++ EV M+R GSFWFK GS + S PVL
Sbjct: 241 SPNVMVCPYDFPLTEQDMRAYIPNVLFGSSXEVKMKRXGSFWFKSGSSNKSELDAHVPVL 300
Query: 355 KTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
KTL LVATRA+CDEEVLLNYRLSNSKRRP WY+PVDEEEDRRRWS
Sbjct: 301 KTLVLVATRALCDEEVLLNYRLSNSKRRPAWYTPVDEEEDRRRWS 345
>gi|195645104|gb|ACG42020.1| hypothetical protein [Zea mays]
gi|413937914|gb|AFW72465.1| hypothetical protein ZEAMMB73_372625 [Zea mays]
Length = 384
Score = 499 bits (1286), Expect = e-139, Method: Compositional matrix adjust.
Identities = 240/383 (62%), Positives = 293/383 (76%), Gaps = 4/383 (1%)
Query: 21 FARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVLDNIHS 80
FARDPR LQFEAD+NRLFLYTSY RLG +A+E DAEEIID+A KAS++DQQ+QV DN+H
Sbjct: 2 FARDPRHLQFEADVNRLFLYTSYYRLGENADEKDAEEIIDLASKASIADQQKQVQDNVHY 61
Query: 81 QIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFAVGPTARPT-DSPVVPQTRQL 139
++ C +MD IL N P + ++++ +RSGLSFAVG A S ++P TR L
Sbjct: 62 ELTHMCQAMDSILCPDTINDPSKGPSEAHNHSQRSGLSFAVGGAASANKQSAIIPATRPL 121
Query: 140 TRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRY 199
TR ELS++ +D GYTLD+ PS IPH++AGQGLFL G+AN GAV+AIYPG+IYSPAYYRY
Sbjct: 122 TRAELSKKFRDHFGYTLDIGPSGIPHKDAGQGLFLSGQANAGAVLAIYPGVIYSPAYYRY 181
Query: 200 IPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNS-KGAEKGSDQ 258
IPGYPR+D N YLITRYDGT+I+A+PW G +TR LWDG L + + KG+E SD+
Sbjct: 182 IPGYPRIDVCNNYLITRYDGTIIDAKPWCFGGETRALWDGSDLVDYNAMAPKGSENNSDR 241
Query: 259 FWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKDMRP 318
W++LSKP++ + E+LERRNPLA HFANHP KG PNVMICPYDFPLTEKDMR
Sbjct: 242 VWRMLSKPLEKGVRENFGEVLERRNPLAFGHFANHPPKGSSPNVMICPYDFPLTEKDMRV 301
Query: 319 YIPNISFGNAEEVNMRRFGSFWFKWGSGSGSS--TPVLKTLALVATRAICDEEVLLNYRL 376
YIPNI+FG E + M+RFGSF+FK G + + VLKTL LV+TR+ICDEE+ LNYR
Sbjct: 302 YIPNITFGGQETIRMKRFGSFYFKSGGSDNQAGDSLVLKTLVLVSTRSICDEELFLNYRY 361
Query: 377 SNSKRRPVWYSPVDEEEDRRRWS 399
SNSKRRP WYSPVDEEED+RRWS
Sbjct: 362 SNSKRRPEWYSPVDEEEDKRRWS 384
>gi|102139939|gb|ABF70082.1| hypothetical protein MA4_54B05.28 [Musa acuminata]
Length = 592
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 235/364 (64%), Positives = 279/364 (76%), Gaps = 6/364 (1%)
Query: 38 FLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPE 97
+ +SYNRLG AEE DA+EII+MA KAS+ DQQ+QV +NIH QIK C MDEIL +
Sbjct: 205 YAKSSYNRLGNSAEENDADEIIEMANKASVKDQQKQVQENIHFQIKNMCKIMDEILRVDN 264
Query: 98 KNVPIESSA---QSNAAPRRSGLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGY 154
KN+P S+ N RRSGLSFA+G + PVVP T+ LTR+ELS L D +GY
Sbjct: 265 KNIPDPPSSIPCSQNGPRRRSGLSFAIGKGTPSANEPVVPTTQPLTRSELSHCLMDRLGY 324
Query: 155 TLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLI 214
D+KPSQIPHEEAGQGLFL GEA+VGA+IA YPG+IYSPAYYRYIPGYPRVDA N YLI
Sbjct: 325 APDIKPSQIPHEEAGQGLFLNGEADVGAIIAFYPGMIYSPAYYRYIPGYPRVDASNSYLI 384
Query: 215 TRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPN-SKGAEKGSDQFWKLLSKPMDNKRGG 273
TRYDG VINAQPWGSG +TRELWDG P+ P+ S+ +GSD+ W++LSKP++ G
Sbjct: 385 TRYDGNVINAQPWGSGGETRELWDGFYHPQYNPDTSEETGRGSDRMWRMLSKPLEASNRG 444
Query: 274 SGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNM 333
+ E+LERRNPLA HFANHP KG+ PNVM+CPYDFPLTEKDMR YIPN+ +G+ E + M
Sbjct: 445 TIGEVLERRNPLAFGHFANHPPKGIAPNVMVCPYDFPLTEKDMRVYIPNLIYGDEESITM 504
Query: 334 RRFGSFWFKWGSG--SGSSTPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDE 391
RRFG+FWF+ GS +PVLKTL LVATRA+C+EEV LNYRLSN KRRP WY+PVDE
Sbjct: 505 RRFGTFWFRLGSSVDPDGDSPVLKTLVLVATRALCNEEVFLNYRLSNQKRRPAWYTPVDE 564
Query: 392 EEDR 395
EED+
Sbjct: 565 EEDK 568
>gi|297806861|ref|XP_002871314.1| hypothetical protein ARALYDRAFT_487659 [Arabidopsis lyrata subsp.
lyrata]
gi|297317151|gb|EFH47573.1| hypothetical protein ARALYDRAFT_487659 [Arabidopsis lyrata subsp.
lyrata]
Length = 395
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 239/400 (59%), Positives = 300/400 (75%), Gaps = 11/400 (2%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
M FLF KFQ+ V LAKS TFA+ PRQLQFEAD+N+LF++TSYNRLGR+AEEADAEEII+
Sbjct: 6 MMFLFSKFQQGVGILAKSTTFAKSPRQLQFEADINKLFMFTSYNRLGRNAEEADAEEIIE 65
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFA 120
+AGKA+L +QQ+QV +NIH Q++ FC SMDEILL E + + + R + +S A
Sbjct: 66 IAGKATLFEQQKQVQENIHYQVQNFCSSMDEILLTDIDKTKEELNKPGSESRRDNNVSAA 125
Query: 121 VGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANV 180
D P+VP+T+ L E+S RLKD IGYTL +KPS IPH++AGQG F+ GEA+V
Sbjct: 126 --------DKPIVPETKPLKLAEVSHRLKDRIGYTLQIKPSLIPHKDAGQGCFIEGEADV 177
Query: 181 GAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDG- 239
GAV+A YPG+IYSPA+YRYIPGYP VDAQN YLIT YDGTVINAQPWG G ++RE+W+G
Sbjct: 178 GAVLAFYPGVIYSPAFYRYIPGYPNVDAQNSYLITGYDGTVINAQPWGRGGESREVWNGS 237
Query: 240 LTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMV 299
T PEI ++ AE GS + WK+LS+P+ + GS +LE RNPLA HF NHP K M
Sbjct: 238 FTTPEIRTDTTTAENGSSKVWKMLSRPLGDL--GSVEAVLEMRNPLAFGHFINHPGKDME 295
Query: 300 PNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLAL 359
PNVM+CPYDFPL E +MR YIPN++FGN ++ M RFGSF + G+ S VLKTL L
Sbjct: 296 PNVMMCPYDFPLPETEMRAYIPNVAFGNTGDIKMGRFGSFLSRTGNNSSLDARVLKTLVL 355
Query: 360 VATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
VAT+A+C+EE++LN RLSNS++R WY P+DE+EDRR WS
Sbjct: 356 VATKALCNEELMLNDRLSNSEKRLEWYIPLDEQEDRREWS 395
>gi|224083906|ref|XP_002307168.1| predicted protein [Populus trichocarpa]
gi|222856617|gb|EEE94164.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 230/383 (60%), Positives = 288/383 (75%), Gaps = 19/383 (4%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
M LF KFQE VKTLAKSPT + D+ RLF+ TS NRLGR+A++ADA++II+
Sbjct: 1 MTSLFHKFQEVVKTLAKSPTDGTSTK------DLKRLFILTSCNRLGRNADKADAKDIIE 54
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILL--LPEKNVPIESSAQSNAAPRRSGLS 118
MA K +++QQ+Q+ +NIH QIK FC +M++ILL +++ E+ AQSNA
Sbjct: 55 MANKFYVAEQQKQIQENIHFQIKNFCTAMNDILLPDARKRSRKDEAPAQSNA-------- 106
Query: 119 FAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
F+VG + PTD +P+ R+L+ E+SQ LKD IGYTLD+KPS +PHE+AGQGL + GEA
Sbjct: 107 FSVGKSGPPTDRHAIPEMRRLSHAEVSQSLKDHIGYTLDVKPSLMPHEKAGQGLLINGEA 166
Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD 238
+ GAVIAIYPGII+SPAYY+YIPGYPR+ A+N YLITR+DGTVINAQPWG G + RE+WD
Sbjct: 167 DAGAVIAIYPGIIHSPAYYQYIPGYPRIHARNRYLITRHDGTVINAQPWGFGGEIREVWD 226
Query: 239 GLTLPEIMPNSKGAEK-GSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKG 297
G T+PEIMPN + K G + W++LSKP++ GS ++LERRNPLALAHFAN A G
Sbjct: 227 GFTVPEIMPNVQSTGKDGPELVWRVLSKPLEGTGVGSIGDVLERRNPLALAHFANQVAVG 286
Query: 298 MVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTL 357
M NVM CPYDFPLTE DMR YIPN+SF N EVN+RR+GSFWF S S S+ PVLKTL
Sbjct: 287 MHQNVMACPYDFPLTEMDMRTYIPNVSFANP-EVNIRRYGSFWFSSKS-SESNVPVLKTL 344
Query: 358 ALVATRAICDEEVLLNYRLSNSK 380
LVATRA+CDEEVL++YRL+N K
Sbjct: 345 VLVATRALCDEEVLVDYRLTNMK 367
>gi|357473943|ref|XP_003607256.1| hypothetical protein MTR_4g075120 [Medicago truncatula]
gi|355508311|gb|AES89453.1| hypothetical protein MTR_4g075120 [Medicago truncatula]
Length = 371
Score = 449 bits (1156), Expect = e-124, Method: Compositional matrix adjust.
Identities = 240/401 (59%), Positives = 291/401 (72%), Gaps = 42/401 (10%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFA--RDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEI 58
MAFLFQKFQEAVKTLAKSPTFA RDPRQLQ++ D+NRLFLYTSYNRLG++A EADAEEI
Sbjct: 1 MAFLFQKFQEAVKTLAKSPTFAFARDPRQLQYQIDLNRLFLYTSYNRLGKNASEADAEEI 60
Query: 59 IDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEK---NVPIESSAQSNAAPRRS 115
I++A KAS++DQQ V +N+H+QIK FC MD + L EK +V E S Q+ P+ S
Sbjct: 61 IEIATKASVADQQMVVQENVHAQIKAFCTFMDAVFLPNEKKVNDVSFELSQQTKILPQHS 120
Query: 116 GLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC 175
L A G VP+ R +++ ELSQ+LKD +GYTL++KPSQI H++AGQGLFL
Sbjct: 121 DLRSANG----------VPKQRLISQVELSQKLKDELGYTLNVKPSQISHKDAGQGLFLD 170
Query: 176 GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRE 235
G +VGAV+A YPG++YSPAYY +IPGY +D QNPYLITR+DG VI+AQ WG G D +E
Sbjct: 171 GVVDVGAVVAFYPGVVYSPAYYHHIPGY--LDEQNPYLITRHDGNVIDAQLWGRGGDKKE 228
Query: 236 LWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPA 295
LW+G + + EKGS +DN ++LERRNPLALAHFANHP+
Sbjct: 229 LWNGRKMVD--------EKGSQ---------VDN-----SDDVLERRNPLALAHFANHPS 266
Query: 296 KGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV-- 353
KGM+PNVMICPYDFPL E DMR YIPN+ FGNA E N RFGSFWFK + + V
Sbjct: 267 KGMLPNVMICPYDFPLIENDMRAYIPNVLFGNAAEENTERFGSFWFKSRVPRNNESHVPT 326
Query: 354 -LKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEE 393
LKT+ LVATRA+ DEE+LLNYRL N+KR P WY+PVDEEE
Sbjct: 327 TLKTVVLVATRALQDEELLLNYRLGNTKRCPEWYAPVDEEE 367
>gi|15241572|ref|NP_196444.1| uncharacterized protein [Arabidopsis thaliana]
gi|332003893|gb|AED91276.1| uncharacterized protein [Arabidopsis thaliana]
Length = 386
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 232/398 (58%), Positives = 290/398 (72%), Gaps = 24/398 (6%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
M FLF KFQ+ V LAKS TFA++PRQLQFEAD+N+LF++TSYNRLGRDAEEADAEEII+
Sbjct: 6 MMFLFGKFQQGVGVLAKSTTFAKNPRQLQFEADINKLFMFTSYNRLGRDAEEADAEEIIE 65
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLL-----PEKNVPIESSAQSNAAPRRS 115
MAGKA+ S+QQ+QV +NIH Q++ FC SM+EILL + N P S + N P
Sbjct: 66 MAGKATFSEQQKQVQENIHYQLQNFCSSMNEILLTNIDKTKDLNEPGSESRRDNIVP--- 122
Query: 116 GLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC 175
D P+VP+T+ L E+S RLKD +GYTL++KPS IPH++AGQG F+
Sbjct: 123 -----------AADKPIVPETKPLKLDEVSHRLKDRVGYTLEIKPSLIPHKDAGQGCFIE 171
Query: 176 GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRE 235
GEA+VGAV+A YPG+IYSPA++RYIPGYP VDAQN YLIT YDGTVINAQPWG G +RE
Sbjct: 172 GEADVGAVLAFYPGVIYSPAFHRYIPGYPNVDAQNSYLITGYDGTVINAQPWGRGGISRE 231
Query: 236 LWDG-LTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSE-MLERRNPLALAHFANH 293
+W+G T PEI + E SD+ WK+LS+P++ G G E ++E RNPLA HF NH
Sbjct: 232 VWNGSFTTPEIRTEATTTENSSDKVWKMLSRPLE---GSGGVEAVIEMRNPLAFGHFINH 288
Query: 294 PAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV 353
P K M NVMICPYDF L+E +MR YIPN++FGN +V M RF SF + GS S V
Sbjct: 289 PGKEMESNVMICPYDFLLSETEMRAYIPNVAFGNTGDVKMGRFESFLSRTGSNSSLDAHV 348
Query: 354 LKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDE 391
LKT+ LVAT+A+C+EE++LNYRLSNS++R WY P+DE
Sbjct: 349 LKTIVLVATKALCNEELMLNYRLSNSEKRLEWYIPLDE 386
>gi|26453064|dbj|BAC43608.1| unknown protein [Arabidopsis thaliana]
Length = 386
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 231/398 (58%), Positives = 290/398 (72%), Gaps = 24/398 (6%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
M FLF KFQ+ V LAKS TFA++PRQLQFEAD+N+LF++TSYNRLGRDAEEADAEEII+
Sbjct: 6 MMFLFGKFQQGVGVLAKSTTFAKNPRQLQFEADINKLFMFTSYNRLGRDAEEADAEEIIE 65
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLL-----PEKNVPIESSAQSNAAPRRS 115
MAGKA+ S+QQ+QV +NIH Q++ FC SM+EILL + N P S + N P
Sbjct: 66 MAGKATFSEQQKQVQENIHYQLQNFCSSMNEILLTNIDKTKDLNEPGSESRRDNIVP--- 122
Query: 116 GLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC 175
D P+VP+T+ L E+S RLKD +GYTL++KPS IPH++AGQG F+
Sbjct: 123 -----------AADKPIVPETKPLKLDEVSHRLKDRVGYTLEIKPSLIPHKDAGQGCFIE 171
Query: 176 GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRE 235
GEA+VGAV+A YPG+IY+PA++RYIPGYP VDAQN YLIT YDGTVINAQPWG G +RE
Sbjct: 172 GEADVGAVLAFYPGVIYTPAFHRYIPGYPNVDAQNSYLITGYDGTVINAQPWGRGGISRE 231
Query: 236 LWDG-LTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSE-MLERRNPLALAHFANH 293
+W+G T PEI + E SD+ WK+LS+P++ G G E ++E RNPLA HF NH
Sbjct: 232 VWNGSFTTPEIRTEATTTENSSDKVWKMLSRPLE---GSGGVEAVIEMRNPLAFGHFINH 288
Query: 294 PAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV 353
P K M NVMICPYDF L+E +MR YIPN++FGN +V M RF SF + GS S V
Sbjct: 289 PGKEMESNVMICPYDFLLSETEMRAYIPNVAFGNTGDVKMGRFESFLSRTGSNSSLDAHV 348
Query: 354 LKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDE 391
LKT+ LVAT+A+C+EE++LNYRLSNS++R WY P+DE
Sbjct: 349 LKTIVLVATKALCNEELMLNYRLSNSEKRLEWYIPLDE 386
>gi|218191231|gb|EEC73658.1| hypothetical protein OsI_08190 [Oryza sativa Indica Group]
Length = 352
Score = 432 bits (1112), Expect = e-119, Method: Compositional matrix adjust.
Identities = 228/410 (55%), Positives = 269/410 (65%), Gaps = 69/410 (16%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFA------RDPRQLQFEADMNRLFLYTSYNRLGRDAEEAD 54
M LF +FQEAVKTLAK+P FA RDPR LQFEAD+NRLFLYTSY RLG +AEE D
Sbjct: 1 MGSLFIRFQEAVKTLAKNPMFANNRLFARDPRHLQFEADVNRLFLYTSYYRLGANAEEKD 60
Query: 55 AEEIIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRR 114
AEEIIDMA KAS+S+QQ+QV +N+H Q+ C +MD ILL KN ++++N PRR
Sbjct: 61 AEEIIDMASKASVSEQQKQVQENVHYQLTNMCQAMDSILLPDTKN----GASEANNYPRR 116
Query: 115 SGLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFL 174
SGLSFAVG + P
Sbjct: 117 SGLSFAVGTGVASANKP------------------------------------------- 133
Query: 175 CGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTR 234
AIYPG++YSPAYYRYIPGYP++DA N YLITRYDGT+I+A+PW G D+R
Sbjct: 134 ----------AIYPGVVYSPAYYRYIPGYPKIDACNNYLITRYDGTIIDAKPWQLGGDSR 183
Query: 235 ELWDGLTLPEI--MPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFAN 292
E+WDG L + +P SK E SD+ W++LSKP+ + E+LERRNPLA HFAN
Sbjct: 184 EIWDGSDLVDYNAVP-SKSQESNSDRAWRMLSKPLKKGHTENFGEVLERRNPLAFGHFAN 242
Query: 293 HPAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNAEE-VNMRRFGSFWFKWGSGSG--S 349
HP KG PNVMICPYDFPLTEKDMR YIPNI+FG EE V M+RFGSFWFK G
Sbjct: 243 HPPKGSTPNVMICPYDFPLTEKDMRVYIPNITFGGEEEPVTMKRFGSFWFKSGRSGNQVG 302
Query: 350 STPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
+PVLKTL LV+TR+ICDEE+ LNYR SNSK+RP WY PVDEEED+RRWS
Sbjct: 303 ESPVLKTLVLVSTRSICDEELFLNYRYSNSKKRPEWYIPVDEEEDKRRWS 352
>gi|449531573|ref|XP_004172760.1| PREDICTED: uncharacterized protein LOC101224517, partial [Cucumis
sativus]
Length = 270
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 196/270 (72%), Positives = 230/270 (85%), Gaps = 4/270 (1%)
Query: 132 VVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGII 191
+P TR L R+ELSQ+LKD IGYTLD+KPS+IPH++AGQGLF+ GEA+VG++IAIYPG++
Sbjct: 3 TIPTTRPLGRSELSQQLKDEIGYTLDIKPSRIPHKDAGQGLFIDGEADVGSIIAIYPGVV 62
Query: 192 YSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKG 251
YSPA+Y+YIPGYPRVDAQNPYLITRYDGTVINAQPWG G DTRE+W GLT+P P +G
Sbjct: 63 YSPAHYQYIPGYPRVDAQNPYLITRYDGTVINAQPWGLGADTREVWHGLTVPNSNPTKQG 122
Query: 252 AEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPL 311
EK SD+ W++LSKP++ KR G + +ERRNPLA AHFANHPAK MVPNVM+CPYDFPL
Sbjct: 123 DEK-SDRLWRMLSKPLEAKRVQHGGDAIERRNPLAFAHFANHPAKDMVPNVMLCPYDFPL 181
Query: 312 TEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSG--SGSSTPVLKTLALVATRAICDEE 369
TEKDMR YIPN+ N EEV M+RFGSFWFK G +GS TP+LKT+ LVATRA+C+EE
Sbjct: 182 TEKDMRVYIPNVPVAN-EEVGMKRFGSFWFKSGRSRINGSDTPILKTIVLVATRALCNEE 240
Query: 370 VLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
VLLNYRLSNSKRRP WY+PVDE EDRRRWS
Sbjct: 241 VLLNYRLSNSKRRPSWYTPVDEGEDRRRWS 270
>gi|359806342|ref|NP_001241484.1| uncharacterized protein LOC100782051 [Glycine max]
gi|255642207|gb|ACU21368.1| unknown [Glycine max]
Length = 322
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 198/328 (60%), Positives = 247/328 (75%), Gaps = 23/328 (7%)
Query: 73 QVLDNIHSQIKRFCLSMDEILLLPEKNV--PIESSAQSNAAPRRSGLSFAVGPTARPTDS 130
QV +N+HSQIK FC MDE+LL EK V P+E S Q N++P SGL+ A
Sbjct: 2 QVKENVHSQIKAFCTYMDEMLLPNEKRVNDPMELSQQENSSPHHSGLNSA---------- 51
Query: 131 PVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGI 190
VV + R T+ E+SQ+LKD +GY L++KPSQI H+++GQGLFL G A+VGAVIA YPG+
Sbjct: 52 KVVSKQRSYTQAEVSQKLKDQLGYKLNVKPSQISHKDSGQGLFLDGAADVGAVIAFYPGV 111
Query: 191 IYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSK 250
+YSPAYY+YIPGY +D QNPYLITRYDGTVI+AQPWG G + E ++G + E +++
Sbjct: 112 VYSPAYYQYIPGY--LDEQNPYLITRYDGTVIDAQPWGCGGEGPERFNGRKMLERKLDNE 169
Query: 251 GAEKGSDQFWKLLSKPMDNKRGGSGSEM--LERRNPLALAHFANHPAKGMVPNVMICPYD 308
GAEKGSD+ +SKP++ + + +E +ERRNPLALAHFANHPAKGM+PNVMICPYD
Sbjct: 170 GAEKGSDR----MSKPLEGSQVENDNENDDVERRNPLALAHFANHPAKGMLPNVMICPYD 225
Query: 309 FPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV--LKTLALVATRAIC 366
FPLTE +MR YIPNI FGNA EVN++R GSFWFK G S + V LKT+ LVATR++
Sbjct: 226 FPLTENNMRVYIPNILFGNA-EVNIKRLGSFWFKSGVSRNSQSDVLTLKTVVLVATRSLQ 284
Query: 367 DEEVLLNYRLSNSKRRPVWYSPVDEEED 394
DEE+LLNYRLSNSK+ P WY+PVDEEED
Sbjct: 285 DEELLLNYRLSNSKKWPEWYAPVDEEED 312
>gi|8346564|emb|CAB93728.1| hypothetical protein (fragment at BAC end) [Arabidopsis thaliana]
Length = 325
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/335 (58%), Positives = 244/335 (72%), Gaps = 24/335 (7%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
M FLF KFQ+ V LAKS TFA++PRQLQFEAD+N+LF++TSYNRLGRDAEEADAEEII+
Sbjct: 6 MMFLFGKFQQGVGVLAKSTTFAKNPRQLQFEADINKLFMFTSYNRLGRDAEEADAEEIIE 65
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLL-----PEKNVPIESSAQSNAAPRRS 115
MAGKA+ S+QQ+QV +NIH Q++ FC SM+EILL + N P S + N P
Sbjct: 66 MAGKATFSEQQKQVQENIHYQLQNFCSSMNEILLTNIDKTKDLNEPGSESRRDNIVP--- 122
Query: 116 GLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC 175
D P+VP+T+ L E+S RLKD +GYTL++KPS IPH++AGQG F+
Sbjct: 123 -----------AADKPIVPETKPLKLDEVSHRLKDRVGYTLEIKPSLIPHKDAGQGCFIE 171
Query: 176 GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRE 235
GEA+VGAV+A YPG+IYSPA++RYIPGYP VDAQN YLIT YDGTVINAQPWG G +RE
Sbjct: 172 GEADVGAVLAFYPGVIYSPAFHRYIPGYPNVDAQNSYLITGYDGTVINAQPWGRGGISRE 231
Query: 236 LWDG-LTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSE-MLERRNPLALAHFANH 293
+W+G T PEI + E SD+ WK+LS+P++ G G E ++E RNPLA HF NH
Sbjct: 232 VWNGSFTTPEIRTEATTTENSSDKVWKMLSRPLE---GSGGVEAVIEMRNPLAFGHFINH 288
Query: 294 PAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGNA 328
P K M NVMICPYDF L+E +MR YIPN++FGN
Sbjct: 289 PGKEMESNVMICPYDFLLSETEMRAYIPNVAFGNT 323
>gi|296085325|emb|CBI29057.3| unnamed protein product [Vitis vinifera]
Length = 383
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 168/238 (70%), Positives = 191/238 (80%), Gaps = 7/238 (2%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIID 60
MAFLFQKFQEAVKTLAKSP FARDPR LQFEAD+NRLFLYTSYNRLGR+A+EADAE IID
Sbjct: 83 MAFLFQKFQEAVKTLAKSPVFARDPRHLQFEADINRLFLYTSYNRLGRNADEADAENIID 142
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPI--ESSAQSNAAPRRSGLS 118
MAGKASL+DQQ+QV +NIHSQIK FC ++D +LL KN+ ES Q+ A PRRSGLS
Sbjct: 143 MAGKASLADQQKQVQENIHSQIKTFCNTLDGVLLPDSKNMNELPESCPQTKATPRRSGLS 202
Query: 119 FAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEA 178
FAVG P+D+ VPQT+ L R ELSQ LKD IGYTLDLKPSQIPH++AGQGLFL GEA
Sbjct: 203 FAVGRDGPPSDNSAVPQTKPLKRAELSQSLKDLIGYTLDLKPSQIPHKDAGQGLFLDGEA 262
Query: 179 NVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITR-----YDGTVINAQPWGSGW 231
+VGAV+A+YPG+IY PAYY+YIPGYPRVDA NPYLIT G + + GS W
Sbjct: 263 DVGAVVALYPGVIYFPAYYQYIPGYPRVDAHNPYLITSKPMEAKHGGSVKMKRLGSFW 320
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 56/73 (76%), Positives = 60/73 (82%), Gaps = 4/73 (5%)
Query: 331 VNMRRFGSFWFKWGSGSGS----STPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWY 386
V M+R GSFWFK GS + S PVLKTL LVATRA+CDEEVLLNYRLSNSKRRP WY
Sbjct: 311 VKMKRLGSFWFKSGSSNKSELDAHVPVLKTLVLVATRALCDEEVLLNYRLSNSKRRPAWY 370
Query: 387 SPVDEEEDRRRWS 399
+PVDEEEDRRRWS
Sbjct: 371 TPVDEEEDRRRWS 383
>gi|302763379|ref|XP_002965111.1| hypothetical protein SELMODRAFT_167135 [Selaginella moellendorffii]
gi|300167344|gb|EFJ33949.1| hypothetical protein SELMODRAFT_167135 [Selaginella moellendorffii]
Length = 399
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 176/393 (44%), Positives = 235/393 (59%), Gaps = 19/393 (4%)
Query: 16 AKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVL 75
AKS D R++Q E D++RLFL TSY RL A + D + I + DQ+ QV
Sbjct: 10 AKSKVVVPDYRKVQKEIDISRLFLLTSYRRLDSSATDEDLDRIEQLGRTVPFPDQEHQVQ 69
Query: 76 DNIHSQIKRFCLSMDEILLLP-EKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVV- 133
+NIH Q++ ++D I L E+ +E + R SGL+FAV + + +
Sbjct: 70 ENIHDQVRNISRALDTIFLAGCERKKSLEEGFK-----RPSGLAFAVTKSKPDIEVQKLL 124
Query: 134 --PQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGII 191
P T L + ++ + LK+ +G++L LK S++P AGQGLFL G A G V+A+YPG+I
Sbjct: 125 GPPATASLGKYDIDEALKNQLGFSLHLKDSEVPGHNAGQGLFLEGIARTGTVVALYPGVI 184
Query: 192 YSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKG 251
YSPA Y YIPGYP++D NPYL++RY+G VI+ + WG G +RE WDG PE
Sbjct: 185 YSPAKYWYIPGYPKLD--NPYLVSRYEGYVIDGKSWGKGGTSREFWDGYDEPEEKEEDLE 242
Query: 252 AEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPL 311
K ++ K + G + S+++ERRNPLALAHF NHP KG +PNVMICPYDFP
Sbjct: 243 DRKEEQGEEAMVDKASE---GEAASQVIERRNPLALAHFVNHPPKGQLPNVMICPYDFPE 299
Query: 312 TEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWG-----SGSGSSTPVLKTLALVATRAIC 366
E+DMRPYIPN+ F + NM R G W K S L+ L LV TR +C
Sbjct: 300 HEEDMRPYIPNLRFEDDTVKNMERRGKSWSKVPLPEVLSAKDRDPTYLRGLVLVTTRNVC 359
Query: 367 DEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
+EE+LLNYRLSN K +P WY PVDE E+RRRW+
Sbjct: 360 NEELLLNYRLSNPKTKPEWYHPVDEAEERRRWA 392
>gi|302757549|ref|XP_002962198.1| hypothetical protein SELMODRAFT_77390 [Selaginella moellendorffii]
gi|300170857|gb|EFJ37458.1| hypothetical protein SELMODRAFT_77390 [Selaginella moellendorffii]
Length = 407
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 176/393 (44%), Positives = 235/393 (59%), Gaps = 19/393 (4%)
Query: 16 AKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVL 75
AKS D R++Q E D++RLFL TSY RL A + D + I + DQ+ QV
Sbjct: 10 AKSKIVVPDYRKVQKEIDISRLFLLTSYRRLDSSATDEDLDRIEQLGRTVPFPDQEHQVQ 69
Query: 76 DNIHSQIKRFCLSMDEILLLP-EKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVV- 133
+NIH Q++ ++D I L E+ +E + R SGL+FAV + + +
Sbjct: 70 ENIHDQVRNISRALDTIFLAGCERKKSLEEGFK-----RPSGLAFAVTKSKPDIEVQKLL 124
Query: 134 --PQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGII 191
P T L + ++ + LK+ +G++L LK S++P AGQGLFL G A G V+A+YPG+I
Sbjct: 125 GPPATASLGKYDIDEALKNQLGFSLHLKDSEVPGHNAGQGLFLEGIARTGTVVALYPGVI 184
Query: 192 YSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKG 251
YSPA Y YIPGYP++D NPYL++RY+G VI+ + WG G +RE WDG PE
Sbjct: 185 YSPAKYWYIPGYPKLD--NPYLVSRYEGYVIDGKSWGKGGTSREFWDGYDEPEEKEEDLE 242
Query: 252 AEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPL 311
K ++ K + G + S+++ERRNPLALAHF NHP KG +PNVMICPYDFP
Sbjct: 243 DRKEEQGEEAMVDKASE---GEAASQVIERRNPLALAHFVNHPPKGQLPNVMICPYDFPE 299
Query: 312 TEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWG-----SGSGSSTPVLKTLALVATRAIC 366
E+DMRPYIPN+ F + NM R G W K S L+ L LV TR +C
Sbjct: 300 HEEDMRPYIPNLRFEDDTVKNMERRGKSWSKVPLPEVLSAKDRDPTYLRGLVLVTTRNVC 359
Query: 367 DEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
+EE+LLNYRLSN K +P WY PVDE E+RRRW+
Sbjct: 360 NEELLLNYRLSNPKTKPEWYHPVDEAEERRRWA 392
>gi|168018707|ref|XP_001761887.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686942|gb|EDQ73328.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 412
Score = 298 bits (762), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 171/403 (42%), Positives = 236/403 (58%), Gaps = 35/403 (8%)
Query: 24 DPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVLDNIHSQIK 83
D R+LQ E D ++LFL TS+ R E AEE+ +A +QQ+QV + +H QIK
Sbjct: 16 DKRRLQHEIDTHKLFLLTSFQREATSGEIGVAEELTKLAEATPFEEQQRQVQEQVHGQIK 75
Query: 84 RFCLSMDEILL---LPEKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVVP--QTRQ 138
+ +D ILL LPE +P S + R SGL FA+G P P +T+
Sbjct: 76 KLSTVLDSILLKDPLPE--IPSPPSEKK----RPSGLFFALGRQPEPEAPPRANAIETKP 129
Query: 139 LTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYR 198
LTR +++ ++ G+TL L+ S IPHEEAG GLF+ G A G V+++YPG++YSP+ YR
Sbjct: 130 LTREQVASAFREHTGFTLCLQRSGIPHEEAGTGLFIKGRAEPGTVVSMYPGVVYSPSQYR 189
Query: 199 YIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWD---GLTLPEIMPNSKGAEKG 255
YIPGYP+VD N YLI+RYDG VI+ +PWG G D RE WD G++ + + A +
Sbjct: 190 YIPGYPKVDVDNGYLISRYDGLVIDGKPWGKGDDIREWWDGHNGISFEQAAELTGPARQS 249
Query: 256 SDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLT-EK 314
FW+ + + G+E LERR+P+AL HFANHP +G+ PNVMICPY F ++ ++
Sbjct: 250 LPSFWQFVGGGKVMQGPLEGAE-LERRSPIALGHFANHPPEGVHPNVMICPYTFSISGQQ 308
Query: 315 DMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV-------------------LK 355
MR YIPN+ FGN EE +M+R G W G P +
Sbjct: 309 SMRAYIPNVMFGNDEEFDMQRRGPLWINEGKKERPEDPADTRLSTVERSEQPITLISDVH 368
Query: 356 TLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRW 398
TL LV TR + +EE+ LNYRLS+ ++P WY PV+ EE++RRW
Sbjct: 369 TLVLVTTREVDNEELFLNYRLSSYIQQPEWYHPVNVEEEKRRW 411
>gi|388519347|gb|AFK47735.1| unknown [Lotus japonicus]
Length = 153
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 95/147 (64%), Positives = 114/147 (77%), Gaps = 7/147 (4%)
Query: 250 KGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDF 309
+GAEKG+D LSKP++ + S ++LE RNPL+LAHFANHPAKGM+PNVM+C YDF
Sbjct: 9 EGAEKGADS----LSKPLEGSQVDSSDDVLECRNPLSLAHFANHPAKGMLPNVMVCSYDF 64
Query: 310 PLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWG--SGSGSSTPVLKTLALVATRAICD 367
PLTE +MR YIPNI +GNA EVN++ F FWFK G S S P L T+ LVATRA+ D
Sbjct: 65 PLTENNMRVYIPNILYGNA-EVNVKSFSRFWFKNGVSRNSESKVPTLNTVVLVATRALQD 123
Query: 368 EEVLLNYRLSNSKRRPVWYSPVDEEED 394
EE+LLNYRLSNSKRRP WY+P+DEEED
Sbjct: 124 EELLLNYRLSNSKRRPEWYAPLDEEED 150
>gi|384246392|gb|EIE19882.1| hypothetical protein COCSUDRAFT_44295 [Coccomyxa subellipsoidea
C-169]
Length = 261
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 108/260 (41%), Positives = 148/260 (56%), Gaps = 47/260 (18%)
Query: 149 KDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDA 208
++ +G++L L PS + H +AG+GL+L G A GAV+ +YPGIIY+ ++YR +PGYP+VD
Sbjct: 40 EERMGFSLCLAPSAVGHADAGEGLWLRGRAKTGAVVGLYPGIIYTRSHYREMPGYPKVDR 99
Query: 209 QNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMD 268
++ +LI+R+DG +++A+PWG G L P+ + + +D +L
Sbjct: 100 ESDFLISRFDGAILDAKPWGRG--------ALEEPQSAESLGYRDGLADALTQL------ 145
Query: 269 NKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKDMRPYIPNISFGN- 327
+LERR+P +LAH+ANHP G PN+MI YDF R YIPN FG+
Sbjct: 146 --------HVLERRHPFSLAHYANHPPAGRRPNIMIAAYDFQNCRDGTRRYIPNAMFGSP 197
Query: 328 --------AEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLSNS 379
AEE+ M P+L LALVATR I DEE+LLNYRLS
Sbjct: 198 RASPGLTGAEEIPM---------------DDRPLL-CLALVATREIEDEELLLNYRLSPG 241
Query: 380 KRRPVWYSPVDEEEDRRRWS 399
RP WY+ VDE ED RRWS
Sbjct: 242 LARPSWYTAVDEAEDSRRWS 261
>gi|449512760|ref|XP_004164133.1| PREDICTED: uncharacterized LOC101224517 [Cucumis sativus]
Length = 135
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 95/135 (70%), Positives = 106/135 (78%), Gaps = 9/135 (6%)
Query: 1 MAFLFQKFQEAVKTLAKSPTF------ARDPRQLQFEADMNRLFLYTSYNRLGRDAEEAD 54
MAFLFQKFQEAVK LAK+P F ARDPR+LQFEADMNRLFL+TSYNRLG DA EAD
Sbjct: 1 MAFLFQKFQEAVKVLAKNPMFTKNPLFARDPRRLQFEADMNRLFLFTSYNRLGMDATEAD 60
Query: 55 AEEIIDMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRR 114
AEEIIDMA KAS +DQQ+QV +NIHSQ+K FC+ MD ILL E P ES +S A R+
Sbjct: 61 AEEIIDMASKASFADQQKQVQENIHSQVKSFCMHMDGILLPDE---PAESPEKSKDAVRK 117
Query: 115 SGLSFAVGPTARPTD 129
SGLSFAVG T+ PTD
Sbjct: 118 SGLSFAVGRTSPPTD 132
>gi|255089437|ref|XP_002506640.1| predicted protein [Micromonas sp. RCC299]
gi|226521913|gb|ACO67898.1| predicted protein [Micromonas sp. RCC299]
Length = 435
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 136/447 (30%), Positives = 205/447 (45%), Gaps = 110/447 (24%)
Query: 26 RQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVLDNIHSQIKRF 85
R++Q E D+ LF+ TS + RD D ++ID+ LS++Q+
Sbjct: 23 RRVQEELDLRELFVLTSSH--ARDVPVDDDADLIDV-----LSEEQR------------- 62
Query: 86 CLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVVPQTRQLTRTELS 145
L+ DE+ KN + ++ + R L R TD +R +++
Sbjct: 63 GLTYDEVKAEVTKN--LTANLHTMCGEVRDAL--------RATDG-----VGHGSRDDVT 107
Query: 146 QRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPR 205
+ G++ + S P+ AG+GLF+ G+ + G ++ +YPG++Y+P R++PGYP+
Sbjct: 108 AEYRRRAGFSAHVAESTAPN--AGRGLFVNGQVDPGEIVCVYPGVVYAPENLRHMPGYPK 165
Query: 206 VDAQNPYLITRYDGTVINAQPWG--SGWDTRELW----DGLT---------------LPE 244
+ A NPYL+ RYDG+V++A PWG +GW + + DG T P
Sbjct: 166 IAADNPYLVARYDGSVVDASPWGLGAGWSSAKTSLDGDDGWTDGERRRVQGFGRWPGAPI 225
Query: 245 IMPNSKGAEKGSDQ-------FWKLLS---KPMDNKR---GGSGSEMLERRNPLALAHFA 291
+++G G D W + P+D + + +E ++RR+PLALAH A
Sbjct: 226 GTDDARGDSPGGDSPGGDSPGGWLRAAAGETPLDPTQLTDARTAAESVDRRHPLALAHLA 285
Query: 292 NHPAKGMVPNVMICPYDFPL---TE------KDMRPYIPNISFGN-----AEEVNMRRFG 337
NHP KG PNV + D TE K +R Y+PN+ G RR G
Sbjct: 286 NHPPKGTSPNVAVASVDVVFDAETEELDKDVKALRRYLPNVGLGRWRVPLGTGEAKRRGG 345
Query: 338 SFWFKWGSGS--GS-----------------------STPVLKTLALVATRAICDEEVLL 372
+ K G GS V+ TL LVA R I DEEV L
Sbjct: 346 AGEAKRARGGFVGSLARMFGEDDEDGDIEEAEAEAPNGKVVVPTLVLVAKRFILDEEVFL 405
Query: 373 NYRLSNSKRRPVWYSPVDEEEDRRRWS 399
NYRLS +RP WY PV+ EE++RRW+
Sbjct: 406 NYRLSTHVKRPDWYHPVNAEEEKRRWA 432
>gi|308811843|ref|XP_003083229.1| unnamed protein product [Ostreococcus tauri]
gi|116055108|emb|CAL57504.1| unnamed protein product [Ostreococcus tauri]
Length = 302
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 105/287 (36%), Positives = 138/287 (48%), Gaps = 43/287 (14%)
Query: 150 DAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQ 209
+A G+ L S I AG G+F+ G G V+A YPG+ Y + RY+ GYPRV
Sbjct: 22 EAYGHHCYLDASTI-GAAAGTGVFVRGNVGAGRVVATYPGLTYVGSAVRYMKGYPRVGVN 80
Query: 210 NPYLITRYDGTVINAQPWGSGWDTREL---WDGLTLPEIMPNSKGAEKGSDQFWKLLSKP 266
N YL+ R DG VI+ +PWG G D + W G + + A+ GS L +
Sbjct: 81 NEYLMARSDGCVIDGKPWGRGSDEGDGFGDWPGSPVKLTEEEEREADGGSFFTRLLRPRL 140
Query: 267 MDNKRGGSGSEM--LERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKDMRPYIPNIS 324
+R +E LER NP A AHFANHP +G PNV++ D P ++ D R Y+PN++
Sbjct: 141 PRERRERLLAECMSLERDNPYACAHFANHPPEGTSPNVIVATVDIP-SDPDTRRYVPNVN 199
Query: 325 F------------------------GNAEEVNMRRFGSFWFK-------WGSGSGSSTPV 353
+V+ FG K SG GS
Sbjct: 200 IEPDDGDKSSESVWSIIRDGERTIGERVRDVSASLFGEDADKPKAKSTEPTSGDGS---- 255
Query: 354 LKTLALVATRAICD-EEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
++TL LVATR I D EE+ LNYRLS P WY+ VD EED RRW+
Sbjct: 256 IRTLVLVATRDIEDGEEIFLNYRLSTHVTPPSWYTRVDPEEDDRRWA 302
>gi|145353865|ref|XP_001421220.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581457|gb|ABO99513.1| predicted protein, partial [Ostreococcus lucimarinus CCE9901]
Length = 295
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 132/289 (45%), Gaps = 35/289 (12%)
Query: 143 ELSQRLKDAIGYTLDLKPSQIPHE--EAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYI 200
+L++ + D+ PS AG GLF G G V+A YPG+ Y A R++
Sbjct: 8 QLAELARATPARACDVAPSTAGRSGASAGDGLFARGAVAAGRVVATYPGLTYVGAATRFM 67
Query: 201 PGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFW 260
YPRV +N YLI R DG+VI+A+PWG G + + W G P + ++ +
Sbjct: 68 KDYPRVTRENEYLIARSDGSVIDAEPWGRGGENDDAWPGA--PVALSEAEARRADGASWL 125
Query: 261 KLLSKPM----DNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKDM 316
+ L KP R L+R + LALAH+ANHP G PNV++ D P + D
Sbjct: 126 ERLRKPTLPRATRARLARECATLDRGDELALAHYANHPPSGTAPNVVVASVDVP-RDSDA 184
Query: 317 RPYIPNISFG------------------------NAEEVNMRRFGSFWFK-WGSGSGSST 351
R N++ A + ++ FG K G
Sbjct: 185 RRRTQNVNVECEEEGEETESVWSIVRDGERSLVERARDASLAMFGEDADKPRARGGTVRR 244
Query: 352 PVLKTLALVATRAICD-EEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
V++ L LV+TR I D EE+ LNYRLS P WY VD EED RRWS
Sbjct: 245 DVVRGLVLVSTRDIEDGEEIWLNYRLSTHVTPPAWYVRVDPEEDERRWS 293
>gi|412986804|emb|CCO15230.1| predicted protein [Bathycoccus prasinos]
Length = 449
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 130/477 (27%), Positives = 211/477 (44%), Gaps = 132/477 (27%)
Query: 18 SPTFARDPRQ---LQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQV 74
+ +F++ PRQ LQ E+D+ +LFL T + E + M + + + +++
Sbjct: 10 ASSFSKKPRQGRALQEESDVQKLFLLTQFR-----------ERTMMMTRERNEREGEEE- 57
Query: 75 LDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVVP 134
S+ +R ++ E+L E+S + ++A R++ + +V ++
Sbjct: 58 ----DSRARREAVA-KEVL---------EASTKCSSAERKARVWRSVQTQLTTIGLDLMG 103
Query: 135 QTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSP 194
+T ++ T + + +KPS E +G+FL G A+VG V+++YPGI+Y+
Sbjct: 104 RTGDISETNTTNKNT--------VKPSDGVENE--EGVFLNGPASVGTVVSLYPGIVYAK 153
Query: 195 AYYRYIPGYPRVDAQNPYLITRYDGTVINAQPW-------------------------GS 229
R++ G+P V +N +L+ RYD TV++A+PW
Sbjct: 154 EALRFLKGFPNVAKENAFLVARYDDTVVDAKPWEEERRRSSSERSSSGVVEGSSDSESSI 213
Query: 230 GWDTRELWDGLTLPEIMPNSK--GAEKGSDQFWKL------LSKPMDN---KRG--GSGS 276
W G L E + K GA K + KL LS+P + KR +
Sbjct: 214 NSSGSSSWPGFPLREDLEKEKIVGALK-KEWVEKLPPWLEKLSRPELHAKIKRACLDVST 272
Query: 277 EMLER--RNPLALAHFANHPAKGMVPNVMICPYDFPLTEKD------------------- 315
++L + ++P+ALAH ANHP +G PNVMIC +D+ + E
Sbjct: 273 DLLVQAGKHPIALAHKANHPPQGKTPNVMICMHDYDVEEGAKEALRMMNMDTEDSQKLER 332
Query: 316 ----MRPYIPNISFGN----------AEEVNMRRFGSFWFKWGSGSGSSTP--------- 352
+R IPN+ N A+ F + +GS P
Sbjct: 333 MKHFVRAAIPNVYLSNDIRDYERFDPAKSARNASFLKMVSSFAAGSVFEDPNEKAKDLNE 392
Query: 353 ------VLK---TLALVATRAI-CDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
V+K T+ALVATR I +EE+ L+YRLS+ RP WY PVDEE ++RRWS
Sbjct: 393 AQMVDIVMKAAPTIALVATRDIEANEELFLDYRLSSHVERPDWYVPVDEEAEKRRWS 449
>gi|449018475|dbj|BAM81877.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 377
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 111/393 (28%), Positives = 171/393 (43%), Gaps = 55/393 (13%)
Query: 26 RQLQFEADMNRLFLYTSYNRLGRDAEEADAEEIIDMAGKASLSDQQQQVLDNIHSQIKRF 85
R+LQ + D +LFL G + A A+EI+ A +A+ + + S + +
Sbjct: 20 RELQHQRDNMQLFLLAVSYEKGFEQARALAQEIVSSAEQATRASSPLLMFRRALS-VWKH 78
Query: 86 CLSMDEILLLPEKNVPIESSAQSNAAPR---RSGLSFAVGPTARPTDSPVVPQTRQLTRT 142
S++ P+ ++ + AQS R R + + PT +
Sbjct: 79 SASLEMKTDSPQ-HILDDQGAQSRLIERVNARQVHNLLAAAASMPTSASAA--------- 128
Query: 143 ELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPG 202
+TL L+PSQ + AG G+ L G A +G+V+A YPGI Y ++ IPG
Sbjct: 129 ------TSGPAFTLSLRPSQ-SNVNAGYGVILNGHARMGSVVAFYPGIAYRVRHWTSIPG 181
Query: 203 YPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKL 262
YP + AQNPYL RYD +++A P E+ GL I+ ++ L
Sbjct: 182 YPYITAQNPYLFARYDRVIVDANP------NLEVSRGLMNIPILDHA------------L 223
Query: 263 LSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLT--EKDMRPYI 320
S ++ S E NP AL H NHP +G NVM C ++F L E + +I
Sbjct: 224 ESVVRHTRQTESLHECWVGLNPYALGHRVNHPPRGREANVMPCAWNFDLQRLEPSLLSFI 283
Query: 321 PNISFGNAEEVN----------MRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEV 370
PN F +++ RF ++ G S + + L+A R + DEE+
Sbjct: 284 PNRLFTKYRQLSSGILGPTSMLWSRFRHWFDSLGHQDNMSLTEIPGVVLIALRDLQDEEL 343
Query: 371 LLNYRLSNSK----RRPVWYSPVDEEEDRRRWS 399
LNYR + + + P WY D+E +RRWS
Sbjct: 344 FLNYRYNPAALERLQAPSWYWQPDQEAAKRRWS 376
>gi|452820490|gb|EME27532.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
gi|452820491|gb|EME27533.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
Length = 436
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 108/429 (25%), Positives = 183/429 (42%), Gaps = 106/429 (24%)
Query: 31 EADMNRLFLYTSYNRLGRDAEE-----ADAEEIIDMAGKASLSD------QQQQVLD-NI 78
+ D RL++ TS R G+ A + D E ID + + +Q+++L+ NI
Sbjct: 44 QNDTLRLYVLTSLKRDGKKAADFFSRIMDGSETIDRVTQKKFEEFLPSPEEQKEILERNI 103
Query: 79 HSQI------------KRFCLSMDEILLL-----------PEKNVPIESSAQSNAAPRRS 115
Q+ ++ CL + LL P + + ++ RS
Sbjct: 104 WEQLLNMFSVLDICSRRKPCLPYGSLALLEKPRFPDPLPDPSDLISFANRIKAGQLTDRS 163
Query: 116 GLSFAVGPTAR---PTDSPVVPQTRQLTR--------TELSQRLKDAIGYTLDLKPSQIP 164
G ++ + D ++ R++ + E+S+R++ +G+ + S IP
Sbjct: 164 GDLVSLSSNHKELQTIDRQILQIDRRIKQYVPTEQDYEEMSRRIEKELGFIVYRATSTIP 223
Query: 165 HEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINA 224
+AGQGLF+ G+ VG+V+A+YPGIIY P+ + +PGYP V N +L+ R DG +I+
Sbjct: 224 --DAGQGLFIKGQCTVGSVVALYPGIIYFPSEWEKLPGYPYVSKYNVHLVKRQDGVIIDG 281
Query: 225 QPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNP 284
+P +D + + PE+ W ER NP
Sbjct: 282 KP----YDEQTISFQGERPEVA-------------W-------------------ERINP 305
Query: 285 LALAHFANHPAKGMVPNVMICPYDFPLTEKD--MRPYIPNISFGNAEE--------VNMR 334
AL +ANHP KG PN++ +D+PL M+ +PN + +E M
Sbjct: 306 FALGQYANHPPKGTEPNIIALSFDYPLYRMSPAMKILVPNRYLLDEQEDEAWSSHLSKME 365
Query: 335 RFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWY-SPVDEE- 392
F G + V++ + ++ R + +EE+ +NYR + P WY P EE
Sbjct: 366 DVMQF------GKDRKSRVMRGMVIITLRDVENEELFINYRYNPFATHPEWYWDPSPEET 419
Query: 393 ----EDRRR 397
EDR++
Sbjct: 420 QWLIEDRQK 428
>gi|348688367|gb|EGZ28181.1| hypothetical protein PHYSODRAFT_309093 [Phytophthora sojae]
Length = 312
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 163/387 (42%), Gaps = 108/387 (27%)
Query: 26 RQLQFEADMNRLFLYTSYNRLGRDAEEADAE------EIIDMAGKASLSDQQQQVLDNIH 79
R+ Q EAD+ + + + + L D + A+ E + +++ G Q+Q V +
Sbjct: 19 RRFQEEADVGVVAVLSCFRSLKLDPQNANKEHEELFDQAVEIVGTKKFEQQKQYVQGEVA 78
Query: 80 SQIKRFCLSMDEILLLP---EKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVVPQT 136
Q++ C + +L +P EKNVP S + R + + +G
Sbjct: 79 KQVQS-CF--EAVLRMPKDSEKNVPKMSLS---GFERTTEVEEKLG-------------- 118
Query: 137 RQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAY 196
+S + G+TL+++ S +P +AG G+ L G A +G+++A+YPGI+Y P +
Sbjct: 119 -------ISDNFQKLFGFTLNVEKSSVP--DAGDGVKLRGSAPIGSLVALYPGIVYLPEH 169
Query: 197 YRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGS 256
Y+ Y + NPY R+D +I+A+ S
Sbjct: 170 YKK-KHYLSELSNNPYARARFDSVIIDAKNESS--------------------------- 201
Query: 257 DQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP----LT 312
KP RNPLA+AH NHP G PNV+ ++FP T
Sbjct: 202 --------KP---------------RNPLAVAHKINHPPNGTPPNVLGFAFEFPPEEPFT 238
Query: 313 EKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEE-VL 371
K+ IPN ++M FG ++ +LA +AT I DEE +
Sbjct: 239 TKEFDELIPNSFVKEPSRLSM--FGK------------RAIVHSLAFIATADIADEEELF 284
Query: 372 LNYRLSNSKRRPVWYSPVDEEEDRRRW 398
LNYR + + P WY+PVD E DR W
Sbjct: 285 LNYRYNPDHQLPEWYTPVDVESDRAMW 311
>gi|301117376|ref|XP_002906416.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107765|gb|EEY65817.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 312
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 164/384 (42%), Gaps = 102/384 (26%)
Query: 26 RQLQFEADMNRLFLYTSYNRLGRDAEEADAE------EIIDMAGKASLSDQQQQVLDNIH 79
R+ Q EAD+ + + + L D + A+ E + +++ G Q+Q V + +
Sbjct: 19 RRFQEEADVGVAAVLSCFRSLKLDPQNANKEHEELFDQAVEIVGSKKFEQQKQYVQNEVA 78
Query: 80 SQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFAVGPTARPTDSPVVPQTRQL 139
Q++ + + +L +P+ + +APR S F R T+ +L
Sbjct: 79 KQVQ---ICFEAVLKMPKDS--------EKSAPRMSLSGFE-----RTTEV-----EEKL 117
Query: 140 TRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRY 199
T+ Q+L G+TL+++ S +P A +G+ L G A +G+++A+YPGI+Y P +Y+
Sbjct: 118 GITDNFQKL---FGFTLNVEESSVPG--ADEGVKLRGSAPIGSLVALYPGIVYLPEHYKK 172
Query: 200 IPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQF 259
+ V + N Y R+D +I+A+ S
Sbjct: 173 -KHHLGVLSNNAYARARFDSVIIDAKNESS------------------------------ 201
Query: 260 WKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP----LTEKD 315
KP RNPLA+AH NHP G PNV+ ++FP + K+
Sbjct: 202 -----KP---------------RNPLAVAHKINHPPSGTPPNVIGFAFEFPPEEPFSTKE 241
Query: 316 MRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEE-VLLNY 374
IPN ++M FG ++ LA +AT I DEE + LNY
Sbjct: 242 YDELIPNSFVQEPSRLSM--FGK------------RAIVHGLAFIATADIADEEELFLNY 287
Query: 375 RLSNSKRRPVWYSPVDEEEDRRRW 398
R + + P WY+PVD + DR W
Sbjct: 288 RYNPDRPLPEWYTPVDVDSDRAMW 311
>gi|66811938|ref|XP_640148.1| hypothetical protein DDB_G0282599 [Dictyostelium discoideum AX4]
gi|60468149|gb|EAL66159.1| hypothetical protein DDB_G0282599 [Dictyostelium discoideum AX4]
Length = 378
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 117/259 (45%), Gaps = 60/259 (23%)
Query: 153 GYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPY 212
G+ L ++ S I HEEAG G+ + GE G VIAIYPG Y IP V + N Y
Sbjct: 152 GFRLHVRKSDINHEEAGYGVHIEGECVPGTVIAIYPGDTYDGD---NIPS--NVISDNDY 206
Query: 213 LITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRG 272
+I+RYDGTVI+ G W+ R D L L E +F+K L
Sbjct: 207 MISRYDGTVID----GRSWNKRA--DELILKE-------------KFFKKLGTK------ 241
Query: 273 GSGSEMLERRNPLALAHFANHP-----------AKGMVPNVMICPYDFPLT-EKDMRPYI 320
SG+++L +NP A+ +F NHP + PNV+ Y+F + YI
Sbjct: 242 SSGTDLLCYKNPFAIGNFINHPPIITKDGDGDTDQTKEPNVIAYSYNFRSNFPNHLSQYI 301
Query: 321 PNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVL-KTLALVATRAICDEEVLLNYRLSNS 379
PN + N T VL ++L L+A + I ++EV LNYR +
Sbjct: 302 PNKTVTEHSIFN-----------------DTSVLRRSLLLIAFKPIKNQEVFLNYRFNPD 344
Query: 380 KRRPVWYSPVDEEEDRRRW 398
P WY D EE +RRW
Sbjct: 345 LPYPNWYKQPDLEEAKRRW 363
>gi|328867711|gb|EGG16093.1| hypothetical protein DFA_09765 [Dictyostelium fasciculatum]
Length = 468
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 116/252 (46%), Gaps = 53/252 (21%)
Query: 154 YTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYL 213
++L +K S I H EAG G+ + G + G V+AIYPG Y+ + + +N Y+
Sbjct: 247 FSLVIKKSSIDHGEAGYGVHVKGLVSPGTVLAIYPGDTYTASNMDHA-----AILENEYM 301
Query: 214 ITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGG 273
++RYDGTVI+ + W D I + +E G+
Sbjct: 302 MSRYDGTVIDGRSWLKKVDM----------SIYRSKYFSETGT---------------SV 336
Query: 274 SGSEMLERRNPLALAHFANHPAKGMV------PNVMICPYDFPLTEKD-MRPYIPNISFG 326
S E+L+ RNP A+ F NHP++ NV+ Y+FP T D ++PYIP+ S
Sbjct: 337 SEEELLKYRNPFAIGQFINHPSEHKQNPNFPHANVIGIKYNFPQTFPDHLKPYIPHTS-- 394
Query: 327 NAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWY 386
+ N R + + K+ L+A R I DEE+LLNYR + + P WY
Sbjct: 395 -VTDTNFFRDKNIF-------------AKSYVLIANRYIQDEELLLNYRYNPANPYPPWY 440
Query: 387 SPVDEEEDRRRW 398
D E +RRW
Sbjct: 441 FQPDILEAKRRW 452
>gi|330806343|ref|XP_003291130.1| hypothetical protein DICPUDRAFT_81815 [Dictyostelium purpureum]
gi|325078691|gb|EGC32328.1| hypothetical protein DICPUDRAFT_81815 [Dictyostelium purpureum]
Length = 362
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/247 (29%), Positives = 114/247 (46%), Gaps = 47/247 (19%)
Query: 153 GYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPY 212
G+ + +K S + HEEAG G+++ GE G V+A+YPG Y IP P + +N Y
Sbjct: 146 GFKIHVKSSTVDHEEAGYGVYIEGEVIPGTVVALYPGDTYDG---ENIP--PSIVRENEY 200
Query: 213 LITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRG 272
+++RYDGT+I+ + W D LTL M G KP D
Sbjct: 201 MMSRYDGTIIDGRSWNKKCDE------LTLKSKMFKEIGTN----------VKPTD---- 240
Query: 273 GSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKD-MRPYIPNISFGNAEEV 331
+L RNP A+ +F NHP PNV+ +++ D + YIPN + +
Sbjct: 241 -----ILCYRNPFAIGNFINHPPADDQPNVIAYAFNYRRGFPDHLVHYIPNRTITENQ-- 293
Query: 332 NMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDE 391
+F ++ + + + L+A + I + E+ LNYR + P WY+ D
Sbjct: 294 --------YF------SDNSVIRRGVVLIAFKPIINSELFLNYRFNPDLPYPDWYTQPDL 339
Query: 392 EEDRRRW 398
EE +RRW
Sbjct: 340 EEAKRRW 346
>gi|298709446|emb|CBJ31352.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 384
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 116/274 (42%), Gaps = 79/274 (28%)
Query: 141 RTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYI 200
R L + L+ +G+ +K S + AG GLF+ G A GAV+AIYPG+++ P + R
Sbjct: 154 RKHLKEMLRSELGFNTVVKDSTL--SGAGLGLFVEGSAPEGAVVAIYPGLVHLPEHLRSQ 211
Query: 201 P-GYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQF 259
+ ++ +L+ R DG +++ + G ++P+S
Sbjct: 212 KYAASLLPDEDHFLMGRADGCLVDGRTAG----------------LLPSS---------- 245
Query: 260 WKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKD--- 315
PL AH ANHP +G NV+ YDFP L+E D
Sbjct: 246 ------------------------PLTFAHAANHPPEGQAANVLQVAYDFPALSEDDVGI 281
Query: 316 ---------MRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRA-I 365
+RP +PN R+ G + +++T+ LVA R +
Sbjct: 282 AKGGRFPERLRPLVPN------------RYHRAPKALFGGKLDQSAMVETVVLVACRGGL 329
Query: 366 CDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
D+EV ++YR + + R P WYSPVD E W+
Sbjct: 330 HDQEVFMDYRFNPNMRHPDWYSPVDPEATAEMWA 363
>gi|303284110|ref|XP_003061346.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457697|gb|EEH54996.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 609
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 80/198 (40%), Gaps = 66/198 (33%)
Query: 268 DNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPL---------------- 311
D + + +ERR+PLALAHFANHP G PNV + D
Sbjct: 410 DRRVAFDAAAAVERRHPLALAHFANHPPGGTSPNVAVASVDVVFEEEEEEEEDAGAAEAG 469
Query: 312 ------------TEKDMRPYIPNISFGN---------AEEVNM---------------RR 335
+ +RPY+PN+ G+ EE + R
Sbjct: 470 TAEGPRSSTTDSVDASLRPYLPNVGIGSWGVPTRGWELEEGTLDSEQNKTGGFAGLLRRL 529
Query: 336 FG-------SFWFKWGSGSGSSTP-------VLKTLALVATRAICDEEVLLNYRLSNSKR 381
FG F K + S V+ TLALVA A+ DEEV LNYRLS +
Sbjct: 530 FGEDDDDDDDFETKARKKAADSEAARRRNRVVVPTLALVALEALQDEEVFLNYRLSTHVK 589
Query: 382 RPVWYSPVDEEEDRRRWS 399
RP WY PVD ED RRW+
Sbjct: 590 RPEWYVPVDPAEDARRWA 607
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 98/213 (46%), Gaps = 26/213 (12%)
Query: 12 VKTLA---KSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAE-EADAEEIIDMAGKASL 67
VKT A + R R+ Q AD++RL + T+Y RD + EA +++ A S
Sbjct: 38 VKTFAGRGSTDHLTRGARRAQEHADLHRLHVLTTY--YARDGDGEALLNDVLAGAEGKSY 95
Query: 68 SDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIESSAQSNAAPRRSGLSFAVGPTARP 127
+ V DN+ SQI+ C ++ L P + R+ S
Sbjct: 96 EQLKASVSDNLTSQIRTICAAVRASLASPRVPPRVGPRGGGGGDRDRASSSS-------- 147
Query: 128 TDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIY 187
+R+ E++ RL G+ ++ S +G+GLF+ G+ + G ++A+Y
Sbjct: 148 -------SSREAIANEITIRL----GFACAIQRSTA-GAGSGRGLFVRGKVDPGEIVALY 195
Query: 188 PGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGT 220
PG+ Y ++PG+P V A NPYLI R+DGT
Sbjct: 196 PGVAYDVRSVTHLPGFPEVAAANPYLIARFDGT 228
>gi|302851678|ref|XP_002957362.1| hypothetical protein VOLCADRAFT_107629 [Volvox carteri f.
nagariensis]
gi|300257321|gb|EFJ41571.1| hypothetical protein VOLCADRAFT_107629 [Volvox carteri f.
nagariensis]
Length = 523
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 67/123 (54%), Gaps = 31/123 (25%)
Query: 139 LTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYR 198
L++ ++++ L+ IG+ L+L PS +PH ++G GLFL GEA GA++AI+PG++Y
Sbjct: 159 LSQQQVAEALQRRIGFRLELGPSSVPHPDSGTGLFLRGEARPGAIVAIFPGVLYGRTQLA 218
Query: 199 YIPGYPR-------------------------------VDAQNPYLITRYDGTVINAQPW 227
++P +P+ VD NPYL RYD ++++++PW
Sbjct: 219 HMPNFPKARPERFLVLVMKLVVKLVLALDAGLRRSGVAVDMDNPYLSCRYDQSIVDSKPW 278
Query: 228 GSG 230
G G
Sbjct: 279 GRG 281
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 32/48 (66%), Gaps = 1/48 (2%)
Query: 353 VLKTLALVATRAICD-EEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
V++ L LV+T A+ D +E+L NYR++ RP WY D E ++RRW+
Sbjct: 439 VVRLLVLVSTTAVRDGDELLQNYRMNPHVARPDWYVVHDAEAEQRRWA 486
>gi|156378496|ref|XP_001631178.1| predicted protein [Nematostella vectensis]
gi|156218214|gb|EDO39115.1| predicted protein [Nematostella vectensis]
Length = 375
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 75/159 (47%), Gaps = 42/159 (26%)
Query: 154 YTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYL 213
++L +KPS I H EAG+G+FL G A G V+ +PG++Y P Y R +P YP +D N YL
Sbjct: 96 FSLSIKPSTIQHPEAGEGVFLDGSAVPGTVVMFHPGLVYLPPYIRKMPNYPWIDKDNTYL 155
Query: 214 ITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGG 273
++RYDG +++ + G DT D +R
Sbjct: 156 MSRYDGCIVDPK----GLDT----------------------------------DIQR-- 175
Query: 274 SGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLT 312
S M+ +PLA H NH G PN++ YDF T
Sbjct: 176 SDGTMVA--HPLAAGHKVNHAPPGTKPNILPFSYDFAKT 212
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 34/47 (72%)
Query: 353 VLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
VL+++ V T + ++E+ +NYRL+ + + P WY+PVD +ED RRW+
Sbjct: 328 VLRSVVYVTTNHVHNQELFVNYRLNPAIKLPEWYTPVDRKEDLRRWN 374
>gi|413937912|gb|AFW72463.1| hypothetical protein ZEAMMB73_372625, partial [Zea mays]
Length = 56
Score = 78.6 bits (192), Expect = 5e-12, Method: Composition-based stats.
Identities = 35/45 (77%), Positives = 39/45 (86%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNR 45
MAFLF KFQEA++T+AKSP FARDPR LQFEAD+NRLFLYT R
Sbjct: 1 MAFLFNKFQEAIRTIAKSPMFARDPRHLQFEADVNRLFLYTRKCR 45
>gi|325186225|emb|CCA20726.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 133
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 63/121 (52%), Gaps = 18/121 (14%)
Query: 282 RNPLALAHFANHPAKGMVPNVMICPYDFPL----TEKDMRPYIPNISFGNAEEVNMRRFG 337
RNPL + H NHP +PNV+ PYDFP+ T+++ P IPN ++M FG
Sbjct: 26 RNPLGVGHMINHPPANELPNVIAFPYDFPMREPFTKEEHIPLIPNSFIDQPSRLSM--FG 83
Query: 338 SFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRR 397
++ +LA ++ R I DEE+ LNYR + + P WY+PVD E D+
Sbjct: 84 K------------RILIHSLAFISLREIEDEELFLNYRYNPNLPYPEWYTPVDLESDKLL 131
Query: 398 W 398
W
Sbjct: 132 W 132
>gi|440798065|gb|ELR19136.1| hypothetical protein ACA1_335990 [Acanthamoeba castellanii str.
Neff]
Length = 317
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 66/225 (29%), Positives = 95/225 (42%), Gaps = 46/225 (20%)
Query: 177 EANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWG-SGWDTRE 235
E V I PG ++ P Y + N Y+ RYDG V++ + W + R
Sbjct: 127 EQGVHVKGHILPGTVHFP----YTLNENAIKGNN-YMYGRYDGVVVDGRDWERKAFAARR 181
Query: 236 LWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPA 295
L L E + S EK +L RNP +A + NHP
Sbjct: 182 LQTRL---ERVGQSIKDEKT----------------------LLRFRNPFGIAQYINHPP 216
Query: 296 KGMVPNVMICPYDFPLTEKD-MRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVL 354
G PNVM Y+FP T D ++PYIP+ + +E N+ +F + + +
Sbjct: 217 GGKQPNVMCVGYEFPNTFPDELKPYIPH---EHVKEPNI------FF-----NTMTDCYI 262
Query: 355 KTLALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
+++ +ATR I DEE+ LNYR + P WY D E RRW
Sbjct: 263 RSIMCIATRHITDEELFLNYRYNPINPYPDWYVQPDVAEAERRWK 307
>gi|388521337|gb|AFK48730.1| unknown [Lotus japonicus]
Length = 77
Score = 75.5 bits (184), Expect = 5e-11, Method: Composition-based stats.
Identities = 44/73 (60%), Positives = 51/73 (69%), Gaps = 3/73 (4%)
Query: 61 MAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEK--NVPIESSAQSNAAPRRSGLS 118
MA KAS DQQ+QV +N+HSQI+ FC MDEILL E+ N P+ S Q+ PRRSGLS
Sbjct: 1 MASKASFDDQQKQVQENVHSQIRTFCSFMDEILLPNEEPLNDPLGLSQQATVFPRRSGLS 60
Query: 119 FAVGPT-ARPTDS 130
FAVG T A P DS
Sbjct: 61 FAVGRTDAPPHDS 73
>gi|440802939|gb|ELR23854.1| hypothetical protein ACA1_123630 [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 67/232 (28%), Positives = 95/232 (40%), Gaps = 54/232 (23%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVD 207
++ G+ L + S IP +AG GLFL G + G + IYPG +Y P +P
Sbjct: 131 MRTQFGFALVKRRSTIP--DAGHGLFLEGAVSEGTCVCIYPGAVYQPGDPLLLP-----S 183
Query: 208 AQNPYLITRYDGTVINAQPWG-SGWDTRELWDGLTLPEIMPNSKGAEKG-SDQFWKLLSK 265
+N YLI R DG+ ++ +P G S + R +++ G G D W
Sbjct: 184 LRNDYLIRRSDGSSVDGRPRGLSRYFYRSIYEKF----------GPFAGLCDATW----- 228
Query: 266 PMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDF-PLTEKDMRPYIPNIS 324
+D K GG+ NP H NH VPNVM Y F P T IPN+
Sbjct: 229 -LDIKEGGTNEH-----NPFGWGHLINHSD---VPNVMYFEYQFPPDTPAHYLSSIPNVF 279
Query: 325 FGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRL 376
+ + + +KT+ L+A R I DEE+ NY
Sbjct: 280 YSSKDPAC--------------------AVKTIVLMAMRRIRDEELFSNYNF 311
>gi|440797554|gb|ELR18638.1| hypothetical protein ACA1_298810 [Acanthamoeba castellanii str.
Neff]
Length = 317
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/248 (27%), Positives = 109/248 (43%), Gaps = 51/248 (20%)
Query: 151 AIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQN 210
+ G+TL +PS I AG+G+F+ G+A+ G V+A YPG +Y+ + + YP + N
Sbjct: 100 SCGFTLAKRPSSI--SGAGEGVFIQGKASRGTVVAFYPGCVYAESDLPLV--YPHIYTDN 155
Query: 211 PYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNK 270
YL R D +I+A +SKG+ + + F S+ +
Sbjct: 156 DYLAFRSDKMLIDA-----------------------SSKGSSRQA--FLAAWSRDHPRQ 190
Query: 271 RGGSGSEMLERR---NPLALAHFANHPAKGMVPNVMICPYDFPLTEKD----MRPYIPNI 323
+ SG ++ N A+ H NHP PNV+ P+DF + D +R +IPN+
Sbjct: 191 QPPSGDKIDFAEVFANSYAVGHLVNHPPADRRPNVVSLPFDFLAADLDAHDHLRRHIPNV 250
Query: 324 SFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICD-EEVLLNYRLS-NSKR 381
F + R G +G+ V+ + +VA + D E+L++Y
Sbjct: 251 YF-----LPRRHEGD--------AGADGVVMSGMVIVAADDLEDGAELLMDYHYGMRGAD 297
Query: 382 RPVWYSPV 389
P WY P
Sbjct: 298 VPSWYRPA 305
>gi|403368929|gb|EJY84299.1| hypothetical protein OXYTRI_17960 [Oxytricha trifallax]
Length = 363
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 82/183 (44%), Gaps = 36/183 (19%)
Query: 141 RTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFL-CGEAN----VGAVIAIYPGIIYSPA 195
+ +L ++LK +G+ L +K S I H EAG+G+FL C G ++ I+PG++ P
Sbjct: 62 KEQLLKQLKYRVGHVLRIKDSSIDHHEAGRGVFLECPRRQPIVLPGTLLGIFPGVVCDPG 121
Query: 196 YYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKG 255
+P P+ PYL R+DG I+ + LP MP+ G
Sbjct: 122 C--PMPPTPK-RGLRPYL-KRFDGYWIDYEK--------------ELPYPMPSP-----G 158
Query: 256 SD--QFWKLLSKPMDNKRGGSGSEML----ERRNPLALAHFANHPAKGMVPNVMICPYDF 309
S+ F + S +D + G + E NP AL H NHP + NV + +D
Sbjct: 159 SNLIDFAENFSSQID--KSGMHKSFIEVPPEYMNPFALGHLINHPPPDIPANVQLIDFDL 216
Query: 310 PLT 312
P T
Sbjct: 217 PYT 219
>gi|390360997|ref|XP_788396.2| PREDICTED: uncharacterized protein C5orf35-like [Strongylocentrotus
purpuratus]
Length = 297
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 65/235 (27%), Positives = 97/235 (41%), Gaps = 59/235 (25%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVI-AIYPGIIYSPAYYRYIPGYPRV 206
+ +GY + +PSQI E G G+F+ V+ A+YPG IY PG P +
Sbjct: 109 MHQTLGYAIGRRPSQI--EGGGVGVFVTKGVVPKGVVVAMYPGTIY-------YPGEPIL 159
Query: 207 --DAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLS 264
NP++ DG I+ + + L + + G SD W L S
Sbjct: 160 LQSIANPFVFRCLDGIHIDG-------NDKRLSKMIYRSCARRDRLGPYMFSDLSW-LTS 211
Query: 265 KPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMI----CPYDFPLTEKDMRPYI 320
P+ NP+A+ + N+ KG NV P DFP+ +R YI
Sbjct: 212 CPL---------------NPMAVGQYVNNQHKGAPANVCYQEFDVPLDFPIK---LRKYI 253
Query: 321 PNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAI-CDEEVLLNY 374
PN FG +V G++ ++T+ALV+ R I C EE+L +Y
Sbjct: 254 PNAFFGTLPDV----------------GNAGRNMRTVALVSVREIQCGEELLSSY 292
>gi|47227924|emb|CAF97553.1| unnamed protein product [Tetraodon nigroviridis]
Length = 279
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 78/180 (43%), Gaps = 38/180 (21%)
Query: 151 AIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRV 206
+G+ +D KPS +P AG G+F+ G GA++A+YPG +Y P +++ I
Sbjct: 100 TLGFCIDRKPSSLPF--AGTGVFVTKGFVPKGAIVALYPGTVYQAYEPIFFQSI------ 151
Query: 207 DAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKP 266
+NP++ DG +I+ G G + G SD W L S P
Sbjct: 152 --RNPFVFRCIDGILIDGNDKGISRMVFRSCSGR-------DRVGFFMMSDISW-LTSDP 201
Query: 267 MDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
+ NPLA+ + N+ + NV YD P ++R ++PN++F
Sbjct: 202 L---------------NPLAVGQYVNNCSNQRPANVCYQEYDVPDAFPTELRQFLPNVNF 246
>gi|190344028|gb|ACE75808.1| hypothetical protein [Sorex araneus]
Length = 299
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 85/192 (44%), Gaps = 38/192 (19%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
L +G+++ L S + AG+G+F+ G GAV+++YPG +Y P +++ I
Sbjct: 116 LYKTLGFSVALATSSLI--SAGKGVFVTKGLIPKGAVVSMYPGTVYQKYEPIFFQSI--- 170
Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
NP++ DG +I+ G +G + G K SD W L
Sbjct: 171 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDQLGPFKMSDSTW-LT 217
Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
S E RNPLAL + N+ + NV +D P + +++ Y+PN
Sbjct: 218 S---------------EIRNPLALGQYVNNCSNDRAANVCYQEFDVPAVFPLELKQYLPN 262
Query: 323 ISFGNAEEVNMR 334
I++ + ++ +R
Sbjct: 263 IAYSSDKQSLLR 274
>gi|410924740|ref|XP_003975839.1| PREDICTED: SET domain-containing protein 9-like [Takifugu rubripes]
Length = 283
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 82/189 (43%), Gaps = 38/189 (20%)
Query: 151 AIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQ 209
+G+ +D KPS + AG G+F+ G GA++A+YPG +Y P Y P + + +
Sbjct: 103 TLGFCIDRKPSSLAF--AGTGVFVTKGLVPKGAIVAMYPGTVYQP----YEPIFFQ-SIR 155
Query: 210 NPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDN 269
NP++ DG +I+ G G + G SD W L + P+
Sbjct: 156 NPFVFRCIDGILIDGNDKGISKMVFRSCSG-------RDRVGFFMVSDVSW-LTNDPL-- 205
Query: 270 KRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYD----FPLTEKDMRPYIPNISF 325
NPLA+ + N+ + NV YD FP+ ++R Y+PN++F
Sbjct: 206 -------------NPLAVGQYVNNCSNERPANVCYQEYDVPDKFPI---ELRQYLPNVNF 249
Query: 326 GNAEEVNMR 334
+ +R
Sbjct: 250 RQEVQAPLR 258
>gi|321459697|gb|EFX70748.1| hypothetical protein DAPPUDRAFT_327850 [Daphnia pulex]
Length = 265
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 56/236 (23%), Positives = 94/236 (39%), Gaps = 52/236 (22%)
Query: 143 ELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIP 201
E + ++ +GYT+ L+ S IP EAG G+F+ G G+++A+YPG + YR +
Sbjct: 72 EALKVMERVLGYTIQLRKSNIP--EAGTGVFVSKGTVTAGSIVALYPGTV-----YRVME 124
Query: 202 GYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWK 261
NP++ DG I+ + G +R+++ D W
Sbjct: 125 PLLLQSIVNPFIFRCIDGLHIDGKDRGI---SRKIFKSCVY-------------RDLRWP 168
Query: 262 LLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTE--KDMRPY 319
+ G + NPL + N+ NV D P+ ++R Y
Sbjct: 169 FFPADLTWLTG-------DLINPLNCGQYVNNETPEYPCNVAYQECDLPIDRFPIELRCY 221
Query: 320 IPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAI-CDEEVLLNY 374
+PNI + ++E G+ T L+ +ALVA + I D E+ Y
Sbjct: 222 LPNIYYSSSE------------------GNQTKFLRVVALVAVKDIGIDTELFSTY 259
>gi|301621891|ref|XP_002940280.1| PREDICTED: uncharacterized protein C5orf35 [Xenopus (Silurana)
tropicalis]
Length = 297
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 59/238 (24%), Positives = 99/238 (41%), Gaps = 60/238 (25%)
Query: 143 ELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYR 198
E + L +G+ + PS I + AG G+F+ G GAV+++YPG +Y P +++
Sbjct: 109 EAREILHSTLGFCIARAPSSI--DSAGTGVFVSKGHVQKGAVVSMYPGTVYQQYEPIFFQ 166
Query: 199 YIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQ 258
I +NP++ DG +I+ G ++ L+ + + + G K D
Sbjct: 167 SI--------RNPFIFRCIDGILIDGNDKGI---SQYLYRSCSRRDQL----GPLKLCDA 211
Query: 259 FWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMR 317
W L + P +NP A+ + N+ + NV +D P ++R
Sbjct: 212 TW-LSAFP---------------QNPWAVGQYVNNCSNEREANVCYQEFDVPEYFPIELR 255
Query: 318 PYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAI-CDEEVLLNY 374
Y+PNI++ S S L+ + LVA R I C EE+ NY
Sbjct: 256 QYLPNINY---------------------SPDSQGPLRCVVLVALRDIDCGEELFSNY 292
>gi|219111099|ref|XP_002177301.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411836|gb|EEC51764.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 470
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 61/271 (22%), Positives = 107/271 (39%), Gaps = 79/271 (29%)
Query: 135 QTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAGQGLFLCGEANVGAVIAIYPGIIY-- 192
Q+ + R+ L +++ +G+++ S IP AG+G F+ G A G+++A PG ++
Sbjct: 249 QSLNMCRSAL---IREELGFSVVSFRSLIP--GAGRGAFVDGNALAGSILAFQPGDVWPK 303
Query: 193 ------SPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIM 246
+P + G + ++ R+D V++++ TRE
Sbjct: 304 EHLLTSAPDVIEHFAG-----EDDCHVSLRFDDYVVDSRQSPVTVLTRE----------- 347
Query: 247 PNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICP 306
GS NP A+ H ANHP PNV
Sbjct: 348 ----------------------------GS-----MNPWAIGHMANHPPSKTQPNVQSTM 374
Query: 307 YDFP----LTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVAT 362
+F + E D+ Y+PN ++ R F + V+ L L++
Sbjct: 375 LNFTERMNMAENDLFRYLPN-TYARLPSWRSRVF-----------EAEEVVMHGLCLLSK 422
Query: 363 RAICDEEVLLNYRLSNSKRRPVWYSPVDEEE 393
R + +EE++ +YRL S+ P WYS V ++
Sbjct: 423 RDVSNEELVYDYRL-QSETTPDWYSVVRHDD 452
>gi|397566635|gb|EJK45127.1| hypothetical protein THAOC_36272 [Thalassiosira oceanica]
Length = 472
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 58/138 (42%), Gaps = 31/138 (22%)
Query: 282 RNPLALAHFANHPAKGMV---------PNVMICPYDFPLT-----EKDMRPYIPNISFGN 327
+N A+AH NHP PN + C +F T + +R Y+PN
Sbjct: 339 KNLWAIAHIVNHPPLRKTGERDDVFSGPNCVTCSINFTETMFRDQQDKLRDYLPNEYEQP 398
Query: 328 AEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLS------NSKR 381
+E W + S V+ L L A R + DEE+ +YRLS N ++
Sbjct: 399 PQE---------WTR--SMFDDEKVVMHGLGLFALRDLEDEELFFDYRLSPDDRQPNGQQ 447
Query: 382 RPVWYSPVDEEEDRRRWS 399
P WY +DEE + RW+
Sbjct: 448 YPHWYRVIDEEAVKNRWN 465
>gi|344272547|ref|XP_003408093.1| PREDICTED: uncharacterized protein C5orf35-like [Loxodonta
africana]
Length = 299
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGFVPKGAVVSMYPGTVYQKYEPVFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W SEM RN
Sbjct: 186 GNDKGISKVVYRSCNG-------RDQLGPLKMSDSTW-------------LTSEM---RN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFPLT-EKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P ++R Y+PNI++ ++ +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAAFPVELRQYLPNIAYSYDKQSPLR 274
>gi|307110916|gb|EFN59151.1| hypothetical protein CHLNCDRAFT_49994 [Chlorella variabilis]
Length = 988
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/230 (25%), Positives = 91/230 (39%), Gaps = 31/230 (13%)
Query: 144 LSQRLKDAIGYTLDLKPSQIPHEEAGQGLFL-CGEANVGAVIAIYPGIIYSPAYYRYI-- 200
L++ L D IG+ L +PS I H +AG G+FL G G ++ +YPG+++ + +
Sbjct: 596 LTEALADTIGFELQRRPSGIQHPDAGDGVFLTAGGCPAGGILTLYPGLVHDMGDFVLLLG 655
Query: 201 ---------PGYPRVDAQNPYLI---TRYDGTVINAQPWGSG---WDTRELWDGLTLPE- 244
P P N YL+ +++ QP G + PE
Sbjct: 656 VSAPDDACAPAPPAWTLDNHYLLQLRCLNAACLVDGQPGGLSAQRFQAAAASWAAGQPEA 715
Query: 245 IMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANH-PAKGMVPNVM 303
+ N+ G Q +L G G E L R+ AL H NH PA
Sbjct: 716 ALVNASWLALGGQQRAEL--------DGRLGCEALLRQA--ALGHMLNHGPAAAANAAFD 765
Query: 304 ICPYDFPLTEKDMRPYIPNISFGNAEEVNMRRFGSFWFKWGSGSGSSTPV 353
+CP L R Y+P++S G E +R + G+ + +PV
Sbjct: 766 MCPLPAALPPSLQR-YVPSLSVGGRGEEEVRWVPLVTARTALGATAGSPV 814
>gi|292617940|ref|XP_001332805.2| PREDICTED: uncharacterized protein C5orf35-like [Danio rerio]
Length = 288
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/232 (25%), Positives = 97/232 (41%), Gaps = 62/232 (26%)
Query: 150 DAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDA 208
+ +G++++ K S + E AG G+F+ G G+++A+YPG IY P +
Sbjct: 107 ETLGFSVERKISSL--ERAGTGVFVMRGRVPAGSIVAMYPGTIYQPDEPIFFQS-----V 159
Query: 209 QNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMD 268
+NP++ DG +I+ + ++ +I+ S G D+F L
Sbjct: 160 RNPFVFRCIDGILIDG-------------NDKSISKIVYRSCS---GRDRFGPL------ 197
Query: 269 NKRGGSGSEMLER-RNPLALAHFANHPAKGMVPNVMI----CPYDFPLTEKDMRPYIPNI 323
R GS + NPLA+ + N+ + NV P FPL ++R Y+PN+
Sbjct: 198 --RLSDGSWLTPHPENPLAVGQYVNNCSNEKSANVCFQELDVPDGFPL---ELRQYLPNV 252
Query: 324 SFGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICD-EEVLLNY 374
++ S L+ + LVA R I D EE+ NY
Sbjct: 253 NY---------------------RADSKRPLRCVVLVALRDIDDGEELFSNY 283
>gi|183398083|gb|ACC62499.1| hypothetical protein [Rhinolophus ferrumequinum]
Length = 299
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 76/172 (44%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYHTYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E +N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIQN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ N ++ +R
Sbjct: 223 PLAIGQYVNNCSNDRPANVCYQEFDVPAVFPIELKQYLPNIAYSNDKQSTLR 274
>gi|426246475|ref|XP_004017019.1| PREDICTED: SET domain-containing protein 9 [Ovis aries]
Length = 299
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 38/192 (19%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
L +G+++ PS + AG+G+F+ G GAV+++YPG +Y P +++ I
Sbjct: 116 LYKTLGFSVARAPSSLI--SAGKGVFVTKGLVRKGAVVSMYPGTVYQKYEPIFFQSI--- 170
Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
NP++ DG +I+ G +G + G K SD W L
Sbjct: 171 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDQLGPLKMSDSTW-LT 217
Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLT-EKDMRPYIPN 322
S E NPLA+ + N+ + NV +D P +++ Y+PN
Sbjct: 218 S---------------EIHNPLAIGQYVNNCSNDRAANVCYQEFDVPAAFPIELKQYLPN 262
Query: 323 ISFGNAEEVNMR 334
I++ ++ +R
Sbjct: 263 IAYSYDKQSPLR 274
>gi|61841357|ref|XP_592703.1| PREDICTED: uncharacterized protein C5orf35 isoform 1 [Bos taurus]
gi|297487643|ref|XP_002696366.1| PREDICTED: uncharacterized protein C5orf35 [Bos taurus]
gi|296475835|tpg|DAA17950.1| TPA: hypothetical protein BOS_19749 [Bos taurus]
gi|440904685|gb|ELR55160.1| hypothetical protein M91_11582 [Bos grunniens mutus]
Length = 299
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 82/192 (42%), Gaps = 38/192 (19%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
L +G+++ PS + AG+G+F+ G GAV+++YPG +Y P +++ I
Sbjct: 116 LYKTLGFSVARAPSSLI--SAGKGVFVTKGLVRKGAVVSMYPGTVYQKYEPIFFQSI--- 170
Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
NP++ DG +I+ G G + G K SD W L
Sbjct: 171 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCSG-------RDQLGPLKMSDSTW-LT 217
Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
S E NPLA+ + N+ + NV +D P + +++ Y+PN
Sbjct: 218 S---------------EIHNPLAIGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPN 262
Query: 323 ISFGNAEEVNMR 334
I++ ++ +R
Sbjct: 263 IAYSYDKQSPLR 274
>gi|255076193|ref|XP_002501771.1| predicted protein [Micromonas sp. RCC299]
gi|226517035|gb|ACO63029.1| predicted protein [Micromonas sp. RCC299]
Length = 298
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 94/248 (37%), Gaps = 61/248 (24%)
Query: 156 LDLKPSQIPHEEAGQGLFLCGEANV---GAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPY 212
++++ SQ+ + AG GLFL +A+ G +IAIY G+ Y+P P
Sbjct: 63 VEVRRSQVRSDSAGSGLFLRSDAHAAKPGDLIAIYAGV-YTPPAPDVTHAADGTSILIPL 121
Query: 213 LITRYDGT-VINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKR 271
R +G VIN Q G GW + A KG W
Sbjct: 122 PALREEGAYVINLQ--GGGWIC-------GAESALQAKNNAAKGLCAGW----------- 161
Query: 272 GGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLTEKD-MRPYIPNISFGNAEE 330
A+A ANHP +G +PNV D+ D +R +I I+
Sbjct: 162 --------------AVAALANHPPRGSIPNVEAIAVDWDDAISDAVRKHINPITT----- 202
Query: 331 VNMRRFGSFWFKWGSGSGSSTPVLKT-----LALVATRAIC-DEEVLLNYRLSNSKRRPV 384
W G S V +T + +A+R I +E+L NYRL+ P
Sbjct: 203 ----------LPWYIDDGVSVFVPQTVRSRGIVFLASRTINPGDEILFNYRLNPQGNLPR 252
Query: 385 WYSPVDEE 392
WY+ V +E
Sbjct: 253 WYAAVPDE 260
>gi|432913661|ref|XP_004079001.1| PREDICTED: SET domain-containing protein 9-like [Oryzias latipes]
Length = 268
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 78/180 (43%), Gaps = 38/180 (21%)
Query: 151 AIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQ 209
+G+ +D +PS +P AG G+F+ G G V+A+YPG +Y P Y P + +
Sbjct: 88 TLGFCIDRRPSTLP--LAGTGVFVVNGVIPKGTVVAMYPGTVYQP----YDPILFQ-SIR 140
Query: 210 NPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDN 269
NP++ DG +I+ G + G + G SD W L + P
Sbjct: 141 NPFIFRCIDGVLIDGSDKGMSKMVFKSCSG-------RDRLGPLVTSDTSW-LTASP--- 189
Query: 270 KRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYD----FPLTEKDMRPYIPNISF 325
RNPLA+ + N+ + NV YD FP+ ++ ++PN++F
Sbjct: 190 ------------RNPLAVGQYVNNCSNTKPANVCYQEYDVPDRFPI---ELWQFLPNVNF 234
>gi|348501071|ref|XP_003438094.1| PREDICTED: uncharacterized protein C5orf35-like [Oreochromis
niloticus]
Length = 292
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 80/189 (42%), Gaps = 38/189 (20%)
Query: 151 AIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQ 209
+G+ ++ KPS +P AG G+F+ G GA +A+YPG +Y P Y P + +
Sbjct: 112 TLGFCIERKPSTLPF--AGSGVFVTKGFVPKGATVAMYPGTVYQP----YEPILLQ-SIR 164
Query: 210 NPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDN 269
NP++ DG +++ G + G + G SD W L P
Sbjct: 165 NPFVFRCIDGILVDGNDKGISRMVFKSCSGR-------DRIGPFLTSDTSW-LTDTP--- 213
Query: 270 KRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYD----FPLTEKDMRPYIPNISF 325
+NPLA+ + N+ + NV YD FP+ ++ Y+PN+++
Sbjct: 214 ------------QNPLAVGQYVNNCSDERPANVCYQEYDVPDKFPI---ELHQYLPNVNY 258
Query: 326 GNAEEVNMR 334
+ +R
Sbjct: 259 SQGTQRRLR 267
>gi|363744171|ref|XP_001233556.2| PREDICTED: uncharacterized protein C5orf35 isoform 1 [Gallus
gallus]
Length = 296
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 93/208 (44%), Gaps = 40/208 (19%)
Query: 134 PQTRQLTRTELS--QRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGI 190
P R R LS + L +G++L + S + AG G+F+ G A GAV+A+YPG
Sbjct: 97 PPGRHSQREPLSPEEVLFRTLGFSLARERSSL--LSAGTGVFVSRGSAPQGAVVAMYPGT 154
Query: 191 IY---SPAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMP 247
+Y P +++ + NP++ DG +I+ G +R ++ + + +
Sbjct: 155 VYRKHEPIFFQSL--------GNPFIFRCIDGILIDGNDKGL---SRSVYRSCSWRDQL- 202
Query: 248 NSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPY 307
G + SD W L + P +NPLA+ + N+ + NV +
Sbjct: 203 ---GPFQMSDVSW-LTAAP---------------QNPLAVGQYVNNCSYEKAANVCYQEF 243
Query: 308 DFPLT-EKDMRPYIPNISFGNAEEVNMR 334
D P +++ Y+PNI + + E ++R
Sbjct: 244 DVPRHFPVELKQYLPNIVYSHEIESHLR 271
>gi|311273761|ref|XP_003134022.1| PREDICTED: uncharacterized protein C5orf35-like [Sus scrofa]
Length = 299
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 50/207 (24%), Positives = 88/207 (42%), Gaps = 40/207 (19%)
Query: 135 QTRQLTRTELS--QRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGII 191
+ R RT L + L +G+++ PS + AG+G+F+ G AV+++YPG +
Sbjct: 101 EYRHQQRTTLKPEEILYKTLGFSVARAPSSLI--SAGKGVFVTKGLVPKDAVVSMYPGTV 158
Query: 192 YS---PAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPN 248
Y P +++ I NP++ DG +I+ G +G +
Sbjct: 159 YQKYEPVFFQSI--------GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RD 203
Query: 249 SKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYD 308
G K SD W L S E NPLA+ + N+ + NV +D
Sbjct: 204 RLGPLKMSDSTW-LTS---------------EIHNPLAVGQYVNNCSNDKAANVCYQEFD 247
Query: 309 FP-LTEKDMRPYIPNISFGNAEEVNMR 334
P + +++ Y+PNI++ ++ +R
Sbjct: 248 VPAVFPIELKQYLPNIAYSYDKQSPLR 274
>gi|390459961|ref|XP_003732396.1| PREDICTED: uncharacterized protein C5orf35 homolog isoform 2
[Callithrix jacchus]
Length = 273
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 78/183 (42%), Gaps = 38/183 (20%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
L +G+++ S + AG+G+F+ G GAV+++YPG +Y P +++ I
Sbjct: 90 LYKTLGFSVAQATSSL--NSAGKGVFVTKGLVPKGAVVSMYPGTVYQNYEPIFFQSIG-- 145
Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
NP++ DG +I+ G +G + G K SD W L
Sbjct: 146 ------NPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LT 191
Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
S E NPLA+ + N+ + NV +D P + +++ Y+PN
Sbjct: 192 S---------------EIHNPLAMGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPN 236
Query: 323 ISF 325
I++
Sbjct: 237 IAY 239
>gi|169409540|gb|ACA57888.1| hypothetical protein [Callicebus moloch]
Length = 299
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 46/183 (25%), Positives = 78/183 (42%), Gaps = 38/183 (20%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
L +GY++ S + AG+G+F+ G GAV+++YPG +Y P +++ I
Sbjct: 116 LYKTLGYSVAQATSSLI--SAGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--- 170
Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
NP++ DG +I+ G +G + G K SD W L
Sbjct: 171 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LT 217
Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
S E NPLA+ + N+ + NV +D P + +++ Y+PN
Sbjct: 218 S---------------EIHNPLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPN 262
Query: 323 ISF 325
I++
Sbjct: 263 IAY 265
>gi|296194560|ref|XP_002745008.1| PREDICTED: uncharacterized protein C5orf35 homolog isoform 1
[Callithrix jacchus]
gi|166064956|gb|ABY79120.1| hypothetical protein [Callithrix jacchus]
Length = 299
Score = 45.8 bits (107), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 78/183 (42%), Gaps = 38/183 (20%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
L +G+++ S + AG+G+F+ G GAV+++YPG +Y P +++ I
Sbjct: 116 LYKTLGFSVAQATSSL--NSAGKGVFVTKGLVPKGAVVSMYPGTVYQNYEPIFFQSI--- 170
Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
NP++ DG +I+ G +G + G K SD W L
Sbjct: 171 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LT 217
Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
S E NPLA+ + N+ + NV +D P + +++ Y+PN
Sbjct: 218 S---------------EIHNPLAMGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPN 262
Query: 323 ISF 325
I++
Sbjct: 263 IAY 265
>gi|395510329|ref|XP_003759430.1| PREDICTED: SET domain-containing protein 9 [Sarcophilus harrisii]
Length = 270
Score = 45.8 bits (107), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 50/204 (24%), Positives = 86/204 (42%), Gaps = 46/204 (22%)
Query: 135 QTRQLTRTELSQR--------LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIA 185
Q +L R + Q+ L + +G+++ + S + AG+G+F+ G GAV++
Sbjct: 66 QRTELPRCNIQQQNIYKPGEVLFNTLGFSISRETSSLI--SAGKGVFVTKGFVPKGAVVS 123
Query: 186 IYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTL 242
+YPG +Y P +++ I NP++ DG +I+ G +G
Sbjct: 124 MYPGTVYQKHEPIFFQSI--------GNPFIFRCLDGMLIDGNDKGISKVVYRSCNG--- 172
Query: 243 PEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNV 302
+ G K SD W L S E +NPLA+ + N+ + NV
Sbjct: 173 ----RDQLGPFKMSDSTW-LTS---------------EIQNPLAVGQYVNNCSNDKEANV 212
Query: 303 MICPYDFP-LTEKDMRPYIPNISF 325
+D P L + + Y+PNI +
Sbjct: 213 CYQEFDVPELFPLEFKQYLPNIMY 236
>gi|281182766|ref|NP_001162402.1| SET domain-containing protein 9 [Papio anubis]
gi|163780989|gb|ABY40772.1| chromosome 5 open reading frame 35 (predicted) [Papio anubis]
Length = 298
Score = 45.8 bits (107), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 133 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 184
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 185 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 221
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 222 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPVELKQYLPNIAYSYDKQSPLR 273
>gi|302564946|ref|NP_001181100.1| SET domain containing 9 [Macaca mulatta]
gi|355691322|gb|EHH26507.1| hypothetical protein EGK_16503 [Macaca mulatta]
gi|355749929|gb|EHH54267.1| hypothetical protein EGM_15070 [Macaca fascicularis]
gi|380809686|gb|AFE76718.1| hypothetical protein LOC133383 isoform 1 [Macaca mulatta]
Length = 299
Score = 45.8 bits (107), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPVELKQYLPNIAYSYDKQSPLR 274
>gi|197102146|ref|NP_001125419.1| SET domain-containing protein 9 [Pongo abelii]
gi|75055093|sp|Q5RBW0.1|SETD9_PONAB RecName: Full=SET domain-containing protein 9
gi|55727995|emb|CAH90750.1| hypothetical protein [Pongo abelii]
Length = 273
Score = 45.8 bits (107), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 108 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSIG--------NPFIFRCLDGVLID 159
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 160 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 196
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
PLA+ + N+ + NV +D P + +++ Y+PNI++
Sbjct: 197 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 239
>gi|397514313|ref|XP_003827435.1| PREDICTED: SET domain-containing protein 9 isoform 1 [Pan paniscus]
Length = 299
Score = 45.8 bits (107), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 274
>gi|426384667|ref|XP_004058879.1| PREDICTED: SET domain-containing protein 9 isoform 1 [Gorilla
gorilla gorilla]
Length = 299
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 274
>gi|338718749|ref|XP_001916598.2| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
C5orf35-like, partial [Equus caballus]
Length = 279
Score = 45.4 bits (106), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 114 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGILID 165
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 166 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 202
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 203 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 254
>gi|114600191|ref|XP_517753.2| PREDICTED: SET domain-containing protein 9 isoform 2 [Pan
troglodytes]
gi|119575352|gb|EAW54957.1| hypothetical protein MGC33648, isoform CRA_b [Homo sapiens]
gi|410225266|gb|JAA09852.1| chromosome 5 open reading frame 35 [Pan troglodytes]
gi|410252724|gb|JAA14329.1| chromosome 5 open reading frame 35 [Pan troglodytes]
gi|410301704|gb|JAA29452.1| chromosome 5 open reading frame 35 [Pan troglodytes]
gi|410329565|gb|JAA33729.1| chromosome 5 open reading frame 35 [Pan troglodytes]
Length = 299
Score = 45.4 bits (106), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 274
>gi|22328098|gb|AAH36528.1| Chromosome 5 open reading frame 35 [Homo sapiens]
gi|325464235|gb|ADZ15888.1| chromosome 5 open reading frame 35 [synthetic construct]
Length = 299
Score = 45.4 bits (106), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 274
>gi|285397547|ref|NP_714917.2| SET domain-containing protein 9 isoform 1 [Homo sapiens]
gi|296434439|sp|Q8NE22.2|SETD9_HUMAN RecName: Full=SET domain-containing protein 9
Length = 299
Score = 45.4 bits (106), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSYDKQSPLR 274
>gi|291395387|ref|XP_002714031.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 628
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 44/183 (24%), Positives = 75/183 (40%), Gaps = 38/183 (20%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
L +G+ + PS + AG+G+F+ G GAV+++YPG +Y P +++ I
Sbjct: 445 LYKTLGFCVVRAPSSV--ISAGKGVFVAKGLVPKGAVVSMYPGTVYQKHEPIFFQSI--- 499
Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
NP++ DG +I+ G G + G K SD W L
Sbjct: 500 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCSGR-------DRLGPLKMSDSTW--L 545
Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFPLT-EKDMRPYIPN 322
+ D NPLA+ + N+ + NV +D P +++ Y+PN
Sbjct: 546 TAETD--------------NPLAVGQYVNNCSNERAANVCYQEFDVPAVFPVELKQYLPN 591
Query: 323 ISF 325
++
Sbjct: 592 TAY 594
>gi|431908607|gb|ELK12200.1| hypothetical protein PAL_GLEAN10009679 [Pteropus alecto]
Length = 299
Score = 45.4 bits (106), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVFRSCNG-------RDRLGPLKMSDSTW-LTS---------------EFHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ +E +R
Sbjct: 223 PLAVGQYVNNCSNDRPANVCYQEFDVPAVFPIELKQYLPNIAYSFDKESPLR 274
>gi|397514315|ref|XP_003827436.1| PREDICTED: SET domain-containing protein 9 isoform 2 [Pan paniscus]
Length = 271
Score = 45.1 bits (105), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
PLA+ + N+ + NV +D P + +++ Y+PNI++
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 265
>gi|297294316|ref|XP_002804415.1| PREDICTED: uncharacterized protein C5orf35-like isoform 2 [Macaca
mulatta]
Length = 271
Score = 45.1 bits (105), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
PLA+ + N+ + NV +D P + +++ Y+PNI++
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPVELKQYLPNIAY 265
>gi|332821466|ref|XP_003310776.1| PREDICTED: SET domain-containing protein 9 isoform 1 [Pan
troglodytes]
gi|426384669|ref|XP_004058880.1| PREDICTED: SET domain-containing protein 9 isoform 2 [Gorilla
gorilla gorilla]
Length = 271
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
PLA+ + N+ + NV +D P + +++ Y+PNI++
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 265
>gi|285397577|ref|NP_001165461.1| SET domain-containing protein 9 isoform 2 [Homo sapiens]
Length = 271
Score = 45.1 bits (105), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
PLA+ + N+ + NV +D P + +++ Y+PNI++
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 265
>gi|345327968|ref|XP_001507930.2| PREDICTED: uncharacterized protein C5orf35-like [Ornithorhynchus
anatinus]
Length = 339
Score = 45.1 bits (105), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 67/163 (41%), Gaps = 36/163 (22%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG IY P +++ I NP++ DG +I+
Sbjct: 174 AGKGVFVTKGFVPKGAVVSMYPGTIYQMSEPIFFQSI--------GNPFIFRCIDGVLID 225
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G G + G K SD W S +N
Sbjct: 226 GNDKGISKAVYRSCSGR-------DQLGPFKMSDSTWLTSSV----------------QN 262
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
PLA+ + N+ + NV +D P L + + Y+PNI++
Sbjct: 263 PLAVGQYVNNCSNDKAANVCYQEFDVPELFPIEFKRYLPNIAY 305
>gi|403267548|ref|XP_003925888.1| PREDICTED: SET domain-containing protein 9 [Saimiri boliviensis
boliviensis]
Length = 299
Score = 44.7 bits (104), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTKGFVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W L S E N
Sbjct: 186 GNNKGISKVVYRSCNG-------RDRLGPLKMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
PLA+ + N+ + NV +D P + +++ Y+PNI++
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 265
>gi|301785768|ref|XP_002928298.1| PREDICTED: uncharacterized protein C5orf35-like [Ailuropoda
melanoleuca]
gi|281339574|gb|EFB15158.1| hypothetical protein PANDA_018216 [Ailuropoda melanoleuca]
Length = 299
Score = 44.7 bits (104), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTRGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W SE+ N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPFKMSDSTW-------------LTSEIC---N 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSCDKQSPLR 274
>gi|145545702|ref|XP_001458535.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426355|emb|CAK91138.1| unnamed protein product [Paramecium tetraurelia]
Length = 569
Score = 44.7 bits (104), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 77/185 (41%), Gaps = 41/185 (22%)
Query: 145 SQRLKDAIGYTLDLKPSQIPHEEAGQGLFL-CGEAN-----VGAVIAIYPGIIYSPAYYR 198
+QR+ +G+ L+L+ S I H +AG+G+FL AN G ++ PG+IY
Sbjct: 282 TQRVTQNLGFKLNLRESTIDHSDAGEGVFLETLPANKRLVPAGTLLGFVPGLIYD----- 336
Query: 199 YIPGYPRVDAQNP--YLITRYDGTVINAQ-----PW--GSGWDTRELWDGLTLPEIMPNS 249
Y + P R+DG + + P+ G G+D E +I
Sbjct: 337 ---NYKSYSEKTPPDMHFFRFDGPIFDFTSRIYYPYVPGYGFDEYE-------QKISDIK 386
Query: 250 KGAEKGSDQFWKLLSKPMDNKRGGSGSEML--ERRNPLALAHFANHPAKGMVPNVMICPY 307
+ A++ S + NK G G E++ E NP AL H NH + NV +
Sbjct: 387 ELADRSSRSY--------RNKSKG-GYELVKGEDINPYALGHKINHTPRNKSTNVSLIDI 437
Query: 308 DFPLT 312
P T
Sbjct: 438 FVPQT 442
>gi|73949541|ref|XP_849744.1| PREDICTED: uncharacterized protein C5orf35 isoform 1 [Canis lupus
familiaris]
Length = 299
Score = 44.3 bits (103), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG +I+
Sbjct: 134 AGKGVFVTRGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W SE+ N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPFKMSDSTW-------------LTSEIC---N 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAYSCDKQSPLR 274
>gi|410948609|ref|XP_003981023.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein 9
[Felis catus]
Length = 307
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+A+YPG +Y P +++ I NP++ DG +++
Sbjct: 142 AGKGVFVTRGLVPKGAVVAMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVLVD 193
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G K SD W SE+ N
Sbjct: 194 GNDKGISKVVYRSCNG-------RDRLGPLKMSDSTW-------------LTSEIC---N 230
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
PLA+ + N+ + NV +D P + +++ Y+PNI++
Sbjct: 231 PLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPNIAY 273
>gi|348569018|ref|XP_003470295.1| PREDICTED: uncharacterized protein C5orf35-like [Cavia porcellus]
Length = 299
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 36/163 (22%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DGT+I+
Sbjct: 134 AGRGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGTLID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G G + G K SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCSG-------RDRLGPLKMSDATW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
PLA+ + N+ + NV ++ P + +++ Y+PNI++
Sbjct: 223 PLAIGQYVNNCSNDRAANVCYQEFEVPAVFPIELKQYLPNIAY 265
>gi|355736428|gb|AES11997.1| hypothetical protein [Mustela putorius furo]
Length = 219
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 83/192 (43%), Gaps = 38/192 (19%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGY 203
L +G+++ + S + AG+G+F+ G GAV+++YPG +Y P +++ I
Sbjct: 37 LYKTLGFSVARETSSLI--SAGKGVFVTRGWVPKGAVVSMYPGTVYQKYEPIFFQSI--- 91
Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
NP++ DG +I+ G +G + G K SD W L
Sbjct: 92 -----GNPFIFRCLDGVLIDGNDKGISKVVYRSCNG-------RDRLGPFKMSDSTW-LT 138
Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
S E NPLA+ + N+ + NV +D P + +++ Y+PN
Sbjct: 139 S---------------EICNPLAVGQYVNNCSNDRAANVCYQEFDVPAVFPIELKQYLPN 183
Query: 323 ISFGNAEEVNMR 334
I++ ++ +R
Sbjct: 184 IAYSCDKQSPLR 195
>gi|327262831|ref|XP_003216227.1| PREDICTED: uncharacterized protein C5orf35-like [Anolis
carolinensis]
Length = 295
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 83/189 (43%), Gaps = 32/189 (16%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRV 206
L + +G+++ S + AG G+F+ G G V+++YPG +Y +Y P + +
Sbjct: 112 LFNTLGFSITRSCSSLV--SAGTGVFVTKGLVPKGTVVSMYPGTVYE----KYEPIFFQ- 164
Query: 207 DAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKP 266
NP++ DG +I+ G ++ ++ + + G K SD FW L+
Sbjct: 165 SIGNPFIFRCIDGVLIDGNDKGI---SKAIYRSCS----KRDQFGPLKMSDAFW--LTNS 215
Query: 267 MDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISF 325
M +NPLA+ + N+ + NV +D P + + Y+PNI +
Sbjct: 216 M--------------QNPLAVGQYVNNCSTEKSANVCYQEFDVPEYFPIEFKQYLPNIKY 261
Query: 326 GNAEEVNMR 334
+ + +R
Sbjct: 262 SHEVQRPLR 270
>gi|351697671|gb|EHB00590.1| hypothetical protein GW7_10244 [Heterocephalus glaber]
Length = 274
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 73/172 (42%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV++IYPG IY P +++ I NP++ DG +I+
Sbjct: 109 AGRGVFVTKGLVPKGAVVSIYPGTIYQKYEPIFFQSI--------GNPFIFRCLDGVLID 160
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G G + G K SD W L S E N
Sbjct: 161 GNDKGISKAVYRSCSG-------RDRIGPLKMSDSTW-LTS---------------EICN 197
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV ++ P + +++ Y+PNI++ ++ +R
Sbjct: 198 PLAVGQYVNNCSNDRPANVCYQEFEVPAVFPIELKQYLPNIAYSYDKQSPLR 249
>gi|444725095|gb|ELW65674.1| hypothetical protein TREES_T100005305 [Tupaia chinensis]
Length = 299
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 73/172 (42%), Gaps = 36/172 (20%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIYS---PAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG+G+F+ G GAV+++YPG +Y P +++ I NP++ DG I+
Sbjct: 134 AGKGVFVTKGLVPKGAVVSMYPGTVYQKYEPIFFQSI--------GNPFIFRCLDGVFID 185
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +G + G + SD W L S E N
Sbjct: 186 GNDKGISKVVYRSCNG-------RDRLGPLQMSDSTW-LTS---------------EIHN 222
Query: 284 PLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D P + +++ Y+PNI++ ++ +R
Sbjct: 223 PLAVGQYVNNCSNDRAANVCYQEFDVPTVFPIELKQYLPNIAYSYDKQSPLR 274
>gi|449270355|gb|EMC81040.1| hypothetical protein A306_11210, partial [Columba livia]
Length = 297
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 42/192 (21%), Positives = 86/192 (44%), Gaps = 38/192 (19%)
Query: 148 LKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIY---SPAYYRYIPGY 203
L + +G+++ S + AG G+F+ G G V+++YPG +Y P +++ +
Sbjct: 114 LFNTLGFSITRDRSSLV--SAGTGVFVTKGFVPKGTVVSMYPGTVYRKHEPIFFQSLG-- 169
Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
NP++ DG +I+ G +R ++ + + + G + SD+ W L
Sbjct: 170 ------NPFIFRCIDGILIDGNDKGL---SRSVYRSCSRRDQL----GPFQMSDESW--L 214
Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMICPYDFP-LTEKDMRPYIPN 322
+ + +NPLA+ + N+ + NV +D P +++ Y+PN
Sbjct: 215 TAAL--------------QNPLAVGQYVNNCSHDKAANVCYQEFDVPGYFPVELKQYLPN 260
Query: 323 ISFGNAEEVNMR 334
I + + E ++R
Sbjct: 261 IVYSHDIESHLR 272
>gi|224090581|ref|XP_002187427.1| PREDICTED: SET domain-containing protein 9 [Taeniopygia guttata]
Length = 294
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 40/175 (22%), Positives = 79/175 (45%), Gaps = 42/175 (24%)
Query: 168 AGQGLFLC-GEANVGAVIAIYPGIIY---SPAYYRYIPGYPRVDAQNPYLITRYDGTVIN 223
AG G+F+ G G V+++YPG +Y P +++ + NP++ DG +I+
Sbjct: 129 AGTGVFVTKGFVPKGTVVSMYPGTVYRKHEPIFFQSLG--------NPFIFRCIDGVLID 180
Query: 224 AQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLLSKPMDNKRGGSGSEMLERRN 283
G +R ++ + + + G + SD+ W L+ + +N
Sbjct: 181 GNDKGL---SRSVYRSCSRRDQL----GPFQMSDESW--LTATL--------------QN 217
Query: 284 PLALAHFANHPAKGMVPNVMICPYD----FPLTEKDMRPYIPNISFGNAEEVNMR 334
PLA+ + N+ + NV +D FP+ +++ Y+PNI + + E ++R
Sbjct: 218 PLAVGQYVNNCSHEKTANVCYQEFDVPGHFPV---ELKQYLPNIVYSHDIESHLR 269
>gi|196011120|ref|XP_002115424.1| hypothetical protein TRIADDRAFT_50702 [Trichoplax adhaerens]
gi|190582195|gb|EDV22269.1| hypothetical protein TRIADDRAFT_50702 [Trichoplax adhaerens]
Length = 255
Score = 42.4 bits (98), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 49/200 (24%), Positives = 86/200 (43%), Gaps = 43/200 (21%)
Query: 145 SQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGY 203
S + + G+T+ K ++ + G G+++ G G V+A+YPG IY
Sbjct: 65 SMIMMNVCGFTIARKETK----DKGIGVYVSDGYIPPGKVVALYPGTIYQSH-----DSM 115
Query: 204 PRVDAQNPYLITRYDGTVINAQPWGSGWDTRELWDGLTLPEIMPNSKGAEKGSDQFWKLL 263
+N ++ DGT+I+ + G ++ ++ T +I+ G +D W L
Sbjct: 116 LFQSIRNSFMFRCVDGTIIDGKDHGI---SKAVYMSCTKRDIL----GHHILNDITW-LT 167
Query: 264 SKPMDNKRGGSGSEMLERRNPLALAHFANHPAKGMVPNVMI----CPYDFPLTEKDMRPY 319
S P+ NP+A+ + N+ +K NV+ P D PLT + +
Sbjct: 168 SNPV---------------NPMAIGQYINNQSKNSPANVIYHEINFPEDLPLT---LYRF 209
Query: 320 IPNISFGNA---EEVNMRRF 336
IPN+ F N EE+ R+
Sbjct: 210 IPNVYFRNLTYEEEMGTRKI 229
>gi|340384402|ref|XP_003390701.1| PREDICTED: hypothetical protein LOC100637857 [Amphimedon
queenslandica]
Length = 539
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 56/128 (43%), Gaps = 13/128 (10%)
Query: 103 ESSAQSNAAPRRSGLSFAVGPTARPTDSPVVPQTRQLT----RTELSQRLKDAIGYTLDL 158
+S A S P+ LS+ + P T T QL R+ + L D +TL++
Sbjct: 35 QSKALSQLKPQLETLSYIIKPLTPLTFKSNFSDTNQLKYQTLRSSAIKSLFDINNFTLNV 94
Query: 159 KPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGYPRVDAQNPYLITRY 217
PS + G G+F+ GE G ++A+YPG IY P + N Y++
Sbjct: 95 SPSL---QAEGLGVFVSQGEVREGQLVALYPGTIYLPLQPILLQSL-----HNSYILRCV 146
Query: 218 DGTVINAQ 225
DG I+ +
Sbjct: 147 DGIHIDGR 154
>gi|281212403|gb|EFA86563.1| hypothetical protein PPL_00364 [Polysphondylium pallidum PN500]
Length = 141
Score = 41.2 bits (95), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 24/42 (57%)
Query: 358 ALVATRAICDEEVLLNYRLSNSKRRPVWYSPVDEEEDRRRWS 399
VA + I DEE+ LNYR + + P WY D EE +RRW
Sbjct: 85 GYVAYQPIKDEELFLNYRFNPANPYPAWYHQPDIEEAKRRWG 126
>gi|428179967|gb|EKX48836.1| hypothetical protein GUITHDRAFT_105460 [Guillardia theta CCMP2712]
Length = 225
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 32/49 (65%), Gaps = 3/49 (6%)
Query: 354 LKTLALVATRAICD-EEVLLNYRLSNSKRR--PVWYSPVDEEEDRRRWS 399
L + L++ I D E+LL+YRL+ K++ P WY+PVD + +RRW+
Sbjct: 175 LGHMVLISKSDIPDGSEILLDYRLNPKKKKYWPSWYTPVDVDAAKRRWA 223
>gi|325186224|emb|CCA20725.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 179
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 45/209 (21%), Positives = 88/209 (42%), Gaps = 46/209 (22%)
Query: 1 MAFLFQKFQEAVKTLAKSPTFARDPRQLQFEADMNRLFLYTSYNRLGRDAEEADAEEII- 59
M+ +F++F+ A++ S R+ Q E D + L T + +L A+ I+
Sbjct: 1 MSSIFRRFRLALEGTIPS-------RRRQEEVDTGWVALLTEFRKLSEAKHSKQAKRILV 53
Query: 60 ---------DMAGKASLSDQQQQVLDNIHSQIKRFCLSMDEILLLPEKNVPIE-SSAQSN 109
D+ K S +Q+ Q+ I ++++ S + P+ S SN
Sbjct: 54 LDELLTEASDLIKKKSFKEQKAQIQAEISARLEEIFKSYS--------SRPMHWDSVGSN 105
Query: 110 AAPRRSGLSFAVGPTARPTDSPVVPQTRQLTRTELSQRLKDAIGYTLDLKPSQIPHEEAG 169
A LS + D ++ D +G+ L ++ S + ++AG
Sbjct: 106 KAK----LSISGFSRKHFIDEDILEN--------------DLLGFRLSVEQSNL--DKAG 145
Query: 170 QGLFLCGEANVGAVIAIYPGIIYSPAYYR 198
G+ + G+ G+++A+YPG +Y P +Y+
Sbjct: 146 DGVKVYGKVPPGSLVALYPGTVYLPEHYK 174
>gi|223997052|ref|XP_002288199.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975307|gb|EED93635.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 598
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 60/141 (42%), Gaps = 35/141 (24%)
Query: 283 NPLALAHFANHP-----------AKGMV-PNVMICPYDFP------LTEKDMRPYIPNIS 324
N AL H NHP ++ ++ PN + P ++ T D++ YIPN
Sbjct: 465 NLWALGHIVNHPPAPLKSTESSSSQTLIGPNCVSVPINYTESMFKQHTSSDLKRYIPN-- 522
Query: 325 FGNAEEVNMRRFGSFWFKWGSGSGSSTPVLKTLALVATRAICDEEVLLNYRLS---NSKR 381
E+ + + F V+ + LVA R + DEE+ +YRLS ++K+
Sbjct: 523 ---EYELPPQPWAKNVF------DREKVVMHGMGLVALRDVKDEELFYDYRLSPDESAKK 573
Query: 382 R---PVWYSPVDEEEDRRRWS 399
R P WY D E RWS
Sbjct: 574 RSQYPPWYHVWDAEAMNNRWS 594
>gi|225709596|gb|ACO10644.1| C5orf35 [Caligus rogercresseyi]
Length = 241
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 9/85 (10%)
Query: 145 SQRLKDAIGYTLDLKPSQIPHEEAGQGLFLC-GEANVGAVIAIYPGIIYSPAYYRYIPGY 203
S L GY L + S+ E G G+FL G + G ++ +YPG IY P IP
Sbjct: 61 SSILSSQFGYELCVSGSK---EGKGCGVFLSSGSVHPGQIVGLYPGSIYHPGDPILIP-- 115
Query: 204 PRVDAQNPYLITRYDGTVINAQPWG 228
NPY++ DG +I+ + G
Sbjct: 116 ---SLFNPYILQLSDGVMIDGRSHG 137
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,567,799,126
Number of Sequences: 23463169
Number of extensions: 294714666
Number of successful extensions: 667880
Number of sequences better than 100.0: 118
Number of HSP's better than 100.0 without gapping: 52
Number of HSP's successfully gapped in prelim test: 66
Number of HSP's that attempted gapping in prelim test: 667597
Number of HSP's gapped (non-prelim): 164
length of query: 399
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 254
effective length of database: 8,957,035,862
effective search space: 2275087108948
effective search space used: 2275087108948
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)