BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027770
(219 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255549355|ref|XP_002515731.1| conserved hypothetical protein [Ricinus communis]
gi|223545168|gb|EEF46678.1| conserved hypothetical protein [Ricinus communis]
Length = 279
Score = 347 bits (889), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 172/210 (81%), Positives = 186/210 (88%), Gaps = 2/210 (0%)
Query: 8 LSWYQ-DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATST 66
+S+Y DSRSVGAIAGLA+AIVFTWRM RSPG PQRRQPKRQAP++S SV + Q+N+T
Sbjct: 36 ISFYNLDSRSVGAIAGLAVAIVFTWRMLRSPGGPQRRQPKRQAPTSSTSVVTPQSNSTLM 95
Query: 67 PFGASSSSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQ 125
P G SSSEDLR Q + D FQPVKPTLGQ+VRQKLSEGRKVTCRLLGVILEES PEELQ
Sbjct: 96 PSGVCSSSEDLRAQNVVDNFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEESTPEELQ 155
Query: 126 KQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFC 185
Q TVKSSVLE LLEITKFCDLYLMERVLD+ESE+KVL ALE AGVFTSGGLVKDKVLFC
Sbjct: 156 TQATVKSSVLEVLLEITKFCDLYLMERVLDDESEQKVLAALENAGVFTSGGLVKDKVLFC 215
Query: 186 STEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
STE GR+SFVRQLEPDWHIDTNPEIVSQL+
Sbjct: 216 STENGRSSFVRQLEPDWHIDTNPEIVSQLS 245
>gi|224132998|ref|XP_002327932.1| predicted protein [Populus trichocarpa]
gi|222837341|gb|EEE75720.1| predicted protein [Populus trichocarpa]
Length = 273
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 163/205 (79%), Positives = 178/205 (86%), Gaps = 2/205 (0%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGA-S 71
DSRSVGA+AGLA+AIVFTWR RS PQRRQPKRQAP+TS+SV +TQ NA S P G
Sbjct: 35 DSRSVGAVAGLAVAIVFTWRWLRSNSGPQRRQPKRQAPTTSSSVVTTQPNAVSIPSGGVC 94
Query: 72 SSSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTV 130
SSSEDLR Q + D FQ VKP+LGQ+VRQKLSEGRKVTCRLLGVILEES PEELQ Q TV
Sbjct: 95 SSSEDLRVQNVVDEFFQTVKPSLGQIVRQKLSEGRKVTCRLLGVILEESSPEELQSQATV 154
Query: 131 KSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIG 190
+SSVLE LLEITKFCD+YLMERVLD+ESE+KVL ALE AGVFTSGGLVKDKVLFCSTE G
Sbjct: 155 RSSVLEVLLEITKFCDVYLMERVLDDESEQKVLAALENAGVFTSGGLVKDKVLFCSTETG 214
Query: 191 RTSFVRQLEPDWHIDTNPEIVSQLA 215
R+SFVRQLEPDWHIDTNPE++ QLA
Sbjct: 215 RSSFVRQLEPDWHIDTNPEVIFQLA 239
>gi|388498658|gb|AFK37395.1| unknown [Lotus japonicus]
Length = 264
Score = 324 bits (831), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 158/204 (77%), Positives = 177/204 (86%), Gaps = 1/204 (0%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGASS 72
DS S+GA+AGLA+AI FTWR+ RSPG PQ RQ KRQ ++SN ST +NA+ P G S
Sbjct: 27 DSHSIGAVAGLAIAIFFTWRLLRSPGEPQPRQRKRQTTTSSNPGVSTHSNASVVPSGVRS 86
Query: 73 SSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
S D R Q + D LFQPVKPTLGQ+VRQKL+EGRKVTCRLLGVILEE+ PEELQKQVTVK
Sbjct: 87 PSGDSRAQNVVDELFQPVKPTLGQIVRQKLNEGRKVTCRLLGVILEETSPEELQKQVTVK 146
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
S+VLE LLEITKFCDLYLMERVLD+ESEK+VL+ALE AGVFTSGGLVKDKVLFCSTEIGR
Sbjct: 147 STVLEVLLEITKFCDLYLMERVLDDESEKRVLVALEDAGVFTSGGLVKDKVLFCSTEIGR 206
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
+SFVRQLEPDWHID+NPEI+SQLA
Sbjct: 207 SSFVRQLEPDWHIDSNPEIISQLA 230
>gi|118481139|gb|ABK92522.1| unknown [Populus trichocarpa]
Length = 275
Score = 322 bits (825), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 163/207 (78%), Positives = 178/207 (85%), Gaps = 4/207 (1%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGA-S 71
DSRSVGA+AGLA+AIVFTWR RS PQRRQPKRQAP+TS+SV +TQ NA S P G
Sbjct: 35 DSRSVGAVAGLAVAIVFTWRWLRSNSGPQRRQPKRQAPTTSSSVVTTQPNAVSIPSGGVC 94
Query: 72 SSSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQ--KQV 128
SSSEDLR Q + D FQ VKP+LGQ+VRQKLSEGRKVTCRLLGVILEES PEELQ Q
Sbjct: 95 SSSEDLRVQNVVDEFFQTVKPSLGQIVRQKLSEGRKVTCRLLGVILEESSPEELQGQSQA 154
Query: 129 TVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTE 188
TV+SSVLE LLEITKFCD+YLMERVLD+ESE+KVL ALE AGVFTSGGLVKDKVLFCSTE
Sbjct: 155 TVRSSVLEVLLEITKFCDVYLMERVLDDESEQKVLAALENAGVFTSGGLVKDKVLFCSTE 214
Query: 189 IGRTSFVRQLEPDWHIDTNPEIVSQLA 215
GR+SFVRQLEPDWHIDTNPE++ QLA
Sbjct: 215 TGRSSFVRQLEPDWHIDTNPEVIFQLA 241
>gi|225447340|ref|XP_002280526.1| PREDICTED: peroxisome biogenesis protein 22 [Vitis vinifera]
gi|297739312|emb|CBI28963.3| unnamed protein product [Vitis vinifera]
Length = 275
Score = 319 bits (817), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 163/205 (79%), Positives = 179/205 (87%), Gaps = 2/205 (0%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGASS 72
DSRSVGA+AG A+A++FTWR+ RSP PQRRQPKRQAP+ S+S S+Q+ T P G S
Sbjct: 37 DSRSVGALAGFAVAMIFTWRLLRSPSGPQRRQPKRQAPAPSSSGVSSQSTTTLAPSGVCS 96
Query: 73 SSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQ-KQVTV 130
SSED R Q + D FQPVKPTLGQ+VRQKLSEGRKVTCRLLGVILEES PEELQ KQ TV
Sbjct: 97 SSEDSRAQNVIDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEESSPEELQAKQATV 156
Query: 131 KSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIG 190
+SSVLE LLEITKFCDLYLMERVLD+ESE+KVLLALE AGVFTSGGLVKDKVLFCSTE G
Sbjct: 157 RSSVLEVLLEITKFCDLYLMERVLDDESERKVLLALEDAGVFTSGGLVKDKVLFCSTETG 216
Query: 191 RTSFVRQLEPDWHIDTNPEIVSQLA 215
R+SFVRQLEPDWHIDTNPE+VSQLA
Sbjct: 217 RSSFVRQLEPDWHIDTNPEVVSQLA 241
>gi|356550004|ref|XP_003543380.1| PREDICTED: peroxisome biogenesis protein 22-like [Glycine max]
Length = 280
Score = 318 bits (815), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 158/206 (76%), Positives = 178/206 (86%), Gaps = 3/206 (1%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSN-SVG-STQANATSTPFGA 70
D RS+GA+AGLA+AIVFTWR+ RSP QRRQ KRQ PS+SN VG ++ +NA+ P A
Sbjct: 41 DLRSIGAVAGLAVAIVFTWRLLRSPSGSQRRQQKRQGPSSSNPGVGTNSNSNASVVPSDA 100
Query: 71 SSSSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVT 129
S S+D R Q + D FQPVKPTLGQ+VRQKLSEGRKVTCRLLGVILEES PEELQKQ T
Sbjct: 101 CSPSDDSRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEESSPEELQKQAT 160
Query: 130 VKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEI 189
V+SSVLE LLE+TKFCDLYLMERVLD+ESEK+VL+ALE AGVFTSGGLVKDKVLFCSTE
Sbjct: 161 VRSSVLEVLLEVTKFCDLYLMERVLDDESEKRVLVALEEAGVFTSGGLVKDKVLFCSTEN 220
Query: 190 GRTSFVRQLEPDWHIDTNPEIVSQLA 215
GR+SFVRQLEPDWHID+NPEIV+QLA
Sbjct: 221 GRSSFVRQLEPDWHIDSNPEIVTQLA 246
>gi|356543742|ref|XP_003540319.1| PREDICTED: peroxisome biogenesis protein 22-like [Glycine max]
Length = 278
Score = 317 bits (813), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 155/204 (75%), Positives = 174/204 (85%), Gaps = 1/204 (0%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGASS 72
D RS+GA+AGLA+AIVFTWR+ RSP QRRQ KRQ PS+SN +T +N + P A S
Sbjct: 41 DIRSIGAVAGLAVAIVFTWRLLRSPSGSQRRQQKRQGPSSSNPGVTTHSNVSDVPSDACS 100
Query: 73 SSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
S+D R Q + D FQPVKPTLGQ+VRQKLSEGRKVTCRLLGVILEES PEELQKQ TV+
Sbjct: 101 PSDDSRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEESSPEELQKQATVR 160
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
SSVLE LLE+TKFCDLYLMERVLD+ESEK+VL+ALE AGVFTSG LVKDKVLFCSTE GR
Sbjct: 161 SSVLEVLLEVTKFCDLYLMERVLDDESEKRVLVALEEAGVFTSGRLVKDKVLFCSTENGR 220
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
+SFVRQLEPDWHID+NPEIV+QLA
Sbjct: 221 SSFVRQLEPDWHIDSNPEIVTQLA 244
>gi|449468331|ref|XP_004151875.1| PREDICTED: peroxisome biogenesis protein 22-like [Cucumis sativus]
gi|449484063|ref|XP_004156773.1| PREDICTED: peroxisome biogenesis protein 22-like [Cucumis sativus]
Length = 276
Score = 313 bits (801), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 155/206 (75%), Positives = 170/206 (82%), Gaps = 3/206 (1%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQ--APSTSNSVGSTQANATSTPFGA 70
DSRSVGAIAG A+AI+FTWR+ RS QR+QPKRQ APS+S S N S P G
Sbjct: 37 DSRSVGAIAGFAVAIIFTWRLLRSSNGHQRQQPKRQMPAPSSSASNAGLNTNEQSIPSGV 96
Query: 71 SSSSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVT 129
S SEDLR + D FQPVKPTLGQ+VRQKLSEGRKVTCRLLG+ILEE PE+LQKQ T
Sbjct: 97 CSPSEDLRAHNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGIILEEYSPEDLQKQAT 156
Query: 130 VKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEI 189
V+SSVLE LLEITK+CDLYLME VLD+ESEKKVL ALE AGVFTSGGLVKDKVLFCSTE
Sbjct: 157 VRSSVLEVLLEITKYCDLYLMETVLDDESEKKVLSALEDAGVFTSGGLVKDKVLFCSTEN 216
Query: 190 GRTSFVRQLEPDWHIDTNPEIVSQLA 215
GRTSFVRQLEPDWHID+NPEI++QLA
Sbjct: 217 GRTSFVRQLEPDWHIDSNPEIITQLA 242
>gi|217074268|gb|ACJ85494.1| unknown [Medicago truncatula]
Length = 275
Score = 308 bits (790), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 152/204 (74%), Positives = 173/204 (84%), Gaps = 1/204 (0%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGASS 72
D R++GA+AGLA+AIVFTWR+ RSP Q RQ KRQ ++SN ST +NA+ P AS
Sbjct: 38 DIRTIGAVAGLAVAIVFTWRLLRSPNPTQGRQRKRQGGASSNPGASTNSNASVVPSDASL 97
Query: 73 SSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
S++LR Q + D FQPVKPTLGQ+VRQKLSEGRKVTCRLLGVILEES PEELQ + TV+
Sbjct: 98 PSDELRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEESSPEELQTKATVR 157
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
SSVLE LLEITKFCDLYLM+RVLD+ESEK+VL+ALE AGVFTSGGLVKDKVLF STE GR
Sbjct: 158 SSVLEVLLEITKFCDLYLMDRVLDDESEKRVLVALEEAGVFTSGGLVKDKVLFSSTENGR 217
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
+SFVRQLEPDWHIDTNPEIV+QLA
Sbjct: 218 SSFVRQLEPDWHIDTNPEIVTQLA 241
>gi|357453503|ref|XP_003597029.1| hypothetical protein MTR_2g088850 [Medicago truncatula]
gi|355486077|gb|AES67280.1| hypothetical protein MTR_2g088850 [Medicago truncatula]
gi|388510392|gb|AFK43262.1| unknown [Medicago truncatula]
Length = 275
Score = 308 bits (789), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 152/204 (74%), Positives = 173/204 (84%), Gaps = 1/204 (0%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGASS 72
D R++GA+AGLA+AIVFTWR+ RSP Q RQ KRQ ++SN ST +NA+ P AS
Sbjct: 38 DIRTIGAVAGLAVAIVFTWRLLRSPNPTQGRQRKRQGGASSNPGASTNSNASVVPSDASL 97
Query: 73 SSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
S++LR Q + D FQPVKPTLGQ+VRQKLSEGRKVTCRLLGVILEES PEELQ + TV+
Sbjct: 98 PSDELRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEESSPEELQTKATVR 157
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
SSVLE LLEITKFCDLYLM+RVLD+ESEK+VL+ALE AGVFTSGGLVKDKVLF STE GR
Sbjct: 158 SSVLEVLLEITKFCDLYLMDRVLDDESEKRVLVALEEAGVFTSGGLVKDKVLFSSTENGR 217
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
+SFVRQLEPDWHIDTNPEIV+QLA
Sbjct: 218 SSFVRQLEPDWHIDTNPEIVTQLA 241
>gi|118481087|gb|ABK92497.1| unknown [Populus trichocarpa]
Length = 275
Score = 307 bits (787), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 161/207 (77%), Positives = 176/207 (85%), Gaps = 4/207 (1%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGA-S 71
DSRSVGA+AGLA+AIVFTWR+ RS P+RRQPKRQA +TS+SV +TQ NA S P G
Sbjct: 35 DSRSVGAVAGLAVAIVFTWRLLRSNSGPRRRQPKRQASTTSSSVVTTQPNAVSIPSGGVC 94
Query: 72 SSSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQ--KQV 128
SSSEDLR Q + D FQPVKPTL Q+VRQKLSEGRKVTCRL VILEES PEELQ Q
Sbjct: 95 SSSEDLRVQNVVDEFFQPVKPTLAQIVRQKLSEGRKVTCRLFEVILEESSPEELQIVSQA 154
Query: 129 TVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTE 188
TV+SSVLE LLEITKFCDLYLMERVLD+ESE+K+L ALE AGVFTSGGLVKDKVLFCSTE
Sbjct: 155 TVRSSVLEVLLEITKFCDLYLMERVLDDESEQKILAALENAGVFTSGGLVKDKVLFCSTE 214
Query: 189 IGRTSFVRQLEPDWHIDTNPEIVSQLA 215
GR+SFVRQLEPDWHIDTNPEI+ QLA
Sbjct: 215 TGRSSFVRQLEPDWHIDTNPEILFQLA 241
>gi|343172840|gb|AEL99123.1| peroxin, partial [Silene latifolia]
Length = 234
Score = 303 bits (775), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 151/204 (74%), Positives = 171/204 (83%), Gaps = 3/204 (1%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGASS 72
D SVGAIAGLA+A+VFTWR+ RS APQR QPKRQ S+S ++Q NAT GAS
Sbjct: 1 DIISVGAIAGLAIALVFTWRILRS--APQRTQPKRQPNVASSSRVTSQPNATVVSSGASV 58
Query: 73 SSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
S ED R Q + D FQP+KPTL Q+VRQKL EGRK+TCRL+GVILEE+ PEELQK+ TV+
Sbjct: 59 SLEDSRAQNVVDDFFQPIKPTLAQIVRQKLCEGRKLTCRLIGVILEETSPEELQKKATVR 118
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
SSVLE LLEITK+CDLY ME VLD+ESEK+VLLALE AG+FTSGGL+KDKVLFCSTEIGR
Sbjct: 119 SSVLEVLLEITKYCDLYFMESVLDDESEKRVLLALEEAGIFTSGGLIKDKVLFCSTEIGR 178
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
TSFVRQLEPDWHIDTNPEI+SQLA
Sbjct: 179 TSFVRQLEPDWHIDTNPEIISQLA 202
>gi|343172842|gb|AEL99124.1| peroxin, partial [Silene latifolia]
Length = 234
Score = 296 bits (759), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 149/204 (73%), Positives = 169/204 (82%), Gaps = 3/204 (1%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGASS 72
D SVGAIAGLA+A+VFTWR+ RS APQR QPKRQ S+S ++Q NA GAS
Sbjct: 1 DIISVGAIAGLAIALVFTWRILRS--APQRTQPKRQPNVASSSRVTSQPNAAVVSSGASV 58
Query: 73 SSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
S ED R Q + D FQP+KPTL Q+VRQKL EGRKVTC L+GVILEE+ PEELQK+ TV+
Sbjct: 59 SLEDSRAQNVVDDFFQPIKPTLAQIVRQKLCEGRKVTCCLIGVILEETSPEELQKKATVR 118
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
SSVLE LLEITK+CDLY +E VLD+ESEK+VLLALE AG+FTSGGL+KDKVLFCSTEIGR
Sbjct: 119 SSVLEVLLEITKYCDLYFVESVLDDESEKRVLLALEEAGIFTSGGLIKDKVLFCSTEIGR 178
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
TSFVRQLEPDWHIDTNPEI+SQLA
Sbjct: 179 TSFVRQLEPDWHIDTNPEIISQLA 202
>gi|297835182|ref|XP_002885473.1| hypothetical protein ARALYDRAFT_479706 [Arabidopsis lyrata subsp.
lyrata]
gi|297331313|gb|EFH61732.1| hypothetical protein ARALYDRAFT_479706 [Arabidopsis lyrata subsp.
lyrata]
Length = 282
Score = 288 bits (737), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 147/208 (70%), Positives = 167/208 (80%), Gaps = 5/208 (2%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAP-QRRQPKRQAPSTSNSVGSTQANATST----P 67
DSRS+GAIAGLA+A++FTWR R+PG QRRQPKR+ + S + A ++ P
Sbjct: 41 DSRSIGAIAGLAIAVIFTWRAIRTPGEHRQRRQPKRRINNAETSSAAAAAAQSNVASVLP 100
Query: 68 FGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQ 127
+S ++ + D FQPVKPTLGQ+VRQKLSEGRKVTCRLLGVILEE+ PEELQKQ
Sbjct: 101 EFSSPREDNAVQDVVDQFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEETSPEELQKQ 160
Query: 128 VTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCST 187
TV+SSVLE LLEITK+ DLYLMERVLD+ESE KVL ALE AGVFTSGGLVKDKVLFCST
Sbjct: 161 ATVRSSVLEVLLEITKYSDLYLMERVLDDESEAKVLQALENAGVFTSGGLVKDKVLFCST 220
Query: 188 EIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
EIGRTSFVRQLEPDWHIDTNPEI +QLA
Sbjct: 221 EIGRTSFVRQLEPDWHIDTNPEISTQLA 248
>gi|42565097|ref|NP_566696.2| peroxin 22 [Arabidopsis thaliana]
gi|75274037|sp|Q9LSX7.1|PEX22_ARATH RecName: Full=Peroxisome biogenesis protein 22; AltName:
Full=Peroxin-22; Short=AtPEX22
gi|11994654|dbj|BAB02849.1| unnamed protein product [Arabidopsis thaliana]
gi|17381010|gb|AAL36317.1| unknown protein [Arabidopsis thaliana]
gi|20465875|gb|AAM20042.1| unknown protein [Arabidopsis thaliana]
gi|332643039|gb|AEE76560.1| peroxin 22 [Arabidopsis thaliana]
Length = 283
Score = 275 bits (703), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 141/209 (67%), Positives = 165/209 (78%), Gaps = 6/209 (2%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATS------T 66
DSRS+GAIAGLA+A++FTWR R+PG ++R+ ++ + + + A + S
Sbjct: 41 DSRSIGAIAGLAIAVIFTWRAIRTPGEQRQRRQPKRRIHNAETSSAAAAASQSNLASSVA 100
Query: 67 PFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQK 126
P +S ++ + D FQPVKPTLGQ+VRQKLSEGRKVTCRLLGVILEE+ PEELQK
Sbjct: 101 PEVSSPREDNAVQDVVDQFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEETSPEELQK 160
Query: 127 QVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCS 186
Q TV+SSVLE LLEITK+ DLYLMERVLD+ESE KVL ALE AGVFTSGGLVKDKVLFCS
Sbjct: 161 QATVRSSVLEVLLEITKYSDLYLMERVLDDESEAKVLQALENAGVFTSGGLVKDKVLFCS 220
Query: 187 TEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
TEIGRTSFVRQLEPDWHIDTNPEI +QLA
Sbjct: 221 TEIGRTSFVRQLEPDWHIDTNPEISTQLA 249
>gi|449493124|ref|XP_004159199.1| PREDICTED: peroxisome biogenesis protein 22-like [Cucumis sativus]
Length = 266
Score = 260 bits (665), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 131/214 (61%), Positives = 160/214 (74%), Gaps = 11/214 (5%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA 63
+ ++L ++ RS+ AIAGLALA++F+W++ R P +RRQPKRQ S NS ST +N+
Sbjct: 28 LFSILFQTRNFRSLSAIAGLALAVIFSWKLLRPPSGHRRRQPKRQTSSAGNSDISTSSNS 87
Query: 64 TSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEE 123
S S+D Q KPTL Q+VRQKL GRKVTCRLLG+ILEE PEE
Sbjct: 88 QLITSAVFSPSDDAGAQ---------KPTLEQIVRQKLGGGRKVTCRLLGIILEEKSPEE 138
Query: 124 LQ--KQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDK 181
LQ Q TV+SSV++ LLEIT +CDLYLMERVLD ESE+KV++ALE AGVF SGGL+K+K
Sbjct: 139 LQAVNQATVRSSVVDVLLEITNYCDLYLMERVLDEESERKVIVALEDAGVFASGGLIKEK 198
Query: 182 VLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
VLFCSTE GRTSFVRQLEPDWHID++PEI+SQLA
Sbjct: 199 VLFCSTENGRTSFVRQLEPDWHIDSDPEIISQLA 232
>gi|449453482|ref|XP_004144486.1| PREDICTED: peroxisome biogenesis protein 22-like [Cucumis sativus]
Length = 266
Score = 260 bits (665), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 131/214 (61%), Positives = 160/214 (74%), Gaps = 11/214 (5%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA 63
+ ++L ++ RS+ AIAGLALA++F+W++ R P +RRQPKRQ S NS ST +N+
Sbjct: 28 LFSILFQTRNFRSLSAIAGLALAVIFSWKLLRPPSGHRRRQPKRQTSSAGNSDISTSSNS 87
Query: 64 TSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEE 123
S S+D Q KPTL Q+VRQKL GRKVTCRLLG+ILEE PEE
Sbjct: 88 QLITSAVFSPSDDAGAQ---------KPTLEQIVRQKLGGGRKVTCRLLGIILEEKSPEE 138
Query: 124 LQ--KQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDK 181
LQ Q TV+SSV++ LLEIT +CDLYLMERVLD ESE+KV++ALE AGVF SGGL+K+K
Sbjct: 139 LQAVNQATVRSSVVDVLLEITNYCDLYLMERVLDEESERKVIVALEDAGVFASGGLIKEK 198
Query: 182 VLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
VLFCSTE GRTSFVRQLEPDWHID++PEI+SQLA
Sbjct: 199 VLFCSTENGRTSFVRQLEPDWHIDSDPEIISQLA 232
>gi|115460728|ref|NP_001053964.1| Os04g0629000 [Oryza sativa Japonica Group]
gi|113565535|dbj|BAF15878.1| Os04g0629000, partial [Oryza sativa Japonica Group]
Length = 299
Score = 240 bits (612), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 130/206 (63%), Positives = 155/206 (75%), Gaps = 4/206 (1%)
Query: 12 QDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA-TSTPFGA 70
Q RS +A + + ++F W++ R+P RR +R APS SN+ ++ A TST A
Sbjct: 59 QKLRSFTTLAAITITLIFAWKLLRAPQEQPRRPRRRVAPSPSNTSSRSRPGALTST--DA 116
Query: 71 SSSSEDLRTQIA-DALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVT 129
SSS D R Q A + LFQPV TL Q+VR KLSEGR+VTCRLLGVILEE+ PEELQ VT
Sbjct: 117 CSSSADSRAQEAINQLFQPVNLTLEQLVRHKLSEGRRVTCRLLGVILEETTPEELQNHVT 176
Query: 130 VKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEI 189
V+ SVLE LLEI KFCD+YLMER+LD+ES +KVL AL AG+FT+ GL+KDKVLFCSTE
Sbjct: 177 VRPSVLEVLLEIAKFCDVYLMERILDDESGEKVLSALSEAGLFTNSGLIKDKVLFCSTEN 236
Query: 190 GRTSFVRQLEPDWHIDTNPEIVSQLA 215
GRTSFVRQLEPDWHIDT+PEIV QLA
Sbjct: 237 GRTSFVRQLEPDWHIDTSPEIVHQLA 262
>gi|222629610|gb|EEE61742.1| hypothetical protein OsJ_16268 [Oryza sativa Japonica Group]
Length = 262
Score = 238 bits (606), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 130/207 (62%), Positives = 156/207 (75%), Gaps = 6/207 (2%)
Query: 12 QDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA-TSTPFGA 70
Q RS +A + + ++F W++ R+P RR +R APS SN+ ++ A TST A
Sbjct: 50 QKLRSFTTLAAITITLIFAWKLLRAPQEQPRRPRRRVAPSPSNTSSRSRPGALTST--DA 107
Query: 71 SSSSEDLRTQIA-DALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVT 129
SSS D R Q A + LFQP+ TL Q+VR KLSEGR+VTCRLLGVILEE+ PEELQ VT
Sbjct: 108 CSSSADSRAQEAINQLFQPL--TLEQLVRHKLSEGRRVTCRLLGVILEETTPEELQNHVT 165
Query: 130 VKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEI 189
V+ SVLE LLEI KFCD+YLMER+LD+ES +KVL AL AG+FT+ GL+KDKVLFCSTE
Sbjct: 166 VRPSVLEVLLEIAKFCDVYLMERILDDESGEKVLSALSEAGLFTNSGLIKDKVLFCSTEN 225
Query: 190 GRTSFVRQLEPDWHIDTNPEIVSQLAV 216
GRTSFVRQLEPDWHIDT+PEIV QLAV
Sbjct: 226 GRTSFVRQLEPDWHIDTSPEIVHQLAV 252
>gi|242074426|ref|XP_002447149.1| hypothetical protein SORBIDRAFT_06g029430 [Sorghum bicolor]
gi|241938332|gb|EES11477.1| hypothetical protein SORBIDRAFT_06g029430 [Sorghum bicolor]
Length = 289
Score = 238 bits (606), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/204 (62%), Positives = 154/204 (75%), Gaps = 1/204 (0%)
Query: 12 QDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGAS 71
Q RS+ +A +++ ++F W+M R+P RR +R APS+SN+ +Q A +T A
Sbjct: 50 QKLRSLTTLAAISITLLFAWKMLRAPQEQARRPRRRAAPSSSNTSTRSQPGALTTT-DAC 108
Query: 72 SSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
SS++ + + LFQPV TL Q+VR KLSEGR+ TCRLLGVILEE+ PEELQ VTVK
Sbjct: 109 SSADSRAHEAVNQLFQPVNLTLEQLVRHKLSEGRRFTCRLLGVILEETSPEELQNHVTVK 168
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
SV+E LLEI KFCD+YLMERVLD+ESE KVLLAL AG+F GGL+KDKVLFCSTE GR
Sbjct: 169 PSVVEVLLEIAKFCDVYLMERVLDDESEGKVLLALSEAGLFAGGGLIKDKVLFCSTENGR 228
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
TSFVRQLEPDWHIDT+PEIV QLA
Sbjct: 229 TSFVRQLEPDWHIDTSPEIVHQLA 252
>gi|357166123|ref|XP_003580606.1| PREDICTED: peroxisome biogenesis protein 22-like [Brachypodium
distachyon]
Length = 288
Score = 231 bits (588), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 124/204 (60%), Positives = 155/204 (75%), Gaps = 1/204 (0%)
Query: 12 QDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGAS 71
Q RS+ +A +A+ +VF W++ R+P RR +R APS+SN+ ++ A ++ A
Sbjct: 49 QKLRSLTTLAAIAITLVFAWKLLRAPQEQPRRPRRRVAPSSSNTSSRSRPGALAST-DAC 107
Query: 72 SSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
SS++ + + LFQPV TL Q+VR KLSEGR+VTCRLLGVILEE+ PEELQ TV+
Sbjct: 108 SSADSRAHEAINQLFQPVNLTLEQLVRHKLSEGRRVTCRLLGVILEETTPEELQSHATVR 167
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
SSVLE LLEI KFCD+YLME +LD+ESE+KVL AL AG+FTSGG VKDKVLFCSTE GR
Sbjct: 168 SSVLEVLLEIAKFCDVYLMECILDDESEEKVLSALSEAGLFTSGGFVKDKVLFCSTENGR 227
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
TSFVRQLEPDWHID++PEIV QL+
Sbjct: 228 TSFVRQLEPDWHIDSSPEIVHQLS 251
>gi|326494736|dbj|BAJ94487.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 286
Score = 231 bits (588), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 128/222 (57%), Positives = 160/222 (72%), Gaps = 11/222 (4%)
Query: 4 VLTLLSWYQDS----------RSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTS 53
++ +LS Y D RS+ +A +A+ IVF W++ R+P RR +R APS+S
Sbjct: 29 IVDVLSRYSDRLPFDLDRQKLRSLTTLAAIAITIVFAWKLLRAPLEQPRRPRRRDAPSSS 88
Query: 54 NSVGSTQANATSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLG 113
N+ ++ A P A SS++ + + LFQPV TL Q+VR KLSEGR+VTCRLLG
Sbjct: 89 NTSSRSRPGALVGP-DACSSADSRAHEAINQLFQPVNLTLEQLVRHKLSEGRRVTCRLLG 147
Query: 114 VILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFT 173
VILEE+ PEELQ VTV+ SVLE LLEI K CD+YLME VLD+ESE+KVL AL AG+FT
Sbjct: 148 VILEETTPEELQNHVTVRPSVLEVLLEIAKVCDVYLMEHVLDDESEEKVLSALSEAGLFT 207
Query: 174 SGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
SGGLV++KVLFCSTE GRTSFVRQLEPDWHID++PEIV QL+
Sbjct: 208 SGGLVREKVLFCSTENGRTSFVRQLEPDWHIDSSPEIVHQLS 249
>gi|116781530|gb|ABK22140.1| unknown [Picea sitchensis]
Length = 285
Score = 228 bits (580), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 115/218 (52%), Positives = 160/218 (73%), Gaps = 4/218 (1%)
Query: 1 MFWVLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQ 60
M ++ LL ++++ S+GA+AG ALAIVFTW+ +SPG RR KR+ T+NS ++Q
Sbjct: 24 MSNLIALLINHKNAGSIGALAGFALAIVFTWKYLKSPGGHHRRLEKRRNSPTANSDDNSQ 83
Query: 61 ANAT---STPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILE 117
+ S+ SSS + +T ++ F ++ +L QVVRQ+L+ GRK+TC+LLGV+LE
Sbjct: 84 STGETVLSSNVCQPSSSLN-QTAVSPQEFTTMQLSLAQVVRQQLNGGRKITCQLLGVVLE 142
Query: 118 ESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGL 177
ES PEEL + V+ +V++ LLEI K CDLYL+ RVLD++SE+KV+ AL+ GVFT GG+
Sbjct: 143 ESTPEELLEHAVVRPTVVDVLLEIAKGCDLYLIARVLDDDSEEKVISALDAVGVFTIGGM 202
Query: 178 VKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
++KVLFCSTE GR+SFVRQLEPDWHIDT+ EI+SQLA
Sbjct: 203 NRNKVLFCSTETGRSSFVRQLEPDWHIDTSAEIISQLA 240
>gi|148908950|gb|ABR17579.1| unknown [Picea sitchensis]
Length = 242
Score = 227 bits (579), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 116/220 (52%), Positives = 160/220 (72%), Gaps = 6/220 (2%)
Query: 1 MFWVLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQ 60
M ++ LL ++++ S+GA+AG ALAIVFTW+ +SPG RR KR+ +NS ++Q
Sbjct: 24 MSNLIALLINHKNAGSIGALAGFALAIVFTWKYLKSPGGHHRRLEKRRNSPPANSDDNSQ 83
Query: 61 ANA----TSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVIL 116
+ +ST SSS +T ++ F ++ +L QVVRQ+L+ GRK+TC+LLGV+L
Sbjct: 84 STGETVLSSTVCQPSSSLN--QTAVSPQEFTTMQLSLAQVVRQQLNGGRKITCQLLGVVL 141
Query: 117 EESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGG 176
EES PEEL + V+ +V++ LLEI K CDLYL+ RVLD++SE+KV+ AL+ GVFT GG
Sbjct: 142 EESTPEELLEHAVVRPTVVDVLLEIAKGCDLYLIARVLDDDSEEKVISALDAVGVFTIGG 201
Query: 177 LVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAV 216
+ ++KVLFCSTE GR+SFVRQLEPDWHIDT+ EI+SQLAV
Sbjct: 202 MNRNKVLFCSTETGRSSFVRQLEPDWHIDTSAEIISQLAV 241
>gi|226503855|ref|NP_001143443.1| hypothetical protein [Zea mays]
gi|195620568|gb|ACG32114.1| hypothetical protein [Zea mays]
gi|224034339|gb|ACN36245.1| unknown [Zea mays]
gi|238010548|gb|ACR36309.1| unknown [Zea mays]
gi|413919612|gb|AFW59544.1| hypothetical protein ZEAMMB73_475591 [Zea mays]
Length = 287
Score = 225 bits (574), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 120/204 (58%), Positives = 151/204 (74%), Gaps = 1/204 (0%)
Query: 12 QDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGAS 71
Q RS+ +A +++ ++F W+M R+P Q R+P+R+A +S++ + T A
Sbjct: 48 QKLRSLTTLAAISITLLFAWKMLRAP-QEQPRRPRRRAAPSSSNTSTRSRPGALTVADAC 106
Query: 72 SSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
SS++ + + LFQPV TL Q+VR KLSEGR+ TCRLLGVILEE+ PEELQ VTVK
Sbjct: 107 SSADSRAHEAVNQLFQPVNLTLEQLVRHKLSEGRRFTCRLLGVILEETAPEELQNHVTVK 166
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
SV+E LLEI KFCD+YLMER+LD+ESE KVL AL AG+F GGL+KDKVLFCSTE GR
Sbjct: 167 PSVVEVLLEIAKFCDVYLMERILDDESEGKVLSALSEAGLFGGGGLIKDKVLFCSTENGR 226
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
TSFVRQLEPDWHIDT+PE+V QLA
Sbjct: 227 TSFVRQLEPDWHIDTSPEVVHQLA 250
>gi|90399290|emb|CAJ86212.1| H0323C08.2 [Oryza sativa Indica Group]
gi|116312062|emb|CAJ86426.1| H0303G06.15 [Oryza sativa Indica Group]
Length = 248
Score = 221 bits (562), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 120/207 (57%), Positives = 143/207 (69%), Gaps = 20/207 (9%)
Query: 12 QDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA-TSTPFGA 70
Q RS +A + + ++F W++ R+P RR +R APS SN+ ++ A TST A
Sbjct: 50 QKLRSFTTLAAITITLIFAWKLLRAPQEQPRRPHRRVAPSPSNTSSRSRPGALTST--DA 107
Query: 71 SSSSEDLRTQIA-DALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVT 129
SSS D R Q A + LFQPV VTCRLLGVILEE+ PEELQ VT
Sbjct: 108 CSSSADSRAQEAINQLFQPVN----------------VTCRLLGVILEETTPEELQNHVT 151
Query: 130 VKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEI 189
V+ SVLE LLEI KFCD+YLMER+LD+ES +KVL AL AG+FT+ GL+KDKVLFCSTE
Sbjct: 152 VRPSVLEVLLEIAKFCDVYLMERILDDESGEKVLSALSEAGLFTNSGLIKDKVLFCSTEN 211
Query: 190 GRTSFVRQLEPDWHIDTNPEIVSQLAV 216
GRTSFVRQLEPDWHIDT+PEIV QLAV
Sbjct: 212 GRTSFVRQLEPDWHIDTSPEIVHQLAV 238
>gi|226500778|ref|NP_001143704.1| uncharacterized protein LOC100276441 [Zea mays]
gi|195625084|gb|ACG34372.1| hypothetical protein [Zea mays]
gi|414585278|tpg|DAA35849.1| TPA: hypothetical protein ZEAMMB73_262425 [Zea mays]
Length = 283
Score = 218 bits (554), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 119/204 (58%), Positives = 148/204 (72%), Gaps = 1/204 (0%)
Query: 12 QDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGAS 71
Q RS+ +A +++ ++F W+M R P R +R APS+SN+ ++ A +T A
Sbjct: 44 QKLRSLTTLAAISITLLFAWKMLRGPQEQPPRPRRRIAPSSSNTSTRSRPGALTTT-DAC 102
Query: 72 SSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
SS++ + + LFQPV TL Q+V+ KLS GR+ TCRLLGVILEE+ PEELQ VTVK
Sbjct: 103 SSADSRAHEAVNQLFQPVNLTLEQLVKHKLSGGRRFTCRLLGVILEETAPEELQNHVTVK 162
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
SV+E LLEI K CD+YLME VLD+ESE KVL AL AG+F GGL+KDKVLFCST+ GR
Sbjct: 163 PSVVEVLLEIAKICDVYLMECVLDDESEGKVLSALSEAGLFAGGGLIKDKVLFCSTDNGR 222
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
TSFVRQLEPDWHIDT+PEIV QLA
Sbjct: 223 TSFVRQLEPDWHIDTSPEIVHQLA 246
>gi|414585277|tpg|DAA35848.1| TPA: hypothetical protein ZEAMMB73_262425 [Zea mays]
Length = 281
Score = 213 bits (541), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 118/204 (57%), Positives = 148/204 (72%), Gaps = 3/204 (1%)
Query: 12 QDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGAS 71
Q RS+ +A +++ ++F W+M R P R +R APS+SN+ ++ A +T A
Sbjct: 44 QKLRSLTTLAAISITLLFAWKMLRGPQEQPPRPRRRIAPSSSNTSTRSRPGALTTT-DAC 102
Query: 72 SSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVK 131
SS++ + + LFQP+ TL Q+V+ KLS GR+ TCRLLGVILEE+ PEELQ VTVK
Sbjct: 103 SSADSRAHEAVNQLFQPL--TLEQLVKHKLSGGRRFTCRLLGVILEETAPEELQNHVTVK 160
Query: 132 SSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGR 191
SV+E LLEI K CD+YLME VLD+ESE KVL AL AG+F GGL+KDKVLFCST+ GR
Sbjct: 161 PSVVEVLLEIAKICDVYLMECVLDDESEGKVLSALSEAGLFAGGGLIKDKVLFCSTDNGR 220
Query: 192 TSFVRQLEPDWHIDTNPEIVSQLA 215
TSFVRQLEPDWHIDT+PEIV QLA
Sbjct: 221 TSFVRQLEPDWHIDTSPEIVHQLA 244
>gi|21618311|gb|AAM67361.1| unknown [Arabidopsis thaliana]
Length = 151
Score = 211 bits (538), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 103/117 (88%), Positives = 109/117 (93%)
Query: 99 QKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNES 158
QKLSEGRKVTCRLLGVILEE+ PEELQKQ TV+SSVLE LLEITK+ DLYLME+VLD+ES
Sbjct: 1 QKLSEGRKVTCRLLGVILEETSPEELQKQATVRSSVLEVLLEITKYSDLYLMEKVLDDES 60
Query: 159 EKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
E KVL ALE AGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEI +QLA
Sbjct: 61 EAKVLQALENAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEISTQLA 117
>gi|218195643|gb|EEC78070.1| hypothetical protein OsI_17536 [Oryza sativa Indica Group]
Length = 186
Score = 203 bits (516), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 98/125 (78%), Positives = 110/125 (88%)
Query: 92 TLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLME 151
TL Q+VR KLSEGR+VTCRLLGVILEE+ PEELQ VTV+ SVLE LLEI KFCD+YLME
Sbjct: 52 TLEQLVRHKLSEGRRVTCRLLGVILEETTPEELQNHVTVRPSVLEVLLEIAKFCDVYLME 111
Query: 152 RVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIV 211
R+LD+ES +KVL AL AG+FT+ GL+KDKVLFCSTE GRTSFVRQLEPDWHIDT+PEIV
Sbjct: 112 RILDDESGEKVLSALSEAGLFTNSGLIKDKVLFCSTENGRTSFVRQLEPDWHIDTSPEIV 171
Query: 212 SQLAV 216
QLAV
Sbjct: 172 HQLAV 176
>gi|224093214|ref|XP_002309836.1| predicted protein [Populus trichocarpa]
gi|222852739|gb|EEE90286.1| predicted protein [Populus trichocarpa]
Length = 153
Score = 202 bits (515), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 101/119 (84%), Positives = 107/119 (89%), Gaps = 2/119 (1%)
Query: 99 QKLSEGRKVTCRLLGVILEESCPEELQ--KQVTVKSSVLEALLEITKFCDLYLMERVLDN 156
QKLSEGRKVTCRL VILEES PEELQ Q TV+SSVLE LLEITKFCDLYLMERVLD+
Sbjct: 1 QKLSEGRKVTCRLFEVILEESSPEELQIVSQATVRSSVLEVLLEITKFCDLYLMERVLDD 60
Query: 157 ESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
ESE+K+L ALE AGVFTSGGLVKDKVLFCSTE GR+SFVRQLEPDWHIDTNPEI+ QLA
Sbjct: 61 ESEQKILAALENAGVFTSGGLVKDKVLFCSTETGRSSFVRQLEPDWHIDTNPEILFQLA 119
>gi|359359136|gb|AEV41042.1| hypothetical protein [Oryza minuta]
Length = 252
Score = 200 bits (508), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 104/213 (48%), Positives = 142/213 (66%), Gaps = 21/213 (9%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRS-PGAPQRRQPKRQAPSTSNSVGSTQAN 62
++ LL ++ + S+GA+AG A+A+ F W+ RS P P+R + KR P + + A+
Sbjct: 28 LVGLLFHHKSAGSLGAVAGFAIAVAFAWKFLRSSPARPRRPESKR--PLVGRAAPDSAAH 85
Query: 63 ATSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPE 122
P + KPT Q+V ++L RKVTC+LLGV+ EE PE
Sbjct: 86 DAVEPANSG------------------KPTTRQIVAKQLGGCRKVTCQLLGVVFEEKTPE 127
Query: 123 ELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKV 182
ELQ TV+ SV+E LLEI KFCDLYLME VLD++SE+ LLALE+AG+F +GGL+K+KV
Sbjct: 128 ELQTHATVRPSVVELLLEIAKFCDLYLMETVLDDKSEENALLALESAGLFKTGGLMKEKV 187
Query: 183 LFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
LFCS+E+GRTSFVRQLE D+HIDT+ +I+SQL+
Sbjct: 188 LFCSSEVGRTSFVRQLEADFHIDTSLDIISQLS 220
>gi|39546241|emb|CAE04250.3| OSJNBa0089N06.11 [Oryza sativa Japonica Group]
Length = 170
Score = 183 bits (465), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 88/113 (77%), Positives = 99/113 (87%)
Query: 104 GRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVL 163
R+VTCRLLGVILEE+ PEELQ VTV+ SVLE LLEI KFCD+YLMER+LD+ES +KVL
Sbjct: 48 DRQVTCRLLGVILEETTPEELQNHVTVRPSVLEVLLEIAKFCDVYLMERILDDESGEKVL 107
Query: 164 LALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAV 216
AL AG+FT+ GL+KDKVLFCSTE GRTSFVRQLEPDWHIDT+PEIV QLAV
Sbjct: 108 SALSEAGLFTNSGLIKDKVLFCSTENGRTSFVRQLEPDWHIDTSPEIVHQLAV 160
>gi|356544834|ref|XP_003540852.1| PREDICTED: LOW QUALITY PROTEIN: peroxisome biogenesis protein
22-like [Glycine max]
Length = 150
Score = 183 bits (465), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 88/113 (77%), Positives = 100/113 (88%)
Query: 103 EGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKV 162
E +VTCR LGVILE+S EEL KQ T++SSVLE LLEITKFCDLYLM+RVLD+ESEK+V
Sbjct: 4 EELEVTCRHLGVILEKSNTEELXKQATIRSSVLEVLLEITKFCDLYLMDRVLDDESEKRV 63
Query: 163 LLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
L+ LE +FTSGGLVKDKVLFCSTE GR+SFVRQLEPDWHID+NPEI+SQLA
Sbjct: 64 LVVLEDVEIFTSGGLVKDKVLFCSTENGRSSFVRQLEPDWHIDSNPEIISQLA 116
>gi|70663949|emb|CAE54553.2| OSJNBb0004A17.13 [Oryza sativa Japonica Group]
Length = 250
Score = 183 bits (464), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 103/210 (49%), Positives = 139/210 (66%), Gaps = 18/210 (8%)
Query: 7 LLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATST 66
LL ++ + S+GA+AG A+A+VF W+ RS A RR P + P + + A +
Sbjct: 31 LLFHHKSAGSLGAVAGFAIAVVFAWKFLRSSPARPRRPPGPKRPLVGPAAPDSAARDAAE 90
Query: 67 PFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQK 126
P P K T Q+V ++L RKVTC+LLGV+ EE P+ELQ
Sbjct: 91 PA------------------NPGKLTTRQIVVKRLGGCRKVTCQLLGVVFEEITPQELQT 132
Query: 127 QVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCS 186
TV+ SV+E LLEI K+CDLYLME VLD++SE+ LLALE+AG+F +GGL+K+KVLFCS
Sbjct: 133 HATVRPSVVELLLEIAKYCDLYLMETVLDDKSEENALLALESAGLFKTGGLMKEKVLFCS 192
Query: 187 TEIGRTSFVRQLEPDWHIDTNPEIVSQLAV 216
+E+GRTSFVRQLE D+HIDT+ +I+SQL+V
Sbjct: 193 SEVGRTSFVRQLEADFHIDTSLDIISQLSV 222
>gi|357166686|ref|XP_003580800.1| PREDICTED: peroxisome biogenesis protein 22-like [Brachypodium
distachyon]
Length = 256
Score = 182 bits (462), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 101/212 (47%), Positives = 140/212 (66%), Gaps = 17/212 (8%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA 63
++ LL ++ + S GA+AG A+A+VF W+ RS R+P +S + A
Sbjct: 30 LVALLFRHKSAGSFGAVAGFAIAVVFAWKFLRSSPPRPPRRPAAPKRPAPSSAAAPAAAD 89
Query: 64 TSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEE 123
+ P G S K T Q+V ++LS RKVTC+LLGV+ EE+ PEE
Sbjct: 90 AAEPVGDSG-----------------KLTTRQIVSRRLSGCRKVTCQLLGVVFEENTPEE 132
Query: 124 LQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVL 183
LQK TV+ SV++ LLEI+K CD YLME VLD++SE+ L+ALE+AG+F +GGL+K+KVL
Sbjct: 133 LQKHATVRPSVVDLLLEISKCCDFYLMETVLDDKSEENALMALESAGLFRTGGLMKEKVL 192
Query: 184 FCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
FCS+E+GRTSFVRQLE D+HIDT+ +I+SQL+
Sbjct: 193 FCSSEVGRTSFVRQLEADFHIDTSLDIISQLS 224
>gi|326514842|dbj|BAJ99782.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 255
Score = 182 bits (461), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 98/219 (44%), Positives = 140/219 (63%), Gaps = 31/219 (14%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA 63
++ LL ++ + S+GA+AG A+A+VF W+ RS S+ A
Sbjct: 29 LVALLFRHKSAGSLGAVAGFAIAVVFAWKFLRS--------------SSPARPRRPAAPK 74
Query: 64 TSTPFGASSSSEDLRTQIADALFQPVKP-------TLGQVVRQKLSEGRKVTCRLLGVIL 116
+P G+++ + D P +P T Q+V ++L R+VTC+L+G++
Sbjct: 75 RPSPAGSAAPAPD----------DPAQPAGDSGTLTTRQIVARRLGGCRRVTCQLIGIVF 124
Query: 117 EESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGG 176
EE PEELQ TVK SV+E LLEI+K+CD YLME VLD++SE+ L+ALE AG+F +GG
Sbjct: 125 EEKTPEELQNHATVKPSVVELLLEISKYCDFYLMETVLDDKSEENALMALENAGLFKTGG 184
Query: 177 LVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
L+K+KVLFCS+E+GRTSFVRQLE D+HIDT+ EI+SQL+
Sbjct: 185 LMKEKVLFCSSEVGRTSFVRQLESDFHIDTSLEIISQLS 223
>gi|222629759|gb|EEE61891.1| hypothetical protein OsJ_16589 [Oryza sativa Japonica Group]
Length = 666
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 98/215 (45%), Positives = 136/215 (63%), Gaps = 18/215 (8%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA 63
++ LL ++ + S+GA+AG A+A+VF W+ ++
Sbjct: 28 LVGLLFHHKSAGSLGAVAGFAIAVVFAWKFL-------------RSSPARPRRPPGPKRP 74
Query: 64 TSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEE 123
P S++ D A P K T Q+V ++L RKVTC+LLGV+ EE P+E
Sbjct: 75 LVGPAAPDSAARD-----AAEPANPGKLTTRQIVVKRLGGCRKVTCQLLGVVFEEITPQE 129
Query: 124 LQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVL 183
LQ TV+ SV+E LLEI K+CDLYLME VLD++SE+ LLALE+AG+F +GGL+K+KVL
Sbjct: 130 LQTHATVRPSVVELLLEIAKYCDLYLMETVLDDKSEENALLALESAGLFKTGGLMKEKVL 189
Query: 184 FCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAVHL 218
FCS+E+GRTSFVRQLE D+HIDT+ +I+SQL+ L
Sbjct: 190 FCSSEVGRTSFVRQLEADFHIDTSLDIISQLSATL 224
>gi|242074742|ref|XP_002447307.1| hypothetical protein SORBIDRAFT_06g032570 [Sorghum bicolor]
gi|241938490|gb|EES11635.1| hypothetical protein SORBIDRAFT_06g032570 [Sorghum bicolor]
Length = 250
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 138/212 (65%), Gaps = 22/212 (10%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA 63
+ LL ++ + S+GA+AG A+A+VF W+ R +R P A + +V
Sbjct: 29 IFALLFSHKGAGSLGAVAGFAIAVVFAWKFLRPRRPAPKRPPPTPAADPAATVPDA---- 84
Query: 64 TSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEE 123
+ P G S K ++V ++L RKVTC+LLGV+ EE PEE
Sbjct: 85 -AEPIGDSG-----------------KVVTREIVVKRLRGCRKVTCQLLGVVFEEKTPEE 126
Query: 124 LQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVL 183
LQ+ TV+ SV+E LLEI+++CDLYLME VLD++SE+ L+ALET+G+F +GGL+K+KVL
Sbjct: 127 LQEHATVRPSVVELLLEISRYCDLYLMETVLDDKSEENALMALETSGLFRTGGLMKEKVL 186
Query: 184 FCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
FCSTE+GRTSFVRQLE D+HIDTN +IVSQL+
Sbjct: 187 FCSTEVGRTSFVRQLESDFHIDTNLDIVSQLS 218
>gi|115461256|ref|NP_001054228.1| Os04g0672800 [Oryza sativa Japonica Group]
gi|90265233|emb|CAH67768.1| H0322F07.5 [Oryza sativa Indica Group]
gi|113565799|dbj|BAF16142.1| Os04g0672800 [Oryza sativa Japonica Group]
gi|215687054|dbj|BAG90900.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 253
Score = 181 bits (458), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/209 (48%), Positives = 138/209 (66%), Gaps = 18/209 (8%)
Query: 7 LLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATST 66
LL ++ + S+GA+AG A+A+VF W+ RS A RR P + P + + A +
Sbjct: 31 LLFHHKSAGSLGAVAGFAIAVVFAWKFLRSSPARPRRPPGPKRPLVGPAAPDSAARDAAE 90
Query: 67 PFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQK 126
P P K T Q+V ++L RKVTC+LLGV+ EE P+ELQ
Sbjct: 91 PA------------------NPGKLTTRQIVVKRLGGCRKVTCQLLGVVFEEITPQELQT 132
Query: 127 QVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCS 186
TV+ SV+E LLEI K+CDLYLME VLD++SE+ LLALE+AG+F +GGL+K+KVLFCS
Sbjct: 133 HATVRPSVVELLLEIAKYCDLYLMETVLDDKSEENALLALESAGLFKTGGLMKEKVLFCS 192
Query: 187 TEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
+E+GRTSFVRQLE D+HIDT+ +I+SQL+
Sbjct: 193 SEVGRTSFVRQLEADFHIDTSLDIISQLS 221
>gi|218195806|gb|EEC78233.1| hypothetical protein OsI_17880 [Oryza sativa Indica Group]
Length = 253
Score = 181 bits (458), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 102/212 (48%), Positives = 140/212 (66%), Gaps = 18/212 (8%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA 63
++ LL ++ + S+GA+AG A+A+VF W+ RS A RR P + P + + A
Sbjct: 28 LVGLLFHHKSAGSLGAVAGFAIAVVFAWKFLRSSPARPRRPPGPKRPLVGPAAPDSAARD 87
Query: 64 TSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEE 123
+ P P K T Q+V ++L RKVTC+LLGV+ EE P+E
Sbjct: 88 AAEPA------------------NPGKLTTRQIVVKRLGGCRKVTCQLLGVVFEEITPQE 129
Query: 124 LQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVL 183
LQ TV+ SV+E LLEI K+CDLYLME VLD++SE+ LLALE+AG+F +GGL+K+KVL
Sbjct: 130 LQTHATVRPSVVELLLEIAKYCDLYLMETVLDDKSEENALLALESAGLFKTGGLMKEKVL 189
Query: 184 FCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
FCS+E+GRTSFVRQLE D+HIDT+ +I+SQL+
Sbjct: 190 FCSSEVGRTSFVRQLEADFHIDTSLDIISQLS 221
>gi|414584841|tpg|DAA35412.1| TPA: hypothetical protein ZEAMMB73_564779 [Zea mays]
Length = 238
Score = 179 bits (455), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 143/216 (66%), Gaps = 22/216 (10%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA 63
++ LL ++ + S+GA+AG A+A+VF W+ R R+ ++ P T + +
Sbjct: 29 IVALLFSHKGAGSLGAVAGFAIAVVFAWKFLRPR-----RRAPKRPPPTPAAAPAATVPD 83
Query: 64 TSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEE 123
+ P G S K ++V ++L RKVTC+LLGV+ EE+ PEE
Sbjct: 84 AAEPIGDSG-----------------KVVTREIVVKRLKGCRKVTCQLLGVVFEETIPEE 126
Query: 124 LQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVL 183
LQK TV+ SV+E LLEI+++CDLYLME V+D++SE+ L+ALET+G+F +GGL+K+KVL
Sbjct: 127 LQKHATVRPSVVELLLEISRYCDLYLMETVIDDKSEENALMALETSGLFRTGGLMKEKVL 186
Query: 184 FCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAVHLL 219
FCS+E+GRTSFVRQLE D+HIDT+ +IVSQL+V L
Sbjct: 187 FCSSEVGRTSFVRQLESDFHIDTSLDIVSQLSVCYL 222
>gi|224028645|gb|ACN33398.1| unknown [Zea mays]
Length = 250
Score = 179 bits (453), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 141/213 (66%), Gaps = 22/213 (10%)
Query: 3 WVLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQAN 62
+++ LL ++ + S+GA+AG A+A+VF W+ R R+ ++ P T + +
Sbjct: 28 YIVALLFSHKGAGSLGAVAGFAIAVVFAWKFLRPR-----RRAPKRPPPTPAAAPAATVP 82
Query: 63 ATSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPE 122
+ P G S K ++V ++L RKVTC+LLGV+ EE PE
Sbjct: 83 DAAEPIGDSG-----------------KVVTREIVVKRLKGCRKVTCQLLGVVFEEKIPE 125
Query: 123 ELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKV 182
ELQK TV+ SV+E LLEI+++CDLYLME V+D++SE+ L+ALET+G+F +GGL+K+KV
Sbjct: 126 ELQKHATVRPSVVELLLEISRYCDLYLMETVIDDKSEENALMALETSGLFRTGGLMKEKV 185
Query: 183 LFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
LFCS+E+GRTSFVRQLE D+HIDT+ +IVSQL+
Sbjct: 186 LFCSSEVGRTSFVRQLESDFHIDTSLDIVSQLS 218
>gi|414584842|tpg|DAA35413.1| TPA: hypothetical protein ZEAMMB73_564779 [Zea mays]
Length = 250
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/212 (45%), Positives = 141/212 (66%), Gaps = 22/212 (10%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA 63
++ LL ++ + S+GA+AG A+A+VF W+ R R+ ++ P T + +
Sbjct: 29 IVALLFSHKGAGSLGAVAGFAIAVVFAWKFLRPR-----RRAPKRPPPTPAAAPAATVPD 83
Query: 64 TSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEE 123
+ P G S K ++V ++L RKVTC+LLGV+ EE+ PEE
Sbjct: 84 AAEPIGDSG-----------------KVVTREIVVKRLKGCRKVTCQLLGVVFEETIPEE 126
Query: 124 LQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVL 183
LQK TV+ SV+E LLEI+++CDLYLME V+D++SE+ L+ALET+G+F +GGL+K+KVL
Sbjct: 127 LQKHATVRPSVVELLLEISRYCDLYLMETVIDDKSEENALMALETSGLFRTGGLMKEKVL 186
Query: 184 FCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
FCS+E+GRTSFVRQLE D+HIDT+ +IVSQL+
Sbjct: 187 FCSSEVGRTSFVRQLESDFHIDTSLDIVSQLS 218
>gi|148908102|gb|ABR17168.1| unknown [Picea sitchensis]
Length = 172
Score = 176 bits (447), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 82/124 (66%), Positives = 105/124 (84%)
Query: 92 TLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLME 151
+L QVVRQ+L+ GRK+TC+LLGV+LEES PEEL + V+ +V++ LLEI K CDLYL+
Sbjct: 4 SLAQVVRQQLNGGRKITCQLLGVVLEESTPEELLEHAVVRPTVVDVLLEIAKGCDLYLIA 63
Query: 152 RVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIV 211
RVLD++SE+KV+ AL+ GVFT GG+ ++KVLFCSTE GR+SFVRQLEPDWHIDT+ EI+
Sbjct: 64 RVLDDDSEEKVISALDAVGVFTIGGMNRNKVLFCSTETGRSSFVRQLEPDWHIDTSAEII 123
Query: 212 SQLA 215
SQLA
Sbjct: 124 SQLA 127
>gi|302814322|ref|XP_002988845.1| hypothetical protein SELMODRAFT_184106 [Selaginella moellendorffii]
gi|300143416|gb|EFJ10107.1| hypothetical protein SELMODRAFT_184106 [Selaginella moellendorffii]
Length = 266
Score = 176 bits (447), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 96/221 (43%), Positives = 146/221 (66%), Gaps = 8/221 (3%)
Query: 1 MFWVLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQP--KRQAPSTSNSVGS 58
M +++ L ++D+ S+GAIAG A+A+V TWR RS +R +P K +APS+S+S +
Sbjct: 20 MSALVSFLMHHKDASSIGAIAGFAIALVCTWRYMRSSNPVRRPKPTAKVEAPSSSSSDST 79
Query: 59 TQANATSTPFGASSSSEDLRTQIADALFQPV-KPTLGQVVRQKLSEGRKVTCRLLGVILE 117
+ ++ G+ + + Q A P+ K TL Q V+++L GRKVTC+LLGV+L+
Sbjct: 80 SGSSTAIESRGSQARQRGVAIQEA-----PLSKLTLAQSVKRQLKGGRKVTCQLLGVVLQ 134
Query: 118 ESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGL 177
E+ +L+ V+ + + LLE+ + CDLYLM RV D+ SE+ +L AL+ G+F+
Sbjct: 135 ENSTADLENHAVVRPAAADVLLELAQCCDLYLMSRVNDDASEEAILAALDQIGLFSKANF 194
Query: 178 VKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAVHL 218
++KVLFCS+E GR SFVRQLEPDWHIDT+ +IV+QLA ++
Sbjct: 195 NRNKVLFCSSEAGRCSFVRQLEPDWHIDTSVDIVNQLARYI 235
>gi|302761590|ref|XP_002964217.1| hypothetical protein SELMODRAFT_142584 [Selaginella moellendorffii]
gi|300167946|gb|EFJ34550.1| hypothetical protein SELMODRAFT_142584 [Selaginella moellendorffii]
Length = 266
Score = 176 bits (446), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 96/221 (43%), Positives = 146/221 (66%), Gaps = 8/221 (3%)
Query: 1 MFWVLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQP--KRQAPSTSNSVGS 58
M +++ L ++D+ S+GAIAG A+A+V TWR RS +R +P K +APS+S+S +
Sbjct: 20 MSALVSFLMHHKDASSIGAIAGFAIALVCTWRYMRSSNPVRRPKPTVKVEAPSSSSSDST 79
Query: 59 TQANATSTPFGASSSSEDLRTQIADALFQPV-KPTLGQVVRQKLSEGRKVTCRLLGVILE 117
+ ++ G+ + + Q A P+ K TL Q V+++L GRKVTC+LLGV+L+
Sbjct: 80 SGSSTAIESRGSQARQRGVAIQEA-----PLSKLTLAQSVKRQLKGGRKVTCQLLGVVLQ 134
Query: 118 ESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGL 177
E+ +L+ V+ + + LLE+ + CDLYLM RV D+ SE+ +L AL+ G+F+
Sbjct: 135 ENSTADLENHAVVRPAAADVLLELAQCCDLYLMSRVNDDASEEAILAALDQIGLFSKANF 194
Query: 178 VKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAVHL 218
++KVLFCS+E GR SFVRQLEPDWHIDT+ +IV+QLA ++
Sbjct: 195 NRNKVLFCSSEAGRCSFVRQLEPDWHIDTSVDIVNQLARYI 235
>gi|226510333|ref|NP_001143649.1| uncharacterized protein LOC100276371 [Zea mays]
gi|195623864|gb|ACG33762.1| hypothetical protein [Zea mays]
Length = 250
Score = 175 bits (443), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/212 (45%), Positives = 139/212 (65%), Gaps = 22/212 (10%)
Query: 4 VLTLLSWYQDSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANA 63
++ LL ++ + S+GA+AG A+A+VF W+ R R+ ++ P T + +
Sbjct: 29 IVALLFSHKGAGSLGAVAGFAIAVVFAWKFLRPR-----RRAPKRPPPTPAAAPAATVPD 83
Query: 64 TSTPFGASSSSEDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEE 123
+ P G S K ++V ++L RKVTC+LLGV+ EE PEE
Sbjct: 84 AAEPIGDSG-----------------KVVTREIVVKRLKGCRKVTCQLLGVVFEEKIPEE 126
Query: 124 LQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVL 183
LQK TV+ SV+E LLEI+++CDLYLME V+D++SE+ L+ALE +G+F +GGL+K+KVL
Sbjct: 127 LQKHATVRPSVVELLLEISRYCDLYLMETVIDDKSEENALMALEISGLFRTGGLMKEKVL 186
Query: 184 FCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
FCS+E+GRTSFVRQLE D+HIDT+ +IVSQL+
Sbjct: 187 FCSSEVGRTSFVRQLESDFHIDTSLDIVSQLS 218
>gi|168008529|ref|XP_001756959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|168008745|ref|XP_001757067.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691830|gb|EDQ78190.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691938|gb|EDQ78298.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 262
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 88/188 (46%), Positives = 121/188 (64%), Gaps = 15/188 (7%)
Query: 29 FTWRMFRSPGA-PQRRQPKRQAPSTSNSVGSTQANATSTPFGASSSSEDLRTQIADALFQ 87
+TW+ +R PGA P R KR +++ + S G + S L
Sbjct: 52 WTWKCWRIPGARPHPRVDKRDGADAASASATRAPAGASMDSGPGALSTAL---------- 101
Query: 88 PVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDL 147
++ Q+VR++L+ RK+TC+LLGVILEES PEE+QK V+ +V+E LLEI + CDL
Sbjct: 102 ----SVAQIVRRQLNGSRKMTCQLLGVILEESSPEEVQKHAVVRPAVVEVLLEIARTCDL 157
Query: 148 YLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTN 207
YL+ RV+D+ SE V+ ALE G+F+ G L ++KVLFCSTE GRTSFVRQLEPDWH+DT+
Sbjct: 158 YLVARVVDDASEGLVMGALEAVGLFSEGKLNREKVLFCSTETGRTSFVRQLEPDWHVDTS 217
Query: 208 PEIVSQLA 215
+I + LA
Sbjct: 218 RDINAGLA 225
>gi|255646655|gb|ACU23801.1| unknown [Glycine max]
Length = 163
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 82/119 (68%), Positives = 93/119 (78%), Gaps = 1/119 (0%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGASS 72
D RS+GA+AGLA+AIVFTWR+ RSP QRRQ KRQ PS+SN +T +N + P A S
Sbjct: 41 DIRSIGAVAGLAVAIVFTWRLLRSPSGSQRRQQKRQGPSSSNPGVTTHSNVSDVPSDACS 100
Query: 73 SSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTV 130
S+D R Q + D FQPVKPTLGQ+VRQKLSEGRKVTCRLLGVILEES PEELQKQ TV
Sbjct: 101 PSDDSRAQNVVDEFFQPVKPTLGQIVRQKLSEGRKVTCRLLGVILEESSPEELQKQATV 159
>gi|388494478|gb|AFK35305.1| unknown [Lotus japonicus]
Length = 131
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/101 (64%), Positives = 76/101 (75%), Gaps = 1/101 (0%)
Query: 13 DSRSVGAIAGLALAIVFTWRMFRSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGASS 72
DS S+GA+AGLA+AI FTWR+ RSPG PQ RQ KRQ ++SN ST +NA+ P G S
Sbjct: 27 DSHSIGAVAGLAIAIFFTWRLLRSPGEPQPRQRKRQTTTSSNPGVSTHSNASVVPSGVRS 86
Query: 73 SSEDLRTQ-IADALFQPVKPTLGQVVRQKLSEGRKVTCRLL 112
S D R Q + D LFQPVKPTLGQ+VRQKL+EGRKVT RLL
Sbjct: 87 PSGDSRAQNVVDELFQPVKPTLGQIVRQKLNEGRKVTRRLL 127
>gi|302141740|emb|CBI18943.3| unnamed protein product [Vitis vinifera]
Length = 179
Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 68/105 (64%), Positives = 75/105 (71%), Gaps = 18/105 (17%)
Query: 80 QIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALL 139
+ D FQPV+PTLGQ+VRQKLSEGRK KQ TV+SSV E LL
Sbjct: 71 NVIDEFFQPVEPTLGQIVRQKLSEGRKA------------------KQATVRSSVPEVLL 112
Query: 140 EITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLF 184
EITKFCDLYLMERVLD+ESE+KV LALE AGVFTSG LV+ KVLF
Sbjct: 113 EITKFCDLYLMERVLDDESERKVFLALEDAGVFTSGDLVQGKVLF 157
>gi|215697004|dbj|BAG90998.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765458|dbj|BAG87155.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 103
Score = 112 bits (281), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 52/66 (78%), Positives = 59/66 (89%)
Query: 150 MERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPE 209
MER+LD+ES +KVL AL AG+FT+ GL+KDKVLFCSTE GRTSFVRQLEPDWHIDT+PE
Sbjct: 1 MERILDDESGEKVLSALSEAGLFTNSGLIKDKVLFCSTENGRTSFVRQLEPDWHIDTSPE 60
Query: 210 IVSQLA 215
IV QLA
Sbjct: 61 IVHQLA 66
>gi|414584840|tpg|DAA35411.1| TPA: hypothetical protein ZEAMMB73_564779 [Zea mays]
Length = 98
Score = 100 bits (248), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 44/66 (66%), Positives = 60/66 (90%)
Query: 150 MERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPE 209
ME V+D++SE+ L+ALET+G+F +GGL+K+KVLFCS+E+GRTSFVRQLE D+HIDT+ +
Sbjct: 1 METVIDDKSEENALMALETSGLFRTGGLMKEKVLFCSSEVGRTSFVRQLESDFHIDTSLD 60
Query: 210 IVSQLA 215
IVSQL+
Sbjct: 61 IVSQLS 66
>gi|302854160|ref|XP_002958590.1| hypothetical protein VOLCADRAFT_84514 [Volvox carteri f.
nagariensis]
gi|300256051|gb|EFJ40327.1| hypothetical protein VOLCADRAFT_84514 [Volvox carteri f.
nagariensis]
Length = 259
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 72/124 (58%), Gaps = 2/124 (1%)
Query: 93 LGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMER 152
LG+ V +LS ++VT L GV+ E P +LQ+ TV LE L EI++ D+YL+
Sbjct: 99 LGRAVAAQLSGVKRVTLSLPGVVFLERTPAQLQESATVCPKALEVLQEISRVSDVYLIAH 158
Query: 153 VLDNESEKKVLLALETAGVFTSG-GLVK-DKVLFCSTEIGRTSFVRQLEPDWHIDTNPEI 210
V D+ E V ALE AGV +G G VK VL CS+ G+ RQLEP+ H+D +P
Sbjct: 159 VEDDVGEAVVTGALEAAGVLGNGPGQVKQHHVLCCSSLDGKVPIARQLEPELHVDGHPAS 218
Query: 211 VSQL 214
V +L
Sbjct: 219 VDEL 222
>gi|307105185|gb|EFN53435.1| hypothetical protein CHLNCDRAFT_136676 [Chlorella variabilis]
Length = 252
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 73/120 (60%), Gaps = 4/120 (3%)
Query: 99 QKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCD--LYLMERVLDN 156
++L+ ++ T + GV+LEES P EL++ V++ + E+ + C LYLM +V D+
Sbjct: 84 RRLAGVQRATMSVPGVLLEESSPGELEESACVRADAVALAREVARCCSAGLYLMCQVADD 143
Query: 157 ESEKKVLLALETAGVFTS--GGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQL 214
E V ALE AG+ + G + ++LFCST G+ SFVRQLEP+ H+D +P VS L
Sbjct: 144 VGEAVVRGALEHAGLLGTAPGQVPPHRLLFCSTLEGKASFVRQLEPELHVDAHPRTVSDL 203
>gi|384245649|gb|EIE19142.1| hypothetical protein COCSUDRAFT_59626 [Coccomyxa subellipsoidea
C-169]
Length = 499
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 73/132 (55%), Gaps = 2/132 (1%)
Query: 85 LFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKF 144
L Q + L VR +LS R VT GV+LEE P +L++ ++ L E+ K
Sbjct: 14 LVQGEERGLAWAVRSQLSGIRSVTVSAPGVLLEEWTPTDLEESAVLREDAAAVLREMLKT 73
Query: 145 CDLYLMERVLDNESEKKVLLALETAGVF--TSGGLVKDKVLFCSTEIGRTSFVRQLEPDW 202
++Y++ V+D+ E V ALE G+ + G + +VLFCST G+ S VRQLEP+
Sbjct: 74 ANVYVLAHVIDDIGEATVRGALEAGGLVGPSVGQIPPHRVLFCSTLEGKVSIVRQLEPEL 133
Query: 203 HIDTNPEIVSQL 214
HID +P+ + L
Sbjct: 134 HIDGHPQTIEAL 145
>gi|428161933|gb|EKX31161.1| hypothetical protein GUITHDRAFT_122633 [Guillardia theta CCMP2712]
Length = 197
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 59/90 (65%), Gaps = 1/90 (1%)
Query: 125 QKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLF 184
+ + +K S L LLE+ + CDLY++ +V +E E KV+ ALE AGVF GL KVLF
Sbjct: 58 RNSIALKGSSLAILLEMARTCDLYVITQVTSDEVENKVVDALEQAGVF-DAGLSPHKVLF 116
Query: 185 CSTEIGRTSFVRQLEPDWHIDTNPEIVSQL 214
C TE+GR + VRQ+EP H++++ ++ L
Sbjct: 117 CETELGRAAMVRQIEPMLHVESSAVVLESL 146
>gi|452821155|gb|EME28189.1| hypothetical protein Gasu_43540 [Galdieria sulphuraria]
Length = 253
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 62/115 (53%), Gaps = 6/115 (5%)
Query: 105 RKVTCRLLGVILEESC-----PEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESE 159
R V+ LG LE+SC +++Q+ V L L++ +++L+ RV +E E
Sbjct: 106 RGVSRITLGTDLEKSCLFRRVSSSVKEQLQVDPDKLSGFLQLISLFEVFLIIRVSSDEEE 165
Query: 160 KKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQL 214
ALE G F SG L KVLFC T G T+ VRQLEP HID+NP ++ L
Sbjct: 166 VSCRNALEVTGTFQSG-LHPAKVLFCETLQGTTAIVRQLEPQMHIDSNPSVIVDL 219
>gi|159474688|ref|XP_001695457.1| predicted protein [Chlamydomonas reinhardtii]
gi|158275940|gb|EDP01715.1| predicted protein [Chlamydomonas reinhardtii]
Length = 214
Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 71/119 (59%), Gaps = 2/119 (1%)
Query: 98 RQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNE 157
RQ+L+ ++VT + GV+L ES P +LQ+ +V+ LEA+ E+ + D+YL+ V D+
Sbjct: 60 RQRLAGAKRVTLSIPGVLLAESSPAQLQESASVQLEALEAVREMARVSDVYLLAHVEDDV 119
Query: 158 SEKKVLLALETAGVFTSGG--LVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQL 214
E V ALE AG+ +G + +L CST G+ VRQL+PD H+D + V +L
Sbjct: 120 GEAVVAGALEAAGLLGAGAGQVAPHHLLCCSTLDGKVPIVRQLDPDLHVDGHAVSVDEL 178
>gi|328875444|gb|EGG23808.1| splicing factor [Dictyostelium fasciculatum]
Length = 297
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 66/121 (54%), Gaps = 6/121 (4%)
Query: 103 EGRKVTCRLLGVILEESCPEELQKQVTVKS-----SVLEALLEITKFCDLYLMERVLDNE 157
+ +K++ + VI+E++ E + S S +AL+++T CD+YLM + D+
Sbjct: 123 KSKKISISTINVIIEKTVHYESPSDRRICSYKFMESEKQALIKLTSQCDIYLMTMIEDDS 182
Query: 158 SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAVH 217
E K+ L+ +G+F GL K+LFCST GR R LE HID + +++S L H
Sbjct: 183 EEDKIKSILKDSGIF-DNGLNPHKILFCSTTQGRAHMSRHLECLMHIDDDYQVLSMLKPH 241
Query: 218 L 218
+
Sbjct: 242 V 242
>gi|320167810|gb|EFW44709.1| hypothetical protein CAOG_02734 [Capsaspora owczarzaki ATCC 30864]
Length = 206
Score = 63.9 bits (154), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 58/112 (51%), Gaps = 4/112 (3%)
Query: 107 VTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLAL 166
+TC L V+L + ++TV + L +L ++ + L+++ + E AL
Sbjct: 69 MTCAALNVVLRR---DATTGKLTVMAESLAHVLSMSSHFTIVLIQQCASDADEDSARQAL 125
Query: 167 ETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAVHL 218
E AG F GL+++ +LFC T G R LEP H+DT+ E+V LA H+
Sbjct: 126 EEAGAF-QNGLLRNNLLFCDTPEGLMHMARHLEPALHVDTDAEVVKTLAPHV 176
>gi|281210874|gb|EFA85040.1| splicing factor [Polysphondylium pallidum PN500]
Length = 324
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 1/116 (0%)
Query: 103 EGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKV 162
+KVT + + E + T+ S + LL + DLYL+ ++ D+ E+ V
Sbjct: 148 HKKKVTLSTIDTVYYEHITDRKVCNFTLIESERDTLLRLASTADLYLLTQITDDNEEEMV 207
Query: 163 LLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAVHL 218
+ L++ G+F GL KVLFCST GR R LE HID + ++ L H+
Sbjct: 208 INLLKSYGIF-EAGLNPHKVLFCSTTQGRAHIARHLESILHIDDDLTALTMLKPHV 262
>gi|440804537|gb|ELR25414.1| hypothetical protein ACA1_293850 [Acanthamoeba castellanii str.
Neff]
Length = 179
Score = 56.6 bits (135), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 62/121 (51%), Gaps = 6/121 (4%)
Query: 103 EGRKVTCRLLGVILE-----ESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNE 157
+GR+V+C GV+LE ++ + + S + LL + ++ DLYL+ D++
Sbjct: 25 KGRRVSCATRGVVLEFDNDADADAAAGHHRPRLIESAVPVLLLLARYTDLYLITEC-DDD 83
Query: 158 SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAVH 217
S ++ + +L A + GL K LFCST +GR RQLE HID N + L H
Sbjct: 84 STEEAVRSLLAARGLLAAGLNPHKALFCSTPLGRVHMARQLESHMHIDGNAAVAEGLHRH 143
Query: 218 L 218
+
Sbjct: 144 V 144
>gi|348687079|gb|EGZ26893.1| hypothetical protein PHYSODRAFT_553679 [Phytophthora sojae]
Length = 255
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 8/89 (8%)
Query: 138 LLEITKFCDLYLMERVLDNESEKKVLLALE--------TAGVFTSGGLVKDKVLFCSTEI 189
L ++++ D+YL+ V D + K+ E + T GG+ K+LFC+T I
Sbjct: 126 LADLSRVADVYLLCTVQDANDKGKMQRVREFVTTHPDLKSNESTPGGVKAHKILFCTTAI 185
Query: 190 GRTSFVRQLEPDWHIDTNPEIVSQLAVHL 218
G+ +FVRQ+EP H++ + +V L H+
Sbjct: 186 GKVAFVRQIEPQIHVEVDSGVVRDLEKHV 214
>gi|70934301|ref|XP_738397.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56514583|emb|CAH76351.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 169
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 63/114 (55%), Gaps = 8/114 (7%)
Query: 104 GRKVTCR------LLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNE 157
GRK T + ++ + L + + + + +V+E L+++ +++++ +V ++
Sbjct: 11 GRKTTNKSEYSRPIISLCLNDIVLKMIGNNAHILENVIEPLIKLCSISEVFVVAQVSNDT 70
Query: 158 SEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIV 211
E ++ L+ G+F G L + +++FCST GR S +RQL P H+D N EIV
Sbjct: 71 QETNIINLLKKTGLFDKG-LKEHRLMFCSTSNGRASMIRQLRPLTHVD-NDEIV 122
>gi|301123529|ref|XP_002909491.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100253|gb|EEY58305.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 255
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 49/89 (55%), Gaps = 8/89 (8%)
Query: 138 LLEITKFCDLYLMERVLDNESEKKVLLALETAGVF--------TSGGLVKDKVLFCSTEI 189
L ++++ D+YL+ V D +++ + E T+GG+ K+LFC+T I
Sbjct: 126 LADLSRVGDVYLLCMVTDATNKQSMQRIREFVATHPDLKSNDSTTGGIKAHKILFCATSI 185
Query: 190 GRTSFVRQLEPDWHIDTNPEIVSQLAVHL 218
G+ +FVRQ+EP H++ + +V L H+
Sbjct: 186 GKIAFVRQIEPLVHVEVDAAVVRDLERHV 214
>gi|68061577|ref|XP_672789.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|68076943|ref|XP_680391.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56490145|emb|CAI02193.1| conserved hypothetical protein [Plasmodium berghei]
gi|56501314|emb|CAH98127.1| conserved hypothetical protein [Plasmodium berghei]
Length = 217
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
Query: 133 SVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRT 192
+V+E L ++ +L+++ ++ ++ E ++ L+ G+F +G L + +++FCST GR
Sbjct: 94 NVIEPLRKLCSISELFVVAQISNDAQETNIINLLKKIGLFNTG-LKEHRLMFCSTSNGRA 152
Query: 193 SFVRQLEPDWHIDTNPEIVSQL 214
S +RQL P H+D + ++ L
Sbjct: 153 SMIRQLRPLTHVDNDETVIKTL 174
>gi|82597111|ref|XP_726543.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23481990|gb|EAA18108.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 217
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 55/94 (58%), Gaps = 5/94 (5%)
Query: 126 KQVTVKSSVLEALLE-ITKFC---DLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDK 181
K + + ++E ++E +TK C +L+++ +V ++ E ++ L+ G+F G L + +
Sbjct: 83 KIIGNNAHIIENVIEPLTKLCSISELFVVAQVSNDIQETNIINLLKKTGLFNKG-LKEHR 141
Query: 182 VLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
++FCST GR S +RQL P H+D + ++ L
Sbjct: 142 LMFCSTSNGRASMIRQLSPLTHVDNDETVIKTLT 175
>gi|221056680|ref|XP_002259478.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193809549|emb|CAQ40251.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 218
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/105 (22%), Positives = 58/105 (55%), Gaps = 1/105 (0%)
Query: 111 LLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAG 170
++ + L + + + +V + S +E +++ +L+++ ++L++ EK ++ + G
Sbjct: 73 IISLCLNDIVLKIIGNKVHIVESSIEPFIKLCAISELFVIAQILNDVQEKNIIDLFKRLG 132
Query: 171 VFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
+F G + +++FC+T GR S +RQL P H+D + ++ L
Sbjct: 133 LF-DKGFKEHRLMFCTTSNGRASMIRQLCPLTHVDNDETVIKTLT 176
>gi|237838925|ref|XP_002368760.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211966424|gb|EEB01620.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
Length = 316
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 50/89 (56%), Gaps = 2/89 (2%)
Query: 126 KQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFC 185
K VTV +++ + L + + L++ +V +E E++VL LE F G L + +V+F
Sbjct: 199 KFVTVDAAI-DPFLSLCSWSKLFVFVQVNTDEEEQQVLDELEKLQAFERG-LQRHRVMFS 256
Query: 186 STEIGRTSFVRQLEPDWHIDTNPEIVSQL 214
ST GR S VRQL+P HID + I L
Sbjct: 257 STRNGRASMVRQLQPLTHIDADDFIAVTL 285
>gi|221481345|gb|EEE19739.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 316
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 75/158 (47%), Gaps = 28/158 (17%)
Query: 58 STQANATSTPFGASSSSEDLRTQIADALFQPVKP-TLGQVVRQKLSEGRKVTCRLLGVIL 116
++ A AT G ++ +D D++F+P L +++ +K ++G K
Sbjct: 155 TSSAAATGVIGGNATEGKD-----DDSVFRPCASICLNEILLKKSADGDKF--------- 200
Query: 117 EESCPEELQKQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGG 176
VTV +++ + L + + L++ +V +E E++VL LE F G
Sbjct: 201 -----------VTVDAAI-DPFLSLCSWSKLFVFVQVNTDEEEQQVLDELEKLQAFERG- 247
Query: 177 LVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQL 214
L + +V+F ST GR S VRQL+P HID + I L
Sbjct: 248 LQRHRVMFSSTRNGRASMVRQLQPLTHIDADDFIAVTL 285
>gi|221502067|gb|EEE27813.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 316
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 50/89 (56%), Gaps = 2/89 (2%)
Query: 126 KQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFC 185
K VTV +++ + L + + L++ +V +E E++VL LE F G L + +V+F
Sbjct: 199 KFVTVDAAI-DPFLSLCSWSKLFVFVQVNTDEEEQQVLDELEKLQAFERG-LQRHRVMFS 256
Query: 186 STEIGRTSFVRQLEPDWHIDTNPEIVSQL 214
ST GR S VRQL+P HID + I L
Sbjct: 257 STRNGRASMVRQLQPLTHIDADDFIAVTL 285
>gi|401400887|ref|XP_003880881.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325115293|emb|CBZ50848.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 281
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 45/81 (55%), Gaps = 1/81 (1%)
Query: 127 QVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCS 186
+ ++ + + L + + L++ +V +E E++VL LE F G L + +V+F S
Sbjct: 164 KFVMEEAAIAPFLSLCSWSKLFVFAQVNSDEEEQQVLDELEGIQAFDRG-LQRHRVMFSS 222
Query: 187 TEIGRTSFVRQLEPDWHIDTN 207
T GR S VRQL+P HID +
Sbjct: 223 TRNGRASMVRQLQPLTHIDAD 243
>gi|124804933|ref|XP_001348154.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
gi|23496411|gb|AAN36067.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
Length = 217
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 26/124 (20%), Positives = 66/124 (53%), Gaps = 1/124 (0%)
Query: 92 TLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYLME 151
T + R+K+++ + ++ + L + + + + + +E L +++ +L+++
Sbjct: 53 TSNESYRRKMNKKNISSKPVISLCLNDIVIKIIGNNAHIMENSVEPLNKLSSISELFVIA 112
Query: 152 RVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIV 211
++ ++ EK ++ + G+F GL + +++FC+T GR S +RQL P H+D + ++
Sbjct: 113 QISNDVQEKNIIDLFKKLGLF-DKGLKEHRLMFCNTSNGRASMIRQLSPLTHVDNDETVI 171
Query: 212 SQLA 215
L
Sbjct: 172 KTLT 175
>gi|156099171|ref|XP_001615588.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804462|gb|EDL45861.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 218
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 49/90 (54%), Gaps = 1/90 (1%)
Query: 126 KQVTVKSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFC 185
V + S LE ++ +L+++ ++ ++ EK ++ + G+F GL + +++FC
Sbjct: 88 NNVHIVESSLEPFNKLCAISELFVIAQISNDVQEKNIIDLFKKLGLF-DKGLKEHRLMFC 146
Query: 186 STEIGRTSFVRQLEPDWHIDTNPEIVSQLA 215
+T GR S +RQL P H+D + ++ L
Sbjct: 147 TTSNGRASMIRQLCPLTHVDNDETVIKTLT 176
>gi|156088335|ref|XP_001611574.1| hypothetical protein [Babesia bovis T2Bo]
gi|154798828|gb|EDO08006.1| conserved hypothetical protein [Babesia bovis]
Length = 160
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 14/74 (18%)
Query: 154 LDNESE-KKVLLALETAGVFTSGGLVKD------------KVLFCSTEIGRTSFVRQLEP 200
+ NES+ +L +L++AG F GGL+K +VLF T GR S VRQL+P
Sbjct: 37 VSNESDIANILASLDSAGAF-RGGLMKHVRFTLYFNQHEQRVLFSQTSAGRGSMVRQLQP 95
Query: 201 DWHIDTNPEIVSQL 214
H++TN +V +
Sbjct: 96 RIHMETNAAVVQAI 109
>gi|66357068|ref|XP_625712.1| hypothetical protein [Cryptosporidium parvum Iowa II]
gi|46226651|gb|EAK87630.1| hypothetical conserved protein with possible signal peptide
[Cryptosporidium parvum Iowa II]
Length = 238
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 131 KSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIG 190
K V E+L++++ F DL+L +V + E +L ++ G+F GL K +++FC
Sbjct: 123 KLKVKESLIKLSGFTDLFLFIQVDTEQDELVILNYMQEFGIF-EAGLKKHRLVFCEKPES 181
Query: 191 RTSFVRQLEPDWHIDTNPEIVSQL 214
+ RQL+ + HIDTN + +L
Sbjct: 182 IPNMARQLQANMHIDTNQDNAQKL 205
>gi|384501212|gb|EIE91703.1| hypothetical protein RO3G_16414 [Rhizopus delemar RA 99-880]
Length = 217
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/75 (28%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 133 SVLEALLEITKFCDLYLMERVLDNESEKKVLLALET--AGVFTSGGLVKDKVLFCSTEIG 190
+VL+ ++ +T D+Y++ + + ++ LE G+FT G + + K+++CS E G
Sbjct: 86 NVLQLIIRLTYAYDIYIIIHINSVKERDRIQFLLENTQGGLFTGGYMDQRKIIYCSEEEG 145
Query: 191 RTSFVRQLEPDWHID 205
+ +R +EP HI+
Sbjct: 146 KVHIIRHIEPVIHIE 160
>gi|209879525|ref|XP_002141203.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209556809|gb|EEA06854.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 205
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 39/70 (55%), Gaps = 4/70 (5%)
Query: 141 ITKFC---DLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQ 197
+TK C D+YL+ +V +E E + +E G+ +G L + +++FC S RQ
Sbjct: 97 LTKLCLKADIYLIVQVNSDEEESVFMKTIEDMGILDNG-LKRHRIVFCEKSTSIPSLARQ 155
Query: 198 LEPDWHIDTN 207
L+P HIDTN
Sbjct: 156 LQPLMHIDTN 165
>gi|429327358|gb|AFZ79118.1| hypothetical protein BEWA_019640 [Babesia equi]
Length = 183
Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 1/78 (1%)
Query: 135 LEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSF 194
L L + LYL+ ++ K VL LE VF +G L + +FC++ GR S
Sbjct: 80 LNRFLNVCSRTKLYLITQIKREHEMKVVLNNLEVHNVFKNG-LAAHRAMFCNSPSGRVSM 138
Query: 195 VRQLEPDWHIDTNPEIVS 212
++QL+P H++ + E V+
Sbjct: 139 IQQLQPHIHVEFDAEGVN 156
>gi|407849040|gb|EKG03903.1| hypothetical protein TCSYLVIO_005036 [Trypanosoma cruzi]
Length = 201
Score = 45.4 bits (106), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 15/111 (13%)
Query: 90 KPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYL 149
KP+L +RQ+ +G +V E L +Q + EAL+ ++Y+
Sbjct: 58 KPSLLNRMRQQQFKGHRVCIAW----------EVLAEQGQWRDHAKEALVVFAVDMEVYV 107
Query: 150 MERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEP 200
M RV + + EK +LA+ T G+ GLV+ ++LFC T G SF RQ++P
Sbjct: 108 MCRV-NCKQEKNEVLAMLT-GI---SGLVRHRILFCETAKGYESFCRQIKP 153
>gi|71665879|ref|XP_819905.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70885226|gb|EAN98054.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 201
Score = 45.1 bits (105), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 15/111 (13%)
Query: 90 KPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYL 149
KP+L +RQ+ +G +V E L +Q + EAL+ ++Y+
Sbjct: 58 KPSLLNRMRQQQFKGHRVCIAW----------EVLAEQGQWRDHAKEALVVFAVDMEVYV 107
Query: 150 MERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEP 200
M RV + + EK +LA+ T G+ GLV+ ++LFC T G SF RQ++P
Sbjct: 108 MCRV-NCKQEKNEVLAMLT-GI---SGLVRHRILFCETAKGYESFCRQIKP 153
>gi|294952915|ref|XP_002787513.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239902515|gb|EER19309.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 213
Score = 45.1 bits (105), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 9/84 (10%)
Query: 133 SVLEALLEITKFCDLY--LMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIG 190
++ L + FCDL ++ D E+ K ++ + G+ +VL ST G
Sbjct: 94 ALYTGLRRLADFCDLVVEMIAATDDEENTAKEKISRKV-------GIPGHRVLCSSTANG 146
Query: 191 RTSFVRQLEPDWHIDTNPEIVSQL 214
R VRQL P WH+DT+ E++ L
Sbjct: 147 RAYMVRQLHPAWHVDTDEEVLKYL 170
>gi|146104035|ref|XP_001469712.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398024548|ref|XP_003865435.1| hypothetical protein, conserved [Leishmania donovani]
gi|134074082|emb|CAM72824.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322503672|emb|CBZ38758.1| hypothetical protein, conserved [Leishmania donovani]
Length = 142
Score = 45.1 bits (105), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 45/80 (56%), Gaps = 9/80 (11%)
Query: 139 LEITKF----CDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSF 194
LEI +F ++YLM R+ D + ++++L ++ G+ + +VLFC+TE G +F
Sbjct: 34 LEILRFYAFSSEVYLMCRIRDADDKRQILNLVKAVD-----GIERHRVLFCTTEKGYEAF 88
Query: 195 VRQLEPDWHIDTNPEIVSQL 214
RQ++P I N V+ L
Sbjct: 89 TRQIDPSLLITNNAAQVAFL 108
>gi|401420336|ref|XP_003874657.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322490893|emb|CBZ26157.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 142
Score = 44.7 bits (104), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 42/74 (56%), Gaps = 9/74 (12%)
Query: 139 LEITKF----CDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSF 194
LEI +F ++YLM R+ D + +K++L ++ G+ + +VLFC+TE G +F
Sbjct: 34 LEILRFYAFSSEVYLMCRIRDADDKKQILNLVKAVD-----GIERHRVLFCTTEKGYEAF 88
Query: 195 VRQLEPDWHIDTNP 208
RQ++P + N
Sbjct: 89 TRQIDPSLLVSHNA 102
>gi|71649469|ref|XP_813458.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70878341|gb|EAN91607.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 201
Score = 44.3 bits (103), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 56/111 (50%), Gaps = 15/111 (13%)
Query: 90 KPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYL 149
KP+L +RQ+ +G +V E L +Q + EAL+ ++YL
Sbjct: 58 KPSLLNRMRQQQFKGHRVCIAW----------EVLAEQGQWRDHAKEALVVFAVDMEVYL 107
Query: 150 MERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEP 200
M RV + + EK +L + T G+ GLV+ ++LFC T G SF RQ++P
Sbjct: 108 MCRV-NCKQEKNEVLEMLT-GI---SGLVRHRILFCETAKGYESFCRQIKP 153
>gi|407407936|gb|EKF31545.1| hypothetical protein MOQ_004618 [Trypanosoma cruzi marinkellei]
Length = 255
Score = 43.9 bits (102), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 60/118 (50%), Gaps = 15/118 (12%)
Query: 90 KPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEITKFCDLYL 149
KP+L +RQ+ +G +V C V+ + + K EAL+ ++YL
Sbjct: 112 KPSLLNRMRQQQFKGHRV-CIAWEVLADHGQWRDHAK---------EALVVFAVDMEVYL 161
Query: 150 MERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDTN 207
M RV + + EK +LA+ T G+ G+V+ ++LFC T G SF RQ++P + N
Sbjct: 162 MCRV-NCKEEKNEVLAMLT-GI---SGIVRHRILFCETAKGYESFCRQIKPSLVVIQN 214
>gi|154346018|ref|XP_001568946.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066288|emb|CAM44079.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 209
Score = 43.1 bits (100), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 41/73 (56%), Gaps = 9/73 (12%)
Query: 139 LEITKF----CDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSF 194
L+I +F ++YLM R+ D +K++L ++ G+ + ++LFC+TE G +F
Sbjct: 101 LDILRFYAFSSEVYLMCRIRDAADKKQILNLVKAVD-----GIERHRILFCTTEKGYEAF 155
Query: 195 VRQLEPDWHIDTN 207
RQ++P + N
Sbjct: 156 TRQIDPSLLVTNN 168
>gi|157877105|ref|XP_001686884.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68129959|emb|CAJ09267.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 142
Score = 42.7 bits (99), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 44/80 (55%), Gaps = 9/80 (11%)
Query: 139 LEITKF----CDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSF 194
L+I +F ++YLM R+ D + +K++ ++ G+ + +VLFC+TE G +F
Sbjct: 34 LQILRFYAFSSEVYLMCRIRDADDKKQICNLVKAVD-----GIERHRVLFCTTEKGYEAF 88
Query: 195 VRQLEPDWHIDTNPEIVSQL 214
RQ++P I N V+ L
Sbjct: 89 TRQIDPSLLITNNAAQVAFL 108
>gi|111226806|ref|XP_642776.2| hypothetical protein DDB_G0277361 [Dictyostelium discoideum AX4]
gi|90970778|gb|EAL68867.2| hypothetical protein DDB_G0277361 [Dictyostelium discoideum AX4]
Length = 344
Score = 42.7 bits (99), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 2/78 (2%)
Query: 138 LLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQ 197
LL++ K DLYL+ + D + E +V L+ G+ GL K L+CST G+ R
Sbjct: 194 LLKLAKETDLYLITQTEDLQEEDQVARLLKQYGII-DAGLNPHKSLYCSTSQGKGHMSRH 252
Query: 198 LEPDWHID-TNPEIVSQL 214
LE HID ++ E+++ L
Sbjct: 253 LEIILHIDGSDIEVLTML 270
>gi|399219056|emb|CCF75943.1| unnamed protein product [Babesia microti strain RI]
Length = 86
Score = 40.8 bits (94), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 34/80 (42%), Gaps = 28/80 (35%)
Query: 135 LEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSF 194
LE LLEI K +LYL + +++ CST GR S
Sbjct: 4 LEILLEIAKISNLYL----------------------------ITQRIMTCSTAKGRGSM 35
Query: 195 VRQLEPDWHIDTNPEIVSQL 214
VRQL+P H+D + IV L
Sbjct: 36 VRQLQPILHVDIDRTIVDYL 55
>gi|328771599|gb|EGF81639.1| expressed protein [Batrachochytrium dendrobatidis JAM81]
Length = 500
Score = 40.4 bits (93), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 28/110 (25%), Positives = 52/110 (47%), Gaps = 3/110 (2%)
Query: 100 KLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSVLEALLEI--TKFCDLYLMERVLDNE 157
+LS + +T ++ P+ K S+L ++E+ T+ ++ L+ V +
Sbjct: 204 RLSSKKLLTISTRNLLFWNPSPDACAPNFAFKESMLPVMIELLKTRKYEITLITLVKSDA 263
Query: 158 SEKKVL-LALETAGVFTSGGLVKDKVLFCSTEIGRTSFVRQLEPDWHIDT 206
E++V L L+ GL +VLFCS+ G V+ + PD H+D+
Sbjct: 264 EEEQVRQLLLDERHELVKSGLDSRRVLFCSSLQGVVHMVKHIGPDVHVDS 313
>gi|342184389|emb|CCC93871.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 196
Score = 40.0 bits (92), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 40/70 (57%), Gaps = 5/70 (7%)
Query: 131 KSSVLEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIG 190
+++ EAL+ ++ +YLM RV E ++ +L L+ G+ LV+ +LFC T G
Sbjct: 88 RAAAREALITLSTSMTVYLMCRVASEEEKRSILAMLK--GIPR---LVRHNILFCETAKG 142
Query: 191 RTSFVRQLEP 200
+F RQ++P
Sbjct: 143 YEAFSRQIKP 152
>gi|330844078|ref|XP_003293964.1| hypothetical protein DICPUDRAFT_158893 [Dictyostelium purpureum]
gi|325075642|gb|EGC29504.1| hypothetical protein DICPUDRAFT_158893 [Dictyostelium purpureum]
Length = 173
Score = 39.7 bits (91), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
Query: 136 EALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSFV 195
+ L+++ K DLYL+ ++ +++ E++VL L++ G+ +G L +K LFCST G+
Sbjct: 28 DVLIKLAKESDLYLITQIEEDQEEEQVLDLLKSYGIIDAG-LNPNKCLFCSTSQGKGHIS 86
Query: 196 RQLEPDWHID 205
R LE HI+
Sbjct: 87 RHLEVYLHIE 96
>gi|297572219|ref|YP_003697993.1| hypothetical protein Arch_1691 [Arcanobacterium haemolyticum DSM
20595]
gi|296932566|gb|ADH93374.1| Protein of unknown function DUF2335, membrane [Arcanobacterium
haemolyticum DSM 20595]
Length = 215
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 15/94 (15%)
Query: 37 PGAPQRRQPKRQAPSTSNSV-----GSTQANATSTPFGASSSSEDLRTQIADALFQPVKP 91
P + R P+ Q S +SV G A+ + TPFG S+ IA P +P
Sbjct: 18 PDHSESRSPQEQEISQPDSVHQSSGGGVDADGSVTPFGDGRDSKKESLDIA-----PKEP 72
Query: 92 TLGQVVRQKLSEGRKVT----CRLLGVILEESCP 121
T+GQVV Q L++G ++ R++G ++ S P
Sbjct: 73 TIGQVV-QALAQGEDISNELVTRVVGHSVQYSGP 105
>gi|123413681|ref|XP_001304326.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121885770|gb|EAX91396.1| hypothetical protein TVAG_350900 [Trichomonas vaginalis G3]
Length = 189
Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 175 GGLVK-DKVLFCSTEIGRTSFVRQLEPDWHIDTNPEIVSQLAVHL 218
GLV+ + +L C T +G S RQLE H D NPE + Q ++ +
Sbjct: 106 NGLVESNHILPCQTAMGCASMARQLESSAHFDYNPEAIHQASIFI 150
>gi|290982745|ref|XP_002674090.1| hypothetical protein NAEGRDRAFT_80749 [Naegleria gruberi]
gi|284087678|gb|EFC41346.1| hypothetical protein NAEGRDRAFT_80749 [Naegleria gruberi]
Length = 243
Score = 37.4 bits (85), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 135 LEALLEITKFCDLYLMERVLDNESEKKVLLALET-AGVFTSGGLVKDKVLFCSTEIGRTS 193
+++L ++TK D+YL+ +V +E + LET G F K LFC T G+ +
Sbjct: 128 IKSLKDLTKSNDVYLICQVSSDEEQNYYQDLLETRYGFFNQV-----KTLFCETSKGKIA 182
Query: 194 FVRQLEPDWHIDTNPEIVSQL 214
F RQ+ P + + E +L
Sbjct: 183 FCRQINPQLCFEVDRETTREL 203
>gi|338730819|ref|YP_004660211.1| peptidase S16 lon domain-containing protein [Thermotoga thermarum
DSM 5069]
gi|335365170|gb|AEH51115.1| peptidase S16 lon domain-containing protein [Thermotoga thermarum
DSM 5069]
Length = 803
Score = 37.0 bits (84), Expect = 4.9, Method: Composition-based stats.
Identities = 33/127 (25%), Positives = 62/127 (48%), Gaps = 11/127 (8%)
Query: 75 EDLRTQIADALFQPVKPTLGQVVRQKLSEGRKVTCRLLGVILEESCPEELQKQVTVKSSV 134
E+LR + A F V+ T V+ + EG+ VT +L E+ PEE++ S
Sbjct: 159 EELREKAAQLGFA-VQLTPTGVITVPVYEGKPVTPEVL-----ETLPEEVKNAFNENSRK 212
Query: 135 LEALLEITKFCDLYLMERVLDNESEKKVLLALETAGVFTSGGLVKDKVLFCSTEIGRTSF 194
L+ ++E T + R LD E ++K++ + A +FT G + ++ + G +
Sbjct: 213 LKHIIEGTLY-----KSRKLDREYKEKLIELDKYAALFTVGSMFEEMIKQYEENEGVVEY 267
Query: 195 VRQLEPD 201
++++E D
Sbjct: 268 LKEVEKD 274
>gi|312385705|gb|EFR30134.1| hypothetical protein AND_00458 [Anopheles darlingi]
Length = 1614
Score = 36.2 bits (82), Expect = 8.5, Method: Composition-based stats.
Identities = 32/97 (32%), Positives = 48/97 (49%), Gaps = 7/97 (7%)
Query: 35 RSPGAPQRRQPKRQAPSTSNSVGSTQANATSTPFGASSSSEDLRTQIADALFQPVKPTLG 94
RSPGAP P R AP+ + VGS++ + P GA+S L T AD P + +
Sbjct: 954 RSPGAPVA-VPIRPAPTLPSVVGSSEPTVSFAPSGAAS----LATNPADTGSLPAEVLIN 1008
Query: 95 QVVRQKLSEGRKVTCRLLGVILEESCPEELQ-KQVTV 130
V R K + K+ R+ + +E P + + KQ+ V
Sbjct: 1009 PVSRDKKLKESKLN-RITSYLKKEEKPTKPEPKQLRV 1044
>gi|302844562|ref|XP_002953821.1| hypothetical protein VOLCADRAFT_106102 [Volvox carteri f.
nagariensis]
gi|300260929|gb|EFJ45145.1| hypothetical protein VOLCADRAFT_106102 [Volvox carteri f.
nagariensis]
Length = 487
Score = 36.2 bits (82), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 34/66 (51%), Gaps = 6/66 (9%)
Query: 40 PQRRQPKRQAPSTSN--SVGSTQANATSTPFGASSSSEDLRTQIADALFQPVKPTLGQVV 97
PQ P+RQAPST + + G+T N ST G S+S R AL ++G V
Sbjct: 386 PQHEGPRRQAPSTGHAEAQGTTGYNGPSTGAGTSNSDSTSRQPNGTALLL----SIGSTV 441
Query: 98 RQKLSE 103
+KLSE
Sbjct: 442 SRKLSE 447
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.132 0.380
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,131,582,933
Number of Sequences: 23463169
Number of extensions: 117280417
Number of successful extensions: 472889
Number of sequences better than 100.0: 122
Number of HSP's better than 100.0 without gapping: 83
Number of HSP's successfully gapped in prelim test: 39
Number of HSP's that attempted gapping in prelim test: 472713
Number of HSP's gapped (non-prelim): 128
length of query: 219
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 82
effective length of database: 9,144,741,214
effective search space: 749868779548
effective search space used: 749868779548
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)