BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019937
(333 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224082810|ref|XP_002306848.1| predicted protein [Populus trichocarpa]
gi|222856297|gb|EEE93844.1| predicted protein [Populus trichocarpa]
Length = 428
Score = 586 bits (1510), Expect = e-165, Method: Compositional matrix adjust.
Identities = 283/329 (86%), Positives = 305/329 (92%), Gaps = 4/329 (1%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
MNLL+KIK QKK EPDK+ EIP+YW ETSDSV+RH+QF+PDG LS+KVVDD+RPVY
Sbjct: 1 MNLLDKIKMQKK-EPDKT---PEIPVYWIETSDSVSRHFQFEPDGQLSMKVVDDARPVYR 56
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
RVV+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQ
Sbjct: 57 RVVESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQ 116
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
AT VSWILKDGMQH GKLICSNLGARMDSEPKRWRILAD LYDLGTGLEVLSPLCP LFL
Sbjct: 117 ATAVSWILKDGMQHAGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFL 176
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
E+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+GLG GIQLA
Sbjct: 177 EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGVGIQLA 236
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
STVCSSMQGK + GPLLS++HV VIEEMRA PVNTLNPQRTAM+VADFVKTG+ISSPAD
Sbjct: 237 STVCSSMQGKFVAGPLLSIVHVCCVIEEMRATPVNTLNPQRTAMVVADFVKTGKISSPAD 296
Query: 301 LRYREDLLFPGRLIEDAGNVKMDRHGIRA 329
LRY EDLLFPGRLIE+AGNVK+ + RA
Sbjct: 297 LRYHEDLLFPGRLIENAGNVKVGQALHRA 325
>gi|297744143|emb|CBI37113.3| unnamed protein product [Vitis vinifera]
Length = 424
Score = 575 bits (1483), Expect = e-162, Method: Compositional matrix adjust.
Identities = 276/318 (86%), Positives = 299/318 (94%), Gaps = 2/318 (0%)
Query: 7 IKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSF 66
+KK+ KKE +++ V + ++W ETSDSV+RHYQF +G LS+KVVDDSR V HRVV+SF
Sbjct: 1 MKKEMKKEQERA--VAQGAVFWIETSDSVSRHYQFHSNGDLSMKVVDDSRTVVHRVVESF 58
Query: 67 LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSW 126
LNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQAT VSW
Sbjct: 59 LNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATAVSW 118
Query: 127 ILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+LKDGMQHVGKLICSNLGARMDSEPK WRILADALYDLGTGLEVLSPLCP LFLEMAGLG
Sbjct: 119 VLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFLEMAGLG 178
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
NFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAGIQLAST+CSS
Sbjct: 179 NFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLASTICSS 238
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
MQGKMI GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF+KTG+ISSPADLRY+ED
Sbjct: 239 MQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPADLRYQED 298
Query: 307 LLFPGRLIEDAGNVKMDR 324
L+F GRLIEDAGNVK+ R
Sbjct: 299 LIFTGRLIEDAGNVKVGR 316
>gi|255567274|ref|XP_002524618.1| conserved hypothetical protein [Ricinus communis]
gi|223536171|gb|EEF37826.1| conserved hypothetical protein [Ricinus communis]
Length = 436
Score = 575 bits (1483), Expect = e-162, Method: Compositional matrix adjust.
Identities = 276/329 (83%), Positives = 304/329 (92%), Gaps = 6/329 (1%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIP-----IYWFETSDSVTRHYQFQPDGHLSVKVVDDS 55
MNLL IK QKK EPDK+ + P ++W ETSDSV+RH+QFQPDG +S+KV+DDS
Sbjct: 1 MNLLNTIKMQKK-EPDKAPLLPPPPLPELPVHWIETSDSVSRHFQFQPDGRISMKVIDDS 59
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
R ++HRVV+SF NKFFPSGYP+SVNEGYLRYTQFRALQH +SAALSVLSTQSLLFAAGLR
Sbjct: 60 RSMFHRVVESFQNKFFPSGYPYSVNEGYLRYTQFRALQHLSSAALSVLSTQSLLFAAGLR 119
Query: 116 PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
PTPAQAT VSW+LKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLEVLSPLC
Sbjct: 120 PTPAQATAVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLC 179
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P LFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG
Sbjct: 180 PHLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGV 239
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
GIQLASTVCSS+QGK++VGPLLS++HV+SV+EEMRAAPVNTLNPQRTAMIVAD+VK G+I
Sbjct: 240 GIQLASTVCSSIQGKLVVGPLLSILHVFSVVEEMRAAPVNTLNPQRTAMIVADYVKAGKI 299
Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKMDR 324
SSPADLRYREDL+FPGRLIEDAGNVK+ R
Sbjct: 300 SSPADLRYREDLIFPGRLIEDAGNVKVGR 328
>gi|225438149|ref|XP_002273202.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
Length = 420
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 274/313 (87%), Positives = 295/313 (94%), Gaps = 2/313 (0%)
Query: 12 KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFF 71
KKE +++ V + ++W ETSDSV+RHYQF +G LS+KVVDDSR V HRVV+SFLNKFF
Sbjct: 2 KKEQERA--VAQGAVFWIETSDSVSRHYQFHSNGDLSMKVVDDSRTVVHRVVESFLNKFF 59
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
PSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQAT VSW+LKDG
Sbjct: 60 PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATAVSWVLKDG 119
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
MQHVGKLICSNLGARMDSEPK WRILADALYDLGTGLEVLSPLCP LFLEMAGLGNFAKG
Sbjct: 120 MQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFLEMAGLGNFAKG 179
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAGIQLAST+CSSMQGKM
Sbjct: 180 MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLASTICSSMQGKM 239
Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
I GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF+KTG+ISSPADLRY+EDL+F G
Sbjct: 240 IAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPADLRYQEDLIFTG 299
Query: 312 RLIEDAGNVKMDR 324
RLIEDAGNVK+ R
Sbjct: 300 RLIEDAGNVKVGR 312
>gi|334184613|ref|NP_001189649.1| uncharacterized protein [Arabidopsis thaliana]
gi|330253412|gb|AEC08506.1| uncharacterized protein [Arabidopsis thaliana]
Length = 432
Score = 555 bits (1431), Expect = e-156, Method: Compositional matrix adjust.
Identities = 259/324 (79%), Positives = 297/324 (91%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
M LEK+K KK++P S ++ P+YWFETSDSV+ YQFQ DGHLS+KVVDD+RPV
Sbjct: 1 MQFLEKVKLIKKEDPVMLKSPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQ 60
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
++V+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQ
Sbjct: 61 KMVESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQ 120
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLE++SPLCP LFL
Sbjct: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFL 180
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
EMAGLGNFAKG+A VAARATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV G+GAGIQLA
Sbjct: 181 EMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLA 240
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
ST+CSSM+GK++VG +LSV+HVYSV+E+MR P+NTLNPQRTA+IVA+F+KTG++ SP D
Sbjct: 241 STICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPD 300
Query: 301 LRYREDLLFPGRLIEDAGNVKMDR 324
LR++EDL+FP R I+DAGNVK+ R
Sbjct: 301 LRFQEDLMFPERPIQDAGNVKVGR 324
>gi|18402585|ref|NP_565718.1| uncharacterized protein [Arabidopsis thaliana]
gi|20197897|gb|AAD20664.2| expressed protein [Arabidopsis thaliana]
gi|21553562|gb|AAM62655.1| unknown [Arabidopsis thaliana]
gi|109134157|gb|ABG25076.1| At2g31190 [Arabidopsis thaliana]
gi|330253411|gb|AEC08505.1| uncharacterized protein [Arabidopsis thaliana]
Length = 433
Score = 552 bits (1422), Expect = e-155, Method: Compositional matrix adjust.
Identities = 258/322 (80%), Positives = 296/322 (91%)
Query: 3 LLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRV 62
L EK+K KK++P S ++ P+YWFETSDSV+ YQFQ DGHLS+KVVDD+RPV ++
Sbjct: 4 LQEKVKLIKKEDPVMLKSPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQKM 63
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
V+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQAT
Sbjct: 64 VESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQAT 123
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLE++SPLCP LFLEM
Sbjct: 124 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEM 183
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
AGLGNFAKG+A VAARATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV G+GAGIQLAST
Sbjct: 184 AGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLAST 243
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+CSSM+GK++VG +LSV+HVYSV+E+MR P+NTLNPQRTA+IVA+F+KTG++ SP DLR
Sbjct: 244 ICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLR 303
Query: 303 YREDLLFPGRLIEDAGNVKMDR 324
++EDL+FP R I+DAGNVK+ R
Sbjct: 304 FQEDLMFPERPIQDAGNVKVGR 325
>gi|449462449|ref|XP_004148953.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 428
Score = 552 bits (1422), Expect = e-155, Method: Compositional matrix adjust.
Identities = 260/324 (80%), Positives = 293/324 (90%), Gaps = 4/324 (1%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
M+LL K + K+P+KS S +P+ W E SDSV+R QFQPDGHLSVK++DDSRP
Sbjct: 1 MDLLNKFSA-RNKDPEKSPS---LPVSWIEVSDSVSRRCQFQPDGHLSVKIIDDSRPAIQ 56
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
R+VDSFLN FFPSGYP+SV+EGYLRYTQFRALQH TSAALSVLSTQSLLFAAGLRPT AQ
Sbjct: 57 RIVDSFLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQ 116
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
ATVVSW+LKDGMQHVGKLICSNLGARMDSEPKRWR++AD LYDLG GLEV+SPLCP LFL
Sbjct: 117 ATVVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFL 176
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
EMAGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAG+QLA
Sbjct: 177 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLA 236
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
ST+CSS+QGK++ PLLS++HVY V+E+MRA P+NTLNPQRTAMIVADFVK GRI SPAD
Sbjct: 237 STICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPAD 296
Query: 301 LRYREDLLFPGRLIEDAGNVKMDR 324
+RY+EDL+FPGRLIE+AGNVK+ R
Sbjct: 297 IRYQEDLVFPGRLIEEAGNVKVGR 320
>gi|449502048|ref|XP_004161529.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 428
Score = 551 bits (1421), Expect = e-154, Method: Compositional matrix adjust.
Identities = 260/324 (80%), Positives = 293/324 (90%), Gaps = 4/324 (1%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
M+LL K + K+P+KS S +P+ W E SDSV+R QFQPDGHLSVK++DDSRP
Sbjct: 1 MDLLNKFSA-RNKDPEKSPS---LPVSWIEVSDSVSRRCQFQPDGHLSVKIIDDSRPAIQ 56
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
R+VDSFLN FFPSGYP+SV+EGYLRYTQFRALQH TSAALSVLSTQSLLFAAGLRPT AQ
Sbjct: 57 RIVDSFLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQ 116
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
ATVVSW+LKDGMQHVGKLICSNLGARMDSEPKRWR++AD LYDLG GLEV+SPLCP LFL
Sbjct: 117 ATVVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFL 176
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
EMAGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAG+QLA
Sbjct: 177 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLA 236
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
ST+CSS+QGK++ PLLS++HVY V+E+MRA P+NTLNPQRTAMIVADFVK GRI SPAD
Sbjct: 237 STICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPAD 296
Query: 301 LRYREDLLFPGRLIEDAGNVKMDR 324
+RY+EDL+FPGRLIE+AGNVK+ R
Sbjct: 297 IRYQEDLVFPGRLIEEAGNVKVGR 320
>gi|356545865|ref|XP_003541354.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 431
Score = 543 bits (1399), Expect = e-152, Method: Compositional matrix adjust.
Identities = 257/322 (79%), Positives = 289/322 (89%), Gaps = 3/322 (0%)
Query: 3 LLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRV 62
LEK+K KKK + + P+ WFETS +V+R +QF+P+G LSV +V DSRP+YHR+
Sbjct: 5 FLEKLKIWKKKAEQEE---EGPPLVWFETSATVSRRFQFEPNGQLSVTLVGDSRPLYHRI 61
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
VDSFLNKFFPSGYP+SVNEGYLRYTQFRALQH SAALSVLSTQSLLFAAGLRPTPAQAT
Sbjct: 62 VDSFLNKFFPSGYPYSVNEGYLRYTQFRALQHTASAALSVLSTQSLLFAAGLRPTPAQAT 121
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
VSW+LKDGMQH+G +ICS LGARMDSEPKRWRILAD LYDLGTGLEVLSPLCPQ FLEM
Sbjct: 122 AVSWVLKDGMQHLGNIICSKLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPQFFLEM 181
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
AG+GN AKG+++VA+RATRLPIYSSFAKEGNLSDLFA+GEAISTLFNVVG+G GIQLAST
Sbjct: 182 AGIGNLAKGMSIVASRATRLPIYSSFAKEGNLSDLFARGEAISTLFNVVGIGIGIQLAST 241
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
VC+SMQGK++VGPLLS+IH+YSV EEMRA PVNTLNPQRTAMIVADF+K G +SSPADLR
Sbjct: 242 VCASMQGKLVVGPLLSIIHIYSVSEEMRATPVNTLNPQRTAMIVADFLKAGSVSSPADLR 301
Query: 303 YREDLLFPGRLIEDAGNVKMDR 324
YREDLLFP RLIEDAGNV++ R
Sbjct: 302 YREDLLFPRRLIEDAGNVRVGR 323
>gi|356505015|ref|XP_003521288.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 415
Score = 532 bits (1370), Expect = e-149, Method: Compositional matrix adjust.
Identities = 249/306 (81%), Positives = 278/306 (90%)
Query: 19 MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
M E P+ WFETSDSV R +QF+PDG LSV +VDDSRP+Y RV SF+NKFFPSGYP+S
Sbjct: 1 MKEGEAPVLWFETSDSVCRRHQFEPDGQLSVVIVDDSRPLYQRVAGSFMNKFFPSGYPYS 60
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 138
VNEGYLRYTQFRA+QH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKL
Sbjct: 61 VNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKL 120
Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
ICSN GARMDSEPKRWR+LADALYD+GTGLEVLSPLCP LFLEMAGLGNFAKG+AVVAAR
Sbjct: 121 ICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFLEMAGLGNFAKGMAVVAAR 180
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
ATRLPIYSSFAKEGNLSDLFAKGEA STLFNV+G+G GIQLAST+C+SMQGK++ GPLLS
Sbjct: 181 ATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASMQGKLVAGPLLS 240
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
+IH+YSV EEMRA P+NTLNP+RTAM+VADF+K G +SSPADLRYRE+LLF + EDAG
Sbjct: 241 IIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPADLRYRENLLFNVHVKEDAG 300
Query: 319 NVKMDR 324
NV++ +
Sbjct: 301 NVRVGK 306
>gi|255639094|gb|ACU19847.1| unknown [Glycine max]
Length = 302
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 249/302 (82%), Positives = 275/302 (91%)
Query: 19 MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
M E P+ WFETSDSV R +QF+PDG LSV +VDDSRP+Y RV SF+NKFFPSGYP+S
Sbjct: 1 MKEGEAPVLWFETSDSVCRRHQFEPDGQLSVVIVDDSRPLYQRVAGSFMNKFFPSGYPYS 60
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 138
VNEGYLRYTQFRA+QH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKL
Sbjct: 61 VNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKL 120
Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
ICSN GARMDSEPKRWR+LADALYD+GTGLEVLSPLCP LFLEMAGLGNFAKG+AVVAAR
Sbjct: 121 ICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFLEMAGLGNFAKGMAVVAAR 180
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
ATRLPIYSSFAKEGNLSDLFAKGEA STLFNV+G+G GIQLAST+C+SMQGK++ GPLLS
Sbjct: 181 ATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASMQGKLVAGPLLS 240
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
+IH+YSV EEMRA P+NTLNP+RTAM+VADF+K G +SSPADLRYRE+LLF + EDAG
Sbjct: 241 IIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPADLRYRENLLFNVHVKEDAG 300
Query: 319 NV 320
NV
Sbjct: 301 NV 302
>gi|388508448|gb|AFK42290.1| unknown [Lotus japonicus]
Length = 414
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 244/306 (79%), Positives = 274/306 (89%)
Query: 19 MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
M E P++WFETS SV + FQ +G LSV +VDDSRP+Y +V SF+NKFFPSGYP+S
Sbjct: 1 MEKGEGPVFWFETSGSVCHRHHFQSNGQLSVMLVDDSRPLYQKVAGSFMNKFFPSGYPYS 60
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 138
VNEGYLRYTQFR++QH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKL
Sbjct: 61 VNEGYLRYTQFRSVQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKL 120
Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
ICSN GARMDSEPKRWR++AD LYDLGTGLEVLSPLCP LFL+MAGLGNFAKG+AVVAAR
Sbjct: 121 ICSNWGARMDSEPKRWRLIADVLYDLGTGLEVLSPLCPHLFLQMAGLGNFAKGMAVVAAR 180
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
ATRLPIYSSFAKEGNLSDLFAKGEA STL NV+G+G GIQLAST+C+S+QGKM+VGPLLS
Sbjct: 181 ATRLPIYSSFAKEGNLSDLFAKGEAFSTLCNVIGIGVGIQLASTICASVQGKMVVGPLLS 240
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
++H+YSV EEMRA P+NTLNP+RTAMIV DF+K G +SSPADLRYREDLLF RL EDAG
Sbjct: 241 ILHIYSVSEEMRATPINTLNPRRTAMIVTDFLKAGVVSSPADLRYREDLLFRVRLTEDAG 300
Query: 319 NVKMDR 324
NV++ R
Sbjct: 301 NVRVGR 306
>gi|356572258|ref|XP_003554286.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 419
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 243/303 (80%), Positives = 275/303 (90%)
Query: 23 EIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEG 82
E P+ WFETSDSV +QF+PDG LSV +VDDSRP+Y R+ SF+NKFFPSGYP+SVNEG
Sbjct: 9 EAPVLWFETSDSVCCQHQFEPDGRLSVVIVDDSRPLYQRMAGSFMNKFFPSGYPYSVNEG 68
Query: 83 YLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN 142
YLRYTQFRALQH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKLICSN
Sbjct: 69 YLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKLICSN 128
Query: 143 LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRL 202
GARMDSEPKRWR+LAD LYD+G GLEVLSPLCP LFLEMAGLGNFAKG+AVVAARATRL
Sbjct: 129 WGARMDSEPKRWRLLADVLYDIGIGLEVLSPLCPHLFLEMAGLGNFAKGMAVVAARATRL 188
Query: 203 PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHV 262
PIYSSFAKEGNLSDLFAKGEA STLFNVVG+G GIQLAST+C+S+QGK++ GPLLS+IH+
Sbjct: 189 PIYSSFAKEGNLSDLFAKGEAFSTLFNVVGIGVGIQLASTICASIQGKLVAGPLLSIIHL 248
Query: 263 YSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKM 322
YSV EEMRA P+NTLNP+RTAM+VADF+K G +SSPADLRYR++LLF ++ ED GNV++
Sbjct: 249 YSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPADLRYRDNLLFNVQVKEDTGNVRV 308
Query: 323 DRH 325
++
Sbjct: 309 GKN 311
>gi|297826561|ref|XP_002881163.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
lyrata]
gi|297327002|gb|EFH57422.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 520 bits (1339), Expect = e-145, Method: Compositional matrix adjust.
Identities = 249/324 (76%), Positives = 284/324 (87%), Gaps = 13/324 (4%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
M LEK+K KK+EP + ++ P+YWFETSDSV+ YQFQ DGHLS+KVVDD+RPV
Sbjct: 1 MQFLEKVKLIKKEEPLMVKNPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQ 60
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
++V+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQ
Sbjct: 61 KMVESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQ 120
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLE++SPLCP LFL
Sbjct: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFL 180
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
EMAGLGNFAKG+A VAARATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV G+GAGIQLA
Sbjct: 181 EMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLA 240
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
ST+CSSM+GK++VG +LSV+HVYSV+E+MR P+NTLNPQRTA+IVA+++K I
Sbjct: 241 STICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANYLKVVSI----- 295
Query: 301 LRYREDLLFPGRLIEDAGNVKMDR 324
+F RLI+DAGNVK+ R
Sbjct: 296 -------IFK-RLIQDAGNVKVGR 311
>gi|115459438|ref|NP_001053319.1| Os04g0517300 [Oryza sativa Japonica Group]
gi|113564890|dbj|BAF15233.1| Os04g0517300 [Oryza sativa Japonica Group]
gi|116310213|emb|CAH67223.1| OSIGBa0145M07.5 [Oryza sativa Indica Group]
gi|215697660|dbj|BAG91654.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 504 bits (1299), Expect = e-140, Method: Compositional matrix adjust.
Identities = 240/324 (74%), Positives = 279/324 (86%), Gaps = 3/324 (0%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
MN+LE+I+ + + P W E S+S++R F G +SVKV+ DSRP++
Sbjct: 1 MNILERIRGGGDRAAVGEGPREPEP--WVEISESISRLCSFDA-GRVSVKVIQDSRPIHD 57
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
+++DSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAAGLRPTPAQ
Sbjct: 58 KMIDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGLRPTPAQ 117
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
AT VSWILKDGMQH GKLICS +GARMDSEPK WRILAD LYD GT LEV+SPLCPQLFL
Sbjct: 118 ATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFL 177
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
E+AG GNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LA
Sbjct: 178 EVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 237
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
STVCS+ QGK+I GPLLSV+H+Y V++EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+
Sbjct: 238 STVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAE 297
Query: 301 LRYREDLLFPGRLIEDAGNVKMDR 324
LRYREDLLFP RLIE+AG+VK+ +
Sbjct: 298 LRYREDLLFPNRLIEEAGSVKIGQ 321
>gi|242076506|ref|XP_002448189.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
gi|241939372|gb|EES12517.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
Length = 439
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 243/332 (73%), Positives = 281/332 (84%), Gaps = 13/332 (3%)
Query: 1 MNLLEKIK----KQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQP----DGHLSVKVV 52
MN+L +I + EP P+ W E S+SV+R F G +SVKV+
Sbjct: 1 MNILARISGGGDRAAATEPPPPQ-----PMCWVEISESVSRLCSFDAAGSGGGSISVKVI 55
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
D+RP++ +VVDSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAA
Sbjct: 56 QDARPIHDKVVDSFLNKFFPSGYPYSVNEGYLTYTRFRALQHFSSAMLHVLSTQSLLFAA 115
Query: 113 GLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
GLRPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+S
Sbjct: 116 GLRPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVS 175
Query: 173 PLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
PLCPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G
Sbjct: 176 PLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMG 235
Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
+GAGI LASTVCS+ QGK+I GPLLS +H+Y V++EMRA PVNTLNPQRTAMIVADF+K+
Sbjct: 236 IGAGIGLASTVCSTTQGKLIAGPLLSAVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKS 295
Query: 293 GRISSPADLRYREDLLFPGRLIEDAGNVKMDR 324
G++SSPA+LRYREDLLFP R+IE+AG+VK+ +
Sbjct: 296 GKVSSPAELRYREDLLFPNRVIEEAGSVKIGQ 327
>gi|414586417|tpg|DAA36988.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 439
Score = 503 bits (1294), Expect = e-140, Method: Compositional matrix adjust.
Identities = 242/328 (73%), Positives = 282/328 (85%), Gaps = 10/328 (3%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
MN+LE+I+ D++ + + P W E S+SV+R F G +SVKV+ D
Sbjct: 1 MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57 SRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPL 176
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
AGI LASTVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G+
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGK 296
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKM 322
+SSPA+LRY+EDLLFP R+IE+AG+VK+
Sbjct: 297 VSSPAELRYKEDLLFPNRVIEEAGSVKI 324
>gi|226494744|ref|NP_001146435.1| uncharacterized protein LOC100280018 [Zea mays]
gi|219887187|gb|ACL53968.1| unknown [Zea mays]
Length = 439
Score = 500 bits (1287), Expect = e-139, Method: Compositional matrix adjust.
Identities = 241/328 (73%), Positives = 281/328 (85%), Gaps = 10/328 (3%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
MN+LE+I+ D++ + + P W E S+SV+R F G +SVKV+ D
Sbjct: 1 MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57 SRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK W ILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWTILADVLYDLGTALEVVSPL 176
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
AGI LASTVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G+
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGK 296
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKM 322
+SSPA+LRY+EDLLFP R+IE+AG+VK+
Sbjct: 297 VSSPAELRYKEDLLFPNRVIEEAGSVKI 324
>gi|357164706|ref|XP_003580140.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 437
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 234/325 (72%), Positives = 276/325 (84%), Gaps = 5/325 (1%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQP---DGHLSVKVVDDSRP 57
MN+LE+I+ K + P W E S+SV+R F G +SVKV+ DSRP
Sbjct: 1 MNILERIRGGGDKTTGTETPRKLEP--WVEISESVSRLCCFDARSRGGGISVKVIQDSRP 58
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAAGLRPT
Sbjct: 59 IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGLRPT 118
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
PAQAT VSWILKDGMQH GKLICS +GARMDSEPK WRI AD LYD GT L+ +SPLCPQ
Sbjct: 119 PAQATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRIFADVLYDFGTALDFISPLCPQ 178
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LFLE+AGLGNFAKG+AVVAARATRLPIYS+FAKEGNLSDLFAKGEAISTL NV+G+GAGI
Sbjct: 179 LFLEVAGLGNFAKGMAVVAARATRLPIYSAFAKEGNLSDLFAKGEAISTLSNVMGIGAGI 238
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
L+STVCS+ QGK+I+GPLLS +H+Y V++EMRA P+N+LNPQRTAM+VADF+K+G++SS
Sbjct: 239 GLSSTVCSTTQGKLIIGPLLSAVHLYGVVQEMRAIPINSLNPQRTAMVVADFIKSGKVSS 298
Query: 298 PADLRYREDLLFPGRLIEDAGNVKM 322
PA+LRYREDLLFP RLIE+AG+VK+
Sbjct: 299 PAELRYREDLLFPNRLIEEAGSVKV 323
>gi|38568058|emb|CAE05447.3| OSJNBa0073E02.7 [Oryza sativa Japonica Group]
gi|125549033|gb|EAY94855.1| hypothetical protein OsI_16651 [Oryza sativa Indica Group]
gi|125590999|gb|EAZ31349.1| hypothetical protein OsJ_15471 [Oryza sativa Japonica Group]
Length = 380
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 219/263 (83%), Positives = 245/263 (93%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
++DSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAAGLRPTPAQA
Sbjct: 1 MIDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGLRPTPAQA 60
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
T VSWILKDGMQH GKLICS +GARMDSEPK WRILAD LYD GT LEV+SPLCPQLFLE
Sbjct: 61 TAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFLE 120
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+AG GNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LAS
Sbjct: 121 VAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLAS 180
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
TVCS+ QGK+I GPLLSV+H+Y V++EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+L
Sbjct: 181 TVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAEL 240
Query: 302 RYREDLLFPGRLIEDAGNVKMDR 324
RYREDLLFP RLIE+AG+VK+ +
Sbjct: 241 RYREDLLFPNRLIEEAGSVKIGQ 263
>gi|414586418|tpg|DAA36989.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 297
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 221/297 (74%), Positives = 252/297 (84%), Gaps = 10/297 (3%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
MN+LE+I+ D++ + + P W E S+SV+R F G +SVKV+ D
Sbjct: 1 MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57 SRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPL 176
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
AGI LASTVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIK 293
>gi|302756337|ref|XP_002961592.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
gi|300170251|gb|EFJ36852.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
Length = 399
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 196/295 (66%), Positives = 248/295 (84%)
Query: 28 WFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
W E S++ R Y + DG L +++DD+RP+ H++VD FL++FFP GYP SV EGYL Y+
Sbjct: 5 WRERSENARREYVSKGDGTLQARLLDDNRPLKHKLVDGFLDQFFPVGYPQSVGEGYLTYS 64
Query: 88 QFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARM 147
QFRALQHF+SA LSV STQSL+FAAGLRPTPAQATV+SW+LKDGMQH+GKL+ S +GARM
Sbjct: 65 QFRALQHFSSAILSVFSTQSLVFAAGLRPTPAQATVISWVLKDGMQHIGKLVGSRMGARM 124
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DSEPKRWRI AD LYD GT LE++SP CPQ FL +AGL N AKG+A+V ARATRLP+Y++
Sbjct: 125 DSEPKRWRIFADVLYDAGTALEIISPACPQYFLGVAGLANLAKGVAMVTARATRLPLYTA 184
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
FAKEGNLSDL+AKGEA+STL NV GLG GI+LAS+ C+S+QGK++V PLLSVIH+YSV+E
Sbjct: 185 FAKEGNLSDLYAKGEAMSTLANVAGLGVGIRLASSACTSLQGKLLVAPLLSVIHLYSVME 244
Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKM 322
EMRA P++TLN QRTAM+VAD+++ GR+ SPA++RYRE L+ P + AG+V++
Sbjct: 245 EMRAVPLDTLNAQRTAMLVADYLELGRVPSPAEIRYRERLILPVKPNVHAGSVRI 299
>gi|302775588|ref|XP_002971211.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
gi|300161193|gb|EFJ27809.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
Length = 399
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 196/295 (66%), Positives = 248/295 (84%)
Query: 28 WFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
W E S++ R Y + DG L +++DD+RP+ H++VD FL++FFP GYP SV EGYL Y+
Sbjct: 5 WRERSENARREYVSKGDGTLQARLLDDNRPLKHKLVDGFLDQFFPVGYPQSVGEGYLTYS 64
Query: 88 QFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARM 147
QFRALQHF+SA LSV STQSL+FAAGLRPTPAQATV+SW+LKDGMQH+GKL+ S +GARM
Sbjct: 65 QFRALQHFSSAILSVFSTQSLVFAAGLRPTPAQATVISWVLKDGMQHIGKLVGSRMGARM 124
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DSEPKRWRI AD LYD GT LE++SP CPQ FL +AGL N AKG+A+V ARATRLP+Y++
Sbjct: 125 DSEPKRWRIFADVLYDAGTALEMISPACPQYFLGVAGLANLAKGVAMVTARATRLPLYTA 184
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
FAKEGNLSDL+AKGEA+STL NV GLG GI+LAS+ C+S+QGK++V PLLSVIH+YSV+E
Sbjct: 185 FAKEGNLSDLYAKGEAMSTLANVAGLGVGIRLASSACTSLQGKLLVAPLLSVIHLYSVME 244
Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKM 322
EMRA P++TLN QRTAM+VAD+++ GR+ SPA++RYRE L+ P + AG+V++
Sbjct: 245 EMRAVPLDTLNAQRTAMLVADYLELGRVPSPAEIRYRERLILPVKPNVHAGSVRI 299
>gi|168027023|ref|XP_001766030.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682673|gb|EDQ69089.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 397
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 195/274 (71%), Positives = 234/274 (85%)
Query: 49 VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
VKV+ ++RP VDSFL++FFP GYP SV EGYL Y+++RA+QHF SA + VLSTQSL
Sbjct: 9 VKVLKETRPPVQIAVDSFLSRFFPLGYPNSVAEGYLDYSKYRAVQHFASAVMHVLSTQSL 68
Query: 109 LFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGL 168
LFAAGLRPTPAQAT+VSW+LKDGMQHVGKL+CS +GARMDSEPKRWRI AD +YD+G GL
Sbjct: 69 LFAAGLRPTPAQATIVSWVLKDGMQHVGKLLCSRMGARMDSEPKRWRIFADFMYDVGAGL 128
Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
EV+SPLCPQ FL +AG+ N AKG+A+V+ARATRLP+YSSF KEGNLSDL+AKGEAISTL
Sbjct: 129 EVISPLCPQHFLPVAGMANLAKGMALVSARATRLPVYSSFVKEGNLSDLYAKGEAISTLS 188
Query: 229 NVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
NV+GLG GI LAS S++QGK+I P+LS IH+Y V +EMRAAP+NTLN QRTAM+VAD
Sbjct: 189 NVLGLGVGIHLASNFASTIQGKLIWAPILSAIHLYCVQQEMRAAPINTLNSQRTAMLVAD 248
Query: 289 FVKTGRISSPADLRYREDLLFPGRLIEDAGNVKM 322
FVKTG +SSPA+LRYRE L+ P L +DAGNV++
Sbjct: 249 FVKTGSVSSPAELRYRERLILPVGLEKDAGNVRV 282
>gi|414586414|tpg|DAA36985.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 288
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/269 (72%), Positives = 225/269 (83%), Gaps = 10/269 (3%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
MN+LE+I+ D++ + + P W E S+SV+R F G +SVKV+ D
Sbjct: 1 MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SRP++ ++VDSFLNK FPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57 SRPIHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPL 176
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
AGI LASTVCS+ QGK+I GPLLS +H++
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIW 265
>gi|414586420|tpg|DAA36991.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 306
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 158/191 (82%), Positives = 180/191 (94%)
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
MQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG
Sbjct: 1 MQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKG 60
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK+
Sbjct: 61 MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKL 120
Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+LRY+EDLLFP
Sbjct: 121 IAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLLFPN 180
Query: 312 RLIEDAGNVKM 322
R+IE+AG+VK+
Sbjct: 181 RVIEEAGSVKI 191
>gi|414586419|tpg|DAA36990.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 311
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 145/204 (71%), Positives = 167/204 (81%), Gaps = 18/204 (8%)
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A V WIL Q VG + N AD LYDLGT LEV+SPLCPQLFL
Sbjct: 13 AWVQEWILN---QRVGGYLVRN---------------ADVLYDLGTALEVVSPLCPQLFL 54
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
E+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LA
Sbjct: 55 EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 114
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
STVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+
Sbjct: 115 STVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAE 174
Query: 301 LRYREDLLFPGRLIEDAGNVKMDR 324
LRY+EDLLFP R+IE+AG+VK+ +
Sbjct: 175 LRYKEDLLFPNRVIEEAGSVKIGQ 198
>gi|414586421|tpg|DAA36992.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 164
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 137/160 (85%), Positives = 150/160 (93%)
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
MQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG
Sbjct: 1 MQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKG 60
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK+
Sbjct: 61 MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKL 120
Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K
Sbjct: 121 IAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIK 160
>gi|149392593|gb|ABR26099.1| unknown [Oryza sativa Indica Group]
Length = 282
Score = 291 bits (744), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 135/166 (81%), Positives = 155/166 (93%)
Query: 159 DALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
D LYD GT LEV+SPLCPQLFLE+AG GNFAKG+AVVAARATRLPIYSSFAKEGNLSDLF
Sbjct: 1 DVLYDFGTALEVISPLCPQLFLEVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLF 60
Query: 219 AKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
AKGEAISTLFNV+G+GAGI LASTVCS+ QGK+I GPLLSV+H+Y V++EMRA PVNTLN
Sbjct: 61 AKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLN 120
Query: 279 PQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKMDR 324
PQRTAMIVADF+K+G++SSPA+LRYREDLLFP RLIE+AG+VK+ +
Sbjct: 121 PQRTAMIVADFIKSGKVSSPAELRYREDLLFPNRLIEEAGSVKIGQ 166
>gi|413942077|gb|AFW74726.1| hypothetical protein ZEAMMB73_469739 [Zea mays]
Length = 425
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 159/302 (52%), Positives = 184/302 (60%), Gaps = 79/302 (26%)
Query: 48 SVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQS 107
+VKV+ DSRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQ
Sbjct: 57 TVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQL 116
Query: 108 LL-----------------FAAGLRPTPAQATVVSWILK--------------------- 129
+ +A L +V++ +
Sbjct: 117 VFQNRLLSVLSSLTSAHFNYARQLTAQVGCGSVLAVCCRFATYPDASNCCKLGKHMMTEM 176
Query: 130 --------DGMQHVGKLICSNLGARMDSEPKRWRIL------------------------ 157
DGMQH KLICS++GARMDSEPK WRIL
Sbjct: 177 LKSKQILKDGMQHTRKLICSSMGARMDSEPKSWRILGLTAYVLYGCHCLGSHFTIFPIIC 236
Query: 158 -------ADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAK 210
AD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG+ VVAARATRLPIYSSFAK
Sbjct: 237 LYCSILTADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMVVVAARATRLPIYSSFAK 296
Query: 211 EGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMR 270
EGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK + PLL SV E +
Sbjct: 297 EGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKEV--PLLEKYKCRSVNEFVD 354
Query: 271 AA 272
A
Sbjct: 355 AC 356
>gi|413942078|gb|AFW74727.1| hypothetical protein ZEAMMB73_469739, partial [Zea mays]
Length = 199
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 116/143 (81%), Positives = 129/143 (90%)
Query: 48 SVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQS 107
+VKV+ DSRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQS
Sbjct: 57 TVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQS 116
Query: 108 LLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTG 167
LLFA GLRPTP QAT VSWILKDGMQH KLICS++GARMDSEPK WRILAD LYDLGT
Sbjct: 117 LLFAVGLRPTPTQATAVSWILKDGMQHTRKLICSSMGARMDSEPKSWRILADVLYDLGTA 176
Query: 168 LEVLSPLCPQLFLEMAGLGNFAK 190
LEV+SPLCPQLFLE+AGLGNFAK
Sbjct: 177 LEVVSPLCPQLFLEVAGLGNFAK 199
>gi|414586416|tpg|DAA36987.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 155
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 112/132 (84%), Positives = 124/132 (93%)
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
MQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG
Sbjct: 1 MQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKG 60
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK+
Sbjct: 61 MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKL 120
Query: 252 IVGPLLSVIHVY 263
I GPLLS +H++
Sbjct: 121 IAGPLLSAVHIW 132
>gi|356510851|ref|XP_003524147.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
Length = 158
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 107/123 (86%), Positives = 115/123 (93%)
Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
LKDGMQHVGKLICSN G MDSEPKRWR+LADALYD+GTGLEVLSP CP LFLEMAGLGN
Sbjct: 1 LKDGMQHVGKLICSNWGGTMDSEPKRWRLLADALYDIGTGLEVLSPRCPHLFLEMAGLGN 60
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
FAKG++VVAARATRLPIYSSFAKEGNLSDL AKGEA STLFNV+G+G GIQLAST+C+SM
Sbjct: 61 FAKGMSVVAARATRLPIYSSFAKEGNLSDLLAKGEAFSTLFNVIGIGVGIQLASTICASM 120
Query: 248 QGK 250
QGK
Sbjct: 121 QGK 123
>gi|356498113|ref|XP_003517898.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
Length = 171
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 106/126 (84%), Positives = 117/126 (92%)
Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
LK+GMQHVGKLI SN G MD EPKRWR+LADALYD+GTGLEVLSP CP LFLEMAGLGN
Sbjct: 1 LKNGMQHVGKLIGSNWGGTMDFEPKRWRLLADALYDIGTGLEVLSPWCPHLFLEMAGLGN 60
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
FAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEA STLFNV+G+G GIQLAST+C+SM
Sbjct: 61 FAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASM 120
Query: 248 QGKMIV 253
QGK+++
Sbjct: 121 QGKVVI 126
>gi|414586415|tpg|DAA36986.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 163
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 105/165 (63%), Positives = 126/165 (76%), Gaps = 10/165 (6%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
MN+LE+I+ D++ + + P W E S+SV+R F G +SVKV+ D
Sbjct: 1 MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SRP++ ++VDSFLNK FPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57 SRPIHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD 159
RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRIL +
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILGE 161
>gi|452824810|gb|EME31810.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 636
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 96/275 (34%), Positives = 159/275 (57%), Gaps = 4/275 (1%)
Query: 39 YQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSA 98
YQF D + DD RP++ +++D + P GYP SV + YL Y+ +RA+Q+ ++
Sbjct: 249 YQFSDDMIQEGRFKDD-RPIFEKLIDWLSRVYLPDGYPNSVTKDYLSYSIWRAIQNLAAS 307
Query: 99 ALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWR 155
+ V +T++LLF GL + TPA + +SW+LKDG+ ++GK+ ++ G + D +PK WR
Sbjct: 308 VMGVFATEALLFGLGLGRKSTPATSAAISWVLKDGLGYIGKVFYGSIAGNQFDVDPKSWR 367
Query: 156 ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLS 215
+ ADA+ D G LE+++PL P FL +A + N KG+A + ATR IY S A N
Sbjct: 368 LAADAVEDAGGVLEIITPLFPGHFLLLASVANAMKGVAAMTGTATRHAIYKSLALHENQG 427
Query: 216 DLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVN 275
D+ KGE+ +VGLG GI +S + + + L +H+++ + +
Sbjct: 428 DIATKGESQGVTCKMVGLGLGIVASSKMGQNYFALLSAYALGCFVHLFANWKSLSCVQFA 487
Query: 276 TLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
TLN QR ++ + +F++TG + SP ++ RE ++ P
Sbjct: 488 TLNRQRCSLALQEFLQTGNVPSPYEVSRREKVVVP 522
>gi|449018781|dbj|BAM82183.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 631
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 94/225 (41%), Positives = 137/225 (60%), Gaps = 5/225 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILK 129
F P G+P SV YLRY+ +R Q+ SA +VLST +LL A GL TP A SW+LK
Sbjct: 242 FLPEGFPESVGPSYLRYSLWRGAQNVVSAMTAVLSTHALLAAVGLSSTPV-AAATSWVLK 300
Query: 130 DGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFA 189
DG+ +GKL+ + G D++PKR+R+ +D LYD G LE+L+PL PQ FL +A LGNF
Sbjct: 301 DGIGQLGKLLTARRGRDFDADPKRYRLSSDLLYDAGLSLEILTPLFPQYFLALAALGNFT 360
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
K +A+ A R + SSF GNL D+ AK +A + + N+ G+G GI A + +
Sbjct: 361 KSVAITVGMACRNSVLSSFVLRGNLGDISAKNDAQNVVTNLTGMGLGIVTARAL--PPKP 418
Query: 250 KMIVGPLLSVIHVYSVI--EEMRAAPVNTLNPQRTAMIVADFVKT 292
+ +G ++ +YSV+ + M+A +N +N QR A+I FV++
Sbjct: 419 SVRLGAFAALTALYSVLNYQSMKAVKLNVVNRQRGAIIADSFVRS 463
>gi|452823212|gb|EME30224.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 849
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 90/246 (36%), Positives = 145/246 (58%), Gaps = 7/246 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-AQATVVSWIL 128
FFP G V++ Y +T +R Q S+ + V TQ+LL A G++ +A SW+L
Sbjct: 422 FFPEG----VSQDYYSFTMWRVFQRLVSSTVGVFGTQALLLALGIKAGRIGRAAATSWVL 477
Query: 129 KDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
KDG+ +GK++ S +G D++PKRWR + LY LG GLE+++ + P FL A L N
Sbjct: 478 KDGLGRLGKMVWASQMGRDFDADPKRWRFRSALLYALGNGLEIVTQIFPASFLLFATLAN 537
Query: 188 FAKGIAVVAARATRLPIYSSFA-KEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
K I+++ A ATR +Y +FA + N++D+ AKGEA + +++G+ GIQL+ + +S
Sbjct: 538 SMKQISMLTASATRNAMYKNFAGRSENIADITAKGEAQIVVADLLGMALGIQLSKLIGTS 597
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ LSV+ +Y+ +E+RA TLN +R++MIV FV+ G + P ++ RE+
Sbjct: 598 RPNVLTAYSFLSVLDIYAFYKELRAVQFRTLNYERSSMIVDYFVRRGYVPRPDEVSQREN 657
Query: 307 LLFPGR 312
+ R
Sbjct: 658 IFLGPR 663
>gi|449016609|dbj|BAM80011.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 496
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 96/276 (34%), Positives = 149/276 (53%), Gaps = 3/276 (1%)
Query: 37 RHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFT 96
R Y+ GHL V V R + D F F P GYP SV Y YT++R F
Sbjct: 91 REYRINSAGHLEVAEVRTRRAWGLKAADRFSRMFLPDGYPDSVGPDYAEYTKWRGRAFFF 150
Query: 97 SAALSVLSTQSLLFAAGL-RPTPAQ-ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRW 154
+ V STQSLL A G+ RP+ A A + W+++DG+ G+++ S +G D+E K++
Sbjct: 151 GGMVGVFSTQSLLLAVGVGRPSAAPIAAALQWVIRDGLGRAGRMLFSQVGTGFDAETKQY 210
Query: 155 RILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNL 214
R+ A + ++ LE L+PL P LFL +A L N AKG + VAA +TR IY SF + NL
Sbjct: 211 RLAAAFVLNVSCALEALTPLFPHLFLPLASLANMAKGASTVAAASTRGAIYRSFMRRENL 270
Query: 215 SDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPV 274
D+ AK E + +++G G+ L+ S + +I S+ H+ S E+++ +
Sbjct: 271 GDITAKQETVGVAGDLLGTAVGVILSRASSHSHRLSVIAFIGASLGHLISAYYEVKSVEL 330
Query: 275 NTLNPQRTAMIVADFVK-TGRISSPADLRYREDLLF 309
TLN QR M++ ++++ R+ D+ RE L++
Sbjct: 331 RTLNRQRAHMLIWNYLEDQDRVLPIRDVNRRERLVY 366
>gi|449018940|dbj|BAM82342.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 738
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 88/239 (36%), Positives = 144/239 (60%), Gaps = 4/239 (1%)
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP--AQATVVSWILKDGMQ 133
P V Y + ++R Q S+ + V TQ+LL A G++ +QA +SW+LKDG+
Sbjct: 380 PEDVTADYYEFAKWRVFQRLMSSTVGVFGTQALLLALGIKSGNKISQAATISWVLKDGLS 439
Query: 134 HVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
VGK++ S LG MD++PKR+R + LY LG GLE+ + + PQ FL +A L N AK
Sbjct: 440 RVGKMLWASQLGRDMDADPKRFRFGSALLYSLGNGLEIATRIVPQSFLLVATLANTAKLC 499
Query: 193 AVVAARATRLPIYSSFA-KEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+++ A ATR +Y SFA + N++D+ AKGEA T+ ++ G+ GIQL+ + +S +
Sbjct: 500 SMLTASATRNAMYRSFADRNENIADITAKGEAQITIADLGGMALGIQLSKIIGTSRPKVV 559
Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
+LS I +Y++ +E+R TLN QR+++++ ++KTG + +P + E ++ P
Sbjct: 560 ATYLILSTIDLYAIWKELRVIEFRTLNLQRSSLVIEHYLKTGCVPTPKQVSAAERVVLP 618
>gi|242055389|ref|XP_002456840.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
gi|241928815|gb|EES01960.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
Length = 569
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 85/252 (33%), Positives = 144/252 (57%), Gaps = 1/252 (0%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R D + P GYP SV+ Y+ Y+ +R +Q S VLSTQ+LL+A GL R
Sbjct: 161 WRRCADVAVQLLLPDGYPHSVSSDYMNYSLWRGVQGVASQISGVLSTQALLYAVGLGRGA 220
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L ++ GLE+L+P+ P
Sbjct: 221 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 280
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LF+ + ++ A + ATR Y+ FA + N +++ AKGEA + VG+G GI
Sbjct: 281 LFVPIGAAAGASRSAASLIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGI 340
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R +++ ++++ +G++ S
Sbjct: 341 ALANQIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 400
Query: 298 PADLRYREDLLF 309
++ E L F
Sbjct: 401 VKEVNDEEPLFF 412
>gi|357167308|ref|XP_003581100.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 604
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 87/252 (34%), Positives = 144/252 (57%), Gaps = 1/252 (0%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R D L P GYP SV+ YL Y+ +R LQ S VLSTQ+LL+A GL +
Sbjct: 195 WRRCTDVALQLLLPDGYPNSVSSDYLHYSLWRGLQGVASQISGVLSTQALLYAVGLGKGV 254
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A ++W+LKDG+ ++ K++ S G D PKRWR+ AD L + GLE+L+P+ P
Sbjct: 255 IPTAAAINWVLKDGLGYLSKILLSKFGRHFDVNPKRWRLFADLLENTAYGLEILTPVFPH 314
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LF+ + A+ A + ATR Y+ FA + N +++ AKGEA + +G+ GI
Sbjct: 315 LFVPIGAAAGAARSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGIMLGI 374
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA+ + SS+ +I ++V+H+Y ++ ++ + TLNP R +++ ++++ +G++SS
Sbjct: 375 ALANQIGSSVPLALISFAGVTVVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVSS 434
Query: 298 PADLRYREDLLF 309
++ E L
Sbjct: 435 VKEVNDEEPLFL 446
>gi|226491780|ref|NP_001140263.1| uncharacterized protein LOC100272306 [Zea mays]
gi|194698746|gb|ACF83457.1| unknown [Zea mays]
Length = 451
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/252 (33%), Positives = 144/252 (57%), Gaps = 1/252 (0%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + ++ P GYP SV+ YL Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 44 WRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKGA 103
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L ++ GLE+L+P+ P
Sbjct: 104 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 163
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LF+ + + A + ATR Y+ FA + N +++ AKGEA + VG+G GI
Sbjct: 164 LFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGI 223
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R +++ ++++ +G++ S
Sbjct: 224 ALANHIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 283
Query: 298 PADLRYREDLLF 309
++ E L F
Sbjct: 284 VKEVNDEEPLFF 295
>gi|414879009|tpg|DAA56140.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
Length = 593
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/252 (33%), Positives = 144/252 (57%), Gaps = 1/252 (0%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + ++ P GYP SV+ YL Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 186 WRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKGA 245
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L ++ GLE+L+P+ P
Sbjct: 246 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 305
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LF+ + + A + ATR Y+ FA + N +++ AKGEA + VG+G GI
Sbjct: 306 LFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGI 365
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R +++ ++++ +G++ S
Sbjct: 366 ALANHIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 425
Query: 298 PADLRYREDLLF 309
++ E L F
Sbjct: 426 VKEVNDEEPLFF 437
>gi|302771033|ref|XP_002968935.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
gi|300163440|gb|EFJ30051.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
Length = 389
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 88/251 (35%), Positives = 138/251 (54%), Gaps = 15/251 (5%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQA 121
+D+ P GYP +V++ Y YT +R Q S VL+TQ+LL+A GL + A
Sbjct: 5 LDALKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISGVLTTQALLYAVGLGKGAIPTA 64
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++W+LKDG+ ++ K++ S G D PK WR+L+D + D GLE+L+P P LF+
Sbjct: 65 AAINWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFVP 124
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ + K A + ATR Y+ FA + N +++ AKGEA + +G+ GI ++S
Sbjct: 125 LTAVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGISS 184
Query: 242 TVCSSMQGKMIVGPLL-------SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
V SS GPLL + +HV ++ +A ++TLNP RT +I+A+++ T R
Sbjct: 185 KVGSS-------GPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPYRTGLIMAEYLTTQR 237
Query: 295 ISSPADLRYRE 305
I S D+ E
Sbjct: 238 IPSVRDVNSVE 248
>gi|302816607|ref|XP_002989982.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
gi|300142293|gb|EFJ08995.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
Length = 440
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 88/251 (35%), Positives = 138/251 (54%), Gaps = 15/251 (5%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQA 121
+D+ P GYP +V++ Y YT +R Q S VL+TQ+LL+A GL + A
Sbjct: 46 LDALKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISGVLTTQALLYAVGLGKGAIPTA 105
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++W+LKDG+ ++ K++ S G D PK WR+L+D + D GLE+L+P P LF+
Sbjct: 106 AAINWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFVP 165
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ + K A + ATR Y+ FA + N +++ AKGEA + +G+ GI ++S
Sbjct: 166 LTAVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGISS 225
Query: 242 TVCSSMQGKMIVGPLL-------SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
V SS GPLL + +HV ++ +A ++TLNP RT +I+A+++ T R
Sbjct: 226 KVGSS-------GPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPYRTGLIMAEYLTTQR 278
Query: 295 ISSPADLRYRE 305
I S D+ E
Sbjct: 279 IPSVRDVNSVE 289
>gi|449431896|ref|XP_004133736.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
gi|449478104|ref|XP_004155224.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 507
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 144/253 (56%), Gaps = 14/253 (5%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ + TP
Sbjct: 118 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVARNKATPG- 176
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A ++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LFL
Sbjct: 177 AIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAAPHLFL 236
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+A N AK ++ V + +TR PIY +FAK N+ D+ AKGE + + +++G G I +
Sbjct: 237 PLACAANVAKNVSAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI- 295
Query: 241 STVCSSMQGKMIVGP--LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
S + +V LLS +++S +E+R+ ++TLN R + V F+KTGR+ S
Sbjct: 296 -----SKRNPSLVTTFGLLSCGYIFSSYQEVRSVVLHTLNRARFNVAVETFLKTGRVPSL 350
Query: 299 ADLRYREDLL-FP 310
E +L FP
Sbjct: 351 QKGNMNERILSFP 363
>gi|8978258|dbj|BAA98149.1| unnamed protein product [Arabidopsis thaliana]
Length = 407
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 93/252 (36%), Positives = 143/252 (56%), Gaps = 8/252 (3%)
Query: 63 VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
V SFL + P G+P SVNE Y+ Y +RAL+HF A+ V +TQ+LL + G + +
Sbjct: 14 VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 73
Query: 122 TV--VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LF
Sbjct: 74 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 133
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
L +A N K +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I L
Sbjct: 134 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI-L 192
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
S S+ + LLS ++ S +E+R+ ++TLN R + V F+KTGR+ S
Sbjct: 193 ISKRNPSL---VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 249
Query: 300 DLRYREDLL-FP 310
+ +E + FP
Sbjct: 250 EGNIQEKIFTFP 261
>gi|297795749|ref|XP_002865759.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
gi|297311594|gb|EFH42018.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 93/252 (36%), Positives = 143/252 (56%), Gaps = 8/252 (3%)
Query: 63 VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
V SFL + P G+P SVNE Y+ Y +RAL+HF A+ V +TQ+LL + G + +
Sbjct: 107 VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 166
Query: 122 TV--VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LF
Sbjct: 167 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 226
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
L +A N K +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I L
Sbjct: 227 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI-L 285
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
S S+ + LLS ++ S +E+R+ ++TLN R + V F+KTGR+ S
Sbjct: 286 ISKRNPSL---VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 342
Query: 300 DLRYREDLL-FP 310
+ +E + FP
Sbjct: 343 EGNIQEKIFTFP 354
>gi|18423070|ref|NP_568713.1| uncharacterized protein [Arabidopsis thaliana]
gi|16604643|gb|AAL24114.1| unknown protein [Arabidopsis thaliana]
gi|23296703|gb|AAN13151.1| unknown protein [Arabidopsis thaliana]
gi|332008476|gb|AED95859.1| uncharacterized protein [Arabidopsis thaliana]
Length = 497
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 93/252 (36%), Positives = 143/252 (56%), Gaps = 8/252 (3%)
Query: 63 VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
V SFL + P G+P SVNE Y+ Y +RAL+HF A+ V +TQ+LL + G + +
Sbjct: 104 VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 163
Query: 122 TV--VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LF
Sbjct: 164 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 223
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
L +A N K +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I L
Sbjct: 224 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI-L 282
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
S S+ + LLS ++ S +E+R+ ++TLN R + V F+KTGR+ S
Sbjct: 283 ISKRNPSL---VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 339
Query: 300 DLRYREDLL-FP 310
+ +E + FP
Sbjct: 340 EGNIQEKIFTFP 351
>gi|326490149|dbj|BAJ94148.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 519
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/249 (35%), Positives = 142/249 (57%), Gaps = 7/249 (2%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+SV +T++LL + G+ + A A
Sbjct: 123 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRATSGA 182
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WIL+DG VGK++ + G + D + K+ R D + +LG G+E+ + PQLFL
Sbjct: 183 VAINWILRDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLMMELGAGIELATAAFPQLFLP 242
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+A + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I L S
Sbjct: 243 LACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI-LMS 301
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
S+ V LS ++ S +E+R+ +NTLN R + V F+KTG + S +
Sbjct: 302 KRNPSLVASFAV---LSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEG 358
Query: 302 RYREDLLFP 310
E + P
Sbjct: 359 NLEETIFNP 367
>gi|226492898|ref|NP_001145431.1| uncharacterized protein LOC100278802 [Zea mays]
gi|195656119|gb|ACG47527.1| hypothetical protein [Zea mays]
gi|224035549|gb|ACN36850.1| unknown [Zea mays]
gi|413945393|gb|AFW78042.1| hypothetical protein ZEAMMB73_132907 [Zea mays]
Length = 511
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/249 (34%), Positives = 141/249 (56%), Gaps = 7/249 (2%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + A A
Sbjct: 119 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRAASGA 178
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + PQLFL
Sbjct: 179 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 238
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
MA + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I ++
Sbjct: 239 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISK 298
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+ S + LS ++ S E+R+ +NTLN R + V F++TG + S D
Sbjct: 299 SNPSLVTSFA----FLSCGYLLSSYHEVRSVVLNTLNRARFTVAVDSFIRTGYVPSLKDG 354
Query: 302 RYREDLLFP 310
+E + P
Sbjct: 355 NSQETIFNP 363
>gi|242090561|ref|XP_002441113.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
gi|241946398|gb|EES19543.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
Length = 517
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 86/249 (34%), Positives = 142/249 (57%), Gaps = 7/249 (2%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RA++HF A+ V +T++LL + G+ + A A
Sbjct: 125 SFIRSYVVPEGFPESVTPSYVPYMTWRAMKHFFGGAMGVFTTRTLLNSVGVSQSRAASGA 184
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + PQLFL
Sbjct: 185 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 244
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
MA + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I ++
Sbjct: 245 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISK 304
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+ S + + LS ++ S E+R+ +NTLN R + V F++TG + S D
Sbjct: 305 SNPSLVTSFAV----LSCGYLLSSYYEVRSVVLNTLNRARFTVAVDSFIRTGYVPSLKDG 360
Query: 302 RYREDLLFP 310
+E + P
Sbjct: 361 NSQETIFNP 369
>gi|302764074|ref|XP_002965458.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
gi|300166272|gb|EFJ32878.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
Length = 430
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 85/250 (34%), Positives = 135/250 (54%), Gaps = 10/250 (4%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP-TPAQ 120
++D + P G+P SV Y Y Q+R LQ+F A+ V +T+SLL A G+R +
Sbjct: 63 IMDFVRSHVVPEGFPDSVLPCYTPYMQWRCLQYFFGGAMGVFTTRSLLHALGVRNGASSS 122
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A V+W++KDG +GK+I + G + D K+ R D L L LE+ + PQ FL
Sbjct: 123 AIAVNWVVKDGAGRIGKMIFARHGKKFDCHLKQIRFAGDVLMQLAAALELATSATPQFFL 182
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+A + N K +A VA+ +TR PIY +FA+ N+ D+ AKGE I + +++G G GI ++
Sbjct: 183 PLACIANIGKNVAAVASTSTRAPIYKAFARRENIGDITAKGECIGNIADLLGTGMGILMS 242
Query: 241 STVCSSMQGKMIVGPL--LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
+ P L+ +VYS E++A + TLN R + V F++TG++
Sbjct: 243 KNF-------PVFAPFSALAFGYVYSSFREVKAIQLPTLNRHRFGIAVDTFLETGKVPGL 295
Query: 299 ADLRYREDLL 308
+ RE ++
Sbjct: 296 VEANERERII 305
>gi|218196816|gb|EEC79243.1| hypothetical protein OsI_19997 [Oryza sativa Indica Group]
Length = 401
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 86/250 (34%), Positives = 140/250 (56%), Gaps = 9/250 (3%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + A A
Sbjct: 8 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRATSGA 67
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + PQLFL
Sbjct: 68 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 127
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
MA + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I
Sbjct: 128 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGLSI---- 183
Query: 242 TVCSSMQGKMIVG-PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
+ S ++ LS ++ S E+R+ +NTLN R + V F+K+G + S +
Sbjct: 184 -LISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPSLKE 242
Query: 301 LRYREDLLFP 310
+E + P
Sbjct: 243 GNSQETIFNP 252
>gi|297604521|ref|NP_001055571.2| Os05g0419200 [Oryza sativa Japonica Group]
gi|53982666|gb|AAV25645.1| unknown protein [Oryza sativa Japonica Group]
gi|222631625|gb|EEE63757.1| hypothetical protein OsJ_18576 [Oryza sativa Japonica Group]
gi|255676375|dbj|BAF17485.2| Os05g0419200 [Oryza sativa Japonica Group]
Length = 415
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 86/250 (34%), Positives = 140/250 (56%), Gaps = 9/250 (3%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + A A
Sbjct: 22 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRATSGA 81
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + PQLFL
Sbjct: 82 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 141
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
MA + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I
Sbjct: 142 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGLSI---- 197
Query: 242 TVCSSMQGKMIVG-PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
+ S ++ LS ++ S E+R+ +NTLN R + V F+K+G + S +
Sbjct: 198 -LISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPSLKE 256
Query: 301 LRYREDLLFP 310
+E + P
Sbjct: 257 GNSQETIFNP 266
>gi|225462223|ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
Length = 713
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 80/246 (32%), Positives = 137/246 (55%), Gaps = 2/246 (0%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
FL P G+P SV YL YT +R +Q S VL+TQ+LL+A GL + A V
Sbjct: 285 FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 344
Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P P FL +
Sbjct: 345 NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 404
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
+ + A + +TR Y+ FA + N +++ AKGEA + +G+ GI LA+ +
Sbjct: 405 VAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIG 464
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
SS +++ +H++ ++ ++ + TLNP R +++ ++++ +G++ S ++
Sbjct: 465 SSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVN-E 523
Query: 305 EDLLFP 310
E+ LFP
Sbjct: 524 EEPLFP 529
>gi|356540900|ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
Length = 592
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 83/261 (31%), Positives = 142/261 (54%), Gaps = 2/261 (0%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
V+ + D F P G+P SV YL Y+ +RA+Q VL+TQSLL+A GL +
Sbjct: 173 VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG 232
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A ++W+LKDG+ ++ K++ SN G D PK WR+ AD L + GLE+ +P CP
Sbjct: 233 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMSTPACP 292
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
Q F+ + + ++ A + +TR ++ FA + N +++ AKGE +G+ G
Sbjct: 293 QFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIVLG 352
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
I L + + SS + +L+ IH+Y ++ ++ + TLNP R +++ ++++ +G+ +
Sbjct: 353 IGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQ-A 411
Query: 297 SPADLRYREDLLFPGRLIEDA 317
P E+ LFP I +A
Sbjct: 412 PPVKEVNDEEPLFPAVPILNA 432
>gi|357133637|ref|XP_003568430.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Brachypodium
distachyon]
Length = 521
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 88/249 (35%), Positives = 139/249 (55%), Gaps = 7/249 (2%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+SV +T++LL + G+ + A A
Sbjct: 125 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRATSGA 184
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LFL
Sbjct: 185 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHLFLP 244
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+A + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I L S
Sbjct: 245 LACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI-LIS 303
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
S+ LS ++ S E+R+ +NTLN R + V F+KTG + S +
Sbjct: 304 KRNPSLVASFA---FLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEG 360
Query: 302 RYREDLLFP 310
E + P
Sbjct: 361 NLEETIFNP 369
>gi|296082804|emb|CBI21809.3| unnamed protein product [Vitis vinifera]
Length = 537
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 80/246 (32%), Positives = 137/246 (55%), Gaps = 2/246 (0%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
FL P G+P SV YL YT +R +Q S VL+TQ+LL+A GL + A V
Sbjct: 83 FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 142
Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P P FL +
Sbjct: 143 NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 202
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
+ + A + +TR Y+ FA + N +++ AKGEA + +G+ GI LA+ +
Sbjct: 203 VAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIG 262
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
SS +++ +H++ ++ ++ + TLNP R +++ ++++ +G++ S ++
Sbjct: 263 SSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVN-E 321
Query: 305 EDLLFP 310
E+ LFP
Sbjct: 322 EEPLFP 327
>gi|357126248|ref|XP_003564800.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 506
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/240 (35%), Positives = 138/240 (57%), Gaps = 8/240 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + TP A ++WIL
Sbjct: 123 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 181
Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
KDG VGK++ + G + D++ K+ R +D L ++G G+E+ + PQ FL MA + N
Sbjct: 182 KDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGIELATAAFPQFFLPMACVANV 241
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I + S +
Sbjct: 242 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIYITKRNPSLVT 301
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LLS ++ S +E+R+ +NTLN R + V F+KTG + S + +E L
Sbjct: 302 SFA----LLSCGYLMSSYQEVRSVVLNTLNRARFTVAVDSFIKTGYVPSLKEGNSQETLF 357
>gi|222619643|gb|EEE55775.1| hypothetical protein OsJ_04341 [Oryza sativa Japonica Group]
Length = 508
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/242 (34%), Positives = 138/242 (57%), Gaps = 8/242 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + TP A ++WIL
Sbjct: 128 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 186
Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
KDG VGK++ + G + D + K+ R +D L ++G G+E+ + PQ FL MA + N
Sbjct: 187 KDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQFFLPMACVANV 246
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I ++ S +
Sbjct: 247 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNPSLVT 306
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LS +++S E+R+ +NTLN R + V F+KTG I S + +E +
Sbjct: 307 SFA----FLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 362
Query: 309 FP 310
P
Sbjct: 363 NP 364
>gi|56784316|dbj|BAD82242.1| unknown protein [Oryza sativa Japonica Group]
gi|56785239|dbj|BAD82127.1| unknown protein [Oryza sativa Japonica Group]
Length = 508
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/242 (34%), Positives = 138/242 (57%), Gaps = 8/242 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + TP A ++WIL
Sbjct: 128 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 186
Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
KDG VGK++ + G + D + K+ R +D L ++G G+E+ + PQ FL MA + N
Sbjct: 187 KDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQFFLPMACVANV 246
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I ++ S +
Sbjct: 247 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNPSLVT 306
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LS +++S E+R+ +NTLN R + V F+KTG I S + +E +
Sbjct: 307 SFA----FLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 362
Query: 309 FP 310
P
Sbjct: 363 NP 364
>gi|38344695|emb|CAE02373.2| OSJNBb0096E05.17 [Oryza sativa Japonica Group]
Length = 593
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 81/252 (32%), Positives = 140/252 (55%), Gaps = 1/252 (0%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + P GYP SV+ YL+Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 185 WRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKGA 244
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P+ P
Sbjct: 245 IPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPH 304
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LF+ + + A + ATR Y+ FA + N +++ AKGEA + +G+ GI
Sbjct: 305 LFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGIMLGI 364
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R +++ ++++ +G++ S
Sbjct: 365 ALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 424
Query: 298 PADLRYREDLLF 309
++ E L
Sbjct: 425 VKEVNDEEPLFL 436
>gi|356522111|ref|XP_003529693.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 488
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 91/277 (32%), Positives = 149/277 (53%), Gaps = 10/277 (3%)
Query: 41 FQPDGHLSVKVVDDS-RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAA 99
F+ + L V + + R H V+ + P G+P SV Y+ Y +RAL+HF A
Sbjct: 72 FKKNSFLPVSLNSNHPRDPLHEVLSFVTSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGA 131
Query: 100 LSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRI 156
+ V +TQ+LL + G+ R P A ++WILKDG VGK++ + G + D + K+ R
Sbjct: 132 MGVFTTQTLLSSVGVCRNRAAPG-AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRF 190
Query: 157 LADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD 216
D L +LG G+E+ + P LFL +A N K +A V + +TR PIY +FAK N+ D
Sbjct: 191 TGDLLMELGAGVELATAAVPHLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGD 250
Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
+ AKGE + + +++G G I +A S + LLS ++ S E+++ ++T
Sbjct: 251 VTAKGECVGNIADLLGTGLSILIAKRNPS----LVTTFSLLSCGYILSSYREVKSVVLHT 306
Query: 277 LNPQRTAMIVADFVKTGRISSPADLRYREDLL-FPGR 312
LN R ++ V F+ TG++ + + E++ FP +
Sbjct: 307 LNCGRFSVAVEHFLMTGQVPTLQEGNMNENIFSFPWK 343
>gi|226502346|ref|NP_001144348.1| uncharacterized protein LOC100277253 [Zea mays]
gi|195640526|gb|ACG39731.1| hypothetical protein [Zea mays]
Length = 503
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/242 (34%), Positives = 138/242 (57%), Gaps = 8/242 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + TP A ++WIL
Sbjct: 117 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 175
Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
KDG VGK++ + G + D + K+ R +D L ++G G+E+ + PQ FL MA + N
Sbjct: 176 KDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELTTAAFPQFFLPMACVANV 235
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I ++ S +
Sbjct: 236 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNPSLVT 295
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LS ++ S +E+R+ +NTLN R + V F+KTG I S + +E +
Sbjct: 296 SFA----FLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 351
Query: 309 FP 310
P
Sbjct: 352 NP 353
>gi|414879331|tpg|DAA56462.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
gi|414879332|tpg|DAA56463.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
Length = 503
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/242 (34%), Positives = 138/242 (57%), Gaps = 8/242 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + TP A ++WIL
Sbjct: 117 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 175
Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
KDG VGK++ + G + D + K+ R +D L ++G G+E+ + PQ FL MA + N
Sbjct: 176 KDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELTTAAFPQFFLPMACVANV 235
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I ++ S +
Sbjct: 236 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNPSLVT 295
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LS ++ S +E+R+ +NTLN R + V F+KTG I S + +E +
Sbjct: 296 SFA----FLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 351
Query: 309 FP 310
P
Sbjct: 352 NP 353
>gi|357474239|ref|XP_003607404.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
gi|355508459|gb|AES89601.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
Length = 490
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 86/254 (33%), Positives = 141/254 (55%), Gaps = 10/254 (3%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR----PTPA 119
SF+ + P G+P +V Y+ Y +RAL+HF A+ V +TQ+LL + G+ A
Sbjct: 99 SFITSYVVPEGFPHTVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRATPAA 158
Query: 120 QATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A ++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LF
Sbjct: 159 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDLLMELGAGVELATAAMPHLF 218
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
L +A N K + V + +TR PIY +FAK N+ D+ AKGE + + +++G G I +
Sbjct: 219 LPLACAANVLKNVGAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGFSIMI 278
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
A S + LLS ++ S +E+++ ++TLN R ++ V F+KTG++ +
Sbjct: 279 A----KSNPSLITTFSLLSCGYILSSYKEVKSVVLHTLNSARFSVAVDSFLKTGQVPTLR 334
Query: 300 DLRYREDLL-FPGR 312
+ ED+ FP +
Sbjct: 335 EGNLNEDIFSFPWK 348
>gi|356563749|ref|XP_003550122.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 497
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 87/257 (33%), Positives = 142/257 (55%), Gaps = 9/257 (3%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RP 116
H V+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ R
Sbjct: 100 HEVLAFVTSYVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRA 159
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
P A ++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P
Sbjct: 160 APG-AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 218
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
LFL +A N K +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G
Sbjct: 219 HLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 278
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
I +A S + LLS ++ S E+++ ++TLN R ++ V F++TG++
Sbjct: 279 ILIAKRNPS----LVTTFSLLSCGYILSSYREVKSVVLHTLNCGRFSVAVESFLRTGQVP 334
Query: 297 SPADLRYREDLL-FPGR 312
+ + E++ FP +
Sbjct: 335 TLQEGNMNENIFSFPWK 351
>gi|449455818|ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 611
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 86/270 (31%), Positives = 143/270 (52%), Gaps = 6/270 (2%)
Query: 43 PDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSV 102
P LS V+ V+ R D F P G+P SV YL Y+ +R +Q S V
Sbjct: 188 PSSSLSFSFVN----VWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGV 243
Query: 103 LSTQSLLFAAGL-RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADAL 161
L+TQ+LL+A GL + A V+W+LKDG ++ K+ S G D PK WR+ AD L
Sbjct: 244 LATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLL 303
Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
+ G+E+L+P P F+ + + A + ATR Y+ FA + N +++ AKG
Sbjct: 304 ENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKG 363
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
EA + +G+ GI LA+ + SS + ++++IH++ ++ ++ + TLNP R
Sbjct: 364 EAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYR 423
Query: 282 TAMIVADFVKTGRISSPADLRYREDLLFPG 311
+++ ++++ +G + S D+ E+ LFP
Sbjct: 424 ASLVFSEYLLSGEVPSIKDVN-NEEPLFPA 452
>gi|307105811|gb|EFN54059.1| hypothetical protein CHLNCDRAFT_8919, partial [Chlorella
variabilis]
Length = 325
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 85/254 (33%), Positives = 140/254 (55%), Gaps = 12/254 (4%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPT 117
R+ D + P +P SV Y Y + R++Q+F A+SV +T+SLL + G+ + +
Sbjct: 1 QRLGDLLADYLLPQDFPHSVAPQYSDYMRARSVQYFFGGAMSVFTTRSLLASLGVANKHS 60
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A ++W++KDG +G+ + + G +D E K++R++ D L + G LE+ + L P+
Sbjct: 61 SEAAAAINWVVKDGAGRLGRFLFARWGRELDCELKQFRLMGDVLMETGAALELSTVLMPR 120
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+FL +A N AK +A V A +TR PIY +FAK+ NL+D+ AKGE+++ L +VVG GI
Sbjct: 121 MFLPLACTANLAKNLAAVTASSTRAPIYRTFAKQNNLADVTAKGESVANLADVVGTAFGI 180
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVI---EEMRAAPVNTLNPQRTAMIVADFVKTGR 294
LA + + V P + + V +I E+ + + LN R + F TGR
Sbjct: 181 ALA-------KANLPVLPTFAALSVGYLIASRREVDSVVLPYLNRARLSYTTRAFYSTGR 233
Query: 295 ISSPADLRYREDLL 308
+ + YRE L+
Sbjct: 234 VPETLEGNYREPLM 247
>gi|449484579|ref|XP_004156921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 443
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 86/270 (31%), Positives = 143/270 (52%), Gaps = 6/270 (2%)
Query: 43 PDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSV 102
P LS V+ V+ R D F P G+P SV YL Y+ +R +Q S V
Sbjct: 158 PSSSLSFSFVN----VWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGV 213
Query: 103 LSTQSLLFAAGL-RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADAL 161
L+TQ+LL+A GL + A V+W+LKDG ++ K+ S G D PK WR+ AD L
Sbjct: 214 LATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLL 273
Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
+ G+E+L+P P F+ + + A + ATR Y+ FA + N +++ AKG
Sbjct: 274 ENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKG 333
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
EA + +G+ GI LA+ + SS + ++++IH++ ++ ++ + TLNP R
Sbjct: 334 EAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYR 393
Query: 282 TAMIVADFVKTGRISSPADLRYREDLLFPG 311
+++ ++++ +G + S D+ E+ LFP
Sbjct: 394 ASLVFSEYLLSGEVPSIKDVN-NEEPLFPA 422
>gi|255569810|ref|XP_002525869.1| conserved hypothetical protein [Ricinus communis]
gi|223534874|gb|EEF36563.1| conserved hypothetical protein [Ricinus communis]
Length = 520
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 95/275 (34%), Positives = 156/275 (56%), Gaps = 10/275 (3%)
Query: 40 QFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSA 98
+F+ + SV + + PV + SF+ + P G+P SV Y+ Y +RAL+HF
Sbjct: 106 RFKKNSFRSVSLQNPQAPVEELM--SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGG 163
Query: 99 ALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRI 156
A+ V +T++LL + G+ + A A ++WILKDG VGK++ + G + D + K+ R
Sbjct: 164 AMGVFTTKTLLSSVGVSKSTAIPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRF 223
Query: 157 LADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD 216
D L +LG G+E+ + P LFL +A N AK +A V + +TR PIY +FAK N+ D
Sbjct: 224 AGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGD 283
Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
+ AKGE + + +++G G I ++ S + LLS +V+S +E+++ ++T
Sbjct: 284 VTAKGECVGNVADLLGTGLSIMISKRNPS----LVTTFALLSCGYVFSSYQEVKSVVLHT 339
Query: 277 LNPQRTAMIVADFVKTGRISSPADLRYREDLL-FP 310
LN R ++ V F+KTGR+ S + RE++ FP
Sbjct: 340 LNRARFSVAVESFLKTGRVPSLQEGNTRENIFSFP 374
>gi|452823170|gb|EME30183.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 519
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 147/286 (51%), Gaps = 4/286 (1%)
Query: 29 FETSDSVTRHYQFQP-DGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
+ET Y P +G L + DSR V+ +++D + F P +V++ Y +T
Sbjct: 128 YETCTHQRIDYFLNPSNGDLLIAPSIDSRTVWRKILDLLNSSFLPENCKKTVSKDYFWFT 187
Query: 88 QFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLIC-SNLG 144
F LQ+ + + V+STQ LL A GL + ++ ++W+LKDG+ +G ++ S +G
Sbjct: 188 TFNFLQNVVNCCIGVISTQQLLRAVGLSASQGIGKSASLNWVLKDGIGRLGAIVFGSFIG 247
Query: 145 ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPI 204
R D++PKRWR+ D LY G G E++SPL P+ FL +A L N K + + T I
Sbjct: 248 NRFDADPKRWRLWGDILYAFGIGTEIISPLLPRYFLLVASLANMVKATSYMLRLPTTAAI 307
Query: 205 YSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYS 264
SFAK N D+ AK + L N+ G GI + + S + + ++
Sbjct: 308 RRSFAKRENFGDISAKANSQEVLSNLFGTFLGIGFSIILGHSWWQLSVAYFVYFIMFFLL 367
Query: 265 VIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
++ + TLN QR+ ++ + K+ ++S+ ++ YRE+++ P
Sbjct: 368 NYLGVKGLHLRTLNLQRSLLLGWSYWKSSQVSTVEEVNYRENVILP 413
>gi|301622642|ref|XP_002940632.1| PREDICTED: UPF0420 protein C16orf58 [Xenopus (Silurana) tropicalis]
Length = 473
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 90/257 (35%), Positives = 144/257 (56%), Gaps = 6/257 (2%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
Y + D F++ P GYP SV++ YL Y + +Q F+S+ L+TQSLL +G+ +
Sbjct: 62 YRSLRDFFVSLLLPHGYPDSVSKDYLAYQLWDTIQAFSSSVTGSLATQSLLRGSGVGDST 121
Query: 119 AQATVVS--WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A + WIL+DG VG+++ + + G+R+D + KRWR+ AD L D+ +E+++P+
Sbjct: 122 ATVAAATITWILRDGTGMVGRILFAWMKGSRLDCDAKRWRLFADLLNDIAIFMEIMAPVF 181
Query: 176 PQLF-LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P F L + G F K I VA ATR I AK N++D+ AK + TL N+ GL
Sbjct: 182 PSCFTLTVCAAGVF-KCIVGVAGGATRAAITVHQAKRDNMADVCAKDGSQETLVNLAGLL 240
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
+ L V SM ++ LL+++H+Y+ +R+ + TLN R ++++ ++K G+
Sbjct: 241 VSLFLVPLVSDSMMATYLLFFLLTILHLYANYRAVRSVIMETLNQSRLSIVLHHYLKEGK 300
Query: 295 ISSPADLRYREDLLFPG 311
I P RE LL PG
Sbjct: 301 ILDPEAANLREPLL-PG 316
>gi|255557849|ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
gi|223541000|gb|EEF42558.1| conserved hypothetical protein [Ricinus communis]
Length = 541
Score = 144 bits (362), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 80/247 (32%), Positives = 137/247 (55%), Gaps = 2/247 (0%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
F+ P GYP SV YL Y+ +R +Q S VL+TQ+LL+A GL + A +
Sbjct: 132 FVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 191
Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P P LF+ +
Sbjct: 192 NWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLEILTPAFPHLFVFIGA 251
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
+ A + ATR Y+ FA + N +++ AKGEA + +G+ GI LA+ +
Sbjct: 252 AAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIGLANCIG 311
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
SS+ + +++ IH++ ++ ++ + TLNP R +++ ++++ +G+ D+
Sbjct: 312 SSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPIKDVN-D 370
Query: 305 EDLLFPG 311
E+ LFP
Sbjct: 371 EEPLFPA 377
>gi|219129733|ref|XP_002185036.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403531|gb|EEC43483.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 627
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/229 (34%), Positives = 132/229 (57%), Gaps = 11/229 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ----ATVVS 125
F PSG V Y R+ ++R LQ F +A L+V TQSLL A ++ + +Q + ++
Sbjct: 195 FLPSG----VTNDYYRFVRWRILQRFVNANLNVFGTQSLLLALRIKSSASQLGALSAALN 250
Query: 126 WILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
W+LKD + + +++ S +G R DS+ KRWR + ++ G GLE+++ + P LFL A
Sbjct: 251 WVLKDALGKIVRMLWASRMGRRFDSDAKRWRFRSSFVFAAGNGLEIITYVFPSLFLLWAT 310
Query: 185 LGNFAKGIAVVAARATRLPIYSSF--AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
L N K I+++ + +TR IY+SF N+ D+ AKGEA + +++G+ +G+ L+ T
Sbjct: 311 LANCCKQISMLTSSSTRTSIYNSFRDGSRENIGDITAKGEAQIAIVDLLGIASGVTLSRT 370
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
V +S++ + V L I + V ++RA +N +R +VADF +
Sbjct: 371 VGTSIRAVLAVYVTLQAIEIVCVYHQLRAVTYRVMNFERMISVVADFCQ 419
>gi|452819618|gb|EME26673.1| glutamate N-acetyltransferase isoform 2 [Galdieria sulphuraria]
gi|452819619|gb|EME26674.1| glutamate N-acetyltransferase isoform 1 [Galdieria sulphuraria]
Length = 540
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/239 (31%), Positives = 137/239 (57%), Gaps = 2/239 (0%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
+ + ++ ++ S F P G+P SV + YL+Y+ +R +Q+ S VLSTQ+LL AAG
Sbjct: 128 NRQNLWEQITSSLSTFFLPEGFPESVGDSYLQYSIWRGIQNIISNISQVLSTQALLLAAG 187
Query: 114 LRPTPAQATVVSWIL--KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
+ T ++ + KDG+ VG+L + LG + D++PKRWR+++D LYD+G LE+L
Sbjct: 188 VGRTHSETLAAATAWAWKDGIGQVGRLFTAVLGNQYDADPKRWRLVSDLLYDIGLSLEIL 247
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
SP + FL MA LGN +K IA + R + +F + NL ++ AK +A + + ++
Sbjct: 248 SPAFSKYFLFMAALGNISKSIAFAIGISCRYSVLKTFVRRENLGEISAKNDAQNVVTGML 307
Query: 232 GLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
G G+ ++ + ++ + ++ V + I+ + + M+ A + T+N QR ++ ++
Sbjct: 308 GTLIGVWMSKCLPAAPKVRLGVFLFFTAIYSFFNYKSMQVAELTTINRQRGNILAHSYL 366
>gi|359486217|ref|XP_003633414.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
gi|297739449|emb|CBI29631.3| unnamed protein product [Vitis vinifera]
Length = 504
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 90/251 (35%), Positives = 145/251 (57%), Gaps = 10/251 (3%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ R P
Sbjct: 113 SFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNRSAPG- 171
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A ++WILKDG VGK++ + G + D + K+ R+ D L +LG G+E+ + P LFL
Sbjct: 172 AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVPHLFL 231
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I ++
Sbjct: 232 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 291
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
S + LLS +V S +E+++ ++TLN R ++ + F+KTGR+ S +
Sbjct: 292 KRNPS----LVTTFALLSCGYVLSSYQEVKSVVLHTLNRARFSVAIDSFLKTGRVPSLQE 347
Query: 301 LRYREDLL-FP 310
+E++ FP
Sbjct: 348 GNMKENIFSFP 358
>gi|168024902|ref|XP_001764974.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683783|gb|EDQ70190.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 91/254 (35%), Positives = 143/254 (56%), Gaps = 21/254 (8%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
SF+ + P GYP SV Y Y Q+RA+Q+F A+SV +T+SL+ A G+ R T A +
Sbjct: 34 SFVKAYIVPEGYPDSVAPSYTPYMQWRAVQYFFGGAMSVFTTRSLMHALGVSRGTSASSA 93
Query: 123 V-VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
V V+W++KDG +GK++ + G + D + K+ R L LG G+E+ + P LFL
Sbjct: 94 VAVNWVIKDGAGRIGKMLFARHGKKFDVDLKQLRFKGAMLMQLGAGVELTTMAVPHLFLP 153
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI---- 237
+A N AK +A V + +TR PIY +FA+ N+ D+ AKGE IS + +++G G GI
Sbjct: 154 LACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECISNIADLLGTGLGIFISK 213
Query: 238 ---QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
L +T C +LS ++ S E+++ + TLN R ++V F++TG+
Sbjct: 214 RSPSLVATFC-----------VLSCGYMISSFNEIKSVRLATLNRARFGVVVQSFLETGQ 262
Query: 295 ISSPADLRYREDLL 308
+ S D RE ++
Sbjct: 263 VPSIEDANRRESVI 276
>gi|302835848|ref|XP_002949485.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
nagariensis]
gi|300265312|gb|EFJ49504.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
nagariensis]
Length = 495
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/245 (34%), Positives = 134/245 (54%), Gaps = 6/245 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
P+G+P SV Y Y +R +Q+F A+SV +TQSLL A G+ R A ++W+
Sbjct: 113 LLPNGFPDSVGPQYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRFQGEAAAAINWV 172
Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
+KDG +G+L+ + G +D E K++R+ D L + G LE+ + P FL +A N
Sbjct: 173 IKDGAGRLGRLLFARWGRELDCELKQFRLAGDLLMEAGAALELATVYAPPAFLPLACTAN 232
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
+K +A VAA +TR PIY +FA + NL+D+ AKGE+++ L +++G AGI L+ M
Sbjct: 233 LSKNLAAVAASSTRAPIYRTFALQNNLADITAKGESVANLADILGTVAGIALSRMKLPRM 292
Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ LS ++ S +E+ + + +N R A ++ G I A+ + E L
Sbjct: 293 PTFCV----LSAGYLLSSRKEVDSVELPYMNRARLAYATQRYLSDGFIPGVAEANHNEPL 348
Query: 308 LFPGR 312
L GR
Sbjct: 349 LPWGR 353
>gi|412993492|emb|CCO14003.1| predicted protein [Bathycoccus prasinos]
Length = 584
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 83/263 (31%), Positives = 141/263 (53%), Gaps = 13/263 (4%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPT 117
R +D+ + FFP SV Y Y ++R Q S+ ++VL+TQ +L A GL R
Sbjct: 163 RQIDNIRDSFFPQKT--SVTSDYWDYAKYRFTQRIASSCMTVLATQQMLAAIGLGASRSI 220
Query: 118 PAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
PA A ++W+LKDG+ +GKL + +N G DS+ KR R + +Y +E ++P P
Sbjct: 221 PA-AAALNWVLKDGLGRLGKLSVATNFGRSFDSDVKRLRFSSSLVYTGAVFIETITPFFP 279
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+ FL +A + N K I + R PI +FA E NL ++ AK A L + +GL G
Sbjct: 280 KHFLAIATVANVGKSIGITTGNVVRPPIQKTFALEENLGEIAAKTSAQQVLADNIGLALG 339
Query: 237 IQL----ASTVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+ + V ++Q + + P+L+ I + S+ +++++ + T+N +R +I FV
Sbjct: 340 VSVMKFQTKFVSLALQRAVPLALFPILASIDLVSIHKQLKSVQLRTVNKERAEIIAEAFV 399
Query: 291 KTGRISSPADLRYREDLLFPGRL 313
+ ++++ + +E LLFP RL
Sbjct: 400 REKKLATQLQVAEKERLLFPARL 422
>gi|222628554|gb|EEE60686.1| hypothetical protein OsJ_14164 [Oryza sativa Japonica Group]
Length = 609
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 140/268 (52%), Gaps = 17/268 (6%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + P GYP SV+ YL+Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 185 WRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKGA 244
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P+ P
Sbjct: 245 IPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPH 304
Query: 178 LFLEMAGLGNFAKGIAVVAAR----------------ATRLPIYSSFAKEGNLSDLFAKG 221
LF+ + + A + ATR Y+ FA + N +++ AKG
Sbjct: 305 LFVPIGAAAGAGRSAAALIQTIFTHFMLENANHMRKAATRSCFYAGFAVQRNFAEVIAKG 364
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
EA + +G+ GI LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R
Sbjct: 365 EAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYR 424
Query: 282 TAMIVADFVKTGRISSPADLRYREDLLF 309
+++ ++++ +G++ S ++ E L
Sbjct: 425 ASLVFSEYLLSGQVPSVKEVNDEEPLFL 452
>gi|299115992|emb|CBN75993.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 479
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 87/249 (34%), Positives = 141/249 (56%), Gaps = 7/249 (2%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT--PAQATV 123
L PSG+P SV+ Y+ Y ++ SAA V+STQ+LL+A GL P AT+
Sbjct: 1 MLGLVLPSGFPASVHPPYMAYAGWQFTGMAASAAAGVMSTQALLYAMGLGAGALPLAATL 60
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+W++KDG+ +G + S+L R D+ PK WR++A D LE+LSPL P FL +
Sbjct: 61 -NWVIKDGLGQLGGVAFSSLVSTRFDANPKLWRVVAAVSLDASMVLELLSPLAPAYFLPI 119
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
A + N K ++ +AA A+R I++ + +GNL+D+ AK A + L + G GAG+ L+++
Sbjct: 120 ASVANIGKNVSFLAASASRAAIHNVLSLKGNLADVTAKSGAQTILACMAGTGAGVTLSAS 179
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV---ADFVKTGRISSPA 299
+ S + LSV+H+ + MR + T + QR ++V AD ++ + SP
Sbjct: 180 LGSEFHAIVPACLCLSVVHLGANHMSMRRLGIPTFDHQRLELVVNGMADGDRSRGMPSPQ 239
Query: 300 DLRYREDLL 308
D+ +ED++
Sbjct: 240 DIAAQEDVV 248
>gi|168019221|ref|XP_001762143.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686547|gb|EDQ72935.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 390
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 87/254 (34%), Positives = 139/254 (54%), Gaps = 21/254 (8%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR--PTPAQA 121
SF+ + P YP SV Y Y Q+RA+Q+ A+SV +T+SL+ A G+ + + A
Sbjct: 18 SFVKSYIVPENYPDSVAPSYTPYMQWRAVQYLFGGAMSVFTTRSLMHALGVSRGASASSA 77
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
V+W++KDG +GK+I + G + D + K+ R L LG G+E+ + P LFL
Sbjct: 78 VAVNWVIKDGAGRIGKMIFARHGKKFDVDLKQLRFKGAMLMQLGAGVELATMAVPHLFLP 137
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI---- 237
+A N AK +A V + +TR PIY +FA+ N+ D+ AKGE IS + +++G G GI
Sbjct: 138 LACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECISNIADLMGTGLGIFISK 197
Query: 238 ---QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
L +T C +LS ++ S E+++ + TLN R ++V F++TG+
Sbjct: 198 KSPSLVATFC-----------VLSCGYMISSFNEIKSVRLATLNRARFGVVVQSFLETGK 246
Query: 295 ISSPADLRYREDLL 308
+ S D RE ++
Sbjct: 247 VPSVEDANRRESVI 260
>gi|168005499|ref|XP_001755448.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693576|gb|EDQ79928.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 503
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 137/243 (56%), Gaps = 6/243 (2%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVVSWILK 129
P GYP SV + Y YT +R Q S VL+TQ+LL+A GL + A V+W+L+
Sbjct: 98 LPDGYPQSVTDDYFEYTLWRMGQVIASQISGVLTTQALLYAVGLGKGAIPTAAAVNWVLR 157
Query: 130 DGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFA 189
DG+ ++ K++ S G D PK WR+++D + ++ GLE+L+P P LF+ +A
Sbjct: 158 DGIGYLSKIVLSKYGRHFDVHPKGWRLVSDLIENMSYGLELLTPAFPHLFVYLAAAAGAG 217
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
+ + + AT+ + A N +++ AKGEA + +G+ GI +++ V S+ G
Sbjct: 218 RSASGLIQAATKSCFNAGMAANRNFAEIIAKGEAQGMVSKSIGIALGIAVSAYVGST--G 275
Query: 250 KMIVGPLLSV--IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
++VG V +H++ ++ +A + TLNP R ++++A++ ++G + S ++ E+
Sbjct: 276 PLLVGTFFGVTALHIFCNLKSYQAVQLRTLNPYRASLVLAEYFRSGNVVSVREVN-GEEP 334
Query: 308 LFP 310
+FP
Sbjct: 335 IFP 337
>gi|226505832|ref|NP_001140509.1| uncharacterized protein LOC100272571 [Zea mays]
gi|194699768|gb|ACF83968.1| unknown [Zea mays]
Length = 491
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/248 (32%), Positives = 134/248 (54%), Gaps = 18/248 (7%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV 123
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ +
Sbjct: 110 SFIRSYVVPEGFPDSVTPSYVPYMNWRALKHFFGGAMGVFTTRTLLSSVGVSQS------ 163
Query: 124 VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
KDG VGK++ + G + D++ K+ R +D L ++G G+E+ + PQ FL MA
Sbjct: 164 -----KDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGIELTTAAFPQFFLPMA 218
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
+ N K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I
Sbjct: 219 CVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSI-----F 273
Query: 244 CSSMQGKMIVG-PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
S ++ LS ++ S +E+R+ +NTLN R + V F+KTG I S +
Sbjct: 274 ISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGN 333
Query: 303 YREDLLFP 310
+E + P
Sbjct: 334 SQETIFNP 341
>gi|449016577|dbj|BAM79979.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 719
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 141/268 (52%), Gaps = 13/268 (4%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL- 114
R + R+ + + + P GYP +V + YL +T +R LQ+ ++ + V ST++LL GL
Sbjct: 328 RSLVERLRNWLIQVYLPHGYPHTVTQDYLAFTLWRLLQNTAASVMGVFSTEALLLGLGLG 387
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSP 173
+ A V WILKDG H GK+ + + G + D +P+ WRI++D L DL LE+++P
Sbjct: 388 KNVSGTAQAVQWILKDGFGHAGKIAYAAVAGKQFDMDPRSWRIMSDLLEDLAGVLEIITP 447
Query: 174 LCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
+ FL +A + + IA + ATR I+ S A N DL KGE+ ++GL
Sbjct: 448 IFSGQFLVLASIATTMRAIAAMTGTATRHAIFKSMALAENQGDLATKGESQGVTTKLIGL 507
Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV--- 290
G GI L+ + +I + +VIH+ + + M+ +TLN QR A+++ F+
Sbjct: 508 GLGITLSKRIGQDYTNLLIAYGIAAVIHLAANFKAMQCVEFSTLNRQRMALLLDAFMSAR 567
Query: 291 -KTGRISS-------PADLRYREDLLFP 310
+T R +S P ++ +E + P
Sbjct: 568 AQTSRQASRQLTLPTPREICAKERFMIP 595
>gi|356495287|ref|XP_003516510.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
Length = 649
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/261 (31%), Positives = 143/261 (54%), Gaps = 2/261 (0%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
V+ + D F P G+P SV YL Y+ +RA+Q VL+TQSLL+A GL +
Sbjct: 174 VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG 233
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A ++W+LKDG+ ++ K++ SN G D +PK WR+ AD L + GLE+ +P P
Sbjct: 234 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAAFGLEMCTPAFP 293
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
Q F+ + + ++ A + +TR ++ FA + N +++ AKGE +G+G G
Sbjct: 294 QFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLG 353
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
I L + + SS + +L+ IH+Y ++ ++ + TLNP R +++ ++++ +G+ +
Sbjct: 354 IGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQ-A 412
Query: 297 SPADLRYREDLLFPGRLIEDA 317
P E+ LFP I +A
Sbjct: 413 PPVKEVNDEEPLFPAVPILNA 433
>gi|42565615|ref|NP_190175.2| uncharacterized protein [Arabidopsis thaliana]
gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis thaliana]
gi|30794095|gb|AAP40490.1| unknown protein [Arabidopsis thaliana]
gi|110739240|dbj|BAF01534.1| hypothetical protein [Arabidopsis thaliana]
gi|332644566|gb|AEE78087.1| uncharacterized protein [Arabidopsis thaliana]
Length = 608
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/248 (31%), Positives = 138/248 (55%), Gaps = 3/248 (1%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
+ L +F P G+P SV YL Y+ +R +Q S VL+TQSLL+A GL + A
Sbjct: 192 NLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAA 251
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+LKDG+ ++ K++ S G D PK WR+ AD L + G+E+L+P+ PQ F+ +
Sbjct: 252 AINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMI 311
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ A + ATR + FA + N +++ AKGEA + VG+ GI +A+
Sbjct: 312 GAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANC 371
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ +S + +++ IH+Y+ ++ + + TLNP R +++ ++++ +G+ ++
Sbjct: 372 IGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVN 431
Query: 303 YREDLLFP 310
E+ LFP
Sbjct: 432 -DEEPLFP 438
>gi|308807048|ref|XP_003080835.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
tauri]
gi|116059296|emb|CAL55003.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
tauri]
Length = 1023
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 148/285 (51%), Gaps = 19/285 (6%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RP 116
R V+ + P P SV Y Y ++R Q S+ +SV +TQ +L A GL R
Sbjct: 573 RRCVERASHALLPE--PTSVTSDYWEYARYRFAQRIASSCISVFATQQMLTAVGLGASRT 630
Query: 117 TPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
PA A V+W+LKDG+ +GKL + +N G DS+ KR+R + +YD + +E+++P
Sbjct: 631 LPA-AAAVNWVLKDGLGRLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYF 689
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL-- 233
PQ FL +A + N K + + A R PI +F E NL+++ AK A L + +GL
Sbjct: 690 PQHFLPLATVANIGKSVGITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLAL 749
Query: 234 --GA--GIQLASTVCSSMQGKMIV---GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
GA G+ +TV ++ + + GP L+V+ + + +E++A + T+N +R +I
Sbjct: 750 AVGAAKGMSKVATVRPEIRRALPLFAFGP-LAVLDLICIYKELKAVQLRTINKERAEIIA 808
Query: 287 ADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKMDRHGIRAEC 331
+VK I + A + E LL P RL D N+ + I C
Sbjct: 809 EAYVKEREIPTRARVANAERLLIPARL--DKSNLPLKVTSIGEVC 851
>gi|297815746|ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
lyrata]
gi|297321594|gb|EFH52015.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
lyrata]
Length = 613
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/248 (31%), Positives = 138/248 (55%), Gaps = 3/248 (1%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
+ L +F P G+P SV YL Y+ +R +Q S VL+TQSLL+A GL + A
Sbjct: 198 NLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVLATQSLLYAVGLGKGAIPTAA 257
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+LKDG+ ++ K++ S G D PK WR+ AD L + G+E+L+P+ PQ F+ +
Sbjct: 258 AINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMI 317
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ A + ATR + FA + N +++ AKGEA + +G+ GI +A+
Sbjct: 318 GAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSMGILLGIVVANC 377
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ +S + +++ IH+Y+ ++ + + TLNP R +++ ++++ +G+ ++
Sbjct: 378 IGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVN 437
Query: 303 YREDLLFP 310
E+ LFP
Sbjct: 438 -DEEPLFP 444
>gi|357133639|ref|XP_003568431.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Brachypodium
distachyon]
Length = 508
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 82/247 (33%), Positives = 132/247 (53%), Gaps = 16/247 (6%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV 123
SF+ + P G+P SV Y+ Y +RAL+HF A+SV +T++LL + G+ +
Sbjct: 125 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQS------ 178
Query: 124 VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
+DG VGK++ + G + D + K+ R D L +LG G+E+ + P LFL +A
Sbjct: 179 -----RDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHLFLPLA 233
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
+ N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I L S
Sbjct: 234 CMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI-LISKR 292
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
S+ LS ++ S E+R+ +NTLN R + V F+KTG + S +
Sbjct: 293 NPSLVASFA---FLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEGNL 349
Query: 304 REDLLFP 310
E + P
Sbjct: 350 EETIFNP 356
>gi|224129512|ref|XP_002328735.1| predicted protein [Populus trichocarpa]
gi|222839033|gb|EEE77384.1| predicted protein [Populus trichocarpa]
Length = 398
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 85/242 (35%), Positives = 139/242 (57%), Gaps = 7/242 (2%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILK 129
P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + A A ++WILK
Sbjct: 15 PEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTKTLLNSVGVSKSQAIPGAVAINWILK 74
Query: 130 DGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFA 189
DG VGK++ + G + D + K+ R D L +LG +E+ + P LFL +A N A
Sbjct: 75 DGAGRVGKMLYARQGKKFDYDLKQLRFAGDLLMELGAAVELATAAVPHLFLPLACAANVA 134
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
K +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I ++ S
Sbjct: 135 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVADLLGTGLSIMISKRNPS---- 190
Query: 250 KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL- 308
+ LLS +V S +E+++ ++TLN R ++ V F+KTG++ S + +E++
Sbjct: 191 LVTTFALLSCGYVLSSYQEVKSVVLHTLNGARFSVAVESFLKTGQVPSLHEGNVKENIFN 250
Query: 309 FP 310
FP
Sbjct: 251 FP 252
>gi|327286048|ref|XP_003227743.1| PREDICTED: UPF0420 protein C16orf58-like [Anolis carolinensis]
Length = 420
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 145/277 (52%), Gaps = 3/277 (1%)
Query: 36 TRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHF 95
T HY+ PDG LS + Y + D F F P GYP SV+ YL Y + +Q F
Sbjct: 19 TCHYKSLPDGRLSCVLDRLEDEGYRSIRDFFTAVFLPQGYPESVSPDYLAYQIWDTVQAF 78
Query: 96 TSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPK 152
S+ L+TQ++L G+ + A V+WILKDG +G+++ + + G+++D + K
Sbjct: 79 ASSITGTLATQAVLKGVGVGDETSTVTAATVTWILKDGTGMLGRILFAWSKGSKLDCDAK 138
Query: 153 RWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG 212
+WR+ AD L D+ +E+L+P P F + + F K I VA ATR + A+
Sbjct: 139 QWRLFADVLNDVAILMEILAPAFPACFTLIVCISGFFKCIVGVAGGATRAALTMHQARRD 198
Query: 213 NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAA 272
N++D+ AK + TL N+ GL + L V +++ + L +++H+Y+ +RA
Sbjct: 199 NMADVSAKDGSQETLVNLAGLLFSLLLIPLVADNLRLTYSLYGLFTILHLYANYRAVRAV 258
Query: 273 PVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
+ TLN R +++ F+K+G + P +E LL
Sbjct: 259 CMETLNRARLCLVLQSFLKSGEVPGPDVTNPQEPLLL 295
>gi|432867528|ref|XP_004071227.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oryzias latipes]
Length = 423
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 79/247 (31%), Positives = 131/247 (53%), Gaps = 3/247 (1%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPA 119
+V +F + F P GYP SV+E YL+Y + LQ F S L TQ+ L G+ +
Sbjct: 41 LVGAFKSVFLPQGYPESVSEDYLQYQFWDTLQAFASTLSWTLVTQASLKGVGVGNQDATV 100
Query: 120 QATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
A V+WILKDG G+++ + G+R+DSE K+WR+ AD L D+ +E+L+P P
Sbjct: 101 AAATVTWILKDGTGMFGRILFAWQKGSRLDSEAKKWRLFADVLNDIAMFIEILAPFFPSF 160
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
F + K I VA ATR + A+ N++D+ AK + TL N+ GL +
Sbjct: 161 FTLIVCTAGVFKAIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLVVSLI 220
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
L + + + + L +++H+++ + +R+ + T N R ++++ ++K RI SP
Sbjct: 221 LIPLITDNSSLSLGLFFLFTILHLFANYKAVRSVVMETFNQARLSIVLQQYLKDKRILSP 280
Query: 299 ADLRYRE 305
+ RE
Sbjct: 281 FEANRRE 287
>gi|340372549|ref|XP_003384806.1| PREDICTED: UPF0420 protein C16orf58-like [Amphimedon queenslandica]
Length = 432
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 141/279 (50%), Gaps = 8/279 (2%)
Query: 34 SVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQ 93
S R Y D +V+V +R R+ D+ F P GYP SV+E YL+Y + LQ
Sbjct: 14 SSARSYMMIGD---TVRVTSKARNT--RLADTLKEVFLPQGYPNSVSEDYLQYQLWDTLQ 68
Query: 94 HFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSE 150
F S L+T ++L G+ + A A ++WILKDG+ G+++ + L G +D
Sbjct: 69 AFCSYITGTLATHAMLKGVGVGDSSASPVAATLTWILKDGLGMTGRILFAWLRGYDLDCN 128
Query: 151 PKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAK 210
K+WR++AD L D+ ++++SP P FL +A L + + + VA ATR + A+
Sbjct: 129 AKKWRLIADILNDIAICMQLVSPFFPSCFLLIACLASITQSVVGVAGGATRAALVQHQAR 188
Query: 211 EGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMR 270
N++D+ AK + TL N+ GL G+ + + + +++H+YS + +
Sbjct: 189 RDNMADVSAKDGSQETLVNLCGLLIGLIITPLIAGQTVFVWSLFFSFTLLHLYSNYKAVS 248
Query: 271 AAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
+ TLN R +++ + G IS P + E LLF
Sbjct: 249 VVSMETLNCNRLHLLMRNLFLNGTISEPNIVNREEPLLF 287
>gi|145349487|ref|XP_001419164.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579395|gb|ABO97457.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 269
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 91/270 (33%), Positives = 138/270 (51%), Gaps = 21/270 (7%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL- 114
R R V+ + P P SV Y Y +FR Q S+ +SV +TQ +L A GL
Sbjct: 6 RDAARRCVERAQHALLPE--PSSVTNDYWEYAKFRFAQRIASSCISVFATQQMLTAVGLG 63
Query: 115 --RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
R PA A V+W+LKDG+ +GKL + +N G DS+ KR+R + +YD + +E++
Sbjct: 64 ASRTLPA-AAAVNWVLKDGLGRLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEII 122
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
+P PQ FL +A + N K + + A R PI +F E NL+++ AK A L + +
Sbjct: 123 TPYFPQHFLPLATVANIGKSVGITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNI 182
Query: 232 GLGAGIQLA------STVCSSMQGKMIV---GPL--LSVIHVYSVIEEMRAAPVNTLNPQ 280
GL + A +TV ++ + V GPL L +I +Y +E++A + T+N +
Sbjct: 183 GLALAVGAAKAVSKVATVRPEIRRALPVFAFGPLAALDLICIY---KELKAVQLRTINKE 239
Query: 281 RTAMIVADFVKTGRISSPADLRYREDLLFP 310
R +I FVK I S A + E L P
Sbjct: 240 RAEIIAEAFVKEREIPSRAHVADAERLFIP 269
>gi|348510102|ref|XP_003442585.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oreochromis niloticus]
Length = 433
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 78/253 (30%), Positives = 135/253 (53%), Gaps = 3/253 (1%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPT 117
+ V F + F P GYP SV++ YL+Y + +Q F+S+ L+TQ+ L F G +
Sbjct: 43 NSVFGVFKSVFLPQGYPESVSDDYLQYQFWDTMQAFSSSLSGTLATQASLRGFGVGNQEA 102
Query: 118 PAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A V+W+ KDG +G+++ + G+++DSE K+WR+ AD L D+ +E+L+P P
Sbjct: 103 TVAAATVTWLFKDGTGMLGRILFAWRKGSKLDSEAKKWRLFADVLNDIAMFMEILAPYFP 162
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
F + K I VA ATR + A+ N++D+ AK + TL N+ GL
Sbjct: 163 ACFTFIVCTAGIFKSIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVS 222
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ L V + + + L +V+H+++ + +R+ + T N R ++++ ++K RI
Sbjct: 223 LILIPLVTDNPVLTLCLFFLFTVLHLFANYKAVRSVVMETFNEARLSIVLQQYLKDKRIL 282
Query: 297 SPADLRYREDLLF 309
SP + RE + F
Sbjct: 283 SPPEANQREPVFF 295
>gi|298709448|emb|CBJ31354.1| putative glutamate/ornithine acetyltransferase (ISS) [Ectocarpus
siliculosus]
Length = 789
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 79/235 (33%), Positives = 130/235 (55%), Gaps = 17/235 (7%)
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-AQATVVSWILKDGMQH 134
P V Y +T +R Q F + ++V TQ+LL A G++ A +SW+LKD +
Sbjct: 187 PEDVTPDYYHFTFWRMFQRFVAGTINVFGTQALLLALGIKAKRLGAAAAMSWVLKDALGK 246
Query: 135 VGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIA 193
G+++ S +G R DS+ KRWR + LY G GLE+++ + P FL +A + N K ++
Sbjct: 247 FGRILWASKMGRRFDSDAKRWRFRSSLLYAAGNGLEIVTYVFPASFLVLAAMANSFKQMS 306
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAK---------------GEAISTLFNVVGLGAGIQ 238
++ + ATR IY SFA+ N+ D+ AK GEA + +V+G+ +GI
Sbjct: 307 MLTSSATRNTIYRSFARGENIGDITAKACPPAAAMPNAKKHEGEAQIAVVDVLGMLSGIG 366
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
L + V +S Q + +LS++ + ++ E+RA + LN +RT ++V DFV +G
Sbjct: 367 LCTLVGTSRQTIVAAYVILSLVDISAIYNEIRAVCFSVLNHERTHLLVRDFVASG 421
>gi|160333925|ref|NP_001103923.1| uncharacterized protein LOC555936 [Danio rerio]
gi|126631351|gb|AAI34047.1| Zgc:162613 protein [Danio rerio]
Length = 432
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 77/250 (30%), Positives = 136/250 (54%), Gaps = 3/250 (1%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F + F P GYP SV+E YL+Y + +Q F+S+ L+TQ+ L G+ + A
Sbjct: 48 FKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQEATVAAAT 107
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+L+DG +G+++ + L G+++DSE K+WR+ AD L D+ +E+ +P P F +
Sbjct: 108 ITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHFPPFFTII 167
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ K I VA ATR + A+ N++D+ AK + TL N+ GL + L
Sbjct: 168 VCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLVSLALIPL 227
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V + I+ L +V+H+++ +R+ + TLN R ++++ ++ G++ SPA+
Sbjct: 228 VTDNPLLTFILFYLFTVLHLFANYRAVRSVVMETLNEARLSIVLHQYLLHGQVLSPAEAN 287
Query: 303 YREDLLFPGR 312
RE + R
Sbjct: 288 QREPVFLSFR 297
>gi|47217805|emb|CAG07219.1| unnamed protein product [Tetraodon nigroviridis]
Length = 412
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 137/247 (55%), Gaps = 3/247 (1%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQAT 122
+F + F P GYP SV+ YL+Y + LQ F+S+ L+T++ L G+ + A
Sbjct: 74 TFQSVFLPQGYPDSVSSDYLQYQFWDTLQAFSSSLSGTLATRASLRGVGVGNQQATVAAA 133
Query: 123 VVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
+W+LKDG G+++ + L G ++DSE K+WR+ AD L D+ +++L+P P F
Sbjct: 134 TTTWLLKDGTGMCGRILFAWLKGTKLDSEAKKWRLFADILNDIAMFMDILAPYFPAFFTL 193
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ + K + VA ATR + A+ N++D+ AK + TL N+ GL + L S
Sbjct: 194 IMCVSGVFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLALVS 253
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
V + + + L +V+H+++ + +R+ + TLN R ++++ +++K GR+ SP +
Sbjct: 254 FVADNAALTLSLFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLQEYLKDGRMLSPLEA 313
Query: 302 RYREDLL 308
RE +L
Sbjct: 314 NQREPVL 320
>gi|320170693|gb|EFW47592.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 710
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 82/256 (32%), Positives = 140/256 (54%), Gaps = 9/256 (3%)
Query: 41 FQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAAL 100
Q G + ++ +D+R HR++++FL P GYP SV Y+ Y++++A+Q T
Sbjct: 139 LQNAGQRAWRLREDTR---HRLIEAFL----PKGYPHSVTPNYMGYSRWQAVQSVTGTMT 191
Query: 101 SVLSTQSLLFAAGLRPTPAQ-ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILA 158
VLSTQ+LL+A GL A ++WI+KDG+ +G ++ S + ++ DS+PKR R +
Sbjct: 192 GVLSTQALLYAVGLGAGAIPLAGALNWIVKDGLGQLGGVVYSTFISSKFDSDPKRHRFWS 251
Query: 159 DALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
+A T LE+L+PL P +FL +A + N K I+ +AA +TR +++S NL D+
Sbjct: 252 NAALQASTLLEILTPLAPGMFLFLASVSNIGKNISWLAASSTRAQMHNSLTLRDNLGDVT 311
Query: 219 AKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
K + + +++G G GI +A V + ++ LS++++ S + TLN
Sbjct: 312 GKAGSQAIATSLIGTGLGIAIAPFVGTDPIAVLLAFLPLSIVNMVSNYRSNTIVHMRTLN 371
Query: 279 PQRTAMIVADFVKTGR 294
QR + F+ R
Sbjct: 372 VQRAERLFMHFLLHNR 387
>gi|255081626|ref|XP_002508035.1| predicted protein [Micromonas sp. RCC299]
gi|226523311|gb|ACO69293.1| predicted protein [Micromonas sp. RCC299]
Length = 665
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 81/260 (31%), Positives = 138/260 (53%), Gaps = 17/260 (6%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVV 124
N FP+ SV E Y Y +FR LQ S+ ++V +TQ +L A G+ R PA A +
Sbjct: 196 NSLFPARE--SVTEDYWEYAKFRFLQRMASSCITVFATQQMLAAIGMGASRRLPA-AAAI 252
Query: 125 SWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
+W+LKDG+ +GKL + +N G DS+ KR+R + +YD + +E+++P P+ FL +A
Sbjct: 253 NWVLKDGLGRLGKLGVAANFGREFDSDVKRFRFTSSVVYDASSLVEMITPFFPKKFLALA 312
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
L N K + + A R PI SF E NL+++ AK A + + +GL + LA+ +
Sbjct: 313 TLANIGKSVGITTANVVRAPIQRSFVLEENLAEVAAKTSAQQVVADNLGLAVAV-LATGL 371
Query: 244 CSSM---QGKMIVGPL-----LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ + ++I+ P L+ + +Y + E++A + T+N +R +I ++ G +
Sbjct: 372 TGKVVNDRARLII-PFVAFIPLATMDLYCIYRELKAVQLTTINKERAEIICDHWINVGTV 430
Query: 296 SSPADLRYREDLLFPGRLIE 315
+ E L P R+ E
Sbjct: 431 PGTKYVSDAERLFIPARMDE 450
>gi|157422883|gb|AAI53381.1| Zgc:162613 protein [Danio rerio]
Length = 432
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 77/250 (30%), Positives = 137/250 (54%), Gaps = 3/250 (1%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F + F P GYP SV+E YL+Y + +Q F+S+ L+TQ+ L G+ + A
Sbjct: 48 FKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQEATVAAAT 107
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+L+DG +G+++ + L G+++DSE K+WR+ AD L D+ +E+ +P P F +
Sbjct: 108 ITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHFPPFFTLI 167
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ K I VA ATR + A+ N++D+ AK + TL N+ GL + L
Sbjct: 168 VCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLVSLALIPL 227
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V + I+ L +V+H+++ + +R+ + TLN R ++++ ++ G++ SPA+
Sbjct: 228 VTDNPLLTFILFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLHQYLLHGQVLSPAEAN 287
Query: 303 YREDLLFPGR 312
RE + R
Sbjct: 288 QREPVFLSFR 297
>gi|348671592|gb|EGZ11413.1| hypothetical protein PHYSODRAFT_338124 [Phytophthora sojae]
Length = 478
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 126/228 (55%), Gaps = 5/228 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P+GYP SV+E YL + + LQ S VL+TQS+L + G+ + TP A + W
Sbjct: 83 FLPAGYPDSVSEDYLSFQFWDTLQAMCSYLRGVLATQSVLQSVGVGDDKATPL-AAALQW 141
Query: 127 ILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+L+DG +G L + L G + D KRWR+ AD + D+G L++++P P L E+ +
Sbjct: 142 VLRDGSGMIGGLTFAYLVGPKFDVNVKRWRLFADVINDVGLTLDMVAPYFPTLVTEVLCV 201
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ K + VAA ATR + + FAK+ N++D AK + T + GL G+ A+ V S
Sbjct: 202 SSVCKTMCGVAAGATRSSLMTHFAKKDNMADCAAKEGSQETAVKLFGLMFGMYFANAVNS 261
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
S Q + LL+VIHVY+ + + T+N R ++V F+ +G
Sbjct: 262 SPQAVWVAFLLLTVIHVYANYNAVSCLCIPTVNCSRGLILVRRFLASG 309
>gi|410903033|ref|XP_003964998.1| PREDICTED: UPF0420 protein C16orf58 homolog [Takifugu rubripes]
Length = 442
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/249 (30%), Positives = 135/249 (54%), Gaps = 3/249 (1%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPT 117
+ VV +F + F P GYP SV+ YLRY + +Q F S+ L+T++ L G+ +
Sbjct: 42 NSVVGAFKSVFLPQGYPESVSGDYLRYQFWDTVQAFFSSLSGTLATRASLRGVGVGNQEA 101
Query: 118 PAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A ++W+LKDG G+++ + G ++DSE K+WR+ AD L D+ +E+L+P P
Sbjct: 102 TVAAATMTWLLKDGTGMCGRILFAWRKGTKLDSEAKKWRLFADILNDVAMFMEILAPYFP 161
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
F + + K + VA ATR + A+ N++D+ AK + TL N+ GL
Sbjct: 162 AFFTLVMCVSGIFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVS 221
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ L V + + + L +V+H+++ + +R+ + TLN R ++++ +++ G+I
Sbjct: 222 LVLIRFVSDNAALTLGLFFLFAVLHLFANYKAVRSVVMETLNEARLSVVLQQYLENGQIL 281
Query: 297 SPADLRYRE 305
SP + RE
Sbjct: 282 SPLEANERE 290
>gi|357484331|ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
gi|355513788|gb|AES95411.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
Length = 630
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 142/261 (54%), Gaps = 2/261 (0%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
+Y++ + + P G+P SV YL Y+ +R +Q VL+TQ+LL+A GL +
Sbjct: 167 LYNKCREFVVRLMLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQALLYAVGLGKG 226
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A ++W+LKDG+ ++ K++ S+ G D PK WR+ AD L + GLE+ +P P
Sbjct: 227 AIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFP 286
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
LF+ + ++ A + +TR ++ FA + N +++ AKGE + +G+G G
Sbjct: 287 HLFVPIGAFAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMVSRFIGIGIG 346
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
I L + + SS + +++ +H+Y ++ ++ + TLNP R +++ ++++ +G+ +
Sbjct: 347 IGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQSIQLRTLNPHRASLVFSEYLLSGQ-A 405
Query: 297 SPADLRYREDLLFPGRLIEDA 317
P E+ LFP I +A
Sbjct: 406 PPVKEVNAEEPLFPAVPILNA 426
>gi|168013684|ref|XP_001759421.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689351|gb|EDQ75723.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 712
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 81/240 (33%), Positives = 137/240 (57%), Gaps = 9/240 (3%)
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGM 132
P V YL Y +++ + F S+ L TQ++L+A G+ R PA A ++W++KDG+
Sbjct: 283 PDQVWPHYLVYLKWKLVHRFLSSVLHFQCTQAMLWAVGVGAKRRLPA-AAALNWVMKDGV 341
Query: 133 QHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
+GKL+ +N G DS+ KR R L+ GLE+L+PL PQ FL +A L N AK
Sbjct: 342 GRLGKLLFTANFGRTFDSDLKRVRFWTSVLFSSSVGLEILTPLFPQHFLLLATLANIAKS 401
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK- 250
IA A AT I+ SFA NL+D+ AKG A + + + +GL A + L+ + + + +
Sbjct: 402 IAYAAYLATSSAIHRSFALGDNLADISAKGMAQTVVADNLGLAAAVCLSQFIHNFPKIEK 461
Query: 251 ---MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ + P+L+ ++++ ++++A + TLN +R +I+ +VK RI S ++ E++
Sbjct: 462 FLPLAMYPVLASAELFAIYQQLQAVHLQTLNKERLEIIINIWVKDRRIPSFEEVSKIENV 521
>gi|168022981|ref|XP_001764017.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684756|gb|EDQ71156.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 437
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/259 (30%), Positives = 145/259 (55%), Gaps = 11/259 (4%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SR V++ ++ +FL P GYP SV YL + + LQ ++ S+LSTQ+LL G+
Sbjct: 42 SRAVWNNILSAFL----PEGYPNSVTPDYLPFQTWDTLQGLSTYIRSMLSTQALLGGIGV 97
Query: 115 RPTPAQATVVS----WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLE 169
T ATVV W ++D +G ++ + + G+ +DS K+WR+ AD L D+G ++
Sbjct: 98 GET--TATVVGATFLWFMRDFTGMLGSILFTLHQGSNLDSSAKQWRLTADLLNDVGMLMD 155
Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
++SPL P F+ + +G+ A+ + VA+ ATR + FA N +D+ AK + T
Sbjct: 156 LVSPLFPGAFVTILCIGSMARSVTGVASGATRAALTQHFALRKNAADVSAKEGSQETAAT 215
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
+VG+ G+ LA ++ + LL+ H+Y+ +R+ + +LN +RT+++++ F
Sbjct: 216 MVGMLLGMVLARLTAENVVALWVSFLLLTAFHMYANYRAVRSLCLTSLNAERTSIVLSAF 275
Query: 290 VKTGRISSPADLRYREDLL 308
+ ++ +P ++ +E LL
Sbjct: 276 KEGRKVPTPREVADQEHLL 294
>gi|156379270|ref|XP_001631381.1| predicted protein [Nematostella vectensis]
gi|156218420|gb|EDO39318.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 131/245 (53%), Gaps = 3/245 (1%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
+ F P GYP SV++ YL Y + +Q F S+ L+TQ++L + G A A ++
Sbjct: 3 SAFLPQGYPESVSKDYLTYQFWDTVQAFCSSITGTLATQAMLKGYGVGNESATALAATMT 62
Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
WIL+ G VG ++ + + G+ +D K+WR+ AD + DL +E++SP F +A
Sbjct: 63 WILRSGTGMVGSILFAWMQGSNLDCNAKKWRLFADIMNDLAILVELISPYFKAYFTLLAC 122
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
L + AK I VA +TR + A+ N++D+ AK + TL N++ L AG+ + V
Sbjct: 123 LSSVAKSIVGVAGGSTRAALTQHQARRDNMADVAAKDGSQETLVNLMALLAGLVITPLVT 182
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
++ + + +H+Y+ + + + TLN R ++ +F++TG + SP ++
Sbjct: 183 GHTLLTWVLFIIFTFLHLYANYCAVSSVVMETLNTTRLQIVTNEFLRTGHVLSPDEVGKM 242
Query: 305 EDLLF 309
E ++F
Sbjct: 243 EPVIF 247
>gi|356497988|ref|XP_003517837.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 473
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 82/298 (27%), Positives = 149/298 (50%), Gaps = 3/298 (1%)
Query: 14 EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPS 73
E S+S + I + + S F S ++ S + V L F P
Sbjct: 36 ETSSSVSNKTITLEEWNGSSPTKLSKTFTIKASSSSVLITRSGARFTHVWRRLLQAFVPE 95
Query: 74 GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDG 131
G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+ A W L+D
Sbjct: 96 GFPSSVTADYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRDL 155
Query: 132 MQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAK 190
+G ++ + G+ +DS K WR++AD + DLG ++++SPL P F+ + LG+ ++
Sbjct: 156 TGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGSISR 215
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
VA+ ATR + FA + N +D+ AK + T+ ++G+ G+ +A
Sbjct: 216 SFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVARLTIGHPLAI 275
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
L+V H+Y+ +R +N+LNP+R+++++ F++TG++ SP + +E +L
Sbjct: 276 WFSFLSLTVFHMYANYRAVRCLALNSLNPERSSILLQHFMETGQVLSPKQVSSQEHVL 333
>gi|348675633|gb|EGZ15451.1| hypothetical protein PHYSODRAFT_546234 [Phytophthora sojae]
Length = 408
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 80/223 (35%), Positives = 118/223 (52%), Gaps = 8/223 (3%)
Query: 64 DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR----PTPA 119
D + F P S+ + Y Y ++ + S A SVLS QSLL+A GL PT
Sbjct: 109 DVLWDLFLPKDARASLTKDYFPYAKWSFVGSVASCAASVLSMQSLLYAIGLGAGAIPT-- 166
Query: 120 QATVVSWILKDGM-QHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
A V+W+LKDG+ Q G L S + R D++PKRWR+ + D+ E+L+PL P
Sbjct: 167 -AAAVNWVLKDGLGQFGGVLFASIVNNRYDADPKRWRVASSVALDVSVLGEILTPLAPGS 225
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
FL +A L N AK +A ++A ATR ++SFA NL+D+ AK + + ++ G G GI
Sbjct: 226 FLAIASLANVAKNVAWLSASATRAGFHNSFAIRENLADVTAKAGSQAIASSIFGTGLGIL 285
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
++ SS +LS +H+ S + + + TLN QR
Sbjct: 286 ISQFTGSSTLNVAAAFAVLSTVHLISTYKSVDCVQLRTLNCQR 328
>gi|449017769|dbj|BAM81171.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 827
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 142/286 (49%), Gaps = 12/286 (4%)
Query: 56 RPVYHRVVDSFLNKFF--PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
R V+ +V S L + F + +P V++ YL YT +Q+ AA SV++TQS+L A G
Sbjct: 328 RNVWRALVGSLLPEGFGDKTRFPPVVSDDYLPYTVLHFMQNLVHAAASVIATQSMLIAIG 387
Query: 114 L--RPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEV 170
+ + T + V W+LKDG+ + +I S LG R D++PKRWR+ D LY LG G E+
Sbjct: 388 VAKQATIGVSAAVQWVLKDGLGRIAAIIFGSLLGNRYDADPKRWRMRGDMLYTLGVGTEI 447
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
+ L PQ FL + + N K ++ + T I F+ N DL AK + L +
Sbjct: 448 ATRLVPQYFLLVGTMANAIKSVSYMMRLPTAAAIRRHFSLRENFGDLSAKANSQEVLSML 507
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+G GI L+ + +S++ + + + +R + TLN QRT +++ +
Sbjct: 508 LGTFVGIGLSYMIDNSLKALFAFYIIYVQMFILFNYWSLRVLRMRTLNLQRTLIVMRRYW 567
Query: 291 KTGRIS--SPADLRYREDLLFPGRLIEDAGNVKMDRHGIR-AECFR 333
+G + S A E+ L P RL G + R G R E FR
Sbjct: 568 NSGGVETCSVAMANQAENFLLPSRL----GPARRVRFGCRLGEAFR 609
>gi|42562046|ref|NP_172832.3| uncharacterized protein [Arabidopsis thaliana]
gi|28393739|gb|AAO42280.1| unknown protein [Arabidopsis thaliana]
gi|28973405|gb|AAO64027.1| unknown protein [Arabidopsis thaliana]
gi|332190945|gb|AEE29066.1| uncharacterized protein [Arabidopsis thaliana]
Length = 440
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 76/254 (29%), Positives = 136/254 (53%), Gaps = 7/254 (2%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
V+ RV L F P G+P SV Y+ + + LQ ++ +LSTQ+LL A G+
Sbjct: 49 VWRRV----LQAFVPEGFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEK 104
Query: 118 PAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPL 174
A W L+D +G ++ + G+ +DS K WR++AD + D+G +++LSPL
Sbjct: 105 SATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPL 164
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+
Sbjct: 165 FPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMS 224
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
G+ LA + + L+V H+Y+ +R +N+LN +R+++++ F++TG+
Sbjct: 225 LGMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQ 284
Query: 295 ISSPADLRYREDLL 308
+ SP + E +L
Sbjct: 285 VLSPEQVSSMEGVL 298
>gi|357505287|ref|XP_003622932.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
gi|355497947|gb|AES79150.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
Length = 590
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/258 (29%), Positives = 138/258 (53%), Gaps = 10/258 (3%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA- 119
+V S + FF P VNE Y+ + +++ L S+AL VL+TQ++ A G+ + +
Sbjct: 105 KVFGSVVRDFF---IPREVNENYVDFVKWKLLHRVFSSALQVLATQAMFTAIGVGYSSSL 161
Query: 120 -QATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A ++W+LKDG+ + K I ++L + D+ KR R L+ GLE+L+P P+
Sbjct: 162 PSAAALNWVLKDGLGRLSKCIYTASLASAFDTNLKRVRFTTSVLFVGSIGLELLTPTFPR 221
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
FL +A + N AK I++ ATR ++ SFAK NL ++ AK + + F+++GL
Sbjct: 222 FFLLLATIANIAKQISLACYLATRSAVHQSFAKADNLGEISAKAQIQTVCFDILGLMLAA 281
Query: 238 QLASTVCSSMQGK----MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
+ + + + + V PL + + ++ + + ++ + TL R +I+ +++ G
Sbjct: 282 LVNMWIGNHQRPQAGLHFFVYPLFAFMDLFGIYQGLKHVHLKTLTKDRLEIILKTWIEFG 341
Query: 294 RISSPADLRYREDLLFPG 311
+ SPAD+ +E + F G
Sbjct: 342 YVPSPADVSEKEVINFMG 359
>gi|301096265|ref|XP_002897230.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107315|gb|EEY65367.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 474
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 124/228 (54%), Gaps = 5/228 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P+GYP SV+E YL + + LQ S VL+TQS+L + G+ + TP A + W
Sbjct: 79 FLPAGYPDSVSEDYLGFQFWDTLQAMCSYLRGVLATQSVLQSVGVGDDKATPL-AAALQW 137
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+L+DG +G L + +G + D KRWR+ AD + D+G L++++P P L E+ +
Sbjct: 138 VLRDGSGMIGGLTFAYFVGPKFDVNVKRWRLFADVVNDVGLTLDMVAPYFPTLVTEVLCV 197
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ K + VAA ATR + + FAK N++D AK + T + GL G+ A+ V S
Sbjct: 198 SSVCKAMCGVAAGATRSSLMTHFAKRDNMADCAAKEGSQETAVKLFGLMFGMYFANAVNS 257
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
S + +L+++HVY+ + + T+N R ++V F+ +G
Sbjct: 258 SPHAVWMAFLVLTLVHVYANYNAVSCLCIPTVNCSRGLILVQRFLMSG 305
>gi|148905902|gb|ABR16113.1| unknown [Picea sitchensis]
Length = 446
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 78/243 (32%), Positives = 134/243 (55%), Gaps = 5/243 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV YL + + +LQ ++ +LSTQ+LL A G+ T A W
Sbjct: 59 FLPEGFPNSVTSDYLPFQIWDSLQGLSTYIRCMLSTQALLSAIGVGETTATVVGATFQWF 118
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D +G ++ + G+ +DS K WR++AD + DLG ++++SPL P F+ + LG
Sbjct: 119 LRDFTGMLGGILFTLYQGSNLDSNAKMWRLIADLMNDLGMLMDLVSPLYPASFVIILCLG 178
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ A+ VA+ ATR + FA + N +D+ AK + T+ ++G+ G+ LA V +
Sbjct: 179 SIARSFTGVASGATRAALTQHFALQHNAADISAKEGSQETMATMIGMLLGMLLAH-VTAG 237
Query: 247 MQGKMIVG-PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
G M LL+ H+++ + +R +N+LN RT++++ F+ G++ +P D+ E
Sbjct: 238 HPGAMWSSFLLLTFFHMFANYQAVRCLCLNSLNEGRTSVLLKCFITNGQVLNPKDVSSME 297
Query: 306 DLL 308
LL
Sbjct: 298 YLL 300
>gi|440913323|gb|ELR62787.1| hypothetical protein M91_19503, partial [Bos grunniens mutus]
Length = 437
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 125/245 (51%), Gaps = 3/245 (1%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A
Sbjct: 42 LLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAAT 101
Query: 124 VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+W++KD +G+++ + +G++MD K+WR+ AD L D+ LE+++P+ P F
Sbjct: 102 ATWLVKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTIT 161
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 162 VCISNLAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPM 221
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V S + L+ +H+Y+ +RA + TLN QR +++ F++ G + P
Sbjct: 222 VSDSPSLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQRGEVLGPTSAN 281
Query: 303 YREDL 307
E L
Sbjct: 282 QMEPL 286
>gi|170037702|ref|XP_001846695.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880979|gb|EDS44362.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 417
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 128/239 (53%), Gaps = 9/239 (3%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F N F P+GYP SV+ YL Y ++ +Q F S L+T ++L G+ A +
Sbjct: 49 FQNLFLPAGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPLSAT 108
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLE 181
++W+LKDGM H G+++ + G +D + K+WRI AD L D+ G+++ PL P++
Sbjct: 109 ITWVLKDGMGHFGRILFAWWKGTELDIDSKKWRIRADILNDIAMGVDLFVLPLYPKVSTY 168
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ + K I VA ATR + A GNL+D+ +K A T N++ G+ L +
Sbjct: 169 ILCATSTMKAIVGVAGGATRSALTQHHALRGNLADVASKDSAQETCVNLIASFVGLGLLT 228
Query: 242 TVCSSMQGKMIVGPLL--SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
V +++ G L +++H+Y+ I+ ++A + T N R + + ++ K+G + SP
Sbjct: 229 HV---QNQRVLYGLFLFVTLLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGTMLSP 284
>gi|328872765|gb|EGG21132.1| DUF647 family protein [Dictyostelium fasciculatum]
Length = 534
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 137/277 (49%), Gaps = 6/277 (2%)
Query: 37 RHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFT 96
+H H S V+D ++ F F P GYP SV Y +Y ++ LQ+
Sbjct: 88 KHVNNNDTTHTSSLVIDTDNQ--QKLTKMFQKLFLPKGYPQSVTHDYSQYMKWMFLQNTL 145
Query: 97 SAALSVLSTQSLL--FAAGLRPTPAQATVVSWILKDGMQHVG-KLICSNLGARMDSEPKR 153
+ VLST +LL GL + A +SW+LKDG+ L S +D K
Sbjct: 146 GSTTYVLSTHALLTSVGVGLSTSLPFAAAISWVLKDGLGASALVLFASKYSTSLDYNLKS 205
Query: 154 WRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGN 213
++ D L+++G LE+L+P P FL +A + N +KG+A + ATR + SF+ + N
Sbjct: 206 FKFRGDILHNVGVMLEMLTPFVPGYFLPLASVSNLSKGLAGLIYGATRASLNRSFSLQEN 265
Query: 214 LSDLFAKGEAISTLFNVVGLGAGIQLASTVC-SSMQGKMIVGPLLSVIHVYSVIEEMRAA 272
L D+ AK ++ S + G+G G + + + + ++S++H+ ++ +++
Sbjct: 266 LGDITAKYQSQSMASYLTGMGLGTTIGLLMSPMTTTANLCAVFIISLLHMSCSVKSLKSI 325
Query: 273 PVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
+ TLN QR+ +I+ +++ I SP + E+ +F
Sbjct: 326 CLKTLNSQRSKIIIDHWLREKIILSPEIVNKNENFMF 362
>gi|225423472|ref|XP_002274189.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
gi|297738087|emb|CBI27288.3| unnamed protein product [Vitis vinifera]
Length = 437
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/254 (29%), Positives = 135/254 (53%), Gaps = 7/254 (2%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
V+ R+ L F P G+P SV Y + + +LQ ++ ++LSTQ+LL A G+
Sbjct: 47 VWRRI----LQAFVPEGFPSSVTPDYAPFQLWDSLQGLSTYIRTMLSTQALLSAIGVGEK 102
Query: 118 PAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPL 174
A W L+D +G ++ + G+ +DS K WR++AD + DLG ++++SPL
Sbjct: 103 SATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLVSPL 162
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+ +VG+
Sbjct: 163 FPSAFVFIVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMVGMA 222
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
G+ LA L++ H+Y+ +R + TLN +R+++++ F++TG+
Sbjct: 223 FGMLLARITMGHSVAIWFSFLSLTMFHMYANYRAVRCLILTTLNSERSSILIEHFMETGQ 282
Query: 295 ISSPADLRYREDLL 308
+ SP ++ E +L
Sbjct: 283 VLSPQEVSKMEHVL 296
>gi|300176775|emb|CBK25344.2| unnamed protein product [Blastocystis hominis]
Length = 409
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/214 (34%), Positives = 116/214 (54%), Gaps = 3/214 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA-QATVVSWIL 128
F P YP SV YL Y + A SAA VLSTQS+LFA G+ A ++W+L
Sbjct: 105 FLPRNYPSSVKGKYLNYCSWAAFGMGLSAAGGVLSTQSMLFAIGMGAGAVPMAAALNWVL 164
Query: 129 KDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
KDG+ +G + + L +R+D++ + WRIL+ L ++ T LEV++PL P FL +A L N
Sbjct: 165 KDGLGQLGGMAFTALVNSRLDADSRGWRILSAWLLEISTWLEVMTPLFPHSFLLLATLAN 224
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC-SS 246
K I+ +A ATR I F N+ D+ AK + + V G GI +++ +
Sbjct: 225 VGKNISWLAGSATRAGIRYGFVNAHNMGDITAKEGSQTVAITVFGTFLGIVISNLIGHGH 284
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
M+ ++ +S + ++S+ + +R + TLN Q
Sbjct: 285 MEYVLMSSMCISAVSLFSIYQSLRCVSLPTLNYQ 318
>gi|356502614|ref|XP_003520113.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Glycine max]
Length = 486
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 72/253 (28%), Positives = 133/253 (52%), Gaps = 3/253 (1%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ V L F P G+P SV Y+ + + L ++ ++LSTQ+LL A G+
Sbjct: 94 FTHVWRRLLQAFVPEGFPSSVTADYVPFQIWDLLXGLSTYIRTMLSTQALLIAIGVEEKS 153
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG ++++SPL
Sbjct: 154 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSYAKMWRLVADLMNDLGMLMDLISPLF 213
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+
Sbjct: 214 PSAFVFVVCLGSISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMAL 273
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
G+ +A L+V H+Y+ ++ +N+LNP+R++++ F++TG++
Sbjct: 274 GMLVARLTIGHPLAIWFSFLSLTVFHMYANYRAVQXLTLNSLNPERSSILFQHFMETGQV 333
Query: 296 SSPADLRYREDLL 308
SP + +E +L
Sbjct: 334 LSPEQVSSQEHIL 346
>gi|383859063|ref|XP_003705017.1| PREDICTED: UPF0420 protein C16orf58-like [Megachile rotundata]
Length = 464
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 77/260 (29%), Positives = 128/260 (49%), Gaps = 8/260 (3%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
++P Y V F P GYP SV+ Y Y + +Q F S + L+T S++ G+
Sbjct: 38 TKPFYSGFVSILKEVFLPQGYPDSVHPDYTAYQIWDTVQAFASTIMGTLTTHSIMQGVGV 97
Query: 115 RPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVL 171
A A ++WILKDG +G+++ + G +D + K+WR+ AD L D+ GLE+L
Sbjct: 98 GEPAATPLAAAITWILKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDVAMGLELL 157
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
P L + + K I VA ATR + A + NL+D+ AK + T N++
Sbjct: 158 LPYYSSYSLAILCVSTAMKSIVGVAGGATRAAVTQHQALQNNLADVSAKDGSQETCVNLI 217
Query: 232 GLGAGIQLASTVCSSMQGKMIVGP--LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
GI + S G+ I+ L ++H+Y+ +++ +N+LN R A+IV +
Sbjct: 218 ASLVGILILSIF---HNGRYIMELYFFLIIVHLYANYSAVKSLCLNSLNEDRLALIVKSY 274
Query: 290 VKTGRISSPADLRYREDLLF 309
+ I P ++ +E +L
Sbjct: 275 ITNEVIPEPEEVNKKESVLL 294
>gi|388492306|gb|AFK34219.1| unknown [Lotus japonicus]
Length = 182
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 55/74 (74%), Positives = 65/74 (87%)
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
M+VGPLLS++H+YSV EEMRA P+NTLNP+RTAMIV DF+K G +SSPADLRYREDLLF
Sbjct: 1 MVVGPLLSILHIYSVSEEMRATPINTLNPRRTAMIVTDFLKAGVVSSPADLRYREDLLFR 60
Query: 311 GRLIEDAGNVKMDR 324
RL EDAGNV++ R
Sbjct: 61 VRLTEDAGNVRVGR 74
>gi|281201954|gb|EFA76161.1| DUF647 family protein [Polysphondylium pallidum PN500]
Length = 430
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 73/250 (29%), Positives = 128/250 (51%), Gaps = 10/250 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P+GYP SV Y+ Y + Q +S+ VL+T+++L G+ A + V+ WI
Sbjct: 51 FLPAGYPDSVTSDYMSYQVWDTAQALSSSLSGVLATRAMLRGYGVGNAAASVGSAVLQWI 110
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+DG +G++ + G ++D K WR +AD L D+G LE+LSPL LF+ ++ G
Sbjct: 111 LRDGAGMIGRIYFAWKTGRQLDCNAKTWRFMADILNDIGMTLEMLSPLSNVLFMPLSVCG 170
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--- 243
+K I VA T+ + FAK N+ D+ AK + T +VG+ G L++++
Sbjct: 171 IVSKSICGVAGGCTKASLTMHFAKNDNMGDVSAKDGSQETAVGLVGMFLGAALSASIPDE 230
Query: 244 ----CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
++ ++ +L H+Y + + ++N R +I +++ G I +P
Sbjct: 231 SEQTTTATVAIWMLFFILIAFHLYCNYRAVTGVVLTSINRYRATLIYKHYLQYGEIPTPI 290
Query: 300 DLRYREDLLF 309
+ RE++LF
Sbjct: 291 QVSKRENILF 300
>gi|297844278|ref|XP_002890020.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
lyrata]
gi|297335862|gb|EFH66279.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
lyrata]
Length = 440
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/254 (29%), Positives = 137/254 (53%), Gaps = 7/254 (2%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
V+ RV+ +F+ P G+P SV Y+ + + LQ ++ +LSTQ+LL A G+
Sbjct: 49 VWRRVLQAFV----PEGFPGSVTPDYVGFQFWDTLQGLSTYIKMMLSTQALLSAIGVGEK 104
Query: 118 PAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPL 174
A W L+D +G ++ + G+ +DS K WR++AD + D+G +++LSPL
Sbjct: 105 SATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPL 164
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+
Sbjct: 165 FPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMS 224
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
G+ LA + L+V H+Y+ +R +N+LN +R+++++ F++TG+
Sbjct: 225 LGMLLARFTSGNPMAIWFSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFMQTGQ 284
Query: 295 ISSPADLRYREDLL 308
+ SP + E +L
Sbjct: 285 VLSPEQVSSMEGVL 298
>gi|403368796|gb|EJY84239.1| hypothetical protein OXYTRI_18022 [Oxytricha trifallax]
Length = 504
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 73/241 (30%), Positives = 121/241 (50%), Gaps = 4/241 (1%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
V +++ F P GYP SV EGYL+++ F L + A+S LS QSL A G T
Sbjct: 85 VKNKIYIGLFETFLPVGYPNSVAEGYLKFSIFNNLSALSITAMSFLSAQSLFVAIGSTMT 144
Query: 118 PAQ--ATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
A A +W+LKDG+ +G L S G D + K+WR +A ++ +E+L+
Sbjct: 145 QANLAAAAYTWVLKDGIGQLGAILFASRYGRNFDEDIKKWRFMAMIALNISIYIEILTLR 204
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P FL +A + N K I + + A+R I FAK N+ D+ K + T + G+G
Sbjct: 205 FPNHFLALASIANIGKNICFLLSAASRASINVQFAKRNNIGDISGKSVSQFTASTLCGVG 264
Query: 235 AGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
G+ L+ + SS+ V +L+++++ + + + LN QR +I ++ K+
Sbjct: 265 IGLMLSKLIDISSISQLYPVFMILTILNIATSYQASKVIDEVYLNNQRAFIIFNEYFKSN 324
Query: 294 R 294
+
Sbjct: 325 K 325
>gi|307205774|gb|EFN84004.1| UPF0420 protein C16orf58 [Harpegnathos saltator]
Length = 466
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 128/264 (48%), Gaps = 8/264 (3%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
+S+ +Y R V F P GYP SV YL Y + +Q F S + L+T S++ G
Sbjct: 35 ESQDIYFRFVSFVKEVFLPQGYPDSVRADYLAYQIWDTVQAFASTIMGTLTTHSVMQGIG 94
Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV 170
+ + A A ++WILK G +G ++ + G ++D + K+WR+ AD L D+ GLE+
Sbjct: 95 VGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTQLDGQCKKWRLFADILNDIAMGLEL 154
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
+ P + + K + VA ATR + A NL+D+ AK + T N+
Sbjct: 155 MVPYLSPYSVPILCTSTIMKSVVGVAGGATRAALTQHQAIRNNLADVSAKDSSQETCVNL 214
Query: 231 VGLGAGIQLASTVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
+ GI + S G+ ++ L +H+Y+ ++A ++TLN R A+I+ +
Sbjct: 215 IASFVGIFVLSLF---HDGRYLIELYLFLVAVHLYANYSAVKALCLDTLNEDRLALIIKN 271
Query: 289 FVKTGRISSPADLRYREDLLFPGR 312
++ RIS P + E + G
Sbjct: 272 YMMNERISEPQKINKEESVFLLGN 295
>gi|7339490|emb|CAB82813.1| putative protein [Arabidopsis thaliana]
Length = 631
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 137/271 (50%), Gaps = 26/271 (9%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
+ L +F P G+P SV YL Y+ +R +Q S VL+TQSLL+A GL + A
Sbjct: 192 NLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAA 251
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+LKDG+ ++ K++ S G D PK WR+ AD L + G+E+L+P+ PQ F+ +
Sbjct: 252 AINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMI 311
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---------------------- 220
+ A + ATR + FA + N ++++
Sbjct: 312 GAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVYNNFYMALVLITYQQLFVFLNYSGS 371
Query: 221 -GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
GEA + VG+ GI +A+ + +S + +++ IH+Y+ ++ + + TLNP
Sbjct: 372 LGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNP 431
Query: 280 QRTAMIVADFVKTGRISSPADLRYREDLLFP 310
R +++ ++++ +G+ ++ E+ LFP
Sbjct: 432 YRASLVFSEYLISGQAPLIKEVN-DEEPLFP 461
>gi|328778772|ref|XP_394441.3| PREDICTED: UPF0420 protein C16orf58 homolog [Apis mellifera]
Length = 464
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 128/258 (49%), Gaps = 4/258 (1%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
++ Y ++ F P GYP SV+ Y Y + +Q F S + L+T S++ G+
Sbjct: 38 TKSFYSGLISIIKEVFLPQGYPDSVHPDYTSYQIWDTVQAFASTIMGTLTTHSIMQGVGV 97
Query: 115 --RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVL 171
A ++WILKDG VG+++ + G +D + K+WR+ AD L DL GLE+L
Sbjct: 98 GEAAATPLAAAITWILKDGTGMVGRIMFAWWNGTDLDGQCKKWRLFADILNDLAMGLELL 157
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
P L + + K I VA ATR + A + NL+D+ AK + T N++
Sbjct: 158 LPYFSSYSLGILCISTAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLI 217
Query: 232 GLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
GI + S + + Q M + L +IH+Y+ ++A +N+LN R A+IV ++
Sbjct: 218 ASFVGILILS-IFHNGQYIMELYLFLVIIHLYANYSAVKALCLNSLNEDRLALIVKSYIS 276
Query: 292 TGRISSPADLRYREDLLF 309
I P ++ +E +L
Sbjct: 277 NEVIPKPEEVNKKESVLL 294
>gi|426254633|ref|XP_004020981.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ovis aries]
Length = 462
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 125/245 (51%), Gaps = 3/245 (1%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A
Sbjct: 67 LLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAAT 126
Query: 124 VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+W++KD +G+++ + +G++MD K+WR+ AD L D+ LE+++P+ P F
Sbjct: 127 ATWLVKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTVT 186
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 187 VCISNLAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPL 246
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V S + L+ +H+Y+ +RA + TLN QR +++ F++ G + P
Sbjct: 247 VSDSPSLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQWGEVLGPTSAN 306
Query: 303 YREDL 307
E L
Sbjct: 307 RMEPL 311
>gi|354505561|ref|XP_003514836.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cricetulus griseus]
gi|344256981|gb|EGW13085.1| UPF0420 protein C16orf58-like [Cricetulus griseus]
Length = 466
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 127/243 (52%), Gaps = 7/243 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSTDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGDAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 135 LKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADVLNDIAMFLEIMAPMYPIFFTMTVSIS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
S+ + LL+ +H+Y+ +RA + TLN R +++ F++ G + +PA
Sbjct: 255 PSLSLGCFI--LLTALHIYANYRAVRALVLETLNESRLQLLLKHFLQRGEVLAPASANQM 312
Query: 305 EDL 307
E L
Sbjct: 313 EPL 315
>gi|330802843|ref|XP_003289422.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
gi|325080505|gb|EGC34058.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
Length = 552
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 71/250 (28%), Positives = 131/250 (52%), Gaps = 7/250 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F PSGYP SV Y RY + ++Q S+ L+T+++L + G + W+
Sbjct: 60 FLPSGYPDSVTSDYFRYQFWDSIQALCSSITGTLATRAILKGYGVGNSSATVASATTQWV 119
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGL 185
++ +G++I + G +D K+WR +AD L D+G E++SPL + LFL ++ +
Sbjct: 120 IQSATGMIGRIIFAWANGTDLDCNSKKWRFMADILNDIGMTFEMISPLFNEALFLPLSCI 179
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC- 244
G FAK I VA T+ + FAK NL+D+ AK + T +++G+ + ++S +
Sbjct: 180 GLFAKSICGVAGGCTKASLTQHFAKNDNLADVSAKDGSQETAVSLLGMILSVFVSSVLND 239
Query: 245 -SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLR 302
+S+ + + + H+Y + + A +N++N R +I +++ G I SP ++
Sbjct: 240 DTSILVTWFIFFIFTGFHLYCNYKAVSAVQLNSINKYRAHLIYEYYIRNQGSIPSPKEIS 299
Query: 303 YREDLLFPGR 312
E++LF +
Sbjct: 300 NFENILFNSK 309
>gi|449452843|ref|XP_004144168.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Cucumis sativus]
Length = 437
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 135/256 (52%), Gaps = 7/256 (2%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
R V+ R D+F+ P G+P SV Y+ + + +LQ ++ ++LSTQ+LL A G+
Sbjct: 45 RLVWRRFFDAFV----PEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVG 100
Query: 116 PTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS 172
A W L+D +G ++ + G+ +DS K WR++AD + DLG ++++S
Sbjct: 101 EQSATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLVS 160
Query: 173 PLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
PL P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G
Sbjct: 161 PLFPSAFMLIVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIG 220
Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
+ G+ LA + L++ H+Y+ + + + +LNPQR ++ + +++T
Sbjct: 221 MAIGMLLARITMGCPLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQT 280
Query: 293 GRISSPADLRYREDLL 308
+ SP + E +L
Sbjct: 281 AEVLSPQQVSMMEHVL 296
>gi|431902902|gb|ELK09111.1| hypothetical protein PAL_GLEAN10000828 [Pteropus alecto]
Length = 468
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 125/245 (51%), Gaps = 3/245 (1%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A
Sbjct: 73 LLTVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNANASVSAAT 132
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+W++KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 133 ATWLVKDSAGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPICFTMI 192
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ N AK I VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 193 VCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V + LL+ +H+Y+ +RA + TLN R +++ F+++G + P
Sbjct: 253 VSECPSFSLGCFFLLTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQSGEVLDPTSAN 312
Query: 303 YREDL 307
E L
Sbjct: 313 KMEPL 317
>gi|147768659|emb|CAN60615.1| hypothetical protein VITISV_003254 [Vitis vinifera]
Length = 470
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/251 (33%), Positives = 130/251 (51%), Gaps = 44/251 (17%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ R P
Sbjct: 113 SFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNRSAPG- 171
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A ++WILKDG VGK++ + G + D + K+ R+ D L +LG G+E+ + P LFL
Sbjct: 172 AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVPHLFL 231
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + ++V
Sbjct: 232 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLV--------- 282
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
K +V ++TLN R ++ + F+KTGR+ S +
Sbjct: 283 ---------KSVV--------------------LHTLNRARYSVAIDCFLKTGRVPSLQE 313
Query: 301 LRYREDLL-FP 310
+E++ FP
Sbjct: 314 GNMKENIFSFP 324
>gi|194758785|ref|XP_001961639.1| GF15067 [Drosophila ananassae]
gi|190615336|gb|EDV30860.1| GF15067 [Drosophila ananassae]
Length = 417
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/257 (30%), Positives = 130/257 (50%), Gaps = 6/257 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV++ Y Y + Q F S L T ++L G A + V+WI
Sbjct: 53 FLPKGYPDSVSDDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAYSATVTWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG++ + G+++D + K+WR+ AD L DL G+E+ + P P L ++
Sbjct: 113 LKEGSGHVGRIFFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCG 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+YS ++ +RA + T N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYSNLKAVRAVCLRTFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKM 322
+ G+ + + N+K+
Sbjct: 292 RVTI-GQTVSVSLNIKL 307
>gi|344294431|ref|XP_003418921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Loxodonta africana]
Length = 468
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 124/248 (50%), Gaps = 3/248 (1%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ L F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A
Sbjct: 70 LSGLLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGALATQAVLLGIGVGDAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A +W++KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 130 AATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVLPICF 189
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 190 TITVCTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + L+ +H+Y+ +RA + TLN R +I+ F++ G + SP
Sbjct: 250 LPLVSDCPSVSLGCFFFLTALHIYANYRAVRALVMETLNEGRLWLILKHFLQKGEVLSPT 309
Query: 300 DLRYREDL 307
E L
Sbjct: 310 SANQMEPL 317
>gi|417401458|gb|JAA47615.1| Hypothetical protein [Desmodus rotundus]
Length = 468
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 73/248 (29%), Positives = 124/248 (50%), Gaps = 3/248 (1%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A +WI+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCF 189
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 190 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + LL+ +HVY+ ++A + TLN R +++ F++ G + P
Sbjct: 250 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 309
Query: 300 DLRYREDL 307
E L
Sbjct: 310 SANQMEPL 317
>gi|290999547|ref|XP_002682341.1| predicted protein [Naegleria gruberi]
gi|284095968|gb|EFC49597.1| predicted protein [Naegleria gruberi]
Length = 261
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 77/257 (29%), Positives = 130/257 (50%), Gaps = 16/257 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT------- 122
F P YP SV+ Y+ Y +++ALQ+ + VL+ QSLL AAG+ A
Sbjct: 5 FLPKNYPLSVSSNYINYCKWQALQYTAGSFSGVLTMQSLLHAAGITSQLALGGNEYFLAF 64
Query: 123 ---VVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC-PQ 177
++W++KDG+ +G L S + D++PK WR+ + LEV +P+ P
Sbjct: 65 LGGALAWVIKDGLGQLGGILFASKVNTNFDADPKFWRMGGEYALVASALLEVTTPITGPS 124
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
F+ A L N K ++ AA ATR + SFAK NL+D+ AK + + +++G GI
Sbjct: 125 WFIAQASLANIGKNVSCFAASATRAAMNQSFAKSDNLADVTAKATSQALACSLIGTTLGI 184
Query: 238 QLASTVCSSMQ---GKMI-VGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
++S + Q K+ + +LSVI ++S+ + + + LN QR + + ++K
Sbjct: 185 FVSSALSLGTQFVFAKVFPIFCVLSVIQLFSLYKAVSNVKLKVLNKQRFLIACSQYLKDR 244
Query: 294 RISSPADLRYREDLLFP 310
+ P+ + RE + P
Sbjct: 245 AVLDPSKVSEREKFVLP 261
>gi|66818028|ref|XP_642707.1| DUF647 family protein [Dictyostelium discoideum AX4]
gi|74861464|sp|Q86K80.1|U420_DICDI RecName: Full=UPF0420 protein
gi|60470803|gb|EAL68775.1| DUF647 family protein [Dictyostelium discoideum AX4]
Length = 527
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 130/247 (52%), Gaps = 7/247 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P+GYP SV Y Y + ++Q S L+T+++L G+ + A + W+
Sbjct: 63 FLPNGYPDSVTTDYFGYQFWDSIQALCSTITGTLATRAILKGYGVGDSSATVASATTQWL 122
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPL-CPQLFLEMAGL 185
++DGM +G+++ + G +D K+WR AD L ++G E++SPL QLFL ++ +
Sbjct: 123 IRDGMGMIGRIVFAWRKGTDLDCNSKKWRYTADILNNIGMAFEMISPLFSSQLFLPLSCI 182
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV-- 243
G AK I VA T+ + FAK NL+D+ AK + T N+VG+ + ++S +
Sbjct: 183 GLIAKSICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAVNLVGMLLSVIVSSFIND 242
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLR 302
+S+ +V + +H++ + A + ++N R +I F+ G I SP+++
Sbjct: 243 NTSLIVTWLVFLFFTSLHLFCNYRAVSAVQLKSINRYRAYLIYDYFIHNQGSIPSPSEIS 302
Query: 303 YREDLLF 309
E++LF
Sbjct: 303 KLENILF 309
>gi|380030107|ref|XP_003698699.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Apis florea]
Length = 464
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 77/258 (29%), Positives = 129/258 (50%), Gaps = 4/258 (1%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
++ Y ++ F P G+P SV+ Y Y + +Q F S + L+T S++ G+
Sbjct: 38 TKSFYSGLISIIKEVFLPQGFPDSVHPDYTSYQVWDTVQAFASTIMGTLTTHSIMQGVGV 97
Query: 115 --RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVL 171
A ++WILKDG VG+++ + G +D + K+WR+ AD L DL GLE+L
Sbjct: 98 GEAAATPLAAAITWILKDGTGMVGRIMFAWWNGTDLDGQCKKWRLFADILNDLAMGLELL 157
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
P L + + K I VA ATR + + A + NL+D+ AK + T N++
Sbjct: 158 LPYFSSYSLGILCISTAMKSIVGVAGGATRAALTNIKALQNNLADVSAKDGSQETCVNLI 217
Query: 232 GLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
GI + S + + Q M + L +IH+Y+ ++A +N+LN R A+IV ++
Sbjct: 218 ASFVGILILS-IFHNGQYIMELYLFLVIIHLYANYSAVKALCLNSLNEDRLALIVKSYIS 276
Query: 292 TGRISSPADLRYREDLLF 309
I P ++ +E +L
Sbjct: 277 NEVIPKPEEVNKKESVLL 294
>gi|157113953|ref|XP_001652155.1| hypothetical protein AaeL_AAEL006646 [Aedes aegypti]
gi|108877519|gb|EAT41744.1| AAEL006646-PA [Aedes aegypti]
Length = 417
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/239 (31%), Positives = 124/239 (51%), Gaps = 9/239 (3%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F N F P+GYP SV+ YL Y ++ +Q F S L+T ++L G+ A +
Sbjct: 49 FQNLFLPAGYPDSVSGDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPLSAT 108
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLE 181
++W+LKDGM H G++I + G +D + K+WRI AD L DL +++ P P+
Sbjct: 109 ITWVLKDGMGHFGRIIFAWWKGTELDIDSKKWRIRADVLNDLAMSIDLFVLPYYPKASTF 168
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ K I VA ATR + A GNL+D+ +K A T N++ G+ L +
Sbjct: 169 ILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLFLLT 228
Query: 242 TVCSSMQGKMIVGPLL--SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
V +++ G L +++H+Y+ I+ ++A + T N R + + ++ KTG + SP
Sbjct: 229 HV---QDQRVLYGLFLFVTLMHIYANIKAVKAVCLRTFNEARYLIALEEYFKTGTMLSP 284
>gi|340711243|ref|XP_003394188.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus terrestris]
Length = 464
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 123/243 (50%), Gaps = 4/243 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P GYP SV+ Y Y + +Q F S + L+T S++ G+ A A ++WI
Sbjct: 53 FLPQGYPDSVHPDYTSYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEASATPLAAAITWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKDG +G+++ + G +D + K+WR+ AD L DL GLE+ P L + +
Sbjct: 113 LKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDLAMGLELFLPYFSSYSLGILCIS 172
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
K I VA ATR + A + NL+D+ AK + T N++ GI + S + +
Sbjct: 173 TAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHN 231
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ M + L +IH+Y+ ++A +N+LN R A+IV ++ I P ++ +E
Sbjct: 232 GRYIMELYLFLVMIHLYANYMAVKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKES 291
Query: 307 LLF 309
+L
Sbjct: 292 VLL 294
>gi|334332835|ref|XP_001370394.2| PREDICTED: UPF0420 protein C16orf58 homolog [Monodelphis domestica]
Length = 442
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 86/295 (29%), Positives = 142/295 (48%), Gaps = 9/295 (3%)
Query: 19 MSVQEIPI---YWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGY 75
MS E P+ W + R PDG LS + + + F P G+
Sbjct: 1 MSGSEGPVAIKCWERFGSRLPRSCTVAPDGSLSWEAGGGPGQGLRAALTAV---FLPHGF 57
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQ 133
P SV+ YL Y + +Q F S+ L+TQ++L G+ A A ++W+LKDG
Sbjct: 58 PDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVLQGVGVGDAAASVSAATITWLLKDGTG 117
Query: 134 HVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
+G++ + L G+R+D E K+WR+ AD L D LE+++P P F + + + AK I
Sbjct: 118 MLGRIGFAWLMGSRLDCEAKKWRLFADVLNDAAMFLEIVAPGFPSCFTLIVCVSSLAKCI 177
Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMI 252
VA ATR + A+ N++D+ AK + TL N+ GL + L V + +
Sbjct: 178 VGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLVSLLLLPLVSNRPSLSLA 237
Query: 253 VGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
L +++H+Y+ +RA + T N R +++ F++ G++ +P E L
Sbjct: 238 AFFLFTILHLYANFRAVRAVIMETFNEARLYLVLKHFLQWGKVLTPPSANRMEPL 292
>gi|291410995|ref|XP_002721793.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 468
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 73/241 (30%), Positives = 122/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPESVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDIAMFLEIMAPVFPVCFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ LL+ +H+Y+ +RA + TLN R +++ F++ G + P E
Sbjct: 257 PSVSLGCFFLLTALHIYANYRAVRALVMETLNESRLRLVLKHFLQQGEVLDPTPANQMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|14424560|gb|AAH09308.1| C16orf58 protein [Homo sapiens]
Length = 468
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 76/276 (27%), Positives = 135/276 (48%), Gaps = 5/276 (1%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+CP F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVCPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ TLN R +++ +++ G + P E L
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 317
>gi|417401377|gb|JAA47577.1| Hypothetical protein [Desmodus rotundus]
Length = 464
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 73/248 (29%), Positives = 124/248 (50%), Gaps = 3/248 (1%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A +WI+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCF 189
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 190 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + LL+ +HVY+ ++A + TLN R +++ F++ G + P
Sbjct: 250 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 309
Query: 300 DLRYREDL 307
E L
Sbjct: 310 SANQMEPL 317
>gi|357487621|ref|XP_003614098.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
gi|355515433|gb|AES97056.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
Length = 749
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 137/255 (53%), Gaps = 7/255 (2%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ V FL F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 356 FTHVWRRFLQAFVPEGFPSSVTPDYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGV--GE 413
Query: 119 AQATVV----SWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSP 173
ATV+ W L+D +G ++ + G+ +DS K WR++AD + DLG ++++SP
Sbjct: 414 KSATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISP 473
Query: 174 LCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
L P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+
Sbjct: 474 LFPSAFVFIVCLGSISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGM 533
Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
G+ +A L++ H+Y+ ++ +N+LNP+R+++++ F+++
Sbjct: 534 ALGMLVARITTGHPLAIWFSFLSLTLFHMYANYRAVQCLSLNSLNPERSSILLQRFMESR 593
Query: 294 RISSPADLRYREDLL 308
++ SP + E +L
Sbjct: 594 QVLSPKQVSPLEHVL 608
>gi|170582654|ref|XP_001896227.1| cDNA sequence BC017158 [Brugia malayi]
gi|158596613|gb|EDP34929.1| cDNA sequence BC017158, putative [Brugia malayi]
Length = 389
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 133/243 (54%), Gaps = 3/243 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV+ Y+ Y + +Q F S+ S LST+++L G+ + A V+W+
Sbjct: 42 FMPRGYPQSVSPDYMNYQMWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAAAVAWL 101
Query: 128 LKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKDG+ + +++ + L + +D++ K+WR++AD DL L++++P+ P LF+ + L
Sbjct: 102 LKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPSLFMPIICLS 161
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ + + VA ATR + + A N+ D+ AK + TL NV L + L V +
Sbjct: 162 SMIRAVVGVAGSATRTTVTNHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLPVVSRN 221
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ ++ L + IH+Y +++ TLN ++V D+++T +IS+ +++ +E
Sbjct: 222 IVSVWLLFSLFTFIHLYGNYRAVKSLQFRTLNQSLLRIVVKDYIETRKISTVSEMNSKEP 281
Query: 307 LLF 309
+L
Sbjct: 282 ILL 284
>gi|345801540|ref|XP_547054.3| PREDICTED: UPF0420 protein C16orf58 homolog [Canis lupus
familiaris]
Length = 468
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/241 (29%), Positives = 122/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAEASVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 137 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTMTVCIS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V +
Sbjct: 197 NLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSAC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +RA + TLN R +++ F++ G + P E
Sbjct: 257 PSFSLSCFFFLTALHIYANYRAVRALVLETLNEDRLRLVLKHFLQRGEVLDPTSANQMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|326527127|dbj|BAK04505.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 74/253 (29%), Positives = 134/253 (52%), Gaps = 3/253 (1%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 55 WGRIGSRVLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKS 114
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 115 ATVIGATFQWFLRDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 174
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P + + LG+ ++ VA+ ATR + FA N +D+ AK + TL + G+G
Sbjct: 175 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMSGMGL 234
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
G+ LA + L+V H+Y+ + +++ ++TLN +RT++++ F++ G +
Sbjct: 235 GMLLAHVTRGHDLVVWVSFLSLTVFHMYANYKAVQSLSLSTLNYERTSILLQYFMEHGEV 294
Query: 296 SSPADLRYREDLL 308
+P + +E +L
Sbjct: 295 LTPQQVSKQEHIL 307
>gi|255562731|ref|XP_002522371.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
communis]
gi|223538449|gb|EEF40055.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
communis]
Length = 523
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 129/242 (53%), Gaps = 9/242 (3%)
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGM 132
P V E Y+ Y +++ L S+AL VL+TQ++ A G+ R P+ A ++W+LKDG+
Sbjct: 132 PKQVTENYMDYVKWKLLHRVFSSALQVLATQAMFRAVGIGYSRSLPS-AAALNWVLKDGL 190
Query: 133 QHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
+ + I ++L + D+ KR R L+ LG GLE+L+P P+ FL +A + N AK
Sbjct: 191 GRLSRCIYTASLASAFDTNLKRVRFSTSILFSLGIGLELLTPAFPKQFLLLATIANIAKQ 250
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ--- 248
I++ AT I+ SFA NL ++ AK + + F+ +GL L ++ +
Sbjct: 251 ISLACHLATGSAIHRSFAVADNLGEVSAKAQIQTVCFDNLGLMLAALLNMLFKNNQRLLA 310
Query: 249 -GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+V P+ S I ++ + + ++ + TL R MI+ ++ GR+ +PA++ +E
Sbjct: 311 GLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLEMILDSWIALGRVPTPAEVSKKEGF 370
Query: 308 LF 309
F
Sbjct: 371 EF 372
>gi|307111229|gb|EFN59464.1| hypothetical protein CHLNCDRAFT_138052 [Chlorella variabilis]
Length = 657
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 70/214 (32%), Positives = 117/214 (54%), Gaps = 11/214 (5%)
Query: 106 QSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADAL 161
+SLL A G+ R PA AT+ +W+LKDG+ +G+L + + G DS+ KR+R +
Sbjct: 78 ESLLQAVGVGARRSLPAAATI-NWVLKDGLGRLGRLTVATRFGESFDSDLKRFRYATSII 136
Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
Y + LE L+PL PQ FL MA L N K I + AT+ + SF NL+D+ AK
Sbjct: 137 YAVSLSLEFLTPLAPQHFLVMASLANVGKSIGLTTFIATQPAFHRSFCLRENLADISAKT 196
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGK-----MIVGPLLSVIHVYSVIEEMRAAPVNT 276
+A + + +GL A + L + +C + +++ PLL+ ++++ E+R+ + T
Sbjct: 197 QAQQMVMDNIGLAAAVGL-TYLCRHTEAARRALPLVMFPLLAAGDLWAIYSELRSIHLRT 255
Query: 277 LNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
LN +R +I +++ GR+ SP + E + P
Sbjct: 256 LNKERAEIIAQHWLREGRVPSPRQVSEEERFVLP 289
>gi|357612137|gb|EHJ67828.1| hypothetical protein KGM_02553 [Danaus plexippus]
Length = 525
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 122/247 (49%), Gaps = 7/247 (2%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F F P GYP SV++ Y+ Y + Q F S +L+TQ + G+ T A A
Sbjct: 47 FAKIFLPQGYPNSVSKDYIFYQIWDTAQAFCSTITGILATQEVFRGVGVGDTNASPLAAT 106
Query: 124 VSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
V+W+ KDG H+GK++ + G +D+ K+WR+ AD L D +E+ PL
Sbjct: 107 VTWVFKDGCGHIGKILFAYTHGTYLDAYSKKWRLYADTLNDAAMCIEIALPLFKNYITFA 166
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ K I VA ATR+ + A GNL+D+ AK A T N++ A + L S
Sbjct: 167 LCVSTCMKAIVGVAGGATRVAMTQHHALRGNLADVSAKDSAQETAVNLIASFAALFLISL 226
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ +S+ + +L ++H+ +RA + TLN R ++ +++ I++P ++
Sbjct: 227 IGNSVT----IFIILLIMHIVFNYMAVRAVCLRTLNEPRFLQVIDTYLRKEVIANPCEIN 282
Query: 303 YREDLLF 309
E ++F
Sbjct: 283 RNEPIIF 289
>gi|78486556|ref|NP_001030594.1| UPF0420 protein C16orf58 homolog [Rattus norvegicus]
gi|123781627|sp|Q499P8.1|CP058_RAT RecName: Full=UPF0420 protein C16orf58 homolog
gi|71682994|gb|AAH99813.1| Similar to cDNA sequence BC017158 [Rattus norvegicus]
Length = 466
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 123/241 (51%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL+Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ + LL+ +H+Y+ +RA + TLN R +++ F++ G + PA E
Sbjct: 255 LSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314
Query: 307 L 307
L
Sbjct: 315 L 315
>gi|195344882|ref|XP_002039005.1| GM17287 [Drosophila sechellia]
gi|194134135|gb|EDW55651.1| GM17287 [Drosophila sechellia]
Length = 417
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 77/257 (29%), Positives = 130/257 (50%), Gaps = 6/257 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + Q F S L T ++L G A + +WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P L ++
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCC 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+Y+ ++ +RA + + N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKM 322
+ G+ + + N+K+
Sbjct: 292 RVTV-GQTVSVSLNIKL 307
>gi|81879541|sp|Q91W34.1|CP058_MOUSE RecName: Full=UPF0420 protein C16orf58 homolog
gi|16877856|gb|AAH17158.1| CDNA sequence BC017158 [Mus musculus]
Length = 466
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 125/243 (51%), Gaps = 7/243 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIILAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
S+ V LL+ +H+Y+ +RA + TLN R +++ F++ G + PA
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312
Query: 305 EDL 307
E L
Sbjct: 313 EPL 315
>gi|405974674|gb|EKC39300.1| UPF0420 protein C16orf58-like protein [Crassostrea gigas]
Length = 444
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 135/279 (48%), Gaps = 28/279 (10%)
Query: 49 VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
+K VD S P + + F F P GYP SV+ YL+Y + +Q F S+ L+ Q+L
Sbjct: 24 LKTVDLS-PKWTSLSQFFTVVFLPQGYPESVSSDYLQYQIWDTIQAFASSITGTLAAQAL 82
Query: 109 LFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLG 165
L G+ A A ++WILKDG +G+++ + + G +D + KRWR+ AD L D+
Sbjct: 83 LKGVGVGDESATVMAATLTWILKDGTGMIGRILFAWMQGTNLDCDAKRWRLFADILNDMA 142
Query: 166 TGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
+E+L+P F + K I VA ATR + A+ N++D+ AK +
Sbjct: 143 IFMEILAPYFKAYFTPIVCTAGVCKSIVGVAGGATRAALTQHQARRNNMADVSAKDGSQE 202
Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHV------------YSVIEEMRAAP 273
TL N LA+ VCS ++ G + + + YS + +
Sbjct: 203 TLVN---------LAALVCSLTLVPLVTGKDVLIWSLFLLFTLLHLFANYSAV---TSVV 250
Query: 274 VNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
+ TLN R ++V +++TG + S ++ Y+E +L+ R
Sbjct: 251 METLNQARLHILVHHYLQTGEVLSLQEVNYKEPVLWKTR 289
>gi|324514354|gb|ADY45840.1| Unknown [Ascaris suum]
Length = 394
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 129/247 (52%), Gaps = 3/247 (1%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F++ F P GYP SV+ Y+ Y + +Q F S+ S L+T+++L AG+ A A
Sbjct: 44 FVDVFLPRGYPHSVSADYMSYQIWDTVQAFASSMTSALATEAVLRGAGVGDENASALAAT 103
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+++DG+ + ++ + L A++D++ KRWR++AD L DL LE+L+ P LF +
Sbjct: 104 LTWLIRDGLGMLSRIGFAWLKSAQLDADCKRWRLVADFLNDLAFSLELLAAALPSLFTLL 163
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
L + A+ I VA ATR + A+ N++D+ AK + TL NV L + L
Sbjct: 164 VCLSSLARSIVAVAGGATRTTVVQHQARCNNVADVSAKDGSQETLVNVTALLCSLILLPL 223
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ + L + +H+++ + A TLN + +I FV+TG + S
Sbjct: 224 ISGRTVLIWMFYILFTGVHLFANYAAVTALHFETLNQKLLRIITNHFVETGDVFSVKVAN 283
Query: 303 YREDLLF 309
E LLF
Sbjct: 284 ELEPLLF 290
>gi|66816549|ref|XP_642284.1| DUF647 family protein [Dictyostelium discoideum AX4]
gi|60470350|gb|EAL68330.1| DUF647 family protein [Dictyostelium discoideum AX4]
Length = 567
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 134/245 (54%), Gaps = 11/245 (4%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA-GLRPTP 118
+++ DS F P GYP +V Y+ Y ++ +Q+ + VLST +LL + G+ T
Sbjct: 117 NKLFDSIKKLFLPKGYPSTVTSDYIGYQKWMFIQNVIGSTTYVLSTHALLTSVIGISSTS 176
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A A +SW+LKDG+ ++ +N D + K+++ D L++ G LE+ +P
Sbjct: 177 ALPFAAAISWVLKDGLGATALVLFANKYSTLFDYDIKKYKFRGDLLHNFGVLLEMFTPFF 236
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK--GEAISTLFNVVGL 233
Q FL +A + N +KG+A + +TR + SF+ + N+ D+ AK ++++ + + +
Sbjct: 237 QQFFLPLASISNLSKGLAGLIYGSTRASLNRSFSLKENIGDITAKYQSQSMAAYLSGMAI 296
Query: 234 GAGIQLASTV---CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
G+ I + +V S M+ G LS IH++ I +++ + TLN QRT++++ +++
Sbjct: 297 GSSIGMILSVFPLTSVSNLSMVFG--LSFIHIFCGINAIKSIDLKTLNQQRTSILIDNWL 354
Query: 291 KTGRI 295
K+ I
Sbjct: 355 KSNLI 359
>gi|8920582|gb|AAF81304.1|AC027656_21 Contains similarity to CG10338 gene product from Drosophila
melanogaster gb|AE003660 [Arabidopsis thaliana]
Length = 438
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 130/242 (53%), Gaps = 3/242 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F G+P SV Y+ + + LQ ++ +LSTQ+LL A G+ A W
Sbjct: 55 FSSPGFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWF 114
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D +G ++ + G+ +DS K WR++AD + D+G +++LSPL P F+ + LG
Sbjct: 115 LRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLG 174
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+ G+ LA +
Sbjct: 175 SLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLARFTSGN 234
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+V H+Y+ +R +N+LN +R+++++ F++TG++ SP + E
Sbjct: 235 PMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEG 294
Query: 307 LL 308
+L
Sbjct: 295 VL 296
>gi|342881518|gb|EGU82408.1| hypothetical protein FOXB_07090 [Fusarium oxysporum Fo5176]
Length = 448
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 152/307 (49%), Gaps = 10/307 (3%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNE 81
+ + I + S +V+R + D H VV S+P ++ S F P GYP SV+
Sbjct: 12 EPVEIVELDKSGNVSRTWNHTVDNH----VVAKSKPSI-PLIPSLREAFMPVGYPHSVSS 66
Query: 82 GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLI 139
YL Y F +LQ F S S+L+ ++LL G+ + AT ++ +LKD M + ++
Sbjct: 67 DYLNYQFFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAMSRIATIV 126
Query: 140 CSN-LGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVVAA 197
++ G R++ + KR+R LAD D LE+ SP P + G + + VAA
Sbjct: 127 FAHQFGLRIEPDAKRFRFLADLFNDTAFFLELYSPYFGPYGKILALTSGETLRALCGVAA 186
Query: 198 RATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLL 257
A++ + FAK NLS+L AK + T ++GL G + V + + +L
Sbjct: 187 GASKAALSVHFAKHDNLSELNAKEASQETAVGLIGLLVGTIVVKYV-EDHHSVLFLMTVL 245
Query: 258 SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDA 317
H++ +RA ++ LN QR ++ +++KTG + SP ++ +RE++L ++ +
Sbjct: 246 VFAHLWMNYLGVRAVCMDNLNQQRATILFEEYLKTGNVLSPEEVAHRENILLWRPVVRNQ 305
Query: 318 GNVKMDR 324
K+ R
Sbjct: 306 HGQKIAR 312
>gi|414879010|tpg|DAA56141.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
Length = 368
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 95/166 (57%), Gaps = 1/166 (0%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + ++ P GYP SV+ YL Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 186 WRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKGA 245
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L ++ GLE+L+P+ P
Sbjct: 246 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 305
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
LF+ + + A + ATR Y+ FA + N +++ AKGEA
Sbjct: 306 LFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEA 351
>gi|395514351|ref|XP_003761381.1| PREDICTED: UPF0420 protein C16orf58 homolog [Sarcophilus harrisii]
Length = 455
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 131/261 (50%), Gaps = 10/261 (3%)
Query: 57 PVYHRVVDSFLNK-------FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
P HR + S L+ F P G+P SV+ YL Y + +Q F S+ L+TQ++L
Sbjct: 45 PRLHRPLTSLLSSLPPSQAVFLPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVL 104
Query: 110 FAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGT 166
G+ A A ++W+LKDG +G++ + L G+++D + K+WR+ AD L D
Sbjct: 105 QGVGVGDAAASVSAATITWLLKDGTGMLGRIGFAWLMGSKLDCDAKKWRLFADVLNDAAM 164
Query: 167 GLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 226
LE+++P P F + + + AK I VA ATR + A+ N++D+ AK + T
Sbjct: 165 FLEIVAPAFPSCFTLIVCVSSLAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQET 224
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
L N+ GL + L V + + L + +H+Y+ +RA + T N R +++
Sbjct: 225 LVNLAGLLVSLLLLPLVSERPRLSLAAFFLFTALHLYANYRAVRAVIMETFNETRLYLVL 284
Query: 287 ADFVKTGRISSPADLRYREDL 307
F++ G + +P E L
Sbjct: 285 KHFIQWGEVLTPPSANRMEPL 305
>gi|74201602|dbj|BAE28428.1| unnamed protein product [Mus musculus]
Length = 466
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 125/243 (51%), Gaps = 7/243 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
S+ V LL+ +H+Y+ +RA + TLN R +++ F++ G + PA
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312
Query: 305 EDL 307
E L
Sbjct: 313 EPL 315
>gi|225579041|ref|NP_663565.2| UPF0420 protein C16orf58 homolog [Mus musculus]
gi|74204001|dbj|BAE29003.1| unnamed protein product [Mus musculus]
gi|148685697|gb|EDL17644.1| cDNA sequence BC017158, isoform CRA_b [Mus musculus]
Length = 466
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 125/243 (51%), Gaps = 7/243 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
S+ V LL+ +H+Y+ +RA + TLN R +++ F++ G + PA
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312
Query: 305 EDL 307
E L
Sbjct: 313 EPL 315
>gi|149067630|gb|EDM17182.1| similar to cDNA sequence BC017158, isoform CRA_b [Rattus
norvegicus]
Length = 466
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 122/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL+Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ LL+ +H+Y+ +RA + TLN R +++ F++ G + PA E
Sbjct: 255 LSLSFGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314
Query: 307 L 307
L
Sbjct: 315 L 315
>gi|219119716|ref|XP_002180612.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408085|gb|EEC48020.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 408
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/254 (29%), Positives = 132/254 (51%), Gaps = 11/254 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
+ F P+ +P SV GY ++ F L + +A VLSTQ+LL A G+ A+V++
Sbjct: 26 LIQHFLPAQFPASVAPGYAQFCGFCFLASVSGSAGMVLSTQTLLLAIGVVGNAHSASVMA 85
Query: 126 ----WILKDGMQHVGKLI-CSNLGA--RMDSEPKRWRILADALYDLGTGLEVLSPLCPQ- 177
W+LKDG+ +G +I S +G R D+ PK+WR++ D + L++LSP P
Sbjct: 86 GALNWVLKDGIGQLGGVIFASRMGETRRFDANPKKWRMMGALCLDSASLLDILSPFAPSA 145
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+ L +A N K I + A+R ++ + + GNL D+ AK + S ++G G GI
Sbjct: 146 VVLPLACFANALKNIGYLTVGASRAALHQALTRAGNLGDVTAKAGSQSIAAGLLGTGVGI 205
Query: 238 QLASTVCSSMQGKMIVGPL-LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
L++ + G ++G LS+IH ++ ++ N R +++ +++ G++
Sbjct: 206 GLSAAMGHD-AGNFVLGFCGLSLIHQLCNFAALQHVDLHHFNRHRLYLVLDHYLRHGKVL 264
Query: 297 SPADLRYREDLLFP 310
SP ++ +E FP
Sbjct: 265 SPREIATKES-FFP 277
>gi|339250912|ref|XP_003374441.1| conserved hypothetical protein [Trichinella spiralis]
gi|316969248|gb|EFV53376.1| conserved hypothetical protein [Trichinella spiralis]
Length = 340
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 132/275 (48%), Gaps = 12/275 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P GYP SV E Y Y + +LQ F S+ + + ++ A G+ A A V W+
Sbjct: 55 FLPQGYPDSVTEDYFDYQCWDSLQAFCSSVTNAFALHAVFKAVGVGNESASLLAATVVWL 114
Query: 128 LKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L++G+ + +++ + A +DS+ K WR +AD L D L++++PL F+ +A L
Sbjct: 115 LRNGVGMMSQIMFAWFCAGHLDSDCKSWRFVADVLNDCSIFLDLIAPLSESTFVYVACLA 174
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N + I +A ATR I ++ N++D+ AK + T+ N++ LG + L V
Sbjct: 175 NVCRSIVGIAGGATRAAITQHQSRRHNMADVAAKDSSQETMVNLIALGFNLALLPLVAHH 234
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK--TGRISSPADLRYR 304
+ + + + +H+Y+ +RA TLN +R M+V F+K T + Y
Sbjct: 235 VLLVWTLFLMFTFMHIYANYRAIRALKFETLNEKRFRMLVQSFIKDNTNAKMGVEIVNYE 294
Query: 305 EDLL------FPGRLIEDAGNVKMDRHGIRAECFR 333
E L+ F G I G + R ++ CF
Sbjct: 295 ESLVPFALTNFIGDSIS-LGTSLIHRKNLKLLCFH 328
>gi|149725803|ref|XP_001495510.1| PREDICTED: UPF0420 protein C16orf58 homolog [Equus caballus]
Length = 468
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 122/245 (49%), Gaps = 3/245 (1%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A
Sbjct: 73 LLALFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGDAKASVSAAT 132
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+W++KD +G++I + G+++D K WR+ AD L D+ LE+++P+ P F +
Sbjct: 133 ATWLVKDSTGMLGRIIFAWWKGSKLDCNAKHWRLFADILNDVAMFLEIMAPVYPICFTMI 192
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ N AK I VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 193 VCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V + L + +H+Y+ +RA + TLN R +++ F++ G + P
Sbjct: 253 VSGCPSFSLGCFFLFTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQRGEVLGPTSAN 312
Query: 303 YREDL 307
E L
Sbjct: 313 RMEPL 317
>gi|357120354|ref|XP_003561892.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
Length = 452
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 133/253 (52%), Gaps = 3/253 (1%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 60 WGRIGGRVLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLSAIGVGEKS 119
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 120 ATVIGATFQWFLRDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 179
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P + + LG+ ++ VA+ ATR + FA N +D+ AK + TL + G+G
Sbjct: 180 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMSGMGL 239
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
G+ LA + L++ H+Y+ + +++ + TLN +RT++++ F++ G +
Sbjct: 240 GMLLAHVTRGHDLVVWVSFLSLTIFHMYANYKAVQSLSLVTLNYERTSILLQYFLEDGEV 299
Query: 296 SSPADLRYREDLL 308
+P + +E +L
Sbjct: 300 RTPQQVSKQEHIL 312
>gi|260815171|ref|XP_002602347.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
gi|229287656|gb|EEN58359.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
Length = 244
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 70/239 (29%), Positives = 125/239 (52%), Gaps = 3/239 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV++ YL Y + +Q F S+ L+TQ++L G+ A ++WI
Sbjct: 4 FLPQGYPESVSDDYLSYQIWDTVQAFCSSITGTLATQAVLKGVGVGDETATAAGATITWI 63
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKDG +G+++ + L G +D + KRWR+ AD + D LE+++P P F + +
Sbjct: 64 LKDGTGMLGRILFAWLQGTSLDCDAKRWRLFADIVNDFSIFLELVAPAFPLFFTGIVCVA 123
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
K I V+ ATR + A+ N++D+ AK + TL N++ L AG+ + V +
Sbjct: 124 GVGKAIVGVSGGATRAALTMHQARRDNMADVSAKDGSQETLVNLMALLAGLVVTPMVADN 183
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+ + + +H+++ + + TLN R ++V+ +++TG + +PA +E
Sbjct: 184 PLLTWFLFFVFTCLHLFANYNAVTCVVMETLNQSRLHILVSHYLRTGVVLNPAQANVQE 242
>gi|302766892|ref|XP_002966866.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
gi|300164857|gb|EFJ31465.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
Length = 265
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 130/242 (53%), Gaps = 3/242 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWI 127
F P GYP SV+ YL + + LQ ++ S+LSTQ+LL G+ A A W
Sbjct: 23 FLPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWF 82
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D VG ++ + G+ +D+ K+WR+ AD + D+G ++++SPL P+ F+ LG
Sbjct: 83 LRDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFMNDIGMLMDLVSPLFPRAFMFFLCLG 142
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ A+ + VA ATR + FA++ N +D+ AK + T +VG+ G+ LA ++
Sbjct: 143 SAARAVTGVAGGATRAALTQHFARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANN 202
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ HVY+ + + A + ++N +R A+++ F+K G+ SP + +E
Sbjct: 203 IVLMWTSFMSLTAFHVYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQES 262
Query: 307 LL 308
L
Sbjct: 263 AL 264
>gi|195484210|ref|XP_002090597.1| GE13200 [Drosophila yakuba]
gi|194176698|gb|EDW90309.1| GE13200 [Drosophila yakuba]
Length = 417
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 129/257 (50%), Gaps = 6/257 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + Q F S L T ++L G A + +WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+Y+ ++ +RA + + N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKM 322
+ G+ + + N+K+
Sbjct: 292 RVTV-GQTVSVSLNIKL 307
>gi|395842960|ref|XP_003794274.1| PREDICTED: UPF0420 protein C16orf58 homolog [Otolemur garnettii]
Length = 468
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/241 (29%), Positives = 121/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAKASVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLILPLVSDC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +RA + TLN R +++ F++ G + P E
Sbjct: 257 PSFSLGCFFFLTALHIYANYRAVRAIVMETLNQGRLRLVLKHFLQRGEVLDPTSANKMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|197101105|ref|NP_001126133.1| UPF0420 protein C16orf58 homolog [Pongo abelii]
gi|75041556|sp|Q5R8F6.1|CP058_PONAB RecName: Full=UPF0420 protein C16orf58 homolog
gi|55730464|emb|CAH91954.1| hypothetical protein [Pongo abelii]
Length = 468
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 135/276 (48%), Gaps = 5/276 (1%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G S +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDSGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASGLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+VGL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ TLN R +++ +++ G + +P E L
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLNPTAANRMEPL 317
>gi|195579948|ref|XP_002079818.1| GD24150 [Drosophila simulans]
gi|194191827|gb|EDX05403.1| GD24150 [Drosophila simulans]
Length = 417
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 129/257 (50%), Gaps = 6/257 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + Q F S L T ++L G A + +WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+Y+ ++ +RA + + N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKM 322
+ G+ + + N+K+
Sbjct: 292 RVTV-GQTVSVSLNIKL 307
>gi|345492163|ref|XP_001602603.2| PREDICTED: UPF0420 protein C16orf58 homolog [Nasonia vitripennis]
Length = 466
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 131/267 (49%), Gaps = 11/267 (4%)
Query: 51 VVDDSRPVY-----HRVVDSFLNK-FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLS 104
VV + +PV V SF + F P G+P SV++ Y+ Y + LQ F S L+
Sbjct: 29 VVTNFKPVRGTKFAESVTYSFFKEVFLPQGFPDSVHKDYVPYQIWDTLQAFASTINGTLT 88
Query: 105 TQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADAL 161
T S++ G+ + A A ++WILKDG +G++I + G +DS+ K+WR+ AD L
Sbjct: 89 THSIMRGVGVGESTATPLAAAITWILKDGTGMIGRIIFAWWQGNNLDSQCKKWRLFADIL 148
Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
D LEV P L K I +A ATR I A + N++D+ AK
Sbjct: 149 NDAAMALEVTVPYISTFSSYTLCLTTGMKAIVGIAGGATRTAIMQHHAIKDNMADVSAKE 208
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
+ TL N+ G GI + + + + V LL ++H+ S + + +NTLN R
Sbjct: 209 HSQGTLVNLAGSIVGILILLL--INEKFFIYVCLLLVIVHIVSNYFAVTSLELNTLNEDR 266
Query: 282 TAMIVADFVKTGRISSPADLRYREDLL 308
+I+ D++ +S+ D+ RE ++
Sbjct: 267 LCLIIEDYILNHTMSNVEDINSRESVI 293
>gi|24585018|ref|NP_609897.2| CG10338 [Drosophila melanogaster]
gi|22946770|gb|AAF53692.2| CG10338 [Drosophila melanogaster]
gi|384475974|gb|AFH89820.1| FI20169p1 [Drosophila melanogaster]
Length = 417
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 129/257 (50%), Gaps = 6/257 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + Q F S L T ++L G A + +WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+Y+ ++ +RA + + N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKM 322
+ G+ + + N+K+
Sbjct: 292 RVTV-GQTVSVSLNIKL 307
>gi|218194530|gb|EEC76957.1| hypothetical protein OsI_15243 [Oryza sativa Indica Group]
Length = 836
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 115/204 (56%), Gaps = 1/204 (0%)
Query: 107 SLLFAAGL-RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLG 165
+LL+A GL + A V+W+LKDG+ ++ K++ S G D PK WR+ AD L +
Sbjct: 476 ALLYAVGLGKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTA 535
Query: 166 TGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
GLE+L+P+ P LF+ + + A + ATR Y+ FA + N +++ AKGEA
Sbjct: 536 YGLEILTPVFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQG 595
Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
+ +G+ GI LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R +++
Sbjct: 596 MVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLV 655
Query: 286 VADFVKTGRISSPADLRYREDLLF 309
++++ +G++ S ++ E L
Sbjct: 656 FSEYLLSGQVPSVKEVNDEEPLFL 679
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 95/182 (52%), Gaps = 9/182 (4%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + P GYP SV+ YL+Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 310 WRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKGA 369
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P+ P
Sbjct: 370 IPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPH 429
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE--GNLSDLFAKGEAIS-TLFNVVGLG 234
LF+ + + A + I++ F E ++ + AKGEA L VGLG
Sbjct: 430 LFVPIGAAAGAGRSAAALIQT-----IFAHFMLENANHMRKVIAKGEAQGMALLYAVGLG 484
Query: 235 AG 236
G
Sbjct: 485 KG 486
>gi|387019749|gb|AFJ51992.1| UPF0420 protein C16orf58-like [Crotalus adamanteus]
Length = 437
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 136/284 (47%), Gaps = 24/284 (8%)
Query: 37 RHYQFQPDGHLSVKVVDDSRP----VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRAL 92
R YQ P+G L+ K D S+ +HR+ S F P GYP SV+ YL Y + +
Sbjct: 20 RRYQALPNGKLNSKT-DASKDWHLNSFHRIFKSI---FLPQGYPESVSTDYLAYQFWDTI 75
Query: 93 QHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSE 150
Q F S+ L+TQ++L G+ + A V+WILKDG +G++ +
Sbjct: 76 QAFASSITGTLATQAVLKGVGVGDETSTVAAAAVTWILKDGTGMLGRIAFA--------- 126
Query: 151 PKRWR--ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
W + AD L D+ +E+++P P F + F K I VA ATR +
Sbjct: 127 ---WSKGLFADVLNDVAIFMEIMAPAFPACFTLIVCTSGFFKCIVGVAGGATRAALTMHQ 183
Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
A+ N++D+ AK + TL N+ GL + L V ++ L +++H+Y+ +
Sbjct: 184 ARRDNMADVSAKDGSQETLVNLAGLLFSLFLIPLVVDNLPLTYAFYALFTILHLYANYQA 243
Query: 269 MRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
+RA + TLN R +++ ++K + PA + +E LL R
Sbjct: 244 VRAVCMETLNRARLHLVLQHYLKREEVLGPAVINPQEPLLLGFR 287
>gi|194880075|ref|XP_001974360.1| GG21126 [Drosophila erecta]
gi|190657547|gb|EDV54760.1| GG21126 [Drosophila erecta]
Length = 417
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 129/257 (50%), Gaps = 6/257 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + Q F S L T ++L G A + +WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+Y+ ++ +RA + + N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKM 322
+ G+ + + N+K+
Sbjct: 292 RVTV-GQTVSVSLNIKL 307
>gi|195387984|ref|XP_002052672.1| GJ17681 [Drosophila virilis]
gi|194149129|gb|EDW64827.1| GJ17681 [Drosophila virilis]
Length = 417
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 132/259 (50%), Gaps = 6/259 (2%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
N F P GYP SV++ Y Y + +Q F S L T ++L G A + V+
Sbjct: 51 NIFLPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNAYSATVT 110
Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMA 183
WILK+G H+G+++ + G+++D + K+WR+ AD L DL G+E+ + P P L ++
Sbjct: 111 WILKEGSGHLGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
K I VA ATR + A GNL+D+ +K + T N++ G+ L T+
Sbjct: 171 CGSTMLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFVGLYLL-TM 229
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
S V ++ +H+Y+ ++ +RA + T N R + + +F ++ + +P +
Sbjct: 230 IKSQAMLYTVFYIVVTLHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSHMLTPQQVNK 289
Query: 304 REDLLFPGRLIEDAGNVKM 322
E + G+++ + N+++
Sbjct: 290 LERVTV-GQMVSVSLNIRL 307
>gi|195996945|ref|XP_002108341.1| hypothetical protein TRIADDRAFT_19061 [Trichoplax adhaerens]
gi|190589117|gb|EDV29139.1| hypothetical protein TRIADDRAFT_19061, partial [Trichoplax
adhaerens]
Length = 272
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 132/248 (53%), Gaps = 9/248 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P GYP SV+ YL Y + LQ F S+ L+TQ++L G+ A A ++W+
Sbjct: 3 FLPQGYPESVSGDYLTYQLWDTLQAFCSSITGTLATQAILKGVGVGDEKATVAAAAMTWM 62
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKR------WRILADALYDLGTGLEVLSPLCPQLFL 180
LKDG +G+++ + L G +D + K+ +R+ AD L D +E+++P P LF
Sbjct: 63 LKDGTGMIGRILFAWLRGTSLDCDAKKMLIFISYRLFADFLNDGALFIEMIAPNFPHLFT 122
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + AK I VA ATR + A+ N++D+ AK + TL N+ L G+ L
Sbjct: 123 LLLCVASVAKSIVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLSALIVGLFLL 182
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
V + + + + +++H+Y+ + + + + TLN +R ++ +F + G I SPA+
Sbjct: 183 PAVTGNARIIWTLFFMFTMLHLYANYKAVSSVVMETLNRERLHILAYNFFRYGSILSPAE 242
Query: 301 LRYREDLL 308
+ RE ++
Sbjct: 243 VCQRESVI 250
>gi|410984724|ref|XP_003998676.1| PREDICTED: UPF0420 protein C16orf58 homolog [Felis catus]
Length = 468
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 123/248 (49%), Gaps = 3/248 (1%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ S + F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A
Sbjct: 70 LSSLIAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A +W++KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 130 AATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFF 189
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 190 TITICTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + L+ +H+Y+ +RA + TLN R +++ F++ G + P
Sbjct: 250 LPLVSDCPGFSLGCFFFLTALHIYANYRAVRALVIETLNEGRLRLVLKHFLQRGEVLGPT 309
Query: 300 DLRYREDL 307
E L
Sbjct: 310 SANQMEPL 317
>gi|449018513|dbj|BAM81915.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 600
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 80/246 (32%), Positives = 122/246 (49%), Gaps = 23/246 (9%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT------- 122
F P+ YP SV Y Y + L + +A SVL T +LL +AG+ P++A
Sbjct: 158 FLPTNYPHSVTASYAPYAFWHFLHNTLTACNSVLGTTALLVSAGVVADPSRAASTALTRD 217
Query: 123 --------------VVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTG 167
V SWILKDG+ H+ +L S R D + KR R+LAD L+ LGTG
Sbjct: 218 PALEHLPTAASASAVTSWILKDGVGHLVRLAFGSRYAHRFDGDLKRLRVLADVLWHLGTG 277
Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
LE++S P F+ +A KG+A V +TR IY + A N+ D+ AKG+A S
Sbjct: 278 LELVSRSFPDSFIGLASGAYALKGMAHVVFNSTRSTIYRTVATRSNIGDVTAKGDAQSIA 337
Query: 228 FNVVGLGAGIQLASTVC-SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
++G+ G+ L +++ + ++ +YS I+ MR + LN QR ++V
Sbjct: 338 AELLGMSIGLSLYRGFAENNLFWTWAIFGIVVAGQMYSRIQSMRVLVLPLLNLQRAVLLV 397
Query: 287 ADFVKT 292
+V +
Sbjct: 398 DAYVAS 403
>gi|189241392|ref|XP_968556.2| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Tribolium
castaneum]
Length = 472
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 127/267 (47%), Gaps = 6/267 (2%)
Query: 46 HLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLST 105
+S + V DS V V+ F P GYP SV+E Y Y + +Q F S + +T
Sbjct: 34 EISQESVADS--VLQSVLGFFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTT 91
Query: 106 QSLLFAAGLRPTPAQA--TVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALY 162
Q++L G+ A A ++WI+KDG VG++I + G +DS K+WR+ AD L
Sbjct: 92 QAILKGVGVGSAEASALSATITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLN 151
Query: 163 DLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGE 222
D LE+ P L E+ L K + VA ATR I + A +GN++++ AK
Sbjct: 152 DGAMILELSIPHVTHLSTEILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDG 211
Query: 223 AISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRT 282
+ TL N++ GI L + + + IV LL +H+YS + + A LN R
Sbjct: 212 SQETLVNLIASFTGIFLINYFTDATT-QWIVTFLLIFLHLYSNYQAVTALIFKNLNNLRM 270
Query: 283 AMIVADFVKTGRISSPADLRYREDLLF 309
+I+ ++ + P + E +
Sbjct: 271 TLILKSYLAVNAVLGPECINKTESVFL 297
>gi|125986275|ref|XP_001356901.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
gi|195148693|ref|XP_002015302.1| GL19630 [Drosophila persimilis]
gi|54645227|gb|EAL33967.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
gi|194107255|gb|EDW29298.1| GL19630 [Drosophila persimilis]
Length = 417
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 130/257 (50%), Gaps = 6/257 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + +Q F S L T ++L G A + V+WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENINAYSATVTWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
+K+G HVG+++ + G ++D + K+WR+ AD L DL G+E+ + P P L ++
Sbjct: 113 MKEGCGHVGRILFAWWKGCQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCG 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L T+
Sbjct: 173 STMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLL-TLIK 231
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
S V ++ +H+Y+ ++ +R+ + T N R + + ++ ++ R+ +P + E
Sbjct: 232 SQAVLYTVFYVVVALHLYANLKAVRSVCLRTFNESRYLIALEEYFRSSRMLTPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKM 322
+ G+ + + NV++
Sbjct: 292 RVTL-GQTVSVSLNVRL 307
>gi|74215416|dbj|BAE41911.1| unnamed protein product [Mus musculus]
Length = 337
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 125/243 (51%), Gaps = 7/243 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
S+ V LL+ +H+Y+ +RA + TLN R +++ F++ G + PA
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312
Query: 305 EDL 307
E L
Sbjct: 313 EPL 315
>gi|195437813|ref|XP_002066834.1| GK24688 [Drosophila willistoni]
gi|194162919|gb|EDW77820.1| GK24688 [Drosophila willistoni]
Length = 417
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 133/257 (51%), Gaps = 6/257 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV++ Y Y + +Q F S L T ++L + G+ A + V+WI
Sbjct: 53 FLPKGYPDSVSDDYAAYQIYDTIQAFCSTICGTLCTHAILKSIGVGSENINAYSATVTWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P L ++
Sbjct: 113 LKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCG 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K + VA ATR + A GNL+D+ +K + T N++ G+ + T+
Sbjct: 173 STMLKAVVSVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLLASCVGLYVL-TLIK 231
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
S V L+ +H+Y+ ++ +RA + T N R + + +F ++ R+ +P + E
Sbjct: 232 SPAVLYTVFYLVVTLHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSRMLTPQQVNGME 291
Query: 306 DLLFPGRLIEDAGNVKM 322
+ G+ + + N+++
Sbjct: 292 RVTL-GQTVSVSLNIRL 307
>gi|270013094|gb|EFA09542.1| hypothetical protein TcasGA2_TC011650 [Tribolium castaneum]
Length = 450
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 127/267 (47%), Gaps = 6/267 (2%)
Query: 46 HLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLST 105
+S + V DS V V+ F P GYP SV+E Y Y + +Q F S + +T
Sbjct: 34 EISQESVADS--VLQSVLGFFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTT 91
Query: 106 QSLLFAAGLRPTPAQA--TVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALY 162
Q++L G+ A A ++WI+KDG VG++I + G +DS K+WR+ AD L
Sbjct: 92 QAILKGVGVGSAEASALSATITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLN 151
Query: 163 DLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGE 222
D LE+ P L E+ L K + VA ATR I + A +GN++++ AK
Sbjct: 152 DGAMILELSIPHVTHLSTEILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDG 211
Query: 223 AISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRT 282
+ TL N++ GI L + + + IV LL +H+YS + + A LN R
Sbjct: 212 SQETLVNLIASFTGIFLINYFTDATT-QWIVTFLLIFLHLYSNYQAVTALIFKNLNNLRM 270
Query: 283 AMIVADFVKTGRISSPADLRYREDLLF 309
+I+ ++ + P + E +
Sbjct: 271 TLILKSYLAVNAVLGPECINKTESVFL 297
>gi|58392188|ref|XP_319167.2| AGAP010020-PA [Anopheles gambiae str. PEST]
gi|55236239|gb|EAA13864.2| AGAP010020-PA [Anopheles gambiae str. PEST]
Length = 417
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 71/239 (29%), Positives = 124/239 (51%), Gaps = 9/239 (3%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F + F P GYP SV+ YL Y ++ LQ F S L+T ++L G+ A +
Sbjct: 49 FQHLFLPVGYPDSVSSDYLSYQKWDTLQAFCSTISGTLTTHAILKGVGVGSDAANPLSAT 108
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLE 181
++W+LKDG H+G+++ + G+ +D + K+WRI AD L D+ +++ P P+
Sbjct: 109 ITWVLKDGTGHLGRILFAWWKGSELDIDSKKWRIRADILNDVAMAIDLFVLPYYPKASTY 168
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ K I VA ATR + A GNL+D+ +K A T N++ G+ L +
Sbjct: 169 ILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLFLLT 228
Query: 242 TVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
+ K++ G +++IH+Y+ I+ ++A + T N R + + ++ K+G + SP
Sbjct: 229 YL---QNQKVLYGLFAFVTLIHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSP 284
>gi|207080128|ref|NP_001128778.1| DKFZP468D197 protein [Pongo abelii]
gi|55727564|emb|CAH90537.1| hypothetical protein [Pongo abelii]
Length = 466
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 138/278 (49%), Gaps = 11/278 (3%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G S +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDSGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--CSSMQGKMIVGPLLSVIHVYSVIEEM 269
N++D+ AK + TL N+VGL + + V C + G L+ +H+Y+ +
Sbjct: 222 NNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCPGL-GCFF---FLTALHIYANYRAV 277
Query: 270 RAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
RA + TLN R +++ +++ G + +P E L
Sbjct: 278 RALVMETLNEGRLRLVLKHYLQRGEVLNPTAANRMEPL 315
>gi|195115010|ref|XP_002002060.1| GI17175 [Drosophila mojavensis]
gi|193912635|gb|EDW11502.1| GI17175 [Drosophila mojavensis]
Length = 417
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 120/235 (51%), Gaps = 5/235 (2%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
N F P GYP SV++ Y Y + +Q F S L T ++L G A + V+
Sbjct: 51 NIFLPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNAYSATVT 110
Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMA 183
WILK+G H+G++I + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 111 WILKEGSGHMGRIIFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
K I VA ATR + A GNL+D+ +K + T N++ G+ L T+
Sbjct: 171 CGSTLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFMGLYLL-TM 229
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
S V ++ +H+Y+ ++ +RA + T N R + + +F ++ ++ +P
Sbjct: 230 IKSQAMLYTVFYIVVTLHLYANLKAVRAVCLRTFNESRYLITLEEFFRSSQMLTP 284
>gi|195035521|ref|XP_001989226.1| GH11606 [Drosophila grimshawi]
gi|193905226|gb|EDW04093.1| GH11606 [Drosophila grimshawi]
Length = 417
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 132/259 (50%), Gaps = 6/259 (2%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
N F P GYP SV++ Y+ Y + LQ F S L T ++L G A + V+
Sbjct: 51 NIFLPKGYPDSVSDDYVVYQIWDTLQAFCSTICGTLCTHAILKGIGVGSENINAYSATVT 110
Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMA 183
WILK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 111 WILKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
K I VA ATR + A GNL+D+ +K + T N+V G+ L + +
Sbjct: 171 CGSTMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLTMI 230
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
S I +++ +H+Y+ ++ +R+ + T N R + + +F ++ + +P +
Sbjct: 231 KSQAMLYTIFYIVVT-LHLYANLKAVRSICLRTFNESRYLIALEEFFRSSHMLTPQQVNK 289
Query: 304 REDLLFPGRLIEDAGNVKM 322
E + G+ + + N+++
Sbjct: 290 MERVTV-GQTVSVSLNIRL 307
>gi|350411769|ref|XP_003489447.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus impatiens]
Length = 464
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 122/243 (50%), Gaps = 4/243 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV+ Y Y + +Q F S + L+T S++ G+ A ++WI
Sbjct: 53 FLPQGYPDSVHPDYTPYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEAAATPLAAAITWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKDG +G+++ + G +D + K+WR+ AD L DL GLE+ P L + +
Sbjct: 113 LKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDLAMGLELFLPYFSSYSLGILCIS 172
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
K I VA ATR + A + NL+D+ AK + T N++ GI + S + +
Sbjct: 173 TAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHN 231
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ M + L +IH+Y+ ++A +N+LN R A+IV ++ I P ++ +E
Sbjct: 232 GRYIMELYLFLVMIHLYANYMAVKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKES 291
Query: 307 LLF 309
+L
Sbjct: 292 VLL 294
>gi|242036507|ref|XP_002465648.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
gi|241919502|gb|EER92646.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
Length = 440
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 72/253 (28%), Positives = 131/253 (51%), Gaps = 3/253 (1%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ ++ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 52 WGQIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 111
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 112 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 171
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P + + LG+ ++ VA+ ATR + FA N +D+ AK + TL ++G+G
Sbjct: 172 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMIGMGL 231
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
G+ LA L+V H+Y+ + +++ + TLN +R ++++ F + G +
Sbjct: 232 GMLLAHITRGHALSVWTSFLSLTVFHMYANYKAVQSLSLTTLNYERASILLQYFKECGDV 291
Query: 296 SSPADLRYREDLL 308
+P + +E +L
Sbjct: 292 LAPQKVSQQEHIL 304
>gi|198431163|ref|XP_002119233.1| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Ciona
intestinalis]
Length = 414
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 139/281 (49%), Gaps = 18/281 (6%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F + F P GYP SV+E YL Y + +Q F S+ L+T ++L +G+ A +
Sbjct: 49 FKDIFLPKGYPESVSEDYLSYQIWDTVQAFCSSITGTLATHAVLKGSGVGDETANVASAT 108
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+LKDG +G++ + G+ +D K+WR+ AD + D +++++P+ P+ + +
Sbjct: 109 ITWLLKDGTGMLGRIAFAYFKGSSLDCNAKQWRLFADVMNDCAIFIQLVAPVLPKAYFTL 168
Query: 183 A-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
L A + VA TR + AK N++D+ AK + TL N+ L G+ +
Sbjct: 169 VMCLAGLAFSLVGVAGGCTRAALTMHQAKCNNMADVSAKDGSQETLVNLAALLTGLVIMP 228
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
V +++ ++ + + +HV++ + + + + N R + ++ F ++ +P +
Sbjct: 229 IVATNIPLMWLLFLIFTFLHVFANYKAVASVVMELFNRNRLSKVIKVFCDMNKVLTPKEA 288
Query: 302 RYREDLLFP--------GRLIEDAG------NVKMDRHGIR 328
+E + FP G +ED G + K+D G+R
Sbjct: 289 NEKEAIFFPPSYSWLNLGCKLEDLGLTSSEFHQKVDDGGVR 329
>gi|257900512|ref|NP_073581.2| UPF0420 protein C16orf58 [Homo sapiens]
gi|143955305|sp|Q96GQ5.2|CP058_HUMAN RecName: Full=UPF0420 protein C16orf58
gi|62897385|dbj|BAD96633.1| hypothetical protein FLJ13868 variant [Homo sapiens]
gi|119572504|gb|EAW52119.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
gi|119572505|gb|EAW52120.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
gi|119572506|gb|EAW52121.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
Length = 468
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 134/276 (48%), Gaps = 5/276 (1%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ TLN R +++ +++ G + P E L
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 317
>gi|397615116|gb|EJK63231.1| hypothetical protein THAOC_16128 [Thalassiosira oceanica]
Length = 621
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 84/280 (30%), Positives = 134/280 (47%), Gaps = 58/280 (20%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
+ + + S F P G V Y R+ ++R LQ F +A + V+ TQSLL GLR
Sbjct: 141 KSIIRKCRQSLHKNFVPEG----VTPSYYRFMRWRILQRFMNANVHVIGTQSLLM--GLR 194
Query: 116 PTPAQATVVS----------------WILKDGMQHVGKLI----CSNLGARMDSEPKRWR 155
++TVVS W+LKD + GKL+ S +G + DS+ KRWR
Sbjct: 195 GM-QRSTVVSAAAKGGAALGAVAATNWVLKDTL---GKLVRMAWASKMGRKFDSDAKRWR 250
Query: 156 ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG--- 212
A +Y LG GLEV + L P+ FL +A L N K ++++ + ATR +Y+SF +
Sbjct: 251 YRACFIYALGNGLEVSTYLHPRFFLVLAMLANSCKQVSMLTSSATRNALYNSFRRMDDDV 310
Query: 213 -------------------------NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
N+ D+ AKGEA + +++G+ +GI L+ + S+
Sbjct: 311 DTVGTGSNSTLKSSINGGRRRGVVENIGDITAKGEAQIAVVDLLGIASGICLSRAIGVSV 370
Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVA 287
Q + V +L + +Y + E+R+ TLN +R ++
Sbjct: 371 QNVLTVWSVLQLGEIYCMYHEIRSIVYQTLNFERMYTVLG 410
>gi|10436021|dbj|BAB14730.1| unnamed protein product [Homo sapiens]
Length = 468
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 134/276 (48%), Gaps = 5/276 (1%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ TLN R +++ +++ G + P E L
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 317
>gi|312374874|gb|EFR22346.1| hypothetical protein AND_15402 [Anopheles darlingi]
Length = 417
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 72/240 (30%), Positives = 123/240 (51%), Gaps = 11/240 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F + F P GYP SV+ YL Y ++ +Q F S L+T ++L G+ A +
Sbjct: 49 FQHLFLPVGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDSVTALSAT 108
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLE 181
V+W+LKDGM H+G+++ + G +D + K+WRI AD L D+ +++ P P+
Sbjct: 109 VTWVLKDGMGHLGRILFAWWKGTELDIDSKKWRIRADVLNDIAMAIDLFVLPYYPKASTY 168
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ K I VA ATR + A GNL+D+ +K A T N++ G+ L
Sbjct: 169 ILCGTTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLLLL- 227
Query: 242 TVCSSMQGKMIVGPLLSVI---HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
+ +Q + I+ L + + H+Y+ I+ ++A + T N R + + ++ K+G + SP
Sbjct: 228 ---TYLQNQRILYGLFAFVTFLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSP 284
>gi|225452488|ref|XP_002278851.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
Length = 514
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 136/267 (50%), Gaps = 14/267 (5%)
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
DD + S + FF P V+ Y+ Y +++ L S+AL VL+TQ++ A
Sbjct: 99 DDGLNNLFALCSSAVRDFF---IPQKVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAI 155
Query: 113 GL---RPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGL 168
G+ R P+ A ++W+LKDG+ + + I ++L + D+ KR R L+ L G+
Sbjct: 156 GIGYSRSLPS-AAALNWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGV 214
Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
E+L+P P FL +A + N AK + V AT ++ SFA NL ++ AK + F
Sbjct: 215 ELLTPAFPNYFLLLATIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCF 274
Query: 229 NVVGLGAGIQLASTVCSSMQGKM-----IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
+ +GL L + +C + Q + +V P+ S I ++ + + ++ + TL R
Sbjct: 275 DNLGLMLAAVL-NILCKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLE 333
Query: 284 MIVADFVKTGRISSPADLRYREDLLFP 310
+I+ +++ G + SPA++ E + FP
Sbjct: 334 IILNMWIQLGYVPSPAEVSKEEGISFP 360
>gi|301780856|ref|XP_002925848.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ailuropoda
melanoleuca]
gi|281353232|gb|EFB28816.1| hypothetical protein PANDA_015415 [Ailuropoda melanoleuca]
Length = 468
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 70/241 (29%), Positives = 121/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGNAKASVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRILFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTMTVCTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V +
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSAC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +RA + TLN R +++ F++ G + P E
Sbjct: 257 PSFSLGCFFFLTALHIYANYRAVRALVLETLNEGRLRLVLKHFLQRGEVLGPTSANQMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|296087683|emb|CBI34939.3| unnamed protein product [Vitis vinifera]
Length = 559
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 136/267 (50%), Gaps = 14/267 (5%)
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
DD + S + FF P V+ Y+ Y +++ L S+AL VL+TQ++ A
Sbjct: 144 DDGLNNLFALCSSAVRDFF---IPQKVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAI 200
Query: 113 GL---RPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGL 168
G+ R P+ A ++W+LKDG+ + + I ++L + D+ KR R L+ L G+
Sbjct: 201 GIGYSRSLPS-AAALNWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGV 259
Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
E+L+P P FL +A + N AK + V AT ++ SFA NL ++ AK + F
Sbjct: 260 ELLTPAFPNYFLLLATIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCF 319
Query: 229 NVVGLGAGIQLASTVCSSMQGKM-----IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
+ +GL L + +C + Q + +V P+ S I ++ + + ++ + TL R
Sbjct: 320 DNLGLMLAAVL-NILCKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLE 378
Query: 284 MIVADFVKTGRISSPADLRYREDLLFP 310
+I+ +++ G + SPA++ E + FP
Sbjct: 379 IILNMWIQLGYVPSPAEVSKEEGISFP 405
>gi|426381987|ref|XP_004057610.1| PREDICTED: UPF0420 protein C16orf58 homolog [Gorilla gorilla
gorilla]
Length = 468
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 134/276 (48%), Gaps = 5/276 (1%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLILPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ TLN R +++ +++ G + P E L
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 317
>gi|114662190|ref|XP_001159530.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan troglodytes]
gi|397472000|ref|XP_003807550.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan paniscus]
gi|410212432|gb|JAA03435.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410264128|gb|JAA20030.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410292910|gb|JAA25055.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410340381|gb|JAA39137.1| chromosome 16 open reading frame 58 [Pan troglodytes]
Length = 468
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 134/276 (48%), Gaps = 5/276 (1%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ TLN R +++ +++ G + P E L
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 317
>gi|321455034|gb|EFX66180.1| hypothetical protein DAPPUDRAFT_302922 [Daphnia pulex]
Length = 315
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 133/265 (50%), Gaps = 16/265 (6%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
DS + + + F P GYP SV++ YL Y + +Q F S+ LST+++L G
Sbjct: 40 DSNGKFRGITHLLSSVFLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVG 99
Query: 114 L---RPTPAQATVVSWILKDGMQHVGKLI---CSNLGARMDSEPKRWRILADALYDLGTG 167
+ TP AT++ W++KDG VG+++ C+ G ++D++ K+WR AD L D
Sbjct: 100 VGSSTSTPLAATLM-WLIKDGTGMVGRILFAWCN--GTKLDADSKKWRFFADLLNDCALS 156
Query: 168 LEVLSPLCPQ-------LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
LE+ +P + + AK I VA ATR + A++GNL+D+ AK
Sbjct: 157 LELCAPYAVAAVGGSNGTMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAK 216
Query: 221 GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
+ TL N+ L + L + + ++ L +++HV++ ++ ++A + TLN
Sbjct: 217 DGSQETLVNLAALITSLWLLPILDGATGLTWLMFILFTLLHVFANLKAVKAVTMETLNRA 276
Query: 281 RTAMIVADFVKTGRISSPADLRYRE 305
R +I+ F T ++S ++ E
Sbjct: 277 RYMIILKQFASTKYVASVKEVNRLE 301
>gi|449489204|ref|XP_004158245.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 435
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 129/247 (52%), Gaps = 4/247 (1%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F + F P G+P SV Y+ + + +LQ ++ ++LSTQ+LL A G+ A
Sbjct: 51 FFDAFVPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVIGAT 110
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLE-VLSPLCPQLFLE 181
W L+D +G ++ + G+ +DS K WR++AD + DLG ++ +SPL P F+
Sbjct: 111 FQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMYVDGPVSPLFPSAFML 170
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+ G+ LA
Sbjct: 171 IVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGMLLAR 230
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+ L++ H+Y+ + + + +LNPQR ++ + +++T + SP +
Sbjct: 231 ITMGCPLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLSPQQV 290
Query: 302 RYREDLL 308
E +L
Sbjct: 291 SMMEHVL 297
>gi|332262940|ref|XP_003280516.1| PREDICTED: UPF0420 protein C16orf58 homolog [Nomascus leucogenys]
Length = 468
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 70/241 (29%), Positives = 121/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +RA + TLN R +++ +++ G + P E
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|307106356|gb|EFN54602.1| hypothetical protein CHLNCDRAFT_135084 [Chlorella variabilis]
Length = 486
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/250 (30%), Positives = 124/250 (49%), Gaps = 29/250 (11%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
F F P G+P SV+ YL Y + ++Q +S +LS+Q++L G
Sbjct: 65 FAACFLPEGFPHSVSSDYLPYQLWDSIQGLSSYLRGMLSSQAMLAGVGF----------- 113
Query: 126 WILKD--GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
LKD GM G L S G+ +D K+WR+LAD + D+G LE+ SPL P FL +A
Sbjct: 114 -FLKDFTGMLG-GILFASTQGSGLDCYAKQWRLLADVMNDIGLALELASPLLPGAFLLLA 171
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
LG+ A+ + VA ATR+ + FA + N +D+ AK + T +VG+ G+
Sbjct: 172 CLGSLARAVTGVAGSATRMALTQHFALQRNAADIAAKEGSQETAVTLVGMVLGMAF---- 227
Query: 244 CSSMQGKMIVGPL-------LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
++ + PL L+ +HV++ + MR + +LN R +++ +++TG
Sbjct: 228 ---IRAAHGLEPLIWSAFWALTALHVWANVRAMRCLRIASLNQARLGLLLRHYLRTGEAL 284
Query: 297 SPADLRYRED 306
+P + E
Sbjct: 285 TPGQMAAEES 294
>gi|356524511|ref|XP_003530872.1| PREDICTED: uncharacterized protein LOC100783222 [Glycine max]
Length = 513
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 71/260 (27%), Positives = 133/260 (51%), Gaps = 10/260 (3%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA--AGLRP 116
+ R S + FF P V Y+ Y +++ L S+AL VL+TQ++ A G
Sbjct: 104 WLRAFGSVVWDFF---IPRGVTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSN 160
Query: 117 TPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
+ A ++W+LKDG+ + + I ++L + D+ KR R L+ GLE+L+P
Sbjct: 161 SLPSAAALNWVLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAF 220
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG--L 233
P+ FL +A + N +K I++ ATR ++ SFA NL ++ AK + + F+++G L
Sbjct: 221 PRCFLLLATIANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLML 280
Query: 234 GAGIQLASTVCSSMQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
A + L Q + + P + + ++ + + ++ + TL R +I++ +++
Sbjct: 281 AALVNLWIENHRRQQAGLHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIE 340
Query: 292 TGRISSPADLRYREDLLFPG 311
G + SPA++ +E + F G
Sbjct: 341 CGYVPSPAEVSEKEGINFLG 360
>gi|297283910|ref|XP_002802508.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Macaca
mulatta]
Length = 429
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/241 (28%), Positives = 121/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 38 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 97
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 98 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 157
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 158 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 217
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +R+ + TLN R +++ +++ G + P E
Sbjct: 218 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEP 277
Query: 307 L 307
L
Sbjct: 278 L 278
>gi|328869935|gb|EGG18310.1| DUF647 family protein [Dictyostelium fasciculatum]
Length = 407
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 137/279 (49%), Gaps = 15/279 (5%)
Query: 44 DGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVL 103
D +VK+ +R + ++ + F P GYP SV Y Y + LQ S+ VL
Sbjct: 24 DEQTNVKITKQTRK-HGSFIEVLRDMFLPKGYPNSVTSDYTGYQTWDTLQALCSSLTGVL 82
Query: 104 STQSLLFAAG---LRPTPAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILAD 159
+T+++L G L + A A++ WI +D +G++I + G +D K WR AD
Sbjct: 83 ATRAILKGYGVGNLNASVASASI-QWITRDAAGMLGRIIFAWKWGTDLDCNSKTWRFAAD 141
Query: 160 ALYDLGTGLEVLSPLC-PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
L D G LE++SPL P FL ++ LG +K I VA T+ + FAK NL+D+
Sbjct: 142 LLNDFGMTLEMISPLFGPSYFLPISCLGLISKSICGVAGGCTKASLTQHFAKRDNLADVS 201
Query: 219 AKGEAISTLFNVVGLGAGIQLASTV---CSSMQ---GKMIVGPLLSVIHVYSVIEEMRAA 272
AK + T +VG+ G+ +A+++ S+Q + + +VIH+ +
Sbjct: 202 AKDGSQETAIALVGMLLGMVMAASIPDDDDSIQVYFKTWLFFIIFTVIHLLCNYRAVSLV 261
Query: 273 PVNTLNPQRTAMIVADFVK--TGRISSPADLRYREDLLF 309
+ ++N R +I F + +I +P + +E++LF
Sbjct: 262 QLTSINRYRANLIYQHFNENNNNQIPTPKQISLKENILF 300
>gi|322787029|gb|EFZ13253.1| hypothetical protein SINV_10096 [Solenopsis invicta]
Length = 468
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 129/267 (48%), Gaps = 11/267 (4%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
+S + R+ F F P GYP SV+ Y Y + +Q F S L+T S++ G
Sbjct: 35 ESLAIRSRITSVFKQIFLPQGYPDSVHPDYTAYQIWDTVQAFASTITGTLTTHSIMKGIG 94
Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV 170
+ + A A ++WILK G +G ++ + G ++D + K+WR+ AD L D+ G+E+
Sbjct: 95 VGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTKLDGQCKKWRLFADILDDIAKGIEL 154
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
L P + + K I VA ATR + + A + N++D+ AK + T N+
Sbjct: 155 LVPFFSSYSTVILCISTTMKSIVGVAGGATRTALINHQAIQNNVADVSAKDGSQETCVNL 214
Query: 231 VGLGAGIQLASTVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
+ GI + S G+ ++ L +H+Y+ ++A ++TLN R A+IV +
Sbjct: 215 IASFFGIFILSFF---HDGRYLLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAIIVKN 271
Query: 289 FVKTGRISSPADLRYREDLLF---PGR 312
++ +I P + E + PGR
Sbjct: 272 YMINEQIPKPGKVNKEESIFLLTNPGR 298
>gi|332030173|gb|EGI69967.1| UPF0420 protein C16orf58-like protein [Acromyrmex echinatior]
Length = 471
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 126/259 (48%), Gaps = 4/259 (1%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
+S+ V+ R+ F F P GYP SV+ Y Y + +Q F S L+T S++ G
Sbjct: 41 ESQTVHSRITSVFKQIFLPQGYPNSVHPDYTAYQIWDTVQAFASTITGTLTTHSIMKGIG 100
Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV 170
+ + A A ++WILK G +G ++ + G +D + K+WR+ AD L D+ G+E+
Sbjct: 101 VGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTELDGQCKKWRLFADILDDIAKGIEL 160
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
L P + + + K I VA ATR + A + N++D+ AK + T N+
Sbjct: 161 LVPYFSSYSVIILCISTTMKSIVGVAGGATRTALIHHQAIQNNVADVSAKDGSQETCVNL 220
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+ GI + S + Q + + L +H+Y+ ++A ++TLN R A+IV ++
Sbjct: 221 IASFFGIFILS-LFHDGQYVLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAIIVKKYM 279
Query: 291 KTGRISSPADLRYREDLLF 309
+I P + E +
Sbjct: 280 MNEQIPEPWKVNKEESVFL 298
>gi|298708275|emb|CBJ48338.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 284
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 109/203 (53%), Gaps = 3/203 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS--WI 127
F P+GYP SV + Y+ Y + +Q S VL ++L A+G+ A A + W+
Sbjct: 71 FLPTGYPSSVRKEYMEYQTWDTVQALCSYLRGVLCLTAMLHASGVGDENASAIAAAMVWV 130
Query: 128 LKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+DG +G L+ S + MDS K WR+ AD D+G L++++P+ F ++ L
Sbjct: 131 LRDGFGMIGSLLFSYAASSHMDSNIKEWRLFADIANDVGLTLDLVAPVFRGNFAVVSSLA 190
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
K + VAA T+ I S FA + N++D+ AK A T NV G+ G+++AS + +
Sbjct: 191 TVFKALCGVAAGCTKTSITSHFALKQNMADISAKENAQETAVNVCGMLLGVKVASYLSDA 250
Query: 247 MQGKMIVGPLLSVIHVYSVIEEM 269
G++++ S++HVY+ M
Sbjct: 251 PTGRVLIFVFFSLLHVYANYRYM 273
>gi|356566391|ref|XP_003551415.1| PREDICTED: uncharacterized protein LOC100783455 [Glycine max]
Length = 525
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 71/260 (27%), Positives = 132/260 (50%), Gaps = 10/260 (3%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA--AGLRP 116
+ R S + FF P V Y+ Y +++ L S+AL VL+TQ++ A G
Sbjct: 115 WLRACGSVVWDFF---IPRGVTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSC 171
Query: 117 TPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
+ A ++W+LKDG+ + + I ++L + D+ KR R L+ GLE+L+P
Sbjct: 172 SLPSAAALNWVLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAF 231
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG--L 233
P+ FL +A + N +K I++ ATR ++ SFA NL ++ AK + + F+++G L
Sbjct: 232 PRCFLLLATIANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLML 291
Query: 234 GAGIQLASTVCSSMQG--KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
A + L Q + P + + ++ + + ++ + TL R +I++ +++
Sbjct: 292 AALVNLWIENHRRQQAGFHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIE 351
Query: 292 TGRISSPADLRYREDLLFPG 311
G + SPA++ +E + F G
Sbjct: 352 CGYVPSPAEVSEKEGINFLG 371
>gi|302564091|ref|NP_001181525.1| UPF0420 protein C16orf58 [Macaca mulatta]
gi|380815644|gb|AFE79696.1| hypothetical protein LOC64755 [Macaca mulatta]
Length = 468
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/241 (28%), Positives = 121/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +R+ + TLN R +++ +++ G + P E
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|355756739|gb|EHH60347.1| hypothetical protein EGM_11687 [Macaca fascicularis]
Length = 468
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/241 (28%), Positives = 121/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +R+ + TLN R +++ +++ G + P E
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|402908238|ref|XP_003916859.1| PREDICTED: UPF0420 protein C16orf58 homolog [Papio anubis]
Length = 468
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/241 (28%), Positives = 121/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +R+ + TLN R +++ +++ G + P E
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|355710161|gb|EHH31625.1| hypothetical protein EGK_12728 [Macaca mulatta]
Length = 468
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/241 (28%), Positives = 121/241 (50%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +R+ + TLN R +++ +++ G + P E
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|219127360|ref|XP_002183905.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404628|gb|EEC44574.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 891
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 128/245 (52%), Gaps = 27/245 (11%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWI 127
F P GYP+SV EGYL Y + ++Q +S +VL + +L AAG+ A A + ++W
Sbjct: 438 FMPVGYPYSVREGYLSYQIYDSIQGLSSYLRAVLCSAQVLQAAGVGDANATAIMAALTWA 497
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FLEMAGL 185
LKDG+ +G LI S + + +D+ K +R+ AD + D+G L++L+P P+ + ++
Sbjct: 498 LKDGLGMIGGLIFSYGVASHLDAYVKEFRLFADVINDVGMLLDMLAPYVPRADLIWISSA 557
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K + +AA AT+ I FA +GN++DL AK TL N++G+ G+ LA +
Sbjct: 558 ATLCKVMCGIAAGATKSSITQHFALQGNMADLNAKEATQETLVNLIGMVCGVTLARYLSQ 617
Query: 246 SMQG-----------------------KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRT 282
G + +V +L+++H++S + + + TLN +RT
Sbjct: 618 LEAGNEEGDTDRIGYRSWIPCISHDSIQWMVFTILTLLHIWSNWKGVSLLRLETLNRERT 677
Query: 283 AMIVA 287
+++
Sbjct: 678 KVVLC 682
>gi|403276814|ref|XP_003930078.1| PREDICTED: UPF0420 protein C16orf58 homolog [Saimiri boliviensis
boliviensis]
Length = 468
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 70/241 (29%), Positives = 120/241 (49%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGF 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
L+ +H+Y+ +RA + TLN R +++ +++ G + P E
Sbjct: 257 PGFSFGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTTANRMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|194388018|dbj|BAG65393.1| unnamed protein product [Homo sapiens]
Length = 422
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/240 (28%), Positives = 120/240 (50%), Gaps = 3/240 (1%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWIL 128
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W++
Sbjct: 32 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 91
Query: 129 KDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F N
Sbjct: 92 KDSTDMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 151
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 152 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 211
Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ L+ +H+Y+ +RA + TLN R +++ +++ G + P E L
Sbjct: 212 GFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 271
>gi|403362828|gb|EJY81150.1| DUF647 family protein [Oxytricha trifallax]
Length = 507
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 121/248 (48%), Gaps = 16/248 (6%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
++ +Y+ + ++FL P+GYP +V+ YL++T + + F A+ LS QSL A G
Sbjct: 77 NKNIYNGLFETFL----PAGYPHTVSSSYLKFTIYSNISAFMYTAMGFLSAQSLFVAIGS 132
Query: 115 RPTPAQ--ATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
T A A +W+LKDG+ +G L S G D + K+WR +A ++ +E+L
Sbjct: 133 TMTQANLAAAAYTWVLKDGLGQLGGILFASRYGRNFDVDIKKWRFMAIFALNISMYIEIL 192
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
+ PQ FL +A L N K I + A A+R I AK N+ D+ K + T ++
Sbjct: 193 TLSFPQYFLALASLANVGKNICFLLASASRASINLQLAKNNNIGDISGKSVSQFTASTLI 252
Query: 232 G--LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT---LNPQRTAMIV 286
G G GI + S Q + P+ V+ V ++ +A V LN QR ++
Sbjct: 253 GGACGLGISKIIDITSIYQ----LYPVFFVLTVMNIAASYYSAKVIDELYLNNQRAFLLF 308
Query: 287 ADFVKTGR 294
+F K+
Sbjct: 309 NEFFKSNE 316
>gi|351711359|gb|EHB14278.1| hypothetical protein GW7_05117 [Heterocephalus glaber]
Length = 469
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/229 (28%), Positives = 121/229 (52%), Gaps = 3/229 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P G+P SV+ YL Y Q+ ++Q F S+ S L+T S++ G+ A +W+
Sbjct: 78 FLPQGFPESVSPDYLPYQQWDSVQAFASSLCSALATHSVMLGLGVGNAEASLAAATATWL 137
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
++D +G++I + G+++D K+WR+ AD L D LE+++P+ F + +
Sbjct: 138 VRDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDAAMLLELVAPVLSVSFTMIVSIS 197
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + + +
Sbjct: 198 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLIMLPLMSNC 257
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ + LL+ +H+Y+ +R + TLN R +++ F+++G +
Sbjct: 258 PRLSLGCFFLLTALHIYANYRAVRTLVMETLNESRLRLVLKHFLQSGEV 306
>gi|358397603|gb|EHK46971.1| hypothetical protein TRIATDRAFT_291240 [Trichoderma atroviride IMI
206040]
Length = 468
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 148/294 (50%), Gaps = 8/294 (2%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDD--SRPVYHRVVDSFLNKFFPSGYPFSV 79
++ I + S + + DGH+ K + SR + + ++ F P G+P SV
Sbjct: 12 NDVEIVELDKSGHPQQQWLHAHDGHILPKTSNSLFSRS-FREWRKAIMDSFLPVGFPHSV 70
Query: 80 NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGK 137
++ YL Y F +LQ F S S+L+ ++LL G+ + AT ++ I+KD M +
Sbjct: 71 SKDYLAYQTFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTIMKDAMSRIAT 130
Query: 138 LICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIAVV 195
++ ++ G R++ + KR+R LAD D LE+ SP +A G + + V
Sbjct: 131 IVFAHRFGLRIEPDAKRYRFLADVFNDSAFFLELYSPYLGSWGKVIALSTGQALRALCGV 190
Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
AA A++ + FAK NL++L AK + T ++GL G L + +I+
Sbjct: 191 AAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVG-TLVVKMIEDHSSVVILMI 249
Query: 256 LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
LL + H+ + +R+ + TLN QR ++ +F+++G++ +P + RE+++F
Sbjct: 250 LLVIAHLGTNYLGVRSVCMATLNRQRATIVFEEFLRSGKVLTPEQVAERENIIF 303
>gi|449460628|ref|XP_004148047.1| PREDICTED: UPF0420 protein C16orf58-like [Cucumis sativus]
Length = 523
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 146/277 (52%), Gaps = 11/277 (3%)
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
D R +Y V + + F P +V+E Y+ Y +++ L S+AL V++TQ++ A
Sbjct: 107 DGFRTLYRVSVLAVKDFFIPK----NVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAI 162
Query: 113 GL--RPTPAQATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLE 169
G+ + A A ++W+LKDG+ + + L +++ + D+ KR R L+ L G+E
Sbjct: 163 GVGHSRSLASAAALNWVLKDGLGRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVE 222
Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
+L+P PQ FL +A + N K I++ +T ++ SFA NL ++ AK + S F+
Sbjct: 223 LLTPAFPQYFLLLASIANIVKQISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFD 282
Query: 230 VVG--LGAGIQLASTVCSSMQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
+G L A + S +Q + +V P+ + + ++ + ++ + TL R +I
Sbjct: 283 NLGLVLAAFLNFLSKNDRRLQAALPFVVYPIFAAMDLFGTYQGLKHVHLQTLTKDRLEII 342
Query: 286 VADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKM 322
++++++ G + +PA++ RE + R +D+ +++
Sbjct: 343 LSNWIEQGYVPTPAEVSEREGIDLLCRQGKDSWPIRI 379
>gi|302881823|ref|XP_003039822.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
77-13-4]
gi|256720689|gb|EEU34109.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
77-13-4]
Length = 444
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 157/312 (50%), Gaps = 17/312 (5%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRP-VYHRVVDSFL-NKFFPSGYPFSV 79
+E+ + + S +V R + P S +++ ++P ++ R + L + F P GYP SV
Sbjct: 12 REVEVIDLDKSGNVARRWVHMP----SNRIIRKAKPSLFKRSMRKILMDAFLPVGYPHSV 67
Query: 80 NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGK 137
++ YL Y F +LQ F S ++L+ ++LL G+ + AT ++ +LKD M V
Sbjct: 68 SDDYLAYQFFDSLQAFFSTITALLANRALLQGLGVGDANSSATFAMLLTVLKDAMSRVAT 127
Query: 138 LI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVV 195
+I G R++ + KR+R LAD D +E+ SP P + G + + V
Sbjct: 128 IIFAQQFGLRIEPDAKRYRFLADLFNDTAFFMELYSPYFGPYGKILALTSGEALRALCGV 187
Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
A A++ + FAK NL++L AK + T +VGL G + V ++ V
Sbjct: 188 AGGASKAALSVHFAKHDNLAELNAKEASQETAVGLVGLLVG----TFVVKHVEDHTSVVY 243
Query: 256 LLSVI---HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
L++V+ H++ +R+ +N LN QR ++ +++K+G + +P ++ RE +LF
Sbjct: 244 LMTVLVLAHLWMNYLGVRSVCMNVLNRQRATILYKEYLKSGNVLTPNEVARRESILFWSP 303
Query: 313 LIEDAGNVKMDR 324
++ + ++ R
Sbjct: 304 VVRNQHGQRIAR 315
>gi|296233362|ref|XP_002761979.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Callithrix
jacchus]
Length = 468
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 70/241 (29%), Positives = 119/241 (49%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGF 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
L+ +H+Y+ +RA + TLN R +++ +++ G + P E
Sbjct: 257 PGFSFGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTTANRMEP 316
Query: 307 L 307
L
Sbjct: 317 L 317
>gi|449509553|ref|XP_004163622.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58-like
[Cucumis sativus]
Length = 523
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 138/254 (54%), Gaps = 7/254 (2%)
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQ 133
P +V+E Y+ Y +++ L S+AL V++TQ++ A G+ + A A ++W+LKDG+
Sbjct: 126 PKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAIGVGHSRSLASAAALNWVLKDGLG 185
Query: 134 HVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
+ + L +++ + D+ KR R L+ L G+E+L+P PQ FL +A + N K I
Sbjct: 186 RLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVELLTPAFPQYFLLLASIANIVKQI 245
Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG--LGAGIQLASTVCSSMQGK 250
++ +T ++ SFA NL ++ AK + S F+ +G L A + S +Q
Sbjct: 246 SLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFDNLGLVLAAFLNFLSKNDRRLQAA 305
Query: 251 M--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
+ +V P+ + + ++ + ++ + TL R +I++++++ G + +PA++ RE +
Sbjct: 306 LPFVVYPIFAAMDLFGTYQGLKHVHLQTLTKDRLEIILSNWIEQGYVPTPAEVSEREGID 365
Query: 309 FPGRLIEDAGNVKM 322
R +D+ +++
Sbjct: 366 LLCRQGKDSWPIRI 379
>gi|255542032|ref|XP_002512080.1| conserved hypothetical protein [Ricinus communis]
gi|223549260|gb|EEF50749.1| conserved hypothetical protein [Ricinus communis]
Length = 380
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 69/238 (28%), Positives = 126/238 (52%), Gaps = 3/238 (1%)
Query: 74 GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDG 131
G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+ A W L+D
Sbjct: 3 GFPSSVTPDYVPFQVWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRDL 62
Query: 132 MQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAK 190
+G ++ + G+ +DS K WR++AD + DLG ++++SPL P F+ + LG+ ++
Sbjct: 63 TGMLGGILFTFYQGSDLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVVVVCLGSLSR 122
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
VA+ ATR + FA + N +D+ AK + T+ + G+ G+ LA
Sbjct: 123 SFTGVASGATRAALTQHFALQSNAADISAKEGSQETVATMTGMALGMLLARITIGHPLAI 182
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
L++ H+Y+ +R + +LN QR+++++ F++TG++ SP + E +L
Sbjct: 183 WFSFLSLTMFHMYANYRAVRCLVLTSLNMQRSSILLQHFMETGQVLSPEQVSGMEHIL 240
>gi|224002080|ref|XP_002290712.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974134|gb|EED92464.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 656
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 129/271 (47%), Gaps = 51/271 (18%)
Query: 69 KFFPSGY-PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV---- 123
+F S + P V Y R+ ++R LQ F +A + V+ TQSLL GLR +V
Sbjct: 147 QFLHSAFVPEGVTPSYYRFMRWRILQRFINANVHVIGTQSLLM--GLRGMTRSGSVGEAA 204
Query: 124 ---------------VSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTG 167
+W+LKD + + +++ S +G + D + KRWR + +Y LG G
Sbjct: 205 SFAAKEGAALGAAAATNWVLKDTLGKIVRMVWASKMGRKFDPDAKRWRFRSALIYALGNG 264
Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG----------NLS-- 215
LEV + L PQ FL +A L N K ++++ + ATR +Y+SF + G N S
Sbjct: 265 LEVSTYLHPQYFLLLAMLANSCKQMSMLTSSATRNALYNSFKRVGTTSSTVSTSNNASGS 324
Query: 216 ----------------DLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
D+ AKGEA + +++G+ +GI L+ V S+Q V +L +
Sbjct: 325 SAIVRRGGNGGIENIGDITAKGEAQIAVVDLLGIASGICLSRAVGVSVQNVFAVWVVLQI 384
Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
++ + EMR+ T N +R M++ +
Sbjct: 385 GEIFCMYHEMRSIVYRTFNFERMYMVLGKLM 415
>gi|321459639|gb|EFX70690.1| hypothetical protein DAPPUDRAFT_60988 [Daphnia pulex]
Length = 264
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 74/249 (29%), Positives = 128/249 (51%), Gaps = 16/249 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P GYP SV++ YL Y + +Q F S+ LST+++L G+ TP AT++ W
Sbjct: 5 FLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAATLM-W 63
Query: 127 ILKDGMQHVGKLI---CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ------ 177
++KDG VG+++ C+ G ++D++ K+WR AD L D LE+ +P
Sbjct: 64 LIKDGTGMVGRILFAWCN--GTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGGSN 121
Query: 178 -LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+ + AK I VA ATR + A++GNL+D+ AK + TL N+ L
Sbjct: 122 GTMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALITS 181
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ L + + ++ L +++HV++ ++ ++A + TLN R +I+ F T ++
Sbjct: 182 LWLLPILDGATGLTWLMFFLFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTKYVA 241
Query: 297 SPADLRYRE 305
S ++ E
Sbjct: 242 SVKEVNRLE 250
>gi|400593658|gb|EJP61584.1| DUF647 domain protein [Beauveria bassiana ARSEF 2860]
Length = 467
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 154/317 (48%), Gaps = 21/317 (6%)
Query: 14 EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPS 73
E DKS VQ+ W R +P GH + R + DSFL P
Sbjct: 28 ELDKSGHVQK---RWLHEPSG--RIIHIRPQGHTNDLFSRSLREWKKILSDSFL----PV 78
Query: 74 GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDG 131
G+P SV+ YL Y F +LQ F S S+L+ ++LL G+ + AT ++ IL+D
Sbjct: 79 GFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFALLLTILRDA 138
Query: 132 MQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
V ++ ++ G R++ + KR+R LAD D LE+ SP L P + + +G
Sbjct: 139 TSRVATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLSPLAKVVVLSVGEAL 198
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
+ + V+A A++ + FAK NL++L AK + T ++GL G + V +
Sbjct: 199 RALCGVSAGASKAALSMHFAKHDNLAELNAKEASQETAVGLIGLLVG-TIVVKVVEDHRS 257
Query: 250 KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
+ + LL H++ +R+ + TLN QR ++ ++++TG++ SPA++ RE ++
Sbjct: 258 VVSLMILLVFGHLWMNYLGVRSVQMATLNRQRATILFQEYLRTGKVLSPAEVAKREYIIV 317
Query: 310 PGRLIEDAGNVKMDRHG 326
++ +RHG
Sbjct: 318 WSSVVS-------NRHG 327
>gi|320164887|gb|EFW41786.1| GTP-binding protein [Capsaspora owczarzaki ATCC 30864]
Length = 436
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/247 (29%), Positives = 125/247 (50%), Gaps = 4/247 (1%)
Query: 49 VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
+++VD+ P F P GYP SV+ Y Y ++ LQ F S+ L+T ++
Sbjct: 31 LRLVDEKHPEASSFAQLLKAVFLPDGYPDSVSADYASYQRWDTLQAFCSSINGSLATLAV 90
Query: 109 LFAAGL--RPTPAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLG 165
L G+ A A VSW+L+DG+ +G++ + G +D+ KRWR++AD DL
Sbjct: 91 LKGVGVGDETATATAAAVSWMLRDGVGMLGRIFFAWQKGPALDANAKRWRLVADIFNDLA 150
Query: 166 TGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
+E+LSPL + FL A LG+ + I VA ATR I A+ N D+ AK +
Sbjct: 151 MFVELLSPLVGEWFLAFACLGSVLRSIVGVAGGATRAAITQHQARRNNHGDVSAKDGSQE 210
Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKM-IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
TL N+ L + L + S + ++ + +H+++ +RA +T N +R +
Sbjct: 211 TLVNLAALLVSLWLLPALGSDHPTLVWLLFCSFTSLHLFANYRGVRATVFDTFNRERLFI 270
Query: 285 IVADFVK 291
+ +D+++
Sbjct: 271 VASDYMQ 277
>gi|340516831|gb|EGR47078.1| predicted protein [Trichoderma reesei QM6a]
Length = 448
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 153/304 (50%), Gaps = 10/304 (3%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVV----DSFLNKFFPSGYPF 77
++ I + S V + + DGH+ + + S + R V + + F P G+P
Sbjct: 23 NDVDIIELDKSGHVVQRWLHAHDGHV-LPLRRRSNSFFSRSVREWRKALFDSFLPVGFPH 81
Query: 78 SVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHV 135
SV++ YL Y F +LQ F S S+L+ ++LL G+ + AT ++ ILKD M +
Sbjct: 82 SVSKDYLSYQFFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTILKDAMSRI 141
Query: 136 GKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIA 193
++ ++ G R++ + KR+R LAD D LE+ SP +A G + +
Sbjct: 142 ATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLGSWGKVVALSTGEALRALC 201
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
VAA A++ + FA++ NL++L AK + T ++GL G + TV + +++
Sbjct: 202 GVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVGTLVVKTV-ENHSAVVLL 260
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRL 313
+L + H+ +R+ +NT N QR ++ +++K+G++ SP + RE ++F
Sbjct: 261 MIVLVLAHLGMNYLGVRSVCMNTFNRQRATILFEEYLKSGKVLSPEQVANRESIIFWSPA 320
Query: 314 IEDA 317
I +A
Sbjct: 321 IRNA 324
>gi|348585092|ref|XP_003478306.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cavia porcellus]
Length = 468
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 115/229 (50%), Gaps = 3/229 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ S L+T +++ G+ A A +W+
Sbjct: 77 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSSALATHAVMLGLGVGNAEASLAAATTTWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
++D +G+++ + G+++D K+WR+ AD L D LE+ +P P F
Sbjct: 137 VRDSTGMLGRIVFAWWKGSKLDCNVKQWRLFADILNDAAMFLEIAAPTLPFCFTVTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLFSLVMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ LL+ +H+Y+ +RA + TLN R +++ F++ G I
Sbjct: 257 PSFSLGCFFLLTGLHIYANYRAVRALVMETLNESRLQLVLKHFLQRGEI 305
>gi|256077542|ref|XP_002575062.1| hypothetical protein [Schistosoma mansoni]
Length = 410
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/229 (31%), Positives = 112/229 (48%), Gaps = 4/229 (1%)
Query: 67 LNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
L KFF P GYP SV++ YL Y + +Q F S+ L++Q++L G+ + A +
Sbjct: 41 LKKFFLPVGYPSSVSDDYLEYQIWDTIQAFASSITGALASQAVLIGVGVGDSSATILSAS 100
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+ KDG +G++I + G ++D + K WR +AD L D LE++SPL LF +
Sbjct: 101 LTWMFKDGSGMIGRIIFAGYHGIKLDCDCKFWRFVADILNDCALFLEIISPLFNYLFTPL 160
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
L N K + VA ATR I A NL+D+ AK + TL N++ L
Sbjct: 161 LCLANVLKSLVGVAGSATRAAIVQHQAINNNLADVSAKDGSQETLSNLMAWLLNFILLYM 220
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
V + + IH+YS ++ + T N R + + + K
Sbjct: 221 VTGNQFLIWFCFICCTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQWFK 269
>gi|330805147|ref|XP_003290548.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
gi|325079335|gb|EGC32940.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
Length = 493
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 127/247 (51%), Gaps = 4/247 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA-GLRPTPAQ--ATVVSW 126
F P GYP SV+ Y+ Y ++ Q+ + VLST +LL G+ A A +SW
Sbjct: 100 FLPKGYPNSVSPDYIHYYKWLFCQNVVGSVTYVLSTHALLTTVVGMSVASALPFAAAISW 159
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+LKDG+ + +N +D + K+++ D L++LG LE+ +P FL +A +
Sbjct: 160 VLKDGLGAFALVFFANKFSTLLDFDLKKYKFRGDILHNLGVLLEMCTPFIQGYFLPLASI 219
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
N +KG+A + +TR + SF+ + N+ D+ AK ++ S ++G+G G +
Sbjct: 220 SNLSKGLAGLIYGSTRASLNKSFSIKDNIGDITAKYQSQSMAAYLMGMGIGSTFGLLLSG 279
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
S + + LS +H++ + +++ + +LN QR ++I+ +++ I + + E
Sbjct: 280 STFSGISIVFSLSFVHMFLGYKAVKSINLKSLNKQRLSIIIDNWLLNSEILNSKIVNNNE 339
Query: 306 DLLFPGR 312
+ +F +
Sbjct: 340 EFIFDSK 346
>gi|323455525|gb|EGB11393.1| hypothetical protein AURANDRAFT_11851, partial [Aureococcus
anophagefferens]
Length = 248
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/248 (31%), Positives = 126/248 (50%), Gaps = 9/248 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR--PTPAQATVVSWI 127
F P+G+P +V GY Y + + S+A VLST+ LL A G+ A +W
Sbjct: 1 FLPAGFPTTVTPGYGTYAALQFAAYACSSACGVLSTRVLLAAVGVGDGAAAPLAAAANWA 60
Query: 128 LKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLEMAGL 185
+KDG+ +G + + +D+ P++WR+ + A DL + +E+ + P PQ F+ +AGL
Sbjct: 61 IKDGLGMLGGVAFAATWSNSLDARPRQWRLRSSAALDLASLVELAALPAFPQYFVPIAGL 120
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEG---NLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
N AK ++ +AA A+R I+ S + NL+DL AK + + + ++ GL G+ AS
Sbjct: 121 ANVAKNVSYLAASASRAAIHQSLSARSDASNLADLTAKTGSQTIVASLCGLALGVA-ASN 179
Query: 243 VCSSMQGKMI-VGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
C + + LS H+ ++ TL+ R A +V F +TG +PAD+
Sbjct: 180 ACGAEASDVWPAWAALSACHLGCTYASLKYVNTTTLDDARLAALVDAFRETGACPTPADV 239
Query: 302 RYREDLLF 309
RE L
Sbjct: 240 SERESLFL 247
>gi|312082381|ref|XP_003143421.1| hypothetical protein LOAG_07840 [Loa loa]
gi|307761414|gb|EFO20648.1| hypothetical protein LOAG_07840 [Loa loa]
Length = 393
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 130/249 (52%), Gaps = 10/249 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F + F P GYP SV+ Y+ Y + +Q F S+ S L+T+++L G+ A
Sbjct: 43 FRDIFMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALATEAILRGVGVGNETASTMAAA 102
Query: 124 VSWILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
V+W+L+DG+ + +++ + + +D++ K+WR++AD DL L++++P+ P LF+ +
Sbjct: 103 VAWLLRDGIGMLTRILFAWFYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPNLFMPI 162
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
L + +G+ VA ATR + + A N+ D+ AK + TL N+ L + L
Sbjct: 163 VCLSSMVRGVVGVAGGATRTAVVNHQAILDNVGDVAAKDGSQETLINIFALFCSLLLLPI 222
Query: 243 VCSSMQGKMIVGPLLSV---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V G + V L + IH+Y +++ TLN +IV ++ +T +I + +
Sbjct: 223 V----NGNICVWLLFCIFTFIHLYGNYRAVKSLQFRTLNQSLLQIIVKNYAETKKIDTVS 278
Query: 300 DLRYREDLL 308
+ +E +L
Sbjct: 279 KVNNKESML 287
>gi|46137815|ref|XP_390599.1| hypothetical protein FG10423.1 [Gibberella zeae PH-1]
Length = 425
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 145/301 (48%), Gaps = 20/301 (6%)
Query: 29 FETSDSVTRHYQFQPDG--HLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRY 86
++ S + R + PD L KV P H F P GYP SV+ YL Y
Sbjct: 33 YDKSGNFVRSWTHGPDNIVVLKSKVCTSWLPSMHEA-------FMPVGYPHSVSSDYLNY 85
Query: 87 TQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKL-ICSNL 143
F ++Q F S S+L+ ++LL G+ + AT ++ +LKD + V +
Sbjct: 86 QFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAISRVATIAFAQQF 145
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVVAARATRL 202
G R++ + KR+R LAD D +E+ SP L P + G + + VAA A++
Sbjct: 146 GLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTGEALRALCGVAAGASKA 205
Query: 203 PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--CSSMQGKMIVGPLLSVI 260
+ FAK NL++L AK + T ++GL G + V +++ MIV L +
Sbjct: 206 ALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTLVVKYVEDHNTVVFLMIV---LVLA 262
Query: 261 HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL--FPGRLIEDAG 318
H++ +R ++ LN QR ++ +++KTG+I +P ++ RE +L + IE AG
Sbjct: 263 HLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGKIMTPEEVATRESILIWWSSTRIEMAG 322
Query: 319 N 319
+
Sbjct: 323 S 323
>gi|449017626|dbj|BAM81028.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 518
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 130/260 (50%), Gaps = 13/260 (5%)
Query: 33 DSVTRHYQFQPDGHLSVKVVDDSRPVY--HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFR 90
D RH++ P+ V +RP R+V + L P+ YP +V+ Y R+ ++
Sbjct: 66 DEEQRHWRVVPEPPAGTAV---TRPENWPQRLVQTLL---LPADYPSAVSSDYDRFWRWN 119
Query: 91 ALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKL-ICSNLGARM 147
L+H A VL TQS+L A G+ A A WILKDG+ + K+ + + L R+
Sbjct: 120 ILRHTLLEAAEVLGTQSMLLALGVGGASALPLAAAWKWILKDGLGYFAKVALATQLAPRV 179
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVA-ARATRLPIYS 206
D++PKR+R D + LGT E+L+ P FL +A GN + A VA A R+ +Y
Sbjct: 180 DNDPKRFRFYGDFVMALGTLFEMLTLAFPPWFLALASTGNLLRKAADVATGPAYRVFLY- 238
Query: 207 SFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVI 266
F+ N D+ +K E+ + + G+GAGI +++ ++ LLS H+Y+
Sbjct: 239 HFSIRNNSGDVSSKSESQVVVGRLSGIGAGIAVSAVTSHDASALVMAYGLLSAGHLYATY 298
Query: 267 EEMRAAPVNTLNPQRTAMIV 286
+ + + TLN R ++
Sbjct: 299 QSVIRLELRTLNRTRLEYVL 318
>gi|327294507|ref|XP_003231949.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
gi|326465894|gb|EGD91347.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
Length = 483
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 148/276 (53%), Gaps = 15/276 (5%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPA 119
+ S LN F P+GYP SV + Y+ Y + +LQ F+S+ +LS++++L G+ +P
Sbjct: 55 LTSILNVFLPAGYPHSVTDDYIEYQIYDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPT 114
Query: 120 QATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ- 177
A +++ +L++ M + ++ ++ LG ++ E K +R+ AD L D L+ LSP+ P+
Sbjct: 115 AALLLN-VLQESMGRIATILFAHRLGTSLEPECKLYRLAADILNDSAMVLDCLSPIFPKP 173
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+ + + L + + + VAA +++ + + FA+ GNL +L AK + T+ ++ G+ G
Sbjct: 174 VRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLAGMLVGS 233
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
+ S + S ++ I +L ++H+ + +RA + TLN QR ++ + + R+ +
Sbjct: 234 VVVSYITSPLE-TWIALIILLIVHLGTNHAAVRAVKMTTLNRQRANIVFSYLFEDDRVLT 292
Query: 298 PADLRYREDLLFPGRLIEDAGNVKMDRH--GIRAEC 331
PA+ E R+ E G ++ + GI C
Sbjct: 293 PAEASEEE------RIFERDGVLRWNAESSGILGTC 322
>gi|219129806|ref|XP_002185071.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403566|gb|EEC43518.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 500
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 128/256 (50%), Gaps = 5/256 (1%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
R + V + F PSGYP GYLRY+ + +Q ++ SVL+TQ +L G+
Sbjct: 71 RVLLRNVQSGLRSTFLPSGYPARTPPGYLRYSVWSWVQDLSTQLRSVLATQRVLEGVGVG 130
Query: 116 PTPAQA--TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
A A +++++++DG L S +R S+ KRWRI AD + D+G LEV +
Sbjct: 131 REGATALSALLNFLVRDGCGMAASLAFTSAAASRFQSDVKRWRIFADVMVDIGITLEVAA 190
Query: 173 PLC-PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
C P LFL M LGN K + VAA A I +AK ++SD+ AK A T+ +
Sbjct: 191 SACPPTLFLPMISLGNACKAMCGVAAGACGGAINLHWAKGSDISDIQAKFGAQHTVTGSL 250
Query: 232 GLGAGIQLASTVCSSMQGKM-IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
GL A +V + G++ I+ L+ +HV + + MR + N R +++
Sbjct: 251 GLVFAAFFAKSVSTVPGGRLWILYAALTALHVMANLRCMRIIHFDVFNTTRLRLVLRALF 310
Query: 291 KTGRISSPADLRYRED 306
++G +S+ + E+
Sbjct: 311 ESGTLSNKTAVTTAEN 326
>gi|357506015|ref|XP_003623296.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
gi|355498311|gb|AES79514.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
Length = 492
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/298 (27%), Positives = 132/298 (44%), Gaps = 21/298 (7%)
Query: 32 SDSVTRHYQFQPDGHLSVKVVDDSRPVYHR-------------VVDSFLNKFFPSGYPFS 78
S+ + Y D L ++++ R +R + D+ P+G+P S
Sbjct: 60 SNGTAKRYIIGDDSRLRTILIEEDRSTQNRFGVLHSPDKRLSWLPDTVKAFILPAGFPGS 119
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVSWILKDGMQ 133
V++ YL+Y + + T + T SLL A G+ A A + W+ KDG+
Sbjct: 120 VSDDYLQYMLLQFPTNVTGWICHTIVTSSLLKAVGVGSFSGTTAAASAAAIRWVSKDGIG 179
Query: 134 HVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
VG+L I G D +PK+WR+ AD + G+ ++ +PL P FL +A LGN K I
Sbjct: 180 AVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLTTPLYPGYFLPLASLGNLTKAI 239
Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGK 250
A + I S FA NL ++ AK E + +VGLG GI + T + S
Sbjct: 240 ARGLKDPSSRVIQSHFAISANLGEIAAKEEVWEVVAQLVGLGLGILILDTPGLVKSYTVL 299
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
++ ++H++ E + NT+N +R +V V + D E++L
Sbjct: 300 SSTWLIVRLLHLWLRYESLSVLQFNTINLKRARTLVKSHVLHSTVPGCMDCNKEENIL 357
>gi|358380832|gb|EHK18509.1| hypothetical protein TRIVIDRAFT_14438, partial [Trichoderma virens
Gv29-8]
Length = 451
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 147/296 (49%), Gaps = 10/296 (3%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSF----LNKFFPSGYPF 77
++ I + S V + + DGH+ +++ + S + R + + F P G+P
Sbjct: 23 NDVEIVELDKSGHVQQQWLHAHDGHV-LRLKNRSNGFFSRSAREWRKVLFDSFLPVGFPH 81
Query: 78 SVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHV 135
SV++ YL Y F +LQ F S S+L+ ++LL G+ + AT ++ ILKD M +
Sbjct: 82 SVSKDYLAYQMFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTILKDAMSRI 141
Query: 136 GKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIA 193
++ ++ G R++ + KR+R LAD D LE+ SP +A G + +
Sbjct: 142 ATIVFAHRFGLRIEPDAKRYRFLADIFNDSAFFLELYSPYLGSWGKVIALSTGEALRALC 201
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
VAA A++ + FA++ NL++L AK + T ++GL G + T+ +I+
Sbjct: 202 GVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVGTLVVKTI-EDHSSVVIL 260
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
LL H+ +R+ + TLN QR ++ ++++K + +P + RE ++F
Sbjct: 261 MILLVFAHLGMNYLGVRSVCMTTLNRQRATILFSEYLKNRVVLTPEQVAERESIIF 316
>gi|186478458|ref|NP_001117280.1| uncharacterized protein [Arabidopsis thaliana]
gi|332190946|gb|AEE29067.1| uncharacterized protein [Arabidopsis thaliana]
Length = 353
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 116/210 (55%), Gaps = 3/210 (1%)
Query: 102 VLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILA 158
+LSTQ+LL A G+ A W L+D +G ++ + G+ +DS K WR++A
Sbjct: 2 MLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVA 61
Query: 159 DALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
D + D+G +++LSPL P F+ + LG+ ++ VA+ ATR + FA + N +D+
Sbjct: 62 DLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADIS 121
Query: 219 AKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
AK + T+ ++G+ G+ LA + + L+V H+Y+ +R +N+LN
Sbjct: 122 AKEGSQETMATMMGMSLGMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLN 181
Query: 279 PQRTAMIVADFVKTGRISSPADLRYREDLL 308
+R+++++ F++TG++ SP + E +L
Sbjct: 182 FERSSILLTHFIQTGQVLSPEQVSSMEGVL 211
>gi|42567566|ref|NP_195771.2| uncharacterized protein [Arabidopsis thaliana]
gi|209863158|gb|ACI88737.1| At5g01510 [Arabidopsis thaliana]
gi|332002971|gb|AED90354.1| uncharacterized protein [Arabidopsis thaliana]
Length = 509
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 79/261 (30%), Positives = 127/261 (48%), Gaps = 20/261 (7%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---------RPTPAQA 121
FPSG+P SV++ YL Y ++ + T +VL T SLL A G+ A A
Sbjct: 120 FPSGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVGVGSFSGTSAAATAAASA 179
Query: 122 TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
+ W+ KDG+ +G+L I G+ D +PK+WR+ AD + G+ ++ + L P FL
Sbjct: 180 AAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFL 239
Query: 181 EMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+A GN AK + AR R P I + FA GNL ++ AK E ++GLG G
Sbjct: 240 LLASTGNLAKAV----ARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFG 295
Query: 237 IQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
I + T + S ++ + ++H++ + + NT+N +R +IV V
Sbjct: 296 ILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSV 355
Query: 295 ISSPADLRYREDLLFPGRLIE 315
+ D RE++L R ++
Sbjct: 356 VPGYVDCNKRENILLWQRFMK 376
>gi|449461779|ref|XP_004148619.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
gi|449518467|ref|XP_004166263.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 495
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 72/251 (28%), Positives = 118/251 (47%), Gaps = 8/251 (3%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVS 125
P+G+P SV++ YL+Y + + T L T SLL A G+ T A A +
Sbjct: 119 LPTGFPESVSDDYLQYMIRQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTTTAASAVAIR 178
Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
W+ KDG+ VG+L I G D +PK+WR+ AD + G+ ++ +PL P FL +A
Sbjct: 179 WVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLATPLYPSYFLPLAS 238
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
LGN K +A + I + FA GNL ++ AK E + ++GL GI + T
Sbjct: 239 LGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEIAAKEEVWEVVAQLLGLAIGILILDTPG 298
Query: 245 SSMQGKMIVGPLLS--VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
++ LS ++H++ + + NT+N +R ++ V ++ +
Sbjct: 299 LVNSYSVLSATWLSMRLLHLWLRYQSLAVLHFNTINLKRARILARAHVLHNKVPGTVECN 358
Query: 303 YREDLLFPGRL 313
E +L R
Sbjct: 359 SEESILVWERF 369
>gi|358335880|dbj|GAA28290.2| UPF0420 protein C16orf58 [Clonorchis sinensis]
Length = 428
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 68/231 (29%), Positives = 111/231 (48%), Gaps = 3/231 (1%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILK 129
PSGYP SV+ YL Y + +Q S+ L+ QS+L G+ A +SW+LK
Sbjct: 21 PSGYPNSVSPDYLEYQIWDTVQALASSVTGALAAQSVLIGVGVGDVKATILGASLSWMLK 80
Query: 130 DGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
DG +GK++ G+ +D + KRWR+LAD + D LE+ S P +F + L N
Sbjct: 81 DGSGMIGKIVFVGWQGSNLDCDCKRWRLLADIINDCALFLELASSFFPSVFTFIICLANL 140
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K + VA ATR + A + N++D+ AK + TL N++ L + + V +
Sbjct: 141 CKSVVSVAGGATRAAVTLHQALDNNVADVAAKDGSQETLSNLLALFFNLSVVYFVTGNWS 200
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
+L+ IHVY+ ++ ++ N R + V + + P+
Sbjct: 201 IIWTGFVILTFIHVYANYRAVQCLRLSNFNRNRFHIAVQQWFRQRLFHDPS 251
>gi|357127303|ref|XP_003565322.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
Length = 486
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 82/302 (27%), Positives = 138/302 (45%), Gaps = 22/302 (7%)
Query: 33 DSVTRHYQFQPDGHLSVKV---------VDDSRP--VYHRVVDSFLNKFFPSGYPFSVNE 81
D V + Y D L V++ +D +P R + F+ P+G+P SV++
Sbjct: 54 DGVAKRYMLDGDSKLQVRLEKHEHLLNNAEDGKPDSSIPRAITDFV---LPAGFPGSVSD 110
Query: 82 GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVSWILKDGMQHVG 136
YL+Y ++ + T L T SLL A G+ A A + W+ KDG+ G
Sbjct: 111 DYLQYMLWQFPTNVTGWICHTLVTSSLLKAVGVGSFTGTSAAASAAAIRWVSKDGIGAFG 170
Query: 137 KL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVV 195
+L I G D++PK+WR+ AD + +G+ E+ +PL P FL +A LGN AK +
Sbjct: 171 RLLIGGRFGTLFDNDPKKWRMYADFIGSVGSIFELTTPLYPGYFLPLASLGNLAKAVGRG 230
Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGKMIV 253
+ I + FAK GNL ++ AK E ++GL G+ + T V SS +
Sbjct: 231 FRDPSNRVIQNHFAKSGNLGEIAAKEEVWGVGAQLLGLSIGVLILDTSGVQSSYPTLTLT 290
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRL 313
+ ++H++ + + T+N +R ++V V + E++L R
Sbjct: 291 WLGVRLLHLWFRYQSLIVLKFRTVNLKRARILVRSHVAHHTVPGYVACNEEENILTWERF 350
Query: 314 IE 315
++
Sbjct: 351 LQ 352
>gi|356523306|ref|XP_003530281.1| PREDICTED: UPF0420 protein-like [Glycine max]
Length = 480
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 84/298 (28%), Positives = 134/298 (44%), Gaps = 21/298 (7%)
Query: 32 SDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVD--------SFLNKF-----FPSGYPFS 78
S+ + Y D L +V++ R +R D S+L + P+G+P S
Sbjct: 47 SNGTAKRYVLGDDSQLQAFLVEEDRSTPNRFQDLHSSDESLSWLPEIIKDFVLPAGFPGS 106
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVSWILKDGMQ 133
V++ YL Y + + T L T SLL A G+ A A + W+ KDG+
Sbjct: 107 VSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKAVGIGSFTGTTAAASAAAIRWVSKDGIG 166
Query: 134 HVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
VG+L I G+ D +PK+WR+ AD + G+ ++ + L P FL +A LGN K +
Sbjct: 167 AVGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLTTQLYPAYFLPLASLGNLTKAV 226
Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMI 252
A + I + FA GNL ++ AK E + +VGL GI + T +I
Sbjct: 227 ARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVVAQLVGLSLGILILDTPGLVKSYGVI 286
Query: 253 VGPLLS--VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LS ++H++ E + NT+N +R ++V V + D E++L
Sbjct: 287 SLTWLSMRLLHLWLRYESLSVLQFNTINIKRARILVKSHVLHSTVPGCTDCNREENIL 344
>gi|242019564|ref|XP_002430230.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515330|gb|EEB17492.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 527
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 67/240 (27%), Positives = 118/240 (49%), Gaps = 6/240 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P GYP+SV+ YL Y + Q F S+ L+ ++++ G+ TP A + +W
Sbjct: 44 FLPKGYPYSVSSDYLEYQIWDFCQAFCSSICGTLTVRAVMKGIGVGSNVATPMSAAL-TW 102
Query: 127 ILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
I+KDG +G ++ + L G+++DS+ K+WR+ AD L D+ +E+ P+ + M
Sbjct: 103 IIKDGTGMIGSIMFAWLKGSKLDSDCKKWRLFADILNDIAMFIELSIPIIMDYSMYMLCA 162
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K + A +TR I + A +GN +D+ AK + N++G GI L V
Sbjct: 163 ATIFKALVGTAGSSTRAAIITHQAVKGNTADVSAKDGSQELFTNLLGFIFGIFLLKNVNE 222
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
S Q IV ++ H+ + +R +N N +R + + ++ T + + + RE
Sbjct: 223 SNQ-VWIVMCTATIFHLIFNYKAVRCLKINVFNGERLRLTLLSYLPTSCVPTIEQVNRRE 281
>gi|408393145|gb|EKJ72412.1| hypothetical protein FPSE_07436 [Fusarium pseudograminearum CS3096]
Length = 427
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/301 (28%), Positives = 144/301 (47%), Gaps = 20/301 (6%)
Query: 29 FETSDSVTRHYQFQPDG--HLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRY 86
++ S + R + PD L K P H F P GYP SV+ YL Y
Sbjct: 33 YDKSGNFVRSWTHGPDNIVVLQSKARTSWLPSMHEA-------FMPVGYPHSVSSDYLNY 85
Query: 87 TQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKL-ICSNL 143
F ++Q F S S+L+ ++LL G+ + AT ++ +LKD + V +
Sbjct: 86 QFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAISRVATIAFAQQF 145
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVVAARATRL 202
G R++ + KR+R LAD D +E+ SP L P + G + + VAA A++
Sbjct: 146 GLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTGEALRALCGVAAGASKA 205
Query: 203 PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--CSSMQGKMIVGPLLSVI 260
+ FAK NL++L AK + T ++GL G + V +++ MIV L +
Sbjct: 206 ALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTIVVKYVEDHNTVVFLMIV---LVLA 262
Query: 261 HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL--FPGRLIEDAG 318
H++ +R ++ LN QR ++ +++KTG+I +P ++ RE +L + IE AG
Sbjct: 263 HLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGQIMTPEEVAKRESILLWWSSTRIEMAG 322
Query: 319 N 319
+
Sbjct: 323 S 323
>gi|302768148|ref|XP_002967494.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
gi|300165485|gb|EFJ32093.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
Length = 412
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 74/260 (28%), Positives = 127/260 (48%), Gaps = 10/260 (3%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
V+ R +SF F +G SV++ YL+Y ++ + T L T SLL A GL
Sbjct: 4 VFWRFSESFGFLQFSAG---SVSQDYLQYMVWQLPTNITGWICRALITSSLLQAVGLSTE 60
Query: 118 PAQATV----VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
P A + W+ KDG+ VG+L I G D +PK+WR+ A+ + G E+ +
Sbjct: 61 PGSAAAASAAIKWVTKDGLGAVGRLFIGGRFGGVFDQDPKQWRMYAEVIGSFGGLFELAT 120
Query: 173 PLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
PL P FL +A LG+ K +A + I + FA N+ D+ AK E ++G
Sbjct: 121 PLAPDYFLLLASLGHLTKAVARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIG 180
Query: 233 LGAGIQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
L G+ L ST V +S + ++ H++ + + A + T+N +R +++ + +
Sbjct: 181 LSLGVALLSTPVVATSYSQLALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHI 240
Query: 291 KTGRISSPADLRYREDLLFP 310
+ +S ++ +E+L+ P
Sbjct: 241 QGRPLSGVSECNKKENLILP 260
>gi|298706870|emb|CBJ25834.1| Unknown protein [Ectocarpus siliculosus]
Length = 673
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/242 (31%), Positives = 126/242 (52%), Gaps = 15/242 (6%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
V +N F P+GYP SV Y+RY + LQ +S VL+TQ++L G+ TP
Sbjct: 196 VRGGLMNTFLPAGYPGSVRPEYVRYRCWDILQDLSSHLRGVLATQAVLEGMGVGRAGSTP 255
Query: 119 AQATVVSWILKDGMQHVGKLICSNLG-ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
AT + W+ +DG +G L+ +++ A K WR+ AD D+G LE+L+PL P
Sbjct: 256 LAAT-LQWMARDGASMIGGLLFTSVASANFGVNIKTWRLFADTSNDIGLALEMLAPL-PA 313
Query: 178 L-----FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
FL + + + K VA AT I +A + N++D+ AK A T+ +++G
Sbjct: 314 FREGGRFLRLICIASVFKAACGVAGGATGAAITEHWAIDNNIADIGAKNGAQHTVASLLG 373
Query: 233 LGAGIQLAS--TVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
LG + A +V S G + G +L+++H+ + MR + +LNP R +++V +
Sbjct: 374 LGLSVWFARAVSVGGSASGSRVWGWYAVLTLVHLLANYAAMRTLALRSLNPTRASLLVTE 433
Query: 289 FV 290
++
Sbjct: 434 YL 435
>gi|326427989|gb|EGD73559.1| hypothetical protein PTSG_05267 [Salpingoeca sp. ATCC 50818]
Length = 404
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 72/231 (31%), Positives = 118/231 (51%), Gaps = 3/231 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-ATVVSWIL 128
F P Y SV+ +L Y +++ + T A VLS QSLL+A G+ A ++WIL
Sbjct: 120 FLPRDYRASVSPKFLDYCKWQFVHMTTMTASGVLSMQSLLYAVGVGAGSVPLAAAINWIL 179
Query: 129 KDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
KDG+ +G ++ ++ G R D +PKR R A + +++++PL P FL +A + N
Sbjct: 180 KDGLGQLGGMLYGSIFGTRFDDDPKRQRFNAVLSLQVSGIVDIITPLFPHHFLLLASVSN 239
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
F K ++ +A+ ATR + SF + NL D+ AK + S +V G G GI ++ +
Sbjct: 240 FGKNVSYLASSATRAQMNLSFTRTSNLGDVTAKMTSQSIAASVFGTGLGILVSKVTGTEA 299
Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
M LS++ + + + T N QR A I+A T ++ +P
Sbjct: 300 ALLMAAYVPLSLVSIGGNYYSSKFVSLKTFNVQR-AEILARHYFTTKVCTP 349
>gi|307191039|gb|EFN74793.1| UPF0420 protein C16orf58-like protein [Camponotus floridanus]
Length = 825
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 123/257 (47%), Gaps = 4/257 (1%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
+ + + R+V F P GYP SV+ Y Y + +Q S ++ L+T S++ G
Sbjct: 392 EKQSICSRIVSIVKEIFLPQGYPDSVHSDYTAYQIWDTIQASASTIMNTLATHSIMIGIG 451
Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV 170
+ + A A ++WILK+G VG ++ + G ++D + K+WR+ AD L D+ G+E+
Sbjct: 452 VGESNATPLAAAITWILKNGAGMVGSIMFAWWNGTKLDGQCKKWRLFADILNDMAAGMEL 511
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
L + + + K + VA ATR+ + A NL+D+ AK + N+
Sbjct: 512 LVSYFSSYSVVILCISTIMKSMVGVAGGATRVALTQHQAIRNNLADVAAKDGSQEMCVNL 571
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+ GI + S + Q + + L +H+Y+ ++ ++ LN R A+I+ ++
Sbjct: 572 IASFIGIFILS-LFHDRQYLIELYLFLVAVHLYANYSAVKVLCLDILNEDRLALIIKYYM 630
Query: 291 KTGRISSPADLRYREDL 307
RI P L E +
Sbjct: 631 INERIPEPKRLNKEESV 647
>gi|297810247|ref|XP_002873007.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
lyrata]
gi|297318844|gb|EFH49266.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
lyrata]
Length = 510
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 126/261 (48%), Gaps = 20/261 (7%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---------RPTPAQA 121
FP+G+P SV++ YL Y ++ + T +VL T SLL A G+ A A
Sbjct: 121 FPTGFPASVSDDYLDYMLWQFPTNVTGWICNVLVTSSLLKAVGVGSFSGTSAAATAAASA 180
Query: 122 TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
+ W+ KDG+ +G+L I G+ D +PK+WR+ AD + G+ ++ + L P FL
Sbjct: 181 AAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFL 240
Query: 181 EMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+A GN AK + AR R P I + FA GNL ++ AK E ++GLG G
Sbjct: 241 LLASTGNLAKAV----ARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFG 296
Query: 237 IQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
I + T + S + + ++H++ + + NT+N +R +IV V
Sbjct: 297 ILIIDTPGLVKSFPFVSLTWTSVRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSV 356
Query: 295 ISSPADLRYREDLLFPGRLIE 315
+ D RE++L R ++
Sbjct: 357 VPGYVDCNKRENILLGQRFMK 377
>gi|195643336|gb|ACG41136.1| hypothetical protein [Zea mays]
Length = 442
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 129/256 (50%), Gaps = 9/256 (3%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 50 WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---GEAISTLFNVVG 232
P + + LG+ ++ VA+ ATR + FA N +D+ AK E ++T+ +
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNATDISAKEGSQETLATMLGMGL 229
Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
Q+ S+ + L++ H+Y+ + +++ + TLN +R ++++ F +
Sbjct: 230 GMLLAQITRGHALSVWASFLS---LTMFHMYANYKAVQSLSLTTLNYERASILLQYFKEC 286
Query: 293 GRISSPADLRYREDLL 308
G + P + +E +L
Sbjct: 287 GEVLVPRKVSQQEHIL 302
>gi|212274801|ref|NP_001130544.1| uncharacterized protein LOC100191643 [Zea mays]
gi|194689438|gb|ACF78803.1| unknown [Zea mays]
gi|223974155|gb|ACN31265.1| unknown [Zea mays]
gi|413956565|gb|AFW89214.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 442
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 129/256 (50%), Gaps = 9/256 (3%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 50 WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---GEAISTLFNVVG 232
P + + LG+ ++ VA+ ATR + FA N +D+ AK E ++T+ +
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGL 229
Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
Q+ S+ + L++ H+Y+ + +++ + TLN +R ++++ F +
Sbjct: 230 GMLLAQITRGHALSVWASFLS---LTMFHMYANYKAVQSLSLTTLNYERASILLQYFKEC 286
Query: 293 GRISSPADLRYREDLL 308
G + P + +E +L
Sbjct: 287 GEVLVPRKVSQQEHIL 302
>gi|345316269|ref|XP_001510147.2| PREDICTED: UPF0420 protein C16orf58 homolog, partial
[Ornithorhynchus anatinus]
Length = 353
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 121/247 (48%), Gaps = 15/247 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--------RPTPAQA 121
F P G+P SV+ YL Y + +Q F S+ L+TQ++L G+ PT A
Sbjct: 3 FLPRGFPDSVSPDYLSYQLWDTVQAFASSLTGSLATQAVLRGVGVGDGTMTVDSPTGAL- 61
Query: 122 TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
++K G +G++ G+++D + K+WR+ AD L D LE+++P P F
Sbjct: 62 -----LIKHGTGMLGRIGFAWKKGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPAYFT 116
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + AK I VA ATR + A+ N++D+ AK + TL N+ GL + L
Sbjct: 117 LTVCVSSLAKCIVGVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLMLL 176
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
V + + L + +H+Y+ +RA + TLN R +++ F++ G++ S A
Sbjct: 177 PLVSDRPSLCLPLFLLFTSLHLYANFRAVRAVVMETLNESRLRLVLEHFLREGQVLSLAS 236
Query: 301 LRYREDL 307
+E L
Sbjct: 237 ANEKEPL 243
>gi|413956564|gb|AFW89213.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 485
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 129/256 (50%), Gaps = 9/256 (3%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 50 WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---GEAISTLFNVVG 232
P + + LG+ ++ VA+ ATR + FA N +D+ AK E ++T+ +
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGL 229
Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
Q+ S+ + L++ H+Y+ + +++ + TLN +R ++++ F +
Sbjct: 230 GMLLAQITRGHALSVWASFLS---LTMFHMYANYKAVQSLSLTTLNYERASILLQYFKEC 286
Query: 293 GRISSPADLRYREDLL 308
G + P + +E +L
Sbjct: 287 GEVLVPRKVSQQEHIL 302
>gi|242056489|ref|XP_002457390.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
gi|241929365|gb|EES02510.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
Length = 624
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 120/252 (47%), Gaps = 8/252 (3%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVS 125
P+G+P SV+ YL Y + + T VL T SLL A G+ A A +
Sbjct: 240 LPAGFPGSVSVDYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAASAAAIR 299
Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
W+ KDG+ G+L I GA D +PK+WR+ AD + G+ E+++PL P FL +A
Sbjct: 300 WVSKDGIGAFGRLLIGGRFGALFDDDPKKWRMYADIIGSAGSIFELMTPLYPGYFLPLAS 359
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST-- 242
LGN AK +A + I + FA+ GNL ++ AK E ++GL AG+ + T
Sbjct: 360 LGNLAKAVARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSAGVLIMDTAG 419
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V SS + + ++H++ + + T+N +R ++V V +
Sbjct: 420 VKSSYLTLALTWLSVRLLHLWLRYQSLSVLKFRTINLKRGRILVRSHVAQHTVPGYVACN 479
Query: 303 YREDLLFPGRLI 314
E++L R +
Sbjct: 480 EEENILTWERFL 491
>gi|222624448|gb|EEE58580.1| hypothetical protein OsJ_09902 [Oryza sativa Japonica Group]
Length = 479
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 80/281 (28%), Positives = 136/281 (48%), Gaps = 31/281 (11%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ RV L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 59 WGRVGGRLLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKS 118
Query: 119 AQ--ATVVSWILKD--GM---------QHVGKLICSN------------------LGARM 147
A W L+D GM Q L SN LG+ +
Sbjct: 119 ATVIGATFQWFLRDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNL 178
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DS K WR++AD + DLG +++LSPL P + + LG+ ++ VA+ ATR +
Sbjct: 179 DSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQH 238
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
FA N +D+ AK + TL ++G+G G+ LA G + L++ H+Y+ +
Sbjct: 239 FALANNAADISAKEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYK 298
Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
+++ + TLN +R+++++ F+ G + +P + +E +L
Sbjct: 299 AVQSLSLTTLNYERSSILLQYFMDNGEVLTPQQVSKQEHIL 339
>gi|218192328|gb|EEC74755.1| hypothetical protein OsI_10514 [Oryza sativa Indica Group]
Length = 479
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 80/281 (28%), Positives = 136/281 (48%), Gaps = 31/281 (11%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ RV L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 59 WGRVGGRLLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKS 118
Query: 119 AQ--ATVVSWILKD--GM---------QHVGKLICSN------------------LGARM 147
A W L+D GM Q L SN LG+ +
Sbjct: 119 ATVIGATFQWFLRDLTGMLGGILFTFYQVSNSLANSNRQCHRIAIDSLFTDEITCLGSNL 178
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DS K WR++AD + DLG +++LSPL P + + LG+ ++ VA+ ATR +
Sbjct: 179 DSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQH 238
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
FA N +D+ AK + TL ++G+G G+ LA G + L++ H+Y+ +
Sbjct: 239 FALANNAADISAKEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYK 298
Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
+++ + TLN +R+++++ F+ G + +P + +E +L
Sbjct: 299 AVQSLSLTTLNYERSSIMLQYFMDNGEVLTPQQVSKQEHIL 339
>gi|302849057|ref|XP_002956059.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
nagariensis]
gi|300258564|gb|EFJ42799.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
nagariensis]
Length = 208
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 64/190 (33%), Positives = 95/190 (50%), Gaps = 5/190 (2%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
+ DS + F P G+P SV YL Y + H T L+T SLL A G+ +PA
Sbjct: 17 IPDSIRDLFLPPGFPDSVTPDYLPYQLWSLPTHVTGHLSHSLATSSLLTAVGVSTSPAAT 76
Query: 122 TVVS----WILKDGMQHVGKLIC-SNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
+S WI+KDG+ +G+ I S A D +P+RWR++A+ L G GLEV + L P
Sbjct: 77 VALSASIKWIIKDGVGALGRFIVGSRFSAEFDEDPRRWRLVAELLSTAGLGLEVATSLYP 136
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
Q F+ +A G F + + + I + FA+ N+ + AK E +VGL A
Sbjct: 137 QSFMLLASAGKFGQALGKGMGKPVFRVIQTHFARSQNVGAVAAKEEVWEVAAQMVGLMAS 196
Query: 237 IQLASTVCSS 246
+ + T+ S
Sbjct: 197 VAVLRTLEGS 206
>gi|453080329|gb|EMF08380.1| DUF647-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 464
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 143/256 (55%), Gaps = 11/256 (4%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
++H+ VD FL P G+P SV + YL Y + +LQ F+S+ ++LS++++L G+
Sbjct: 50 LWHKTVDVFL----PVGFPQSVTDDYLEYQIYDSLQAFSSSIAAMLSSRAVLEGVGVGDA 105
Query: 118 PAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
A AT ++ IL+D + + ++ ++ G ++ E K +R+LAD L D G L+ LSP+
Sbjct: 106 TANATTALLLSILQDSVGRITTIMFAHRFGMSLEPECKMYRLLADILNDTGFVLDCLSPI 165
Query: 175 CPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
P+ + + + + + VAA + + + + FAK+GNL +L AK + T+ ++ G+
Sbjct: 166 FPKPTRVVILSFSSILRSLCGVAAGSAKASLSAHFAKQGNLGELNAKDSSQETVISLAGM 225
Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
G + + V + + + LLS IH+ + +RA + TLN QR A ++ ++ G
Sbjct: 226 LFGSWVITWVKTPVATWTALILLLS-IHLETNRRAVRAVKMQTLNRQR-ATLLYHHLRKG 283
Query: 294 RISSPADLRYREDLLF 309
+ +P ++ +E L+F
Sbjct: 284 HLPTPIEVADQE-LIF 298
>gi|346318934|gb|EGX88536.1| DUF647 domain-containing protein [Cordyceps militaris CM01]
Length = 467
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 142/296 (47%), Gaps = 14/296 (4%)
Query: 14 EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPS 73
E DKS VQ+ +Y + R +P GH + R + DSFL P
Sbjct: 28 ELDKSGHVQQKWLY-----EPSGRIVHIRPKGHTNDLFSRSLREWKKILSDSFL----PV 78
Query: 74 GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDG 131
G+P SV+ YL Y F +LQ F S S+L+ ++LL G+ + AT ++ IL+D
Sbjct: 79 GFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFALLLTILRDA 138
Query: 132 MQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
V + G R++ + KR+R LAD D LE+ SP L P + +G
Sbjct: 139 TSRVATIAFAHRFGLRIEPDAKRYRFLADLFNDSAFFLELYSPYLSPWGKVAALSIGEAL 198
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
+ + V+A A++ + FAK NL++L AK + T ++GL + V +
Sbjct: 199 RALCGVSAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGL-LVGTVVVKVVEDHRS 257
Query: 250 KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+ + LL H++ + + + TLN QR ++ ++++TG++ SPA++ RE
Sbjct: 258 VVFLMILLVFGHLWMNYLGVCSVEMTTLNRQRATILFQEYLQTGKVLSPAEVSKRE 313
>gi|417401281|gb|JAA47531.1| Hypothetical protein [Desmodus rotundus]
Length = 457
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 117/248 (47%), Gaps = 14/248 (5%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWR-ILADALYDLGTGLEVLSPLCPQLF 179
A +WI+KD +G++I + W+ + AD L D+ LE+++P+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFA-----------WWKGLFADILNDVAMFLEIMAPIYPMCF 178
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 179 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 238
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + LL+ +HVY+ ++A + TLN R +++ F++ G + P
Sbjct: 239 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 298
Query: 300 DLRYREDL 307
E L
Sbjct: 299 SANQMEPL 306
>gi|417401214|gb|JAA47499.1| Hypothetical protein [Desmodus rotundus]
Length = 453
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 117/248 (47%), Gaps = 14/248 (5%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWR-ILADALYDLGTGLEVLSPLCPQLF 179
A +WI+KD +G++I + W+ + AD L D+ LE+++P+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFA-----------WWKGLFADILNDVAMFLEIMAPIYPMCF 178
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 179 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 238
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + LL+ +HVY+ ++A + TLN R +++ F++ G + P
Sbjct: 239 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 298
Query: 300 DLRYREDL 307
E L
Sbjct: 299 SANQMEPL 306
>gi|302753580|ref|XP_002960214.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
gi|300171153|gb|EFJ37753.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
Length = 381
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 117/240 (48%), Gaps = 7/240 (2%)
Query: 78 SVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV----VSWILKDGMQ 133
SV++ YL+Y ++ + T L T SLL A GL P A + W+ KDG+
Sbjct: 2 SVSQDYLQYMLWQLPTNITGWICRALITSSLLQAVGLSTEPGSAAAASAAIKWVTKDGLG 61
Query: 134 HVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
VG+L I G D +PK+WR+ A+ + G E+ +PL P FL +A LG+ K +
Sbjct: 62 AVGRLFIGGRFGGVFDEDPKQWRLYAEVIGSFGGLFELATPLAPDYFLLLASLGHLTKAV 121
Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGK 250
A + I + FA N+ D+ AK E ++GL G+ L ST V +S
Sbjct: 122 ARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVVLLSTPVVATSYSQL 181
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
+ ++ H++ + + A + T+N +R +++ + ++ +S + +E+L+ P
Sbjct: 182 ALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHIQGRPLSGVGECNKKENLILP 241
>gi|322707810|gb|EFY99388.1| DUF647 domain-containing protein [Metarhizium anisopliae ARSEF 23]
Length = 462
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 86/323 (26%), Positives = 155/323 (47%), Gaps = 19/323 (5%)
Query: 2 NLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHR 61
+L+++ + E DKS +VQ W SD+ + + + + ++ S + R
Sbjct: 20 DLIKQGDDVEISELDKSGNVQR---RWLHRSDNTIAILESKTEPN---SIIHSSVREWKR 73
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
+ + F P G+P SV+ YL Y + +LQ F S S+L+ ++LL G+ + A
Sbjct: 74 FLS---DAFLPIGFPHSVSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSSA 130
Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
T ++ IL D + ++ ++ G R++ + KR+R LAD D LE+ SP
Sbjct: 131 TFAMLITILTDATSRIATIVFAHRFGLRIEPDAKRFRFLADIFNDSAFFLELYSPYFGDY 190
Query: 179 FLEMA-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+A +G + + VAA A++ + FAK NL++L AK + T ++GL G
Sbjct: 191 GKILALSVGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLVVGT 250
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
L + Q + + L H++ +R+ + TLN QR ++ +++ TGR+ +
Sbjct: 251 -LVVKLVQDHQSVVYLMVALVFAHLWMNYLGVRSVCMTTLNRQRATILFREYMTTGRVLT 309
Query: 298 PADLRYREDLLF-----PGRLIE 315
P + RE+++ GRL E
Sbjct: 310 PEAVAARENIVLWLPVVGGRLAE 332
>gi|159467675|ref|XP_001692017.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278744|gb|EDP04507.1| predicted protein [Chlamydomonas reinhardtii]
Length = 216
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 101/211 (47%), Gaps = 10/211 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS---- 125
F P GYP +V YL Y + H T L T SLL A G+ PA +S
Sbjct: 5 FLPPGYPQTVTPDYLSYQLWSVPTHITGHLSHALVTSSLLSAVGVSAGPAATVALSASIK 64
Query: 126 WILKDGMQHVGKLIC-SNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
WI+KDG+ +G+LI S A D +P+RWR++A+ L G GLEV + L P FL +A
Sbjct: 65 WIVKDGVGALGRLIVGSRFSAEFDEDPRRWRMVAELLSTCGMGLEVATVLYPHYFLLLAC 124
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
G FA+ + + I + FA+ N+ + AK E + GL A + + +
Sbjct: 125 SGKFAQALGKGMGKPVFRVIQTHFARAQNVGAVAAKEEVWEVTAQMAGLAASVAVLRALE 184
Query: 245 SSMQGKMIVGPLLSV--IHV---YSVIEEMR 270
+S +VG ++ +HV Y + +R
Sbjct: 185 ASGAADGVVGTWAAIQAVHVALRYRALATLR 215
>gi|452005321|gb|EMD97777.1| hypothetical protein COCHEDRAFT_1165025 [Cochliobolus
heterostrophus C5]
Length = 421
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 73/271 (26%), Positives = 143/271 (52%), Gaps = 13/271 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQAT 122
++ F PSG+P SV E Y+ Y F +LQ F+S+ +L+++++L G+ TP A
Sbjct: 1 MIDVFLPSGFPQSVTEDYVEYQIFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTAAL 60
Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFL 180
++S +L++ + + + ++ G ++ E K +R+ AD L D G L+ LSP P+ L +
Sbjct: 61 LLS-VLQESVGRMATIAFADRFGTALEPECKMYRLTADILNDSGMILDCLSPGFPKPLRV 119
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + + V A +++ + + FA GNL ++ AK + T+ +++G+ AG +
Sbjct: 120 AVLSFASCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFVV 179
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
S + S + + LL VIH+ + +RA + +LN QR ++ + ++ G + SP +
Sbjct: 180 SHITSDIATWCTL-ILLLVIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLSPKE 238
Query: 301 LRYREDLLFPGRLIEDAGNVKMDRHGIRAEC 331
+ RE R+ E G ++ + +C
Sbjct: 239 VSARE------RIFERGGVLRWADDEVLGKC 263
>gi|440638734|gb|ELR08653.1| hypothetical protein GMDG_03339 [Geomyces destructans 20631-21]
Length = 714
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 69/227 (30%), Positives = 121/227 (53%), Gaps = 7/227 (3%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RP 116
H SF+ F P+GYP SV E Y+ Y + +LQ F+S+ +++++++L G+
Sbjct: 82 HSWAKSFMEIFLPAGYPNSVTEDYMEYQIYDSLQAFSSSIAGLIASRAVLEGVGVGDATA 141
Query: 117 TPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
+P A ++S +L+D + + + LG ++ E KRWR++AD D L+ LSP
Sbjct: 142 SPTTALLLS-VLQDSTGRIATIFFAARLGLSLEPECKRWRLVADIFNDAAMILDCLSPAF 200
Query: 176 PQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P+ L + + + + VAA + + + + FAK GNL +L AK + T+ +++G+
Sbjct: 201 PKPLRVSLLSASAALRALCGVAAGSAKASLSAHFAKSGNLGELNAKDSSQETIISLLGML 260
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
G L + + +Q + LLS IH+ + +RA + TLN QR
Sbjct: 261 VGTALIPYLTTPLQTWTALLLLLS-IHLTTNYFAVRAVSMRTLNRQR 306
>gi|449301217|gb|EMC97228.1| hypothetical protein BAUCODRAFT_147370 [Baudoinia compniacensis
UAMH 10762]
Length = 454
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 81/293 (27%), Positives = 145/293 (49%), Gaps = 8/293 (2%)
Query: 30 ETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQF 89
+ + +VT Y P V VD P L+ F P+G+P SV Y+ Y +
Sbjct: 12 DEAGNVTATYVASPTSSDGVCRVDVIHPATKAPWQRLLDIFLPAGFPHSVTVDYVEYQIY 71
Query: 90 RALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSNL-GAR 146
+LQ F+S + +LS++++L + G+ A T ++ I ++ + +G + L G
Sbjct: 72 DSLQAFSSNIVGLLSSRAVLLSVGVGDADASPTYALLLHIAQESLGRIGTIAFVALYGTA 131
Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL--GNFAKGIAVVAARATRLPI 204
++ E K +R+LAD D L+ LSP P+ +L + L + K + VAA + + +
Sbjct: 132 LEPECKMYRLLADVFNDFAFVLDCLSPAIPK-YLRVVTLSFSSLLKALCGVAAGSAKASL 190
Query: 205 YSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYS 264
+ FA+ GNL +L AK + T+ +++G+ G + S V + Q LL IH+ +
Sbjct: 191 SAHFAQWGNLGELNAKDSSQETVISLLGMWVGSLVLSWV-RTPQATWTTLILLLAIHLET 249
Query: 265 VIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDA 317
+RA + TLN QR A +V ++ G + +P ++ +E + G ++ A
Sbjct: 250 NRRAVRAVVMRTLNRQR-ATLVFHHLQRGHVPTPEEVSKQERIFERGGVLRSA 301
>gi|355736030|gb|AES11869.1| hypothetical protein [Mustela putorius furo]
Length = 218
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/167 (33%), Positives = 90/167 (53%), Gaps = 3/167 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 52 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLAGSLATHAVLLGIGVGNAEASVSAATATWL 111
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G ++D K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 112 VKDSAGMLGRIIFAWWKGTKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMFFTMIVCTS 171
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
N AK I VA ATR + A+ N++D+ AK + TL N+ GL
Sbjct: 172 NLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGL 218
>gi|380471002|emb|CCF47486.1| hypothetical protein CH063_04176 [Colletotrichum higginsianum]
Length = 478
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/258 (29%), Positives = 127/258 (49%), Gaps = 11/258 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--V 123
+ F P GYP SV Y+ Y + +LQ F S +LS +++L G+ + AT +
Sbjct: 64 LFDAFLPIGYPDSVTPDYIGYQAYDSLQAFFSTITGLLSNRAILQGLGVGDPNSSATYAL 123
Query: 124 VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLE 181
+ ILKDG+ V + G ++ E KR+R LAD D L++ SP P +
Sbjct: 124 LLTILKDGISRVATIAFAYRFGLVIEPECKRYRFLADIFNDSAFFLDLFSPYFDPWTKVA 183
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
L + + VAA A++ + FA+ NLS+L AK + T +VGL G S
Sbjct: 184 AIVLAEALRAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAIGLVGLLVG----S 239
Query: 242 TVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
V ++G+ V L L +H+ +R ++TLN QR ++ + +T R+ +P
Sbjct: 240 IVVRFVEGREAVFVLMVFLVFVHLAMNYFGVRCVQLDTLNQQRATILFDHYAQTKRVLTP 299
Query: 299 ADLRYREDLLFPGRLIED 316
+ +RE+++F +I++
Sbjct: 300 KQVAWRENIVFWSPVIKN 317
>gi|397563898|gb|EJK43986.1| hypothetical protein THAOC_37516 [Thalassiosira oceanica]
Length = 542
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 133/293 (45%), Gaps = 48/293 (16%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QAT 122
SF++ F P+ YP SV Y + + + +A VLSTQ+LL A G+ A A
Sbjct: 110 SFVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAA 169
Query: 123 VVSWILKD-GMQHVGKLICSNLG---------------------------------ARMD 148
++W+LKD Q G L S LG D
Sbjct: 170 ALNWLLKDFAGQLGGVLFVSQLGIDHEFWKRKINRLFRENNQGNNTGRPKKGNYQRGTAD 229
Query: 149 SEPKRWRILADALYDLGTGLEVLSPLC-PQLFLEMAGLGNFAKGIAVVAARATRLPI--- 204
+ PKRWR++A DL T LE+ +PL PQ FL A + N K I +AA A+R I
Sbjct: 230 TNPKRWRMVAALALDLSTLLEICTPLLGPQYFLPCASVANVGKNIGWLAASASRAAINQS 289
Query: 205 -----YSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ--GKMIVGPLL 257
S + NL D+ AK + + + ++VG GI + T+ + G M +L
Sbjct: 290 LSAGGSYSSSSGSNLGDVTAKSGSQAIVASLVGTALGIFFSKTLYADHGTVGVMSGFIVL 349
Query: 258 SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
S +H + + + A P+ +L+ R +++++F +T + SP+ + +E FP
Sbjct: 350 SAVHQVATYKALMAVPLRSLDRHRLHIVLSEFSRTSSVPSPSQVATKEK-FFP 401
>gi|79561141|ref|NP_179928.2| uncharacterized protein [Arabidopsis thaliana]
gi|51970902|dbj|BAD44143.1| unnamed protein product [Arabidopsis thaliana]
gi|330252365|gb|AEC07459.1| uncharacterized protein [Arabidopsis thaliana]
Length = 520
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 70/263 (26%), Positives = 134/263 (50%), Gaps = 10/263 (3%)
Query: 52 VDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA 111
+DD R+ S FF P V++ Y+ Y +++ L S+AL VL+TQ++ A
Sbjct: 104 LDDGFWKLIRLTSSAAKDFF---LPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMFRA 160
Query: 112 AGLRPTP--AQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGL 168
G+ + A + +WILKDG+ + + I ++L + D+ KR R L+ L G+
Sbjct: 161 IGIGQSRSLASSAAFNWILKDGLGRLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGV 220
Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
E+++P+ PQ FL +A + N AK I++ AT ++ SFA NL ++ AK + + F
Sbjct: 221 ELMTPVFPQYFLLLASIANIAKQISLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTVCF 280
Query: 229 NVVGLGAGIQLASTVCSSMQGK----MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
+ +GL + L + + + ++ P+ S + + + ++ + TL R +
Sbjct: 281 DNLGLLLAVLLNMLFQHNQRLQACLPFVLYPIFSTFDLLGIYQGLKHINLQTLTKDRLEI 340
Query: 285 IVADFVKTGRISSPADLRYREDL 307
I+ +++ ++ SPA++ E +
Sbjct: 341 ILERWIEFRQVPSPAEVSEEEGI 363
>gi|297825247|ref|XP_002880506.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
lyrata]
gi|297326345|gb|EFH56765.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
lyrata]
Length = 519
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 74/266 (27%), Positives = 137/266 (51%), Gaps = 12/266 (4%)
Query: 52 VDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA 111
+DD R+ S FF P V++ Y+ Y +++ L S+AL VL+TQ + A
Sbjct: 103 LDDGFWKLIRLSSSAAKDFF---LPKQVSDNYISYVKWKFLHRVFSSALQVLATQVMFRA 159
Query: 112 AGL--RPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGL 168
G+ + A + +WILKDG+ + + I ++L + D+ KR R L+ L G+
Sbjct: 160 IGIGQSRSLASSAAFNWILKDGLGRLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGV 219
Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
E+++P+ PQ FL +A + N AK I++ AT ++ SFA NL ++ AK + + F
Sbjct: 220 ELMTPVFPQYFLLLASIANIAKQISLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTVCF 279
Query: 229 NVVGLGAGIQLASTVCSS--MQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
+ +GL + L ++ +Q + ++ P+ S + + + ++ + TL R +
Sbjct: 280 DNLGLLLAVLLNMLFQNNQRLQASLPFVLYPIFSTFDLLGIYQGLKHINLQTLTKDRLEI 339
Query: 285 IVADFVKTGRISSPADLRYRE--DLL 308
I+ +++ ++ SPA++ E DLL
Sbjct: 340 ILERWIEFRQVPSPAEVSEEEGIDLL 365
>gi|171678219|ref|XP_001904059.1| hypothetical protein [Podospora anserina S mat+]
gi|170937179|emb|CAP61836.1| unnamed protein product [Podospora anserina S mat+]
Length = 448
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 134/254 (52%), Gaps = 9/254 (3%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT-- 122
+ L F P+GYP +V YL Y + +LQ F S+ S+L+++++L G+ + A T
Sbjct: 41 ALLYAFLPAGYPHTVTTDYLPYQTYDSLQAFASSITSLLASRAVLEGLGVGSSEASPTGA 100
Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FL 180
++ I D + + ++ ++ +G ++ E K +R LAD D L++L+P P L L
Sbjct: 101 LILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDAAQFLDLLTPALPYLPKL 160
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + VAA A++ + + FA GNL++L AK + T+ +++G+ G +
Sbjct: 161 GVIVSAGVLRSLCGVAANASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLVGSLVV 220
Query: 241 STVCSSMQ--GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
V + G MI+ L+ H+ +RA + +LN QR ++ +++++G + +P
Sbjct: 221 RLVEDKHKVWGLMII---LAGCHLAMNYRAVRAVRMTSLNRQRATIVFREWLESGTVLNP 277
Query: 299 ADLRYREDLLFPGR 312
A + RE +L GR
Sbjct: 278 AQVSQRESILMNGR 291
>gi|429848918|gb|ELA24353.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 440
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 153/309 (49%), Gaps = 20/309 (6%)
Query: 13 KEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVV-----DSFL 67
KEP K++ + E ++S + R + DG L+ ++S R++ F
Sbjct: 5 KEP-KAIVIDER-----DSSGLIRRRWLHDHDGRLTEMPTENSPTTISRLLTMNIRQVFS 58
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
+ FFP GYP SV++ YL Y + +LQ F S S+L+ +++L G+ + AT ++
Sbjct: 59 DAFFPIGYPDSVSKDYLGYQLYDSLQAFFSTITSLLANRAILQGLGVGDADSSATYALLL 118
Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
ILKDG+ + + G ++ E K++R LAD D L++ SPL + ++A
Sbjct: 119 TILKDGISRLATIGFAYRFGLVIEPECKKYRFLADIFNDSAFFLDLFSPLFGS-WTKVAA 177
Query: 185 L--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
L + + VAA A++ + FA GNLS+L AK + T ++GL G +
Sbjct: 178 LVVAEALRAMCGVAAGASKAALSKHFALRGNLSELNAKESSQETAVGLIGLIVGSFVVRH 237
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRIS-SPAD 300
V S + + +L +H+ +R +N N QR + +++KT G + +PA+
Sbjct: 238 V-ESREAVFALMIILVFVHLGMNYLGVRCVQLNNFNQQRATIFFEEYMKTMGNVRLTPAE 296
Query: 301 LRYREDLLF 309
+ RE+++F
Sbjct: 297 VAKRENIIF 305
>gi|350646547|emb|CCD58759.1| hypothetical protein Smp_142110 [Schistosoma mansoni]
Length = 423
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 112/242 (46%), Gaps = 17/242 (7%)
Query: 67 LNKFF-PSGYPFSVNEGYLRYTQFRALQH-------------FTSAALSVLSTQSLLFAA 112
L KFF P GYP SV++ YL Y + +Q F S+ L++Q++L
Sbjct: 41 LKKFFLPVGYPSSVSDDYLEYQIWDTIQAGIFFITKISLFSAFASSITGALASQAVLIGV 100
Query: 113 GLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLE 169
G+ + A + ++W+ KDG +G++I + G ++D + K WR +AD L D LE
Sbjct: 101 GVGDSSATILSASLTWMFKDGSGMIGRIIFAGYHGIKLDCDCKFWRFVADILNDCALFLE 160
Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
++SPL LF + L N K + VA ATR I A NL+D+ AK + TL N
Sbjct: 161 IISPLFNYLFTPLLCLANVLKSLVGVAGSATRAAIVQHQAINNNLADVSAKDGSQETLSN 220
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
++ L V + + IH+YS ++ + T N R + + +
Sbjct: 221 LMAWLLNFILLYMVTGNQFLIWFCFICCTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQW 280
Query: 290 VK 291
K
Sbjct: 281 FK 282
>gi|398388772|ref|XP_003847847.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
gi|339467721|gb|EGP82823.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
Length = 457
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 124/224 (55%), Gaps = 7/224 (3%)
Query: 67 LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSW 126
++ F P+G+P SV E Y Y + +LQ F+S+ ++LS++++L + G+ T A T +
Sbjct: 54 MDVFLPAGFPHSVTEDYTEYQIYDSLQAFSSSIAAMLSSRAVLSSIGVGDTKASPT-AAL 112
Query: 127 ILKDGMQHVGK----LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLE 181
+L + VG+ L G ++ E K +R++AD L D L+VLSP+ P+ + +
Sbjct: 113 LLSVLQESVGRLATILFAHRFGTSLEPECKMYRLMADVLNDSAFILDVLSPIFPKPIRIV 172
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ + + + VAA +++ + + FA+ GNL +L AK + T+ +++G+ AG + S
Sbjct: 173 ILSFSSILRSLCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLMGMLAGSLVIS 232
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
+ + + + LLS IH+ + +RA + TLN QR ++
Sbjct: 233 WITTPLATWTALIGLLS-IHLETNRRAVRAVKMRTLNRQRATLV 275
>gi|145347876|ref|XP_001418386.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578615|gb|ABO96679.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 362
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 111/240 (46%), Gaps = 20/240 (8%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
F PSG+P + + Y+ + +++ L L ++S QSLL A G+ TP W
Sbjct: 8 FLPSGFPATTSADYVEWLRWQLLSLLFRDVLEIMSAQSLLVALGMGNTPGALPLTAAAKW 67
Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+LKDG+ LI + G R D +PKRW L++ L D+ +E+++P P LFL +A
Sbjct: 68 VLKDGVGSFATLIAGSFGGQRYDEDPKRWWGLSNTLEDVARAIELVTPAAPGLFLPLAAS 127
Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
F + A + R + L F + NL D+ AK E ++ L AGIQ+ V
Sbjct: 128 ATFVRA-AALTGRGSLLNGTFMQHFGRNNNLGDIRAKLEVQGRWLALIALPAGIQVFQLV 186
Query: 244 CSSMQ---------GKMIV--GPLLSVI--HVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
++ G + G VI HV++ + ++ +TLN R + FV
Sbjct: 187 SAAAADLAADGDEFGAFVAAFGAYGGVIGVHVFACWQAAKSLKFDTLNRFRLLKLAEAFV 246
>gi|367049005|ref|XP_003654882.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
gi|347002145|gb|AEO68546.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
Length = 451
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 138/259 (53%), Gaps = 13/259 (5%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
++ + L F P+GYP +V + YL Y + +LQ F S+ S+L+ +++L G+ + +
Sbjct: 37 LLKAILYAFLPAGYPHTVTDDYLAYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSP 96
Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
T ++ I D + + ++ ++ +G ++ E K +R LAD D L++L+P P
Sbjct: 97 TGALILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADLFNDSAQFLDLLTPALP-Y 155
Query: 179 FLEMAGL--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
F ++A + + + VAA A++ + + FA GNL++L AK + T+ +++G+ AG
Sbjct: 156 FPKLAVIVSAGVLRSLCGVAAGASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLAG 215
Query: 237 IQLASTVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
S V ++ K +V L L+ +H+ +R + +LN QR ++ +++ G
Sbjct: 216 ----SLVVRLVEDKQLVWLLMVALAAVHLIMNYRAVRCVRMRSLNRQRATIVFREWLDHG 271
Query: 294 RISSPADLRYREDLLFPGR 312
+ +PA + RE +L GR
Sbjct: 272 AVLTPAQVARRESILRNGR 290
>gi|281211285|gb|EFA85450.1| DUF647 family protein [Polysphondylium pallidum PN500]
Length = 427
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/166 (33%), Positives = 91/166 (54%), Gaps = 3/166 (1%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTP 118
++ ++F F PSGYP SV++ Y Y ++ +Q+ + +LST +LL GL +
Sbjct: 101 KLYETFKTLFLPSGYPESVSQDYASYQRWIFVQNTLGSVTYMLSTHALLTSVGVGLSASL 160
Query: 119 AQATVVSWILKDGMQHVG-KLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
+ +SW+LKDG+ L S +D + KR++ D L++ G LE +P P
Sbjct: 161 PFSAAISWVLKDGLGASALVLFASKYSTSLDFDLKRFKFRGDFLHNFGVFLETCTPFLPG 220
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
FL MA + N AKGIA + +TR ++ SFA + N+ D+ AK ++
Sbjct: 221 YFLLMASISNLAKGIAGLIYGSTRASLHKSFALKENIGDITAKYQS 266
>gi|224013726|ref|XP_002296527.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968879|gb|EED87223.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 719
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 134/324 (41%), Gaps = 80/324 (24%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATV 123
++ F P+ YP SV Y Y + L ++ VLSTQ+LL A G+ A A
Sbjct: 239 LIHHFLPANYPQSVCPSYATYASYCFLGSIAGSSAMVLSTQALLVAVGVGTQSAAPMAAA 298
Query: 124 VSWILKDGMQHVGKLI-CSNLG-----------------------------------ARM 147
++W++KDG+ +G ++ S LG
Sbjct: 299 LNWVMKDGVGQLGGVVFASQLGKGGVDVDYWKVKLNKLMGSSSSKSTETSARGNFQRGTA 358
Query: 148 DSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVVAARATRLPIYS 206
DS PKRWR++A DL T LE+ +P + PQ FL A + N K + +AA A+R ++
Sbjct: 359 DSNPKRWRMVAALALDLSTLLEICTPWMGPQWFLPCASIANIGKNVGFLAASASRAAVHQ 418
Query: 207 SFAKEG-------------------------------NLSDLFAKGEAISTLFNVVGLGA 235
S G NL D+ AK + + + +++G
Sbjct: 419 SLCTGGSFLPPVPDNTTEKDESYDNPKKGKNVTSSSNNLGDVTAKSGSQAIVASLLGTAL 478
Query: 236 GIQLASTVCSSMQGKMIVGP--LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
GI L+ T CS I+ +LS +H ++A P+ +++ R +I+ ++
Sbjct: 479 GIWLSRTFCSDYGTAGILAGFVILSAVHQVCTYRALKAVPLRSVDRHRLHIILNAYIT-- 536
Query: 294 RISSPADLRYREDL-LFPGRLIED 316
AD R + + L P ++ E+
Sbjct: 537 -----ADGRSVQSMVLTPSQVAEE 555
>gi|116205700|ref|XP_001228659.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
gi|88182740|gb|EAQ90208.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
Length = 448
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 70/255 (27%), Positives = 132/255 (51%), Gaps = 11/255 (4%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT-- 122
+ L F P+GYP SV + YL Y + +LQ F S+ S+L+ +++L G+ + + T
Sbjct: 39 AILYAFLPAGYPHSVTDDYLPYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGA 98
Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FL 180
++ I D + + ++ ++ +G ++ E K +R LAD D L++L+P P L
Sbjct: 99 LILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKL 158
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + VAA A++ + + FA GNL++L AK + T+ +++G+ G
Sbjct: 159 GVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVG---- 214
Query: 241 STVCSSMQGKMIVGPLLSV---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
S V ++ K +V L+ V +H+ +RA + +LN QR ++ +++ G + +
Sbjct: 215 SLVVKMVEDKQVVWMLMVVLAGVHLAMNYHAVRAVKMRSLNRQRATLVFREWLDRGTVLT 274
Query: 298 PADLRYREDLLFPGR 312
P + RE +L GR
Sbjct: 275 PEQVSQRESILRKGR 289
>gi|322700393|gb|EFY92148.1| DUF647 domain-containing protein [Metarhizium acridum CQMa 102]
Length = 466
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 81/312 (25%), Positives = 149/312 (47%), Gaps = 14/312 (4%)
Query: 2 NLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHR 61
+L++K + E DKS +VQ W SD+ + + + ++ S + R
Sbjct: 20 DLIKKGDDVEISELDKSGNVQR---RWLHRSDNTIAVLESETE---PSSIIHSSVREWKR 73
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
+ + F P G+P SV+ YL Y + +LQ F S S+L+ ++LL G+ + A
Sbjct: 74 FLS---DAFLPIGFPHSVSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSSA 130
Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
T ++ IL D + ++ ++ G R++ + KR+R LAD D LE+ SP
Sbjct: 131 TFAMLITILTDATSRIATIVFAHRFGLRIEPDAKRFRFLADIFNDTAFFLELYSPYFGDY 190
Query: 179 FLEMA-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+A +G + + VAA A++ + FAK NL++L AK + T ++GL G
Sbjct: 191 GKILALSVGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVGT 250
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
L + Q + + L + H++ +R+ + T N QR ++ +++ TG + +
Sbjct: 251 -LVVKLVQDHQSVVYLMVTLVLAHLWMNYLGVRSVCMTTFNRQRATILFREYMTTGTVLT 309
Query: 298 PADLRYREDLLF 309
P + RE+++
Sbjct: 310 PEAVAARENIVL 321
>gi|224094527|ref|XP_002310175.1| predicted protein [Populus trichocarpa]
gi|222853078|gb|EEE90625.1| predicted protein [Populus trichocarpa]
Length = 333
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 65/232 (28%), Positives = 118/232 (50%), Gaps = 18/232 (7%)
Query: 86 YTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLI-CS 141
Y +++ L S+AL VL+TQ++ A G+ R P+ A ++W+LKDG+ + + I +
Sbjct: 3 YVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPS-AAALNWVLKDGLGRLSRCIYTA 61
Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATR 201
+L + D+ KR R L+ L G+E+L+P PQ FL +A L N AK I++ AT
Sbjct: 62 SLASAFDTNLKRVRFTTSVLFSLSIGIELLTPTFPQYFLLLATLANIAKQISLACYLATG 121
Query: 202 LPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM---------- 251
++ SFA N+ ++ AK + + F+ +GL L ++ + +
Sbjct: 122 SAVHRSFAIADNIGEVSAKAQIQTVSFDNLGLMLAALLNMLFKNNQRFSLSFPTMSFCFC 181
Query: 252 ---IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
V P+ S I ++ + + ++ + TL R +I+ ++ G + SPA+
Sbjct: 182 LPFFVYPIFSAIDLFGIYQGLQHVHLQTLTKDRLEIILNSWIDFGHVPSPAE 233
>gi|356566668|ref|XP_003551552.1| PREDICTED: uncharacterized protein LOC100787355 [Glycine max]
Length = 1041
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/226 (28%), Positives = 109/226 (48%), Gaps = 8/226 (3%)
Query: 64 DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTP 118
D+ + P+G+P SV++ YL Y + + T L T SLL A G+
Sbjct: 649 DTIKDFILPAGFPGSVSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTSAT 708
Query: 119 AQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A A+ + W+ KDG+ VG+L + G+ D +PK+WR+ AD + G+ + + + P
Sbjct: 709 ASASAIRWVSKDGIGAVGRLCLGGRFGSLFDDDPKQWRMYADFIGSAGSIFYLTTQVYPD 768
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
FL +A LGN K +A + I + FA GNL ++ AK E + ++GL GI
Sbjct: 769 YFLPLASLGNLTKAVARGLKDPSFCVIQNHFAISGNLGEVAAKEEIWEVVAQLIGLALGI 828
Query: 238 QLASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVNTLNPQR 281
+ T ++ P L + +H++ + + NT+N +R
Sbjct: 829 LILDTPSLVKSYGVLSLPWLGMQFLHLWLRYKSLSVLQFNTINLKR 874
>gi|443714847|gb|ELU07084.1| hypothetical protein CAPTEDRAFT_141059, partial [Capitella teleta]
Length = 266
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 123/251 (49%), Gaps = 5/251 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV++ Y++Y + +Q F S+ + L+TQ++L G+ A A ++W+
Sbjct: 6 FLPKGFPASVSDDYVQYQIWDTVQAFASSITNTLATQAVLKGVGVGDEQATVLAATLTWL 65
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KDG +G+++ + + G+ +D + KRWR+ AD L D+ +++ +P F + +
Sbjct: 66 MKDGTGMLGRILFTWIQGSYLDCDCKRWRLFADILNDVSILMDICAPFFRVYFTLIVCVA 125
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ I VA ATR + A N++D+ AK + TL N+ L + L V
Sbjct: 126 GVCRSIVGVAGGATRAAVTQHQACRSNMADVSAKDGSQETLVNLAALLCNLALVPMVSDR 185
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV--KTGRISSPADLRYR 304
I+ + +H+Y+ + +R+ + T N R + +F+ TG + R
Sbjct: 186 QWLVWILFTCFTTLHIYANFKAVRSLDMPTFNQVRLHLAAQEFISDNTGNFPGVKTVNQR 245
Query: 305 EDLLFPGRLIE 315
E +L L+
Sbjct: 246 EPVLCSESLLS 256
>gi|427789495|gb|JAA60199.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 464
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 121/248 (48%), Gaps = 8/248 (3%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F F P GYP SV+ Y Y + ++Q F S+ L+TQS+L G+ A A
Sbjct: 45 FSEIFLPVGYPDSVSRDYTEYQIWDSIQAFASSITGTLATQSVLSGVGVGDQGASVLAAT 104
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+WIL+DG G+++ + G+ +D + K+WR+ AD L D+ LE++ +
Sbjct: 105 TTWILRDGTGMTGRILFAWFQGSNLDYDSKKWRLFADILNDVAIFLELMCQYMKGYVTAV 164
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ + AK I VA ATR + A+ N++D+ AK + TL N+ + L
Sbjct: 165 LCVSSVAKSIVGVAGGATRAALTQHQARNNNMADVSAKDGSQETLVNLAAFLFSLLLLRI 224
Query: 243 VCSSMQGKMIVGPLLS--VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPA 299
V + ++ G +S ++H+++ + + T N R A++V F++T G I+
Sbjct: 225 VAGNPW--LLYGIFISFTLLHIFANYRAVSCVVMETFNRSRYAIVVRRFLETAGDIAPVT 282
Query: 300 DLRYREDL 307
+ RE +
Sbjct: 283 WVNARESV 290
>gi|147807901|emb|CAN62157.1| hypothetical protein VITISV_025492 [Vitis vinifera]
Length = 582
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 72/131 (54%), Gaps = 1/131 (0%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
FL P G+P SV YL YT +R +Q S VL+TQ+LL+A GL + A V
Sbjct: 216 FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 275
Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P P FL +
Sbjct: 276 NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 335
Query: 185 LGNFAKGIAVV 195
+ + A +
Sbjct: 336 VAGAGRSAAAL 346
>gi|302847291|ref|XP_002955180.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
nagariensis]
gi|300259472|gb|EFJ43699.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
nagariensis]
Length = 1055
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 153/328 (46%), Gaps = 32/328 (9%)
Query: 5 EKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQP---DGH----LSVKVVDDSRP 57
E ++ P S++ +P+ +V + Q P DG +S + DS
Sbjct: 218 EASERHSNGHPSAPASLERLPV-------AVKKGDQLLPYVWDGRRITAMSEAALQDSDV 270
Query: 58 VYHRVVDSFL-------NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLF 110
R+ + L F P P V Y + ++R Q F S+ + S +LL
Sbjct: 271 TTGRLRNWLLGVQRQLKEAFLPD--PRDVTPDYWEWLRWRLTQRFFSSTMQNFSFSALLM 328
Query: 111 AAGL--RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTG 167
A GL + A + ++W+LKDG+ + ++ + ++ G D++ KR R + ++
Sbjct: 329 ATGLGAKKAFAASAAINWLLKDGVSRIVRMSVSTSFGQTFDADLKRMRFITSLIFTACMA 388
Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
E +P PQ F+ +A + + + + + A AT+ ++ A GNL+DL +K +A +
Sbjct: 389 GEFATPFYPQHFVALASISSVGRAVGLSAFVATQPAFQAALATGGNLADLTSKNQAQHMV 448
Query: 228 FNVVGLGAGIQLASTVCSSM-QGKMIVG----PLLSVIHVYSVIEEMRAAPVNTLNPQRT 282
+++ LG L + +C M +G +++ PL + + + E++A + TLN +R
Sbjct: 449 MDMLALGVSAGL-TWLCRGMPRGGLLLPAVMYPLCAAGDLTCIWHELKAVQLRTLNRERA 507
Query: 283 AMIVADFVKTGRISSPADLRYREDLLFP 310
M++ +++ G + A++ E+L+ P
Sbjct: 508 EMLIERWMRRGAVPDAAEISAAENLILP 535
>gi|320593995|gb|EFX06398.1| duf647 domain containing protein [Grosmannia clavigera kw1407]
Length = 458
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 141/253 (55%), Gaps = 17/253 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P+GYP SV + YL Y + +LQ F+S+ S+L+ +++L G+ +P A V+
Sbjct: 47 FLPAGYPQSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLK- 105
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLS---PLCPQLFLEM 182
I++D + ++ ++ +G ++ E K +R +AD D L++L+ PL P++ + +
Sbjct: 106 IIQDTFSRMATILFAHRMGQAIEPECKTYRFMADLFNDSALMLDLLTPVLPLYPKVCIMV 165
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
A + + + +AA A++ + + FA+ GNL++L AK + T+ +++G+ AG +
Sbjct: 166 A--SSILRSLCGIAASASKASLSAHFARTGNLAELNAKEASQETVISLLGMLAG----TA 219
Query: 243 VCSSMQGKMIV---GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V +Q K V +L ++H+ +R + TLN QR ++ ++++TG+I SP
Sbjct: 220 VVHVVQDKTAVLYWMVMLLMVHLVMNYRGVRCVKMLTLNRQRATIVFREYLETGKILSPG 279
Query: 300 DLRYREDLLFPGR 312
++ RE +L GR
Sbjct: 280 EVAERESVLLGGR 292
>gi|384249834|gb|EIE23315.1| hypothetical protein COCSUDRAFT_15701 [Coccomyxa subellipsoidea
C-169]
Length = 265
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 112/228 (49%), Gaps = 7/228 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV----VS 125
F P GYP +V YLR+ + H T + L+T SLL A G+ + AT +
Sbjct: 9 FLPRGYPETVTPDYLRFQLWAIPAHITGWMSTSLATSSLLKAVGVGNSAVGATAAAAAIK 68
Query: 126 WILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
WI KDG+ G+ L+ LG D +PKRWR++A+A +G LE+ + P F+ +AG
Sbjct: 69 WITKDGIGAAGRVLVGGRLGNVFDEDPKRWRMVAEAFLTVGLALEIATAFSPANFIVLAG 128
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
GN ++ + I + FA N+ D+ AK E ++GL + L +
Sbjct: 129 AGNLSRAVGRGMTNPCFRIIQTHFAAASNVGDVAAKEEVWEVSAQLLGLAGSVGLLKAIE 188
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVN--TLNPQRTAMIVADFV 290
++ + + ++G +V ++S++ A + ++N +R ++ A V
Sbjct: 189 ATGKPENVLGVWAAVQVLHSLLRYKSLAVLQFPSVNQKRACLLAAAHV 236
>gi|367028787|ref|XP_003663677.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
42464]
gi|347010947|gb|AEO58432.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
42464]
Length = 448
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 69/258 (26%), Positives = 134/258 (51%), Gaps = 11/258 (4%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
++ + L F P+GYP +V YL Y F +LQ F S+ S+L+ +++L G+ + +
Sbjct: 36 LLKAILYAFLPAGYPHTVTGDYLAYQTFDSLQAFASSITSLLANRAVLEGLGVGDSSSSP 95
Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
T ++ I D + + ++ ++ +G ++ E K +R LAD D L++L+P P L
Sbjct: 96 TGALILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYL 155
Query: 179 -FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
L + + + VAA A++ + + FA GNL++L AK + T+ +++G+ G
Sbjct: 156 PKLGVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVG- 214
Query: 238 QLASTVCSSMQGKMIVGPLLSV---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
S V ++ K +V L+ + +H+ +R+ + +LN QR ++ +++ G
Sbjct: 215 ---SLVVRMVEDKQVVWMLMVILAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGV 271
Query: 295 ISSPADLRYREDLLFPGR 312
+ +P + RE +L GR
Sbjct: 272 VLTPEQVAQRESILGNGR 289
>gi|328703155|ref|XP_001949028.2| PREDICTED: UPF0420 protein C16orf58 homolog [Acyrthosiphon pisum]
Length = 457
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/219 (27%), Positives = 112/219 (51%), Gaps = 9/219 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWI 127
F P G+P SV++ Y Y + +Q F S ++L+T ++L G+ A A +++I
Sbjct: 56 FLPRGFPNSVSKDYTEYQIWDTIQAFCSTISNILATNAILKGVGVGDAQATALAASITYI 115
Query: 128 LKDGMQHVGKLI---CSNLGARMDSEPKRWRILADALYDLGTGLEVL--SPLCPQLFLEM 182
LKDG +G++ C G+ +D++ K+WR+ ADAL DL +E+L P Q L +
Sbjct: 116 LKDGAGMIGRITFAWCK--GSELDTQCKKWRLRADALNDLAIFIELLLSIPWIRQFSLII 173
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ AK I VA ATR + A N+ D+ AK + T N++ G+ +
Sbjct: 174 LSFSSCAKSIVSVAGGATRAALTQHQAIRDNMGDVSAKDGSQETCINLIAFLIGLIMLPI 233
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
V + + ++ +++ +H+++ + +++ +N N R
Sbjct: 234 VENRILLIWMIYLVVTSLHLFANYKAVKSLNINVFNSAR 272
>gi|384252100|gb|EIE25577.1| DUF647-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 286
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 111/221 (50%), Gaps = 5/221 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP +V Y+ + + ++Q S ++ + ++L G+ + T V +
Sbjct: 49 FLPDGYPDAVTSDYVGFQIWDSIQALCSYVRGMICSNAILTGIGVGKQATSPLVAVFQFF 108
Query: 128 LKD--GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
++D GM G L G+++D+ K+WR+ AD ++G L++ SPL P FL +A L
Sbjct: 109 VRDLAGMCG-GVLFAFTQGSQLDAYAKQWRLFADIFNNVGYALDLASPLFPDQFLLLACL 167
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
G+ A+ + VA ATR + F++ N D+ AK + T ++G+ G+
Sbjct: 168 GSLARAVTGVAGGATRAALTQHFSRADNAGDISAKEGSQETATTLIGMILGMAFLQAASD 227
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
S V LL+ +H+Y+ + +RA + +LN R +++
Sbjct: 228 SPVAVWAVFLLLTFLHIYANVRAVRALQLTSLNCSRLDILL 268
>gi|293336742|ref|NP_001169416.1| hypothetical protein [Zea mays]
gi|224029221|gb|ACN33686.1| unknown [Zea mays]
gi|414876408|tpg|DAA53539.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
Length = 481
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 116/252 (46%), Gaps = 8/252 (3%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPA 119
+ L+ P+G+P SV+ YL Y + + T VL T SLL A G+ A
Sbjct: 90 AILDFVLPAGFPGSVSADYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAA 149
Query: 120 QATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
A + W+ KDG+ G+L I GA D +PK+WR+ AD + G+ E+ + L P
Sbjct: 150 SAAAIRWVSKDGIGAFGRLLIGGRFGALFDDDPKKWRMYADFIGSAGSIFELSTTLYPGY 209
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
FL +A LGN AK IA + I + FA+ GNL ++ AK E ++GL G+
Sbjct: 210 FLPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVL 269
Query: 239 LASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ +V LSV +H++ + + T+N +R ++V V +
Sbjct: 270 IMDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLKFRTINLKRGRILVKSHVAQHTVP 329
Query: 297 SPADLRYREDLL 308
E++L
Sbjct: 330 GYVACNEEENIL 341
>gi|108706828|gb|ABF94623.1| expressed protein [Oryza sativa Japonica Group]
Length = 497
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 77/266 (28%), Positives = 128/266 (48%), Gaps = 31/266 (11%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ RV L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 136 WGRVGGRLLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKS 195
Query: 119 AQ--ATVVSWILKD--GM---------QHVGKLICSN------------------LGARM 147
A W L+D GM Q L SN LG+ +
Sbjct: 196 ATVIGATFQWFLRDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNL 255
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DS K WR++AD + DLG +++LSPL P + + LG+ ++ VA+ ATR +
Sbjct: 256 DSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQH 315
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
FA N +D+ AK + TL ++G+G G+ LA G + L++ H+Y+ +
Sbjct: 316 FALANNAADISAKEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYK 375
Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTG 293
+++ + TLN +R+++++ F+ G
Sbjct: 376 AVQSLSLTTLNYERSSILLQYFMDNG 401
>gi|225440682|ref|XP_002274737.1| PREDICTED: UPF0420 protein-like [Vitis vinifera]
Length = 503
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 120/257 (46%), Gaps = 12/257 (4%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---------RPTPAQA 121
P+G+P SV++ YL Y + + T+ L T SLL A G+ A A
Sbjct: 119 LPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKAVGVGSFSATTAAASAAASA 178
Query: 122 TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
+ W+ KDG+ VG+L I G D +PK+WR+ AD + G+ ++ + L P FL
Sbjct: 179 AAIRWVSKDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLYPAYFL 238
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
++A LGN AK +A + I + FA GNL ++ AK E ++GL GI +
Sbjct: 239 QLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMVL 298
Query: 241 ST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
++ V +S + ++H++ + + ++N +R ++V V +
Sbjct: 299 TSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPGY 358
Query: 299 ADLRYREDLLFPGRLIE 315
D +E +L R ++
Sbjct: 359 VDCNRKEKILSWQRFLK 375
>gi|432112420|gb|ELK35212.1| hypothetical protein MDA_GLEAN10003470, partial [Myotis davidii]
Length = 638
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 105/216 (48%), Gaps = 3/216 (1%)
Query: 95 FTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+T ++L G+ A +W+LKD +G++I + G+++D
Sbjct: 272 FASSLSGSLATHAVLLGIGVGNASASVSAATATWLLKDSTGMLGRIIFAWWKGSKLDCNA 331
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D LE+++PL P F + N AK I VA ATR + A+
Sbjct: 332 KQWRLFADILNDTAMFLEIMAPLLPICFTMTICISNLAKCIVSVAGGATRAALTMHQARR 391
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + LL+ +HVY+ +RA
Sbjct: 392 NNMADVSAKDGSQETLVNLAGLLVSLLMLPLVSDCPSFSLGCFFLLTALHVYANYRAVRA 451
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ TLN R +++ F++ G + P E L
Sbjct: 452 LVIETLNEGRLWLVLKHFLQRGEVLDPTSANQMEPL 487
>gi|239610850|gb|EEQ87837.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ER-3]
Length = 498
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 132/254 (51%), Gaps = 5/254 (1%)
Query: 49 VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
V +V SR + S LN F P GYP SV++ YL Y + +LQ F+S+ +L+++++
Sbjct: 53 VDIVPPSRKESLSSLTSLLNVFLPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAV 112
Query: 109 LFAAGLRPTPAQATV--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG 165
L G+ A TV + +L D M + ++ ++ LG ++ E K +R+ AD L D
Sbjct: 113 LQGVGVGDASASPTVALLHSVLHDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSA 172
Query: 166 TGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAI 224
+ LSP P+ L + + + + + VAA +++ + S FA+ GNL +L AK +
Sbjct: 173 MVFDCLSPAFPKHLRIVVLACSSVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQ 232
Query: 225 STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
T+ +++G+ G + S V + L ++H+ + +R+ + TLN QR +
Sbjct: 233 ETVISLLGMLCGSLVVSRVSTPF-ATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANI 291
Query: 285 IVADFVKTGRISSP 298
+ + + GR+ +P
Sbjct: 292 VFSTLFEDGRVLTP 305
>gi|327350378|gb|EGE79235.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 497
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 132/254 (51%), Gaps = 5/254 (1%)
Query: 49 VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
V +V SR + S LN F P GYP SV++ YL Y + +LQ F+S+ +L+++++
Sbjct: 52 VDIVPPSRKESLSSLTSLLNVFLPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAV 111
Query: 109 LFAAGLRPTPAQATV--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG 165
L G+ A TV + +L D M + ++ ++ LG ++ E K +R+ AD L D
Sbjct: 112 LQGVGVGDASASPTVALLHSVLHDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSA 171
Query: 166 TGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAI 224
+ LSP P+ L + + + + + VAA +++ + S FA+ GNL +L AK +
Sbjct: 172 MVFDCLSPAFPKHLRIVVLACSSVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQ 231
Query: 225 STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
T+ +++G+ G + S V + L ++H+ + +R+ + TLN QR +
Sbjct: 232 ETVISLLGMLCGSLVVSRVSTPF-ATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANI 290
Query: 285 IVADFVKTGRISSP 298
+ + + GR+ +P
Sbjct: 291 VFSTLFEDGRVLTP 304
>gi|255082448|ref|XP_002504210.1| predicted protein [Micromonas sp. RCC299]
gi|226519478|gb|ACO65468.1| predicted protein [Micromonas sp. RCC299]
Length = 226
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/225 (28%), Positives = 114/225 (50%), Gaps = 5/225 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVV--SWI 127
F P YP SV++ Y + + Q +S L+T++LL G+ T A A +I
Sbjct: 2 FLPEDYPNSVSDDYAAFQAWDTAQGLSSYVRGSLTTKALLEGVGVGATGATAASAAAQFI 61
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ--LFLEMAG 184
+D VG L+ + G +D+E K+WR+ AD ++G +E+ +PL FL +A
Sbjct: 62 ARDMCGMVGGLVFTLAKGRHLDAEAKQWRLFADLANNVGMAMELAAPLANDRWWFLALAC 121
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
G+ A+ + AA ATR + FA+ N +D+ AK + T +VG+ G+ + ++
Sbjct: 122 AGSVARSLCGCAAGATRAALTQHFARARNAADIAAKEGSQETAVTLVGMSLGVAVTTSTD 181
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
+ V L+V HVY + +R+ +++LN +R ++A +
Sbjct: 182 GKTAWQWAVFVALTVAHVYCNVRAVRSLVIDSLNRERVVRLLARY 226
>gi|261206350|ref|XP_002627912.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239592971|gb|EEQ75552.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 497
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 132/254 (51%), Gaps = 5/254 (1%)
Query: 49 VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
V +V SR + S LN F P GYP SV++ YL Y + +LQ F+S+ +L+++++
Sbjct: 52 VDIVPPSRKESLSSLTSLLNVFLPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAV 111
Query: 109 LFAAGLRPTPAQATV--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG 165
L G+ A TV + +L D M + ++ ++ LG ++ E K +R+ AD L D
Sbjct: 112 LQGVGVGDASASPTVALLHSVLHDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSA 171
Query: 166 TGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAI 224
+ LSP P+ L + + + + + VAA +++ + S FA+ GNL +L AK +
Sbjct: 172 MVFDCLSPAFPKHLRIVVLACSSVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQ 231
Query: 225 STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
T+ +++G+ G + S V + L ++H+ + +R+ + TLN QR +
Sbjct: 232 ETVISLLGMLCGSLVVSRVSTPF-ATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANI 290
Query: 285 IVADFVKTGRISSP 298
+ + + GR+ +P
Sbjct: 291 VFSTLFEDGRVLTP 304
>gi|307111734|gb|EFN59968.1| hypothetical protein CHLNCDRAFT_133075 [Chlorella variabilis]
Length = 434
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 116/265 (43%), Gaps = 44/265 (16%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ----ATVVS 125
F P+G+P SV YLRY H T L+T S++ A G+ PA + V
Sbjct: 39 FLPTGWPHSVTPDYLRYQLASVPAHITGWMSHSLATSSMITALGVDAGPATIAATSAAVK 98
Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCP-------- 176
W+ KDG+ G+LI L + D +PKRWR++A+ + +G +E+ + + P
Sbjct: 99 WLTKDGLGAAGRLIVGGRLSSVFDEDPKRWRMVAEGVTTVGLAMEIATQIYPGGLRSKFR 158
Query: 177 ------QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
F+ +AG G K +A I + F+ N+ D+ AK E +
Sbjct: 159 VVSVVWSYFIALAGTGTLCKSMAKGMGTPCFRIIQTHFSATNNVGDVAAKEEVWEVAAQL 218
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+GL A SV+ + ++ E +P +T+ P+R A++ + V
Sbjct: 219 LGLAA----------------------SVVALQAL--EAAGSP-DTVVPKRAALMASSHV 253
Query: 291 KTGRISSPADLRYREDLLFPGRLIE 315
+TG + S + E +L P + +
Sbjct: 254 RTGLVPSVEQVNLSEQVLLPPTMCQ 278
>gi|340967040|gb|EGS22547.1| diphosphomevalonate decarboxylase-like protein [Chaetomium
thermophilum var. thermophilum DSM 1495]
Length = 425
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 138/274 (50%), Gaps = 21/274 (7%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQA 121
+ L F P+G+P +V E YL Y + +LQ F S+ S+L+ +++L G+ +P A
Sbjct: 39 AILYAFLPAGFPHTVTEDYLSYQLYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGA 98
Query: 122 TVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-F 179
++ I D + + ++ ++ +G ++ E K +R LAD D L++L+P P
Sbjct: 99 LILK-ITGDTISRIATILFAHRMGRAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPK 157
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
L + + + VAA A++ + + FA GNL++L AK + T+ +++G+ AG
Sbjct: 158 LSVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLAG--- 214
Query: 240 ASTVCSSMQGKMIVG---PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
S V ++ K +V LL+ +H+ +R+ + +LN QR ++ +++ G +
Sbjct: 215 -SLVVRLVEDKQVVWFLMILLAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGTVL 273
Query: 297 SPADLRYREDLLFPGRLIEDAGNVKM---DRHGI 327
+P + RE +L R GN+ D HG
Sbjct: 274 TPDQVAKRESILCKSR-----GNLASKSGDYHGF 302
>gi|336273098|ref|XP_003351304.1| hypothetical protein SMAC_03608 [Sordaria macrospora k-hell]
gi|380092824|emb|CCC09577.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 498
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 131/254 (51%), Gaps = 13/254 (5%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPA 119
+ + L+ F P+GYP SV YL Y + +LQ F S+ S+L+ +++L G+ +P
Sbjct: 38 LQALLHAFLPAGYPHSVTPDYLPYQTYDSLQAFFSSITSLLANRAVLEGLGVGDSSSSPT 97
Query: 120 QATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
A ++ I D + + ++ ++ +G ++ E K +R LAD L D L++L+P P
Sbjct: 98 GALILK-IAGDTISRIATILFAHRMGTAIEPECKFYRFLADLLNDSAQLLDLLTPALPYF 156
Query: 179 -FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
L + + + VAA A++ + + FA GNL++L AK + T+ +++ GI
Sbjct: 157 PKLGVIVSAGVLRSLCGVAANASKASLSAHFALRGNLAELNAKEASQETVVSLL----GI 212
Query: 238 QLASTVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
+ S V +Q + +V L L +H+ +RA TLN QR ++V +++ GR
Sbjct: 213 LVGSLVVKVVQDRGVVWALMVGLLGVHLLMNYWAVRAVRCRTLNRQRATVVVRGWLEWGR 272
Query: 295 ISSPADLRYREDLL 308
+ P ++ E +L
Sbjct: 273 VLGPEEVARMESIL 286
>gi|326428268|gb|EGD73838.1| hypothetical protein PTSG_05532 [Salpingoeca sp. ATCC 50818]
Length = 376
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/246 (29%), Positives = 117/246 (47%), Gaps = 10/246 (4%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWI 127
P GYP SV+ Y Y + Q F+S+ + +ST ++L G+ + T A AT + +
Sbjct: 51 LPEGYPASVSSDYASYQIWDTAQAFSSSVMGTISTYAVLKGVGVGDAQATVAAATYTN-L 109
Query: 128 LKDGMQHV-GKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D + G MD K+WR++AD + D+ +E++SPL P FL + +
Sbjct: 110 LRDATGMLAGIWFAWYSSTDMDRNAKQWRLVADVVNDMAMLVELVSPLIPAAFLPLMCMA 169
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+F + I VA ATR + A+ N++D+ AK + TL N+ L + + S
Sbjct: 170 SFLRAIVGVAGGATRAALVRHQARLNNMADVSAKDGSQETLVNLAALVVSLAITPLAAES 229
Query: 247 --MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
M M + PL+ +H+Y +RA + N R A F++T R+ S
Sbjct: 230 RVMAWAMFL-PLV-FMHLYCNYRAVRALEMPAFNAHRLHHACAVFMRTNRVPSVCAANQA 287
Query: 305 EDLLFP 310
E LFP
Sbjct: 288 EP-LFP 292
>gi|308804794|ref|XP_003079709.1| unnamed protein product [Ostreococcus tauri]
gi|116058166|emb|CAL53355.1| unnamed protein product [Ostreococcus tauri]
Length = 499
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 109/249 (43%), Gaps = 21/249 (8%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
F PSG+P +V+ Y + ++ + ++S QSLL A G+ TP W
Sbjct: 120 FLPSGFPQTVSADYADWLRWHLTSLLFRDVIEIMSAQSLLVALGVGSTPGALPLTAAAKW 179
Query: 127 ILKDGMQHVGKLICSNLGARM-DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+LKDG+ L+ G R D +PKRW +++AL D+ +E+++P P LFL +A
Sbjct: 180 VLKDGVGSFATLLAGAFGGRWYDEDPKRWWAVSNALEDVARAIELVTPAAPGLFLPLAAS 239
Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
F + A + R + + F + NL D+ AK E +V L GIQ+ V
Sbjct: 240 ATFVRA-AALTGRGSLMNGTFMQHFGRNNNLGDIRAKLEVQGRWLALVALPVGIQVFQLV 298
Query: 244 CSSMQGKMIVGPLLSVI-------------HVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+ G L H++S + R+ ++LN R + +V
Sbjct: 299 NAQASALHAEGDELGAAAAAFGGYGLVIGAHIFSCWQAARSLKFDSLNGFRLLRLAEAYV 358
Query: 291 KT-GRISSP 298
+ G+I P
Sbjct: 359 ASDGKIELP 367
>gi|167517110|ref|XP_001742896.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163779520|gb|EDQ93134.1| predicted protein [Monosiga brevicollis MX1]
Length = 272
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 103/195 (52%), Gaps = 2/195 (1%)
Query: 99 ALSVLSTQSLLFAAGLRPTPAQ-ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRI 156
A VLS Q+LL+AAGL A ++W+LKDG+ +G ++ L G R D +PKR R
Sbjct: 3 ASGVLSMQALLYAAGLGAGSVPLAAAINWVLKDGLGQLGGVLYGTLFGPRFDRDPKRQRF 62
Query: 157 LADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD 216
+ E+L PL P LFL +A N AK I+ +A+ ATR +++SF + NL D
Sbjct: 63 WSLVALQGANLTELLIPLVPHLFLPLASAANIAKNISFLASSATRAQMHASFVRRANLGD 122
Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
+ AK A S +VVG GI +++ S + + L++ +Y + ++T
Sbjct: 123 ITAKIGAQSIGASVVGTALGIFVSTHTGSDVTNIIPAFVPLALACLYFNHLSSKHVVLST 182
Query: 277 LNPQRTAMIVADFVK 291
LN QR + + +++
Sbjct: 183 LNEQRLELAMHPWLQ 197
>gi|401421841|ref|XP_003875409.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491646|emb|CBZ26919.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 675
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/234 (27%), Positives = 111/234 (47%), Gaps = 7/234 (2%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
+R RVV + F P GYP SV E L + + +Q + LST+++L G+
Sbjct: 57 ARQALRRVVTALERIFLPDGYPSSVTEDLLPFAFWAHVQVLAKNVTATLSTRAVLLGVGV 116
Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
+ A T +SW+++DG + +G ++ S + ++ K W+++ D ++D LE+
Sbjct: 117 GESKADLTSSTLSWMMQDGARMIGSVVFASVIPQGLECRAKTWQLVTDFIHDSAHLLELC 176
Query: 172 SPLCP---QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
SP P +F + + K +A V TR P FA N +D+ AK +
Sbjct: 177 SPWLPGGQAIFRVVLVAASVIKALASVCGNGTRAPFTQHFALRNNGADIAAKAATRGHVG 236
Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
++GL GI +A V +S + + V L S +H+++ +R + LN R
Sbjct: 237 GLIGLALGIVIAYIVPATSRRLNLAVFALCSAVHIFANYRGVRGVQLRHLNAPR 290
>gi|402074762|gb|EJT70271.1| hypothetical protein GGTG_12444 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 438
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 133/252 (52%), Gaps = 15/252 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P+G+P SV + YL Y + +LQ F+S+ S+L+ +++L G+ +P A V+
Sbjct: 48 FLPAGFPHSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGNASQSPTAALVLQ- 106
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG---TGLEVLSPLCPQLFLEM 182
IL+D + ++ ++ +G ++ E K +R +AD D L + P P++ + +
Sbjct: 107 ILQDTFSRLATILFAHRMGTAIEPECKTYRFMADLFNDASQLLDLLVPVLPFLPKVSVMV 166
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ + + VAA A++ + + FAK GNL++L AK + T+ ++VG+ G +
Sbjct: 167 SAA--ILRSLCGVAAGASKASLSAHFAKMGNLAELNAKEASQETVVSLVGMLVGTAVVHM 224
Query: 243 VCS--SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
V ++ MIV L IH+ +R + TLN QR ++ +F++TG++ +P
Sbjct: 225 VEDKRAVWCWMIV---LIAIHLMMNYRAVRCVKMLTLNRQRATIVFREFLETGKVLTPGQ 281
Query: 301 LRYREDLLFPGR 312
+ RE +L GR
Sbjct: 282 VAARESILRNGR 293
>gi|115434452|ref|NP_001041984.1| Os01g0141600 [Oryza sativa Japonica Group]
gi|54290177|dbj|BAD61065.1| unknown protein [Oryza sativa Japonica Group]
gi|113531515|dbj|BAF03898.1| Os01g0141600 [Oryza sativa Japonica Group]
Length = 323
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 95/192 (49%), Gaps = 17/192 (8%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
R + F+ P+G+P SV++ YL Y ++ + T L T +LL A G+
Sbjct: 101 RAIKDFV---LPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGT 157
Query: 121 ATV-----VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
+ + W+ KDG+ G+L I G D +PK+WR+ AD + G+ E+++PL
Sbjct: 158 SAAASAAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPL 217
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNV 230
P FL +A LGN AK + R R P I + FA+ GNL ++ AK E +
Sbjct: 218 YPGYFLPLASLGNLAKAV----GRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQL 273
Query: 231 VGLGAGIQLAST 242
+GL G+ + T
Sbjct: 274 LGLSIGVFIMDT 285
>gi|295659580|ref|XP_002790348.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226281800|gb|EEH37366.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 485
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 67/240 (27%), Positives = 122/240 (50%), Gaps = 20/240 (8%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
+ S L F P+GYP SV++ Y+ Y + +LQ F+S+ +L+++++L G+ A T
Sbjct: 65 LTSLLIVFLPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPT 124
Query: 123 V--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-L 178
V + +L++ M + ++ ++ LG ++ E K +R+ AD L D +E LSP P+ L
Sbjct: 125 VALLHSVLQESMGRIATILFAHRLGTSLEPECKLYRLAADVLNDSAMVVECLSPAFPKHL 184
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
+ + + + + VAA +++ + S FAK GNL +L A G + + + L G+
Sbjct: 185 RVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGSLVVSHIS-TPLTTGLT 243
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
L L +IH+ + +RA + TLN QR ++ + + GR +P
Sbjct: 244 LI---------------FLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTP 288
>gi|413949258|gb|AFW81907.1| hypothetical protein ZEAMMB73_025152 [Zea mays]
Length = 253
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 73/126 (57%), Gaps = 3/126 (2%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + A A
Sbjct: 120 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRAASGA 179
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK+ + G + D + K+ R D L +LG G+E+ + PQ FL
Sbjct: 180 VAINWILKDGAGRVGKMFFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQFFLP 239
Query: 182 MAGLGN 187
MA + N
Sbjct: 240 MACIAN 245
>gi|389639940|ref|XP_003717603.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
gi|351643422|gb|EHA51284.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
Length = 479
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 67/258 (25%), Positives = 134/258 (51%), Gaps = 11/258 (4%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
++ + F P+G+P +V E YL Y + +LQ F+S+ S+L+ +++L G+ +P
Sbjct: 41 LIKAIFYAFLPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSP 100
Query: 119 AQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGT-GLEVLSPLCP 176
A V+ I++D + + ++ ++ +G ++ E K +R LAD D ++ L
Sbjct: 101 TAALVLK-IIQDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPL 159
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+ + + + VAA A++ + + FAK GNL++L AK + T+ ++VG+ G
Sbjct: 160 LPKVSVLVTSGCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTG 219
Query: 237 IQLASTV--CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
+ V +++ M+V + ++ Y + +R + TLN QR ++ +F++TG+
Sbjct: 220 TAVVHMVEDKTAVWCWMVVLICMHLMMNYRAVRNVR---MLTLNRQRATIVFREFLETGK 276
Query: 295 ISSPADLRYREDLLFPGR 312
+ +PA RE +L R
Sbjct: 277 VLTPAQCSARESILLNSR 294
>gi|156059886|ref|XP_001595866.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980]
gi|154701742|gb|EDO01481.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 477
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 80/281 (28%), Positives = 143/281 (50%), Gaps = 32/281 (11%)
Query: 63 VDSFLNKFFPSGYPFSVNEGY---------LRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
V S F P GYP SV E Y L Y L F S +L+++++L G
Sbjct: 49 VSSLTGAFLPVGYPDSVTEDYTACVPCPIHLLYLLCEKLA-FASTIAGLLASRAVLQGLG 107
Query: 114 LRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEV 170
+ + A AT V+ +L++ + ++ ++ LG+ ++ E K++R++AD D L+
Sbjct: 108 VGDSTASATGAVLLNVLQESAGRIATILFAHRLGSALEPECKKYRLMADIFNDGAMILDC 167
Query: 171 LSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
+SP P++FL A + K + VAA +++ + + FAK+GNL++L AK + TL
Sbjct: 168 ISPAFPKVPRVFLLSAS--SVCKSLCGVAAGSSKASLSAHFAKQGNLAELNAKDASQETL 225
Query: 228 FNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVA 287
+++G+ G + S + S + + LLS IH+ + +R+ + TLN QR ++++
Sbjct: 226 ISLLGMLVGTFVVSRISSQTATWIALISLLS-IHLGTNYLAVRSVTMRTLNRQRANLVIS 284
Query: 288 DFV------KTG-RISSPADLRYREDLLFPGRLIEDAGNVK 321
DFV KT + +P D+ +E R+ E G ++
Sbjct: 285 DFVSNINAEKTRFNLPTPKDISRKE------RIFERDGAIR 319
>gi|125568971|gb|EAZ10486.1| hypothetical protein OsJ_00318 [Oryza sativa Japonica Group]
Length = 459
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 96/198 (48%), Gaps = 19/198 (9%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----R 115
R + F+ P+G+P SV++ YL Y ++ + T L T +LL A G+
Sbjct: 118 RAIKDFV---LPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGT 174
Query: 116 PTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
A A + W+ KDG+ G+L I G D +PK+WR+ AD + G+ E+++PL
Sbjct: 175 SAAASAAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPL 234
Query: 175 CPQLFLEMAGLGNFAKGI------AVVAARATRLP----IYSSFAKEGNLSDLFAKGEAI 224
P FL +A LGN AK I R R P I + FA+ GNL ++ AK E
Sbjct: 235 YPGYFLPLASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVW 294
Query: 225 STLFNVVGLGAGIQLAST 242
++GL G+ + T
Sbjct: 295 EVGAQLLGLSIGVFIMDT 312
>gi|303278700|ref|XP_003058643.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459803|gb|EEH57098.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 679
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 95/186 (51%), Gaps = 16/186 (8%)
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DS+ KR+R + +YD + +E+++P P FL +A + N K + + A R PI S
Sbjct: 314 DSDVKRFRFTSSVVYDCASLIEMITPFYPSKFLLLATIANVGKSVGITTANVVRAPIQMS 373
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM------IVGPLLSV-- 259
FA E NL+++ AK A L + +GL + V + GK+ + PLL+
Sbjct: 374 FALEENLAEIAAKTSAQQVLADNIGLALAV-----VATGWTGKIANPRLRLAVPLLAFAP 428
Query: 260 ---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
+ +Y + E+++ + T+N +R +I A FV+TG I S + E L P R+ E
Sbjct: 429 LASVDLYCIYRELKSVRLRTINKERGEIIAAGFVETGTIPSAVSVAAAERLFIPARMDES 488
Query: 317 AGNVKM 322
+ +K+
Sbjct: 489 SLPLKI 494
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 49/87 (56%), Gaps = 11/87 (12%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
VVD+FL P +V Y Y +FR Q S+ ++V +TQ +L A GL R P
Sbjct: 170 VVDAFL----PCAN--AVTPDYWAYAKFRFAQRVASSCITVFATQQMLQAVGLGATRRLP 223
Query: 119 AQATVVSWILKDGMQHVGKL-ICSNLG 144
A A V+W+LKDG+ +GKL + +N G
Sbjct: 224 A-AAAVNWVLKDGLGRLGKLSVATNFG 249
>gi|440465939|gb|ELQ35234.1| hypothetical protein OOU_Y34scaffold00720g28 [Magnaporthe oryzae
Y34]
gi|440485195|gb|ELQ65173.1| hypothetical protein OOW_P131scaffold00517g6 [Magnaporthe oryzae
P131]
Length = 431
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 131/250 (52%), Gaps = 11/250 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P+G+P +V E YL Y + +LQ F+S+ S+L+ +++L G+ +P A V+
Sbjct: 40 FLPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLK- 98
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG-TGLEVLSPLCPQLFLEMAG 184
I++D + + ++ ++ +G ++ E K +R LAD D ++ L + +
Sbjct: 99 IIQDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLPKVSVLV 158
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV- 243
+ + VAA A++ + + FAK GNL++L AK + T+ ++VG+ G + V
Sbjct: 159 TSGCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTAVVHMVE 218
Query: 244 -CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+++ M+V + ++ Y + +R + TLN QR ++ +F++TG++ +PA
Sbjct: 219 DKTAVWCWMVVLICMHLMMNYRAVRNVR---MLTLNRQRATIVFREFLETGKVLTPAQCS 275
Query: 303 YREDLLFPGR 312
RE +L R
Sbjct: 276 ARESILLNSR 285
>gi|395332415|gb|EJF64794.1| hypothetical protein DICSQDRAFT_80493 [Dichomitus squalens LYAD-421
SS1]
Length = 506
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 126/308 (40%), Gaps = 13/308 (4%)
Query: 22 QEIPIYWFETSDSVTRHYQ--FQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSV 79
+E I W + + + ++ P+ V V R+ F P+ YP SV
Sbjct: 70 RECHISWSDGRSGIRKEWRDTASPEATEGNVAVSGRNSVTGRISSWLRQMFLPTNYPQSV 129
Query: 80 NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGK 137
+ Y + +AL+ + SVL Q+LL + G+ + A V WI+KDG V K
Sbjct: 130 HSSYASFHTLQALETTMATITSVLCNQALLTSVGMSAEGSVFGAVAVQWIIKDGAGEVAK 189
Query: 138 L-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL---CPQLFLEMAGLGNFAKGIA 193
L + DS PK + + + L LG+GL++ + L P FL A GN K +
Sbjct: 190 LFFIRHFSTYFDSHPKTFTLFGEVLGCLGSGLQIATVLIAPSPLNFLLCAAGGNIFKLVG 249
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
T + F+ +GN D+ AK E+ +++ + G AGI L + S +
Sbjct: 250 NAIWSTTHIKFIRYFSMQGNDGDVAAKDESQASVAQLAGYAAGISL---LTFSHAPAYLY 306
Query: 254 GPLLSVI--HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
+ + H+ MR A L R + + +V G + S DL G
Sbjct: 307 AIFFAAVPLHLTMTAYMMRVATFELLTLPRISHLAQTYVNDGVVGSQKDLDREHATGLFG 366
Query: 312 RLIEDAGN 319
++ G+
Sbjct: 367 EFYKNNGD 374
>gi|451846811|gb|EMD60120.1| hypothetical protein COCSADRAFT_151456 [Cochliobolus sativus
ND90Pr]
Length = 482
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 144/281 (51%), Gaps = 13/281 (4%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL- 114
+ + V+ ++ F PSG+P SV E Y+ Y F +LQ F+S+ +L+++++L G+
Sbjct: 51 KQAHKSVLQYIIDVFLPSGFPQSVTEDYIEYQVFDSLQAFSSSIAGMLASRAVLQGVGVG 110
Query: 115 --RPTPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVL 171
TP A ++S +L++ + + + ++ G ++ E K +R+ AD L D L+ L
Sbjct: 111 DSNATPTAALLLS-VLQESVGRMATIAFADRFGTALEPECKMYRLTADILNDSAMILDCL 169
Query: 172 SPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
SP P+ L + + + + + V A +++ + + FA GNL ++ AK + T+ ++
Sbjct: 170 SPGFPKPLRVAVLSFASCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISL 229
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+G+ AG + S + S + LL IH+ + +RA + +LN QR ++ + +
Sbjct: 230 LGMLAGSFVVSHITSDL-ATWCTLILLLAIHLATNYAAVRAVSMQSLNRQRANILFSHIL 288
Query: 291 KTGRISSPADLRYREDLLFPGRLIEDAGNVKMDRHGIRAEC 331
+ G + +P ++ RE R+ E G ++ + +C
Sbjct: 289 QHGIVLNPKEVSARE------RIFERGGVLRWADDEVLGKC 323
>gi|167526040|ref|XP_001747354.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774189|gb|EDQ87821.1| predicted protein [Monosiga brevicollis MX1]
Length = 283
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 94/184 (51%), Gaps = 1/184 (0%)
Query: 127 ILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
IL+D +G + + + G+ +D+ K+WR++AD L D LE++SPL P FL + +
Sbjct: 4 ILRDATGMIGSISFAWIQGSDLDNNAKQWRLVADVLNDTAMLLELISPLLPAYFLFLVCV 63
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ A+ I VA ATR + A+ N+ D+ AK + TL N+V L + L V
Sbjct: 64 ASLARAIVGVAGGATRAALIQHQARNNNMGDVAAKDGSQETLVNLVALLVNLILVPLVSQ 123
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+ Q + L + H++ + +R+ + LN +R +IV + + +I SP RE
Sbjct: 124 NAQVVWSLFGLSTCCHIFFNYKAVRSLCMTRLNGRRAHLIVEHYTRHQQILSPKIANQRE 183
Query: 306 DLLF 309
L F
Sbjct: 184 PLFF 187
>gi|428168470|gb|EKX37414.1| hypothetical protein GUITHDRAFT_89666 [Guillardia theta CCMP2712]
Length = 331
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 85/165 (51%), Gaps = 3/165 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F PSGYP S+ + Y +Y F LQ +S +LSTQ +L G + A + W+
Sbjct: 130 FLPSGYPASLPDEYTKYQAFNLLQDLSSNLRGILSTQKILEGMGVGKAGVTSLAATMQWM 189
Query: 128 LKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+DG +G L+ + + + KRWR+ AD + D+ L++L+P P LFL + L
Sbjct: 190 ARDGSSMLGGLLFTAFASSSFGIDIKRWRLFADLINDVALLLDMLAPAYPDLFLPIICLS 249
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
+ K + VAA A I +A GN++D+ AK A T +V+
Sbjct: 250 SVFKAMCGVAAGACNTAIVQHWAARGNIADVGAKTGAQHTFVSVL 294
>gi|392564112|gb|EIW57290.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 516
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 69/258 (26%), Positives = 114/258 (44%), Gaps = 7/258 (2%)
Query: 50 KVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
K+ + + R+ F P+ YP SV+ YL + + + + SVL Q+LL
Sbjct: 95 KIAAPTNSLTGRISSWFGQMVLPTNYPHSVHRSYLPFHILQFFETTVATITSVLCNQALL 154
Query: 110 FAAGLRPTPA--QATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGT 166
+ G+ + A V WI+KDG V KL DS PK + + + LG+
Sbjct: 155 TSVGVSAEGSVFGAVAVQWIIKDGAGEVAKLFFIRRFSPYFDSHPKTFTFFGELMGCLGS 214
Query: 167 GLEVLSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
GL++ + L P FL A GN K + T + F+++GN D+ AK E+
Sbjct: 215 GLQIATLLIAPSPGNFLLCAAGGNIFKIVGNAIWFTTHIKFMRYFSEQGNTGDVAAKDES 274
Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
S++ +VG +GI L + V + + +H+ MR A L R +
Sbjct: 275 QSSIAQLVGYASGISLLA-VSHTAPYLYAIFAFAVPVHLAMTAYMMRVATFELLTLPRVS 333
Query: 284 MIVADFVKTGRISSPADL 301
+ ++V G++SS +L
Sbjct: 334 CLAQEYVTRGQVSSLGEL 351
>gi|413956563|gb|AFW89212.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 263
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/172 (31%), Positives = 90/172 (52%), Gaps = 3/172 (1%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 50 WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
P + + LG+ ++ VA+ ATR + FA N +D+ AK + TL
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETL 221
>gi|452823886|gb|EME30893.1| hypothetical protein Gasu_19060 [Galdieria sulphuraria]
Length = 346
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 117/253 (46%), Gaps = 17/253 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWIL- 128
F PSGYP SV + Y + ++ +LQ S V+ T+S L G+ A T + +L
Sbjct: 68 FLPSGYPESVTDDYANFEKWDSLQGLCSYLRQVMVTRSTLSGLGVGNNFAHPTAAALMLL 127
Query: 129 -KDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGL 185
++G VG L+ + +R D + K WR+ AD + ++ +E +PL P +F +
Sbjct: 128 WREGFGVVGGLLFTWGKSSRFDKDSKSWRLFADVINNVALTMEWTAPLFPTPVFFALHSS 187
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC- 244
N + +A AT + FAK N+ D+ +K I ++VGL L ST+C
Sbjct: 188 ANIFHSLCGIAGGATCTAFQAHFAKRNNIGDICSKHGNIGRAVSLVGL-----LLSTLCF 242
Query: 245 -------SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
+S+ V L+V H+Y I +R+ + LN +R ++ ++ ++
Sbjct: 243 WSSSLLENSLPFSFGVFCFLTVFHIYFNILALRSLHLAVLNKERGYIVAEHYICHKQLLD 302
Query: 298 PADLRYREDLLFP 310
+ + +E P
Sbjct: 303 VSSVSKKERFWRP 315
>gi|255074355|ref|XP_002500852.1| predicted protein [Micromonas sp. RCC299]
gi|226516115|gb|ACO62110.1| predicted protein [Micromonas sp. RCC299]
Length = 561
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 91/184 (49%), Gaps = 7/184 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
F P+GYP +V+ Y + ++ L V++ QSLL A G+ TP W
Sbjct: 126 FLPAGYPDTVSPDYAPFIRWHLGSLMFRNVLEVITAQSLLVALGMGSTPGALPLTAATKW 185
Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+LKDG+ + L +LG + D +PKRW ++++A D+ +E+++P P LFL +A
Sbjct: 186 VLKDGIGSLATLAAGSLGGQKCDEDPKRWWMVSNAFEDVARVIELVTPAAPALFLPLAAT 245
Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
F + A + R + + + FA+ N +D+ A+ E +V L GI + TV
Sbjct: 246 ATFVR-TAALTGRGSLVNGSLMQHFARNENTADVRARLEVQGRWLALVALPVGIGIFRTV 304
Query: 244 CSSM 247
S
Sbjct: 305 SQSF 308
>gi|225683629|gb|EEH21913.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
Pb03]
Length = 509
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 130/249 (52%), Gaps = 18/249 (7%)
Query: 65 SFLNKFFPSGYPFSVNEGYL-----------RYTQFRALQHFTSAALSVLSTQSLLFAAG 113
S L F P+GYP SV++ Y+ +Y +LQ F+S+ +L+++++L G
Sbjct: 67 SLLIVFLPAGYPHSVSDDYMDLLFLALENDRKYKD--SLQAFSSSIAGLLASRAVLQGVG 124
Query: 114 LRPTPAQATV--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEV 170
+ A TV + +L++ + + ++ ++ LG ++ E K +R AD L D ++
Sbjct: 125 VGDASASPTVALLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDC 184
Query: 171 LSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
LSP P+ L + + + + + VAA +++ + S FAK GNL +L AK + T+ +
Sbjct: 185 LSPAFPKHLRVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNAKDSSQETVIS 244
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
++G+ G + S + + + + + LL +IH+ + +RA + TLN QR ++ +
Sbjct: 245 LMGMLCGSLVVSHISTPLTTGLALIFLL-LIHLSTNYAAVRAVNMTTLNRQRANIVFSTL 303
Query: 290 VKTGRISSP 298
+ GR +P
Sbjct: 304 FEEGRALTP 312
>gi|157869108|ref|XP_001683106.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68223989|emb|CAJ04889.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 678
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 62/234 (26%), Positives = 110/234 (47%), Gaps = 7/234 (2%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
+R RVV + P GYP SV E L + + Q + LST+++L G+
Sbjct: 57 ARQARRRVVTTLERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGV 116
Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVL 171
+ A T +SW+++DG + +G ++ + L + ++ K W+++AD D+ LE+
Sbjct: 117 GESKADLTSATLSWMMQDGTRMIGSVVFAPLIPQGLECRAKTWQLVADFTTDIANLLELC 176
Query: 172 SPLCP---QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
+P P F + + K + V TR FA N +D+ AK S++
Sbjct: 177 APWLPGGQTTFRVVLVAASVVKALGSVCRNCTRASFTRHFALRNNAADIAAKASTRSSVG 236
Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
+ +GL G+ +A V +S ++V L SV+H+++ +R + LN R
Sbjct: 237 SFIGLVLGVAVAYIVPATSRSLNLVVFALGSVLHIFASYRSVRGVQLKHLNAPR 290
>gi|259484769|tpe|CBF81274.1| TPA: DUF647 domain protein (AFU_orthologue; AFUA_6G06850)
[Aspergillus nidulans FGSC A4]
Length = 564
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 137/271 (50%), Gaps = 21/271 (7%)
Query: 64 DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
D + F P+GYP SV++ Y Y F +LQ F+S+ +L+++++L G+ +P
Sbjct: 67 DVLIEVFLPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPTS 126
Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A ++ IL+D + ++ ++ +G ++ E K +R AD DL L+ SP+ P F
Sbjct: 127 ALLLH-ILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGF 185
Query: 180 -----LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
L AG+ + + VA +++ + + FA+ GNL+++ AK + T+ +++G+
Sbjct: 186 NRVTVLSAAGV---LRALCGVAGGSSKASLSAHFARWGNLAEVNAKDSSQETVISLIGML 242
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
G + S + S + + LLS +H+ +R+ + TLN QR A+ V +
Sbjct: 243 VGSFVVSHITSFTATWLTLVFLLS-MHLSLNYAAVRSVQMTTLNRQR-----ANIVFSTL 296
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKMDRH 325
+SS DL + + P R E ++K H
Sbjct: 297 LSSDPDL--ADFVSSPSRKDERRASLKQQEH 325
>gi|226292974|gb|EEH48394.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
Pb18]
Length = 495
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 120/238 (50%), Gaps = 20/238 (8%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV- 123
S L F P+GYP SV++ Y+ Y + +LQ F+S+ +L+++++L G+ A TV
Sbjct: 67 SLLIVFLPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVA 126
Query: 124 -VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFL 180
+ +L++ + + ++ ++ LG ++ E K +R AD L D ++ LSP P+ L +
Sbjct: 127 LLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLSPAFPKHLRV 186
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + + VAA +++ + S FAK GNL +L A G + + + L G+ L
Sbjct: 187 VLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGSLVVSHIS-TPLTTGLALI 245
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
L +IH+ + +RA + TLN QR ++ + + GR +P
Sbjct: 246 ---------------FLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTP 288
>gi|449690512|ref|XP_002160836.2| PREDICTED: UPF0420 protein-like, partial [Hydra magnipapillata]
Length = 172
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 78/128 (60%), Gaps = 3/128 (2%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F N F P G+P SV+E YLRY + LQ F S+ +L+TQ++L A G+ A
Sbjct: 45 FQNVFLPQGFPESVSEDYLRYQLWDTLQAFCSSITGMLATQAMLKAYGVGDNTATVTAAT 104
Query: 124 VSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+LKDG +G+++ + G ++D + KRWR++AD L + L++++PL Q F+ +
Sbjct: 105 ITWMLKDGAGMIGRILFAWKKGTQLDCDAKRWRLVADVLNNCAIMLDLVAPLFKQYFVLI 164
Query: 183 AGLGNFAK 190
A L + A+
Sbjct: 165 ACLSSVAR 172
>gi|170118219|ref|XP_001890291.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164634725|gb|EDQ99048.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 471
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 78/285 (27%), Positives = 142/285 (49%), Gaps = 25/285 (8%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ RV+D F P+GYP SV+ YLRY ALQ F ++ + +LS++++L G+
Sbjct: 42 HRRVLDLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLVGLLSSRAILEGFGVGDPS 101
Query: 119 AQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A AT ++ +L+D + ++ ++ LG+ + + K++R+LAD L D L+ +SP
Sbjct: 102 ATATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDVKKYRLLADMLNDAAVILDTISPRL 161
Query: 176 PQLF---LEMAGLGNFA--KGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAIST 226
LF L +A L A + + ++A ++ I FA G++ DL AK + T
Sbjct: 162 DTLFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKET 221
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEM--RAAPVNTLNPQRTAM 284
+ ++G+ G + + + I LL ++ ++ I + R + TLN QRT +
Sbjct: 222 VLALLGMLLGTLIVPHLTTPWT---IYTTLLVLVGLHLTINYIGVRGLVLRTLNGQRTWL 278
Query: 285 IVADFVKT--GRISSPADLRYREDLLFPGRLIEDAGNVKMDRHGI 327
+ +++T GR +P ++ E R+ E G + R G+
Sbjct: 279 AWSAYMRTQPGRAPTPDEVASLE------RIFERPGAFRDPRSGV 317
>gi|302837684|ref|XP_002950401.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
nagariensis]
gi|300264406|gb|EFJ48602.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
nagariensis]
Length = 500
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/241 (27%), Positives = 125/241 (51%), Gaps = 16/241 (6%)
Query: 86 YTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNL 143
+ + +Q +S VL++Q++L G+ + A V ++ ++D +G ++ S L
Sbjct: 4 FQTWDTIQALSSYVRGVLTSQAILRGVGVGQEASTPLAAVFTFFVRDLAGMLGGILFSYL 63
Query: 144 -GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATR 201
G+ D+ K+WR+ AD DLG +E+ SPL P+ +FL +A LG+ A+ + VA ATR
Sbjct: 64 EGSSFDACAKQWRLFADITNDLGMTVELASPLFPRVMFLPIACLGSVARSLTGVAGGATR 123
Query: 202 LPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS--------SMQGKMIV 253
+ FA+ GN +D+ AK ++ T ++G+ G+ + + + + ++
Sbjct: 124 AALTQHFARRGNAADVSAKEQSQETATTILGMVTGMAVTRLLAAGEGPAGGPTGAAGFVI 183
Query: 254 GPL----LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
L L+ +HV + + MR + +LN R ++V +++ GR+ SP + EDL
Sbjct: 184 AWLVFLGLTAVHVVANVAAMRVLLLTSLNTPRLELLVDRYLQDGRVLSPRAVSQLEDLTP 243
Query: 310 P 310
P
Sbjct: 244 P 244
>gi|310800465|gb|EFQ35358.1| hypothetical protein GLRG_10502 [Glomerella graminicola M1.001]
Length = 416
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 126/256 (49%), Gaps = 11/256 (4%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
+ F P GYP SV YL Y F +LQ F S +LS +++L G+ + AT ++
Sbjct: 7 DAFLPIGYPDSVTPDYLGYQTFDSLQAFFSTITGLLSNRAILQGLGVGDPNSSATYALLL 66
Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMA 183
ILKDG+ V ++ + G ++ E KR+R LAD D L + SP P +
Sbjct: 67 TILKDGISRVATIVFAYRFGLVIEPECKRYRFLADIFNDSAFFLNLFSPYFDPWTKVGAL 126
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
L + VAA A++ + FA+ NLS+L AK + T +VGL G + V
Sbjct: 127 VLAEALGAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAVGLVGLLVGSVVVRFV 186
Query: 244 CSSMQGKMIVGPLLS---VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
+G+ V L++ +H+ +R + TLN QR ++ + +T ++ +PA
Sbjct: 187 ----EGREAVFALMTFLVFVHLGMNYLGVRCVQLATLNQQRATILFEHYARTKQVLTPAQ 242
Query: 301 LRYREDLLFPGRLIED 316
+ RE++LF +I++
Sbjct: 243 VASRENILFWSPVIKN 258
>gi|223974077|gb|ACN31226.1| unknown [Zea mays]
Length = 207
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 87/164 (53%), Gaps = 3/164 (1%)
Query: 67 LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVV 124
L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+ A
Sbjct: 2 LGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATF 61
Query: 125 SWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL P + +
Sbjct: 62 QWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIM 121
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
LG+ ++ VA+ ATR + FA N +D+ AK + TL
Sbjct: 122 CLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETL 165
>gi|302755442|ref|XP_002961145.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
gi|300172084|gb|EFJ38684.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
Length = 343
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 112/242 (46%), Gaps = 31/242 (12%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV+ YL + + LQ ++ S+LSTQ+LL G+ A W
Sbjct: 54 FLPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWF 113
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D VG ++ + G+ +D+ K+WR+ AD + D
Sbjct: 114 LRDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFMND----------------------- 150
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
IA VA ATR + FA++ N +D+ AK + T +VG+ G+ LA ++
Sbjct: 151 -----IAGVAGGATRAALTQHFARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANN 205
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ H+Y+ + + A + ++N +R A+++ F+K G+ SP + +E
Sbjct: 206 IFLMWTSFMSLTAFHMYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQES 265
Query: 307 LL 308
L
Sbjct: 266 AL 267
>gi|428170130|gb|EKX39058.1| hypothetical protein GUITHDRAFT_114933 [Guillardia theta CCMP2712]
Length = 405
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 109/265 (41%), Gaps = 59/265 (22%)
Query: 50 KVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
K V + V +V SFL P V+E Y +T +R LQ S+ +SV TQSLL
Sbjct: 82 KFVRKASNVAFKVKQSFL--------PRGVSEDYYSFTIWRILQRIVSSTVSVFGTQSLL 133
Query: 110 FAAGLRPTPAQATVVS-WILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTG 167
A G++ S W+ + GK+I + G DS+ KRWR
Sbjct: 134 LALGVKTNKVGVAAASGWVFANAFGKFGKMIFAARWGRDFDSDAKRWR------------ 181
Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFA-KEGNLSDLFAKGEAIST 226
ATR Y FA + NL D+ AKG+A
Sbjct: 182 ------------------------------SATRGTFYKQFAGRNENLGDITAKGDAQVA 211
Query: 227 LFNVVGLGAGIQ---LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
+ ++VGL GI+ LA T + G +V LS + +++ + LN +R
Sbjct: 212 VADLVGLVVGIELARLAGTNALCVWGSYVV---LSALDFIFILKSLDTIVFRFLNLERAT 268
Query: 284 MIVADFVKTGRISSPADLRYREDLL 308
M+ + FV G I SPAD+ E +L
Sbjct: 269 MLASAFVSEGIIKSPADISRSEPVL 293
>gi|317037751|ref|XP_001399065.2| hypothetical protein ANI_1_1452164 [Aspergillus niger CBS 513.88]
Length = 525
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/239 (26%), Positives = 123/239 (51%), Gaps = 12/239 (5%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ- 120
V D + F P+GYP SV + Y Y F +LQ F+S+ +LS++++L G+ A
Sbjct: 68 VQDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASP 127
Query: 121 -ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ- 177
A ++ IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P
Sbjct: 128 TAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMILDCLSPMIPAG 187
Query: 178 ----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
L AG+ + + VA +++ + + F++ GNL+++ AK + T+ +++G+
Sbjct: 188 APRVTVLSTAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGM 244
Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
G + S V ++ I +L +H+ +R+ + +LN QR ++ + + T
Sbjct: 245 LVGSFVVSRV-TNFSTTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 302
>gi|297740212|emb|CBI30394.3| unnamed protein product [Vitis vinifera]
Length = 462
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 117/264 (44%), Gaps = 29/264 (10%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWIL-- 128
P+G+P SV++ YL Y + + T+ L T SLL +A+ SW
Sbjct: 81 LPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLL----------KASCWSWFFFS 130
Query: 129 --------------KDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP 173
+DG+ VG+L I G D +PK+WR+ AD + G+ ++ +
Sbjct: 131 NHCCCFCCCFCCCHQDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQ 190
Query: 174 LCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
L P FL++A LGN AK +A + I + FA GNL ++ AK E ++GL
Sbjct: 191 LYPAYFLQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGL 250
Query: 234 GAGIQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
GI + ++ V +S + ++H++ + + ++N +R ++V V
Sbjct: 251 ALGIMVLTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVL 310
Query: 292 TGRISSPADLRYREDLLFPGRLIE 315
+ D +E +L R ++
Sbjct: 311 HSIVPGYVDCNRKEKILSWQRFLK 334
>gi|218187487|gb|EEC69914.1| hypothetical protein OsI_00335 [Oryza sativa Indica Group]
Length = 386
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 90/184 (48%), Gaps = 19/184 (10%)
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
+D+ R + F+ P+G+P SV++ YL Y ++ + T L T +LL A
Sbjct: 57 EDANSSIPRAIKDFV---LPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAV 113
Query: 113 GL-----RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGT 166
G+ A A + W+ KDG+ G+L I G D +PK+WR+ AD + G+
Sbjct: 114 GVGSFTGTSAAASAAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGS 173
Query: 167 GLEVLSPLCPQLFLEMAGLGNFAKGI------AVVAARATRLP----IYSSFAKEGNLSD 216
E+++PL P FL +A LGN AK I R R P I + FA+ GNL +
Sbjct: 174 IFELITPLYPGYFLPLASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGE 233
Query: 217 LFAK 220
+ AK
Sbjct: 234 VAAK 237
>gi|403412621|emb|CCL99321.1| predicted protein [Fibroporia radiculosa]
Length = 4586
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 136/295 (46%), Gaps = 26/295 (8%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLN----KFFPSGYPF 77
+E + W E+ +T+ + + G L + + + + +V F + F P+ YP
Sbjct: 68 RECHVSWTESG--ITKEWH-EMSGALQERGEEQTTSTTNSIVGKFTSWLRQMFLPTNYPQ 124
Query: 78 SVNEGYLRYTQFRALQHFTS---AALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGM 132
SV+ YLR F ALQ F + +SVL Q+LL + G+ + A V WI+KDG
Sbjct: 125 SVHPSYLR---FHALQFFETTFGTVVSVLCNQALLTSVGVSAEGSIFGAVAVQWIIKDGA 181
Query: 133 QHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL---CPQLFLEMAGLGNF 188
V KL DS PK + + + + +G+ L++ + L P FL A GN
Sbjct: 182 GEVAKLFFIRRFSPYFDSHPKSFTLFGEGIVAIGSCLQMATLLITPTPGNFLLCAAGGNI 241
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K + T + F+ +GN+ D+ AK E+ +++ + G AGI L TV S
Sbjct: 242 FKLVGYAVWFTTHIKWVRYFSLQGNVGDVAAKDESQTSIAQLCGYAAGIGLL-TVSHSPA 300
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK------TGRISS 297
+ L+ +H+ + + MR A L R +++ ++ + TG++++
Sbjct: 301 YLYSLFFALTPLHLTATMFMMRDAKFEALTLPRLSLLAREYAREDGEAATGKVNT 355
>gi|317143185|ref|XP_001819302.2| hypothetical protein AOR_1_114154 [Aspergillus oryzae RIB40]
Length = 510
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 122/231 (52%), Gaps = 12/231 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P+GYP SV++ Y+ Y F +LQ F+S+ +LS++++L G+ A A ++ I
Sbjct: 75 FLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHI 134
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-----LE 181
L+D + ++ ++ +G ++ E K +R+ AD D L+ LSP+ P F L
Sbjct: 135 LQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLS 194
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
AG+ + + VA +++ + + F++ GNL+++ AK + T+ +++G+ G + S
Sbjct: 195 TAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVGSFVVS 251
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
V +S I +L +H+ +R+ + +LN QR ++ + + +
Sbjct: 252 RV-TSYTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNS 301
>gi|449542984|gb|EMD33961.1| hypothetical protein CERSUDRAFT_117486 [Ceriporiopsis subvermispora
B]
Length = 505
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 117/280 (41%), Gaps = 7/280 (2%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNE 81
+E + W E S + + + S R+ F F P+ YP SV+
Sbjct: 70 RECHVSWSEDSGVEKEWRELSQPTEAEERAISSSGTFVGRLSAWFRQMFLPTNYPQSVHR 129
Query: 82 GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKL- 138
Y+ + + ++ +SVL Q+LL + G+ + A V WI+KDG V KL
Sbjct: 130 SYMPFHVLQFVEGTLGTLVSVLCNQALLTSVGVSAEGSIFGAVAVQWIIKDGAGEVAKLF 189
Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL---CPQLFLEMAGLGNFAKGIAVV 195
DS PK + + + L LG+GL++ + L P FL A GN K I
Sbjct: 190 FIRKFSPYFDSHPKTFTLSGEVLVALGSGLQMATLLVNPTPLSFLICAAGGNAFKLIGYA 249
Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
T + F+++GN D+ AK E+ +++ + G AGI L T+ +
Sbjct: 250 IWFTTHIKFIRYFSQQGNTGDVAAKDESQASVAQLAGYAAGIGLL-TLSHDPSYLYTIFA 308
Query: 256 LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
L H+ + MR A L R +++ + + G +
Sbjct: 309 LAVPTHLVATTLMMRCATFELLTIPRLSLLAETYSRKGDV 348
>gi|7327810|emb|CAB82267.1| putative protein [Arabidopsis thaliana]
Length = 447
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 58/199 (29%), Positives = 98/199 (49%), Gaps = 11/199 (5%)
Query: 124 VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+ W+ KDG+ +G+L I G+ D +PK+WR+ AD + G+ ++ + L P FL +
Sbjct: 120 IRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFLLL 179
Query: 183 AGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
A GN AK + AR R P I + FA GNL ++ AK E ++GLG GI
Sbjct: 180 ASTGNLAKAV----ARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGIL 235
Query: 239 LAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ T + S ++ + ++H++ + + NT+N +R +IV V +
Sbjct: 236 IIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSVVP 295
Query: 297 SPADLRYREDLLFPGRLIE 315
D RE++L R ++
Sbjct: 296 GYVDCNKRENILLWQRFMK 314
>gi|452842814|gb|EME44750.1| hypothetical protein DOTSEDRAFT_130464 [Dothistroma septosporum
NZE10]
Length = 447
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 135/266 (50%), Gaps = 12/266 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P G+P SV YL Y + +LQ F+S+ S+L+++++L + G+ +P A ++S
Sbjct: 47 FMPVGFPNSVTHDYLEYQIYDSLQAFSSSIASLLASRAVLSSVGVGDADASPTAAILLSI 106
Query: 127 ILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGL 185
+ L LG + E K +R+LAD L D G L+++S P+ + + +
Sbjct: 107 FQTSAGRLATILFAHRLGTSLGPECKMYRLLADILNDTGFLLDMISVAFPKPISVIILSF 166
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ + + VAA + + + + FA+ GNL++L AK + T+ +++G+ AG + S +
Sbjct: 167 SSILRSLCGVAAGSAKASLSAHFARWGNLAELNAKDSSQETVISLLGMWAGSVVISYITE 226
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+ LL+ IH+ + +RA + TLN QR A ++ ++ G + +P ++ +E
Sbjct: 227 PTAAWTTLIVLLA-IHLETNRRAVRAVTMQTLNRQR-ATLMFHHLQRGHVPTPVEIASQE 284
Query: 306 DLLFPGRLIEDAGNVKMDRHGIRAEC 331
R+ E G ++ + I C
Sbjct: 285 ------RIFERDGVLRGNDDRILGHC 304
>gi|380480425|emb|CCF42443.1| hypothetical protein CH063_12442 [Colletotrichum higginsianum]
Length = 484
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 131/289 (45%), Gaps = 25/289 (8%)
Query: 40 QFQPDGHLSVKVVDDSRPVYHRVVD------------SFLNKFFPSGYPFSVNEGYLRYT 87
++ G+L K V+ + + VD L F P GYP SV E Y Y
Sbjct: 12 EYDKAGNLKAKYVESASRIGPSRVDVLLPFKRKKPWLRLLEIFLPDGYPHSVTEDYAAYQ 71
Query: 88 QFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQATVVSWILKDGMQHVGKLICSNL- 143
+ ++Q F + ++S++++ G+ +P A +V ++++ + ++L
Sbjct: 72 IYDSVQAFAGSIAGMISSRAVWEGLGVGDSLASPTGAMLVQ-VIRESTGRFATIAFAHLF 130
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARA 199
G +++E K WR+ AD L D+ L+ LSP P L L + L A G VA A
Sbjct: 131 GTSIEAECKAWRLAADVLCDVAMVLDCLSPFFPHGVRFLVLCFSSLLYSASG---VAGNA 187
Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
++ + FAK NL +L AK + T +++G+ +G + S + + + L
Sbjct: 188 SKSSLSGHFAKWNNLGELNAKDASQETAISLIGMISGTFVVSLL-TDQRATWATLIFLLA 246
Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
+H++ + +RA +LN QR + + G++ +P + RE +
Sbjct: 247 LHLFLNWKGVRAVKSRSLNRQRANIAFSALFSKGQVLTPHHVAERERIF 295
>gi|378729487|gb|EHY55946.1| hypothetical protein HMPREF1120_04055 [Exophiala dermatitidis
NIH/UT8656]
Length = 485
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/230 (26%), Positives = 116/230 (50%), Gaps = 11/230 (4%)
Query: 61 RVVDSFLNK----FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP 116
R S LNK F P+GYP +V Y Y + +LQ F S +L+++++ G+
Sbjct: 45 RTGTSILNKCMSPFLPAGYPSTVTPDYTPYQIYDSLQAFASTIAGLLASRAVFVGMGVGS 104
Query: 117 TPAQATVVSWILKDGMQHVGK----LICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
A + + + +L + VG+ L R+++E K +R AD + D+ L+ +S
Sbjct: 105 EEA-SVLTTMLLYIAQETVGRVATILFAHQFSQRIEAEVKFYRFFADIVNDVAFVLDCMS 163
Query: 173 PLCPQL-FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
P P L + L N + + VA +++ + S FAK GN+ +L AK + T+ +++
Sbjct: 164 PAMPILGRVVTLCLSNACRAVCGVAGGSSKAILSSHFAKAGNIGELNAKDGSQETVISLL 223
Query: 232 GLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
G+ G + S V ++ + PLL+ +H+++ + +R+ +N+ N R
Sbjct: 224 GMWVGGVVVSKVHGTLATWSWLVPLLA-LHLWANWKAVRSVRLNSFNTNR 272
>gi|402582701|gb|EJW76646.1| hypothetical protein WUBG_12446, partial [Wuchereria bancrofti]
Length = 184
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 81/138 (58%), Gaps = 3/138 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV+ Y+ Y + +Q F S+ S LST+++L G+ + A V+W+
Sbjct: 43 FMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAAAVAWL 102
Query: 128 LKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKDG+ + +++ + L + +D++ K+WR++AD DL L++++P+ P LF+ + L
Sbjct: 103 LKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAYCLDLITPIFPNLFMPIICLS 162
Query: 187 NFAKGIAVVAARATRLPI 204
+ + + VA ATR +
Sbjct: 163 SMIRAVVGVAGSATRTTV 180
>gi|170102118|ref|XP_001882275.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642647|gb|EDR06902.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 285
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/285 (27%), Positives = 140/285 (49%), Gaps = 25/285 (8%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ RV+D F P+GYP SV+ YLRY ALQ F ++ +LS++++L G+
Sbjct: 1 HRRVLDLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLAGLLSSRAILEGFGVGDPS 60
Query: 119 AQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A AT ++ +L+D + ++ ++ LG+ + + K++R+LAD L D L+ +SP
Sbjct: 61 ATATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDAKKYRLLADMLNDAAVILDTISPRL 120
Query: 176 PQLF---LEMAGLGNFA--KGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAIST 226
LF L +A L A + + ++A ++ I FA G++ DL AK + T
Sbjct: 121 DTLFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKET 180
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEM--RAAPVNTLNPQRTAM 284
+ ++G+ G + + + I LL ++ ++ I + R + TLN QRT +
Sbjct: 181 VLALLGMLLGTLIVPHLTTPWT---IYATLLVLVGLHLTINYIGVRGLVLRTLNGQRTWI 237
Query: 285 IVADFVKTG--RISSPADLRYREDLLFPGRLIEDAGNVKMDRHGI 327
+ +++T R +P ++ E R+ E G + R G+
Sbjct: 238 AWSAYMRTQPRRAPTPVEVASLE------RIFERPGGFRDPRRGV 276
>gi|358373469|dbj|GAA90067.1| DUF647 domain protein [Aspergillus kawachii IFO 4308]
Length = 527
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/239 (26%), Positives = 124/239 (51%), Gaps = 12/239 (5%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ- 120
V+D + F P+GYP SV + Y Y F +LQ F+S+ +LS++++L G+ A
Sbjct: 68 VLDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASP 127
Query: 121 -ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ- 177
A ++ IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P
Sbjct: 128 TAALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMIPAG 187
Query: 178 ----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
L AG+ + + VA +++ + + F++ GNL+++ AK + T+ +++G+
Sbjct: 188 APRVTVLSTAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGM 244
Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
G + S V +S I +L +H+ +R+ + +LN QR ++ + + T
Sbjct: 245 LVGSFVVSRV-TSFSTTWICLLMLLALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 302
>gi|224090990|ref|XP_002309136.1| predicted protein [Populus trichocarpa]
gi|222855112|gb|EEE92659.1| predicted protein [Populus trichocarpa]
Length = 500
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 116/253 (45%), Gaps = 22/253 (8%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVS 125
P+G+P SV++ YL+Y + + T L T SLL A G A A +
Sbjct: 107 LPAGFPGSVSDDYLQYMVLQFPTNITGWICHTLVTSSLLKAVGAGSFTGTDAAASAAAIR 166
Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
W+ KDG+ +G+L I G D +PK+WR+ AD + G+ ++ + + P FL +A
Sbjct: 167 WVSKDGIGALGRLFIGGRFGDLFDDDPKQWRMYADFIGSAGSIFDLTTQVYPAYFLPLAS 226
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
LGN K +A + I + FA GNL ++ AK E V +GA + +
Sbjct: 227 LGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEVAAKEE-------VWEVGAQLLGLALGI 279
Query: 245 SSMQGKMIVG--PLLS-------VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ +V PLL+ ++H++ + + +T+N +R +++ + R+
Sbjct: 280 LILDTPDLVKSFPLLTSTWMSMRLLHLWLRYQSLSVLRFDTINLKRARILIKSHILQSRV 339
Query: 296 SSPADLRYREDLL 308
D E++L
Sbjct: 340 LGCVDCNQDENIL 352
>gi|409048331|gb|EKM57809.1| hypothetical protein PHACADRAFT_251666 [Phanerochaete carnosa
HHB-10118-sp]
Length = 532
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 107/244 (43%), Gaps = 9/244 (3%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA- 119
R+ F P+ YP SV+ YL++ + ++ + VL Q+LL + G+ +
Sbjct: 105 RLTAWLRQMFLPTNYPQSVHPSYLKFHLLQFMECIVGTVVGVLCNQALLVSVGVSAEGSI 164
Query: 120 -QATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V W++KDG V KL + DS PK + + LG+GL++ + + P
Sbjct: 165 LGAVAVQWVIKDGAGEVAKLWVIRRFSPYFDSHPKTFTFTGATMGLLGSGLQIAALVVPP 224
Query: 178 L---FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
FL A GN K + T + FA++GN+ D+ AK E+ ++ + G
Sbjct: 225 TTLNFLLCAAGGNIFKCVGGAVWLTTHIKFVRFFARQGNMGDVAAKSESQGSVGILSGYA 284
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
AG+ L TV S + L IH+ +RAA L R + + +F R
Sbjct: 285 AGVGLL-TVSHSAPYLYTIFALAIPIHLLITQWMLRAATFELLTLPRLSFLAREFAS--R 341
Query: 295 ISSP 298
S P
Sbjct: 342 ASRP 345
>gi|393227224|gb|EJD34913.1| DUF647-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 377
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/235 (26%), Positives = 116/235 (49%), Gaps = 7/235 (2%)
Query: 89 FRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICS-NLGA 145
F +LQ F S +L+ +++L G+ A AT V+ I++D V ++ + LG
Sbjct: 2 FDSLQAFFSEIAGLLAARAVLTGLGVGDAGATATGAVLLKIIQDTCGRVATILSAWKLGP 61
Query: 146 RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-LEMAGLGNFAKGIAVVAARATRLPI 204
++ E K+WR+ AD D G LE LSP P + + + + + V+A + +
Sbjct: 62 MLEPECKQWRLAADLFNDAGIILECLSPTLPHTMKISLLCVARVVRSLCGVSAGGAKAAL 121
Query: 205 YSSFAK-EGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
F G+++D+ AK + T+ ++G+ G + + +S +G L +H+Y
Sbjct: 122 SQHFVGPGGSIADINAKESSQETVVALIGMLVGSMIVPRI-TSFEGTWTALLFLIGVHLY 180
Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF-PGRLIEDA 317
+ +RA + TLN QR +++ + ++ GR+ SP +R RE + RL+ D
Sbjct: 181 ANWRGVRAVAMLTLNRQRATIVLGELIQRGRVLSPEQVRQRESVFCRSSRLVLDG 235
>gi|164423593|ref|XP_001728067.1| hypothetical protein NCU11380 [Neurospora crassa OR74A]
gi|157070159|gb|EDO64976.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 357
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 136/288 (47%), Gaps = 37/288 (12%)
Query: 39 YQFQPDGHLSVKVVDDSRPVY--HRV----VDSFLNKFFPSGYPFSVNEGYLRYTQFRAL 92
Y P G++ +SR H+ + + LN F P+GYP SV YL Y + +L
Sbjct: 6 YDIDPAGNVLCTYNGESRESLKAHQTKVAPLHALLNAFLPAGYPHSVTPDYLPYQTYDSL 65
Query: 93 QHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGARMD 148
Q F S+ S+L+ +++L G+ +P A ++ I D + ++ ++ LG ++
Sbjct: 66 QAFFSSISSLLANRAVLEGLGVGDASSSPTAALILK-ITSDTISRFATILFAHRLGTAIE 124
Query: 149 SEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAV------------VA 196
E K +R LAD L D L++L P+ M G + + V VA
Sbjct: 125 PECKFFRFLADVLNDAAQLLDLLMPVM------MMGETRYGLKLGVVVGSGVLRSWCGVA 178
Query: 197 ARATRLPIYSSFAKE-GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
A A++ + + F + +L +L AK + T+ +++G+ G S V +Q +++V
Sbjct: 179 AGASKASLSAHFVTQRSSLGELNAKEASQETVVSLLGMLVG----SLVVKVVQDRLVVWV 234
Query: 256 L---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
L L +H+ +R + TLN QR M+V +++TGR+ P +
Sbjct: 235 LMVGLLGVHLLMNYRGVRCVRLRTLNRQRATMVVRSWLETGRVLGPEE 282
>gi|328773288|gb|EGF83325.1| hypothetical protein BATDEDRAFT_85860 [Batrachochytrium
dendrobatidis JAM81]
Length = 434
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 121/261 (46%), Gaps = 14/261 (5%)
Query: 41 FQPDGHLSVKVVDDSRPVYHRVVDS-FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAA 99
+ D H S D S + V+ F N F P GYP SV+ Y + + ++ +A
Sbjct: 10 YASDRHPSTTRSDQSHIRFLGHVNQWFRNMFLPVGYPSSVHACYKKVHMWLFAENVAGSA 69
Query: 100 LSVLSTQSLLFAAG----LRPTPAQATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRW 154
+SVL+ Q++L + G T A A V W+LKDG+ +GK L+ + D+ PKRW
Sbjct: 70 ISVLTAQAMLTSIGASTAAHETAALAIAVDWVLKDGIGELGKMLMIQRFAHQFDTHPKRW 129
Query: 155 RILADALYDLGTGLEVLSPLC-PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG- 212
++ +A G +++ + + P FL A +G + + AT + A
Sbjct: 130 KLYGEACSISGALMQLATCITNPHHFLFFASIGVGLRSMHYSIWAATHTTFTRNMATHNG 189
Query: 213 -NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG--PLLSVIHVYSVIEEM 269
N+ D+ AK ++ +L +++G+ +GI + + S Q + +LS I + +
Sbjct: 190 VNVGDIVAKADSQLSLAHLLGMVSGIGM---LAVSFQPAALFAWFSVLSAIQIVCTGLLL 246
Query: 270 RAAPVNTLNPQRTAMIVADFV 290
R A L+ R +I +F+
Sbjct: 247 REARFEVLDQYRLVLIPREFM 267
>gi|397627073|gb|EJK68340.1| hypothetical protein THAOC_10484 [Thalassiosira oceanica]
Length = 478
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 108/235 (45%), Gaps = 22/235 (9%)
Query: 82 GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLI 139
GYL Y LQ T+A SVL+TQ +L G+ A A + ++++DG + L+
Sbjct: 92 GYLEYVCMDNLQDLTTALRSVLATQRVLEGVGVGREGATALAATLDFLIRDGCGMIASLV 151
Query: 140 -CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP-QLFLEMAGLGNFAKGIAVVAA 197
S + + KRW++ ADA D G LEV++P P + FL + N K + VAA
Sbjct: 152 LTSGFSSTFRRDIKRWKLFADAAVDAGITLEVVAPSLPSRWFLPVLCGANVCKALCGVAA 211
Query: 198 RATRLPIYSSFAK-----EGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMI 252
A P+ +A + LS++ AK A T+ GL L+ M G+
Sbjct: 212 GAVCGPVQLHWAARLGLGDEGLSEVAAKAGAQRTIVGATGL----VLSGLFARWMGGRGA 267
Query: 253 VGP-------LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
P +L+ +H+ + MR +N LN +R +I+ DF+ GR D
Sbjct: 268 NRPVWVGMYAVLTALHLLANYRSMRIVRLNWLNRERMDLILEDFL--GRDGGSGD 320
>gi|303280291|ref|XP_003059438.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459274|gb|EEH56570.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 568
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 79/161 (49%), Gaps = 7/161 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
F P+GYP +V+ Y + ++ L VL++QSLL A G+ TP A W
Sbjct: 150 FLPAGYPDTVSSDYATFIRWHLASLMFRNVLEVLTSQSLLVALGMGSTPGALPLAAATKW 209
Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+LKDG+ L+ G + D +PKRW + ++L D+ LE+++P P LFL +A
Sbjct: 210 VLKDGVGSFATLLAGVFGGQKYDEDPKRWWAVTNSLEDVARVLELVTPAYPALFLPLAAS 269
Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAI 224
F + A+ R + + + N D+ AK EA+
Sbjct: 270 ATFVRAGALT-GRGSLINGTFMQHLGRNQNFGDVRAKMEAL 309
>gi|315056527|ref|XP_003177638.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311339484|gb|EFQ98686.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 495
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 129/268 (48%), Gaps = 18/268 (6%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQ-FRALQHFTSAALSVLSTQSLLFAAGL 114
+P + S LN F P GYP SV + Y+ + FR + +L T S+L
Sbjct: 48 KPSSSSYLTSILNVFLPVGYPHSVTDDYIESSSLFRDYEKKELTSLISGLTSSILKLNCW 107
Query: 115 RP-------------TPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADA 160
P +P A +++ +L++ M + ++ ++ LG ++ E K +R+ AD
Sbjct: 108 SPVIQSSVGVGDATASPTAALLLN-VLQESMGRIATILFAHRLGTSLEPECKMYRLAADL 166
Query: 161 LYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
L D L+ +SP P+ + + + L + + + VAA +++ + + FA+ GNL +L A
Sbjct: 167 LNDSAIVLDCMSPAFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNA 226
Query: 220 KGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
K + T+ ++ G+ G + S + S + I LL ++H+ + +RA + TLN
Sbjct: 227 KDSSQETVISLFGMLVGSVVVSYISSPL-ATWIALILLLIVHLGTNHAAVRAVKMTTLNR 285
Query: 280 QRTAMIVADFVKTGRISSPADLRYREDL 307
QR ++ + + R+ +PA E +
Sbjct: 286 QRANIVFSYLFEDDRVLTPAQASEEERI 313
>gi|390599626|gb|EIN09022.1| DUF647-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 461
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 80/292 (27%), Positives = 135/292 (46%), Gaps = 20/292 (6%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SR + H + D FL P+GYP SV+ YLRY + A Q F S+ ++++++++L G+
Sbjct: 26 SRGLKHLLTDVFL----PAGYPSSVSPDYLRYQTYNAFQAFCSSLANLIASRAVLEGFGV 81
Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVL 171
A AT ++ +L+D + + LG + E K +R+LAD D L+V
Sbjct: 82 GNASASATHAMLLTVLQDVFGRLTTVAGGYYLGTALSPEAKTYRLLADVFNDASIVLDVF 141
Query: 172 SPLCPQL------FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
P + L + G + + V A ++ + FA G+L DL AK +
Sbjct: 142 QPFYASVPVAGARVLALCSSGCL-RALCGVCAGGSKAALAVHFASAGDLGDLNAKDGSRE 200
Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
T+ +VGL +G L S + + + LL H+ +RA + TL QR ++
Sbjct: 201 TVIGLVGLLSGTILLSYIGTPGATYAALCALLGA-HLALNYLAVRAVAMRTLTRQRASIA 259
Query: 286 VADFVK--TGRISSPADLRYREDLLFPGRLIEDAGNVKMDR---HGIRAECF 332
+++ G +P ++ RE LL LI D ++ R G R++ F
Sbjct: 260 WRVWMERGAGAAPTPEEVAEREGLLADPTLIADGTGRRLGRCSFGGARSQGF 311
>gi|444725757|gb|ELW66311.1| hypothetical protein TREES_T100000484 [Tupaia chinensis]
Length = 406
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 106/228 (46%), Gaps = 16/228 (7%)
Query: 83 YLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLIC 140
+ R+T R F S+ L+TQ++L G+ A A +W++KD +G+++
Sbjct: 41 WWRFT--RPFAAFASSLSGSLATQAVLLGIGVGNARASVSAATATWLVKDSTGMLGRIVF 98
Query: 141 SNLGARMDSEPKRWR-ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARA 199
+ W+ + AD L D+ LE+++P+ P F N AK + VA A
Sbjct: 99 A-----------WWKGLFADILNDVAMFLEIMAPIYPIFFTMTICTSNLAKCVVSVAGGA 147
Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
TR + A+ N++D+ AK + TL N+ GL + + V + LL+
Sbjct: 148 TRAALTMHQARRNNMADVAAKDGSQETLVNLAGLLVSLLMLPLVSDFPSVSLGCFFLLTA 207
Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+H+Y+ +RA + TLN R +++ F++TG + P E L
Sbjct: 208 LHIYANYRAVRALVMETLNEGRLRLVLKHFLRTGNVLDPTSANQMEPL 255
>gi|67539168|ref|XP_663358.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
gi|40743657|gb|EAA62847.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
Length = 559
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 73/271 (26%), Positives = 136/271 (50%), Gaps = 26/271 (9%)
Query: 64 DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
D + F P+GYP SV++ Y Y F +LQ F+S+ +L+++++L G+ +P
Sbjct: 67 DVLIEVFLPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPTS 126
Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A ++ IL+D + ++ ++ +G ++ E K +R AD DL L+ SP+ P F
Sbjct: 127 ALLLH-ILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGF 185
Query: 180 -----LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
L AG+ + + VA +++ + + FA+ GNL+++ A+ T+ +++G+
Sbjct: 186 NRVTVLSAAGV---LRALCGVAGGSSKASLSAHFARWGNLAEVNAE-----TVISLIGML 237
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
G + S + S + + LLS +H+ +R+ + TLN QR A+ V +
Sbjct: 238 VGSFVVSHITSFTATWLTLVFLLS-MHLSLNYAAVRSVQMTTLNRQR-----ANIVFSTL 291
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKMDRH 325
+SS DL + + P R E ++K H
Sbjct: 292 LSSDPDL--ADFVSSPSRKDERRASLKQQEH 320
>gi|452984686|gb|EME84443.1| hypothetical protein MYCFIDRAFT_187449 [Pseudocercospora fijiensis
CIRAD86]
Length = 435
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 139/272 (51%), Gaps = 30/272 (11%)
Query: 19 MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
M+ Q + I ++ +D+VT Y Q +G D + R VD FL P G+P S
Sbjct: 1 MADQPLRITEYDEADTVTATY-VQSEGKSQGLSRVDVITAWQRAVDVFL----PVGFPHS 55
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHV 135
V+E YL Y + +LQ F+S+ S+LS++++L + G+ +P A ++S IL+D + +
Sbjct: 56 VSEDYLEYQIYDSLQAFSSSIASLLSSRAVLSSVGVGDSTASPTAAIILS-ILQDSVGRL 114
Query: 136 GK-LICSNLGARMDSEPKRWRILADAL-YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIA 193
L + G ++ E K +R+LAD + + + VL LC
Sbjct: 115 ATILFAHSFGTSLEPECKMYRLLADPIRVVVLSLSSVLRSLCG----------------- 157
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
VAA +++ + + FAK+GNL +L AK + T+ ++VG+ AG + S + + + +
Sbjct: 158 -VAAGSSKASLSAHFAKQGNLGELNAKDSSQETVISLVGMMAGSLVISWITTPIATWTAL 216
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
LLS IH+ + +RA + TL QR A+I
Sbjct: 217 ILLLS-IHLETNRRAVRAVKMRTLTRQRAALI 247
>gi|412993086|emb|CCO16619.1| predicted protein [Bathycoccus prasinos]
Length = 454
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/190 (30%), Positives = 89/190 (46%), Gaps = 15/190 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-----AQATVV 124
F P G+P +V+ YL + +++ L F L VLS QSLL A G+ A
Sbjct: 40 FLPDGFPNTVSRDYLPWLKWQMLSLFFRDVLEVLSAQSLLVAVGMDVNQANAAAPVAGAA 99
Query: 125 SWILKDGMQHVGKLICSNLGA----RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
W+LKDG G L LGA + D PK + A++L D+ +E+L+PL P FL
Sbjct: 100 KWVLKDG---TGGLATLALGALNTKKFDENPKFFWSTANSLEDVSRAMELLTPLFPSHFL 156
Query: 181 EMAGLGNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
+AG F + A + R++ + + F++ N D+ K E + L GI
Sbjct: 157 LIAGTATFVRS-AALTGRSSLINGTFMTHFSRNENAGDIRTKLEVQGRWLQIAALPVGIA 215
Query: 239 LASTVCSSMQ 248
L V + ++
Sbjct: 216 LFRAVTNGLE 225
>gi|440794597|gb|ELR15757.1| hypothetical protein ACA1_379390, partial [Acanthamoeba castellanii
str. Neff]
Length = 449
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 93/218 (42%), Gaps = 30/218 (13%)
Query: 63 VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
VD L F P+GYP SV + Y+ + ++ + S+A V L G
Sbjct: 69 VDLMLKSLFLPTGYPASVRQDYIHFQKWEVAKGIVSSAGFV----GHLVGLGCL------ 118
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
GM L+ G D+E KR R AD ++ G LE+ +PL P FL
Sbjct: 119 ---------GMM----LVAGAFGKFFDTETKRIRWAADCIHVAGVALELATPLFPAYFLP 165
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+A L N AKGIA + AT+ I FA NL D+ AKG + + ++G+G GI
Sbjct: 166 LASLANSAKGIAGLTTGATKAAINQGFALRDNLGDITAKGHSQGIVAYLMGMGIGIGTTY 225
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
++G L +V V + + P T P
Sbjct: 226 VTGG------VLGALFAVYGVLAGGSTSSSGPFTTAAP 257
>gi|238488016|ref|XP_002375246.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
gi|220700125|gb|EED56464.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
Length = 513
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 122/234 (52%), Gaps = 15/234 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P+GYP SV++ Y+ Y F +LQ F+S+ +LS++++L G+ A A ++ I
Sbjct: 75 FLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHI 134
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-----LE 181
L+D + ++ ++ +G ++ E K +R+ AD D L+ LSP+ P F L
Sbjct: 135 LQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLS 194
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL---GAGIQ 238
AG+ + + VA +++ + + F++ GNL+++ AK + T+ +++G+ G
Sbjct: 195 TAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVRFVGSF 251
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
+ S V +S I +L +H+ +R+ + +LN QR ++ + + +
Sbjct: 252 VVSRV-TSYTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNS 304
>gi|241595406|ref|XP_002404468.1| conserved hypothetical protein [Ixodes scapularis]
gi|215502351|gb|EEC11845.1| conserved hypothetical protein [Ixodes scapularis]
Length = 472
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/271 (24%), Positives = 119/271 (43%), Gaps = 24/271 (8%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F F PSG+P +V+ Y Y + ++Q F S+ L+TQ++L G+ A A
Sbjct: 45 FSEIFLPSGFPDTVSRDYKEYQIWDSIQAFASSITGTLATQAVLTGVGVGDQTASVLAAT 104
Query: 124 VSWILKDGMQHVGKLI------------------C--SNLGARMDSEPKRWRILADALYD 163
+WILKDG +G+++ C SN ++ + AD L D
Sbjct: 105 TTWILKDGTGMIGRILFALVSRYLGMCCATHNKYCAHSNDSTTLNHADNYCSLFADILND 164
Query: 164 LGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
LE++S + + + AK + VA ATR + A+ N++D+ AK +
Sbjct: 165 AAILLELMSQHLKGYITLVLCISSVAKSVVGVAGGATRAALTQHQARSNNMADVSAKDGS 224
Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
TL N+ + L + S + + +++H+++ + + T N R +
Sbjct: 225 QETLVNLAAFLCSLLLLRLLAGSQILVYGIFTIFTLLHIFANFRAVSCVVMETFNRSRYS 284
Query: 284 MIVADFV-KTGRISSPADLRYREDL-LFPGR 312
++V F+ + G I+S + +E + L GR
Sbjct: 285 LLVKHFLEEAGDIASVEKVNKQESVWLRCGR 315
>gi|393217163|gb|EJD02652.1| DUF647-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 455
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 67/262 (25%), Positives = 122/262 (46%), Gaps = 12/262 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F PSG+P SV Y+ Y F +LQ F+S+ +LS++++L F G + V+ +
Sbjct: 50 FLPSGFPDSVTPDYIGYQIFDSLQAFSSSIAGLLSSRAVLEGFGVGNENASSTHAVLLTV 109
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP-QLFLEMAGL 185
L+D + ++ + G + E K +R+ AD D L+ LSP P QL + L
Sbjct: 110 LQDATGRLATILFAWKFGPALAPEAKMYRLAADIFNDSAFVLDCLSPALPTQLRIAALCL 169
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ I V + + FA+ GN+ +L AK + T+ + G+ G + + + +
Sbjct: 170 AGALRAICGVCGGGAKAALSVHFARTGNVGELNAKDASQETVIGLFGMLCGSFVMAHI-T 228
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTGRISSPADLRYR 304
S + I L H+ + +RA + TL+ QRT + + ++ + +P+++ R
Sbjct: 229 SRRTTWIALIFLLTAHLVTNYLAVRAVTLTTLSRQRTNIAYGVYCEQSNNVLTPSEVARR 288
Query: 305 EDLLFPGRLIEDAGNVKMDRHG 326
E R+ E G ++ G
Sbjct: 289 E------RIFEVPGAIRDSLQG 304
>gi|119187997|ref|XP_001244605.1| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
Length = 472
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/285 (24%), Positives = 130/285 (45%), Gaps = 49/285 (17%)
Query: 43 PDGHLSVKVVDDSRPVYHRVVDSF----LNKFFPSGYPFSVNEGYLRYTQF--------- 89
P + K V S P R S+ L+ F P GYP SV+ YL Y +
Sbjct: 26 PAASIGTKPVTISAPTSRRTWSSYFTAILDAFLPVGYPHSVSGDYLEYQVYVRPIFLKLH 85
Query: 90 --------------RALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGM 132
+LQ F+S+ +LS++++L G+ +P A ++S IL+D M
Sbjct: 86 WSFTDVANNRLFVQDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSM 144
Query: 133 QHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAK 190
+ ++ ++ LG ++ E K +R+ AD D ++ LSP+ P+ + + + L + +
Sbjct: 145 GRIATILFAHQLGTSLEPECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLR 204
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
+ VAA +++ + + FAK GNL++L A G + S + S +
Sbjct: 205 ALCGVAAGSSKASLSAHFAKWGNLAELNAVGSLV---------------VSYITSPLATW 249
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ + LL ++H+ + +RA + +LN QR ++ + R+
Sbjct: 250 VALISLL-IVHITTNYLAVRAVSMTSLNRQRANIVFSTLFDENRV 293
>gi|346978906|gb|EGY22358.1| hypothetical protein VDAG_03796 [Verticillium dahliae VdLs.17]
Length = 455
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 61/263 (23%), Positives = 121/263 (46%), Gaps = 19/263 (7%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
+ F P YP SV+ Y++Y + + Q F S +L+ +++L G+ ++
Sbjct: 79 DAFLPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHALLL 138
Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
LKDG+ + + + G+ +++ K++R LAD D LE+ +P AG
Sbjct: 139 TCLKDGVSRLATITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELANPF-------FAG 191
Query: 185 LGNF--------AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+ + +AA A++ + + FA+ NL++L K + T ++GL AG
Sbjct: 192 WAKILILVSAESLRALCGIAAGASKAALSTHFARRNNLAELNTKEASQETAIGLIGLLAG 251
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ V + + L IH+ + A ++T+N QR +++ +++T +
Sbjct: 252 TAIIHYV-QNRGAVFFLTIFLVFIHLGMNYLGVCALQLDTVNRQRATIVMQHYLRTQEVL 310
Query: 297 SPADLRYREDLLFPGRLIEDAGN 319
+PA+ +E+++F I D N
Sbjct: 311 TPAEASKKENIIFWRPHIADEHN 333
>gi|320038116|gb|EFW20052.1| hypothetical protein CPSG_03227 [Coccidioides posadasii str.
Silveira]
Length = 478
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 119/220 (54%), Gaps = 7/220 (3%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGAR 146
+LQ F+S+ +LS++++L G+ +P A ++S IL+D M + ++ ++ LG
Sbjct: 92 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSMGRIATILFAHQLGTS 150
Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIY 205
++ E K +R+ AD D ++ LSP P+ + + + L + + + VAA +++ +
Sbjct: 151 LEPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLS 210
Query: 206 SSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSV 265
+ FAK GNL++L AK + T+ +++G+ G + S + S + + + LL ++H+ +
Sbjct: 211 AHFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLL-IVHITTN 269
Query: 266 IEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+RA + +LN QR ++ + R+ +P +E
Sbjct: 270 YLAVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQE 309
>gi|303316636|ref|XP_003068320.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
delta SOWgp]
gi|240108001|gb|EER26175.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
delta SOWgp]
Length = 478
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 119/220 (54%), Gaps = 7/220 (3%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGAR 146
+LQ F+S+ +LS++++L G+ +P A ++S IL+D M + ++ ++ LG
Sbjct: 92 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSMGRIATILFAHQLGTS 150
Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIY 205
++ E K +R+ AD D ++ LSP P+ + + + L + + + VAA +++ +
Sbjct: 151 LEPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLS 210
Query: 206 SSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSV 265
+ FAK GNL++L AK + T+ +++G+ G + S + S + + + LL ++H+ +
Sbjct: 211 AHFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLL-IVHITTN 269
Query: 266 IEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+RA + +LN QR ++ + R+ +P +E
Sbjct: 270 YLAVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQE 309
>gi|255574377|ref|XP_002528102.1| conserved hypothetical protein [Ricinus communis]
gi|223532491|gb|EEF34281.1| conserved hypothetical protein [Ricinus communis]
Length = 485
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 79/160 (49%), Gaps = 8/160 (5%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV------- 123
P+G+P SV++ Y +Y + + T L T SLL A G+
Sbjct: 103 LPAGFPGSVSDDYFQYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFTGSTAAAAASAAA 162
Query: 124 VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+ W+ KDG+ +G+L I G+ D +PK+WR+ AD + G+ ++++ + P FL +
Sbjct: 163 IRWVSKDGIGALGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLITQVYPAYFLPL 222
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGE 222
A LGN K +A + I + FA GNL ++ AK E
Sbjct: 223 ASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEE 262
>gi|194382298|dbj|BAG58904.1| unnamed protein product [Homo sapiens]
Length = 194
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 80/164 (48%)
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLP 203
G+++D K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR
Sbjct: 12 GSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAA 71
Query: 204 IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
+ A+ N++D+ AK + TL N+ GL + + V + L+ +H+Y
Sbjct: 72 LTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIY 131
Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ +RA + TLN R +++ +++ G + P E L
Sbjct: 132 ANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 175
>gi|67972176|dbj|BAE02430.1| unnamed protein product [Macaca fascicularis]
Length = 326
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 80/164 (48%)
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLP 203
G+++D K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR
Sbjct: 12 GSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAA 71
Query: 204 IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
+ A+ N++D+ AK + TL N+ GL + + V + L+ +H+Y
Sbjct: 72 LTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIY 131
Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ +R+ + TLN R +++ +++ G + P E L
Sbjct: 132 ANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 175
>gi|154292218|ref|XP_001546685.1| hypothetical protein BC1G_14192 [Botryotinia fuckeliana B05.10]
Length = 449
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 131/250 (52%), Gaps = 22/250 (8%)
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEP 151
F S +L+++++L G+ + A AT V+ IL++ + ++ ++ LG+ ++ E
Sbjct: 59 FASTIAGLLASRAVLQGLGVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPEC 118
Query: 152 KRWRILADALYDLGTGLEVLSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
K++R++AD D L+ +SP P++FL A + K + VAA +++ + + F
Sbjct: 119 KKYRLMADIFNDAAMILDCISPAFPKIPRVFLLSAS--SVCKSLCGVAAGSSKASLSAHF 176
Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
AK GNL++L AK + TL +++G+ G + S + S + + + LLS IH+ +
Sbjct: 177 AKMGNLAELNAKDASQETLISLLGMLVGTFVVSKISSQVATWIALLALLS-IHLGTNYMA 235
Query: 269 MRAAPVNTLNPQRTAMIVADFVKT-----GRIS--SPADLRYREDLLFPGRLIEDAGNVK 321
+R+ + TLN QR ++++ + + +IS SP D+ +E R+ E G ++
Sbjct: 236 VRSVTMRTLNRQRANLVISSLLSSPEHEKNKISLPSPRDISLQE------RIFERDGAIR 289
Query: 322 MDRHGIRAEC 331
I A C
Sbjct: 290 NTHGTILAYC 299
>gi|387198702|gb|AFJ68865.1| hypothetical protein NGATSA_2057300, partial [Nannochloropsis
gaditana CCMP526]
Length = 277
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 5/134 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQATVVSW 126
F P G+P SV YLRY Q+ +Q ++ +L+TQ++L G+ TP AT+ W
Sbjct: 145 FLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGILATQAILEGVGVGREGATPLAATL-QW 203
Query: 127 ILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
I +DG + L+ S L K WR+ AD + D+G L++L+PL P+ FL L
Sbjct: 204 ITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFADFMVDIGITLDMLAPLFPKNFLLCICL 263
Query: 186 GNFAKGIAVVAARA 199
+ K + +AA A
Sbjct: 264 ASVCKSLCGIAAGA 277
>gi|422292654|gb|EKU19956.1| hypothetical protein NGA_2057300, partial [Nannochloropsis gaditana
CCMP526]
Length = 274
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 5/134 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQATVVSW 126
F P G+P SV YLRY Q+ +Q ++ +L+TQ++L G+ TP AT+ W
Sbjct: 142 FLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGILATQAILEGVGVGREGATPLAATL-QW 200
Query: 127 ILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
I +DG + L+ S L K WR+ AD + D+G L++L+PL P+ FL L
Sbjct: 201 ITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFADFMVDIGITLDMLAPLFPKNFLLCICL 260
Query: 186 GNFAKGIAVVAARA 199
+ K + +AA A
Sbjct: 261 ASVCKSLCGIAAGA 274
>gi|392871321|gb|EAS33216.2| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
Length = 407
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 116/210 (55%), Gaps = 7/210 (3%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGAR 146
+LQ F+S+ +LS++++L G+ +P A ++S IL+D M + ++ ++ LG
Sbjct: 21 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSMGRIATILFAHQLGTS 79
Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIY 205
++ E K +R+ AD D ++ LSP+ P+ + + + L + + + VAA +++ +
Sbjct: 80 LEPECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLS 139
Query: 206 SSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSV 265
+ FAK GNL++L AK + T+ +++G+ G + S + S + + + LL ++H+ +
Sbjct: 140 AHFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLL-IVHITTN 198
Query: 266 IEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+RA + +LN QR ++ + R+
Sbjct: 199 YLAVRAVSMTSLNRQRANIVFSTLFDENRV 228
>gi|336467636|gb|EGO55800.1| hypothetical protein NEUTE1DRAFT_103211 [Neurospora tetrasperma
FGSC 2508]
gi|350287710|gb|EGZ68946.1| hypothetical protein NEUTE2DRAFT_72230 [Neurospora tetrasperma FGSC
2509]
Length = 361
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 127/254 (50%), Gaps = 23/254 (9%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQA 121
+ L+ F P+GYP SV YL Y + +LQ F S+ S+L+ +++L G+ +P A
Sbjct: 38 ALLHAFLPAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLEGLGVGDASSSPTGA 97
Query: 122 TVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF- 179
++ I D + V ++ ++ LG ++ E K +R LAD L D L++L P + +
Sbjct: 98 LILK-IASDTISRVATILFAHRLGTAIEPECKFFRFLADCLNDAAQLLDLLVPAMMERYG 156
Query: 180 LEMA---GLGNFAKGIAV-----VAARATRLPIYSSFAKEG--NLSDLFAKGEAISTLFN 229
L++ G G V A+ + + +S K G NL++L AK + T+ +
Sbjct: 157 LKLGVIVGAGVLRSWCGVAASASKASLSAHFIVVASQQKGGGNNLAELNAKEASQETVVS 216
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
++G+ G S V +Q +++V L L +H+ +R TLN QR M+V
Sbjct: 217 LLGMLVG----SLVVQVVQDRLVVWVLMVGLLGVHLLMNYWGVRCVRCRTLNRQRATMVV 272
Query: 287 ADFVKTGRISSPAD 300
++++TGR+ P +
Sbjct: 273 REWLETGRVLGPEE 286
>gi|296411034|ref|XP_002835240.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295628015|emb|CAZ79361.1| unnamed protein product [Tuber melanosporum]
Length = 413
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/244 (27%), Positives = 124/244 (50%), Gaps = 17/244 (6%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARM 147
+LQ F+S+ +LS++++L G+ A AT ++ I +D + + ++ ++ G +
Sbjct: 6 SLQAFSSSIAGLLSSRAVLEGFGVGDANASATGALLLTIAQDCVSRLATILFAHRFGPAL 65
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARATRLP 203
++E KR+R+ AD D L+ LSP P+ L L +G A G+A ++A+ L
Sbjct: 66 EAETKRYRLAADVFNDSAMILDCLSPAFPRSVRILMLCCSGSLRAACGVAGGGSKAS-LS 124
Query: 204 IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
++ FA+ GN+ +L AK + T+ ++G+ AG + S + SS I LL +H+
Sbjct: 125 VH--FARSGNVGELNAKDSSQETVIGLMGMLAGSFVVSHI-SSTWATWISLTLLQAVHLG 181
Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKMD 323
+ + + A + TLN QR ++ + F TG I +P E R+ E G ++
Sbjct: 182 TNFKAVTAVALRTLNRQRANIVCSAFFDTGEILTPRQAARHE------RIFEKDGVLRWG 235
Query: 324 RHGI 327
G+
Sbjct: 236 GSGV 239
>gi|116004169|ref|NP_001070441.1| UPF0420 protein C16orf58 homolog [Bos taurus]
gi|61554841|gb|AAX46623.1| hypothetical protein FLJ13868 [Bos taurus]
Length = 232
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 13/184 (7%)
Query: 12 KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVV--DDSRPVYHRVVDSFLNK 69
+ PD S+ + W+ S + T +P G V + P + R++ FL
Sbjct: 19 RAAPDGSLQWEAWGRRWWGFSGAFTA----KPGGRDGGGGVAPGTASPPFSRLLAVFL-- 72
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 73 --PQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 130
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + +G++MD K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 131 VKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCIS 190
Query: 187 NFAK 190
N AK
Sbjct: 191 NLAK 194
>gi|429856893|gb|ELA31783.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 500
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 67/273 (24%), Positives = 135/273 (49%), Gaps = 18/273 (6%)
Query: 48 SVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQS 107
SV++ + + +V+D FL P GYP SV + Y Y + + Q F ++S+++
Sbjct: 63 SVQLPTIRKKPWRKVLDIFL----PDGYPHSVTDDYAAYQIYDSFQAFAGTIAGMISSRA 118
Query: 108 LLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYD 163
+ G+ +P A ++ ++++ M +G + ++L G +++E K +R+ +D L
Sbjct: 119 VWEGMGVGDSMASPIGAMMIQ-VIRESMGRLGTIAFAHLMGTSIEAECKAYRMASDVLCS 177
Query: 164 LGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
+ L+ LSPL P+ L L + + A G VA A++ + FAK NL +L A
Sbjct: 178 IAFILDCLSPLFPRSIRFLILCCSSILFAASG---VAGNASKSSLSGHFAKWNNLGELNA 234
Query: 220 KGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
K + T +++G+ G + S + + + L++ I+++ + +RA +LN
Sbjct: 235 KDASQETAISLMGMLTGTLVVSCLTTPTTTWTTLFILIA-IYLFLNWKGVRAVKSRSLNR 293
Query: 280 QRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
QR + + + ++ +P + +E L+F R
Sbjct: 294 QRANIAFSALLSKDQVLTPGKISEKE-LIFERR 325
>gi|296473307|tpg|DAA15422.1| TPA: hypothetical protein LOC767862 [Bos taurus]
Length = 232
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 13/184 (7%)
Query: 12 KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVV--DDSRPVYHRVVDSFLNK 69
+ PD S+ + W+ S + T +P G V + P + R++ FL
Sbjct: 19 RAAPDGSLQREAWGRRWWGFSGAFTA----KPGGRDGGGGVAPGTASPPFSRLLAVFL-- 72
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 73 --PQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 130
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + +G++MD K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 131 VKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCIS 190
Query: 187 NFAK 190
N AK
Sbjct: 191 NLAK 194
>gi|296826902|ref|XP_002851048.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
gi|238838602|gb|EEQ28264.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
Length = 479
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/306 (24%), Positives = 139/306 (45%), Gaps = 58/306 (18%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQF---------------------RALQH 94
+P + S LN F PSGYP SV + YL Y + +LQ
Sbjct: 47 QPSSTSYLTSILNVFLPSGYPHSVTDDYLEYQIYVSQQPFPSSDRKEKKLTVSYQDSLQA 106
Query: 95 FTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEP 151
F+S+ +LS++++L G+ +P A +++ +L++ M + ++ +
Sbjct: 107 FSSSIAGLLSSRAVLQGVGVGDATASPTAALLLN-VLQESMGRIATILFA---------- 155
Query: 152 KRWRILADALYDLGTGLE---VLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
+ LGT LE LSP+ P+ + + L + + + VAA +++ + +
Sbjct: 156 ----------HRLGTSLEPEYCLSPVFPKPVRVGFLSLSSVLRALCGVAAGSSKASLSAH 205
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
FA+ GNL +L AK + T+ ++VG+ G + + V S + + + LL+ IH+ +
Sbjct: 206 FARWGNLGELNAKDSSQETVISLVGMLVGSVVVAYVSSPLATWVALLFLLA-IHLATNHA 264
Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNV--KMDRH 325
+RA + TLN QR ++ + R+ +PA + E R+ E G + K +
Sbjct: 265 AVRAVNMTTLNRQRANIVFSSLFDDDRVLTPAQVSKEE------RIFERDGVLRWKAEAT 318
Query: 326 GIRAEC 331
G+ C
Sbjct: 319 GVLGTC 324
>gi|310789554|gb|EFQ25087.1| hypothetical protein GLRG_00231 [Glomerella graminicola M1.001]
Length = 484
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 68/286 (23%), Positives = 131/286 (45%), Gaps = 25/286 (8%)
Query: 40 QFQPDGHLSVKVVDDSRPVYHRVVD------------SFLNKFFPSGYPFSVNEGYLRYT 87
++ G L K V+ + H VD L F P GYP SV++ Y Y
Sbjct: 27 EYDKAGSLKAKYVESIPIIGHPRVDVLLPVRRKKIWLQLLEIFLPDGYPHSVSDDYAAYQ 86
Query: 88 QFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNL- 143
+ ++Q F + ++S++++ G+ +P A ++ ++++ + ++L
Sbjct: 87 IYDSIQAFAGSIAGMISSRAVWEGLGVGDSLASPTGAMLIQ-VIRESTGRFATITFAHLF 145
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARA 199
G +++E K +R+ +D L D+ L+ LSP P+ L L + L A G VA A
Sbjct: 146 GTSIEAECKAYRLASDILCDVAMVLDCLSPFFPRSIRFLVLCFSSLLYSASG---VAGNA 202
Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
++ + FAK NL +L AK + T +++G+ G + S + + + LL+
Sbjct: 203 SKSSLSGHFAKWNNLGELNAKDASQETAISLMGMITGTFMVSLLTGPKATWVALISLLA- 261
Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+H++ + +RA +LN QR + + ++ +P + RE
Sbjct: 262 LHLFLNWKGVRAVKSRSLNRQRANIAFSALFSKDQVLTPVIVSERE 307
>gi|299740308|ref|XP_001838896.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
gi|298404184|gb|EAU82827.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
Length = 768
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 108/224 (48%), Gaps = 13/224 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P+GYP SV YL Y ALQ F S+ S++S+++ L F G A ++ I
Sbjct: 331 FLPAGYPNSVTPDYLTYQVLNALQAFCSSLASLISSRATLQGFGVGSASASANDALLLTI 390
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPL-----CPQLFLE 181
L D + ++ ++ LG+ + +E K++R +AD L D+ L+ LSP+ PQL +
Sbjct: 391 LLDIYSRLTTIVSAHLLGSSLATEAKKYRFVADILNDVAVVLDTLSPVFKAQGYPQLRVV 450
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAISTLFNVVGLGAGI 237
L + + +AA ++ I FA +G+L DL AK + T+ + G+ G
Sbjct: 451 ALCLSASCRALCGIAAGGSKAAISVHFATPVKGKGDLGDLNAKDSSKETVLALFGMLLGS 510
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
+ + ++ + LLS +H+ + + T+N QR
Sbjct: 511 LIVPRLQTAWSTYTTLFILLS-LHIAINYVAVTGLVLRTMNWQR 553
>gi|345563243|gb|EGX46246.1| hypothetical protein AOL_s00110g70 [Arthrobotrys oligospora ATCC
24927]
Length = 445
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 64/259 (24%), Positives = 120/259 (46%), Gaps = 41/259 (15%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
+ + +N F P+GYP SV + YL A G + A T
Sbjct: 54 IQNIINIFLPAGYPKSVTDDYLDGD----------------------LAVGFEDSNASTT 91
Query: 123 --VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
V+ I++D + V ++ ++ LG+ ++ E KR+R AD D ++ LSP+ P+
Sbjct: 92 AAVLLKIVQDSIGRVATILFAHRLGSSLEPECKRFRFAADIFNDSALIVDCLSPVFPKPL 151
Query: 178 --LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL-- 233
+ L AG + + VA ++ + F+K GN+++L AK + T+ +++G+
Sbjct: 152 RVVLLSTAGT---LRALCGVAGGGSKAQLSVHFSKWGNVAELNAKDSSQETVISLMGMLM 208
Query: 234 ----GAGIQLASTVCSSMQGKMIVGP---LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
++ S V SS+ K L +H+++ ++A + TLN QR ++
Sbjct: 209 RADQTLRVKAGSLVISSVTSKTATWASMLFLIAVHLWTNYRAVKAVTMVTLNRQRANSVM 268
Query: 287 ADFVKTGRISSPADLRYRE 305
+ F +TG + +P D+ +E
Sbjct: 269 STFAETGVMLTPKDVARQE 287
>gi|347842141|emb|CCD56713.1| hypothetical protein [Botryotinia fuckeliana]
Length = 376
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 121/232 (52%), Gaps = 22/232 (9%)
Query: 113 GLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLE 169
G+ + A AT V+ IL++ + ++ ++ LG+ ++ E K++R++AD D L+
Sbjct: 4 GVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPECKKYRLMADIFNDAAMILD 63
Query: 170 VLSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 226
+SP P++FL A + K + VAA +++ + + FAK GNL++L AK + T
Sbjct: 64 CISPAFPKIPRVFLLSAS--SVCKSLCGVAAGSSKASLSAHFAKMGNLAELNAKDASQET 121
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
L +++G+ G + S + S + + + LLS IH+ + +R+ + TLN QR +++
Sbjct: 122 LISLLGMLVGTFVVSKISSQVATWIALLALLS-IHLGTNYMAVRSVTMRTLNRQRANLVI 180
Query: 287 ADFVKT-----GRIS--SPADLRYREDLLFPGRLIEDAGNVKMDRHGIRAEC 331
+ + + +IS SP D+ +E R+ E G ++ I A C
Sbjct: 181 SSLLSSPEHEKNKISLPSPRDISLQE------RIFERDGAIRNTHGTILAYC 226
>gi|396461797|ref|XP_003835510.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
gi|312212061|emb|CBX92145.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
Length = 543
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 102/191 (53%), Gaps = 4/191 (2%)
Query: 117 TPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
+P A ++S +L+D M ++ ++ LG + E K +R AD L D G LE LSP
Sbjct: 187 SPTAALLLS-VLQDSMGRTATILFADRLGTAFEPECKMYRFTADILNDTGMVLECLSPAF 245
Query: 176 PQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P+ + + + + + + V A + + + + FA++GNL ++ AK + T+ +++G+
Sbjct: 246 PKPIRVCVLSCSSVLRSLCGVCAGSAKASLSAHFARKGNLGEVNAKDSSQETVISLLGML 305
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
AG + S + S M + LL+ IH+ + +RA + LN QR ++ + ++ G+
Sbjct: 306 AGSVVVSWISSPMATWSTLIGLLA-IHLATNYAAVRAVSMRCLNRQRANILFSRLLEHGQ 364
Query: 295 ISSPADLRYRE 305
+ SP ++ RE
Sbjct: 365 LLSPLEVSRRE 375
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 38/56 (67%), Gaps = 4/56 (7%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
D++ RV+D FL P+GYP SV E Y++Y + +LQ F+S+ S+L+++++L
Sbjct: 45 DAKGYAQRVLDVFL----PAGYPHSVTEDYIQYQIYDSLQAFSSSIASMLASRAVL 96
>gi|398015008|ref|XP_003860694.1| hypothetical protein, conserved [Leishmania donovani]
gi|322498916|emb|CBZ33989.1| hypothetical protein, conserved [Leishmania donovani]
Length = 675
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 107/234 (45%), Gaps = 7/234 (2%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
+R RVV + P GYP SV E L + + Q + LST+++L G+
Sbjct: 57 ARQARRRVVTALERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGV 116
Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVL 171
+ A T +SW+++DG + +G ++ ++L + ++ K W+++AD D+ LE+
Sbjct: 117 GESKADLTSSTLSWMMQDGTRMIGSIVFASLIPQGLECRAKTWQLVADFTTDIANLLELC 176
Query: 172 SPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
+P C +F + + K + V TR FA N +D+ AK S++
Sbjct: 177 APWLPGCQTIFRVVLVAASAIKALGGVCDSGTRASFTQHFALRNNAADIAAKAATRSSVG 236
Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
+ +GL G +A +V +S + L S +++ +R + LN R
Sbjct: 237 SFIGLALGTAVAYSVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPR 290
>gi|406860777|gb|EKD13834.1| DUF647 domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 501
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 70/255 (27%), Positives = 121/255 (47%), Gaps = 40/255 (15%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---------- 114
S L+ F P+GYP SV E YL Y Q + AA + TQS L + GL
Sbjct: 52 SVLDAFLPAGYPNSVTEDYLEY------QIYVRAA-NKPWTQSELISEGLSTSVLKLHSW 104
Query: 115 -------------RPTPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADA 160
+P A ++S +L+ M + ++ ++ LG ++ E K +R+ AD
Sbjct: 105 DAVLESRIGVGDSHASPTAALLLS-VLQQSMGRIATIVFAHRLGTALEPECKMYRLAADI 163
Query: 161 LYDLGTGLEVLSPLCPQLF-LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
D L+ LSP P+ + + L + + + V A +++ + + FA +GNL +L A
Sbjct: 164 FNDAAMILDCLSPALPKASRVALLSLSSVLRSMCGVTAGSSKASLSAHFATQGNLGELNA 223
Query: 220 KGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG---PLLSVIHVYSVIEEMRAAPVNT 276
K + T+ +++G+ AG S V S + K I LL H+ + +RA + T
Sbjct: 224 KDSSQETVISLLGMLAG----SVVVSYITSKTITWIALILLLATHLGTNYLAVRAVCMQT 279
Query: 277 LNPQRTAMIVADFVK 291
LN QR +++++ ++
Sbjct: 280 LNRQRANLVLSNILQ 294
>gi|224010239|ref|XP_002294077.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970094|gb|EED88432.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 354
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 91/180 (50%), Gaps = 21/180 (11%)
Query: 75 YPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGM 132
YP +V GYL Y + +LQ +S V+ST ++L AAG+ A + ++W ++DG+
Sbjct: 19 YPHTVAPGYLEYQFYDSLQGLSSYLRGVVSTTAVLSAAGVGDAAATAMSAAMTWAIRDGL 78
Query: 133 QHVGKLICSNLG-ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
VG L+ S + + D+ K +R++AD L D+G L++ P
Sbjct: 79 GMVGGLLFSYVASSHFDAHVKEFRLVADVLNDVGLTLDMALP------------------ 120
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+ + RAT+ I F+ GN +D+ AK TL +++G+ G+ L++ S G++
Sbjct: 121 VVLSWTRATKGNITDHFSISGNRADVSAKESTQETLVSLIGMALGVWLSNLDSHSRVGEL 180
>gi|339898153|ref|XP_001465441.2| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321399425|emb|CAM67862.2| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 675
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 107/234 (45%), Gaps = 7/234 (2%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
+R RVV + P GYP SV E L + + Q + LST+++L G+
Sbjct: 57 ARQARRRVVTALERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGV 116
Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVL 171
+ A T +SW+++DG + +G ++ ++L + ++ K W+++AD D+ LE+
Sbjct: 117 GESKADLTSSTLSWMMQDGTRMIGSIVFASLIPQGLECRAKTWQLVADFTTDIANLLELC 176
Query: 172 SPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
+P C +F + + K + V TR FA N +D+ AK S++
Sbjct: 177 APWLPGCQTIFRVVLVAASAIKALGGVCDSGTRASFAQHFALRNNAADIAAKAATRSSVG 236
Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
+ +GL G +A +V +S + L S +++ +R + LN R
Sbjct: 237 SFIGLALGTAVAYSVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPR 290
>gi|149067629|gb|EDM17181.1| similar to cDNA sequence BC017158, isoform CRA_a [Rattus
norvegicus]
Length = 246
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 67/124 (54%), Gaps = 3/124 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL+Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAK 190
N AK
Sbjct: 195 NLAK 198
>gi|189196610|ref|XP_001934643.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980522|gb|EDU47148.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 333
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 122/253 (48%), Gaps = 15/253 (5%)
Query: 85 RYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICS 141
R +LQ F+S+ +L+++++L G+ TP A ++S I Q L
Sbjct: 36 RVDVIDSLQAFSSSIAGMLASRAILEGVGVGDSTATPTTALLLSVIQTSVGQIATILFAH 95
Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARAT 200
LG ++ E K +R+ AD D L+ LSP P+ + + + + + + V A ++
Sbjct: 96 RLGTALEPECKMYRLAADVFNDTAMVLDCLSPAFPKSIRVAVLSFSSCLRALCGVCAGSS 155
Query: 201 RLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ--GKMIVGPLLS 258
+ + + FA +GNL ++ AK + T+ +++G+ AG + S + S + G +I+ +
Sbjct: 156 KASLSAHFAHKGNLGEVNAKDSSQETVISLLGMLAGSLVVSQITSPLATWGTLILLLTIH 215
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
+ Y+ + RA ++ LN QR ++ ++ + G + SP D+ RE R+ E G
Sbjct: 216 LATNYAAV---RAVNMHCLNRQRANILFSNMFQHGLVLSPRDVSQRE------RVFERGG 266
Query: 319 NVKMDRHGIRAEC 331
++ + C
Sbjct: 267 VLRWSDDKVLGRC 279
>gi|346467075|gb|AEO33382.1| hypothetical protein [Amblyomma maculatum]
Length = 388
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 94/199 (47%), Gaps = 3/199 (1%)
Query: 103 LSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILAD 159
L+ QS+L G+ A A +WIL+DG G+++ + G+ +D + K+WR+ AD
Sbjct: 4 LAIQSVLTGVGVGDQGASVLAATTTWILRDGAGMTGRILFAWFQGSNLDYDSKKWRLFAD 63
Query: 160 ALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
L D LE++ + + + AK + VA ATR + A+ N++D+ A
Sbjct: 64 ILNDAAIFLELMCQYFKGYVTPILCISSVAKSVVGVAGGATRAALTQHQARSNNMADVSA 123
Query: 220 KGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
K + TL N+ + L V + + L +++H+++ + + T N
Sbjct: 124 KDGSQETLVNLAAFLFSLLLLQLVAGNTWLVSTIFILFTLLHIFANYRAVSCVVMETFNR 183
Query: 280 QRTAMIVADFVKTGRISSP 298
R +++V F++ G +P
Sbjct: 184 SRYSVVVRHFLQKGGGIAP 202
>gi|148685696|gb|EDL17643.1| cDNA sequence BC017158, isoform CRA_a [Mus musculus]
Length = 226
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 67/124 (54%), Gaps = 3/124 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAK 190
N AK
Sbjct: 195 NLAK 198
>gi|391863517|gb|EIT72825.1| hypothetical protein Ao3042_00909 [Aspergillus oryzae 3.042]
Length = 236
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 87/158 (55%), Gaps = 11/158 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P+GYP SV++ Y+ Y F +LQ F+S+ +LS++++L G+ A A ++ I
Sbjct: 75 FLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHI 134
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-----LE 181
L+D + ++ ++ +G ++ E K +R+ AD D L+ LSP+ P F L
Sbjct: 135 LQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLS 194
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
AG+ + + VA +++ + + F++ GNL+++ A
Sbjct: 195 TAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNA 229
>gi|403418386|emb|CCM05086.1| predicted protein [Fibroporia radiculosa]
Length = 497
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 117/242 (48%), Gaps = 25/242 (10%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
N F P+GYP SV+ YL+Y + ALQ F S+ S+++++++L G+ A AT +
Sbjct: 49 NVFLPAGYPASVSPDYLQYQIYNALQAFCSSLASLIASRAVLEGHGVGNADASATHAIFL 108
Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE--- 181
+L+D + ++ LG + E K +R+LAD D L+ LSP + L
Sbjct: 109 TVLQDIFSRLTTIVSGYYLGTSLFPEAKTYRLLADIFNDAAIVLDTLSPHLDGVVLSLRY 168
Query: 182 ---MAGLGN-----------FAKGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEA 223
++G G+ ++ + + A ++ + FA G++ DL AK +
Sbjct: 169 PFVLSGSGSSLRVSALCLSGASRALCGMVAGGSKAALTVHFASPGQGTGDVGDLNAKDGS 228
Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
T+ ++G+ +G + V S+ + +V +L+ +H++ +R + +LN QR
Sbjct: 229 KETVLALLGMLSGTAVMHYVHSA-RATYLVLFVLTFLHLFGNYLAVRVVVLRSLNRQRAN 287
Query: 284 MI 285
++
Sbjct: 288 IV 289
>gi|407259493|gb|AFT91377.1| hypothetical protein [Emericella rugulosa]
Length = 549
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 60/241 (24%), Positives = 114/241 (47%), Gaps = 23/241 (9%)
Query: 64 DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV 123
D ++ F P+GYP SV++ Y + AL+ + + + +P A +
Sbjct: 67 DVLIDVFLPAGYPHSVSDDY-------------TPALAQPAIHAGVGVGNANASPTSALL 113
Query: 124 VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+ IL+D + ++ ++ +G ++ E K +R AD DL L+ LSP+ P F +
Sbjct: 114 LH-ILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCLSPMIPAGFNRV 172
Query: 183 AGL--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
L + + VA +++ + + FA+ GNL+D+ AK + T+ +++G+ G +
Sbjct: 173 TVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLADVNAKDSSQETVISLIGMLVGSFVV 232
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
S + +S + L +H+ +R+ + TLN QR A+ V + +SS D
Sbjct: 233 SHI-TSFTATWLTLVFLVSMHLSLNYAAVRSVQMTTLNRQR-----ANIVFSTLLSSDPD 286
Query: 301 L 301
L
Sbjct: 287 L 287
>gi|388520739|gb|AFK48431.1| unknown [Medicago truncatula]
Length = 236
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 55/98 (56%), Gaps = 4/98 (4%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR----PTPAQATVVSWI 127
P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ A A ++WI
Sbjct: 118 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLTSVGVSRNRATPAAAAVAINWI 177
Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLG 165
L DG VGK++ + G + D + K+ R D L +LG
Sbjct: 178 LNDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELG 215
>gi|452981822|gb|EME81581.1| hypothetical protein MYCFIDRAFT_100197, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 272
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/239 (26%), Positives = 113/239 (47%), Gaps = 6/239 (2%)
Query: 85 RYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICS 141
R ++Q F S+ +LS++++L + G+ +P A ++S + + +
Sbjct: 28 RVDAIDSIQAFVSSIAGLLSSRAVLLSVGVGDSSASPTSALLLSMLQQSLSRIAKVAFAY 87
Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL-EMAGLGNFAKGIAVVAARAT 200
LG ++ E K +R AD L+D L LSP+ P+ F + L + + V A +T
Sbjct: 88 RLGKAIEPECKMYRFCADLLHDSSFVLNCLSPMLPKAFRPPILALSSICFAVCDVVASST 147
Query: 201 RLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI 260
+ + + FAK NL +L AK + T+ +++G+ G L + + + +LSV
Sbjct: 148 KASLSAHFAKWENLGELNAKDSSQETIISLLGMLTGSLLVNWISGHWATWTALILMLSV- 206
Query: 261 HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGN 319
H+ + +RA + TLN QR A +V + G I PA++ E + + DA N
Sbjct: 207 HLEANRRAVRAIVMQTLNRQR-ATVVYHRLCVGHIPRPAEVSRVERIFERNGAVRDAEN 264
>gi|326479045|gb|EGE03055.1| DUF647 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 494
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/272 (23%), Positives = 114/272 (41%), Gaps = 64/272 (23%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
+ S LN F P+GYP SV + Y+ Y + ++ + +LFA
Sbjct: 123 LTSILNVFLPAGYPHSVTDDYIEYQIYESMGRIAT----------ILFA----------- 161
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLE 181
LG ++ E K +R+ AD L D L+ LSP+ P+ + +
Sbjct: 162 ------------------HRLGTSLEPECKMYRLAADILNDSAMVLDCLSPIFPKPVRVG 203
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ L + + + VAA +++ + + FA+ GNL +L A G + S
Sbjct: 204 LLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVGSVV---------------VS 248
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+ S ++ + + LL ++H+ + +RA + TLN QR ++ + + I +PA+
Sbjct: 249 YITSPLETWIALIVLL-IVHLGTNHAAVRAVKMTTLNRQRANIVFSHLFEDNLILTPAET 307
Query: 302 RYREDLLFPGRLIEDAGNV--KMDRHGIRAEC 331
E R+ E G + K + GI C
Sbjct: 308 SKEE------RIFERDGVLRWKAESSGILGTC 333
>gi|240280131|gb|EER43635.1| DUF647 domain-containing protein [Ajellomyces capsulatus H143]
Length = 521
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 127/308 (41%), Gaps = 48/308 (15%)
Query: 29 FETSDSV-TRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
F T DS RH + + D +V SR + S LN F P GYP SV++ YL
Sbjct: 34 FRTPDSSGVRHERRRVD------IVQPSRKDSSSYLASLLNVFLPVGYPHSVSDDYLD-- 85
Query: 88 QFRALQHFTSAALSV-LSTQSLLFAAGLRPTPAQATV--VSWILKDGMQHVGKLICSN-L 143
F S L T + G+ A TV + +L+D M + ++ ++ L
Sbjct: 86 ----CLGFASGVLKFNCRTAGVPGWVGVGDASASPTVALLHSVLQDSMGRIATILFAHRL 141
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRL 202
G ++ E K +R+ AD L D + LSP P+ L + + + + + VAA +++
Sbjct: 142 GTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVLRALCGVAAGSSKA 201
Query: 203 PIYSSFAKEGNLSDL----------------------FAKGEAISTLFNVVGLGAGIQLA 240
+ S FAK GNL +L F +A L I L
Sbjct: 202 SLSSHFAKCGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKLQKDSSQETVISLL 261
Query: 241 STVCSSMQGKMIVGPLLS--------VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
+C S+ I P + ++H+ + +R+ + TLN QRT ++ + +
Sbjct: 262 GILCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNMTTLNRQRTNIVFSTLFEK 321
Query: 293 GRISSPAD 300
G + +P +
Sbjct: 322 GSVLTPTE 329
>gi|409046959|gb|EKM56438.1| hypothetical protein PHACADRAFT_253558 [Phanerochaete carnosa
HHB-10118-sp]
Length = 464
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 116/259 (44%), Gaps = 40/259 (15%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWI 127
F P+GYP SV+ Y RY ALQ F S+ +++ +++L G+ A AT ++ I
Sbjct: 2 FLPAGYPASVSGDYTRYQVLNALQGFFSSLGGLVAARAVLEGHGVGSADASATNALLLTI 61
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D V ++ LG + E K +R LAD D+ L+ LSPL + L+ L
Sbjct: 62 LQDIFSRVTAILSGYYLGTALFPEAKTYRFLADIWNDIPLVLDTLSPLLSHVSLDFVWLA 121
Query: 187 -------------NFAKGIAVVAARATR-------------LPIYSS----FAKEGNLSD 216
+GIA+ + R L ++ S A G++SD
Sbjct: 122 FPPFLLPTLVHRPGPWRGIALCFSGIFRSMCAVAAAGSKAALTLHFSQPDHGAASGDISD 181
Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI---HVYSVIEEMRAAP 273
L AK + T+ +VG+ G S + +Q + LL+VI H+Y +R+
Sbjct: 182 LSAKDGSKETVLALVGMLCG----SIIMPLIQTQTQTCGLLAVIIVFHIYINYYAVRSVV 237
Query: 274 VNTLNPQRTAMIVADFVKT 292
+ N QRT ++ + F ++
Sbjct: 238 FKSYNRQRTCILWSAFRRS 256
>gi|426194386|gb|EKV44317.1| hypothetical protein AGABI2DRAFT_120451 [Agaricus bisporus var.
bisporus H97]
Length = 462
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/242 (26%), Positives = 111/242 (45%), Gaps = 18/242 (7%)
Query: 60 HRVVDSFLNK-FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
H+ L K F P+GYP +V YL Y A Q F ++ S+L++++ L G+
Sbjct: 44 HQDTKELLTKIFLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPS 103
Query: 119 AQAT--VVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A +T ++ +L+D + +I + LG R+ E K++R LAD L D+ ++V +P+
Sbjct: 104 ATSTGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVF 163
Query: 176 PQLFLEMA---GLGNFA--KGIAVVAARATRLPIYSSFAKE----GNLSDLFAKGEAIST 226
A GL A + + V A + I FA G++ DL AK + T
Sbjct: 164 SAFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKET 223
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGP-LLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
+ ++G+ L S + + + P LL +I V+ VI + + +P R
Sbjct: 224 VLALLGM----LLGSVLIPHITSPWVTYPVLLLLISVHLVINYIAVRGLAIRSPNRHRAT 279
Query: 286 VA 287
+A
Sbjct: 280 IA 281
>gi|409076043|gb|EKM76417.1| hypothetical protein AGABI1DRAFT_131245 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 466
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 65/242 (26%), Positives = 111/242 (45%), Gaps = 18/242 (7%)
Query: 60 HRVVDSFLNK-FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
H+ L K F P+GYP +V YL Y A Q F ++ S+L++++ L G+
Sbjct: 44 HQDTKELLTKIFLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPS 103
Query: 119 AQAT--VVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A +T ++ +L+D + +I + LG R+ E K++R LAD L D+ ++V +P+
Sbjct: 104 ATSTGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVF 163
Query: 176 PQLFLEMA---GLGNFA--KGIAVVAARATRLPIYSSFAKE----GNLSDLFAKGEAIST 226
A GL A + + V A + I FA G++ DL AK + T
Sbjct: 164 SAFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKET 223
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGP-LLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
+ ++G+ L S + + + P LL +I V+ VI + + +P R
Sbjct: 224 VLALLGM----LLGSVMIPHITSPWVTYPVLLLLISVHLVINYIAVRGLAIRSPNRHRAT 279
Query: 286 VA 287
+A
Sbjct: 280 IA 281
>gi|303288586|ref|XP_003063581.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454649|gb|EEH51954.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 290
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 49/88 (55%), Gaps = 4/88 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
F P+GYP +V+ Y R+ ++ +F L VL++QSLL A G+ TP A W
Sbjct: 2 FLPAGYPDTVSSDYARWLRWHLFSNFFRNVLEVLTSQSLLVALGMGSTPGALPLAAATKW 61
Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKR 153
+LKDG+ L+ +GA R D +PKR
Sbjct: 62 VLKDGVGSFATLLSGVVGAQRYDEDPKR 89
>gi|159474944|ref|XP_001695583.1| hypothetical protein CHLREDRAFT_191958 [Chlamydomonas reinhardtii]
gi|158275594|gb|EDP01370.1| predicted protein [Chlamydomonas reinhardtii]
Length = 487
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 74/138 (53%), Gaps = 8/138 (5%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGLR----PTPAQATVVSWILKDGMQHVGKLI-CSNLGA 145
L + A SVL++ +L+A GL PT A ++W+LKDGM +G L+ +
Sbjct: 142 GLNNIAVTANSVLASTFMLYAVGLGAGAIPT---AGALNWVLKDGMGQLGTLVFGKTIAH 198
Query: 146 RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIY 205
D K W L+ L LE+L+ L P FL M L N KG+A +AA +TR +
Sbjct: 199 NFDVHSKTWFFLSAVLLQAAAALEMLTVLVPGHFLLMGSLANMLKGLAWMAAGSTRSVFH 258
Query: 206 SSFAKEGNLSDLFAKGEA 223
SFA++ N++D+ AKG +
Sbjct: 259 LSFARDNNIADVTAKGTS 276
>gi|167378255|ref|XP_001734737.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165903647|gb|EDR29104.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 400
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 57/230 (24%), Positives = 102/230 (44%), Gaps = 8/230 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL-FAAGLRPTPAQATVVSWILKD 130
P GYP SV GY R+ F + + + V+S ++L F +G A VV +
Sbjct: 51 PVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAILRFNSG---HSAAFFVVFREVMT 107
Query: 131 GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAK 190
GM H+ +I G + K+WR+ D + +L +E+ P +F M+ + +
Sbjct: 108 GMMHL--IITERWGTSIVFFAKQWRMRIDLISELLRIIEIAFISTP-IFYSMSVTCSVIE 164
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
G+ + + + I +++AK N+++L K + + TLF VV L G + + +
Sbjct: 165 GVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVVSLLIGWLILHVLGDFTESG 224
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPA 299
+IV + H TLN +R ++++ F+ I SP
Sbjct: 225 IIVFICALISHCVCNFLLSNVIVFKTLNYERLTILMSYFMNIKNEILSPV 274
>gi|121709968|ref|XP_001272600.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
gi|119400750|gb|EAW11174.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
Length = 512
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 92/189 (48%), Gaps = 18/189 (9%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFR-ALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
+VD ++ F PSGYP SV + YL F A T+ L V TP
Sbjct: 65 LVDLLVDVFLPSGYPHSVTDDYLPRVYFHLATPQPTNNPLGV-------GVGNADATPTS 117
Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
A ++ IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P
Sbjct: 118 ALLLH-ILQDTSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMIPAGV 176
Query: 178 ---LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
L AG+ + + VA +++ + + FA+ GNL+++ AK + T+ ++VG+
Sbjct: 177 SRITVLSTAGV---LRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLVGML 233
Query: 235 AGIQLASTV 243
G + S V
Sbjct: 234 VGSLVVSHV 242
>gi|302678497|ref|XP_003028931.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
gi|300102620|gb|EFI94028.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
Length = 278
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/273 (26%), Positives = 121/273 (44%), Gaps = 16/273 (5%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
VD F P+G+P SV+ YLRY + A+Q F S+ +LS+++LL G+ A AT
Sbjct: 1 VDVLAKVFLPAGFPNSVSPDYLRYQIYNAIQAFCSSLAGLLSSRALLEGFGVGDANATAT 60
Query: 123 --VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC---- 175
V+ + +D + +I + LG + E K +R LAD D + +P+
Sbjct: 61 NAVLLTVAQDVFSRLTTIIAAYLLGPSLFPEAKTFRFLADVFNDTAIIFDASTPILRASG 120
Query: 176 -PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAISTLFNV 230
L + L ++ I + + FA G++ D+ AK + TL +
Sbjct: 121 HANLRVAALCLAGASRAICGTICGGAKAALTLHFATPLRGTGDIGDVNAKDASKETLLAL 180
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+G+ AG + + SS Q + L H+ +R + TLN QR ++ F
Sbjct: 181 MGMLAGSFVVHHL-SSTQATHALLLALLAAHLTVNYLGVRGVCLRTLNRQRASLAWTSFR 239
Query: 291 KTGRISSPADLRYREDLLF-PGRLIE--DAGNV 320
+ G SP + E +L PG ++ D G++
Sbjct: 240 EYGSAPSPTIVAGMEGILARPGAIVHAGDGGHI 272
>gi|159124134|gb|EDP49252.1| DUF647 domain protein [Aspergillus fumigatus A1163]
Length = 542
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 111/251 (44%), Gaps = 36/251 (14%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
++ F PSGYP SV + YL LQ + T +LL
Sbjct: 70 LVDVFLPSGYPNSVTDDYLPAV----LQGVGVGNANASPTSALLLH-------------- 111
Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P AG
Sbjct: 112 -ILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVLDCLSPMIP------AG 164
Query: 185 LGNFA--------KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+G + + VA +++ + + FA+ GNL+++ AK + T+ +++G+ G
Sbjct: 165 VGRVGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLLGMLVG 224
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ S V +S L +H+ +R+ + +LN QR ++ + +++
Sbjct: 225 SFIVSHV-TSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRANIVFSTLLESDPAL 283
Query: 297 SPADL-RYRED 306
P L R R D
Sbjct: 284 DPHRLSRSRVD 294
>gi|70991459|ref|XP_750578.1| DUF647 domain protein [Aspergillus fumigatus Af293]
gi|66848211|gb|EAL88540.1| DUF647 domain protein [Aspergillus fumigatus Af293]
Length = 542
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 111/251 (44%), Gaps = 36/251 (14%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
++ F PSGYP SV + YL LQ + T +LL
Sbjct: 70 LVDVFLPSGYPNSVTDDYLPAV----LQGVGVGNANASPTSALLLH-------------- 111
Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P AG
Sbjct: 112 -ILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVLDCLSPMIP------AG 164
Query: 185 LGNFA--------KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+G + + VA +++ + + FA+ GNL+++ AK + T+ +++G+ G
Sbjct: 165 VGRVGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLLGMLVG 224
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ S V +S L +H+ +R+ + +LN QR ++ + +++
Sbjct: 225 SFIVSHV-TSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRANIVFSTLLESDPAL 283
Query: 297 SPADL-RYRED 306
P L R R D
Sbjct: 284 DPHRLSRSRVD 294
>gi|342186523|emb|CCC96010.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 596
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 62/249 (24%), Positives = 102/249 (40%), Gaps = 7/249 (2%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P+GYP S EG+ RY F S+ S ++ QS+L L +P W+LKD
Sbjct: 143 PTGYPDSCAEGFRRYFLLSLCSTFVSSFASSIAYQSILNGFLLVSSPQL-----WMLKDL 197
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ +N ++ PK W +++ L++ LE++ P + P L A L + +
Sbjct: 198 APALAAAYLANRVVSYENRPKFWFVVSVVLHNTSIVLEMIIPSVVPHNLLTAAVLTSCVR 257
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
A + TR FA GNL++L K + + V GI V +S+ +
Sbjct: 258 QSASLMFLVTRASALQHFAVSGNLAELTKKFNSFGIVIYTVSTALGIAYTYYV-ASLTAQ 316
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
++ ++ M LN ++I+ +VK G + L RE F
Sbjct: 317 LVTVLFCCGANLVLSYMSMCNIAFRVLNGTTISVILRAYVKEGCLGPGHVLSPREVSDFI 376
Query: 311 GRLIEDAGN 319
G + D G
Sbjct: 377 GLRMLDVGT 385
>gi|407034154|gb|EKE37096.1| hypothetical protein ENU1_209440 [Entamoeba nuttalli P19]
Length = 400
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 101/232 (43%), Gaps = 12/232 (5%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD- 130
P GYP SV GY R+ F + + + V+S +A LR P + + ++
Sbjct: 51 PVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSV-----SAILRINPGHSAAFFVVFREV 105
Query: 131 --GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
GM H+ +I G + K+WR+ + + +L +E+ P +F ++ +
Sbjct: 106 MTGMMHL--IITERWGTSIVFFAKQWRMRIELISELLRIIEIAFISTP-IFYSISVTCSV 162
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
+G+ + + + I +++AK N+++L K + + TLF VV L G + + +
Sbjct: 163 IEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVVSLLIGWVILHVLGDFTE 222
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPA 299
+IV + H TLN +R ++++ F+ I SP
Sbjct: 223 SGIIVFICALISHCICNFLLSNVIVFKTLNYERLTILMSYFMNIKNEILSPV 274
>gi|67467345|ref|XP_649789.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56466294|gb|EAL44403.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449707066|gb|EMD46790.1| Hypothetical protein EHI5A_044670 [Entamoeba histolytica KU27]
Length = 400
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 101/232 (43%), Gaps = 12/232 (5%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD- 130
P GYP SV GY R+ F + + + V+S +A LR P + + ++
Sbjct: 51 PVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSV-----SAILRINPGHSAAFFVVFREV 105
Query: 131 --GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
GM H+ +I G + K+WR+ + + +L +E+ P +F ++ +
Sbjct: 106 MTGMMHL--IITERWGTSIVFFAKQWRMRIELISELLRIIEIAFISTP-IFYSISVTCSV 162
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
+G+ + + + I +++AK N+++L K + + TLF VV L G + + +
Sbjct: 163 IEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVVSLLIGWLILHVLGDFTE 222
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPA 299
+IV + H TLN +R ++++ F+ I SP
Sbjct: 223 SGIIVFICALISHCICNFLLSNVIVFKTLNYERLTILMSYFMNIKNEILSPV 274
>gi|392585079|gb|EIW74420.1| DUF647-domain-containing protein [Coniophora puteana RWD-64-598
SS2]
Length = 515
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 121/280 (43%), Gaps = 46/280 (16%)
Query: 57 PVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP 116
P R+ ++ F P+GYP SV YL Y + ALQ F S+ +++++ L G+
Sbjct: 50 PSLDRIRRVLVDLFLPAGYPASVTPDYLPYHAYNALQAFFSSLAGLIASRGALQGVGV-- 107
Query: 117 TPAQATVVSWILKDGMQHV-GKLICSN----LGARMDSEPKRWRILADALYDLGTGLEVL 171
A AT S +L +Q V G++ + G + E K +R++AD + D VL
Sbjct: 108 GDASATATSALLLTVLQDVFGRITTISAAYVFGTSLYPETKTFRLVADLVNDASI---VL 164
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATR---------------------LPIYSSFAK 210
LCP+L L L F I++ A+ A R +P ++
Sbjct: 165 DTLCPRLSLSF--LHTFLLAISLCASAALRALCGLVAGGSKAALTNHFACTVPASTAAGG 222
Query: 211 E----GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVI 266
E G++ +L AK + T+ + G+ G L + +S++ + L H+ +
Sbjct: 223 EAVVAGDVGELNAKDASRETVVGLFGMLLGTLLVPRL-TSVRATWLTLVFLIAGHLLANY 281
Query: 267 EEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+R+ + TLN QR ++ + A +R RED
Sbjct: 282 LAVRSVSLRTLNRQRAGILWTAWA--------ARMRERED 313
>gi|303281718|ref|XP_003060151.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458806|gb|EEH56103.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 814
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 3/120 (2%)
Query: 57 PVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-- 114
P+ + + F P GYP SV+ YL + A Q S A +L + +LL AG+
Sbjct: 224 PLLRACLRALRRAFLPEGYPRSVSSDYLAFQLCDAAQGVCSYARGILCSTALLRGAGVGS 283
Query: 115 RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP 173
A + ++ +D +G + + G MD++ KRWR+ ADA+ DLG +E+ SP
Sbjct: 284 STATAASATAQFVARDLTSMLGGVAFAATRGRAMDADAKRWRLFADAMNDLGMAIELASP 343
Score = 44.7 bits (104), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 51/117 (43%), Gaps = 17/117 (14%)
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFA-----------------KEGNLSDLFAKG 221
FL A LG A+ + V +RATR + FA G+++D+ AK
Sbjct: 410 FLVAACLGGLARSLCGVTSRATRAALTQHFAAGDDAGRSFTRGRTGGSTTGSVADVDAKE 469
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
T ++G+GAG+ + LL+ +HV+ + MR+ +NT+N
Sbjct: 470 GTQETAATLIGMGAGVLVTRLAADDAAAIWFWFALLTALHVFFNVRAMRSLRLNTMN 526
>gi|384494915|gb|EIE85406.1| hypothetical protein RO3G_10116 [Rhizopus delemar RA 99-880]
Length = 188
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P GYP SV+ Y ++ + L+ + +A+ VL +Q++L + GL A A + W+
Sbjct: 58 FLPVGYPESVHSCYKKFHSWLFLETYVGSAVGVLCSQAMLASLGLGTVEATGGAVAIQWV 117
Query: 128 LKDGMQHVGKL-ICSNLGARMDSEPKRWRILA 158
LKDG+ +GKL + DS PK W+ ++
Sbjct: 118 LKDGIGEIGKLFFIKKYASTFDSHPKTWKFVS 149
>gi|350630832|gb|EHA19204.1| hypothetical protein ASPNIDRAFT_186948 [Aspergillus niger ATCC
1015]
Length = 504
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 111/237 (46%), Gaps = 29/237 (12%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
V D + F P+GYP SV + Y A +VL + A PT A
Sbjct: 68 VQDFLVEVFLPAGYPHSVTDDY---------------APAVLQGVGV-GNADASPT---A 108
Query: 122 TVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ--- 177
++ IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P
Sbjct: 109 ALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMTLDCLSPMIPAGAP 168
Query: 178 --LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
L AG+ + + VA +++ + + F++ GNL+++ AK + T+ +++G+
Sbjct: 169 RVTVLSTAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLV 225
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
G + S V ++ I +L +H+ +R+ + +LN QR ++ + + T
Sbjct: 226 GSFVVSRV-TNFSTTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 281
>gi|258576451|ref|XP_002542407.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902673|gb|EEP77074.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 442
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 120/239 (50%), Gaps = 28/239 (11%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNLGARM 147
+LQ F+S+ +LS++++L G+ +P A +++ +L++ M + ++ ++ R+
Sbjct: 40 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAALLLN-VLQESMGRIATILFAH---RL 95
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYS 206
E K +R+ AD D ++ LSP+ P+ + + + L + + + VAA +++ + +
Sbjct: 96 --ECKMYRLAADIFNDSAIVMDCLSPMFPKPMRVGVLSLSSVLRAMCGVAAGSSKASLSA 153
Query: 207 SFAKEGNLSDLFAKGEAISTLFNVVG------LGAGIQLASTVCSSMQGKMIVGP----- 255
FA+ GNL++L AK + T+ +++G LG ++ + + G ++V
Sbjct: 154 HFARWGNLAELNAKDSSQETVISLMGMLVRPILGFLESISGLILTQQVGSLVVSRVTSTL 213
Query: 256 -------LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
LL IH+ + +RA + TLN QR ++ + R +P+ ++E +
Sbjct: 214 ATWAALILLLTIHITTNYFAVRAVNMTTLNRQRANIVFSTIFDENRTLTPSQAAHQERI 272
>gi|300175334|emb|CBK20645.2| unnamed protein product [Blastocystis hominis]
Length = 290
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/151 (25%), Positives = 72/151 (47%), Gaps = 1/151 (0%)
Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
+++PL P FL +A + N K I+ +A AT I FA N+ D+ AK + ++
Sbjct: 1 MITPLFPNCFLALATISNIGKSISFLANGATGAGIRYGFANAMNIGDVTAKEGSQTSAVY 60
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPL-LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
++G+ G+ ++S + + + +S + +Y++ + +R + TLN QR+ + D
Sbjct: 61 LLGMLFGVSVSSLLGDQSMSTIFASVICMSCLSLYAMNKSLRCVTLPTLNTQRSELACVD 120
Query: 289 FVKTGRISSPADLRYREDLLFPGRLIEDAGN 319
+ G I P + E P + D G
Sbjct: 121 YCSQGIIYQPEAVAKLERYYLPYLHVADCGT 151
>gi|449543750|gb|EMD34725.1| hypothetical protein CERSUDRAFT_116920 [Ceriporiopsis subvermispora
B]
Length = 507
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 65/252 (25%), Positives = 112/252 (44%), Gaps = 33/252 (13%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--V 123
F F P+GYP +V+ YL Y F LQ S+ S+L+++++L G+ A AT +
Sbjct: 53 FSGIFLPAGYPNTVSPDYLHYQIFNGLQAGCSSIASLLASRAVLEGHGVGEASASATNAM 112
Query: 124 VSWILKDGMQHVGKLICSN----LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
+ I++D V +LI LG + E K++R+LAD D L+V+SPL
Sbjct: 113 LVNIIQDV---VSRLITIGSGYYLGTSLFPEAKKYRLLADVYNDAAVVLDVISPLFTHWS 169
Query: 178 -------------LFLEMAGL---GNFAKGIAVVAARATRLPIYSSFAKE----GNLSDL 217
+L + L G+F + + + A ++ + FA G++ DL
Sbjct: 170 ISLVYPFIRREQGFYLRVLALCLSGSF-RALCGMVAGGSKAALTMHFATAGPVPGDIGDL 228
Query: 218 FAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTL 277
AK + T+ ++G+ G + V S ++ +L + H+ +R + T
Sbjct: 229 NAKDGSKETVLALIGMLLGTFVIKHVDSGFATYAVLS-MLILTHLAINFVAVRGVVMRTF 287
Query: 278 NPQRTAMIVADF 289
N R A A +
Sbjct: 288 NRHRAAAAWASY 299
>gi|119467992|ref|XP_001257802.1| hypothetical protein NFIA_052500 [Neosartorya fischeri NRRL 181]
gi|119405954|gb|EAW15905.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 515
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 52/224 (23%), Positives = 112/224 (50%), Gaps = 22/224 (9%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGAR 146
+LQ F S+ +LS++++L G+ +P A ++ IL+D + ++ ++ +G
Sbjct: 86 SLQAFCSSIAGLLSSRAVLQGVGVGNANASPTSALLLH-ILQDSSGRIATILFAHRVGTA 144
Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFA--------KGIAVVAAR 198
++ E K +R+ AD D+ L+ LSP+ P AG+G + + VA
Sbjct: 145 LEPECKMYRLAADVFNDVAMVLDCLSPMIP------AGVGRVGVLSAAGVLRALCGVAGG 198
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV-CSSMQGKMIVGPLL 257
+++ + + FA+ GNL+++ AK + T+ +++G+ G + S V C + ++ L
Sbjct: 199 SSKASLSAHFARGGNLAEVNAKDSSQETIISLIGMLVGSFIVSHVTCFAATWASLL--FL 256
Query: 258 SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+H+ +R+ + +LN QR ++++ +++ P L
Sbjct: 257 LTVHLGMNYAAVRSVQMTSLNRQRANIVLSTLLESDLALDPRSL 300
>gi|340059715|emb|CCC54109.1| conserved hypothetical protein, fragment, partial [Trypanosoma
vivax Y486]
Length = 618
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/227 (23%), Positives = 96/227 (42%), Gaps = 7/227 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
PSGYP S G+ RY S+ S + QS+L L +P W+LKD
Sbjct: 183 PSGYPHSCGVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLASSPQL-----WMLKDL 237
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N ++ PK W +++ +++L E++ P L PQ L A + +
Sbjct: 238 VPALAAAYLANRIVSYENRPKFWFVVSVGMHNLSVIAEMIVPSLVPQHLLFAAVTTSCIR 297
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
A + TR FA NL++L K + + V GI S V +S+ +
Sbjct: 298 QSASLMFLVTRASALQHFAISNNLAELTKKFNSFGIVIYTVATALGIAFTSVV-TSLTAQ 356
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
+I + +++ + M LN ++I+ ++++ G S
Sbjct: 357 LITVLMCCAVNLIISHQSMCKIAFRMLNGTTISVILHEYIQNGSTCS 403
>gi|242208267|ref|XP_002469985.1| predicted protein [Postia placenta Mad-698-R]
gi|242226469|ref|XP_002477622.1| predicted protein [Postia placenta Mad-698-R]
gi|220722555|gb|EED77179.1| predicted protein [Postia placenta Mad-698-R]
gi|220731047|gb|EED84896.1| predicted protein [Postia placenta Mad-698-R]
Length = 226
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 109/225 (48%), Gaps = 12/225 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQAT--VVS 125
F P+G+P +V+ G +Y + ALQ F S+ +++++++L F G+ A AT +
Sbjct: 5 FLPAGFPATVSPG--KYQIYNALQAFCSSLAGLIASRAVLQGFRHGVGKAGASATHAIFL 62
Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
+L+D + ++ LG + E K +R++AD D L+ LSP +
Sbjct: 63 TVLQDIFSRLTTIVAGYYLGTSLFPEAKAYRLMADIFNDAAIILDTLSPHLGSFRVASLC 122
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKE----GNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
L + + V A ++ + FA G++ DL AK + T+ ++G+ +G +
Sbjct: 123 LSGAFRALCGVVAGGSKAALTVHFATTAKGAGDVGDLSAKDGSKETVLALLGMLSGTAVM 182
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
V S+ +++ L++ +H++ +R +LN QR +++
Sbjct: 183 HYVHSTRATYVVLFALIA-LHLFCNYLAVRVVVFRSLNRQRASIV 226
>gi|330932017|ref|XP_003303615.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
gi|311320239|gb|EFQ88270.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
Length = 391
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 103/218 (47%), Gaps = 12/218 (5%)
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
TP A ++S I + L LG ++ E K +R AD D L+ LSP P
Sbjct: 34 TPTTALLLSVIHSSTGRIATILFAHRLGTALEPECKMYRFAADIFNDTAMVLDCLSPAFP 93
Query: 177 Q-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
+ + + + + + V A +++ + + FA++GNL ++ AK + T+ +++G+ A
Sbjct: 94 KPIRVAVLSFSSCLSALCGVCAGSSKASLSAHFARKGNLGEVNAKDSSQETVISLLGMLA 153
Query: 236 GIQLASTVCSSMQ--GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
G + S + S + G +I+ + + Y+ + RA ++ LN QR ++ ++ + G
Sbjct: 154 GSVVISYITSPLATWGTLILLLTIHLATNYAAV---RAVIMHCLNRQRANILFSNMFQHG 210
Query: 294 RISSPADLRYREDLLFPGRLIEDAGNVKMDRHGIRAEC 331
+ SP D+ RE R+ E G ++ + C
Sbjct: 211 LVLSPRDVSQRE------RVFERGGVLRWSDDKVLGRC 242
>gi|326469821|gb|EGD93830.1| hypothetical protein TESG_01362 [Trichophyton tonsurans CBS 112818]
Length = 413
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 104/211 (49%), Gaps = 21/211 (9%)
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAG 184
+L++ M + ++ ++ LG ++ E K +R+ AD L D L+ LSP+ P+ + + +
Sbjct: 57 VLQESMGRIATILFAHRLGTSLEPECKMYRLAADILNDSAMVLDCLSPIFPKPVRVGLLS 116
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAI--STLFNVVGLGAGIQLAST 242
L + + + VAA +++ + + FA+ GNL +L A G I S L VV S
Sbjct: 117 LSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVGLNIPPSQLTGVV--------LSY 168
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ S ++ + + LL ++H+ + +RA + TLN QR ++ + + I +PA+
Sbjct: 169 ITSPLETWIALIVLL-IVHLGTNHAAVRAVKMTTLNRQRANIVFSHLFEDNLILTPAETS 227
Query: 303 YREDLLFPGRLIEDAGNV--KMDRHGIRAEC 331
E R+ E G + K + GI C
Sbjct: 228 KEE------RIFERDGVLRWKAESSGILGTC 252
>gi|169604674|ref|XP_001795758.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
gi|111066623|gb|EAT87743.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
Length = 473
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/285 (22%), Positives = 121/285 (42%), Gaps = 38/285 (13%)
Query: 52 VDDSRPVYHR-VVDSFLNKFFPSGYPFS------------------VNEGYLRYTQFRAL 92
VD +P + V + F P+GYP S V EG+L L
Sbjct: 28 VDVVKPAERKSYVQQVFDVFLPAGYPQSDSLQAFSSSIAGMLASRAVLEGHLDAEDTLRL 87
Query: 93 QHFTSAALS--------VLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN-L 143
Q+ + + + S QS G A ++ +L++ M + ++ ++ L
Sbjct: 88 QNLVPSVYAASIGDQRHMSSHQSRCVGVGDSSASPTAALLLSVLQESMGRIATILFAHRL 147
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIAVVAARATRL 202
G ++ E K +R+ AD D L+ LSP P+ F + + + + V A + +
Sbjct: 148 GTALEPECKMYRLAADVFNDTAMILDCLSPAFPKPFRVLVLSFSSVLRSLCGVCAGSAKA 207
Query: 203 PIYSSFAKEGNLSDLFAKGEAI--STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI 260
+ + FA++GNL ++ A + + F++ GL G +T I+ LLS I
Sbjct: 208 SLSAHFARKGNLGEVNAHITHMNPNVDFDIEGLEPGNGHIAT------WNAILIALLS-I 260
Query: 261 HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
H+ + ++A + LN QR ++ + + G + +P ++ RE
Sbjct: 261 HLATNYAAVKAVSMRCLNRQRANIVFGNLMHHGSVLTPTEVSQRE 305
>gi|115389162|ref|XP_001212086.1| predicted protein [Aspergillus terreus NIH2624]
gi|114194482|gb|EAU36182.1| predicted protein [Aspergillus terreus NIH2624]
Length = 411
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 62/115 (53%), Gaps = 5/115 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQAT 122
+ F P+GYP SV + Y Y F +LQ F+S+ +LS++++L G+ +P A
Sbjct: 57 LIEVFLPAGYPHSVTDDYTAYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNPNASPTSAL 116
Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
++ IL+D + ++ ++ +G + E K +R+ AD DL L+ LSP P
Sbjct: 117 LLH-ILQDTSGRLATILFAHRVGTALQPECKSYRLAADLFNDLAMILDCLSPAAP 170
>gi|392558453|gb|EIW51641.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 450
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 125/292 (42%), Gaps = 37/292 (12%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWI 127
F P GYPF+V+ +RY + A+Q F S+ + +++++L G+ A AT + I
Sbjct: 28 FLPEGYPFTVSPQCVRYQIWNAVQAFCSSLAGLFASRAVLQGHGVGNASASATDAIFLTI 87
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ--------- 177
L+D + ++ LG + E K +R+LAD L D + LSP
Sbjct: 88 LQDVFSRLTTIVSGYYLGTSLYPEAKTYRLLADILNDAAMVTDTLSPHLAHFSFSLSYPF 147
Query: 178 -----------LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE--GNLSDLFAKGEAI 224
+ L M+G G AV L ++ + A E G++ DL AK +
Sbjct: 148 VVPASNSSLRVVALCMSGAFRALCG-AVAGGSKAALTVHFATAGERPGDVGDLSAKDGSK 206
Query: 225 STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
T+ ++G+ +G + V + Q +V L H+ + +R + N QR +
Sbjct: 207 ETVLALLGMLSGSVVLRYV-HTAQATYVVLFGLIFCHLAANFIAVRVIAMRAFNRQRAGI 265
Query: 285 IVADFVKTGRISS--PADLRY-REDLLFPGRLIEDAGNVKMDRH--GIRAEC 331
+F ++ + + P L R++ R+ D+ + + +H G A C
Sbjct: 266 AWCEFRRSFDVVTVVPDYLHVARQE-----RIFTDSARIPISQHARGTYAHC 312
>gi|440299036|gb|ELP91648.1| hypothetical protein EIN_206450 [Entamoeba invadens IP1]
Length = 360
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 59/271 (21%), Positives = 111/271 (40%), Gaps = 17/271 (6%)
Query: 30 ETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQF 89
E+ S +R Y F D + +P + + + P GYP +V GY R+ F
Sbjct: 18 ESKGSWSRRYIFDEDYN---------QPPFLTTRNHLQDIIVPVGYPTTVERGYFRFVAF 68
Query: 90 RALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN-LGARMD 148
+ L V+S +A LR + + I ++ M LI + G+ +
Sbjct: 69 SVAMESCNMMLLVMS-----ISASLRIDSSHSAAFFVIFREVMAGTMHLIITERWGSSIV 123
Query: 149 SEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
K+WR+ + + +L +++ P +F + + + + + + + I S++
Sbjct: 124 FYAKQWRMRIEIISELLRLIQIAFIASP-IFYSINVTCSTIESVLNASRQVIKTKILSNY 182
Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
AKE N+++L K + I TL V+ L G + + + +IV L + HV
Sbjct: 183 AKENNVAELTEKAQNIETLARVISLLIGFIVLHLLGDFTEHGIIVFMLALIAHVVCNFLM 242
Query: 269 MRAAPVNTLNPQRTAMIVADFVKT-GRISSP 298
R +N +R +++ F I SP
Sbjct: 243 GRVIVFKAINYERLMILMKYFTTVKNEILSP 273
>gi|159490370|ref|XP_001703152.1| hypothetical protein CHLREDRAFT_195109 [Chlamydomonas reinhardtii]
gi|158270782|gb|EDO96617.1| predicted protein [Chlamydomonas reinhardtii]
Length = 280
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 48/77 (62%), Gaps = 2/77 (2%)
Query: 83 YLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLIC 140
Y Y +R +Q+F A+SV +TQSLL A G+ R + A ++W++KDG +G+L+
Sbjct: 11 YAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRYSGEAAAAINWVIKDGAGRLGRLLF 70
Query: 141 SNLGARMDSEPKRWRIL 157
+ G +D E K++R++
Sbjct: 71 ARWGRELDCELKQFRLM 87
>gi|429329557|gb|AFZ81316.1| hypothetical protein BEWA_007250 [Babesia equi]
Length = 743
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 57/299 (19%), Positives = 125/299 (41%), Gaps = 48/299 (16%)
Query: 32 SDSVTRHYQFQPDGHLSVKVVDDS-------RPVYHRVVDSFLNKFFPSG-----YPFSV 79
S S +R+ +PD + + ++ + R ++R++ +F F +P
Sbjct: 122 SSSDSRNNSAKPDPTIVSRGINQNYGRKLSVRSFFNRIIGAF--NFLKRCINIIIHPSVA 179
Query: 80 NEGYLRYTQFRALQHFTSAALSVLSTQS------------------LLFAAGLRPTPAQA 121
+ Y +Y +R + F+ + L +++ L+ + + ++
Sbjct: 180 TKQYYKYAHWRMFERFSFSVLQTMASNIKSAPTNLQQNKVENTNIILMLKNKISESKMKS 239
Query: 122 TVVSWILKDGMQHVGKLIC-SNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC---PQ 177
+ + I KD + + S +G D+ PK +R+L L T ++ + + P+
Sbjct: 240 ILATMIFKDAFSRIFHFLWFSEIGVGFDNNPKAFRLLGSILCSTATLIDFVCNVFTFGPK 299
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG---NLSDLFAKGEAISTLFNVVGLG 234
+ M N + I ++ A++ P Y+SF + N+ ++ AK EA + + + G+
Sbjct: 300 MI--MGACTNAVRQIGLLTMSASQGPFYNSFHVKNTATNIGEITAKLEAQNPICDFSGIA 357
Query: 235 AGIQLASTVCSS----MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
GI L + + +I+ L+S + Y + ++ LNPQR +++ DF
Sbjct: 358 LGIYLTNLLNEQPFVIQATTVILTSLISNVATYMCV---KSVSFKNLNPQRCFVVLEDF 413
>gi|301123209|ref|XP_002909331.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100093|gb|EEY58145.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 122
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 60/119 (50%), Gaps = 26/119 (21%)
Query: 106 QSLLFAAGLR----PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADAL 161
QSLL+ AGL PT A V+ +L+DG+ +WRI +
Sbjct: 2 QSLLYDAGLVAGAIPT---AAAVNRVLRDGL-------------------GQWRITSSVA 39
Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
D+ E+L+PL P FL +A L N AK +A ++A ATR ++SF NL+D+ AK
Sbjct: 40 LDVAVLGEILTPLAPGSFLVIASLTNVAKNVAWLSASATRAGFHNSFVIRENLADVTAK 98
>gi|62320358|dbj|BAD94740.1| putative protein [Arabidopsis thaliana]
Length = 295
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 10/164 (6%)
Query: 158 ADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGN 213
AD + G+ ++ + L P FL +A GN AK +A R R P I + FA GN
Sbjct: 3 ADFIGSAGSFFDLATQLYPSQFLLLASTGNLAKAVA----RGLRDPSFRVIQNHFAISGN 58
Query: 214 LSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
L ++ AK E ++GLG GI + T + S ++ + ++H++ + +
Sbjct: 59 LGEVAAKEEVWEVAAQLIGLGFGILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAV 118
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIE 315
NT+N +R +IV V + D RE++L R ++
Sbjct: 119 LQFNTVNLKRARIIVESHVVHSVVPGYVDCNKRENILLWQRFMK 162
>gi|255945167|ref|XP_002563351.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588086|emb|CAP86157.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 504
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 89/198 (44%), Gaps = 14/198 (7%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYL-----------RYTQFRALQHFTSAALSVLSTQSLLF 110
++ S + F PSGYP SV++ YL + L T A + TQ
Sbjct: 61 IISSLSDVFLPSGYPQSVSDDYLPPPLLPSGSSRSISHPAQLHRPTQRAKQLTPTQHTGV 120
Query: 111 AAGLRPTPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLE 169
G + ++ IL+D + + ++ +G ++ E K +R+ AD D+ L+
Sbjct: 121 GVGNANASPTSALLLHILQDISGRIATICFAHRIGTALEPECKTYRLAADVFNDIAMILD 180
Query: 170 VLSPLCPQLFLEMAGLGNFA--KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
LSP P +A L + + VA +++ + + FAK GNL++L AK + T+
Sbjct: 181 CLSPGVPAGPARVAVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQETV 240
Query: 228 FNVVGLGAGIQLASTVCS 245
++ G+ G + S + S
Sbjct: 241 ISLFGMLVGSVVISHITS 258
>gi|147803314|emb|CAN66551.1| hypothetical protein VITISV_039658 [Vitis vinifera]
Length = 470
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 88/194 (45%), Gaps = 15/194 (7%)
Query: 124 VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
+ W+ KDG+ VG+L +G + K ++ +++DL T L P FL++A
Sbjct: 162 IRWVSKDGIGAVGRLF---IGLSVIKRAKGYK----SIFDLST------QLYPAYFLQLA 208
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST- 242
LGN AK +A + I + FA GNL ++ AK E ++GL GI + ++
Sbjct: 209 SLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMVLTSP 268
Query: 243 -VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
V +S + ++H++ + + ++N +R ++V V + D
Sbjct: 269 LVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPGYVDC 328
Query: 302 RYREDLLFPGRLIE 315
+E +L R ++
Sbjct: 329 NRKEKILSWQRFLK 342
>gi|390341859|ref|XP_795645.3| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2
[Strongylocentrotus purpuratus]
Length = 311
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 74/151 (49%), Gaps = 2/151 (1%)
Query: 161 LYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
+ D +E++S PQ F+ +A + + K + VA ATR + A+ N++D+ AK
Sbjct: 1 MNDAAMCIELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAK 60
Query: 221 GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
+ TL N+ L G+ + ++ + L + +H+Y+ + + TLN
Sbjct: 61 DGSQETLVNLAALFVGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQN 120
Query: 281 RTAMIVADFVKTGR--ISSPADLRYREDLLF 309
R ++V D++ + + +S P + RE +++
Sbjct: 121 RFHILVQDYLHSSQFSMSGPDAVNSREPVIW 151
>gi|390341857|ref|XP_003725545.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1
[Strongylocentrotus purpuratus]
Length = 311
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 74/151 (49%), Gaps = 2/151 (1%)
Query: 161 LYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
+ D +E++S PQ F+ +A + + K + VA ATR + A+ N++D+ AK
Sbjct: 1 MNDAAMCIELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAK 60
Query: 221 GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
+ TL N+ L G+ + ++ + L + +H+Y+ + + TLN
Sbjct: 61 DGSQETLVNLAALFVGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQN 120
Query: 281 RTAMIVADFVKTGR--ISSPADLRYREDLLF 309
R ++V D++ + + +S P + RE +++
Sbjct: 121 RFHILVQDYLHSSQFSMSGPDAVNSREPVIW 151
>gi|422292653|gb|EKU19955.1| hypothetical protein NGA_2040500, partial [Nannochloropsis gaditana
CCMP526]
Length = 181
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 58/114 (50%)
Query: 195 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
+AA A+ I A+ NL+++ AKG A T ++ GLG G+ A S +
Sbjct: 2 IAAGASNGNIVEHMARTNNLAEVLAKGGAQHTAVSLFGLGFGMWFARVANQSPRRVWTAY 61
Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LL+VIH+ + MR ++N +R +++ F + + SPA++ RE ++
Sbjct: 62 TLLTVIHLTANYAAMRVLAFTSINRRRLNVLLTAFRENASVLSPAEVARRETII 115
>gi|17945727|gb|AAL48912.1| RE31721p [Drosophila melanogaster]
Length = 266
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 76/157 (48%), Gaps = 3/157 (1%)
Query: 167 GLEV-LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
G+E+ + P P ++ K I VA ATR + A GNL+D+ +K +
Sbjct: 2 GIEIYVLPKYPHFSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQE 61
Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
T N+V G+ L S + S I ++S +H+Y+ ++ +RA + + N R +
Sbjct: 62 TCVNLVASFVGLYLLSLIKSQAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIA 120
Query: 286 VADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKM 322
+ +F ++ R+ SP + E + G+ + + N+K+
Sbjct: 121 LEEFFRSSRMLSPQQVNAMERVTV-GQTVSVSLNIKL 156
>gi|397621626|gb|EJK66381.1| hypothetical protein THAOC_12704 [Thalassiosira oceanica]
Length = 480
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 72/161 (44%), Gaps = 16/161 (9%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA--TV 123
F + F P+G+P + GYL Y +Q S V+ST + L AAG+ + A A
Sbjct: 57 FGDLFLPAGHPSTTAGGYLHYQVCDGVQGLCSYLRGVVSTGAALAAAGVGDSDATAMGAA 116
Query: 124 VSWILKDGMQHVGKLICSNLG-ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
V+W ++DG+ +G L S ++ DS K +R+LAD D G L++ P
Sbjct: 117 VTWAVRDGLGMLGGLAFSYYASSKFDSRVKEYRLLADVANDAGMILDMARP--------- 167
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
+ A+G V A P A +G L D+ E
Sbjct: 168 ----SDARGGLVAATPGVPRPRLGQRALQGRLRDVRGSNEG 204
>gi|407850424|gb|EKG04824.1| hypothetical protein TCSYLVIO_004113 [Trypanosoma cruzi]
Length = 631
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 15/229 (6%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P GYP S G+ RY S+ S + QS+L L +P W++KD
Sbjct: 164 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 218
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N ++ PK W ++ L +L E+L P L PQ L A + + A+
Sbjct: 219 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 278
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
A + +R FA NL++L K G I T+F G+ + ++V +S
Sbjct: 279 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMMIYTIFTAFGI-----MYTSVVTS 333
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ ++ +++ + M LN ++++ D+V G I
Sbjct: 334 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNATTLSVVLRDYVGRGGI 382
>gi|261335583|emb|CBH18577.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 595
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/241 (25%), Positives = 97/241 (40%), Gaps = 20/241 (8%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD 130
P+GYP S EG+ RY F S+ S + QS+L L +P W+LKD
Sbjct: 166 MPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSILNGFLLASSPQL-----WMLKD 220
Query: 131 GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
+ +N ++ PK W + + AL++L E++ P P L A L +
Sbjct: 221 LAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVPNHLLAAAILTSCV 280
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
+ A + TR FA NL++L K + + V GI S V S
Sbjct: 281 RQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALGIAYTSLVPSVTAQ 340
Query: 250 ----KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-----GRISSPAD 300
M G L++ H+ M LN ++I+ +++ GR+ SP +
Sbjct: 341 LATVLMCCGANLAISHL-----SMCNIAFRILNETTLSVILRFYMREEGHQRGRVLSPRE 395
Query: 301 L 301
+
Sbjct: 396 V 396
>gi|74026006|ref|XP_829569.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70834955|gb|EAN80457.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 595
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/240 (25%), Positives = 97/240 (40%), Gaps = 20/240 (8%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P+GYP S EG+ RY F S+ S + QS+L L +P W+LKD
Sbjct: 167 PAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSILNGFLLASSPQL-----WMLKDL 221
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ +N ++ PK W + + AL++L E++ P P L A L + +
Sbjct: 222 APALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVPNHLLAAAILTSCVR 281
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG- 249
A + TR FA NL++L K + + V GI S V S
Sbjct: 282 QSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALGIAYTSLVPSVTAQL 341
Query: 250 ---KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-----GRISSPADL 301
M G L++ H+ M LN ++I+ +++ GR+ SP ++
Sbjct: 342 ATVLMCCGANLAISHL-----SMCNIAFRILNETTLSVILRFYMREEGHQRGRVLSPREV 396
>gi|71661222|ref|XP_817635.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70882839|gb|EAN95784.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 630
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 15/229 (6%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P GYP S G+ RY S+ S + QS+L L +P W++KD
Sbjct: 164 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 218
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N ++ PK W ++ L +L E+L P L PQ L A + + A+
Sbjct: 219 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 278
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
A + +R FA NL++L K G I T+F G+ + ++V +S
Sbjct: 279 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-----MYTSVVTS 333
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ ++ +++ + M LN ++++ D+V G I
Sbjct: 334 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGI 382
>gi|407411065|gb|EKF33281.1| hypothetical protein MOQ_002854, partial [Trypanosoma cruzi
marinkellei]
Length = 636
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 15/229 (6%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P GYP S G+ RY S+ S + QS+L L +P W++KD
Sbjct: 170 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 224
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N ++ PK W ++ L +L E+L P L PQ L A + + A+
Sbjct: 225 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 284
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
A + +R FA NL++L K G I T+F G+ + ++V +S
Sbjct: 285 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-----MYTSVVTS 339
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ ++ +++ + M LN ++++ D+V G I
Sbjct: 340 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGI 388
>gi|71414333|ref|XP_809272.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70873629|gb|EAN87421.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 630
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 15/229 (6%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P GYP S G+ RY S+ S + QS+L L +P W++KD
Sbjct: 164 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 218
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N ++ PK W ++ L +L E+L P L PQ L A + + A+
Sbjct: 219 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 278
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
A + +R FA NL++L K G I T+F G+ + ++V +S
Sbjct: 279 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-----MYTSVVTS 333
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ ++ +++ + M LN ++++ D+V G I
Sbjct: 334 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGI 382
>gi|387203446|gb|AFJ68984.1| hypothetical protein NGATSA_2040500, partial [Nannochloropsis
gaditana CCMP526]
Length = 181
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 57/114 (50%)
Query: 195 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
+AA A+ I A+ NL+++ AKG A T ++ GLG + A S +
Sbjct: 2 IAAGASNGNIVEHMARTNNLAEVLAKGGAQHTAVSLFGLGFRMWFARVANQSPRRVWTAY 61
Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LL+VIH+ + MR ++N +R +++ F + + SPA++ RE ++
Sbjct: 62 TLLTVIHLTANYAAMRVLAFTSINRRRLNVLLTAFRENASVLSPAEVARRETII 115
>gi|325088852|gb|EGC42162.1| DUF647 domain-containing protein [Ajellomyces capsulatus H88]
Length = 507
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 47/206 (22%), Positives = 88/206 (42%), Gaps = 32/206 (15%)
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAG 184
+L+D M + ++ ++ LG ++ E K +R+ AD L D + LSP P+ L + +
Sbjct: 48 VLQDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLA 107
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDL----------------------FAKGE 222
+ + + VAA +++ + S FAK GNL +L F +
Sbjct: 108 FSSVLRALCGVAAGSSKASLSSHFAKCGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSD 167
Query: 223 AISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS--------VIHVYSVIEEMRAAPV 274
A L I L +C S+ I P + ++H+ + +R+ +
Sbjct: 168 ADQKLQKDSSQETVISLLGILCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNM 227
Query: 275 NTLNPQRTAMIVADFVKTGRISSPAD 300
TLN QRT ++ + + G + +P +
Sbjct: 228 TTLNRQRTNIVFSTLFEKGSVLTPTE 253
>gi|56752623|gb|AAW24525.1| SJCHGC08554 protein [Schistosoma japonicum]
Length = 127
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 2/72 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P GYP SV++ YL Y + +Q F S+ L++Q++L G+ + A ++W+
Sbjct: 45 FLPVGYPSSVSDDYLEYQIWDTIQAFASSITGALASQAVLIGVGVGNSSATILGASLTWM 104
Query: 128 LKDGMQHVGKLI 139
KDG +G+++
Sbjct: 105 FKDGSGMIGRIL 116
>gi|242208074|ref|XP_002469889.1| predicted protein [Postia placenta Mad-698-R]
gi|220731120|gb|EED84968.1| predicted protein [Postia placenta Mad-698-R]
Length = 214
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/215 (23%), Positives = 101/215 (46%), Gaps = 10/215 (4%)
Query: 84 LRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQAT--VVSWILKDGMQHVGKLI 139
+RY + ALQ F S+ +++++++L F G+ A AT + +L+D + ++
Sbjct: 1 VRYQIYNALQAFCSSLAGLIASRAVLQGFRHGVGKAGASATHAIFLTVLQDIFSRLTTIV 60
Query: 140 CS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
LG + E K +R+ AD D L+ LSP + L + + V A
Sbjct: 61 AGYYLGTSLFPEAKAYRLTADIFNDAAIILDTLSPHLGSFRVASLCLSGAFRALCGVVAG 120
Query: 199 ATRLPIYSSFAKE----GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
++ + FA G++ DL AK + T+ ++G+ +G + V S+ +++
Sbjct: 121 GSKAALTVHFATTAKGAGDVGDLSAKDGSKETVLALLGMLSGTAVMHYVHSTRATYVVLF 180
Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
L++ +H++ +R +LN QR +++ A +
Sbjct: 181 ALIA-LHLFCNYLAVRVVVFRSLNRQRASIVWAAY 214
>gi|414876406|tpg|DAA53537.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
Length = 295
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 67/153 (43%), Gaps = 2/153 (1%)
Query: 158 ADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDL 217
AD + G+ E+ + L P FL +A LGN AK IA + I + FA+ GNL ++
Sbjct: 3 ADFIGSAGSIFELSTTLYPGYFLPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLGEV 62
Query: 218 FAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVN 275
AK E ++GL G+ + +V LSV +H++ + +
Sbjct: 63 AAKEEVWEVGAQLLGLSIGVLIMDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLKFR 122
Query: 276 TLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
T+N +R ++V V + E++L
Sbjct: 123 TINLKRGRILVKSHVAQHTVPGYVACNEEENIL 155
>gi|134084659|emb|CAK43337.1| unnamed protein product [Aspergillus niger]
Length = 484
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 90/206 (43%), Gaps = 26/206 (12%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHF-TSAALSVL---------STQSLLFA 111
V D + F P+GYP SV + Y Q R L H S +L L S L
Sbjct: 68 VQDFLVEVFLPAGYPHSVTDDYAPPYQTRRLIHEGLSPSLQQLDCWPPILESSAARLPLG 127
Query: 112 AGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGL 168
G+ A T ++ IL+D + ++ ++ +G ++ E K +R+ AD DL L
Sbjct: 128 VGVGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMIL 187
Query: 169 EVLSPLCPQ-----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNL-----SDLF 218
+ LSP+ P L AG+ G+A +++A+ + + F++ GNL +D
Sbjct: 188 DCLSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKAS---LSAHFSRWGNLAEAAITDFL 244
Query: 219 AKGEAISTLFNVVGLGAGIQLASTVC 244
+ +S ++ AG S +C
Sbjct: 245 STERFVSRDNHITHRNAGPAPQSQLC 270
>gi|215259607|gb|ACJ64295.1| conserved hypothetical protein [Culex tarsalis]
Length = 234
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 63/126 (50%), Gaps = 6/126 (4%)
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLL- 257
ATR + A GNL+D+ +K A T N++ G+ L + V +++ G L
Sbjct: 3 ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHV---QNQRVLYGLFLF 59
Query: 258 -SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
+++H+Y+ I+ ++A + T N R + + ++ K+G + SP + E + G+ +
Sbjct: 60 VTLLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGSMLSPQQVNKLERVTI-GQTVSL 118
Query: 317 AGNVKM 322
VKM
Sbjct: 119 TARVKM 124
>gi|302417830|ref|XP_003006746.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261354348|gb|EEY16776.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 228
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 71/153 (46%), Gaps = 13/153 (8%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
+ F P YP SV+ Y++Y + + Q F S +L+ +++L G+ ++
Sbjct: 79 DAFLPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHALLL 138
Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
LKDG+ + + + G+ +++ K++R LAD D LE+ +P
Sbjct: 139 TCLKDGVSRLAMITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELANPF---------- 188
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDL 217
+AK + +V+A + R + A++ LS +
Sbjct: 189 FSGWAKILILVSAESLRALCGIALAQQAALSTI 221
>gi|332372572|gb|AEE61428.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 57/115 (49%), Gaps = 1/115 (0%)
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
V ATR I A N++++ AK T+ N++G I L + SS +++
Sbjct: 37 VCYCGATRASITHHQAIRDNMAEISAKDGTQETIVNLIGSFGSIFLLNYFSSSRAEWILI 96
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
L+ V+H+Y+ ++A NT N +R A+++ + G + +PA + E +L
Sbjct: 97 ISLM-VLHLYTNFLAVKALIFNTFNRERLALVLKSYFTIGSVLNPAKVNVNESVL 150
>gi|157874033|ref|XP_001685512.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128584|emb|CAJ08716.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 468
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 56/255 (21%), Positives = 105/255 (41%), Gaps = 10/255 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P G+P + G+ R+ F S S + QSLL L +P W+LKD
Sbjct: 36 PKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLGSSPQL-----WMLKDL 90
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N +S PK W +++ + +L ++L P + P L A + + K
Sbjct: 91 VPALFAAYMANKVVSYESRPKFWFVVSVFVNNLTVISDMLIPSVLPNHLLAAAIITSTVK 150
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
+ + TR +A NL++L K + + V GI L T +S +
Sbjct: 151 QSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATALGI-LYCTYIASFTVQ 209
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTGRISSPADLRYREDL-L 308
+I V +++ M LN +++ F+ + R+ +P ++ + +
Sbjct: 210 LITVAACCVTNMFLSSMSMMPITFRLLNVDTMKLLLGAFILEERRVMTPQEISTLLGVRM 269
Query: 309 FPGRLIEDAGNVKMD 323
P + ++ AG + MD
Sbjct: 270 LPTQALKKAG-IAMD 283
>gi|194383144|dbj|BAG59128.1| unnamed protein product [Homo sapiens]
Length = 156
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 4/107 (3%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLI 139
F S+ L+TQ++L G+ A A +W++KD +G+++
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIV 148
>gi|146096657|ref|XP_001467882.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398020910|ref|XP_003863618.1| hypothetical protein, conserved [Leishmania donovani]
gi|134072248|emb|CAM70952.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322501851|emb|CBZ36933.1| hypothetical protein, conserved [Leishmania donovani]
Length = 468
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 57/248 (22%), Positives = 100/248 (40%), Gaps = 10/248 (4%)
Query: 58 VYHRVVDSFLNKFF--PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
V HR S + F P G+P + G+ R+ F S S + QSLL L
Sbjct: 20 VQHRETVSNRIRLFGMPKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG 79
Query: 116 PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-L 174
+P W+LKD + + +N +S PK W +++ + +L ++L P +
Sbjct: 80 SSPQL-----WMLKDLVPALFAAYMANKVVSYESRPKFWFVVSVFINNLTVISDMLIPSV 134
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P L A + + K + + TR +A NL++L K + + V
Sbjct: 135 LPNHLLAAAIITSTVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATA 194
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTG 293
GI L T +S ++I V +++ M LN +++ F+ +
Sbjct: 195 LGI-LYCTYIASFTVQLITVAACCVTNMFLSSMSMMPITFRLLNFDTMKLLLRAFILEKR 253
Query: 294 RISSPADL 301
RI +P ++
Sbjct: 254 RILTPQEI 261
>gi|215259609|gb|ACJ64296.1| conserved hypothetical protein [Culex tarsalis]
Length = 234
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 62/126 (49%), Gaps = 6/126 (4%)
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLL- 257
ATR + A GNL+D+ +K A T N++ G+ L + V +++ G L
Sbjct: 3 ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHV---QNHRVLYGLFLF 59
Query: 258 -SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
+++H+Y I+ ++A + T N R + + ++ K+G + SP + E + G+ +
Sbjct: 60 LTLLHIYPNIKAVQAVCLRTFNEARYLIALEEYFKSGSMLSPQQVNKLERVTI-GQTVSL 118
Query: 317 AGNVKM 322
VKM
Sbjct: 119 TARVKM 124
>gi|401427109|ref|XP_003878038.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494285|emb|CBZ29584.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 468
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 62/274 (22%), Positives = 112/274 (40%), Gaps = 18/274 (6%)
Query: 58 VYHRVVDSFLNKFF--PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
V HR S + F P G+P + G+ R+ F S S + QSLL L
Sbjct: 20 VQHRETVSNRIRLFGMPKGFPETTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG 79
Query: 116 PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-L 174
+P W+LKD + + +N ++ PK W +++ + +L ++L P +
Sbjct: 80 SSPQL-----WMLKDLVPALFAAYMANKVVSYENRPKFWFVVSVFMSNLMVISDMLIPSV 134
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P L A + + K + + TR +A NL++L K + + V
Sbjct: 135 LPNHLLAAAIITSTVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATA 194
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTG 293
GI L T +S ++I V++++ M LN +++ F+ +
Sbjct: 195 LGI-LYCTYIASFTVQLITVAACCVMNMFLSSMSMMPITFRLLNFDTMKLLLRAFILEKR 253
Query: 294 RISSPADLRYREDLL----FPGRLIEDAGNVKMD 323
R+ +P ++ +LL P ++ AG + MD
Sbjct: 254 RVMTPQEI---SNLLGVRMLPTHALKKAG-IAMD 283
>gi|225560567|gb|EEH08848.1| DUF647 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 472
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 44/194 (22%), Positives = 78/194 (40%), Gaps = 31/194 (15%)
Query: 138 LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVA 196
L LG ++ E K +R+ AD L D + LSP P+ L + + + + + VA
Sbjct: 8 LFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVLRALCGVA 67
Query: 197 ARATRLPIYSSFAKEGNLSDL----------------------FAKGEAISTLFNVVGLG 234
A +++ + S FAK GNL +L F +A L
Sbjct: 68 AGSSKASLSSHFAKWGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKLQKDSSQE 127
Query: 235 AGIQLASTVCSSMQGKMIVGPLLS--------VIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
I L +C S+ I P + ++H+ + +R+ + TLN QR ++
Sbjct: 128 TVISLLGMLCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNMTTLNRQRANIVF 187
Query: 287 ADFVKTGRISSPAD 300
+ + G + +P +
Sbjct: 188 STLFEKGSVLTPTE 201
>gi|156088815|ref|XP_001611814.1| hypothetical protein [Babesia bovis T2Bo]
gi|154799068|gb|EDO08246.1| hypothetical protein BBOV_III006850 [Babesia bovis]
Length = 540
Score = 45.1 bits (105), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 88/193 (45%), Gaps = 19/193 (9%)
Query: 108 LLFAAGLRPTPAQATVVSWILKDGMQHVGKLIC-SNLGARMDSEPKRWRILADALYD--- 163
L +A + + A T+V LKD + + +GA D PK +R+L L
Sbjct: 42 LTWATSGKVSSALTTIV---LKDIFSRILNFVWFGQIGAGFDDNPKAFRLLGGLLCSAAG 98
Query: 164 LGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG---NLSDLFAK 220
+ + + L P++ L + L N K + V A++ +S+F + N+ ++ AK
Sbjct: 99 VACFIGNMMQLEPKVVLTV--LVNVIKHVGAVTMVASQATFHSTFCVKNAATNVGEITAK 156
Query: 221 GEAISTLFNVVGLGAGIQ---LASTVCSSMQGKMIVGPLL-SVIHVYSVIEEMRAAPVNT 276
E S + + +G+ AG L + + +SMQ + G L S I Y ++ T
Sbjct: 157 LETQSPICDFLGMAAGTSLGALTANMAASMQTVLFTGMCLTSNIATYMC---AKSVCFRT 213
Query: 277 LNPQRTAMIVADF 289
LNPQR +++ +F
Sbjct: 214 LNPQRCFVLLENF 226
>gi|399218598|emb|CCF75485.1| unnamed protein product [Babesia microti strain RI]
Length = 635
Score = 45.1 bits (105), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 44/197 (22%), Positives = 81/197 (41%), Gaps = 25/197 (12%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
++ FLN +PSG V Y +Y +R L+ F+ + +S L L +
Sbjct: 106 KIKKRFLNIVYPSG----VTSNYYKYANWRLLERFSHSMSQSISNNPLAAKILLNQSTYS 161
Query: 121 --------ATVVSWILKDGMQHVGKLICS---------NLGARMDSEPKRWRILADALYD 163
A + +I + M + K I S N+G D P+ + +
Sbjct: 162 TNDGVINIAYKIDFIATNLMTQIFKEIVSRIVKTMWVGNIGIGFDMNPRAFCFWGSLICT 221
Query: 164 LGTGLEVLSPLCPQLFLEMA-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD---LFA 219
+ + LS L ++ + + + I ++ A+ PIY SF + N+S+ + A
Sbjct: 222 ITNTADFLSTLFNFKYISITRHITTLFRQIGMLTTSASIGPIYGSFKSQNNVSNIGEITA 281
Query: 220 KGEAISTLFNVVGLGAG 236
K EA++ L +++G G
Sbjct: 282 KMEAVTPLCDLLGSATG 298
>gi|151555803|gb|AAI49327.1| FLJ13868 protein [Bos taurus]
Length = 158
Score = 44.7 bits (104), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 12/132 (9%)
Query: 12 KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVV--DDSRPVYHRVVDSFLNK 69
+ PD S+ + W+ S + T +P G V + P + R+ L
Sbjct: 29 RAAPDGSLQWEAWGRRWWGFSGAFTA----KPGGRDGGGGVAPGTASPPFSRL----LAV 80
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 81 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 140
Query: 128 LKDGMQHVGKLI 139
+KD +G+++
Sbjct: 141 VKDSTGMLGRIV 152
>gi|159467581|ref|XP_001691970.1| hypothetical protein CHLREDRAFT_170864 [Chlamydomonas reinhardtii]
gi|158278697|gb|EDP04460.1| predicted protein [Chlamydomonas reinhardtii]
Length = 131
Score = 44.3 bits (103), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 136 GKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAV 194
G L G+ D+ K+WR+ AD DLG +E+ SPL P+ LFL A G+ A+ + V
Sbjct: 7 GILFAYAEGSSFDACAKQWRLFADIANDLGMTVELASPLLPRALFLPCACFGSIARSVTV 66
Query: 195 VA 196
A
Sbjct: 67 FA 68
>gi|425769885|gb|EKV08364.1| hypothetical protein PDIP_68690 [Penicillium digitatum Pd1]
gi|425771531|gb|EKV09972.1| hypothetical protein PDIG_59240 [Penicillium digitatum PHI26]
Length = 406
Score = 44.3 bits (103), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 58/114 (50%), Gaps = 10/114 (8%)
Query: 139 IC--SNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-----LFLEMAGLGNFAKG 191
IC +G ++ E K +R+ AD D+ L+ LSP P + L AG+ +
Sbjct: 34 ICFAHRVGTALEPECKTYRLAADVFNDIAMILDCLSPGVPAGPARVIVLSTAGV---LRA 90
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ VA +++ + + FAK GNL++L AK + T+ ++ G+ G + S + S
Sbjct: 91 LCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQETVISLFGMLVGSVVISHITS 144
>gi|397575094|gb|EJK49531.1| hypothetical protein THAOC_31586, partial [Thalassiosira oceanica]
Length = 194
Score = 44.3 bits (103), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QAT 122
S ++ F P+ YP SV Y + + + +A VLSTQ+LL A G+ A A
Sbjct: 113 SIVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAA 172
Query: 123 VVSWILKDGMQHVGKLI 139
++W+LKD +G ++
Sbjct: 173 ALNWLLKDFAGQLGGVL 189
>gi|321459665|gb|EFX70716.1| hypothetical protein DAPPUDRAFT_327911 [Daphnia pulex]
Length = 269
Score = 43.9 bits (102), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 72/142 (50%), Gaps = 29/142 (20%)
Query: 89 FRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDG-MQHVGKLI---CS 141
A+Q F + LST+++L G+ TP AT++ W++KDG + VG+++ C+
Sbjct: 9 IHAIQAFVGSLSGSLSTRAILEGVGVGSSTSTPLAATLM-WLIKDGSTRMVGRILFAWCN 67
Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATR 201
G ++D++ G+ + S LC +AG+ AK I VA ATR
Sbjct: 68 --GTKLDADTAVG----------GSNGTMTSILC------VAGV---AKSIVGVAGGATR 106
Query: 202 LPIYSSFAKEGNLSDLFAKGEA 223
+ A++GNL+D+ AK EA
Sbjct: 107 AALTQHQARQGNLADVSAKVEA 128
>gi|3242710|gb|AAC23762.1| hypothetical protein [Arabidopsis thaliana]
Length = 405
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 5/83 (6%)
Query: 52 VDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA 111
+DD R+ S FF P V++ Y+ Y +++ L S+AL VL+TQ++ A
Sbjct: 104 LDDGFWKLIRLTSSAAKDFF---LPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMFRA 160
Query: 112 AGLRPTP--AQATVVSWILKDGM 132
G+ + A + +WILKDG+
Sbjct: 161 IGIGQSRSLASSAAFNWILKDGL 183
>gi|440799746|gb|ELR20790.1| protein C16orf58 family protein [Acanthamoeba castellanii str.
Neff]
Length = 101
Score = 43.5 bits (101), Expect = 0.14, Method: Composition-based stats.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 2/71 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P GYP SV+ Y Y + +Q S+ +L T++LL G+ A A V W
Sbjct: 21 LLPEGYPESVSGDYATYQTWDTVQALCSSITGLLCTRALLQGVGVGDVGATPGAAAVQWA 80
Query: 128 LKDGMQHVGKL 138
L+DG VG++
Sbjct: 81 LRDGAGMVGRI 91
>gi|224044670|ref|XP_002188933.1| PREDICTED: condensin-2 complex subunit G2 [Taeniopygia guttata]
Length = 1150
Score = 41.2 bits (95), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%)
Query: 37 RHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSV 79
+ + P HL ++ DSRPV R+V+ LN FFPS P V
Sbjct: 494 KFWNISPMEHLLSRLESDSRPVSRRIVNLLLNSFFPSAQPEDV 536
>gi|268318089|ref|YP_003291808.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
[Rhodothermus marinus DSM 4252]
gi|262335623|gb|ACY49420.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
[Rhodothermus marinus DSM 4252]
Length = 561
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 50/115 (43%), Gaps = 22/115 (19%)
Query: 202 LPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIH 261
+P+Y A G + DLF F VG G GI+LA V ++ G+ +H
Sbjct: 46 IPVY---AVGGVVRDLFLDRPTTDIDFVTVGPGTGIRLARLVARALGGR--------TVH 94
Query: 262 VYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
VY E A + P R+ + V +FV R S YR+D P ++ED
Sbjct: 95 VY---ENFGTAAIRVPTPDRSGVFVLEFVAARRES------YRKDSRKP--IVED 138
>gi|345304379|ref|YP_004826281.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
[Rhodothermus marinus SG0.5JP17-172]
gi|345113612|gb|AEN74444.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
[Rhodothermus marinus SG0.5JP17-172]
Length = 552
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 50/115 (43%), Gaps = 22/115 (19%)
Query: 202 LPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIH 261
+P+Y A G + DLF F VG G GI+LA V ++ G+ +H
Sbjct: 37 IPVY---AVGGVVRDLFLDRPTTDIDFVTVGPGTGIRLARLVARALGGR--------TVH 85
Query: 262 VYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
VY E A + P R+ + V +FV R S YR+D P ++ED
Sbjct: 86 VY---ENFGTAAIRVPTPDRSGVFVLEFVAARRES------YRKDSRKP--IVED 129
>gi|449682081|ref|XP_002168725.2| PREDICTED: UPF0420 protein C16orf58 homolog [Hydra magnipapillata]
Length = 237
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/111 (22%), Positives = 53/111 (47%)
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
I VA ATR + A+ N++D+ AK + TL N++ L G + V ++
Sbjct: 7 IVGVAGGATRAALTLHQARRNNMADVSAKDGSQETLVNLMALFTGFLITPLVVDNLFLTW 66
Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ + +H+Y+ + + + T+N R ++V + G++ +P ++
Sbjct: 67 YFFLVFTFLHLYANYQAVSTVVMETINLPRLHILVKHYFIYGKVLTPKEVN 117
>gi|402580829|gb|EJW74778.1| hypothetical protein WUBG_14315 [Wuchereria bancrofti]
Length = 191
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 54/115 (46%)
Query: 195 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
VA ATR + + A N+ D+ AK + TL NV L + L V + ++
Sbjct: 8 VAGSATRTTVINHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLPVVSKNAVFVWLLF 67
Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
L + IH+Y ++ TLN ++V D+++T +I + ++ +E +L
Sbjct: 68 CLFTFIHLYGNYRAVKMLQFRTLNQSLLRIVVKDYIQTRKIGTVNEMNNKEPILL 122
>gi|154343241|ref|XP_001567566.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134064898|emb|CAM43006.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 439
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 39/179 (21%), Positives = 72/179 (40%), Gaps = 7/179 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P G+P + G+ R+ F S S + QSLL L +P W+LKD
Sbjct: 7 PKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLGSSPQL-----WMLKDL 61
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ +N ++ PK W +++ + ++ ++L P + P L A + + K
Sbjct: 62 APALLAAYMANQVVSYENRPKFWFVVSVLMNNIMVMSDMLIPSVVPNHLLAAAIVTSTVK 121
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS-SMQ 248
+ + R +A NL++L K + + + GI + + S +MQ
Sbjct: 122 QSSALMYFVARASTLQHYAINNNLAELTKKFNSFGMVSYTIATALGILYCTYIASFTMQ 180
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.136 0.400
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,087,759,653
Number of Sequences: 23463169
Number of extensions: 208622883
Number of successful extensions: 486737
Number of sequences better than 100.0: 502
Number of HSP's better than 100.0 without gapping: 392
Number of HSP's successfully gapped in prelim test: 110
Number of HSP's that attempted gapping in prelim test: 485484
Number of HSP's gapped (non-prelim): 535
length of query: 333
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 190
effective length of database: 9,003,962,200
effective search space: 1710752818000
effective search space used: 1710752818000
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)