BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013984
(432 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224082810|ref|XP_002306848.1| predicted protein [Populus trichocarpa]
gi|222856297|gb|EEE93844.1| predicted protein [Populus trichocarpa]
Length = 428
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/432 (83%), Positives = 393/432 (90%), Gaps = 4/432 (0%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
MNLL+KIK QKK EPDK+ EIP+YW ETSDSV+RH+QF+PDG LS+KVVDD+RPVY
Sbjct: 1 MNLLDKIKMQKK-EPDKT---PEIPVYWIETSDSVSRHFQFEPDGQLSMKVVDDARPVYR 56
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
RVV+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQ
Sbjct: 57 RVVESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQ 116
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
AT VSWILKDGMQH GKLICSNLGARMDSEPKRWRILAD LYDLGTGLEVLSPLCP LFL
Sbjct: 117 ATAVSWILKDGMQHAGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFL 176
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
E+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+GLG GIQLA
Sbjct: 177 EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGVGIQLA 236
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
STVCSSMQGK + GPLLS++HV VIEEMRA PVNTLNPQRTAM+VADFVKTG+ISSPAD
Sbjct: 237 STVCSSMQGKFVAGPLLSIVHVCCVIEEMRATPVNTLNPQRTAMVVADFVKTGKISSPAD 296
Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
LRY EDLLFPGRLIE+AGNVKVG+ H+ V+PSK ELKE+F EKFIL G++WTD+VL
Sbjct: 297 LRYHEDLLFPGRLIENAGNVKVGQALHRAVRPSKLRELKEIFPGEKFILSPGNKWTDLVL 356
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
EQNASGEDALR WLVAAYA+SM KS H+ + LQDAY+KMN VF P LSELQAKGWHTD
Sbjct: 357 EQNASGEDALRAWLVAAYASSMKKSSHESTSVTLQDAYEKMNSVFDPFLSELQAKGWHTD 416
Query: 421 RFLDGTGTRFAW 432
RFLDGTG+RF+W
Sbjct: 417 RFLDGTGSRFSW 428
>gi|297744143|emb|CBI37113.3| unnamed protein product [Vitis vinifera]
Length = 424
Score = 747 bits (1929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/426 (83%), Positives = 393/426 (92%), Gaps = 2/426 (0%)
Query: 7 IKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSF 66
+KK+ KKE +++ V + ++W ETSDSV+RHYQF +G LS+KVVDDSR V HRVV+SF
Sbjct: 1 MKKEMKKEQERA--VAQGAVFWIETSDSVSRHYQFHSNGDLSMKVVDDSRTVVHRVVESF 58
Query: 67 LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSW 126
LNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQAT VSW
Sbjct: 59 LNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATAVSW 118
Query: 127 ILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+LKDGMQHVGKLICSNLGARMDSEPK WRILADALYDLGTGLEVLSPLCP LFLEMAGLG
Sbjct: 119 VLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFLEMAGLG 178
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
NFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAGIQLAST+CSS
Sbjct: 179 NFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLASTICSS 238
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
MQGKMI GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF+KTG+ISSPADLRY+ED
Sbjct: 239 MQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPADLRYQED 298
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASG 366
L+F GRLIEDAGNVKVGRD HKVVKPSK +LK +F EEKF+LC G +WTDMVLE +A+G
Sbjct: 299 LIFTGRLIEDAGNVKVGRDLHKVVKPSKLCKLKGVFPEEKFLLCHGKKWTDMVLEHDATG 358
Query: 367 EDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGT 426
EDALRGWLVAAY A++ K+ H+PS+TVLQ+AYDKMN VF+ +S+LQAKGWHTD FLDGT
Sbjct: 359 EDALRGWLVAAYVANLEKTDHEPSMTVLQEAYDKMNSVFSTFVSKLQAKGWHTDCFLDGT 418
Query: 427 GTRFAW 432
G+RFAW
Sbjct: 419 GSRFAW 424
>gi|225438149|ref|XP_002273202.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
Length = 420
Score = 742 bits (1916), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/421 (83%), Positives = 389/421 (92%), Gaps = 2/421 (0%)
Query: 12 KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFF 71
KKE +++ V + ++W ETSDSV+RHYQF +G LS+KVVDDSR V HRVV+SFLNKFF
Sbjct: 2 KKEQERA--VAQGAVFWIETSDSVSRHYQFHSNGDLSMKVVDDSRTVVHRVVESFLNKFF 59
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
PSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQAT VSW+LKDG
Sbjct: 60 PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATAVSWVLKDG 119
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
MQHVGKLICSNLGARMDSEPK WRILADALYDLGTGLEVLSPLCP LFLEMAGLGNFAKG
Sbjct: 120 MQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFLEMAGLGNFAKG 179
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAGIQLAST+CSSMQGKM
Sbjct: 180 MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLASTICSSMQGKM 239
Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
I GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF+KTG+ISSPADLRY+EDL+F G
Sbjct: 240 IAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPADLRYQEDLIFTG 299
Query: 312 RLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
RLIEDAGNVKVGRD HKVVKPSK +LK +F EEKF+LC G +WTDMVLE +A+GEDALR
Sbjct: 300 RLIEDAGNVKVGRDLHKVVKPSKLCKLKGVFPEEKFLLCHGKKWTDMVLEHDATGEDALR 359
Query: 372 GWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFA 431
GWLVAAY A++ K+ H+PS+TVLQ+AYDKMN VF+ +S+LQAKGWHTD FLDGTG+RFA
Sbjct: 360 GWLVAAYVANLEKTDHEPSMTVLQEAYDKMNSVFSTFVSKLQAKGWHTDCFLDGTGSRFA 419
Query: 432 W 432
W
Sbjct: 420 W 420
>gi|255567274|ref|XP_002524618.1| conserved hypothetical protein [Ricinus communis]
gi|223536171|gb|EEF37826.1| conserved hypothetical protein [Ricinus communis]
Length = 436
Score = 740 bits (1911), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/437 (81%), Positives = 391/437 (89%), Gaps = 6/437 (1%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIP-----IYWFETSDSVTRHYQFQPDGHLSVKVVDDS 55
MNLL IK QKK EPDK+ + P ++W ETSDSV+RH+QFQPDG +S+KV+DDS
Sbjct: 1 MNLLNTIKMQKK-EPDKAPLLPPPPLPELPVHWIETSDSVSRHFQFQPDGRISMKVIDDS 59
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
R ++HRVV+SF NKFFPSGYP+SVNEGYLRYTQFRALQH +SAALSVLSTQSLLFAAGLR
Sbjct: 60 RSMFHRVVESFQNKFFPSGYPYSVNEGYLRYTQFRALQHLSSAALSVLSTQSLLFAAGLR 119
Query: 116 PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
PTPAQAT VSW+LKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLEVLSPLC
Sbjct: 120 PTPAQATAVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLC 179
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P LFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG
Sbjct: 180 PHLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGV 239
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
GIQLASTVCSS+QGK++VGPLLS++HV+SV+EEMRAAPVNTLNPQRTAMIVAD+VK G+I
Sbjct: 240 GIQLASTVCSSIQGKLVVGPLLSILHVFSVVEEMRAAPVNTLNPQRTAMIVADYVKAGKI 299
Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRW 355
SSPADLRYREDL+FPGRLIEDAGNVKVGR HK PSK HELK++F EEKF+L G+ W
Sbjct: 300 SSPADLRYREDLIFPGRLIEDAGNVKVGRALHKAFTPSKVHELKDIFPEEKFLLNRGNTW 359
Query: 356 TDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
TDMVLEQNASGEDALRGWLVAAYA SM KS H + +VLQDAY+KMN F LSELQAK
Sbjct: 360 TDMVLEQNASGEDALRGWLVAAYATSMEKSSHMCTPSVLQDAYEKMNSTFDSFLSELQAK 419
Query: 416 GWHTDRFLDGTGTRFAW 432
GWHTDRFLDG G+RFAW
Sbjct: 420 GWHTDRFLDGVGSRFAW 436
>gi|334184613|ref|NP_001189649.1| uncharacterized protein [Arabidopsis thaliana]
gi|330253412|gb|AEC08506.1| uncharacterized protein [Arabidopsis thaliana]
Length = 432
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/432 (77%), Positives = 385/432 (89%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
M LEK+K KK++P S ++ P+YWFETSDSV+ YQFQ DGHLS+KVVDD+RPV
Sbjct: 1 MQFLEKVKLIKKEDPVMLKSPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQ 60
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
++V+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQ
Sbjct: 61 KMVESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQ 120
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLE++SPLCP LFL
Sbjct: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFL 180
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
EMAGLGNFAKG+A VAARATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV G+GAGIQLA
Sbjct: 181 EMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLA 240
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
ST+CSSM+GK++VG +LSV+HVYSV+E+MR P+NTLNPQRTA+IVA+F+KTG++ SP D
Sbjct: 241 STICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPD 300
Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
LR++EDL+FP R I+DAGNVKVGR HK VKPS+ LK++F EEKF+L G WTDMVL
Sbjct: 301 LRFQEDLMFPERPIQDAGNVKVGRALHKAVKPSEVQRLKQVFVEEKFLLSHGKSWTDMVL 360
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
E +A+GEDALRGWLVAAY SM K ++DP +LQDAYDKMNDVF P LS++QAKGW+TD
Sbjct: 361 EHDATGEDALRGWLVAAYVKSMTKIYNDPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTD 420
Query: 421 RFLDGTGTRFAW 432
RFLDGTGTRFAW
Sbjct: 421 RFLDGTGTRFAW 432
>gi|18402585|ref|NP_565718.1| uncharacterized protein [Arabidopsis thaliana]
gi|20197897|gb|AAD20664.2| expressed protein [Arabidopsis thaliana]
gi|21553562|gb|AAM62655.1| unknown [Arabidopsis thaliana]
gi|109134157|gb|ABG25076.1| At2g31190 [Arabidopsis thaliana]
gi|330253411|gb|AEC08505.1| uncharacterized protein [Arabidopsis thaliana]
Length = 433
Score = 718 bits (1854), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/430 (77%), Positives = 384/430 (89%)
Query: 3 LLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRV 62
L EK+K KK++P S ++ P+YWFETSDSV+ YQFQ DGHLS+KVVDD+RPV ++
Sbjct: 4 LQEKVKLIKKEDPVMLKSPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQKM 63
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
V+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQAT
Sbjct: 64 VESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQAT 123
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLE++SPLCP LFLEM
Sbjct: 124 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEM 183
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
AGLGNFAKG+A VAARATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV G+GAGIQLAST
Sbjct: 184 AGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLAST 243
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+CSSM+GK++VG +LSV+HVYSV+E+MR P+NTLNPQRTA+IVA+F+KTG++ SP DLR
Sbjct: 244 ICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLR 303
Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQ 362
++EDL+FP R I+DAGNVKVGR HK VKPS+ LK++F EEKF+L G WTDMVLE
Sbjct: 304 FQEDLMFPERPIQDAGNVKVGRALHKAVKPSEVQRLKQVFVEEKFLLSHGKSWTDMVLEH 363
Query: 363 NASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRF 422
+A+GEDALRGWLVAAY SM K ++DP +LQDAYDKMNDVF P LS++QAKGW+TDRF
Sbjct: 364 DATGEDALRGWLVAAYVKSMTKIYNDPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTDRF 423
Query: 423 LDGTGTRFAW 432
LDGTGTRFAW
Sbjct: 424 LDGTGTRFAW 433
>gi|449462449|ref|XP_004148953.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 428
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/432 (76%), Positives = 381/432 (88%), Gaps = 4/432 (0%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
M+LL K + K+P+KS S +P+ W E SDSV+R QFQPDGHLSVK++DDSRP
Sbjct: 1 MDLLNKFSA-RNKDPEKSPS---LPVSWIEVSDSVSRRCQFQPDGHLSVKIIDDSRPAIQ 56
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
R+VDSFLN FFPSGYP+SV+EGYLRYTQFRALQH TSAALSVLSTQSLLFAAGLRPT AQ
Sbjct: 57 RIVDSFLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQ 116
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
ATVVSW+LKDGMQHVGKLICSNLGARMDSEPKRWR++AD LYDLG GLEV+SPLCP LFL
Sbjct: 117 ATVVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFL 176
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
EMAGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAG+QLA
Sbjct: 177 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLA 236
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
ST+CSS+QGK++ PLLS++HVY V+E+MRA P+NTLNPQRTAMIVADFVK GRI SPAD
Sbjct: 237 STICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPAD 296
Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
+RY+EDL+FPGRLIE+AGNVKVGR H+V+KPSK E+K++F EKF+L +W DMVL
Sbjct: 297 IRYQEDLVFPGRLIEEAGNVKVGRALHEVIKPSKLVEMKQIFPGEKFVLNQSKKWVDMVL 356
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
E +ASGEDALRGWLVAAY ++ + H+P+ +VL +AY+KMNDVFTP LSELQAKGW+TD
Sbjct: 357 EHDASGEDALRGWLVAAYTTNIKEPSHEPTASVLLEAYEKMNDVFTPFLSELQAKGWYTD 416
Query: 421 RFLDGTGTRFAW 432
RFLDG G+RFAW
Sbjct: 417 RFLDGAGSRFAW 428
>gi|449502048|ref|XP_004161529.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 428
Score = 705 bits (1820), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/432 (76%), Positives = 379/432 (87%), Gaps = 4/432 (0%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
M+LL K + K+P+KS S +P+ W E SDSV+R QFQPDGHLSVK++DDSRP
Sbjct: 1 MDLLNKFSA-RNKDPEKSPS---LPVSWIEVSDSVSRRCQFQPDGHLSVKIIDDSRPAIQ 56
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
R+VDSFLN FFPSGYP+SV+EGYLRYTQFRALQH TSAALSVLSTQSLLFAAGLRPT AQ
Sbjct: 57 RIVDSFLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQ 116
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
ATVVSW+LKDGMQHVGKLICSNLGARMDSEPKRWR++AD LYDLG GLEV+SPLCP LFL
Sbjct: 117 ATVVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFL 176
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
EMAGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAG+QLA
Sbjct: 177 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLA 236
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
ST+CSS+QGK++ PLLS++HVY V+E+MRA P+NTLNPQRTAMIVADFVK GRI SPAD
Sbjct: 237 STICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPAD 296
Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
+RY+EDL+FPGRLIE+AGNVKVGR H+V+KPSK E+K++F EKF+L +W DMVL
Sbjct: 297 IRYQEDLVFPGRLIEEAGNVKVGRALHEVIKPSKLVEMKQIFPGEKFVLNQSKKWVDMVL 356
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
E +ASGEDALRGWLVAAY ++ H+P+ +VL +AY+KMND FTP LSELQAKGW+TD
Sbjct: 357 EHDASGEDALRGWLVAAYTTNIKGPSHEPTASVLLEAYEKMNDAFTPFLSELQAKGWYTD 416
Query: 421 RFLDGTGTRFAW 432
RFLDG G+RFAW
Sbjct: 417 RFLDGAGSRFAW 428
>gi|297826561|ref|XP_002881163.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
lyrata]
gi|297327002|gb|EFH57422.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/432 (75%), Positives = 372/432 (86%), Gaps = 13/432 (3%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
M LEK+K KK+EP + ++ P+YWFETSDSV+ YQFQ DGHLS+KVVDD+RPV
Sbjct: 1 MQFLEKVKLIKKEEPLMVKNPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQ 60
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
++V+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQ
Sbjct: 61 KMVESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQ 120
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLE++SPLCP LFL
Sbjct: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFL 180
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
EMAGLGNFAKG+A VAARATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV G+GAGIQLA
Sbjct: 181 EMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLA 240
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
ST+CSSM+GK++VG +LSV+HVYSV+E+MR P+NTLNPQRTA+IVA+++K I
Sbjct: 241 STICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANYLKVVSI----- 295
Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
+F RLI+DAGNVKVGR HK VKPS+ LK++ EEKF+L G WTD+VL
Sbjct: 296 -------IFK-RLIQDAGNVKVGRALHKAVKPSEVQRLKQVSVEEKFLLSHGKSWTDVVL 347
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
E +A+GEDALRGWLVAAYA SM K ++DP +LQDAYDKMNDVF P LS++QAKGW+TD
Sbjct: 348 EHDATGEDALRGWLVAAYAKSMTKIYNDPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTD 407
Query: 421 RFLDGTGTRFAW 432
RFLDGTGTRFAW
Sbjct: 408 RFLDGTGTRFAW 419
>gi|356545865|ref|XP_003541354.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 431
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/429 (75%), Positives = 370/429 (86%), Gaps = 3/429 (0%)
Query: 3 LLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRV 62
LEK+K KKK + + P+ WFETS +V+R +QF+P+G LSV +V DSRP+YHR+
Sbjct: 5 FLEKLKIWKKKAEQEE---EGPPLVWFETSATVSRRFQFEPNGQLSVTLVGDSRPLYHRI 61
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
VDSFLNKFFPSGYP+SVNEGYLRYTQFRALQH SAALSVLSTQSLLFAAGLRPTPAQAT
Sbjct: 62 VDSFLNKFFPSGYPYSVNEGYLRYTQFRALQHTASAALSVLSTQSLLFAAGLRPTPAQAT 121
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
VSW+LKDGMQH+G +ICS LGARMDSEPKRWRILAD LYDLGTGLEVLSPLCPQ FLEM
Sbjct: 122 AVSWVLKDGMQHLGNIICSKLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPQFFLEM 181
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
AG+GN AKG+++VA+RATRLPIYSSFAKEGNLSDLFA+GEAISTLFNVVG+G GIQLAST
Sbjct: 182 AGIGNLAKGMSIVASRATRLPIYSSFAKEGNLSDLFARGEAISTLFNVVGIGIGIQLAST 241
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
VC+SMQGK++VGPLLS+IH+YSV EEMRA PVNTLNPQRTAMIVADF+K G +SSPADLR
Sbjct: 242 VCASMQGKLVVGPLLSIIHIYSVSEEMRATPVNTLNPQRTAMIVADFLKAGSVSSPADLR 301
Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQ 362
YREDLLFP RLIEDAGNV+VGRD HKV+KPS+ E K++F EKFIL +R DMVLEQ
Sbjct: 302 YREDLLFPRRLIEDAGNVRVGRDLHKVIKPSRLLESKQVFPGEKFILNGDNRCIDMVLEQ 361
Query: 363 NASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRF 422
+A G+DALRGWLVA+YA + KS H+ S + L AY+KMN+VF + ELQ KGWHTDRF
Sbjct: 362 DAIGKDALRGWLVASYAVQIGKSSHELSTSTLLQAYEKMNEVFPAFIKELQCKGWHTDRF 421
Query: 423 LDGTGTRFA 431
LDG+G RFA
Sbjct: 422 LDGSGCRFA 430
>gi|356505015|ref|XP_003521288.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 415
Score = 677 bits (1748), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/415 (77%), Positives = 364/415 (87%), Gaps = 1/415 (0%)
Query: 19 MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
M E P+ WFETSDSV R +QF+PDG LSV +VDDSRP+Y RV SF+NKFFPSGYP+S
Sbjct: 1 MKEGEAPVLWFETSDSVCRRHQFEPDGQLSVVIVDDSRPLYQRVAGSFMNKFFPSGYPYS 60
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 138
VNEGYLRYTQFRA+QH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKL
Sbjct: 61 VNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKL 120
Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
ICSN GARMDSEPKRWR+LADALYD+GTGLEVLSPLCP LFLEMAGLGNFAKG+AVVAAR
Sbjct: 121 ICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFLEMAGLGNFAKGMAVVAAR 180
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
ATRLPIYSSFAKEGNLSDLFAKGEA STLFNV+G+G GIQLAST+C+SMQGK++ GPLLS
Sbjct: 181 ATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASMQGKLVAGPLLS 240
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
+IH+YSV EEMRA P+NTLNP+RTAM+VADF+K G +SSPADLRYRE+LLF + EDAG
Sbjct: 241 IIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPADLRYRENLLFNVHVKEDAG 300
Query: 319 NVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAY 378
NV+VG+D HKV+KPS+ ELK++F EEKF+L G++ DMVLEQ+ASGEDALRGWLVAAY
Sbjct: 301 NVRVGKDVHKVIKPSRLLELKQVFPEEKFLLNFGNKCIDMVLEQDASGEDALRGWLVAAY 360
Query: 379 AASMAKSFHDPSLT-VLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
AA H+ S T VL +AY+KMN VF L ELQ KGWHTDRFLDGTG+RFA+
Sbjct: 361 AAQTHSFSHELSATSVLHEAYEKMNGVFPVFLRELQNKGWHTDRFLDGTGSRFAF 415
>gi|356572258|ref|XP_003554286.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 419
Score = 666 bits (1719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/411 (76%), Positives = 361/411 (87%), Gaps = 1/411 (0%)
Query: 23 EIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEG 82
E P+ WFETSDSV +QF+PDG LSV +VDDSRP+Y R+ SF+NKFFPSGYP+SVNEG
Sbjct: 9 EAPVLWFETSDSVCCQHQFEPDGRLSVVIVDDSRPLYQRMAGSFMNKFFPSGYPYSVNEG 68
Query: 83 YLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN 142
YLRYTQFRALQH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKLICSN
Sbjct: 69 YLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKLICSN 128
Query: 143 LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRL 202
GARMDSEPKRWR+LAD LYD+G GLEVLSPLCP LFLEMAGLGNFAKG+AVVAARATRL
Sbjct: 129 WGARMDSEPKRWRLLADVLYDIGIGLEVLSPLCPHLFLEMAGLGNFAKGMAVVAARATRL 188
Query: 203 PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHV 262
PIYSSFAKEGNLSDLFAKGEA STLFNVVG+G GIQLAST+C+S+QGK++ GPLLS+IH+
Sbjct: 189 PIYSSFAKEGNLSDLFAKGEAFSTLFNVVGIGVGIQLASTICASIQGKLVAGPLLSIIHL 248
Query: 263 YSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKV 322
YSV EEMRA P+NTLNP+RTAM+VADF+K G +SSPADLRYR++LLF ++ ED GNV+V
Sbjct: 249 YSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPADLRYRDNLLFNVQVKEDTGNVRV 308
Query: 323 GRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASM 382
G++ HKV+KPS+ ELK++F EEKF+L G++ DMVLEQ+ASGEDALRGWLVAAYAA
Sbjct: 309 GKNVHKVIKPSRLLELKQVFPEEKFLLNFGNKCIDMVLEQDASGEDALRGWLVAAYAART 368
Query: 383 AKSFHDPSLT-VLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
S H+ S T VL +AY+KMN VF L ELQ KGWHTDRFLDGTG+RF++
Sbjct: 369 ESSSHELSATSVLHEAYEKMNGVFPVFLRELQNKGWHTDRFLDGTGSRFSF 419
>gi|388508448|gb|AFK42290.1| unknown [Lotus japonicus]
Length = 414
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/414 (75%), Positives = 357/414 (86%)
Query: 19 MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
M E P++WFETS SV + FQ +G LSV +VDDSRP+Y +V SF+NKFFPSGYP+S
Sbjct: 1 MEKGEGPVFWFETSGSVCHRHHFQSNGQLSVMLVDDSRPLYQKVAGSFMNKFFPSGYPYS 60
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 138
VNEGYLRYTQFR++QH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKL
Sbjct: 61 VNEGYLRYTQFRSVQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKL 120
Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
ICSN GARMDSEPKRWR++AD LYDLGTGLEVLSPLCP LFL+MAGLGNFAKG+AVVAAR
Sbjct: 121 ICSNWGARMDSEPKRWRLIADVLYDLGTGLEVLSPLCPHLFLQMAGLGNFAKGMAVVAAR 180
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
ATRLPIYSSFAKEGNLSDLFAKGEA STL NV+G+G GIQLAST+C+S+QGKM+VGPLLS
Sbjct: 181 ATRLPIYSSFAKEGNLSDLFAKGEAFSTLCNVIGIGVGIQLASTICASVQGKMVVGPLLS 240
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
++H+YSV EEMRA P+NTLNP+RTAMIV DF+K G +SSPADLRYREDLLF RL EDAG
Sbjct: 241 ILHIYSVSEEMRATPINTLNPRRTAMIVTDFLKAGVVSSPADLRYREDLLFRVRLTEDAG 300
Query: 319 NVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAY 378
NV+VGR H+V+KPS+ EL+++ EKF+L G + DMVLEQ+ASGEDALRGWLVAAY
Sbjct: 301 NVRVGRALHEVIKPSRLLELEQVLPGEKFLLNRGGKCVDMVLEQDASGEDALRGWLVAAY 360
Query: 379 AASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
AA + S H+ S +VL +AY KM VF L ELQ+KGWHTDRFLDGTG+RFA+
Sbjct: 361 AAQIENSSHELSASVLHEAYKKMTGVFPVFLKELQSKGWHTDRFLDGTGSRFAF 414
>gi|242076506|ref|XP_002448189.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
gi|241939372|gb|EES12517.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
Length = 439
Score = 613 bits (1580), Expect = e-173, Method: Compositional matrix adjust.
Identities = 298/440 (67%), Positives = 358/440 (81%), Gaps = 13/440 (2%)
Query: 1 MNLLEKIK----KQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQP----DGHLSVKVV 52
MN+L +I + EP P+ W E S+SV+R F G +SVKV+
Sbjct: 1 MNILARISGGGDRAAATEPPPPQ-----PMCWVEISESVSRLCSFDAAGSGGGSISVKVI 55
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
D+RP++ +VVDSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAA
Sbjct: 56 QDARPIHDKVVDSFLNKFFPSGYPYSVNEGYLTYTRFRALQHFSSAMLHVLSTQSLLFAA 115
Query: 113 GLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
GLRPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+S
Sbjct: 116 GLRPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVS 175
Query: 173 PLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
PLCPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G
Sbjct: 176 PLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMG 235
Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
+GAGI LASTVCS+ QGK+I GPLLS +H+Y V++EMRA PVNTLNPQRTAMIVADF+K+
Sbjct: 236 IGAGIGLASTVCSTTQGKLIAGPLLSAVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKS 295
Query: 293 GRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLG 352
G++SSPA+LRYREDLLFP R+IE+AG+VK+G+ +V+ PS +L+ F +EKF+L
Sbjct: 296 GKVSSPAELRYREDLLFPNRVIEEAGSVKIGQPLRRVLSPSLVEQLRATFPDEKFLLTQK 355
Query: 353 SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSEL 412
S T MVLEQ+ASGEDALRGWLVAA+A+ M +S TVL++AY+K VF +SE+
Sbjct: 356 SNKTYMVLEQSASGEDALRGWLVAAFASEMERSGIVSRDTVLKEAYEKTKMVFPSFVSEV 415
Query: 413 QAKGWHTDRFLDGTGTRFAW 432
+++GW+TD+FLDG G+R A+
Sbjct: 416 RSRGWYTDQFLDGNGSRVAF 435
>gi|115459438|ref|NP_001053319.1| Os04g0517300 [Oryza sativa Japonica Group]
gi|113564890|dbj|BAF15233.1| Os04g0517300 [Oryza sativa Japonica Group]
gi|116310213|emb|CAH67223.1| OSIGBa0145M07.5 [Oryza sativa Indica Group]
gi|215697660|dbj|BAG91654.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 290/432 (67%), Positives = 354/432 (81%), Gaps = 3/432 (0%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
MN+LE+I+ + + P W E S+S++R F G +SVKV+ DSRP++
Sbjct: 1 MNILERIRGGGDRAAVGEGPREPEP--WVEISESISRLCSFDA-GRVSVKVIQDSRPIHD 57
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
+++DSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAAGLRPTPAQ
Sbjct: 58 KMIDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGLRPTPAQ 117
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
AT VSWILKDGMQH GKLICS +GARMDSEPK WRILAD LYD GT LEV+SPLCPQLFL
Sbjct: 118 ATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFL 177
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
E+AG GNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LA
Sbjct: 178 EVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 237
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
STVCS+ QGK+I GPLLSV+H+Y V++EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+
Sbjct: 238 STVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAE 297
Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
LRYREDLLFP RLIE+AG+VK+G+ +V+ P + +LK F++EKF+L MVL
Sbjct: 298 LRYREDLLFPNRLIEEAGSVKIGQPVRRVLSPQRIEQLKATFSKEKFLLSRKDNSAYMVL 357
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
EQ+A+GEDALRGWLVAA+A+ M +S TVL AY++M +VF ++E++++GW+TD
Sbjct: 358 EQSATGEDALRGWLVAAFASEMERSGVGSGDTVLNVAYERMENVFPMFVAEVKSRGWYTD 417
Query: 421 RFLDGTGTRFAW 432
+FLDG +R A+
Sbjct: 418 QFLDGNRSRIAY 429
>gi|357164706|ref|XP_003580140.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 437
Score = 600 bits (1548), Expect = e-169, Method: Compositional matrix adjust.
Identities = 288/435 (66%), Positives = 353/435 (81%), Gaps = 5/435 (1%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQP---DGHLSVKVVDDSRP 57
MN+LE+I+ K + P W E S+SV+R F G +SVKV+ DSRP
Sbjct: 1 MNILERIRGGGDKTTGTETPRKLEP--WVEISESVSRLCCFDARSRGGGISVKVIQDSRP 58
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAAGLRPT
Sbjct: 59 IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGLRPT 118
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
PAQAT VSWILKDGMQH GKLICS +GARMDSEPK WRI AD LYD GT L+ +SPLCPQ
Sbjct: 119 PAQATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRIFADVLYDFGTALDFISPLCPQ 178
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LFLE+AGLGNFAKG+AVVAARATRLPIYS+FAKEGNLSDLFAKGEAISTL NV+G+GAGI
Sbjct: 179 LFLEVAGLGNFAKGMAVVAARATRLPIYSAFAKEGNLSDLFAKGEAISTLSNVMGIGAGI 238
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
L+STVCS+ QGK+I+GPLLS +H+Y V++EMRA P+N+LNPQRTAM+VADF+K+G++SS
Sbjct: 239 GLSSTVCSTTQGKLIIGPLLSAVHLYGVVQEMRAIPINSLNPQRTAMVVADFIKSGKVSS 298
Query: 298 PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTD 357
PA+LRYREDLLFP RLIE+AG+VKVG+ +++ P + +L+ + EKF+L L S T
Sbjct: 299 PAELRYREDLLFPNRLIEEAGSVKVGQPLCRILSPQRVEQLRATYPNEKFLLSLKSNKTY 358
Query: 358 MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
MVLEQ+A+GEDALRGWLVAA+A+ M KS +VL +AY KM DVF +SE++++GW
Sbjct: 359 MVLEQSATGEDALRGWLVAAFASEMEKSGVGSRDSVLNEAYQKMEDVFPMFVSEVRSRGW 418
Query: 418 HTDRFLDGTGTRFAW 432
+TD+FLDG +R A+
Sbjct: 419 YTDQFLDGNRSRIAY 433
>gi|414586417|tpg|DAA36988.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 439
Score = 600 bits (1546), Expect = e-169, Method: Compositional matrix adjust.
Identities = 294/439 (66%), Positives = 355/439 (80%), Gaps = 11/439 (2%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
MN+LE+I+ D++ + + P W E S+SV+R F G +SVKV+ D
Sbjct: 1 MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57 SRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPL 176
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
AGI LASTVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G+
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGK 296
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSR 354
+SSPA+LRY+EDLLFP R+IE+AG+VK+G+ KV P +L+ F EKF+L S
Sbjct: 297 VSSPAELRYKEDLLFPNRVIEEAGSVKIGQPLRKVFSPRLVEQLRATFPGEKFLLSQKSN 356
Query: 355 WTDMVLEQNASGEDALRGWLVAAYAASMAKS-FHDPSLTVLQDAYDKMNDVFTPLLSELQ 413
T MVLE++ASGEDALRGWLVAA+A+ M +S VL +AY+KM VF +SE++
Sbjct: 357 RTYMVLERSASGEDALRGWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPTFVSEVR 416
Query: 414 AKGWHTDRFLDGTGTRFAW 432
++GW+TD+FLDG +R ++
Sbjct: 417 SRGWYTDQFLDGNCSRVSF 435
>gi|226494744|ref|NP_001146435.1| uncharacterized protein LOC100280018 [Zea mays]
gi|219887187|gb|ACL53968.1| unknown [Zea mays]
Length = 439
Score = 597 bits (1539), Expect = e-168, Method: Compositional matrix adjust.
Identities = 293/439 (66%), Positives = 354/439 (80%), Gaps = 11/439 (2%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
MN+LE+I+ D++ + + P W E S+SV+R F G +SVKV+ D
Sbjct: 1 MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57 SRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK W ILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWTILADVLYDLGTALEVVSPL 176
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
AGI LASTVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G+
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGK 296
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSR 354
+SSPA+LRY+EDLLFP R+IE+AG+VK+G+ KV P +L+ F EKF+L S
Sbjct: 297 VSSPAELRYKEDLLFPNRVIEEAGSVKIGQPLRKVFSPRLVEQLRATFPGEKFLLSQKSN 356
Query: 355 WTDMVLEQNASGEDALRGWLVAAYAASMAKS-FHDPSLTVLQDAYDKMNDVFTPLLSELQ 413
T MVLE++ASGEDALRGWLVAA+A+ M +S VL +AY+KM VF +SE++
Sbjct: 357 RTYMVLERSASGEDALRGWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPTFVSEVR 416
Query: 414 AKGWHTDRFLDGTGTRFAW 432
++GW+TD+FLDG +R ++
Sbjct: 417 SRGWYTDQFLDGNCSRVSF 435
>gi|38568058|emb|CAE05447.3| OSJNBa0073E02.7 [Oryza sativa Japonica Group]
gi|125549033|gb|EAY94855.1| hypothetical protein OsI_16651 [Oryza sativa Indica Group]
gi|125590999|gb|EAZ31349.1| hypothetical protein OsJ_15471 [Oryza sativa Japonica Group]
Length = 380
Score = 576 bits (1484), Expect = e-162, Method: Compositional matrix adjust.
Identities = 269/371 (72%), Positives = 320/371 (86%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
++DSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAAGLRPTPAQA
Sbjct: 1 MIDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGLRPTPAQA 60
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
T VSWILKDGMQH GKLICS +GARMDSEPK WRILAD LYD GT LEV+SPLCPQLFLE
Sbjct: 61 TAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFLE 120
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+AG GNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LAS
Sbjct: 121 VAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLAS 180
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
TVCS+ QGK+I GPLLSV+H+Y V++EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+L
Sbjct: 181 TVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAEL 240
Query: 302 RYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLE 361
RYREDLLFP RLIE+AG+VK+G+ +V+ P + +LK F++EKF+L MVLE
Sbjct: 241 RYREDLLFPNRLIEEAGSVKIGQPVRRVLSPQRIEQLKATFSKEKFLLSRKDNSAYMVLE 300
Query: 362 QNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
Q+A+GEDALRGWLVAA+A+ M +S TVL AY++M +VF ++E++++GW+TD+
Sbjct: 301 QSATGEDALRGWLVAAFASEMERSGVGSGDTVLNVAYERMENVFPMFVAEVKSRGWYTDQ 360
Query: 422 FLDGTGTRFAW 432
FLDG +R A+
Sbjct: 361 FLDGNRSRIAY 371
>gi|255639094|gb|ACU19847.1| unknown [Glycine max]
Length = 302
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 249/302 (82%), Positives = 275/302 (91%)
Query: 19 MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
M E P+ WFETSDSV R +QF+PDG LSV +VDDSRP+Y RV SF+NKFFPSGYP+S
Sbjct: 1 MKEGEAPVLWFETSDSVCRRHQFEPDGQLSVVIVDDSRPLYQRVAGSFMNKFFPSGYPYS 60
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 138
VNEGYLRYTQFRA+QH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKL
Sbjct: 61 VNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKL 120
Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
ICSN GARMDSEPKRWR+LADALYD+GTGLEVLSPLCP LFLEMAGLGNFAKG+AVVAAR
Sbjct: 121 ICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFLEMAGLGNFAKGMAVVAAR 180
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
ATRLPIYSSFAKEGNLSDLFAKGEA STLFNV+G+G GIQLAST+C+SMQGK++ GPLLS
Sbjct: 181 ATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASMQGKLVAGPLLS 240
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
+IH+YSV EEMRA P+NTLNP+RTAM+VADF+K G +SSPADLRYRE+LLF + EDAG
Sbjct: 241 IIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPADLRYRENLLFNVHVKEDAG 300
Query: 319 NV 320
NV
Sbjct: 301 NV 302
>gi|168027023|ref|XP_001766030.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682673|gb|EDQ69089.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 397
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 244/389 (62%), Positives = 300/389 (77%), Gaps = 5/389 (1%)
Query: 49 VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
VKV+ ++RP VDSFL++FFP GYP SV EGYL Y+++RA+QHF SA + VLSTQSL
Sbjct: 9 VKVLKETRPPVQIAVDSFLSRFFPLGYPNSVAEGYLDYSKYRAVQHFASAVMHVLSTQSL 68
Query: 109 LFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGL 168
LFAAGLRPTPAQAT+VSW+LKDGMQHVGKL+CS +GARMDSEPKRWRI AD +YD+G GL
Sbjct: 69 LFAAGLRPTPAQATIVSWVLKDGMQHVGKLLCSRMGARMDSEPKRWRIFADFMYDVGAGL 128
Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
EV+SPLCPQ FL +AG+ N AKG+A+V+ARATRLP+YSSF KEGNLSDL+AKGEAISTL
Sbjct: 129 EVISPLCPQHFLPVAGMANLAKGMALVSARATRLPVYSSFVKEGNLSDLYAKGEAISTLS 188
Query: 229 NVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
NV+GLG GI LAS S++QGK+I P+LS IH+Y V +EMRAAP+NTLN QRTAM+VAD
Sbjct: 189 NVLGLGVGIHLASNFASTIQGKLIWAPILSAIHLYCVQQEMRAAPINTLNSQRTAMLVAD 248
Query: 289 FVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHK-VVKPSKFHELKELFAEEKF 347
FVKTG +SSPA+LRYRE L+ P L +DAGNV+VG + KPS ELKE F E+F
Sbjct: 249 FVKTGSVSSPAELRYRERLILPVGLEKDAGNVRVGAPLQSGIGKPSVMAELKEKFQGEQF 308
Query: 348 ILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKS--FHDPSL--TVLQDAYDKMND 403
+L G + TDMVL Q ASGEDA+RGWL+A + + + K F D + +L+ AYD+
Sbjct: 309 LLHFGEQATDMVLHQKASGEDAVRGWLLATHVSRLCKGRKFQDVDVYGKILEKAYDETER 368
Query: 404 VFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
+S L+ GWH+ FL+G+G R W
Sbjct: 369 CLPAFISGLKDHGWHSHLFLEGSGVRAVW 397
>gi|302756337|ref|XP_002961592.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
gi|300170251|gb|EFJ36852.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
Length = 399
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 238/406 (58%), Positives = 309/406 (76%), Gaps = 12/406 (2%)
Query: 28 WFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
W E S++ R Y + DG L +++DD+RP+ H++VD FL++FFP GYP SV EGYL Y+
Sbjct: 5 WRERSENARREYVSKGDGTLQARLLDDNRPLKHKLVDGFLDQFFPVGYPQSVGEGYLTYS 64
Query: 88 QFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARM 147
QFRALQHF+SA LSV STQSL+FAAGLRPTPAQATV+SW+LKDGMQH+GKL+ S +GARM
Sbjct: 65 QFRALQHFSSAILSVFSTQSLVFAAGLRPTPAQATVISWVLKDGMQHIGKLVGSRMGARM 124
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DSEPKRWRI AD LYD GT LE++SP CPQ FL +AGL N AKG+A+V ARATRLP+Y++
Sbjct: 125 DSEPKRWRIFADVLYDAGTALEIISPACPQYFLGVAGLANLAKGVAMVTARATRLPLYTA 184
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
FAKEGNLSDL+AKGEA+STL NV GLG GI+LAS+ C+S+QGK++V PLLSVIH+YSV+E
Sbjct: 185 FAKEGNLSDLYAKGEAMSTLANVAGLGVGIRLASSACTSLQGKLLVAPLLSVIHLYSVME 244
Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPH 327
EMRA P++TLN QRTAM+VAD+++ GR+ SPA++RYRE L+ P + AG+V++ D
Sbjct: 245 EMRAVPLDTLNAQRTAMLVADYLELGRVPSPAEIRYRERLILPVKPNVHAGSVRIANDIL 304
Query: 328 KVVKPS-KFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSF 386
++ PS K ELK F E+F+L T+MVL +ASGED +RGWL+AA ++ A
Sbjct: 305 QI--PSGKIRELKHRFQSERFLLDFHETSTEMVLHHSASGEDVVRGWLLAACSSKEAS-- 360
Query: 387 HDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
L DAY +++ + LS+L+++GW T FL G+ R W
Sbjct: 361 -------LSDAYSRVDKMLPEFLSQLKSRGWQTQLFLHGSAARAFW 399
>gi|302775588|ref|XP_002971211.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
gi|300161193|gb|EFJ27809.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
Length = 399
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 238/406 (58%), Positives = 309/406 (76%), Gaps = 12/406 (2%)
Query: 28 WFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
W E S++ R Y + DG L +++DD+RP+ H++VD FL++FFP GYP SV EGYL Y+
Sbjct: 5 WRERSENARREYVSKGDGTLQARLLDDNRPLKHKLVDGFLDQFFPVGYPQSVGEGYLTYS 64
Query: 88 QFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARM 147
QFRALQHF+SA LSV STQSL+FAAGLRPTPAQATV+SW+LKDGMQH+GKL+ S +GARM
Sbjct: 65 QFRALQHFSSAILSVFSTQSLVFAAGLRPTPAQATVISWVLKDGMQHIGKLVGSRMGARM 124
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DSEPKRWRI AD LYD GT LE++SP CPQ FL +AGL N AKG+A+V ARATRLP+Y++
Sbjct: 125 DSEPKRWRIFADVLYDAGTALEMISPACPQYFLGVAGLANLAKGVAMVTARATRLPLYTA 184
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
FAKEGNLSDL+AKGEA+STL NV GLG GI+LAS+ C+S+QGK++V PLLSVIH+YSV+E
Sbjct: 185 FAKEGNLSDLYAKGEAMSTLANVAGLGVGIRLASSACTSLQGKLLVAPLLSVIHLYSVME 244
Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPH 327
EMRA P++TLN QRTAM+VAD+++ GR+ SPA++RYRE L+ P + AG+V++ D
Sbjct: 245 EMRAVPLDTLNAQRTAMLVADYLELGRVPSPAEIRYRERLILPVKPNVHAGSVRIANDIL 304
Query: 328 KVVKPS-KFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSF 386
++ PS K ELK F E+F+L T+MVL +ASGED +RGWL+AA ++ A
Sbjct: 305 QI--PSGKIRELKHRFQSERFLLDFHETSTEMVLHHSASGEDVVRGWLLAACSSKEAS-- 360
Query: 387 HDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
L DAY +++ + LS+L+++GW T FL G+ R W
Sbjct: 361 -------LSDAYSRVDKMLPEFLSQLKSRGWQTQLFLHGSAARAFW 399
>gi|414586418|tpg|DAA36989.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 297
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 221/297 (74%), Positives = 252/297 (84%), Gaps = 10/297 (3%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
MN+LE+I+ D++ + + P W E S+SV+R F G +SVKV+ D
Sbjct: 1 MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57 SRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPL 176
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
AGI LASTVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIK 293
>gi|414586420|tpg|DAA36991.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 306
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 210/302 (69%), Positives = 253/302 (83%), Gaps = 1/302 (0%)
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
MQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG
Sbjct: 1 MQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKG 60
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK+
Sbjct: 61 MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKL 120
Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+LRY+EDLLFP
Sbjct: 121 IAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLLFPN 180
Query: 312 RLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
R+IE+AG+VK+G+ KV P +L+ F EKF+L S T MVLE++ASGEDALR
Sbjct: 181 RVIEEAGSVKIGQPLRKVFSPRLVEQLRATFPGEKFLLSQKSNRTYMVLERSASGEDALR 240
Query: 372 GWLVAAYAASMAKS-FHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRF 430
GWLVAA+A+ M +S VL +AY+KM VF +SE++++GW+TD+FLDG +R
Sbjct: 241 GWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPTFVSEVRSRGWYTDQFLDGNCSRV 300
Query: 431 AW 432
++
Sbjct: 301 SF 302
>gi|414586414|tpg|DAA36985.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 288
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/269 (72%), Positives = 225/269 (83%), Gaps = 10/269 (3%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
MN+LE+I+ D++ + + P W E S+SV+R F G +SVKV+ D
Sbjct: 1 MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SRP++ ++VDSFLNK FPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57 SRPIHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPL 176
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
AGI LASTVCS+ QGK+I GPLLS +H++
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIW 265
>gi|149392593|gb|ABR26099.1| unknown [Oryza sativa Indica Group]
Length = 282
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 185/274 (67%), Positives = 230/274 (83%)
Query: 159 DALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
D LYD GT LEV+SPLCPQLFLE+AG GNFAKG+AVVAARATRLPIYSSFAKEGNLSDLF
Sbjct: 1 DVLYDFGTALEVISPLCPQLFLEVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLF 60
Query: 219 AKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
AKGEAISTLFNV+G+GAGI LASTVCS+ QGK+I GPLLSV+H+Y V++EMRA PVNTLN
Sbjct: 61 AKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLN 120
Query: 279 PQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHEL 338
PQRTAMIVADF+K+G++SSPA+LRYREDLLFP RLIE+AG+VK+G+ +V+ P + +L
Sbjct: 121 PQRTAMIVADFIKSGKVSSPAELRYREDLLFPNRLIEEAGSVKIGQPVRRVLSPQRIEQL 180
Query: 339 KELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAY 398
K F++EKF+L MVLEQ+A+GEDALRGWLVAA+A+ M +S TVL AY
Sbjct: 181 KATFSKEKFLLSRKDNSAYMVLEQSATGEDALRGWLVAAFASEMERSGVGSGDTVLNVAY 240
Query: 399 DKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
++M +VF ++E++++GW+TD+FLDG +R A+
Sbjct: 241 ERMENVFPMFVAEVKSRGWYTDQFLDGNRSRIAY 274
>gi|414586419|tpg|DAA36990.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 311
Score = 390 bits (1001), Expect = e-106, Method: Compositional matrix adjust.
Identities = 197/313 (62%), Positives = 239/313 (76%), Gaps = 19/313 (6%)
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A V WIL Q VG + N AD LYDLGT LEV+SPLCPQLFL
Sbjct: 13 AWVQEWILN---QRVGGYLVRN---------------ADVLYDLGTALEVVSPLCPQLFL 54
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
E+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LA
Sbjct: 55 EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 114
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
STVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+
Sbjct: 115 STVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAE 174
Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
LRY+EDLLFP R+IE+AG+VK+G+ KV P +L+ F EKF+L S T MVL
Sbjct: 175 LRYKEDLLFPNRVIEEAGSVKIGQPLRKVFSPRLVEQLRATFPGEKFLLSQKSNRTYMVL 234
Query: 361 EQNASGEDALRGWLVAAYAASMAKS-FHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHT 419
E++ASGEDALRGWLVAA+A+ M +S VL +AY+KM VF +SE++++GW+T
Sbjct: 235 ERSASGEDALRGWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPTFVSEVRSRGWYT 294
Query: 420 DRFLDGTGTRFAW 432
D+FLDG +R ++
Sbjct: 295 DQFLDGNCSRVSF 307
>gi|414586421|tpg|DAA36992.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 164
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 137/160 (85%), Positives = 150/160 (93%)
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
MQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG
Sbjct: 1 MQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKG 60
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK+
Sbjct: 61 MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKL 120
Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K
Sbjct: 121 IAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIK 160
>gi|413942077|gb|AFW74726.1| hypothetical protein ZEAMMB73_469739 [Zea mays]
Length = 425
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 162/302 (53%), Positives = 186/302 (61%), Gaps = 79/302 (26%)
Query: 48 SVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQS 107
+VKV+ DSRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQ
Sbjct: 57 TVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQL 116
Query: 108 LL-----------------FAAGLRPTPAQATVVS------------------------- 125
+ +A L +V++
Sbjct: 117 VFQNRLLSVLSSLTSAHFNYARQLTAQVGCGSVLAVCCRFATYPDASNCCKLGKHMMTEM 176
Query: 126 ----WILKDGMQHVGKLICSNLGARMDSEPKRWRIL------------------------ 157
ILKDGMQH KLICS++GARMDSEPK WRIL
Sbjct: 177 LKSKQILKDGMQHTRKLICSSMGARMDSEPKSWRILGLTAYVLYGCHCLGSHFTIFPIIC 236
Query: 158 -------ADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAK 210
AD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG+ VVAARATRLPIYSSFAK
Sbjct: 237 LYCSILTADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMVVVAARATRLPIYSSFAK 296
Query: 211 EGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMR 270
EGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK + PLL SV E +
Sbjct: 297 EGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKEV--PLLEKYKCRSVNEFVD 354
Query: 271 AA 272
A
Sbjct: 355 AC 356
>gi|388492306|gb|AFK34219.1| unknown [Lotus japonicus]
Length = 182
Score = 261 bits (668), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 123/182 (67%), Positives = 148/182 (81%)
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
M+VGPLLS++H+YSV EEMRA P+NTLNP+RTAMIV DF+K G +SSPADLRYREDLLF
Sbjct: 1 MVVGPLLSILHIYSVSEEMRATPINTLNPRRTAMIVTDFLKAGVVSSPADLRYREDLLFR 60
Query: 311 GRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDAL 370
RL EDAGNV+VGR H+V+KPS+ EL+++ EKF+L G + DMVLEQ+ASGEDAL
Sbjct: 61 VRLTEDAGNVRVGRALHEVIKPSRLLELEQVLPGEKFLLNRGGKCVDMVLEQDASGEDAL 120
Query: 371 RGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRF 430
RGWLVAAYAA + S H+ S +VL +AY KM VF L ELQ+KGWHTDRFLDGTG+RF
Sbjct: 121 RGWLVAAYAAQIENSSHELSASVLHEAYKKMTGVFPVFLKELQSKGWHTDRFLDGTGSRF 180
Query: 431 AW 432
A+
Sbjct: 181 AF 182
>gi|413942078|gb|AFW74727.1| hypothetical protein ZEAMMB73_469739, partial [Zea mays]
Length = 199
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 116/143 (81%), Positives = 129/143 (90%)
Query: 48 SVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQS 107
+VKV+ DSRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQS
Sbjct: 57 TVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQS 116
Query: 108 LLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTG 167
LLFA GLRPTP QAT VSWILKDGMQH KLICS++GARMDSEPK WRILAD LYDLGT
Sbjct: 117 LLFAVGLRPTPTQATAVSWILKDGMQHTRKLICSSMGARMDSEPKSWRILADVLYDLGTA 176
Query: 168 LEVLSPLCPQLFLEMAGLGNFAK 190
LEV+SPLCPQLFLE+AGLGNFAK
Sbjct: 177 LEVVSPLCPQLFLEVAGLGNFAK 199
>gi|414586416|tpg|DAA36987.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 155
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 112/132 (84%), Positives = 124/132 (93%)
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
MQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG
Sbjct: 1 MQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKG 60
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK+
Sbjct: 61 MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKL 120
Query: 252 IVGPLLSVIHVY 263
I GPLLS +H++
Sbjct: 121 IAGPLLSAVHIW 132
>gi|356510851|ref|XP_003524147.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
Length = 158
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 107/123 (86%), Positives = 115/123 (93%)
Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
LKDGMQHVGKLICSN G MDSEPKRWR+LADALYD+GTGLEVLSP CP LFLEMAGLGN
Sbjct: 1 LKDGMQHVGKLICSNWGGTMDSEPKRWRLLADALYDIGTGLEVLSPRCPHLFLEMAGLGN 60
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
FAKG++VVAARATRLPIYSSFAKEGNLSDL AKGEA STLFNV+G+G GIQLAST+C+SM
Sbjct: 61 FAKGMSVVAARATRLPIYSSFAKEGNLSDLLAKGEAFSTLFNVIGIGVGIQLASTICASM 120
Query: 248 QGK 250
QGK
Sbjct: 121 QGK 123
>gi|356498113|ref|XP_003517898.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
Length = 171
Score = 224 bits (571), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 106/126 (84%), Positives = 117/126 (92%)
Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
LK+GMQHVGKLI SN G MD EPKRWR+LADALYD+GTGLEVLSP CP LFLEMAGLGN
Sbjct: 1 LKNGMQHVGKLIGSNWGGTMDFEPKRWRLLADALYDIGTGLEVLSPWCPHLFLEMAGLGN 60
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
FAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEA STLFNV+G+G GIQLAST+C+SM
Sbjct: 61 FAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASM 120
Query: 248 QGKMIV 253
QGK+++
Sbjct: 121 QGKVVI 126
>gi|414586415|tpg|DAA36986.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 163
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 105/165 (63%), Positives = 126/165 (76%), Gaps = 10/165 (6%)
Query: 1 MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
MN+LE+I+ D++ + + P W E S+SV+R F G +SVKV+ D
Sbjct: 1 MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SRP++ ++VDSFLNK FPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57 SRPIHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD 159
RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRIL +
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILGE 161
>gi|449016609|dbj|BAM80011.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 496
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 125/409 (30%), Positives = 205/409 (50%), Gaps = 12/409 (2%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNE 81
Q P+ E R Y+ GHL V V R + D F F P GYP SV
Sbjct: 76 QVPPVLAVEQIRYGQREYRINSAGHLEVAEVRTRRAWGLKAADRFSRMFLPDGYPDSVGP 135
Query: 82 GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQ-ATVVSWILKDGMQHVGKLI 139
Y YT++R F + V STQSLL A G+ RP+ A A + W+++DG+ G+++
Sbjct: 136 DYAEYTKWRGRAFFFGGMVGVFSTQSLLLAVGVGRPSAAPIAAALQWVIRDGLGRAGRML 195
Query: 140 CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARA 199
S +G D+E K++R+ A + ++ LE L+PL P LFL +A L N AKG + VAA +
Sbjct: 196 FSQVGTGFDAETKQYRLAAAFVLNVSCALEALTPLFPHLFLPLASLANMAKGASTVAAAS 255
Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
TR IY SF + NL D+ AK E + +++G G+ L+ S + +I S+
Sbjct: 256 TRGAIYRSFMRRENLGDITAKQETVGVAGDLLGTAVGVILSRASSHSHRLSVIAFIGASL 315
Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK-TGRISSPADLRYREDLLFPGRLIE-DA 317
H+ S E+++ + TLN QR M++ ++++ R+ D+ RE L++ L A
Sbjct: 316 GHLISAYYEVKSVELRTLNRQRAHMLIWNYLEDQDRVLPIRDVNRRERLVYRPWLDSLHA 375
Query: 318 GNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVA 376
NV++G R L L+ +EK+++ +VL +A+ +D L+ + +
Sbjct: 376 PNVEIGARLSDAAPDADALKFLLRLYRDEKYMMNYAGGRVRIVLRNDATHQDQLKAYFQS 435
Query: 377 A-----YAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
Y S + P+ +++D+Y N +F + + +A+GW+ D
Sbjct: 436 RAFWRLYMTSGER--FSPNRALIEDSYRATNQLFPSFVRDCRAEGWNVD 482
>gi|452824810|gb|EME31810.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 636
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 110/347 (31%), Positives = 190/347 (54%), Gaps = 16/347 (4%)
Query: 39 YQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSA 98
YQF D + DD RP++ +++D + P GYP SV + YL Y+ +RA+Q+ ++
Sbjct: 249 YQFSDDMIQEGRFKDD-RPIFEKLIDWLSRVYLPDGYPNSVTKDYLSYSIWRAIQNLAAS 307
Query: 99 ALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWR 155
+ V +T++LLF GL + TPA + +SW+LKDG+ ++GK+ ++ G + D +PK WR
Sbjct: 308 VMGVFATEALLFGLGLGRKSTPATSAAISWVLKDGLGYIGKVFYGSIAGNQFDVDPKSWR 367
Query: 156 ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLS 215
+ ADA+ D G LE+++PL P FL +A + N KG+A + ATR IY S A N
Sbjct: 368 LAADAVEDAGGVLEIITPLFPGHFLLLASVANAMKGVAAMTGTATRHAIYKSLALHENQG 427
Query: 216 DLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVN 275
D+ KGE+ +VGLG GI +S + + + L +H+++ + +
Sbjct: 428 DIATKGESQGVTCKMVGLGLGIVASSKMGQNYFALLSAYALGCFVHLFANWKSLSCVQFA 487
Query: 276 TLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP---GRLIEDAGNVKVGRDPHKVVKP 332
TLN QR ++ + +F++TG + SP ++ RE ++ P G L + VG + + +
Sbjct: 488 TLNRQRCSLALQEFLQTGNVPSPYEVSRREKVVVPPWKGYL----DRIVVGANLKECFQN 543
Query: 333 SKFHELKE---LFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVA 376
+ EL+E L+ + ++IL +VL ++ +D + ++ A
Sbjct: 544 GR--ELREAVALYRDCRYILTQRDNKILIVLHESCRPQDIFQAFMQA 588
>gi|449018781|dbj|BAM82183.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 631
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 94/225 (41%), Positives = 137/225 (60%), Gaps = 5/225 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILK 129
F P G+P SV YLRY+ +R Q+ SA +VLST +LL A GL TP A SW+LK
Sbjct: 242 FLPEGFPESVGPSYLRYSLWRGAQNVVSAMTAVLSTHALLAAVGLSSTPV-AAATSWVLK 300
Query: 130 DGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFA 189
DG+ +GKL+ + G D++PKR+R+ +D LYD G LE+L+PL PQ FL +A LGNF
Sbjct: 301 DGIGQLGKLLTARRGRDFDADPKRYRLSSDLLYDAGLSLEILTPLFPQYFLALAALGNFT 360
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
K +A+ A R + SSF GNL D+ AK +A + + N+ G+G GI A + +
Sbjct: 361 KSVAITVGMACRNSVLSSFVLRGNLGDISAKNDAQNVVTNLTGMGLGIVTARAL--PPKP 418
Query: 250 KMIVGPLLSVIHVYSVI--EEMRAAPVNTLNPQRTAMIVADFVKT 292
+ +G ++ +YSV+ + M+A +N +N QR A+I FV++
Sbjct: 419 SVRLGAFAALTALYSVLNYQSMKAVKLNVVNRQRGAIIADSFVRS 463
>gi|452823212|gb|EME30224.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 849
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 105/315 (33%), Positives = 173/315 (54%), Gaps = 15/315 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP-TPAQATVVSWIL 128
FFP G V++ Y +T +R Q S+ + V TQ+LL A G++ +A SW+L
Sbjct: 422 FFPEG----VSQDYYSFTMWRVFQRLVSSTVGVFGTQALLLALGIKAGRIGRAAATSWVL 477
Query: 129 KDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
KDG+ +GK++ S +G D++PKRWR + LY LG GLE+++ + P FL A L N
Sbjct: 478 KDGLGRLGKMVWASQMGRDFDADPKRWRFRSALLYALGNGLEIVTQIFPASFLLFATLAN 537
Query: 188 FAKGIAVVAARATRLPIYSSFA-KEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
K I+++ A ATR +Y +FA + N++D+ AKGEA + +++G+ GIQL+ + +S
Sbjct: 538 SMKQISMLTASATRNAMYKNFAGRSENIADITAKGEAQIVVADLLGMALGIQLSKLIGTS 597
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ LSV+ +Y+ +E+RA TLN +R++MIV FV+ G + P ++ RE+
Sbjct: 598 RPNVLTAYSFLSVLDIYAFYKELRAVQFRTLNYERSSMIVDYFVRRGYVPRPDEVSQREN 657
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT---DMVLEQN 363
+ R + + V P + + L ++F E+F++ G W +VL ++
Sbjct: 658 IFLGPRYDVRSMFASLS---DAVSCPQELNSLIKIFKGEQFMVTRG--WNGQYRIVLREH 712
Query: 364 ASGEDALRGWLVAAY 378
A D LR L +
Sbjct: 713 ARNGDILRALLTLGF 727
>gi|302771033|ref|XP_002968935.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
gi|300163440|gb|EFJ30051.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
Length = 389
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 187/387 (48%), Gaps = 35/387 (9%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQ 120
+D+ P GYP +V++ Y YT +R Q S VL+TQ+LL+A GL +
Sbjct: 4 CLDALKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISGVLTTQALLYAVGLGKGAIPT 63
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A ++W+LKDG+ ++ K++ S G D PK WR+L+D + D GLE+L+P P LF+
Sbjct: 64 AAAINWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFV 123
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + K A + ATR Y+ FA + N +++ AKGEA + +G+ GI ++
Sbjct: 124 PLTAVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGIS 183
Query: 241 STVCSSMQGKMIVGPLL-------SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
S V SS GPLL + +HV ++ +A ++TLNP RT +I+A+++ T
Sbjct: 184 SKVGSS-------GPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPYRTGLIMAEYLTTQ 236
Query: 294 RISSPADLRYREDLLFPGRLIEDA------------------GNVKVGRD-PHKVVKPSK 334
RI S D+ E + N+K+G V
Sbjct: 237 RIPSVRDVNSVEPIFTVADFFSSKKAPLSISCESKCAAADIEKNLKLGVSFGEAVASRGA 296
Query: 335 FHELKELFAEEKFILCL-GSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTV 393
L +L++ E+++L S +VL+++AS D LR + A+Y + +
Sbjct: 297 AQALLDLYSGEEYLLTYDDSHCFKVVLKKSASPHDMLRAVVQASYLRHIHNQQSLSKDEL 356
Query: 394 LQDAYDKMNDVFTPLLSELQAKGWHTD 420
L+ +Y M F L+ LQA GW T
Sbjct: 357 LKLSYQSMQQSFNELIQGLQAAGWSTS 383
>gi|302816607|ref|XP_002989982.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
gi|300142293|gb|EFJ08995.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
Length = 440
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 118/400 (29%), Positives = 190/400 (47%), Gaps = 39/400 (9%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQ 120
+D+ P GYP +V++ Y YT +R Q S VL+TQ+LL+A GL +
Sbjct: 45 CLDALKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISGVLTTQALLYAVGLGKGAIPT 104
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A ++W+LKDG+ ++ K++ S G D PK WR+L+D + D GLE+L+P P LF+
Sbjct: 105 AAAINWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFV 164
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + K A + ATR Y+ FA + N +++ AKGEA + +G+ GI ++
Sbjct: 165 PLTAVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGIS 224
Query: 241 STVCSSMQGKMIVGPLL-------SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
S V SS GPLL + +HV ++ +A ++TLNP RT +I+A+++ T
Sbjct: 225 SKVGSS-------GPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPYRTGLIMAEYLTTQ 277
Query: 294 RISSPADLRYREDLLFPGRLIEDA----------------------GNVKVGRD-PHKVV 330
RI S D+ E + N+K+G V
Sbjct: 278 RIPSVRDVNSVEPIFTVADFFSSKKAPVSTHLSISCESKCAAADIEKNLKLGVSFGEAVA 337
Query: 331 KPSKFHELKELFAEEKFILCL-GSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDP 389
L +L++ E+++L S +VL+++AS D LR + A+Y +
Sbjct: 338 SRGAAQALLDLYSGEEYLLTYDDSHCFKVVLKKSASPHDMLRAVVQASYLRHIHNQQSLS 397
Query: 390 SLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTR 429
+L+ +Y M F L+ LQA GW T + G+
Sbjct: 398 KDELLKLSYQSMQQSFNELIQGLQAAGWSTSGLVARPGSN 437
>gi|449018940|dbj|BAM82342.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 738
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/363 (30%), Positives = 190/363 (52%), Gaps = 35/363 (9%)
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP--TPAQATVVSWILKDGMQ 133
P V Y + ++R Q S+ + V TQ+LL A G++ +QA +SW+LKDG+
Sbjct: 380 PEDVTADYYEFAKWRVFQRLMSSTVGVFGTQALLLALGIKSGNKISQAATISWVLKDGLS 439
Query: 134 HVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
VGK++ S LG MD++PKR+R + LY LG GLE+ + + PQ FL +A L N AK
Sbjct: 440 RVGKMLWASQLGRDMDADPKRFRFGSALLYSLGNGLEIATRIVPQSFLLVATLANTAKLC 499
Query: 193 AVVAARATRLPIYSSFA-KEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+++ A ATR +Y SFA + N++D+ AKGEA T+ ++ G+ GIQL+ + +S +
Sbjct: 500 SMLTASATRNAMYRSFADRNENIADITAKGEAQITIADLGGMALGIQLSKIIGTSRPKVV 559
Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
+LS I +Y++ +E+R TLN QR+++++ ++KTG + +P + E ++ P
Sbjct: 560 ATYLILSTIDLYAIWKELRVIEFRTLNLQRSSLVIEHYLKTGCVPTPKQVSAAERVVLPE 619
Query: 312 RLIEDA--GNVKVGRDPHKVVKPSKFHELKELFAEEKFIL----------CLGSRWTDMV 359
+ A K+ RD + ++ + + +F++ CL S +V
Sbjct: 620 PIPRHAFVAFSKLVRDEQEA------EQMIKRYGSARFLMRYGEVHWPWTCLHSSRGGIV 673
Query: 360 LEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYD---KMNDVFTPLLSELQAKG 416
L ++A +D R L ++ L ++Q+ + + + V PLL + G
Sbjct: 674 LHRDADQQDIFRAILC----------WNLRRLGIVQEEDEVLARADVVMEPLLKAAREAG 723
Query: 417 WHT 419
W T
Sbjct: 724 WRT 726
>gi|8978258|dbj|BAA98149.1| unnamed protein product [Arabidopsis thaliana]
Length = 407
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 190/359 (52%), Gaps = 16/359 (4%)
Query: 63 VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
V SFL + P G+P SVNE Y+ Y +RAL+HF A+ V +TQ+LL + G + +
Sbjct: 14 VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 73
Query: 122 TV--VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LF
Sbjct: 74 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 133
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
L +A N K +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I L
Sbjct: 134 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI-L 192
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
S S+ + LLS ++ S +E+R+ ++TLN R + V F+KTGR+ S
Sbjct: 193 ISKRNPSL---VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 249
Query: 300 DLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM 358
+ +E + FP ++D + R PS + +K F +E++++ +
Sbjct: 250 EGNIQEKIFTFP--WVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKV 307
Query: 359 --VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
+L+ A+ +D L+ AA+ A + F + S + ++++ F P EL+++
Sbjct: 308 YALLKHQANSDDILK----AAFHAHVLLHFMNQSKDGNPRSVEQLDPAFAPTEYELESR 362
>gi|18423070|ref|NP_568713.1| uncharacterized protein [Arabidopsis thaliana]
gi|16604643|gb|AAL24114.1| unknown protein [Arabidopsis thaliana]
gi|23296703|gb|AAN13151.1| unknown protein [Arabidopsis thaliana]
gi|332008476|gb|AED95859.1| uncharacterized protein [Arabidopsis thaliana]
Length = 497
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 190/359 (52%), Gaps = 16/359 (4%)
Query: 63 VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
V SFL + P G+P SVNE Y+ Y +RAL+HF A+ V +TQ+LL + G + +
Sbjct: 104 VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 163
Query: 122 TV--VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LF
Sbjct: 164 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 223
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
L +A N K +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I L
Sbjct: 224 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI-L 282
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
S S+ + LLS ++ S +E+R+ ++TLN R + V F+KTGR+ S
Sbjct: 283 ISKRNPSL---VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 339
Query: 300 DLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM 358
+ +E + FP ++D + R PS + +K F +E++++ +
Sbjct: 340 EGNIQEKIFTFP--WVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKV 397
Query: 359 --VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
+L+ A+ +D L+ AA+ A + F + S + ++++ F P EL+++
Sbjct: 398 YALLKHQANSDDILK----AAFHAHVLLHFMNQSKDGNPRSVEQLDPAFAPTEYELESR 452
>gi|297795749|ref|XP_002865759.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
gi|297311594|gb|EFH42018.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 189/359 (52%), Gaps = 16/359 (4%)
Query: 63 VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
V SFL + P G+P SVNE Y+ Y +RAL+HF A+ V +TQ+LL + G + +
Sbjct: 107 VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 166
Query: 122 TV--VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LF
Sbjct: 167 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 226
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
L +A N K +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I L
Sbjct: 227 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI-L 285
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
S S+ + LLS ++ S +E+R+ ++TLN R + V F+KTGR+ S
Sbjct: 286 ISKRNPSL---VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 342
Query: 300 DLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM 358
+ +E + FP ++D + R PS + +K F +E++++ +
Sbjct: 343 EGNIQEKIFTFP--WVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKV 400
Query: 359 --VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
+L+ A+ +D L+ AA+ A + F + S ++++ F P EL+++
Sbjct: 401 YALLKHQANSDDILK----AAFHAHVLLHFMNQSKDGTPRLVEQLDPAFAPTEYELESR 455
>gi|302764074|ref|XP_002965458.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
gi|300166272|gb|EFJ32878.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
Length = 430
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 181/359 (50%), Gaps = 22/359 (6%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP-TPAQ 120
++D + P G+P SV Y Y Q+R LQ+F A+ V +T+SLL A G+R +
Sbjct: 63 IMDFVRSHVVPEGFPDSVLPCYTPYMQWRCLQYFFGGAMGVFTTRSLLHALGVRNGASSS 122
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A V+W++KDG +GK+I + G + D K+ R D L L LE+ + PQ FL
Sbjct: 123 AIAVNWVVKDGAGRIGKMIFARHGKKFDCHLKQIRFAGDVLMQLAAALELATSATPQFFL 182
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+A + N K +A VA+ +TR PIY +FA+ N+ D+ AKGE I + +++G G GI ++
Sbjct: 183 PLACIANIGKNVAAVASTSTRAPIYKAFARRENIGDITAKGECIGNIADLLGTGMGILMS 242
Query: 241 STVCSSMQGKMIVGPL--LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
+ P L+ +VYS E++A + TLN R + V F++TG++
Sbjct: 243 KNF-------PVFAPFSALAFGYVYSSFREVKAIQLPTLNRHRFGIAVDTFLETGKVPGL 295
Query: 299 ADLRYREDLLF-PGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSR-- 354
+ RE ++ P +E +++G R P + + LF E +++ +
Sbjct: 296 VEANERERIIVGPQWSLE---KLELGARVSDAFSAPQDYQYIDSLFKGENYLVSYNPKRQ 352
Query: 355 WTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQ 413
T +VL++ A+G+D LR AA+ + + D + + AY + +PL +
Sbjct: 353 RTYVVLKERANGDDVLR----AAFHGQLFVNIFDKCSSA-EGAYQESCKYISPLYEHFK 406
>gi|219129733|ref|XP_002185036.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403531|gb|EEC43483.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 627
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/416 (28%), Positives = 199/416 (47%), Gaps = 65/416 (15%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ----ATVVS 125
F PSG V Y R+ ++R LQ F +A L+V TQSLL A ++ + +Q + ++
Sbjct: 195 FLPSG----VTNDYYRFVRWRILQRFVNANLNVFGTQSLLLALRIKSSASQLGALSAALN 250
Query: 126 WILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
W+LKD + + +++ S +G R DS+ KRWR + ++ G GLE+++ + P LFL A
Sbjct: 251 WVLKDALGKIVRMLWASRMGRRFDSDAKRWRFRSSFVFAAGNGLEIITYVFPSLFLLWAT 310
Query: 185 LGNFAKGIAVVAARATRLPIYSSF--AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
L N K I+++ + +TR IY+SF N+ D+ AKGEA + +++G+ +G+ L+ T
Sbjct: 311 LANCCKQISMLTSSSTRTSIYNSFRDGSRENIGDITAKGEAQIAIVDLLGIASGVTLSRT 370
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR-------- 294
V +S++ + V L I + V ++RA +N +R +VADF + +
Sbjct: 371 VGTSIRAVLAVYVTLQAIEIVCVYHQLRAVTYRVMNFERMISVVADFCQARQGPKDGLEG 430
Query: 295 ------------ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELF 342
I +P L +E + P + + + G + P + L E+F
Sbjct: 431 LAASCTTPTPAGIPTPQTLASQERIFLPPKHLTRRA-IAFGSIGRARLSPDELGTLLEIF 489
Query: 343 AEEKFILCLG---------------------SRWTD---MVLEQNASGEDALRGWL-VAA 377
E+FIL +G SR + +VL + A+ D ++ L +
Sbjct: 490 KRERFILVVGKNVKHPRPFMAKTAKQNEDPVSRIQENCHIVLHEAATNMDIVKSTLALTL 549
Query: 378 YAASMAKSFHDPSLTVLQDAYDKM-------NDVFTPLLSELQAKGWHTD-RFLDG 425
+A S DPS + +D M ND+F LL E+ +GW + RF+ G
Sbjct: 550 LRRKLALSKFDPSQVRSSNCFDIMKVTQEETNDLFPLLLREMNTQGWESPARFMFG 605
>gi|226492898|ref|NP_001145431.1| uncharacterized protein LOC100278802 [Zea mays]
gi|195656119|gb|ACG47527.1| hypothetical protein [Zea mays]
gi|224035549|gb|ACN36850.1| unknown [Zea mays]
gi|413945393|gb|AFW78042.1| hypothetical protein ZEAMMB73_132907 [Zea mays]
Length = 511
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 114/363 (31%), Positives = 186/363 (51%), Gaps = 27/363 (7%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + A A
Sbjct: 119 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRAASGA 178
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + PQLFL
Sbjct: 179 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 238
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
MA + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I ++
Sbjct: 239 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISK 298
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+ S + LS ++ S E+R+ +NTLN R + V F++TG + S D
Sbjct: 299 SNPSLVTSFA----FLSCGYLLSSYHEVRSVVLNTLNRARFTVAVDSFIRTGYVPSLKDG 354
Query: 302 RYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM-- 358
+E + P E V +G R +P+ F +K LF +E++I+ +
Sbjct: 355 NSQETIFNPPWRHEP---VAIGSRFGDAFQEPASFLAIKPLFEDERYIVTYNPTKDKVYA 411
Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQD----------AYDKMNDVFTPL 408
+L+ A +D L+ AA+ A + F + S L YD +N F P
Sbjct: 412 LLKDQAKPDDILK----AAFHAHVLLHFINASHANLNARKRMNSNRSYQYDPLNMDFVPH 467
Query: 409 LSE 411
+ E
Sbjct: 468 IEE 470
>gi|242055389|ref|XP_002456840.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
gi|241928815|gb|EES01960.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
Length = 569
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 101/333 (30%), Positives = 173/333 (51%), Gaps = 20/333 (6%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R D + P GYP SV+ Y+ Y+ +R +Q S VLSTQ+LL+A GL R
Sbjct: 161 WRRCADVAVQLLLPDGYPHSVSSDYMNYSLWRGVQGVASQISGVLSTQALLYAVGLGRGA 220
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L ++ GLE+L+P+ P
Sbjct: 221 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 280
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LF+ + ++ A + ATR Y+ FA + N +++ AKGEA + VG+G GI
Sbjct: 281 LFVPIGAAAGASRSAASLIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGI 340
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R +++ ++++ +G++ S
Sbjct: 341 ALANQIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 400
Query: 298 PADLRYREDLLFPGRLIEDAGNVKV---------GRDPHKVVKPSKFHELKE-------- 340
++ E L F L K+ R ++ SK E+ E
Sbjct: 401 VKEVNDEEPLFFNLSLGASCQQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACAL 460
Query: 341 --LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
L+ E+++L +VL++ +S ED L+
Sbjct: 461 FDLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLK 493
>gi|357167308|ref|XP_003581100.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 604
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 102/333 (30%), Positives = 175/333 (52%), Gaps = 20/333 (6%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R D L P GYP SV+ YL Y+ +R LQ S VLSTQ+LL+A GL +
Sbjct: 195 WRRCTDVALQLLLPDGYPNSVSSDYLHYSLWRGLQGVASQISGVLSTQALLYAVGLGKGV 254
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A ++W+LKDG+ ++ K++ S G D PKRWR+ AD L + GLE+L+P+ P
Sbjct: 255 IPTAAAINWVLKDGLGYLSKILLSKFGRHFDVNPKRWRLFADLLENTAYGLEILTPVFPH 314
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LF+ + A+ A + ATR Y+ FA + N +++ AKGEA + +G+ GI
Sbjct: 315 LFVPIGAAAGAARSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGIMLGI 374
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA+ + SS+ +I ++V+H+Y ++ ++ + TLNP R +++ ++++ +G++SS
Sbjct: 375 ALANQIGSSVPLALISFAGVTVVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVSS 434
Query: 298 PADLRYREDL---------LFPGRLIEDAGNVKVGRDPHKVVKPSKFHE----------L 338
++ E L L +++ R ++ SK E L
Sbjct: 435 VKEVNDEEPLFLNSSMGAKLKQNKILSAEAKDAADRICRRLQLGSKLSEIIDNEEDACAL 494
Query: 339 KELFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
+L+ +E+++L +VL++ +S ED L+
Sbjct: 495 FDLYKDEQYLLLEYKGKFCVVLKEGSSPEDMLK 527
>gi|226491780|ref|NP_001140263.1| uncharacterized protein LOC100272306 [Zea mays]
gi|194698746|gb|ACF83457.1| unknown [Zea mays]
Length = 451
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 98/332 (29%), Positives = 173/332 (52%), Gaps = 19/332 (5%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + ++ P GYP SV+ YL Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 44 WRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKGA 103
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L ++ GLE+L+P+ P
Sbjct: 104 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 163
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LF+ + + A + ATR Y+ FA + N +++ AKGEA + VG+G GI
Sbjct: 164 LFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGI 223
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R +++ ++++ +G++ S
Sbjct: 224 ALANHIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 283
Query: 298 PADLRYREDLLF--------PGRLIEDAGNVKVGRDPHKVVKPSKFHELKE--------- 340
++ E L F +++ R ++ SK E+ E
Sbjct: 284 VKEVNDEEPLFFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACALF 343
Query: 341 -LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
L+ E+++L +VL++ +S ED L+
Sbjct: 344 DLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLK 375
>gi|242090561|ref|XP_002441113.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
gi|241946398|gb|EES19543.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
Length = 517
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/344 (31%), Positives = 184/344 (53%), Gaps = 18/344 (5%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RA++HF A+ V +T++LL + G+ + A A
Sbjct: 125 SFIRSYVVPEGFPESVTPSYVPYMTWRAMKHFFGGAMGVFTTRTLLNSVGVSQSRAASGA 184
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + PQLFL
Sbjct: 185 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 244
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
MA + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I ++
Sbjct: 245 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISK 304
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+ S + + LS ++ S E+R+ +NTLN R + V F++TG + S D
Sbjct: 305 SNPSLVTSFAV----LSCGYLLSSYYEVRSVVLNTLNRARFTVAVDSFIRTGYVPSLKDG 360
Query: 302 RYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM-- 358
+E + P E V +G R +P+ F +K LF +E++I+ +
Sbjct: 361 NSQETIFNPPWRHEP---VAIGSRFGEAFQEPASFIAIKPLFEDERYIVTYNPTKDKVYA 417
Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
+L+ A +D L+ AA+ A + F + S L +A +MN
Sbjct: 418 LLKDQAKPDDILK----AAFHAHVLLHFINASHANL-NARKRMN 456
>gi|414879009|tpg|DAA56140.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
Length = 593
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 98/332 (29%), Positives = 173/332 (52%), Gaps = 19/332 (5%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + ++ P GYP SV+ YL Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 186 WRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKGA 245
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L ++ GLE+L+P+ P
Sbjct: 246 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 305
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LF+ + + A + ATR Y+ FA + N +++ AKGEA + VG+G GI
Sbjct: 306 LFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGI 365
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R +++ ++++ +G++ S
Sbjct: 366 ALANHIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 425
Query: 298 PADLRYREDLLF--------PGRLIEDAGNVKVGRDPHKVVKPSKFHELKE--------- 340
++ E L F +++ R ++ SK E+ E
Sbjct: 426 VKEVNDEEPLFFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACALF 485
Query: 341 -LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
L+ E+++L +VL++ +S ED L+
Sbjct: 486 DLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLK 517
>gi|357474239|ref|XP_003607404.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
gi|355508459|gb|AES89601.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
Length = 490
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 103/332 (31%), Positives = 174/332 (52%), Gaps = 15/332 (4%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR----PTPA 119
SF+ + P G+P +V Y+ Y +RAL+HF A+ V +TQ+LL + G+ A
Sbjct: 99 SFITSYVVPEGFPHTVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRATPAA 158
Query: 120 QATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A ++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LF
Sbjct: 159 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDLLMELGAGVELATAAMPHLF 218
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
L +A N K + V + +TR PIY +FAK N+ D+ AKGE + + +++G G I +
Sbjct: 219 LPLACAANVLKNVGAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGFSIMI 278
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
A S + LLS ++ S +E+++ ++TLN R ++ V F+KTG++ +
Sbjct: 279 A----KSNPSLITTFSLLSCGYILSSYKEVKSVVLHTLNSARFSVAVDSFLKTGQVPTLR 334
Query: 300 DLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM 358
+ ED+ FP + D V R PS + ++ LF +E++I+ + +
Sbjct: 335 EGNLNEDIFSFPWK---DRPVVLGSRIKDAFQSPSAYVAIEPLFDKERYIVTYNPSKSKV 391
Query: 359 --VLEQNASGEDALRGWLVAAYAASMAKSFHD 388
VL+ A +D L+ A ++ KS ++
Sbjct: 392 HAVLKDQAKSDDILKAAFHAHVLSNFIKSLNE 423
>gi|326490149|dbj|BAJ94148.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 519
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 110/344 (31%), Positives = 183/344 (53%), Gaps = 18/344 (5%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+SV +T++LL + G+ + A A
Sbjct: 123 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRATSGA 182
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WIL+DG VGK++ + G + D + K+ R D + +LG G+E+ + PQLFL
Sbjct: 183 VAINWILRDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLMMELGAGIELATAAFPQLFLP 242
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+A + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I L S
Sbjct: 243 LACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI-LMS 301
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
S+ V LS ++ S +E+R+ +NTLN R + V F+KTG + S +
Sbjct: 302 KRNPSLVASFAV---LSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEG 358
Query: 302 RYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM-- 358
E + P + V +G R +P+ F + LF +E++I+ +
Sbjct: 359 NLEETIFNPPWRHQP---VAIGSRFGEAFQEPASFVATRPLFEDERYIVTYNPAKDKVYA 415
Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
+L+ A +D L+ AA+ A + F + S L+ A +MN
Sbjct: 416 LLKDQAKQDDILK----AAFHAHVLLHFINASHANLK-ARKRMN 454
>gi|218196816|gb|EEC79243.1| hypothetical protein OsI_19997 [Oryza sativa Indica Group]
Length = 401
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/346 (31%), Positives = 182/346 (52%), Gaps = 20/346 (5%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + A A
Sbjct: 8 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRATSGA 67
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + PQLFL
Sbjct: 68 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 127
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
MA + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I
Sbjct: 128 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGLSI---- 183
Query: 242 TVCSSMQGKMIVG-PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
+ S ++ LS ++ S E+R+ +NTLN R + V F+K+G + S +
Sbjct: 184 -LISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPSLKE 242
Query: 301 LRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM- 358
+E + P + V +G R +P+ F ++ LF +E++I+ +
Sbjct: 243 GNSQETIFNPPWRHQP---VAIGSRFGEAFQEPASFVAIRPLFEDERYIVTYNPTKDKVY 299
Query: 359 -VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMND 403
+L+ A +D L+ AA+ A + F + S L +A +MN
Sbjct: 300 ALLKDQAKPDDILK----AAFHAHVLLHFINASHANL-NARKRMNS 340
>gi|356522111|ref|XP_003529693.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 488
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 116/387 (29%), Positives = 196/387 (50%), Gaps = 26/387 (6%)
Query: 41 FQPDGHLSVKVVDDS-RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAA 99
F+ + L V + + R H V+ + P G+P SV Y+ Y +RAL+HF A
Sbjct: 72 FKKNSFLPVSLNSNHPRDPLHEVLSFVTSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGA 131
Query: 100 LSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRI 156
+ V +TQ+LL + G+ R P A ++WILKDG VGK++ + G + D + K+ R
Sbjct: 132 MGVFTTQTLLSSVGVCRNRAAPG-AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRF 190
Query: 157 LADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD 216
D L +LG G+E+ + P LFL +A N K +A V + +TR PIY +FAK N+ D
Sbjct: 191 TGDLLMELGAGVELATAAVPHLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGD 250
Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
+ AKGE + + +++G G I +A S + LLS ++ S E+++ ++T
Sbjct: 251 VTAKGECVGNIADLLGTGLSILIAKRNPS----LVTTFSLLSCGYILSSYREVKSVVLHT 306
Query: 277 LNPQRTAMIVADFVKTGRISSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKF 335
LN R ++ V F+ TG++ + + E++ FP + D V R PS +
Sbjct: 307 LNCGRFSVAVEHFLMTGQVPTLQEGNMNENIFSFPWK---DRPVVLGSRIKEAFQDPSAY 363
Query: 336 HELKELFAEEKFILCLGSRWTDM--VLEQNASGEDALRGWLVAAYAA-----SMAKSFHD 388
++ LF E++I+ + VL+ A +D L+ AA+ A S+ KS ++
Sbjct: 364 VAIEPLFDRERYIVTYNPSKHKVYAVLKDQAKSDDILK----AAFHAHVLFFSLMKSLNE 419
Query: 389 PSLTVLQDAYDKMNDVFTPLLSELQAK 415
+ L+ D N T +++++A+
Sbjct: 420 NKASSLKQREDLSN--MTHTVADIEAR 444
>gi|297604521|ref|NP_001055571.2| Os05g0419200 [Oryza sativa Japonica Group]
gi|53982666|gb|AAV25645.1| unknown protein [Oryza sativa Japonica Group]
gi|222631625|gb|EEE63757.1| hypothetical protein OsJ_18576 [Oryza sativa Japonica Group]
gi|255676375|dbj|BAF17485.2| Os05g0419200 [Oryza sativa Japonica Group]
Length = 415
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/345 (31%), Positives = 182/345 (52%), Gaps = 18/345 (5%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + A A
Sbjct: 22 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRATSGA 81
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + PQLFL
Sbjct: 82 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 141
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
MA + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I ++
Sbjct: 142 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGLSILISK 201
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
S + LS ++ S E+R+ +NTLN R + V F+K+G + S +
Sbjct: 202 RNPSLVTSFA----FLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPSLKEG 257
Query: 302 RYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM-- 358
+E + P + V +G R +P+ F ++ LF +E++I+ +
Sbjct: 258 NSQETIFNPPWRHQP---VAIGSRFGEAFQEPASFVAIRPLFEDERYIVTYNPTKDKVYA 314
Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMND 403
+L+ A +D L+ AA+ A + F + S L +A +MN
Sbjct: 315 LLKDQAKPDDILK----AAFHAHVLLHFINASHANL-NARKRMNS 354
>gi|356563749|ref|XP_003550122.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 497
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/358 (30%), Positives = 184/358 (51%), Gaps = 23/358 (6%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RP 116
H V+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ R
Sbjct: 100 HEVLAFVTSYVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRA 159
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
P A ++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P
Sbjct: 160 APG-AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 218
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
LFL +A N K +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G
Sbjct: 219 HLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 278
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
I +A S + LLS ++ S E+++ ++TLN R ++ V F++TG++
Sbjct: 279 ILIAKRNPS----LVTTFSLLSCGYILSSYREVKSVVLHTLNCGRFSVAVESFLRTGQVP 334
Query: 297 SPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRW 355
+ + E++ FP + D V R PS + ++ LF E++I+
Sbjct: 335 TLQEGNMNENIFSFPWK---DRPVVLGSRIKEAFQDPSAYFAIEPLFDRERYIVTYNPSK 391
Query: 356 TDM--VLEQNASGEDALRGWLVAAYAA-----SMAKSFHDPSLTVLQDAYDKMNDVFT 406
+ VL+ A +D L+ AA+ A ++ KS+++ + + L+ D N T
Sbjct: 392 HKVYAVLKDQAKSDDILK----AAFHAHVLLFNLIKSWNESNASSLKQREDLSNMTHT 445
>gi|449431896|ref|XP_004133736.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
gi|449478104|ref|XP_004155224.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 507
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 114/363 (31%), Positives = 189/363 (52%), Gaps = 26/363 (7%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ + TP
Sbjct: 118 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVARNKATPG- 176
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A ++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LFL
Sbjct: 177 AIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAAPHLFL 236
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+A N AK ++ V + +TR PIY +FAK N+ D+ AKGE + + +++G G I +
Sbjct: 237 PLACAANVAKNVSAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI- 295
Query: 241 STVCSSMQGKMIVGP--LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
S + +V LLS +++S +E+R+ ++TLN R + V F+KTGR+ S
Sbjct: 296 -----SKRNPSLVTTFGLLSCGYIFSSYQEVRSVVLHTLNRARFNVAVETFLKTGRVPSL 350
Query: 299 ADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTD 357
E +L FP +++ V R F ++ LF EK+++
Sbjct: 351 QKGNMNERILSFP--WLKENPIVLGPRFKDAFQDAGSFLAIEPLFDREKYLVTYNQTKGK 408
Query: 358 M--VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFT---PLLSEL 412
+ +L+ A +D ++ AA+ A + F S + Q++ K D F+ P + L
Sbjct: 409 VYALLKDQAKSDDIIK----AAFHAHVLLHFIRSS-SGAQNSNQKQVDAFSSSVPTTTNL 463
Query: 413 QAK 415
+A+
Sbjct: 464 EAE 466
>gi|168019221|ref|XP_001762143.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686547|gb|EDQ72935.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 390
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 191/371 (51%), Gaps = 32/371 (8%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR--PTPAQA 121
SF+ + P YP SV Y Y Q+RA+Q+ A+SV +T+SL+ A G+ + + A
Sbjct: 18 SFVKSYIVPENYPDSVAPSYTPYMQWRAVQYLFGGAMSVFTTRSLMHALGVSRGASASSA 77
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
V+W++KDG +GK+I + G + D + K+ R L LG G+E+ + P LFL
Sbjct: 78 VAVNWVIKDGAGRIGKMIFARHGKKFDVDLKQLRFKGAMLMQLGAGVELATMAVPHLFLP 137
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI---- 237
+A N AK +A V + +TR PIY +FA+ N+ D+ AKGE IS + +++G G GI
Sbjct: 138 LACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECISNIADLMGTGLGIFISK 197
Query: 238 ---QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
L +T C +LS ++ S E+++ + TLN R ++V F++TG+
Sbjct: 198 KSPSLVATFC-----------VLSCGYMISSFNEIKSVRLATLNRARFGVVVQSFLETGK 246
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILC--L 351
+ S D RE ++ L +++G R P +F +E F +EK+++ L
Sbjct: 247 VPSVEDANRRESVI---TLPWQEKPLELGSRVAQAYSNPKEFLRAREQFQKEKYLVTYSL 303
Query: 352 GSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLT-----VLQDAYDKMNDVFT 406
+ +V++++AS +D +R A + + +T +++++ +D++
Sbjct: 304 KKKRAYVVIKESASSDDVVRATFQAHVMLHILRGLRSTLITSNVGAIVEESTKLTHDLYD 363
Query: 407 PLLSELQAKGW 417
E +++GW
Sbjct: 364 SFKEEAKSQGW 374
>gi|222619643|gb|EEE55775.1| hypothetical protein OsJ_04341 [Oryza sativa Japonica Group]
Length = 508
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 162/292 (55%), Gaps = 13/292 (4%)
Query: 63 VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
V +F+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + TP
Sbjct: 118 VMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTP 177
Query: 119 AQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
A ++WILKDG VGK++ + G + D + K+ R +D L ++G G+E+ + PQ
Sbjct: 178 G-AIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQF 236
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
FL MA + N K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I
Sbjct: 237 FLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIF 296
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
++ S + LS +++S E+R+ +NTLN R + V F+KTG I S
Sbjct: 297 ISKRNPSLVTSFA----FLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPSL 352
Query: 299 ADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
+ +E + P E V +G R +P+ F ++ LF +E++++
Sbjct: 353 KEGNSQETIFNPPWRHEP---VAIGSRFGEAFQEPASFVAIRPLFEDERYMV 401
>gi|56784316|dbj|BAD82242.1| unknown protein [Oryza sativa Japonica Group]
gi|56785239|dbj|BAD82127.1| unknown protein [Oryza sativa Japonica Group]
Length = 508
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 162/292 (55%), Gaps = 13/292 (4%)
Query: 63 VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
V +F+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + TP
Sbjct: 118 VMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTP 177
Query: 119 AQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
A ++WILKDG VGK++ + G + D + K+ R +D L ++G G+E+ + PQ
Sbjct: 178 G-AIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQF 236
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
FL MA + N K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I
Sbjct: 237 FLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIF 296
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
++ S + LS +++S E+R+ +NTLN R + V F+KTG I S
Sbjct: 297 ISKRNPSLVTSFA----FLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPSL 352
Query: 299 ADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
+ +E + P E V +G R +P+ F ++ LF +E++++
Sbjct: 353 KEGNSQETIFNPPWRHEP---VAIGSRFGEAFQEPASFVAIRPLFEDERYMV 401
>gi|414879331|tpg|DAA56462.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
gi|414879332|tpg|DAA56463.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
Length = 503
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 93/282 (32%), Positives = 157/282 (55%), Gaps = 12/282 (4%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + TP A ++WIL
Sbjct: 117 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 175
Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
KDG VGK++ + G + D + K+ R +D L ++G G+E+ + PQ FL MA + N
Sbjct: 176 KDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELTTAAFPQFFLPMACVANV 235
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I ++ S +
Sbjct: 236 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNPSLVT 295
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LS ++ S +E+R+ +NTLN R + V F+KTG I S + +E +
Sbjct: 296 SFA----FLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 351
Query: 309 FPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
P E V +G R +P+ F ++ LF +E++++
Sbjct: 352 NPPWRHEP---VAIGSRFGEAFQEPASFLAIRPLFEDERYMV 390
>gi|357133637|ref|XP_003568430.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Brachypodium
distachyon]
Length = 521
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 156/289 (53%), Gaps = 11/289 (3%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+SV +T++LL + G+ + A A
Sbjct: 125 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRATSGA 184
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK++ + G + D + K+ R D L +LG G+E+ + P LFL
Sbjct: 185 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHLFLP 244
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+A + N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I L S
Sbjct: 245 LACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI-LIS 303
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
S+ LS ++ S E+R+ +NTLN R + V F+KTG + S +
Sbjct: 304 KRNPSLVASFA---FLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEG 360
Query: 302 RYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
E + P + V +G R +P+ F LF +E++I+
Sbjct: 361 NLEETIFNPPWRHQP---VAIGSRFGEAFQEPASFVSTSPLFEDERYIV 406
>gi|168024902|ref|XP_001764974.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683783|gb|EDQ70190.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 191/371 (51%), Gaps = 32/371 (8%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
SF+ + P GYP SV Y Y Q+RA+Q+F A+SV +T+SL+ A G+ R T A +
Sbjct: 34 SFVKAYIVPEGYPDSVAPSYTPYMQWRAVQYFFGGAMSVFTTRSLMHALGVSRGTSASSA 93
Query: 123 V-VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
V V+W++KDG +GK++ + G + D + K+ R L LG G+E+ + P LFL
Sbjct: 94 VAVNWVIKDGAGRIGKMLFARHGKKFDVDLKQLRFKGAMLMQLGAGVELTTMAVPHLFLP 153
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI---- 237
+A N AK +A V + +TR PIY +FA+ N+ D+ AKGE IS + +++G G GI
Sbjct: 154 LACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECISNIADLLGTGLGIFISK 213
Query: 238 ---QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
L +T C +LS ++ S E+++ + TLN R ++V F++TG+
Sbjct: 214 RSPSLVATFC-----------VLSCGYMISSFNEIKSVRLATLNRARFGVVVQSFLETGQ 262
Query: 295 ISSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCL-- 351
+ S D RE ++ P + +VG P +F +E F +E +++
Sbjct: 263 VPSIEDANRRESVITLPWQEKPLELGSRVG---QAFSNPREFIRTQEQFQKENYLVTYRP 319
Query: 352 GSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDP-SLTVLQDAY----DKMNDVFT 406
+ +V++++AS +D +R A M + + + ++DA + D++
Sbjct: 320 NKKRAYVVIKESASSDDVVRATFQAHVMLHMLRGLEGTITNSNVEDAVAESTKRTYDLYD 379
Query: 407 PLLSELQAKGW 417
E ++KGW
Sbjct: 380 SFKEEAKSKGW 390
>gi|226502346|ref|NP_001144348.1| uncharacterized protein LOC100277253 [Zea mays]
gi|195640526|gb|ACG39731.1| hypothetical protein [Zea mays]
Length = 503
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 93/282 (32%), Positives = 157/282 (55%), Gaps = 12/282 (4%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + TP A ++WIL
Sbjct: 117 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 175
Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
KDG VGK++ + G + D + K+ R +D L ++G G+E+ + PQ FL MA + N
Sbjct: 176 KDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELTTAAFPQFFLPMACVANV 235
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I ++ S +
Sbjct: 236 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNPSLVT 295
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LS ++ S +E+R+ +NTLN R + V F+KTG I S + +E +
Sbjct: 296 SFA----FLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 351
Query: 309 FPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
P E V +G R +P+ F ++ LF +E++++
Sbjct: 352 NPPWRHEP---VAIGSRFGEAFQEPASFLAIRPLFEDERYMV 390
>gi|357126248|ref|XP_003564800.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 506
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 165/307 (53%), Gaps = 14/307 (4%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + TP A ++WIL
Sbjct: 123 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 181
Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
KDG VGK++ + G + D++ K+ R +D L ++G G+E+ + PQ FL MA + N
Sbjct: 182 KDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGIELATAAFPQFFLPMACVANV 241
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I + S +
Sbjct: 242 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIYITKRNPSLVT 301
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL- 307
LLS ++ S +E+R+ +NTLN R + V F+KTG + S + +E L
Sbjct: 302 SFA----LLSCGYLMSSYQEVRSVVLNTLNRARFTVAVDSFIKTGYVPSLKEGNSQETLF 357
Query: 308 LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM--VLEQNAS 365
+ P R A + G +P F + LF +E++++ + +L+ A
Sbjct: 358 IHPWRHEPVAIGSRFG---EAFQEPVSFVATRPLFEDERYMVTYNPTKDKVYALLKDQAK 414
Query: 366 GEDALRG 372
+D L+
Sbjct: 415 PDDVLKA 421
>gi|225462223|ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
Length = 713
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 163/306 (53%), Gaps = 12/306 (3%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
FL P G+P SV YL YT +R +Q S VL+TQ+LL+A GL + A V
Sbjct: 285 FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 344
Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P P FL +
Sbjct: 345 NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 404
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
+ + A + +TR Y+ FA + N +++ AKGEA + +G+ GI LA+ +
Sbjct: 405 VAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIG 464
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
SS +++ +H++ ++ ++ + TLNP R +++ ++++ +G++ S ++
Sbjct: 465 SSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVN-E 523
Query: 305 EDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNA 364
E+ LFP + +A + +K E K+ AE + L LGS+ +++V +
Sbjct: 524 EEPLFPVVPLLNA------KPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVV----S 573
Query: 365 SGEDAL 370
S ED L
Sbjct: 574 SKEDVL 579
>gi|255569810|ref|XP_002525869.1| conserved hypothetical protein [Ricinus communis]
gi|223534874|gb|EEF36563.1| conserved hypothetical protein [Ricinus communis]
Length = 520
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 108/339 (31%), Positives = 183/339 (53%), Gaps = 14/339 (4%)
Query: 40 QFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSA 98
+F+ + SV + + PV + SF+ + P G+P SV Y+ Y +RAL+HF
Sbjct: 106 RFKKNSFRSVSLQNPQAPVEELM--SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGG 163
Query: 99 ALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRI 156
A+ V +T++LL + G+ + A A ++WILKDG VGK++ + G + D + K+ R
Sbjct: 164 AMGVFTTKTLLSSVGVSKSTAIPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRF 223
Query: 157 LADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD 216
D L +LG G+E+ + P LFL +A N AK +A V + +TR PIY +FAK N+ D
Sbjct: 224 AGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGD 283
Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
+ AKGE + + +++G G I ++ S + LLS +V+S +E+++ ++T
Sbjct: 284 VTAKGECVGNVADLLGTGLSIMISKRNPS----LVTTFALLSCGYVFSSYQEVKSVVLHT 339
Query: 277 LNPQRTAMIVADFVKTGRISSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKF 335
LN R ++ V F+KTGR+ S + RE++ FP +++ V R PS F
Sbjct: 340 LNRARFSVAVESFLKTGRVPSLQEGNTRENIFSFP--WLKERPIVIGSRFKDAFQDPSAF 397
Query: 336 HELKELFAEEKFILCLGSRWTDM--VLEQNASGEDALRG 372
++ LF +E++I+ + +L+ A +D L+
Sbjct: 398 LAIEPLFEKERYIVTYNPTKDKVYALLKDQAKADDILKA 436
>gi|296082804|emb|CBI21809.3| unnamed protein product [Vitis vinifera]
Length = 537
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 163/306 (53%), Gaps = 12/306 (3%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
FL P G+P SV YL YT +R +Q S VL+TQ+LL+A GL + A V
Sbjct: 83 FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 142
Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P P FL +
Sbjct: 143 NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 202
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
+ + A + +TR Y+ FA + N +++ AKGEA + +G+ GI LA+ +
Sbjct: 203 VAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIG 262
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
SS +++ +H++ ++ ++ + TLNP R +++ ++++ +G++ S ++
Sbjct: 263 SSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVN-E 321
Query: 305 EDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNA 364
E+ LFP + +A + +K E K+ AE + L LGS+ +++V +
Sbjct: 322 EEPLFPVVPLLNA------KPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVV----S 371
Query: 365 SGEDAL 370
S ED L
Sbjct: 372 SKEDVL 377
>gi|356540900|ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
Length = 592
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 163/314 (51%), Gaps = 12/314 (3%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
V+ + D F P G+P SV YL Y+ +RA+Q VL+TQSLL+A GL +
Sbjct: 173 VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG 232
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A ++W+LKDG+ ++ K++ SN G D PK WR+ AD L + GLE+ +P CP
Sbjct: 233 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMSTPACP 292
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
Q F+ + + ++ A + +TR ++ FA + N +++ AKGE +G+ G
Sbjct: 293 QFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIVLG 352
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
I L + + SS + +L+ IH+Y ++ ++ + TLNP R +++ ++++ +G+ +
Sbjct: 353 IGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQ-A 411
Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT 356
P E+ LFP I +A K + E K+ AE + L LGS+ +
Sbjct: 412 PPVKEVNDEEPLFPAVPILNATFAS------KAQSFALSSEAKDAAAEIEHRLQLGSKLS 465
Query: 357 DMVLEQNASGEDAL 370
++V S ED L
Sbjct: 466 EIV----NSKEDVL 475
>gi|38344695|emb|CAE02373.2| OSJNBb0096E05.17 [Oryza sativa Japonica Group]
Length = 593
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 172/342 (50%), Gaps = 38/342 (11%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + P GYP SV+ YL+Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 185 WRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKGA 244
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P+ P
Sbjct: 245 IPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPH 304
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
LF+ + + A + ATR Y+ FA + N +++ AKGEA + +G+ GI
Sbjct: 305 LFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGIMLGI 364
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R +++ ++++ +G++ S
Sbjct: 365 ALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 424
Query: 298 PADLRYREDLLFPGRLIEDAGNVKVG--RDPHKVVKP----------------SKFHELK 339
++ E L N+ +G R K++ P SK E+
Sbjct: 425 VKEVNDEEPLFL---------NLSIGTSRKESKILSPQAKDAAEIICRRLQLGSKLSEII 475
Query: 340 E----------LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
E L+ E+++L ++L++ +S ED L+
Sbjct: 476 ENKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLK 517
>gi|452823170|gb|EME30183.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 519
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 178/361 (49%), Gaps = 14/361 (3%)
Query: 29 FETSDSVTRHYQFQP-DGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
+ET Y P +G L + DSR V+ +++D + F P +V++ Y +T
Sbjct: 128 YETCTHQRIDYFLNPSNGDLLIAPSIDSRTVWRKILDLLNSSFLPENCKKTVSKDYFWFT 187
Query: 88 QFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLIC-SNLG 144
F LQ+ + + V+STQ LL A GL + ++ ++W+LKDG+ +G ++ S +G
Sbjct: 188 TFNFLQNVVNCCIGVISTQQLLRAVGLSASQGIGKSASLNWVLKDGIGRLGAIVFGSFIG 247
Query: 145 ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPI 204
R D++PKRWR+ D LY G G E++SPL P+ FL +A L N K + + T I
Sbjct: 248 NRFDADPKRWRLWGDILYAFGIGTEIISPLLPRYFLLVASLANMVKATSYMLRLPTTAAI 307
Query: 205 YSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYS 264
SFAK N D+ AK + L N+ G GI + + S + + ++
Sbjct: 308 RRSFAKRENFGDISAKANSQEVLSNLFGTFLGIGFSIILGHSWWQLSVAYFVYFIMFFLL 367
Query: 265 VIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP-----GRLIEDAGN 319
++ + TLN QR+ ++ + K+ ++S+ ++ YRE+++ P GR I +
Sbjct: 368 NYLGVKGLHLRTLNLQRSLLLGWSYWKSSQVSTVEEVNYRENVILPSIMSFGRYIRFGVS 427
Query: 320 VKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYA 379
+ RD H + S+ L + + + K++L + +V A +D L+ L A +
Sbjct: 428 L---RDIH--LTGSELAALLDKYRQAKYLLTVRHGQVYIVFHMEARVKDELQAILQATFL 482
Query: 380 A 380
A
Sbjct: 483 A 483
>gi|301622642|ref|XP_002940632.1| PREDICTED: UPF0420 protein C16orf58 [Xenopus (Silurana) tropicalis]
Length = 473
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 121/412 (29%), Positives = 195/412 (47%), Gaps = 44/412 (10%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
Y + D F++ P GYP SV++ YL Y + +Q F+S+ L+TQSLL +G+ +
Sbjct: 62 YRSLRDFFVSLLLPHGYPDSVSKDYLAYQLWDTIQAFSSSVTGSLATQSLLRGSGVGDST 121
Query: 119 AQATVVS--WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A + WIL+DG VG+++ + + G+R+D + KRWR+ AD L D+ +E+++P+
Sbjct: 122 ATVAAATITWILRDGTGMVGRILFAWMKGSRLDCDAKRWRLFADLLNDIAIFMEIMAPVF 181
Query: 176 PQLF-LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P F L + G F K I VA ATR I AK N++D+ AK + TL N+ GL
Sbjct: 182 PSCFTLTVCAAGVF-KCIVGVAGGATRAAITVHQAKRDNMADVCAKDGSQETLVNLAGLL 240
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
+ L V SM ++ LL+++H+Y+ +R+ + TLN R ++++ ++K G+
Sbjct: 241 VSLFLVPLVSDSMMATYLLFFLLTILHLYANYRAVRSVIMETLNQSRLSIVLHHYLKEGK 300
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSR 354
I P RE LL PG + + +G + S EL++L + + LG R
Sbjct: 301 ILDPEAANLREPLL-PG--LGSQAPISLGVPLSYLA--SSVPELEQLKKDNHSLYLLGWR 355
Query: 355 WTD----MVLEQNASGEDALRGWL----------------------------VAAYAASM 382
+VL + AS D +R + V +
Sbjct: 356 EDTGSLAVVLHERASSVDMIRACVHAEMLHRKTSSSDHHKSSSFPILSALQCVRGRGSRP 415
Query: 383 AKSFHDPS-LTVLQDAYDKMNDVFTPLLSELQAKGWHTDR-FLDGTGTRFAW 432
F T L+ ++++N +F L L A GW T+R LD R W
Sbjct: 416 CTGFEQHGRFTALEQTHNQINSMFPTFLQGLTASGWVTNRNLLDSDDWRANW 467
>gi|452819618|gb|EME26673.1| glutamate N-acetyltransferase isoform 2 [Galdieria sulphuraria]
gi|452819619|gb|EME26674.1| glutamate N-acetyltransferase isoform 1 [Galdieria sulphuraria]
Length = 540
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 97/345 (28%), Positives = 177/345 (51%), Gaps = 7/345 (2%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
+ + ++ ++ S F P G+P SV + YL+Y+ +R +Q+ S VLSTQ+LL AAG
Sbjct: 128 NRQNLWEQITSSLSTFFLPEGFPESVGDSYLQYSIWRGIQNIISNISQVLSTQALLLAAG 187
Query: 114 LRPTPAQATVVSWIL--KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
+ T ++ + KDG+ VG+L + LG + D++PKRWR+++D LYD+G LE+L
Sbjct: 188 VGRTHSETLAAATAWAWKDGIGQVGRLFTAVLGNQYDADPKRWRLVSDLLYDIGLSLEIL 247
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
SP + FL MA LGN +K IA + R + +F + NL ++ AK +A + + ++
Sbjct: 248 SPAFSKYFLFMAALGNISKSIAFAIGISCRYSVLKTFVRRENLGEISAKNDAQNVVTGML 307
Query: 232 GLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
G G+ ++ + ++ + ++ V + I+ + + M+ A + T+N QR ++ ++
Sbjct: 308 GTLIGVWMSKCLPAAPKVRLGVFLFFTAIYSFFNYKSMQVAELTTINRQRGNILAHSYLM 367
Query: 292 TGRISSPADLRYREDL--LFPGRLIEDAGNVKVGRDPHKV-VKPSKFHELKELFAEEKFI 348
+ S + E LFP + +V++G ++ + K F++EK+I
Sbjct: 368 FQNVPSVSMSNSYERFVPLFP--TVFQYPHVRLGVGLTELSISIEKIANYTRQFSKEKYI 425
Query: 349 LCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTV 393
L L + L +N D L+ L A+ + P T+
Sbjct: 426 LSLSDGEIRVALHRNHESVDLLQLLLHIAFIRDEIQRRLGPQCTI 470
>gi|432867528|ref|XP_004071227.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oryzias latipes]
Length = 423
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 180/372 (48%), Gaps = 14/372 (3%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPA 119
+V +F + F P GYP SV+E YL+Y + LQ F S L TQ+ L G+ +
Sbjct: 41 LVGAFKSVFLPQGYPESVSEDYLQYQFWDTLQAFASTLSWTLVTQASLKGVGVGNQDATV 100
Query: 120 QATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
A V+WILKDG G+++ + G+R+DSE K+WR+ AD L D+ +E+L+P P
Sbjct: 101 AAATVTWILKDGTGMFGRILFAWQKGSRLDSEAKKWRLFADVLNDIAMFIEILAPFFPSF 160
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
F + K I VA ATR + A+ N++D+ AK + TL N+ GL +
Sbjct: 161 FTLIVCTAGVFKAIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLVVSLI 220
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
L + + + + L +++H+++ + +R+ + T N R ++++ ++K RI SP
Sbjct: 221 LIPLITDNSSLSLGLFFLFTILHLFANYKAVRSVVMETFNQARLSIVLQQYLKDKRILSP 280
Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK-PSKFHELKELFAEEKFILCLGSRWTD 357
+ RE + ++ +K+G VV+ P FH L ++L + +
Sbjct: 281 FEANRREPVFLEFKITTP---IKLGVKLQDVVQSPEDFH-LALRNNSMAYLLGVRNGCIC 336
Query: 358 MVLEQNASGEDALRGWLVAAYAASMAK--SFHDPSLTVLQDAYDKMNDV----FTPLLSE 411
+ L AS D +R A ++M K SF S L + K +V F L
Sbjct: 337 VCLGSGASVHDEIRAACQALCLSNMLKSPSFSQTSKQSLWEMVQKSQEVIDANFHTFLKG 396
Query: 412 LQAKGWHTDRFL 423
++A GW T R L
Sbjct: 397 VEAAGWDTKRTL 408
>gi|307105811|gb|EFN54059.1| hypothetical protein CHLNCDRAFT_8919, partial [Chlorella
variabilis]
Length = 325
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 164/314 (52%), Gaps = 17/314 (5%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPT 117
R+ D + P +P SV Y Y + R++Q+F A+SV +T+SLL + G+ + +
Sbjct: 1 QRLGDLLADYLLPQDFPHSVAPQYSDYMRARSVQYFFGGAMSVFTTRSLLASLGVANKHS 60
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A ++W++KDG +G+ + + G +D E K++R++ D L + G LE+ + L P+
Sbjct: 61 SEAAAAINWVVKDGAGRLGRFLFARWGRELDCELKQFRLMGDVLMETGAALELSTVLMPR 120
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+FL +A N AK +A V A +TR PIY +FAK+ NL+D+ AKGE+++ L +VVG GI
Sbjct: 121 MFLPLACTANLAKNLAAVTASSTRAPIYRTFAKQNNLADVTAKGESVANLADVVGTAFGI 180
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVI---EEMRAAPVNTLNPQRTAMIVADFVKTGR 294
LA + + V P + + V +I E+ + + LN R + F TGR
Sbjct: 181 ALA-------KANLPVLPTFAALSVGYLIASRREVDSVVLPYLNRARLSYTTRAFYSTGR 233
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK-PSKFHELKELFAEEKFILCL-- 351
+ + YRE L+ P + G V +G + P + H+ F+ ++ L
Sbjct: 234 VPETLEGNYREPLM-PWSDPHN-GRVVLGATVEEACAGPQQLHDALAAFSGRQYALTYRP 291
Query: 352 GSRWTDMVLEQNAS 365
+R +L+Q AS
Sbjct: 292 DTRKCYALLKQGAS 305
>gi|449455818|ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 611
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 170/354 (48%), Gaps = 28/354 (7%)
Query: 43 PDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSV 102
P LS V+ V+ R D F P G+P SV YL Y+ +R +Q S V
Sbjct: 188 PSSSLSFSFVN----VWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGV 243
Query: 103 LSTQSLLFAAGL-RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADAL 161
L+TQ+LL+A GL + A V+W+LKDG ++ K+ S G D PK WR+ AD L
Sbjct: 244 LATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLL 303
Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
+ G+E+L+P P F+ + + A + ATR Y+ FA + N +++ AKG
Sbjct: 304 ENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKG 363
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
EA + +G+ GI LA+ + SS + ++++IH++ ++ ++ + TLNP R
Sbjct: 364 EAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYR 423
Query: 282 TAMIVADFVKTGRISSPADLRYRE------------------DLLFPGRLIEDAGNV--- 320
+++ ++++ +G + S D+ E D L E A N+
Sbjct: 424 ASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKAPDWSRDFLLSAEAKESAANIEKR 483
Query: 321 -KVGRDPHKVVKPSK-FHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRG 372
++G V + EL LF +E +IL ++L+++AS D L+
Sbjct: 484 LQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKA 537
>gi|359486217|ref|XP_003633414.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
gi|297739449|emb|CBI29631.3| unnamed protein product [Vitis vinifera]
Length = 504
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 106/329 (32%), Positives = 176/329 (53%), Gaps = 18/329 (5%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ R P
Sbjct: 113 SFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNRSAPG- 171
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A ++WILKDG VGK++ + G + D + K+ R+ D L +LG G+E+ + P LFL
Sbjct: 172 AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVPHLFL 231
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I ++
Sbjct: 232 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 291
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
S + LLS +V S +E+++ ++TLN R ++ + F+KTGR+ S +
Sbjct: 292 KRNPS----LVTTFALLSCGYVLSSYQEVKSVVLHTLNRARFSVAIDSFLKTGRVPSLQE 347
Query: 301 LRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM- 358
+E++ FP +ED V R + ++ LF +E++++ +
Sbjct: 348 GNMKENIFSFP--WLEDKPIVLGSRFKDAFQDAGSYLSIEPLFEKERYLVTYNPSKGKVY 405
Query: 359 -VLEQNASGEDALRGWLVAAYAASMAKSF 386
+L+ A +D L+ AA+ A M F
Sbjct: 406 ALLKDQAKSDDILK----AAFHAHMLLHF 430
>gi|301096265|ref|XP_002897230.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107315|gb|EEY65367.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 474
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 183/391 (46%), Gaps = 36/391 (9%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P+GYP SV+E YL + + LQ S VL+TQS+L + G+ + TP A + W
Sbjct: 79 FLPAGYPDSVSEDYLGFQFWDTLQAMCSYLRGVLATQSVLQSVGVGDDKATPL-AAALQW 137
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+L+DG +G L + +G + D KRWR+ AD + D+G L++++P P L E+ +
Sbjct: 138 VLRDGSGMIGGLTFAYFVGPKFDVNVKRWRLFADVVNDVGLTLDMVAPYFPTLVTEVLCV 197
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ K + VAA ATR + + FAK N++D AK + T + GL G+ A+ V S
Sbjct: 198 SSVCKAMCGVAAGATRSSLMTHFAKRDNMADCAAKEGSQETAVKLFGLMFGMYFANAVNS 257
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL---- 301
S + +L+++HVY+ + + T+N R ++V F+ +G ++
Sbjct: 258 SPHAVWMAFLVLTLVHVYANYNAVSCLCIPTVNCSRGLILVQRFLMSGNKPGKENVADHQ 317
Query: 302 -RY------REDLLFPGRLIEDAGNVKVGRDPHKVVKPSK-FHELKELFAEEKFILCLGS 353
RY +E+ +F ++ ++ +G + H+ V S L +++ +E ++L +
Sbjct: 318 GRYSIRSVNQEEPIFKDPVLPVTSHLMMGSELHQAVPTSTDLDRLLQIYEDETYLLTVVG 377
Query: 354 RWTDMVLEQNASGEDALRGWLVAAYAASMAKSF----------------HDPSLTVLQDA 397
++ E A D LR + A S H P +L+
Sbjct: 378 DRVHVMFEAEAKPTDELRAFFQAVLVLQALASVPPTTSTPSSSNEGSTKHQP---LLEST 434
Query: 398 YDKMNDVFTPLLSELQAKGWHTDRFLDGTGT 428
Y KM F L LQ GW T FL T T
Sbjct: 435 YRKMTSDFPLFLQVLQNSGWRTHMFLLNTST 465
>gi|449484579|ref|XP_004156921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 443
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 86/270 (31%), Positives = 143/270 (52%), Gaps = 6/270 (2%)
Query: 43 PDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSV 102
P LS V+ V+ R D F P G+P SV YL Y+ +R +Q S V
Sbjct: 158 PSSSLSFSFVN----VWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGV 213
Query: 103 LSTQSLLFAAGL-RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADAL 161
L+TQ+LL+A GL + A V+W+LKDG ++ K+ S G D PK WR+ AD L
Sbjct: 214 LATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLL 273
Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
+ G+E+L+P P F+ + + A + ATR Y+ FA + N +++ AKG
Sbjct: 274 ENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKG 333
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
EA + +G+ GI LA+ + SS + ++++IH++ ++ ++ + TLNP R
Sbjct: 334 EAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYR 393
Query: 282 TAMIVADFVKTGRISSPADLRYREDLLFPG 311
+++ ++++ +G + S D+ E+ LFP
Sbjct: 394 ASLVFSEYLLSGEVPSIKDVN-NEEPLFPA 422
>gi|340372549|ref|XP_003384806.1| PREDICTED: UPF0420 protein C16orf58-like [Amphimedon queenslandica]
Length = 432
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 115/418 (27%), Positives = 190/418 (45%), Gaps = 34/418 (8%)
Query: 34 SVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQ 93
S R Y D +V+V +R R+ D+ F P GYP SV+E YL+Y + LQ
Sbjct: 14 SSARSYMMIGD---TVRVTSKARNT--RLADTLKEVFLPQGYPNSVSEDYLQYQLWDTLQ 68
Query: 94 HFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSE 150
F S L+T ++L G+ + A A ++WILKDG+ G+++ + L G +D
Sbjct: 69 AFCSYITGTLATHAMLKGVGVGDSSASPVAATLTWILKDGLGMTGRILFAWLRGYDLDCN 128
Query: 151 PKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAK 210
K+WR++AD L D+ ++++SP P FL +A L + + + VA ATR + A+
Sbjct: 129 AKKWRLIADILNDIAICMQLVSPFFPSCFLLIACLASITQSVVGVAGGATRAALVQHQAR 188
Query: 211 EGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMR 270
N++D+ AK + TL N+ GL G+ + + + +++H+YS + +
Sbjct: 189 RDNMADVSAKDGSQETLVNLCGLLIGLIITPLIAGQTVFVWSLFFSFTLLHLYSNYKAVS 248
Query: 271 AAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV 330
+ TLN R +++ + G IS P + E LLF E V+ G V+
Sbjct: 249 VVSMETLNCNRLHLLMRNLFLNGTISEPNIVNREEPLLFRQ---ERFFTVEYGSSLSSVL 305
Query: 331 KPSKFH--ELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAA--------- 377
S + + + +KFI+ L R + +S D L+ L
Sbjct: 306 IHSSDYSRSILQFNKGQKFIIKLSKTKRQIRVAFHCGSSSSDQLKAGLTVELIEFVCGGC 365
Query: 378 ---------YAASMAKSFHDPSLTVLQD-AYDKMNDVFTPLLSELQAKGWHTDRFLDG 425
YA + K +VL D D +ND+F+ +L +L+ +GW L G
Sbjct: 366 YGDSNYLKDYALLIRKGVESTDESVLVDVCQDIINDLFSSILDQLRKEGWTVSHHLLG 423
>gi|255557849|ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
gi|223541000|gb|EEF42558.1| conserved hypothetical protein [Ricinus communis]
Length = 541
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 96/306 (31%), Positives = 160/306 (52%), Gaps = 13/306 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
F+ P GYP SV YL Y+ +R +Q S VL+TQ+LL+A GL + A +
Sbjct: 132 FVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 191
Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P P LF+ +
Sbjct: 192 NWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLEILTPAFPHLFVFIGA 251
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
+ A + ATR Y+ FA + N +++ AKGEA + +G+ GI LA+ +
Sbjct: 252 AAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIGLANCIG 311
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
SS+ + +++ IH++ ++ ++ + TLNP R +++ ++++ +G+ D+
Sbjct: 312 SSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPIKDVN-D 370
Query: 305 EDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNA 364
E+ LFP + K P VV E ++ E + L LGS+ +D+V
Sbjct: 371 EEPLFPAVF----PHFKSADKPSLVVLS---LEARDAATEIERRLQLGSKLSDVV----N 419
Query: 365 SGEDAL 370
S ED L
Sbjct: 420 SKEDVL 425
>gi|348671592|gb|EGZ11413.1| hypothetical protein PHYSODRAFT_338124 [Phytophthora sojae]
Length = 478
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 186/391 (47%), Gaps = 36/391 (9%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P+GYP SV+E YL + + LQ S VL+TQS+L + G+ + TP A + W
Sbjct: 83 FLPAGYPDSVSEDYLSFQFWDTLQAMCSYLRGVLATQSVLQSVGVGDDKATPL-AAALQW 141
Query: 127 ILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+L+DG +G L + L G + D KRWR+ AD + D+G L++++P P L E+ +
Sbjct: 142 VLRDGSGMIGGLTFAYLVGPKFDVNVKRWRLFADVINDVGLTLDMVAPYFPTLVTEVLCV 201
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ K + VAA ATR + + FAK+ N++D AK + T + GL G+ A+ V S
Sbjct: 202 SSVCKTMCGVAAGATRSSLMTHFAKKDNMADCAAKEGSQETAVKLFGLMFGMYFANAVNS 261
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL---- 301
S Q + LL+VIHVY+ + + T+N R ++V F+ +G ++
Sbjct: 262 SPQAVWVAFLLLTVIHVYANYNAVSCLCIPTVNCSRGLILVRRFLASGNKPGKENVADHN 321
Query: 302 -RY------REDLLFPGRLIEDAGNVKVGRDPHKVVKPSK-FHELKELFAEEKFILCLGS 353
RY +E+ +F ++ ++ +G + H+ V S L ++++E ++L +
Sbjct: 322 GRYSIRSVNQEEPIFKDPVLPVTSHLVMGAELHQAVPSSHDLDRLLHIYSDEAYLLTVVG 381
Query: 354 RWTDMVLEQNASGEDALRGWLVAAYA----------------ASMAKSFHDPSLTVLQDA 397
++ + A +D LR + A ++ H P +L+
Sbjct: 382 DRVLVMFQAEAKPKDELRAFFQAVLVLEALASVSPTASPPPSSNGGTITHQP---LLEST 438
Query: 398 YDKMNDVFTPLLSELQAKGWHTDRFLDGTGT 428
KM F+ LQ GW T FL T T
Sbjct: 439 LRKMTADFSLFHQVLQNSGWRTHMFLLNTST 469
>gi|160333925|ref|NP_001103923.1| uncharacterized protein LOC555936 [Danio rerio]
gi|126631351|gb|AAI34047.1| Zgc:162613 protein [Danio rerio]
Length = 432
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 184/397 (46%), Gaps = 47/397 (11%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F + F P GYP SV+E YL+Y + +Q F+S+ L+TQ+ L G+ + A
Sbjct: 48 FKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQEATVAAAT 107
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+L+DG +G+++ + L G+++DSE K+WR+ AD L D+ +E+ +P P F +
Sbjct: 108 ITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHFPPFFTII 167
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ K I VA ATR + A+ N++D+ AK + TL N+ GL + L
Sbjct: 168 VCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLVSLALIPL 227
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V + I+ L +V+H+++ +R+ + TLN R ++++ ++ G++ SPA+
Sbjct: 228 VTDNPLLTFILFYLFTVLHLFANYRAVRSVVMETLNEARLSIVLHQYLLHGQVLSPAEAN 287
Query: 303 YREDLLFP-------------GRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFIL 349
RE + G LI + G++++ + KP +
Sbjct: 288 QREPVFLSFRRTVPIKLGVRLGDLIHEPGDLQLALKNNS--KPYLIG-----IKNGAVCV 340
Query: 350 CLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQ-------------D 396
CLGS +AS +D ++ + A S P VLQ
Sbjct: 341 CLGS---------DASVDDEIKA-VCQALCVSAVLHNESPLPAVLQQLSNTKDPWALISQ 390
Query: 397 AYDKMNDVFTPLLSELQAKGWHTDR-FLDGTGTRFAW 432
++ ++++F L EL W TDR LD R W
Sbjct: 391 SHKMIDEIFQTFLKELDRAKWQTDRTLLDWNEWRVEW 427
>gi|168005499|ref|XP_001755448.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693576|gb|EDQ79928.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 503
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 104/395 (26%), Positives = 190/395 (48%), Gaps = 37/395 (9%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
V R V+ P GYP SV + Y YT +R Q S VL+TQ+LL+A GL +
Sbjct: 85 VITRFVELGKQIVLPDGYPQSVTDDYFEYTLWRMGQVIASQISGVLTTQALLYAVGLGKG 144
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A V+W+L+DG+ ++ K++ S G D PK WR+++D + ++ GLE+L+P P
Sbjct: 145 AIPTAAAVNWVLRDGIGYLSKIVLSKYGRHFDVHPKGWRLVSDLIENMSYGLELLTPAFP 204
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
LF+ +A + + + AT+ + A N +++ AKGEA + +G+ G
Sbjct: 205 HLFVYLAAAAGAGRSASGLIQAATKSCFNAGMAANRNFAEIIAKGEAQGMVSKSIGIALG 264
Query: 237 IQLASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
I +++ V S+ G ++VG V +H++ ++ +A + TLNP R ++++A++ ++G
Sbjct: 265 IAVSAYVGST--GPLLVGTFFGVTALHIFCNLKSYQAVQLRTLNPYRASLVLAEYFRSGN 322
Query: 295 ISSPADLRYREDL----LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELK----------- 339
+ S ++ E + F RL G+ + + K V S + L+
Sbjct: 323 VVSVREVNGEEPIFPFSFFNFRLRGSKGDGTLLSEESKAVAHSIENHLEFGATFGAAVKT 382
Query: 340 --------ELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYA--------ASMA 383
+++ E+++L +VL++ A+ D LR + A Y S
Sbjct: 383 REEADALLDIYKSERYLLSQHDDRLLVVLKEGATPRDMLRAMMQAIYLDYLQGTGFTSHR 442
Query: 384 KSFHDP-SLTVLQDAYDKMNDVFTPLLSELQAKGW 417
HD + VL+ ++D + + F + ++ GW
Sbjct: 443 NLMHDSGNGGVLRVSHDFVVNQFEQIRQDIAGVGW 477
>gi|226505832|ref|NP_001140509.1| uncharacterized protein LOC100272571 [Zea mays]
gi|194699768|gb|ACF83968.1| unknown [Zea mays]
Length = 491
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 89/287 (31%), Positives = 154/287 (53%), Gaps = 20/287 (6%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV 123
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ +
Sbjct: 110 SFIRSYVVPEGFPDSVTPSYVPYMNWRALKHFFGGAMGVFTTRTLLSSVGVSQS------ 163
Query: 124 VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
KDG VGK++ + G + D++ K+ R +D L ++G G+E+ + PQ FL MA
Sbjct: 164 -----KDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGIELTTAAFPQFFLPMA 218
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
+ N K +A V + +TR PIY ++A+ N+ D+ AKGE++ + +++G G I ++
Sbjct: 219 CVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRN 278
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
S + LS ++ S +E+R+ +NTLN R + V F+KTG I S +
Sbjct: 279 PSLVTSFA----FLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNS 334
Query: 304 REDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
+E + P E V +G R +P+ F ++ LF +E++++
Sbjct: 335 QETIFNPPWRHE---TVAIGSRFGEAFQEPASFLAIRPLFEDERYMV 378
>gi|168022981|ref|XP_001764017.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684756|gb|EDQ71156.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 437
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 110/392 (28%), Positives = 198/392 (50%), Gaps = 28/392 (7%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SR V++ ++ +FL P GYP SV YL + + LQ ++ S+LSTQ+LL G+
Sbjct: 42 SRAVWNNILSAFL----PEGYPNSVTPDYLPFQTWDTLQGLSTYIRSMLSTQALLGGIGV 97
Query: 115 RPTPAQATVVS----WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLE 169
T ATVV W ++D +G ++ + + G+ +DS K+WR+ AD L D+G ++
Sbjct: 98 GET--TATVVGATFLWFMRDFTGMLGSILFTLHQGSNLDSSAKQWRLTADLLNDVGMLMD 155
Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
++SPL P F+ + +G+ A+ + VA+ ATR + FA N +D+ AK + T
Sbjct: 156 LVSPLFPGAFVTILCIGSMARSVTGVASGATRAALTQHFALRKNAADVSAKEGSQETAAT 215
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
+VG+ G+ LA ++ + LL+ H+Y+ +R+ + +LN +RT+++++ F
Sbjct: 216 MVGMLLGMVLARLTAENVVALWVSFLLLTAFHMYANYRAVRSLCLTSLNAERTSIVLSAF 275
Query: 290 VKTGRISSPADLRYREDLL-----FPGRLIEDAGNVKVGRDPHKVVKPS---------KF 335
+ ++ +P ++ +E LL P + + AG+ + + V+ S F
Sbjct: 276 KEGRKVPTPREVADQEHLLPRIPRLPWQ-KQRAGDAWLPKSIRFGVQISSLKLDDGWKSF 334
Query: 336 HELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQ 395
L E ++ + +++ +VL + A+ +D L G++ A A + P+ +
Sbjct: 335 LSLVERYSRDPYLMVPKGAIVHVVLHKLATPQDFLHGYVHALCLADL--QLRSPADQSEE 392
Query: 396 DAYDKMNDVFTPLLSELQAKGWHTDRFLDGTG 427
A M + +T L L GW TDR L TG
Sbjct: 393 LAISWMKNNYTAFLDSLGKSGWATDRILVETG 424
>gi|356497988|ref|XP_003517837.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 473
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 116/438 (26%), Positives = 208/438 (47%), Gaps = 29/438 (6%)
Query: 14 EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPS 73
E S+S + I + + S F S ++ S + V L F P
Sbjct: 36 ETSSSVSNKTITLEEWNGSSPTKLSKTFTIKASSSSVLITRSGARFTHVWRRLLQAFVPE 95
Query: 74 GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDG 131
G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+ A W L+D
Sbjct: 96 GFPSSVTADYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRDL 155
Query: 132 MQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAK 190
+G ++ + G+ +DS K WR++AD + DLG ++++SPL P F+ + LG+ ++
Sbjct: 156 TGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGSISR 215
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
VA+ ATR + FA + N +D+ AK + T+ ++G+ G+ +A
Sbjct: 216 SFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVARLTIGHPLAI 275
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
L+V H+Y+ +R +N+LNP+R+++++ F++TG++ SP + +E +L P
Sbjct: 276 WFSFLSLTVFHMYANYRAVRCLALNSLNPERSSILLQHFMETGQVLSPKQVSSQEHVL-P 334
Query: 311 GRLIEDAGNVKVGRDPHKVV----KPSKFH--ELKE-------LFAEEKFILCLGSRWTD 357
+ I + + K HK V + S F E+KE + + K++L D
Sbjct: 335 IQFI--SWSSKKANCLHKKVCLGMRISSFDNMEIKEHLLCAAPYYTKAKYLLVERKGIID 392
Query: 358 MVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQA 414
+++ ++++ D L+ + LV A +KS H Q D +VF + +L++
Sbjct: 393 VIVHKDSNAADVLKSFFHALVLAKNVHKSKSLHSEG----QKWIDYQYEVF---IQKLKS 445
Query: 415 KGWHTDRFLDGTGTRFAW 432
GW T+R L R W
Sbjct: 446 LGWKTERLLSPIIWRAYW 463
>gi|410903033|ref|XP_003964998.1| PREDICTED: UPF0420 protein C16orf58 homolog [Takifugu rubripes]
Length = 442
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 103/399 (25%), Positives = 194/399 (48%), Gaps = 29/399 (7%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPT 117
+ VV +F + F P GYP SV+ YLRY + +Q F S+ L+T++ L G+ +
Sbjct: 42 NSVVGAFKSVFLPQGYPESVSGDYLRYQFWDTVQAFFSSLSGTLATRASLRGVGVGNQEA 101
Query: 118 PAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A ++W+LKDG G+++ + G ++DSE K+WR+ AD L D+ +E+L+P P
Sbjct: 102 TVAAATMTWLLKDGTGMCGRILFAWRKGTKLDSEAKKWRLFADILNDVAMFMEILAPYFP 161
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
F + + K + VA ATR + A+ N++D+ AK + TL N+ GL
Sbjct: 162 AFFTLVMCVSGIFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVS 221
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ L V + + + L +V+H+++ + +R+ + TLN R ++++ +++ G+I
Sbjct: 222 LVLIRFVSDNAALTLGLFFLFAVLHLFANYKAVRSVVMETLNEARLSVVLQQYLENGQIL 281
Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT 356
SP + RE + + + +++G +V++ S+ + + + F+L + +
Sbjct: 282 SPLEANEREPVFMDFKKVVP---IRLGVRLQEVIQSSEELKFALKLSNKPFLLGVKNGGV 338
Query: 357 DMVLEQNASGEDALRGWLVAAYAASMAKS---------------FHDPSL-------TVL 394
+ L AS +D +R A + M S F+ P ++
Sbjct: 339 CVCLGPEASVQDEIRVMCQALWLRRMLSSQPSRGASAEQQVLFLFNIPLFPSSKGHWDLV 398
Query: 395 QDAYDKMNDVFTPLLSELQAKGWHTDR-FLDGTGTRFAW 432
++++ ++++F+P L L A GW R LD R W
Sbjct: 399 HESHNVIDNIFSPFLKGLVAAGWDLKRNLLDWDEWRVEW 437
>gi|348510102|ref|XP_003442585.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oreochromis niloticus]
Length = 433
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 103/393 (26%), Positives = 180/393 (45%), Gaps = 27/393 (6%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPT 117
+ V F + F P GYP SV++ YL+Y + +Q F+S+ L+TQ+ L F G +
Sbjct: 43 NSVFGVFKSVFLPQGYPESVSDDYLQYQFWDTMQAFSSSLSGTLATQASLRGFGVGNQEA 102
Query: 118 PAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A V+W+ KDG +G+++ + G+++DSE K+WR+ AD L D+ +E+L+P P
Sbjct: 103 TVAAATVTWLFKDGTGMLGRILFAWRKGSKLDSEAKKWRLFADVLNDIAMFMEILAPYFP 162
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
F + K I VA ATR + A+ N++D+ AK + TL N+ GL
Sbjct: 163 ACFTFIVCTAGIFKSIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVS 222
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ L V + + + L +V+H+++ + +R+ + T N R ++++ ++K RI
Sbjct: 223 LILIPLVTDNPVLTLCLFFLFTVLHLFANYKAVRSVVMETFNEARLSIVLQQYLKDKRIL 282
Query: 297 SPADLRYREDLLFP-GRLIEDAGNVK---VGRDPHKVVKPSKFHELKELFAEEKFILCLG 352
SP + RE + F G+ + V+ + + P + K + + L +C+
Sbjct: 283 SPPEANQREPVFFEFGQTVPMKLGVRLQEIAQSPEDLQLALKENSMPYLLGIRNGCVCV- 341
Query: 353 SRWTDMVLEQNASGEDALRGWLVAAY------------AASMAKSFHDPSLTVLQDAYDK 400
L AS D +R A A A ++ +++
Sbjct: 342 ------CLGPEASASDEIRAMCQAVVLSNMLSSSNSREATGAALKQQKGHWEMVHESHKL 395
Query: 401 MNDVFTPLLSELQAKGWHTDR-FLDGTGTRFAW 432
M+ +F P L +A GW R +D R W
Sbjct: 396 MDAIFNPFLKGAEAAGWDMKRTMIDWDEWRVEW 428
>gi|222628554|gb|EEE60686.1| hypothetical protein OsJ_14164 [Oryza sativa Japonica Group]
Length = 609
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 98/358 (27%), Positives = 172/358 (48%), Gaps = 54/358 (15%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + P GYP SV+ YL+Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 185 WRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKGA 244
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P+ P
Sbjct: 245 IPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPH 304
Query: 178 LFLEMAGLGNFAKGIAVVAAR----------------ATRLPIYSSFAKEGNLSDLFAKG 221
LF+ + + A + ATR Y+ FA + N +++ AKG
Sbjct: 305 LFVPIGAAAGAGRSAAALIQTIFTHFMLENANHMRKAATRSCFYAGFAVQRNFAEVIAKG 364
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
EA + +G+ GI LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R
Sbjct: 365 EAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYR 424
Query: 282 TAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG--RDPHKVVKP------- 332
+++ ++++ +G++ S ++ E L N+ +G R K++ P
Sbjct: 425 ASLVFSEYLLSGQVPSVKEVNDEEPLFL---------NLSIGTSRKESKILSPQAKDAAE 475
Query: 333 ---------SKFHELKE----------LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
SK E+ E L+ E+++L ++L++ +S ED L+
Sbjct: 476 IICRRLQLGSKLSEIIENKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLK 533
>gi|449016577|dbj|BAM79979.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 719
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 181/384 (47%), Gaps = 33/384 (8%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL- 114
R + R+ + + + P GYP +V + YL +T +R LQ+ ++ + V ST++LL GL
Sbjct: 328 RSLVERLRNWLIQVYLPHGYPHTVTQDYLAFTLWRLLQNTAASVMGVFSTEALLLGLGLG 387
Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSP 173
+ A V WILKDG H GK+ + + G + D +P+ WRI++D L DL LE+++P
Sbjct: 388 KNVSGTAQAVQWILKDGFGHAGKIAYAAVAGKQFDMDPRSWRIMSDLLEDLAGVLEIITP 447
Query: 174 LCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
+ FL +A + + IA + ATR I+ S A N DL KGE+ ++GL
Sbjct: 448 IFSGQFLVLASIATTMRAIAAMTGTATRHAIFKSMALAENQGDLATKGESQGVTTKLIGL 507
Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV--- 290
G GI L+ + +I + +VIH+ + + M+ +TLN QR A+++ F+
Sbjct: 508 GLGITLSKRIGQDYTNLLIAYGIAAVIHLAANFKAMQCVEFSTLNRQRMALLLDAFMSAR 567
Query: 291 -KTGRISS-------PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELF 342
+T R +S P ++ +E + P + + G +V F L+
Sbjct: 568 AQTSRQASRQLTLPTPREICAKERFMIPP-WRDYRPEIVFGASLQRVKHARHFLHCAALY 626
Query: 343 AEEKFILCL--GSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVL----QD 396
E++I+ ++ + L A+ +D L K+F + +L +
Sbjct: 627 RNERYIIYAEPSTKHVLVFLRDGATVKDQL-------------KAFMNAQRYLLDRDAER 673
Query: 397 AYDKMNDVFTPLLSELQAKGWHTD 420
AY+ F+ L A GW T+
Sbjct: 674 AYNFTQREFSTFLQATVASGWSTN 697
>gi|356495287|ref|XP_003516510.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
Length = 649
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 164/314 (52%), Gaps = 12/314 (3%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
V+ + D F P G+P SV YL Y+ +RA+Q VL+TQSLL+A GL +
Sbjct: 174 VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG 233
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A ++W+LKDG+ ++ K++ SN G D +PK WR+ AD L + GLE+ +P P
Sbjct: 234 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAAFGLEMCTPAFP 293
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
Q F+ + + ++ A + +TR ++ FA + N +++ AKGE +G+G G
Sbjct: 294 QFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLG 353
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
I L + + SS + +L+ IH+Y ++ ++ + TLNP R +++ ++++ +G+ +
Sbjct: 354 IGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQ-A 412
Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT 356
P E+ LFP I N +V S E K+ AE + L LGS+ +
Sbjct: 413 PPVKEVNDEEPLFPAVPIL---NATFANKAQSIVLSS---EAKDAAAEIEHRLQLGSKLS 466
Query: 357 DMVLEQNASGEDAL 370
++V S ED L
Sbjct: 467 EIV----NSKEDVL 476
>gi|357487621|ref|XP_003614098.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
gi|355515433|gb|AES97056.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
Length = 749
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 192/378 (50%), Gaps = 31/378 (8%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVV- 124
FL F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+ ATV+
Sbjct: 363 FLQAFVPEGFPSSVTPDYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGV--GEKSATVIG 420
Query: 125 ---SWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
W L+D +G ++ + G+ +DS K WR++AD + DLG ++++SPL P F+
Sbjct: 421 ATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFV 480
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+ G+ +A
Sbjct: 481 FIVCLGSISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVA 540
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
L++ H+Y+ ++ +N+LNP+R+++++ F+++ ++ SP
Sbjct: 541 RITTGHPLAIWFSFLSLTLFHMYANYRAVQCLSLNSLNPERSSILLQRFMESRQVLSPKQ 600
Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKV---VKPSKFHE---------LKELFAEEKFI 348
+ E +L P +L + KV KV + S F E L +A+ K+I
Sbjct: 601 VSPLEHVL-PIQLTSMSSK-KVNSLDSKVRLGTRISSFDEMEIKEHLLSLASYYAKAKYI 658
Query: 349 LCLGSRWTDMVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQDAYDKMNDVF 405
L D+++ +++S D L+ + LV A KS H S +++ Y +VF
Sbjct: 659 LVEKKGIIDVIVHKDSSSADVLKSFIHALVLANNVYKGKSLHSDSQMWMENQY----EVF 714
Query: 406 TPLLSELQAKGWHTDRFL 423
+ +L++ GW T+R L
Sbjct: 715 ---IQKLKSLGWKTERLL 729
>gi|157422883|gb|AAI53381.1| Zgc:162613 protein [Danio rerio]
Length = 432
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 185/384 (48%), Gaps = 21/384 (5%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F + F P GYP SV+E YL+Y + +Q F+S+ L+TQ+ L G+ + A
Sbjct: 48 FKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQEATVAAAT 107
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+L+DG +G+++ + L G+++DSE K+WR+ AD L D+ +E+ +P P F +
Sbjct: 108 ITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHFPPFFTLI 167
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ K I VA ATR + A+ N++D+ AK + TL N+ GL + L
Sbjct: 168 VCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLVSLALIPL 227
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V + I+ L +V+H+++ + +R+ + TLN R ++++ ++ G++ SPA+
Sbjct: 228 VTDNPLLTFILFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLHQYLLHGQVLSPAEAN 287
Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQ 362
RE + R +K+G ++ +L + +++ + + + L
Sbjct: 288 QREPVFLSFR---RTVPIKLGVRLGDIIHEPGDLQLALKNNSKPYLIGIKNGAVCVCLGS 344
Query: 363 NASGEDALRGWLVAAYAASMAKSFHDPSLTVLQ-------------DAYDKMNDVFTPLL 409
+AS +D ++ + A S P VLQ ++ ++++F L
Sbjct: 345 DASVDDEIKA-VCQALCVSAVLHNESPLPAVLQQLSNTKDPWALISQSHKMIDEIFQTFL 403
Query: 410 SELQAKGWHTDR-FLDGTGTRFAW 432
EL W TDR LD R W
Sbjct: 404 KELDRAKWQTDRTLLDWNEWRVEW 427
>gi|412993492|emb|CCO14003.1| predicted protein [Bathycoccus prasinos]
Length = 584
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 83/263 (31%), Positives = 141/263 (53%), Gaps = 13/263 (4%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPT 117
R +D+ + FFP SV Y Y ++R Q S+ ++VL+TQ +L A GL R
Sbjct: 163 RQIDNIRDSFFPQKT--SVTSDYWDYAKYRFTQRIASSCMTVLATQQMLAAIGLGASRSI 220
Query: 118 PAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
PA A ++W+LKDG+ +GKL + +N G DS+ KR R + +Y +E ++P P
Sbjct: 221 PA-AAALNWVLKDGLGRLGKLSVATNFGRSFDSDVKRLRFSSSLVYTGAVFIETITPFFP 279
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+ FL +A + N K I + R PI +FA E NL ++ AK A L + +GL G
Sbjct: 280 KHFLAIATVANVGKSIGITTGNVVRPPIQKTFALEENLGEIAAKTSAQQVLADNIGLALG 339
Query: 237 IQL----ASTVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+ + V ++Q + + P+L+ I + S+ +++++ + T+N +R +I FV
Sbjct: 340 VSVMKFQTKFVSLALQRAVPLALFPILASIDLVSIHKQLKSVQLRTVNKERAEIIAEAFV 399
Query: 291 KTGRISSPADLRYREDLLFPGRL 313
+ ++++ + +E LLFP RL
Sbjct: 400 REKKLATQLQVAEKERLLFPARL 422
>gi|302835848|ref|XP_002949485.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
nagariensis]
gi|300265312|gb|EFJ49504.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
nagariensis]
Length = 495
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 84/245 (34%), Positives = 134/245 (54%), Gaps = 6/245 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
P+G+P SV Y Y +R +Q+F A+SV +TQSLL A G+ R A ++W+
Sbjct: 113 LLPNGFPDSVGPQYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRFQGEAAAAINWV 172
Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
+KDG +G+L+ + G +D E K++R+ D L + G LE+ + P FL +A N
Sbjct: 173 IKDGAGRLGRLLFARWGRELDCELKQFRLAGDLLMEAGAALELATVYAPPAFLPLACTAN 232
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
+K +A VAA +TR PIY +FA + NL+D+ AKGE+++ L +++G AGI L+ M
Sbjct: 233 LSKNLAAVAASSTRAPIYRTFALQNNLADITAKGESVANLADILGTVAGIALSRMKLPRM 292
Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ LS ++ S +E+ + + +N R A ++ G I A+ + E L
Sbjct: 293 PTFCV----LSAGYLLSSRKEVDSVELPYMNRARLAYATQRYLSDGFIPGVAEANHNEPL 348
Query: 308 LFPGR 312
L GR
Sbjct: 349 LPWGR 353
>gi|357133639|ref|XP_003568431.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Brachypodium
distachyon]
Length = 508
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 104/343 (30%), Positives = 171/343 (49%), Gaps = 27/343 (7%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV 123
SF+ + P G+P SV Y+ Y +RAL+HF A+SV +T++LL + G+ +
Sbjct: 125 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQS------ 178
Query: 124 VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
+DG VGK++ + G + D + K+ R D L +LG G+E+ + P LFL +A
Sbjct: 179 -----RDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHLFLPLA 233
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
+ N K +A V + +TR PIY ++AK N+ D+ AKGE++ + +++G G I L S
Sbjct: 234 CMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI-LISKR 292
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
S+ LS ++ S E+R+ +NTLN R + V F+KTG + S +
Sbjct: 293 NPSLVASFA---FLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEGNL 349
Query: 304 REDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM--VL 360
E + P + V +G R +P+ F LF +E++I+ + +L
Sbjct: 350 EETIFNPPWRHQP---VAIGSRFGEAFQEPASFVSTSPLFEDERYIVTYNPTKDKVYALL 406
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMND 403
+ A +D L+ AA+ A + F + S L +A MN
Sbjct: 407 KDQAKPDDILK----AAFHAHVLLHFINASHANL-NARKHMNS 444
>gi|225423472|ref|XP_002274189.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
gi|297738087|emb|CBI27288.3| unnamed protein product [Vitis vinifera]
Length = 437
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 187/382 (48%), Gaps = 23/382 (6%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ V L F P G+P SV Y + + +LQ ++ ++LSTQ+LL A G+
Sbjct: 44 FSHVWRRILQAFVPEGFPSSVTPDYAPFQLWDSLQGLSTYIRTMLSTQALLSAIGVGEKS 103
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG ++++SPL
Sbjct: 104 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLVSPLF 163
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+ +VG+
Sbjct: 164 PSAFVFIVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMVGMAF 223
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
G+ LA L++ H+Y+ +R + TLN +R+++++ F++TG++
Sbjct: 224 GMLLARITMGHSVAIWFSFLSLTMFHMYANYRAVRCLILTTLNSERSSILIEHFMETGQV 283
Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPH-------------KVVKPSKFHELKELF 342
SP ++ E +L P L + ++K + H K+ H + +
Sbjct: 284 LSPQEVSKMEHVL-P--LWASSWSLKNAKSLHTRIHLGVRVSSLDKLEVTDLLHSAESHY 340
Query: 343 AEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
+ K++L +++ ++++ D L+ ++ A A++ D S+ + ++ M+
Sbjct: 341 KKAKYLLLERKGIISVIMHKDSTASDVLQSFIHALVMANLV----DKSIYIHLESRSWMD 396
Query: 403 DVFTPLLSELQAKGWHTDRFLD 424
+ L +LQ+ GW T+R L
Sbjct: 397 KQYENFLLKLQSSGWKTERLLS 418
>gi|299115992|emb|CBN75993.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 479
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 87/249 (34%), Positives = 141/249 (56%), Gaps = 7/249 (2%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT--PAQATV 123
L PSG+P SV+ Y+ Y ++ SAA V+STQ+LL+A GL P AT+
Sbjct: 1 MLGLVLPSGFPASVHPPYMAYAGWQFTGMAASAAAGVMSTQALLYAMGLGAGALPLAATL 60
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+W++KDG+ +G + S+L R D+ PK WR++A D LE+LSPL P FL +
Sbjct: 61 -NWVIKDGLGQLGGVAFSSLVSTRFDANPKLWRVVAAVSLDASMVLELLSPLAPAYFLPI 119
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
A + N K ++ +AA A+R I++ + +GNL+D+ AK A + L + G GAG+ L+++
Sbjct: 120 ASVANIGKNVSFLAASASRAAIHNVLSLKGNLADVTAKSGAQTILACMAGTGAGVTLSAS 179
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV---ADFVKTGRISSPA 299
+ S + LSV+H+ + MR + T + QR ++V AD ++ + SP
Sbjct: 180 LGSEFHAIVPACLCLSVVHLGANHMSMRRLGIPTFDHQRLELVVNGMADGDRSRGMPSPQ 239
Query: 300 DLRYREDLL 308
D+ +ED++
Sbjct: 240 DIAAQEDVV 248
>gi|42565615|ref|NP_190175.2| uncharacterized protein [Arabidopsis thaliana]
gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis thaliana]
gi|30794095|gb|AAP40490.1| unknown protein [Arabidopsis thaliana]
gi|110739240|dbj|BAF01534.1| hypothetical protein [Arabidopsis thaliana]
gi|332644566|gb|AEE78087.1| uncharacterized protein [Arabidopsis thaliana]
Length = 608
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 157/297 (52%), Gaps = 9/297 (3%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
+ L +F P G+P SV YL Y+ +R +Q S VL+TQSLL+A GL + A
Sbjct: 192 NLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAA 251
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+LKDG+ ++ K++ S G D PK WR+ AD L + G+E+L+P+ PQ F+ +
Sbjct: 252 AINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMI 311
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ A + ATR + FA + N +++ AKGEA + VG+ GI +A+
Sbjct: 312 GAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANC 371
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ +S + +++ IH+Y+ ++ + + TLNP R +++ ++++ +G+ ++
Sbjct: 372 IGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVN 431
Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMV 359
E+ LFP + + P K+ E K A+ + L LGS+ +D++
Sbjct: 432 -DEEPLFPTVRFSNM------KSPEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVI 481
>gi|327286048|ref|XP_003227743.1| PREDICTED: UPF0420 protein C16orf58-like [Anolis carolinensis]
Length = 420
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 112/404 (27%), Positives = 185/404 (45%), Gaps = 32/404 (7%)
Query: 36 TRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHF 95
T HY+ PDG LS + Y + D F F P GYP SV+ YL Y + +Q F
Sbjct: 19 TCHYKSLPDGRLSCVLDRLEDEGYRSIRDFFTAVFLPQGYPESVSPDYLAYQIWDTVQAF 78
Query: 96 TSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPK 152
S+ L+TQ++L G+ + A V+WILKDG +G+++ + + G+++D + K
Sbjct: 79 ASSITGTLATQAVLKGVGVGDETSTVTAATVTWILKDGTGMLGRILFAWSKGSKLDCDAK 138
Query: 153 RWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG 212
+WR+ AD L D+ +E+L+P P F + + F K I VA ATR + A+
Sbjct: 139 QWRLFADVLNDVAILMEILAPAFPACFTLIVCISGFFKCIVGVAGGATRAALTMHQARRD 198
Query: 213 NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAA 272
N++D+ AK + TL N+ GL + L V +++ + L +++H+Y+ +RA
Sbjct: 199 NMADVSAKDGSQETLVNLAGLLFSLLLIPLVADNLRLTYSLYGLFTILHLYANYRAVRAV 258
Query: 273 PVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLI-------EDAGNVKV--- 322
+ TLN R +++ F+K+G + P +E LL I G V +
Sbjct: 259 CMETLNRARLCLVLQSFLKSGEVPGPDVTNPQEPLLLVDFSITIYVPWSSSTGTVSILLH 318
Query: 323 -GRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAAS 381
R V+K L E L +VLE+ + + A
Sbjct: 319 QQRRSVDVIKACSHALLLEAL--------LSWDVASLVLEKES----------LLAVQKQ 360
Query: 382 MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDG 425
+ + V+ + + ++ +F L+ L A GW T+R L G
Sbjct: 361 LREGSKSKHCGVVSETHRILDRIFPKFLAGLMAAGWVTERNLLG 404
>gi|224129512|ref|XP_002328735.1| predicted protein [Populus trichocarpa]
gi|222839033|gb|EEE77384.1| predicted protein [Populus trichocarpa]
Length = 398
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 154/281 (54%), Gaps = 9/281 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILK 129
P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + A A ++WILK
Sbjct: 15 PEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTKTLLNSVGVSKSQAIPGAVAINWILK 74
Query: 130 DGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFA 189
DG VGK++ + G + D + K+ R D L +LG +E+ + P LFL +A N A
Sbjct: 75 DGAGRVGKMLYARQGKKFDYDLKQLRFAGDLLMELGAAVELATAAVPHLFLPLACAANVA 134
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
K +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I ++ S
Sbjct: 135 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVADLLGTGLSIMISKRNPS---- 190
Query: 250 KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL- 308
+ LLS +V S +E+++ ++TLN R ++ V F+KTG++ S + +E++
Sbjct: 191 LVTTFALLSCGYVLSSYQEVKSVVLHTLNGARFSVAVESFLKTGQVPSLHEGNVKENIFN 250
Query: 309 FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFIL 349
FP ++D R P + ++ LF E++I+
Sbjct: 251 FP--WLKDRSISLGPRFKDAFQDPRAYLAIEPLFERERYIV 289
>gi|148905902|gb|ABR16113.1| unknown [Picea sitchensis]
Length = 446
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 188/374 (50%), Gaps = 26/374 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV YL + + +LQ ++ +LSTQ+LL A G+ T A W
Sbjct: 59 FLPEGFPNSVTSDYLPFQIWDSLQGLSTYIRCMLSTQALLSAIGVGETTATVVGATFQWF 118
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D +G ++ + G+ +DS K WR++AD + DLG ++++SPL P F+ + LG
Sbjct: 119 LRDFTGMLGGILFTLYQGSNLDSNAKMWRLIADLMNDLGMLMDLVSPLYPASFVIILCLG 178
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ A+ VA+ ATR + FA + N +D+ AK + T+ ++G+ G+ LA V +
Sbjct: 179 SIARSFTGVASGATRAALTQHFALQHNAADISAKEGSQETMATMIGMLLGMLLAH-VTAG 237
Query: 247 MQGKMIVG-PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
G M LL+ H+++ + +R +N+LN RT++++ F+ G++ +P D+ E
Sbjct: 238 HPGAMWSSFLLLTFFHMFANYQAVRCLCLNSLNEGRTSVLLKCFITNGQVLNPKDVSSME 297
Query: 306 DLL---FPGRLIEDA----GNVKVGRDPHKVVKPSKFHELK---ELFAEEKFILCLGSRW 355
LL R++ K+G + + LK + + K++L +
Sbjct: 298 YLLPNWMQLRMLTKGRPFNATFKLGAKISSLSQSDMSKLLKLSASCYGKGKYLLLCRNGT 357
Query: 356 TDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK-----MNDVFTPLLS 410
+++L +++ D L+ +Y + S+H S + + DA + M ++ +S
Sbjct: 358 IEVILNKDSVSADILQ-----SYIHGLVLSYHMGSSSSVNDANNADSCLWMEKYYSMFIS 412
Query: 411 -ELQAKGWHTDRFL 423
+LQA GW+T R L
Sbjct: 413 KQLQASGWNTGRLL 426
>gi|297815746|ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
lyrata]
gi|297321594|gb|EFH52015.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
lyrata]
Length = 613
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 87/297 (29%), Positives = 154/297 (51%), Gaps = 9/297 (3%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
+ L +F P G+P SV YL Y+ +R +Q S VL+TQSLL+A GL + A
Sbjct: 198 NLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVLATQSLLYAVGLGKGAIPTAA 257
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+LKDG+ ++ K++ S G D PK WR+ AD L + G+E+L+P+ PQ F+ +
Sbjct: 258 AINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMI 317
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ A + ATR + FA + N +++ AKGEA + +G+ GI +A+
Sbjct: 318 GAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSMGILLGIVVANC 377
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ +S + +++ IH+Y+ ++ + + TLNP R +++ ++++ +G+ ++
Sbjct: 378 IGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVN 437
Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMV 359
E L R + N+K V S+ E E L LGS+ +D++
Sbjct: 438 DEEPLFPTVRFL----NMKSPEKLQDFVLSSEAKAAAEDIEER---LQLGSKLSDVI 487
>gi|308807048|ref|XP_003080835.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
tauri]
gi|116059296|emb|CAL55003.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
tauri]
Length = 1023
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/267 (33%), Positives = 142/267 (53%), Gaps = 17/267 (6%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RP 116
R V+ + P P SV Y Y ++R Q S+ +SV +TQ +L A GL R
Sbjct: 573 RRCVERASHALLPE--PTSVTSDYWEYARYRFAQRIASSCISVFATQQMLTAVGLGASRT 630
Query: 117 TPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
PA A V+W+LKDG+ +GKL + +N G DS+ KR+R + +YD + +E+++P
Sbjct: 631 LPA-AAAVNWVLKDGLGRLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYF 689
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL-- 233
PQ FL +A + N K + + A R PI +F E NL+++ AK A L + +GL
Sbjct: 690 PQHFLPLATVANIGKSVGITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLAL 749
Query: 234 --GA--GIQLASTVCSSMQGKMIV---GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
GA G+ +TV ++ + + GP L+V+ + + +E++A + T+N +R +I
Sbjct: 750 AVGAAKGMSKVATVRPEIRRALPLFAFGP-LAVLDLICIYKELKAVQLRTINKERAEIIA 808
Query: 287 ADFVKTGRISSPADLRYREDLLFPGRL 313
+VK I + A + E LL P RL
Sbjct: 809 EAYVKEREIPTRARVANAERLLIPARL 835
>gi|145349487|ref|XP_001419164.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579395|gb|ABO97457.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 269
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 91/270 (33%), Positives = 138/270 (51%), Gaps = 21/270 (7%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL- 114
R R V+ + P P SV Y Y +FR Q S+ +SV +TQ +L A GL
Sbjct: 6 RDAARRCVERAQHALLPE--PSSVTNDYWEYAKFRFAQRIASSCISVFATQQMLTAVGLG 63
Query: 115 --RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
R PA A V+W+LKDG+ +GKL + +N G DS+ KR+R + +YD + +E++
Sbjct: 64 ASRTLPA-AAAVNWVLKDGLGRLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEII 122
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
+P PQ FL +A + N K + + A R PI +F E NL+++ AK A L + +
Sbjct: 123 TPYFPQHFLPLATVANIGKSVGITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNI 182
Query: 232 GLGAGIQLA------STVCSSMQGKMIV---GPL--LSVIHVYSVIEEMRAAPVNTLNPQ 280
GL + A +TV ++ + V GPL L +I +Y +E++A + T+N +
Sbjct: 183 GLALAVGAAKAVSKVATVRPEIRRALPVFAFGPLAALDLICIY---KELKAVQLRTINKE 239
Query: 281 RTAMIVADFVKTGRISSPADLRYREDLLFP 310
R +I FVK I S A + E L P
Sbjct: 240 RAEIIAEAFVKEREIPSRAHVADAERLFIP 269
>gi|298709448|emb|CBJ31354.1| putative glutamate/ornithine acetyltransferase (ISS) [Ectocarpus
siliculosus]
Length = 789
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 79/235 (33%), Positives = 130/235 (55%), Gaps = 17/235 (7%)
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-AQATVVSWILKDGMQH 134
P V Y +T +R Q F + ++V TQ+LL A G++ A +SW+LKD +
Sbjct: 187 PEDVTPDYYHFTFWRMFQRFVAGTINVFGTQALLLALGIKAKRLGAAAAMSWVLKDALGK 246
Query: 135 VGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIA 193
G+++ S +G R DS+ KRWR + LY G GLE+++ + P FL +A + N K ++
Sbjct: 247 FGRILWASKMGRRFDSDAKRWRFRSSLLYAAGNGLEIVTYVFPASFLVLAAMANSFKQMS 306
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAK---------------GEAISTLFNVVGLGAGIQ 238
++ + ATR IY SFA+ N+ D+ AK GEA + +V+G+ +GI
Sbjct: 307 MLTSSATRNTIYRSFARGENIGDITAKACPPAAAMPNAKKHEGEAQIAVVDVLGMLSGIG 366
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
L + V +S Q + +LS++ + ++ E+RA + LN +RT ++V DFV +G
Sbjct: 367 LCTLVGTSRQTIVAAYVILSLVDISAIYNEIRAVCFSVLNHERTHLLVRDFVASG 421
>gi|170037702|ref|XP_001846695.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880979|gb|EDS44362.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 417
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 185/381 (48%), Gaps = 28/381 (7%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQAT 122
F N F P+GYP SV+ YL Y ++ +Q F S L+T ++L G+ P AT
Sbjct: 49 FQNLFLPAGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPLSAT 108
Query: 123 VVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFL 180
++W+LKDGM H G+++ + G +D + K+WRI AD L D+ G+++ PL P++
Sbjct: 109 -ITWVLKDGMGHFGRILFAWWKGTELDIDSKKWRIRADILNDIAMGVDLFVLPLYPKVST 167
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + K I VA ATR + A GNL+D+ +K A T N++ G+ L
Sbjct: 168 YILCATSTMKAIVGVAGGATRSALTQHHALRGNLADVASKDSAQETCVNLIASFVGLGLL 227
Query: 241 STVCSSMQGKMIVGPLL--SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
+ V +++ G L +++H+Y+ I+ ++A + T N R + + ++ K+G + SP
Sbjct: 228 THV---QNQRVLYGLFLFVTLLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGTMLSP 284
Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK-FHELKELFA----EEKFILCLGS 353
+ E + G+ + VK+G ++ + + ++L+ + A EKF+L
Sbjct: 285 QQVNKLERVTI-GQTVSLTARVKIGCSARELAQYYRNCYDLENVIACFDSREKFLLAETR 343
Query: 354 RWTDMVLEQNASGEDALRGWL-VAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSEL 412
+ + L D ++ + VA+Y ++ L+D + + + + LS
Sbjct: 344 HYVGVYLHFTVKPLDIIKSYFYVASYLQDKSQ---------LRDRFWETQNKWNEFLSMA 394
Query: 413 QAKGWHTDRFLDGTGT-RFAW 432
Q +GW+ L T R W
Sbjct: 395 QREGWNVQAHLLKTDEYRLDW 415
>gi|356502614|ref|XP_003520113.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Glycine max]
Length = 486
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 101/390 (25%), Positives = 193/390 (49%), Gaps = 23/390 (5%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ V L F P G+P SV Y+ + + L ++ ++LSTQ+LL A G+
Sbjct: 94 FTHVWRRLLQAFVPEGFPSSVTADYVPFQIWDLLXGLSTYIRTMLSTQALLIAIGVEEKS 153
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG ++++SPL
Sbjct: 154 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSYAKMWRLVADLMNDLGMLMDLISPLF 213
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+
Sbjct: 214 PSAFVFVVCLGSISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMAL 273
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
G+ +A L+V H+Y+ ++ +N+LNP+R++++ F++TG++
Sbjct: 274 GMLVARLTIGHPLAIWFSFLSLTVFHMYANYRAVQXLTLNSLNPERSSILFQHFMETGQV 333
Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV----KPSKFH--ELKE-------LF 342
SP + +E +L P + I + + K HK V + S F E+KE +
Sbjct: 334 LSPEQVSSQEHIL-PIQFI--SWSSKKANCLHKKVNLGMRISSFDNMEIKEHLLSAAPYY 390
Query: 343 AEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
+ K++L D+++ ++++ D L+ + +A +AK+ H S +V + ++
Sbjct: 391 TKAKYLLVERKGIIDVIVHKDSNAADVLK---LFFHALVLAKNVHK-SKSVHSEGQKWID 446
Query: 403 DVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
+ + +L++ GW +R L R W
Sbjct: 447 YQYEVFIQKLKSLGWKIERLLSPIIWRAYW 476
>gi|297844278|ref|XP_002890020.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
lyrata]
gi|297335862|gb|EFH66279.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
lyrata]
Length = 440
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 100/390 (25%), Positives = 188/390 (48%), Gaps = 24/390 (6%)
Query: 52 VDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA 111
+ S ++ V L F P G+P SV Y+ + + LQ ++ +LSTQ+LL A
Sbjct: 39 IQRSANRFNHVWRRVLQAFVPEGFPGSVTPDYVGFQFWDTLQGLSTYIKMMLSTQALLSA 98
Query: 112 AGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGL 168
G+ A W L+D +G ++ + G+ +DS K WR++AD + D+G +
Sbjct: 99 IGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLM 158
Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
++LSPL P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+
Sbjct: 159 DLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMA 218
Query: 229 NVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
++G+ G+ LA + L+V H+Y+ +R +N+LN +R+++++
Sbjct: 219 TMMGMSLGMLLARFTSGNPMAIWFSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTH 278
Query: 289 FVKTGRISSPADLRYREDLLFPGRLIEDAGNVK-VGRDPHKVVKPSKFHELKEL------ 341
F++TG++ SP + E +L + N K + + H V+ S L+ +
Sbjct: 279 FMQTGQVLSPEQVSSMEGVLPVWATSLRSTNSKLLHKRVHLGVRVSSLPRLEMMQLLNGV 338
Query: 342 ----FAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASM---AKSFHDPSLTVL 394
+ K++L ++L +++S D L+ ++ A A++ + SF+ +
Sbjct: 339 GASSYKNAKYLLAHRKGNVSVILHKDSSAADVLKSYIHAIVLANLMEKSTSFYSEGEAWI 398
Query: 395 QDAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
YD+ L +L++ GW T+R L
Sbjct: 399 DKHYDE-------FLHKLRSGGWKTERLLS 421
>gi|47217805|emb|CAG07219.1| unnamed protein product [Tetraodon nigroviridis]
Length = 412
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 137/247 (55%), Gaps = 3/247 (1%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQAT 122
+F + F P GYP SV+ YL+Y + LQ F+S+ L+T++ L G+ + A
Sbjct: 74 TFQSVFLPQGYPDSVSSDYLQYQFWDTLQAFSSSLSGTLATRASLRGVGVGNQQATVAAA 133
Query: 123 VVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
+W+LKDG G+++ + L G ++DSE K+WR+ AD L D+ +++L+P P F
Sbjct: 134 TTTWLLKDGTGMCGRILFAWLKGTKLDSEAKKWRLFADILNDIAMFMDILAPYFPAFFTL 193
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ + K + VA ATR + A+ N++D+ AK + TL N+ GL + L S
Sbjct: 194 IMCVSGVFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLALVS 253
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
V + + + L +V+H+++ + +R+ + TLN R ++++ +++K GR+ SP +
Sbjct: 254 FVADNAALTLSLFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLQEYLKDGRMLSPLEA 313
Query: 302 RYREDLL 308
RE +L
Sbjct: 314 NQREPVL 320
>gi|157113953|ref|XP_001652155.1| hypothetical protein AaeL_AAEL006646 [Aedes aegypti]
gi|108877519|gb|EAT41744.1| AAEL006646-PA [Aedes aegypti]
Length = 417
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 179/370 (48%), Gaps = 25/370 (6%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQAT 122
F N F P+GYP SV+ YL Y ++ +Q F S L+T ++L G+ P AT
Sbjct: 49 FQNLFLPAGYPDSVSGDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPLSAT 108
Query: 123 VVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFL 180
++W+LKDGM H G++I + G +D + K+WRI AD L DL +++ P P+
Sbjct: 109 -ITWVLKDGMGHFGRIIFAWWKGTELDIDSKKWRIRADVLNDLAMSIDLFVLPYYPKAST 167
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ K I VA ATR + A GNL+D+ +K A T N++ G+ L
Sbjct: 168 FILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLFLL 227
Query: 241 STVCSSMQGKMIVGPLL--SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
+ V +++ G L +++H+Y+ I+ ++A + T N R + + ++ KTG + SP
Sbjct: 228 THV---QDQRVLYGLFLFVTLMHIYANIKAVKAVCLRTFNEARYLIALEEYFKTGTMLSP 284
Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKF-HELKELFA----EEKFILCLGS 353
+ E + G+ + VK+G ++ + + ++L+ L A +KF++
Sbjct: 285 EQVNKLERVTI-GQTVTLTARVKIGCSARELAQYYRVCYDLENLMACFDSRDKFLIAETR 343
Query: 354 RWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQ 413
+ + L A D ++ + Y AS + + L+D Y ++ + + L+ Q
Sbjct: 344 NYVGVYLHFTAKPLDIIKSYF---YVASYLQDKNQ-----LRDRYWEIQNKWNEFLNLAQ 395
Query: 414 AKGWHTDRFL 423
+GW+ L
Sbjct: 396 CEGWNVQAHL 405
>gi|320170693|gb|EFW47592.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 710
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 82/256 (32%), Positives = 140/256 (54%), Gaps = 9/256 (3%)
Query: 41 FQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAAL 100
Q G + ++ +D+R HR++++FL P GYP SV Y+ Y++++A+Q T
Sbjct: 139 LQNAGQRAWRLREDTR---HRLIEAFL----PKGYPHSVTPNYMGYSRWQAVQSVTGTMT 191
Query: 101 SVLSTQSLLFAAGLRPTPAQ-ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILA 158
VLSTQ+LL+A GL A ++WI+KDG+ +G ++ S + ++ DS+PKR R +
Sbjct: 192 GVLSTQALLYAVGLGAGAIPLAGALNWIVKDGLGQLGGVVYSTFISSKFDSDPKRHRFWS 251
Query: 159 DALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
+A T LE+L+PL P +FL +A + N K I+ +AA +TR +++S NL D+
Sbjct: 252 NAALQASTLLEILTPLAPGMFLFLASVSNIGKNISWLAASSTRAQMHNSLTLRDNLGDVT 311
Query: 219 AKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
K + + +++G G GI +A V + ++ LS++++ S + TLN
Sbjct: 312 GKAGSQAIATSLIGTGLGIAIAPFVGTDPIAVLLAFLPLSIVNMVSNYRSNTIVHMRTLN 371
Query: 279 PQRTAMIVADFVKTGR 294
QR + F+ R
Sbjct: 372 VQRAERLFMHFLLHNR 387
>gi|81879541|sp|Q91W34.1|CP058_MOUSE RecName: Full=UPF0420 protein C16orf58 homolog
gi|16877856|gb|AAH17158.1| CDNA sequence BC017158 [Mus musculus]
Length = 466
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 181/381 (47%), Gaps = 39/381 (10%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIILAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
S+ V LL+ +H+Y+ +RA + TLN R +++ F++ G + PA
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312
Query: 305 EDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGS--RWTD 357
E L +P ++ +G H +V S ELK+L E ++LC
Sbjct: 313 EPLWTGFWPSL------SLSLGVPLHHLV--SSVSELKQLVEGHHEPYLLCWNKSRNQVQ 364
Query: 358 MVLEQNASGEDALR----GWLVAAYA------ASMAKSFH----DP---SLTVLQDAYDK 400
+ L Q A E LR G ++ A +A+ H DP S ++++ +
Sbjct: 365 VALSQEAGPETVLRAATHGLILGALQEDGPLPGELAELRHQVQADPKKESWILVRETHQV 424
Query: 401 MNDVFTPLLSELQAKGWHTDR 421
++ +F L LQA GW T++
Sbjct: 425 LDTLFPKFLKGLQAAGWKTEK 445
>gi|42562046|ref|NP_172832.3| uncharacterized protein [Arabidopsis thaliana]
gi|28393739|gb|AAO42280.1| unknown protein [Arabidopsis thaliana]
gi|28973405|gb|AAO64027.1| unknown protein [Arabidopsis thaliana]
gi|332190945|gb|AEE29066.1| uncharacterized protein [Arabidopsis thaliana]
Length = 440
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 184/383 (48%), Gaps = 24/383 (6%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
++ V L F P G+P SV Y+ + + LQ ++ +LSTQ+LL A G+
Sbjct: 46 FNHVWRRVLQAFVPEGFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKS 105
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + D+G +++LSPL
Sbjct: 106 ATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLF 165
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+
Sbjct: 166 PSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSL 225
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
G+ LA + + L+V H+Y+ +R +N+LN +R+++++ F++TG++
Sbjct: 226 GMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQV 285
Query: 296 SSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKEL----------FAE 344
SP + E +L + + + + V+ S L L +
Sbjct: 286 LSPEQVSSMEGVLPLWATSLRSTNSKPLHKRVQLGVRVSSLPRLDMLQLLNGVGASSYKN 345
Query: 345 EKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASM---AKSFHDPSLTVLQDAYDKM 401
K++L ++L +++ D L+ ++ A A++ + SF+ + YD+
Sbjct: 346 AKYLLAHIKGNVSVILHKDSKPADVLKSYIHAIVLANLMEKSTSFYSEGEAWIDKHYDE- 404
Query: 402 NDVFTPLLSELQAKGWHTDRFLD 424
LL +L++ GW T+R L
Sbjct: 405 ------LLHKLRSGGWKTERLLS 421
>gi|255081626|ref|XP_002508035.1| predicted protein [Micromonas sp. RCC299]
gi|226523311|gb|ACO69293.1| predicted protein [Micromonas sp. RCC299]
Length = 665
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 81/260 (31%), Positives = 138/260 (53%), Gaps = 17/260 (6%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVV 124
N FP+ SV E Y Y +FR LQ S+ ++V +TQ +L A G+ R PA A +
Sbjct: 196 NSLFPARE--SVTEDYWEYAKFRFLQRMASSCITVFATQQMLAAIGMGASRRLPA-AAAI 252
Query: 125 SWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
+W+LKDG+ +GKL + +N G DS+ KR+R + +YD + +E+++P P+ FL +A
Sbjct: 253 NWVLKDGLGRLGKLGVAANFGREFDSDVKRFRFTSSVVYDASSLVEMITPFFPKKFLALA 312
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
L N K + + A R PI SF E NL+++ AK A + + +GL + LA+ +
Sbjct: 313 TLANIGKSVGITTANVVRAPIQRSFVLEENLAEVAAKTSAQQVVADNLGLAVAV-LATGL 371
Query: 244 CSSM---QGKMIVGPL-----LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ + ++I+ P L+ + +Y + E++A + T+N +R +I ++ G +
Sbjct: 372 TGKVVNDRARLII-PFVAFIPLATMDLYCIYRELKAVQLTTINKERAEIICDHWINVGTV 430
Query: 296 SSPADLRYREDLLFPGRLIE 315
+ E L P R+ E
Sbjct: 431 PGTKYVSDAERLFIPARMDE 450
>gi|14424560|gb|AAH09308.1| C16orf58 protein [Homo sapiens]
Length = 468
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 105/411 (25%), Positives = 191/411 (46%), Gaps = 31/411 (7%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+CP F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVCPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
+ TLN R +++ +++ G + P R + L+ G A ++ +G H++V
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337
Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
S EL++L +E ++LC +VL Q A + LR G ++ A
Sbjct: 338 -SSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396
Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
+ S V+++ ++ ++ +F L LQ GW T++
Sbjct: 397 LPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDAGWKTEK 447
>gi|225579041|ref|NP_663565.2| UPF0420 protein C16orf58 homolog [Mus musculus]
gi|74204001|dbj|BAE29003.1| unnamed protein product [Mus musculus]
gi|148685697|gb|EDL17644.1| cDNA sequence BC017158, isoform CRA_b [Mus musculus]
Length = 466
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 181/381 (47%), Gaps = 39/381 (10%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
S+ V LL+ +H+Y+ +RA + TLN R +++ F++ G + PA
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312
Query: 305 EDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRWTD-- 357
E L +P ++ +G H +V S ELK+L E ++LC
Sbjct: 313 EPLWTGFWPSL------SLSLGVPLHHLV--SSVSELKQLVEGHHEPYLLCWNKSRNQVR 364
Query: 358 MVLEQNASGEDALR----GWLVAAYA------ASMAKSFH----DP---SLTVLQDAYDK 400
+ L Q A E LR G ++ A +A+ H DP S ++++ +
Sbjct: 365 VALSQEAGPETVLRAATHGLILGALQEDGPLPGELAQLRHQVQADPKKESWILVRETHQV 424
Query: 401 MNDVFTPLLSELQAKGWHTDR 421
++ +F L LQA GW T++
Sbjct: 425 LDTLFPKFLKGLQAAGWKTEK 445
>gi|357612137|gb|EHJ67828.1| hypothetical protein KGM_02553 [Danaus plexippus]
Length = 525
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/367 (25%), Positives = 176/367 (47%), Gaps = 21/367 (5%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F F P GYP SV++ Y+ Y + Q F S +L+TQ + G+ T A A
Sbjct: 47 FAKIFLPQGYPNSVSKDYIFYQIWDTAQAFCSTITGILATQEVFRGVGVGDTNASPLAAT 106
Query: 124 VSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
V+W+ KDG H+GK++ + G +D+ K+WR+ AD L D +E+ PL
Sbjct: 107 VTWVFKDGCGHIGKILFAYTHGTYLDAYSKKWRLYADTLNDAAMCIEIALPLFKNYITFA 166
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ K I VA ATR+ + A GNL+D+ AK A T N++ A + L S
Sbjct: 167 LCVSTCMKAIVGVAGGATRVAMTQHHALRGNLADVSAKDSAQETAVNLIASFAALFLISL 226
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ +S+ + +L ++H+ +RA + TLN R ++ +++ I++P ++
Sbjct: 227 IGNSVT----IFIILLIMHIVFNYMAVRAVCLRTLNEPRFLQVIDTYLRKEVIANPCEIN 282
Query: 303 YREDLLF----PGRL-IEDAG-NVKVGRDPHKVVKPSK----FHELKELFAEEKFILC-- 350
E ++F P L ++ G ++ +G K++ P +++K+++ ++K+I+
Sbjct: 283 RNEPIIFYQLGPNLLDLKICGFHIIIGDSISKILNPRTNAVYINKVKDIYNDKKYIIHPD 342
Query: 351 LGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLS 410
G+R + +++AS +D L + + A + + +D L + + F +
Sbjct: 343 TGNRVMYVFPKEDASVDDMLCAYFQSVLLAIITCAINDHQLAIFSS--NNNTKPFAQVCV 400
Query: 411 ELQAKGW 417
LQ+ W
Sbjct: 401 TLQSAEW 407
>gi|74201602|dbj|BAE28428.1| unnamed protein product [Mus musculus]
Length = 466
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 181/381 (47%), Gaps = 39/381 (10%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
S+ V LL+ +H+Y+ +RA + TLN R +++ F++ G + PA
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312
Query: 305 EDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRWTD-- 357
E L +P ++ +G H +V S ELK+L E ++LC
Sbjct: 313 EPLWTGFWPSL------SLSLGVPLHHLV--SSVSELKQLVEGHHEPYLLCWNKSRNQVR 364
Query: 358 MVLEQNASGEDALR----GWLVAAYA------ASMAKSFH----DP---SLTVLQDAYDK 400
+ L Q A E LR G ++ A +A+ H DP S ++++ +
Sbjct: 365 VALSQEAGPETVLRAATHGLILGALQEDGPLPGELAQLRHQVQADPKKESWILVRETHQV 424
Query: 401 MNDVFTPLLSELQAKGWHTDR 421
++ +F L LQA GW T++
Sbjct: 425 LDTLFPKFLKGLQAAGWKTEK 445
>gi|291410995|ref|XP_002721793.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 468
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 174/376 (46%), Gaps = 29/376 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPESVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDIAMFLEIMAPVFPVCFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ LL+ +H+Y+ +RA + TLN R +++ F++ G + P E
Sbjct: 257 PSVSLGCFFLLTALHIYANYRAVRALVMETLNESRLRLVLKHFLQQGEVLDPTPANQMEP 316
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELF--AEEKFILCL--GSRWTDMVLEQ 362
L+ G + + +G H +V S EL++L +E ++LC +VL Q
Sbjct: 317 -LWTG--FSPSPALSLGVPLHCLV--SSMSELQQLIDGHQEPYLLCWDQSRNQVQVVLSQ 371
Query: 363 NASGEDALRGWLVAAYAASMAKSFHDP-----------------SLTVLQDAYDKMNDVF 405
A E LR ++ + P S V+++ + ++ +F
Sbjct: 372 MAGPETVLRAATHGLVLGALREDGPLPGELEELRRWVRAGPEKDSWVVVKETHKVLDKLF 431
Query: 406 TPLLSELQAKGWHTDR 421
L LQA GW T++
Sbjct: 432 PKFLKGLQAAGWKTEK 447
>gi|395514351|ref|XP_003761381.1| PREDICTED: UPF0420 protein C16orf58 homolog [Sarcophilus harrisii]
Length = 455
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 115/410 (28%), Positives = 195/410 (47%), Gaps = 41/410 (10%)
Query: 57 PVYHRVVDSFLNK-------FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
P HR + S L+ F P G+P SV+ YL Y + +Q F S+ L+TQ++L
Sbjct: 45 PRLHRPLTSLLSSLPPSQAVFLPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVL 104
Query: 110 FAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGT 166
G+ A A ++W+LKDG +G++ + L G+++D + K+WR+ AD L D
Sbjct: 105 QGVGVGDAAASVSAATITWLLKDGTGMLGRIGFAWLMGSKLDCDAKKWRLFADVLNDAAM 164
Query: 167 GLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 226
LE+++P P F + + + AK I VA ATR + A+ N++D+ AK + T
Sbjct: 165 FLEIVAPAFPSCFTLIVCVSSLAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQET 224
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
L N+ GL + L V + + L + +H+Y+ +RA + T N R +++
Sbjct: 225 LVNLAGLLVSLLLLPLVSERPRLSLAAFFLFTALHLYANYRAVRAVIMETFNETRLYLVL 284
Query: 287 ADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--E 344
F++ G + +P R + L+ G L A ++ +G H +V S EL+++ +
Sbjct: 285 KHFIQWGEVLTPPSAN-RMEPLWTGCL--PALSLSLGVALHSLV--SSAFELQQVSEGHQ 339
Query: 345 EKFIL----CLGSRWTDMVLEQNASGEDALR----GWLVAA-------------YAASMA 383
+ ++L C G + L Q AS LR G L+AA +
Sbjct: 340 QPYLLRWNQCRGK--IQVTLGQEASAGVILRAATHGLLLAALLEDVPLTEELETLRGQIR 397
Query: 384 KSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD-RFLDGTGTRFAW 432
+ S T++++ Y ++ ++ L ELQA GW T+ L+ R AW
Sbjct: 398 AAPEGDSWTIVKETYQILDKLYPKFLKELQAVGWETEHHLLEVDEWRAAW 447
>gi|219119716|ref|XP_002180612.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408085|gb|EEC48020.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 408
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/366 (26%), Positives = 175/366 (47%), Gaps = 16/366 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
+ F P+ +P SV GY ++ F L + +A VLSTQ+LL A G+ A+V++
Sbjct: 26 LIQHFLPAQFPASVAPGYAQFCGFCFLASVSGSAGMVLSTQTLLLAIGVVGNAHSASVMA 85
Query: 126 ----WILKDGMQHVGKLI-CSNLGA--RMDSEPKRWRILADALYDLGTGLEVLSPLCPQ- 177
W+LKDG+ +G +I S +G R D+ PK+WR++ D + L++LSP P
Sbjct: 86 GALNWVLKDGIGQLGGVIFASRMGETRRFDANPKKWRMMGALCLDSASLLDILSPFAPSA 145
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+ L +A N K I + A+R ++ + + GNL D+ AK + S ++G G GI
Sbjct: 146 VVLPLACFANALKNIGYLTVGASRAALHQALTRAGNLGDVTAKAGSQSIAAGLLGTGVGI 205
Query: 238 QLASTVCSSMQGKMIVGPL-LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
L++ + G ++G LS+IH ++ ++ N R +++ +++ G++
Sbjct: 206 GLSAAMGHD-AGNFVLGFCGLSLIHQLCNFAALQHVDLHHFNRHRLYLVLDHYLRHGKVL 264
Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA---EEKFIL-CLG 352
SP ++ +E R + A + +G + P +EL +L +E +I+ C
Sbjct: 265 SPREIATKESFFPLLREDDTASWLSIG-SALTAISPGGPNELTQLLKACPDESYIINCTN 323
Query: 353 SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSEL 412
+V Q+A GED ++G A + D V+ +++ F L EL
Sbjct: 324 DGKIHVVFLQDAQGEDLVKGMFHACFLRKKTGLSIDGVEAVVA-THEETKVRFPHFLEEL 382
Query: 413 QAKGWH 418
+ W+
Sbjct: 383 NRQAWN 388
>gi|334332835|ref|XP_001370394.2| PREDICTED: UPF0420 protein C16orf58 homolog [Monodelphis domestica]
Length = 442
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 116/437 (26%), Positives = 198/437 (45%), Gaps = 45/437 (10%)
Query: 19 MSVQEIPI---YWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGY 75
MS E P+ W + R PDG LS + + + F P G+
Sbjct: 1 MSGSEGPVAIKCWERFGSRLPRSCTVAPDGSLSWEAGGGPGQGLRAALTAV---FLPHGF 57
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQ 133
P SV+ YL Y + +Q F S+ L+TQ++L G+ A A ++W+LKDG
Sbjct: 58 PDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVLQGVGVGDAAASVSAATITWLLKDGTG 117
Query: 134 HVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
+G++ + L G+R+D E K+WR+ AD L D LE+++P P F + + + AK I
Sbjct: 118 MLGRIGFAWLMGSRLDCEAKKWRLFADVLNDAAMFLEIVAPGFPSCFTLIVCVSSLAKCI 177
Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMI 252
VA ATR + A+ N++D+ AK + TL N+ GL + L V + +
Sbjct: 178 VGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLVSLLLLPLVSNRPSLSLA 237
Query: 253 VGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
L +++H+Y+ +RA + T N R +++ F++ G++ +P R + L+ G
Sbjct: 238 AFFLFTILHLYANFRAVRAVIMETFNEARLYLVLKHFLQWGKVLTPPSAN-RMEPLWTG- 295
Query: 313 LIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFIL----CLGSRWTDMVLEQNASG 366
A ++ +G H ++ S EL+++ ++ ++L C G + L Q S
Sbjct: 296 -CHSALSLSLGVPLHSLI--SSVFELQQVSEGHQQPYLLRWDRCRGQ--IQVTLGQEVST 350
Query: 367 EDALRGWLVAAYAASMAKSFHD--------------------PSLTVLQDAYDKMNDVFT 406
LR A + +A D S T++++ Y ++ ++
Sbjct: 351 GVILRA---ATHGLLLATLLEDVPLMEELETLRGQIRAAPEGDSWTIVKETYQILDKLYP 407
Query: 407 PLLSELQAKGWHTDRFL 423
L ELQA GW T+ L
Sbjct: 408 KFLKELQAVGWETEHHL 424
>gi|440913323|gb|ELR62787.1| hypothetical protein M91_19503, partial [Bos grunniens mutus]
Length = 437
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 182/384 (47%), Gaps = 37/384 (9%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A
Sbjct: 42 LLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAAT 101
Query: 124 VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+W++KD +G+++ + +G++MD K+WR+ AD L D+ LE+++P+ P F
Sbjct: 102 ATWLVKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTIT 161
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 162 VCISNLAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPM 221
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V S + L+ +H+Y+ +RA + TLN QR +++ F++ G + P
Sbjct: 222 VSDSPSLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQRGEVLGPTSAN 281
Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRW----- 355
E L+ G + ++ +G H V S EL++L +E ++L RW
Sbjct: 282 QMEP-LWTG--FWSSVSLSLGAPLHHVT--SSVSELQQLVEGHQEPYLL----RWDQSRN 332
Query: 356 -TDMVLEQNASGEDALR----GWLVAAYAA------------SMAKSFHD-PSLTVLQDA 397
+VL Q A E LR G ++ A + A++ + S V+++
Sbjct: 333 QVQVVLSQMAGPEAILRAATHGLVLGALRGDGPLPEELEELRNQARAGPEKESWVVVRET 392
Query: 398 YDKMNDVFTPLLSELQAKGWHTDR 421
+ ++ +F L LQ GW T++
Sbjct: 393 HQVLDKLFPKFLKGLQDAGWSTEK 416
>gi|281201954|gb|EFA76161.1| DUF647 family protein [Polysphondylium pallidum PN500]
Length = 430
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 103/395 (26%), Positives = 182/395 (46%), Gaps = 51/395 (12%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P+GYP SV Y+ Y + Q +S+ VL+T+++L G+ A + V+ WI
Sbjct: 51 FLPAGYPDSVTSDYMSYQVWDTAQALSSSLSGVLATRAMLRGYGVGNAAASVGSAVLQWI 110
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+DG +G++ + G ++D K WR +AD L D+G LE+LSPL LF+ ++ G
Sbjct: 111 LRDGAGMIGRIYFAWKTGRQLDCNAKTWRFMADILNDIGMTLEMLSPLSNVLFMPLSVCG 170
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--- 243
+K I VA T+ + FAK N+ D+ AK + T +VG+ G L++++
Sbjct: 171 IVSKSICGVAGGCTKASLTMHFAKNDNMGDVSAKDGSQETAVGLVGMFLGAALSASIPDE 230
Query: 244 ----CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
++ ++ +L H+Y + + ++N R +I +++ G I +P
Sbjct: 231 SEQTTTATVAIWMLFFILIAFHLYCNYRAVTGVVLTSINRYRATLIYKHYLQYGEIPTPI 290
Query: 300 DLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKF-------HELKELFAEEKFILCLG 352
+ RE++LF +D + +G ++ +K E++E L
Sbjct: 291 QVSKRENILFR----DDKQPIYLGASLLNIINLNKQQQQQQQSKEIRESKTNSIIRHKLE 346
Query: 353 SRWTD---------MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMND 403
+R+ D + L+ + + D L+ AY S+A +L + N
Sbjct: 347 NRYIDIWQFQQEIYISLKVHCTAVD-----LIQAYLYSVA---------ILHQTKQQNNR 392
Query: 404 VFTPLLS------ELQAKGWHTDR-FLDGTGTRFA 431
F L++ L +KGW+TDR LD G R++
Sbjct: 393 AFIDLVTSKSFNDRLLSKGWNTDRQALDAEGWRYS 427
>gi|354505561|ref|XP_003514836.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cricetulus griseus]
gi|344256981|gb|EGW13085.1| UPF0420 protein C16orf58-like [Cricetulus griseus]
Length = 466
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 105/381 (27%), Positives = 180/381 (47%), Gaps = 39/381 (10%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSTDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGDAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 135 LKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADVLNDIAMFLEIMAPMYPIFFTMTVSIS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
S+ + LL+ +H+Y+ +RA + TLN R +++ F++ G + +PA
Sbjct: 255 PSLSLGCFI--LLTALHIYANYRAVRALVLETLNESRLQLLLKHFLQRGEVLAPASANQM 312
Query: 305 EDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLG--SRWTD 357
E L +P ++ +G H +V S EL++L +E ++LC
Sbjct: 313 EPLWTGFWPSL------SLSLGVPLHHLV--SSVFELEQLVEGHQEPYLLCWNQSQNQVQ 364
Query: 358 MVLEQNASGEDALR--------------GWLVAAYAASMAKSFHDP---SLTVLQDAYDK 400
+ L Q A E LR G L A ++ P S ++++ +
Sbjct: 365 VALSQVAGPEAVLRAATHGLVLGALQEDGPLPKELAELREQARAGPKKESWILVRETHQV 424
Query: 401 MNDVFTPLLSELQAKGWHTDR 421
++ +F L LQA GW T++
Sbjct: 425 LDTLFPKFLKGLQAAGWKTEK 445
>gi|78486556|ref|NP_001030594.1| UPF0420 protein C16orf58 homolog [Rattus norvegicus]
gi|123781627|sp|Q499P8.1|CP058_RAT RecName: Full=UPF0420 protein C16orf58 homolog
gi|71682994|gb|AAH99813.1| Similar to cDNA sequence BC017158 [Rattus norvegicus]
Length = 466
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 176/379 (46%), Gaps = 35/379 (9%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL+Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ + LL+ +H+Y+ +RA + TLN R +++ F++ G + PA E
Sbjct: 255 LSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314
Query: 307 L---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLG--SRWTDMV 359
L +P ++ +G H +V S ELK+L +E ++LC +
Sbjct: 315 LWTGFWPSL------SLSLGVPLHHLV--SSVSELKQLVEGHQEPYLLCWNQSQNQVQVA 366
Query: 360 LEQNASGEDALR----GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMN 402
L Q A E LR G ++ A + + S ++++ + ++
Sbjct: 367 LSQVAGPETVLRAATHGLILGALQEDGPLPGELAELRDMVQAGPKNESWILVRETHQVLD 426
Query: 403 DVFTPLLSELQAKGWHTDR 421
+F L LQA GW T++
Sbjct: 427 TLFPKFLKGLQAAGWKTEK 445
>gi|357484331|ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
gi|355513788|gb|AES95411.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
Length = 630
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 161/303 (53%), Gaps = 7/303 (2%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
+Y++ + + P G+P SV YL Y+ +R +Q VL+TQ+LL+A GL +
Sbjct: 167 LYNKCREFVVRLMLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQALLYAVGLGKG 226
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A ++W+LKDG+ ++ K++ S+ G D PK WR+ AD L + GLE+ +P P
Sbjct: 227 AIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFP 286
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
LF+ + ++ A + +TR ++ FA + N +++ AKGE + +G+G G
Sbjct: 287 HLFVPIGAFAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMVSRFIGIGIG 346
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
I L + + SS + +++ +H+Y ++ ++ + TLNP R +++ ++++ +G+ +
Sbjct: 347 IGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQSIQLRTLNPHRASLVFSEYLLSGQ-A 405
Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT 356
P E+ LFP I +A ++ +V S E K+ E + L LGS+ +
Sbjct: 406 PPVKEVNAEEPLFPAVPILNAPFAN--KETQSIVLSS---EAKDAAVEIESRLQLGSKLS 460
Query: 357 DMV 359
+++
Sbjct: 461 EII 463
>gi|168013684|ref|XP_001759421.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689351|gb|EDQ75723.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 712
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 158/302 (52%), Gaps = 27/302 (8%)
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGM 132
P V YL Y +++ + F S+ L TQ++L+A G+ R PA A ++W++KDG+
Sbjct: 283 PDQVWPHYLVYLKWKLVHRFLSSVLHFQCTQAMLWAVGVGAKRRLPA-AAALNWVMKDGV 341
Query: 133 QHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
+GKL+ +N G DS+ KR R L+ GLE+L+PL PQ FL +A L N AK
Sbjct: 342 GRLGKLLFTANFGRTFDSDLKRVRFWTSVLFSSSVGLEILTPLFPQHFLLLATLANIAKS 401
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK- 250
IA A AT I+ SFA NL+D+ AKG A + + + +GL A + L+ + + + +
Sbjct: 402 IAYAAYLATSSAIHRSFALGDNLADISAKGMAQTVVADNLGLAAAVCLSQFIHNFPKIEK 461
Query: 251 ---MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ + P+L+ ++++ ++++A + TLN +R +I+ +VK RI S ++ E++
Sbjct: 462 FLPLAMYPVLASAELFAIYQQLQAVHLQTLNKERLEIIINIWVKDRRIPSFEEVSKIENV 521
Query: 308 ----LFP--------------GRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFIL 349
LF G ++ ++++G + + P + L +K+ L
Sbjct: 522 ALLDLFRVFGVNMVFANGGEGGSGFKNQLSLRIGAVSLRYLTPEGLMNIVNLRETQKYTL 581
Query: 350 CL 351
CL
Sbjct: 582 CL 583
>gi|197101105|ref|NP_001126133.1| UPF0420 protein C16orf58 homolog [Pongo abelii]
gi|75041556|sp|Q5R8F6.1|CP058_PONAB RecName: Full=UPF0420 protein C16orf58 homolog
gi|55730464|emb|CAH91954.1| hypothetical protein [Pongo abelii]
Length = 468
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 105/411 (25%), Positives = 191/411 (46%), Gaps = 31/411 (7%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G S +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDSGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASGLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+VGL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
+ TLN R +++ +++ G + +P R + L+ G A ++ +G H++V
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLNPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337
Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
S EL++L +E ++LC +VL Q A + LR G ++ A
Sbjct: 338 -SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396
Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
+ S ++++ ++ ++ +F L LQ GW T++
Sbjct: 397 LPAELEELRNRVQAGPKKESWVIVKETHEVLDMLFPKFLKGLQDAGWKTEK 447
>gi|149067630|gb|EDM17182.1| similar to cDNA sequence BC017158, isoform CRA_b [Rattus
norvegicus]
Length = 466
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 175/379 (46%), Gaps = 35/379 (9%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL+Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ LL+ +H+Y+ +RA + TLN R +++ F++ G + PA E
Sbjct: 255 LSLSFGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314
Query: 307 L---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLG--SRWTDMV 359
L +P ++ +G H +V S ELK+L +E ++LC +
Sbjct: 315 LWTGFWPSL------SLSLGVPLHHLV--SSVSELKQLVEGHQEPYLLCWNQSQNQVQVA 366
Query: 360 LEQNASGEDALR----GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMN 402
L Q A E LR G ++ A + + S ++++ + ++
Sbjct: 367 LSQVAGPETVLRAATHGLILGALQEDGPLPGELAELRDRVQAGPKNESWILVRETHQVLD 426
Query: 403 DVFTPLLSELQAKGWHTDR 421
+F L LQA GW T++
Sbjct: 427 TLFPKFLKGLQAAGWKTEK 445
>gi|449452843|ref|XP_004144168.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Cucumis sativus]
Length = 437
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/393 (24%), Positives = 185/393 (47%), Gaps = 39/393 (9%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
R V+ R D+F+ P G+P SV Y+ + + +LQ ++ ++LSTQ+LL A G+
Sbjct: 45 RLVWRRFFDAFV----PEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVG 100
Query: 116 PTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS 172
A W L+D +G ++ + G+ +DS K WR++AD + DLG ++++S
Sbjct: 101 EQSATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLVS 160
Query: 173 PLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
PL P F+ + LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G
Sbjct: 161 PLFPSAFMLIVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIG 220
Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
+ G+ LA + L++ H+Y+ + + + +LNPQR ++ + +++T
Sbjct: 221 MAIGMLLARITMGCPLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQT 280
Query: 293 GRISSPADLRYREDLLFPG----------RLIEDAGNVKVG---RDPHKVVKPSKFHELK 339
+ SP + E +L P +LI V++G H + F++L
Sbjct: 281 AEVLSPQQVSMMEHVL-PAWASSWSLRSIKLIHT--RVRLGARISSLHHTEMXAFFNQLM 337
Query: 340 EL-----FAEEKFILCLGSRWTDMVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSL 391
+ K++L ++++++A+ D + + LV AY + +
Sbjct: 338 NFVKSSGYKGAKYLLVERKETISVIVQKDATANDIFQSFVHALVMAYVPDQESRYFE--- 394
Query: 392 TVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
+ M+ + + +L+ GW TDR L
Sbjct: 395 -----SMSWMDKHYECFIEKLKISGWKTDRLLS 422
>gi|328778772|ref|XP_394441.3| PREDICTED: UPF0420 protein C16orf58 homolog [Apis mellifera]
Length = 464
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 178/370 (48%), Gaps = 21/370 (5%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
++ Y ++ F P GYP SV+ Y Y + +Q F S + L+T S++ G+
Sbjct: 38 TKSFYSGLISIIKEVFLPQGYPDSVHPDYTSYQIWDTVQAFASTIMGTLTTHSIMQGVGV 97
Query: 115 --RPTPAQATVVSWILKDGMQHVGKLICS--NLGARMDSEPKRWRILADALYDLGTGLEV 170
A ++WILKDG VG+++ + N G +D + K+WR+ AD L DL GLE+
Sbjct: 98 GEAAATPLAAAITWILKDGTGMVGRIMFAWWN-GTDLDGQCKKWRLFADILNDLAMGLEL 156
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
L P L + + K I VA ATR + A + NL+D+ AK + T N+
Sbjct: 157 LLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNL 216
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+ GI + S + + Q M + L +IH+Y+ ++A +N+LN R A+IV ++
Sbjct: 217 IASFVGILILS-IFHNGQYIMELYLFLVIIHLYANYSAVKALCLNSLNEDRLALIVKSYI 275
Query: 291 KTGRISSPADLRYRED-LLFPGRLIEDAG-NVKVGRDPHKVVKPSKF----HELK-ELFA 343
I P ++ +E LL + G N+K+G ++K EL +LF
Sbjct: 276 SNEVIPKPEEVNKKESVLLLTKSTMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFL 335
Query: 344 EEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK- 400
+ K+++ + ++ + L+++A D L + + A + F SL V D + K
Sbjct: 336 DRKYLISIDVQNKTIFICLKKDAQPYDVLEAY----FHACLCGLFICMSLKVPLDVFLKP 391
Query: 401 -MNDVFTPLL 409
++D+ PL+
Sbjct: 392 EVSDLSYPLM 401
>gi|207080128|ref|NP_001128778.1| DKFZP468D197 protein [Pongo abelii]
gi|55727564|emb|CAH90537.1| hypothetical protein [Pongo abelii]
Length = 466
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 106/413 (25%), Positives = 193/413 (46%), Gaps = 37/413 (8%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G S +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDSGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--CSSMQGKMIVGPLLSVIHVYSVIEEM 269
N++D+ AK + TL N+VGL + + V C + L+ +H+Y+ +
Sbjct: 222 NNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCPGLGCFF----FLTALHIYANYRAV 277
Query: 270 RAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKV 329
RA + TLN R +++ +++ G + +P R + L+ G A ++ +G H++
Sbjct: 278 RALVMETLNEGRLRLVLKHYLQRGEVLNPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRL 334
Query: 330 VKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS 381
V S EL++L +E ++LC +VL Q A + LR G ++ A
Sbjct: 335 V--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGLMLGALQGD 392
Query: 382 -------------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
+ S ++++ ++ ++ +F L LQ GW T++
Sbjct: 393 GPLPAELEELRNRVQAGPKKESWVIVKETHEVLDMLFPKFLKGLQDAGWKTEK 445
>gi|312374874|gb|EFR22346.1| hypothetical protein AND_15402 [Anopheles darlingi]
Length = 417
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 181/381 (47%), Gaps = 28/381 (7%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F + F P GYP SV+ YL Y ++ +Q F S L+T ++L G+ A +
Sbjct: 49 FQHLFLPVGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDSVTALSAT 108
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLE 181
V+W+LKDGM H+G+++ + G +D + K+WRI AD L D+ +++ P P+
Sbjct: 109 VTWVLKDGMGHLGRILFAWWKGTELDIDSKKWRIRADVLNDIAMAIDLFVLPYYPKASTY 168
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ K I VA ATR + A GNL+D+ +K A T N++ G+ L
Sbjct: 169 ILCGTTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLLLL- 227
Query: 242 TVCSSMQGKMIVGPLLSVI---HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
+ +Q + I+ L + + H+Y+ I+ ++A + T N R + + ++ K+G + SP
Sbjct: 228 ---TYLQNQRILYGLFAFVTFLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSP 284
Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK-FHELKELFA----EEKFILCLGS 353
+ E + G+ + VK+G ++ + + ++L+ L A EKF++
Sbjct: 285 RQVNKLERVTV-GQTVSLTARVKIGCSVRELTEFYRNCYDLENLIACFDSREKFLIAETR 343
Query: 354 RWTDMVLEQNASGEDALRGWL-VAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSEL 412
+ + L + D ++ + VA+Y ++ L+D Y ++ + + L+
Sbjct: 344 HYVGVYLHFSVKPLDIIKSYFYVASYLQDKSQ---------LRDRYWEIQNKWNEFLNLA 394
Query: 413 QAKGWHTDRFLDGTGT-RFAW 432
Q +GW+ L T R W
Sbjct: 395 QCEGWNVQAHLLKTDEYRLDW 415
>gi|344294431|ref|XP_003418921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Loxodonta africana]
Length = 468
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 103/390 (26%), Positives = 176/390 (45%), Gaps = 43/390 (11%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ L F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A
Sbjct: 70 LSGLLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGALATQAVLLGIGVGDAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A +W++KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 130 AATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVLPICF 189
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 190 TITVCTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + L+ +H+Y+ +RA + TLN R +I+ F++ G + SP
Sbjct: 250 LPLVSDCPSVSLGCFFFLTALHIYANYRAVRALVMETLNEGRLWLILKHFLQKGEVLSPT 309
Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
E L +P ++ +G H ++ S EL++L +E ++L
Sbjct: 310 SANQMEPLWTGFWPSL------SLSLGVPLHCLI--STVSELQQLIGGHQEPYLL----H 357
Query: 355 W------TDMVLEQNASGEDALR----GWLVAAYAA-------------SMAKSFHDPSL 391
W +VL Q A E LR G ++ A M S
Sbjct: 358 WDQSRNQVQVVLSQTAGPETVLRAATHGLVLGALQGDGPLPGELEELRNQMWAGPEKDSW 417
Query: 392 TVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
++++ + ++ +F L LQ GW T++
Sbjct: 418 AIVKETHQVLDKLFPKFLKGLQDAGWKTEK 447
>gi|257900512|ref|NP_073581.2| UPF0420 protein C16orf58 [Homo sapiens]
gi|143955305|sp|Q96GQ5.2|CP058_HUMAN RecName: Full=UPF0420 protein C16orf58
gi|62897385|dbj|BAD96633.1| hypothetical protein FLJ13868 variant [Homo sapiens]
gi|119572504|gb|EAW52119.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
gi|119572505|gb|EAW52120.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
gi|119572506|gb|EAW52121.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
Length = 468
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 104/411 (25%), Positives = 190/411 (46%), Gaps = 31/411 (7%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
+ TLN R +++ +++ G + P R + L+ G A ++ +G H++V
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337
Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
S EL++L +E ++LC +VL Q A + LR G ++ A
Sbjct: 338 -SSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396
Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
+ S V+++ ++ ++ +F L LQ GW T++
Sbjct: 397 LPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDAGWKTEK 447
>gi|156379270|ref|XP_001631381.1| predicted protein [Nematostella vectensis]
gi|156218420|gb|EDO39318.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 131/245 (53%), Gaps = 3/245 (1%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
+ F P GYP SV++ YL Y + +Q F S+ L+TQ++L + G A A ++
Sbjct: 3 SAFLPQGYPESVSKDYLTYQFWDTVQAFCSSITGTLATQAMLKGYGVGNESATALAATMT 62
Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
WIL+ G VG ++ + + G+ +D K+WR+ AD + DL +E++SP F +A
Sbjct: 63 WILRSGTGMVGSILFAWMQGSNLDCNAKKWRLFADIMNDLAILVELISPYFKAYFTLLAC 122
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
L + AK I VA +TR + A+ N++D+ AK + TL N++ L AG+ + V
Sbjct: 123 LSSVAKSIVGVAGGSTRAALTQHQARRDNMADVAAKDGSQETLVNLMALLAGLVITPLVT 182
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
++ + + +H+Y+ + + + TLN R ++ +F++TG + SP ++
Sbjct: 183 GHTLLTWVLFIIFTFLHLYANYCAVSSVVMETLNTTRLQIVTNEFLRTGHVLSPDEVGKM 242
Query: 305 EDLLF 309
E ++F
Sbjct: 243 EPVIF 247
>gi|10436021|dbj|BAB14730.1| unnamed protein product [Homo sapiens]
Length = 468
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 104/411 (25%), Positives = 190/411 (46%), Gaps = 31/411 (7%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
+ TLN R +++ +++ G + P R + L+ G A ++ +G H++V
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337
Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
S EL++L +E ++LC +VL Q A + LR G ++ A
Sbjct: 338 -SSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396
Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
+ S V+++ ++ ++ +F L LQ GW T++
Sbjct: 397 LPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDAGWKTEK 447
>gi|255562731|ref|XP_002522371.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
communis]
gi|223538449|gb|EEF40055.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
communis]
Length = 523
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 99/390 (25%), Positives = 184/390 (47%), Gaps = 34/390 (8%)
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGM 132
P V E Y+ Y +++ L S+AL VL+TQ++ A G+ R P+ A ++W+LKDG+
Sbjct: 132 PKQVTENYMDYVKWKLLHRVFSSALQVLATQAMFRAVGIGYSRSLPS-AAALNWVLKDGL 190
Query: 133 QHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
+ + I ++L + D+ KR R L+ LG GLE+L+P P+ FL +A + N AK
Sbjct: 191 GRLSRCIYTASLASAFDTNLKRVRFSTSILFSLGIGLELLTPAFPKQFLLLATIANIAKQ 250
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ--- 248
I++ AT I+ SFA NL ++ AK + + F+ +GL L ++ +
Sbjct: 251 ISLACHLATGSAIHRSFAVADNLGEVSAKAQIQTVCFDNLGLMLAALLNMLFKNNQRLLA 310
Query: 249 -GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+V P+ S I ++ + + ++ + TL R MI+ ++ GR+ +PA++ +E
Sbjct: 311 GLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLEMILDSWIALGRVPTPAEVSKKEGF 370
Query: 308 LFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMV------- 359
F ++ +++G + + ++ L +E+ + +C+ + + +
Sbjct: 371 EFLRTKGKELWPIRIGCLNAKDQISKLSVIAMQSLSSEDYYFICVENSYRGLTRTKQQGI 430
Query: 360 ---LEQNASGEDALRGWLVAAYAA-----SMAKSFHDPSLTV-------LQDAYDKMNDV 404
L + A D + G L A Y S ++ DP TV ++D+
Sbjct: 431 LLSLREGAGTADVIMGLLQACYIRKALLFSRWETISDPGSTVHKECIKLIEDSKQSAEKD 490
Query: 405 FTPLLSELQAKGWHTDRFLDGT--GTRFAW 432
+ L ++ A GW L T TR+++
Sbjct: 491 LSMLGDQMTALGWAVKNILLSTEEQTRYSF 520
>gi|307205774|gb|EFN84004.1| UPF0420 protein C16orf58 [Harpegnathos saltator]
Length = 466
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 160/337 (47%), Gaps = 17/337 (5%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
+S+ +Y R V F P GYP SV YL Y + +Q F S + L+T S++ G
Sbjct: 35 ESQDIYFRFVSFVKEVFLPQGYPDSVRADYLAYQIWDTVQAFASTIMGTLTTHSVMQGIG 94
Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV 170
+ + A A ++WILK G +G ++ + G ++D + K+WR+ AD L D+ GLE+
Sbjct: 95 VGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTQLDGQCKKWRLFADILNDIAMGLEL 154
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
+ P + + K + VA ATR + A NL+D+ AK + T N+
Sbjct: 155 MVPYLSPYSVPILCTSTIMKSVVGVAGGATRAALTQHQAIRNNLADVSAKDSSQETCVNL 214
Query: 231 VGLGAGIQLASTVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
+ GI + S G+ ++ L +H+Y+ ++A ++TLN R A+I+ +
Sbjct: 215 IASFVGIFVLSLF---HDGRYLIELYLFLVAVHLYANYSAVKALCLDTLNEDRLALIIKN 271
Query: 289 FVKTGRISSPADLRYREDLLFPGRLIEDAG--NVKVGRD-----PHKVVKPSKFHELKEL 341
++ RIS P + E + G ++ N+K+G V+ + L +
Sbjct: 272 YMMNERISEPQKINKEESVFLLGNPSKEICGFNIKIGASFASLLKRNVISSMEMEFLLKF 331
Query: 342 FAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVA 376
F + ++++ + +R ++L+++ + L G+ A
Sbjct: 332 FEDRRYLITIDIETRTIFIILKKDIKPIEVLEGYFYA 368
>gi|58392188|ref|XP_319167.2| AGAP010020-PA [Anopheles gambiae str. PEST]
gi|55236239|gb|EAA13864.2| AGAP010020-PA [Anopheles gambiae str. PEST]
Length = 417
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 181/380 (47%), Gaps = 26/380 (6%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F + F P GYP SV+ YL Y ++ LQ F S L+T ++L G+ A +
Sbjct: 49 FQHLFLPVGYPDSVSSDYLSYQKWDTLQAFCSTISGTLTTHAILKGVGVGSDAANPLSAT 108
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLE 181
++W+LKDG H+G+++ + G+ +D + K+WRI AD L D+ +++ P P+
Sbjct: 109 ITWVLKDGTGHLGRILFAWWKGSELDIDSKKWRIRADILNDVAMAIDLFVLPYYPKASTY 168
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ K I VA ATR + A GNL+D+ +K A T N++ G+ L +
Sbjct: 169 ILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLFLLT 228
Query: 242 TVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
+ K++ G +++IH+Y+ I+ ++A + T N R + + ++ K+G + SP
Sbjct: 229 YL---QNQKVLYGLFAFVTLIHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSPQ 285
Query: 300 DLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK-FHELKELFA----EEKFILCLGSR 354
+ E + G+ + VK+G ++ + + ++L+ L A +KF++
Sbjct: 286 QVNKLERVTV-GQTVTLTARVKIGCSVRELTEFYRNCYDLENLIACFDSRDKFLIAETRH 344
Query: 355 WTDMVLEQNASGEDALRGWL-VAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQ 413
+ + L D ++ + VA+Y ++ L+D Y ++ + + L+ Q
Sbjct: 345 YVGVYLHFTVKPLDIIKSYFYVASYLQDKSQ---------LRDRYWEIQNKWNEFLNMAQ 395
Query: 414 AKGWHTDRFLDGTGT-RFAW 432
+GW+ L T R W
Sbjct: 396 CEGWNVHAHLLKTDEYRLDW 415
>gi|194758785|ref|XP_001961639.1| GF15067 [Drosophila ananassae]
gi|190615336|gb|EDV30860.1| GF15067 [Drosophila ananassae]
Length = 417
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 164/357 (45%), Gaps = 19/357 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV++ Y Y + Q F S L T ++L G A + V+WI
Sbjct: 53 FLPKGYPDSVSDDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAYSATVTWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG++ + G+++D + K+WR+ AD L DL G+E+ + P P L ++
Sbjct: 113 LKEGSGHVGRIFFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCG 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+YS ++ +RA + T N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYSNLKAVRAVCLRTFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
+ G+ + + N+K+G ++ K + + E FI+ G + + L
Sbjct: 292 RVTI-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIDNIVSSFDPHEHFIIAEGKKCLGVYL 350
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
+ D L+ + A D S ++D Y + + ++ Q +GW
Sbjct: 351 HFDTRPLDVLKAYFFA------VSYLQDRSQ--IKDNYWDIQSKWQEFMALAQKEGW 399
>gi|402908238|ref|XP_003916859.1| PREDICTED: UPF0420 protein C16orf58 homolog [Papio anubis]
Length = 468
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 177/376 (47%), Gaps = 29/376 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +R+ + TLN R +++ +++ G + P R +
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 315
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGS--RWTDMVLEQ 362
L+ G A ++ +G H++V S EL++L +E ++LC +VL Q
Sbjct: 316 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWNQSRNQVQVVLNQ 371
Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
NA + LR G ++ A + S ++++ + ++ +F
Sbjct: 372 NAGPKTVLRAATHGLMLGALQGDGPLPAELEELRNQVRAGPKKESWVIVKETHQVLDMLF 431
Query: 406 TPLLSELQAKGWHTDR 421
L LQ GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447
>gi|357505287|ref|XP_003622932.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
gi|355497947|gb|AES79150.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
Length = 590
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 86/335 (25%), Positives = 165/335 (49%), Gaps = 20/335 (5%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA- 119
+V S + FF P VNE Y+ + +++ L S+AL VL+TQ++ A G+ + +
Sbjct: 105 KVFGSVVRDFF---IPREVNENYVDFVKWKLLHRVFSSALQVLATQAMFTAIGVGYSSSL 161
Query: 120 -QATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A ++W+LKDG+ + K I ++L + D+ KR R L+ GLE+L+P P+
Sbjct: 162 PSAAALNWVLKDGLGRLSKCIYTASLASAFDTNLKRVRFTTSVLFVGSIGLELLTPTFPR 221
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
FL +A + N AK I++ ATR ++ SFAK NL ++ AK + + F+++GL
Sbjct: 222 FFLLLATIANIAKQISLACYLATRSAVHQSFAKADNLGEISAKAQIQTVCFDILGLMLAA 281
Query: 238 QLASTVCSSMQGK----MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
+ + + + + V PL + + ++ + + ++ + TL R +I+ +++ G
Sbjct: 282 LVNMWIGNHQRPQAGLHFFVYPLFAFMDLFGIYQGLKHVHLKTLTKDRLEIILKTWIEFG 341
Query: 294 RISSPADLRYREDLLFPGRLIEDAGNVKVGR-DPHKVVKPSKFHELKELFAEEKFILCL- 351
+ SPAD+ +E + F G + V++G P + ++ + E+ + +C+
Sbjct: 342 YVPSPADVSEKEVINFMGVKGKSLWPVRIGCISPKDQIPEWSMKTIQSITDEDYYFVCME 401
Query: 352 --------GSRWTDMVLEQNASGEDALRGWLVAAY 378
G + + + A + + G L A Y
Sbjct: 402 IFEGLKRTGQLRILLSIREGAESKHIIMGLLQACY 436
>gi|114662190|ref|XP_001159530.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan troglodytes]
gi|397472000|ref|XP_003807550.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan paniscus]
gi|410212432|gb|JAA03435.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410264128|gb|JAA20030.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410292910|gb|JAA25055.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410340381|gb|JAA39137.1| chromosome 16 open reading frame 58 [Pan troglodytes]
Length = 468
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 104/411 (25%), Positives = 190/411 (46%), Gaps = 31/411 (7%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
+ TLN R +++ +++ G + P R + L+ G A ++ +G H++V
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337
Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
S EL++L +E ++LC +VL Q A + LR G ++ A
Sbjct: 338 -SSVFELQQLVEGHQESYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396
Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
+ S V+++ ++ ++ +F L LQ GW T++
Sbjct: 397 LPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDAGWKTEK 447
>gi|426381987|ref|XP_004057610.1| PREDICTED: UPF0420 protein C16orf58 homolog [Gorilla gorilla
gorilla]
Length = 468
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 104/411 (25%), Positives = 190/411 (46%), Gaps = 31/411 (7%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+TQ++L G+ A A +W++KD +G+++ + G+++D
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR + A+
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + L+ +H+Y+ +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLILPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
+ TLN R +++ +++ G + P R + L+ G A ++ +G H++V
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337
Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
S EL++L +E ++LC +VL Q A + LR G ++ A
Sbjct: 338 -SSVFELQQLVEGHQESYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396
Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
+ S V+++ ++ ++ +F L LQ GW T++
Sbjct: 397 LPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDAGWKTEK 447
>gi|8920582|gb|AAF81304.1|AC027656_21 Contains similarity to CG10338 gene product from Drosophila
melanogaster gb|AE003660 [Arabidopsis thaliana]
Length = 438
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 179/372 (48%), Gaps = 24/372 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F G+P SV Y+ + + LQ ++ +LSTQ+LL A G+ A W
Sbjct: 55 FSSPGFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWF 114
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D +G ++ + G+ +DS K WR++AD + D+G +++LSPL P F+ + LG
Sbjct: 115 LRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLG 174
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+ G+ LA +
Sbjct: 175 SLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLARFTSGN 234
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+V H+Y+ +R +N+LN +R+++++ F++TG++ SP + E
Sbjct: 235 PMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEG 294
Query: 307 LL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKEL----------FAEEKFILCLGSRW 355
+L + + + + V+ S L L + K++L
Sbjct: 295 VLPLWATSLRSTNSKPLHKRVQLGVRVSSLPRLDMLQLLNGVGASSYKNAKYLLAHIKGN 354
Query: 356 TDMVLEQNASGEDALRGWLVAAYAASM---AKSFHDPSLTVLQDAYDKMNDVFTPLLSEL 412
++L +++ D L+ ++ A A++ + SF+ + YD+ LL +L
Sbjct: 355 VSVILHKDSKPADVLKSYIHAIVLANLMEKSTSFYSEGEAWIDKHYDE-------LLHKL 407
Query: 413 QAKGWHTDRFLD 424
++ GW T+R L
Sbjct: 408 RSGGWKTERLLS 419
>gi|348675633|gb|EGZ15451.1| hypothetical protein PHYSODRAFT_546234 [Phytophthora sojae]
Length = 408
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 80/223 (35%), Positives = 118/223 (52%), Gaps = 8/223 (3%)
Query: 64 DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR----PTPA 119
D + F P S+ + Y Y ++ + S A SVLS QSLL+A GL PT
Sbjct: 109 DVLWDLFLPKDARASLTKDYFPYAKWSFVGSVASCAASVLSMQSLLYAIGLGAGAIPT-- 166
Query: 120 QATVVSWILKDGM-QHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
A V+W+LKDG+ Q G L S + R D++PKRWR+ + D+ E+L+PL P
Sbjct: 167 -AAAVNWVLKDGLGQFGGVLFASIVNNRYDADPKRWRVASSVALDVSVLGEILTPLAPGS 225
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
FL +A L N AK +A ++A ATR ++SFA NL+D+ AK + + ++ G G GI
Sbjct: 226 FLAIASLANVAKNVAWLSASATRAGFHNSFAIRENLADVTAKAGSQAIASSIFGTGLGIL 285
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
++ SS +LS +H+ S + + + TLN QR
Sbjct: 286 ISQFTGSSTLNVAAAFAVLSTVHLISTYKSVDCVQLRTLNCQR 328
>gi|149725803|ref|XP_001495510.1| PREDICTED: UPF0420 protein C16orf58 homolog [Equus caballus]
Length = 468
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 181/388 (46%), Gaps = 45/388 (11%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A
Sbjct: 73 LLALFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGDAKASVSAAT 132
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+W++KD +G++I + G+++D K WR+ AD L D+ LE+++P+ P F +
Sbjct: 133 ATWLVKDSTGMLGRIIFAWWKGSKLDCNAKHWRLFADILNDVAMFLEIMAPVYPICFTMI 192
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ N AK I VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 193 VCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V + L + +H+Y+ +RA + TLN R +++ F++ G + P
Sbjct: 253 VSGCPSFSLGCFFLFTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQRGEVLGPTSAN 312
Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRWT---- 356
R + L+ G + ++ +G H+++ S EL++L +E ++L RW
Sbjct: 313 -RMEPLWTG--FWPSVSLSLGVPLHRLI--SSVSELQQLVEGHQEPYLL----RWNQSQN 363
Query: 357 --DMVLEQNASGEDALR----GWLVAAY-----------------AASMAKSFHDPSLTV 393
+VL Q A E LR G ++ A A +AK S +
Sbjct: 364 QVQVVLSQMAGPETVLRAATHGLVLGALRRDGPLPRELEELRNRVQAGLAKE----SWVI 419
Query: 394 LQDAYDKMNDVFTPLLSELQAKGWHTDR 421
+++ + ++ +F L LQ GW T++
Sbjct: 420 VRETHHVLDKLFPKFLKGLQDAGWKTEK 447
>gi|42567566|ref|NP_195771.2| uncharacterized protein [Arabidopsis thaliana]
gi|209863158|gb|ACI88737.1| At5g01510 [Arabidopsis thaliana]
gi|332002971|gb|AED90354.1| uncharacterized protein [Arabidopsis thaliana]
Length = 509
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 106/373 (28%), Positives = 174/373 (46%), Gaps = 29/373 (7%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---------RPTPAQA 121
FPSG+P SV++ YL Y ++ + T +VL T SLL A G+ A A
Sbjct: 120 FPSGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVGVGSFSGTSAAATAAASA 179
Query: 122 TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
+ W+ KDG+ +G+L I G+ D +PK+WR+ AD + G+ ++ + L P FL
Sbjct: 180 AAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFL 239
Query: 181 EMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+A GN AK + AR R P I + FA GNL ++ AK E ++GLG G
Sbjct: 240 LLASTGNLAKAV----ARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFG 295
Query: 237 IQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
I + T + S ++ + ++H++ + + NT+N +R +IV V
Sbjct: 296 ILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSV 355
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKP--SKFHELKELFAEEKFILCLG 352
+ D RE++L R ++ V + ++ SK L +++ +EK+IL L
Sbjct: 356 VPGYVDCNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVSKVKALLKMYTKEKYILTLN 415
Query: 353 SRWTD----MVLEQNASGEDALRG-WLVAAYAASMAKSFHDPSLTV--LQDAYDKMNDVF 405
D + + NA+ D LR W +M +SF D L+ + +M++ F
Sbjct: 416 KLNKDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEESFKDKDSVFHWLKQSLSEMDNKF 475
Query: 406 TPLLSELQAKGWH 418
L +L GW+
Sbjct: 476 DDFLFKLDTAGWN 488
>gi|449489204|ref|XP_004158245.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 435
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 180/378 (47%), Gaps = 27/378 (7%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F + F P G+P SV Y+ + + +LQ ++ ++LSTQ+LL A G+ A
Sbjct: 51 FFDAFVPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVIGAT 110
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLE-VLSPLCPQLFLE 181
W L+D +G ++ + G+ +DS K WR++AD + DLG ++ +SPL P F+
Sbjct: 111 FQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMYVDGPVSPLFPSAFML 170
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ LG+ ++ VA+ ATR + FA + N +D+ AK + T+ ++G+ G+ LA
Sbjct: 171 IVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGMLLAR 230
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+ L++ H+Y+ + + + +LNPQR ++ + +++T + SP +
Sbjct: 231 ITMGCPLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLSPQQV 290
Query: 302 RYREDLLFPGRLIEDAGNVKVGRDPHKV-VKPSKFH--ELKELF---------AEEKFIL 349
E +L ++K+ ++ + S H E+KEL +EK++L
Sbjct: 291 SMMEHVLPAWASSWSLRSIKLIHTRVRLGARISSLHHTEIKELLHLAGANNNDKQEKYLL 350
Query: 350 CLGSRWTDMVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFT 406
++++++A+ D + + LV AY + ++ M+ +
Sbjct: 351 VERKETISVIVQKDATANDIFQSFVHALVMAYVPDQESRYF--------ESMSWMDKHYE 402
Query: 407 PLLSELQAKGWHTDRFLD 424
+ +L+ GW TDR L
Sbjct: 403 CFIEKLKISGWKTDRLLS 420
>gi|195387984|ref|XP_002052672.1| GJ17681 [Drosophila virilis]
gi|194149129|gb|EDW64827.1| GJ17681 [Drosophila virilis]
Length = 417
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 173/365 (47%), Gaps = 19/365 (5%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
N F P GYP SV++ Y Y + +Q F S L T ++L G A + V+
Sbjct: 51 NIFLPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNAYSATVT 110
Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMA 183
WILK+G H+G+++ + G+++D + K+WR+ AD L DL G+E+ + P P L ++
Sbjct: 111 WILKEGSGHLGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
K I VA ATR + A GNL+D+ +K + T N++ G+ L T+
Sbjct: 171 CGSTMLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFVGLYLL-TM 229
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
S V ++ +H+Y+ ++ +RA + T N R + + +F ++ + +P +
Sbjct: 230 IKSQAMLYTVFYIVVTLHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSHMLTPQQVNK 289
Query: 304 REDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDM 358
E + G+++ + N+++G ++ K + E E+FI+ ++ +
Sbjct: 290 LERVTV-GQMVSVSLNIRLGLSVKHLIDEYKSSNVIENIVSSFDPHERFIIAESKKYLGV 348
Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWH 418
L + +D L+ + +A S + + +++ Y + + L+ Q +GW
Sbjct: 349 YLHFDTRPQDVLKAYF---FAVSYLQDRNQ-----IKEQYWDIQGKWQEFLNLAQKEGWL 400
Query: 419 TDRFL 423
++ L
Sbjct: 401 INQHL 405
>gi|431902902|gb|ELK09111.1| hypothetical protein PAL_GLEAN10000828 [Pteropus alecto]
Length = 468
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 182/387 (47%), Gaps = 43/387 (11%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A
Sbjct: 73 LLTVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNANASVSAAT 132
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+W++KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 133 ATWLVKDSAGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPICFTMI 192
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ N AK I VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 193 VCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V + LL+ +H+Y+ +RA + TLN R +++ F+++G + P
Sbjct: 253 VSECPSFSLGCFFLLTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQSGEVLDPTSAN 312
Query: 303 YREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRW-- 355
E L +P + ++ +G H+++ S+ EL++L +E ++L W
Sbjct: 313 KMEPLWTGFWP------SPSLFLGVPLHRLI--SRVFELQQLIEGHQEPYLL----HWDQ 360
Query: 356 ----TDMVLEQNASGEDALR----GWLVAAYA------ASMAK-------SFHDPSLTVL 394
+VL Q A E LR G ++ A A + K S ++
Sbjct: 361 SQNQVQVVLSQMAGTETILRAATHGLVLGALQEDGPLPAELEKLRNRVRAGPEKESWVIV 420
Query: 395 QDAYDKMNDVFTPLLSELQAKGWHTDR 421
++ + ++ +F L LQ GW T++
Sbjct: 421 KETHQVLDKLFPKFLKGLQDVGWKTEK 447
>gi|380030107|ref|XP_003698699.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Apis florea]
Length = 464
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 180/370 (48%), Gaps = 21/370 (5%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
++ Y ++ F P G+P SV+ Y Y + +Q F S + L+T S++ G+
Sbjct: 38 TKSFYSGLISIIKEVFLPQGFPDSVHPDYTSYQVWDTVQAFASTIMGTLTTHSIMQGVGV 97
Query: 115 --RPTPAQATVVSWILKDGMQHVGKLICS--NLGARMDSEPKRWRILADALYDLGTGLEV 170
A ++WILKDG VG+++ + N G +D + K+WR+ AD L DL GLE+
Sbjct: 98 GEAAATPLAAAITWILKDGTGMVGRIMFAWWN-GTDLDGQCKKWRLFADILNDLAMGLEL 156
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
L P L + + K I VA ATR + + A + NL+D+ AK + T N+
Sbjct: 157 LLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTNIKALQNNLADVSAKDGSQETCVNL 216
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+ GI + S + + Q M + L +IH+Y+ ++A +N+LN R A+IV ++
Sbjct: 217 IASFVGILILS-IFHNGQYIMELYLFLVIIHLYANYSAVKALCLNSLNEDRLALIVKSYI 275
Query: 291 KTGRISSPADLRYREDLLFPGR-LIEDAG-NVKVGRDPHKVVKPSKF----HELK-ELFA 343
I P ++ +E +L + + G N+K+G ++K EL +LF
Sbjct: 276 SNEVIPKPEEVNKKESVLLLTKPTMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFL 335
Query: 344 EEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK- 400
+ K+++ + ++ + L+++A D L + + A + F SL V D + K
Sbjct: 336 DRKYLISIDVQNKTIFICLKKDAQPYDVLEAY----FHACLCGLFICMSLKVPLDVFLKP 391
Query: 401 -MNDVFTPLL 409
++D+ PL+
Sbjct: 392 EVSDLSYPLM 401
>gi|332262940|ref|XP_003280516.1| PREDICTED: UPF0420 protein C16orf58 homolog [Nomascus leucogenys]
Length = 468
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 177/376 (47%), Gaps = 29/376 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +RA + TLN R +++ +++ G + P R +
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 315
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
L+ G A ++ +G H++V S EL++L +E ++LC +VL Q
Sbjct: 316 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 371
Query: 363 NASGEDALR----GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMNDVF 405
A ++ LR G ++ A + S ++++ + ++ +F
Sbjct: 372 KAGPKNILRAATHGLMLGALQGDGPLPAELEELRNRVQAGPKKESWVIVKETHQVLDMLF 431
Query: 406 TPLLSELQAKGWHTDR 421
L LQ GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447
>gi|255542032|ref|XP_002512080.1| conserved hypothetical protein [Ricinus communis]
gi|223549260|gb|EEF50749.1| conserved hypothetical protein [Ricinus communis]
Length = 380
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 179/365 (49%), Gaps = 20/365 (5%)
Query: 74 GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDG 131
G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+ A W L+D
Sbjct: 3 GFPSSVTPDYVPFQVWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRDL 62
Query: 132 MQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAK 190
+G ++ + G+ +DS K WR++AD + DLG ++++SPL P F+ + LG+ ++
Sbjct: 63 TGMLGGILFTFYQGSDLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVVVVCLGSLSR 122
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
VA+ ATR + FA + N +D+ AK + T+ + G+ G+ LA
Sbjct: 123 SFTGVASGATRAALTQHFALQSNAADISAKEGSQETVATMTGMALGMLLARITIGHPLAI 182
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
L++ H+Y+ +R + +LN QR+++++ F++TG++ SP + E +L
Sbjct: 183 WFSFLSLTMFHMYANYRAVRCLVLTSLNMQRSSILLQHFMETGQVLSPEQVSGMEHILPR 242
Query: 311 GRLIEDAGNVKVGRDPHKV---VKPSKFHELKELF------AEEKFILCLGSRWTDMVLE 361
+VK+ + V EL EL + K++L ++++
Sbjct: 243 WTNFWRTKHVKLLHSRMCLGVRVSSLDHQELMELLHSAGSHKKAKYLLMERKGTINILIH 302
Query: 362 QNASGEDALRGWLVAAYAASMAKSFHDPSLTVL--QDAYDKMNDVFTPLLSELQAKGWHT 419
+N++ D L+ ++ +A MAK + + T L Q DK ++F L +L + GW T
Sbjct: 303 KNSTTSDVLQSFI---HALVMAKLMEENNSTYLESQSWMDKHYELF---LGKLSSSGWKT 356
Query: 420 DRFLD 424
+R L
Sbjct: 357 ERLLS 361
>gi|351711359|gb|EHB14278.1| hypothetical protein GW7_05117 [Heterocephalus glaber]
Length = 469
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 175/376 (46%), Gaps = 29/376 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P G+P SV+ YL Y Q+ ++Q F S+ S L+T S++ G+ A +W+
Sbjct: 78 FLPQGFPESVSPDYLPYQQWDSVQAFASSLCSALATHSVMLGLGVGNAEASLAAATATWL 137
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
++D +G++I + G+++D K+WR+ AD L D LE+++P+ F + +
Sbjct: 138 VRDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDAAMLLELVAPVLSVSFTMIVSIS 197
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + + +
Sbjct: 198 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLIMLPLMSNC 257
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ + LL+ +H+Y+ +R + TLN R +++ F+++G + E
Sbjct: 258 PRLSLGCFFLLTALHIYANYRAVRTLVMETLNESRLRLVLKHFLQSGEVLDTTSANQMEP 317
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLG--SRWTDMVLEQ 362
L+ G I + ++ +G H VV S EL++L +E ++LC +VL Q
Sbjct: 318 -LWTG--IWPSPSLSLGVPLHCVV--SSVSELQQLVEGHQEPYLLCWNQSENQVQVVLSQ 372
Query: 363 NASGEDALRGWLVAAYAASMAKSFHDP-----------------SLTVLQDAYDKMNDVF 405
A E LR ++ K P ++++ ++ +F
Sbjct: 373 MAGPEAVLRAATHGLLLGALKKDGPLPRELEELRNQVQAGPEKDDWAIVRETNQVLDKLF 432
Query: 406 TPLLSELQAKGWHTDR 421
L LQA GW T++
Sbjct: 433 PRFLEGLQAAGWQTEK 448
>gi|357120354|ref|XP_003561892.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
Length = 452
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 186/389 (47%), Gaps = 38/389 (9%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 60 WGRIGGRVLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLSAIGVGEKS 119
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 120 ATVIGATFQWFLRDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 179
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P + + LG+ ++ VA+ ATR + FA N +D+ AK + TL + G+G
Sbjct: 180 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMSGMGL 239
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
G+ LA + L++ H+Y+ + +++ + TLN +RT++++ F++ G +
Sbjct: 240 GMLLAHVTRGHDLVVWVSFLSLTIFHMYANYKAVQSLSLVTLNYERTSILLQYFLEDGEV 299
Query: 296 SSPADLRYREDLLFP-----GRLIEDAGNVKVGRDPHKVV----KPSKF--------HEL 338
+P + +E +L P RL+ + PH++V K S +
Sbjct: 300 RTPQQVSKQEHIL-PFWSSWRRLLRT-------KLPHELVHLGAKASMLTHSDMLLISKT 351
Query: 339 KELFAEEKFILCLGSRWTDMVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQ 395
+ + + + L + + + A+ D L+ + LV A +KS H
Sbjct: 352 RSYYTDANYFLLDKEGSVHIFIHKQAAATDVLKSFVHGLVLARLTRKSKSCH-------A 404
Query: 396 DAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
+A+ M+D + +S+L+ +G+ T+R L
Sbjct: 405 EAHQWMDDKYNIFISKLKMEGYSTERLLS 433
>gi|147768659|emb|CAN60615.1| hypothetical protein VITISV_003254 [Vitis vinifera]
Length = 470
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/329 (30%), Positives = 161/329 (48%), Gaps = 52/329 (15%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ R P
Sbjct: 113 SFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNRSAPG- 171
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
A ++WILKDG VGK++ + G + D + K+ R+ D L +LG G+E+ + P LFL
Sbjct: 172 AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVPHLFL 231
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + ++V
Sbjct: 232 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLV--------- 282
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
K +V ++TLN R ++ + F+KTGR+ S +
Sbjct: 283 ---------KSVV--------------------LHTLNRARYSVAIDCFLKTGRVPSLQE 313
Query: 301 LRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM- 358
+E++ FP +ED V R + ++ LF +E++++ +
Sbjct: 314 GNMKENIFSFP--WLEDKPIVLGSRFKDAFQDAGSYLSIEPLFEKERYLVTYNPSKGKVY 371
Query: 359 -VLEQNASGEDALRGWLVAAYAASMAKSF 386
+L+ A +D L+ AA+ A M F
Sbjct: 372 ALLKDQAKSDDILK----AAFHAHMLLHF 396
>gi|195437813|ref|XP_002066834.1| GK24688 [Drosophila willistoni]
gi|194162919|gb|EDW77820.1| GK24688 [Drosophila willistoni]
Length = 417
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 172/363 (47%), Gaps = 19/363 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV++ Y Y + +Q F S L T ++L + G+ A + V+WI
Sbjct: 53 FLPKGYPDSVSDDYAAYQIYDTIQAFCSTICGTLCTHAILKSIGVGSENINAYSATVTWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P L ++
Sbjct: 113 LKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCG 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K + VA ATR + A GNL+D+ +K + T N++ G+ + T+
Sbjct: 173 STMLKAVVSVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLLASCVGLYVL-TLIK 231
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
S V L+ +H+Y+ ++ +RA + T N R + + +F ++ R+ +P + E
Sbjct: 232 SPAVLYTVFYLVVTLHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSRMLTPQQVNGME 291
Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHE-----LKELFAEEKFILCLGSRWTDMVL 360
+ G+ + + N+++G +V K + E+FI+ ++ + L
Sbjct: 292 RVTL-GQTVSVSLNIRLGLSVKNLVDEYKSSSDIENIVSSFDTHERFIIAESKKYLGVYL 350
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
+ +D L+ + +A S + + L++ Y + + ++ Q +GW+
Sbjct: 351 HFDTRPQDVLKAYF---FAVSYLQDRNQ-----LKEKYWDIQAKWQEFFTQAQKEGWNIH 402
Query: 421 RFL 423
L
Sbjct: 403 THL 405
>gi|340711243|ref|XP_003394188.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus terrestris]
Length = 464
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 172/354 (48%), Gaps = 19/354 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P GYP SV+ Y Y + +Q F S + L+T S++ G+ A A ++WI
Sbjct: 53 FLPQGYPDSVHPDYTSYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEASATPLAAAITWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKDG +G+++ + G +D + K+WR+ AD L DL GLE+ P L + +
Sbjct: 113 LKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDLAMGLELFLPYFSSYSLGILCIS 172
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
K I VA ATR + A + NL+D+ AK + T N++ GI + S + +
Sbjct: 173 TAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHN 231
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ M + L +IH+Y+ ++A +N+LN R A+IV ++ I P ++ +E
Sbjct: 232 GRYIMELYLFLVMIHLYANYMAVKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKES 291
Query: 307 LLFPGR-LIEDAG-NVKVGRDPHKVVK-----PSKFHELKELFAEEKFILCLG--SRWTD 357
+L + + G N+K+G ++K S +LF + K+++ + ++
Sbjct: 292 VLLLTKPTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDRKYLISIDVQNKTIF 351
Query: 358 MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK--MNDVFTPLL 409
+ L+++A D L + + A + F SL V D + K ++D+ PL+
Sbjct: 352 ICLKKDAQPYDVLEAY----FHACLCGLFICMSLKVPLDVFVKPEVSDLSYPLM 401
>gi|345801540|ref|XP_547054.3| PREDICTED: UPF0420 protein C16orf58 homolog [Canis lupus
familiaris]
Length = 468
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 177/383 (46%), Gaps = 43/383 (11%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAEASVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 137 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTMTVCIS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V +
Sbjct: 197 NLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSAC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +RA + TLN R +++ F++ G + P E
Sbjct: 257 PSFSLSCFFFLTALHIYANYRAVRALVLETLNEDRLRLVLKHFLQRGEVLDPTSANQMEP 316
Query: 307 L---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRW------ 355
L +P ++ +G H+++ S EL++L +E ++L W
Sbjct: 317 LWTGFWPSL------SLSLGVPLHRLI--SSVFELQQLVEGHQEPYLL----HWDKSKSQ 364
Query: 356 TDMVLEQNASGEDALR----GWLVAAYAAS--MAKSFHD-----------PSLTVLQDAY 398
+VL Q A E LR G ++ A + + + S ++++ +
Sbjct: 365 VQVVLSQKAGPETILRAATHGLVLEALQGDGPLPRELEELRNRVLAGADKESWVIIKETH 424
Query: 399 DKMNDVFTPLLSELQAKGWHTDR 421
++ +F L LQ GW T++
Sbjct: 425 QVLDKLFPKFLKGLQDAGWKTEK 447
>gi|225452488|ref|XP_002278851.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
Length = 514
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 168/345 (48%), Gaps = 24/345 (6%)
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
DD + S + FF P V+ Y+ Y +++ L S+AL VL+TQ++ A
Sbjct: 99 DDGLNNLFALCSSAVRDFF---IPQKVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAI 155
Query: 113 GL---RPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGL 168
G+ R P+ A ++W+LKDG+ + + I ++L + D+ KR R L+ L G+
Sbjct: 156 GIGYSRSLPS-AAALNWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGV 214
Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
E+L+P P FL +A + N AK + V AT ++ SFA NL ++ AK + F
Sbjct: 215 ELLTPAFPNYFLLLATIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCF 274
Query: 229 NVVGLGAGIQLASTVCSSMQGKM-----IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
+ +GL L + +C + Q + +V P+ S I ++ + + ++ + TL R
Sbjct: 275 DNLGLMLAAVL-NILCKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLE 333
Query: 284 MIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELF 342
+I+ +++ G + SPA++ E + FP + +++G DP + ++ L
Sbjct: 334 IILNMWIQLGYVPSPAEVSKEEGISFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSLS 393
Query: 343 AEEKFILCLGS--RWTD-------MVLEQNASGEDALRGWLVAAY 378
+E+ + +C+ + R T + L + AS D + G L A Y
Sbjct: 394 SEDFYFICMEACRRLTRPEQQGILLCLREGASTTDVISGLLQACY 438
>gi|296087683|emb|CBI34939.3| unnamed protein product [Vitis vinifera]
Length = 559
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 168/345 (48%), Gaps = 24/345 (6%)
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
DD + S + FF P V+ Y+ Y +++ L S+AL VL+TQ++ A
Sbjct: 144 DDGLNNLFALCSSAVRDFF---IPQKVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAI 200
Query: 113 GL---RPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGL 168
G+ R P+ A ++W+LKDG+ + + I ++L + D+ KR R L+ L G+
Sbjct: 201 GIGYSRSLPS-AAALNWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGV 259
Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
E+L+P P FL +A + N AK + V AT ++ SFA NL ++ AK + F
Sbjct: 260 ELLTPAFPNYFLLLATIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCF 319
Query: 229 NVVGLGAGIQLASTVCSSMQGKM-----IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
+ +GL L + +C + Q + +V P+ S I ++ + + ++ + TL R
Sbjct: 320 DNLGLMLAAVL-NILCKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLE 378
Query: 284 MIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELF 342
+I+ +++ G + SPA++ E + FP + +++G DP + ++ L
Sbjct: 379 IILNMWIQLGYVPSPAEVSKEEGISFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSLS 438
Query: 343 AEEKFILCLGS--RWTD-------MVLEQNASGEDALRGWLVAAY 378
+E+ + +C+ + R T + L + AS D + G L A Y
Sbjct: 439 SEDFYFICMEACRRLTRPEQQGILLCLREGASTTDVISGLLQACY 483
>gi|326527127|dbj|BAK04505.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 97/384 (25%), Positives = 181/384 (47%), Gaps = 28/384 (7%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 55 WGRIGSRVLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKS 114
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 115 ATVIGATFQWFLRDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 174
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
P + + LG+ ++ VA+ ATR + FA N +D+ AK + TL + G+G
Sbjct: 175 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMSGMGL 234
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
G+ LA + L+V H+Y+ + +++ ++TLN +RT++++ F++ G +
Sbjct: 235 GMLLAHVTRGHDLVVWVSFLSLTVFHMYANYKAVQSLSLSTLNYERTSILLQYFMEHGEV 294
Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK------------FHELKELFA 343
+P + +E +L VK+ PH++V + + +
Sbjct: 295 LTPQQVSKQEHILPFWSSWRKLLRVKL---PHELVNLGAKASMLTHSDMLLIAKTRSYYT 351
Query: 344 EEKFILCLGSRWTDMVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQDAYDK 400
+ L + + + A D L+ + LV A +KS H +A+
Sbjct: 352 NANYFLLDKDGSVCIFIHKQAVATDVLKSFIHGLVLARLVPKSKSCH-------MEAHQW 404
Query: 401 MNDVFTPLLSELQAKGWHTDRFLD 424
M++ + +S+L+ +G+ T+R L
Sbjct: 405 MDEKYNTFISKLKVEGYSTERLLS 428
>gi|194388018|dbj|BAG65393.1| unnamed protein product [Homo sapiens]
Length = 422
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 176/375 (46%), Gaps = 29/375 (7%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWIL 128
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W++
Sbjct: 32 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 91
Query: 129 KDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F N
Sbjct: 92 KDSTDMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 151
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 152 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 211
Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ L+ +H+Y+ +RA + TLN R +++ +++ G + P R +
Sbjct: 212 GFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEP 270
Query: 308 LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQN 363
L+ G A ++ +G H++V S EL++L +E ++LC +VL Q
Sbjct: 271 LWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQK 326
Query: 364 ASGEDALR----GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMNDVFT 406
A + LR G ++ A + S V+++ ++ ++ +F
Sbjct: 327 AGPKTILRAATHGLMLGALQGDGPLPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFP 386
Query: 407 PLLSELQAKGWHTDR 421
L LQ GW T++
Sbjct: 387 KFLKGLQDAGWKTEK 401
>gi|297283910|ref|XP_002802508.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Macaca
mulatta]
Length = 429
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 176/376 (46%), Gaps = 29/376 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 38 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 97
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 98 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 157
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 158 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 217
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +R+ + TLN R +++ +++ G + P R +
Sbjct: 218 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 276
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
L+ G A ++ +G H++V S EL++L +E ++LC +VL Q
Sbjct: 277 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 332
Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
A + LR G ++ A + S ++++ + ++ +F
Sbjct: 333 KAGPKTVLRAATHGLMLGALQGDGPLPAELEELRNQVRAGPKKESWVIVKETHQVLDMLF 392
Query: 406 TPLLSELQAKGWHTDR 421
L LQ GW T++
Sbjct: 393 PKFLKGLQDAGWKTEK 408
>gi|383859063|ref|XP_003705017.1| PREDICTED: UPF0420 protein C16orf58-like [Megachile rotundata]
Length = 464
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 175/372 (47%), Gaps = 25/372 (6%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
++P Y V F P GYP SV+ Y Y + +Q F S + L+T S++ G+
Sbjct: 38 TKPFYSGFVSILKEVFLPQGYPDSVHPDYTAYQIWDTVQAFASTIMGTLTTHSIMQGVGV 97
Query: 115 RPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVL 171
A A ++WILKDG +G+++ + G +D + K+WR+ AD L D+ GLE+L
Sbjct: 98 GEPAATPLAAAITWILKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDVAMGLELL 157
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
P L + + K I VA ATR + A + NL+D+ AK + T N++
Sbjct: 158 LPYYSSYSLAILCVSTAMKSIVGVAGGATRAAVTQHQALQNNLADVSAKDGSQETCVNLI 217
Query: 232 GLGAGIQLASTVCSSMQGKMIVGP--LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
GI + S G+ I+ L ++H+Y+ +++ +N+LN R A+IV +
Sbjct: 218 ASLVGILILSIF---HNGRYIMELYFFLIIVHLYANYSAVKSLCLNSLNEDRLALIVKSY 274
Query: 290 VKTGRISSPADLRYREDLLF---PGRLIEDAGNVKVGRD-----PHKVVKPSKFHELKEL 341
+ I P ++ +E +L P I N+K+G + ++ S +L
Sbjct: 275 ITNEVIPEPEEVNKKESVLLLTKPTMSIY-GFNIKMGVSLAALIKNNIISTSDTELSLKL 333
Query: 342 FAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYD 399
F + K+++ + ++ + +++A D L + + A + F SL V D +
Sbjct: 334 FLDRKYLISIDVQNKTIFICFKKDAQPCDVLEAY----FHACLCGLFICMSLRVPLDLFL 389
Query: 400 K--MNDVFTPLL 409
K ++D+ PL+
Sbjct: 390 KPEVSDMSYPLM 401
>gi|297810247|ref|XP_002873007.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
lyrata]
gi|297318844|gb|EFH49266.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
lyrata]
Length = 510
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 105/373 (28%), Positives = 174/373 (46%), Gaps = 29/373 (7%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---------RPTPAQA 121
FP+G+P SV++ YL Y ++ + T +VL T SLL A G+ A A
Sbjct: 121 FPTGFPASVSDDYLDYMLWQFPTNVTGWICNVLVTSSLLKAVGVGSFSGTSAAATAAASA 180
Query: 122 TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
+ W+ KDG+ +G+L I G+ D +PK+WR+ AD + G+ ++ + L P FL
Sbjct: 181 AAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFL 240
Query: 181 EMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+A GN AK + AR R P I + FA GNL ++ AK E ++GLG G
Sbjct: 241 LLASTGNLAKAV----ARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFG 296
Query: 237 IQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
I + T + S + + ++H++ + + NT+N +R +IV V
Sbjct: 297 ILIIDTPGLVKSFPFVSLTWTSVRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSV 356
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKP--SKFHELKELFAEEKFILCLG 352
+ D RE++L R ++ V + ++ SK L +++ +EK+IL L
Sbjct: 357 VPGYVDCNKRENILLGQRFMKPRIIFGVSLEEVSGLEKSVSKVKALLKMYTKEKYILTLN 416
Query: 353 SRWTD----MVLEQNASGEDALRG-WLVAAYAASMAKSFHDPSLTV--LQDAYDKMNDVF 405
+D + + NA+ D LR W +M +SF D L+ + +M++ F
Sbjct: 417 KLNSDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEESFKDKDSVFHWLKQSLSEMDNKF 476
Query: 406 TPLLSELQAKGWH 418
L +L GW+
Sbjct: 477 DDFLFKLDTAGWN 489
>gi|355756739|gb|EHH60347.1| hypothetical protein EGM_11687 [Macaca fascicularis]
Length = 468
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 176/376 (46%), Gaps = 29/376 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +R+ + TLN R +++ +++ G + P R +
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 315
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
L+ G A ++ +G H++V S EL++L +E ++LC +VL Q
Sbjct: 316 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 371
Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
A + LR G ++ A + S ++++ + ++ +F
Sbjct: 372 KAGPKTVLRAATHGLILGALQGDGPLPAELEELRNQVRAGPKKESWVIVKETHQVLDMLF 431
Query: 406 TPLLSELQAKGWHTDR 421
L LQ GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447
>gi|302564091|ref|NP_001181525.1| UPF0420 protein C16orf58 [Macaca mulatta]
gi|380815644|gb|AFE79696.1| hypothetical protein LOC64755 [Macaca mulatta]
Length = 468
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 176/376 (46%), Gaps = 29/376 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +R+ + TLN R +++ +++ G + P R +
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 315
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
L+ G A ++ +G H++V S EL++L +E ++LC +VL Q
Sbjct: 316 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 371
Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
A + LR G ++ A + S ++++ + ++ +F
Sbjct: 372 KAGPKTVLRAATHGLMLGALQGDGPLPAELEELRNQVRAGPKKESWVIVKETHQVLDMLF 431
Query: 406 TPLLSELQAKGWHTDR 421
L LQ GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447
>gi|355710161|gb|EHH31625.1| hypothetical protein EGK_12728 [Macaca mulatta]
Length = 468
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 176/376 (46%), Gaps = 29/376 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +R+ + TLN R +++ +++ G + P R +
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 315
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
L+ G A ++ +G H++V S EL++L +E ++LC +VL Q
Sbjct: 316 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 371
Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
A + LR G ++ A + S ++++ + ++ +F
Sbjct: 372 KAGPKTVLRAATHGLMLGALQGDGPLPAELEELRNQVRAGPKKESWVIVRETHQVLDMLF 431
Query: 406 TPLLSELQAKGWHTDR 421
L LQ GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447
>gi|7339490|emb|CAB82813.1| putative protein [Arabidopsis thaliana]
Length = 631
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 86/320 (26%), Positives = 156/320 (48%), Gaps = 32/320 (10%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
+ L +F P G+P SV YL Y+ +R +Q S VL+TQSLL+A GL + A
Sbjct: 192 NLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAA 251
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+LKDG+ ++ K++ S G D PK WR+ AD L + G+E+L+P+ PQ F+ +
Sbjct: 252 AINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMI 311
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---------------------- 220
+ A + ATR + FA + N ++++
Sbjct: 312 GAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVYNNFYMALVLITYQQLFVFLNYSGS 371
Query: 221 -GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
GEA + VG+ GI +A+ + +S + +++ IH+Y+ ++ + + TLNP
Sbjct: 372 LGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNP 431
Query: 280 QRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELK 339
R +++ ++++ +G+ ++ E+ LFP + + P K+ E K
Sbjct: 432 YRASLVFSEYLISGQAPLIKEVN-DEEPLFPTVRFSNM------KSPEKLQDFVLSSEAK 484
Query: 340 ELFAEEKFILCLGSRWTDMV 359
A+ + L LGS+ +D++
Sbjct: 485 AAAADIEERLQLGSKLSDVI 504
>gi|417401458|gb|JAA47615.1| Hypothetical protein [Desmodus rotundus]
Length = 468
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 99/390 (25%), Positives = 175/390 (44%), Gaps = 43/390 (11%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A +WI+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCF 189
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 190 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + LL+ +HVY+ ++A + TLN R +++ F++ G + P
Sbjct: 250 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 309
Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
E L +P ++ +G H+++ S EL++L E ++L
Sbjct: 310 SANQMEPLWTGFWPSL------SLSLGAPLHRLI--SSVFELQQLVEGHREPYLL----H 357
Query: 355 W------TDMVLEQNASGEDALRGWLVAAYAASMAKSFHDP-----------------SL 391
W +VL Q A E LR ++ + P S
Sbjct: 358 WDQSQNRVQVVLSQMAGPETILRAATHGLVLGALREDGPLPRELEELRNQIRAGPKKDSW 417
Query: 392 TVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
++++ + ++ +F L LQ GW T++
Sbjct: 418 VIVKETHQVLDKLFPKFLKGLQDVGWKTEK 447
>gi|328872765|gb|EGG21132.1| DUF647 family protein [Dictyostelium fasciculatum]
Length = 534
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 137/277 (49%), Gaps = 6/277 (2%)
Query: 37 RHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFT 96
+H H S V+D ++ F F P GYP SV Y +Y ++ LQ+
Sbjct: 88 KHVNNNDTTHTSSLVIDTDNQ--QKLTKMFQKLFLPKGYPQSVTHDYSQYMKWMFLQNTL 145
Query: 97 SAALSVLSTQSLL--FAAGLRPTPAQATVVSWILKDGMQHVG-KLICSNLGARMDSEPKR 153
+ VLST +LL GL + A +SW+LKDG+ L S +D K
Sbjct: 146 GSTTYVLSTHALLTSVGVGLSTSLPFAAAISWVLKDGLGASALVLFASKYSTSLDYNLKS 205
Query: 154 WRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGN 213
++ D L+++G LE+L+P P FL +A + N +KG+A + ATR + SF+ + N
Sbjct: 206 FKFRGDILHNVGVMLEMLTPFVPGYFLPLASVSNLSKGLAGLIYGATRASLNRSFSLQEN 265
Query: 214 LSDLFAKGEAISTLFNVVGLGAGIQLASTVC-SSMQGKMIVGPLLSVIHVYSVIEEMRAA 272
L D+ AK ++ S + G+G G + + + + ++S++H+ ++ +++
Sbjct: 266 LGDITAKYQSQSMASYLTGMGLGTTIGLLMSPMTTTANLCAVFIISLLHMSCSVKSLKSI 325
Query: 273 PVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
+ TLN QR+ +I+ +++ I SP + E+ +F
Sbjct: 326 CLKTLNSQRSKIIIDHWLREKIILSPEIVNKNENFMF 362
>gi|195344882|ref|XP_002039005.1| GM17287 [Drosophila sechellia]
gi|194134135|gb|EDW55651.1| GM17287 [Drosophila sechellia]
Length = 417
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 165/357 (46%), Gaps = 19/357 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + Q F S L T ++L G A + +WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P L ++
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCC 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+Y+ ++ +RA + + N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
+ G+ + + N+K+G ++ K + E E FI+ + + L
Sbjct: 292 RVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTKKCLGVYL 350
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
+D L+ + +A S + + +++ Y + + L+ Q +GW
Sbjct: 351 HFETRPQDVLKAYF---FAVSYLQDRNQ-----IKEKYWDIQTKWQEFLALAQQEGW 399
>gi|342881518|gb|EGU82408.1| hypothetical protein FOXB_07090 [Fusarium oxysporum Fo5176]
Length = 448
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 109/425 (25%), Positives = 196/425 (46%), Gaps = 36/425 (8%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNE 81
+ + I + S +V+R + D H VV S+P ++ S F P GYP SV+
Sbjct: 12 EPVEIVELDKSGNVSRTWNHTVDNH----VVAKSKPSI-PLIPSLREAFMPVGYPHSVSS 66
Query: 82 GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLI 139
YL Y F +LQ F S S+L+ ++LL G+ + AT ++ +LKD M + ++
Sbjct: 67 DYLNYQFFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAMSRIATIV 126
Query: 140 CSN-LGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVVAA 197
++ G R++ + KR+R LAD D LE+ SP P + G + + VAA
Sbjct: 127 FAHQFGLRIEPDAKRFRFLADLFNDTAFFLELYSPYFGPYGKILALTSGETLRALCGVAA 186
Query: 198 RATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLL 257
A++ + FAK NLS+L AK + T ++GL G + V + + +L
Sbjct: 187 GASKAALSVHFAKHDNLSELNAKEASQETAVGLIGLLVGTIVVKYV-EDHHSVLFLMTVL 245
Query: 258 SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDA 317
H++ +RA ++ LN QR ++ +++KTG + SP ++ +RE++L ++ +
Sbjct: 246 VFAHLWMNYLGVRAVCMDNLNQQRATILFEEYLKTGNVLSPEEVAHRENILLWRPVVRNQ 305
Query: 318 GNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSR-----W---------TDMVLEQN 363
K+ R + S + + ++ G+R W ++L ++
Sbjct: 306 HGQKIARIE---MAKSYWDAMGAKAPNSNVVIIDGARSSLFVWYHSKRSYTPIKIMLWKD 362
Query: 364 ASGEDALRGWLVA---AYAASMAKSFHDPSLTVLQ-----DAYDKMNDVFTPLLSELQAK 415
AS A+ W +A A+ ++ + D + + D Y++ D L E+ AK
Sbjct: 363 ASAIHAISAWFMAIEMAWLLEKSQGYTDKLDKIFRVKETVDGYEQKYDK-RDLWDEMLAK 421
Query: 416 GWHTD 420
GW+ +
Sbjct: 422 GWNIE 426
>gi|24585018|ref|NP_609897.2| CG10338 [Drosophila melanogaster]
gi|22946770|gb|AAF53692.2| CG10338 [Drosophila melanogaster]
gi|384475974|gb|AFH89820.1| FI20169p1 [Drosophila melanogaster]
Length = 417
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 164/357 (45%), Gaps = 19/357 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + Q F S L T ++L G A + +WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+Y+ ++ +RA + + N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
+ G+ + + N+K+G ++ K + E E FI+ + + L
Sbjct: 292 RVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTKKCLGVYL 350
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
+D L+ + +A S + + L++ Y + + L+ Q +GW
Sbjct: 351 HFETRPQDVLKAYF---FAVSYLQDRNQ-----LKEKYWDIQTKWQEFLALAQQEGW 399
>gi|410984724|ref|XP_003998676.1| PREDICTED: UPF0420 protein C16orf58 homolog [Felis catus]
Length = 468
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 99/390 (25%), Positives = 179/390 (45%), Gaps = 43/390 (11%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ S + F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A
Sbjct: 70 LSSLIAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A +W++KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 130 AATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFF 189
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 190 TITICTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + L+ +H+Y+ +RA + TLN R +++ F++ G + P
Sbjct: 250 LPLVSDCPGFSLGCFFFLTALHIYANYRAVRALVIETLNEGRLRLVLKHFLQRGEVLGPT 309
Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
E L +P + ++ +G H++ S +L++L +E ++L R
Sbjct: 310 SANQMEPLWTGFWPSQ------SLSLGVPLHRLT--SSVFDLQQLVEGHQEPYLL----R 357
Query: 355 W------TDMVLEQNASGEDALR----GWLVAAYAAS--MAKSFHD-----------PSL 391
W +VL Q A E LR G ++ A + + + +
Sbjct: 358 WDLSQNQVQVVLSQMAGPETILRAATHGLVLEALQGDGPLPRGLEELRNQVRAGPEKDNW 417
Query: 392 TVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
+++D + ++ +F L LQ GW T++
Sbjct: 418 VIVRDTHQVLDKLFPKFLKGLQDAGWKTEK 447
>gi|403368796|gb|EJY84239.1| hypothetical protein OXYTRI_18022 [Oxytricha trifallax]
Length = 504
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 73/241 (30%), Positives = 121/241 (50%), Gaps = 4/241 (1%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
V +++ F P GYP SV EGYL+++ F L + A+S LS QSL A G T
Sbjct: 85 VKNKIYIGLFETFLPVGYPNSVAEGYLKFSIFNNLSALSITAMSFLSAQSLFVAIGSTMT 144
Query: 118 PAQ--ATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
A A +W+LKDG+ +G L S G D + K+WR +A ++ +E+L+
Sbjct: 145 QANLAAAAYTWVLKDGIGQLGAILFASRYGRNFDEDIKKWRFMAMIALNISIYIEILTLR 204
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P FL +A + N K I + + A+R I FAK N+ D+ K + T + G+G
Sbjct: 205 FPNHFLALASIANIGKNICFLLSAASRASINVQFAKRNNIGDISGKSVSQFTASTLCGVG 264
Query: 235 AGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
G+ L+ + SS+ V +L+++++ + + + LN QR +I ++ K+
Sbjct: 265 IGLMLSKLIDISSISQLYPVFMILTILNIATSYQASKVIDEVYLNNQRAFIIFNEYFKSN 324
Query: 294 R 294
+
Sbjct: 325 K 325
>gi|195115010|ref|XP_002002060.1| GI17175 [Drosophila mojavensis]
gi|193912635|gb|EDW11502.1| GI17175 [Drosophila mojavensis]
Length = 417
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 89/365 (24%), Positives = 168/365 (46%), Gaps = 19/365 (5%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
N F P GYP SV++ Y Y + +Q F S L T ++L G A + V+
Sbjct: 51 NIFLPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNAYSATVT 110
Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMA 183
WILK+G H+G++I + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 111 WILKEGSGHMGRIIFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
K I VA ATR + A GNL+D+ +K + T N++ G+ L T+
Sbjct: 171 CGSTLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFMGLYLL-TM 229
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
S V ++ +H+Y+ ++ +RA + T N R + + +F ++ ++ +P +
Sbjct: 230 IKSQAMLYTVFYIVVTLHLYANLKAVRAVCLRTFNESRYLITLEEFFRSSQMLTPQQVNK 289
Query: 304 REDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDM 358
E + G+++ + +++G ++ K ++ E E FI+ ++ +
Sbjct: 290 LERVTL-GQMVSVSLKIRLGLSVKHLIDEYKSSDVIEDIVSSFDPHEHFIIAESKKFLGV 348
Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWH 418
L + +D L+ + K +++ Y + + LS Q +GW
Sbjct: 349 YLHFDTRPQDVLKAYFFGVSYLQDRKH--------IKEKYWDIQGKWQEFLSLAQKEGWI 400
Query: 419 TDRFL 423
++ L
Sbjct: 401 INQHL 405
>gi|426254633|ref|XP_004020981.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ovis aries]
Length = 462
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 101/384 (26%), Positives = 176/384 (45%), Gaps = 37/384 (9%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A
Sbjct: 67 LLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAAT 126
Query: 124 VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+W++KD +G+++ + +G++MD K+WR+ AD L D+ LE+++P+ P F
Sbjct: 127 ATWLVKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTVT 186
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 187 VCISNLAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPL 246
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V S + L+ +H+Y+ +RA + TLN QR +++ F++ G + P
Sbjct: 247 VSDSPSLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQWGEVLGPTSAN 306
Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRW----- 355
R + L+ G + ++ +G H V S EL+ L +E ++L W
Sbjct: 307 -RMEPLWTG--FWSSVSLSLGAPLHCVT--SSVSELQHLVEGHQEPYLL----HWDQSRN 357
Query: 356 -TDMVLEQNASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDA 397
+VL A E LR G ++ A + S V+++
Sbjct: 358 QVQVVLSPMAGPEAVLRAATHGLVLGALRGDGPLPEELEELRNQVRAGPEKESWVVVRET 417
Query: 398 YDKMNDVFTPLLSELQAKGWHTDR 421
+ ++ +F L LQ GW T++
Sbjct: 418 HQVLDKLFPKFLKGLQDGGWSTEK 441
>gi|449017769|dbj|BAM81171.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 827
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 116/425 (27%), Positives = 187/425 (44%), Gaps = 61/425 (14%)
Query: 56 RPVYHRVVDSFLNKFF--PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
R V+ +V S L + F + +P V++ YL YT +Q+ AA SV++TQS+L A G
Sbjct: 328 RNVWRALVGSLLPEGFGDKTRFPPVVSDDYLPYTVLHFMQNLVHAAASVIATQSMLIAIG 387
Query: 114 L--RPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEV 170
+ + T + V W+LKDG+ + +I S LG R D++PKRWR+ D LY LG G E+
Sbjct: 388 VAKQATIGVSAAVQWVLKDGLGRIAAIIFGSLLGNRYDADPKRWRMRGDMLYTLGVGTEI 447
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
+ L PQ FL + + N K ++ + T I F+ N DL AK + L +
Sbjct: 448 ATRLVPQYFLLVGTMANAIKSVSYMMRLPTAAAIRRHFSLRENFGDLSAKANSQEVLSML 507
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+G GI L+ + +S++ + + + +R + TLN QRT +++ +
Sbjct: 508 LGTFVGIGLSYMIDNSLKALFAFYIIYVQMFILFNYWSLRVLRMRTLNLQRTLIVMRRYW 567
Query: 291 KTGRI--SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFI 348
+G + S A E+ L P RL A V+ G + + E +E A ++
Sbjct: 568 NSGGVETCSVAMANQAENFLLPSRL-GPARRVRFGCRLGEAFR--TVGEFEEAIARQRL- 623
Query: 349 LCLGSRWTDMVLEQ--------NAS-----------------GEDALRGWLVA-----AY 378
W L+Q NA G DA RG + A
Sbjct: 624 ------WRGQYLQQTDVSKRNGNADADFTSVPDDVLDERYLLGVDAKRGRVFVVLHRDAK 677
Query: 379 AASMAKSFHDPSLTVLQD-------AYDKMNDVF------TPLLSE-LQAKGWHTDRFLD 424
A ++ + V D + D++ D + TP+L + ++A GW+ D F+
Sbjct: 678 VADELQALMHATYLVRADWGPPPDMSIDRVRDGYLLACAETPVLVQRMRAVGWNVDEFVH 737
Query: 425 GTGTR 429
G R
Sbjct: 738 AGGER 742
>gi|403276814|ref|XP_003930078.1| PREDICTED: UPF0420 protein C16orf58 homolog [Saimiri boliviensis
boliviensis]
Length = 468
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 176/376 (46%), Gaps = 29/376 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGF 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
L+ +H+Y+ +RA + TLN R +++ +++ G + P R +
Sbjct: 257 PGFSFGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTTAN-RME 315
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
L+ G ++ +G H++V S EL++L +E ++LC +VL Q
Sbjct: 316 PLWTG--FWPTPSLSLGVPLHRLV--SSVSELQQLVEGHQEPYLLCWDQSQNQVQVVLNQ 371
Query: 363 NASGEDALR----GWLVAAYAA-----SMAKSFHD--------PSLTVLQDAYDKMNDVF 405
A + LR G ++ A + K + S ++++ + ++ +F
Sbjct: 372 KADPKTVLRAATHGLMLGALQGDGPLPAELKELRNRVRAGPKKESWVIVKETHQVLDMLF 431
Query: 406 TPLLSELQAKGWHTDR 421
L LQ GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447
>gi|300176775|emb|CBK25344.2| unnamed protein product [Blastocystis hominis]
Length = 409
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 74/214 (34%), Positives = 116/214 (54%), Gaps = 3/214 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA-QATVVSWIL 128
F P YP SV YL Y + A SAA VLSTQS+LFA G+ A ++W+L
Sbjct: 105 FLPRNYPSSVKGKYLNYCSWAAFGMGLSAAGGVLSTQSMLFAIGMGAGAVPMAAALNWVL 164
Query: 129 KDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
KDG+ +G + + L +R+D++ + WRIL+ L ++ T LEV++PL P FL +A L N
Sbjct: 165 KDGLGQLGGMAFTALVNSRLDADSRGWRILSAWLLEISTWLEVMTPLFPHSFLLLATLAN 224
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC-SS 246
K I+ +A ATR I F N+ D+ AK + + V G GI +++ +
Sbjct: 225 VGKNISWLAGSATRAGIRYGFVNAHNMGDITAKEGSQTVAITVFGTFLGIVISNLIGHGH 284
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
M+ ++ +S + ++S+ + +R + TLN Q
Sbjct: 285 MEYVLMSSMCISAVSLFSIYQSLRCVSLPTLNYQ 318
>gi|195579948|ref|XP_002079818.1| GD24150 [Drosophila simulans]
gi|194191827|gb|EDX05403.1| GD24150 [Drosophila simulans]
Length = 417
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 164/357 (45%), Gaps = 19/357 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + Q F S L T ++L G A + +WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+Y+ ++ +RA + + N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
+ G+ + + N+K+G ++ K + E E FI+ + + L
Sbjct: 292 RVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTKKCLGVYL 350
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
+D L+ + +A S + + +++ Y + + L+ Q +GW
Sbjct: 351 HFETRPQDVLKAYF---FAVSYLQDRNQ-----IKEKYWDIQTKWQEFLALAQQEGW 399
>gi|125986275|ref|XP_001356901.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
gi|195148693|ref|XP_002015302.1| GL19630 [Drosophila persimilis]
gi|54645227|gb|EAL33967.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
gi|194107255|gb|EDW29298.1| GL19630 [Drosophila persimilis]
Length = 417
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 149/317 (47%), Gaps = 11/317 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + +Q F S L T ++L G A + V+WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENINAYSATVTWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
+K+G HVG+++ + G ++D + K+WR+ AD L DL G+E+ + P P L ++
Sbjct: 113 MKEGCGHVGRILFAWWKGCQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCG 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L T+
Sbjct: 173 STMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLL-TLIK 231
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
S V ++ +H+Y+ ++ +R+ + T N R + + ++ ++ R+ +P + E
Sbjct: 232 SQAVLYTVFYVVVALHLYANLKAVRSVCLRTFNESRYLIALEEYFRSSRMLTPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
+ G+ + + NV++G ++ K E E FI+ ++ + L
Sbjct: 292 RVTL-GQTVSVSLNVRLGLSVKNLIDEYKTSSSIENIVSSFDPHEHFIIAESKKYLGVYL 350
Query: 361 EQNASGEDALRGWLVAA 377
+D L+ + A
Sbjct: 351 HFETRQQDVLKAYFFAV 367
>gi|195035521|ref|XP_001989226.1| GH11606 [Drosophila grimshawi]
gi|193905226|gb|EDW04093.1| GH11606 [Drosophila grimshawi]
Length = 417
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/319 (26%), Positives = 153/319 (47%), Gaps = 11/319 (3%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
N F P GYP SV++ Y+ Y + LQ F S L T ++L G A + V+
Sbjct: 51 NIFLPKGYPDSVSDDYVVYQIWDTLQAFCSTICGTLCTHAILKGIGVGSENINAYSATVT 110
Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMA 183
WILK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 111 WILKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
K I VA ATR + A GNL+D+ +K + T N+V G+ L + +
Sbjct: 171 CGSTMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLTMI 230
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
S I +++ +H+Y+ ++ +R+ + T N R + + +F ++ + +P +
Sbjct: 231 KSQAMLYTIFYIVVT-LHLYANLKAVRSICLRTFNESRYLIALEEFFRSSHMLTPQQVNK 289
Query: 304 REDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDM 358
E + G+ + + N+++G ++ K + E E+FI+ + +
Sbjct: 290 MERVTV-GQTVSVSLNIRLGLSVKHLIDEYKSSSVIENIVSSFDPHERFIIAESKKHLGV 348
Query: 359 VLEQNASGEDALRGWLVAA 377
L + +D L+ + A
Sbjct: 349 YLHFDTRPQDVLKAYFFAV 367
>gi|189241392|ref|XP_968556.2| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Tribolium
castaneum]
Length = 472
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/338 (28%), Positives = 161/338 (47%), Gaps = 14/338 (4%)
Query: 47 LSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQ 106
+S + V DS V V+ F P GYP SV+E Y Y + +Q F S + +TQ
Sbjct: 35 ISQESVADS--VLQSVLGFFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTTQ 92
Query: 107 SLLFAAGLRPTPAQA--TVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYD 163
++L G+ A A ++WI+KDG VG++I + G +DS K+WR+ AD L D
Sbjct: 93 AILKGVGVGSAEASALSATITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLND 152
Query: 164 LGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
LE+ P L E+ L K + VA ATR I + A +GN++++ AK +
Sbjct: 153 GAMILELSIPHVTHLSTEILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDGS 212
Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
TL N++ GI L + + + IV LL +H+YS + + A LN R
Sbjct: 213 QETLVNLIASFTGIFLINYFTDATT-QWIVTFLLIFLHLYSNYQAVTALIFKNLNNLRMT 271
Query: 284 MIVADFVKTGRISSPADLRYREDLLFP-GRLIEDAGNVKV--GRDPHKVVKPSKFHELK- 339
+I+ ++ + P + E + G +E K+ G+ K ++ ELK
Sbjct: 272 LILKSYLAVNAVLGPECINKTESVFLGFGTSVEKFCGFKIVLGQALGKTLRRYNSDELKS 331
Query: 340 --ELFAEEKFILC--LGSRWTDMVLEQNASGEDALRGW 373
E++ ++K+++ L +R + E+ + ++ + G+
Sbjct: 332 LTEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGF 369
>gi|195484210|ref|XP_002090597.1| GE13200 [Drosophila yakuba]
gi|194176698|gb|EDW90309.1| GE13200 [Drosophila yakuba]
Length = 417
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 164/357 (45%), Gaps = 19/357 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + Q F S L T ++L G A + +WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+Y+ ++ +RA + + N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
+ G+ + + N+K+G ++ K + E E FI+ + + L
Sbjct: 292 RVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAETKKCLGVYL 350
Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
+D L+ + +A S + + +++ Y + + L+ Q +GW
Sbjct: 351 HFETRPQDVLKAYF---FAVSYLQDRNQ-----IKEKYWDIQTKWQEFLALAQQEGW 399
>gi|322787029|gb|EFZ13253.1| hypothetical protein SINV_10096 [Solenopsis invicta]
Length = 468
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 87/342 (25%), Positives = 161/342 (47%), Gaps = 19/342 (5%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
+S + R+ F F P GYP SV+ Y Y + +Q F S L+T S++ G
Sbjct: 35 ESLAIRSRITSVFKQIFLPQGYPDSVHPDYTAYQIWDTVQAFASTITGTLTTHSIMKGIG 94
Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV 170
+ + A A ++WILK G +G ++ + G ++D + K+WR+ AD L D+ G+E+
Sbjct: 95 VGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTKLDGQCKKWRLFADILDDIAKGIEL 154
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
L P + + K I VA ATR + + A + N++D+ AK + T N+
Sbjct: 155 LVPFFSSYSTVILCISTTMKSIVGVAGGATRTALINHQAIQNNVADVSAKDGSQETCVNL 214
Query: 231 VGLGAGIQLASTVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
+ GI + S G+ ++ L +H+Y+ ++A ++TLN R A+IV +
Sbjct: 215 IASFFGIFILSFF---HDGRYLLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAIIVKN 271
Query: 289 FVKTGRISSPADLRYREDLLF---PGRLIEDAGNVKVGRD-----PHKVVKPSKFHELKE 340
++ +I P + E + PGR + N+K+G ++ ++ L +
Sbjct: 272 YMINEQIPKPGKVNKEESIFLLTNPGRNVH-GFNIKIGVSFTSILKRNIISLTEMEFLLK 330
Query: 341 LFAEEKFILCLGSRWTD--MVLEQNASGEDALRGWLVAAYAA 380
F K+I+ + + ++L+++ + L G+ A+ A
Sbjct: 331 FFENRKYIITIDIEKKNIFIILKKDIQSIEILEGYFYASMCA 372
>gi|330802843|ref|XP_003289422.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
gi|325080505|gb|EGC34058.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
Length = 552
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 71/250 (28%), Positives = 131/250 (52%), Gaps = 7/250 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F PSGYP SV Y RY + ++Q S+ L+T+++L + G + W+
Sbjct: 60 FLPSGYPDSVTSDYFRYQFWDSIQALCSSITGTLATRAILKGYGVGNSSATVASATTQWV 119
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGL 185
++ +G++I + G +D K+WR +AD L D+G E++SPL + LFL ++ +
Sbjct: 120 IQSATGMIGRIIFAWANGTDLDCNSKKWRFMADILNDIGMTFEMISPLFNEALFLPLSCI 179
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC- 244
G FAK I VA T+ + FAK NL+D+ AK + T +++G+ + ++S +
Sbjct: 180 GLFAKSICGVAGGCTKASLTQHFAKNDNLADVSAKDGSQETAVSLLGMILSVFVSSVLND 239
Query: 245 -SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLR 302
+S+ + + + H+Y + + A +N++N R +I +++ G I SP ++
Sbjct: 240 DTSILVTWFIFFIFTGFHLYCNYKAVSAVQLNSINKYRAHLIYEYYIRNQGSIPSPKEIS 299
Query: 303 YREDLLFPGR 312
E++LF +
Sbjct: 300 NFENILFNSK 309
>gi|194880075|ref|XP_001974360.1| GG21126 [Drosophila erecta]
gi|190657547|gb|EDV54760.1| GG21126 [Drosophila erecta]
Length = 417
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 148/316 (46%), Gaps = 11/316 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P GYP SV+E Y Y + Q F S L T ++L G A + +WI
Sbjct: 53 FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
LK+G HVG+++ + G+++D + K+WR+ AD L DL G+E+ + P P ++
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K I VA ATR + A GNL+D+ +K + T N+V G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
I ++S +H+Y+ ++ +RA + + N R + + +F ++ R+ SP + E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291
Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
+ G+ + + N+K+G ++ K + E E FI+ + + L
Sbjct: 292 RVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIEDIVSSFDPHEHFIIAETKKCLGVYL 350
Query: 361 EQNASGEDALRGWLVA 376
+D L+ + A
Sbjct: 351 HFETRPQDVLKAYFFA 366
>gi|270013094|gb|EFA09542.1| hypothetical protein TcasGA2_TC011650 [Tribolium castaneum]
Length = 450
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 161/339 (47%), Gaps = 14/339 (4%)
Query: 46 HLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLST 105
+S + V DS V V+ F P GYP SV+E Y Y + +Q F S + +T
Sbjct: 34 EISQESVADS--VLQSVLGFFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTT 91
Query: 106 QSLLFAAGLRPTPAQA--TVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALY 162
Q++L G+ A A ++WI+KDG VG++I + G +DS K+WR+ AD L
Sbjct: 92 QAILKGVGVGSAEASALSATITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLN 151
Query: 163 DLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGE 222
D LE+ P L E+ L K + VA ATR I + A +GN++++ AK
Sbjct: 152 DGAMILELSIPHVTHLSTEILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDG 211
Query: 223 AISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRT 282
+ TL N++ GI L + + + IV LL +H+YS + + A LN R
Sbjct: 212 SQETLVNLIASFTGIFLINYFTDATT-QWIVTFLLIFLHLYSNYQAVTALIFKNLNNLRM 270
Query: 283 AMIVADFVKTGRISSPADLRYREDLLFP-GRLIEDAGNVKV--GRDPHKVVKPSKFHELK 339
+I+ ++ + P + E + G +E K+ G+ K ++ ELK
Sbjct: 271 TLILKSYLAVNAVLGPECINKTESVFLGFGTSVEKFCGFKIVLGQALGKTLRRYNSDELK 330
Query: 340 ---ELFAEEKFILC--LGSRWTDMVLEQNASGEDALRGW 373
E++ ++K+++ L +R + E+ + ++ + G+
Sbjct: 331 SLTEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGF 369
>gi|307111229|gb|EFN59464.1| hypothetical protein CHLNCDRAFT_138052 [Chlorella variabilis]
Length = 657
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 124/230 (53%), Gaps = 18/230 (7%)
Query: 106 QSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADAL 161
+SLL A G+ R PA AT+ +W+LKDG+ +G+L + + G DS+ KR+R +
Sbjct: 78 ESLLQAVGVGARRSLPAAATI-NWVLKDGLGRLGRLTVATRFGESFDSDLKRFRYATSII 136
Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
Y + LE L+PL PQ FL MA L N K I + AT+ + SF NL+D+ AK
Sbjct: 137 YAVSLSLEFLTPLAPQHFLVMASLANVGKSIGLTTFIATQPAFHRSFCLRENLADISAKT 196
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGK-----MIVGPLLSVIHVYSVIEEMRAAPVNT 276
+A + + +GL A + L + +C + +++ PLL+ ++++ E+R+ + T
Sbjct: 197 QAQQMVMDNIGLAAAVGL-TYLCRHTEAARRALPLVMFPLLAAGDLWAIYSELRSIHLRT 255
Query: 277 LNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDP 326
LN +R +I +++ GR+ SP + E + P +++VGR P
Sbjct: 256 LNKERAEIIAQHWLREGRVPSPRQVSEEERFVLP-------PHIEVGRMP 298
>gi|405974674|gb|EKC39300.1| UPF0420 protein C16orf58-like protein [Crassostrea gigas]
Length = 444
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 91/312 (29%), Positives = 147/312 (47%), Gaps = 22/312 (7%)
Query: 49 VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
+K VD S P + + F F P GYP SV+ YL+Y + +Q F S+ L+ Q+L
Sbjct: 24 LKTVDLS-PKWTSLSQFFTVVFLPQGYPESVSSDYLQYQIWDTIQAFASSITGTLAAQAL 82
Query: 109 LFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLG 165
L G+ A A ++WILKDG +G+++ + + G +D + KRWR+ AD L D+
Sbjct: 83 LKGVGVGDESATVMAATLTWILKDGTGMIGRILFAWMQGTNLDCDAKRWRLFADILNDMA 142
Query: 166 TGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
+E+L+P F + K I VA ATR + A+ N++D+ AK +
Sbjct: 143 IFMEILAPYFKAYFTPIVCTAGVCKSIVGVAGGATRAALTQHQARRNNMADVSAKDGSQE 202
Query: 226 TLFNVVGLGAGIQLASTVC-------SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
TL N+ L + L V S ++ + V SV+ E TLN
Sbjct: 203 TLVNLAALVCSLTLVPLVTGKDVLIWSLFLLFTLLHLFANYSAVTSVVME-------TLN 255
Query: 279 PQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV-KPSKFHE 337
R ++V +++TG + S ++ Y+E +L+ R NV +G KV + S E
Sbjct: 256 QARLHILVHHYLQTGEVLSLQEVNYKEPVLWKTR---RKLNVHLGTSIGKVFSRASILEE 312
Query: 338 LKELFAEEKFIL 349
L+ + ++L
Sbjct: 313 SIALYKDSNYLL 324
>gi|417401377|gb|JAA47577.1| Hypothetical protein [Desmodus rotundus]
Length = 464
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 153/323 (47%), Gaps = 26/323 (8%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A +WI+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCF 189
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 190 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + LL+ +HVY+ ++A + TLN R +++ F++ G + P
Sbjct: 250 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 309
Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
E L +P ++ +G H+++ S EL++L E ++L
Sbjct: 310 SANQMEPLWTGFWPSL------SLSLGAPLHRLI--SSVFELQQLVEGHREPYLL----H 357
Query: 355 W------TDMVLEQNASGEDALR 371
W +VL Q A E LR
Sbjct: 358 WDQSQNRVQVVLSQMAGPETILR 380
>gi|397615116|gb|EJK63231.1| hypothetical protein THAOC_16128 [Thalassiosira oceanica]
Length = 621
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 166/365 (45%), Gaps = 82/365 (22%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
+ + + S F P G V Y R+ ++R LQ F +A + V+ TQSLL GLR
Sbjct: 141 KSIIRKCRQSLHKNFVPEG----VTPSYYRFMRWRILQRFMNANVHVIGTQSLLM--GLR 194
Query: 116 PTPAQATVVS----------------WILKDGMQHVGKLI----CSNLGARMDSEPKRWR 155
++TVVS W+LKD + GKL+ S +G + DS+ KRWR
Sbjct: 195 GM-QRSTVVSAAAKGGAALGAVAATNWVLKDTL---GKLVRMAWASKMGRKFDSDAKRWR 250
Query: 156 ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG--- 212
A +Y LG GLEV + L P+ FL +A L N K ++++ + ATR +Y+SF +
Sbjct: 251 YRACFIYALGNGLEVSTYLHPRFFLVLAMLANSCKQVSMLTSSATRNALYNSFRRMDDDV 310
Query: 213 -------------------------NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
N+ D+ AKGEA + +++G+ +GI L+ + S+
Sbjct: 311 DTVGTGSNSTLKSSINGGRRRGVVENIGDITAKGEAQIAVVDLLGIASGICLSRAIGVSV 370
Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV----KTG---------- 293
Q + V +L + +Y + E+R+ TLN +R ++ + ++G
Sbjct: 371 QNVLTVWSVLQLGEIYCMYHEIRSIVYQTLNFERMYTVLGRLLPVDDESGSANGESCCCD 430
Query: 294 RISSPADLRYREDLLFP----GRLIEDAGNVKVGRDPHKVVKPSKFHEL-KELFAEEKFI 348
RI +P + E + P GR + G V+ P + L +++F E+F
Sbjct: 431 RIPTPDQIAAEEKVFLPPDHLGR-----KEIAFGSAGRSVLSPGELDVLIRDVFRGERFF 485
Query: 349 LCLGS 353
L +G+
Sbjct: 486 LAVGA 490
>gi|350411769|ref|XP_003489447.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus impatiens]
Length = 464
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 171/354 (48%), Gaps = 19/354 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV+ Y Y + +Q F S + L+T S++ G+ A ++WI
Sbjct: 53 FLPQGYPDSVHPDYTPYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEAAATPLAAAITWI 112
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKDG +G+++ + G +D + K+WR+ AD L DL GLE+ P L + +
Sbjct: 113 LKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDLAMGLELFLPYFSSYSLGILCIS 172
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
K I VA ATR + A + NL+D+ AK + T N++ GI + S + +
Sbjct: 173 TAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHN 231
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ M + L +IH+Y+ ++A +N+LN R A+IV ++ I P ++ +E
Sbjct: 232 GRYIMELYLFLVMIHLYANYMAVKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKES 291
Query: 307 LLFPGR-LIEDAG-NVKVGRDPHKVVK-----PSKFHELKELFAEEKFILCLG--SRWTD 357
+L + + G N+K+G ++K S +LF + K+++ + ++
Sbjct: 292 VLLLTKPTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDRKYLISIDVQNKTIF 351
Query: 358 MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK--MNDVFTPLL 409
+ L+++A D L + + A + F SL V D + K ++D+ PL+
Sbjct: 352 ICLKKDAQPYDVLEAY----FHACLCGLFICMSLKVPLDVFVKPEVSDLSYPLM 401
>gi|242036507|ref|XP_002465648.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
gi|241919502|gb|EER92646.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
Length = 440
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 180/379 (47%), Gaps = 26/379 (6%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
++ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+ A
Sbjct: 54 QIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSAT 113
Query: 121 --ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL P
Sbjct: 114 VIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPS 173
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+ + LG+ ++ VA+ ATR + FA N +D+ AK + TL ++G+G G+
Sbjct: 174 SLIVIMCLGSLSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMIGMGLGM 233
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
LA L+V H+Y+ + +++ + TLN +R ++++ F + G + +
Sbjct: 234 LLAHITRGHALSVWTSFLSLTVFHMYANYKAVQSLSLTTLNYERASILLQYFKECGDVLA 293
Query: 298 PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV----KPSKFH--------ELKELFAEE 345
P + +E +L +K+ PH+ V K S + + +
Sbjct: 294 PQKVSQQEHILPFWSTWRKLNKIKL---PHERVHLGAKASMLTHSDMLVIAKTRSHYENT 350
Query: 346 KFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVF 405
++L + + + A+ D LR ++ AS S + +A M++++
Sbjct: 351 NYLLLNKQGSVYIFIHKLATPADVLRSFVHGLVLAS--------STSQHLEARRWMDEMY 402
Query: 406 TPLLSELQAKGWHTDRFLD 424
T +S+LQ++G+ T+R L
Sbjct: 403 TNFISKLQSEGYSTERLLS 421
>gi|66818028|ref|XP_642707.1| DUF647 family protein [Dictyostelium discoideum AX4]
gi|74861464|sp|Q86K80.1|U420_DICDI RecName: Full=UPF0420 protein
gi|60470803|gb|EAL68775.1| DUF647 family protein [Dictyostelium discoideum AX4]
Length = 527
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 139/269 (51%), Gaps = 11/269 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P+GYP SV Y Y + ++Q S L+T+++L G+ + A + W+
Sbjct: 63 FLPNGYPDSVTTDYFGYQFWDSIQALCSTITGTLATRAILKGYGVGDSSATVASATTQWL 122
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPL-CPQLFLEMAGL 185
++DGM +G+++ + G +D K+WR AD L ++G E++SPL QLFL ++ +
Sbjct: 123 IRDGMGMIGRIVFAWRKGTDLDCNSKKWRYTADILNNIGMAFEMISPLFSSQLFLPLSCI 182
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV-- 243
G AK I VA T+ + FAK NL+D+ AK + T N+VG+ + ++S +
Sbjct: 183 GLIAKSICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAVNLVGMLLSVIVSSFIND 242
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLR 302
+S+ +V + +H++ + A + ++N R +I F+ G I SP+++
Sbjct: 243 NTSLIVTWLVFLFFTSLHLFCNYRAVSAVQLKSINRYRAYLIYDYFIHNQGSIPSPSEIS 302
Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
E++LF + ++ ++VG + K
Sbjct: 303 KLENILFSIKELD----IRVGVSLCNIYK 327
>gi|301780856|ref|XP_002925848.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ailuropoda
melanoleuca]
gi|281353232|gb|EFB28816.1| hypothetical protein PANDA_015415 [Ailuropoda melanoleuca]
Length = 468
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 177/383 (46%), Gaps = 43/383 (11%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGNAKASVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRILFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTMTVCTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V +
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSAC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +RA + TLN R +++ F++ G + P E
Sbjct: 257 PSFSLGCFFFLTALHIYANYRAVRALVLETLNEGRLRLVLKHFLQRGEVLGPTSANQMEP 316
Query: 307 L---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRW------ 355
L +P ++ +G H ++ S EL++L +E ++L RW
Sbjct: 317 LWTGFWPSL------SLSLGVPLHCMI--SSVFELQQLVEGHQEPYLL----RWDKSQRQ 364
Query: 356 TDMVLEQNASGEDALR----GWLVAAYAAS--MAKSFHD-----------PSLTVLQDAY 398
+VL Q A E LR G ++ A + + + + S ++++ +
Sbjct: 365 VQVVLSQMAGPETILRAATHGLVLEALQGNGPLPRELEELRNQVRAGPEKDSWVIVRETH 424
Query: 399 DKMNDVFTPLLSELQAKGWHTDR 421
++ +F L LQ GW T++
Sbjct: 425 QVLDKLFPKFLKGLQDAGWKTEK 447
>gi|218194530|gb|EEC76957.1| hypothetical protein OsI_15243 [Oryza sativa Indica Group]
Length = 836
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 147/294 (50%), Gaps = 38/294 (12%)
Query: 107 SLLFAAGL-RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLG 165
+LL+A GL + A V+W+LKDG+ ++ K++ S G D PK WR+ AD L +
Sbjct: 476 ALLYAVGLGKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTA 535
Query: 166 TGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
GLE+L+P+ P LF+ + + A + ATR Y+ FA + N +++ AKGEA
Sbjct: 536 YGLEILTPVFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQG 595
Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
+ +G+ GI LA+ + SS+ +I ++ +H+Y ++ ++ + TLNP R +++
Sbjct: 596 MVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLV 655
Query: 286 VADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG--RDPHKVVKP----------- 332
++++ +G++ S ++ E L N+ VG R K++ P
Sbjct: 656 FSEYLLSGQVPSVKEVNDEEPLFL---------NLSVGTSRKESKILSPQAKDAAEIICR 706
Query: 333 -----SKFHELKE----------LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
SK E+ E L+ E+++L ++L++ +S ED L+
Sbjct: 707 RLQLGSKLSEIIENKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLK 760
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 95/182 (52%), Gaps = 9/182 (4%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + P GYP SV+ YL+Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 310 WRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKGA 369
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P+ P
Sbjct: 370 IPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPH 429
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE--GNLSDLFAKGEAIS-TLFNVVGLG 234
LF+ + + A + I++ F E ++ + AKGEA L VGLG
Sbjct: 430 LFVPIGAAAGAGRSAAALIQT-----IFAHFMLENANHMRKVIAKGEAQGMALLYAVGLG 484
Query: 235 AG 236
G
Sbjct: 485 KG 486
>gi|296233362|ref|XP_002761979.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Callithrix
jacchus]
Length = 468
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 173/376 (46%), Gaps = 29/376 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKATVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGF 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
L+ +H+Y+ +RA + TLN R +++ +++ G + P R +
Sbjct: 257 PGFSFGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTTAN-RME 315
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
L+ G ++ +G H++V S EL++L +E ++LC +VL Q
Sbjct: 316 PLWTG--FWPTPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 371
Query: 363 NASGEDALR----GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMNDVF 405
A + LR G ++ A + + ++++ + ++ +F
Sbjct: 372 KADPKTVLRAATHGLMLGALQGDGPLPAELEELRNHVQAGPKKETWVIVKETHQVLDMLF 431
Query: 406 TPLLSELQAKGWHTDR 421
L LQ GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447
>gi|66816549|ref|XP_642284.1| DUF647 family protein [Dictyostelium discoideum AX4]
gi|60470350|gb|EAL68330.1| DUF647 family protein [Dictyostelium discoideum AX4]
Length = 567
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 75/273 (27%), Positives = 146/273 (53%), Gaps = 13/273 (4%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA-GLRPTP 118
+++ DS F P GYP +V Y+ Y ++ +Q+ + VLST +LL + G+ T
Sbjct: 117 NKLFDSIKKLFLPKGYPSTVTSDYIGYQKWMFIQNVIGSTTYVLSTHALLTSVIGISSTS 176
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A A +SW+LKDG+ ++ +N D + K+++ D L++ G LE+ +P
Sbjct: 177 ALPFAAAISWVLKDGLGATALVLFANKYSTLFDYDIKKYKFRGDLLHNFGVLLEMFTPFF 236
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK--GEAISTLFNVVGL 233
Q FL +A + N +KG+A + +TR + SF+ + N+ D+ AK ++++ + + +
Sbjct: 237 QQFFLPLASISNLSKGLAGLIYGSTRASLNRSFSLKENIGDITAKYQSQSMAAYLSGMAI 296
Query: 234 GAGIQLASTV---CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
G+ I + +V S M+ G LS IH++ I +++ + TLN QRT++++ +++
Sbjct: 297 GSSIGMILSVFPLTSVSNLSMVFG--LSFIHIFCGINAIKSIDLKTLNQQRTSILIDNWL 354
Query: 291 KTGRISSPADLRYREDLLFPGRLIEDAGNVKVG 323
K+ I + + E + + ++ N+K+G
Sbjct: 355 KSNLILTSKIVNQEEKFITNDKYLK--PNIKLG 385
>gi|332030173|gb|EGI69967.1| UPF0420 protein C16orf58-like protein [Acromyrmex echinatior]
Length = 471
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 178/379 (46%), Gaps = 17/379 (4%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
+S+ V+ R+ F F P GYP SV+ Y Y + +Q F S L+T S++ G
Sbjct: 41 ESQTVHSRITSVFKQIFLPQGYPNSVHPDYTAYQIWDTVQAFASTITGTLTTHSIMKGIG 100
Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV 170
+ + A A ++WILK G +G ++ + G +D + K+WR+ AD L D+ G+E+
Sbjct: 101 VGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTELDGQCKKWRLFADILDDIAKGIEL 160
Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
L P + + + K I VA ATR + A + N++D+ AK + T N+
Sbjct: 161 LVPYFSSYSVIILCISTTMKSIVGVAGGATRTALIHHQAIQNNVADVSAKDGSQETCVNL 220
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+ GI + S + Q + + L +H+Y+ ++A ++TLN R A+IV ++
Sbjct: 221 IASFFGIFILS-LFHDGQYVLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAIIVKKYM 279
Query: 291 KTGRISSPADLRYREDLLF---PGRLIEDAGNVKVGRDPHKVVKPS--KFHELK---ELF 342
+I P + E + P R I ++K+G ++K + F E++ + F
Sbjct: 280 MNEQIPEPWKVNKEESVFLLKNPTRDIH-GFDIKIGVSFANILKRNIISFTEMEFLLKFF 338
Query: 343 AEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK 400
K+I+ + R ++L+++ + L + A+ A S+ VL + +
Sbjct: 339 ENRKYIILIDIEKRNIFVILKKDIQSVEILEAYFYASMCAFYLCIARKISIDVLLKS--E 396
Query: 401 MNDVFTPLLSELQAKGWHT 419
+++ PLL+ + HT
Sbjct: 397 TSEISYPLLNTYILQRIHT 415
>gi|290999547|ref|XP_002682341.1| predicted protein [Naegleria gruberi]
gi|284095968|gb|EFC49597.1| predicted protein [Naegleria gruberi]
Length = 261
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 77/260 (29%), Positives = 132/260 (50%), Gaps = 16/260 (6%)
Query: 67 LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT---- 122
++ F P YP SV+ Y+ Y +++ALQ+ + VL+ QSLL AAG+ A
Sbjct: 2 ISVFLPKNYPLSVSSNYINYCKWQALQYTAGSFSGVLTMQSLLHAAGITSQLALGGNEYF 61
Query: 123 ------VVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
++W++KDG+ +G L S + D++PK WR+ + LEV +P+
Sbjct: 62 LAFLGGALAWVIKDGLGQLGGILFASKVNTNFDADPKFWRMGGEYALVASALLEVTTPIT 121
Query: 176 -PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P F+ A L N K ++ AA ATR + SFAK NL+D+ AK + + +++G
Sbjct: 122 GPSWFIAQASLANIGKNVSCFAASATRAAMNQSFAKSDNLADVTAKATSQALACSLIGTT 181
Query: 235 AGIQLASTVCSSMQ---GKMI-VGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
GI ++S + Q K+ + +LSVI ++S+ + + + LN QR + + ++
Sbjct: 182 LGIFVSSALSLGTQFVFAKVFPIFCVLSVIQLFSLYKAVSNVKLKVLNKQRFLIACSQYL 241
Query: 291 KTGRISSPADLRYREDLLFP 310
K + P+ + RE + P
Sbjct: 242 KDRAVLDPSKVSEREKFVLP 261
>gi|170582654|ref|XP_001896227.1| cDNA sequence BC017158 [Brugia malayi]
gi|158596613|gb|EDP34929.1| cDNA sequence BC017158, putative [Brugia malayi]
Length = 389
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 133/243 (54%), Gaps = 3/243 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV+ Y+ Y + +Q F S+ S LST+++L G+ + A V+W+
Sbjct: 42 FMPRGYPQSVSPDYMNYQMWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAAAVAWL 101
Query: 128 LKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKDG+ + +++ + L + +D++ K+WR++AD DL L++++P+ P LF+ + L
Sbjct: 102 LKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPSLFMPIICLS 161
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ + + VA ATR + + A N+ D+ AK + TL NV L + L V +
Sbjct: 162 SMIRAVVGVAGSATRTTVTNHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLPVVSRN 221
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ ++ L + IH+Y +++ TLN ++V D+++T +IS+ +++ +E
Sbjct: 222 IVSVWLLFSLFTFIHLYGNYRAVKSLQFRTLNQSLLRIVVKDYIETRKISTVSEMNSKEP 281
Query: 307 LLF 309
+L
Sbjct: 282 ILL 284
>gi|395842960|ref|XP_003794274.1| PREDICTED: UPF0420 protein C16orf58 homolog [Otolemur garnettii]
Length = 468
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 172/375 (45%), Gaps = 29/375 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 77 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAKASVSAATATWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPICFTMTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLILPLVSDC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ +H+Y+ +RA + TLN R +++ F++ G + P E
Sbjct: 257 PSFSLGCFFFLTALHIYANYRAVRAIVMETLNQGRLRLVLKHFLQRGEVLDPTSANKMEP 316
Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
L R + ++ +G H ++ S +L++L +E ++L +VL Q
Sbjct: 317 LWTGFR---SSPSLSLGVPLHHLI--SSVFDLQQLVEGHQEPYLLYWDQSRNQVQVVLSQ 371
Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
A+ E LR G ++ A + + S ++++ + ++ F
Sbjct: 372 MAASETILRAATHGLVLGALQGDGPLPRELEELRNQVREGPEKNSWVIVREVHRVLDVFF 431
Query: 406 TPLLSELQAKGWHTD 420
L LQ GW T+
Sbjct: 432 PKFLKGLQDAGWKTE 446
>gi|449461779|ref|XP_004148619.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
gi|449518467|ref|XP_004166263.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 495
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 165/367 (44%), Gaps = 18/367 (4%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVS 125
P+G+P SV++ YL+Y + + T L T SLL A G+ T A A +
Sbjct: 119 LPTGFPESVSDDYLQYMIRQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTTTAASAVAIR 178
Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
W+ KDG+ VG+L I G D +PK+WR+ AD + G+ ++ +PL P FL +A
Sbjct: 179 WVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLATPLYPSYFLPLAS 238
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
LGN K +A + I + FA GNL ++ AK E + ++GL GI + T
Sbjct: 239 LGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEIAAKEEVWEVVAQLLGLAIGILILDTPG 298
Query: 245 SSMQGKMIVGPLLS--VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
++ LS ++H++ + + NT+N +R ++ V ++ +
Sbjct: 299 LVNSYSVLSATWLSMRLLHLWLRYQSLAVLHFNTINLKRARILARAHVLHNKVPGTVECN 358
Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVV---KPSKFHELKELFAEEKFILCLGSRWTDM- 358
E +L R + + G +++ + S +L +L+A EK+IL L S+ D+
Sbjct: 359 SEESILVWERFTRPS--IIFGVPLEEMMGSERSSTVMKLLKLYANEKYILMLDSQDKDLK 416
Query: 359 ---VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
+ AS LR + Y + + + L + +M D F + L+
Sbjct: 417 VFVSFKVGASSMTVLRS-IWQTYWLNKHLDTTENVIHQLAQSLSEMEDKFNEFVQLLEGA 475
Query: 416 GWHTDRF 422
GW T +
Sbjct: 476 GWDTHQL 482
>gi|46137815|ref|XP_390599.1| hypothetical protein FG10423.1 [Gibberella zeae PH-1]
Length = 425
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 109/409 (26%), Positives = 186/409 (45%), Gaps = 41/409 (10%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGH--LSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSV 79
+ I ++ S + R + PD L KV P H F P GYP SV
Sbjct: 26 ERTEIIEYDKSGNFVRSWTHGPDNIVVLKSKVCTSWLPSMHEA-------FMPVGYPHSV 78
Query: 80 NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGK 137
+ YL Y F ++Q F S S+L+ ++LL G+ + AT ++ +LKD + V
Sbjct: 79 SSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAISRVAT 138
Query: 138 L-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVV 195
+ G R++ + KR+R LAD D +E+ SP L P + G + + V
Sbjct: 139 IAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTGEALRALCGV 198
Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--CSSMQGKMIV 253
AA A++ + FAK NL++L AK + T ++GL G + V +++ MIV
Sbjct: 199 AAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTLVVKYVEDHNTVVFLMIV 258
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL--FPG 311
L + H++ +R ++ LN QR ++ +++KTG+I +P ++ RE +L +
Sbjct: 259 ---LVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGKIMTPEEVATRESILIWWSS 315
Query: 312 RLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
IE AG+ +K K ++ + A + L +G ++L ++++ A+
Sbjct: 316 TRIEMAGSY------YKATK-----DMVNVIATDGSTLFVGPHGIKIMLWEDSTAVQAID 364
Query: 372 GWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
W A M S+ S D ++K + LL ++ KGW D
Sbjct: 365 AWFTA-----MKISYSKDS-----DYHEKKGALEESLLEGMRVKGWRLD 403
>gi|340516831|gb|EGR47078.1| predicted protein [Trichoderma reesei QM6a]
Length = 448
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 111/421 (26%), Positives = 193/421 (45%), Gaps = 36/421 (8%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVV----DSFLNKFFPSGYPF 77
++ I + S V + + DGH+ + + S + R V + + F P G+P
Sbjct: 23 NDVDIIELDKSGHVVQRWLHAHDGHV-LPLRRRSNSFFSRSVREWRKALFDSFLPVGFPH 81
Query: 78 SVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHV 135
SV++ YL Y F +LQ F S S+L+ ++LL G+ + AT ++ ILKD M +
Sbjct: 82 SVSKDYLSYQFFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTILKDAMSRI 141
Query: 136 GKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIA 193
++ ++ G R++ + KR+R LAD D LE+ SP +A G + +
Sbjct: 142 ATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLGSWGKVVALSTGEALRALC 201
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
VAA A++ + FA++ NL++L AK + T ++GL G + TV + +++
Sbjct: 202 GVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVGTLVVKTV-ENHSAVVLL 260
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRL 313
+L + H+ +R+ +NT N QR ++ +++K+G++ SP + RE ++F
Sbjct: 261 MIVLVLAHLGMNYLGVRSVCMNTFNRQRATILFEEYLKSGKVLSPEQVANRESIIFWSPA 320
Query: 314 IEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT--------------DMV 359
I +A +V H S H + A EK I+ G+ T ++
Sbjct: 321 IRNAQGKEV---VHLEFAESYSHAMSCSDAHEKLIIVDGAMHTRFICSRVPGRLTRIKLL 377
Query: 360 LEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHT 419
L AS DA+ W AA ++A D + D LL +L+ +GW
Sbjct: 378 LWDGASPMDAVLAWF-AAMEIAIAGKDRDSEFVPRYKSED--------LLEQLKRQGWSA 428
Query: 420 D 420
D
Sbjct: 429 D 429
>gi|414879010|tpg|DAA56141.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
Length = 368
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 95/166 (57%), Gaps = 1/166 (0%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
+ R + ++ P GYP SV+ YL Y+ +R +Q S VLSTQ+LL+A GL +
Sbjct: 186 WRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKGA 245
Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A V+W+LKDG+ ++ K++ S G D PK WR+ AD L ++ GLE+L+P+ P
Sbjct: 246 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 305
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
LF+ + + A + ATR Y+ FA + N +++ AKGEA
Sbjct: 306 LFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEA 351
>gi|324514354|gb|ADY45840.1| Unknown [Ascaris suum]
Length = 394
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 129/247 (52%), Gaps = 3/247 (1%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F++ F P GYP SV+ Y+ Y + +Q F S+ S L+T+++L AG+ A A
Sbjct: 44 FVDVFLPRGYPHSVSADYMSYQIWDTVQAFASSMTSALATEAVLRGAGVGDENASALAAT 103
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+++DG+ + ++ + L A++D++ KRWR++AD L DL LE+L+ P LF +
Sbjct: 104 LTWLIRDGLGMLSRIGFAWLKSAQLDADCKRWRLVADFLNDLAFSLELLAAALPSLFTLL 163
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
L + A+ I VA ATR + A+ N++D+ AK + TL NV L + L
Sbjct: 164 VCLSSLARSIVAVAGGATRTTVVQHQARCNNVADVSAKDGSQETLVNVTALLCSLILLPL 223
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ + L + +H+++ + A TLN + +I FV+TG + S
Sbjct: 224 ISGRTVLIWMFYILFTGVHLFANYAAVTALHFETLNQKLLRIITNHFVETGDVFSVKVAN 283
Query: 303 YREDLLF 309
E LLF
Sbjct: 284 ELEPLLF 290
>gi|345492163|ref|XP_001602603.2| PREDICTED: UPF0420 protein C16orf58 homolog [Nasonia vitripennis]
Length = 466
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 164/344 (47%), Gaps = 23/344 (6%)
Query: 51 VVDDSRPVY-----HRVVDSFLNK-FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLS 104
VV + +PV V SF + F P G+P SV++ Y+ Y + LQ F S L+
Sbjct: 29 VVTNFKPVRGTKFAESVTYSFFKEVFLPQGFPDSVHKDYVPYQIWDTLQAFASTINGTLT 88
Query: 105 TQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADAL 161
T S++ G+ + A A ++WILKDG +G++I + G +DS+ K+WR+ AD L
Sbjct: 89 THSIMRGVGVGESTATPLAAAITWILKDGTGMIGRIIFAWWQGNNLDSQCKKWRLFADIL 148
Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
D LEV P L K I +A ATR I A + N++D+ AK
Sbjct: 149 NDAAMALEVTVPYISTFSSYTLCLTTGMKAIVGIAGGATRTAIMQHHAIKDNMADVSAKE 208
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
+ TL N+ G GI + + + + V LL ++H+ S + + +NTLN R
Sbjct: 209 HSQGTLVNLAGSIVGILILLL--INEKFFIYVCLLLVIVHIVSNYFAVTSLELNTLNEDR 266
Query: 282 TAMIVADFVKTGRISSPADLRYREDLLF----PGRLIEDAGNVKVGRDPHKVVKPSKFHE 337
+I+ D++ +S+ D+ RE ++ P + I ++++G V+K + +
Sbjct: 267 LCLIIEDYILNHTMSNVEDINSRESVIILLEKPAKKIF-GFDIEIGVSFESVLKMNLINT 325
Query: 338 -----LKELFAEEKF--ILCLGSRWTDMVLEQNASGEDALRGWL 374
L LF K+ ++ L ++ ++L ++A + L+ +
Sbjct: 326 KDVRFLLALFQRRKYLPVMDLKAKKIHIILSKDADDDIILKAYF 369
>gi|302753580|ref|XP_002960214.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
gi|300171153|gb|EFJ37753.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
Length = 381
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 167/365 (45%), Gaps = 25/365 (6%)
Query: 78 SVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV----VSWILKDGMQ 133
SV++ YL+Y ++ + T L T SLL A GL P A + W+ KDG+
Sbjct: 2 SVSQDYLQYMLWQLPTNITGWICRALITSSLLQAVGLSTEPGSAAAASAAIKWVTKDGLG 61
Query: 134 HVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
VG+L I G D +PK+WR+ A+ + G E+ +PL P FL +A LG+ K +
Sbjct: 62 AVGRLFIGGRFGGVFDEDPKQWRLYAEVIGSFGGLFELATPLAPDYFLLLASLGHLTKAV 121
Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGK 250
A + I + FA N+ D+ AK E ++GL G+ L ST V +S
Sbjct: 122 ARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVVLLSTPVVATSYSQL 181
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
+ ++ H++ + + A + T+N +R +++ + ++ +S + +E+L+ P
Sbjct: 182 ALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHIQGRPLSGVGECNKKENLILP 241
Query: 311 GRLIEDAGNVKVGRDPHKVV----KPSKFHELKELFAEEKFILCLGSRWTD--------M 358
++ +++G +V+ P + E EL+ EK++L + S++
Sbjct: 242 YTMLMP--QIRLGCSLREVITLGKTPLEIEEAFELYNSEKYVLIVLSQFLSSCSCNIKFR 299
Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWH 418
V + LR + A+ +KSF + L+ + M F L EL A GW+
Sbjct: 300 VCQDGCENTTLLRS-IYQAHVLFHSKSFDTFT---LKQSLTSMQKSFASFLDELVAAGWN 355
Query: 419 TDRFL 423
+ +
Sbjct: 356 PSKLV 360
>gi|387019749|gb|AFJ51992.1| UPF0420 protein C16orf58-like [Crotalus adamanteus]
Length = 437
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 136/284 (47%), Gaps = 24/284 (8%)
Query: 37 RHYQFQPDGHLSVKVVDDSRP----VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRAL 92
R YQ P+G L+ K D S+ +HR+ S F P GYP SV+ YL Y + +
Sbjct: 20 RRYQALPNGKLNSKT-DASKDWHLNSFHRIFKSI---FLPQGYPESVSTDYLAYQFWDTI 75
Query: 93 QHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSE 150
Q F S+ L+TQ++L G+ + A V+WILKDG +G++ +
Sbjct: 76 QAFASSITGTLATQAVLKGVGVGDETSTVAAAAVTWILKDGTGMLGRIAFA--------- 126
Query: 151 PKRWR--ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
W + AD L D+ +E+++P P F + F K I VA ATR +
Sbjct: 127 ---WSKGLFADVLNDVAIFMEIMAPAFPACFTLIVCTSGFFKCIVGVAGGATRAALTMHQ 183
Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
A+ N++D+ AK + TL N+ GL + L V ++ L +++H+Y+ +
Sbjct: 184 ARRDNMADVSAKDGSQETLVNLAGLLFSLFLIPLVVDNLPLTYAFYALFTILHLYANYQA 243
Query: 269 MRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
+RA + TLN R +++ ++K + PA + +E LL R
Sbjct: 244 VRAVCMETLNRARLHLVLQHYLKREEVLGPAVINPQEPLLLGFR 287
>gi|260815171|ref|XP_002602347.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
gi|229287656|gb|EEN58359.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
Length = 244
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 70/239 (29%), Positives = 125/239 (52%), Gaps = 3/239 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV++ YL Y + +Q F S+ L+TQ++L G+ A ++WI
Sbjct: 4 FLPQGYPESVSDDYLSYQIWDTVQAFCSSITGTLATQAVLKGVGVGDETATAAGATITWI 63
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKDG +G+++ + L G +D + KRWR+ AD + D LE+++P P F + +
Sbjct: 64 LKDGTGMLGRILFAWLQGTSLDCDAKRWRLFADIVNDFSIFLELVAPAFPLFFTGIVCVA 123
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
K I V+ ATR + A+ N++D+ AK + TL N++ L AG+ + V +
Sbjct: 124 GVGKAIVGVSGGATRAALTMHQARRDNMADVSAKDGSQETLVNLMALLAGLVVTPMVADN 183
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+ + + +H+++ + + TLN R ++V+ +++TG + +PA +E
Sbjct: 184 PLLTWFLFFVFTCLHLFANYNAVTCVVMETLNQSRLHILVSHYLRTGVVLNPAQANVQE 242
>gi|302768148|ref|XP_002967494.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
gi|300165485|gb|EFJ32093.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
Length = 412
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/394 (26%), Positives = 180/394 (45%), Gaps = 34/394 (8%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
V+ R +SF F +G SV++ YL+Y ++ + T L T SLL A GL
Sbjct: 4 VFWRFSESFGFLQFSAG---SVSQDYLQYMVWQLPTNITGWICRALITSSLLQAVGLSTE 60
Query: 118 PAQATV----VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
P A + W+ KDG+ VG+L I G D +PK+WR+ A+ + G E+ +
Sbjct: 61 PGSAAAASAAIKWVTKDGLGAVGRLFIGGRFGGVFDQDPKQWRMYAEVIGSFGGLFELAT 120
Query: 173 PLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
PL P FL +A LG+ K +A + I + FA N+ D+ AK E ++G
Sbjct: 121 PLAPDYFLLLASLGHLTKAVARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIG 180
Query: 233 LGAGIQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
L G+ L ST V +S + ++ H++ + + A + T+N +R +++ + +
Sbjct: 181 LSLGVALLSTPVVATSYSQLALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHI 240
Query: 291 KTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV----KPSKFHELKELFAEEK 346
+ +S ++ +E+L+ P ++ +++G +V+ P + E EL+ EK
Sbjct: 241 QGRPLSGVSECNKKENLILPYTMLMP--QIRLGCSLREVITLGKTPLEIEEAFELYNSEK 298
Query: 347 FILCLGSRWTDM-----VLEQ---NASGEDALRGWLVAAYAASMAKS-------FHDPSL 391
++L + S+ ++M VL Q + S R ++ +S FH S
Sbjct: 299 YVLIVFSK-SNMGILLKVLSQFLSSCSCNIKCRVCQDGCENTTLLRSIYQAHVLFHSKSF 357
Query: 392 T--VLQDAYDKMNDVFTPLLSELQAKGWHTDRFL 423
L+ + M F L EL A GW+ + +
Sbjct: 358 DTFTLKQSLTSMQKSFASFLDELVAAGWNPSKLV 391
>gi|302766892|ref|XP_002966866.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
gi|300164857|gb|EFJ31465.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
Length = 265
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 130/242 (53%), Gaps = 3/242 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWI 127
F P GYP SV+ YL + + LQ ++ S+LSTQ+LL G+ A A W
Sbjct: 23 FLPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWF 82
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D VG ++ + G+ +D+ K+WR+ AD + D+G ++++SPL P+ F+ LG
Sbjct: 83 LRDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFMNDIGMLMDLVSPLFPRAFMFFLCLG 142
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ A+ + VA ATR + FA++ N +D+ AK + T +VG+ G+ LA ++
Sbjct: 143 SAARAVTGVAGGATRAALTQHFARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANN 202
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ HVY+ + + A + ++N +R A+++ F+K G+ SP + +E
Sbjct: 203 IVLMWTSFMSLTAFHVYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQES 262
Query: 307 LL 308
L
Sbjct: 263 AL 264
>gi|339250912|ref|XP_003374441.1| conserved hypothetical protein [Trichinella spiralis]
gi|316969248|gb|EFV53376.1| conserved hypothetical protein [Trichinella spiralis]
Length = 340
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 67/225 (29%), Positives = 116/225 (51%), Gaps = 3/225 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P GYP SV E Y Y + +LQ F S+ + + ++ A G+ A A V W+
Sbjct: 55 FLPQGYPDSVTEDYFDYQCWDSLQAFCSSVTNAFALHAVFKAVGVGNESASLLAATVVWL 114
Query: 128 LKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L++G+ + +++ + A +DS+ K WR +AD L D L++++PL F+ +A L
Sbjct: 115 LRNGVGMMSQIMFAWFCAGHLDSDCKSWRFVADVLNDCSIFLDLIAPLSESTFVYVACLA 174
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N + I +A ATR I ++ N++D+ AK + T+ N++ LG + L V
Sbjct: 175 NVCRSIVGIAGGATRAAITQHQSRRHNMADVAAKDSSQETMVNLIALGFNLALLPLVAHH 234
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
+ + + + +H+Y+ +RA TLN +R M+V F+K
Sbjct: 235 VLLVWTLFLMFTFMHIYANYRAIRALKFETLNEKRFRMLVQSFIK 279
>gi|74215416|dbj|BAE41911.1| unnamed protein product [Mus musculus]
Length = 337
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 125/243 (51%), Gaps = 7/243 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
N AK I VA ATR + A+ N++D+ AK + T+ N+ GL + + V C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
S+ V LL+ +H+Y+ +RA + TLN R +++ F++ G + PA
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312
Query: 305 EDL 307
E L
Sbjct: 313 EPL 315
>gi|449018513|dbj|BAM81915.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 600
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 80/246 (32%), Positives = 122/246 (49%), Gaps = 23/246 (9%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT------- 122
F P+ YP SV Y Y + L + +A SVL T +LL +AG+ P++A
Sbjct: 158 FLPTNYPHSVTASYAPYAFWHFLHNTLTACNSVLGTTALLVSAGVVADPSRAASTALTRD 217
Query: 123 --------------VVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTG 167
V SWILKDG+ H+ +L S R D + KR R+LAD L+ LGTG
Sbjct: 218 PALEHLPTAASASAVTSWILKDGVGHLVRLAFGSRYAHRFDGDLKRLRVLADVLWHLGTG 277
Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
LE++S P F+ +A KG+A V +TR IY + A N+ D+ AKG+A S
Sbjct: 278 LELVSRSFPDSFIGLASGAYALKGMAHVVFNSTRSTIYRTVATRSNIGDVTAKGDAQSIA 337
Query: 228 FNVVGLGAGIQLASTVC-SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
++G+ G+ L +++ + ++ +YS I+ MR + LN QR ++V
Sbjct: 338 AELLGMSIGLSLYRGFAENNLFWTWAIFGIVVAGQMYSRIQSMRVLVLPLLNLQRAVLLV 397
Query: 287 ADFVKT 292
+V +
Sbjct: 398 DAYVAS 403
>gi|408393145|gb|EKJ72412.1| hypothetical protein FPSE_07436 [Fusarium pseudograminearum CS3096]
Length = 427
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 101/364 (27%), Positives = 174/364 (47%), Gaps = 30/364 (8%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT-- 122
S F P GYP SV+ YL Y F ++Q F S S+L+ ++LL G+ + AT
Sbjct: 64 SMHEAFMPVGYPHSVSSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFA 123
Query: 123 VVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFL 180
++ +LKD + V + G R++ + KR+R LAD D +E+ SP L P +
Sbjct: 124 MLLTVLKDAISRVATIAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKI 183
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
G + + VAA A++ + FAK NL++L AK + T ++GL G +
Sbjct: 184 LALTTGEALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTIVV 243
Query: 241 STV--CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
V +++ MIV L + H++ +R ++ LN QR ++ +++KTG+I +P
Sbjct: 244 KYVEDHNTVVFLMIV---LVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGQIMTP 300
Query: 299 ADLRYREDLL--FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT 356
++ RE +L + IE AG+ +K K ++ + A + L +G
Sbjct: 301 EEVAKRESILLWWSSTRIEMAGSY------YKATK-----DMVNVIATDGSTLFVGPHGI 349
Query: 357 DMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKG 416
++L ++++ A+ W A M S+ S + D ++K + LL ++AKG
Sbjct: 350 KIMLWEDSTAVQAIDAWFTA-----MKISYAKDSDS---DYHEKKGALEESLLEGMRAKG 401
Query: 417 WHTD 420
W D
Sbjct: 402 WRLD 405
>gi|327294507|ref|XP_003231949.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
gi|326465894|gb|EGD91347.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
Length = 483
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 104/420 (24%), Positives = 194/420 (46%), Gaps = 69/420 (16%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ S LN F P+GYP SV + Y+ Y + +LQ F+S+ +LS++++L G+ A
Sbjct: 55 LTSILNVFLPAGYPHSVTDDYIEYQIYDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPT 114
Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-L 178
A ++ +L++ M + ++ ++ LG ++ E K +R+ AD L D L+ LSP+ P+ +
Sbjct: 115 AALLLNVLQESMGRIATILFAHRLGTSLEPECKLYRLAADILNDSAMVLDCLSPIFPKPV 174
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
+ + L + + + VAA +++ + + FA+ GNL +L AK + T+ ++ G+ G
Sbjct: 175 RVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLAGMLVGSV 234
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
+ S + S ++ I +L ++H+ + +RA + TLN QR ++ + + R+ +P
Sbjct: 235 VVSYITSPLE-TWIALIILLIVHLGTNHAAVRAVKMTTLNRQRANIVFSYLFEDDRVLTP 293
Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK------------------------ 334
A+ E R+ E G ++ + ++ +
Sbjct: 294 AEASEEE------RIFERDGVLRWNAESSGILGTCQIGTSLEQLLLLLPEQADRGGALET 347
Query: 335 -------------FHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW----LV 375
L ELF EE++IL SR +VL+ A+ + L+ W LV
Sbjct: 348 GQTSSTTMNTSTNLTALLELFREEEYILYFDPLSRRGAIVLKAGATAKAQLKAWSHALLV 407
Query: 376 AAYAA---SMAKSFHD---------PS---LTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
+ + A S A H+ PS L++L+D + F + L+ GW TD
Sbjct: 408 SRHLAGKNSTATCRHEKQDDQGELSPSVSVLSILRDTLQEHTRTFKGRVKRLEHAGWKTD 467
>gi|195996945|ref|XP_002108341.1| hypothetical protein TRIADDRAFT_19061 [Trichoplax adhaerens]
gi|190589117|gb|EDV29139.1| hypothetical protein TRIADDRAFT_19061, partial [Trichoplax
adhaerens]
Length = 272
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 132/248 (53%), Gaps = 9/248 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P GYP SV+ YL Y + LQ F S+ L+TQ++L G+ A A ++W+
Sbjct: 3 FLPQGYPESVSGDYLTYQLWDTLQAFCSSITGTLATQAILKGVGVGDEKATVAAAAMTWM 62
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKR------WRILADALYDLGTGLEVLSPLCPQLFL 180
LKDG +G+++ + L G +D + K+ +R+ AD L D +E+++P P LF
Sbjct: 63 LKDGTGMIGRILFAWLRGTSLDCDAKKMLIFISYRLFADFLNDGALFIEMIAPNFPHLFT 122
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + AK I VA ATR + A+ N++D+ AK + TL N+ L G+ L
Sbjct: 123 LLLCVASVAKSIVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLSALIVGLFLL 182
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
V + + + + +++H+Y+ + + + + TLN +R ++ +F + G I SPA+
Sbjct: 183 PAVTGNARIIWTLFFMFTMLHLYANYKAVSSVVMETLNRERLHILAYNFFRYGSILSPAE 242
Query: 301 LRYREDLL 308
+ RE ++
Sbjct: 243 VCQRESVI 250
>gi|356524511|ref|XP_003530872.1| PREDICTED: uncharacterized protein LOC100783222 [Glycine max]
Length = 513
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 76/300 (25%), Positives = 149/300 (49%), Gaps = 11/300 (3%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA--AGLRP 116
+ R S + FF P V Y+ Y +++ L S+AL VL+TQ++ A G
Sbjct: 104 WLRAFGSVVWDFF---IPRGVTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSN 160
Query: 117 TPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
+ A ++W+LKDG+ + + I ++L + D+ KR R L+ GLE+L+P
Sbjct: 161 SLPSAAALNWVLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAF 220
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG--L 233
P+ FL +A + N +K I++ ATR ++ SFA NL ++ AK + + F+++G L
Sbjct: 221 PRCFLLLATIANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLML 280
Query: 234 GAGIQLASTVCSSMQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
A + L Q + + P + + ++ + + ++ + TL R +I++ +++
Sbjct: 281 AALVNLWIENHRRQQAGLHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIE 340
Query: 292 TGRISSPADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILC 350
G + SPA++ +E + F G + +++G +P V ++ + E+ + +C
Sbjct: 341 CGYVPSPAEVSEKEGINFLGVKGKSLWPIRIGCLNPKDQVPKWSMKTIQCITDEDYYFVC 400
>gi|302881823|ref|XP_003039822.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
77-13-4]
gi|256720689|gb|EEU34109.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
77-13-4]
Length = 444
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 157/312 (50%), Gaps = 17/312 (5%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRP-VYHRVVDSFL-NKFFPSGYPFSV 79
+E+ + + S +V R + P S +++ ++P ++ R + L + F P GYP SV
Sbjct: 12 REVEVIDLDKSGNVARRWVHMP----SNRIIRKAKPSLFKRSMRKILMDAFLPVGYPHSV 67
Query: 80 NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGK 137
++ YL Y F +LQ F S ++L+ ++LL G+ + AT ++ +LKD M V
Sbjct: 68 SDDYLAYQFFDSLQAFFSTITALLANRALLQGLGVGDANSSATFAMLLTVLKDAMSRVAT 127
Query: 138 LI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVV 195
+I G R++ + KR+R LAD D +E+ SP P + G + + V
Sbjct: 128 IIFAQQFGLRIEPDAKRYRFLADLFNDTAFFMELYSPYFGPYGKILALTSGEALRALCGV 187
Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
A A++ + FAK NL++L AK + T +VGL G + V ++ V
Sbjct: 188 AGGASKAALSVHFAKHDNLAELNAKEASQETAVGLVGLLVG----TFVVKHVEDHTSVVY 243
Query: 256 LLSVI---HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
L++V+ H++ +R+ +N LN QR ++ +++K+G + +P ++ RE +LF
Sbjct: 244 LMTVLVLAHLWMNYLGVRSVCMNVLNRQRATILYKEYLKSGNVLTPNEVARRESILFWSP 303
Query: 313 LIEDAGNVKVGR 324
++ + ++ R
Sbjct: 304 VVRNQHGQRIAR 315
>gi|186478458|ref|NP_001117280.1| uncharacterized protein [Arabidopsis thaliana]
gi|332190946|gb|AEE29067.1| uncharacterized protein [Arabidopsis thaliana]
Length = 353
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 165/340 (48%), Gaps = 24/340 (7%)
Query: 102 VLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILA 158
+LSTQ+LL A G+ A W L+D +G ++ + G+ +DS K WR++A
Sbjct: 2 MLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVA 61
Query: 159 DALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
D + D+G +++LSPL P F+ + LG+ ++ VA+ ATR + FA + N +D+
Sbjct: 62 DLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADIS 121
Query: 219 AKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
AK + T+ ++G+ G+ LA + + L+V H+Y+ +R +N+LN
Sbjct: 122 AKEGSQETMATMMGMSLGMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLN 181
Query: 279 PQRTAMIVADFVKTGRISSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHE 337
+R+++++ F++TG++ SP + E +L + + + + V+ S
Sbjct: 182 FERSSILLTHFIQTGQVLSPEQVSSMEGVLPLWATSLRSTNSKPLHKRVQLGVRVSSLPR 241
Query: 338 LKEL----------FAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASM---AK 384
L L + K++L ++L +++ D L+ ++ A A++ +
Sbjct: 242 LDMLQLLNGVGASSYKNAKYLLAHIKGNVSVILHKDSKPADVLKSYIHAIVLANLMEKST 301
Query: 385 SFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
SF+ + YD+ LL +L++ GW T+R L
Sbjct: 302 SFYSEGEAWIDKHYDE-------LLHKLRSGGWKTERLLS 334
>gi|307106356|gb|EFN54602.1| hypothetical protein CHLNCDRAFT_135084 [Chlorella variabilis]
Length = 486
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/250 (30%), Positives = 124/250 (49%), Gaps = 29/250 (11%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
F F P G+P SV+ YL Y + ++Q +S +LS+Q++L G
Sbjct: 65 FAACFLPEGFPHSVSSDYLPYQLWDSIQGLSSYLRGMLSSQAMLAGVGF----------- 113
Query: 126 WILKD--GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
LKD GM G L S G+ +D K+WR+LAD + D+G LE+ SPL P FL +A
Sbjct: 114 -FLKDFTGMLG-GILFASTQGSGLDCYAKQWRLLADVMNDIGLALELASPLLPGAFLLLA 171
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
LG+ A+ + VA ATR+ + FA + N +D+ AK + T +VG+ G+
Sbjct: 172 CLGSLARAVTGVAGSATRMALTQHFALQRNAADIAAKEGSQETAVTLVGMVLGMAF---- 227
Query: 244 CSSMQGKMIVGPL-------LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
++ + PL L+ +HV++ + MR + +LN R +++ +++TG
Sbjct: 228 ---IRAAHGLEPLIWSAFWALTALHVWANVRAMRCLRIASLNQARLGLLLRHYLRTGEAL 284
Query: 297 SPADLRYRED 306
+P + E
Sbjct: 285 TPGQMAAEES 294
>gi|449460628|ref|XP_004148047.1| PREDICTED: UPF0420 protein C16orf58-like [Cucumis sativus]
Length = 523
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 147/278 (52%), Gaps = 11/278 (3%)
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
D R +Y V + + F P +V+E Y+ Y +++ L S+AL V++TQ++ A
Sbjct: 107 DGFRTLYRVSVLAVKDFFIPK----NVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAI 162
Query: 113 GL--RPTPAQATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLE 169
G+ + A A ++W+LKDG+ + + L +++ + D+ KR R L+ L G+E
Sbjct: 163 GVGHSRSLASAAALNWVLKDGLGRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVE 222
Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
+L+P PQ FL +A + N K I++ +T ++ SFA NL ++ AK + S F+
Sbjct: 223 LLTPAFPQYFLLLASIANIVKQISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFD 282
Query: 230 VVG--LGAGIQLASTVCSSMQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
+G L A + S +Q + +V P+ + + ++ + ++ + TL R +I
Sbjct: 283 NLGLVLAAFLNFLSKNDRRLQAALPFVVYPIFAAMDLFGTYQGLKHVHLQTLTKDRLEII 342
Query: 286 VADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG 323
++++++ G + +PA++ RE + R +D+ +++G
Sbjct: 343 LSNWIEQGYVPTPAEVSEREGIDLLCRQGKDSWPIRIG 380
>gi|357506015|ref|XP_003623296.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
gi|355498311|gb|AES79514.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
Length = 492
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 154/350 (44%), Gaps = 23/350 (6%)
Query: 32 SDSVTRHYQFQPDGHLSVKVVDDSRPVYHR-------------VVDSFLNKFFPSGYPFS 78
S+ + Y D L ++++ R +R + D+ P+G+P S
Sbjct: 60 SNGTAKRYIIGDDSRLRTILIEEDRSTQNRFGVLHSPDKRLSWLPDTVKAFILPAGFPGS 119
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVSWILKDGMQ 133
V++ YL+Y + + T + T SLL A G+ A A + W+ KDG+
Sbjct: 120 VSDDYLQYMLLQFPTNVTGWICHTIVTSSLLKAVGVGSFSGTTAAASAAAIRWVSKDGIG 179
Query: 134 HVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
VG+L I G D +PK+WR+ AD + G+ ++ +PL P FL +A LGN K I
Sbjct: 180 AVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLTTPLYPGYFLPLASLGNLTKAI 239
Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGK 250
A + I S FA NL ++ AK E + +VGLG GI + T + S
Sbjct: 240 ARGLKDPSSRVIQSHFAISANLGEIAAKEEVWEVVAQLVGLGLGILILDTPGLVKSYTVL 299
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
++ ++H++ E + NT+N +R +V V + D E++L
Sbjct: 300 SSTWLIVRLLHLWLRYESLSVLQFNTINLKRARTLVKSHVLHSTVPGCMDCNKEENILVW 359
Query: 311 GRLIEDAGNVKVGRDPHKVVKPSKF--HELKELFAEEKFILCLGSRWTDM 358
+ ++ D ++ S F L +L+A EK+IL + + D+
Sbjct: 360 SQFMKPIIIFGSPLDKMDGMERSHFMVEALLKLYANEKYILVVNQQLDDL 409
>gi|397563898|gb|EJK43986.1| hypothetical protein THAOC_37516 [Thalassiosira oceanica]
Length = 542
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 113/410 (27%), Positives = 176/410 (42%), Gaps = 58/410 (14%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QAT 122
SF++ F P+ YP SV Y + + + +A VLSTQ+LL A G+ A A
Sbjct: 110 SFVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAA 169
Query: 123 VVSWILKD-GMQHVGKLICSNLG---------------------------------ARMD 148
++W+LKD Q G L S LG D
Sbjct: 170 ALNWLLKDFAGQLGGVLFVSQLGIDHEFWKRKINRLFRENNQGNNTGRPKKGNYQRGTAD 229
Query: 149 SEPKRWRILADALYDLGTGLEVLSPLC-PQLFLEMAGLGNFAKGIAVVAARATRLPI--- 204
+ PKRWR++A DL T LE+ +PL PQ FL A + N K I +AA A+R I
Sbjct: 230 TNPKRWRMVAALALDLSTLLEICTPLLGPQYFLPCASVANVGKNIGWLAASASRAAINQS 289
Query: 205 -----YSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS--SMQGKMIVGPLL 257
S + NL D+ AK + + + ++VG GI + T+ + G M +L
Sbjct: 290 LSAGGSYSSSSGSNLGDVTAKSGSQAIVASLVGTALGIFFSKTLYADHGTVGVMSGFIVL 349
Query: 258 SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDA 317
S +H + + + A P+ +L+ R +++++F +T + SP+ + +E FP + +
Sbjct: 350 SAVHQVATYKALMAVPLRSLDRHRLHIVLSEFSRTSSVPSPSQVATKEK-FFPLITPDHS 408
Query: 318 GNVKVGRDPHKVVKPSKFHELKELF-------AEEKFILCLGSRWTDMVLEQNASGEDAL 370
N P + P EL+ L E++IL +G + +A+ D L
Sbjct: 409 CNWLFVGSPLLEICPKGISELQGLILWRDDAMQHEQYILKVGPASIQLTFLDDANEVDLL 468
Query: 371 RGWLVAAYAASMAKS--FHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWH 418
RG L A A + K+ +D L D + M+ F L KGW
Sbjct: 469 RGQLHAYLAHNRMKTDLAYDEDAVRL-DTHRSMDSQFGEFQEMLLEKGWQ 517
>gi|449509553|ref|XP_004163622.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58-like
[Cucumis sativus]
Length = 523
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 76/278 (27%), Positives = 148/278 (53%), Gaps = 11/278 (3%)
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
D R +Y RV + FF P +V+E Y+ Y +++ L S+AL V++TQ++ A
Sbjct: 107 DGFRTLY-RVSVLAVKXFF---IPKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAI 162
Query: 113 GL--RPTPAQATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLE 169
G+ + A A ++W+LKDG+ + + L +++ + D+ KR R L+ L G+E
Sbjct: 163 GVGHSRSLASAAALNWVLKDGLGRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVE 222
Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
+L+P PQ FL +A + N K I++ +T ++ SFA NL ++ AK + S F+
Sbjct: 223 LLTPAFPQYFLLLASIANIVKQISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFD 282
Query: 230 VVG--LGAGIQLASTVCSSMQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
+G L A + S +Q + +V P+ + + ++ + ++ + TL R +I
Sbjct: 283 NLGLVLAAFLNFLSKNDRRLQAALPFVVYPIFAAMDLFGTYQGLKHVHLQTLTKDRLEII 342
Query: 286 VADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG 323
++++++ G + +PA++ RE + R +D+ +++G
Sbjct: 343 LSNWIEQGYVPTPAEVSEREGIDLLCRQGKDSWPIRIG 380
>gi|321455034|gb|EFX66180.1| hypothetical protein DAPPUDRAFT_302922 [Daphnia pulex]
Length = 315
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 135/269 (50%), Gaps = 16/269 (5%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
DS + + + F P GYP SV++ YL Y + +Q F S+ LST+++L G
Sbjct: 40 DSNGKFRGITHLLSSVFLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVG 99
Query: 114 L---RPTPAQATVVSWILKDGMQHVGKLI---CSNLGARMDSEPKRWRILADALYDLGTG 167
+ TP AT++ W++KDG VG+++ C+ G ++D++ K+WR AD L D
Sbjct: 100 VGSSTSTPLAATLM-WLIKDGTGMVGRILFAWCN--GTKLDADSKKWRFFADLLNDCALS 156
Query: 168 LEVLSPLCPQ-------LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
LE+ +P + + AK I VA ATR + A++GNL+D+ AK
Sbjct: 157 LELCAPYAVAAVGGSNGTMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAK 216
Query: 221 GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
+ TL N+ L + L + + ++ L +++HV++ ++ ++A + TLN
Sbjct: 217 DGSQETLVNLAALITSLWLLPILDGATGLTWLMFILFTLLHVFANLKAVKAVTMETLNRA 276
Query: 281 RTAMIVADFVKTGRISSPADLRYREDLLF 309
R +I+ F T ++S ++ E ++
Sbjct: 277 RYMIILKQFASTKYVASVKEVNRLEPVII 305
>gi|224002080|ref|XP_002290712.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974134|gb|EED92464.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 656
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 155/348 (44%), Gaps = 67/348 (19%)
Query: 69 KFFPSGY-PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV---- 123
+F S + P V Y R+ ++R LQ F +A + V+ TQSLL GLR +V
Sbjct: 147 QFLHSAFVPEGVTPSYYRFMRWRILQRFINANVHVIGTQSLLM--GLRGMTRSGSVGEAA 204
Query: 124 ---------------VSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTG 167
+W+LKD + + +++ S +G + D + KRWR + +Y LG G
Sbjct: 205 SFAAKEGAALGAAAATNWVLKDTLGKIVRMVWASKMGRKFDPDAKRWRFRSALIYALGNG 264
Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG----------NLS-- 215
LEV + L PQ FL +A L N K ++++ + ATR +Y+SF + G N S
Sbjct: 265 LEVSTYLHPQYFLLLAMLANSCKQMSMLTSSATRNALYNSFKRVGTTSSTVSTSNNASGS 324
Query: 216 ----------------DLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
D+ AKGEA + +++G+ +GI L+ V S+Q V +L +
Sbjct: 325 SAIVRRGGNGGIENIGDITAKGEAQIAVVDLLGIASGICLSRAVGVSVQNVFAVWVVLQI 384
Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-------KTGR-------ISSPADLRYRE 305
++ + EMR+ T N +R M++ + K G I SP + E
Sbjct: 385 GEIFCMYHEMRSIVYRTFNFERMYMVLGKLMLDDAADNKGGDSSSTKNLIPSPEQIAANE 444
Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHEL-KELFAEEKFILCLG 352
+ P + + G + P++ L ++F EK+ L +G
Sbjct: 445 KVFLPPDHLARRA-IAFGSPGRSTLDPAELDVLVNDIFRGEKYFLVVG 491
>gi|348585092|ref|XP_003478306.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cavia porcellus]
Length = 468
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 170/379 (44%), Gaps = 35/379 (9%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ S L+T +++ G+ A A +W+
Sbjct: 77 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSSALATHAVMLGLGVGNAEASLAAATTTWL 136
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
++D +G+++ + G+++D K+WR+ AD L D LE+ +P P F
Sbjct: 137 VRDSTGMLGRIVFAWWKGSKLDCNVKQWRLFADILNDAAMFLEIAAPTLPFCFTVTVSTS 196
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
N AK I VA ATR + A+ N++D+ AK + TL N+ GL + + V
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLFSLVMLPLVSGC 256
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ LL+ +H+Y+ +RA + TLN R +++ F++ G I E
Sbjct: 257 PSFSLGCFFLLTGLHIYANYRAVRALVMETLNESRLQLVLKHFLQRGEILDTTSANQMEP 316
Query: 307 L---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGS--RWTDMV 359
L +P ++ +G H ++ S EL++L +E ++L +V
Sbjct: 317 LWTGFWPSL------SLSLGVPLHDLI--SSVSELQQLVEGHQEPYLLSWNQSRNKVQVV 368
Query: 360 LEQNASGEDALR----GWLVAAYA--ASMAKSFHD-----------PSLTVLQDAYDKMN 402
L A E LR G L+ A + + + S ++++ Y ++
Sbjct: 369 LSHVAGPEAVLRAATHGLLLGALKEDGPLPRELEELRNRVRAGREKDSQAIVRETYRVLD 428
Query: 403 DVFTPLLSELQAKGWHTDR 421
+F L LQA GW T++
Sbjct: 429 KLFPRFLEGLQATGWKTEK 447
>gi|328869935|gb|EGG18310.1| DUF647 family protein [Dictyostelium fasciculatum]
Length = 407
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 137/279 (49%), Gaps = 15/279 (5%)
Query: 44 DGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVL 103
D +VK+ +R + ++ + F P GYP SV Y Y + LQ S+ VL
Sbjct: 24 DEQTNVKITKQTRK-HGSFIEVLRDMFLPKGYPNSVTSDYTGYQTWDTLQALCSSLTGVL 82
Query: 104 STQSLLFAAG---LRPTPAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILAD 159
+T+++L G L + A A++ WI +D +G++I + G +D K WR AD
Sbjct: 83 ATRAILKGYGVGNLNASVASASI-QWITRDAAGMLGRIIFAWKWGTDLDCNSKTWRFAAD 141
Query: 160 ALYDLGTGLEVLSPL-CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
L D G LE++SPL P FL ++ LG +K I VA T+ + FAK NL+D+
Sbjct: 142 LLNDFGMTLEMISPLFGPSYFLPISCLGLISKSICGVAGGCTKASLTQHFAKRDNLADVS 201
Query: 219 AKGEAISTLFNVVGLGAGIQLASTV---CSSMQ---GKMIVGPLLSVIHVYSVIEEMRAA 272
AK + T +VG+ G+ +A+++ S+Q + + +VIH+ +
Sbjct: 202 AKDGSQETAIALVGMLLGMVMAASIPDDDDSIQVYFKTWLFFIIFTVIHLLCNYRAVSLV 261
Query: 273 PVNTLNPQRTAMIVADFVK--TGRISSPADLRYREDLLF 309
+ ++N R +I F + +I +P + +E++LF
Sbjct: 262 QLTSINRYRANLIYQHFNENNNNQIPTPKQISLKENILF 300
>gi|358397603|gb|EHK46971.1| hypothetical protein TRIATDRAFT_291240 [Trichoderma atroviride IMI
206040]
Length = 468
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 153/307 (49%), Gaps = 8/307 (2%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDD--SRPVYHRVVDSFLNKFFPSGYPFSV 79
++ I + S + + DGH+ K + SR + + ++ F P G+P SV
Sbjct: 12 NDVEIVELDKSGHPQQQWLHAHDGHILPKTSNSLFSRS-FREWRKAIMDSFLPVGFPHSV 70
Query: 80 NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGK 137
++ YL Y F +LQ F S S+L+ ++LL G+ + AT ++ I+KD M +
Sbjct: 71 SKDYLAYQTFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTIMKDAMSRIAT 130
Query: 138 LICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIAVV 195
++ ++ G R++ + KR+R LAD D LE+ SP +A G + + V
Sbjct: 131 IVFAHRFGLRIEPDAKRYRFLADVFNDSAFFLELYSPYLGSWGKVIALSTGQALRALCGV 190
Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
AA A++ + FAK NL++L AK + T ++GL G L + +I+
Sbjct: 191 AAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVG-TLVVKMIEDHSSVVILMI 249
Query: 256 LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIE 315
LL + H+ + +R+ + TLN QR ++ +F+++G++ +P + RE+++F ++
Sbjct: 250 LLVIAHLGTNYLGVRSVCMATLNRQRATIVFEEFLRSGKVLTPEQVAERENIIFWNPIMR 309
Query: 316 DAGNVKV 322
+ KV
Sbjct: 310 NLHGKKV 316
>gi|198431163|ref|XP_002119233.1| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Ciona
intestinalis]
Length = 414
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 67/265 (25%), Positives = 132/265 (49%), Gaps = 12/265 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F + F P GYP SV+E YL Y + +Q F S+ L+T ++L +G+ A +
Sbjct: 49 FKDIFLPKGYPESVSEDYLSYQIWDTVQAFCSSITGTLATHAVLKGSGVGDETANVASAT 108
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+LKDG +G++ + G+ +D K+WR+ AD + D +++++P+ P+ + +
Sbjct: 109 ITWLLKDGTGMLGRIAFAYFKGSSLDCNAKQWRLFADVMNDCAIFIQLVAPVLPKAYFTL 168
Query: 183 A-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
L A + VA TR + AK N++D+ AK + TL N+ L G+ +
Sbjct: 169 VMCLAGLAFSLVGVAGGCTRAALTMHQAKCNNMADVSAKDGSQETLVNLAALLTGLVIMP 228
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
V +++ ++ + + +HV++ + + + + N R + ++ F ++ +P +
Sbjct: 229 IVATNIPLMWLLFLIFTFLHVFANYKAVASVVMELFNRNRLSKVIKVFCDMNKVLTPKEA 288
Query: 302 RYREDLLFP--------GRLIEDAG 318
+E + FP G +ED G
Sbjct: 289 NEKEAIFFPPSYSWLNLGCKLEDLG 313
>gi|357127303|ref|XP_003565322.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
Length = 486
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 103/416 (24%), Positives = 181/416 (43%), Gaps = 40/416 (9%)
Query: 33 DSVTRHYQFQPDGHLSVKV---------VDDSRP--VYHRVVDSFLNKFFPSGYPFSVNE 81
D V + Y D L V++ +D +P R + F+ P+G+P SV++
Sbjct: 54 DGVAKRYMLDGDSKLQVRLEKHEHLLNNAEDGKPDSSIPRAITDFV---LPAGFPGSVSD 110
Query: 82 GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVSWILKDGMQHVG 136
YL+Y ++ + T L T SLL A G+ A A + W+ KDG+ G
Sbjct: 111 DYLQYMLWQFPTNVTGWICHTLVTSSLLKAVGVGSFTGTSAAASAAAIRWVSKDGIGAFG 170
Query: 137 KL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVV 195
+L I G D++PK+WR+ AD + +G+ E+ +PL P FL +A LGN AK +
Sbjct: 171 RLLIGGRFGTLFDNDPKKWRMYADFIGSVGSIFELTTPLYPGYFLPLASLGNLAKAVGRG 230
Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGKMIV 253
+ I + FAK GNL ++ AK E ++GL G+ + T V SS +
Sbjct: 231 FRDPSNRVIQNHFAKSGNLGEIAAKEEVWGVGAQLLGLSIGVLILDTSGVQSSYPTLTLT 290
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRL 313
+ ++H++ + + T+N +R ++V V + E++L R
Sbjct: 291 WLGVRLLHLWFRYQSLIVLKFRTVNLKRARILVRSHVAHHTVPGYVACNEEENILTWERF 350
Query: 314 IEDAGNVKV-------GRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTD-----MVLE 361
++ + V G +P +V L +L+ E +IL + + + +
Sbjct: 351 LQPQISFGVPMERMLGGGEPSDMVS-----RLLKLYKNEMYILFVDQSESKEPAFVVTFK 405
Query: 362 QNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
+ A+ LR L A+ K D +L+++ + FT + +++ GW
Sbjct: 406 EAATSMSVLRS-LWQAHWLHKNKPKQDDVFALLEESLAALEGGFTDFIEQMERAGW 460
>gi|212274801|ref|NP_001130544.1| uncharacterized protein LOC100191643 [Zea mays]
gi|194689438|gb|ACF78803.1| unknown [Zea mays]
gi|223974155|gb|ACN31265.1| unknown [Zea mays]
gi|413956565|gb|AFW89214.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 442
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 179/389 (46%), Gaps = 38/389 (9%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 50 WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---GEAISTLFNVVG 232
P + + LG+ ++ VA+ ATR + FA N +D+ AK E ++T+ +
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGL 229
Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
Q+ S+ + L++ H+Y+ + +++ + TLN +R ++++ F +
Sbjct: 230 GMLLAQITRGHALSVWASFLS---LTMFHMYANYKAVQSLSLTTLNYERASILLQYFKEC 286
Query: 293 GRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHK-----------------VVKPSKF 335
G + P + +E +L +K+ PH+ V+ +++
Sbjct: 287 GEVLVPRKVSQQEHILPSWSNWRKLNRIKL---PHERVHLGAKASMLTHSDMLVIAKTRY 343
Query: 336 HELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQ 395
H + + L + + + A+ D LR ++ AS ++ L
Sbjct: 344 H-----YENANYFLLDKQGIVYVFIHKEATPADVLRSFVHGLVLASSTQNSKPQHL---- 394
Query: 396 DAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
+A M++++T +S+LQ +G+ T+R L
Sbjct: 395 EARRWMDEMYTSFISKLQTEGYSTERLLS 423
>gi|403362828|gb|EJY81150.1| DUF647 family protein [Oxytricha trifallax]
Length = 507
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 121/248 (48%), Gaps = 16/248 (6%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
++ +Y+ + ++FL P+GYP +V+ YL++T + + F A+ LS QSL A G
Sbjct: 77 NKNIYNGLFETFL----PAGYPHTVSSSYLKFTIYSNISAFMYTAMGFLSAQSLFVAIGS 132
Query: 115 RPTPAQ--ATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
T A A +W+LKDG+ +G L S G D + K+WR +A ++ +E+L
Sbjct: 133 TMTQANLAAAAYTWVLKDGLGQLGGILFASRYGRNFDVDIKKWRFMAIFALNISMYIEIL 192
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
+ PQ FL +A L N K I + A A+R I AK N+ D+ K + T ++
Sbjct: 193 TLSFPQYFLALASLANVGKNICFLLASASRASINLQLAKNNNIGDISGKSVSQFTASTLI 252
Query: 232 G--LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT---LNPQRTAMIV 286
G G GI + S Q + P+ V+ V ++ +A V LN QR ++
Sbjct: 253 GGACGLGISKIIDITSIYQ----LYPVFFVLTVMNIAASYYSAKVIDELYLNNQRAFLLF 308
Query: 287 ADFVKTGR 294
+F K+
Sbjct: 309 NEFFKSNE 316
>gi|356566391|ref|XP_003551415.1| PREDICTED: uncharacterized protein LOC100783455 [Glycine max]
Length = 525
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 71/260 (27%), Positives = 132/260 (50%), Gaps = 10/260 (3%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA--AGLRP 116
+ R S + FF P V Y+ Y +++ L S+AL VL+TQ++ A G
Sbjct: 115 WLRACGSVVWDFF---IPRGVTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSC 171
Query: 117 TPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
+ A ++W+LKDG+ + + I ++L + D+ KR R L+ GLE+L+P
Sbjct: 172 SLPSAAALNWVLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAF 231
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG--L 233
P+ FL +A + N +K I++ ATR ++ SFA NL ++ AK + + F+++G L
Sbjct: 232 PRCFLLLATIANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLML 291
Query: 234 GAGIQLASTVCSSMQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
A + L Q + P + + ++ + + ++ + TL R +I++ +++
Sbjct: 292 AALVNLWIENHRRQQAGFHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIE 351
Query: 292 TGRISSPADLRYREDLLFPG 311
G + SPA++ +E + F G
Sbjct: 352 CGYVPSPAEVSEKEGINFLG 371
>gi|219127360|ref|XP_002183905.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404628|gb|EEC44574.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 891
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 128/245 (52%), Gaps = 27/245 (11%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWI 127
F P GYP+SV EGYL Y + ++Q +S +VL + +L AAG+ A A + ++W
Sbjct: 438 FMPVGYPYSVREGYLSYQIYDSIQGLSSYLRAVLCSAQVLQAAGVGDANATAIMAALTWA 497
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FLEMAGL 185
LKDG+ +G LI S + + +D+ K +R+ AD + D+G L++L+P P+ + ++
Sbjct: 498 LKDGLGMIGGLIFSYGVASHLDAYVKEFRLFADVINDVGMLLDMLAPYVPRADLIWISSA 557
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K + +AA AT+ I FA +GN++DL AK TL N++G+ G+ LA +
Sbjct: 558 ATLCKVMCGIAAGATKSSITQHFALQGNMADLNAKEATQETLVNLIGMVCGVTLARYLSQ 617
Query: 246 SMQG-----------------------KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRT 282
G + +V +L+++H++S + + + TLN +RT
Sbjct: 618 LEAGNEEGDTDRIGYRSWIPCISHDSIQWMVFTILTLLHIWSNWKGVSLLRLETLNRERT 677
Query: 283 AMIVA 287
+++
Sbjct: 678 KVVLC 682
>gi|298708275|emb|CBJ48338.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 284
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 109/203 (53%), Gaps = 3/203 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS--WI 127
F P+GYP SV + Y+ Y + +Q S VL ++L A+G+ A A + W+
Sbjct: 71 FLPTGYPSSVRKEYMEYQTWDTVQALCSYLRGVLCLTAMLHASGVGDENASAIAAAMVWV 130
Query: 128 LKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+DG +G L+ S + MDS K WR+ AD D+G L++++P+ F ++ L
Sbjct: 131 LRDGFGMIGSLLFSYAASSHMDSNIKEWRLFADIANDVGLTLDLVAPVFRGNFAVVSSLA 190
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
K + VAA T+ I S FA + N++D+ AK A T NV G+ G+++AS + +
Sbjct: 191 TVFKALCGVAAGCTKTSITSHFALKQNMADISAKENAQETAVNVCGMLLGVKVASYLSDA 250
Query: 247 MQGKMIVGPLLSVIHVYSVIEEM 269
G++++ S++HVY+ M
Sbjct: 251 PTGRVLIFVFFSLLHVYANYRYM 273
>gi|218192328|gb|EEC74755.1| hypothetical protein OsI_10514 [Oryza sativa Indica Group]
Length = 479
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 104/409 (25%), Positives = 187/409 (45%), Gaps = 50/409 (12%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ RV L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 59 WGRVGGRLLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKS 118
Query: 119 AQ--ATVVSWILKD--GM---------QHVGKLICSN------------------LGARM 147
A W L+D GM Q L SN LG+ +
Sbjct: 119 ATVIGATFQWFLRDLTGMLGGILFTFYQVSNSLANSNRQCHRIAIDSLFTDEITCLGSNL 178
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DS K WR++AD + DLG +++LSPL P + + LG+ ++ VA+ ATR +
Sbjct: 179 DSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQH 238
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
FA N +D+ AK + TL ++G+G G+ LA G + L++ H+Y+ +
Sbjct: 239 FALANNAADISAKEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYK 298
Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL-FPG------RLIEDAGNV 320
+++ + TLN +R+++++ F+ G + +P + +E +L F R+ +V
Sbjct: 299 AVQSLSLTTLNYERSSIMLQYFMDNGEVLTPQQVSKQEHILPFWSSWRKFLRIKLPHEHV 358
Query: 321 KVGRDPHKVVKPS--KFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGW---LV 375
+G ++ P + + + + E + L + + + A+ D L + LV
Sbjct: 359 HLGAKASRLTHPDLLQIAKTRPYYRNENYFLLDKEGSVHIFIHKQAAATDILMSFIHGLV 418
Query: 376 AAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
A+ +KS H +A +++ + +S+LQ +G+ T+R L
Sbjct: 419 LAHLMQKSKSGH-------AEARQWIDEKYNTFISKLQVEGYSTERLLS 460
>gi|222624448|gb|EEE58580.1| hypothetical protein OsJ_09902 [Oryza sativa Japonica Group]
Length = 479
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 104/408 (25%), Positives = 186/408 (45%), Gaps = 52/408 (12%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
RV L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+ A
Sbjct: 61 RVGGRLLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSAT 120
Query: 121 --ATVVSWILKD--GM---------QHVGKLICSN------------------LGARMDS 149
W L+D GM Q L SN LG+ +DS
Sbjct: 121 VIGATFQWFLRDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNLDS 180
Query: 150 EPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFA 209
K WR++AD + DLG +++LSPL P + + LG+ ++ VA+ ATR + FA
Sbjct: 181 NAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFA 240
Query: 210 KEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEM 269
N +D+ AK + TL ++G+G G+ LA G + L++ H+Y+ + +
Sbjct: 241 LANNAADISAKEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAV 300
Query: 270 RAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG--------RLIEDAGNVK 321
++ + TLN +R+++++ F+ G + +P + +E +L P R+ +V
Sbjct: 301 QSLSLTTLNYERSSILLQYFMDNGEVLTPQQVSKQEHIL-PYWSSWRKFLRIKLPHEHVH 359
Query: 322 VGRDPHKVVKPS--KFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGW---LVA 376
+G ++ P + + + + E + L + + + A+ D L + LV
Sbjct: 360 LGAKASRLTHPDLLQIAKTRPYYRNENYFLLDKEGSVHIFIHKQAAATDILMSFIHGLVL 419
Query: 377 AYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
A+ +KS H +A +++ + +S+LQ +G+ T+R L
Sbjct: 420 AHLMQKSKSGH-------AEARQWIDEKYNTFISKLQVEGYSTERLLS 460
>gi|400593658|gb|EJP61584.1| DUF647 domain protein [Beauveria bassiana ARSEF 2860]
Length = 467
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/315 (28%), Positives = 154/315 (48%), Gaps = 14/315 (4%)
Query: 14 EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPS 73
E DKS VQ+ W R +P GH + R + DSFL P
Sbjct: 28 ELDKSGHVQK---RWLHEPSG--RIIHIRPQGHTNDLFSRSLREWKKILSDSFL----PV 78
Query: 74 GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDG 131
G+P SV+ YL Y F +LQ F S S+L+ ++LL G+ + AT ++ IL+D
Sbjct: 79 GFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFALLLTILRDA 138
Query: 132 MQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
V ++ ++ G R++ + KR+R LAD D LE+ SP L P + + +G
Sbjct: 139 TSRVATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLSPLAKVVVLSVGEAL 198
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
+ + V+A A++ + FAK NL++L AK + T ++GL G + V +
Sbjct: 199 RALCGVSAGASKAALSMHFAKHDNLAELNAKEASQETAVGLIGLLVG-TIVVKVVEDHRS 257
Query: 250 KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
+ + LL H++ +R+ + TLN QR ++ ++++TG++ SPA++ RE ++
Sbjct: 258 VVSLMILLVFGHLWMNYLGVRSVQMATLNRQRATILFQEYLRTGKVLSPAEVAKREYIIV 317
Query: 310 PGRLIEDAGNVKVGR 324
++ + +V +
Sbjct: 318 WSSVVSNRHGQEVAK 332
>gi|170118219|ref|XP_001890291.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164634725|gb|EDQ99048.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 471
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 109/406 (26%), Positives = 194/406 (47%), Gaps = 52/406 (12%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ RV+D F P+GYP SV+ YLRY ALQ F ++ + +LS++++L G+
Sbjct: 42 HRRVLDLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLVGLLSSRAILEGFGVGDPS 101
Query: 119 AQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A AT ++ +L+D + ++ ++ LG+ + + K++R+LAD L D L+ +SP
Sbjct: 102 ATATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDVKKYRLLADMLNDAAVILDTISPRL 161
Query: 176 PQLF---LEMAGLGNFA--KGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAIST 226
LF L +A L A + + ++A ++ I FA G++ DL AK + T
Sbjct: 162 DTLFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKET 221
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEM--RAAPVNTLNPQRTAM 284
+ ++G+ G + + + I LL ++ ++ I + R + TLN QRT +
Sbjct: 222 VLALLGMLLGTLIVPHLTTPWT---IYTTLLVLVGLHLTINYIGVRGLVLRTLNGQRTWL 278
Query: 285 IVADFVKT--GRISSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHE---- 337
+ +++T GR +P ++ E + PG + V +GR S F E
Sbjct: 279 AWSAYMRTQPGRAPTPDEVASLERIFERPGAFRDPRSGVLMGR----CTIGSSFSEILSG 334
Query: 338 -----LKELFAEEKFIL-----CL---------GSRWTDMVLEQNASGEDALRGWLVAAY 378
L ELF EE+++L CL G + L++ + ED L+ W+ AA
Sbjct: 335 PIPPQLLELFKEERYLLWFDRRCLLRPESPSVEGFLRLHICLKEGYNTEDQLKAWIHAAE 394
Query: 379 AASMAKSFHDPS-----LTVLQDAYDKMNDVFTPLLSELQAKGWHT 419
S D S L VL++ + K+ ++ +++++A GW+T
Sbjct: 395 LCRAISSQRDASGVLDALGVLEETHQKVVELIPDFVNKMRALGWNT 440
>gi|195643336|gb|ACG41136.1| hypothetical protein [Zea mays]
Length = 442
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 178/389 (45%), Gaps = 38/389 (9%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 50 WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---GEAISTLFNVVG 232
P + + LG+ ++ VA+ ATR + FA N +D+ AK E ++T+ +
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNATDISAKEGSQETLATMLGMGL 229
Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
Q+ S+ + L++ H+Y+ + +++ + TLN +R ++++ F +
Sbjct: 230 GMLLAQITRGHALSVWASFLS---LTMFHMYANYKAVQSLSLTTLNYERASILLQYFKEC 286
Query: 293 GRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHK-----------------VVKPSKF 335
G + P + +E +L +K+ PH+ V+ ++
Sbjct: 287 GEVLVPRKVSQQEHILPSWSNWRKLNRIKL---PHERVHLGAKASMLTHSDMLVIAKTRC 343
Query: 336 HELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQ 395
H + + L + + + A+ D LR ++ AS ++ L
Sbjct: 344 H-----YENANYFLLDKQGIVYVFIHKEATPADVLRSFVHGLVLASSTQNSKPQHL---- 394
Query: 396 DAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
+A M++++T +S+LQ +G+ T+R L
Sbjct: 395 EARRWMDEMYTSFISKLQTEGYSTERLLS 423
>gi|356523306|ref|XP_003530281.1| PREDICTED: UPF0420 protein-like [Glycine max]
Length = 480
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 95/350 (27%), Positives = 157/350 (44%), Gaps = 23/350 (6%)
Query: 32 SDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVD--------SFLNKF-----FPSGYPFS 78
S+ + Y D L +V++ R +R D S+L + P+G+P S
Sbjct: 47 SNGTAKRYVLGDDSQLQAFLVEEDRSTPNRFQDLHSSDESLSWLPEIIKDFVLPAGFPGS 106
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVSWILKDGMQ 133
V++ YL Y + + T L T SLL A G+ A A + W+ KDG+
Sbjct: 107 VSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKAVGIGSFTGTTAAASAAAIRWVSKDGIG 166
Query: 134 HVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
VG+L I G+ D +PK+WR+ AD + G+ ++ + L P FL +A LGN K +
Sbjct: 167 AVGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLTTQLYPAYFLPLASLGNLTKAV 226
Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMI 252
A + I + FA GNL ++ AK E + +VGL GI + T +I
Sbjct: 227 ARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVVAQLVGLSLGILILDTPGLVKSYGVI 286
Query: 253 VGPLLS--VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
LS ++H++ E + NT+N +R ++V V + D E++L
Sbjct: 287 SLTWLSMRLLHLWLRYESLSVLQFNTINIKRARILVKSHVLHSTVPGCTDCNREENILAW 346
Query: 311 GRLIEDAGNVKVGRDPHKVVKPSKF--HELKELFAEEKFILCLGSRWTDM 358
+ ++ + + V+ S F L +L+A EK+IL + + D+
Sbjct: 347 SQFMKPKIIFGLPLEKMDGVERSYFMVEALIKLYASEKYILMVNQQTEDL 396
>gi|321459639|gb|EFX70690.1| hypothetical protein DAPPUDRAFT_60988 [Daphnia pulex]
Length = 264
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/253 (29%), Positives = 130/253 (51%), Gaps = 16/253 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P GYP SV++ YL Y + +Q F S+ LST+++L G+ TP AT++ W
Sbjct: 5 FLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAATLM-W 63
Query: 127 ILKDGMQHVGKLI---CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ------ 177
++KDG VG+++ C+ G ++D++ K+WR AD L D LE+ +P
Sbjct: 64 LIKDGTGMVGRILFAWCN--GTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGGSN 121
Query: 178 -LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+ + AK I VA ATR + A++GNL+D+ AK + TL N+ L
Sbjct: 122 GTMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALITS 181
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ L + + ++ L +++HV++ ++ ++A + TLN R +I+ F T ++
Sbjct: 182 LWLLPILDGATGLTWLMFFLFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTKYVA 241
Query: 297 SPADLRYREDLLF 309
S ++ E ++
Sbjct: 242 SVKEVNRLEPVIM 254
>gi|320164887|gb|EFW41786.1| GTP-binding protein [Capsaspora owczarzaki ATCC 30864]
Length = 436
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 74/247 (29%), Positives = 125/247 (50%), Gaps = 4/247 (1%)
Query: 49 VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
+++VD+ P F P GYP SV+ Y Y ++ LQ F S+ L+T ++
Sbjct: 31 LRLVDEKHPEASSFAQLLKAVFLPDGYPDSVSADYASYQRWDTLQAFCSSINGSLATLAV 90
Query: 109 LFAAGL--RPTPAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLG 165
L G+ A A VSW+L+DG+ +G++ + G +D+ KRWR++AD DL
Sbjct: 91 LKGVGVGDETATATAAAVSWMLRDGVGMLGRIFFAWQKGPALDANAKRWRLVADIFNDLA 150
Query: 166 TGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
+E+LSPL + FL A LG+ + I VA ATR I A+ N D+ AK +
Sbjct: 151 MFVELLSPLVGEWFLAFACLGSVLRSIVGVAGGATRAAITQHQARRNNHGDVSAKDGSQE 210
Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKM-IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
TL N+ L + L + S + ++ + +H+++ +RA +T N +R +
Sbjct: 211 TLVNLAALLVSLWLLPALGSDHPTLVWLLFCSFTSLHLFANYRGVRATVFDTFNRERLFI 270
Query: 285 IVADFVK 291
+ +D+++
Sbjct: 271 VASDYMQ 277
>gi|358335880|dbj|GAA28290.2| UPF0420 protein C16orf58 [Clonorchis sinensis]
Length = 428
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 83/310 (26%), Positives = 138/310 (44%), Gaps = 19/310 (6%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILK 129
PSGYP SV+ YL Y + +Q S+ L+ QS+L G+ A +SW+LK
Sbjct: 21 PSGYPNSVSPDYLEYQIWDTVQALASSVTGALAAQSVLIGVGVGDVKATILGASLSWMLK 80
Query: 130 DGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
DG +GK++ G+ +D + KRWR+LAD + D LE+ S P +F + L N
Sbjct: 81 DGSGMIGKIVFVGWQGSNLDCDCKRWRLLADIINDCALFLELASSFFPSVFTFIICLANL 140
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K + VA ATR + A + N++D+ AK + TL N++ L + + V +
Sbjct: 141 CKSVVSVAGGATRAAVTLHQALDNNVADVAAKDGSQETLSNLLALFFNLSVVYFVTGNWS 200
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD-------- 300
+L+ IHVY+ ++ ++ N R + V + + P+
Sbjct: 201 IIWTGFVILTFIHVYANYRAVQCLRLSNFNRNRFHIAVQQWFRQRLFHDPSVSDPTATPF 260
Query: 301 -----LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRW 355
+ RE +L P + P+K + +L +F++EK++L S W
Sbjct: 261 PSIEWVNSREPILLPASDARIHLGCSLQTLPNK--EQYNLRDLINVFSQEKYLL-YCSNW 317
Query: 356 TDMVLEQNAS 365
N+S
Sbjct: 318 RATTHSPNSS 327
>gi|440638734|gb|ELR08653.1| hypothetical protein GMDG_03339 [Geomyces destructans 20631-21]
Length = 714
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 162/352 (46%), Gaps = 32/352 (9%)
Query: 60 HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RP 116
H SF+ F P+GYP SV E Y+ Y + +LQ F+S+ +++++++L G+
Sbjct: 82 HSWAKSFMEIFLPAGYPNSVTEDYMEYQIYDSLQAFSSSIAGLIASRAVLEGVGVGDATA 141
Query: 117 TPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
+P A ++S +L+D + + + LG ++ E KRWR++AD D L+ LSP
Sbjct: 142 SPTTALLLS-VLQDSTGRIATIFFAARLGLSLEPECKRWRLVADIFNDAAMILDCLSPAF 200
Query: 176 PQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P+ L + + + + VAA + + + + FAK GNL +L AK + T+ +++G+
Sbjct: 201 PKPLRVSLLSASAALRALCGVAAGSAKASLSAHFAKSGNLGELNAKDSSQETIISLLGML 260
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRT------------ 282
G L + + +Q + LLS IH+ + +RA + TLN QR
Sbjct: 261 VGTALIPYLTTPLQTWTALLLLLS-IHLTTNYFAVRAVSMRTLNRQRASLLLRPLLSPPI 319
Query: 283 AMIVADFVKTGRISSPADLRYREDLLF--PGRLIEDAGNVKVGRDPHKVV--------KP 332
+ + I+SP + E ++ P L D P V P
Sbjct: 320 SPAPLPALPPNPIASPLQISQLETIIHLDPSSL-HDPNPAHASFVPLATVLSHASRGPNP 378
Query: 333 SKFHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASM 382
+ H L +FA E +IL +R + L+ +A+ ALR W A A M
Sbjct: 379 IRPHILLSIFAAESYILYFDRYTRHAFIALKSSATASTALRSWFCAMECAHM 430
>gi|358380832|gb|EHK18509.1| hypothetical protein TRIVIDRAFT_14438, partial [Trichoderma virens
Gv29-8]
Length = 451
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 96/386 (24%), Positives = 176/386 (45%), Gaps = 33/386 (8%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSF----LNKFFPSGYPF 77
++ I + S V + + DGH+ +++ + S + R + + F P G+P
Sbjct: 23 NDVEIVELDKSGHVQQQWLHAHDGHV-LRLKNRSNGFFSRSAREWRKVLFDSFLPVGFPH 81
Query: 78 SVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHV 135
SV++ YL Y F +LQ F S S+L+ ++LL G+ + AT ++ ILKD M +
Sbjct: 82 SVSKDYLAYQMFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTILKDAMSRI 141
Query: 136 GKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIA 193
++ ++ G R++ + KR+R LAD D LE+ SP +A G + +
Sbjct: 142 ATIVFAHRFGLRIEPDAKRYRFLADIFNDSAFFLELYSPYLGSWGKVIALSTGEALRALC 201
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
VAA A++ + FA++ NL++L AK + T ++GL G + T+ +I+
Sbjct: 202 GVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVGTLVVKTI-EDHSSVVIL 260
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRL 313
LL H+ +R+ + TLN QR ++ ++++K + +P + RE ++F +
Sbjct: 261 MILLVFAHLGMNYLGVRSVCMTTLNRQRATILFSEYLKNRVVLTPEQVAERESIIFWSPI 320
Query: 314 I--------------EDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRW---T 356
+ E G+ DPH + + + +FI ++
Sbjct: 321 MRNIRGKKVVYLEFAESYGHAMSCSDPHDQLI------IVDGIKHTRFICSRVPKYLTPI 374
Query: 357 DMVLEQNASGEDALRGWLVAAYAASM 382
M+L + A DA+R W + A M
Sbjct: 375 KMLLWEGAQPIDAIRAWFASMEIAWM 400
>gi|330805147|ref|XP_003290548.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
gi|325079335|gb|EGC32940.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
Length = 493
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 72/271 (26%), Positives = 135/271 (49%), Gaps = 17/271 (6%)
Query: 65 SFLNK----FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA-GLRPTPA 119
S +NK F P GYP SV+ Y+ Y ++ Q+ + VLST +LL G+ A
Sbjct: 91 SIINKAKSYFLPKGYPNSVSPDYIHYYKWLFCQNVVGSVTYVLSTHALLTTVVGMSVASA 150
Query: 120 Q--ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
A +SW+LKDG+ + +N +D + K+++ D L++LG LE+ +P
Sbjct: 151 LPFAAAISWVLKDGLGAFALVFFANKFSTLLDFDLKKYKFRGDILHNLGVLLEMCTPFIQ 210
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
FL +A + N +KG+A + +TR + SF+ + N+ D+ AK ++ S ++G+G G
Sbjct: 211 GYFLPLASISNLSKGLAGLIYGSTRASLNKSFSIKDNIGDITAKYQSQSMAAYLMGMGIG 270
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ S + + LS +H++ + +++ + +LN QR ++I+ +++ I
Sbjct: 271 STFGLLLSGSTFSGISIVFSLSFVHMFLGYKAVKSINLKSLNKQRLSIIIDNWLLNSEIL 330
Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPH 327
+ + E+ +F K ++PH
Sbjct: 331 NSKIVNNNEEFIFDS---------KYNKEPH 352
>gi|293336742|ref|NP_001169416.1| hypothetical protein [Zea mays]
gi|224029221|gb|ACN33686.1| unknown [Zea mays]
gi|414876408|tpg|DAA53539.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
Length = 481
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 111/432 (25%), Positives = 184/432 (42%), Gaps = 33/432 (7%)
Query: 14 EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVY--------HRVVDS 65
EP+K+ P+ D V + Y L +++ PV + +
Sbjct: 36 EPEKAR-----PLLAERYRDGVVKRYISDGSSKLQLRLEKHESPVNGVEDENADSLIPQA 90
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQ 120
L+ P+G+P SV+ YL Y + + T VL T SLL A G+ A
Sbjct: 91 ILDFVLPAGFPGSVSADYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAAS 150
Query: 121 ATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A + W+ KDG+ G+L I GA D +PK+WR+ AD + G+ E+ + L P F
Sbjct: 151 AAAIRWVSKDGIGAFGRLLIGGRFGALFDDDPKKWRMYADFIGSAGSIFELSTTLYPGYF 210
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
L +A LGN AK IA + I + FA+ GNL ++ AK E ++GL G+ +
Sbjct: 211 LPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVLI 270
Query: 240 ASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
+V LSV +H++ + + T+N +R ++V V +
Sbjct: 271 MDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLKFRTINLKRGRILVKSHVAQHTVPG 330
Query: 298 PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK----FHELKELFAEEKFILC--- 350
E++L R + V G +++ K + L +L+ EK+IL
Sbjct: 331 YVACNEEENILTWERYLRP--QVSFGVPMERMLGGDKSSDMVNRLLKLYKNEKYILFFEQ 388
Query: 351 LGSRWTDMVL--EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPL 408
GSR ++ +++A+ LR L A+ + D L+++ + + FT
Sbjct: 389 FGSREPTFLVTFKESATSMSVLRS-LWQAHWLHKNQLRQDEVFPWLEESIVALENGFTDF 447
Query: 409 LSELQAKGWHTD 420
L +L + GW D
Sbjct: 448 LEQLGSAGWDQD 459
>gi|256077542|ref|XP_002575062.1| hypothetical protein [Schistosoma mansoni]
Length = 410
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/229 (31%), Positives = 112/229 (48%), Gaps = 4/229 (1%)
Query: 67 LNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
L KFF P GYP SV++ YL Y + +Q F S+ L++Q++L G+ + A +
Sbjct: 41 LKKFFLPVGYPSSVSDDYLEYQIWDTIQAFASSITGALASQAVLIGVGVGDSSATILSAS 100
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+ KDG +G++I + G ++D + K WR +AD L D LE++SPL LF +
Sbjct: 101 LTWMFKDGSGMIGRIIFAGYHGIKLDCDCKFWRFVADILNDCALFLEIISPLFNYLFTPL 160
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
L N K + VA ATR I A NL+D+ AK + TL N++ L
Sbjct: 161 LCLANVLKSLVGVAGSATRAAIVQHQAINNNLADVSAKDGSQETLSNLMAWLLNFILLYM 220
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
V + + IH+YS ++ + T N R + + + K
Sbjct: 221 VTGNQFLIWFCFICCTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQWFK 269
>gi|312082381|ref|XP_003143421.1| hypothetical protein LOAG_07840 [Loa loa]
gi|307761414|gb|EFO20648.1| hypothetical protein LOAG_07840 [Loa loa]
Length = 393
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 130/249 (52%), Gaps = 10/249 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F + F P GYP SV+ Y+ Y + +Q F S+ S L+T+++L G+ A
Sbjct: 43 FRDIFMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALATEAILRGVGVGNETASTMAAA 102
Query: 124 VSWILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
V+W+L+DG+ + +++ + + +D++ K+WR++AD DL L++++P+ P LF+ +
Sbjct: 103 VAWLLRDGIGMLTRILFAWFYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPNLFMPI 162
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
L + +G+ VA ATR + + A N+ D+ AK + TL N+ L + L
Sbjct: 163 VCLSSMVRGVVGVAGGATRTAVVNHQAILDNVGDVAAKDGSQETLINIFALFCSLLLLPI 222
Query: 243 VCSSMQGKMIVGPLLSV---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V G + V L + IH+Y +++ TLN +IV ++ +T +I + +
Sbjct: 223 V----NGNICVWLLFCIFTFIHLYGNYRAVKSLQFRTLNQSLLQIIVKNYAETKKIDTVS 278
Query: 300 DLRYREDLL 308
+ +E +L
Sbjct: 279 KVNNKESML 287
>gi|449017626|dbj|BAM81028.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 518
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 85/285 (29%), Positives = 139/285 (48%), Gaps = 15/285 (5%)
Query: 33 DSVTRHYQFQPDGHLSVKVVDDSRPVY--HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFR 90
D RH++ P+ V +RP R+V + L P+ YP +V+ Y R+ ++
Sbjct: 66 DEEQRHWRVVPEPPAGTAV---TRPENWPQRLVQTLL---LPADYPSAVSSDYDRFWRWN 119
Query: 91 ALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKL-ICSNLGARM 147
L+H A VL TQS+L A G+ A A WILKDG+ + K+ + + L R+
Sbjct: 120 ILRHTLLEAAEVLGTQSMLLALGVGGASALPLAAAWKWILKDGLGYFAKVALATQLAPRV 179
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVA-ARATRLPIYS 206
D++PKR+R D + LGT E+L+ P FL +A GN + A VA A R+ +Y
Sbjct: 180 DNDPKRFRFYGDFVMALGTLFEMLTLAFPPWFLALASTGNLLRKAADVATGPAYRVFLY- 238
Query: 207 SFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVI 266
F+ N D+ +K E+ + + G+GAGI +++ ++ LLS H+Y+
Sbjct: 239 HFSIRNNSGDVSSKSESQVVVGRLSGIGAGIAVSAVTSHDASALVMAYGLLSAGHLYATY 298
Query: 267 EEMRAAPVNTLNPQRTAMIVADFVKTG--RISSPADLRYREDLLF 309
+ + + TLN R ++ + + R+ S A+ E F
Sbjct: 299 QSVIRLELRTLNRTRLEYVLNVYTASSGKRVPSIAEANAAEQKSF 343
>gi|323455525|gb|EGB11393.1| hypothetical protein AURANDRAFT_11851, partial [Aureococcus
anophagefferens]
Length = 248
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/248 (31%), Positives = 126/248 (50%), Gaps = 9/248 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR--PTPAQATVVSWI 127
F P+G+P +V GY Y + + S+A VLST+ LL A G+ A +W
Sbjct: 1 FLPAGFPTTVTPGYGTYAALQFAAYACSSACGVLSTRVLLAAVGVGDGAAAPLAAAANWA 60
Query: 128 LKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLEMAGL 185
+KDG+ +G + + +D+ P++WR+ + A DL + +E+ + P PQ F+ +AGL
Sbjct: 61 IKDGLGMLGGVAFAATWSNSLDARPRQWRLRSSAALDLASLVELAALPAFPQYFVPIAGL 120
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEG---NLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
N AK ++ +AA A+R I+ S + NL+DL AK + + + ++ GL G+ AS
Sbjct: 121 ANVAKNVSYLAASASRAAIHQSLSARSDASNLADLTAKTGSQTIVASLCGLALGVA-ASN 179
Query: 243 VCSSMQGKMIVG-PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
C + + LS H+ ++ TL+ R A +V F +TG +PAD+
Sbjct: 180 ACGAEASDVWPAWAALSACHLGCTYASLKYVNTTTLDDARLAALVDAFRETGACPTPADV 239
Query: 302 RYREDLLF 309
RE L
Sbjct: 240 SERESLFL 247
>gi|219129806|ref|XP_002185071.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403566|gb|EEC43518.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 500
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 128/256 (50%), Gaps = 5/256 (1%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
R + V + F PSGYP GYLRY+ + +Q ++ SVL+TQ +L G+
Sbjct: 71 RVLLRNVQSGLRSTFLPSGYPARTPPGYLRYSVWSWVQDLSTQLRSVLATQRVLEGVGVG 130
Query: 116 PTPAQA--TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
A A +++++++DG L S +R S+ KRWRI AD + D+G LEV +
Sbjct: 131 REGATALSALLNFLVRDGCGMAASLAFTSAAASRFQSDVKRWRIFADVMVDIGITLEVAA 190
Query: 173 PLC-PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
C P LFL M LGN K + VAA A I +AK ++SD+ AK A T+ +
Sbjct: 191 SACPPTLFLPMISLGNACKAMCGVAAGACGGAINLHWAKGSDISDIQAKFGAQHTVTGSL 250
Query: 232 GLGAGIQLASTVCSSMQGKM-IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
GL A +V + G++ I+ L+ +HV + + MR + N R +++
Sbjct: 251 GLVFAAFFAKSVSTVPGGRLWILYAALTALHVMANLRCMRIIHFDVFNTTRLRLVLRALF 310
Query: 291 KTGRISSPADLRYRED 306
++G +S+ + E+
Sbjct: 311 ESGTLSNKTAVTTAEN 326
>gi|242056489|ref|XP_002457390.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
gi|241929365|gb|EES02510.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
Length = 624
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 165/362 (45%), Gaps = 16/362 (4%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVS 125
P+G+P SV+ YL Y + + T VL T SLL A G+ A A +
Sbjct: 240 LPAGFPGSVSVDYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAASAAAIR 299
Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
W+ KDG+ G+L I GA D +PK+WR+ AD + G+ E+++PL P FL +A
Sbjct: 300 WVSKDGIGAFGRLLIGGRFGALFDDDPKKWRMYADIIGSAGSIFELMTPLYPGYFLPLAS 359
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST-- 242
LGN AK +A + I + FA+ GNL ++ AK E ++GL AG+ + T
Sbjct: 360 LGNLAKAVARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSAGVLIMDTAG 419
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
V SS + + ++H++ + + T+N +R ++V V +
Sbjct: 420 VKSSYLTLALTWLSVRLLHLWLRYQSLSVLKFRTINLKRGRILVRSHVAQHTVPGYVACN 479
Query: 303 YREDLLFPGRLI--EDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILC---LGSRWTD 357
E++L R + + + V + R L +L+ EK+IL GSR
Sbjct: 480 EEENILTWERFLRPQISFGVPMERMLGGDESSDMVDRLLKLYKNEKYILFFEQFGSRDPT 539
Query: 358 MVL--EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
++ +++A+ LR L A+ + D L+++ + + FT L ++ +
Sbjct: 540 FLVTFKESATSMSVLRS-LWQAHWLHKNQLRQDGVFPWLEESLVALENGFTDFLGQMGSA 598
Query: 416 GW 417
GW
Sbjct: 599 GW 600
>gi|398388772|ref|XP_003847847.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
gi|339467721|gb|EGP82823.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
Length = 457
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 92/390 (23%), Positives = 180/390 (46%), Gaps = 39/390 (10%)
Query: 67 LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSW 126
++ F P+G+P SV E Y Y + +LQ F+S+ ++LS++++L + G+ T A T +
Sbjct: 54 MDVFLPAGFPHSVTEDYTEYQIYDSLQAFSSSIAAMLSSRAVLSSIGVGDTKASPT-AAL 112
Query: 127 ILKDGMQHVGK----LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLE 181
+L + VG+ L G ++ E K +R++AD L D L+VLSP+ P+ + +
Sbjct: 113 LLSVLQESVGRLATILFAHRFGTSLEPECKMYRLMADVLNDSAFILDVLSPIFPKPIRIV 172
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ + + + VAA +++ + + FA+ GNL +L AK + T+ +++G+ AG + S
Sbjct: 173 ILSFSSILRSLCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLMGMLAGSLVIS 232
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+ + + + LLS IH+ + +RA + TLN QR ++ + + S ++
Sbjct: 233 WITTPLATWTALIGLLS-IHLETNRRAVRAVKMRTLNRQRATLVFHHLQRQQTVPSIKEI 291
Query: 302 RYREDLL-FPG--------------------RLIEDAGNVKVGRDPHKVVKPSKFHELKE 340
E + + G RL+E + ++ ++
Sbjct: 292 SSVEHIFEWDGVLRTSTRDIMGYCDIGTPFLRLLEAVSESQTSTKASH-IQQQTLSQILS 350
Query: 341 LFAEEKFILCLGSRWT------DMVLEQNASGEDALRGWLVAAYAA----SMAKSFHDPS 390
L+ ++IL R T +++L++ A +D L W A + A + + +
Sbjct: 351 LYNSSRYILWHDRRSTKDVPRFNIILKKGAEPKDLLIAWWQALFHAQDDSAAGQDGFEEK 410
Query: 391 LTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
L L+ + + + F L+ KGW D
Sbjct: 411 LAALERSLSRAKEFFERYEKSLREKGWDVD 440
>gi|393227224|gb|EJD34913.1| DUF647-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 377
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 165/360 (45%), Gaps = 29/360 (8%)
Query: 89 FRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICS-NLGA 145
F +LQ F S +L+ +++L G+ A AT V+ I++D V ++ + LG
Sbjct: 2 FDSLQAFFSEIAGLLAARAVLTGLGVGDAGATATGAVLLKIIQDTCGRVATILSAWKLGP 61
Query: 146 RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-LEMAGLGNFAKGIAVVAARATRLPI 204
++ E K+WR+ AD D G LE LSP P + + + + + V+A + +
Sbjct: 62 MLEPECKQWRLAADLFNDAGIILECLSPTLPHTMKISLLCVARVVRSLCGVSAGGAKAAL 121
Query: 205 YSSFAKEG-NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
F G +++D+ AK + T+ ++G+ G + + +S +G L +H+Y
Sbjct: 122 SQHFVGPGGSIADINAKESSQETVVALIGMLVGSMIVPRI-TSFEGTWTALLFLIGVHLY 180
Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF-PGRLIEDA---GN 319
+ +RA + TLN QR +++ + ++ GR+ SP +R RE + RL+ D G
Sbjct: 181 ANWRGVRAVAMLTLNRQRATIVLGELIQRGRVLSPEQVRQRESVFCRSSRLVLDGVTLGQ 240
Query: 320 VKVGRD-PHKVVKPSKFH----------ELKELFAEEKFIL-CLGSRWTD----MVLEQN 363
+G H + + ++ H L + FA E++IL C R + L++
Sbjct: 241 CHIGVSLQHLLAQIARVHGPDRSGSALAGLMDRFASERYILHCCCRRKGKVSIIICLKRG 300
Query: 364 ASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
D LR W LV A S + +P VL++ + ++F +S L GW D
Sbjct: 301 VESRDMLRAWTHALVLAQRYSAIPASGEPDELVLEETTARARELFLGSISRLSNAGWKLD 360
>gi|307191039|gb|EFN74793.1| UPF0420 protein C16orf58-like protein [Camponotus floridanus]
Length = 825
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 82/339 (24%), Positives = 160/339 (47%), Gaps = 19/339 (5%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
+ + + R+V F P GYP SV+ Y Y + +Q S ++ L+T S++ G
Sbjct: 392 EKQSICSRIVSIVKEIFLPQGYPDSVHSDYTAYQIWDTIQASASTIMNTLATHSIMIGIG 451
Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICS--NLGARMDSEPKRWRILADALYDLGTGLE 169
+ + A A ++WILK+G VG ++ + N G ++D + K+WR+ AD L D+ G+E
Sbjct: 452 VGESNATPLAAAITWILKNGAGMVGSIMFAWWN-GTKLDGQCKKWRLFADILNDMAAGME 510
Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
+L + + + K + VA ATR+ + A NL+D+ AK + N
Sbjct: 511 LLVSYFSSYSVVILCISTIMKSMVGVAGGATRVALTQHQAIRNNLADVAAKDGSQEMCVN 570
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
++ GI + S + Q + + L +H+Y+ ++ ++ LN R A+I+ +
Sbjct: 571 LIASFIGIFILS-LFHDRQYLIELYLFLVAVHLYANYSAVKVLCLDILNEDRLALIIKYY 629
Query: 290 VKTGRISSPADLRYREDLLF---PGRLIEDAG-NVKVGRDPHKVVKPSKFHELK-----E 340
+ RI P L E + P + I G N+K+G V++ + ++ +
Sbjct: 630 MINERIPEPKRLNKEESVSLLDNPTKSI--CGFNIKIGVSFASVLRKNVISSMEMEFLLK 687
Query: 341 LFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAA 377
F + K+++ + +R ++L+++ + L + A+
Sbjct: 688 FFEDRKYLIAIDVKNRNIFVILKKDIQATEILEAYFYAS 726
>gi|452005321|gb|EMD97777.1| hypothetical protein COCHEDRAFT_1165025 [Cochliobolus
heterostrophus C5]
Length = 421
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 96/405 (23%), Positives = 183/405 (45%), Gaps = 57/405 (14%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQAT 122
++ F PSG+P SV E Y+ Y F +LQ F+S+ +L+++++L G+ TP A
Sbjct: 1 MIDVFLPSGFPQSVTEDYVEYQIFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTAAL 60
Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFL 180
++S +L++ + + + ++ G ++ E K +R+ AD L D G L+ LSP P+ L +
Sbjct: 61 LLS-VLQESVGRMATIAFADRFGTALEPECKMYRLTADILNDSGMILDCLSPGFPKPLRV 119
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + + V A +++ + + FA GNL ++ AK + T+ +++G+ AG +
Sbjct: 120 AVLSFASCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFVV 179
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
S + S + + LL VIH+ + +RA + +LN QR ++ + ++ G + SP +
Sbjct: 180 SHITSDIATWCTL-ILLLVIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLSPKE 238
Query: 301 LRYREDLLFPGRLI-----EDAGNVKVGRDPHKVV-----------------------KP 332
+ RE + G ++ E G ++G +++
Sbjct: 239 VSARERIFERGGVLRWADDEVLGKCRIGTSLSELLGRIGRRDKLSGSLAFCSSGADGESE 298
Query: 333 SKFHELKELFAEEKFILCLGSRWTD---------MVLEQNASGEDALRGWLVAAYAAS-- 381
+L +F E ++L W +VL+ D LR W A A
Sbjct: 299 VDVFDLLSIFEAEDYVL-----WASGGPAAYEAVIVLKAGCGPIDQLRAWAHALLVAERR 353
Query: 382 ------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
+ + L+D ++ +F E++AKGWH D
Sbjct: 354 QRGGDGGGTRARNGLMGELRDTLYEVRRMFDKYGDEMRAKGWHLD 398
>gi|242019564|ref|XP_002430230.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515330|gb|EEB17492.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 527
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 136/291 (46%), Gaps = 16/291 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P GYP+SV+ YL Y + Q F S+ L+ ++++ G+ TP A + +W
Sbjct: 44 FLPKGYPYSVSSDYLEYQIWDFCQAFCSSICGTLTVRAVMKGIGVGSNVATPMSAAL-TW 102
Query: 127 ILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
I+KDG +G ++ + L G+++DS+ K+WR+ AD L D+ +E+ P+ + M
Sbjct: 103 IIKDGTGMIGSIMFAWLKGSKLDSDCKKWRLFADILNDIAMFIELSIPIIMDYSMYMLCA 162
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
K + A +TR I + A +GN +D+ AK + N++G GI L V
Sbjct: 163 ATIFKALVGTAGSSTRAAIITHQAVKGNTADVSAKDGSQELFTNLLGFIFGIFLLKNVNE 222
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
S Q IV ++ H+ + +R +N N +R + + ++ T + + + RE
Sbjct: 223 SNQ-VWIVMCTATIFHLIFNYKAVRCLKINVFNGERLRLTLLSYLPTSCVPTIEQVNRRE 281
Query: 306 DLLFPGRLIED----AGNVKVGRDPHKVVKPSK-----FHELKELFAEEKF 347
++ G D +K G +K VK K L +F +K+
Sbjct: 282 PVIL-GCGTSDYQLCGFKIKFGVSVNKAVKSEKMTNQQLSTLANVFQSKKY 331
>gi|417401281|gb|JAA47531.1| Hypothetical protein [Desmodus rotundus]
Length = 457
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/390 (24%), Positives = 168/390 (43%), Gaps = 54/390 (13%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWR-ILADALYDLGTGLEVLSPLCPQLF 179
A +WI+KD +G++I + W+ + AD L D+ LE+++P+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFA-----------WWKGLFADILNDVAMFLEIMAPIYPMCF 178
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 179 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 238
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + LL+ +HVY+ ++A + TLN R +++ F++ G + P
Sbjct: 239 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 298
Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
E L +P ++ +G H+++ S EL++L E ++L
Sbjct: 299 SANQMEPLWTGFWPSL------SLSLGAPLHRLI--SSVFELQQLVEGHREPYLL----H 346
Query: 355 W------TDMVLEQNASGEDALRGWLVAAYAASMAKSFHDP-----------------SL 391
W +VL Q A E LR ++ + P S
Sbjct: 347 WDQSQNRVQVVLSQMAGPETILRAATHGLVLGALREDGPLPRELEELRNQIRAGPKKDSW 406
Query: 392 TVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
++++ + ++ +F L LQ GW T++
Sbjct: 407 VIVKETHQVLDKLFPKFLKGLQDVGWKTEK 436
>gi|395332415|gb|EJF64794.1| hypothetical protein DICSQDRAFT_80493 [Dichomitus squalens LYAD-421
SS1]
Length = 506
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/439 (25%), Positives = 180/439 (41%), Gaps = 32/439 (7%)
Query: 22 QEIPIYWFETSDSVTRHYQ--FQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSV 79
+E I W + + + ++ P+ V V R+ F P+ YP SV
Sbjct: 70 RECHISWSDGRSGIRKEWRDTASPEATEGNVAVSGRNSVTGRISSWLRQMFLPTNYPQSV 129
Query: 80 NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGK 137
+ Y + +AL+ + SVL Q+LL + G+ + A V WI+KDG V K
Sbjct: 130 HSSYASFHTLQALETTMATITSVLCNQALLTSVGMSAEGSVFGAVAVQWIIKDGAGEVAK 189
Query: 138 L-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL---CPQLFLEMAGLGNFAKGIA 193
L + DS PK + + + L LG+GL++ + L P FL A GN K +
Sbjct: 190 LFFIRHFSTYFDSHPKTFTLFGEVLGCLGSGLQIATVLIAPSPLNFLLCAAGGNIFKLVG 249
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
T + F+ +GN D+ AK E+ +++ + G AGI L + S +
Sbjct: 250 NAIWSTTHIKFIRYFSMQGNDGDVAAKDESQASVAQLAGYAAGISL---LTFSHAPAYLY 306
Query: 254 GPLLSVI--HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
+ + H+ MR A L R + + +V G + S DL G
Sbjct: 307 AIFFAAVPLHLTMTAYMMRVATFELLTLPRISHLAQTYVNDGVVGSQKDLDREHATGLFG 366
Query: 312 RLIEDAGNV------KVGR--DPHKVVKPSKFHELKELFAEEKFIL--CLGSR--WTDMV 359
++ G+ +VG + V S + ++F +++++L C R ++
Sbjct: 367 EFYKNNGDRWLTLAPRVGDALNTSSDVDRSTWQVCSQVFQDDRYLLLPCDSPRGPLISVL 426
Query: 360 LEQNASGEDALRGWLVAAYAASMAK---SFHDPSL-TVLQDAYDKMNDVFTPLLSELQAK 415
+A+ +D L+ L AA S+ S PSL TVL + + F+ L +
Sbjct: 427 FHPDATTDDMLQSILHAASVRSLLSRDASCPTPSLRTVLSETRVQAQQDFSAFKQALHER 486
Query: 416 GWHTDR--FLDGTGTRFAW 432
GW TD F D G R W
Sbjct: 487 GWRTDELCFAD-LGHRVIW 504
>gi|322707810|gb|EFY99388.1| DUF647 domain-containing protein [Metarhizium anisopliae ARSEF 23]
Length = 462
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 163/353 (46%), Gaps = 19/353 (5%)
Query: 2 NLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHR 61
+L+++ + E DKS +VQ W SD+ + + + + ++ S + R
Sbjct: 20 DLIKQGDDVEISELDKSGNVQR---RWLHRSDNTIAILESKTEPN---SIIHSSVREWKR 73
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
+ + F P G+P SV+ YL Y + +LQ F S S+L+ ++LL G+ + A
Sbjct: 74 FLS---DAFLPIGFPHSVSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSSA 130
Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
T ++ IL D + ++ ++ G R++ + KR+R LAD D LE+ SP
Sbjct: 131 TFAMLITILTDATSRIATIVFAHRFGLRIEPDAKRFRFLADIFNDSAFFLELYSPYFGDY 190
Query: 179 FLEMA-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+A +G + + VAA A++ + FAK NL++L AK + T ++GL G
Sbjct: 191 GKILALSVGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLVVGT 250
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
L + Q + + L H++ +R+ + TLN QR ++ +++ TGR+ +
Sbjct: 251 -LVVKLVQDHQSVVYLMVALVFAHLWMNYLGVRSVCMTTLNRQRATILFREYMTTGRVLT 309
Query: 298 PADLRYREDLLF-----PGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEE 345
P + RE+++ GRL E ++ + +K + EE
Sbjct: 310 PEAVAARENIVLWLPVVGGRLAERVCKIEFAESYQRAAASCNGRRVKVVDGEE 362
>gi|302847291|ref|XP_002955180.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
nagariensis]
gi|300259472|gb|EFJ43699.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
nagariensis]
Length = 1055
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/375 (22%), Positives = 175/375 (46%), Gaps = 37/375 (9%)
Query: 5 EKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQP---DGH----LSVKVVDDSRP 57
E ++ P S++ +P+ +V + Q P DG +S + DS
Sbjct: 218 EASERHSNGHPSAPASLERLPV-------AVKKGDQLLPYVWDGRRITAMSEAALQDSDV 270
Query: 58 VYHRVVDSFL-------NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLF 110
R+ + L F P P V Y + ++R Q F S+ + S +LL
Sbjct: 271 TTGRLRNWLLGVQRQLKEAFLPD--PRDVTPDYWEWLRWRLTQRFFSSTMQNFSFSALLM 328
Query: 111 AAGL--RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTG 167
A GL + A + ++W+LKDG+ + ++ + ++ G D++ KR R + ++
Sbjct: 329 ATGLGAKKAFAASAAINWLLKDGVSRIVRMSVSTSFGQTFDADLKRMRFITSLIFTACMA 388
Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
E +P PQ F+ +A + + + + + A AT+ ++ A GNL+DL +K +A +
Sbjct: 389 GEFATPFYPQHFVALASISSVGRAVGLSAFVATQPAFQAALATGGNLADLTSKNQAQHMV 448
Query: 228 FNVVGLGAGIQLASTVCSSM-QGKMIVG----PLLSVIHVYSVIEEMRAAPVNTLNPQRT 282
+++ LG L + +C M +G +++ PL + + + E++A + TLN +R
Sbjct: 449 MDMLALGVSAGL-TWLCRGMPRGGLLLPAVMYPLCAAGDLTCIWHELKAVQLRTLNRERA 507
Query: 283 AMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELF 342
M++ +++ G + A++ E+L+ P + +++G P + P+ +L++L
Sbjct: 508 EMLIERWMRRGAVPDAAEISAAENLILPSDVWRGLLPLRIG--PLDRIAPANPEQLRKLL 565
Query: 343 AE---EKFILCLGSR 354
E E+++L + R
Sbjct: 566 REYDGEEYVLHVADR 580
>gi|298706870|emb|CBJ25834.1| Unknown protein [Ectocarpus siliculosus]
Length = 673
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 76/242 (31%), Positives = 126/242 (52%), Gaps = 15/242 (6%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
V +N F P+GYP SV Y+RY + LQ +S VL+TQ++L G+ TP
Sbjct: 196 VRGGLMNTFLPAGYPGSVRPEYVRYRCWDILQDLSSHLRGVLATQAVLEGMGVGRAGSTP 255
Query: 119 AQATVVSWILKDGMQHVGKLICSNLG-ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
AT + W+ +DG +G L+ +++ A K WR+ AD D+G LE+L+PL P
Sbjct: 256 LAAT-LQWMARDGASMIGGLLFTSVASANFGVNIKTWRLFADTSNDIGLALEMLAPL-PA 313
Query: 178 L-----FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
FL + + + K VA AT I +A + N++D+ AK A T+ +++G
Sbjct: 314 FREGGRFLRLICIASVFKAACGVAGGATGAAITEHWAIDNNIADIGAKNGAQHTVASLLG 373
Query: 233 LGAGIQLAS--TVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
LG + A +V S G + G +L+++H+ + MR + +LNP R +++V +
Sbjct: 374 LGLSVWFARAVSVGGSASGSRVWGWYAVLTLVHLLANYAAMRTLALRSLNPTRASLLVTE 433
Query: 289 FV 290
++
Sbjct: 434 YL 435
>gi|7327810|emb|CAB82267.1| putative protein [Arabidopsis thaliana]
Length = 447
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/311 (27%), Positives = 145/311 (46%), Gaps = 20/311 (6%)
Query: 124 VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+ W+ KDG+ +G+L I G+ D +PK+WR+ AD + G+ ++ + L P FL +
Sbjct: 120 IRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFLLL 179
Query: 183 AGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
A GN AK + AR R P I + FA GNL ++ AK E ++GLG GI
Sbjct: 180 ASTGNLAKAV----ARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGIL 235
Query: 239 LAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ T + S ++ + ++H++ + + NT+N +R +IV V +
Sbjct: 236 IIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSVVP 295
Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKP--SKFHELKELFAEEKFILCLGSR 354
D RE++L R ++ V + ++ SK L +++ +EK+IL L
Sbjct: 296 GYVDCNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVSKVKALLKMYTKEKYILTLNKL 355
Query: 355 WTD----MVLEQNASGEDALRG-WLVAAYAASMAKSFHDPSLTV--LQDAYDKMNDVFTP 407
D + + NA+ D LR W +M +SF D L+ + +M++ F
Sbjct: 356 NKDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEESFKDKDSVFHWLKQSLSEMDNKFDD 415
Query: 408 LLSELQAKGWH 418
L +L GW+
Sbjct: 416 FLFKLDTAGWN 426
>gi|326427989|gb|EGD73559.1| hypothetical protein PTSG_05267 [Salpingoeca sp. ATCC 50818]
Length = 404
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 72/231 (31%), Positives = 118/231 (51%), Gaps = 3/231 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-ATVVSWIL 128
F P Y SV+ +L Y +++ + T A VLS QSLL+A G+ A ++WIL
Sbjct: 120 FLPRDYRASVSPKFLDYCKWQFVHMTTMTASGVLSMQSLLYAVGVGAGSVPLAAAINWIL 179
Query: 129 KDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
KDG+ +G ++ ++ G R D +PKR R A + +++++PL P FL +A + N
Sbjct: 180 KDGLGQLGGMLYGSIFGTRFDDDPKRQRFNAVLSLQVSGIVDIITPLFPHHFLLLASVSN 239
Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
F K ++ +A+ ATR + SF + NL D+ AK + S +V G G GI ++ +
Sbjct: 240 FGKNVSYLASSATRAQMNLSFTRTSNLGDVTAKMTSQSIAASVFGTGLGILVSKVTGTEA 299
Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
M LS++ + + + T N QR A I+A T ++ +P
Sbjct: 300 ALLMAAYVPLSLVSIGGNYYSSKFVSLKTFNVQR-AEILARHYFTTKVCTP 349
>gi|225440682|ref|XP_002274737.1| PREDICTED: UPF0420 protein-like [Vitis vinifera]
Length = 503
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 144/304 (47%), Gaps = 18/304 (5%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---------RPTPAQA 121
P+G+P SV++ YL Y + + T+ L T SLL A G+ A A
Sbjct: 119 LPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKAVGVGSFSATTAAASAAASA 178
Query: 122 TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
+ W+ KDG+ VG+L I G D +PK+WR+ AD + G+ ++ + L P FL
Sbjct: 179 AAIRWVSKDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLYPAYFL 238
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
++A LGN AK +A + I + FA GNL ++ AK E ++GL GI +
Sbjct: 239 QLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMVL 298
Query: 241 ST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
++ V +S + ++H++ + + ++N +R ++V V +
Sbjct: 299 TSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPGY 358
Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV---KP-SKFHELKELFAEEKFILCLGSR 354
D +E +L R ++ + G +++ +P SK L +L+++EK+IL + +
Sbjct: 359 VDCNRKEKILSWQRFLKP--RIIFGVSLEELIGGERPVSKVRTLLKLYSQEKYILVVKQQ 416
Query: 355 WTDM 358
TD
Sbjct: 417 GTDF 420
>gi|449301217|gb|EMC97228.1| hypothetical protein BAUCODRAFT_147370 [Baudoinia compniacensis
UAMH 10762]
Length = 454
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/310 (27%), Positives = 150/310 (48%), Gaps = 16/310 (5%)
Query: 30 ETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQF 89
+ + +VT Y P V VD P L+ F P+G+P SV Y+ Y +
Sbjct: 12 DEAGNVTATYVASPTSSDGVCRVDVIHPATKAPWQRLLDIFLPAGFPHSVTVDYVEYQIY 71
Query: 90 RALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSNL-GAR 146
+LQ F+S + +LS++++L + G+ A T ++ I ++ + +G + L G
Sbjct: 72 DSLQAFSSNIVGLLSSRAVLLSVGVGDADASPTYALLLHIAQESLGRIGTIAFVALYGTA 131
Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL--GNFAKGIAVVAARATRLPI 204
++ E K +R+LAD D L+ LSP P+ +L + L + K + VAA + + +
Sbjct: 132 LEPECKMYRLLADVFNDFAFVLDCLSPAIPK-YLRVVTLSFSSLLKALCGVAAGSAKASL 190
Query: 205 YSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYS 264
+ FA+ GNL +L AK + T+ +++G+ G + S V + Q LL IH+ +
Sbjct: 191 SAHFAQWGNLGELNAKDSSQETVISLLGMWVGSLVLSWV-RTPQATWTTLILLLAIHLET 249
Query: 265 VIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG- 323
+RA + TLN QR A +V ++ G + +P ++ +E + G ++ A +G
Sbjct: 250 NRRAVRAVVMRTLNRQR-ATLVFHHLQRGHVPTPEEVSKQERIFERGGVLRSADGQTLGF 308
Query: 324 -------RDP 326
RDP
Sbjct: 309 CAVGMPLRDP 318
>gi|345316269|ref|XP_001510147.2| PREDICTED: UPF0420 protein C16orf58 homolog, partial
[Ornithorhynchus anatinus]
Length = 353
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 69/241 (28%), Positives = 120/241 (49%), Gaps = 3/241 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR--PTPAQATVVSWI 127
F P G+P SV+ YL Y + +Q F S+ L+TQ++L G+ + + +
Sbjct: 3 FLPRGFPDSVSPDYLSYQLWDTVQAFASSLTGSLATQAVLRGVGVGDGTMTVDSPTGALL 62
Query: 128 LKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+K G +G++ G+++D + K+WR+ AD L D LE+++P P F +
Sbjct: 63 IKHGTGMLGRIGFAWKKGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPAYFTLTVCVS 122
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ AK I VA ATR + A+ N++D+ AK + TL N+ GL + L V
Sbjct: 123 SLAKCIVGVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLMLLPLVSDR 182
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ + L + +H+Y+ +RA + TLN R +++ F++ G++ S A +E
Sbjct: 183 PSLCLPLFLLFTSLHLYANFRAVRAVVMETLNESRLRLVLEHFLREGQVLSLASANEKEP 242
Query: 307 L 307
L
Sbjct: 243 L 243
>gi|453080329|gb|EMF08380.1| DUF647-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 464
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 102/405 (25%), Positives = 190/405 (46%), Gaps = 50/405 (12%)
Query: 58 VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
++H+ VD FL P G+P SV + YL Y + +LQ F+S+ ++LS++++L G+
Sbjct: 50 LWHKTVDVFL----PVGFPQSVTDDYLEYQIYDSLQAFSSSIAAMLSSRAVLEGVGVGDA 105
Query: 118 PAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
A AT ++ IL+D + + ++ ++ G ++ E K +R+LAD L D G L+ LSP+
Sbjct: 106 TANATTALLLSILQDSVGRITTIMFAHRFGMSLEPECKMYRLLADILNDTGFVLDCLSPI 165
Query: 175 CPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
P+ + + + + + VAA + + + + FAK+GNL +L AK + T+ ++ G+
Sbjct: 166 FPKPTRVVILSFSSILRSLCGVAAGSAKASLSAHFAKQGNLGELNAKDSSQETVISLAGM 225
Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
G + + V + + + LLS IH+ + +RA + TLN QR A ++ ++ G
Sbjct: 226 LFGSWVITWVKTPVATWTALILLLS-IHLETNRRAVRAVKMQTLNRQR-ATLLYHHLRKG 283
Query: 294 RISSP-----ADLRYREDLLFPGRLIEDAGNVKVGR----------DPHKVVKPSK---- 334
+ +P +L + D + R G+ VG HK K +
Sbjct: 284 HLPTPIEVADQELIFERDGILRSREGRAIGHCSVGAPMSRLLAAVGQQHKTTKSIRVDDQ 343
Query: 335 --FHELKELFAEEKFILCLGSRWTD-----------MVLEQNASGEDALRGWLVAAYAAS 381
+ + EL+++ + I+ W D +VL++ A + W A A
Sbjct: 344 QHLNRILELYSDAQCIV-----WCDPSSRPGKASVFIVLKKAADSKALTLAWWQALEFAE 398
Query: 382 MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW--HTDRFLD 424
P + D ++ +VF + ++ A W H DR ++
Sbjct: 399 CLGVAPTPVADMNPDELSRLPEVFGKVRDQVNAL-WTTHYDRLVE 442
>gi|417401214|gb|JAA47499.1| Hypothetical protein [Desmodus rotundus]
Length = 453
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 86/323 (26%), Positives = 146/323 (45%), Gaps = 37/323 (11%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
+ L F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWR-ILADALYDLGTGLEVLSPLCPQLF 179
A +WI+KD +G++I + W+ + AD L D+ LE+++P+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFA-----------WWKGLFADILNDVAMFLEIMAPIYPMCF 178
Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
+ N AK + VA ATR + A+ N++D+ AK + TL N+ GL + +
Sbjct: 179 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 238
Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V + LL+ +HVY+ ++A + TLN R +++ F++ G + P
Sbjct: 239 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 298
Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
E L +P ++ +G H+++ S EL++L E ++L
Sbjct: 299 SANQMEPLWTGFWPSL------SLSLGAPLHRLI--SSVFELQQLVEGHREPYLL----H 346
Query: 355 W------TDMVLEQNASGEDALR 371
W +VL Q A E LR
Sbjct: 347 WDQSQNRVQVVLSQMAGPETILR 369
>gi|413956564|gb|AFW89213.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 485
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 129/256 (50%), Gaps = 9/256 (3%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 50 WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---GEAISTLFNVVG 232
P + + LG+ ++ VA+ ATR + FA N +D+ AK E ++T+ +
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGL 229
Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
Q+ S+ + L++ H+Y+ + +++ + TLN +R ++++ F +
Sbjct: 230 GMLLAQITRGHALSVWASFLS---LTMFHMYANYKAVQSLSLTTLNYERASILLQYFKEC 286
Query: 293 GRISSPADLRYREDLL 308
G + P + +E +L
Sbjct: 287 GEVLVPRKVSQQEHIL 302
>gi|346318934|gb|EGX88536.1| DUF647 domain-containing protein [Cordyceps militaris CM01]
Length = 467
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 142/296 (47%), Gaps = 14/296 (4%)
Query: 14 EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPS 73
E DKS VQ+ +Y + R +P GH + R + DSFL P
Sbjct: 28 ELDKSGHVQQKWLY-----EPSGRIVHIRPKGHTNDLFSRSLREWKKILSDSFL----PV 78
Query: 74 GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDG 131
G+P SV+ YL Y F +LQ F S S+L+ ++LL G+ + AT ++ IL+D
Sbjct: 79 GFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFALLLTILRDA 138
Query: 132 MQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
V + G R++ + KR+R LAD D LE+ SP L P + +G
Sbjct: 139 TSRVATIAFAHRFGLRIEPDAKRYRFLADLFNDSAFFLELYSPYLSPWGKVAALSIGEAL 198
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
+ + V+A A++ + FAK NL++L AK + T ++GL + V +
Sbjct: 199 RALCGVSAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGL-LVGTVVVKVVEDHRS 257
Query: 250 KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+ + LL H++ + + + TLN QR ++ ++++TG++ SPA++ RE
Sbjct: 258 VVFLMILLVFGHLWMNYLGVCSVEMTTLNRQRATILFQEYLQTGKVLSPAEVSKRE 313
>gi|302849057|ref|XP_002956059.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
nagariensis]
gi|300258564|gb|EFJ42799.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
nagariensis]
Length = 208
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/190 (33%), Positives = 95/190 (50%), Gaps = 5/190 (2%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
+ DS + F P G+P SV YL Y + H T L+T SLL A G+ +PA
Sbjct: 17 IPDSIRDLFLPPGFPDSVTPDYLPYQLWSLPTHVTGHLSHSLATSSLLTAVGVSTSPAAT 76
Query: 122 TVVS----WILKDGMQHVGKLIC-SNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
+S WI+KDG+ +G+ I S A D +P+RWR++A+ L G GLEV + L P
Sbjct: 77 VALSASIKWIIKDGVGALGRFIVGSRFSAEFDEDPRRWRLVAELLSTAGLGLEVATSLYP 136
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
Q F+ +A G F + + + I + FA+ N+ + AK E +VGL A
Sbjct: 137 QSFMLLASAGKFGQALGKGMGKPVFRVIQTHFARSQNVGAVAAKEEVWEVAAQMVGLMAS 196
Query: 237 IQLASTVCSS 246
+ + T+ S
Sbjct: 197 VAVLRTLEGS 206
>gi|356566668|ref|XP_003551552.1| PREDICTED: uncharacterized protein LOC100787355 [Glycine max]
Length = 1041
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 139/305 (45%), Gaps = 18/305 (5%)
Query: 64 DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTP 118
D+ + P+G+P SV++ YL Y + + T L T SLL A G+
Sbjct: 649 DTIKDFILPAGFPGSVSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTSAT 708
Query: 119 AQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A A+ + W+ KDG+ VG+L + G+ D +PK+WR+ AD + G+ + + + P
Sbjct: 709 ASASAIRWVSKDGIGAVGRLCLGGRFGSLFDDDPKQWRMYADFIGSAGSIFYLTTQVYPD 768
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
FL +A LGN K +A + I + FA GNL ++ AK E + ++GL GI
Sbjct: 769 YFLPLASLGNLTKAVARGLKDPSFCVIQNHFAISGNLGEVAAKEEIWEVVAQLIGLALGI 828
Query: 238 QLASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ T ++ P L + +H++ + + NT+N +R V +
Sbjct: 829 LILDTPSLVKSYGVLSLPWLGMQFLHLWLRYKSLSVLQFNTINLKR--------VLHSTV 880
Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKF--HELKELFAEEKFILCLGS 353
E++L + ++ N + + V+ S F L +L+A EK+IL +
Sbjct: 881 PGCTYCNREENILTRSQFMKPKINFGLPLEKIDGVERSHFMVEALLKLYASEKYILMVNQ 940
Query: 354 RWTDM 358
+ D+
Sbjct: 941 QLEDL 945
>gi|432112420|gb|ELK35212.1| hypothetical protein MDA_GLEAN10003470, partial [Myotis davidii]
Length = 638
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 160/358 (44%), Gaps = 43/358 (12%)
Query: 95 FTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
F S+ L+T ++L G+ A +W+LKD +G++I + G+++D
Sbjct: 272 FASSLSGSLATHAVLLGIGVGNASASVSAATATWLLKDSTGMLGRIIFAWWKGSKLDCNA 331
Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
K+WR+ AD L D LE+++PL P F + N AK I VA ATR + A+
Sbjct: 332 KQWRLFADILNDTAMFLEIMAPLLPICFTMTICISNLAKCIVSVAGGATRAALTMHQARR 391
Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
N++D+ AK + TL N+ GL + + V + LL+ +HVY+ +RA
Sbjct: 392 NNMADVSAKDGSQETLVNLAGLLVSLLMLPLVSDCPSFSLGCFFLLTALHVYANYRAVRA 451
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL---LFPGRLIEDAGNVKVGRDPHK 328
+ TLN R +++ F++ G + P E L +P ++ +G H
Sbjct: 452 LVIETLNEGRLWLVLKHFLQRGEVLDPTSANQMEPLWTGFWPSL------SLSLGVPLHH 505
Query: 329 VVKPSKFHELKELFA--EEKFILCLGSRW------TDMVLEQNASGEDALR----GWLVA 376
+V S EL++L +E ++L RW +VL Q A E LR G ++
Sbjct: 506 LV--SSVAELQQLAEGHQEPYLL----RWDQSQNQVQVVLSQEAGPETVLRAATHGLVLG 559
Query: 377 A-------------YAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
A + + + S ++++ + ++ +F L LQ GW T++
Sbjct: 560 ALREDGPLPRELGELRSQVRAGPNTESWVIVKETHQVLDKLFPQFLRGLQDAGWETEK 617
>gi|297825247|ref|XP_002880506.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
lyrata]
gi|297326345|gb|EFH56765.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
lyrata]
Length = 519
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 176/377 (46%), Gaps = 38/377 (10%)
Query: 76 PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQ 133
P V++ Y+ Y +++ L S+AL VL+TQ + A G+ + A + +WILKDG+
Sbjct: 124 PKQVSDNYISYVKWKFLHRVFSSALQVLATQVMFRAIGIGQSRSLASSAAFNWILKDGLG 183
Query: 134 HVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
+ + I ++L + D+ KR R L+ L G+E+++P+ PQ FL +A + N AK I
Sbjct: 184 RLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQYFLLLASIANIAKQI 243
Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS--MQGK 250
++ AT ++ SFA NL ++ AK + + F+ +GL + L ++ +Q
Sbjct: 244 SLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAVLLNMLFQNNQRLQAS 303
Query: 251 M--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL- 307
+ ++ P+ S + + + ++ + TL R +I+ +++ ++ SPA++ E +
Sbjct: 304 LPFVLYPIFSTFDLLGIYQGLKHINLQTLTKDRLEIILERWIEFRQVPSPAEVSEEEGID 363
Query: 308 LFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCL-----GSRWTD---- 357
L R + +++G DP + ++ L ++ F + + G R T
Sbjct: 364 LLGSRGSKRVWPIRIGCLDPKAQIPTLSMMAMQSLCNDDSFFITIELSSKGFRRTPKPGI 423
Query: 358 -MVLEQNASGEDALRGWLVAAYAASMAKSF-------HDPSLT--VLQD-------AYDK 400
+ L + A+ D + L Y + K+F D SL+ LQD +
Sbjct: 424 VICLREGANSVDIITSLLQTCY---IRKAFGTNRIKRRDVSLSDLTLQDWSLLTGESKRA 480
Query: 401 MNDVFTPLLSELQAKGW 417
D L +LQ GW
Sbjct: 481 ARDDIITLNRQLQEHGW 497
>gi|224013726|ref|XP_002296527.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968879|gb|EED87223.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 719
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 110/462 (23%), Positives = 183/462 (39%), Gaps = 111/462 (24%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATV 123
++ F P+ YP SV Y Y + L ++ VLSTQ+LL A G+ A A
Sbjct: 239 LIHHFLPANYPQSVCPSYATYASYCFLGSIAGSSAMVLSTQALLVAVGVGTQSAAPMAAA 298
Query: 124 VSWILKDGMQHVGKLI-CSNLG-----------------------------------ARM 147
++W++KDG+ +G ++ S LG
Sbjct: 299 LNWVMKDGVGQLGGVVFASQLGKGGVDVDYWKVKLNKLMGSSSSKSTETSARGNFQRGTA 358
Query: 148 DSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVVAARATRLPIYS 206
DS PKRWR++A DL T LE+ +P + PQ FL A + N K + +AA A+R ++
Sbjct: 359 DSNPKRWRMVAALALDLSTLLEICTPWMGPQWFLPCASIANIGKNVGFLAASASRAAVHQ 418
Query: 207 SFAKEG-------------------------------NLSDLFAKGEAISTLFNVVGLGA 235
S G NL D+ AK + + + +++G
Sbjct: 419 SLCTGGSFLPPVPDNTTEKDESYDNPKKGKNVTSSSNNLGDVTAKSGSQAIVASLLGTAL 478
Query: 236 GIQLASTVCSSMQGKMIVGP--LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT- 292
GI L+ T CS I+ +LS +H ++A P+ +++ R +I+ ++
Sbjct: 479 GIWLSRTFCSDYGTAGILAGFVILSAVHQVCTYRALKAVPLRSVDRHRLHIILNAYITAD 538
Query: 293 GR-----ISSPADLRYREDLLFPGRLIEDAGN-VKVGRDPHKVVKPSKFHELKELFAE-- 344
GR + +P+ + E L P +++ + + VG D + PS EL+ L
Sbjct: 539 GRSVQSMVLTPSQVAEEESFL-PMMAPDNSVHWLSVG-DSLIDICPSGLDELEALLIRRD 596
Query: 345 -----EKFILCLGSRWTD-----------MVLEQNASGEDALRGWLVA--AYAASMAKSF 386
EK+I+ + T+ + A+ D L+G A A A ++
Sbjct: 597 DISQYEKYIMKVHLADTNATTAAVDGIVQLTFLDGATDNDLLQGMFHAYIARALMISHQC 656
Query: 387 HDPS----------LTVLQDAYDKMNDVFTPLLSELQAKGWH 418
HD S +++D++ N L LQ GW
Sbjct: 657 HDDSNDNHNHGKRGNQIIEDSHTITNHRMPEFLEHLQQGGWQ 698
>gi|171678219|ref|XP_001904059.1| hypothetical protein [Podospora anserina S mat+]
gi|170937179|emb|CAP61836.1| unnamed protein product [Podospora anserina S mat+]
Length = 448
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 134/254 (52%), Gaps = 9/254 (3%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT-- 122
+ L F P+GYP +V YL Y + +LQ F S+ S+L+++++L G+ + A T
Sbjct: 41 ALLYAFLPAGYPHTVTTDYLPYQTYDSLQAFASSITSLLASRAVLEGLGVGSSEASPTGA 100
Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FL 180
++ I D + + ++ ++ +G ++ E K +R LAD D L++L+P P L L
Sbjct: 101 LILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDAAQFLDLLTPALPYLPKL 160
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + VAA A++ + + FA GNL++L AK + T+ +++G+ G +
Sbjct: 161 GVIVSAGVLRSLCGVAANASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLVGSLVV 220
Query: 241 STVCSSMQ--GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
V + G MI+ L+ H+ +RA + +LN QR ++ +++++G + +P
Sbjct: 221 RLVEDKHKVWGLMII---LAGCHLAMNYRAVRAVRMTSLNRQRATIVFREWLESGTVLNP 277
Query: 299 ADLRYREDLLFPGR 312
A + RE +L GR
Sbjct: 278 AQVSQRESILMNGR 291
>gi|380471002|emb|CCF47486.1| hypothetical protein CH063_04176 [Colletotrichum higginsianum]
Length = 478
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/258 (29%), Positives = 127/258 (49%), Gaps = 11/258 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--V 123
+ F P GYP SV Y+ Y + +LQ F S +LS +++L G+ + AT +
Sbjct: 64 LFDAFLPIGYPDSVTPDYIGYQAYDSLQAFFSTITGLLSNRAILQGLGVGDPNSSATYAL 123
Query: 124 VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLE 181
+ ILKDG+ V + G ++ E KR+R LAD D L++ SP P +
Sbjct: 124 LLTILKDGISRVATIAFAYRFGLVIEPECKRYRFLADIFNDSAFFLDLFSPYFDPWTKVA 183
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
L + + VAA A++ + FA+ NLS+L AK + T +VGL G S
Sbjct: 184 AIVLAEALRAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAIGLVGLLVG----S 239
Query: 242 TVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
V ++G+ V L L +H+ +R ++TLN QR ++ + +T R+ +P
Sbjct: 240 IVVRFVEGREAVFVLMVFLVFVHLAMNYFGVRCVQLDTLNQQRATILFDHYAQTKRVLTP 299
Query: 299 ADLRYREDLLFPGRLIED 316
+ +RE+++F +I++
Sbjct: 300 KQVAWRENIVFWSPVIKN 317
>gi|159467675|ref|XP_001692017.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278744|gb|EDP04507.1| predicted protein [Chlamydomonas reinhardtii]
Length = 216
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 101/211 (47%), Gaps = 10/211 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS---- 125
F P GYP +V YL Y + H T L T SLL A G+ PA +S
Sbjct: 5 FLPPGYPQTVTPDYLSYQLWSVPTHITGHLSHALVTSSLLSAVGVSAGPAATVALSASIK 64
Query: 126 WILKDGMQHVGKLIC-SNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
WI+KDG+ +G+LI S A D +P+RWR++A+ L G GLEV + L P FL +A
Sbjct: 65 WIVKDGVGALGRLIVGSRFSAEFDEDPRRWRMVAELLSTCGMGLEVATVLYPHYFLLLAC 124
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
G FA+ + + I + FA+ N+ + AK E + GL A + + +
Sbjct: 125 SGKFAQALGKGMGKPVFRVIQTHFARAQNVGAVAAKEEVWEVTAQMAGLAASVAVLRALE 184
Query: 245 SSMQGKMIVGPLLSV--IHV---YSVIEEMR 270
+S +VG ++ +HV Y + +R
Sbjct: 185 ASGAADGVVGTWAAIQAVHVALRYRALATLR 215
>gi|328703155|ref|XP_001949028.2| PREDICTED: UPF0420 protein C16orf58 homolog [Acyrthosiphon pisum]
Length = 457
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 156/325 (48%), Gaps = 21/325 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWI 127
F P G+P SV++ Y Y + +Q F S ++L+T ++L G+ A A +++I
Sbjct: 56 FLPRGFPNSVSKDYTEYQIWDTIQAFCSTISNILATNAILKGVGVGDAQATALAASITYI 115
Query: 128 LKDGMQHVGKLI---CSNLGARMDSEPKRWRILADALYDLGTGLEVL--SPLCPQLFLEM 182
LKDG +G++ C G+ +D++ K+WR+ ADAL DL +E+L P Q L +
Sbjct: 116 LKDGAGMIGRITFAWCK--GSELDTQCKKWRLRADALNDLAIFIELLLSIPWIRQFSLII 173
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ AK I VA ATR + A N+ D+ AK + T N++ G+ +
Sbjct: 174 LSFSSCAKSIVSVAGGATRAALTQHQAIRDNMGDVSAKDGSQETCINLIAFLIGLIMLPI 233
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV----KTGRISSP 298
V + + ++ +++ +H+++ + +++ +N N R + + ++ + + P
Sbjct: 234 VENRILLIWMIYLVVTSLHLFANYKAVKSLNINVFNSARYDLTLKYYLSNDTQNHDVQKP 293
Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELK-----ELFAEEKFILCLGS 353
+ RE F G + +++G H+++ + EL+ + +IL + +
Sbjct: 294 DYINKRE-ACFLGDEKLCSFKIQLGTSIHELLYTNTLTTWDIIDHIELYKDYLYILIVDT 352
Query: 354 R--WTDMVLEQNASGEDALRGWLVA 376
+VL++N + E+ L+ + A
Sbjct: 353 HRDIIRVVLDKNINTENILKAYFHA 377
>gi|355736030|gb|AES11869.1| hypothetical protein [Mustela putorius furo]
Length = 218
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 56/167 (33%), Positives = 90/167 (53%), Gaps = 3/167 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 52 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLAGSLATHAVLLGIGVGNAEASVSAATATWL 111
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G ++D K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 112 VKDSAGMLGRIIFAWWKGTKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMFFTMIVCTS 171
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
N AK I VA ATR + A+ N++D+ AK + TL N+ GL
Sbjct: 172 NLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGL 218
>gi|79561141|ref|NP_179928.2| uncharacterized protein [Arabidopsis thaliana]
gi|51970902|dbj|BAD44143.1| unnamed protein product [Arabidopsis thaliana]
gi|330252365|gb|AEC07459.1| uncharacterized protein [Arabidopsis thaliana]
Length = 520
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 74/300 (24%), Positives = 150/300 (50%), Gaps = 12/300 (4%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-- 118
R+ S FF P V++ Y+ Y +++ L S+AL VL+TQ++ A G+ +
Sbjct: 113 RLTSSAAKDFF---LPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSL 169
Query: 119 AQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
A + +WILKDG+ + + I ++L + D+ KR R L+ L G+E+++P+ PQ
Sbjct: 170 ASSAAFNWILKDGLGRLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQ 229
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
FL +A + N AK I++ AT ++ SFA NL ++ AK + + F+ +GL +
Sbjct: 230 YFLLLASIANIAKQISLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAV 289
Query: 238 QLASTVCSSMQGK----MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
L + + + ++ P+ S + + + ++ + TL R +I+ +++
Sbjct: 290 LLNMLFQHNQRLQACLPFVLYPIFSTFDLLGIYQGLKHINLQTLTKDRLEIILERWIEFR 349
Query: 294 RISSPADLRYREDL-LFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCL 351
++ SPA++ E + L R + +++G DP + ++ L +++ + + +
Sbjct: 350 QVPSPAEVSEEEGIGLLGSRGSKRVWPIRIGCLDPKAQIPTLSMMAMQSLCSDDGYFITM 409
>gi|429848918|gb|ELA24353.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 440
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 153/309 (49%), Gaps = 20/309 (6%)
Query: 13 KEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVV-----DSFL 67
KEP K++ + E ++S + R + DG L+ ++S R++ F
Sbjct: 5 KEP-KAIVIDER-----DSSGLIRRRWLHDHDGRLTEMPTENSPTTISRLLTMNIRQVFS 58
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
+ FFP GYP SV++ YL Y + +LQ F S S+L+ +++L G+ + AT ++
Sbjct: 59 DAFFPIGYPDSVSKDYLGYQLYDSLQAFFSTITSLLANRAILQGLGVGDADSSATYALLL 118
Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
ILKDG+ + + G ++ E K++R LAD D L++ SPL + ++A
Sbjct: 119 TILKDGISRLATIGFAYRFGLVIEPECKKYRFLADIFNDSAFFLDLFSPLFGS-WTKVAA 177
Query: 185 L--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
L + + VAA A++ + FA GNLS+L AK + T ++GL G +
Sbjct: 178 LVVAEALRAMCGVAAGASKAALSKHFALRGNLSELNAKESSQETAVGLIGLIVGSFVVRH 237
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRIS-SPAD 300
V S + + +L +H+ +R +N N QR + +++KT G + +PA+
Sbjct: 238 V-ESREAVFALMIILVFVHLGMNYLGVRCVQLNNFNQQRATIFFEEYMKTMGNVRLTPAE 296
Query: 301 LRYREDLLF 309
+ RE+++F
Sbjct: 297 VAKRENIIF 305
>gi|367049005|ref|XP_003654882.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
gi|347002145|gb|AEO68546.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
Length = 451
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 138/259 (53%), Gaps = 13/259 (5%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
++ + L F P+GYP +V + YL Y + +LQ F S+ S+L+ +++L G+ + +
Sbjct: 37 LLKAILYAFLPAGYPHTVTDDYLAYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSP 96
Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
T ++ I D + + ++ ++ +G ++ E K +R LAD D L++L+P P
Sbjct: 97 TGALILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADLFNDSAQFLDLLTPALP-Y 155
Query: 179 FLEMAGL--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
F ++A + + + VAA A++ + + FA GNL++L AK + T+ +++G+ AG
Sbjct: 156 FPKLAVIVSAGVLRSLCGVAAGASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLAG 215
Query: 237 IQLASTVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
S V ++ K +V L L+ +H+ +R + +LN QR ++ +++ G
Sbjct: 216 ----SLVVRLVEDKQLVWLLMVALAAVHLIMNYRAVRCVRMRSLNRQRATIVFREWLDHG 271
Query: 294 RISSPADLRYREDLLFPGR 312
+ +PA + RE +L GR
Sbjct: 272 AVLTPAQVARRESILRNGR 290
>gi|116205700|ref|XP_001228659.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
gi|88182740|gb|EAQ90208.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
Length = 448
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 70/255 (27%), Positives = 132/255 (51%), Gaps = 11/255 (4%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT-- 122
+ L F P+GYP SV + YL Y + +LQ F S+ S+L+ +++L G+ + + T
Sbjct: 39 AILYAFLPAGYPHSVTDDYLPYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGA 98
Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FL 180
++ I D + + ++ ++ +G ++ E K +R LAD D L++L+P P L
Sbjct: 99 LILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKL 158
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + VAA A++ + + FA GNL++L AK + T+ +++G+ G
Sbjct: 159 GVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVG---- 214
Query: 241 STVCSSMQGKMIVGPLLSV---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
S V ++ K +V L+ V +H+ +RA + +LN QR ++ +++ G + +
Sbjct: 215 SLVVKMVEDKQVVWMLMVVLAGVHLAMNYHAVRAVKMRSLNRQRATLVFREWLDRGTVLT 274
Query: 298 PADLRYREDLLFPGR 312
P + RE +L GR
Sbjct: 275 PEQVSQRESILRKGR 289
>gi|322700393|gb|EFY92148.1| DUF647 domain-containing protein [Metarhizium acridum CQMa 102]
Length = 466
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 81/312 (25%), Positives = 149/312 (47%), Gaps = 14/312 (4%)
Query: 2 NLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHR 61
+L++K + E DKS +VQ W SD+ + + + ++ S + R
Sbjct: 20 DLIKKGDDVEISELDKSGNVQR---RWLHRSDNTIAVLESETE---PSSIIHSSVREWKR 73
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
+ + F P G+P SV+ YL Y + +LQ F S S+L+ ++LL G+ + A
Sbjct: 74 FLS---DAFLPIGFPHSVSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSSA 130
Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
T ++ IL D + ++ ++ G R++ + KR+R LAD D LE+ SP
Sbjct: 131 TFAMLITILTDATSRIATIVFAHRFGLRIEPDAKRFRFLADIFNDTAFFLELYSPYFGDY 190
Query: 179 FLEMA-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+A +G + + VAA A++ + FAK NL++L AK + T ++GL G
Sbjct: 191 GKILALSVGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVGT 250
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
L + Q + + L + H++ +R+ + T N QR ++ +++ TG + +
Sbjct: 251 -LVVKLVQDHQSVVYLMVTLVLAHLWMNYLGVRSVCMTTFNRQRATILFREYMTTGTVLT 309
Query: 298 PADLRYREDLLF 309
P + RE+++
Sbjct: 310 PEAVAARENIVL 321
>gi|350646547|emb|CCD58759.1| hypothetical protein Smp_142110 [Schistosoma mansoni]
Length = 423
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 111/242 (45%), Gaps = 17/242 (7%)
Query: 67 LNKFF-PSGYPFSVNEGYLRY-------------TQFRALQHFTSAALSVLSTQSLLFAA 112
L KFF P GYP SV++ YL Y T+ F S+ L++Q++L
Sbjct: 41 LKKFFLPVGYPSSVSDDYLEYQIWDTIQAGIFFITKISLFSAFASSITGALASQAVLIGV 100
Query: 113 GLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLE 169
G+ + A + ++W+ KDG +G++I + G ++D + K WR +AD L D LE
Sbjct: 101 GVGDSSATILSASLTWMFKDGSGMIGRIIFAGYHGIKLDCDCKFWRFVADILNDCALFLE 160
Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
++SPL LF + L N K + VA ATR I A NL+D+ AK + TL N
Sbjct: 161 IISPLFNYLFTPLLCLANVLKSLVGVAGSATRAAIVQHQAINNNLADVSAKDGSQETLSN 220
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
++ L V + + IH+YS ++ + T N R + + +
Sbjct: 221 LMAWLLNFILLYMVTGNQFLIWFCFICCTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQW 280
Query: 290 VK 291
K
Sbjct: 281 FK 282
>gi|320593995|gb|EFX06398.1| duf647 domain containing protein [Grosmannia clavigera kw1407]
Length = 458
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 141/253 (55%), Gaps = 17/253 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P+GYP SV + YL Y + +LQ F+S+ S+L+ +++L G+ +P A V+
Sbjct: 47 FLPAGYPQSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLK- 105
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLS---PLCPQLFLEM 182
I++D + ++ ++ +G ++ E K +R +AD D L++L+ PL P++ + +
Sbjct: 106 IIQDTFSRMATILFAHRMGQAIEPECKTYRFMADLFNDSALMLDLLTPVLPLYPKVCIMV 165
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
A + + + +AA A++ + + FA+ GNL++L AK + T+ +++G+ AG +
Sbjct: 166 A--SSILRSLCGIAASASKASLSAHFARTGNLAELNAKEASQETVISLLGMLAG----TA 219
Query: 243 VCSSMQGKMIV---GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
V +Q K V +L ++H+ +R + TLN QR ++ ++++TG+I SP
Sbjct: 220 VVHVVQDKTAVLYWMVMLLMVHLVMNYRGVRCVKMLTLNRQRATIVFREYLETGKILSPG 279
Query: 300 DLRYREDLLFPGR 312
++ RE +L GR
Sbjct: 280 EVAERESVLLGGR 292
>gi|239610850|gb|EEQ87837.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ER-3]
Length = 498
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 97/390 (24%), Positives = 183/390 (46%), Gaps = 35/390 (8%)
Query: 23 EIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEG 82
++P +DS+ R + V +V SR + S LN F P GYP SV++
Sbjct: 34 QLPTPGSSENDSIRRERR-------RVDIVPPSRKESLSSLTSLLNVFLPVGYPHSVSDD 86
Query: 83 YLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWILKDGMQHVGKLIC 140
YL Y + +LQ F+S+ +L+++++L G+ A TV + +L D M + ++
Sbjct: 87 YLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVLHDSMGRIATILF 146
Query: 141 SN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAAR 198
++ LG ++ E K +R+ AD L D + LSP P+ L + + + + + VAA
Sbjct: 147 AHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACSSVLRALCGVAAG 206
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
+++ + S FA+ GNL +L AK + T+ +++G+ G + S V + L
Sbjct: 207 SSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRVSTPF-ATWATLLFLL 265
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD------LRYREDLLFPGR 312
++H+ + +R+ + TLN QR ++ + + GR+ +P + R+ +L
Sbjct: 266 LVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCERIFERDGVLRWKA 325
Query: 313 LIEDAGNVKVGRDPHKVVKPSK------------FHELKELFAEEKFILCLGS--RWTDM 358
G+ ++G +++ S+ L LF EE++IL S + +
Sbjct: 326 SAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRRLLHLFREEEYILWFNSSDKGGTI 385
Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHD 388
VL+ + + L+ W ++A +AK D
Sbjct: 386 VLKNSVTPLSQLKAW---SHALMVAKRMAD 412
>gi|224094527|ref|XP_002310175.1| predicted protein [Populus trichocarpa]
gi|222853078|gb|EEE90625.1| predicted protein [Populus trichocarpa]
Length = 333
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/232 (28%), Positives = 118/232 (50%), Gaps = 18/232 (7%)
Query: 86 YTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLI-CS 141
Y +++ L S+AL VL+TQ++ A G+ R P+ A ++W+LKDG+ + + I +
Sbjct: 3 YVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPS-AAALNWVLKDGLGRLSRCIYTA 61
Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATR 201
+L + D+ KR R L+ L G+E+L+P PQ FL +A L N AK I++ AT
Sbjct: 62 SLASAFDTNLKRVRFTTSVLFSLSIGIELLTPTFPQYFLLLATLANIAKQISLACYLATG 121
Query: 202 LPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM---------- 251
++ SFA N+ ++ AK + + F+ +GL L ++ + +
Sbjct: 122 SAVHRSFAIADNIGEVSAKAQIQTVSFDNLGLMLAALLNMLFKNNQRFSLSFPTMSFCFC 181
Query: 252 ---IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
V P+ S I ++ + + ++ + TL R +I+ ++ G + SPA+
Sbjct: 182 LPFFVYPIFSAIDLFGIYQGLQHVHLQTLTKDRLEIILNSWIDFGHVPSPAE 233
>gi|261206350|ref|XP_002627912.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239592971|gb|EEQ75552.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 497
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 97/390 (24%), Positives = 183/390 (46%), Gaps = 35/390 (8%)
Query: 23 EIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEG 82
++P +DS+ R + V +V SR + S LN F P GYP SV++
Sbjct: 33 QLPTPGSSENDSIRRERR-------RVDIVPPSRKESLSSLTSLLNVFLPVGYPHSVSDD 85
Query: 83 YLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWILKDGMQHVGKLIC 140
YL Y + +LQ F+S+ +L+++++L G+ A TV + +L D M + ++
Sbjct: 86 YLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVLHDSMGRIATILF 145
Query: 141 SN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAAR 198
++ LG ++ E K +R+ AD L D + LSP P+ L + + + + + VAA
Sbjct: 146 AHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACSSVLRALCGVAAG 205
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
+++ + S FA+ GNL +L AK + T+ +++G+ G + S V + L
Sbjct: 206 SSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRVSTPF-ATWATLLFLL 264
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD------LRYREDLLFPGR 312
++H+ + +R+ + TLN QR ++ + + GR+ +P + R+ +L
Sbjct: 265 LVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCERIFERDGVLRWKA 324
Query: 313 LIEDAGNVKVGRDPHKVVKPSK------------FHELKELFAEEKFILCLGS--RWTDM 358
G+ ++G +++ S+ L LF EE++IL S + +
Sbjct: 325 SAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRRLLHLFREEEYILWFNSSDKGGTI 384
Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHD 388
VL+ + + L+ W ++A +AK D
Sbjct: 385 VLKNSVTPLSQLKAW---SHALMVAKRIAD 411
>gi|427789495|gb|JAA60199.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 464
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 121/248 (48%), Gaps = 8/248 (3%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F F P GYP SV+ Y Y + ++Q F S+ L+TQS+L G+ A A
Sbjct: 45 FSEIFLPVGYPDSVSRDYTEYQIWDSIQAFASSITGTLATQSVLSGVGVGDQGASVLAAT 104
Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+WIL+DG G+++ + G+ +D + K+WR+ AD L D+ LE++ +
Sbjct: 105 TTWILRDGTGMTGRILFAWFQGSNLDYDSKKWRLFADILNDVAIFLELMCQYMKGYVTAV 164
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ + AK I VA ATR + A+ N++D+ AK + TL N+ + L
Sbjct: 165 LCVSSVAKSIVGVAGGATRAALTQHQARNNNMADVSAKDGSQETLVNLAAFLFSLLLLRI 224
Query: 243 VCSSMQGKMIVGPLLS--VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPA 299
V + ++ G +S ++H+++ + + T N R A++V F++T G I+
Sbjct: 225 VAGNPW--LLYGIFISFTLLHIFANYRAVSCVVMETFNRSRYAIVVRRFLETAGDIAPVT 282
Query: 300 DLRYREDL 307
+ RE +
Sbjct: 283 WVNARESV 290
>gi|327350378|gb|EGE79235.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 497
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 174/364 (47%), Gaps = 28/364 (7%)
Query: 49 VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
V +V SR + S LN F P GYP SV++ YL Y + +LQ F+S+ +L+++++
Sbjct: 52 VDIVPPSRKESLSSLTSLLNVFLPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAV 111
Query: 109 LFAAGLRPTPAQATV--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG 165
L G+ A TV + +L D M + ++ ++ LG ++ E K +R+ AD L D
Sbjct: 112 LQGVGVGDASASPTVALLHSVLHDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSA 171
Query: 166 TGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAI 224
+ LSP P+ L + + + + + VAA +++ + S FA+ GNL +L AK +
Sbjct: 172 MVFDCLSPAFPKHLRIVVLACSSVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQ 231
Query: 225 STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
T+ +++G+ G + S V S+ L ++H+ + +R+ + TLN QR +
Sbjct: 232 ETVISLLGMLCGSLVVSRV-STPFATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANI 290
Query: 285 IVADFVKTGRISSPADLRYREDLLFPGRLIE------DAGNVKVGRDPHKVVKPSK---- 334
+ + + GR+ +P E + ++ G+ ++G +++ S+
Sbjct: 291 VFSTLFEDGRVLTPKQASKCERIFERDGVLRWKASAATLGSCRIGVSFQELLCGSRGRVN 350
Query: 335 --------FHELKELFAEEKFILCLGS--RWTDMVLEQNASGEDALRGWLVAAYAASMAK 384
L LF EE++IL S + +VL+ + + L+ W ++A +AK
Sbjct: 351 SIRDINIDIRRLLHLFREEEYILWFNSSDKGGTIVLKNSVTPLSQLKAW---SHALMVAK 407
Query: 385 SFHD 388
D
Sbjct: 408 RIAD 411
>gi|281211285|gb|EFA85450.1| DUF647 family protein [Polysphondylium pallidum PN500]
Length = 427
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/166 (33%), Positives = 91/166 (54%), Gaps = 3/166 (1%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTP 118
++ ++F F PSGYP SV++ Y Y ++ +Q+ + +LST +LL GL +
Sbjct: 101 KLYETFKTLFLPSGYPESVSQDYASYQRWIFVQNTLGSVTYMLSTHALLTSVGVGLSASL 160
Query: 119 AQATVVSWILKDGMQHVG-KLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
+ +SW+LKDG+ L S +D + KR++ D L++ G LE +P P
Sbjct: 161 PFSAAISWVLKDGLGASALVLFASKYSTSLDFDLKRFKFRGDFLHNFGVFLETCTPFLPG 220
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
FL MA + N AKGIA + +TR ++ SFA + N+ D+ AK ++
Sbjct: 221 YFLLMASISNLAKGIAGLIYGSTRASLHKSFALKENIGDITAKYQS 266
>gi|145347876|ref|XP_001418386.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578615|gb|ABO96679.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 362
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 111/240 (46%), Gaps = 20/240 (8%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
F PSG+P + + Y+ + +++ L L ++S QSLL A G+ TP W
Sbjct: 8 FLPSGFPATTSADYVEWLRWQLLSLLFRDVLEIMSAQSLLVALGMGNTPGALPLTAAAKW 67
Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+LKDG+ LI + G R D +PKRW L++ L D+ +E+++P P LFL +A
Sbjct: 68 VLKDGVGSFATLIAGSFGGQRYDEDPKRWWGLSNTLEDVARAIELVTPAAPGLFLPLAAS 127
Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
F + A + R + L F + NL D+ AK E ++ L AGIQ+ V
Sbjct: 128 ATFVRA-AALTGRGSLLNGTFMQHFGRNNNLGDIRAKLEVQGRWLALIALPAGIQVFQLV 186
Query: 244 CSSMQ---------GKMIV--GPLLSVI--HVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
++ G + G VI HV++ + ++ +TLN R + FV
Sbjct: 187 SAAAADLAADGDEFGAFVAAFGAYGGVIGVHVFACWQAAKSLKFDTLNRFRLLKLAEAFV 246
>gi|449542984|gb|EMD33961.1| hypothetical protein CERSUDRAFT_117486 [Ceriporiopsis subvermispora
B]
Length = 505
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 106/433 (24%), Positives = 176/433 (40%), Gaps = 27/433 (6%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNE 81
+E + W E S + + + S R+ F F P+ YP SV+
Sbjct: 70 RECHVSWSEDSGVEKEWRELSQPTEAEERAISSSGTFVGRLSAWFRQMFLPTNYPQSVHR 129
Query: 82 GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKL- 138
Y+ + + ++ +SVL Q+LL + G+ + A V WI+KDG V KL
Sbjct: 130 SYMPFHVLQFVEGTLGTLVSVLCNQALLTSVGVSAEGSIFGAVAVQWIIKDGAGEVAKLF 189
Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC---PQLFLEMAGLGNFAKGIAVV 195
DS PK + + + L LG+GL++ + L P FL A GN K I
Sbjct: 190 FIRKFSPYFDSHPKTFTLSGEVLVALGSGLQMATLLVNPTPLSFLICAAGGNAFKLIGYA 249
Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
T + F+++GN D+ AK E+ +++ + G AGI L T+ +
Sbjct: 250 IWFTTHIKFIRYFSQQGNTGDVAAKDESQASVAQLAGYAAGIGLL-TLSHDPSYLYTIFA 308
Query: 256 LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIE 315
L H+ + MR A L R +++ + + G + + L + G
Sbjct: 309 LAVPTHLVATTLMMRCATFELLTIPRLSLLAETYSRKGDVMPLSRLEAKRATGMFGEFYG 368
Query: 316 DAGNVKVGRDPH--KVVKP-----SKFHELKELFAEEKFIL----CLGSRWTDMVLEQNA 364
V P +VVK +++ E+ +E+++L + + L +A
Sbjct: 369 KGQGKLVSLAPRVAEVVKDEEGERARWALCTEVCRDERYLLYPPAAPQANPASVFLRPDA 428
Query: 365 SGEDALRGWLVAAYAASMAKSFHDPSLTV---LQDAYDKMNDVFTPLLSELQAKGWHTDR 421
S +D LR L +AA + D +T+ L+D+++ + F L KGW TD
Sbjct: 429 SSDDMLRAVL---HAARLRSLLADDIVTLREALKDSHEWTQEHFMTFKGALDDKGWRTDE 485
Query: 422 --FLDGTGTRFAW 432
F D G R W
Sbjct: 486 VAFAD-QGHRLLW 497
>gi|443714847|gb|ELU07084.1| hypothetical protein CAPTEDRAFT_141059, partial [Capitella teleta]
Length = 266
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 123/251 (49%), Gaps = 5/251 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P G+P SV++ Y++Y + +Q F S+ + L+TQ++L G+ A A ++W+
Sbjct: 6 FLPKGFPASVSDDYVQYQIWDTVQAFASSITNTLATQAVLKGVGVGDEQATVLAATLTWL 65
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KDG +G+++ + + G+ +D + KRWR+ AD L D+ +++ +P F + +
Sbjct: 66 MKDGTGMLGRILFTWIQGSYLDCDCKRWRLFADILNDVSILMDICAPFFRVYFTLIVCVA 125
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ I VA ATR + A N++D+ AK + TL N+ L + L V
Sbjct: 126 GVCRSIVGVAGGATRAAVTQHQACRSNMADVSAKDGSQETLVNLAALLCNLALVPMVSDR 185
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV--KTGRISSPADLRYR 304
I+ + +H+Y+ + +R+ + T N R + +F+ TG + R
Sbjct: 186 QWLVWILFTCFTTLHIYANFKAVRSLDMPTFNQVRLHLAAQEFISDNTGNFPGVKTVNQR 245
Query: 305 EDLLFPGRLIE 315
E +L L+
Sbjct: 246 EPVLCSESLLS 256
>gi|367028787|ref|XP_003663677.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
42464]
gi|347010947|gb|AEO58432.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
42464]
Length = 448
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/258 (26%), Positives = 134/258 (51%), Gaps = 11/258 (4%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
++ + L F P+GYP +V YL Y F +LQ F S+ S+L+ +++L G+ + +
Sbjct: 36 LLKAILYAFLPAGYPHTVTGDYLAYQTFDSLQAFASSITSLLANRAVLEGLGVGDSSSSP 95
Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
T ++ I D + + ++ ++ +G ++ E K +R LAD D L++L+P P L
Sbjct: 96 TGALILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYL 155
Query: 179 -FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
L + + + VAA A++ + + FA GNL++L AK + T+ +++G+ G
Sbjct: 156 PKLGVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVG- 214
Query: 238 QLASTVCSSMQGKMIVGPLLSV---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
S V ++ K +V L+ + +H+ +R+ + +LN QR ++ +++ G
Sbjct: 215 ---SLVVRMVEDKQVVWMLMVILAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGV 271
Query: 295 ISSPADLRYREDLLFPGR 312
+ +P + RE +L GR
Sbjct: 272 VLTPEQVAQRESILGNGR 289
>gi|147807901|emb|CAN62157.1| hypothetical protein VITISV_025492 [Vitis vinifera]
Length = 582
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 72/131 (54%), Gaps = 1/131 (0%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
FL P G+P SV YL YT +R +Q S VL+TQ+LL+A GL + A V
Sbjct: 216 FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 275
Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
+W+LKDG+ ++ K++ S G D PK WR+ AD L + GLE+L+P P FL +
Sbjct: 276 NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 335
Query: 185 LGNFAKGIAVV 195
+ + A +
Sbjct: 336 VAGAGRSAAAL 346
>gi|307111734|gb|EFN59968.1| hypothetical protein CHLNCDRAFT_133075 [Chlorella variabilis]
Length = 434
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 97/417 (23%), Positives = 164/417 (39%), Gaps = 85/417 (20%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ----ATVVS 125
F P+G+P SV YLRY H T L+T S++ A G+ PA + V
Sbjct: 39 FLPTGWPHSVTPDYLRYQLASVPAHITGWMSHSLATSSMITALGVDAGPATIAATSAAVK 98
Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCP-------- 176
W+ KDG+ G+LI L + D +PKRWR++A+ + +G +E+ + + P
Sbjct: 99 WLTKDGLGAAGRLIVGGRLSSVFDEDPKRWRMVAEGVTTVGLAMEIATQIYPGGLRSKFR 158
Query: 177 ------QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
F+ +AG G K +A I + F+ N+ D+ AK E +
Sbjct: 159 VVSVVWSYFIALAGTGTLCKSMAKGMGTPCFRIIQTHFSATNNVGDVAAKEEVWEVAAQL 218
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+GL A SV+ + ++ E +P +T+ P+R A++ + V
Sbjct: 219 LGLAA----------------------SVVALQAL--EAAGSP-DTVVPKRAALMASSHV 253
Query: 291 KTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDP-------------HKVVKPSKFHE 337
+TG + S + E +L P + + + + D + + E
Sbjct: 254 RTGLVPSVEQVNLSEQVLLPPTMCQPSVHYGCTLDQALGSGGGGFGGGAASGGRGQQLAE 313
Query: 338 LKELFAEEKFILCL--GSRWTDMVLEQNASGEDALRGWLVAAY----------------- 378
L L+ E+++L G+ W +VLE A D LR AA+
Sbjct: 314 LVRLYDGEQYMLTWHRGAAWV-LVLE-GAQRADLLRAMWQAAWLEEHHQQQGEGLEQQQG 371
Query: 379 -------AASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGT 428
+ + P+ VL + M + L + Q +GW D+ + GT
Sbjct: 372 EGPAPSDGSGGSVGSVVPAAEVLAASLAAMQQQWPDFLQQAQEQGWLLDKTVLPQGT 428
>gi|384249834|gb|EIE23315.1| hypothetical protein COCSUDRAFT_15701 [Coccomyxa subellipsoidea
C-169]
Length = 265
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 112/228 (49%), Gaps = 7/228 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV----VS 125
F P GYP +V YLR+ + H T + L+T SLL A G+ + AT +
Sbjct: 9 FLPRGYPETVTPDYLRFQLWAIPAHITGWMSTSLATSSLLKAVGVGNSAVGATAAAAAIK 68
Query: 126 WILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
WI KDG+ G+ L+ LG D +PKRWR++A+A +G LE+ + P F+ +AG
Sbjct: 69 WITKDGIGAAGRVLVGGRLGNVFDEDPKRWRMVAEAFLTVGLALEIATAFSPANFIVLAG 128
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
GN ++ + I + FA N+ D+ AK E ++GL + L +
Sbjct: 129 AGNLSRAVGRGMTNPCFRIIQTHFAAASNVGDVAAKEEVWEVSAQLLGLAGSVGLLKAIE 188
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVN--TLNPQRTAMIVADFV 290
++ + + ++G +V ++S++ A + ++N +R ++ A V
Sbjct: 189 ATGKPENVLGVWAAVQVLHSLLRYKSLAVLQFPSVNQKRACLLAAAHV 236
>gi|384252100|gb|EIE25577.1| DUF647-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 286
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 111/221 (50%), Gaps = 5/221 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP +V Y+ + + ++Q S ++ + ++L G+ + T V +
Sbjct: 49 FLPDGYPDAVTSDYVGFQIWDSIQALCSYVRGMICSNAILTGIGVGKQATSPLVAVFQFF 108
Query: 128 LKD--GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
++D GM G L G+++D+ K+WR+ AD ++G L++ SPL P FL +A L
Sbjct: 109 VRDLAGMCG-GVLFAFTQGSQLDAYAKQWRLFADIFNNVGYALDLASPLFPDQFLLLACL 167
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
G+ A+ + VA ATR + F++ N D+ AK + T ++G+ G+
Sbjct: 168 GSLARAVTGVAGGATRAALTQHFSRADNAGDISAKEGSQETATTLIGMILGMAFLQAASD 227
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
S V LL+ +H+Y+ + +RA + +LN R +++
Sbjct: 228 SPVAVWAVFLLLTFLHIYANVRAVRALQLTSLNCSRLDILL 268
>gi|108706828|gb|ABF94623.1| expressed protein [Oryza sativa Japonica Group]
Length = 497
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 77/266 (28%), Positives = 128/266 (48%), Gaps = 31/266 (11%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ RV L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 136 WGRVGGRLLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKS 195
Query: 119 AQ--ATVVSWILKD--GM---------QHVGKLICSN------------------LGARM 147
A W L+D GM Q L SN LG+ +
Sbjct: 196 ATVIGATFQWFLRDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNL 255
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DS K WR++AD + DLG +++LSPL P + + LG+ ++ VA+ ATR +
Sbjct: 256 DSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQH 315
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
FA N +D+ AK + TL ++G+G G+ LA G + L++ H+Y+ +
Sbjct: 316 FALANNAADISAKEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYK 375
Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTG 293
+++ + TLN +R+++++ F+ G
Sbjct: 376 AVQSLSLTTLNYERSSILLQYFMDNG 401
>gi|255082448|ref|XP_002504210.1| predicted protein [Micromonas sp. RCC299]
gi|226519478|gb|ACO65468.1| predicted protein [Micromonas sp. RCC299]
Length = 226
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/225 (28%), Positives = 114/225 (50%), Gaps = 5/225 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVV--SWI 127
F P YP SV++ Y + + Q +S L+T++LL G+ T A A +I
Sbjct: 2 FLPEDYPNSVSDDYAAFQAWDTAQGLSSYVRGSLTTKALLEGVGVGATGATAASAAAQFI 61
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ--LFLEMAG 184
+D VG L+ + G +D+E K+WR+ AD ++G +E+ +PL FL +A
Sbjct: 62 ARDMCGMVGGLVFTLAKGRHLDAEAKQWRLFADLANNVGMAMELAAPLANDRWWFLALAC 121
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
G+ A+ + AA ATR + FA+ N +D+ AK + T +VG+ G+ + ++
Sbjct: 122 AGSVARSLCGCAAGATRAALTQHFARARNAADIAAKEGSQETAVTLVGMSLGVAVTTSTD 181
Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
+ V L+V HVY + +R+ +++LN +R ++A +
Sbjct: 182 GKTAWQWAVFVALTVAHVYCNVRAVRSLVIDSLNRERVVRLLARY 226
>gi|340967040|gb|EGS22547.1| diphosphomevalonate decarboxylase-like protein [Chaetomium
thermophilum var. thermophilum DSM 1495]
Length = 425
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 132/255 (51%), Gaps = 11/255 (4%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT-- 122
+ L F P+G+P +V E YL Y + +LQ F S+ S+L+ +++L G+ + + T
Sbjct: 39 AILYAFLPAGFPHTVTEDYLSYQLYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGA 98
Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FL 180
++ I D + + ++ ++ +G ++ E K +R LAD D L++L+P P L
Sbjct: 99 LILKITGDTISRIATILFAHRMGRAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKL 158
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + VAA A++ + + FA GNL++L AK + T+ +++G+ AG
Sbjct: 159 SVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLAG---- 214
Query: 241 STVCSSMQGKMIVG---PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
S V ++ K +V LL+ +H+ +R+ + +LN QR ++ +++ G + +
Sbjct: 215 SLVVRLVEDKQVVWFLMILLAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGTVLT 274
Query: 298 PADLRYREDLLFPGR 312
P + RE +L R
Sbjct: 275 PDQVAKRESILCKSR 289
>gi|326428268|gb|EGD73838.1| hypothetical protein PTSG_05532 [Salpingoeca sp. ATCC 50818]
Length = 376
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 122/259 (47%), Gaps = 9/259 (3%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWI 127
P GYP SV+ Y Y + Q F+S+ + +ST ++L G+ + T A AT + +
Sbjct: 51 LPEGYPASVSSDYASYQIWDTAQAFSSSVMGTISTYAVLKGVGVGDAQATVAAATYTN-L 109
Query: 128 LKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D + + + + MD K+WR++AD + D+ +E++SPL P FL + +
Sbjct: 110 LRDATGMLAGIWFAWYSSTDMDRNAKQWRLVADVVNDMAMLVELVSPLIPAAFLPLMCMA 169
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+F + I VA ATR + A+ N++D+ AK + TL N+ L + + S
Sbjct: 170 SFLRAIVGVAGGATRAALVRHQARLNNMADVSAKDGSQETLVNLAALVVSLAITPLAAES 229
Query: 247 --MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
M M + PL+ +H+Y +RA + N R A F++T R+ S
Sbjct: 230 RVMAWAMFL-PLV-FMHLYCNYRAVRALEMPAFNAHRLHHACAVFMRTNRVPSVCAANQA 287
Query: 305 EDLLFPGRLIEDAGNVKVG 323
E L IE V +G
Sbjct: 288 EPLFPRAPAIEGHAKVCLG 306
>gi|156059886|ref|XP_001595866.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980]
gi|154701742|gb|EDO01481.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 477
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 103/414 (24%), Positives = 187/414 (45%), Gaps = 60/414 (14%)
Query: 63 VDSFLNKFFPSGYPFSVNEGY---------LRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
V S F P GYP SV E Y L Y L F S +L+++++L G
Sbjct: 49 VSSLTGAFLPVGYPDSVTEDYTACVPCPIHLLYLLCEKLA-FASTIAGLLASRAVLQGLG 107
Query: 114 LRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEV 170
+ + A AT V+ +L++ + ++ ++ LG+ ++ E K++R++AD D L+
Sbjct: 108 VGDSTASATGAVLLNVLQESAGRIATILFAHRLGSALEPECKKYRLMADIFNDGAMILDC 167
Query: 171 LSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
+SP P++FL A + K + VAA +++ + + FAK+GNL++L AK + TL
Sbjct: 168 ISPAFPKVPRVFLLSAS--SVCKSLCGVAAGSSKASLSAHFAKQGNLAELNAKDASQETL 225
Query: 228 FNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVA 287
+++G+ G + S + S + + LLS IH+ + +R+ + TLN QR ++++
Sbjct: 226 ISLLGMLVGTFVVSRISSQTATWIALISLLS-IHLGTNYLAVRSVTMRTLNRQRANLVIS 284
Query: 288 DFV------KTG-RISSPADLRYREDLLFPGRLIEDA-----GNVKVGRDPHKVV----- 330
DFV KT + +P D+ +E + I G KVG +++
Sbjct: 285 DFVSNINAEKTRFNLPTPKDISRKERIFERDGAIRHIQGVVLGYCKVGVSLREILSSISS 344
Query: 331 -----------KPSKFHELKELFAEEKFILCLGSRWTD--MVLEQNASGEDALRGWLVAA 377
S L +++ + +++ R ++L++N + L W A
Sbjct: 345 SPTVSGSYKEETSSIITSLFKIYKNQGYVMWYSRRQNTFLVILKENTAPSVQLDAWFHAV 404
Query: 378 YAASMA------KSFHDPSLTVLQDAYDKMNDVF-----TPLLSELQAKGWHTD 420
+A S++ S + +L+ D + D L EL+ KGW D
Sbjct: 405 WALSVSSSPAVGNSTNGSDQDILKWIEDSLRDSSLCKEKANLYEELRNKGWDLD 458
>gi|296411034|ref|XP_002835240.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295628015|emb|CAZ79361.1| unnamed protein product [Tuber melanosporum]
Length = 413
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 173/372 (46%), Gaps = 46/372 (12%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARM 147
+LQ F+S+ +LS++++L G+ A AT ++ I +D + + ++ ++ G +
Sbjct: 6 SLQAFSSSIAGLLSSRAVLEGFGVGDANASATGALLLTIAQDCVSRLATILFAHRFGPAL 65
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARATRLP 203
++E KR+R+ AD D L+ LSP P+ L L +G A G+A ++A+ L
Sbjct: 66 EAETKRYRLAADVFNDSAMILDCLSPAFPRSVRILMLCCSGSLRAACGVAGGGSKAS-LS 124
Query: 204 IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
++ FA+ GN+ +L AK + T+ ++G+ AG + S + SS I LL +H+
Sbjct: 125 VH--FARSGNVGELNAKDSSQETVIGLMGMLAGSFVVSHI-SSTWATWISLTLLQAVHLG 181
Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDA-----G 318
+ + + A + TLN QR ++ + F TG I +P E + ++ G
Sbjct: 182 TNFKAVTAVALRTLNRQRANIVCSAFFDTGEILTPRQAARHERIFEKDGVLRWGGSGVLG 241
Query: 319 NVKVGRDPHKVVK-----PSKFH----------ELKELFAEEKFILC--LGSRWTDMVLE 361
N ++G +++ P + EL +F EK++L + R + L+
Sbjct: 242 NAEIGVKLTALIRSLRQSPERKKEGEGEGVSPVELASVFQNEKYLLWYDVQRRLVTICLK 301
Query: 362 QNASGEDALRGWLVAAYAA---------SMAKSFHDPSLTVLQD----AYDKMNDVFTPL 408
+ S E+ L+ W+ A A A + + PS L D + D + FTP+
Sbjct: 302 EKVSPEEQLKSWVHALVLAREIRHHWPKEGADNSNRPSQQALLDTVRRSLDDIGMSFTPM 361
Query: 409 LSELQAKGWHTD 420
L L+ GW D
Sbjct: 362 LGRLKGAGWDLD 373
>gi|336273098|ref|XP_003351304.1| hypothetical protein SMAC_03608 [Sordaria macrospora k-hell]
gi|380092824|emb|CCC09577.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 498
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/253 (28%), Positives = 131/253 (51%), Gaps = 11/253 (4%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
+ + L+ F P+GYP SV YL Y + +LQ F S+ S+L+ +++L G+ + + T
Sbjct: 38 LQALLHAFLPAGYPHSVTPDYLPYQTYDSLQAFFSSITSLLANRAVLEGLGVGDSSSSPT 97
Query: 123 --VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL- 178
++ I D + + ++ ++ +G ++ E K +R LAD L D L++L+P P
Sbjct: 98 GALILKIAGDTISRIATILFAHRMGTAIEPECKFYRFLADLLNDSAQLLDLLTPALPYFP 157
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
L + + + VAA A++ + + FA GNL++L AK + T+ +++ GI
Sbjct: 158 KLGVIVSAGVLRSLCGVAANASKASLSAHFALRGNLAELNAKEASQETVVSLL----GIL 213
Query: 239 LASTVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ S V +Q + +V L L +H+ +RA TLN QR ++V +++ GR+
Sbjct: 214 VGSLVVKVVQDRGVVWALMVGLLGVHLLMNYWAVRAVRCRTLNRQRATVVVRGWLEWGRV 273
Query: 296 SSPADLRYREDLL 308
P ++ E +L
Sbjct: 274 LGPEEVARMESIL 286
>gi|297740212|emb|CBI30394.3| unnamed protein product [Vitis vinifera]
Length = 462
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/311 (25%), Positives = 141/311 (45%), Gaps = 35/311 (11%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWIL-- 128
P+G+P SV++ YL Y + + T+ L T SLL +A+ SW
Sbjct: 81 LPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLL----------KASCWSWFFFS 130
Query: 129 --------------KDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP 173
+DG+ VG+L I G D +PK+WR+ AD + G+ ++ +
Sbjct: 131 NHCCCFCCCFCCCHQDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQ 190
Query: 174 LCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
L P FL++A LGN AK +A + I + FA GNL ++ AK E ++GL
Sbjct: 191 LYPAYFLQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGL 250
Query: 234 GAGIQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
GI + ++ V +S + ++H++ + + ++N +R ++V V
Sbjct: 251 ALGIMVLTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVL 310
Query: 292 TGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV---KP-SKFHELKELFAEEKF 347
+ D +E +L R ++ + G +++ +P SK L +L+++EK+
Sbjct: 311 HSIVPGYVDCNRKEKILSWQRFLKP--RIIFGVSLEELIGGERPVSKVRTLLKLYSQEKY 368
Query: 348 ILCLGSRWTDM 358
IL + + TD
Sbjct: 369 ILVVKQQGTDF 379
>gi|224090990|ref|XP_002309136.1| predicted protein [Populus trichocarpa]
gi|222855112|gb|EEE92659.1| predicted protein [Populus trichocarpa]
Length = 500
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 164/386 (42%), Gaps = 44/386 (11%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVS 125
P+G+P SV++ YL+Y + + T L T SLL A G A A +
Sbjct: 107 LPAGFPGSVSDDYLQYMVLQFPTNITGWICHTLVTSSLLKAVGAGSFTGTDAAASAAAIR 166
Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
W+ KDG+ +G+L I G D +PK+WR+ AD + G+ ++ + + P FL +A
Sbjct: 167 WVSKDGIGALGRLFIGGRFGDLFDDDPKQWRMYADFIGSAGSIFDLTTQVYPAYFLPLAS 226
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
LGN K +A + I + FA GNL ++ AK E V +GA + +
Sbjct: 227 LGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEVAAKEE-------VWEVGAQLLGLALGI 279
Query: 245 SSMQGKMIVG--PLLS-------VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ +V PLL+ ++H++ + + +T+N +R +++ + R+
Sbjct: 280 LILDTPDLVKSFPLLTSTWMSMRLLHLWLRYQSLSVLRFDTINLKRARILIKSHILQSRV 339
Query: 296 SSPADLRYREDLL----------FPGRLIED--AGNVKVGRDPHKVVKPSKFHELKELFA 343
D E++L G +E+ G V + H VV K L +L+A
Sbjct: 340 LGCVDCNQDENILSWETFTRPRIIFGVPLEEMIGGERSVAKIKHGVV--IKIKMLLKLYA 397
Query: 344 EEKFILCLGSRWTD----MVLEQNASGEDALRGWLVAAYAASMAKSF----HDPSLTVLQ 395
+EK L + + TD + + A+ LR + KSF D L
Sbjct: 398 KEKHFLVVNQQKTDFEVLVSFKVGATSMSVLRSVWQTYWLHENWKSFGNRSSDSDYDQLA 457
Query: 396 DAYDKMNDVFTPLLSELQAKGWHTDR 421
+ +M F + +L A GW T +
Sbjct: 458 QSLLEMEVKFEDFMQQLDAVGWDTSQ 483
>gi|167517110|ref|XP_001742896.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163779520|gb|EDQ93134.1| predicted protein [Monosiga brevicollis MX1]
Length = 272
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 98/185 (52%), Gaps = 2/185 (1%)
Query: 99 ALSVLSTQSLLFAAGLRPTPAQ-ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRI 156
A VLS Q+LL+AAGL A ++W+LKDG+ +G ++ L G R D +PKR R
Sbjct: 3 ASGVLSMQALLYAAGLGAGSVPLAAAINWVLKDGLGQLGGVLYGTLFGPRFDRDPKRQRF 62
Query: 157 LADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD 216
+ E+L PL P LFL +A N AK I+ +A+ ATR +++SF + NL D
Sbjct: 63 WSLVALQGANLTELLIPLVPHLFLPLASAANIAKNISFLASSATRAQMHASFVRRANLGD 122
Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
+ AK A S +VVG GI +++ S + + L++ +Y + ++T
Sbjct: 123 ITAKIGAQSIGASVVGTALGIFVSTHTGSDVTNIIPAFVPLALACLYFNHLSSKHVVLST 182
Query: 277 LNPQR 281
LN QR
Sbjct: 183 LNEQR 187
>gi|451846811|gb|EMD60120.1| hypothetical protein COCSADRAFT_151456 [Cochliobolus sativus
ND90Pr]
Length = 482
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 108/460 (23%), Positives = 200/460 (43%), Gaps = 66/460 (14%)
Query: 14 EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRP--VYHRVVDSFLNKFF 71
E D++ V + Y +S S + +G K VD P + V+ ++ F
Sbjct: 13 EQDEAGKV--VRTYVLSSSSSAVGEHGKHGEG----KRVDVIVPKQAHKSVLQYIIDVFL 66
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
PSG+P SV E Y+ Y F +LQ F+S+ +L+++++L G+ TP A ++S +L
Sbjct: 67 PSGFPQSVTEDYIEYQVFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTAALLLS-VL 125
Query: 129 KDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLG 186
++ + + + G ++ E K +R+ AD L D L+ LSP P+ L + +
Sbjct: 126 QESVGRMATIAFADRFGTALEPECKMYRLTADILNDSAMILDCLSPGFPKPLRVAVLSFA 185
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
+ + + V A +++ + + FA GNL ++ AK + T+ +++G+ AG + S + S
Sbjct: 186 SCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFVVSHITSD 245
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ LL IH+ + +RA + +LN QR ++ + ++ G + +P ++ RE
Sbjct: 246 L-ATWCTLILLLAIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLNPKEVSARER 304
Query: 307 LLFPGRLI-----EDAGNVKVGRDPHKVV-----------------------KPSKFHEL 338
+ G ++ E G ++G +++ +L
Sbjct: 305 IFERGGVLRWADDEVLGKCRIGTSLSELLGRIGRRDKLSGSLAFCSSGGDDGSEVDVFDL 364
Query: 339 KELFAEEKFILCLGSRWTD---------MVLEQNASGEDALRGWLVAAYAASMAKSFHDP 389
+F E ++L W +VL+ D LR W A A + D
Sbjct: 365 LSIFEAEDYVL-----WATGGPAAYEAVIVLKAGCGPIDQLRAWAHALLVAEKRQRGGDS 419
Query: 390 SLT------VLQDAYDKMNDV---FTPLLSELQAKGWHTD 420
T V+ D D + +V F E++AKGW+ D
Sbjct: 420 GGTRAKHGLVVGDLKDTLYEVRRMFDKYGDEMRAKGWNLD 459
>gi|402074762|gb|EJT70271.1| hypothetical protein GGTG_12444 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 438
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 133/252 (52%), Gaps = 15/252 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P+G+P SV + YL Y + +LQ F+S+ S+L+ +++L G+ +P A V+
Sbjct: 48 FLPAGFPHSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGNASQSPTAALVLQ- 106
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG---TGLEVLSPLCPQLFLEM 182
IL+D + ++ ++ +G ++ E K +R +AD D L + P P++ + +
Sbjct: 107 ILQDTFSRLATILFAHRMGTAIEPECKTYRFMADLFNDASQLLDLLVPVLPFLPKVSVMV 166
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
+ + + VAA A++ + + FAK GNL++L AK + T+ ++VG+ G +
Sbjct: 167 SAA--ILRSLCGVAAGASKASLSAHFAKMGNLAELNAKEASQETVVSLVGMLVGTAVVHM 224
Query: 243 V--CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
V ++ MIV L IH+ +R + TLN QR ++ +F++TG++ +P
Sbjct: 225 VEDKRAVWCWMIV---LIAIHLMMNYRAVRCVKMLTLNRQRATIVFREFLETGKVLTPGQ 281
Query: 301 LRYREDLLFPGR 312
+ RE +L GR
Sbjct: 282 VAARESILRNGR 293
>gi|403418386|emb|CCM05086.1| predicted protein [Fibroporia radiculosa]
Length = 497
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 100/418 (23%), Positives = 179/418 (42%), Gaps = 67/418 (16%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
N F P+GYP SV+ YL+Y + ALQ F S+ S+++++++L G+ A AT +
Sbjct: 49 NVFLPAGYPASVSPDYLQYQIYNALQAFCSSLASLIASRAVLEGHGVGNADASATHAIFL 108
Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE--- 181
+L+D + ++ LG + E K +R+LAD D L+ LSP + L
Sbjct: 109 TVLQDIFSRLTTIVSGYYLGTSLFPEAKTYRLLADIFNDAAIVLDTLSPHLDGVVLSLRY 168
Query: 182 ---MAGLGN-----------FAKGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEA 223
++G G+ ++ + + A ++ + FA G++ DL AK +
Sbjct: 169 PFVLSGSGSSLRVSALCLSGASRALCGMVAGGSKAALTVHFASPGQGTGDVGDLNAKDGS 228
Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
T+ ++G+ +G + V S+ + +V +L+ +H++ +R + +LN QR
Sbjct: 229 KETVLALLGMLSGTAVMHYVHSA-RATYLVLFVLTFLHLFGNYLAVRVVVLRSLNRQRAN 287
Query: 284 MIVADFV---------KTGRISSPADLRYREDLLFPGRLIEDA--------GNVKVG--- 323
++ + K + +P + RE + + D G KVG
Sbjct: 288 IVWMAYRAHIKGPMGGKRPLVLTPEAVAARERIFANPSALHDTSSPAHPFLGICKVGSSF 347
Query: 324 -----------RDPHKVVKPSKFHELKELFAEEKFILCLG-----SRWTDMVLEQNASGE 367
R P + H + E+F+EE ++L G S + L+
Sbjct: 348 SSLASRQQSAPRGALAAPSPPQIHTILEIFSEEPYVLWFGPTSGPSCRLHVFLKDGHKPR 407
Query: 368 DALRGWLVAAYAASMAK-----SFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
D L+ W A AA M SF + +L +Q +Y + D+F P L ++ GW D
Sbjct: 408 DHLKAWTHAQEAAIMLNGRQPGSF-EVALAAVQTSYALVEDLFPPFLDAVRGVGWRID 464
>gi|295659580|ref|XP_002790348.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226281800|gb|EEH37366.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 485
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 83/337 (24%), Positives = 154/337 (45%), Gaps = 42/337 (12%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
+ S L F P+GYP SV++ Y+ Y + +LQ F+S+ +L+++++L G+ A T
Sbjct: 65 LTSLLIVFLPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPT 124
Query: 123 V--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-L 178
V + +L++ M + ++ ++ LG ++ E K +R+ AD L D +E LSP P+ L
Sbjct: 125 VALLHSVLQESMGRIATILFAHRLGTSLEPECKLYRLAADVLNDSAMVVECLSPAFPKHL 184
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
+ + + + + VAA +++ + S FAK GNL +L A G + + + L G+
Sbjct: 185 RVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGSLVVSHIS-TPLTTGLT 243
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
L L +IH+ + +RA + TLN QR ++ + + GR +P
Sbjct: 244 LI---------------FLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTP 288
Query: 299 ADLRYREDLLFPGRLIE------DAGNVKVGRDPHKVVK--------------PSKFHEL 338
E + ++ G ++G ++++ P L
Sbjct: 289 KQTSKCERIFERDGILRWKASSTTLGFCRIGIAFQELLRSSNVSCRANSIRDIPIDIPRL 348
Query: 339 KELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW 373
LF +E++IL S+ +VL+ N L+ W
Sbjct: 349 LRLFEKEEYILWFNPVSKKGTIVLKNNVKPVSQLKAW 385
>gi|392564112|gb|EIW57290.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 516
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 98/419 (23%), Positives = 168/419 (40%), Gaps = 37/419 (8%)
Query: 50 KVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
K+ + + R+ F P+ YP SV+ YL + + + + SVL Q+LL
Sbjct: 95 KIAAPTNSLTGRISSWFGQMVLPTNYPHSVHRSYLPFHILQFFETTVATITSVLCNQALL 154
Query: 110 FAAGLRPTPA--QATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGT 166
+ G+ + A V WI+KDG V KL DS PK + + + LG+
Sbjct: 155 TSVGVSAEGSVFGAVAVQWIIKDGAGEVAKLFFIRRFSPYFDSHPKTFTFFGELMGCLGS 214
Query: 167 GLEVLSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
GL++ + L P FL A GN K + T + F+++GN D+ AK E+
Sbjct: 215 GLQIATLLIAPSPGNFLLCAAGGNIFKIVGNAIWFTTHIKFMRYFSEQGNTGDVAAKDES 274
Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
S++ +VG +GI L + V + + +H+ MR A L R +
Sbjct: 275 QSSIAQLVGYASGISLLA-VSHTAPYLYAIFAFAVPVHLAMTAYMMRVATFELLTLPRVS 333
Query: 284 MIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNV------KVGR--DPHKVVKPSKF 335
+ ++V G++SS +L G D + +VG D ++ ++
Sbjct: 334 CLAQEYVTRGQVSSLGELDAAHRTGLFGEFYRDKSDRWLTLAPRVGDVLDSGTELERVRW 393
Query: 336 HELKELFAEEKFILCLGSRWTDMVLEQNASGEDAL---------RGWLVAAYAASM---- 382
+F +++++L R + +A+ +D L R L+ A+S
Sbjct: 394 QICAGVFEDDRYLLFPRGRNISVFFHPDAANDDMLCAIYHAALVRDALINREASSTDVPP 453
Query: 383 AKSFHDPSLT--------VLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGT-GTRFAW 432
A+ +P+ T +L D + F L+A+ W TD G R W
Sbjct: 454 AEGLLEPTDTDSQAALKLILADTGARAERDFEVFREALEAREWRTDELCYADHGHRVTW 512
>gi|315056527|ref|XP_003177638.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311339484|gb|EFQ98686.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 495
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/434 (22%), Positives = 184/434 (42%), Gaps = 71/434 (16%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQ-FR---------ALQHFTSAALSVLS- 104
+P + S LN F P GYP SV + Y+ + FR + TS+ L +
Sbjct: 48 KPSSSSYLTSILNVFLPVGYPHSVTDDYIESSSLFRDYEKKELTSLISGLTSSILKLNCW 107
Query: 105 ---TQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADA 160
QS + +P A +++ +L++ M + ++ ++ LG ++ E K +R+ AD
Sbjct: 108 SPVIQSSVGVGDATASPTAALLLN-VLQESMGRIATILFAHRLGTSLEPECKMYRLAADL 166
Query: 161 LYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
L D L+ +SP P+ + + + L + + + VAA +++ + + FA+ GNL +L A
Sbjct: 167 LNDSAIVLDCMSPAFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNA 226
Query: 220 KGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
K + T+ ++ G+ G + S + S + I LL ++H+ + +RA + TLN
Sbjct: 227 KDSSQETVISLFGMLVGSVVVSYISSPL-ATWIALILLLIVHLGTNHAAVRAVKMTTLNR 285
Query: 280 QRTAMIVADFVKTGRISSPADLRYREDLL-------FPGRLIEDAGNVKVG--------- 323
QR ++ + + R+ +PA E + + + G ++G
Sbjct: 286 QRANIVFSYLFEDDRVLTPAQASEEERIFETDGVLRWKAKATGTLGICQIGVSLEQLLLL 345
Query: 324 ---------------RDPHKVVKPSKFHELKELFAEEKFILCL--GSRWTDMVLEQNASG 366
D V + L ELF EE++IL +R +VL+ A+
Sbjct: 346 LPEQVNTNAQGSSQKPDTTMVGASTGLSALLELFKEEEYILYFSPAARRGAIVLKAGATP 405
Query: 367 EDALRGWLVAAYAAS-MAKSFHDPS-------------------LTVLQDAYDKMNDVFT 406
+ L+ W A + +A+ HD + L++L+D + F
Sbjct: 406 QAQLKAWSHALLVSRHLAREEHDTAACINEKQGGQGELSPPVSILSILRDTLREHTRTFV 465
Query: 407 PLLSELQAKGWHTD 420
+ L+ GW TD
Sbjct: 466 DRVKRLEHAGWQTD 479
>gi|401421841|ref|XP_003875409.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491646|emb|CBZ26919.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 675
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 64/234 (27%), Positives = 111/234 (47%), Gaps = 7/234 (2%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
+R RVV + F P GYP SV E L + + +Q + LST+++L G+
Sbjct: 57 ARQALRRVVTALERIFLPDGYPSSVTEDLLPFAFWAHVQVLAKNVTATLSTRAVLLGVGV 116
Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
+ A T +SW+++DG + +G ++ S + ++ K W+++ D ++D LE+
Sbjct: 117 GESKADLTSSTLSWMMQDGARMIGSVVFASVIPQGLECRAKTWQLVTDFIHDSAHLLELC 176
Query: 172 SPLCP---QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
SP P +F + + K +A V TR P FA N +D+ AK +
Sbjct: 177 SPWLPGGQAIFRVVLVAASVIKALASVCGNGTRAPFTQHFALRNNGADIAAKAATRGHVG 236
Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
++GL GI +A V +S + + V L S +H+++ +R + LN R
Sbjct: 237 GLIGLALGIVIAYIVPATSRRLNLAVFALCSAVHIFANYRGVRGVQLRHLNAPR 290
>gi|389639940|ref|XP_003717603.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
gi|351643422|gb|EHA51284.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
Length = 479
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 67/258 (25%), Positives = 134/258 (51%), Gaps = 11/258 (4%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
++ + F P+G+P +V E YL Y + +LQ F+S+ S+L+ +++L G+ +P
Sbjct: 41 LIKAIFYAFLPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSP 100
Query: 119 AQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG-TGLEVLSPLCP 176
A V+ I++D + + ++ ++ +G ++ E K +R LAD D ++ L
Sbjct: 101 TAALVLK-IIQDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPL 159
Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+ + + + VAA A++ + + FAK GNL++L AK + T+ ++VG+ G
Sbjct: 160 LPKVSVLVTSGCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTG 219
Query: 237 IQLASTV--CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
+ V +++ M+V + ++ Y + +R + TLN QR ++ +F++TG+
Sbjct: 220 TAVVHMVEDKTAVWCWMVVLICMHLMMNYRAVRNVR---MLTLNRQRATIVFREFLETGK 276
Query: 295 ISSPADLRYREDLLFPGR 312
+ +PA RE +L R
Sbjct: 277 VLTPAQCSARESILLNSR 294
>gi|303278700|ref|XP_003058643.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459803|gb|EEH57098.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 679
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 95/186 (51%), Gaps = 16/186 (8%)
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
DS+ KR+R + +YD + +E+++P P FL +A + N K + + A R PI S
Sbjct: 314 DSDVKRFRFTSSVVYDCASLIEMITPFYPSKFLLLATIANVGKSVGITTANVVRAPIQMS 373
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM------IVGPLLSV-- 259
FA E NL+++ AK A L + +GL + V + GK+ + PLL+
Sbjct: 374 FALEENLAEIAAKTSAQQVLADNIGLALAV-----VATGWTGKIANPRLRLAVPLLAFAP 428
Query: 260 ---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
+ +Y + E+++ + T+N +R +I A FV+TG I S + E L P R+ E
Sbjct: 429 LASVDLYCIYRELKSVRLRTINKERGEIIAAGFVETGTIPSAVSVAAAERLFIPARMDES 488
Query: 317 AGNVKV 322
+ +K+
Sbjct: 489 SLPLKI 494
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 49/87 (56%), Gaps = 11/87 (12%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
VVD+FL P +V Y Y +FR Q S+ ++V +TQ +L A GL R P
Sbjct: 170 VVDAFL----PCAN--AVTPDYWAYAKFRFAQRVASSCITVFATQQMLQAVGLGATRRLP 223
Query: 119 AQATVVSWILKDGMQHVGKL-ICSNLG 144
A A V+W+LKDG+ +GKL + +N G
Sbjct: 224 A-AAAVNWVLKDGLGRLGKLSVATNFG 249
>gi|308804794|ref|XP_003079709.1| unnamed protein product [Ostreococcus tauri]
gi|116058166|emb|CAL53355.1| unnamed protein product [Ostreococcus tauri]
Length = 499
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 109/249 (43%), Gaps = 21/249 (8%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
F PSG+P +V+ Y + ++ + ++S QSLL A G+ TP W
Sbjct: 120 FLPSGFPQTVSADYADWLRWHLTSLLFRDVIEIMSAQSLLVALGVGSTPGALPLTAAAKW 179
Query: 127 ILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+LKDG+ L+ G R D +PKRW +++AL D+ +E+++P P LFL +A
Sbjct: 180 VLKDGVGSFATLLAGAFGGRWYDEDPKRWWAVSNALEDVARAIELVTPAAPGLFLPLAAS 239
Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
F + A + R + + F + NL D+ AK E +V L GIQ+ V
Sbjct: 240 ATFVRA-AALTGRGSLMNGTFMQHFGRNNNLGDIRAKLEVQGRWLALVALPVGIQVFQLV 298
Query: 244 CSSMQGKMIVGPLLSVI-------------HVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+ G L H++S + R+ ++LN R + +V
Sbjct: 299 NAQASALHAEGDELGAAAAAFGGYGLVIGAHIFSCWQAARSLKFDSLNGFRLLRLAEAYV 358
Query: 291 KT-GRISSP 298
+ G+I P
Sbjct: 359 ASDGKIELP 367
>gi|115434452|ref|NP_001041984.1| Os01g0141600 [Oryza sativa Japonica Group]
gi|54290177|dbj|BAD61065.1| unknown protein [Oryza sativa Japonica Group]
gi|113531515|dbj|BAF03898.1| Os01g0141600 [Oryza sativa Japonica Group]
Length = 323
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 95/192 (49%), Gaps = 17/192 (8%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
R + F+ P+G+P SV++ YL Y ++ + T L T +LL A G+
Sbjct: 101 RAIKDFV---LPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGT 157
Query: 121 ATV-----VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
+ + W+ KDG+ G+L I G D +PK+WR+ AD + G+ E+++PL
Sbjct: 158 SAAASAAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPL 217
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNV 230
P FL +A LGN AK + R R P I + FA+ GNL ++ AK E +
Sbjct: 218 YPGYFLPLASLGNLAKAV----GRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQL 273
Query: 231 VGLGAGIQLAST 242
+GL G+ + T
Sbjct: 274 LGLSIGVFIMDT 285
>gi|393217163|gb|EJD02652.1| DUF647-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 455
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 164/381 (43%), Gaps = 36/381 (9%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F PSG+P SV Y+ Y F +LQ F+S+ +LS++++L F G + V+ +
Sbjct: 50 FLPSGFPDSVTPDYIGYQIFDSLQAFSSSIAGLLSSRAVLEGFGVGNENASSTHAVLLTV 109
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP-QLFLEMAGL 185
L+D + ++ + G + E K +R+ AD D L+ LSP P QL + L
Sbjct: 110 LQDATGRLATILFAWKFGPALAPEAKMYRLAADIFNDSAFVLDCLSPALPTQLRIAALCL 169
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ I V + + FA+ GN+ +L AK + T+ + G+ G + + + +
Sbjct: 170 AGALRAICGVCGGGAKAALSVHFARTGNVGELNAKDASQETVIGLFGMLCGSFVMAHI-T 228
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTGRISSPADLRYR 304
S + I L H+ + +RA + TL+ QRT + + ++ + +P+++ R
Sbjct: 229 SRRTTWIALIFLLTAHLVTNYLAVRAVTLTTLSRQRTNIAYGVYCEQSNNVLTPSEVARR 288
Query: 305 EDLL-FPGRLIEDA------GNVKVGRDPHKVVKPSKFHELKE------LFAEEKFILCL 351
E + PG I D+ G + P ++++ S+ + F E ++L L
Sbjct: 289 ERIFEVPGA-IRDSLQGFLLGTCTICTSPEELLRRSRQGSMASWKNALVAFEHEAYVL-L 346
Query: 352 GSRW-----TDMVLEQNASGEDALRGWLVAAYAASMAKS----------FHDPSLTVLQD 396
G R + + A ED L+ W+ A +A S S + LQ
Sbjct: 347 GIRADFPCKILICFKHGAKPEDYLKAWVNAVELCRIAGSRLQVGQESNNVTPGSFSALQK 406
Query: 397 AYDKMNDVFTPLLSELQAKGW 417
+ + N F L + GW
Sbjct: 407 SLECTNTKFLDFLDKANKAGW 427
>gi|413949258|gb|AFW81907.1| hypothetical protein ZEAMMB73_025152 [Zea mays]
Length = 253
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 73/126 (57%), Gaps = 3/126 (2%)
Query: 65 SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +T++LL + G+ + A A
Sbjct: 120 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRAASGA 179
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
++WILKDG VGK+ + G + D + K+ R D L +LG G+E+ + PQ FL
Sbjct: 180 VAINWILKDGAGRVGKMFFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQFFLP 239
Query: 182 MAGLGN 187
MA + N
Sbjct: 240 MACIAN 245
>gi|440465939|gb|ELQ35234.1| hypothetical protein OOU_Y34scaffold00720g28 [Magnaporthe oryzae
Y34]
gi|440485195|gb|ELQ65173.1| hypothetical protein OOW_P131scaffold00517g6 [Magnaporthe oryzae
P131]
Length = 431
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 131/250 (52%), Gaps = 11/250 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P+G+P +V E YL Y + +LQ F+S+ S+L+ +++L G+ +P A V+
Sbjct: 40 FLPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLK- 98
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG-TGLEVLSPLCPQLFLEMAG 184
I++D + + ++ ++ +G ++ E K +R LAD D ++ L + +
Sbjct: 99 IIQDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLPKVSVLV 158
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV- 243
+ + VAA A++ + + FAK GNL++L AK + T+ ++VG+ G + V
Sbjct: 159 TSGCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTAVVHMVE 218
Query: 244 -CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+++ M+V + ++ Y + +R + TLN QR ++ +F++TG++ +PA
Sbjct: 219 DKTAVWCWMVVLICMHLMMNYRAVRNVR---MLTLNRQRATIVFREFLETGKVLTPAQCS 275
Query: 303 YREDLLFPGR 312
RE +L R
Sbjct: 276 ARESILLNSR 285
>gi|125568971|gb|EAZ10486.1| hypothetical protein OsJ_00318 [Oryza sativa Japonica Group]
Length = 459
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 96/198 (48%), Gaps = 19/198 (9%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----R 115
R + F+ P+G+P SV++ YL Y ++ + T L T +LL A G+
Sbjct: 118 RAIKDFV---LPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGT 174
Query: 116 PTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
A A + W+ KDG+ G+L I G D +PK+WR+ AD + G+ E+++PL
Sbjct: 175 SAAASAAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPL 234
Query: 175 CPQLFLEMAGLGNFAKGI------AVVAARATRLP----IYSSFAKEGNLSDLFAKGEAI 224
P FL +A LGN AK I R R P I + FA+ GNL ++ AK E
Sbjct: 235 YPGYFLPLASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVW 294
Query: 225 STLFNVVGLGAGIQLAST 242
++GL G+ + T
Sbjct: 295 EVGAQLLGLSIGVFIMDT 312
>gi|67972176|dbj|BAE02430.1| unnamed protein product [Macaca fascicularis]
Length = 326
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 135/299 (45%), Gaps = 26/299 (8%)
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLP 203
G+++D K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR
Sbjct: 12 GSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAA 71
Query: 204 IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
+ A+ N++D+ AK + TL N+ GL + + V + L+ +H+Y
Sbjct: 72 LTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIY 131
Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG 323
+ +R+ + TLN R +++ +++ G + P R + L+ G A ++ +G
Sbjct: 132 ANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLG 188
Query: 324 RDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLV 375
H++V S EL++L +E ++LC +VL Q A + LR G ++
Sbjct: 189 VPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTVLRAATHGLML 246
Query: 376 AAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
A + S ++++ + ++ +F L LQ GW T++
Sbjct: 247 GALQGDGPLPAELEELRNQVRAGPKKESWVIVKETHQVLDMLFPKFLKGLQDAGWKTEK 305
>gi|225683629|gb|EEH21913.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
Pb03]
Length = 509
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/346 (23%), Positives = 161/346 (46%), Gaps = 40/346 (11%)
Query: 65 SFLNKFFPSGYPFSVNEGYL-----------RYTQFRALQHFTSAALSVLSTQSLLFAAG 113
S L F P+GYP SV++ Y+ +Y +LQ F+S+ +L+++++L G
Sbjct: 67 SLLIVFLPAGYPHSVSDDYMDLLFLALENDRKYKD--SLQAFSSSIAGLLASRAVLQGVG 124
Query: 114 LRPTPAQATV--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEV 170
+ A TV + +L++ + + ++ ++ LG ++ E K +R AD L D ++
Sbjct: 125 VGDASASPTVALLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDC 184
Query: 171 LSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
LSP P+ L + + + + + VAA +++ + S FAK GNL +L AK + T+ +
Sbjct: 185 LSPAFPKHLRVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNAKDSSQETVIS 244
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
++G+ G + S + + + + + LL +IH+ + +RA + TLN QR ++ +
Sbjct: 245 LMGMLCGSLVVSHISTPLTTGLALIFLL-LIHLSTNYAAVRAVNMTTLNRQRANIVFSTL 303
Query: 290 VKTGRISSPADLRYREDLLFPGRLIE------DAGNVKVGRDPHKVV------------- 330
+ GR +P E + ++ G ++G +++
Sbjct: 304 FEEGRALTPKQTSKCERIFERDGILRWKASSTTLGFCRIGNSFQELLCGSNGSYRANSIR 363
Query: 331 -KPSKFHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW 373
P L LF +E++IL SR +VL+ + L+ W
Sbjct: 364 DAPIDIPRLLRLFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAW 409
>gi|452842814|gb|EME44750.1| hypothetical protein DOTSEDRAFT_130464 [Dothistroma septosporum
NZE10]
Length = 447
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 95/392 (24%), Positives = 181/392 (46%), Gaps = 48/392 (12%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
F P G+P SV YL Y + +LQ F+S+ S+L+++++L + G+ +P A ++S
Sbjct: 47 FMPVGFPNSVTHDYLEYQIYDSLQAFSSSIASLLASRAVLSSVGVGDADASPTAAILLSI 106
Query: 127 ILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGL 185
+ L LG + E K +R+LAD L D G L+++S P+ + + +
Sbjct: 107 FQTSAGRLATILFAHRLGTSLGPECKMYRLLADILNDTGFLLDMISVAFPKPISVIILSF 166
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ + + VAA + + + + FA+ GNL++L AK + T+ +++G+ AG + S +
Sbjct: 167 SSILRSLCGVAAGSAKASLSAHFARWGNLAELNAKDSSQETVISLLGMWAGSVVISYITE 226
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+ LL+ IH+ + +RA + TLN QR A ++ ++ G + +P ++ +E
Sbjct: 227 PTAAWTTLIVLLA-IHLETNRRAVRAVTMQTLNRQR-ATLMFHHLQRGHVPTPVEIASQE 284
Query: 306 -----DLLFPGRLIEDAGNVKVGRDPHKVVK------------PSK---FHELKELFAEE 345
D + G G+ VG +++K PS+ + ++ +
Sbjct: 285 RIFERDGVLRGNDDRILGHCAVGASTERLLKAVSGAQTSTKAFPSQGPAMQSILTVYEKS 344
Query: 346 KFILCLGSRWTD-----------MVLEQNASGEDALRGW---LVAAYAASMAKSFHD--- 388
+IL W D +VL++ A+ + L W L A A ++ KS
Sbjct: 345 PYIL-----WQDRRSLPGRSSLFIVLKKGATPPEMLTAWWQALAVASANNIKKSTQPDQY 399
Query: 389 PSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
L +++ D+ ++ + L+ KGW D
Sbjct: 400 AELELMRSTMDQAANLLKQHENALKEKGWDLD 431
>gi|167526040|ref|XP_001747354.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774189|gb|EDQ87821.1| predicted protein [Monosiga brevicollis MX1]
Length = 283
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 94/184 (51%), Gaps = 1/184 (0%)
Query: 127 ILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
IL+D +G + + + G+ +D+ K+WR++AD L D LE++SPL P FL + +
Sbjct: 4 ILRDATGMIGSISFAWIQGSDLDNNAKQWRLVADVLNDTAMLLELISPLLPAYFLFLVCV 63
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ A+ I VA ATR + A+ N+ D+ AK + TL N+V L + L V
Sbjct: 64 ASLARAIVGVAGGATRAALIQHQARNNNMGDVAAKDGSQETLVNLVALLVNLILVPLVSQ 123
Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
+ Q + L + H++ + +R+ + LN +R +IV + + +I SP RE
Sbjct: 124 NAQVVWSLFGLSTCCHIFFNYKAVRSLCMTRLNGRRAHLIVEHYTRHQQILSPKIANQRE 183
Query: 306 DLLF 309
L F
Sbjct: 184 PLFF 187
>gi|413956563|gb|AFW89212.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 263
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/172 (31%), Positives = 90/172 (52%), Gaps = 3/172 (1%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ R+ L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+
Sbjct: 50 WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109
Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169
Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
P + + LG+ ++ VA+ ATR + FA N +D+ AK + TL
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETL 221
>gi|226292974|gb|EEH48394.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
Pb18]
Length = 495
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 81/335 (24%), Positives = 151/335 (45%), Gaps = 42/335 (12%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV- 123
S L F P+GYP SV++ Y+ Y + +LQ F+S+ +L+++++L G+ A TV
Sbjct: 67 SLLIVFLPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVA 126
Query: 124 -VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFL 180
+ +L++ + + ++ ++ LG ++ E K +R AD L D ++ LSP P+ L +
Sbjct: 127 LLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLSPAFPKHLRV 186
Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
+ + + + VAA +++ + S FAK GNL +L A G + + + L G+ L
Sbjct: 187 VLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGSLVVSHIS-TPLTTGLALI 245
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
L +IH+ + +RA + TLN QR ++ + + GR +P
Sbjct: 246 ---------------FLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTPKQ 290
Query: 301 LRYREDLLFPGRLIE------DAGNVKVGRDPHKVV--------------KPSKFHELKE 340
E + ++ G ++G +++ P L
Sbjct: 291 TSKCERIFERDGILRWKASSTTLGFCRIGNSFQELLCGSNGSYRANSIRDAPIDIPRLLR 350
Query: 341 LFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW 373
LF +E++IL SR +VL+ + L+ W
Sbjct: 351 LFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAW 385
>gi|428168470|gb|EKX37414.1| hypothetical protein GUITHDRAFT_89666 [Guillardia theta CCMP2712]
Length = 331
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 85/165 (51%), Gaps = 3/165 (1%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F PSGYP S+ + Y +Y F LQ +S +LSTQ +L G + A + W+
Sbjct: 130 FLPSGYPASLPDEYTKYQAFNLLQDLSSNLRGILSTQKILEGMGVGKAGVTSLAATMQWM 189
Query: 128 LKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+DG +G L+ + + + KRWR+ AD + D+ L++L+P P LFL + L
Sbjct: 190 ARDGSSMLGGLLFTAFASSSFGIDIKRWRLFADLINDVALLLDMLAPAYPDLFLPIICLS 249
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
+ K + VAA A I +A GN++D+ AK A T +V+
Sbjct: 250 SVFKAMCGVAAGACNTAIVQHWAARGNIADVGAKTGAQHTFVSVL 294
>gi|452823886|gb|EME30893.1| hypothetical protein Gasu_19060 [Galdieria sulphuraria]
Length = 346
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 110/233 (47%), Gaps = 17/233 (7%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWIL- 128
F PSGYP SV + Y + ++ +LQ S V+ T+S L G+ A T + +L
Sbjct: 68 FLPSGYPESVTDDYANFEKWDSLQGLCSYLRQVMVTRSTLSGLGVGNNFAHPTAAALMLL 127
Query: 129 -KDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGL 185
++G VG L+ + +R D + K WR+ AD + ++ +E +PL P +F +
Sbjct: 128 WREGFGVVGGLLFTWGKSSRFDKDSKSWRLFADVINNVALTMEWTAPLFPTPVFFALHSS 187
Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC- 244
N + +A AT + FAK N+ D+ +K I ++VGL L ST+C
Sbjct: 188 ANIFHSLCGIAGGATCTAFQAHFAKRNNIGDICSKHGNIGRAVSLVGL-----LLSTLCF 242
Query: 245 -------SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+S+ V L+V H+Y I +R+ + LN +R ++ ++
Sbjct: 243 WSSSLLENSLPFSFGVFCFLTVFHIYFNILALRSLHLAVLNKERGYIVAEHYI 295
>gi|259484769|tpe|CBF81274.1| TPA: DUF647 domain protein (AFU_orthologue; AFUA_6G06850)
[Aspergillus nidulans FGSC A4]
Length = 564
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 149/301 (49%), Gaps = 20/301 (6%)
Query: 64 DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
D + F P+GYP SV++ Y Y F +LQ F+S+ +L+++++L G+ +P
Sbjct: 67 DVLIEVFLPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPTS 126
Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A ++ IL+D + ++ ++ +G ++ E K +R AD DL L+ SP+ P F
Sbjct: 127 ALLLH-ILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGF 185
Query: 180 LEMAGL--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+ L + + VA +++ + + FA+ GNL+++ AK + T+ +++G+ G
Sbjct: 186 NRVTVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAKDSSQETVISLIGMLVGS 245
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
+ S + S + + LLS +H+ +R+ + TLN QR A+ V + +SS
Sbjct: 246 FVVSHITSFTATWLTLVFLLS-MHLSLNYAAVRSVQMTTLNRQR-----ANIVFSTLLSS 299
Query: 298 PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELK-ELFAEEKFILCLGS--R 354
DL + + P R E ++K H+ P+ + E A+E+ I G+ +
Sbjct: 300 DPDL--ADFVSSPSRKDERRASLKQQEHTHQ--NPNHIRPITPEQVAKEERIFAAGAALK 355
Query: 355 W 355
W
Sbjct: 356 W 356
>gi|255074355|ref|XP_002500852.1| predicted protein [Micromonas sp. RCC299]
gi|226516115|gb|ACO62110.1| predicted protein [Micromonas sp. RCC299]
Length = 561
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 91/184 (49%), Gaps = 7/184 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
F P+GYP +V+ Y + ++ L V++ QSLL A G+ TP W
Sbjct: 126 FLPAGYPDTVSPDYAPFIRWHLGSLMFRNVLEVITAQSLLVALGMGSTPGALPLTAATKW 185
Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+LKDG+ + L +LG + D +PKRW ++++A D+ +E+++P P LFL +A
Sbjct: 186 VLKDGIGSLATLAAGSLGGQKCDEDPKRWWMVSNAFEDVARVIELVTPAAPALFLPLAAT 245
Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
F + A + R + + + FA+ N +D+ A+ E +V L GI + TV
Sbjct: 246 ATFVR-TAALTGRGSLVNGSLMQHFARNENTADVRARLEVQGRWLALVALPVGIGIFRTV 304
Query: 244 CSSM 247
S
Sbjct: 305 SQSF 308
>gi|403412621|emb|CCL99321.1| predicted protein [Fibroporia radiculosa]
Length = 4586
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 114/447 (25%), Positives = 189/447 (42%), Gaps = 60/447 (13%)
Query: 22 QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLN----KFFPSGYPF 77
+E + W E+ +T+ + + G L + + + + +V F + F P+ YP
Sbjct: 68 RECHVSWTESG--ITKEWH-EMSGALQERGEEQTTSTTNSIVGKFTSWLRQMFLPTNYPQ 124
Query: 78 SVNEGYLRYTQFRALQHFTS---AALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGM 132
SV+ YLR F ALQ F + +SVL Q+LL + G+ + A V WI+KDG
Sbjct: 125 SVHPSYLR---FHALQFFETTFGTVVSVLCNQALLTSVGVSAEGSIFGAVAVQWIIKDGA 181
Query: 133 QHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL---CPQLFLEMAGLGNF 188
V KL DS PK + + + + +G+ L++ + L P FL A GN
Sbjct: 182 GEVAKLFFIRRFSPYFDSHPKSFTLFGEGIVAIGSCLQMATLLITPTPGNFLLCAAGGNI 241
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
K + T + F+ +GN+ D+ AK E+ +++ + G AGI L TV S
Sbjct: 242 FKLVGYAVWFTTHIKWVRYFSLQGNVGDVAAKDESQTSIAQLCGYAAGIGLL-TVSHSPA 300
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK------TGRISSPADLR 302
+ L+ +H+ + + MR A L R +++ ++ + TG++++ LR
Sbjct: 301 YLYSLFFALTPLHLTATMFMMRDAKFEALTLPRLSLLAREYAREDGEAATGKVNT---LR 357
Query: 303 YREDLLFPG-----------RLIEDAGNVK--VGRDPHKVVKPSKFHELKELFAEEKFIL 349
E+ G R +E A V +G D + +P + F EK++L
Sbjct: 358 EIEESRQTGAFGEFFKRKNDRFVELAPRVTEVIGGDVDR--EPVLWEMCTSEFQTEKYLL 415
Query: 350 CLGSRWTD----MVLEQNASGEDALRGWLVAA-----YAASMAKSFHDP------SLTV- 393
+ ++ + +A+ +D LR L AA S S P L V
Sbjct: 416 YPSASPSEKPITVFYHPDATSDDTLRSILHAARFRHDLLESRLSSLDSPWSSRTGELQVA 475
Query: 394 LQDAYDKMNDVFTPLLSELQAKGWHTD 420
L ++ F EL KGW TD
Sbjct: 476 LASSHAWTRQHFGAFKQELDDKGWRTD 502
>gi|157869108|ref|XP_001683106.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68223989|emb|CAJ04889.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 678
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/234 (26%), Positives = 110/234 (47%), Gaps = 7/234 (2%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
+R RVV + P GYP SV E L + + Q + LST+++L G+
Sbjct: 57 ARQARRRVVTTLERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGV 116
Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVL 171
+ A T +SW+++DG + +G ++ + L + ++ K W+++AD D+ LE+
Sbjct: 117 GESKADLTSATLSWMMQDGTRMIGSVVFAPLIPQGLECRAKTWQLVADFTTDIANLLELC 176
Query: 172 SPLCP---QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
+P P F + + K + V TR FA N +D+ AK S++
Sbjct: 177 APWLPGGQTTFRVVLVAASVVKALGSVCRNCTRASFTRHFALRNNAADIAAKASTRSSVG 236
Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
+ +GL G+ +A V +S ++V L SV+H+++ +R + LN R
Sbjct: 237 SFIGLVLGVAVAYIVPATSRSLNLVVFALGSVLHIFASYRSVRGVQLKHLNAPR 290
>gi|444725757|gb|ELW66311.1| hypothetical protein TREES_T100000484 [Tupaia chinensis]
Length = 406
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/365 (23%), Positives = 156/365 (42%), Gaps = 46/365 (12%)
Query: 83 YLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLIC 140
+ R+T R F S+ L+TQ++L G+ A A +W++KD +G+++
Sbjct: 41 WWRFT--RPFAAFASSLSGSLATQAVLLGIGVGNARASVSAATATWLVKDSTGMLGRIVF 98
Query: 141 SNLGARMDSEPKRWR-ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARA 199
+ W+ + AD L D+ LE+++P+ P F N AK + VA A
Sbjct: 99 A-----------WWKGLFADILNDVAMFLEIMAPIYPIFFTMTICTSNLAKCVVSVAGGA 147
Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
TR + A+ N++D+ AK + TL N+ GL + + V + LL+
Sbjct: 148 TRAALTMHQARRNNMADVAAKDGSQETLVNLAGLLVSLLMLPLVSDFPSVSLGCFFLLTA 207
Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL--FPGRLIEDA 317
+H+Y+ +RA + TLN R +++ F++TG + P E L F L
Sbjct: 208 LHIYANYRAVRALVMETLNEGRLRLVLKHFLRTGNVLDPTSANQMEPLWTGFCPSL---- 263
Query: 318 GNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALRGW 373
++ +G H+++ S +L++L +E ++LC +VL Q A E LR
Sbjct: 264 -SLSLGVPLHRLI--SSVSDLQQLVEGHQEPYLLCWDQSRNQVQVVLSQKAGPETVLRAA 320
Query: 374 LVAAYAASMAKSFHDP-----------------SLTVLQDAYDKMNDVFTPLLSELQAKG 416
++ P S +++ + ++ +F L LQ G
Sbjct: 321 THGLVLGTLQGDGPLPGELEELRNWVRAGPEKDSRVIVRKMHQVLDTLFPKFLKGLQDAG 380
Query: 417 WHTDR 421
W T++
Sbjct: 381 WKTEK 385
>gi|449690512|ref|XP_002160836.2| PREDICTED: UPF0420 protein-like, partial [Hydra magnipapillata]
Length = 172
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 78/128 (60%), Gaps = 3/128 (2%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
F N F P G+P SV+E YLRY + LQ F S+ +L+TQ++L A G+ A
Sbjct: 45 FQNVFLPQGFPESVSEDYLRYQLWDTLQAFCSSITGMLATQAMLKAYGVGDNTATVTAAT 104
Query: 124 VSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
++W+LKDG +G+++ + G ++D + KRWR++AD L + L++++PL Q F+ +
Sbjct: 105 ITWMLKDGAGMIGRILFAWKKGTQLDCDAKRWRLVADVLNNCAIMLDLVAPLFKQYFVLI 164
Query: 183 AGLGNFAK 190
A L + A+
Sbjct: 165 ACLSSVAR 172
>gi|302755442|ref|XP_002961145.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
gi|300172084|gb|EFJ38684.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
Length = 343
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/277 (27%), Positives = 125/277 (45%), Gaps = 38/277 (13%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV+ YL + + LQ ++ S+LSTQ+LL G+ A W
Sbjct: 54 FLPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWF 113
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D VG ++ + G+ +D+ K+WR+ AD + D
Sbjct: 114 LRDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFMND----------------------- 150
Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
IA VA ATR + FA++ N +D+ AK + T +VG+ G+ LA ++
Sbjct: 151 -----IAGVAGGATRAALTQHFARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANN 205
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
+ L+ H+Y+ + + A + ++N +R A+++ F+K G+ SP + +E
Sbjct: 206 IFLMWTSFMSLTAFHMYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQES 265
Query: 307 LL---FPGR--LIEDAGNVKVGRDPHKVVK--PSKFH 336
L F R +I A + D KV P KFH
Sbjct: 266 ALPRPFTSRSGIIFGAKISSLCGDGRKVSTSLPKKFH 302
>gi|317037751|ref|XP_001399065.2| hypothetical protein ANI_1_1452164 [Aspergillus niger CBS 513.88]
Length = 525
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/239 (26%), Positives = 123/239 (51%), Gaps = 12/239 (5%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ- 120
V D + F P+GYP SV + Y Y F +LQ F+S+ +LS++++L G+ A
Sbjct: 68 VQDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASP 127
Query: 121 -ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ- 177
A ++ IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P
Sbjct: 128 TAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMILDCLSPMIPAG 187
Query: 178 ----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
L AG+ + + VA +++ + + F++ GNL+++ AK + T+ +++G+
Sbjct: 188 APRVTVLSTAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGM 244
Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
G + S V ++ I +L +H+ +R+ + +LN QR ++ + + T
Sbjct: 245 LVGSFVVSRV-TNFSTTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 302
>gi|310800465|gb|EFQ35358.1| hypothetical protein GLRG_10502 [Glomerella graminicola M1.001]
Length = 416
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 126/256 (49%), Gaps = 11/256 (4%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
+ F P GYP SV YL Y F +LQ F S +LS +++L G+ + AT ++
Sbjct: 7 DAFLPIGYPDSVTPDYLGYQTFDSLQAFFSTITGLLSNRAILQGLGVGDPNSSATYALLL 66
Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMA 183
ILKDG+ V ++ + G ++ E KR+R LAD D L + SP P +
Sbjct: 67 TILKDGISRVATIVFAYRFGLVIEPECKRYRFLADIFNDSAFFLNLFSPYFDPWTKVGAL 126
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
L + VAA A++ + FA+ NLS+L AK + T +VGL G + V
Sbjct: 127 VLAEALGAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAVGLVGLLVGSVVVRFV 186
Query: 244 CSSMQGKMIVGPLLS---VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
+G+ V L++ +H+ +R + TLN QR ++ + +T ++ +PA
Sbjct: 187 ----EGREAVFALMTFLVFVHLGMNYLGVRCVQLATLNQQRATILFEHYARTKQVLTPAQ 242
Query: 301 LRYREDLLFPGRLIED 316
+ RE++LF +I++
Sbjct: 243 VASRENILFWSPVIKN 258
>gi|328773288|gb|EGF83325.1| hypothetical protein BATDEDRAFT_85860 [Batrachochytrium
dendrobatidis JAM81]
Length = 434
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 104/433 (24%), Positives = 181/433 (41%), Gaps = 53/433 (12%)
Query: 41 FQPDGHLSVKVVDDSRPVYHRVVDS-FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAA 99
+ D H S D S + V+ F N F P GYP SV+ Y + + ++ +A
Sbjct: 10 YASDRHPSTTRSDQSHIRFLGHVNQWFRNMFLPVGYPSSVHACYKKVHMWLFAENVAGSA 69
Query: 100 LSVLSTQSLLFAAG----LRPTPAQATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRW 154
+SVL+ Q++L + G T A A V W+LKDG+ +GK L+ + D+ PKRW
Sbjct: 70 ISVLTAQAMLTSIGASTAAHETAALAIAVDWVLKDGIGELGKMLMIQRFAHQFDTHPKRW 129
Query: 155 RILADALYDLGTGLEVLSPLC-PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG- 212
++ +A G +++ + + P FL A +G + + AT + A
Sbjct: 130 KLYGEACSISGALMQLATCITNPHHFLFFASIGVGLRSMHYSIWAATHTTFTRNMATHNG 189
Query: 213 -NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG--PLLSVIHVYSVIEEM 269
N+ D+ AK ++ +L +++G+ +GI + + S Q + +LS I + +
Sbjct: 190 VNVGDIVAKADSQLSLAHLLGMVSGIGMLAV---SFQPAALFAWFSVLSAIQIVCTGLLL 246
Query: 270 RAAPVNTLNPQRTAMIVADFVKTG---RISSPADLRYREDL----LFPGRLIEDAGNVKV 322
R A L+ R +I +F+ RI + + E+ L G+++ D + +
Sbjct: 247 REARFEVLDQYRLVLIPREFMSEKNALRIPTLDQVVRFENWLNEGLSKGQVVTD---LSI 303
Query: 323 GRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTD---------MVLEQNASGEDALRGW 373
G K S+ ++F K++ LG + D +VL Q+ ++
Sbjct: 304 GVTVDDAFKHSRLSHALQIFKNGKYL--LGFKLADSTHSQIKVYVVLHQSIQPVHVIQA- 360
Query: 374 LVAAYAASMAK-------------SFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
A +A K S HD +L+ + D F L L+ + W TD
Sbjct: 361 --AFHAVRFEKEIIRQTQSTGAPISLHDAE-NILESSLAWTLDNFPAYLDGLEEQKWKTD 417
Query: 421 RFLDG-TGTRFAW 432
+ G G R W
Sbjct: 418 LIIWGDRGIRVVW 430
>gi|223974077|gb|ACN31226.1| unknown [Zea mays]
Length = 207
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 87/164 (53%), Gaps = 3/164 (1%)
Query: 67 LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVV 124
L F P G+P SV Y+ + + LQ ++ ++LSTQ+LL A G+ A
Sbjct: 2 LGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATF 61
Query: 125 SWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
W L+D +G ++ + G+ +DS K WR++AD + DLG +++LSPL P + +
Sbjct: 62 QWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIM 121
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
LG+ ++ VA+ ATR + FA N +D+ AK + TL
Sbjct: 122 CLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETL 165
>gi|302837684|ref|XP_002950401.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
nagariensis]
gi|300264406|gb|EFJ48602.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
nagariensis]
Length = 500
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 123/235 (52%), Gaps = 16/235 (6%)
Query: 92 LQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMD 148
+Q +S VL++Q++L G+ + A V ++ ++D +G ++ S L G+ D
Sbjct: 10 IQALSSYVRGVLTSQAILRGVGVGQEASTPLAAVFTFFVRDLAGMLGGILFSYLEGSSFD 69
Query: 149 SEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
+ K+WR+ AD DLG +E+ SPL P+ +FL +A LG+ A+ + VA ATR +
Sbjct: 70 ACAKQWRLFADITNDLGMTVELASPLFPRVMFLPIACLGSVARSLTGVAGGATRAALTQH 129
Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS--------SMQGKMIVGPL--- 256
FA+ GN +D+ AK ++ T ++G+ G+ + + + + ++ L
Sbjct: 130 FARRGNAADVSAKEQSQETATTILGMVTGMAVTRLLAAGEGPAGGPTGAAGFVIAWLVFL 189
Query: 257 -LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
L+ +HV + + MR + +LN R ++V +++ GR+ SP + EDL P
Sbjct: 190 GLTAVHVVANVAAMRVLLLTSLNTPRLELLVDRYLQDGRVLSPRAVSQLEDLTPP 244
>gi|218187487|gb|EEC69914.1| hypothetical protein OsI_00335 [Oryza sativa Indica Group]
Length = 386
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 90/184 (48%), Gaps = 19/184 (10%)
Query: 53 DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
+D+ R + F+ P+G+P SV++ YL Y ++ + T L T +LL A
Sbjct: 57 EDANSSIPRAIKDFV---LPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAV 113
Query: 113 GL-----RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGT 166
G+ A A + W+ KDG+ G+L I G D +PK+WR+ AD + G+
Sbjct: 114 GVGSFTGTSAAASAAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGS 173
Query: 167 GLEVLSPLCPQLFLEMAGLGNFAKGI------AVVAARATRLP----IYSSFAKEGNLSD 216
E+++PL P FL +A LGN AK I R R P I + FA+ GNL +
Sbjct: 174 IFELITPLYPGYFLPLASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGE 233
Query: 217 LFAK 220
+ AK
Sbjct: 234 VAAK 237
>gi|428170130|gb|EKX39058.1| hypothetical protein GUITHDRAFT_114933 [Guillardia theta CCMP2712]
Length = 405
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 109/265 (41%), Gaps = 59/265 (22%)
Query: 50 KVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
K V + V +V SFL P V+E Y +T +R LQ S+ +SV TQSLL
Sbjct: 82 KFVRKASNVAFKVKQSFL--------PRGVSEDYYSFTIWRILQRIVSSTVSVFGTQSLL 133
Query: 110 FAAGLRPTPAQATVVS-WILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTG 167
A G++ S W+ + GK+I + G DS+ KRWR
Sbjct: 134 LALGVKTNKVGVAAASGWVFANAFGKFGKMIFAARWGRDFDSDAKRWR------------ 181
Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFA-KEGNLSDLFAKGEAIST 226
ATR Y FA + NL D+ AKG+A
Sbjct: 182 ------------------------------SATRGTFYKQFAGRNENLGDITAKGDAQVA 211
Query: 227 LFNVVGLGAGIQLA---STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
+ ++VGL GI+LA T + G +V LS + +++ + LN +R
Sbjct: 212 VADLVGLVVGIELARLAGTNALCVWGSYVV---LSALDFIFILKSLDTIVFRFLNLERAT 268
Query: 284 MIVADFVKTGRISSPADLRYREDLL 308
M+ + FV G I SPAD+ E +L
Sbjct: 269 MLASAFVSEGIIKSPADISRSEPVL 293
>gi|317143185|ref|XP_001819302.2| hypothetical protein AOR_1_114154 [Aspergillus oryzae RIB40]
Length = 510
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 122/231 (52%), Gaps = 12/231 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P+GYP SV++ Y+ Y F +LQ F+S+ +LS++++L G+ A A ++ I
Sbjct: 75 FLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHI 134
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-----LE 181
L+D + ++ ++ +G ++ E K +R+ AD D L+ LSP+ P F L
Sbjct: 135 LQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLS 194
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
AG+ + + VA +++ + + F++ GNL+++ AK + T+ +++G+ G + S
Sbjct: 195 TAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVGSFVVS 251
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
V +S I +L +H+ +R+ + +LN QR ++ + + +
Sbjct: 252 RV-TSYTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNS 301
>gi|119187997|ref|XP_001244605.1| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
Length = 472
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 106/450 (23%), Positives = 188/450 (41%), Gaps = 94/450 (20%)
Query: 43 PDGHLSVKVVDDSRPVYHRVVDSF----LNKFFPSGYPFSVNEGYLRYTQF--------- 89
P + K V S P R S+ L+ F P GYP SV+ YL Y +
Sbjct: 26 PAASIGTKPVTISAPTSRRTWSSYFTAILDAFLPVGYPHSVSGDYLEYQVYVRPIFLKLH 85
Query: 90 --------------RALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGM 132
+LQ F+S+ +LS++++L G+ +P A ++S IL+D M
Sbjct: 86 WSFTDVANNRLFVQDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSM 144
Query: 133 QHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAK 190
+ ++ ++ LG ++ E K +R+ AD D ++ LSP+ P+ + + + L + +
Sbjct: 145 GRIATILFAHQLGTSLEPECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLR 204
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
+ VAA +++ + + FAK GNL++L A G + S + S +
Sbjct: 205 ALCGVAAGSSKASLSAHFAKWGNLAELNAVGSLV---------------VSYITSPLATW 249
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF-----VKTGRISSPADLRYRE 305
+ + LL ++H+ + +RA + +LN QR ++ + V T + +S + +
Sbjct: 250 VALISLL-IVHITTNYLAVRAVSMTSLNRQRANIVFSTLFDENRVLTAKQASKQERVFER 308
Query: 306 DLLFPGRLIEDA-GNVKVG---RDPHKVVKPSKFHELKE--------LFAEEKFILCLGS 353
D + + D G ++G +D V S E E +F +E++IL
Sbjct: 309 DGVLRWKASSDTLGFCQIGVPFKDVIDHVSSSSPAERTEATIVAILRVFEQEEYIL---- 364
Query: 354 RWTD-------MVLEQNASGEDALRGWLVAAYAASMAKSFHDPS---------------- 390
W D + L+ NAS L+ W A AA+ K +
Sbjct: 365 -WFDPARKRGVIGLKTNASPTSQLKAWSHALLAANSLKIKESNTRGPLVREQVGTDSVLM 423
Query: 391 LTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
L +LQD + + F + L+ GW D
Sbjct: 424 LGILQDTLAEHSKDFDARIESLRDAGWDID 453
>gi|358373469|dbj|GAA90067.1| DUF647 domain protein [Aspergillus kawachii IFO 4308]
Length = 527
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/239 (26%), Positives = 124/239 (51%), Gaps = 12/239 (5%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ- 120
V+D + F P+GYP SV + Y Y F +LQ F+S+ +LS++++L G+ A
Sbjct: 68 VLDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASP 127
Query: 121 -ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ- 177
A ++ IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P
Sbjct: 128 TAALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMIPAG 187
Query: 178 ----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
L AG+ + + VA +++ + + F++ GNL+++ AK + T+ +++G+
Sbjct: 188 APRVTVLSTAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGM 244
Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
G + S V +S I +L +H+ +R+ + +LN QR ++ + + T
Sbjct: 245 LVGSFVVSRV-TSFSTTWICLLMLLALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 302
>gi|397627073|gb|EJK68340.1| hypothetical protein THAOC_10484 [Thalassiosira oceanica]
Length = 478
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 124/275 (45%), Gaps = 27/275 (9%)
Query: 82 GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLI 139
GYL Y LQ T+A SVL+TQ +L G+ A A + ++++DG + L+
Sbjct: 92 GYLEYVCMDNLQDLTTALRSVLATQRVLEGVGVGREGATALAATLDFLIRDGCGMIASLV 151
Query: 140 -CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP-QLFLEMAGLGNFAKGIAVVAA 197
S + + KRW++ ADA D G LEV++P P + FL + N K + VAA
Sbjct: 152 LTSGFSSTFRRDIKRWKLFADAAVDAGITLEVVAPSLPSRWFLPVLCGANVCKALCGVAA 211
Query: 198 RATRLPIYSSFAK-----EGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMI 252
A P+ +A + LS++ AK A T+ GL L+ M G+
Sbjct: 212 GAVCGPVQLHWAARLGLGDEGLSEVAAKAGAQRTIVGATGL----VLSGLFARWMGGRGA 267
Query: 253 VGP-------LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-------KTGRISSP 298
P +L+ +H+ + MR +N LN +R +I+ DF+ T + P
Sbjct: 268 NRPVWVGMYAVLTALHLLANYRSMRIVRLNWLNRERMDLILEDFLGRDGGSGDTLGLIDP 327
Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPS 333
A E L+F R+ V++G + +V+ S
Sbjct: 328 AGAARVESLVFRPRVRRHQCGVRLGVSFNDLVRCS 362
>gi|345563243|gb|EGX46246.1| hypothetical protein AOL_s00110g70 [Arthrobotrys oligospora ATCC
24927]
Length = 445
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/394 (22%), Positives = 166/394 (42%), Gaps = 62/394 (15%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
+ + +N F P+GYP SV + YL A G + A T
Sbjct: 54 IQNIINIFLPAGYPKSVTDDYLDGD----------------------LAVGFEDSNASTT 91
Query: 123 --VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
V+ I++D + V ++ ++ LG+ ++ E KR+R AD D ++ LSP+ P+
Sbjct: 92 AAVLLKIVQDSIGRVATILFAHRLGSSLEPECKRFRFAADIFNDSALIVDCLSPVFPKPL 151
Query: 178 --LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL-- 233
+ L AG + + VA ++ + F+K GN+++L AK + T+ +++G+
Sbjct: 152 RVVLLSTAGT---LRALCGVAGGGSKAQLSVHFSKWGNVAELNAKDSSQETVISLMGMLM 208
Query: 234 ----GAGIQLASTVCSSMQGKMIVGP---LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
++ S V SS+ K L +H+++ ++A + TLN QR ++
Sbjct: 209 RADQTLRVKAGSLVISSVTSKTATWASMLFLIAVHLWTNYRAVKAVTMVTLNRQRANSVM 268
Query: 287 ADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG------RDPHKVVKPS-----KF 335
+ F +TG + +P D+ +E + ++ VG + V + S
Sbjct: 269 STFAETGVMLTPKDVARQERIFERDGVLRWCDGAVVGWGVIGTKLSSIVARKSASGSLSL 328
Query: 336 HELKELFAEEKFILCLGSRWTD----MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSL 391
E+ + E+++L G + D + LE + R W+ +A + ++
Sbjct: 329 AEVAKACPTEQYMLWFGDKLRDIRVCICLENGCTPAGQFRAWMHGLLSAKLLSDRRSQAV 388
Query: 392 T------VLQDAYDKMNDVFTPLLSELQAKGWHT 419
T L DA +++ +F L + GW T
Sbjct: 389 TESDALRALVDAKRQLDGIFEEFLEKAAEAGWDT 422
>gi|392871321|gb|EAS33216.2| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
Length = 407
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 174/375 (46%), Gaps = 52/375 (13%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGAR 146
+LQ F+S+ +LS++++L G+ +P A ++S IL+D M + ++ ++ LG
Sbjct: 21 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSMGRIATILFAHQLGTS 79
Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIY 205
++ E K +R+ AD D ++ LSP+ P+ + + + L + + + VAA +++ +
Sbjct: 80 LEPECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLS 139
Query: 206 SSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSV 265
+ FAK GNL++L AK + T+ +++G+ G + S + S + + + LL ++H+ +
Sbjct: 140 AHFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLL-IVHITTN 198
Query: 266 IEEMRAAPVNTLNPQRTAMIVADF-----VKTGRISSPADLRYREDLLFPGRLIEDA-GN 319
+RA + +LN QR ++ + V T + +S + + D + + D G
Sbjct: 199 YLAVRAVSMTSLNRQRANIVFSTLFDENRVLTAKQASKQERVFERDGVLRWKASSDTLGF 258
Query: 320 VKVG---RDPHKVVKPSKFHELKE--------LFAEEKFILCLGSRWTD-------MVLE 361
++G +D V S E E +F +E++IL W D + L+
Sbjct: 259 CQIGVPFKDVIDHVSSSSPAERTEATIVAILRVFEQEEYIL-----WFDPARKRGVIGLK 313
Query: 362 QNASGEDALRGWLVAAYAASMAKSFHDPS----------------LTVLQDAYDKMNDVF 405
NAS L+ W A AA+ K + L +LQD + + F
Sbjct: 314 TNASPTSQLKAWSHALLAANSLKIKESNTRGPLVREQVGTDSVLMLGILQDTLAEHSKDF 373
Query: 406 TPLLSELQAKGWHTD 420
+ L+ GW D
Sbjct: 374 DARIESLRDAGWDID 388
>gi|409048331|gb|EKM57809.1| hypothetical protein PHACADRAFT_251666 [Phanerochaete carnosa
HHB-10118-sp]
Length = 532
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 105/235 (44%), Gaps = 9/235 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P+ YP SV+ YL++ + ++ + VL Q+LL + G+ + A V W+
Sbjct: 114 FLPTNYPQSVHPSYLKFHLLQFMECIVGTVVGVLCNQALLVSVGVSAEGSILGAVAVQWV 173
Query: 128 LKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL---FLEMA 183
+KDG V KL + DS PK + + LG+GL++ + + P FL A
Sbjct: 174 IKDGAGEVAKLWVIRRFSPYFDSHPKTFTFTGATMGLLGSGLQIAALVVPPTTLNFLLCA 233
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
GN K + T + FA++GN+ D+ AK E+ ++ + G AG+ L TV
Sbjct: 234 AGGNIFKCVGGAVWLTTHIKFVRFFARQGNMGDVAAKSESQGSVGILSGYAAGVGLL-TV 292
Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
S + L IH+ +RAA L R + + +F R S P
Sbjct: 293 SHSAPYLYTIFALAIPIHLLITQWMLRAATFELLTLPRLSFLAREF--ASRASRP 345
>gi|402582701|gb|EJW76646.1| hypothetical protein WUBG_12446, partial [Wuchereria bancrofti]
Length = 184
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 79/135 (58%), Gaps = 3/135 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
F P GYP SV+ Y+ Y + +Q F S+ S LST+++L G+ + A V+W+
Sbjct: 43 FMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAAAVAWL 102
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
LKDG+ + +++ + L +D++ K+WR++AD DL L++++P+ P LF+ + L
Sbjct: 103 LKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAYCLDLITPIFPNLFMPIICLS 162
Query: 187 NFAKGIAVVAARATR 201
+ + + VA ATR
Sbjct: 163 SMIRAVVGVAGSATR 177
>gi|378729487|gb|EHY55946.1| hypothetical protein HMPREF1120_04055 [Exophiala dermatitidis
NIH/UT8656]
Length = 485
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/230 (26%), Positives = 116/230 (50%), Gaps = 11/230 (4%)
Query: 61 RVVDSFLNK----FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP 116
R S LNK F P+GYP +V Y Y + +LQ F S +L+++++ G+
Sbjct: 45 RTGTSILNKCMSPFLPAGYPSTVTPDYTPYQIYDSLQAFASTIAGLLASRAVFVGMGVGS 104
Query: 117 TPAQATVVSWILKDGMQHVGK----LICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
A + + + +L + VG+ L R+++E K +R AD + D+ L+ +S
Sbjct: 105 EEA-SVLTTMLLYIAQETVGRVATILFAHQFSQRIEAEVKFYRFFADIVNDVAFVLDCMS 163
Query: 173 PLCPQL-FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
P P L + L N + + VA +++ + S FAK GN+ +L AK + T+ +++
Sbjct: 164 PAMPILGRVVTLCLSNACRAVCGVAGGSSKAILSSHFAKAGNIGELNAKDGSQETVISLL 223
Query: 232 GLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
G+ G + S V ++ + PLL+ +H+++ + +R+ +N+ N R
Sbjct: 224 GMWVGGVVVSKVHGTLATWSWLVPLLA-LHLWANWKAVRSVRLNSFNTNR 272
>gi|380480425|emb|CCF42443.1| hypothetical protein CH063_12442 [Colletotrichum higginsianum]
Length = 484
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 131/289 (45%), Gaps = 25/289 (8%)
Query: 40 QFQPDGHLSVKVVDDSRPVYHRVVD------------SFLNKFFPSGYPFSVNEGYLRYT 87
++ G+L K V+ + + VD L F P GYP SV E Y Y
Sbjct: 12 EYDKAGNLKAKYVESASRIGPSRVDVLLPFKRKKPWLRLLEIFLPDGYPHSVTEDYAAYQ 71
Query: 88 QFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQATVVSWILKDGMQHVGKLICSNL- 143
+ ++Q F + ++S++++ G+ +P A +V ++++ + ++L
Sbjct: 72 IYDSVQAFAGSIAGMISSRAVWEGLGVGDSLASPTGAMLVQ-VIRESTGRFATIAFAHLF 130
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARA 199
G +++E K WR+ AD L D+ L+ LSP P L L + L A G VA A
Sbjct: 131 GTSIEAECKAWRLAADVLCDVAMVLDCLSPFFPHGVRFLVLCFSSLLYSASG---VAGNA 187
Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
++ + FAK NL +L AK + T +++G+ +G + S + + + L
Sbjct: 188 SKSSLSGHFAKWNNLGELNAKDASQETAISLIGMISGTFVVSLL-TDQRATWATLIFLLA 246
Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
+H++ + +RA +LN QR + + G++ +P + RE +
Sbjct: 247 LHLFLNWKGVRAVKSRSLNRQRANIAFSALFSKGQVLTPHHVAERERIF 295
>gi|390599626|gb|EIN09022.1| DUF647-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 461
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 105/411 (25%), Positives = 173/411 (42%), Gaps = 52/411 (12%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
SR + H + D FL P+GYP SV+ YLRY + A Q F S+ ++++++++L G+
Sbjct: 26 SRGLKHLLTDVFL----PAGYPSSVSPDYLRYQTYNAFQAFCSSLANLIASRAVLEGFGV 81
Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVL 171
A AT ++ +L+D + + LG + E K +R+LAD D L+V
Sbjct: 82 GNASASATHAMLLTVLQDVFGRLTTVAGGYYLGTALSPEAKTYRLLADVFNDASIVLDVF 141
Query: 172 SPLCPQL------FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
P + L + G + + V A ++ + FA G+L DL AK +
Sbjct: 142 QPFYASVPVAGARVLALCSSGCL-RALCGVCAGGSKAALAVHFASAGDLGDLNAKDGSRE 200
Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
T+ +VGL +G L S + + + LL H+ +RA + TL QR ++
Sbjct: 201 TVIGLVGLLSGTILLSYIGTPGATYAALCALLGA-HLALNYLAVRAVAMRTLTRQRASIA 259
Query: 286 VADFVK--TGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPH----------KVVKPS 333
+++ G +P ++ RE LL LI D ++GR K S
Sbjct: 260 WRVWMERGAGAAPTPEEVAEREGLLADPTLIADGTGRRLGRCSFGGARSQGFIATTGKDS 319
Query: 334 KFHELKELFAEEKFILCL----GSRWTDMVLEQNASG-EDALRGWLVAAYA--------- 379
L ++ L + GS T +L + G ED L+ W+ A A
Sbjct: 320 AMVSTSRLIEAARYWLLIDSKPGSIPTATILLKRVYGPEDLLKAWVHALRAVQDRRAESS 379
Query: 380 ----ASMAKSF---HDPSLTVLQDAYDKMND---VFTPLLSELQAKGWHTD 420
A ++F H P D +D+ ND +F + ++ GW D
Sbjct: 380 LQSGARGPQTFNHGHGPDSAGHTD-HDEGNDSERLFERFAARMRDAGWRVD 429
>gi|164423593|ref|XP_001728067.1| hypothetical protein NCU11380 [Neurospora crassa OR74A]
gi|157070159|gb|EDO64976.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 357
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 78/288 (27%), Positives = 133/288 (46%), Gaps = 37/288 (12%)
Query: 39 YQFQPDGHLSVKVVDDSRPVY--HRV----VDSFLNKFFPSGYPFSVNEGYLRYTQFRAL 92
Y P G++ +SR H+ + + LN F P+GYP SV YL Y + +L
Sbjct: 6 YDIDPAGNVLCTYNGESRESLKAHQTKVAPLHALLNAFLPAGYPHSVTPDYLPYQTYDSL 65
Query: 93 QHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGK----LICSNLGARMD 148
Q F S+ S+L+ +++L G+ + T + ILK + + L LG ++
Sbjct: 66 QAFFSSISSLLANRAVLEGLGVGDASSSPT-AALILKITSDTISRFATILFAHRLGTAIE 124
Query: 149 SEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAV------------VA 196
E K +R LAD L D L++L P+ M G + + V VA
Sbjct: 125 PECKFFRFLADVLNDAAQLLDLLMPVM------MMGETRYGLKLGVVVGSGVLRSWCGVA 178
Query: 197 ARATRLPIYSSF-AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
A A++ + + F + +L +L AK + T+ +++G+ G S V +Q +++V
Sbjct: 179 AGASKASLSAHFVTQRSSLGELNAKEASQETVVSLLGMLVG----SLVVKVVQDRLVVWV 234
Query: 256 L---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
L L +H+ +R + TLN QR M+V +++TGR+ P +
Sbjct: 235 LMVGLLGVHLLMNYRGVRCVRLRTLNRQRATMVVRSWLETGRVLGPEE 282
>gi|452984686|gb|EME84443.1| hypothetical protein MYCFIDRAFT_187449 [Pseudocercospora fijiensis
CIRAD86]
Length = 435
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 104/391 (26%), Positives = 180/391 (46%), Gaps = 67/391 (17%)
Query: 19 MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
M+ Q + I ++ +D+VT Y Q +G D + R VD FL P G+P S
Sbjct: 1 MADQPLRITEYDEADTVTATY-VQSEGKSQGLSRVDVITAWQRAVDVFL----PVGFPHS 55
Query: 79 VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHV 135
V+E YL Y + +LQ F+S+ S+LS++++L + G+ +P A ++S IL+D + +
Sbjct: 56 VSEDYLEYQIYDSLQAFSSSIASLLSSRAVLSSVGVGDSTASPTAAIILS-ILQDSVGRL 114
Query: 136 GK-LICSNLGARMDSEPKRWRILADAL-YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIA 193
L + G ++ E K +R+LAD + + + VL LC
Sbjct: 115 ATILFAHSFGTSLEPECKMYRLLADPIRVVVLSLSSVLRSLCG----------------- 157
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
VAA +++ + + FAK+GNL +L AK + T+ ++VG+ AG + S + + + +
Sbjct: 158 -VAAGSSKASLSAHFAKQGNLGELNAKDSSQETVISLVGMMAGSLVISWITTPIATWTAL 216
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRL 313
LLS IH+ + +RA + TL QR A+I V+ + +P + +E + +
Sbjct: 217 ILLLS-IHLETNRRAVRAVKMRTLTRQRAALIYYQ-VQQNKTPTPDFIASQERIFERDGV 274
Query: 314 IEDA-----GNVKVG----------RDPHKVVKPS-----KFHELKELFAEEKFILCLGS 353
+ +A G VG + H K S KF + L+ FIL
Sbjct: 275 LRNAKAEVLGFCSVGVPLARLLNSIGEQHSQTKSSTMASEKFTRILHLYQTSAFIL---- 330
Query: 354 RWTD-----------MVLEQNASGEDALRGW 373
W+D +V++++A D + W
Sbjct: 331 -WSDRHSRSRRNSLFIVVKKSAVSRDLILAW 360
>gi|255574377|ref|XP_002528102.1| conserved hypothetical protein [Ricinus communis]
gi|223532491|gb|EEF34281.1| conserved hypothetical protein [Ricinus communis]
Length = 485
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 84/379 (22%), Positives = 156/379 (41%), Gaps = 41/379 (10%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV------- 123
P+G+P SV++ Y +Y + + T L T SLL A G+
Sbjct: 103 LPAGFPGSVSDDYFQYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFTGSTAAAAASAAA 162
Query: 124 VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+ W+ KDG+ +G+L I G+ D +PK+WR+ AD + G+ ++++ + P FL +
Sbjct: 163 IRWVSKDGIGALGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLITQVYPAYFLPL 222
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
A LGN K +A + I + FA GNL ++ AK E VG
Sbjct: 223 ASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWE-----VGAQLLGLALGI 277
Query: 243 VCSSMQGKMIVGPLLS-------VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ G + P L+ ++H++ + +T+N +R ++V + +
Sbjct: 278 LILDTPGLVKSFPALASTWLSMRLLHLWLRYLSLSVLQFDTINLKRARLLVKSHILHSVV 337
Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV----KPSKFHELKELFAEEKFILCL 351
D E++L + + G ++V ++ L +L+ +E ++L +
Sbjct: 338 LGCVDCNREENILLWETFTKP--QITFGVPLQEMVGGERSAAQVKTLLKLYTKENYLLVV 395
Query: 352 GSRWTD----MVLEQNASGEDALRG-----WLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
+ D + + A+ ALR WL + +S L L + +M
Sbjct: 396 NQQKGDFEVFVSFKVGATSMSALRSVWQTYWLHENWKSSTN------VLDKLAQSLLEME 449
Query: 403 DVFTPLLSELQAKGWHTDR 421
F + +L+ GW+T +
Sbjct: 450 VKFEDFVQQLEVAGWNTHQ 468
>gi|67539168|ref|XP_663358.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
gi|40743657|gb|EAA62847.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
Length = 559
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 78/301 (25%), Positives = 148/301 (49%), Gaps = 25/301 (8%)
Query: 64 DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
D + F P+GYP SV++ Y Y F +LQ F+S+ +L+++++L G+ +P
Sbjct: 67 DVLIEVFLPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPTS 126
Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
A ++ IL+D + ++ ++ +G ++ E K +R AD DL L+ SP+ P F
Sbjct: 127 ALLLH-ILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGF 185
Query: 180 LEMAGL--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
+ L + + VA +++ + + FA+ GNL+++ A+ T+ +++G+ G
Sbjct: 186 NRVTVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAE-----TVISLIGMLVGS 240
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
+ S + S + + LLS +H+ +R+ + TLN QR A+ V + +SS
Sbjct: 241 FVVSHITSFTATWLTLVFLLS-MHLSLNYAAVRSVQMTTLNRQR-----ANIVFSTLLSS 294
Query: 298 PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELK-ELFAEEKFILCLGS--R 354
DL + + P R E ++K H+ P+ + E A+E+ I G+ +
Sbjct: 295 DPDL--ADFVSSPSRKDERRASLKQQEHTHQ--NPNHIRPITPEQVAKEERIFAAGAALK 350
Query: 355 W 355
W
Sbjct: 351 W 351
>gi|170102118|ref|XP_001882275.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642647|gb|EDR06902.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 285
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 79/288 (27%), Positives = 141/288 (48%), Gaps = 28/288 (9%)
Query: 59 YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
+ RV+D F P+GYP SV+ YLRY ALQ F ++ +LS++++L G+
Sbjct: 1 HRRVLDLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLAGLLSSRAILEGFGVGDPS 60
Query: 119 AQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A AT ++ +L+D + ++ ++ LG+ + + K++R+LAD L D L+ +SP
Sbjct: 61 ATATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDAKKYRLLADMLNDAAVILDTISPRL 120
Query: 176 PQLF---LEMAGLGNFA--KGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAIST 226
LF L +A L A + + ++A ++ I FA G++ DL AK + T
Sbjct: 121 DTLFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKET 180
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEM--RAAPVNTLNPQRTAM 284
+ ++G+ G + + + I LL ++ ++ I + R + TLN QRT +
Sbjct: 181 VLALLGMLLGTLIVPHLTTPWT---IYATLLVLVGLHLTINYIGVRGLVLRTLNGQRTWI 237
Query: 285 IVADFVKTG--RISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV 330
+ +++T R +P ++ E R+ E G RDP + V
Sbjct: 238 AWSAYMRTQPRRAPTPVEVASLE------RIFERPGGF---RDPRRGV 276
>gi|414586422|tpg|DAA36993.1| TPA: hypothetical protein ZEAMMB73_408601, partial [Zea mays]
Length = 72
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 53/70 (75%), Gaps = 1/70 (1%)
Query: 358 MVLEQNASGEDALRGWLVAAYAASMAKS-FHDPSLTVLQDAYDKMNDVFTPLLSELQAKG 416
MVLE++ASGEDALRGWLVAA+A+ M +S VL +AY+KM VF +SE++++G
Sbjct: 1 MVLERSASGEDALRGWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPTFVSEVRSRG 60
Query: 417 WHTDRFLDGT 426
W+TD+FLDG
Sbjct: 61 WYTDQFLDGN 70
>gi|296826902|ref|XP_002851048.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
gi|238838602|gb|EEQ28264.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
Length = 479
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 104/439 (23%), Positives = 188/439 (42%), Gaps = 96/439 (21%)
Query: 56 RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQF---------------------RALQH 94
+P + S LN F PSGYP SV + YL Y + +LQ
Sbjct: 47 QPSSTSYLTSILNVFLPSGYPHSVTDDYLEYQIYVSQQPFPSSDRKEKKLTVSYQDSLQA 106
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNLGARMDSEPK 152
F+S+ +LS++++L G+ A A ++ +L++ M + ++ +
Sbjct: 107 FSSSIAGLLSSRAVLQGVGVGDATASPTAALLLNVLQESMGRIATILFA----------- 155
Query: 153 RWRILADALYDLGTGLE---VLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
+ LGT LE LSP+ P+ + + L + + + VAA +++ + + F
Sbjct: 156 ---------HRLGTSLEPEYCLSPVFPKPVRVGFLSLSSVLRALCGVAAGSSKASLSAHF 206
Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
A+ GNL +L AK + T+ ++VG+ G + + V S + + + LL+ IH+ +
Sbjct: 207 ARWGNLGELNAKDSSQETVISLVGMLVGSVVVAYVSSPLATWVALLFLLA-IHLATNHAA 265
Query: 269 MRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP--GRLIEDA------GNV 320
+RA + TLN QR ++ + R+ +PA + +E+ +F G L A G
Sbjct: 266 VRAVNMTTLNRQRANIVFSSLFDDDRVLTPAQVS-KEERIFERDGVLRWKAEATGVLGTC 324
Query: 321 KVG--------RDPHK----VVKPSKFHE-----------LKELFAEEKFILCLGSR--W 355
+VG PH K S+ E L +LF EE+++L R
Sbjct: 325 RVGVSLEQLLLLLPHSQTNDASKTSRRSEAGADFGVSLPVLVDLFKEEEYLLYFSPRDHA 384
Query: 356 TDMVLEQNASGEDALRGW----LVAAYAA--SMAKSFHDPS--------LTVLQDAYDKM 401
++L+ + + L+ W LV+ A S + ++ L++L+D +
Sbjct: 385 AVIILKAGVTAQAQLKAWSHALLVSRRLAGRSTSACINEKQGGQNELSVLSILRDTLREH 444
Query: 402 NDVFTPLLSELQAKGWHTD 420
+ F + L+ GW TD
Sbjct: 445 SKAFDDRVERLKQAGWETD 463
>gi|303316636|ref|XP_003068320.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
delta SOWgp]
gi|240108001|gb|EER26175.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
delta SOWgp]
Length = 478
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 174/375 (46%), Gaps = 52/375 (13%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGAR 146
+LQ F+S+ +LS++++L G+ +P A ++S IL+D M + ++ ++ LG
Sbjct: 92 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSMGRIATILFAHQLGTS 150
Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIY 205
++ E K +R+ AD D ++ LSP P+ + + + L + + + VAA +++ +
Sbjct: 151 LEPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLS 210
Query: 206 SSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSV 265
+ FAK GNL++L AK + T+ +++G+ G + S + S + + + LL ++H+ +
Sbjct: 211 AHFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLL-IVHITTN 269
Query: 266 IEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE-----DLLFPGRLIEDA-GN 319
+RA + +LN QR ++ + R+ +P +E D + + D G
Sbjct: 270 YLAVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGF 329
Query: 320 VKVGRDPHKVV------KPSKFHE-----LKELFAEEKFILCLGSRWTD-------MVLE 361
++G V+ P++ E + +F +E++IL W D + L+
Sbjct: 330 CQIGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYIL-----WFDPARKRGVIGLK 384
Query: 362 QNASGEDALRGWLVAAYAASMAK----SFHDP------------SLTVLQDAYDKMNDVF 405
NAS L+ W A AA+ K + +P L +LQ + + F
Sbjct: 385 TNASPTSQLKAWSHALLAANSLKIKESNTREPLVREQVGTDSVLMLGILQGTLAEHSKDF 444
Query: 406 TPLLSELQAKGWHTD 420
+ L+ GW D
Sbjct: 445 DARIESLRDAGWDID 459
>gi|320038116|gb|EFW20052.1| hypothetical protein CPSG_03227 [Coccidioides posadasii str.
Silveira]
Length = 478
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 174/375 (46%), Gaps = 52/375 (13%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGAR 146
+LQ F+S+ +LS++++L G+ +P A ++S IL+D M + ++ ++ LG
Sbjct: 92 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSMGRIATILFAHQLGTS 150
Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIY 205
++ E K +R+ AD D ++ LSP P+ + + + L + + + VAA +++ +
Sbjct: 151 LEPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLS 210
Query: 206 SSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSV 265
+ FAK GNL++L AK + T+ +++G+ G + S + S + + + LL ++H+ +
Sbjct: 211 AHFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLL-IVHITTN 269
Query: 266 IEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE-----DLLFPGRLIEDA-GN 319
+RA + +LN QR ++ + R+ +P +E D + + D G
Sbjct: 270 YLAVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGF 329
Query: 320 VKVGRDPHKVV------KPSKFHE-----LKELFAEEKFILCLGSRWTD-------MVLE 361
++G V+ P++ E + +F +E++IL W D + L+
Sbjct: 330 CQIGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYIL-----WFDPARKRGVIGLK 384
Query: 362 QNASGEDALRGWLVAAYAASMAK----SFHDP------------SLTVLQDAYDKMNDVF 405
NAS L+ W A AA+ K + +P L +LQ + + F
Sbjct: 385 TNASPTSQLKAWSHALLAANSFKIKESNTREPLVREQVGTDSVLMLGILQGTLAEHSKDF 444
Query: 406 TPLLSELQAKGWHTD 420
+ L+ GW D
Sbjct: 445 DARIESLRDAGWDID 459
>gi|326479045|gb|EGE03055.1| DUF647 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 494
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/411 (22%), Positives = 162/411 (39%), Gaps = 108/411 (26%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
+ S LN F P+GYP SV + Y+ Y + ++ + +LFA
Sbjct: 123 LTSILNVFLPAGYPHSVTDDYIEYQIYESMGRIAT----------ILFA----------- 161
Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLE 181
LG ++ E K +R+ AD L D L+ LSP+ P+ + +
Sbjct: 162 ------------------HRLGTSLEPECKMYRLAADILNDSAMVLDCLSPIFPKPVRVG 203
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+ L + + + VAA +++ + + FA+ GNL +L A G + S
Sbjct: 204 LLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVGSVV---------------VS 248
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+ S ++ + + LL ++H+ + +RA + TLN QR ++ + + I +PA+
Sbjct: 249 YITSPLETWIALIVLL-IVHLGTNHAAVRAVKMTTLNRQRANIVFSHLFEDNLILTPAET 307
Query: 302 RYREDLLFPGRLIE-----------------------------DAGNVKVGRDPHKVVKP 332
E + ++ D G ++ R +
Sbjct: 308 SKEERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRGALETSRTTSTTMDA 367
Query: 333 S-KFHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW----LVAAYAA----S 381
S L ELF EE++IL G SR +VL+ A+ + L+ W LV+ + A +
Sbjct: 368 STNLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKAWSHALLVSRHLARKNGT 427
Query: 382 MAKSFHD---------PS---LTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
H+ PS L++L+D + F + L+ GW TD
Sbjct: 428 ATSCIHEKQGGQGELSPSVSVLSILRDTLQEHTRTFKDRVKRLEHAGWQTD 478
>gi|303280291|ref|XP_003059438.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459274|gb|EEH56570.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 568
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 79/161 (49%), Gaps = 7/161 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
F P+GYP +V+ Y + ++ L VL++QSLL A G+ TP A W
Sbjct: 150 FLPAGYPDTVSSDYATFIRWHLASLMFRNVLEVLTSQSLLVALGMGSTPGALPLAAATKW 209
Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
+LKDG+ L+ G + D +PKRW + ++L D+ LE+++P P LFL +A
Sbjct: 210 VLKDGVGSFATLLAGVFGGQKYDEDPKRWWAVTNSLEDVARVLELVTPAYPALFLPLAAS 269
Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAI 224
F + A+ R + + + N D+ AK EA+
Sbjct: 270 ATFVRAGALT-GRGSLINGTFMQHLGRNQNFGDVRAKMEAL 309
>gi|346978906|gb|EGY22358.1| hypothetical protein VDAG_03796 [Verticillium dahliae VdLs.17]
Length = 455
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/262 (22%), Positives = 125/262 (47%), Gaps = 9/262 (3%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
+ F P YP SV+ Y++Y + + Q F S +L+ +++L G+ ++
Sbjct: 79 DAFLPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHALLL 138
Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLC---PQLFLE 181
LKDG+ + + + G+ +++ K++R LAD D LE+ +P ++ +
Sbjct: 139 TCLKDGVSRLATITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELANPFFAGWAKILIL 198
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
++ + + +AA A++ + + FA+ NL++L K + T ++GL AG +
Sbjct: 199 VSAES--LRALCGIAAGASKAALSTHFARRNNLAELNTKEASQETAIGLIGLLAGTAIIH 256
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
V + + L IH+ + A ++T+N QR +++ +++T + +PA+
Sbjct: 257 YV-QNRGAVFFLTIFLVFIHLGMNYLGVCALQLDTVNRQRATIVMQHYLRTQEVLTPAEA 315
Query: 302 RYREDLLFPGRLIEDAGNVKVG 323
+E+++F I D N V
Sbjct: 316 SKKENIIFWRPHIADEHNKTVA 337
>gi|238488016|ref|XP_002375246.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
gi|220700125|gb|EED56464.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
Length = 513
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 95/425 (22%), Positives = 182/425 (42%), Gaps = 81/425 (19%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P+GYP SV++ Y+ Y F +LQ F+S+ +LS++++L G+ A A ++ I
Sbjct: 75 FLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHI 134
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-----LE 181
L+D + ++ ++ +G ++ E K +R+ AD D L+ LSP+ P F L
Sbjct: 135 LQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLS 194
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL---GAGIQ 238
AG+ + + VA +++ + + F++ GNL+++ AK + T+ +++G+ G
Sbjct: 195 TAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVRFVGSF 251
Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI------------V 286
+ S V +S I +L +H+ +R+ + +LN QR ++ +
Sbjct: 252 VVSRV-TSYTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNSDTDLDI 310
Query: 287 ADFVKTG------------------RISSPADLRYREDLLFPGRLI---------EDAGN 319
A+F T +I +PA + +E + ++ G
Sbjct: 311 ANFNPTHETHPTPKHSKATQSQKQWQIPTPAQVSKQEKIFETDGILRWVSAPSTQHKLGT 370
Query: 320 VKVGRDPHKVVKPSK-------------FHELKELFAEEKFILCLG---SRW-TDMVLEQ 362
++G + + PS +L LF E ++L L W ++L+
Sbjct: 371 CRIGVSLEQFLAPSSTRTGSGSLKTSTPMSDLSSLFKSEDYLLFLHRNRQSWDARILLKT 430
Query: 363 NASGEDALRGWLVAAYAA----SMAKSFHDPSLTVLQDAYDK----MND--VFTPLLSEL 412
+++ + L+ W+ AA S AK + + D K +ND +S L
Sbjct: 431 SSTTQTQLKAWMHVLLAARVLCSSAKEMRTQEIEYIMDTISKTLTFLNDGSRTDQYMSAL 490
Query: 413 QAKGW 417
GW
Sbjct: 491 TEAGW 495
>gi|241595406|ref|XP_002404468.1| conserved hypothetical protein [Ixodes scapularis]
gi|215502351|gb|EEC11845.1| conserved hypothetical protein [Ixodes scapularis]
Length = 472
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 66/271 (24%), Positives = 119/271 (43%), Gaps = 24/271 (8%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
F F PSG+P +V+ Y Y + ++Q F S+ L+TQ++L G+ A A
Sbjct: 45 FSEIFLPSGFPDTVSRDYKEYQIWDSIQAFASSITGTLATQAVLTGVGVGDQTASVLAAT 104
Query: 124 VSWILKDGMQHVGKLI------------------C--SNLGARMDSEPKRWRILADALYD 163
+WILKDG +G+++ C SN ++ + AD L D
Sbjct: 105 TTWILKDGTGMIGRILFALVSRYLGMCCATHNKYCAHSNDSTTLNHADNYCSLFADILND 164
Query: 164 LGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
LE++S + + + AK + VA ATR + A+ N++D+ AK +
Sbjct: 165 AAILLELMSQHLKGYITLVLCISSVAKSVVGVAGGATRAALTQHQARSNNMADVSAKDGS 224
Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
TL N+ + L + S + + +++H+++ + + T N R +
Sbjct: 225 QETLVNLAAFLCSLLLLRLLAGSQILVYGIFTIFTLLHIFANFRAVSCVVMETFNRSRYS 284
Query: 284 MIVADFV-KTGRISSPADLRYREDL-LFPGR 312
++V F+ + G I+S + +E + L GR
Sbjct: 285 LLVKHFLEEAGDIASVEKVNKQESVWLRCGR 315
>gi|396461797|ref|XP_003835510.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
gi|312212061|emb|CBX92145.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
Length = 543
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 151/316 (47%), Gaps = 33/316 (10%)
Query: 117 TPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
+P A ++S +L+D M ++ ++ LG + E K +R AD L D G LE LSP
Sbjct: 187 SPTAALLLS-VLQDSMGRTATILFADRLGTAFEPECKMYRFTADILNDTGMVLECLSPAF 245
Query: 176 PQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P+ + + + + + + V A + + + + FA++GNL ++ AK + T+ +++G+
Sbjct: 246 PKPIRVCVLSCSSVLRSLCGVCAGSAKASLSAHFARKGNLGEVNAKDSSQETVISLLGML 305
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
AG + S + S M + LL+ IH+ + +RA + LN QR ++ + ++ G+
Sbjct: 306 AGSVVVSWISSPMATWSTLIGLLA-IHLATNYAAVRAVSMRCLNRQRANILFSRLLEHGQ 364
Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVG--------------------RDPHKVVKPSK 334
+ SP ++ RE + ++ + ++ +G R ++ ++
Sbjct: 365 LLSPLEVSRRERVFEHDGVLRWSDDMAIGYCRIGVSLQTLLSHMGLRNARTGSLHLRGAE 424
Query: 335 FHELKELFAEEKFILCLGSRWTD--MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLT 392
EL +F +E +IL S +VL++ + D L+ W A+A +AK + P
Sbjct: 425 ISELLHVFVDEAYILWPASSAGHFLIVLKEGCTPMDQLKAW---AHALLLAKGYSLPGAK 481
Query: 393 VLQD----AYDKMNDV 404
+ D A +++ DV
Sbjct: 482 SVVDQTSLAGNRLADV 497
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 38/56 (67%), Gaps = 4/56 (7%)
Query: 54 DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
D++ RV+D FL P+GYP SV E Y++Y + +LQ F+S+ S+L+++++L
Sbjct: 45 DAKGYAQRVLDVFL----PAGYPHSVTEDYIQYQIYDSLQAFSSSIASMLASRAVL 96
>gi|440794597|gb|ELR15757.1| hypothetical protein ACA1_379390, partial [Acanthamoeba castellanii
str. Neff]
Length = 449
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 93/218 (42%), Gaps = 30/218 (13%)
Query: 63 VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
VD L F P+GYP SV + Y+ + ++ + S+A V L G
Sbjct: 69 VDLMLKSLFLPTGYPASVRQDYIHFQKWEVAKGIVSSAGFV----GHLVGLGCL------ 118
Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
GM L+ G D+E KR R AD ++ G LE+ +PL P FL
Sbjct: 119 ---------GMM----LVAGAFGKFFDTETKRIRWAADCIHVAGVALELATPLFPAYFLP 165
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
+A L N AKGIA + AT+ I FA NL D+ AKG + + ++G+G GI
Sbjct: 166 LASLANSAKGIAGLTTGATKAAINQGFALRDNLGDITAKGHSQGIVAYLMGMGIGIGTTY 225
Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
++G L +V V + + P T P
Sbjct: 226 VTGG------VLGALFAVYGVLAGGSTSSSGPFTTAAP 257
>gi|194382298|dbj|BAG58904.1| unnamed protein product [Homo sapiens]
Length = 194
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 80/164 (48%)
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLP 203
G+++D K+WR+ AD L D+ LE+++P+ P F N AK I VA ATR
Sbjct: 12 GSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAA 71
Query: 204 IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
+ A+ N++D+ AK + TL N+ GL + + V + L+ +H+Y
Sbjct: 72 LTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIY 131
Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
+ +RA + TLN R +++ +++ G + P E L
Sbjct: 132 ANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 175
>gi|412993086|emb|CCO16619.1| predicted protein [Bathycoccus prasinos]
Length = 454
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 87/187 (46%), Gaps = 9/187 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-----AQATVV 124
F P G+P +V+ YL + +++ L F L VLS QSLL A G+ A
Sbjct: 40 FLPDGFPNTVSRDYLPWLKWQMLSLFFRDVLEVLSAQSLLVAVGMDVNQANAAAPVAGAA 99
Query: 125 SWILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
W+LKDG + L L + D PK + A++L D+ +E+L+PL P FL +A
Sbjct: 100 KWVLKDGTGGLATLALGALNTKKFDENPKFFWSTANSLEDVSRAMELLTPLFPSHFLLIA 159
Query: 184 GLGNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
G F + A + R++ + + F++ N D+ K E + L GI L
Sbjct: 160 GTATFVRS-AALTGRSSLINGTFMTHFSRNENAGDIRTKLEVQGRWLQIAALPVGIALFR 218
Query: 242 TVCSSMQ 248
V + ++
Sbjct: 219 AVTNGLE 225
>gi|422292654|gb|EKU19956.1| hypothetical protein NGA_2057300, partial [Nannochloropsis gaditana
CCMP526]
Length = 274
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 5/134 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQATVVSW 126
F P G+P SV YLRY Q+ +Q ++ +L+TQ++L G+ TP AT+ W
Sbjct: 142 FLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGILATQAILEGVGVGREGATPLAATL-QW 200
Query: 127 ILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
I +DG + L+ S L K WR+ AD + D+G L++L+PL P+ FL L
Sbjct: 201 ITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFADFMVDIGITLDMLAPLFPKNFLLCICL 260
Query: 186 GNFAKGIAVVAARA 199
+ K + +AA A
Sbjct: 261 ASVCKSLCGIAAGA 274
>gi|387198702|gb|AFJ68865.1| hypothetical protein NGATSA_2057300, partial [Nannochloropsis
gaditana CCMP526]
Length = 277
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 5/134 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQATVVSW 126
F P G+P SV YLRY Q+ +Q ++ +L+TQ++L G+ TP AT+ W
Sbjct: 145 FLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGILATQAILEGVGVGREGATPLAATL-QW 203
Query: 127 ILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
I +DG + L+ S L K WR+ AD + D+G L++L+PL P+ FL L
Sbjct: 204 ITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFADFMVDIGITLDMLAPLFPKNFLLCICL 263
Query: 186 GNFAKGIAVVAARA 199
+ K + +AA A
Sbjct: 264 ASVCKSLCGIAAGA 277
>gi|409076043|gb|EKM76417.1| hypothetical protein AGABI1DRAFT_131245 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 466
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/411 (22%), Positives = 170/411 (41%), Gaps = 45/411 (10%)
Query: 60 HRVVDSFLNK-FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
H+ L K F P+GYP +V YL Y A Q F ++ S+L++++ L G+
Sbjct: 44 HQDTKELLTKIFLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPS 103
Query: 119 AQAT--VVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A +T ++ +L+D + +I + LG R+ E K++R LAD L D+ ++V +P+
Sbjct: 104 ATSTGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVF 163
Query: 176 PQLFLEMA---GLGNFA--KGIAVVAARATRLPIYSSFAKE----GNLSDLFAKGEAIST 226
A GL A + + V A + I FA G++ DL AK + T
Sbjct: 164 SAFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKET 223
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGP-LLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
+ ++G+ L S + + + P LL +I V+ VI + + +P R
Sbjct: 224 VLALLGM----LLGSVMIPHITSPWVTYPVLLLLISVHLVINYIAVRGLAIRSPNRHRAT 279
Query: 286 VADF---VKTGRISSPADLRYREDLLFPGRLIEDA------GNVKVGRDPHKVVK-PSKF 335
+A K SP + RE + I D G VG P V++ P+
Sbjct: 280 IAWLWYRSKDECAPSPQAVASRELIFERPENIRDPTTSLVLGKCHVGSAPLDVMRGPALS 339
Query: 336 HELKELFAEEKFILCLGSRW-------TDMVLEQNASGEDALRGWL--------VAAYAA 380
E ++F +K+I+C + ++ + D L+ W+ +A
Sbjct: 340 QEYLDIFHADKYIVCFNPSHETGHLPIIHICFKEGFTRADELKSWIHAVEICRSIALNQV 399
Query: 381 SMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD--RFLDGTGTR 429
+ K L ++ + + + + + ++ GW+ D G+G +
Sbjct: 400 PIQKGSKAVELEHIRSTFQVVKEKYDHFVDHMKKAGWNFDESHLASGSGPK 450
>gi|154292218|ref|XP_001546685.1| hypothetical protein BC1G_14192 [Botryotinia fuckeliana B05.10]
Length = 449
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 131/249 (52%), Gaps = 25/249 (10%)
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEP 151
F S +L+++++L G+ + A AT V+ IL++ + ++ ++ LG+ ++ E
Sbjct: 59 FASTIAGLLASRAVLQGLGVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPEC 118
Query: 152 KRWRILADALYDLGTGLEVLSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
K++R++AD D L+ +SP P++FL A + K + VAA +++ + + F
Sbjct: 119 KKYRLMADIFNDAAMILDCISPAFPKIPRVFLLSAS--SVCKSLCGVAAGSSKASLSAHF 176
Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
AK GNL++L AK + TL +++G+ G + S + S + + + LLS IH+ +
Sbjct: 177 AKMGNLAELNAKDASQETLISLLGMLVGTFVVSKISSQVATWIALLALLS-IHLGTNYMA 235
Query: 269 MRAAPVNTLNPQRTAMIVADFVKT-----GRIS--SPADLRYREDLLFPGRLIEDAGNVK 321
+R+ + TLN QR ++++ + + +IS SP D+ +E R+ E G +
Sbjct: 236 VRSVTMRTLNRQRANLVISSLLSSPEHEKNKISLPSPRDISLQE------RIFERDGAI- 288
Query: 322 VGRDPHKVV 330
R+ H +
Sbjct: 289 --RNTHGTI 295
>gi|336467636|gb|EGO55800.1| hypothetical protein NEUTE1DRAFT_103211 [Neurospora tetrasperma
FGSC 2508]
gi|350287710|gb|EGZ68946.1| hypothetical protein NEUTE2DRAFT_72230 [Neurospora tetrasperma FGSC
2509]
Length = 361
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 127/254 (50%), Gaps = 23/254 (9%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQA 121
+ L+ F P+GYP SV YL Y + +LQ F S+ S+L+ +++L G+ +P A
Sbjct: 38 ALLHAFLPAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLEGLGVGDASSSPTGA 97
Query: 122 TVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF- 179
++ I D + V ++ ++ LG ++ E K +R LAD L D L++L P + +
Sbjct: 98 LILK-IASDTISRVATILFAHRLGTAIEPECKFFRFLADCLNDAAQLLDLLVPAMMERYG 156
Query: 180 LEMA---GLGNFAKGIAV-----VAARATRLPIYSSFAKEG--NLSDLFAKGEAISTLFN 229
L++ G G V A+ + + +S K G NL++L AK + T+ +
Sbjct: 157 LKLGVIVGAGVLRSWCGVAASASKASLSAHFIVVASQQKGGGNNLAELNAKEASQETVVS 216
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
++G+ G S V +Q +++V L L +H+ +R TLN QR M+V
Sbjct: 217 LLGMLVG----SLVVQVVQDRLVVWVLMVGLLGVHLLMNYWGVRCVRCRTLNRQRATMVV 272
Query: 287 ADFVKTGRISSPAD 300
++++TGR+ P +
Sbjct: 273 REWLETGRVLGPEE 286
>gi|116004169|ref|NP_001070441.1| UPF0420 protein C16orf58 homolog [Bos taurus]
gi|61554841|gb|AAX46623.1| hypothetical protein FLJ13868 [Bos taurus]
Length = 232
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 13/184 (7%)
Query: 12 KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVV--DDSRPVYHRVVDSFLNK 69
+ PD S+ + W+ S + T +P G V + P + R++ FL
Sbjct: 19 RAAPDGSLQWEAWGRRWWGFSGAFTA----KPGGRDGGGGVAPGTASPPFSRLLAVFL-- 72
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 73 --PQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 130
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + +G++MD K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 131 VKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCIS 190
Query: 187 NFAK 190
N AK
Sbjct: 191 NLAK 194
>gi|189196610|ref|XP_001934643.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980522|gb|EDU47148.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 333
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/296 (23%), Positives = 139/296 (46%), Gaps = 29/296 (9%)
Query: 85 RYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICS 141
R +LQ F+S+ +L+++++L G+ TP A ++S I Q L
Sbjct: 36 RVDVIDSLQAFSSSIAGMLASRAILEGVGVGDSTATPTTALLLSVIQTSVGQIATILFAH 95
Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARAT 200
LG ++ E K +R+ AD D L+ LSP P+ + + + + + + V A ++
Sbjct: 96 RLGTALEPECKMYRLAADVFNDTAMVLDCLSPAFPKSIRVAVLSFSSCLRALCGVCAGSS 155
Query: 201 RLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ--GKMIVGPLLS 258
+ + + FA +GNL ++ AK + T+ +++G+ AG + S + S + G +I+ +
Sbjct: 156 KASLSAHFAHKGNLGEVNAKDSSQETVISLLGMLAGSLVVSQITSPLATWGTLILLLTIH 215
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
+ Y+ + RA ++ LN QR ++ ++ + G + SP D+ RE + G ++ +
Sbjct: 216 LATNYAAV---RAVNMHCLNRQRANILFSNMFQHGLVLSPRDVSQRERVFERGGVLRWSD 272
Query: 319 NVKVGR---------------DPHK-----VVKPSKFHELKELFAEEKFILCLGSR 354
+ +GR PH+ +K + +L E+FA E + G R
Sbjct: 273 DKVLGRCSIGVPLQRLLNRLGTPHQQTGSLTLKSMQISDLLEVFANESLHIIAGPR 328
>gi|296473307|tpg|DAA15422.1| TPA: hypothetical protein LOC767862 [Bos taurus]
Length = 232
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 13/184 (7%)
Query: 12 KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVV--DDSRPVYHRVVDSFLNK 69
+ PD S+ + W+ S + T +P G V + P + R++ FL
Sbjct: 19 RAAPDGSLQREAWGRRWWGFSGAFTA----KPGGRDGGGGVAPGTASPPFSRLLAVFL-- 72
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 73 --PQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 130
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G+++ + +G++MD K+WR+ AD L D+ LE+++P+ P F +
Sbjct: 131 VKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCIS 190
Query: 187 NFAK 190
N AK
Sbjct: 191 NLAK 194
>gi|426194386|gb|EKV44317.1| hypothetical protein AGABI2DRAFT_120451 [Agaricus bisporus var.
bisporus H97]
Length = 462
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 95/410 (23%), Positives = 171/410 (41%), Gaps = 47/410 (11%)
Query: 60 HRVVDSFLNK-FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
H+ L K F P+GYP +V YL Y A Q F ++ S+L++++ L G+
Sbjct: 44 HQDTKELLTKIFLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPS 103
Query: 119 AQAT--VVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
A +T ++ +L+D + +I + LG R+ E K++R LAD L D+ ++V +P+
Sbjct: 104 ATSTGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVF 163
Query: 176 PQLFLEMA---GLGNFA--KGIAVVAARATRLPIYSSFAKE----GNLSDLFAKGEAIST 226
A GL A + + V A + I FA G++ DL AK + T
Sbjct: 164 SAFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKET 223
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGP-LLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
+ ++G+ L S + + + P LL +I V+ VI + + +P R
Sbjct: 224 VLALLGM----LLGSVLIPHITSPWVTYPVLLLLISVHLVINYIAVRGLAIRSPNRHRAT 279
Query: 286 VADF---VKTGRISSPADLRYREDLLFPGRLIEDA------GNVKVGRDPHKVVK-PSKF 335
+A K SP + E + I D G VG P V++ P+
Sbjct: 280 IAWLWYRSKDECAPSPQAVASHELIFERPENIRDPTTSLVLGKCHVGSAPLDVMRGPALS 339
Query: 336 HELKELFAEEKFILCLGSRW-------TDMVLEQNASGEDALRGWLVAAYAASMAKSFHD 388
E ++F +K+I+C + ++ + D L+ W+ +A + +S
Sbjct: 340 KEYLDIFHADKYIVCFNPSHETGHLPIIHICFKEGFTRADELKSWI---HAVEICRSIAL 396
Query: 389 PSLTVLQDAYDKMNDVFTPL-------LSELQAKGWHTD--RFLDGTGTR 429
+ V + A + + F + + ++ GW+ D G+G +
Sbjct: 397 NQVPVQKGALEHIRSTFQVVKEKYDHFVDHMKKAGWNFDESHLASGSGPK 446
>gi|299740308|ref|XP_001838896.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
gi|298404184|gb|EAU82827.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
Length = 768
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 108/224 (48%), Gaps = 13/224 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
F P+GYP SV YL Y ALQ F S+ S++S+++ L F G A ++ I
Sbjct: 331 FLPAGYPNSVTPDYLTYQVLNALQAFCSSLASLISSRATLQGFGVGSASASANDALLLTI 390
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPL-----CPQLFLE 181
L D + ++ ++ LG+ + +E K++R +AD L D+ L+ LSP+ PQL +
Sbjct: 391 LLDIYSRLTTIVSAHLLGSSLATEAKKYRFVADILNDVAVVLDTLSPVFKAQGYPQLRVV 450
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAISTLFNVVGLGAGI 237
L + + +AA ++ I FA +G+L DL AK + T+ + G+ G
Sbjct: 451 ALCLSASCRALCGIAAGGSKAAISVHFATPVKGKGDLGDLNAKDSSKETVLALFGMLLGS 510
Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
+ + ++ + LLS +H+ + + T+N QR
Sbjct: 511 LIVPRLQTAWSTYTTLFILLS-LHIAINYVAVTGLVLRTMNWQR 553
>gi|62320358|dbj|BAD94740.1| putative protein [Arabidopsis thaliana]
Length = 295
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 72/276 (26%), Positives = 123/276 (44%), Gaps = 19/276 (6%)
Query: 158 ADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGN 213
AD + G+ ++ + L P FL +A GN AK +A R R P I + FA GN
Sbjct: 3 ADFIGSAGSFFDLATQLYPSQFLLLASTGNLAKAVA----RGLRDPSFRVIQNHFAISGN 58
Query: 214 LSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
L ++ AK E ++GLG GI + T + S ++ + ++H++ + +
Sbjct: 59 LGEVAAKEEVWEVAAQLIGLGFGILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAV 118
Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
NT+N +R +IV V + D RE++L R ++ V + ++
Sbjct: 119 LQFNTVNLKRARIIVESHVVHSVVPGYVDCNKRENILLWQRFMKPRIIFGVSLEELSGLE 178
Query: 332 P--SKFHELKELFAEEKFILCLGSRWTD----MVLEQNASGEDALRG-WLVAAYAASMAK 384
SK L +++ +EK+IL L D + + NA+ D LR W +M +
Sbjct: 179 KSVSKVKALLKMYTKEKYILTLNKLNKDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEE 238
Query: 385 SFHDPSLTV--LQDAYDKMNDVFTPLLSELQAKGWH 418
SF D L+ + +M++ F L +L GW+
Sbjct: 239 SFKDKDSVFHWLKQSLSEMDNKFDDFLFKLDTAGWN 274
>gi|429856893|gb|ELA31783.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 500
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 67/273 (24%), Positives = 135/273 (49%), Gaps = 18/273 (6%)
Query: 48 SVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQS 107
SV++ + + +V+D FL P GYP SV + Y Y + + Q F ++S+++
Sbjct: 63 SVQLPTIRKKPWRKVLDIFL----PDGYPHSVTDDYAAYQIYDSFQAFAGTIAGMISSRA 118
Query: 108 LLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYD 163
+ G+ +P A ++ ++++ M +G + ++L G +++E K +R+ +D L
Sbjct: 119 VWEGMGVGDSMASPIGAMMIQ-VIRESMGRLGTIAFAHLMGTSIEAECKAYRMASDVLCS 177
Query: 164 LGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
+ L+ LSPL P+ L L + + A G VA A++ + FAK NL +L A
Sbjct: 178 IAFILDCLSPLFPRSIRFLILCCSSILFAASG---VAGNASKSSLSGHFAKWNNLGELNA 234
Query: 220 KGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
K + T +++G+ G + S + + + L++ I+++ + +RA +LN
Sbjct: 235 KDASQETAISLMGMLTGTLVVSCLTTPTTTWTTLFILIA-IYLFLNWKGVRAVKSRSLNR 293
Query: 280 QRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
QR + + + ++ +P + +E L+F R
Sbjct: 294 QRANIAFSALLSKDQVLTPGKISEKE-LIFERR 325
>gi|310789554|gb|EFQ25087.1| hypothetical protein GLRG_00231 [Glomerella graminicola M1.001]
Length = 484
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 131/285 (45%), Gaps = 23/285 (8%)
Query: 40 QFQPDGHLSVKVVDDSRPVYHRVVD------------SFLNKFFPSGYPFSVNEGYLRYT 87
++ G L K V+ + H VD L F P GYP SV++ Y Y
Sbjct: 27 EYDKAGSLKAKYVESIPIIGHPRVDVLLPVRRKKIWLQLLEIFLPDGYPHSVSDDYAAYQ 86
Query: 88 QFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSNL-G 144
+ ++Q F + ++S++++ G+ + A T ++ ++++ + ++L G
Sbjct: 87 IYDSIQAFAGSIAGMISSRAVWEGLGVGDSLASPTGAMLIQVIRESTGRFATITFAHLFG 146
Query: 145 ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARAT 200
+++E K +R+ +D L D+ L+ LSP P+ L L + L A G VA A+
Sbjct: 147 TSIEAECKAYRLASDILCDVAMVLDCLSPFFPRSIRFLVLCFSSLLYSASG---VAGNAS 203
Query: 201 RLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI 260
+ + FAK NL +L AK + T +++G+ G + S + + + LL+ +
Sbjct: 204 KSSLSGHFAKWNNLGELNAKDASQETAISLMGMITGTFMVSLLTGPKATWVALISLLA-L 262
Query: 261 HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
H++ + +RA +LN QR + + ++ +P + RE
Sbjct: 263 HLFLNWKGVRAVKSRSLNRQRANIAFSALFSKDQVLTPVIVSERE 307
>gi|330932017|ref|XP_003303615.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
gi|311320239|gb|EFQ88270.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
Length = 391
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 157/346 (45%), Gaps = 51/346 (14%)
Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
TP A ++S I + L LG ++ E K +R AD D L+ LSP P
Sbjct: 34 TPTTALLLSVIHSSTGRIATILFAHRLGTALEPECKMYRFAADIFNDTAMVLDCLSPAFP 93
Query: 177 Q-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
+ + + + + + V A +++ + + FA++GNL ++ AK + T+ +++G+ A
Sbjct: 94 KPIRVAVLSFSSCLSALCGVCAGSSKASLSAHFARKGNLGEVNAKDSSQETVISLLGMLA 153
Query: 236 GIQLASTVCSSMQ--GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
G + S + S + G +I+ + + Y+ + RA ++ LN QR ++ ++ + G
Sbjct: 154 GSVVISYITSPLATWGTLILLLTIHLATNYAAV---RAVIMHCLNRQRANILFSNMFQHG 210
Query: 294 RISSPADLRYREDLLFPGRLIEDAGNVKVGR---------------DPHK-----VVKPS 333
+ SP D+ RE + G ++ + + +GR PHK +K
Sbjct: 211 LVLSPRDVSQRERVFERGGVLRWSDDKVLGRCSIGVPLQRLLNRLGTPHKQTGSLTLKSM 270
Query: 334 KFHELKELFAEEKFILCLGSRWTD--MVLEQNASGEDALRGWLVAAYAASMAK------- 384
+ +L ++FA E +IL S + +VL++ D ++ W A+A +AK
Sbjct: 271 EISDLLDVFANEAYILLPASTVEEGLIVLKRTCEPIDQIKAW---AHALLLAKRREGLEA 327
Query: 385 ---------SFHDPSL----TVLQDAYDKMNDVFTPLLSELQAKGW 417
+ H P++ + L+++ ++ +F E++ KGW
Sbjct: 328 AGLGRKRDDASHGPAMEGLISELRESLKEVQAMFARYGDEMRDKGW 373
>gi|326469821|gb|EGD93830.1| hypothetical protein TESG_01362 [Trichophyton tonsurans CBS 112818]
Length = 413
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 152/350 (43%), Gaps = 65/350 (18%)
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAG 184
+L++ M + ++ ++ LG ++ E K +R+ AD L D L+ LSP+ P+ + + +
Sbjct: 57 VLQESMGRIATILFAHRLGTSLEPECKMYRLAADILNDSAMVLDCLSPIFPKPVRVGLLS 116
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAI--STLFNVVGLGAGIQLAST 242
L + + + VAA +++ + + FA+ GNL +L A G I S L VV S
Sbjct: 117 LSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVGLNIPPSQLTGVV--------LSY 168
Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ S ++ + + LL ++H+ + +RA + TLN QR ++ + + I +PA+
Sbjct: 169 ITSPLETWIALIVLL-IVHLGTNHAAVRAVKMTTLNRQRANIVFSHLFEDNLILTPAETS 227
Query: 303 YREDLLFPGRLIE-----------------------------DAGNVKVGRDPHKVVKPS 333
E + ++ D G ++ R + S
Sbjct: 228 KEERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRGALETSRTTSTTMDAS 287
Query: 334 -KFHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW----LVAAYAA----SM 382
L ELF EE++IL G SR +VL+ A+ + L+ W LV+ + A +
Sbjct: 288 TNLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKAWSHALLVSRHLARKNGTA 347
Query: 383 AKSFHD---------PS---LTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
H+ PS L++L+D + F + L+ GW TD
Sbjct: 348 TSCIHEKQGGQGELSPSVSVLSILRDTLQEHTRTFKDRVKRLEHAGWQTD 397
>gi|406860777|gb|EKD13834.1| DUF647 domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 501
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 120/248 (48%), Gaps = 26/248 (10%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRY-----------TQFRALQHFTSAALSVLST-----QSL 108
S L+ F P+GYP SV E YL Y TQ + S ++ L + +S
Sbjct: 52 SVLDAFLPAGYPNSVTEDYLEYQIYVRAANKPWTQSELISEGLSTSVLKLHSWDAVLESR 111
Query: 109 LFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTG 167
+ +P A ++S +L+ M + ++ ++ LG ++ E K +R+ AD D
Sbjct: 112 IGVGDSHASPTAALLLS-VLQQSMGRIATIVFAHRLGTALEPECKMYRLAADIFNDAAMI 170
Query: 168 LEVLSPLCPQLF-LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 226
L+ LSP P+ + + L + + + V A +++ + + FA +GNL +L AK + T
Sbjct: 171 LDCLSPALPKASRVALLSLSSVLRSMCGVTAGSSKASLSAHFATQGNLGELNAKDSSQET 230
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVG---PLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
+ +++G+ AG S V S + K I LL H+ + +RA + TLN QR
Sbjct: 231 VISLLGMLAG----SVVVSYITSKTITWIALILLLATHLGTNYLAVRAVCMQTLNRQRAN 286
Query: 284 MIVADFVK 291
+++++ ++
Sbjct: 287 LVLSNILQ 294
>gi|347842141|emb|CCD56713.1| hypothetical protein [Botryotinia fuckeliana]
Length = 376
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 63/231 (27%), Positives = 121/231 (52%), Gaps = 25/231 (10%)
Query: 113 GLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLE 169
G+ + A AT V+ IL++ + ++ ++ LG+ ++ E K++R++AD D L+
Sbjct: 4 GVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPECKKYRLMADIFNDAAMILD 63
Query: 170 VLSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 226
+SP P++FL A + K + VAA +++ + + FAK GNL++L AK + T
Sbjct: 64 CISPAFPKIPRVFLLSAS--SVCKSLCGVAAGSSKASLSAHFAKMGNLAELNAKDASQET 121
Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
L +++G+ G + S + S + + + LLS IH+ + +R+ + TLN QR +++
Sbjct: 122 LISLLGMLVGTFVVSKISSQVATWIALLALLS-IHLGTNYMAVRSVTMRTLNRQRANLVI 180
Query: 287 ADFVKT-----GRIS--SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV 330
+ + + +IS SP D+ +E R+ E G + R+ H +
Sbjct: 181 SSLLSSPEHEKNKISLPSPRDISLQE------RIFERDGAI---RNTHGTI 222
>gi|339898153|ref|XP_001465441.2| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321399425|emb|CAM67862.2| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 675
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 107/234 (45%), Gaps = 7/234 (2%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
+R RVV + P GYP SV E L + + Q + LST+++L G+
Sbjct: 57 ARQARRRVVTALERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGV 116
Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVL 171
+ A T +SW+++DG + +G ++ ++L + ++ K W+++AD D+ LE+
Sbjct: 117 GESKADLTSSTLSWMMQDGTRMIGSIVFASLIPQGLECRAKTWQLVADFTTDIANLLELC 176
Query: 172 SPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
+P C +F + + K + V TR FA N +D+ AK S++
Sbjct: 177 APWLPGCQTIFRVVLVAASAIKALGGVCDSGTRASFAQHFALRNNAADIAAKAATRSSVG 236
Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
+ +GL G +A +V +S + L S +++ +R + LN R
Sbjct: 237 SFIGLALGTAVAYSVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPR 290
>gi|398015008|ref|XP_003860694.1| hypothetical protein, conserved [Leishmania donovani]
gi|322498916|emb|CBZ33989.1| hypothetical protein, conserved [Leishmania donovani]
Length = 675
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 107/234 (45%), Gaps = 7/234 (2%)
Query: 55 SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
+R RVV + P GYP SV E L + + Q + LST+++L G+
Sbjct: 57 ARQARRRVVTALERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGV 116
Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVL 171
+ A T +SW+++DG + +G ++ ++L + ++ K W+++AD D+ LE+
Sbjct: 117 GESKADLTSSTLSWMMQDGTRMIGSIVFASLIPQGLECRAKTWQLVADFTTDIANLLELC 176
Query: 172 SPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
+P C +F + + K + V TR FA N +D+ AK S++
Sbjct: 177 APWLPGCQTIFRVVLVAASAIKALGGVCDSGTRASFTQHFALRNNAADIAAKAATRSSVG 236
Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
+ +GL G +A +V +S + L S +++ +R + LN R
Sbjct: 237 SFIGLALGTAVAYSVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPR 290
>gi|392585079|gb|EIW74420.1| DUF647-domain-containing protein [Coniophora puteana RWD-64-598
SS2]
Length = 515
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 104/438 (23%), Positives = 181/438 (41%), Gaps = 82/438 (18%)
Query: 57 PVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP 116
P R+ ++ F P+GYP SV YL Y + ALQ F S+ +++++ L G+
Sbjct: 50 PSLDRIRRVLVDLFLPAGYPASVTPDYLPYHAYNALQAFFSSLAGLIASRGALQGVGV-- 107
Query: 117 TPAQATVVSWILKDGMQHV-GKLICSN----LGARMDSEPKRWRILADALYDLGTGLEVL 171
A AT S +L +Q V G++ + G + E K +R++AD + D VL
Sbjct: 108 GDASATATSALLLTVLQDVFGRITTISAAYVFGTSLYPETKTFRLVADLVNDASI---VL 164
Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATR---------------------LPIYSSFAK 210
LCP+L L L F I++ A+ A R +P ++
Sbjct: 165 DTLCPRLSLSF--LHTFLLAISLCASAALRALCGLVAGGSKAALTNHFACTVPASTAAGG 222
Query: 211 E----GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVI 266
E G++ +L AK + T+ + G+ G L + +S++ + L H+ +
Sbjct: 223 EAVVAGDVGELNAKDASRETVVGLFGMLLGTLLVPRL-TSVRATWLTLVFLIAGHLLANY 281
Query: 267 EEMRAAPVNTLNPQRTAMI--------------------VADFVKTGR--ISSPADLRYR 304
+R+ + TLN QR ++ ++ K G +PA++ R
Sbjct: 282 LAVRSVSLRTLNRQRAGILWTAWAARMREREDADDVSPSISSLAKAGGQLALTPAEVSRR 341
Query: 305 EDLL-FPGRLIEDAGN----VKVGRDPHKVVKPSKFHELKEL--------FAEEKFIL-C 350
E +L PG L + G+ ++G +VV ++ FA+E+++L
Sbjct: 342 ERILALPGTLRDARGHSIGYCELGASVSRVVARGGAENVRRALQSSSDAKFAKERYMLFA 401
Query: 351 LGSRWTDMVLE------QNASGEDALR--GWLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
L R + +E + A+ D L+ + A A A S + TV+++AY +
Sbjct: 402 LPRRRVESKVELTVCFKEGATPRDQLKAWAHALEAAARLRAASVSPDTNTVVEEAYAAVE 461
Query: 403 DVFTPLLSELQAKGWHTD 420
F P + L GW D
Sbjct: 462 AWFAPFMEALSGAGWRVD 479
>gi|224010239|ref|XP_002294077.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970094|gb|EED88432.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 354
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 91/180 (50%), Gaps = 21/180 (11%)
Query: 75 YPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGM 132
YP +V GYL Y + +LQ +S V+ST ++L AAG+ A + ++W ++DG+
Sbjct: 19 YPHTVAPGYLEYQFYDSLQGLSSYLRGVVSTTAVLSAAGVGDAAATAMSAAMTWAIRDGL 78
Query: 133 QHVGKLICSNLG-ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
VG L+ S + + D+ K +R++AD L D+G L++ P
Sbjct: 79 GMVGGLLFSYVASSHFDAHVKEFRLVADVLNDVGLTLDMALP------------------ 120
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
+ + RAT+ I F+ GN +D+ AK TL +++G+ G+ L++ S G++
Sbjct: 121 VVLSWTRATKGNITDHFSISGNRADVSAKESTQETLVSLIGMALGVWLSNLDSHSRVGEL 180
>gi|346467075|gb|AEO33382.1| hypothetical protein [Amblyomma maculatum]
Length = 388
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 94/199 (47%), Gaps = 3/199 (1%)
Query: 103 LSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILAD 159
L+ QS+L G+ A A +WIL+DG G+++ + G+ +D + K+WR+ AD
Sbjct: 4 LAIQSVLTGVGVGDQGASVLAATTTWILRDGAGMTGRILFAWFQGSNLDYDSKKWRLFAD 63
Query: 160 ALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
L D LE++ + + + AK + VA ATR + A+ N++D+ A
Sbjct: 64 ILNDAAIFLELMCQYFKGYVTPILCISSVAKSVVGVAGGATRAALTQHQARSNNMADVSA 123
Query: 220 KGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
K + TL N+ + L V + + L +++H+++ + + T N
Sbjct: 124 KDGSQETLVNLAAFLFSLLLLQLVAGNTWLVSTIFILFTLLHIFANYRAVSCVVMETFNR 183
Query: 280 QRTAMIVADFVKTGRISSP 298
R +++V F++ G +P
Sbjct: 184 SRYSVVVRHFLQKGGGIAP 202
>gi|258576451|ref|XP_002542407.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902673|gb|EEP77074.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 442
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 83/391 (21%), Positives = 170/391 (43%), Gaps = 67/391 (17%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNLGARM 147
+LQ F+S+ +LS++++L G+ +P A +++ +L++ M + ++ ++ R+
Sbjct: 40 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAALLLN-VLQESMGRIATILFAH---RL 95
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYS 206
E K +R+ AD D ++ LSP+ P+ + + + L + + + VAA +++ + +
Sbjct: 96 --ECKMYRLAADIFNDSAIVMDCLSPMFPKPMRVGVLSLSSVLRAMCGVAAGSSKASLSA 153
Query: 207 SFAKEGNLSDLFAKGEAISTLFNVVG------LGAGIQLASTVCSSMQGKMIVGP----- 255
FA+ GNL++L AK + T+ +++G LG ++ + + G ++V
Sbjct: 154 HFARWGNLAELNAKDSSQETVISLMGMLVRPILGFLESISGLILTQQVGSLVVSRVTSTL 213
Query: 256 -------LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LL IH+ + +RA + TLN QR ++ + R +P+ ++E +
Sbjct: 214 ATWAALILLLTIHITTNYFAVRAVNMTTLNRQRANIVFSTIFDENRTLTPSQAAHQERIF 273
Query: 309 FPGRLIE------DAGNVKVGRDPHKVVK------------PSKFHELKELFAEEKFILC 350
++ G+ ++G +V+ S L LF E++IL
Sbjct: 274 ERDGVLRWKASPASLGSCRIGVSFQYMVQLIASSESCTSDGQSIVAALLHLFEREEYILW 333
Query: 351 LG--SRWTDMVLEQNASGEDALRGWLVAAYAASM-------------------AKSFHDP 389
+ +VL+ NA+ L+ W A A + A H
Sbjct: 334 FNPTRKRGAIVLKTNATAASQLKAWSHALIVAQILRLGEWKSKTGIDQFILQPATRAHGL 393
Query: 390 SLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
L +L++ + ++ F L L+ GW D
Sbjct: 394 VLLILRETLTEHSENFDARLERLRNAGWDVD 424
>gi|149067629|gb|EDM17181.1| similar to cDNA sequence BC017158, isoform CRA_a [Rattus
norvegicus]
Length = 246
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 67/124 (54%), Gaps = 3/124 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL+Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAK 190
N AK
Sbjct: 195 NLAK 198
>gi|300175334|emb|CBK20645.2| unnamed protein product [Blastocystis hominis]
Length = 290
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 64/287 (22%), Positives = 123/287 (42%), Gaps = 28/287 (9%)
Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
+++PL P FL +A + N K I+ +A AT I FA N+ D+ AK + ++
Sbjct: 1 MITPLFPNCFLALATISNIGKSISFLANGATGAGIRYGFANAMNIGDVTAKEGSQTSAVY 60
Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPL-LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
++G+ G+ ++S + + + +S + +Y++ + +R + TLN QR+ + D
Sbjct: 61 LLGMLFGVSVSSLLGDQSMSTIFASVICMSCLSLYAMNKSLRCVTLPTLNTQRSELACVD 120
Query: 289 FVKTGRISSPADLRYREDLLFPGRLIEDAG-NVKVGRDPHKVVKPSKFHELKE------L 341
+ G I P + E P + D G ++++G H V+ S + L+E L
Sbjct: 121 YCSQGIIYQPEAVAKLERYYLPYLHVADCGTDIRLGMSLHSVL--SAYRSLEEADDVIQL 178
Query: 342 FAEEKFILCLGSRWTDMV-----------------LEQNASGEDALRGWLVAAYAASMAK 384
F + ++L M L +AS D L+ +L A +
Sbjct: 179 FGDCNYMLSYCDLKQGMYRSIINRTEKTKCTIAISLNSSASTRDQLQAYLHALLLRHASG 238
Query: 385 SFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR-FLDGTGTRF 430
L +L++ ++ + + +L+ GW FL+ T R+
Sbjct: 239 GRFTTDLQLLKETKERSKLLLPFFVRDLEQNGWDCIHLFLEETKKRY 285
>gi|148685696|gb|EDL17643.1| cDNA sequence BC017158, isoform CRA_a [Mus musculus]
Length = 226
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 67/124 (54%), Gaps = 3/124 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P G+P SV+ YL Y + ++Q F S+ L+TQ++L G+ A A +W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
+KD +G++I + G+++D K+WR+ AD L D+ LE+++P+ P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 187 NFAK 190
N AK
Sbjct: 195 NLAK 198
>gi|240280131|gb|EER43635.1| DUF647 domain-containing protein [Ajellomyces capsulatus H143]
Length = 521
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 95/416 (22%), Positives = 168/416 (40%), Gaps = 71/416 (17%)
Query: 29 FETSDSV-TRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
F T DS RH + + D +V SR + S LN F P GYP SV++ YL
Sbjct: 34 FRTPDSSGVRHERRRVD------IVQPSRKDSSSYLASLLNVFLPVGYPHSVSDDYLD-- 85
Query: 88 QFRALQHFTSAALSV-LSTQSLLFAAGLRPTPAQATV--VSWILKDGMQHVGKLICSN-L 143
F S L T + G+ A TV + +L+D M + ++ ++ L
Sbjct: 86 ----CLGFASGVLKFNCRTAGVPGWVGVGDASASPTVALLHSVLQDSMGRIATILFAHRL 141
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRL 202
G ++ E K +R+ AD L D + LSP P+ L + + + + + VAA +++
Sbjct: 142 GTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVLRALCGVAAGSSKA 201
Query: 203 PIYSSFAKEGNLSDL----------------------FAKGEAISTLFNVVGLGAGIQLA 240
+ S FAK GNL +L F +A L I L
Sbjct: 202 SLSSHFAKCGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKLQKDSSQETVISLL 261
Query: 241 STVCSSMQGKMIVGPLLS--------VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
+C S+ I P + ++H+ + +R+ + TLN QRT ++ + +
Sbjct: 262 GILCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNMTTLNRQRTNIVFSTLFEK 321
Query: 293 GRISSPADLRYREDLLFPGRLIE------DAGNVKVGRDPHKVVKPS------------K 334
G + +P + E + ++ G ++G ++++ S
Sbjct: 322 GSVLTPTEASKCERIFERDGILRWKASSATLGYCQIGGSFQELLRGSIHGANSIRDVGID 381
Query: 335 FHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHD 388
++L LF +E+++L + +VL+ N + L+ W ++A +AK D
Sbjct: 382 IYKLLRLFEKEEYVLWFNPIHKKGTIVLKNNVTPISQLQAW---SHALRVAKRLTD 434
>gi|391863517|gb|EIT72825.1| hypothetical protein Ao3042_00909 [Aspergillus oryzae 3.042]
Length = 236
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 87/158 (55%), Gaps = 11/158 (6%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P+GYP SV++ Y+ Y F +LQ F+S+ +LS++++L G+ A A ++ I
Sbjct: 75 FLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHI 134
Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-----LE 181
L+D + ++ ++ +G ++ E K +R+ AD D L+ LSP+ P F L
Sbjct: 135 LQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLS 194
Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
AG+ + + VA +++ + + F++ GNL+++ A
Sbjct: 195 TAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNA 229
>gi|392558453|gb|EIW51641.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 450
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 97/408 (23%), Positives = 168/408 (41%), Gaps = 68/408 (16%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWI 127
F P GYPF+V+ +RY + A+Q F S+ + +++++L G+ A AT + I
Sbjct: 28 FLPEGYPFTVSPQCVRYQIWNAVQAFCSSLAGLFASRAVLQGHGVGNASASATDAIFLTI 87
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSP------------- 173
L+D + ++ LG + E K +R+LAD L D + LSP
Sbjct: 88 LQDVFSRLTTIVSGYYLGTSLYPEAKTYRLLADILNDAAMVTDTLSPHLAHFSFSLSYPF 147
Query: 174 LCPQ-------LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE--GNLSDLFAKGEAI 224
+ P + L M+G G AV L ++ + A E G++ DL AK +
Sbjct: 148 VVPASNSSLRVVALCMSGAFRALCG-AVAGGSKAALTVHFATAGERPGDVGDLSAKDGSK 206
Query: 225 STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
T+ ++G+ +G + V + Q +V L H+ + +R + N QR +
Sbjct: 207 ETVLALLGMLSGSVVLRYV-HTAQATYVVLFGLIFCHLAANFIAVRVIAMRAFNRQRAGI 265
Query: 285 IVADFVKTGRISS--PADLRY-REDLLFPGRLIEDAGNVKV------------------- 322
+F ++ + + P L R++ R+ D+ + +
Sbjct: 266 AWCEFRRSFDVVTVVPDYLHVARQE-----RIFTDSARIPISQHARGTYAHCALGVSFLS 320
Query: 323 --GRDPHKVVKPSKFHELKELFAEEKFILCLGSR--WTDMVLEQNASGEDALRGWLVAAY 378
GR+P + L E+FA EK+IL S +VL+ D L+ W A
Sbjct: 321 LSGREP---FSDEQIATLLEVFAHEKYILWPASNPPQIAVVLKDGHGPHDHLKAWAHAHE 377
Query: 379 AA------SMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
A S A F++ L ++ A + ++ +F L Q GW T+
Sbjct: 378 IANICDRESHAHGFNE-CLVAIRYALENVSRMFPLFLRAAQEVGWKTE 424
>gi|449543750|gb|EMD34725.1| hypothetical protein CERSUDRAFT_116920 [Ceriporiopsis subvermispora
B]
Length = 507
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 101/433 (23%), Positives = 176/433 (40%), Gaps = 84/433 (19%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--V 123
F F P+GYP +V+ YL Y F LQ S+ S+L+++++L G+ A AT +
Sbjct: 53 FSGIFLPAGYPNTVSPDYLHYQIFNGLQAGCSSIASLLASRAVLEGHGVGEASASATNAM 112
Query: 124 VSWILKDGMQHVGKLICSN----LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
+ I++D V +LI LG + E K++R+LAD D L+V+SPL
Sbjct: 113 LVNIIQDV---VSRLITIGSGYYLGTSLFPEAKKYRLLADVYNDAAVVLDVISPLFTHWS 169
Query: 178 -------------LFLEMAGL---GNFAKGIAVVAARATRLPIYSSFAKE----GNLSDL 217
+L + L G+F + + + A ++ + FA G++ DL
Sbjct: 170 ISLVYPFIRREQGFYLRVLALCLSGSF-RALCGMVAGGSKAALTMHFATAGPVPGDIGDL 228
Query: 218 FAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTL 277
AK + T+ ++G+ G + V S ++ +L + H+ +R + T
Sbjct: 229 NAKDGSKETVLALIGMLLGTFVIKHVDSGFATYAVLS-MLILTHLAINFVAVRGVVMRTF 287
Query: 278 NPQRTAMIVADFVK--------TGRISSPADLRYRED-------LLFPGRLIEDAGNVKV 322
N R A A + + G I +P + +E L+ P R + G+ +
Sbjct: 288 NRHRAAAAWASYRRYEQGIAQYEGAILTPELMAVKERIFADPTLLMHPLRPEKRLGHCHL 347
Query: 323 GR-------DPHKVVKPSKFHELKEL-----------FAEEKFILCL--GSRWTDMVL-- 360
G + H+ PS+ + L F E++IL L SR + L
Sbjct: 348 GASLSTIFLETHRPCPPSRTSTVPNLTSAQLNVVLQTFENERYILWLRHDSRSESLQLVV 407
Query: 361 --EQNASGEDALRGWLVAAYAA--------SMAKSFHDPSLTVLQDAYDKMNDVFTPLLS 410
+ D L+GW+ A A M + H+ V+Q + ++ +F L
Sbjct: 408 CFKTGHQPIDRLKGWIHAQELAIALEDGVHPMFEKLHE----VVQTSLARVQKLFPLFLD 463
Query: 411 ELQAKGWHTDRFL 423
+ +GW + L
Sbjct: 464 AARGRGWMVEEGL 476
>gi|407259493|gb|AFT91377.1| hypothetical protein [Emericella rugulosa]
Length = 549
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/241 (24%), Positives = 114/241 (47%), Gaps = 23/241 (9%)
Query: 64 DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV 123
D ++ F P+GYP SV++ Y + AL+ + + + +P A +
Sbjct: 67 DVLIDVFLPAGYPHSVSDDY-------------TPALAQPAIHAGVGVGNANASPTSALL 113
Query: 124 VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
+ IL+D + ++ ++ +G ++ E K +R AD DL L+ LSP+ P F +
Sbjct: 114 LH-ILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCLSPMIPAGFNRV 172
Query: 183 AGL--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
L + + VA +++ + + FA+ GNL+D+ AK + T+ +++G+ G +
Sbjct: 173 TVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLADVNAKDSSQETVISLIGMLVGSFVV 232
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
S + +S + L +H+ +R+ + TLN QR A+ V + +SS D
Sbjct: 233 SHI-TSFTATWLTLVFLVSMHLSLNYAAVRSVQMTTLNRQR-----ANIVFSTLLSSDPD 286
Query: 301 L 301
L
Sbjct: 287 L 287
>gi|452981822|gb|EME81581.1| hypothetical protein MYCFIDRAFT_100197, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 272
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 64/243 (26%), Positives = 115/243 (47%), Gaps = 6/243 (2%)
Query: 85 RYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICS 141
R ++Q F S+ +LS++++L + G+ +P A ++S + + +
Sbjct: 28 RVDAIDSIQAFVSSIAGLLSSRAVLLSVGVGDSSASPTSALLLSMLQQSLSRIAKVAFAY 87
Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL-EMAGLGNFAKGIAVVAARAT 200
LG ++ E K +R AD L+D L LSP+ P+ F + L + + V A +T
Sbjct: 88 RLGKAIEPECKMYRFCADLLHDSSFVLNCLSPMLPKAFRPPILALSSICFAVCDVVASST 147
Query: 201 RLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI 260
+ + + FAK NL +L AK + T+ +++G+ G L + + + +LSV
Sbjct: 148 KASLSAHFAKWENLGELNAKDSSQETIISLLGMLTGSLLVNWISGHWATWTALILMLSV- 206
Query: 261 HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNV 320
H+ + +RA + TLN QR A +V + G I PA++ E + + DA N
Sbjct: 207 HLEANRRAVRAIVMQTLNRQR-ATVVYHRLCVGHIPRPAEVSRVERIFERNGAVRDAENR 265
Query: 321 KVG 323
+G
Sbjct: 266 LIG 268
>gi|388520739|gb|AFK48431.1| unknown [Medicago truncatula]
Length = 236
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 55/98 (56%), Gaps = 4/98 (4%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT----PAQATVVSWI 127
P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ A A ++WI
Sbjct: 118 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLTSVGVSRNRATPAAAAVAINWI 177
Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLG 165
L DG VGK++ + G + D + K+ R D L +LG
Sbjct: 178 LNDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELG 215
>gi|167378255|ref|XP_001734737.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165903647|gb|EDR29104.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 400
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 117/267 (43%), Gaps = 12/267 (4%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL-FAAGLRPTPAQATVVSWILKD 130
P GYP SV GY R+ F + + + V+S ++L F +G A VV +
Sbjct: 51 PVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAILRFNSG---HSAAFFVVFREVMT 107
Query: 131 GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAK 190
GM H+ +I G + K+WR+ D + +L +E+ P +F M+ + +
Sbjct: 108 GMMHL--IITERWGTSIVFFAKQWRMRIDLISELLRIIEIAFISTP-IFYSMSVTCSVIE 164
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
G+ + + + I +++AK N+++L K + + TLF VV L G + + +
Sbjct: 165 GVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVVSLLIGWLILHVLGDFTESG 224
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLRYREDLLF 309
+IV + H TLN +R ++++ F+ I SP + E +
Sbjct: 225 IIVFICALISHCVCNFLLSNVIVFKTLNYERLTILMSYFMNIKNEILSPVVVSRMESKIK 284
Query: 310 PGRLIEDAGNVKVGRDPHKVVKPSKFH 336
++ N+++G + + K S +
Sbjct: 285 VNEMV----NIRMGISLNSIPKSSNLY 307
>gi|409046959|gb|EKM56438.1| hypothetical protein PHACADRAFT_253558 [Phanerochaete carnosa
HHB-10118-sp]
Length = 464
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 116/259 (44%), Gaps = 40/259 (15%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWI 127
F P+GYP SV+ Y RY ALQ F S+ +++ +++L G+ A AT ++ I
Sbjct: 2 FLPAGYPASVSGDYTRYQVLNALQGFFSSLGGLVAARAVLEGHGVGSADASATNALLLTI 61
Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
L+D V ++ LG + E K +R LAD D+ L+ LSPL + L+ L
Sbjct: 62 LQDIFSRVTAILSGYYLGTALFPEAKTYRFLADIWNDIPLVLDTLSPLLSHVSLDFVWLA 121
Query: 187 -------------NFAKGIAVVAARATR-------------LPIYSS----FAKEGNLSD 216
+GIA+ + R L ++ S A G++SD
Sbjct: 122 FPPFLLPTLVHRPGPWRGIALCFSGIFRSMCAVAAAGSKAALTLHFSQPDHGAASGDISD 181
Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI---HVYSVIEEMRAAP 273
L AK + T+ +VG+ G S + +Q + LL+VI H+Y +R+
Sbjct: 182 LSAKDGSKETVLALVGMLCG----SIIMPLIQTQTQTCGLLAVIIVFHIYINYYAVRSVV 237
Query: 274 VNTLNPQRTAMIVADFVKT 292
+ N QRT ++ + F ++
Sbjct: 238 FKSYNRQRTCILWSAFRRS 256
>gi|147803314|emb|CAN66551.1| hypothetical protein VITISV_039658 [Vitis vinifera]
Length = 470
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 72/313 (23%), Positives = 138/313 (44%), Gaps = 36/313 (11%)
Query: 124 VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
+ W+ KDG+ VG+L +G + K ++ +++DL T L P FL++A
Sbjct: 162 IRWVSKDGIGAVGRLF---IGLSVIKRAKGYK----SIFDLST------QLYPAYFLQLA 208
Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST- 242
LGN AK +A + I + FA GNL ++ AK E ++GL GI + ++
Sbjct: 209 SLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMVLTSP 268
Query: 243 -VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
V +S + ++H++ + + ++N +R ++V V + D
Sbjct: 269 LVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPGYVDC 328
Query: 302 RYREDLLFPGRLIEDAGNVKVGRDPHKVV---KP-SKFHELKELFAEEKFILCLGSRWTD 357
+E +L R ++ + G +++ +P SK L +L+++EK+IL + + TD
Sbjct: 329 NRKEKILSWQRFLKP--RIIFGVSLEELIGGERPVSKVRTLLKLYSQEKYILVVKQQGTD 386
Query: 358 ----MVLEQNASGEDALRG-----WLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPL 408
+ + A+ LR WL + S D L L+ + ++ + F
Sbjct: 387 FEVFVSFKVGATSLSVLRSIWQSYWLQENWCGS------DNILVQLEQSLLELEESFGDF 440
Query: 409 LSELQAKGWHTDR 421
+L++ GW +
Sbjct: 441 TQQLKSSGWDLQK 453
>gi|121709968|ref|XP_001272600.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
gi|119400750|gb|EAW11174.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
Length = 512
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 92/189 (48%), Gaps = 18/189 (9%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFR-ALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
+VD ++ F PSGYP SV + YL F A T+ L V TP
Sbjct: 65 LVDLLVDVFLPSGYPHSVTDDYLPRVYFHLATPQPTNNPLGV-------GVGNADATPTS 117
Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
A ++ IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P
Sbjct: 118 ALLLH-ILQDTSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMIPAGV 176
Query: 178 ---LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
L AG+ + + VA +++ + + FA+ GNL+++ AK + T+ ++VG+
Sbjct: 177 SRITVLSTAGV---LRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLVGML 233
Query: 235 AGIQLASTV 243
G + S V
Sbjct: 234 VGSLVVSHV 242
>gi|169604674|ref|XP_001795758.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
gi|111066623|gb|EAT87743.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
Length = 473
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 84/391 (21%), Positives = 162/391 (41%), Gaps = 63/391 (16%)
Query: 52 VDDSRPVYHR-VVDSFLNKFFPSGYPFS------------------VNEGYLRYTQFRAL 92
VD +P + V + F P+GYP S V EG+L L
Sbjct: 28 VDVVKPAERKSYVQQVFDVFLPAGYPQSDSLQAFSSSIAGMLASRAVLEGHLDAEDTLRL 87
Query: 93 QHFTSAALS--------VLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN-L 143
Q+ + + + S QS G A ++ +L++ M + ++ ++ L
Sbjct: 88 QNLVPSVYAASIGDQRHMSSHQSRCVGVGDSSASPTAALLLSVLQESMGRIATILFAHRL 147
Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIAVVAARATRL 202
G ++ E K +R+ AD D L+ LSP P+ F + + + + V A + +
Sbjct: 148 GTALEPECKMYRLAADVFNDTAMILDCLSPAFPKPFRVLVLSFSSVLRSLCGVCAGSAKA 207
Query: 203 PIYSSFAKEGNLSDLFAKGEAI--STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI 260
+ + FA++GNL ++ A + + F++ GL G +T I+ LLS I
Sbjct: 208 SLSAHFARKGNLGEVNAHITHMNPNVDFDIEGLEPGNGHIAT------WNAILIALLS-I 260
Query: 261 HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG---RLIEDA 317
H+ + ++A + LN QR ++ + + G + +P ++ RE + R ++D
Sbjct: 261 HLATNYAAVKAVSMRCLNRQRANIVFGNLMHHGSVLTPTEVSQRERIFERDGVLRWVDDC 320
Query: 318 --GNVKVG---------------RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTD--M 358
G+ +G R + +L ++FA++K++L + +
Sbjct: 321 VIGHCTIGVPLGVLLASMSQRYRRTGSLELSDINLPDLMDIFADQKYVLWFTDSEYEAIV 380
Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDP 389
VL++ + D L+ W A+A A+ +P
Sbjct: 381 VLKEGCTPIDQLKAW---AHALLFAQDLRNP 408
>gi|159474944|ref|XP_001695583.1| hypothetical protein CHLREDRAFT_191958 [Chlamydomonas reinhardtii]
gi|158275594|gb|EDP01370.1| predicted protein [Chlamydomonas reinhardtii]
Length = 487
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 74/138 (53%), Gaps = 8/138 (5%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGLR----PTPAQATVVSWILKDGMQHVGKLI-CSNLGA 145
L + A SVL++ +L+A GL PT A ++W+LKDGM +G L+ +
Sbjct: 142 GLNNIAVTANSVLASTFMLYAVGLGAGAIPT---AGALNWVLKDGMGQLGTLVFGKTIAH 198
Query: 146 RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIY 205
D K W L+ L LE+L+ L P FL M L N KG+A +AA +TR +
Sbjct: 199 NFDVHSKTWFFLSAVLLQAAAALEMLTVLVPGHFLLMGSLANMLKGLAWMAAGSTRSVFH 258
Query: 206 SSFAKEGNLSDLFAKGEA 223
SFA++ N++D+ AKG +
Sbjct: 259 LSFARDNNIADVTAKGTS 276
>gi|67467345|ref|XP_649789.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56466294|gb|EAL44403.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449707066|gb|EMD46790.1| Hypothetical protein EHI5A_044670 [Entamoeba histolytica KU27]
Length = 400
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 58/269 (21%), Positives = 116/269 (43%), Gaps = 16/269 (5%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD- 130
P GYP SV GY R+ F + + + V+S +A LR P + + ++
Sbjct: 51 PVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSV-----SAILRINPGHSAAFFVVFREV 105
Query: 131 --GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
GM H+ +I G + K+WR+ + + +L +E+ P +F ++ +
Sbjct: 106 MTGMMHL--IITERWGTSIVFFAKQWRMRIELISELLRIIEIAFISTP-IFYSISVTCSV 162
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
+G+ + + + I +++AK N+++L K + + TLF VV L G + + +
Sbjct: 163 IEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVVSLLIGWLILHVLGDFTE 222
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLRYREDL 307
+IV + H TLN +R ++++ F+ I SP + E
Sbjct: 223 SGIIVFICALISHCICNFLLSNVIVFKTLNYERLTILMSYFMNIKNEILSPVVVSRIESK 282
Query: 308 LFPGRLIEDAGNVKVGRDPHKVVKPSKFH 336
+ ++ N+++G + + K S +
Sbjct: 283 IKVNEMV----NIRMGISLNSIPKSSDLY 307
>gi|342186523|emb|CCC96010.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 596
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 62/249 (24%), Positives = 102/249 (40%), Gaps = 7/249 (2%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P+GYP S EG+ RY F S+ S ++ QS+L L +P W+LKD
Sbjct: 143 PTGYPDSCAEGFRRYFLLSLCSTFVSSFASSIAYQSILNGFLLVSSPQL-----WMLKDL 197
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ +N ++ PK W +++ L++ LE++ P + P L A L + +
Sbjct: 198 APALAAAYLANRVVSYENRPKFWFVVSVVLHNTSIVLEMIIPSVVPHNLLTAAVLTSCVR 257
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
A + TR FA GNL++L K + + V GI V +S+ +
Sbjct: 258 QSASLMFLVTRASALQHFAVSGNLAELTKKFNSFGIVIYTVSTALGIAYTYYV-ASLTAQ 316
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
++ ++ M LN ++I+ +VK G + L RE F
Sbjct: 317 LVTVLFCCGANLVLSYMSMCNIAFRVLNGTTISVILRAYVKEGCLGPGHVLSPREVSDFI 376
Query: 311 GRLIEDAGN 319
G + D G
Sbjct: 377 GLRMLDVGT 385
>gi|407034154|gb|EKE37096.1| hypothetical protein ENU1_209440 [Entamoeba nuttalli P19]
Length = 400
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/269 (21%), Positives = 116/269 (43%), Gaps = 16/269 (5%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD- 130
P GYP SV GY R+ F + + + V+S +A LR P + + ++
Sbjct: 51 PVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSV-----SAILRINPGHSAAFFVVFREV 105
Query: 131 --GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
GM H+ +I G + K+WR+ + + +L +E+ P +F ++ +
Sbjct: 106 MTGMMHL--IITERWGTSIVFFAKQWRMRIELISELLRIIEIAFISTP-IFYSISVTCSV 162
Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
+G+ + + + I +++AK N+++L K + + TLF VV L G + + +
Sbjct: 163 IEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVVSLLIGWVILHVLGDFTE 222
Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLRYREDL 307
+IV + H TLN +R ++++ F+ I SP + E
Sbjct: 223 SGIIVFICALISHCICNFLLSNVIVFKTLNYERLTILMSYFMNIKNEILSPVVVSRIESK 282
Query: 308 LFPGRLIEDAGNVKVGRDPHKVVKPSKFH 336
+ ++ N+++G + + K S +
Sbjct: 283 IKVNEMV----NIRMGISLNSIPKSSDLY 307
>gi|302678497|ref|XP_003028931.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
gi|300102620|gb|EFI94028.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
Length = 278
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/273 (27%), Positives = 121/273 (44%), Gaps = 16/273 (5%)
Query: 63 VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
VD F P+G+P SV+ YLRY + A+Q F S+ +LS+++LL G+ A AT
Sbjct: 1 VDVLAKVFLPAGFPNSVSPDYLRYQIYNAIQAFCSSLAGLLSSRALLEGFGVGDANATAT 60
Query: 123 --VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
V+ + +D + +I + LG + E K +R LAD D + +P+
Sbjct: 61 NAVLLTVAQDVFSRLTTIIAAYLLGPSLFPEAKTFRFLADVFNDTAIIFDASTPILRASG 120
Query: 178 ------LFLEMAGLGNFAKGIAVVAARAT-RLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
L +AG G A+A L + G++ D+ AK + TL +
Sbjct: 121 HANLRVAALCLAGASRAICGTICGGAKAALTLHFATPLRGTGDIGDVNAKDASKETLLAL 180
Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
+G+ AG + + SS Q + L H+ +R + TLN QR ++ F
Sbjct: 181 MGMLAGSFVVHHL-SSTQATHALLLALLAAHLTVNYLGVRGVCLRTLNRQRASLAWTSFR 239
Query: 291 KTGRISSPADLRYREDLLF-PGRLIE--DAGNV 320
+ G SP + E +L PG ++ D G++
Sbjct: 240 EYGSAPSPTIVAGMEGILARPGAIVHAGDGGHI 272
>gi|303288586|ref|XP_003063581.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454649|gb|EEH51954.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 290
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 49/88 (55%), Gaps = 4/88 (4%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
F P+GYP +V+ Y R+ ++ +F L VL++QSLL A G+ TP A W
Sbjct: 2 FLPAGYPDTVSSDYARWLRWHLFSNFFRNVLEVLTSQSLLVALGMGSTPGALPLAAATKW 61
Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKR 153
+LKDG+ L+ +GA R D +PKR
Sbjct: 62 VLKDGVGSFATLLSGVVGAQRYDEDPKR 89
>gi|159124134|gb|EDP49252.1| DUF647 domain protein [Aspergillus fumigatus A1163]
Length = 542
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 111/251 (44%), Gaps = 36/251 (14%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
++ F PSGYP SV + YL LQ + T +LL
Sbjct: 70 LVDVFLPSGYPNSVTDDYLPAV----LQGVGVGNANASPTSALLL--------------- 110
Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P AG
Sbjct: 111 HILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVLDCLSPMIP------AG 164
Query: 185 LGNFA--------KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+G + + VA +++ + + FA+ GNL+++ AK + T+ +++G+ G
Sbjct: 165 VGRVGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLLGMLVG 224
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ S V +S L +H+ +R+ + +LN QR ++ + +++
Sbjct: 225 SFIVSHV-TSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRANIVFSTLLESDPAL 283
Query: 297 SPADL-RYRED 306
P L R R D
Sbjct: 284 DPHRLSRSRVD 294
>gi|70991459|ref|XP_750578.1| DUF647 domain protein [Aspergillus fumigatus Af293]
gi|66848211|gb|EAL88540.1| DUF647 domain protein [Aspergillus fumigatus Af293]
Length = 542
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 111/251 (44%), Gaps = 36/251 (14%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
++ F PSGYP SV + YL LQ + T +LL
Sbjct: 70 LVDVFLPSGYPNSVTDDYLPAV----LQGVGVGNANASPTSALLL--------------- 110
Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P AG
Sbjct: 111 HILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVLDCLSPMIP------AG 164
Query: 185 LGNFA--------KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
+G + + VA +++ + + FA+ GNL+++ AK + T+ +++G+ G
Sbjct: 165 VGRVGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLLGMLVG 224
Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
+ S V +S L +H+ +R+ + +LN QR ++ + +++
Sbjct: 225 SFIVSHV-TSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRANIVFSTLLESDPAL 283
Query: 297 SPADL-RYRED 306
P L R R D
Sbjct: 284 DPHRLSRSRVD 294
>gi|303281718|ref|XP_003060151.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458806|gb|EEH56103.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 814
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 3/120 (2%)
Query: 57 PVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-- 114
P+ + + F P GYP SV+ YL + A Q S A +L + +LL AG+
Sbjct: 224 PLLRACLRALRRAFLPEGYPRSVSSDYLAFQLCDAAQGVCSYARGILCSTALLRGAGVGS 283
Query: 115 RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP 173
A + ++ +D +G + + G MD++ KRWR+ ADA+ DLG +E+ SP
Sbjct: 284 STATAASATAQFVARDLTSMLGGVAFAATRGRAMDADAKRWRLFADAMNDLGMAIELASP 343
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 51/117 (43%), Gaps = 17/117 (14%)
Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAK-----------------EGNLSDLFAKG 221
FL A LG A+ + V +RATR + FA G+++D+ AK
Sbjct: 410 FLVAACLGGLARSLCGVTSRATRAALTQHFAAGDDAGRSFTRGRTGGSTTGSVADVDAKE 469
Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
T ++G+GAG+ + LL+ +HV+ + MR+ +NT+N
Sbjct: 470 GTQETAATLIGMGAGVLVTRLAADDAAAIWFWFALLTALHVFFNVRAMRSLRLNTMN 526
>gi|350630832|gb|EHA19204.1| hypothetical protein ASPNIDRAFT_186948 [Aspergillus niger ATCC
1015]
Length = 504
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 111/237 (46%), Gaps = 29/237 (12%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
V D + F P+GYP SV + Y A +VL + A PT A
Sbjct: 68 VQDFLVEVFLPAGYPHSVTDDY---------------APAVLQGVGV-GNADASPT---A 108
Query: 122 TVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ--- 177
++ IL+D + ++ ++ +G ++ E K +R+ AD DL L+ LSP+ P
Sbjct: 109 ALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMTLDCLSPMIPAGAP 168
Query: 178 --LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
L AG+ + + VA +++ + + F++ GNL+++ AK + T+ +++G+
Sbjct: 169 RVTVLSTAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLV 225
Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
G + S V ++ I +L +H+ +R+ + +LN QR ++ + + T
Sbjct: 226 GSFVVSRV-TNFSTTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 281
>gi|390341857|ref|XP_003725545.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1
[Strongylocentrotus purpuratus]
Length = 311
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 53/233 (22%), Positives = 107/233 (45%), Gaps = 8/233 (3%)
Query: 161 LYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
+ D +E++S PQ F+ +A + + K + VA ATR + A+ N++D+ AK
Sbjct: 1 MNDAAMCIELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAK 60
Query: 221 GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
+ TL N+ L G+ + ++ + L + +H+Y+ + + TLN
Sbjct: 61 DGSQETLVNLAALFVGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQN 120
Query: 281 RTAMIVADFVKTGR--ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHEL 338
R ++V D++ + + +S P + RE +++ + + +G V K + E+
Sbjct: 121 RFHILVQDYLHSSQFSMSGPDAVNSREPVIWR---LSRPLKLNLGVPFRTVAKSMRDIEM 177
Query: 339 K-ELFAEEKFILCLGSR--WTDMVLEQNASGEDALRGWLVAAYAASMAKSFHD 388
+ +EKF+L L + D+V+ +++ ED ++ L A HD
Sbjct: 178 CLQSNGQEKFLLSLDVKKGCIDIVIHHDSTSEDFIKACLQAEIIMYTTFHLHD 230
>gi|384494915|gb|EIE85406.1| hypothetical protein RO3G_10116 [Rhizopus delemar RA 99-880]
Length = 188
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P GYP SV+ Y ++ + L+ + +A+ VL +Q++L + GL A A + W+
Sbjct: 58 FLPVGYPESVHSCYKKFHSWLFLETYVGSAVGVLCSQAMLASLGLGTVEATGGAVAIQWV 117
Query: 128 LKDGMQHVGKL-ICSNLGARMDSEPKRWRILA 158
LKDG+ +GKL + DS PK W+ ++
Sbjct: 118 LKDGIGEIGKLFFIKKYASTFDSHPKTWKFVS 149
>gi|340059715|emb|CCC54109.1| conserved hypothetical protein, fragment, partial [Trypanosoma
vivax Y486]
Length = 618
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/227 (23%), Positives = 96/227 (42%), Gaps = 7/227 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
PSGYP S G+ RY S+ S + QS+L L +P W+LKD
Sbjct: 183 PSGYPHSCGVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLASSPQL-----WMLKDL 237
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N ++ PK W +++ +++L E++ P L PQ L A + +
Sbjct: 238 VPALAAAYLANRIVSYENRPKFWFVVSVGMHNLSVIAEMIVPSLVPQHLLFAAVTTSCIR 297
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
A + TR FA NL++L K + + V GI S V +S+ +
Sbjct: 298 QSASLMFLVTRASALQHFAISNNLAELTKKFNSFGIVIYTVATALGIAFTSVV-TSLTAQ 356
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
+I + +++ + M LN ++I+ ++++ G S
Sbjct: 357 LITVLMCCAVNLIISHQSMCKIAFRMLNGTTISVILHEYIQNGSTCS 403
>gi|119467992|ref|XP_001257802.1| hypothetical protein NFIA_052500 [Neosartorya fischeri NRRL 181]
gi|119405954|gb|EAW15905.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 515
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 111/223 (49%), Gaps = 20/223 (8%)
Query: 91 ALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARM 147
+LQ F S+ +LS++++L G+ A T ++ IL+D + ++ ++ +G +
Sbjct: 86 SLQAFCSSIAGLLSSRAVLQGVGVGNANASPTSALLLHILQDSSGRIATILFAHRVGTAL 145
Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFA--------KGIAVVAARA 199
+ E K +R+ AD D+ L+ LSP+ P AG+G + + VA +
Sbjct: 146 EPECKMYRLAADVFNDVAMVLDCLSPMIP------AGVGRVGVLSAAGVLRALCGVAGGS 199
Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV-CSSMQGKMIVGPLLS 258
++ + + FA+ GNL+++ AK + T+ +++G+ G + S V C + ++ L
Sbjct: 200 SKASLSAHFARGGNLAEVNAKDSSQETIISLIGMLVGSFIVSHVTCFAATWASLL--FLL 257
Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
+H+ +R+ + +LN QR ++++ +++ P L
Sbjct: 258 TVHLGMNYAAVRSVQMTSLNRQRANIVLSTLLESDLALDPRSL 300
>gi|414876406|tpg|DAA53537.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
Length = 295
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 67/274 (24%), Positives = 117/274 (42%), Gaps = 14/274 (5%)
Query: 158 ADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDL 217
AD + G+ E+ + L P FL +A LGN AK IA + I + FA+ GNL ++
Sbjct: 3 ADFIGSAGSIFELSTTLYPGYFLPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLGEV 62
Query: 218 FAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVN 275
AK E ++GL G+ + +V LSV +H++ + +
Sbjct: 63 AAKEEVWEVGAQLLGLSIGVLIMDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLKFR 122
Query: 276 TLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK- 334
T+N +R ++V V + E++L R + V G +++ K
Sbjct: 123 TINLKRGRILVKSHVAQHTVPGYVACNEEENILTWERYLRP--QVSFGVPMERMLGGDKS 180
Query: 335 ---FHELKELFAEEKFILC---LGSRWTDMVL--EQNASGEDALRGWLVAAYAASMAKSF 386
+ L +L+ EK+IL GSR ++ +++A+ LR L A+ +
Sbjct: 181 SDMVNRLLKLYKNEKYILFFEQFGSREPTFLVTFKESATSMSVLRS-LWQAHWLHKNQLR 239
Query: 387 HDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
D L+++ + + FT L +L + GW D
Sbjct: 240 QDEVFPWLEESIVALENGFTDFLEQLGSAGWDQD 273
>gi|17945727|gb|AAL48912.1| RE31721p [Drosophila melanogaster]
Length = 266
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/257 (22%), Positives = 111/257 (43%), Gaps = 16/257 (6%)
Query: 167 GLEV-LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
G+E+ + P P ++ K I VA ATR + A GNL+D+ +K +
Sbjct: 2 GIEIYVLPKYPHFSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQE 61
Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
T N+V G+ L S + S I ++S +H+Y+ ++ +RA + + N R +
Sbjct: 62 TCVNLVASFVGLYLLSLIKSQAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIA 120
Query: 286 VADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-- 343
+ +F ++ R+ SP + E + G+ + + N+K+G ++ K + E
Sbjct: 121 LEEFFRSSRMLSPQQVNAMERVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSS 179
Query: 344 ---EEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK 400
E FI+ + + L +D L+ + +A S + + L++ Y
Sbjct: 180 FDPHEHFIIAQTKKCLGVYLHFETRPQDVLKAYF---FAVSYLQDRNQ-----LKEKYWD 231
Query: 401 MNDVFTPLLSELQAKGW 417
+ + L+ Q +GW
Sbjct: 232 IQTKWQEFLALAQQEGW 248
>gi|242208267|ref|XP_002469985.1| predicted protein [Postia placenta Mad-698-R]
gi|242226469|ref|XP_002477622.1| predicted protein [Postia placenta Mad-698-R]
gi|220722555|gb|EED77179.1| predicted protein [Postia placenta Mad-698-R]
gi|220731047|gb|EED84896.1| predicted protein [Postia placenta Mad-698-R]
Length = 226
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 109/225 (48%), Gaps = 12/225 (5%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQAT--VVS 125
F P+G+P +V+ G +Y + ALQ F S+ +++++++L F G+ A AT +
Sbjct: 5 FLPAGFPATVSPG--KYQIYNALQAFCSSLAGLIASRAVLQGFRHGVGKAGASATHAIFL 62
Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
+L+D + ++ LG + E K +R++AD D L+ LSP +
Sbjct: 63 TVLQDIFSRLTTIVAGYYLGTSLFPEAKAYRLMADIFNDAAIILDTLSPHLGSFRVASLC 122
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
L + + V A ++ + FA G++ DL AK + T+ ++G+ +G +
Sbjct: 123 LSGAFRALCGVVAGGSKAALTVHFATTAKGAGDVGDLSAKDGSKETVLALLGMLSGTAVM 182
Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
V S+ +++ L++ +H++ +R +LN QR +++
Sbjct: 183 HYVHSTRATYVVLFALIA-LHLFCNYLAVRVVVFRSLNRQRASIV 226
>gi|390341859|ref|XP_795645.3| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2
[Strongylocentrotus purpuratus]
Length = 311
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 108/239 (45%), Gaps = 20/239 (8%)
Query: 161 LYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
+ D +E++S PQ F+ +A + + K + VA ATR + A+ N++D+ AK
Sbjct: 1 MNDAAMCIELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAK 60
Query: 221 GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
+ TL N+ L G+ + ++ + L + +H+Y+ + + TLN
Sbjct: 61 DGSQETLVNLAALFVGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQN 120
Query: 281 RTAMIVADFVKTGR--ISSPADLRYREDLLF----PGRLIEDAGNVKVGRDPHKVVKPSK 334
R ++V D++ + + +S P + RE +++ P +L N+ V P + V S
Sbjct: 121 RFHILVQDYLHSSQFSMSGPDAVNSREPVIWRLSRPLKL-----NLGV---PFRTVAQS- 171
Query: 335 FHELKELF---AEEKFILCLGSR--WTDMVLEQNASGEDALRGWLVAAYAASMAKSFHD 388
+L+ +EKF+L + D+ + +++ ED ++ L A HD
Sbjct: 172 MRDLEMCLQSNGQEKFLLSFDVKKGCIDIAIHHDSTSEDFIKACLQAEIIMYTTFHLHD 230
>gi|115389162|ref|XP_001212086.1| predicted protein [Aspergillus terreus NIH2624]
gi|114194482|gb|EAU36182.1| predicted protein [Aspergillus terreus NIH2624]
Length = 411
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 62/115 (53%), Gaps = 5/115 (4%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQAT 122
+ F P+GYP SV + Y Y F +LQ F+S+ +LS++++L G+ +P A
Sbjct: 57 LIEVFLPAGYPHSVTDDYTAYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNPNASPTSAL 116
Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
++ IL+D + ++ ++ +G + E K +R+ AD DL L+ LSP P
Sbjct: 117 LLH-ILQDTSGRLATILFAHRVGTALQPECKSYRLAADLFNDLAMILDCLSPAAP 170
>gi|429329557|gb|AFZ81316.1| hypothetical protein BEWA_007250 [Babesia equi]
Length = 743
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 57/299 (19%), Positives = 125/299 (41%), Gaps = 48/299 (16%)
Query: 32 SDSVTRHYQFQPDGHLSVKVVDDS-------RPVYHRVVDSFLNKFFPSG-----YPFSV 79
S S +R+ +PD + + ++ + R ++R++ +F F +P
Sbjct: 122 SSSDSRNNSAKPDPTIVSRGINQNYGRKLSVRSFFNRIIGAF--NFLKRCINIIIHPSVA 179
Query: 80 NEGYLRYTQFRALQHFTSAALSVLSTQS------------------LLFAAGLRPTPAQA 121
+ Y +Y +R + F+ + L +++ L+ + + ++
Sbjct: 180 TKQYYKYAHWRMFERFSFSVLQTMASNIKSAPTNLQQNKVENTNIILMLKNKISESKMKS 239
Query: 122 TVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC---PQ 177
+ + I KD + + S +G D+ PK +R+L L T ++ + + P+
Sbjct: 240 ILATMIFKDAFSRIFHFLWFSEIGVGFDNNPKAFRLLGSILCSTATLIDFVCNVFTFGPK 299
Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG---NLSDLFAKGEAISTLFNVVGLG 234
+ M N + I ++ A++ P Y+SF + N+ ++ AK EA + + + G+
Sbjct: 300 MI--MGACTNAVRQIGLLTMSASQGPFYNSFHVKNTATNIGEITAKLEAQNPICDFSGIA 357
Query: 235 AGIQLASTVCSS----MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
GI L + + +I+ L+S + Y + ++ LNPQR +++ DF
Sbjct: 358 LGIYLTNLLNEQPFVIQATTVILTSLISNVATYMCV---KSVSFKNLNPQRCFVVLEDF 413
>gi|255945167|ref|XP_002563351.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588086|emb|CAP86157.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 504
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 88/199 (44%), Gaps = 16/199 (8%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYL-----------RYTQFRALQHFTSAALSVLSTQSLLF 110
++ S + F PSGYP SV++ YL + L T A + TQ
Sbjct: 61 IISSLSDVFLPSGYPQSVSDDYLPPPLLPSGSSRSISHPAQLHRPTQRAKQLTPTQHTGV 120
Query: 111 AAGLRPTPAQATVVSWILKDGMQHVGKLIC--SNLGARMDSEPKRWRILADALYDLGTGL 168
G + ++ IL+D + IC +G ++ E K +R+ AD D+ L
Sbjct: 121 GVGNANASPTSALLLHILQDISGRIAT-ICFAHRIGTALEPECKTYRLAADVFNDIAMIL 179
Query: 169 EVLSPLCPQLFLEMAGLGNFA--KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 226
+ LSP P +A L + + VA +++ + + FAK GNL++L AK + T
Sbjct: 180 DCLSPGVPAGPARVAVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQET 239
Query: 227 LFNVVGLGAGIQLASTVCS 245
+ ++ G+ G + S + S
Sbjct: 240 VISLFGMLVGSVVISHITS 258
>gi|159490370|ref|XP_001703152.1| hypothetical protein CHLREDRAFT_195109 [Chlamydomonas reinhardtii]
gi|158270782|gb|EDO96617.1| predicted protein [Chlamydomonas reinhardtii]
Length = 280
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 48/77 (62%), Gaps = 2/77 (2%)
Query: 83 YLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLIC 140
Y Y +R +Q+F A+SV +TQSLL A G+ R + A ++W++KDG +G+L+
Sbjct: 11 YAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRYSGEAAAAINWVIKDGAGRLGRLLF 70
Query: 141 SNLGARMDSEPKRWRIL 157
+ G +D E K++R++
Sbjct: 71 ARWGRELDCELKQFRLM 87
>gi|301123209|ref|XP_002909331.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100093|gb|EEY58145.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 122
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 60/119 (50%), Gaps = 26/119 (21%)
Query: 106 QSLLFAAGLR----PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADAL 161
QSLL+ AGL PT A V+ +L+DG+ +WRI +
Sbjct: 2 QSLLYDAGLVAGAIPT---AAAVNRVLRDGL-------------------GQWRITSSVA 39
Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
D+ E+L+PL P FL +A L N AK +A ++A ATR ++SF NL+D+ AK
Sbjct: 40 LDVAVLGEILTPLAPGSFLVIASLTNVAKNVAWLSASATRAGFHNSFVIRENLADVTAK 98
>gi|440299036|gb|ELP91648.1| hypothetical protein EIN_206450 [Entamoeba invadens IP1]
Length = 360
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/271 (21%), Positives = 111/271 (40%), Gaps = 17/271 (6%)
Query: 30 ETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQF 89
E+ S +R Y F D + +P + + + P GYP +V GY R+ F
Sbjct: 18 ESKGSWSRRYIFDEDYN---------QPPFLTTRNHLQDIIVPVGYPTTVERGYFRFVAF 68
Query: 90 RALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNL-GARMD 148
+ L V+S +A LR + + I ++ M LI + G+ +
Sbjct: 69 SVAMESCNMMLLVMS-----ISASLRIDSSHSAAFFVIFREVMAGTMHLIITERWGSSIV 123
Query: 149 SEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
K+WR+ + + +L +++ P +F + + + + + + + I S++
Sbjct: 124 FYAKQWRMRIEIISELLRLIQIAFIASP-IFYSINVTCSTIESVLNASRQVIKTKILSNY 182
Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
AKE N+++L K + I TL V+ L G + + + +IV L + HV
Sbjct: 183 AKENNVAELTEKAQNIETLARVISLLIGFIVLHLLGDFTEHGIIVFMLALIAHVVCNFLM 242
Query: 269 MRAAPVNTLNPQRTAMIVADFVKT-GRISSP 298
R +N +R +++ F I SP
Sbjct: 243 GRVIVFKAINYERLMILMKYFTTVKNEILSP 273
>gi|215259607|gb|ACJ64295.1| conserved hypothetical protein [Culex tarsalis]
Length = 234
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 55/242 (22%), Positives = 110/242 (45%), Gaps = 20/242 (8%)
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLL- 257
ATR + A GNL+D+ +K A T N++ G+ L + V +++ G L
Sbjct: 3 ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHV---QNQRVLYGLFLF 59
Query: 258 -SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
+++H+Y+ I+ ++A + T N R + + ++ K+G + SP + E + G+ +
Sbjct: 60 VTLLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGSMLSPQQVNKLERVTI-GQTVSL 118
Query: 317 AGNVKVGRDPHKVVKPSK-FHELKELFA----EEKFILCLGSRWTDMVLEQNASGEDALR 371
VK+G ++ + + ++L+ + A EKF++ + + L D ++
Sbjct: 119 TARVKMGCSARELAQYYRNCYDLENVIACFDSREKFLIAETRNYVGVYLHFTVKPLDIIK 178
Query: 372 GWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGT-RF 430
+ Y AS D S L+D + ++ + + L+ Q +GW+ L T R
Sbjct: 179 SYF---YVASY---LQDKS--QLRDRFWEIQNKWNEFLNLAQREGWNVQAHLLKTDEYRL 230
Query: 431 AW 432
W
Sbjct: 231 DW 232
>gi|215259609|gb|ACJ64296.1| conserved hypothetical protein [Culex tarsalis]
Length = 234
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/248 (22%), Positives = 108/248 (43%), Gaps = 32/248 (12%)
Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLL- 257
ATR + A GNL+D+ +K A T N++ G+ L + V +++ G L
Sbjct: 3 ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHV---QNHRVLYGLFLF 59
Query: 258 -SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
+++H+Y I+ ++A + T N R + + ++ K+G + SP + E + G+ +
Sbjct: 60 LTLLHIYPNIKAVQAVCLRTFNEARYLIALEEYFKSGSMLSPQQVNKLERVTI-GQTVSL 118
Query: 317 AGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLE-QNASGE-------- 367
VK+G ++ H + + E I C SR ++ E +N G
Sbjct: 119 TARVKMGCSARELA-----HYYRNCYDLENVIACFDSREKFLIAETRNYIGVYLHFTVKP 173
Query: 368 -DALRGWL-VAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDG 425
D ++ + VA+Y ++ L+D + ++ + + L+ Q +GW+ L
Sbjct: 174 LDIIKSYFYVASYLQDKSQ---------LRDRFWEIQNKWNEFLNLAQREGWNVQAHLLK 224
Query: 426 TGT-RFAW 432
T R W
Sbjct: 225 TDEYRLDW 232
>gi|422292653|gb|EKU19955.1| hypothetical protein NGA_2040500, partial [Nannochloropsis gaditana
CCMP526]
Length = 181
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 58/114 (50%)
Query: 195 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
+AA A+ I A+ NL+++ AKG A T ++ GLG G+ A S +
Sbjct: 2 IAAGASNGNIVEHMARTNNLAEVLAKGGAQHTAVSLFGLGFGMWFARVANQSPRRVWTAY 61
Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LL+VIH+ + MR ++N +R +++ F + + SPA++ RE ++
Sbjct: 62 TLLTVIHLTANYAAMRVLAFTSINRRRLNVLLTAFRENASVLSPAEVARRETII 115
>gi|325088852|gb|EGC42162.1| DUF647 domain-containing protein [Ajellomyces capsulatus H88]
Length = 507
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 65/314 (20%), Positives = 129/314 (41%), Gaps = 55/314 (17%)
Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAG 184
+L+D M + ++ ++ LG ++ E K +R+ AD L D + LSP P+ L + +
Sbjct: 48 VLQDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLA 107
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDL----------------------FAKGE 222
+ + + VAA +++ + S FAK GNL +L F +
Sbjct: 108 FSSVLRALCGVAAGSSKASLSSHFAKCGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSD 167
Query: 223 AISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS--------VIHVYSVIEEMRAAPV 274
A L I L +C S+ I P + ++H+ + +R+ +
Sbjct: 168 ADQKLQKDSSQETVISLLGILCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNM 227
Query: 275 NTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIE------DAGNVKVGRDPHK 328
TLN QRT ++ + + G + +P + E + ++ G ++G +
Sbjct: 228 TTLNRQRTNIVFSTLFEKGSVLTPTEASKCERIFERDGILRWKASSATLGYCQIGGSFQE 287
Query: 329 VVKPS------------KFHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWL 374
+++ S ++L LF +E+++L + +VL+ N + L+ W
Sbjct: 288 LLRGSIHGANSIRDVGIDIYKLLRLFEKEEYVLWFNPIHKKGTIVLKNNVTPISQLQAW- 346
Query: 375 VAAYAASMAKSFHD 388
++A +AK D
Sbjct: 347 --SHALRVAKRLTD 358
>gi|407850424|gb|EKG04824.1| hypothetical protein TCSYLVIO_004113 [Trypanosoma cruzi]
Length = 631
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/270 (22%), Positives = 112/270 (41%), Gaps = 29/270 (10%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P GYP S G+ RY S+ S + QS+L L +P W++KD
Sbjct: 164 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 218
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N ++ PK W ++ L +L E+L P L PQ L A + + A+
Sbjct: 219 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 278
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
A + +R FA NL++L K G I T+F G+ + ++V +S
Sbjct: 279 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMMIYTIFTAFGI-----MYTSVVTS 333
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG-----RISSPADL 301
+ ++ +++ + M LN ++++ D+V G I +P ++
Sbjct: 334 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNATTLSVVLRDYVGRGGIHRRHIPTPQEV 393
Query: 302 RYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
R L R++++ RDP+ +
Sbjct: 394 SDRIGL----RMLDEK-----ARDPNDCTR 414
>gi|74026006|ref|XP_829569.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70834955|gb|EAN80457.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 595
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 105/263 (39%), Gaps = 21/263 (7%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD 130
P+GYP S EG+ RY F S+ S + QS+L L +P W+LKD
Sbjct: 166 MPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSILNGFLLASSPQL-----WMLKD 220
Query: 131 GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
+ +N ++ PK W + + AL++L E++ P P L A L +
Sbjct: 221 LAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVPNHLLAAAILTSCV 280
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
+ A + TR FA NL++L K + + V GI S V S
Sbjct: 281 RQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALGIAYTSLVPSVTAQ 340
Query: 250 ----KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-----GRISSPAD 300
M G L++ H+ M LN ++I+ +++ GR+ SP +
Sbjct: 341 LATVLMCCGANLAISHL-----SMCNIAFRILNETTLSVILRFYMREEGHQRGRVLSPRE 395
Query: 301 LRYREDLLFPGRLIEDAGNVKVG 323
+ L R D GN ++G
Sbjct: 396 VSDAIGLRMIDRGRSD-GNDRIG 417
>gi|261335583|emb|CBH18577.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 595
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 105/263 (39%), Gaps = 21/263 (7%)
Query: 71 FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD 130
P+GYP S EG+ RY F S+ S + QS+L L +P W+LKD
Sbjct: 166 MPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSILNGFLLASSPQL-----WMLKD 220
Query: 131 GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
+ +N ++ PK W + + AL++L E++ P P L A L +
Sbjct: 221 LAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVPNHLLAAAILTSCV 280
Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
+ A + TR FA NL++L K + + V GI S V S
Sbjct: 281 RQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALGIAYTSLVPSVTAQ 340
Query: 250 ----KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-----GRISSPAD 300
M G L++ H+ M LN ++I+ +++ GR+ SP +
Sbjct: 341 LATVLMCCGANLAISHL-----SMCNIAFRILNETTLSVILRFYMREEGHQRGRVLSPRE 395
Query: 301 LRYREDLLFPGRLIEDAGNVKVG 323
+ L R D GN ++G
Sbjct: 396 VSDAIGLRMIDRGRSD-GNDRIG 417
>gi|397621626|gb|EJK66381.1| hypothetical protein THAOC_12704 [Thalassiosira oceanica]
Length = 480
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 72/161 (44%), Gaps = 16/161 (9%)
Query: 66 FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA--TV 123
F + F P+G+P + GYL Y +Q S V+ST + L AAG+ + A A
Sbjct: 57 FGDLFLPAGHPSTTAGGYLHYQVCDGVQGLCSYLRGVVSTGAALAAAGVGDSDATAMGAA 116
Query: 124 VSWILKDGMQHVGKLICSNLG-ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
V+W ++DG+ +G L S ++ DS K +R+LAD D G L++ P
Sbjct: 117 VTWAVRDGLGMLGGLAFSYYASSKFDSRVKEYRLLADVANDAGMILDMARP--------- 167
Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
+ A+G V A P A +G L D+ E
Sbjct: 168 ----SDARGGLVAATPGVPRPRLGQRALQGRLRDVRGSNEG 204
>gi|407411065|gb|EKF33281.1| hypothetical protein MOQ_002854, partial [Trypanosoma cruzi
marinkellei]
Length = 636
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 15/229 (6%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P GYP S G+ RY S+ S + QS+L L +P W++KD
Sbjct: 170 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 224
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N ++ PK W ++ L +L E+L P L PQ L A + + A+
Sbjct: 225 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 284
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
A + +R FA NL++L K G I T+F G+ + ++V +S
Sbjct: 285 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-----MYTSVVTS 339
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ ++ +++ + M LN ++++ D+V G I
Sbjct: 340 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGI 388
>gi|71661222|ref|XP_817635.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70882839|gb|EAN95784.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 630
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 15/229 (6%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P GYP S G+ RY S+ S + QS+L L +P W++KD
Sbjct: 164 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 218
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N ++ PK W ++ L +L E+L P L PQ L A + + A+
Sbjct: 219 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 278
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
A + +R FA NL++L K G I T+F G+ + ++V +S
Sbjct: 279 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-----MYTSVVTS 333
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ ++ +++ + M LN ++++ D+V G I
Sbjct: 334 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGI 382
>gi|71414333|ref|XP_809272.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70873629|gb|EAN87421.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 630
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 15/229 (6%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P GYP S G+ RY S+ S + QS+L L +P W++KD
Sbjct: 164 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 218
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N ++ PK W ++ L +L E+L P L PQ L A + + A+
Sbjct: 219 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 278
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
A + +R FA NL++L K G I T+F G+ + ++V +S
Sbjct: 279 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-----MYTSVVTS 333
Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
+ ++ +++ + M LN ++++ D+V G I
Sbjct: 334 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGI 382
>gi|387203446|gb|AFJ68984.1| hypothetical protein NGATSA_2040500, partial [Nannochloropsis
gaditana CCMP526]
Length = 181
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 57/114 (50%)
Query: 195 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
+AA A+ I A+ NL+++ AKG A T ++ GLG + A S +
Sbjct: 2 IAAGASNGNIVEHMARTNNLAEVLAKGGAQHTAVSLFGLGFRMWFARVANQSPRRVWTAY 61
Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
LL+VIH+ + MR ++N +R +++ F + + SPA++ RE ++
Sbjct: 62 TLLTVIHLTANYAAMRVLAFTSINRRRLNVLLTAFRENASVLSPAEVARRETII 115
>gi|56752623|gb|AAW24525.1| SJCHGC08554 protein [Schistosoma japonicum]
Length = 127
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 2/72 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
F P GYP SV++ YL Y + +Q F S+ L++Q++L G+ + A ++W+
Sbjct: 45 FLPVGYPSSVSDDYLEYQIWDTIQAFASSITGALASQAVLIGVGVGNSSATILGASLTWM 104
Query: 128 LKDGMQHVGKLI 139
KDG +G+++
Sbjct: 105 FKDGSGMIGRIL 116
>gi|242208074|ref|XP_002469889.1| predicted protein [Postia placenta Mad-698-R]
gi|220731120|gb|EED84968.1| predicted protein [Postia placenta Mad-698-R]
Length = 214
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 50/215 (23%), Positives = 101/215 (46%), Gaps = 10/215 (4%)
Query: 84 LRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQAT--VVSWILKDGMQHVGKLI 139
+RY + ALQ F S+ +++++++L F G+ A AT + +L+D + ++
Sbjct: 1 VRYQIYNALQAFCSSLAGLIASRAVLQGFRHGVGKAGASATHAIFLTVLQDIFSRLTTIV 60
Query: 140 CS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
LG + E K +R+ AD D L+ LSP + L + + V A
Sbjct: 61 AGYYLGTSLFPEAKAYRLTADIFNDAAIILDTLSPHLGSFRVASLCLSGAFRALCGVVAG 120
Query: 199 ATRLPIYSSFAKE----GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
++ + FA G++ DL AK + T+ ++G+ +G + V S+ +++
Sbjct: 121 GSKAALTVHFATTAKGAGDVGDLSAKDGSKETVLALLGMLSGTAVMHYVHSTRATYVVLF 180
Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
L++ +H++ +R +LN QR +++ A +
Sbjct: 181 ALIA-LHLFCNYLAVRVVVFRSLNRQRASIVWAAY 214
>gi|134084659|emb|CAK43337.1| unnamed protein product [Aspergillus niger]
Length = 484
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 90/206 (43%), Gaps = 26/206 (12%)
Query: 62 VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH-FTSAALSVL---------STQSLLFA 111
V D + F P+GYP SV + Y Q R L H S +L L S L
Sbjct: 68 VQDFLVEVFLPAGYPHSVTDDYAPPYQTRRLIHEGLSPSLQQLDCWPPILESSAARLPLG 127
Query: 112 AGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGL 168
G+ A T ++ IL+D + ++ ++ +G ++ E K +R+ AD DL L
Sbjct: 128 VGVGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMIL 187
Query: 169 EVLSPLCPQ-----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNL-----SDLF 218
+ LSP+ P L AG+ G+A +++A+ + + F++ GNL +D
Sbjct: 188 DCLSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKAS---LSAHFSRWGNLAEAAITDFL 244
Query: 219 AKGEAISTLFNVVGLGAGIQLASTVC 244
+ +S ++ AG S +C
Sbjct: 245 STERFVSRDNHITHRNAGPAPQSQLC 270
>gi|332372572|gb|AEE61428.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 43/191 (22%), Positives = 89/191 (46%), Gaps = 9/191 (4%)
Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
V ATR I A N++++ AK T+ N++G I L + SS +++
Sbjct: 37 VCYCGATRASITHHQAIRDNMAEISAKDGTQETIVNLIGSFGSIFLLNYFSSSRAEWILI 96
Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL--FPG 311
L+ V+H+Y+ ++A NT N +R A+++ + G + +PA + E +L F
Sbjct: 97 ISLM-VLHLYTNFLAVKALIFNTFNRERLALVLKSYFTIGSVLNPAKVNVNESVLLGFGL 155
Query: 312 RLIEDAG-NVKVGRDPHKVVK---PSKFHELKELFAEEKFILCLG--SRWTDMVLEQNAS 365
+ G ++ +G K + + L+ ++ ++K++L + + + LE+ +
Sbjct: 156 NVKRMCGFDIVLGESIKKSLNDYAAADIKLLQSVYKDKKYMLLVNGTKKKIYVCLEKGET 215
Query: 366 GEDALRGWLVA 376
D + G+ A
Sbjct: 216 AADVIAGYFHA 226
>gi|225560567|gb|EEH08848.1| DUF647 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 472
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 64/316 (20%), Positives = 126/316 (39%), Gaps = 58/316 (18%)
Query: 138 LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVA 196
L LG ++ E K +R+ AD L D + LSP P+ L + + + + + VA
Sbjct: 8 LFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVLRALCGVA 67
Query: 197 ARATRLPIYSSFAKEGNLSDL----------------------FAKGEAISTLFNVVGLG 234
A +++ + S FAK GNL +L F +A L
Sbjct: 68 AGSSKASLSSHFAKWGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKLQKDSSQE 127
Query: 235 AGIQLASTVCSSMQGKMIVGPLLS--------VIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
I L +C S+ I P + ++H+ + +R+ + TLN QR ++
Sbjct: 128 TVISLLGMLCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNMTTLNRQRANIVF 187
Query: 287 ADFVKTGRISSPADLRYREDLLFPGRLIE------DAGNVKVGRDPHKVVKPSK------ 334
+ + G + +P + E + ++ G ++G ++++ S
Sbjct: 188 STLFEKGSVLTPTEASKCERIFERDGILRWKASPAALGYCQIGGSFQELLRGSTHGANSI 247
Query: 335 ------FHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSF 386
++L LF +E+++L + +VL+ N + L+ W ++A +AK
Sbjct: 248 RDVGIDIYKLLRLFEKEEYVLWFNPIHKKGTIVLKNNVTPISQLKAW---SHALIVAKRL 304
Query: 387 HDPSLTVLQDAYDKMN 402
D + ++ D++N
Sbjct: 305 TD----LRREEADRIN 316
>gi|194383144|dbj|BAG59128.1| unnamed protein product [Homo sapiens]
Length = 156
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 4/107 (3%)
Query: 35 VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
++R + +P+G + +V P + F P G+P SV+ YL Y + ++Q
Sbjct: 44 LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101
Query: 95 FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLI 139
F S+ L+TQ++L G+ A A +W++KD +G+++
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIV 148
>gi|302417830|ref|XP_003006746.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261354348|gb|EEY16776.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 228
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 71/153 (46%), Gaps = 13/153 (8%)
Query: 68 NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
+ F P YP SV+ Y++Y + + Q F S +L+ +++L G+ ++
Sbjct: 79 DAFLPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHALLL 138
Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
LKDG+ + + + G+ +++ K++R LAD D LE+ +P
Sbjct: 139 TCLKDGVSRLAMITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELANPF---------- 188
Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDL 217
+AK + +V+A + R + A++ LS +
Sbjct: 189 FSGWAKILILVSAESLRALCGIALAQQAALSTI 221
>gi|146096657|ref|XP_001467882.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398020910|ref|XP_003863618.1| hypothetical protein, conserved [Leishmania donovani]
gi|134072248|emb|CAM70952.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322501851|emb|CBZ36933.1| hypothetical protein, conserved [Leishmania donovani]
Length = 468
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 57/248 (22%), Positives = 100/248 (40%), Gaps = 10/248 (4%)
Query: 58 VYHRVVDSFLNKFF--PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
V HR S + F P G+P + G+ R+ F S S + QSLL L
Sbjct: 20 VQHRETVSNRIRLFGMPKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG 79
Query: 116 PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-L 174
+P W+LKD + + +N +S PK W +++ + +L ++L P +
Sbjct: 80 SSPQL-----WMLKDLVPALFAAYMANKVVSYESRPKFWFVVSVFINNLTVISDMLIPSV 134
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P L A + + K + + TR +A NL++L K + + V
Sbjct: 135 LPNHLLAAAIITSTVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATA 194
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTG 293
GI L T +S ++I V +++ M LN +++ F+ +
Sbjct: 195 LGI-LYCTYIASFTVQLITVAACCVTNMFLSSMSMMPITFRLLNFDTMKLLLRAFILEKR 253
Query: 294 RISSPADL 301
RI +P ++
Sbjct: 254 RILTPQEI 261
>gi|401427109|ref|XP_003878038.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494285|emb|CBZ29584.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 468
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 101/248 (40%), Gaps = 10/248 (4%)
Query: 58 VYHRVVDSFLNKFF--PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
V HR S + F P G+P + G+ R+ F S S + QSLL L
Sbjct: 20 VQHRETVSNRIRLFGMPKGFPETTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG 79
Query: 116 PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-L 174
+P W+LKD + + +N ++ PK W +++ + +L ++L P +
Sbjct: 80 SSPQL-----WMLKDLVPALFAAYMANKVVSYENRPKFWFVVSVFMSNLMVISDMLIPSV 134
Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
P L A + + K + + TR +A NL++L K + + V
Sbjct: 135 LPNHLLAAAIITSTVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATA 194
Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTG 293
GI L T +S ++I V++++ M LN +++ F+ +
Sbjct: 195 LGI-LYCTYIASFTVQLITVAACCVMNMFLSSMSMMPITFRLLNFDTMKLLLRAFILEKR 253
Query: 294 RISSPADL 301
R+ +P ++
Sbjct: 254 RVMTPQEI 261
>gi|157874033|ref|XP_001685512.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128584|emb|CAJ08716.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 468
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 54/250 (21%), Positives = 102/250 (40%), Gaps = 9/250 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P G+P + G+ R+ F S S + QSLL L +P W+LKD
Sbjct: 36 PKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLGSSPQL-----WMLKDL 90
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ + +N +S PK W +++ + +L ++L P + P L A + + K
Sbjct: 91 VPALFAAYMANKVVSYESRPKFWFVVSVFVNNLTVISDMLIPSVLPNHLLAAAIITSTVK 150
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
+ + TR +A NL++L K + + V GI L T +S +
Sbjct: 151 QSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATALGI-LYCTYIASFTVQ 209
Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTGRISSPADLRYREDL-L 308
+I V +++ M LN +++ F+ + R+ +P ++ + +
Sbjct: 210 LITVAACCVTNMFLSSMSMMPITFRLLNVDTMKLLLGAFILEERRVMTPQEISTLLGVRM 269
Query: 309 FPGRLIEDAG 318
P + ++ AG
Sbjct: 270 LPTQALKKAG 279
>gi|156088815|ref|XP_001611814.1| hypothetical protein [Babesia bovis T2Bo]
gi|154799068|gb|EDO08246.1| hypothetical protein BBOV_III006850 [Babesia bovis]
Length = 540
Score = 45.4 bits (106), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 88/193 (45%), Gaps = 19/193 (9%)
Query: 108 LLFAAGLRPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYD--- 163
L +A + + A T+V LKD + + +GA D PK +R+L L
Sbjct: 42 LTWATSGKVSSALTTIV---LKDIFSRILNFVWFGQIGAGFDDNPKAFRLLGGLLCSAAG 98
Query: 164 LGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG---NLSDLFAK 220
+ + + L P++ L + L N K + V A++ +S+F + N+ ++ AK
Sbjct: 99 VACFIGNMMQLEPKVVLTV--LVNVIKHVGAVTMVASQATFHSTFCVKNAATNVGEITAK 156
Query: 221 GEAISTLFNVVGLGAGIQLA---STVCSSMQGKMIVGPLL-SVIHVYSVIEEMRAAPVNT 276
E S + + +G+ AG L + + +SMQ + G L S I Y ++ T
Sbjct: 157 LETQSPICDFLGMAAGTSLGALTANMAASMQTVLFTGMCLTSNIATYMC---AKSVCFRT 213
Query: 277 LNPQRTAMIVADF 289
LNPQR +++ +F
Sbjct: 214 LNPQRCFVLLENF 226
>gi|399218598|emb|CCF75485.1| unnamed protein product [Babesia microti strain RI]
Length = 635
Score = 45.4 bits (106), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 44/197 (22%), Positives = 81/197 (41%), Gaps = 25/197 (12%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
++ FLN +PSG V Y +Y +R L+ F+ + +S L L +
Sbjct: 106 KIKKRFLNIVYPSG----VTSNYYKYANWRLLERFSHSMSQSISNNPLAAKILLNQSTYS 161
Query: 121 --------ATVVSWILKDGMQHVGKLICS---------NLGARMDSEPKRWRILADALYD 163
A + +I + M + K I S N+G D P+ + +
Sbjct: 162 TNDGVINIAYKIDFIATNLMTQIFKEIVSRIVKTMWVGNIGIGFDMNPRAFCFWGSLICT 221
Query: 164 LGTGLEVLSPLCPQLFLEMA-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD---LFA 219
+ + LS L ++ + + + I ++ A+ PIY SF + N+S+ + A
Sbjct: 222 ITNTADFLSTLFNFKYISITRHITTLFRQIGMLTTSASIGPIYGSFKSQNNVSNIGEITA 281
Query: 220 KGEAISTLFNVVGLGAG 236
K EA++ L +++G G
Sbjct: 282 KMEAVTPLCDLLGSATG 298
>gi|425769885|gb|EKV08364.1| hypothetical protein PDIP_68690 [Penicillium digitatum Pd1]
gi|425771531|gb|EKV09972.1| hypothetical protein PDIG_59240 [Penicillium digitatum PHI26]
Length = 406
Score = 44.7 bits (104), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 58/114 (50%), Gaps = 10/114 (8%)
Query: 139 IC--SNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-----LFLEMAGLGNFAKG 191
IC +G ++ E K +R+ AD D+ L+ LSP P + L AG+ +
Sbjct: 34 ICFAHRVGTALEPECKTYRLAADVFNDIAMILDCLSPGVPAGPARVIVLSTAGV---LRA 90
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
+ VA +++ + + FAK GNL++L AK + T+ ++ G+ G + S + S
Sbjct: 91 LCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQETVISLFGMLVGSVVISHITS 144
>gi|397575094|gb|EJK49531.1| hypothetical protein THAOC_31586, partial [Thalassiosira oceanica]
Length = 194
Score = 44.7 bits (104), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 65 SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QAT 122
S ++ F P+ YP SV Y + + + +A VLSTQ+LL A G+ A A
Sbjct: 113 SIVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAA 172
Query: 123 VVSWILKDGMQHVGKLI 139
++W+LKD +G ++
Sbjct: 173 ALNWLLKDFAGQLGGVL 189
>gi|151555803|gb|AAI49327.1| FLJ13868 protein [Bos taurus]
Length = 158
Score = 44.7 bits (104), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 12/132 (9%)
Query: 12 KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVV--DDSRPVYHRVVDSFLNK 69
+ PD S+ + W+ S + T +P G V + P + R+ L
Sbjct: 29 RAAPDGSLQWEAWGRRWWGFSGAFTA----KPGGRDGGGGVAPGTASPPFSRL----LAV 80
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
F P G+P SV+ YL Y + ++Q F S+ L+T ++L G+ A A +W+
Sbjct: 81 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 140
Query: 128 LKDGMQHVGKLI 139
+KD +G+++
Sbjct: 141 VKDSTGMLGRIV 152
>gi|159467581|ref|XP_001691970.1| hypothetical protein CHLREDRAFT_170864 [Chlamydomonas reinhardtii]
gi|158278697|gb|EDP04460.1| predicted protein [Chlamydomonas reinhardtii]
Length = 131
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 136 GKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAV 194
G L G+ D+ K+WR+ AD DLG +E+ SPL P+ LFL A G+ A+ + V
Sbjct: 7 GILFAYAEGSSFDACAKQWRLFADIANDLGMTVELASPLLPRALFLPCACFGSIARSVTV 66
Query: 195 VA 196
A
Sbjct: 67 FA 68
>gi|321459665|gb|EFX70716.1| hypothetical protein DAPPUDRAFT_327911 [Daphnia pulex]
Length = 269
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 72/142 (50%), Gaps = 29/142 (20%)
Query: 89 FRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDG-MQHVGKLI---CS 141
A+Q F + LST+++L G+ TP AT++ W++KDG + VG+++ C+
Sbjct: 9 IHAIQAFVGSLSGSLSTRAILEGVGVGSSTSTPLAATLM-WLIKDGSTRMVGRILFAWCN 67
Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATR 201
G ++D++ G+ + S LC +AG+ AK I VA ATR
Sbjct: 68 --GTKLDADTAVG----------GSNGTMTSILC------VAGV---AKSIVGVAGGATR 106
Query: 202 LPIYSSFAKEGNLSDLFAKGEA 223
+ A++GNL+D+ AK EA
Sbjct: 107 AALTQHQARQGNLADVSAKVEA 128
>gi|440799746|gb|ELR20790.1| protein C16orf58 family protein [Acanthamoeba castellanii str.
Neff]
Length = 101
Score = 42.7 bits (99), Expect = 0.33, Method: Composition-based stats.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 2/71 (2%)
Query: 70 FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
P GYP SV+ Y Y + +Q S+ +L T++LL G+ A A V W
Sbjct: 21 LLPEGYPESVSGDYATYQTWDTVQALCSSITGLLCTRALLQGVGVGDVGATPGAAAVQWA 80
Query: 128 LKDGMQHVGKL 138
L+DG VG++
Sbjct: 81 LRDGAGMVGRI 91
>gi|3242710|gb|AAC23762.1| hypothetical protein [Arabidopsis thaliana]
Length = 405
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 5/74 (6%)
Query: 61 RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-- 118
R+ S FF P V++ Y+ Y +++ L S+AL VL+TQ++ A G+ +
Sbjct: 113 RLTSSAAKDFF---LPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSL 169
Query: 119 AQATVVSWILKDGM 132
A + +WILKDG+
Sbjct: 170 ASSAAFNWILKDGL 183
>gi|12654037|gb|AAH00822.1| C16orf58 protein [Homo sapiens]
Length = 231
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 82/186 (44%), Gaps = 26/186 (13%)
Query: 257 LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
L+ +H+Y+ +RA + TLN R +++ +++ G + P R + L+ G
Sbjct: 30 LTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWP 86
Query: 317 AGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR- 371
A ++ +G H++V S EL++L +E ++LC +VL Q A + LR
Sbjct: 87 APSLSLGVPLHRLV--SSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRA 144
Query: 372 ---GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
G ++ A + S V+++ ++ ++ +F L LQ
Sbjct: 145 ATHGLMLGALQGDGPLPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDA 204
Query: 416 GWHTDR 421
GW T++
Sbjct: 205 GWKTEK 210
>gi|224044670|ref|XP_002188933.1| PREDICTED: condensin-2 complex subunit G2 [Taeniopygia guttata]
Length = 1150
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%)
Query: 37 RHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSV 79
+ + P HL ++ DSRPV R+V+ LN FFPS P V
Sbjct: 494 KFWNISPMEHLLSRLESDSRPVSRRIVNLLLNSFFPSAQPEDV 536
>gi|268318089|ref|YP_003291808.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
[Rhodothermus marinus DSM 4252]
gi|262335623|gb|ACY49420.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
[Rhodothermus marinus DSM 4252]
Length = 561
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 50/115 (43%), Gaps = 22/115 (19%)
Query: 202 LPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIH 261
+P+Y A G + DLF F VG G GI+LA V ++ G+ +H
Sbjct: 46 IPVY---AVGGVVRDLFLDRPTTDIDFVTVGPGTGIRLARLVARALGGR--------TVH 94
Query: 262 VYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
VY E A + P R+ + V +FV R S YR+D P ++ED
Sbjct: 95 VY---ENFGTAAIRVPTPDRSGVFVLEFVAARRES------YRKDSRKP--IVED 138
>gi|345304379|ref|YP_004826281.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
[Rhodothermus marinus SG0.5JP17-172]
gi|345113612|gb|AEN74444.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
[Rhodothermus marinus SG0.5JP17-172]
Length = 552
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 50/115 (43%), Gaps = 22/115 (19%)
Query: 202 LPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIH 261
+P+Y A G + DLF F VG G GI+LA V ++ G+ +H
Sbjct: 37 IPVY---AVGGVVRDLFLDRPTTDIDFVTVGPGTGIRLARLVARALGGR--------TVH 85
Query: 262 VYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
VY E A + P R+ + V +FV R S YR+D P ++ED
Sbjct: 86 VY---ENFGTAAIRVPTPDRSGVFVLEFVAARRES------YRKDSRKP--IVED 129
>gi|449682081|ref|XP_002168725.2| PREDICTED: UPF0420 protein C16orf58 homolog [Hydra magnipapillata]
Length = 237
Score = 39.7 bits (91), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/111 (22%), Positives = 53/111 (47%)
Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
I VA ATR + A+ N++D+ AK + TL N++ L G + V ++
Sbjct: 7 IVGVAGGATRAALTLHQARRNNMADVSAKDGSQETLVNLMALFTGFLITPLVVDNLFLTW 66
Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
+ + +H+Y+ + + + T+N R ++V + G++ +P ++
Sbjct: 67 YFFLVFTFLHLYANYQAVSTVVMETINLPRLHILVKHYFIYGKVLTPKEVN 117
>gi|402580829|gb|EJW74778.1| hypothetical protein WUBG_14315 [Wuchereria bancrofti]
Length = 191
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 54/115 (46%)
Query: 195 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
VA ATR + + A N+ D+ AK + TL NV L + L V + ++
Sbjct: 8 VAGSATRTTVINHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLPVVSKNAVFVWLLF 67
Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
L + IH+Y ++ TLN ++V D+++T +I + ++ +E +L
Sbjct: 68 CLFTFIHLYGNYRAVKMLQFRTLNQSLLRIVVKDYIQTRKIGTVNEMNNKEPILL 122
>gi|297602390|ref|NP_001052390.2| Os04g0290800 [Oryza sativa Japonica Group]
gi|255675293|dbj|BAF14304.2| Os04g0290800, partial [Oryza sativa Japonica Group]
Length = 107
Score = 39.3 bits (90), Expect = 4.1, Method: Composition-based stats.
Identities = 20/71 (28%), Positives = 41/71 (57%)
Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
+ AKGEA + +G+ GI LA+ + SS+ +I ++ +H+Y ++ ++ + T
Sbjct: 30 VIAKGEAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRT 89
Query: 277 LNPQRTAMIVA 287
LNP R + +++
Sbjct: 90 LNPYRASNVLS 100
>gi|154343241|ref|XP_001567566.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134064898|emb|CAM43006.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 439
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 39/179 (21%), Positives = 72/179 (40%), Gaps = 7/179 (3%)
Query: 72 PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
P G+P + G+ R+ F S S + QSLL L +P W+LKD
Sbjct: 7 PKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLGSSPQL-----WMLKDL 61
Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
+ +N ++ PK W +++ + ++ ++L P + P L A + + K
Sbjct: 62 APALLAAYMANQVVSYENRPKFWFVVSVLMNNIMVMSDMLIPSVVPNHLLAAAIVTSTVK 121
Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS-SMQ 248
+ + R +A NL++L K + + + GI + + S +MQ
Sbjct: 122 QSSALMYFVARASTLQHYAINNNLAELTKKFNSFGMVSYTIATALGILYCTYIASFTMQ 180
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.135 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,715,635,325
Number of Sequences: 23463169
Number of extensions: 280363585
Number of successful extensions: 661667
Number of sequences better than 100.0: 503
Number of HSP's better than 100.0 without gapping: 387
Number of HSP's successfully gapped in prelim test: 116
Number of HSP's that attempted gapping in prelim test: 660305
Number of HSP's gapped (non-prelim): 582
length of query: 432
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 287
effective length of database: 8,957,035,862
effective search space: 2570669292394
effective search space used: 2570669292394
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)