BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 013984
         (432 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224082810|ref|XP_002306848.1| predicted protein [Populus trichocarpa]
 gi|222856297|gb|EEE93844.1| predicted protein [Populus trichocarpa]
          Length = 428

 Score =  754 bits (1947), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/432 (83%), Positives = 393/432 (90%), Gaps = 4/432 (0%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
           MNLL+KIK QKK EPDK+    EIP+YW ETSDSV+RH+QF+PDG LS+KVVDD+RPVY 
Sbjct: 1   MNLLDKIKMQKK-EPDKT---PEIPVYWIETSDSVSRHFQFEPDGQLSMKVVDDARPVYR 56

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           RVV+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQ
Sbjct: 57  RVVESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQ 116

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           AT VSWILKDGMQH GKLICSNLGARMDSEPKRWRILAD LYDLGTGLEVLSPLCP LFL
Sbjct: 117 ATAVSWILKDGMQHAGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFL 176

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
           E+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+GLG GIQLA
Sbjct: 177 EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGVGIQLA 236

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           STVCSSMQGK + GPLLS++HV  VIEEMRA PVNTLNPQRTAM+VADFVKTG+ISSPAD
Sbjct: 237 STVCSSMQGKFVAGPLLSIVHVCCVIEEMRATPVNTLNPQRTAMVVADFVKTGKISSPAD 296

Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
           LRY EDLLFPGRLIE+AGNVKVG+  H+ V+PSK  ELKE+F  EKFIL  G++WTD+VL
Sbjct: 297 LRYHEDLLFPGRLIENAGNVKVGQALHRAVRPSKLRELKEIFPGEKFILSPGNKWTDLVL 356

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
           EQNASGEDALR WLVAAYA+SM KS H+ +   LQDAY+KMN VF P LSELQAKGWHTD
Sbjct: 357 EQNASGEDALRAWLVAAYASSMKKSSHESTSVTLQDAYEKMNSVFDPFLSELQAKGWHTD 416

Query: 421 RFLDGTGTRFAW 432
           RFLDGTG+RF+W
Sbjct: 417 RFLDGTGSRFSW 428


>gi|297744143|emb|CBI37113.3| unnamed protein product [Vitis vinifera]
          Length = 424

 Score =  747 bits (1929), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/426 (83%), Positives = 393/426 (92%), Gaps = 2/426 (0%)

Query: 7   IKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSF 66
           +KK+ KKE +++  V +  ++W ETSDSV+RHYQF  +G LS+KVVDDSR V HRVV+SF
Sbjct: 1   MKKEMKKEQERA--VAQGAVFWIETSDSVSRHYQFHSNGDLSMKVVDDSRTVVHRVVESF 58

Query: 67  LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSW 126
           LNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQAT VSW
Sbjct: 59  LNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATAVSW 118

Query: 127 ILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +LKDGMQHVGKLICSNLGARMDSEPK WRILADALYDLGTGLEVLSPLCP LFLEMAGLG
Sbjct: 119 VLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFLEMAGLG 178

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           NFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAGIQLAST+CSS
Sbjct: 179 NFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLASTICSS 238

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
           MQGKMI GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF+KTG+ISSPADLRY+ED
Sbjct: 239 MQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPADLRYQED 298

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASG 366
           L+F GRLIEDAGNVKVGRD HKVVKPSK  +LK +F EEKF+LC G +WTDMVLE +A+G
Sbjct: 299 LIFTGRLIEDAGNVKVGRDLHKVVKPSKLCKLKGVFPEEKFLLCHGKKWTDMVLEHDATG 358

Query: 367 EDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGT 426
           EDALRGWLVAAY A++ K+ H+PS+TVLQ+AYDKMN VF+  +S+LQAKGWHTD FLDGT
Sbjct: 359 EDALRGWLVAAYVANLEKTDHEPSMTVLQEAYDKMNSVFSTFVSKLQAKGWHTDCFLDGT 418

Query: 427 GTRFAW 432
           G+RFAW
Sbjct: 419 GSRFAW 424


>gi|225438149|ref|XP_002273202.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
          Length = 420

 Score =  742 bits (1916), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/421 (83%), Positives = 389/421 (92%), Gaps = 2/421 (0%)

Query: 12  KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFF 71
           KKE +++  V +  ++W ETSDSV+RHYQF  +G LS+KVVDDSR V HRVV+SFLNKFF
Sbjct: 2   KKEQERA--VAQGAVFWIETSDSVSRHYQFHSNGDLSMKVVDDSRTVVHRVVESFLNKFF 59

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
           PSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQAT VSW+LKDG
Sbjct: 60  PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQATAVSWVLKDG 119

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
           MQHVGKLICSNLGARMDSEPK WRILADALYDLGTGLEVLSPLCP LFLEMAGLGNFAKG
Sbjct: 120 MQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFLEMAGLGNFAKG 179

Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
           +AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAGIQLAST+CSSMQGKM
Sbjct: 180 MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLASTICSSMQGKM 239

Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
           I GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF+KTG+ISSPADLRY+EDL+F G
Sbjct: 240 IAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPADLRYQEDLIFTG 299

Query: 312 RLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
           RLIEDAGNVKVGRD HKVVKPSK  +LK +F EEKF+LC G +WTDMVLE +A+GEDALR
Sbjct: 300 RLIEDAGNVKVGRDLHKVVKPSKLCKLKGVFPEEKFLLCHGKKWTDMVLEHDATGEDALR 359

Query: 372 GWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFA 431
           GWLVAAY A++ K+ H+PS+TVLQ+AYDKMN VF+  +S+LQAKGWHTD FLDGTG+RFA
Sbjct: 360 GWLVAAYVANLEKTDHEPSMTVLQEAYDKMNSVFSTFVSKLQAKGWHTDCFLDGTGSRFA 419

Query: 432 W 432
           W
Sbjct: 420 W 420


>gi|255567274|ref|XP_002524618.1| conserved hypothetical protein [Ricinus communis]
 gi|223536171|gb|EEF37826.1| conserved hypothetical protein [Ricinus communis]
          Length = 436

 Score =  740 bits (1911), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/437 (81%), Positives = 391/437 (89%), Gaps = 6/437 (1%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIP-----IYWFETSDSVTRHYQFQPDGHLSVKVVDDS 55
           MNLL  IK QKK EPDK+  +   P     ++W ETSDSV+RH+QFQPDG +S+KV+DDS
Sbjct: 1   MNLLNTIKMQKK-EPDKAPLLPPPPLPELPVHWIETSDSVSRHFQFQPDGRISMKVIDDS 59

Query: 56  RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
           R ++HRVV+SF NKFFPSGYP+SVNEGYLRYTQFRALQH +SAALSVLSTQSLLFAAGLR
Sbjct: 60  RSMFHRVVESFQNKFFPSGYPYSVNEGYLRYTQFRALQHLSSAALSVLSTQSLLFAAGLR 119

Query: 116 PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           PTPAQAT VSW+LKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLEVLSPLC
Sbjct: 120 PTPAQATAVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLC 179

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
           P LFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 
Sbjct: 180 PHLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGV 239

Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           GIQLASTVCSS+QGK++VGPLLS++HV+SV+EEMRAAPVNTLNPQRTAMIVAD+VK G+I
Sbjct: 240 GIQLASTVCSSIQGKLVVGPLLSILHVFSVVEEMRAAPVNTLNPQRTAMIVADYVKAGKI 299

Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRW 355
           SSPADLRYREDL+FPGRLIEDAGNVKVGR  HK   PSK HELK++F EEKF+L  G+ W
Sbjct: 300 SSPADLRYREDLIFPGRLIEDAGNVKVGRALHKAFTPSKVHELKDIFPEEKFLLNRGNTW 359

Query: 356 TDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
           TDMVLEQNASGEDALRGWLVAAYA SM KS H  + +VLQDAY+KMN  F   LSELQAK
Sbjct: 360 TDMVLEQNASGEDALRGWLVAAYATSMEKSSHMCTPSVLQDAYEKMNSTFDSFLSELQAK 419

Query: 416 GWHTDRFLDGTGTRFAW 432
           GWHTDRFLDG G+RFAW
Sbjct: 420 GWHTDRFLDGVGSRFAW 436


>gi|334184613|ref|NP_001189649.1| uncharacterized protein [Arabidopsis thaliana]
 gi|330253412|gb|AEC08506.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 432

 Score =  721 bits (1861), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/432 (77%), Positives = 385/432 (89%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
           M  LEK+K  KK++P    S ++ P+YWFETSDSV+  YQFQ DGHLS+KVVDD+RPV  
Sbjct: 1   MQFLEKVKLIKKEDPVMLKSPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQ 60

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           ++V+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQ
Sbjct: 61  KMVESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQ 120

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLE++SPLCP LFL
Sbjct: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFL 180

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
           EMAGLGNFAKG+A VAARATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV G+GAGIQLA
Sbjct: 181 EMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLA 240

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           ST+CSSM+GK++VG +LSV+HVYSV+E+MR  P+NTLNPQRTA+IVA+F+KTG++ SP D
Sbjct: 241 STICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPD 300

Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
           LR++EDL+FP R I+DAGNVKVGR  HK VKPS+   LK++F EEKF+L  G  WTDMVL
Sbjct: 301 LRFQEDLMFPERPIQDAGNVKVGRALHKAVKPSEVQRLKQVFVEEKFLLSHGKSWTDMVL 360

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
           E +A+GEDALRGWLVAAY  SM K ++DP   +LQDAYDKMNDVF P LS++QAKGW+TD
Sbjct: 361 EHDATGEDALRGWLVAAYVKSMTKIYNDPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTD 420

Query: 421 RFLDGTGTRFAW 432
           RFLDGTGTRFAW
Sbjct: 421 RFLDGTGTRFAW 432


>gi|18402585|ref|NP_565718.1| uncharacterized protein [Arabidopsis thaliana]
 gi|20197897|gb|AAD20664.2| expressed protein [Arabidopsis thaliana]
 gi|21553562|gb|AAM62655.1| unknown [Arabidopsis thaliana]
 gi|109134157|gb|ABG25076.1| At2g31190 [Arabidopsis thaliana]
 gi|330253411|gb|AEC08505.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 433

 Score =  718 bits (1854), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/430 (77%), Positives = 384/430 (89%)

Query: 3   LLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRV 62
           L EK+K  KK++P    S ++ P+YWFETSDSV+  YQFQ DGHLS+KVVDD+RPV  ++
Sbjct: 4   LQEKVKLIKKEDPVMLKSPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQKM 63

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
           V+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQAT
Sbjct: 64  VESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQAT 123

Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLE++SPLCP LFLEM
Sbjct: 124 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEM 183

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
           AGLGNFAKG+A VAARATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV G+GAGIQLAST
Sbjct: 184 AGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLAST 243

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           +CSSM+GK++VG +LSV+HVYSV+E+MR  P+NTLNPQRTA+IVA+F+KTG++ SP DLR
Sbjct: 244 ICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLR 303

Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQ 362
           ++EDL+FP R I+DAGNVKVGR  HK VKPS+   LK++F EEKF+L  G  WTDMVLE 
Sbjct: 304 FQEDLMFPERPIQDAGNVKVGRALHKAVKPSEVQRLKQVFVEEKFLLSHGKSWTDMVLEH 363

Query: 363 NASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRF 422
           +A+GEDALRGWLVAAY  SM K ++DP   +LQDAYDKMNDVF P LS++QAKGW+TDRF
Sbjct: 364 DATGEDALRGWLVAAYVKSMTKIYNDPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTDRF 423

Query: 423 LDGTGTRFAW 432
           LDGTGTRFAW
Sbjct: 424 LDGTGTRFAW 433


>gi|449462449|ref|XP_004148953.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 428

 Score =  708 bits (1827), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/432 (76%), Positives = 381/432 (88%), Gaps = 4/432 (0%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
           M+LL K    + K+P+KS S   +P+ W E SDSV+R  QFQPDGHLSVK++DDSRP   
Sbjct: 1   MDLLNKFSA-RNKDPEKSPS---LPVSWIEVSDSVSRRCQFQPDGHLSVKIIDDSRPAIQ 56

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           R+VDSFLN FFPSGYP+SV+EGYLRYTQFRALQH TSAALSVLSTQSLLFAAGLRPT AQ
Sbjct: 57  RIVDSFLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQ 116

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           ATVVSW+LKDGMQHVGKLICSNLGARMDSEPKRWR++AD LYDLG GLEV+SPLCP LFL
Sbjct: 117 ATVVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFL 176

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
           EMAGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAG+QLA
Sbjct: 177 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLA 236

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           ST+CSS+QGK++  PLLS++HVY V+E+MRA P+NTLNPQRTAMIVADFVK GRI SPAD
Sbjct: 237 STICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPAD 296

Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
           +RY+EDL+FPGRLIE+AGNVKVGR  H+V+KPSK  E+K++F  EKF+L    +W DMVL
Sbjct: 297 IRYQEDLVFPGRLIEEAGNVKVGRALHEVIKPSKLVEMKQIFPGEKFVLNQSKKWVDMVL 356

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
           E +ASGEDALRGWLVAAY  ++ +  H+P+ +VL +AY+KMNDVFTP LSELQAKGW+TD
Sbjct: 357 EHDASGEDALRGWLVAAYTTNIKEPSHEPTASVLLEAYEKMNDVFTPFLSELQAKGWYTD 416

Query: 421 RFLDGTGTRFAW 432
           RFLDG G+RFAW
Sbjct: 417 RFLDGAGSRFAW 428


>gi|449502048|ref|XP_004161529.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 428

 Score =  705 bits (1820), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/432 (76%), Positives = 379/432 (87%), Gaps = 4/432 (0%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
           M+LL K    + K+P+KS S   +P+ W E SDSV+R  QFQPDGHLSVK++DDSRP   
Sbjct: 1   MDLLNKFSA-RNKDPEKSPS---LPVSWIEVSDSVSRRCQFQPDGHLSVKIIDDSRPAIQ 56

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           R+VDSFLN FFPSGYP+SV+EGYLRYTQFRALQH TSAALSVLSTQSLLFAAGLRPT AQ
Sbjct: 57  RIVDSFLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTAAQ 116

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           ATVVSW+LKDGMQHVGKLICSNLGARMDSEPKRWR++AD LYDLG GLEV+SPLCP LFL
Sbjct: 117 ATVVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFL 176

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
           EMAGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG+GAG+QLA
Sbjct: 177 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLA 236

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           ST+CSS+QGK++  PLLS++HVY V+E+MRA P+NTLNPQRTAMIVADFVK GRI SPAD
Sbjct: 237 STICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPAD 296

Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
           +RY+EDL+FPGRLIE+AGNVKVGR  H+V+KPSK  E+K++F  EKF+L    +W DMVL
Sbjct: 297 IRYQEDLVFPGRLIEEAGNVKVGRALHEVIKPSKLVEMKQIFPGEKFVLNQSKKWVDMVL 356

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
           E +ASGEDALRGWLVAAY  ++    H+P+ +VL +AY+KMND FTP LSELQAKGW+TD
Sbjct: 357 EHDASGEDALRGWLVAAYTTNIKGPSHEPTASVLLEAYEKMNDAFTPFLSELQAKGWYTD 416

Query: 421 RFLDGTGTRFAW 432
           RFLDG G+RFAW
Sbjct: 417 RFLDGAGSRFAW 428


>gi|297826561|ref|XP_002881163.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327002|gb|EFH57422.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score =  682 bits (1761), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/432 (75%), Positives = 372/432 (86%), Gaps = 13/432 (3%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
           M  LEK+K  KK+EP    + ++ P+YWFETSDSV+  YQFQ DGHLS+KVVDD+RPV  
Sbjct: 1   MQFLEKVKLIKKEEPLMVKNPEDFPVYWFETSDSVSHRYQFQSDGHLSMKVVDDARPVPQ 60

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           ++V+SFLNKFFPSGYP+SVNEGYLRYTQFRALQHF+SAALSVLSTQSLLFAAGLRPTPAQ
Sbjct: 61  KMVESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGLRPTPAQ 120

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD LYDLGTGLE++SPLCP LFL
Sbjct: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFL 180

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
           EMAGLGNFAKG+A VAARATRLPIYSSFAKEGNLSD+FAKGEAISTLFNV G+GAGIQLA
Sbjct: 181 EMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLA 240

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           ST+CSSM+GK++VG +LSV+HVYSV+E+MR  P+NTLNPQRTA+IVA+++K   I     
Sbjct: 241 STICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANYLKVVSI----- 295

Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
                  +F  RLI+DAGNVKVGR  HK VKPS+   LK++  EEKF+L  G  WTD+VL
Sbjct: 296 -------IFK-RLIQDAGNVKVGRALHKAVKPSEVQRLKQVSVEEKFLLSHGKSWTDVVL 347

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
           E +A+GEDALRGWLVAAYA SM K ++DP   +LQDAYDKMNDVF P LS++QAKGW+TD
Sbjct: 348 EHDATGEDALRGWLVAAYAKSMTKIYNDPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTD 407

Query: 421 RFLDGTGTRFAW 432
           RFLDGTGTRFAW
Sbjct: 408 RFLDGTGTRFAW 419


>gi|356545865|ref|XP_003541354.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 431

 Score =  680 bits (1755), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/429 (75%), Positives = 370/429 (86%), Gaps = 3/429 (0%)

Query: 3   LLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRV 62
            LEK+K  KKK   +    +  P+ WFETS +V+R +QF+P+G LSV +V DSRP+YHR+
Sbjct: 5   FLEKLKIWKKKAEQEE---EGPPLVWFETSATVSRRFQFEPNGQLSVTLVGDSRPLYHRI 61

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
           VDSFLNKFFPSGYP+SVNEGYLRYTQFRALQH  SAALSVLSTQSLLFAAGLRPTPAQAT
Sbjct: 62  VDSFLNKFFPSGYPYSVNEGYLRYTQFRALQHTASAALSVLSTQSLLFAAGLRPTPAQAT 121

Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
            VSW+LKDGMQH+G +ICS LGARMDSEPKRWRILAD LYDLGTGLEVLSPLCPQ FLEM
Sbjct: 122 AVSWVLKDGMQHLGNIICSKLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPQFFLEM 181

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
           AG+GN AKG+++VA+RATRLPIYSSFAKEGNLSDLFA+GEAISTLFNVVG+G GIQLAST
Sbjct: 182 AGIGNLAKGMSIVASRATRLPIYSSFAKEGNLSDLFARGEAISTLFNVVGIGIGIQLAST 241

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           VC+SMQGK++VGPLLS+IH+YSV EEMRA PVNTLNPQRTAMIVADF+K G +SSPADLR
Sbjct: 242 VCASMQGKLVVGPLLSIIHIYSVSEEMRATPVNTLNPQRTAMIVADFLKAGSVSSPADLR 301

Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQ 362
           YREDLLFP RLIEDAGNV+VGRD HKV+KPS+  E K++F  EKFIL   +R  DMVLEQ
Sbjct: 302 YREDLLFPRRLIEDAGNVRVGRDLHKVIKPSRLLESKQVFPGEKFILNGDNRCIDMVLEQ 361

Query: 363 NASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRF 422
           +A G+DALRGWLVA+YA  + KS H+ S + L  AY+KMN+VF   + ELQ KGWHTDRF
Sbjct: 362 DAIGKDALRGWLVASYAVQIGKSSHELSTSTLLQAYEKMNEVFPAFIKELQCKGWHTDRF 421

Query: 423 LDGTGTRFA 431
           LDG+G RFA
Sbjct: 422 LDGSGCRFA 430


>gi|356505015|ref|XP_003521288.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 415

 Score =  677 bits (1748), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/415 (77%), Positives = 364/415 (87%), Gaps = 1/415 (0%)

Query: 19  MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
           M   E P+ WFETSDSV R +QF+PDG LSV +VDDSRP+Y RV  SF+NKFFPSGYP+S
Sbjct: 1   MKEGEAPVLWFETSDSVCRRHQFEPDGQLSVVIVDDSRPLYQRVAGSFMNKFFPSGYPYS 60

Query: 79  VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 138
           VNEGYLRYTQFRA+QH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKL
Sbjct: 61  VNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKL 120

Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
           ICSN GARMDSEPKRWR+LADALYD+GTGLEVLSPLCP LFLEMAGLGNFAKG+AVVAAR
Sbjct: 121 ICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFLEMAGLGNFAKGMAVVAAR 180

Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
           ATRLPIYSSFAKEGNLSDLFAKGEA STLFNV+G+G GIQLAST+C+SMQGK++ GPLLS
Sbjct: 181 ATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASMQGKLVAGPLLS 240

Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
           +IH+YSV EEMRA P+NTLNP+RTAM+VADF+K G +SSPADLRYRE+LLF   + EDAG
Sbjct: 241 IIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPADLRYRENLLFNVHVKEDAG 300

Query: 319 NVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAY 378
           NV+VG+D HKV+KPS+  ELK++F EEKF+L  G++  DMVLEQ+ASGEDALRGWLVAAY
Sbjct: 301 NVRVGKDVHKVIKPSRLLELKQVFPEEKFLLNFGNKCIDMVLEQDASGEDALRGWLVAAY 360

Query: 379 AASMAKSFHDPSLT-VLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
           AA      H+ S T VL +AY+KMN VF   L ELQ KGWHTDRFLDGTG+RFA+
Sbjct: 361 AAQTHSFSHELSATSVLHEAYEKMNGVFPVFLRELQNKGWHTDRFLDGTGSRFAF 415


>gi|356572258|ref|XP_003554286.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 419

 Score =  666 bits (1719), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/411 (76%), Positives = 361/411 (87%), Gaps = 1/411 (0%)

Query: 23  EIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEG 82
           E P+ WFETSDSV   +QF+PDG LSV +VDDSRP+Y R+  SF+NKFFPSGYP+SVNEG
Sbjct: 9   EAPVLWFETSDSVCCQHQFEPDGRLSVVIVDDSRPLYQRMAGSFMNKFFPSGYPYSVNEG 68

Query: 83  YLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN 142
           YLRYTQFRALQH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKLICSN
Sbjct: 69  YLRYTQFRALQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKLICSN 128

Query: 143 LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRL 202
            GARMDSEPKRWR+LAD LYD+G GLEVLSPLCP LFLEMAGLGNFAKG+AVVAARATRL
Sbjct: 129 WGARMDSEPKRWRLLADVLYDIGIGLEVLSPLCPHLFLEMAGLGNFAKGMAVVAARATRL 188

Query: 203 PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHV 262
           PIYSSFAKEGNLSDLFAKGEA STLFNVVG+G GIQLAST+C+S+QGK++ GPLLS+IH+
Sbjct: 189 PIYSSFAKEGNLSDLFAKGEAFSTLFNVVGIGVGIQLASTICASIQGKLVAGPLLSIIHL 248

Query: 263 YSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKV 322
           YSV EEMRA P+NTLNP+RTAM+VADF+K G +SSPADLRYR++LLF  ++ ED GNV+V
Sbjct: 249 YSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPADLRYRDNLLFNVQVKEDTGNVRV 308

Query: 323 GRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASM 382
           G++ HKV+KPS+  ELK++F EEKF+L  G++  DMVLEQ+ASGEDALRGWLVAAYAA  
Sbjct: 309 GKNVHKVIKPSRLLELKQVFPEEKFLLNFGNKCIDMVLEQDASGEDALRGWLVAAYAART 368

Query: 383 AKSFHDPSLT-VLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
             S H+ S T VL +AY+KMN VF   L ELQ KGWHTDRFLDGTG+RF++
Sbjct: 369 ESSSHELSATSVLHEAYEKMNGVFPVFLRELQNKGWHTDRFLDGTGSRFSF 419


>gi|388508448|gb|AFK42290.1| unknown [Lotus japonicus]
          Length = 414

 Score =  660 bits (1702), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/414 (75%), Positives = 357/414 (86%)

Query: 19  MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
           M   E P++WFETS SV   + FQ +G LSV +VDDSRP+Y +V  SF+NKFFPSGYP+S
Sbjct: 1   MEKGEGPVFWFETSGSVCHRHHFQSNGQLSVMLVDDSRPLYQKVAGSFMNKFFPSGYPYS 60

Query: 79  VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 138
           VNEGYLRYTQFR++QH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKL
Sbjct: 61  VNEGYLRYTQFRSVQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKL 120

Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
           ICSN GARMDSEPKRWR++AD LYDLGTGLEVLSPLCP LFL+MAGLGNFAKG+AVVAAR
Sbjct: 121 ICSNWGARMDSEPKRWRLIADVLYDLGTGLEVLSPLCPHLFLQMAGLGNFAKGMAVVAAR 180

Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
           ATRLPIYSSFAKEGNLSDLFAKGEA STL NV+G+G GIQLAST+C+S+QGKM+VGPLLS
Sbjct: 181 ATRLPIYSSFAKEGNLSDLFAKGEAFSTLCNVIGIGVGIQLASTICASVQGKMVVGPLLS 240

Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
           ++H+YSV EEMRA P+NTLNP+RTAMIV DF+K G +SSPADLRYREDLLF  RL EDAG
Sbjct: 241 ILHIYSVSEEMRATPINTLNPRRTAMIVTDFLKAGVVSSPADLRYREDLLFRVRLTEDAG 300

Query: 319 NVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAY 378
           NV+VGR  H+V+KPS+  EL+++   EKF+L  G +  DMVLEQ+ASGEDALRGWLVAAY
Sbjct: 301 NVRVGRALHEVIKPSRLLELEQVLPGEKFLLNRGGKCVDMVLEQDASGEDALRGWLVAAY 360

Query: 379 AASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
           AA +  S H+ S +VL +AY KM  VF   L ELQ+KGWHTDRFLDGTG+RFA+
Sbjct: 361 AAQIENSSHELSASVLHEAYKKMTGVFPVFLKELQSKGWHTDRFLDGTGSRFAF 414


>gi|242076506|ref|XP_002448189.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
 gi|241939372|gb|EES12517.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
          Length = 439

 Score =  613 bits (1580), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 298/440 (67%), Positives = 358/440 (81%), Gaps = 13/440 (2%)

Query: 1   MNLLEKIK----KQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQP----DGHLSVKVV 52
           MN+L +I     +    EP         P+ W E S+SV+R   F       G +SVKV+
Sbjct: 1   MNILARISGGGDRAAATEPPPPQ-----PMCWVEISESVSRLCSFDAAGSGGGSISVKVI 55

Query: 53  DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
            D+RP++ +VVDSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAA
Sbjct: 56  QDARPIHDKVVDSFLNKFFPSGYPYSVNEGYLTYTRFRALQHFSSAMLHVLSTQSLLFAA 115

Query: 113 GLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
           GLRPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+S
Sbjct: 116 GLRPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVS 175

Query: 173 PLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
           PLCPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G
Sbjct: 176 PLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMG 235

Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
           +GAGI LASTVCS+ QGK+I GPLLS +H+Y V++EMRA PVNTLNPQRTAMIVADF+K+
Sbjct: 236 IGAGIGLASTVCSTTQGKLIAGPLLSAVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKS 295

Query: 293 GRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLG 352
           G++SSPA+LRYREDLLFP R+IE+AG+VK+G+   +V+ PS   +L+  F +EKF+L   
Sbjct: 296 GKVSSPAELRYREDLLFPNRVIEEAGSVKIGQPLRRVLSPSLVEQLRATFPDEKFLLTQK 355

Query: 353 SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSEL 412
           S  T MVLEQ+ASGEDALRGWLVAA+A+ M +S      TVL++AY+K   VF   +SE+
Sbjct: 356 SNKTYMVLEQSASGEDALRGWLVAAFASEMERSGIVSRDTVLKEAYEKTKMVFPSFVSEV 415

Query: 413 QAKGWHTDRFLDGTGTRFAW 432
           +++GW+TD+FLDG G+R A+
Sbjct: 416 RSRGWYTDQFLDGNGSRVAF 435


>gi|115459438|ref|NP_001053319.1| Os04g0517300 [Oryza sativa Japonica Group]
 gi|113564890|dbj|BAF15233.1| Os04g0517300 [Oryza sativa Japonica Group]
 gi|116310213|emb|CAH67223.1| OSIGBa0145M07.5 [Oryza sativa Indica Group]
 gi|215697660|dbj|BAG91654.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 290/432 (67%), Positives = 354/432 (81%), Gaps = 3/432 (0%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYH 60
           MN+LE+I+    +        +  P  W E S+S++R   F   G +SVKV+ DSRP++ 
Sbjct: 1   MNILERIRGGGDRAAVGEGPREPEP--WVEISESISRLCSFDA-GRVSVKVIQDSRPIHD 57

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           +++DSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAAGLRPTPAQ
Sbjct: 58  KMIDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGLRPTPAQ 117

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           AT VSWILKDGMQH GKLICS +GARMDSEPK WRILAD LYD GT LEV+SPLCPQLFL
Sbjct: 118 ATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFL 177

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
           E+AG GNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LA
Sbjct: 178 EVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 237

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           STVCS+ QGK+I GPLLSV+H+Y V++EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+
Sbjct: 238 STVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAE 297

Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
           LRYREDLLFP RLIE+AG+VK+G+   +V+ P +  +LK  F++EKF+L        MVL
Sbjct: 298 LRYREDLLFPNRLIEEAGSVKIGQPVRRVLSPQRIEQLKATFSKEKFLLSRKDNSAYMVL 357

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
           EQ+A+GEDALRGWLVAA+A+ M +S      TVL  AY++M +VF   ++E++++GW+TD
Sbjct: 358 EQSATGEDALRGWLVAAFASEMERSGVGSGDTVLNVAYERMENVFPMFVAEVKSRGWYTD 417

Query: 421 RFLDGTGTRFAW 432
           +FLDG  +R A+
Sbjct: 418 QFLDGNRSRIAY 429


>gi|357164706|ref|XP_003580140.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
           distachyon]
          Length = 437

 Score =  600 bits (1548), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 288/435 (66%), Positives = 353/435 (81%), Gaps = 5/435 (1%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQP---DGHLSVKVVDDSRP 57
           MN+LE+I+    K        +  P  W E S+SV+R   F      G +SVKV+ DSRP
Sbjct: 1   MNILERIRGGGDKTTGTETPRKLEP--WVEISESVSRLCCFDARSRGGGISVKVIQDSRP 58

Query: 58  VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
           ++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAAGLRPT
Sbjct: 59  IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGLRPT 118

Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
           PAQAT VSWILKDGMQH GKLICS +GARMDSEPK WRI AD LYD GT L+ +SPLCPQ
Sbjct: 119 PAQATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRIFADVLYDFGTALDFISPLCPQ 178

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
           LFLE+AGLGNFAKG+AVVAARATRLPIYS+FAKEGNLSDLFAKGEAISTL NV+G+GAGI
Sbjct: 179 LFLEVAGLGNFAKGMAVVAARATRLPIYSAFAKEGNLSDLFAKGEAISTLSNVMGIGAGI 238

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            L+STVCS+ QGK+I+GPLLS +H+Y V++EMRA P+N+LNPQRTAM+VADF+K+G++SS
Sbjct: 239 GLSSTVCSTTQGKLIIGPLLSAVHLYGVVQEMRAIPINSLNPQRTAMVVADFIKSGKVSS 298

Query: 298 PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTD 357
           PA+LRYREDLLFP RLIE+AG+VKVG+   +++ P +  +L+  +  EKF+L L S  T 
Sbjct: 299 PAELRYREDLLFPNRLIEEAGSVKVGQPLCRILSPQRVEQLRATYPNEKFLLSLKSNKTY 358

Query: 358 MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
           MVLEQ+A+GEDALRGWLVAA+A+ M KS      +VL +AY KM DVF   +SE++++GW
Sbjct: 359 MVLEQSATGEDALRGWLVAAFASEMEKSGVGSRDSVLNEAYQKMEDVFPMFVSEVRSRGW 418

Query: 418 HTDRFLDGTGTRFAW 432
           +TD+FLDG  +R A+
Sbjct: 419 YTDQFLDGNRSRIAY 433


>gi|414586417|tpg|DAA36988.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 439

 Score =  600 bits (1546), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 294/439 (66%), Positives = 355/439 (80%), Gaps = 11/439 (2%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
           MN+LE+I+       D++ + +  P    W E S+SV+R   F   G     +SVKV+ D
Sbjct: 1   MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           SRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57  SRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116

Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
           RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPL 176

Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
           CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236

Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
           AGI LASTVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G+
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGK 296

Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSR 354
           +SSPA+LRY+EDLLFP R+IE+AG+VK+G+   KV  P    +L+  F  EKF+L   S 
Sbjct: 297 VSSPAELRYKEDLLFPNRVIEEAGSVKIGQPLRKVFSPRLVEQLRATFPGEKFLLSQKSN 356

Query: 355 WTDMVLEQNASGEDALRGWLVAAYAASMAKS-FHDPSLTVLQDAYDKMNDVFTPLLSELQ 413
            T MVLE++ASGEDALRGWLVAA+A+ M +S        VL +AY+KM  VF   +SE++
Sbjct: 357 RTYMVLERSASGEDALRGWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPTFVSEVR 416

Query: 414 AKGWHTDRFLDGTGTRFAW 432
           ++GW+TD+FLDG  +R ++
Sbjct: 417 SRGWYTDQFLDGNCSRVSF 435


>gi|226494744|ref|NP_001146435.1| uncharacterized protein LOC100280018 [Zea mays]
 gi|219887187|gb|ACL53968.1| unknown [Zea mays]
          Length = 439

 Score =  597 bits (1539), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 293/439 (66%), Positives = 354/439 (80%), Gaps = 11/439 (2%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
           MN+LE+I+       D++ + +  P    W E S+SV+R   F   G     +SVKV+ D
Sbjct: 1   MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           SRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57  SRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116

Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
           RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK W ILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWTILADVLYDLGTALEVVSPL 176

Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
           CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236

Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
           AGI LASTVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G+
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGK 296

Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSR 354
           +SSPA+LRY+EDLLFP R+IE+AG+VK+G+   KV  P    +L+  F  EKF+L   S 
Sbjct: 297 VSSPAELRYKEDLLFPNRVIEEAGSVKIGQPLRKVFSPRLVEQLRATFPGEKFLLSQKSN 356

Query: 355 WTDMVLEQNASGEDALRGWLVAAYAASMAKS-FHDPSLTVLQDAYDKMNDVFTPLLSELQ 413
            T MVLE++ASGEDALRGWLVAA+A+ M +S        VL +AY+KM  VF   +SE++
Sbjct: 357 RTYMVLERSASGEDALRGWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPTFVSEVR 416

Query: 414 AKGWHTDRFLDGTGTRFAW 432
           ++GW+TD+FLDG  +R ++
Sbjct: 417 SRGWYTDQFLDGNCSRVSF 435


>gi|38568058|emb|CAE05447.3| OSJNBa0073E02.7 [Oryza sativa Japonica Group]
 gi|125549033|gb|EAY94855.1| hypothetical protein OsI_16651 [Oryza sativa Indica Group]
 gi|125590999|gb|EAZ31349.1| hypothetical protein OsJ_15471 [Oryza sativa Japonica Group]
          Length = 380

 Score =  576 bits (1484), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 269/371 (72%), Positives = 320/371 (86%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
           ++DSFLNKFFPSGYP+SVNEGYL YT+FRALQHF+SA L VLSTQSLLFAAGLRPTPAQA
Sbjct: 1   MIDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGLRPTPAQA 60

Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
           T VSWILKDGMQH GKLICS +GARMDSEPK WRILAD LYD GT LEV+SPLCPQLFLE
Sbjct: 61  TAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFLE 120

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           +AG GNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LAS
Sbjct: 121 VAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLAS 180

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
           TVCS+ QGK+I GPLLSV+H+Y V++EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+L
Sbjct: 181 TVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAEL 240

Query: 302 RYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLE 361
           RYREDLLFP RLIE+AG+VK+G+   +V+ P +  +LK  F++EKF+L        MVLE
Sbjct: 241 RYREDLLFPNRLIEEAGSVKIGQPVRRVLSPQRIEQLKATFSKEKFLLSRKDNSAYMVLE 300

Query: 362 QNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
           Q+A+GEDALRGWLVAA+A+ M +S      TVL  AY++M +VF   ++E++++GW+TD+
Sbjct: 301 QSATGEDALRGWLVAAFASEMERSGVGSGDTVLNVAYERMENVFPMFVAEVKSRGWYTDQ 360

Query: 422 FLDGTGTRFAW 432
           FLDG  +R A+
Sbjct: 361 FLDGNRSRIAY 371


>gi|255639094|gb|ACU19847.1| unknown [Glycine max]
          Length = 302

 Score =  530 bits (1364), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 249/302 (82%), Positives = 275/302 (91%)

Query: 19  MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
           M   E P+ WFETSDSV R +QF+PDG LSV +VDDSRP+Y RV  SF+NKFFPSGYP+S
Sbjct: 1   MKEGEAPVLWFETSDSVCRRHQFEPDGQLSVVIVDDSRPLYQRVAGSFMNKFFPSGYPYS 60

Query: 79  VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKL 138
           VNEGYLRYTQFRA+QH TSAALSVLSTQSLLFAAGLRPTPAQAT VSWILKDGMQHVGKL
Sbjct: 61  VNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGLRPTPAQATAVSWILKDGMQHVGKL 120

Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
           ICSN GARMDSEPKRWR+LADALYD+GTGLEVLSPLCP LFLEMAGLGNFAKG+AVVAAR
Sbjct: 121 ICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFLEMAGLGNFAKGMAVVAAR 180

Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
           ATRLPIYSSFAKEGNLSDLFAKGEA STLFNV+G+G GIQLAST+C+SMQGK++ GPLLS
Sbjct: 181 ATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASMQGKLVAGPLLS 240

Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
           +IH+YSV EEMRA P+NTLNP+RTAM+VADF+K G +SSPADLRYRE+LLF   + EDAG
Sbjct: 241 IIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPADLRYRENLLFNVHVKEDAG 300

Query: 319 NV 320
           NV
Sbjct: 301 NV 302


>gi|168027023|ref|XP_001766030.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682673|gb|EDQ69089.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 397

 Score =  498 bits (1282), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 244/389 (62%), Positives = 300/389 (77%), Gaps = 5/389 (1%)

Query: 49  VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
           VKV+ ++RP     VDSFL++FFP GYP SV EGYL Y+++RA+QHF SA + VLSTQSL
Sbjct: 9   VKVLKETRPPVQIAVDSFLSRFFPLGYPNSVAEGYLDYSKYRAVQHFASAVMHVLSTQSL 68

Query: 109 LFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGL 168
           LFAAGLRPTPAQAT+VSW+LKDGMQHVGKL+CS +GARMDSEPKRWRI AD +YD+G GL
Sbjct: 69  LFAAGLRPTPAQATIVSWVLKDGMQHVGKLLCSRMGARMDSEPKRWRIFADFMYDVGAGL 128

Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
           EV+SPLCPQ FL +AG+ N AKG+A+V+ARATRLP+YSSF KEGNLSDL+AKGEAISTL 
Sbjct: 129 EVISPLCPQHFLPVAGMANLAKGMALVSARATRLPVYSSFVKEGNLSDLYAKGEAISTLS 188

Query: 229 NVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
           NV+GLG GI LAS   S++QGK+I  P+LS IH+Y V +EMRAAP+NTLN QRTAM+VAD
Sbjct: 189 NVLGLGVGIHLASNFASTIQGKLIWAPILSAIHLYCVQQEMRAAPINTLNSQRTAMLVAD 248

Query: 289 FVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHK-VVKPSKFHELKELFAEEKF 347
           FVKTG +SSPA+LRYRE L+ P  L +DAGNV+VG      + KPS   ELKE F  E+F
Sbjct: 249 FVKTGSVSSPAELRYRERLILPVGLEKDAGNVRVGAPLQSGIGKPSVMAELKEKFQGEQF 308

Query: 348 ILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKS--FHDPSL--TVLQDAYDKMND 403
           +L  G + TDMVL Q ASGEDA+RGWL+A + + + K   F D  +   +L+ AYD+   
Sbjct: 309 LLHFGEQATDMVLHQKASGEDAVRGWLLATHVSRLCKGRKFQDVDVYGKILEKAYDETER 368

Query: 404 VFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
                +S L+  GWH+  FL+G+G R  W
Sbjct: 369 CLPAFISGLKDHGWHSHLFLEGSGVRAVW 397


>gi|302756337|ref|XP_002961592.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
 gi|300170251|gb|EFJ36852.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
          Length = 399

 Score =  498 bits (1281), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 238/406 (58%), Positives = 309/406 (76%), Gaps = 12/406 (2%)

Query: 28  WFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
           W E S++  R Y  + DG L  +++DD+RP+ H++VD FL++FFP GYP SV EGYL Y+
Sbjct: 5   WRERSENARREYVSKGDGTLQARLLDDNRPLKHKLVDGFLDQFFPVGYPQSVGEGYLTYS 64

Query: 88  QFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARM 147
           QFRALQHF+SA LSV STQSL+FAAGLRPTPAQATV+SW+LKDGMQH+GKL+ S +GARM
Sbjct: 65  QFRALQHFSSAILSVFSTQSLVFAAGLRPTPAQATVISWVLKDGMQHIGKLVGSRMGARM 124

Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
           DSEPKRWRI AD LYD GT LE++SP CPQ FL +AGL N AKG+A+V ARATRLP+Y++
Sbjct: 125 DSEPKRWRIFADVLYDAGTALEIISPACPQYFLGVAGLANLAKGVAMVTARATRLPLYTA 184

Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
           FAKEGNLSDL+AKGEA+STL NV GLG GI+LAS+ C+S+QGK++V PLLSVIH+YSV+E
Sbjct: 185 FAKEGNLSDLYAKGEAMSTLANVAGLGVGIRLASSACTSLQGKLLVAPLLSVIHLYSVME 244

Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPH 327
           EMRA P++TLN QRTAM+VAD+++ GR+ SPA++RYRE L+ P +    AG+V++  D  
Sbjct: 245 EMRAVPLDTLNAQRTAMLVADYLELGRVPSPAEIRYRERLILPVKPNVHAGSVRIANDIL 304

Query: 328 KVVKPS-KFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSF 386
           ++  PS K  ELK  F  E+F+L      T+MVL  +ASGED +RGWL+AA ++  A   
Sbjct: 305 QI--PSGKIRELKHRFQSERFLLDFHETSTEMVLHHSASGEDVVRGWLLAACSSKEAS-- 360

Query: 387 HDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
                  L DAY +++ +    LS+L+++GW T  FL G+  R  W
Sbjct: 361 -------LSDAYSRVDKMLPEFLSQLKSRGWQTQLFLHGSAARAFW 399


>gi|302775588|ref|XP_002971211.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
 gi|300161193|gb|EFJ27809.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
          Length = 399

 Score =  497 bits (1280), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 238/406 (58%), Positives = 309/406 (76%), Gaps = 12/406 (2%)

Query: 28  WFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
           W E S++  R Y  + DG L  +++DD+RP+ H++VD FL++FFP GYP SV EGYL Y+
Sbjct: 5   WRERSENARREYVSKGDGTLQARLLDDNRPLKHKLVDGFLDQFFPVGYPQSVGEGYLTYS 64

Query: 88  QFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARM 147
           QFRALQHF+SA LSV STQSL+FAAGLRPTPAQATV+SW+LKDGMQH+GKL+ S +GARM
Sbjct: 65  QFRALQHFSSAILSVFSTQSLVFAAGLRPTPAQATVISWVLKDGMQHIGKLVGSRMGARM 124

Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
           DSEPKRWRI AD LYD GT LE++SP CPQ FL +AGL N AKG+A+V ARATRLP+Y++
Sbjct: 125 DSEPKRWRIFADVLYDAGTALEMISPACPQYFLGVAGLANLAKGVAMVTARATRLPLYTA 184

Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
           FAKEGNLSDL+AKGEA+STL NV GLG GI+LAS+ C+S+QGK++V PLLSVIH+YSV+E
Sbjct: 185 FAKEGNLSDLYAKGEAMSTLANVAGLGVGIRLASSACTSLQGKLLVAPLLSVIHLYSVME 244

Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPH 327
           EMRA P++TLN QRTAM+VAD+++ GR+ SPA++RYRE L+ P +    AG+V++  D  
Sbjct: 245 EMRAVPLDTLNAQRTAMLVADYLELGRVPSPAEIRYRERLILPVKPNVHAGSVRIANDIL 304

Query: 328 KVVKPS-KFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSF 386
           ++  PS K  ELK  F  E+F+L      T+MVL  +ASGED +RGWL+AA ++  A   
Sbjct: 305 QI--PSGKIRELKHRFQSERFLLDFHETSTEMVLHHSASGEDVVRGWLLAACSSKEAS-- 360

Query: 387 HDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
                  L DAY +++ +    LS+L+++GW T  FL G+  R  W
Sbjct: 361 -------LSDAYSRVDKMLPEFLSQLKSRGWQTQLFLHGSAARAFW 399


>gi|414586418|tpg|DAA36989.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 297

 Score =  455 bits (1170), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 221/297 (74%), Positives = 252/297 (84%), Gaps = 10/297 (3%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
           MN+LE+I+       D++ + +  P    W E S+SV+R   F   G     +SVKV+ D
Sbjct: 1   MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           SRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57  SRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116

Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
           RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPL 176

Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
           CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236

Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
           AGI LASTVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIK 293


>gi|414586420|tpg|DAA36991.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 306

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 210/302 (69%), Positives = 253/302 (83%), Gaps = 1/302 (0%)

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
           MQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG
Sbjct: 1   MQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKG 60

Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
           +AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK+
Sbjct: 61  MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKL 120

Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
           I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+LRY+EDLLFP 
Sbjct: 121 IAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLLFPN 180

Query: 312 RLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
           R+IE+AG+VK+G+   KV  P    +L+  F  EKF+L   S  T MVLE++ASGEDALR
Sbjct: 181 RVIEEAGSVKIGQPLRKVFSPRLVEQLRATFPGEKFLLSQKSNRTYMVLERSASGEDALR 240

Query: 372 GWLVAAYAASMAKS-FHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRF 430
           GWLVAA+A+ M +S        VL +AY+KM  VF   +SE++++GW+TD+FLDG  +R 
Sbjct: 241 GWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPTFVSEVRSRGWYTDQFLDGNCSRV 300

Query: 431 AW 432
           ++
Sbjct: 301 SF 302


>gi|414586414|tpg|DAA36985.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
          Length = 288

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/269 (72%), Positives = 225/269 (83%), Gaps = 10/269 (3%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
           MN+LE+I+       D++ + +  P    W E S+SV+R   F   G     +SVKV+ D
Sbjct: 1   MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           SRP++ ++VDSFLNK FPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57  SRPIHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116

Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
           RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPL
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPL 176

Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
           CPQLFLE+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+G
Sbjct: 177 CPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIG 236

Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
           AGI LASTVCS+ QGK+I GPLLS +H++
Sbjct: 237 AGIGLASTVCSTTQGKLIAGPLLSAVHIW 265


>gi|149392593|gb|ABR26099.1| unknown [Oryza sativa Indica Group]
          Length = 282

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 185/274 (67%), Positives = 230/274 (83%)

Query: 159 DALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
           D LYD GT LEV+SPLCPQLFLE+AG GNFAKG+AVVAARATRLPIYSSFAKEGNLSDLF
Sbjct: 1   DVLYDFGTALEVISPLCPQLFLEVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLF 60

Query: 219 AKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
           AKGEAISTLFNV+G+GAGI LASTVCS+ QGK+I GPLLSV+H+Y V++EMRA PVNTLN
Sbjct: 61  AKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLN 120

Query: 279 PQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHEL 338
           PQRTAMIVADF+K+G++SSPA+LRYREDLLFP RLIE+AG+VK+G+   +V+ P +  +L
Sbjct: 121 PQRTAMIVADFIKSGKVSSPAELRYREDLLFPNRLIEEAGSVKIGQPVRRVLSPQRIEQL 180

Query: 339 KELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAY 398
           K  F++EKF+L        MVLEQ+A+GEDALRGWLVAA+A+ M +S      TVL  AY
Sbjct: 181 KATFSKEKFLLSRKDNSAYMVLEQSATGEDALRGWLVAAFASEMERSGVGSGDTVLNVAY 240

Query: 399 DKMNDVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
           ++M +VF   ++E++++GW+TD+FLDG  +R A+
Sbjct: 241 ERMENVFPMFVAEVKSRGWYTDQFLDGNRSRIAY 274


>gi|414586419|tpg|DAA36990.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 311

 Score =  390 bits (1001), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 197/313 (62%), Positives = 239/313 (76%), Gaps = 19/313 (6%)

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           A V  WIL    Q VG  +  N               AD LYDLGT LEV+SPLCPQLFL
Sbjct: 13  AWVQEWILN---QRVGGYLVRN---------------ADVLYDLGTALEVVSPLCPQLFL 54

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
           E+AGLGNFAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LA
Sbjct: 55  EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 114

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           STVCS+ QGK+I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K+G++SSPA+
Sbjct: 115 STVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAE 174

Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVL 360
           LRY+EDLLFP R+IE+AG+VK+G+   KV  P    +L+  F  EKF+L   S  T MVL
Sbjct: 175 LRYKEDLLFPNRVIEEAGSVKIGQPLRKVFSPRLVEQLRATFPGEKFLLSQKSNRTYMVL 234

Query: 361 EQNASGEDALRGWLVAAYAASMAKS-FHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHT 419
           E++ASGEDALRGWLVAA+A+ M +S        VL +AY+KM  VF   +SE++++GW+T
Sbjct: 235 ERSASGEDALRGWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPTFVSEVRSRGWYT 294

Query: 420 DRFLDGTGTRFAW 432
           D+FLDG  +R ++
Sbjct: 295 DQFLDGNCSRVSF 307


>gi|414586421|tpg|DAA36992.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 164

 Score =  293 bits (750), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 137/160 (85%), Positives = 150/160 (93%)

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
           MQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG
Sbjct: 1   MQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKG 60

Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
           +AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK+
Sbjct: 61  MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKL 120

Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
           I GPLLS +H+Y VI+EMRA PVNTLNPQRTAMIVADF+K
Sbjct: 121 IAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIK 160


>gi|413942077|gb|AFW74726.1| hypothetical protein ZEAMMB73_469739 [Zea mays]
          Length = 425

 Score =  286 bits (733), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 162/302 (53%), Positives = 186/302 (61%), Gaps = 79/302 (26%)

Query: 48  SVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQS 107
           +VKV+ DSRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQ 
Sbjct: 57  TVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQL 116

Query: 108 LL-----------------FAAGLRPTPAQATVVS------------------------- 125
           +                  +A  L       +V++                         
Sbjct: 117 VFQNRLLSVLSSLTSAHFNYARQLTAQVGCGSVLAVCCRFATYPDASNCCKLGKHMMTEM 176

Query: 126 ----WILKDGMQHVGKLICSNLGARMDSEPKRWRIL------------------------ 157
                ILKDGMQH  KLICS++GARMDSEPK WRIL                        
Sbjct: 177 LKSKQILKDGMQHTRKLICSSMGARMDSEPKSWRILGLTAYVLYGCHCLGSHFTIFPIIC 236

Query: 158 -------ADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAK 210
                  AD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG+ VVAARATRLPIYSSFAK
Sbjct: 237 LYCSILTADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMVVVAARATRLPIYSSFAK 296

Query: 211 EGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMR 270
           EGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK +  PLL      SV E + 
Sbjct: 297 EGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKEV--PLLEKYKCRSVNEFVD 354

Query: 271 AA 272
           A 
Sbjct: 355 AC 356


>gi|388492306|gb|AFK34219.1| unknown [Lotus japonicus]
          Length = 182

 Score =  261 bits (668), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 123/182 (67%), Positives = 148/182 (81%)

Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
           M+VGPLLS++H+YSV EEMRA P+NTLNP+RTAMIV DF+K G +SSPADLRYREDLLF 
Sbjct: 1   MVVGPLLSILHIYSVSEEMRATPINTLNPRRTAMIVTDFLKAGVVSSPADLRYREDLLFR 60

Query: 311 GRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDAL 370
            RL EDAGNV+VGR  H+V+KPS+  EL+++   EKF+L  G +  DMVLEQ+ASGEDAL
Sbjct: 61  VRLTEDAGNVRVGRALHEVIKPSRLLELEQVLPGEKFLLNRGGKCVDMVLEQDASGEDAL 120

Query: 371 RGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTRF 430
           RGWLVAAYAA +  S H+ S +VL +AY KM  VF   L ELQ+KGWHTDRFLDGTG+RF
Sbjct: 121 RGWLVAAYAAQIENSSHELSASVLHEAYKKMTGVFPVFLKELQSKGWHTDRFLDGTGSRF 180

Query: 431 AW 432
           A+
Sbjct: 181 AF 182


>gi|413942078|gb|AFW74727.1| hypothetical protein ZEAMMB73_469739, partial [Zea mays]
          Length = 199

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 116/143 (81%), Positives = 129/143 (90%)

Query: 48  SVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQS 107
           +VKV+ DSRP++ ++VDSFLNKFFPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQS
Sbjct: 57  TVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQS 116

Query: 108 LLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTG 167
           LLFA GLRPTP QAT VSWILKDGMQH  KLICS++GARMDSEPK WRILAD LYDLGT 
Sbjct: 117 LLFAVGLRPTPTQATAVSWILKDGMQHTRKLICSSMGARMDSEPKSWRILADVLYDLGTA 176

Query: 168 LEVLSPLCPQLFLEMAGLGNFAK 190
           LEV+SPLCPQLFLE+AGLGNFAK
Sbjct: 177 LEVVSPLCPQLFLEVAGLGNFAK 199


>gi|414586416|tpg|DAA36987.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
          Length = 155

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 112/132 (84%), Positives = 124/132 (93%)

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
           MQH GKLICS++GARMDSEPK WRILAD LYDLGT LEV+SPLCPQLFLE+AGLGNFAKG
Sbjct: 1   MQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKG 60

Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
           +AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV+G+GAGI LASTVCS+ QGK+
Sbjct: 61  MAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKL 120

Query: 252 IVGPLLSVIHVY 263
           I GPLLS +H++
Sbjct: 121 IAGPLLSAVHIW 132


>gi|356510851|ref|XP_003524147.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
          Length = 158

 Score =  227 bits (579), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 107/123 (86%), Positives = 115/123 (93%)

Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
           LKDGMQHVGKLICSN G  MDSEPKRWR+LADALYD+GTGLEVLSP CP LFLEMAGLGN
Sbjct: 1   LKDGMQHVGKLICSNWGGTMDSEPKRWRLLADALYDIGTGLEVLSPRCPHLFLEMAGLGN 60

Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
           FAKG++VVAARATRLPIYSSFAKEGNLSDL AKGEA STLFNV+G+G GIQLAST+C+SM
Sbjct: 61  FAKGMSVVAARATRLPIYSSFAKEGNLSDLLAKGEAFSTLFNVIGIGVGIQLASTICASM 120

Query: 248 QGK 250
           QGK
Sbjct: 121 QGK 123


>gi|356498113|ref|XP_003517898.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
          Length = 171

 Score =  224 bits (571), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 106/126 (84%), Positives = 117/126 (92%)

Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
           LK+GMQHVGKLI SN G  MD EPKRWR+LADALYD+GTGLEVLSP CP LFLEMAGLGN
Sbjct: 1   LKNGMQHVGKLIGSNWGGTMDFEPKRWRLLADALYDIGTGLEVLSPWCPHLFLEMAGLGN 60

Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
           FAKG+AVVAARATRLPIYSSFAKEGNLSDLFAKGEA STLFNV+G+G GIQLAST+C+SM
Sbjct: 61  FAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASM 120

Query: 248 QGKMIV 253
           QGK+++
Sbjct: 121 QGKVVI 126


>gi|414586415|tpg|DAA36986.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
          Length = 163

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 105/165 (63%), Positives = 126/165 (76%), Gaps = 10/165 (6%)

Query: 1   MNLLEKIKKQKKKEPDKSMSVQEIPI--YWFETSDSVTRHYQFQPDGH----LSVKVVDD 54
           MN+LE+I+       D++ + +  P    W E S+SV+R   F   G     +SVKV+ D
Sbjct: 1   MNILERIRGGG----DRAAATERPPRPECWVEISESVSRLCSFDAAGSGSRSISVKVIQD 56

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           SRP++ ++VDSFLNK FPSGYP+SVNEGYL YT+FR LQHF+SA L VLSTQSLLFAAGL
Sbjct: 57  SRPIHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL 116

Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILAD 159
           RPTPAQAT VSWILKDGMQH GKLICS++GARMDSEPK WRIL +
Sbjct: 117 RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILGE 161


>gi|449016609|dbj|BAM80011.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 496

 Score =  191 bits (484), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 125/409 (30%), Positives = 205/409 (50%), Gaps = 12/409 (2%)

Query: 22  QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNE 81
           Q  P+   E      R Y+    GHL V  V   R    +  D F   F P GYP SV  
Sbjct: 76  QVPPVLAVEQIRYGQREYRINSAGHLEVAEVRTRRAWGLKAADRFSRMFLPDGYPDSVGP 135

Query: 82  GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQ-ATVVSWILKDGMQHVGKLI 139
            Y  YT++R    F    + V STQSLL A G+ RP+ A  A  + W+++DG+   G+++
Sbjct: 136 DYAEYTKWRGRAFFFGGMVGVFSTQSLLLAVGVGRPSAAPIAAALQWVIRDGLGRAGRML 195

Query: 140 CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARA 199
            S +G   D+E K++R+ A  + ++   LE L+PL P LFL +A L N AKG + VAA +
Sbjct: 196 FSQVGTGFDAETKQYRLAAAFVLNVSCALEALTPLFPHLFLPLASLANMAKGASTVAAAS 255

Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
           TR  IY SF +  NL D+ AK E +    +++G   G+ L+     S +  +I     S+
Sbjct: 256 TRGAIYRSFMRRENLGDITAKQETVGVAGDLLGTAVGVILSRASSHSHRLSVIAFIGASL 315

Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK-TGRISSPADLRYREDLLFPGRLIE-DA 317
            H+ S   E+++  + TLN QR  M++ ++++   R+    D+  RE L++   L    A
Sbjct: 316 GHLISAYYEVKSVELRTLNRQRAHMLIWNYLEDQDRVLPIRDVNRRERLVYRPWLDSLHA 375

Query: 318 GNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVA 376
            NV++G R             L  L+ +EK+++        +VL  +A+ +D L+ +  +
Sbjct: 376 PNVEIGARLSDAAPDADALKFLLRLYRDEKYMMNYAGGRVRIVLRNDATHQDQLKAYFQS 435

Query: 377 A-----YAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
                 Y  S  +    P+  +++D+Y   N +F   + + +A+GW+ D
Sbjct: 436 RAFWRLYMTSGER--FSPNRALIEDSYRATNQLFPSFVRDCRAEGWNVD 482


>gi|452824810|gb|EME31810.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
          Length = 636

 Score =  184 bits (468), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 110/347 (31%), Positives = 190/347 (54%), Gaps = 16/347 (4%)

Query: 39  YQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSA 98
           YQF  D     +  DD RP++ +++D     + P GYP SV + YL Y+ +RA+Q+  ++
Sbjct: 249 YQFSDDMIQEGRFKDD-RPIFEKLIDWLSRVYLPDGYPNSVTKDYLSYSIWRAIQNLAAS 307

Query: 99  ALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWR 155
            + V +T++LLF  GL  + TPA +  +SW+LKDG+ ++GK+   ++ G + D +PK WR
Sbjct: 308 VMGVFATEALLFGLGLGRKSTPATSAAISWVLKDGLGYIGKVFYGSIAGNQFDVDPKSWR 367

Query: 156 ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLS 215
           + ADA+ D G  LE+++PL P  FL +A + N  KG+A +   ATR  IY S A   N  
Sbjct: 368 LAADAVEDAGGVLEIITPLFPGHFLLLASVANAMKGVAAMTGTATRHAIYKSLALHENQG 427

Query: 216 DLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVN 275
           D+  KGE+      +VGLG GI  +S +  +    +    L   +H+++  + +      
Sbjct: 428 DIATKGESQGVTCKMVGLGLGIVASSKMGQNYFALLSAYALGCFVHLFANWKSLSCVQFA 487

Query: 276 TLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP---GRLIEDAGNVKVGRDPHKVVKP 332
           TLN QR ++ + +F++TG + SP ++  RE ++ P   G L      + VG +  +  + 
Sbjct: 488 TLNRQRCSLALQEFLQTGNVPSPYEVSRREKVVVPPWKGYL----DRIVVGANLKECFQN 543

Query: 333 SKFHELKE---LFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVA 376
            +  EL+E   L+ + ++IL        +VL ++   +D  + ++ A
Sbjct: 544 GR--ELREAVALYRDCRYILTQRDNKILIVLHESCRPQDIFQAFMQA 588


>gi|449018781|dbj|BAM82183.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 631

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 94/225 (41%), Positives = 137/225 (60%), Gaps = 5/225 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILK 129
           F P G+P SV   YLRY+ +R  Q+  SA  +VLST +LL A GL  TP  A   SW+LK
Sbjct: 242 FLPEGFPESVGPSYLRYSLWRGAQNVVSAMTAVLSTHALLAAVGLSSTPV-AAATSWVLK 300

Query: 130 DGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFA 189
           DG+  +GKL+ +  G   D++PKR+R+ +D LYD G  LE+L+PL PQ FL +A LGNF 
Sbjct: 301 DGIGQLGKLLTARRGRDFDADPKRYRLSSDLLYDAGLSLEILTPLFPQYFLALAALGNFT 360

Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
           K +A+    A R  + SSF   GNL D+ AK +A + + N+ G+G GI  A  +    + 
Sbjct: 361 KSVAITVGMACRNSVLSSFVLRGNLGDISAKNDAQNVVTNLTGMGLGIVTARAL--PPKP 418

Query: 250 KMIVGPLLSVIHVYSVI--EEMRAAPVNTLNPQRTAMIVADFVKT 292
            + +G   ++  +YSV+  + M+A  +N +N QR A+I   FV++
Sbjct: 419 SVRLGAFAALTALYSVLNYQSMKAVKLNVVNRQRGAIIADSFVRS 463


>gi|452823212|gb|EME30224.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
          Length = 849

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 105/315 (33%), Positives = 173/315 (54%), Gaps = 15/315 (4%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP-TPAQATVVSWIL 128
           FFP G    V++ Y  +T +R  Q   S+ + V  TQ+LL A G++     +A   SW+L
Sbjct: 422 FFPEG----VSQDYYSFTMWRVFQRLVSSTVGVFGTQALLLALGIKAGRIGRAAATSWVL 477

Query: 129 KDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
           KDG+  +GK++  S +G   D++PKRWR  +  LY LG GLE+++ + P  FL  A L N
Sbjct: 478 KDGLGRLGKMVWASQMGRDFDADPKRWRFRSALLYALGNGLEIVTQIFPASFLLFATLAN 537

Query: 188 FAKGIAVVAARATRLPIYSSFA-KEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
             K I+++ A ATR  +Y +FA +  N++D+ AKGEA   + +++G+  GIQL+  + +S
Sbjct: 538 SMKQISMLTASATRNAMYKNFAGRSENIADITAKGEAQIVVADLLGMALGIQLSKLIGTS 597

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +     LSV+ +Y+  +E+RA    TLN +R++MIV  FV+ G +  P ++  RE+
Sbjct: 598 RPNVLTAYSFLSVLDIYAFYKELRAVQFRTLNYERSSMIVDYFVRRGYVPRPDEVSQREN 657

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT---DMVLEQN 363
           +    R    +    +      V  P + + L ++F  E+F++  G  W     +VL ++
Sbjct: 658 IFLGPRYDVRSMFASLS---DAVSCPQELNSLIKIFKGEQFMVTRG--WNGQYRIVLREH 712

Query: 364 ASGEDALRGWLVAAY 378
           A   D LR  L   +
Sbjct: 713 ARNGDILRALLTLGF 727


>gi|302771033|ref|XP_002968935.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
 gi|300163440|gb|EFJ30051.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
          Length = 389

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 187/387 (48%), Gaps = 35/387 (9%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQ 120
            +D+      P GYP +V++ Y  YT +R  Q   S    VL+TQ+LL+A GL +     
Sbjct: 4   CLDALKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISGVLTTQALLYAVGLGKGAIPT 63

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           A  ++W+LKDG+ ++ K++ S  G   D  PK WR+L+D + D   GLE+L+P  P LF+
Sbjct: 64  AAAINWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFV 123

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +  +    K  A +   ATR   Y+ FA + N +++ AKGEA   +   +G+  GI ++
Sbjct: 124 PLTAVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGIS 183

Query: 241 STVCSSMQGKMIVGPLL-------SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
           S V SS       GPLL       + +HV   ++  +A  ++TLNP RT +I+A+++ T 
Sbjct: 184 SKVGSS-------GPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPYRTGLIMAEYLTTQ 236

Query: 294 RISSPADLRYREDLLFPGRLIEDA------------------GNVKVGRD-PHKVVKPSK 334
           RI S  D+   E +                             N+K+G      V     
Sbjct: 237 RIPSVRDVNSVEPIFTVADFFSSKKAPLSISCESKCAAADIEKNLKLGVSFGEAVASRGA 296

Query: 335 FHELKELFAEEKFILCL-GSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTV 393
              L +L++ E+++L    S    +VL+++AS  D LR  + A+Y   +          +
Sbjct: 297 AQALLDLYSGEEYLLTYDDSHCFKVVLKKSASPHDMLRAVVQASYLRHIHNQQSLSKDEL 356

Query: 394 LQDAYDKMNDVFTPLLSELQAKGWHTD 420
           L+ +Y  M   F  L+  LQA GW T 
Sbjct: 357 LKLSYQSMQQSFNELIQGLQAAGWSTS 383


>gi|302816607|ref|XP_002989982.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
 gi|300142293|gb|EFJ08995.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
          Length = 440

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 118/400 (29%), Positives = 190/400 (47%), Gaps = 39/400 (9%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQ 120
            +D+      P GYP +V++ Y  YT +R  Q   S    VL+TQ+LL+A GL +     
Sbjct: 45  CLDALKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISGVLTTQALLYAVGLGKGAIPT 104

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           A  ++W+LKDG+ ++ K++ S  G   D  PK WR+L+D + D   GLE+L+P  P LF+
Sbjct: 105 AAAINWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFV 164

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +  +    K  A +   ATR   Y+ FA + N +++ AKGEA   +   +G+  GI ++
Sbjct: 165 PLTAVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGIS 224

Query: 241 STVCSSMQGKMIVGPLL-------SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
           S V SS       GPLL       + +HV   ++  +A  ++TLNP RT +I+A+++ T 
Sbjct: 225 SKVGSS-------GPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPYRTGLIMAEYLTTQ 277

Query: 294 RISSPADLRYREDLLFPGRLIEDA----------------------GNVKVGRD-PHKVV 330
           RI S  D+   E +                                 N+K+G      V 
Sbjct: 278 RIPSVRDVNSVEPIFTVADFFSSKKAPVSTHLSISCESKCAAADIEKNLKLGVSFGEAVA 337

Query: 331 KPSKFHELKELFAEEKFILCL-GSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDP 389
                  L +L++ E+++L    S    +VL+++AS  D LR  + A+Y   +       
Sbjct: 338 SRGAAQALLDLYSGEEYLLTYDDSHCFKVVLKKSASPHDMLRAVVQASYLRHIHNQQSLS 397

Query: 390 SLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGTR 429
              +L+ +Y  M   F  L+  LQA GW T   +   G+ 
Sbjct: 398 KDELLKLSYQSMQQSFNELIQGLQAAGWSTSGLVARPGSN 437


>gi|449018940|dbj|BAM82342.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 738

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 111/363 (30%), Positives = 190/363 (52%), Gaps = 35/363 (9%)

Query: 76  PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP--TPAQATVVSWILKDGMQ 133
           P  V   Y  + ++R  Q   S+ + V  TQ+LL A G++     +QA  +SW+LKDG+ 
Sbjct: 380 PEDVTADYYEFAKWRVFQRLMSSTVGVFGTQALLLALGIKSGNKISQAATISWVLKDGLS 439

Query: 134 HVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
            VGK++  S LG  MD++PKR+R  +  LY LG GLE+ + + PQ FL +A L N AK  
Sbjct: 440 RVGKMLWASQLGRDMDADPKRFRFGSALLYSLGNGLEIATRIVPQSFLLVATLANTAKLC 499

Query: 193 AVVAARATRLPIYSSFA-KEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
           +++ A ATR  +Y SFA +  N++D+ AKGEA  T+ ++ G+  GIQL+  + +S    +
Sbjct: 500 SMLTASATRNAMYRSFADRNENIADITAKGEAQITIADLGGMALGIQLSKIIGTSRPKVV 559

Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
               +LS I +Y++ +E+R     TLN QR+++++  ++KTG + +P  +   E ++ P 
Sbjct: 560 ATYLILSTIDLYAIWKELRVIEFRTLNLQRSSLVIEHYLKTGCVPTPKQVSAAERVVLPE 619

Query: 312 RLIEDA--GNVKVGRDPHKVVKPSKFHELKELFAEEKFIL----------CLGSRWTDMV 359
            +   A     K+ RD  +        ++ + +   +F++          CL S    +V
Sbjct: 620 PIPRHAFVAFSKLVRDEQEA------EQMIKRYGSARFLMRYGEVHWPWTCLHSSRGGIV 673

Query: 360 LEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYD---KMNDVFTPLLSELQAKG 416
           L ++A  +D  R  L           ++   L ++Q+  +   + + V  PLL   +  G
Sbjct: 674 LHRDADQQDIFRAILC----------WNLRRLGIVQEEDEVLARADVVMEPLLKAAREAG 723

Query: 417 WHT 419
           W T
Sbjct: 724 WRT 726


>gi|8978258|dbj|BAA98149.1| unnamed protein product [Arabidopsis thaliana]
          Length = 407

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 190/359 (52%), Gaps = 16/359 (4%)

Query: 63  VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
           V SFL  +  P G+P SVNE Y+ Y  +RAL+HF   A+ V +TQ+LL + G     + +
Sbjct: 14  VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 73

Query: 122 TV--VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
               ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   P LF
Sbjct: 74  AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 133

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
           L +A   N  K +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I L
Sbjct: 134 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI-L 192

Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
            S    S+   +    LLS  ++ S  +E+R+  ++TLN  R  + V  F+KTGR+ S  
Sbjct: 193 ISKRNPSL---VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 249

Query: 300 DLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM 358
           +   +E +  FP   ++D   +   R       PS +  +K  F +E++++        +
Sbjct: 250 EGNIQEKIFTFP--WVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKV 307

Query: 359 --VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
             +L+  A+ +D L+    AA+ A +   F + S      + ++++  F P   EL+++
Sbjct: 308 YALLKHQANSDDILK----AAFHAHVLLHFMNQSKDGNPRSVEQLDPAFAPTEYELESR 362


>gi|18423070|ref|NP_568713.1| uncharacterized protein [Arabidopsis thaliana]
 gi|16604643|gb|AAL24114.1| unknown protein [Arabidopsis thaliana]
 gi|23296703|gb|AAN13151.1| unknown protein [Arabidopsis thaliana]
 gi|332008476|gb|AED95859.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 497

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 190/359 (52%), Gaps = 16/359 (4%)

Query: 63  VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
           V SFL  +  P G+P SVNE Y+ Y  +RAL+HF   A+ V +TQ+LL + G     + +
Sbjct: 104 VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 163

Query: 122 TV--VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
               ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   P LF
Sbjct: 164 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 223

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
           L +A   N  K +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I L
Sbjct: 224 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI-L 282

Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
            S    S+   +    LLS  ++ S  +E+R+  ++TLN  R  + V  F+KTGR+ S  
Sbjct: 283 ISKRNPSL---VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 339

Query: 300 DLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM 358
           +   +E +  FP   ++D   +   R       PS +  +K  F +E++++        +
Sbjct: 340 EGNIQEKIFTFP--WVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKV 397

Query: 359 --VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
             +L+  A+ +D L+    AA+ A +   F + S      + ++++  F P   EL+++
Sbjct: 398 YALLKHQANSDDILK----AAFHAHVLLHFMNQSKDGNPRSVEQLDPAFAPTEYELESR 452


>gi|297795749|ref|XP_002865759.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
 gi|297311594|gb|EFH42018.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
          Length = 499

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 189/359 (52%), Gaps = 16/359 (4%)

Query: 63  VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
           V SFL  +  P G+P SVNE Y+ Y  +RAL+HF   A+ V +TQ+LL + G     + +
Sbjct: 107 VGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSAS 166

Query: 122 TV--VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
               ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   P LF
Sbjct: 167 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 226

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
           L +A   N  K +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I L
Sbjct: 227 LPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI-L 285

Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
            S    S+   +    LLS  ++ S  +E+R+  ++TLN  R  + V  F+KTGR+ S  
Sbjct: 286 ISKRNPSL---VTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQ 342

Query: 300 DLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM 358
           +   +E +  FP   ++D   +   R       PS +  +K  F +E++++        +
Sbjct: 343 EGNIQEKIFTFP--WVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKV 400

Query: 359 --VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
             +L+  A+ +D L+    AA+ A +   F + S        ++++  F P   EL+++
Sbjct: 401 YALLKHQANSDDILK----AAFHAHVLLHFMNQSKDGTPRLVEQLDPAFAPTEYELESR 455


>gi|302764074|ref|XP_002965458.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
 gi|300166272|gb|EFJ32878.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
          Length = 430

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 108/359 (30%), Positives = 181/359 (50%), Gaps = 22/359 (6%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP-TPAQ 120
           ++D   +   P G+P SV   Y  Y Q+R LQ+F   A+ V +T+SLL A G+R    + 
Sbjct: 63  IMDFVRSHVVPEGFPDSVLPCYTPYMQWRCLQYFFGGAMGVFTTRSLLHALGVRNGASSS 122

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           A  V+W++KDG   +GK+I +  G + D   K+ R   D L  L   LE+ +   PQ FL
Sbjct: 123 AIAVNWVVKDGAGRIGKMIFARHGKKFDCHLKQIRFAGDVLMQLAAALELATSATPQFFL 182

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +A + N  K +A VA+ +TR PIY +FA+  N+ D+ AKGE I  + +++G G GI ++
Sbjct: 183 PLACIANIGKNVAAVASTSTRAPIYKAFARRENIGDITAKGECIGNIADLLGTGMGILMS 242

Query: 241 STVCSSMQGKMIVGPL--LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
                      +  P   L+  +VYS   E++A  + TLN  R  + V  F++TG++   
Sbjct: 243 KNF-------PVFAPFSALAFGYVYSSFREVKAIQLPTLNRHRFGIAVDTFLETGKVPGL 295

Query: 299 ADLRYREDLLF-PGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSR-- 354
            +   RE ++  P   +E    +++G R       P  +  +  LF  E +++    +  
Sbjct: 296 VEANERERIIVGPQWSLE---KLELGARVSDAFSAPQDYQYIDSLFKGENYLVSYNPKRQ 352

Query: 355 WTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQ 413
            T +VL++ A+G+D LR    AA+   +  +  D   +  + AY +     +PL    +
Sbjct: 353 RTYVVLKERANGDDVLR----AAFHGQLFVNIFDKCSSA-EGAYQESCKYISPLYEHFK 406


>gi|219129733|ref|XP_002185036.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403531|gb|EEC43483.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 627

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 118/416 (28%), Positives = 199/416 (47%), Gaps = 65/416 (15%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ----ATVVS 125
           F PSG    V   Y R+ ++R LQ F +A L+V  TQSLL A  ++ + +Q    +  ++
Sbjct: 195 FLPSG----VTNDYYRFVRWRILQRFVNANLNVFGTQSLLLALRIKSSASQLGALSAALN 250

Query: 126 WILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           W+LKD +  + +++  S +G R DS+ KRWR  +  ++  G GLE+++ + P LFL  A 
Sbjct: 251 WVLKDALGKIVRMLWASRMGRRFDSDAKRWRFRSSFVFAAGNGLEIITYVFPSLFLLWAT 310

Query: 185 LGNFAKGIAVVAARATRLPIYSSF--AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
           L N  K I+++ + +TR  IY+SF      N+ D+ AKGEA   + +++G+ +G+ L+ T
Sbjct: 311 LANCCKQISMLTSSSTRTSIYNSFRDGSRENIGDITAKGEAQIAIVDLLGIASGVTLSRT 370

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR-------- 294
           V +S++  + V   L  I +  V  ++RA     +N +R   +VADF +  +        
Sbjct: 371 VGTSIRAVLAVYVTLQAIEIVCVYHQLRAVTYRVMNFERMISVVADFCQARQGPKDGLEG 430

Query: 295 ------------ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELF 342
                       I +P  L  +E +  P + +     +  G      + P +   L E+F
Sbjct: 431 LAASCTTPTPAGIPTPQTLASQERIFLPPKHLTRRA-IAFGSIGRARLSPDELGTLLEIF 489

Query: 343 AEEKFILCLG---------------------SRWTD---MVLEQNASGEDALRGWL-VAA 377
             E+FIL +G                     SR  +   +VL + A+  D ++  L +  
Sbjct: 490 KRERFILVVGKNVKHPRPFMAKTAKQNEDPVSRIQENCHIVLHEAATNMDIVKSTLALTL 549

Query: 378 YAASMAKSFHDPSLTVLQDAYDKM-------NDVFTPLLSELQAKGWHTD-RFLDG 425
               +A S  DPS     + +D M       ND+F  LL E+  +GW +  RF+ G
Sbjct: 550 LRRKLALSKFDPSQVRSSNCFDIMKVTQEETNDLFPLLLREMNTQGWESPARFMFG 605


>gi|226492898|ref|NP_001145431.1| uncharacterized protein LOC100278802 [Zea mays]
 gi|195656119|gb|ACG47527.1| hypothetical protein [Zea mays]
 gi|224035549|gb|ACN36850.1| unknown [Zea mays]
 gi|413945393|gb|AFW78042.1| hypothetical protein ZEAMMB73_132907 [Zea mays]
          Length = 511

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 114/363 (31%), Positives = 186/363 (51%), Gaps = 27/363 (7%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+  + A   A
Sbjct: 119 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRAASGA 178

Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
             ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   PQLFL 
Sbjct: 179 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 238

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           MA + N  K +A V + +TR PIY ++AK  N+ D+ AKGE++  + +++G G  I ++ 
Sbjct: 239 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISK 298

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
           +  S +         LS  ++ S   E+R+  +NTLN  R  + V  F++TG + S  D 
Sbjct: 299 SNPSLVTSFA----FLSCGYLLSSYHEVRSVVLNTLNRARFTVAVDSFIRTGYVPSLKDG 354

Query: 302 RYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM-- 358
             +E +  P    E    V +G R      +P+ F  +K LF +E++I+        +  
Sbjct: 355 NSQETIFNPPWRHEP---VAIGSRFGDAFQEPASFLAIKPLFEDERYIVTYNPTKDKVYA 411

Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQD----------AYDKMNDVFTPL 408
           +L+  A  +D L+    AA+ A +   F + S   L             YD +N  F P 
Sbjct: 412 LLKDQAKPDDILK----AAFHAHVLLHFINASHANLNARKRMNSNRSYQYDPLNMDFVPH 467

Query: 409 LSE 411
           + E
Sbjct: 468 IEE 470


>gi|242055389|ref|XP_002456840.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
 gi|241928815|gb|EES01960.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
          Length = 569

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 101/333 (30%), Positives = 173/333 (51%), Gaps = 20/333 (6%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
           + R  D  +    P GYP SV+  Y+ Y+ +R +Q   S    VLSTQ+LL+A GL R  
Sbjct: 161 WRRCADVAVQLLLPDGYPHSVSSDYMNYSLWRGVQGVASQISGVLSTQALLYAVGLGRGA 220

Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
              A  V+W+LKDG+ ++ K++ S  G   D  PK WR+ AD L ++  GLE+L+P+ P 
Sbjct: 221 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 280

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
           LF+ +      ++  A +   ATR   Y+ FA + N +++ AKGEA   +   VG+G GI
Sbjct: 281 LFVPIGAAAGASRSAASLIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGI 340

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            LA+ + SS+   +I    ++ +H+Y  ++  ++  + TLNP R +++ ++++ +G++ S
Sbjct: 341 ALANQIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 400

Query: 298 PADLRYREDLLFPGRLIEDAGNVKV---------GRDPHKVVKPSKFHELKE-------- 340
             ++   E L F   L       K+          R   ++   SK  E+ E        
Sbjct: 401 VKEVNDEEPLFFNLSLGASCQQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACAL 460

Query: 341 --LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
             L+  E+++L        +VL++ +S ED L+
Sbjct: 461 FDLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLK 493


>gi|357167308|ref|XP_003581100.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
           distachyon]
          Length = 604

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 102/333 (30%), Positives = 175/333 (52%), Gaps = 20/333 (6%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
           + R  D  L    P GYP SV+  YL Y+ +R LQ   S    VLSTQ+LL+A GL +  
Sbjct: 195 WRRCTDVALQLLLPDGYPNSVSSDYLHYSLWRGLQGVASQISGVLSTQALLYAVGLGKGV 254

Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
              A  ++W+LKDG+ ++ K++ S  G   D  PKRWR+ AD L +   GLE+L+P+ P 
Sbjct: 255 IPTAAAINWVLKDGLGYLSKILLSKFGRHFDVNPKRWRLFADLLENTAYGLEILTPVFPH 314

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
           LF+ +      A+  A +   ATR   Y+ FA + N +++ AKGEA   +   +G+  GI
Sbjct: 315 LFVPIGAAAGAARSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGIMLGI 374

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            LA+ + SS+   +I    ++V+H+Y  ++  ++  + TLNP R +++ ++++ +G++SS
Sbjct: 375 ALANQIGSSVPLALISFAGVTVVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVSS 434

Query: 298 PADLRYREDL---------LFPGRLIEDAGNVKVGRDPHKVVKPSKFHE----------L 338
             ++   E L         L   +++         R   ++   SK  E          L
Sbjct: 435 VKEVNDEEPLFLNSSMGAKLKQNKILSAEAKDAADRICRRLQLGSKLSEIIDNEEDACAL 494

Query: 339 KELFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
            +L+ +E+++L        +VL++ +S ED L+
Sbjct: 495 FDLYKDEQYLLLEYKGKFCVVLKEGSSPEDMLK 527


>gi|226491780|ref|NP_001140263.1| uncharacterized protein LOC100272306 [Zea mays]
 gi|194698746|gb|ACF83457.1| unknown [Zea mays]
          Length = 451

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 98/332 (29%), Positives = 173/332 (52%), Gaps = 19/332 (5%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
           + R  +  ++   P GYP SV+  YL Y+ +R +Q   S    VLSTQ+LL+A GL +  
Sbjct: 44  WRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKGA 103

Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
              A  V+W+LKDG+ ++ K++ S  G   D  PK WR+ AD L ++  GLE+L+P+ P 
Sbjct: 104 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 163

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
           LF+ +       +  A +   ATR   Y+ FA + N +++ AKGEA   +   VG+G GI
Sbjct: 164 LFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGI 223

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            LA+ + SS+   +I    ++ +H+Y  ++  ++  + TLNP R +++ ++++ +G++ S
Sbjct: 224 ALANHIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 283

Query: 298 PADLRYREDLLF--------PGRLIEDAGNVKVGRDPHKVVKPSKFHELKE--------- 340
             ++   E L F          +++         R   ++   SK  E+ E         
Sbjct: 284 VKEVNDEEPLFFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACALF 343

Query: 341 -LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
            L+  E+++L        +VL++ +S ED L+
Sbjct: 344 DLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLK 375


>gi|242090561|ref|XP_002441113.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
 gi|241946398|gb|EES19543.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
          Length = 517

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 110/344 (31%), Positives = 184/344 (53%), Gaps = 18/344 (5%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
           SF+  +  P G+P SV   Y+ Y  +RA++HF   A+ V +T++LL + G+  + A   A
Sbjct: 125 SFIRSYVVPEGFPESVTPSYVPYMTWRAMKHFFGGAMGVFTTRTLLNSVGVSQSRAASGA 184

Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
             ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   PQLFL 
Sbjct: 185 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 244

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           MA + N  K +A V + +TR PIY ++AK  N+ D+ AKGE++  + +++G G  I ++ 
Sbjct: 245 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISK 304

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
           +  S +    +    LS  ++ S   E+R+  +NTLN  R  + V  F++TG + S  D 
Sbjct: 305 SNPSLVTSFAV----LSCGYLLSSYYEVRSVVLNTLNRARFTVAVDSFIRTGYVPSLKDG 360

Query: 302 RYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM-- 358
             +E +  P    E    V +G R      +P+ F  +K LF +E++I+        +  
Sbjct: 361 NSQETIFNPPWRHEP---VAIGSRFGEAFQEPASFIAIKPLFEDERYIVTYNPTKDKVYA 417

Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
           +L+  A  +D L+    AA+ A +   F + S   L +A  +MN
Sbjct: 418 LLKDQAKPDDILK----AAFHAHVLLHFINASHANL-NARKRMN 456


>gi|414879009|tpg|DAA56140.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
          Length = 593

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 98/332 (29%), Positives = 173/332 (52%), Gaps = 19/332 (5%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
           + R  +  ++   P GYP SV+  YL Y+ +R +Q   S    VLSTQ+LL+A GL +  
Sbjct: 186 WRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKGA 245

Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
              A  V+W+LKDG+ ++ K++ S  G   D  PK WR+ AD L ++  GLE+L+P+ P 
Sbjct: 246 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 305

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
           LF+ +       +  A +   ATR   Y+ FA + N +++ AKGEA   +   VG+G GI
Sbjct: 306 LFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGI 365

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            LA+ + SS+   +I    ++ +H+Y  ++  ++  + TLNP R +++ ++++ +G++ S
Sbjct: 366 ALANHIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 425

Query: 298 PADLRYREDLLF--------PGRLIEDAGNVKVGRDPHKVVKPSKFHELKE--------- 340
             ++   E L F          +++         R   ++   SK  E+ E         
Sbjct: 426 VKEVNDEEPLFFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACALF 485

Query: 341 -LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
            L+  E+++L        +VL++ +S ED L+
Sbjct: 486 DLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLK 517


>gi|357474239|ref|XP_003607404.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
 gi|355508459|gb|AES89601.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
          Length = 490

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 103/332 (31%), Positives = 174/332 (52%), Gaps = 15/332 (4%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR----PTPA 119
           SF+  +  P G+P +V   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+        A
Sbjct: 99  SFITSYVVPEGFPHTVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRATPAA 158

Query: 120 QATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
            A  ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   P LF
Sbjct: 159 AAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDLLMELGAGVELATAAMPHLF 218

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
           L +A   N  K +  V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I +
Sbjct: 219 LPLACAANVLKNVGAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGFSIMI 278

Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
           A     S    +    LLS  ++ S  +E+++  ++TLN  R ++ V  F+KTG++ +  
Sbjct: 279 A----KSNPSLITTFSLLSCGYILSSYKEVKSVVLHTLNSARFSVAVDSFLKTGQVPTLR 334

Query: 300 DLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM 358
           +    ED+  FP +   D   V   R       PS +  ++ LF +E++I+      + +
Sbjct: 335 EGNLNEDIFSFPWK---DRPVVLGSRIKDAFQSPSAYVAIEPLFDKERYIVTYNPSKSKV 391

Query: 359 --VLEQNASGEDALRGWLVAAYAASMAKSFHD 388
             VL+  A  +D L+    A   ++  KS ++
Sbjct: 392 HAVLKDQAKSDDILKAAFHAHVLSNFIKSLNE 423


>gi|326490149|dbj|BAJ94148.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 519

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 110/344 (31%), Positives = 183/344 (53%), Gaps = 18/344 (5%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+SV +T++LL + G+  + A   A
Sbjct: 123 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRATSGA 182

Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
             ++WIL+DG   VGK++ +  G + D + K+ R   D + +LG G+E+ +   PQLFL 
Sbjct: 183 VAINWILRDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLMMELGAGIELATAAFPQLFLP 242

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           +A + N  K +A V + +TR PIY ++AK  N+ D+ AKGE++  + +++G G  I L S
Sbjct: 243 LACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI-LMS 301

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
               S+     V   LS  ++ S  +E+R+  +NTLN  R  + V  F+KTG + S  + 
Sbjct: 302 KRNPSLVASFAV---LSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEG 358

Query: 302 RYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM-- 358
              E +  P    +    V +G R      +P+ F   + LF +E++I+        +  
Sbjct: 359 NLEETIFNPPWRHQP---VAIGSRFGEAFQEPASFVATRPLFEDERYIVTYNPAKDKVYA 415

Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
           +L+  A  +D L+    AA+ A +   F + S   L+ A  +MN
Sbjct: 416 LLKDQAKQDDILK----AAFHAHVLLHFINASHANLK-ARKRMN 454


>gi|218196816|gb|EEC79243.1| hypothetical protein OsI_19997 [Oryza sativa Indica Group]
          Length = 401

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 108/346 (31%), Positives = 182/346 (52%), Gaps = 20/346 (5%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+  + A   A
Sbjct: 8   SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRATSGA 67

Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
             ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   PQLFL 
Sbjct: 68  VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 127

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           MA + N  K +A V + +TR PIY ++AK  N+ D+ AKGE++  + +++G G  I    
Sbjct: 128 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGLSI---- 183

Query: 242 TVCSSMQGKMIVG-PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
            + S     ++     LS  ++ S   E+R+  +NTLN  R  + V  F+K+G + S  +
Sbjct: 184 -LISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPSLKE 242

Query: 301 LRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM- 358
              +E +  P    +    V +G R      +P+ F  ++ LF +E++I+        + 
Sbjct: 243 GNSQETIFNPPWRHQP---VAIGSRFGEAFQEPASFVAIRPLFEDERYIVTYNPTKDKVY 299

Query: 359 -VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMND 403
            +L+  A  +D L+    AA+ A +   F + S   L +A  +MN 
Sbjct: 300 ALLKDQAKPDDILK----AAFHAHVLLHFINASHANL-NARKRMNS 340


>gi|356522111|ref|XP_003529693.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 488

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 116/387 (29%), Positives = 196/387 (50%), Gaps = 26/387 (6%)

Query: 41  FQPDGHLSVKVVDDS-RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAA 99
           F+ +  L V +  +  R   H V+    +   P G+P SV   Y+ Y  +RAL+HF   A
Sbjct: 72  FKKNSFLPVSLNSNHPRDPLHEVLSFVTSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGA 131

Query: 100 LSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRI 156
           + V +TQ+LL + G+   R  P  A  ++WILKDG   VGK++ +  G + D + K+ R 
Sbjct: 132 MGVFTTQTLLSSVGVCRNRAAPG-AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRF 190

Query: 157 LADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD 216
             D L +LG G+E+ +   P LFL +A   N  K +A V + +TR PIY +FAK  N+ D
Sbjct: 191 TGDLLMELGAGVELATAAVPHLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGD 250

Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
           + AKGE +  + +++G G  I +A    S     +    LLS  ++ S   E+++  ++T
Sbjct: 251 VTAKGECVGNIADLLGTGLSILIAKRNPS----LVTTFSLLSCGYILSSYREVKSVVLHT 306

Query: 277 LNPQRTAMIVADFVKTGRISSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKF 335
           LN  R ++ V  F+ TG++ +  +    E++  FP +   D   V   R       PS +
Sbjct: 307 LNCGRFSVAVEHFLMTGQVPTLQEGNMNENIFSFPWK---DRPVVLGSRIKEAFQDPSAY 363

Query: 336 HELKELFAEEKFILCLGSRWTDM--VLEQNASGEDALRGWLVAAYAA-----SMAKSFHD 388
             ++ LF  E++I+        +  VL+  A  +D L+    AA+ A     S+ KS ++
Sbjct: 364 VAIEPLFDRERYIVTYNPSKHKVYAVLKDQAKSDDILK----AAFHAHVLFFSLMKSLNE 419

Query: 389 PSLTVLQDAYDKMNDVFTPLLSELQAK 415
              + L+   D  N   T  +++++A+
Sbjct: 420 NKASSLKQREDLSN--MTHTVADIEAR 444


>gi|297604521|ref|NP_001055571.2| Os05g0419200 [Oryza sativa Japonica Group]
 gi|53982666|gb|AAV25645.1| unknown protein [Oryza sativa Japonica Group]
 gi|222631625|gb|EEE63757.1| hypothetical protein OsJ_18576 [Oryza sativa Japonica Group]
 gi|255676375|dbj|BAF17485.2| Os05g0419200 [Oryza sativa Japonica Group]
          Length = 415

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 108/345 (31%), Positives = 182/345 (52%), Gaps = 18/345 (5%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+  + A   A
Sbjct: 22  SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRATSGA 81

Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
             ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   PQLFL 
Sbjct: 82  VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLP 141

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           MA + N  K +A V + +TR PIY ++AK  N+ D+ AKGE++  + +++G G  I ++ 
Sbjct: 142 MACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGLSILISK 201

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
              S +         LS  ++ S   E+R+  +NTLN  R  + V  F+K+G + S  + 
Sbjct: 202 RNPSLVTSFA----FLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPSLKEG 257

Query: 302 RYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM-- 358
             +E +  P    +    V +G R      +P+ F  ++ LF +E++I+        +  
Sbjct: 258 NSQETIFNPPWRHQP---VAIGSRFGEAFQEPASFVAIRPLFEDERYIVTYNPTKDKVYA 314

Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMND 403
           +L+  A  +D L+    AA+ A +   F + S   L +A  +MN 
Sbjct: 315 LLKDQAKPDDILK----AAFHAHVLLHFINASHANL-NARKRMNS 354


>gi|356563749|ref|XP_003550122.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 497

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 110/358 (30%), Positives = 184/358 (51%), Gaps = 23/358 (6%)

Query: 60  HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RP 116
           H V+    +   P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+   R 
Sbjct: 100 HEVLAFVTSYVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRA 159

Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
            P  A  ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   P
Sbjct: 160 APG-AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 218

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
            LFL +A   N  K +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  
Sbjct: 219 HLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 278

Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
           I +A    S     +    LLS  ++ S   E+++  ++TLN  R ++ V  F++TG++ 
Sbjct: 279 ILIAKRNPS----LVTTFSLLSCGYILSSYREVKSVVLHTLNCGRFSVAVESFLRTGQVP 334

Query: 297 SPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRW 355
           +  +    E++  FP +   D   V   R       PS +  ++ LF  E++I+      
Sbjct: 335 TLQEGNMNENIFSFPWK---DRPVVLGSRIKEAFQDPSAYFAIEPLFDRERYIVTYNPSK 391

Query: 356 TDM--VLEQNASGEDALRGWLVAAYAA-----SMAKSFHDPSLTVLQDAYDKMNDVFT 406
             +  VL+  A  +D L+    AA+ A     ++ KS+++ + + L+   D  N   T
Sbjct: 392 HKVYAVLKDQAKSDDILK----AAFHAHVLLFNLIKSWNESNASSLKQREDLSNMTHT 445


>gi|449431896|ref|XP_004133736.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
 gi|449478104|ref|XP_004155224.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 507

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 114/363 (31%), Positives = 189/363 (52%), Gaps = 26/363 (7%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+   + TP  
Sbjct: 118 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVARNKATPG- 176

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           A  ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   P LFL
Sbjct: 177 AIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAAPHLFL 236

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +A   N AK ++ V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I + 
Sbjct: 237 PLACAANVAKNVSAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI- 295

Query: 241 STVCSSMQGKMIVGP--LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
                S +   +V    LLS  +++S  +E+R+  ++TLN  R  + V  F+KTGR+ S 
Sbjct: 296 -----SKRNPSLVTTFGLLSCGYIFSSYQEVRSVVLHTLNRARFNVAVETFLKTGRVPSL 350

Query: 299 ADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTD 357
                 E +L FP   +++   V   R          F  ++ LF  EK+++        
Sbjct: 351 QKGNMNERILSFP--WLKENPIVLGPRFKDAFQDAGSFLAIEPLFDREKYLVTYNQTKGK 408

Query: 358 M--VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFT---PLLSEL 412
           +  +L+  A  +D ++    AA+ A +   F   S +  Q++  K  D F+   P  + L
Sbjct: 409 VYALLKDQAKSDDIIK----AAFHAHVLLHFIRSS-SGAQNSNQKQVDAFSSSVPTTTNL 463

Query: 413 QAK 415
           +A+
Sbjct: 464 EAE 466


>gi|168019221|ref|XP_001762143.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686547|gb|EDQ72935.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 390

 Score =  154 bits (389), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 191/371 (51%), Gaps = 32/371 (8%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR--PTPAQA 121
           SF+  +  P  YP SV   Y  Y Q+RA+Q+    A+SV +T+SL+ A G+    + + A
Sbjct: 18  SFVKSYIVPENYPDSVAPSYTPYMQWRAVQYLFGGAMSVFTTRSLMHALGVSRGASASSA 77

Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
             V+W++KDG   +GK+I +  G + D + K+ R     L  LG G+E+ +   P LFL 
Sbjct: 78  VAVNWVIKDGAGRIGKMIFARHGKKFDVDLKQLRFKGAMLMQLGAGVELATMAVPHLFLP 137

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI---- 237
           +A   N AK +A V + +TR PIY +FA+  N+ D+ AKGE IS + +++G G GI    
Sbjct: 138 LACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECISNIADLMGTGLGIFISK 197

Query: 238 ---QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
               L +T C           +LS  ++ S   E+++  + TLN  R  ++V  F++TG+
Sbjct: 198 KSPSLVATFC-----------VLSCGYMISSFNEIKSVRLATLNRARFGVVVQSFLETGK 246

Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILC--L 351
           + S  D   RE ++    L      +++G R       P +F   +E F +EK+++   L
Sbjct: 247 VPSVEDANRRESVI---TLPWQEKPLELGSRVAQAYSNPKEFLRAREQFQKEKYLVTYSL 303

Query: 352 GSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLT-----VLQDAYDKMNDVFT 406
             +   +V++++AS +D +R    A     + +      +T     +++++    +D++ 
Sbjct: 304 KKKRAYVVIKESASSDDVVRATFQAHVMLHILRGLRSTLITSNVGAIVEESTKLTHDLYD 363

Query: 407 PLLSELQAKGW 417
               E +++GW
Sbjct: 364 SFKEEAKSQGW 374


>gi|222619643|gb|EEE55775.1| hypothetical protein OsJ_04341 [Oryza sativa Japonica Group]
          Length = 508

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 95/292 (32%), Positives = 162/292 (55%), Gaps = 13/292 (4%)

Query: 63  VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
           V +F+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+   + TP
Sbjct: 118 VMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTP 177

Query: 119 AQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
             A  ++WILKDG   VGK++ +  G + D + K+ R  +D L ++G G+E+ +   PQ 
Sbjct: 178 G-AIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQF 236

Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
           FL MA + N  K +A V + +TR PIY ++A+  N+ D+ AKGE++  + +++G G  I 
Sbjct: 237 FLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIF 296

Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
           ++    S +         LS  +++S   E+R+  +NTLN  R  + V  F+KTG I S 
Sbjct: 297 ISKRNPSLVTSFA----FLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPSL 352

Query: 299 ADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
            +   +E +  P    E    V +G R      +P+ F  ++ LF +E++++
Sbjct: 353 KEGNSQETIFNPPWRHEP---VAIGSRFGEAFQEPASFVAIRPLFEDERYMV 401


>gi|56784316|dbj|BAD82242.1| unknown protein [Oryza sativa Japonica Group]
 gi|56785239|dbj|BAD82127.1| unknown protein [Oryza sativa Japonica Group]
          Length = 508

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 95/292 (32%), Positives = 162/292 (55%), Gaps = 13/292 (4%)

Query: 63  VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
           V +F+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+   + TP
Sbjct: 118 VMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTP 177

Query: 119 AQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
             A  ++WILKDG   VGK++ +  G + D + K+ R  +D L ++G G+E+ +   PQ 
Sbjct: 178 G-AIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQF 236

Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
           FL MA + N  K +A V + +TR PIY ++A+  N+ D+ AKGE++  + +++G G  I 
Sbjct: 237 FLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIF 296

Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
           ++    S +         LS  +++S   E+R+  +NTLN  R  + V  F+KTG I S 
Sbjct: 297 ISKRNPSLVTSFA----FLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPSL 352

Query: 299 ADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
            +   +E +  P    E    V +G R      +P+ F  ++ LF +E++++
Sbjct: 353 KEGNSQETIFNPPWRHEP---VAIGSRFGEAFQEPASFVAIRPLFEDERYMV 401


>gi|414879331|tpg|DAA56462.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
 gi|414879332|tpg|DAA56463.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
          Length = 503

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 93/282 (32%), Positives = 157/282 (55%), Gaps = 12/282 (4%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
           P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+   + TP  A  ++WIL
Sbjct: 117 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 175

Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
           KDG   VGK++ +  G + D + K+ R  +D L ++G G+E+ +   PQ FL MA + N 
Sbjct: 176 KDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELTTAAFPQFFLPMACVANV 235

Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
            K +A V + +TR PIY ++A+  N+ D+ AKGE++  + +++G G  I ++    S + 
Sbjct: 236 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNPSLVT 295

Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
                   LS  ++ S  +E+R+  +NTLN  R  + V  F+KTG I S  +   +E + 
Sbjct: 296 SFA----FLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 351

Query: 309 FPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
            P    E    V +G R      +P+ F  ++ LF +E++++
Sbjct: 352 NPPWRHEP---VAIGSRFGEAFQEPASFLAIRPLFEDERYMV 390


>gi|357133637|ref|XP_003568430.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Brachypodium
           distachyon]
          Length = 521

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 97/289 (33%), Positives = 156/289 (53%), Gaps = 11/289 (3%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+SV +T++LL + G+  + A   A
Sbjct: 125 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRATSGA 184

Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
             ++WILKDG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   P LFL 
Sbjct: 185 VAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHLFLP 244

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           +A + N  K +A V + +TR PIY ++AK  N+ D+ AKGE++  + +++G G  I L S
Sbjct: 245 LACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI-LIS 303

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
               S+         LS  ++ S   E+R+  +NTLN  R  + V  F+KTG + S  + 
Sbjct: 304 KRNPSLVASFA---FLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEG 360

Query: 302 RYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
              E +  P    +    V +G R      +P+ F     LF +E++I+
Sbjct: 361 NLEETIFNPPWRHQP---VAIGSRFGEAFQEPASFVSTSPLFEDERYIV 406


>gi|168024902|ref|XP_001764974.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683783|gb|EDQ70190.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 191/371 (51%), Gaps = 32/371 (8%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
           SF+  +  P GYP SV   Y  Y Q+RA+Q+F   A+SV +T+SL+ A G+ R T A + 
Sbjct: 34  SFVKAYIVPEGYPDSVAPSYTPYMQWRAVQYFFGGAMSVFTTRSLMHALGVSRGTSASSA 93

Query: 123 V-VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
           V V+W++KDG   +GK++ +  G + D + K+ R     L  LG G+E+ +   P LFL 
Sbjct: 94  VAVNWVIKDGAGRIGKMLFARHGKKFDVDLKQLRFKGAMLMQLGAGVELTTMAVPHLFLP 153

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI---- 237
           +A   N AK +A V + +TR PIY +FA+  N+ D+ AKGE IS + +++G G GI    
Sbjct: 154 LACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECISNIADLLGTGLGIFISK 213

Query: 238 ---QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
               L +T C           +LS  ++ S   E+++  + TLN  R  ++V  F++TG+
Sbjct: 214 RSPSLVATFC-----------VLSCGYMISSFNEIKSVRLATLNRARFGVVVQSFLETGQ 262

Query: 295 ISSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCL-- 351
           + S  D   RE ++  P +        +VG        P +F   +E F +E +++    
Sbjct: 263 VPSIEDANRRESVITLPWQEKPLELGSRVG---QAFSNPREFIRTQEQFQKENYLVTYRP 319

Query: 352 GSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDP-SLTVLQDAY----DKMNDVFT 406
             +   +V++++AS +D +R    A     M +      + + ++DA      +  D++ 
Sbjct: 320 NKKRAYVVIKESASSDDVVRATFQAHVMLHMLRGLEGTITNSNVEDAVAESTKRTYDLYD 379

Query: 407 PLLSELQAKGW 417
               E ++KGW
Sbjct: 380 SFKEEAKSKGW 390


>gi|226502346|ref|NP_001144348.1| uncharacterized protein LOC100277253 [Zea mays]
 gi|195640526|gb|ACG39731.1| hypothetical protein [Zea mays]
          Length = 503

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 93/282 (32%), Positives = 157/282 (55%), Gaps = 12/282 (4%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
           P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+   + TP  A  ++WIL
Sbjct: 117 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 175

Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
           KDG   VGK++ +  G + D + K+ R  +D L ++G G+E+ +   PQ FL MA + N 
Sbjct: 176 KDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELTTAAFPQFFLPMACVANV 235

Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
            K +A V + +TR PIY ++A+  N+ D+ AKGE++  + +++G G  I ++    S + 
Sbjct: 236 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNPSLVT 295

Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
                   LS  ++ S  +E+R+  +NTLN  R  + V  F+KTG I S  +   +E + 
Sbjct: 296 SFA----FLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 351

Query: 309 FPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
            P    E    V +G R      +P+ F  ++ LF +E++++
Sbjct: 352 NPPWRHEP---VAIGSRFGEAFQEPASFLAIRPLFEDERYMV 390


>gi|357126248|ref|XP_003564800.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
           distachyon]
          Length = 506

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 97/307 (31%), Positives = 165/307 (53%), Gaps = 14/307 (4%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
           P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+   + TP  A  ++WIL
Sbjct: 123 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPG-AIAINWIL 181

Query: 129 KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
           KDG   VGK++ +  G + D++ K+ R  +D L ++G G+E+ +   PQ FL MA + N 
Sbjct: 182 KDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGIELATAAFPQFFLPMACVANV 241

Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
            K +A V + +TR PIY ++A+  N+ D+ AKGE++  + +++G G  I +     S + 
Sbjct: 242 VKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIYITKRNPSLVT 301

Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL- 307
                  LLS  ++ S  +E+R+  +NTLN  R  + V  F+KTG + S  +   +E L 
Sbjct: 302 SFA----LLSCGYLMSSYQEVRSVVLNTLNRARFTVAVDSFIKTGYVPSLKEGNSQETLF 357

Query: 308 LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM--VLEQNAS 365
           + P R    A   + G       +P  F   + LF +E++++        +  +L+  A 
Sbjct: 358 IHPWRHEPVAIGSRFG---EAFQEPVSFVATRPLFEDERYMVTYNPTKDKVYALLKDQAK 414

Query: 366 GEDALRG 372
            +D L+ 
Sbjct: 415 PDDVLKA 421


>gi|225462223|ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 95/306 (31%), Positives = 163/306 (53%), Gaps = 12/306 (3%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
           FL    P G+P SV   YL YT +R +Q   S    VL+TQ+LL+A GL +     A  V
Sbjct: 285 FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 344

Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           +W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +   GLE+L+P  P  FL +  
Sbjct: 345 NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 404

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
           +    +  A +   +TR   Y+ FA + N +++ AKGEA   +   +G+  GI LA+ + 
Sbjct: 405 VAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIG 464

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
           SS         +++ +H++  ++  ++  + TLNP R +++ ++++ +G++ S  ++   
Sbjct: 465 SSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVN-E 523

Query: 305 EDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNA 364
           E+ LFP   + +A      +  +K        E K+  AE +  L LGS+ +++V    +
Sbjct: 524 EEPLFPVVPLLNA------KPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVV----S 573

Query: 365 SGEDAL 370
           S ED L
Sbjct: 574 SKEDVL 579


>gi|255569810|ref|XP_002525869.1| conserved hypothetical protein [Ricinus communis]
 gi|223534874|gb|EEF36563.1| conserved hypothetical protein [Ricinus communis]
          Length = 520

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 108/339 (31%), Positives = 183/339 (53%), Gaps = 14/339 (4%)

Query: 40  QFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSA 98
           +F+ +   SV + +   PV   +  SF+  +  P G+P SV   Y+ Y  +RAL+HF   
Sbjct: 106 RFKKNSFRSVSLQNPQAPVEELM--SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGG 163

Query: 99  ALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRI 156
           A+ V +T++LL + G+  + A   A  ++WILKDG   VGK++ +  G + D + K+ R 
Sbjct: 164 AMGVFTTKTLLSSVGVSKSTAIPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRF 223

Query: 157 LADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD 216
             D L +LG G+E+ +   P LFL +A   N AK +A V + +TR PIY +FAK  N+ D
Sbjct: 224 AGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGD 283

Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
           + AKGE +  + +++G G  I ++    S     +    LLS  +V+S  +E+++  ++T
Sbjct: 284 VTAKGECVGNVADLLGTGLSIMISKRNPS----LVTTFALLSCGYVFSSYQEVKSVVLHT 339

Query: 277 LNPQRTAMIVADFVKTGRISSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKF 335
           LN  R ++ V  F+KTGR+ S  +   RE++  FP   +++   V   R       PS F
Sbjct: 340 LNRARFSVAVESFLKTGRVPSLQEGNTRENIFSFP--WLKERPIVIGSRFKDAFQDPSAF 397

Query: 336 HELKELFAEEKFILCLGSRWTDM--VLEQNASGEDALRG 372
             ++ LF +E++I+        +  +L+  A  +D L+ 
Sbjct: 398 LAIEPLFEKERYIVTYNPTKDKVYALLKDQAKADDILKA 436


>gi|296082804|emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 95/306 (31%), Positives = 163/306 (53%), Gaps = 12/306 (3%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
           FL    P G+P SV   YL YT +R +Q   S    VL+TQ+LL+A GL +     A  V
Sbjct: 83  FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 142

Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           +W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +   GLE+L+P  P  FL +  
Sbjct: 143 NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 202

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
           +    +  A +   +TR   Y+ FA + N +++ AKGEA   +   +G+  GI LA+ + 
Sbjct: 203 VAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIG 262

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
           SS         +++ +H++  ++  ++  + TLNP R +++ ++++ +G++ S  ++   
Sbjct: 263 SSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVN-E 321

Query: 305 EDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNA 364
           E+ LFP   + +A      +  +K        E K+  AE +  L LGS+ +++V    +
Sbjct: 322 EEPLFPVVPLLNA------KPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVV----S 371

Query: 365 SGEDAL 370
           S ED L
Sbjct: 372 SKEDVL 377


>gi|356540900|ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
          Length = 592

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 163/314 (51%), Gaps = 12/314 (3%)

Query: 58  VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
           V+ +  D F     P G+P SV   YL Y+ +RA+Q        VL+TQSLL+A GL + 
Sbjct: 173 VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG 232

Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
               A  ++W+LKDG+ ++ K++ SN G   D  PK WR+ AD L +   GLE+ +P CP
Sbjct: 233 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMSTPACP 292

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
           Q F+ +  +   ++  A +   +TR   ++ FA + N +++ AKGE        +G+  G
Sbjct: 293 QFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIVLG 352

Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
           I L + + SS    +    +L+ IH+Y  ++  ++  + TLNP R +++ ++++ +G+ +
Sbjct: 353 IGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQ-A 411

Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT 356
            P      E+ LFP   I +A          K    +   E K+  AE +  L LGS+ +
Sbjct: 412 PPVKEVNDEEPLFPAVPILNATFAS------KAQSFALSSEAKDAAAEIEHRLQLGSKLS 465

Query: 357 DMVLEQNASGEDAL 370
           ++V     S ED L
Sbjct: 466 EIV----NSKEDVL 475


>gi|38344695|emb|CAE02373.2| OSJNBb0096E05.17 [Oryza sativa Japonica Group]
          Length = 593

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 98/342 (28%), Positives = 172/342 (50%), Gaps = 38/342 (11%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
           + R     +    P GYP SV+  YL+Y+ +R +Q   S    VLSTQ+LL+A GL +  
Sbjct: 185 WRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKGA 244

Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
              A  V+W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +   GLE+L+P+ P 
Sbjct: 245 IPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPH 304

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
           LF+ +       +  A +   ATR   Y+ FA + N +++ AKGEA   +   +G+  GI
Sbjct: 305 LFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGIMLGI 364

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            LA+ + SS+   +I    ++ +H+Y  ++  ++  + TLNP R +++ ++++ +G++ S
Sbjct: 365 ALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPS 424

Query: 298 PADLRYREDLLFPGRLIEDAGNVKVG--RDPHKVVKP----------------SKFHELK 339
             ++   E L           N+ +G  R   K++ P                SK  E+ 
Sbjct: 425 VKEVNDEEPLFL---------NLSIGTSRKESKILSPQAKDAAEIICRRLQLGSKLSEII 475

Query: 340 E----------LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
           E          L+  E+++L        ++L++ +S ED L+
Sbjct: 476 ENKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLK 517


>gi|452823170|gb|EME30183.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
          Length = 519

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 178/361 (49%), Gaps = 14/361 (3%)

Query: 29  FETSDSVTRHYQFQP-DGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
           +ET       Y   P +G L +    DSR V+ +++D   + F P     +V++ Y  +T
Sbjct: 128 YETCTHQRIDYFLNPSNGDLLIAPSIDSRTVWRKILDLLNSSFLPENCKKTVSKDYFWFT 187

Query: 88  QFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLIC-SNLG 144
            F  LQ+  +  + V+STQ LL A GL  +    ++  ++W+LKDG+  +G ++  S +G
Sbjct: 188 TFNFLQNVVNCCIGVISTQQLLRAVGLSASQGIGKSASLNWVLKDGIGRLGAIVFGSFIG 247

Query: 145 ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPI 204
            R D++PKRWR+  D LY  G G E++SPL P+ FL +A L N  K  + +    T   I
Sbjct: 248 NRFDADPKRWRLWGDILYAFGIGTEIISPLLPRYFLLVASLANMVKATSYMLRLPTTAAI 307

Query: 205 YSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYS 264
             SFAK  N  D+ AK  +   L N+ G   GI  +  +  S     +   +  ++    
Sbjct: 308 RRSFAKRENFGDISAKANSQEVLSNLFGTFLGIGFSIILGHSWWQLSVAYFVYFIMFFLL 367

Query: 265 VIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP-----GRLIEDAGN 319
               ++   + TLN QR+ ++   + K+ ++S+  ++ YRE+++ P     GR I    +
Sbjct: 368 NYLGVKGLHLRTLNLQRSLLLGWSYWKSSQVSTVEEVNYRENVILPSIMSFGRYIRFGVS 427

Query: 320 VKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYA 379
           +   RD H  +  S+   L + + + K++L +      +V    A  +D L+  L A + 
Sbjct: 428 L---RDIH--LTGSELAALLDKYRQAKYLLTVRHGQVYIVFHMEARVKDELQAILQATFL 482

Query: 380 A 380
           A
Sbjct: 483 A 483


>gi|301622642|ref|XP_002940632.1| PREDICTED: UPF0420 protein C16orf58 [Xenopus (Silurana) tropicalis]
          Length = 473

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 121/412 (29%), Positives = 195/412 (47%), Gaps = 44/412 (10%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           Y  + D F++   P GYP SV++ YL Y  +  +Q F+S+    L+TQSLL  +G+  + 
Sbjct: 62  YRSLRDFFVSLLLPHGYPDSVSKDYLAYQLWDTIQAFSSSVTGSLATQSLLRGSGVGDST 121

Query: 119 AQATVVS--WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A     +  WIL+DG   VG+++ + + G+R+D + KRWR+ AD L D+   +E+++P+ 
Sbjct: 122 ATVAAATITWILRDGTGMVGRILFAWMKGSRLDCDAKRWRLFADLLNDIAIFMEIMAPVF 181

Query: 176 PQLF-LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
           P  F L +   G F K I  VA  ATR  I    AK  N++D+ AK  +  TL N+ GL 
Sbjct: 182 PSCFTLTVCAAGVF-KCIVGVAGGATRAAITVHQAKRDNMADVCAKDGSQETLVNLAGLL 240

Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
             + L   V  SM    ++  LL+++H+Y+    +R+  + TLN  R ++++  ++K G+
Sbjct: 241 VSLFLVPLVSDSMMATYLLFFLLTILHLYANYRAVRSVIMETLNQSRLSIVLHHYLKEGK 300

Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSR 354
           I  P     RE LL PG  +     + +G     +   S   EL++L  +   +  LG R
Sbjct: 301 ILDPEAANLREPLL-PG--LGSQAPISLGVPLSYLA--SSVPELEQLKKDNHSLYLLGWR 355

Query: 355 WTD----MVLEQNASGEDALRGWL----------------------------VAAYAASM 382
                  +VL + AS  D +R  +                            V    +  
Sbjct: 356 EDTGSLAVVLHERASSVDMIRACVHAEMLHRKTSSSDHHKSSSFPILSALQCVRGRGSRP 415

Query: 383 AKSFHDPS-LTVLQDAYDKMNDVFTPLLSELQAKGWHTDR-FLDGTGTRFAW 432
              F      T L+  ++++N +F   L  L A GW T+R  LD    R  W
Sbjct: 416 CTGFEQHGRFTALEQTHNQINSMFPTFLQGLTASGWVTNRNLLDSDDWRANW 467


>gi|452819618|gb|EME26673.1| glutamate N-acetyltransferase isoform 2 [Galdieria sulphuraria]
 gi|452819619|gb|EME26674.1| glutamate N-acetyltransferase isoform 1 [Galdieria sulphuraria]
          Length = 540

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 97/345 (28%), Positives = 177/345 (51%), Gaps = 7/345 (2%)

Query: 54  DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
           + + ++ ++  S    F P G+P SV + YL+Y+ +R +Q+  S    VLSTQ+LL AAG
Sbjct: 128 NRQNLWEQITSSLSTFFLPEGFPESVGDSYLQYSIWRGIQNIISNISQVLSTQALLLAAG 187

Query: 114 LRPTPAQATVVSWIL--KDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
           +  T ++    +     KDG+  VG+L  + LG + D++PKRWR+++D LYD+G  LE+L
Sbjct: 188 VGRTHSETLAAATAWAWKDGIGQVGRLFTAVLGNQYDADPKRWRLVSDLLYDIGLSLEIL 247

Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
           SP   + FL MA LGN +K IA     + R  +  +F +  NL ++ AK +A + +  ++
Sbjct: 248 SPAFSKYFLFMAALGNISKSIAFAIGISCRYSVLKTFVRRENLGEISAKNDAQNVVTGML 307

Query: 232 GLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
           G   G+ ++  + ++ + ++ V    + I+ +   + M+ A + T+N QR  ++   ++ 
Sbjct: 308 GTLIGVWMSKCLPAAPKVRLGVFLFFTAIYSFFNYKSMQVAELTTINRQRGNILAHSYLM 367

Query: 292 TGRISSPADLRYREDL--LFPGRLIEDAGNVKVGRDPHKV-VKPSKFHELKELFAEEKFI 348
              + S +     E    LFP   +    +V++G    ++ +   K       F++EK+I
Sbjct: 368 FQNVPSVSMSNSYERFVPLFP--TVFQYPHVRLGVGLTELSISIEKIANYTRQFSKEKYI 425

Query: 349 LCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTV 393
           L L      + L +N    D L+  L  A+     +    P  T+
Sbjct: 426 LSLSDGEIRVALHRNHESVDLLQLLLHIAFIRDEIQRRLGPQCTI 470


>gi|432867528|ref|XP_004071227.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oryzias latipes]
          Length = 423

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 110/372 (29%), Positives = 180/372 (48%), Gaps = 14/372 (3%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPA 119
           +V +F + F P GYP SV+E YL+Y  +  LQ F S     L TQ+ L   G+  +    
Sbjct: 41  LVGAFKSVFLPQGYPESVSEDYLQYQFWDTLQAFASTLSWTLVTQASLKGVGVGNQDATV 100

Query: 120 QATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
            A  V+WILKDG    G+++ +   G+R+DSE K+WR+ AD L D+   +E+L+P  P  
Sbjct: 101 AAATVTWILKDGTGMFGRILFAWQKGSRLDSEAKKWRLFADVLNDIAMFIEILAPFFPSF 160

Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
           F  +       K I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + 
Sbjct: 161 FTLIVCTAGVFKAIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLVVSLI 220

Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
           L   +  +    + +  L +++H+++  + +R+  + T N  R ++++  ++K  RI SP
Sbjct: 221 LIPLITDNSSLSLGLFFLFTILHLFANYKAVRSVVMETFNQARLSIVLQQYLKDKRILSP 280

Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK-PSKFHELKELFAEEKFILCLGSRWTD 357
            +   RE +    ++      +K+G     VV+ P  FH L        ++L + +    
Sbjct: 281 FEANRREPVFLEFKITTP---IKLGVKLQDVVQSPEDFH-LALRNNSMAYLLGVRNGCIC 336

Query: 358 MVLEQNASGEDALRGWLVAAYAASMAK--SFHDPSLTVLQDAYDKMNDV----FTPLLSE 411
           + L   AS  D +R    A   ++M K  SF   S   L +   K  +V    F   L  
Sbjct: 337 VCLGSGASVHDEIRAACQALCLSNMLKSPSFSQTSKQSLWEMVQKSQEVIDANFHTFLKG 396

Query: 412 LQAKGWHTDRFL 423
           ++A GW T R L
Sbjct: 397 VEAAGWDTKRTL 408


>gi|307105811|gb|EFN54059.1| hypothetical protein CHLNCDRAFT_8919, partial [Chlorella
           variabilis]
          Length = 325

 Score =  147 bits (372), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 164/314 (52%), Gaps = 17/314 (5%)

Query: 60  HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPT 117
            R+ D   +   P  +P SV   Y  Y + R++Q+F   A+SV +T+SLL + G+  + +
Sbjct: 1   QRLGDLLADYLLPQDFPHSVAPQYSDYMRARSVQYFFGGAMSVFTTRSLLASLGVANKHS 60

Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
              A  ++W++KDG   +G+ + +  G  +D E K++R++ D L + G  LE+ + L P+
Sbjct: 61  SEAAAAINWVVKDGAGRLGRFLFARWGRELDCELKQFRLMGDVLMETGAALELSTVLMPR 120

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
           +FL +A   N AK +A V A +TR PIY +FAK+ NL+D+ AKGE+++ L +VVG   GI
Sbjct: 121 MFLPLACTANLAKNLAAVTASSTRAPIYRTFAKQNNLADVTAKGESVANLADVVGTAFGI 180

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVI---EEMRAAPVNTLNPQRTAMIVADFVKTGR 294
            LA       +  + V P  + + V  +I    E+ +  +  LN  R +     F  TGR
Sbjct: 181 ALA-------KANLPVLPTFAALSVGYLIASRREVDSVVLPYLNRARLSYTTRAFYSTGR 233

Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK-PSKFHELKELFAEEKFILCL-- 351
           +    +  YRE L+ P     + G V +G    +    P + H+    F+  ++ L    
Sbjct: 234 VPETLEGNYREPLM-PWSDPHN-GRVVLGATVEEACAGPQQLHDALAAFSGRQYALTYRP 291

Query: 352 GSRWTDMVLEQNAS 365
            +R    +L+Q AS
Sbjct: 292 DTRKCYALLKQGAS 305


>gi|449455818|ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 611

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 102/354 (28%), Positives = 170/354 (48%), Gaps = 28/354 (7%)

Query: 43  PDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSV 102
           P   LS   V+    V+ R  D F     P G+P SV   YL Y+ +R +Q   S    V
Sbjct: 188 PSSSLSFSFVN----VWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGV 243

Query: 103 LSTQSLLFAAGL-RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADAL 161
           L+TQ+LL+A GL +     A  V+W+LKDG  ++ K+  S  G   D  PK WR+ AD L
Sbjct: 244 LATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLL 303

Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
            +   G+E+L+P  P  F+ +       +  A +   ATR   Y+ FA + N +++ AKG
Sbjct: 304 ENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKG 363

Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
           EA   +   +G+  GI LA+ + SS    +    ++++IH++  ++  ++  + TLNP R
Sbjct: 364 EAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYR 423

Query: 282 TAMIVADFVKTGRISSPADLRYRE------------------DLLFPGRLIEDAGNV--- 320
            +++ ++++ +G + S  D+   E                  D L      E A N+   
Sbjct: 424 ASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKAPDWSRDFLLSAEAKESAANIEKR 483

Query: 321 -KVGRDPHKVVKPSK-FHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRG 372
            ++G     V    +   EL  LF +E +IL        ++L+++AS  D L+ 
Sbjct: 484 LQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKA 537


>gi|359486217|ref|XP_003633414.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
 gi|297739449|emb|CBI29631.3| unnamed protein product [Vitis vinifera]
          Length = 504

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 106/329 (32%), Positives = 176/329 (53%), Gaps = 18/329 (5%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+   R  P  
Sbjct: 113 SFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNRSAPG- 171

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           A  ++WILKDG   VGK++ +  G + D + K+ R+  D L +LG G+E+ +   P LFL
Sbjct: 172 AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVPHLFL 231

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I ++
Sbjct: 232 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 291

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
               S     +    LLS  +V S  +E+++  ++TLN  R ++ +  F+KTGR+ S  +
Sbjct: 292 KRNPS----LVTTFALLSCGYVLSSYQEVKSVVLHTLNRARFSVAIDSFLKTGRVPSLQE 347

Query: 301 LRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM- 358
              +E++  FP   +ED   V   R          +  ++ LF +E++++        + 
Sbjct: 348 GNMKENIFSFP--WLEDKPIVLGSRFKDAFQDAGSYLSIEPLFEKERYLVTYNPSKGKVY 405

Query: 359 -VLEQNASGEDALRGWLVAAYAASMAKSF 386
            +L+  A  +D L+    AA+ A M   F
Sbjct: 406 ALLKDQAKSDDILK----AAFHAHMLLHF 430


>gi|301096265|ref|XP_002897230.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262107315|gb|EEY65367.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 474

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 111/391 (28%), Positives = 183/391 (46%), Gaps = 36/391 (9%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
           F P+GYP SV+E YL +  +  LQ   S    VL+TQS+L + G+   + TP  A  + W
Sbjct: 79  FLPAGYPDSVSEDYLGFQFWDTLQAMCSYLRGVLATQSVLQSVGVGDDKATPL-AAALQW 137

Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           +L+DG   +G L  +  +G + D   KRWR+ AD + D+G  L++++P  P L  E+  +
Sbjct: 138 VLRDGSGMIGGLTFAYFVGPKFDVNVKRWRLFADVVNDVGLTLDMVAPYFPTLVTEVLCV 197

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
            +  K +  VAA ATR  + + FAK  N++D  AK  +  T   + GL  G+  A+ V S
Sbjct: 198 SSVCKAMCGVAAGATRSSLMTHFAKRDNMADCAAKEGSQETAVKLFGLMFGMYFANAVNS 257

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL---- 301
           S     +   +L+++HVY+    +    + T+N  R  ++V  F+ +G      ++    
Sbjct: 258 SPHAVWMAFLVLTLVHVYANYNAVSCLCIPTVNCSRGLILVQRFLMSGNKPGKENVADHQ 317

Query: 302 -RY------REDLLFPGRLIEDAGNVKVGRDPHKVVKPSK-FHELKELFAEEKFILCLGS 353
            RY      +E+ +F   ++    ++ +G + H+ V  S     L +++ +E ++L +  
Sbjct: 318 GRYSIRSVNQEEPIFKDPVLPVTSHLMMGSELHQAVPTSTDLDRLLQIYEDETYLLTVVG 377

Query: 354 RWTDMVLEQNASGEDALRGWLVAAYAASMAKSF----------------HDPSLTVLQDA 397
               ++ E  A   D LR +  A        S                 H P   +L+  
Sbjct: 378 DRVHVMFEAEAKPTDELRAFFQAVLVLQALASVPPTTSTPSSSNEGSTKHQP---LLEST 434

Query: 398 YDKMNDVFTPLLSELQAKGWHTDRFLDGTGT 428
           Y KM   F   L  LQ  GW T  FL  T T
Sbjct: 435 YRKMTSDFPLFLQVLQNSGWRTHMFLLNTST 465


>gi|449484579|ref|XP_004156921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 443

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 86/270 (31%), Positives = 143/270 (52%), Gaps = 6/270 (2%)

Query: 43  PDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSV 102
           P   LS   V+    V+ R  D F     P G+P SV   YL Y+ +R +Q   S    V
Sbjct: 158 PSSSLSFSFVN----VWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGV 213

Query: 103 LSTQSLLFAAGL-RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADAL 161
           L+TQ+LL+A GL +     A  V+W+LKDG  ++ K+  S  G   D  PK WR+ AD L
Sbjct: 214 LATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLL 273

Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
            +   G+E+L+P  P  F+ +       +  A +   ATR   Y+ FA + N +++ AKG
Sbjct: 274 ENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKG 333

Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
           EA   +   +G+  GI LA+ + SS    +    ++++IH++  ++  ++  + TLNP R
Sbjct: 334 EAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYR 393

Query: 282 TAMIVADFVKTGRISSPADLRYREDLLFPG 311
            +++ ++++ +G + S  D+   E+ LFP 
Sbjct: 394 ASLVFSEYLLSGEVPSIKDVN-NEEPLFPA 422


>gi|340372549|ref|XP_003384806.1| PREDICTED: UPF0420 protein C16orf58-like [Amphimedon queenslandica]
          Length = 432

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 115/418 (27%), Positives = 190/418 (45%), Gaps = 34/418 (8%)

Query: 34  SVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQ 93
           S  R Y    D   +V+V   +R    R+ D+    F P GYP SV+E YL+Y  +  LQ
Sbjct: 14  SSARSYMMIGD---TVRVTSKARNT--RLADTLKEVFLPQGYPNSVSEDYLQYQLWDTLQ 68

Query: 94  HFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSE 150
            F S     L+T ++L   G+  + A   A  ++WILKDG+   G+++ + L G  +D  
Sbjct: 69  AFCSYITGTLATHAMLKGVGVGDSSASPVAATLTWILKDGLGMTGRILFAWLRGYDLDCN 128

Query: 151 PKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAK 210
            K+WR++AD L D+   ++++SP  P  FL +A L +  + +  VA  ATR  +    A+
Sbjct: 129 AKKWRLIADILNDIAICMQLVSPFFPSCFLLIACLASITQSVVGVAGGATRAALVQHQAR 188

Query: 211 EGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMR 270
             N++D+ AK  +  TL N+ GL  G+ +   +         +    +++H+YS  + + 
Sbjct: 189 RDNMADVSAKDGSQETLVNLCGLLIGLIITPLIAGQTVFVWSLFFSFTLLHLYSNYKAVS 248

Query: 271 AAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV 330
              + TLN  R  +++ +    G IS P  +   E LLF     E    V+ G     V+
Sbjct: 249 VVSMETLNCNRLHLLMRNLFLNGTISEPNIVNREEPLLFRQ---ERFFTVEYGSSLSSVL 305

Query: 331 KPSKFH--ELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAA--------- 377
             S  +   + +    +KFI+ L    R   +     +S  D L+  L            
Sbjct: 306 IHSSDYSRSILQFNKGQKFIIKLSKTKRQIRVAFHCGSSSSDQLKAGLTVELIEFVCGGC 365

Query: 378 ---------YAASMAKSFHDPSLTVLQD-AYDKMNDVFTPLLSELQAKGWHTDRFLDG 425
                    YA  + K       +VL D   D +ND+F+ +L +L+ +GW     L G
Sbjct: 366 YGDSNYLKDYALLIRKGVESTDESVLVDVCQDIINDLFSSILDQLRKEGWTVSHHLLG 423


>gi|255557849|ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
 gi|223541000|gb|EEF42558.1| conserved hypothetical protein [Ricinus communis]
          Length = 541

 Score =  144 bits (363), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 96/306 (31%), Positives = 160/306 (52%), Gaps = 13/306 (4%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
           F+    P GYP SV   YL Y+ +R +Q   S    VL+TQ+LL+A GL +     A  +
Sbjct: 132 FVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 191

Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           +W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +   GLE+L+P  P LF+ +  
Sbjct: 192 NWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLEILTPAFPHLFVFIGA 251

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
                +  A +   ATR   Y+ FA + N +++ AKGEA   +   +G+  GI LA+ + 
Sbjct: 252 AAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIGLANCIG 311

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
           SS+   +    +++ IH++  ++  ++  + TLNP R +++ ++++ +G+     D+   
Sbjct: 312 SSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPIKDVN-D 370

Query: 305 EDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNA 364
           E+ LFP        + K    P  VV      E ++   E +  L LGS+ +D+V     
Sbjct: 371 EEPLFPAVF----PHFKSADKPSLVVLS---LEARDAATEIERRLQLGSKLSDVV----N 419

Query: 365 SGEDAL 370
           S ED L
Sbjct: 420 SKEDVL 425


>gi|348671592|gb|EGZ11413.1| hypothetical protein PHYSODRAFT_338124 [Phytophthora sojae]
          Length = 478

 Score =  144 bits (363), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 186/391 (47%), Gaps = 36/391 (9%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
           F P+GYP SV+E YL +  +  LQ   S    VL+TQS+L + G+   + TP  A  + W
Sbjct: 83  FLPAGYPDSVSEDYLSFQFWDTLQAMCSYLRGVLATQSVLQSVGVGDDKATPL-AAALQW 141

Query: 127 ILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           +L+DG   +G L  + L G + D   KRWR+ AD + D+G  L++++P  P L  E+  +
Sbjct: 142 VLRDGSGMIGGLTFAYLVGPKFDVNVKRWRLFADVINDVGLTLDMVAPYFPTLVTEVLCV 201

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
            +  K +  VAA ATR  + + FAK+ N++D  AK  +  T   + GL  G+  A+ V S
Sbjct: 202 SSVCKTMCGVAAGATRSSLMTHFAKKDNMADCAAKEGSQETAVKLFGLMFGMYFANAVNS 261

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL---- 301
           S Q   +   LL+VIHVY+    +    + T+N  R  ++V  F+ +G      ++    
Sbjct: 262 SPQAVWVAFLLLTVIHVYANYNAVSCLCIPTVNCSRGLILVRRFLASGNKPGKENVADHN 321

Query: 302 -RY------REDLLFPGRLIEDAGNVKVGRDPHKVVKPSK-FHELKELFAEEKFILCLGS 353
            RY      +E+ +F   ++    ++ +G + H+ V  S     L  ++++E ++L +  
Sbjct: 322 GRYSIRSVNQEEPIFKDPVLPVTSHLVMGAELHQAVPSSHDLDRLLHIYSDEAYLLTVVG 381

Query: 354 RWTDMVLEQNASGEDALRGWLVAAYA----------------ASMAKSFHDPSLTVLQDA 397
               ++ +  A  +D LR +  A                   ++     H P   +L+  
Sbjct: 382 DRVLVMFQAEAKPKDELRAFFQAVLVLEALASVSPTASPPPSSNGGTITHQP---LLEST 438

Query: 398 YDKMNDVFTPLLSELQAKGWHTDRFLDGTGT 428
             KM   F+     LQ  GW T  FL  T T
Sbjct: 439 LRKMTADFSLFHQVLQNSGWRTHMFLLNTST 469


>gi|160333925|ref|NP_001103923.1| uncharacterized protein LOC555936 [Danio rerio]
 gi|126631351|gb|AAI34047.1| Zgc:162613 protein [Danio rerio]
          Length = 432

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 107/397 (26%), Positives = 184/397 (46%), Gaps = 47/397 (11%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
           F + F P GYP SV+E YL+Y  +  +Q F+S+    L+TQ+ L   G+  +     A  
Sbjct: 48  FKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQEATVAAAT 107

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           ++W+L+DG   +G+++ + L G+++DSE K+WR+ AD L D+   +E+ +P  P  F  +
Sbjct: 108 ITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHFPPFFTII 167

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             +    K I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + L   
Sbjct: 168 VCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLVSLALIPL 227

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           V  +     I+  L +V+H+++    +R+  + TLN  R ++++  ++  G++ SPA+  
Sbjct: 228 VTDNPLLTFILFYLFTVLHLFANYRAVRSVVMETLNEARLSIVLHQYLLHGQVLSPAEAN 287

Query: 303 YREDLLFP-------------GRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFIL 349
            RE +                G LI + G++++    +   KP                +
Sbjct: 288 QREPVFLSFRRTVPIKLGVRLGDLIHEPGDLQLALKNNS--KPYLIG-----IKNGAVCV 340

Query: 350 CLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQ-------------D 396
           CLGS         +AS +D ++  +  A   S       P   VLQ              
Sbjct: 341 CLGS---------DASVDDEIKA-VCQALCVSAVLHNESPLPAVLQQLSNTKDPWALISQ 390

Query: 397 AYDKMNDVFTPLLSELQAKGWHTDR-FLDGTGTRFAW 432
           ++  ++++F   L EL    W TDR  LD    R  W
Sbjct: 391 SHKMIDEIFQTFLKELDRAKWQTDRTLLDWNEWRVEW 427


>gi|168005499|ref|XP_001755448.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693576|gb|EDQ79928.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 503

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 104/395 (26%), Positives = 190/395 (48%), Gaps = 37/395 (9%)

Query: 58  VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
           V  R V+       P GYP SV + Y  YT +R  Q   S    VL+TQ+LL+A GL + 
Sbjct: 85  VITRFVELGKQIVLPDGYPQSVTDDYFEYTLWRMGQVIASQISGVLTTQALLYAVGLGKG 144

Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
               A  V+W+L+DG+ ++ K++ S  G   D  PK WR+++D + ++  GLE+L+P  P
Sbjct: 145 AIPTAAAVNWVLRDGIGYLSKIVLSKYGRHFDVHPKGWRLVSDLIENMSYGLELLTPAFP 204

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
            LF+ +A      +  + +   AT+    +  A   N +++ AKGEA   +   +G+  G
Sbjct: 205 HLFVYLAAAAGAGRSASGLIQAATKSCFNAGMAANRNFAEIIAKGEAQGMVSKSIGIALG 264

Query: 237 IQLASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
           I +++ V S+  G ++VG    V  +H++  ++  +A  + TLNP R ++++A++ ++G 
Sbjct: 265 IAVSAYVGST--GPLLVGTFFGVTALHIFCNLKSYQAVQLRTLNPYRASLVLAEYFRSGN 322

Query: 295 ISSPADLRYREDL----LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELK----------- 339
           + S  ++   E +     F  RL    G+  +  +  K V  S  + L+           
Sbjct: 323 VVSVREVNGEEPIFPFSFFNFRLRGSKGDGTLLSEESKAVAHSIENHLEFGATFGAAVKT 382

Query: 340 --------ELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYA--------ASMA 383
                   +++  E+++L        +VL++ A+  D LR  + A Y          S  
Sbjct: 383 REEADALLDIYKSERYLLSQHDDRLLVVLKEGATPRDMLRAMMQAIYLDYLQGTGFTSHR 442

Query: 384 KSFHDP-SLTVLQDAYDKMNDVFTPLLSELQAKGW 417
              HD  +  VL+ ++D + + F  +  ++   GW
Sbjct: 443 NLMHDSGNGGVLRVSHDFVVNQFEQIRQDIAGVGW 477


>gi|226505832|ref|NP_001140509.1| uncharacterized protein LOC100272571 [Zea mays]
 gi|194699768|gb|ACF83968.1| unknown [Zea mays]
          Length = 491

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 89/287 (31%), Positives = 154/287 (53%), Gaps = 20/287 (6%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV 123
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+  +      
Sbjct: 110 SFIRSYVVPEGFPDSVTPSYVPYMNWRALKHFFGGAMGVFTTRTLLSSVGVSQS------ 163

Query: 124 VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
                KDG   VGK++ +  G + D++ K+ R  +D L ++G G+E+ +   PQ FL MA
Sbjct: 164 -----KDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGIELTTAAFPQFFLPMA 218

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
            + N  K +A V + +TR PIY ++A+  N+ D+ AKGE++  + +++G G  I ++   
Sbjct: 219 CVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRN 278

Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
            S +         LS  ++ S  +E+R+  +NTLN  R  + V  F+KTG I S  +   
Sbjct: 279 PSLVTSFA----FLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNS 334

Query: 304 REDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFIL 349
           +E +  P    E    V +G R      +P+ F  ++ LF +E++++
Sbjct: 335 QETIFNPPWRHE---TVAIGSRFGEAFQEPASFLAIRPLFEDERYMV 378


>gi|168022981|ref|XP_001764017.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162684756|gb|EDQ71156.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 437

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 110/392 (28%), Positives = 198/392 (50%), Gaps = 28/392 (7%)

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           SR V++ ++ +FL    P GYP SV   YL +  +  LQ  ++   S+LSTQ+LL   G+
Sbjct: 42  SRAVWNNILSAFL----PEGYPNSVTPDYLPFQTWDTLQGLSTYIRSMLSTQALLGGIGV 97

Query: 115 RPTPAQATVVS----WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLE 169
             T   ATVV     W ++D    +G ++ + + G+ +DS  K+WR+ AD L D+G  ++
Sbjct: 98  GET--TATVVGATFLWFMRDFTGMLGSILFTLHQGSNLDSSAKQWRLTADLLNDVGMLMD 155

Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
           ++SPL P  F+ +  +G+ A+ +  VA+ ATR  +   FA   N +D+ AK  +  T   
Sbjct: 156 LVSPLFPGAFVTILCIGSMARSVTGVASGATRAALTQHFALRKNAADVSAKEGSQETAAT 215

Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
           +VG+  G+ LA     ++    +   LL+  H+Y+    +R+  + +LN +RT+++++ F
Sbjct: 216 MVGMLLGMVLARLTAENVVALWVSFLLLTAFHMYANYRAVRSLCLTSLNAERTSIVLSAF 275

Query: 290 VKTGRISSPADLRYREDLL-----FPGRLIEDAGNVKVGRDPHKVVKPS---------KF 335
            +  ++ +P ++  +E LL      P +  + AG+  + +     V+ S          F
Sbjct: 276 KEGRKVPTPREVADQEHLLPRIPRLPWQ-KQRAGDAWLPKSIRFGVQISSLKLDDGWKSF 334

Query: 336 HELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQ 395
             L E ++ + +++        +VL + A+ +D L G++ A   A +      P+    +
Sbjct: 335 LSLVERYSRDPYLMVPKGAIVHVVLHKLATPQDFLHGYVHALCLADL--QLRSPADQSEE 392

Query: 396 DAYDKMNDVFTPLLSELQAKGWHTDRFLDGTG 427
            A   M + +T  L  L   GW TDR L  TG
Sbjct: 393 LAISWMKNNYTAFLDSLGKSGWATDRILVETG 424


>gi|356497988|ref|XP_003517837.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 473

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 116/438 (26%), Positives = 208/438 (47%), Gaps = 29/438 (6%)

Query: 14  EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPS 73
           E   S+S + I +  +  S        F      S  ++  S   +  V    L  F P 
Sbjct: 36  ETSSSVSNKTITLEEWNGSSPTKLSKTFTIKASSSSVLITRSGARFTHVWRRLLQAFVPE 95

Query: 74  GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDG 131
           G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    A        W L+D 
Sbjct: 96  GFPSSVTADYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRDL 155

Query: 132 MQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAK 190
              +G ++ +   G+ +DS  K WR++AD + DLG  ++++SPL P  F+ +  LG+ ++
Sbjct: 156 TGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGSISR 215

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
               VA+ ATR  +   FA + N +D+ AK  +  T+  ++G+  G+ +A          
Sbjct: 216 SFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVARLTIGHPLAI 275

Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
                 L+V H+Y+    +R   +N+LNP+R+++++  F++TG++ SP  +  +E +L P
Sbjct: 276 WFSFLSLTVFHMYANYRAVRCLALNSLNPERSSILLQHFMETGQVLSPKQVSSQEHVL-P 334

Query: 311 GRLIEDAGNVKVGRDPHKVV----KPSKFH--ELKE-------LFAEEKFILCLGSRWTD 357
            + I  + + K     HK V    + S F   E+KE        + + K++L       D
Sbjct: 335 IQFI--SWSSKKANCLHKKVCLGMRISSFDNMEIKEHLLCAAPYYTKAKYLLVERKGIID 392

Query: 358 MVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQA 414
           +++ ++++  D L+ +   LV A     +KS H       Q   D   +VF   + +L++
Sbjct: 393 VIVHKDSNAADVLKSFFHALVLAKNVHKSKSLHSEG----QKWIDYQYEVF---IQKLKS 445

Query: 415 KGWHTDRFLDGTGTRFAW 432
            GW T+R L     R  W
Sbjct: 446 LGWKTERLLSPIIWRAYW 463


>gi|410903033|ref|XP_003964998.1| PREDICTED: UPF0420 protein C16orf58 homolog [Takifugu rubripes]
          Length = 442

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 103/399 (25%), Positives = 194/399 (48%), Gaps = 29/399 (7%)

Query: 60  HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPT 117
           + VV +F + F P GYP SV+  YLRY  +  +Q F S+    L+T++ L   G+  +  
Sbjct: 42  NSVVGAFKSVFLPQGYPESVSGDYLRYQFWDTVQAFFSSLSGTLATRASLRGVGVGNQEA 101

Query: 118 PAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
              A  ++W+LKDG    G+++ +   G ++DSE K+WR+ AD L D+   +E+L+P  P
Sbjct: 102 TVAAATMTWLLKDGTGMCGRILFAWRKGTKLDSEAKKWRLFADILNDVAMFMEILAPYFP 161

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
             F  +  +    K +  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   
Sbjct: 162 AFFTLVMCVSGIFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVS 221

Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
           + L   V  +    + +  L +V+H+++  + +R+  + TLN  R ++++  +++ G+I 
Sbjct: 222 LVLIRFVSDNAALTLGLFFLFAVLHLFANYKAVRSVVMETLNEARLSVVLQQYLENGQIL 281

Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT 356
           SP +   RE +    + +     +++G    +V++ S+  +     + + F+L + +   
Sbjct: 282 SPLEANEREPVFMDFKKVVP---IRLGVRLQEVIQSSEELKFALKLSNKPFLLGVKNGGV 338

Query: 357 DMVLEQNASGEDALRGWLVAAYAASMAKS---------------FHDPSL-------TVL 394
            + L   AS +D +R    A +   M  S               F+ P          ++
Sbjct: 339 CVCLGPEASVQDEIRVMCQALWLRRMLSSQPSRGASAEQQVLFLFNIPLFPSSKGHWDLV 398

Query: 395 QDAYDKMNDVFTPLLSELQAKGWHTDR-FLDGTGTRFAW 432
            ++++ ++++F+P L  L A GW   R  LD    R  W
Sbjct: 399 HESHNVIDNIFSPFLKGLVAAGWDLKRNLLDWDEWRVEW 437


>gi|348510102|ref|XP_003442585.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oreochromis niloticus]
          Length = 433

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 103/393 (26%), Positives = 180/393 (45%), Gaps = 27/393 (6%)

Query: 60  HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPT 117
           + V   F + F P GYP SV++ YL+Y  +  +Q F+S+    L+TQ+ L  F  G +  
Sbjct: 43  NSVFGVFKSVFLPQGYPESVSDDYLQYQFWDTMQAFSSSLSGTLATQASLRGFGVGNQEA 102

Query: 118 PAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
              A  V+W+ KDG   +G+++ +   G+++DSE K+WR+ AD L D+   +E+L+P  P
Sbjct: 103 TVAAATVTWLFKDGTGMLGRILFAWRKGSKLDSEAKKWRLFADVLNDIAMFMEILAPYFP 162

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
             F  +       K I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   
Sbjct: 163 ACFTFIVCTAGIFKSIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVS 222

Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
           + L   V  +    + +  L +V+H+++  + +R+  + T N  R ++++  ++K  RI 
Sbjct: 223 LILIPLVTDNPVLTLCLFFLFTVLHLFANYKAVRSVVMETFNEARLSIVLQQYLKDKRIL 282

Query: 297 SPADLRYREDLLFP-GRLIEDAGNVK---VGRDPHKVVKPSKFHELKELFAEEKFILCLG 352
           SP +   RE + F  G+ +     V+   + + P  +    K + +  L       +C+ 
Sbjct: 283 SPPEANQREPVFFEFGQTVPMKLGVRLQEIAQSPEDLQLALKENSMPYLLGIRNGCVCV- 341

Query: 353 SRWTDMVLEQNASGEDALRGWLVAAY------------AASMAKSFHDPSLTVLQDAYDK 400
                  L   AS  D +R    A              A   A         ++ +++  
Sbjct: 342 ------CLGPEASASDEIRAMCQAVVLSNMLSSSNSREATGAALKQQKGHWEMVHESHKL 395

Query: 401 MNDVFTPLLSELQAKGWHTDR-FLDGTGTRFAW 432
           M+ +F P L   +A GW   R  +D    R  W
Sbjct: 396 MDAIFNPFLKGAEAAGWDMKRTMIDWDEWRVEW 428


>gi|222628554|gb|EEE60686.1| hypothetical protein OsJ_14164 [Oryza sativa Japonica Group]
          Length = 609

 Score =  140 bits (354), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 98/358 (27%), Positives = 172/358 (48%), Gaps = 54/358 (15%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
           + R     +    P GYP SV+  YL+Y+ +R +Q   S    VLSTQ+LL+A GL +  
Sbjct: 185 WRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKGA 244

Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
              A  V+W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +   GLE+L+P+ P 
Sbjct: 245 IPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPH 304

Query: 178 LFLEMAGLGNFAKGIAVVAAR----------------ATRLPIYSSFAKEGNLSDLFAKG 221
           LF+ +       +  A +                   ATR   Y+ FA + N +++ AKG
Sbjct: 305 LFVPIGAAAGAGRSAAALIQTIFTHFMLENANHMRKAATRSCFYAGFAVQRNFAEVIAKG 364

Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
           EA   +   +G+  GI LA+ + SS+   +I    ++ +H+Y  ++  ++  + TLNP R
Sbjct: 365 EAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYR 424

Query: 282 TAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG--RDPHKVVKP------- 332
            +++ ++++ +G++ S  ++   E L           N+ +G  R   K++ P       
Sbjct: 425 ASLVFSEYLLSGQVPSVKEVNDEEPLFL---------NLSIGTSRKESKILSPQAKDAAE 475

Query: 333 ---------SKFHELKE----------LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
                    SK  E+ E          L+  E+++L        ++L++ +S ED L+
Sbjct: 476 IICRRLQLGSKLSEIIENKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLK 533


>gi|449016577|dbj|BAM79979.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 719

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 106/384 (27%), Positives = 181/384 (47%), Gaps = 33/384 (8%)

Query: 56  RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL- 114
           R +  R+ +  +  + P GYP +V + YL +T +R LQ+  ++ + V ST++LL   GL 
Sbjct: 328 RSLVERLRNWLIQVYLPHGYPHTVTQDYLAFTLWRLLQNTAASVMGVFSTEALLLGLGLG 387

Query: 115 RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSP 173
           +     A  V WILKDG  H GK+  + + G + D +P+ WRI++D L DL   LE+++P
Sbjct: 388 KNVSGTAQAVQWILKDGFGHAGKIAYAAVAGKQFDMDPRSWRIMSDLLEDLAGVLEIITP 447

Query: 174 LCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
           +    FL +A +    + IA +   ATR  I+ S A   N  DL  KGE+      ++GL
Sbjct: 448 IFSGQFLVLASIATTMRAIAAMTGTATRHAIFKSMALAENQGDLATKGESQGVTTKLIGL 507

Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV--- 290
           G GI L+  +       +I   + +VIH+ +  + M+    +TLN QR A+++  F+   
Sbjct: 508 GLGITLSKRIGQDYTNLLIAYGIAAVIHLAANFKAMQCVEFSTLNRQRMALLLDAFMSAR 567

Query: 291 -KTGRISS-------PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELF 342
            +T R +S       P ++  +E  + P    +    +  G    +V     F     L+
Sbjct: 568 AQTSRQASRQLTLPTPREICAKERFMIPP-WRDYRPEIVFGASLQRVKHARHFLHCAALY 626

Query: 343 AEEKFILCL--GSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVL----QD 396
             E++I+     ++   + L   A+ +D L             K+F +    +L    + 
Sbjct: 627 RNERYIIYAEPSTKHVLVFLRDGATVKDQL-------------KAFMNAQRYLLDRDAER 673

Query: 397 AYDKMNDVFTPLLSELQAKGWHTD 420
           AY+     F+  L    A GW T+
Sbjct: 674 AYNFTQREFSTFLQATVASGWSTN 697


>gi|356495287|ref|XP_003516510.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
          Length = 649

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 164/314 (52%), Gaps = 12/314 (3%)

Query: 58  VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
           V+ +  D F     P G+P SV   YL Y+ +RA+Q        VL+TQSLL+A GL + 
Sbjct: 174 VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG 233

Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
               A  ++W+LKDG+ ++ K++ SN G   D +PK WR+ AD L +   GLE+ +P  P
Sbjct: 234 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAAFGLEMCTPAFP 293

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
           Q F+ +  +   ++  A +   +TR   ++ FA + N +++ AKGE        +G+G G
Sbjct: 294 QFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLG 353

Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
           I L + + SS    +    +L+ IH+Y  ++  ++  + TLNP R +++ ++++ +G+ +
Sbjct: 354 IGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQ-A 412

Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT 356
            P      E+ LFP   I    N         +V  S   E K+  AE +  L LGS+ +
Sbjct: 413 PPVKEVNDEEPLFPAVPIL---NATFANKAQSIVLSS---EAKDAAAEIEHRLQLGSKLS 466

Query: 357 DMVLEQNASGEDAL 370
           ++V     S ED L
Sbjct: 467 EIV----NSKEDVL 476


>gi|357487621|ref|XP_003614098.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
 gi|355515433|gb|AES97056.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
          Length = 749

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 192/378 (50%), Gaps = 31/378 (8%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVV- 124
           FL  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+      ATV+ 
Sbjct: 363 FLQAFVPEGFPSSVTPDYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGV--GEKSATVIG 420

Query: 125 ---SWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
               W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  ++++SPL P  F+
Sbjct: 421 ATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFV 480

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +  LG+ ++    VA+ ATR  +   FA + N +D+ AK  +  T+  ++G+  G+ +A
Sbjct: 481 FIVCLGSISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVA 540

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
                           L++ H+Y+    ++   +N+LNP+R+++++  F+++ ++ SP  
Sbjct: 541 RITTGHPLAIWFSFLSLTLFHMYANYRAVQCLSLNSLNPERSSILLQRFMESRQVLSPKQ 600

Query: 301 LRYREDLLFPGRLIEDAGNVKVGRDPHKV---VKPSKFHE---------LKELFAEEKFI 348
           +   E +L P +L   +   KV     KV    + S F E         L   +A+ K+I
Sbjct: 601 VSPLEHVL-PIQLTSMSSK-KVNSLDSKVRLGTRISSFDEMEIKEHLLSLASYYAKAKYI 658

Query: 349 LCLGSRWTDMVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQDAYDKMNDVF 405
           L       D+++ +++S  D L+ +   LV A      KS H  S   +++ Y    +VF
Sbjct: 659 LVEKKGIIDVIVHKDSSSADVLKSFIHALVLANNVYKGKSLHSDSQMWMENQY----EVF 714

Query: 406 TPLLSELQAKGWHTDRFL 423
              + +L++ GW T+R L
Sbjct: 715 ---IQKLKSLGWKTERLL 729


>gi|157422883|gb|AAI53381.1| Zgc:162613 protein [Danio rerio]
          Length = 432

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 102/384 (26%), Positives = 185/384 (48%), Gaps = 21/384 (5%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
           F + F P GYP SV+E YL+Y  +  +Q F+S+    L+TQ+ L   G+  +     A  
Sbjct: 48  FKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQEATVAAAT 107

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           ++W+L+DG   +G+++ + L G+++DSE K+WR+ AD L D+   +E+ +P  P  F  +
Sbjct: 108 ITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHFPPFFTLI 167

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             +    K I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + L   
Sbjct: 168 VCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLVSLALIPL 227

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           V  +     I+  L +V+H+++  + +R+  + TLN  R ++++  ++  G++ SPA+  
Sbjct: 228 VTDNPLLTFILFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLHQYLLHGQVLSPAEAN 287

Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQ 362
            RE +    R       +K+G     ++      +L      + +++ + +    + L  
Sbjct: 288 QREPVFLSFR---RTVPIKLGVRLGDIIHEPGDLQLALKNNSKPYLIGIKNGAVCVCLGS 344

Query: 363 NASGEDALRGWLVAAYAASMAKSFHDPSLTVLQ-------------DAYDKMNDVFTPLL 409
           +AS +D ++  +  A   S       P   VLQ              ++  ++++F   L
Sbjct: 345 DASVDDEIKA-VCQALCVSAVLHNESPLPAVLQQLSNTKDPWALISQSHKMIDEIFQTFL 403

Query: 410 SELQAKGWHTDR-FLDGTGTRFAW 432
            EL    W TDR  LD    R  W
Sbjct: 404 KELDRAKWQTDRTLLDWNEWRVEW 427


>gi|412993492|emb|CCO14003.1| predicted protein [Bathycoccus prasinos]
          Length = 584

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 83/263 (31%), Positives = 141/263 (53%), Gaps = 13/263 (4%)

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPT 117
           R +D+  + FFP     SV   Y  Y ++R  Q   S+ ++VL+TQ +L A GL   R  
Sbjct: 163 RQIDNIRDSFFPQKT--SVTSDYWDYAKYRFTQRIASSCMTVLATQQMLAAIGLGASRSI 220

Query: 118 PAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
           PA A  ++W+LKDG+  +GKL + +N G   DS+ KR R  +  +Y     +E ++P  P
Sbjct: 221 PA-AAALNWVLKDGLGRLGKLSVATNFGRSFDSDVKRLRFSSSLVYTGAVFIETITPFFP 279

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
           + FL +A + N  K I +      R PI  +FA E NL ++ AK  A   L + +GL  G
Sbjct: 280 KHFLAIATVANVGKSIGITTGNVVRPPIQKTFALEENLGEIAAKTSAQQVLADNIGLALG 339

Query: 237 IQL----ASTVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
           + +       V  ++Q  + +   P+L+ I + S+ +++++  + T+N +R  +I   FV
Sbjct: 340 VSVMKFQTKFVSLALQRAVPLALFPILASIDLVSIHKQLKSVQLRTVNKERAEIIAEAFV 399

Query: 291 KTGRISSPADLRYREDLLFPGRL 313
           +  ++++   +  +E LLFP RL
Sbjct: 400 REKKLATQLQVAEKERLLFPARL 422


>gi|302835848|ref|XP_002949485.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
           nagariensis]
 gi|300265312|gb|EFJ49504.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
           nagariensis]
          Length = 495

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 84/245 (34%), Positives = 134/245 (54%), Gaps = 6/245 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
             P+G+P SV   Y  Y  +R +Q+F   A+SV +TQSLL A G+  R     A  ++W+
Sbjct: 113 LLPNGFPDSVGPQYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRFQGEAAAAINWV 172

Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
           +KDG   +G+L+ +  G  +D E K++R+  D L + G  LE+ +   P  FL +A   N
Sbjct: 173 IKDGAGRLGRLLFARWGRELDCELKQFRLAGDLLMEAGAALELATVYAPPAFLPLACTAN 232

Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
            +K +A VAA +TR PIY +FA + NL+D+ AKGE+++ L +++G  AGI L+      M
Sbjct: 233 LSKNLAAVAASSTRAPIYRTFALQNNLADITAKGESVANLADILGTVAGIALSRMKLPRM 292

Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
               +    LS  ++ S  +E+ +  +  +N  R A     ++  G I   A+  + E L
Sbjct: 293 PTFCV----LSAGYLLSSRKEVDSVELPYMNRARLAYATQRYLSDGFIPGVAEANHNEPL 348

Query: 308 LFPGR 312
           L  GR
Sbjct: 349 LPWGR 353


>gi|357133639|ref|XP_003568431.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Brachypodium
           distachyon]
          Length = 508

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 104/343 (30%), Positives = 171/343 (49%), Gaps = 27/343 (7%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV 123
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+SV +T++LL + G+  +      
Sbjct: 125 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQS------ 178

Query: 124 VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
                +DG   VGK++ +  G + D + K+ R   D L +LG G+E+ +   P LFL +A
Sbjct: 179 -----RDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHLFLPLA 233

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
            + N  K +A V + +TR PIY ++AK  N+ D+ AKGE++  + +++G G  I L S  
Sbjct: 234 CMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI-LISKR 292

Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
             S+         LS  ++ S   E+R+  +NTLN  R  + V  F+KTG + S  +   
Sbjct: 293 NPSLVASFA---FLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEGNL 349

Query: 304 REDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM--VL 360
            E +  P    +    V +G R      +P+ F     LF +E++I+        +  +L
Sbjct: 350 EETIFNPPWRHQP---VAIGSRFGEAFQEPASFVSTSPLFEDERYIVTYNPTKDKVYALL 406

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMND 403
           +  A  +D L+    AA+ A +   F + S   L +A   MN 
Sbjct: 407 KDQAKPDDILK----AAFHAHVLLHFINASHANL-NARKHMNS 444


>gi|225423472|ref|XP_002274189.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
 gi|297738087|emb|CBI27288.3| unnamed protein product [Vitis vinifera]
          Length = 437

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 97/382 (25%), Positives = 187/382 (48%), Gaps = 23/382 (6%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           +  V    L  F P G+P SV   Y  +  + +LQ  ++   ++LSTQ+LL A G+    
Sbjct: 44  FSHVWRRILQAFVPEGFPSSVTPDYAPFQLWDSLQGLSTYIRTMLSTQALLSAIGVGEKS 103

Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A        W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  ++++SPL 
Sbjct: 104 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLVSPLF 163

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
           P  F+ +  LG+ ++    VA+ ATR  +   FA + N +D+ AK  +  T+  +VG+  
Sbjct: 164 PSAFVFIVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMVGMAF 223

Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           G+ LA                L++ H+Y+    +R   + TLN +R+++++  F++TG++
Sbjct: 224 GMLLARITMGHSVAIWFSFLSLTMFHMYANYRAVRCLILTTLNSERSSILIEHFMETGQV 283

Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPH-------------KVVKPSKFHELKELF 342
            SP ++   E +L P  L   + ++K  +  H             K+      H  +  +
Sbjct: 284 LSPQEVSKMEHVL-P--LWASSWSLKNAKSLHTRIHLGVRVSSLDKLEVTDLLHSAESHY 340

Query: 343 AEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
            + K++L        +++ ++++  D L+ ++ A   A++     D S+ +  ++   M+
Sbjct: 341 KKAKYLLLERKGIISVIMHKDSTASDVLQSFIHALVMANLV----DKSIYIHLESRSWMD 396

Query: 403 DVFTPLLSELQAKGWHTDRFLD 424
             +   L +LQ+ GW T+R L 
Sbjct: 397 KQYENFLLKLQSSGWKTERLLS 418


>gi|299115992|emb|CBN75993.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 479

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 87/249 (34%), Positives = 141/249 (56%), Gaps = 7/249 (2%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT--PAQATV 123
            L    PSG+P SV+  Y+ Y  ++      SAA  V+STQ+LL+A GL     P  AT+
Sbjct: 1   MLGLVLPSGFPASVHPPYMAYAGWQFTGMAASAAAGVMSTQALLYAMGLGAGALPLAATL 60

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
            +W++KDG+  +G +  S+L   R D+ PK WR++A    D    LE+LSPL P  FL +
Sbjct: 61  -NWVIKDGLGQLGGVAFSSLVSTRFDANPKLWRVVAAVSLDASMVLELLSPLAPAYFLPI 119

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
           A + N  K ++ +AA A+R  I++  + +GNL+D+ AK  A + L  + G GAG+ L+++
Sbjct: 120 ASVANIGKNVSFLAASASRAAIHNVLSLKGNLADVTAKSGAQTILACMAGTGAGVTLSAS 179

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV---ADFVKTGRISSPA 299
           + S     +     LSV+H+ +    MR   + T + QR  ++V   AD  ++  + SP 
Sbjct: 180 LGSEFHAIVPACLCLSVVHLGANHMSMRRLGIPTFDHQRLELVVNGMADGDRSRGMPSPQ 239

Query: 300 DLRYREDLL 308
           D+  +ED++
Sbjct: 240 DIAAQEDVV 248


>gi|42565615|ref|NP_190175.2| uncharacterized protein [Arabidopsis thaliana]
 gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis thaliana]
 gi|30794095|gb|AAP40490.1| unknown protein [Arabidopsis thaliana]
 gi|110739240|dbj|BAF01534.1| hypothetical protein [Arabidopsis thaliana]
 gi|332644566|gb|AEE78087.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 608

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 88/297 (29%), Positives = 157/297 (52%), Gaps = 9/297 (3%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
           + L +F  P G+P SV   YL Y+ +R +Q   S    VL+TQSLL+A GL +     A 
Sbjct: 192 NLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAA 251

Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
            ++W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +   G+E+L+P+ PQ F+ +
Sbjct: 252 AINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMI 311

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
                  +  A +   ATR    + FA + N +++ AKGEA   +   VG+  GI +A+ 
Sbjct: 312 GAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANC 371

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           + +S    +    +++ IH+Y+ ++  +   + TLNP R +++ ++++ +G+     ++ 
Sbjct: 372 IGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVN 431

Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMV 359
             E+ LFP     +       + P K+       E K   A+ +  L LGS+ +D++
Sbjct: 432 -DEEPLFPTVRFSNM------KSPEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVI 481


>gi|327286048|ref|XP_003227743.1| PREDICTED: UPF0420 protein C16orf58-like [Anolis carolinensis]
          Length = 420

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 112/404 (27%), Positives = 185/404 (45%), Gaps = 32/404 (7%)

Query: 36  TRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHF 95
           T HY+  PDG LS  +       Y  + D F   F P GYP SV+  YL Y  +  +Q F
Sbjct: 19  TCHYKSLPDGRLSCVLDRLEDEGYRSIRDFFTAVFLPQGYPESVSPDYLAYQIWDTVQAF 78

Query: 96  TSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPK 152
            S+    L+TQ++L   G+    +   A  V+WILKDG   +G+++ + + G+++D + K
Sbjct: 79  ASSITGTLATQAVLKGVGVGDETSTVTAATVTWILKDGTGMLGRILFAWSKGSKLDCDAK 138

Query: 153 RWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG 212
           +WR+ AD L D+   +E+L+P  P  F  +  +  F K I  VA  ATR  +    A+  
Sbjct: 139 QWRLFADVLNDVAILMEILAPAFPACFTLIVCISGFFKCIVGVAGGATRAALTMHQARRD 198

Query: 213 NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAA 272
           N++D+ AK  +  TL N+ GL   + L   V  +++    +  L +++H+Y+    +RA 
Sbjct: 199 NMADVSAKDGSQETLVNLAGLLFSLLLIPLVADNLRLTYSLYGLFTILHLYANYRAVRAV 258

Query: 273 PVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLI-------EDAGNVKV--- 322
            + TLN  R  +++  F+K+G +  P     +E LL     I          G V +   
Sbjct: 259 CMETLNRARLCLVLQSFLKSGEVPGPDVTNPQEPLLLVDFSITIYVPWSSSTGTVSILLH 318

Query: 323 -GRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAAS 381
             R    V+K      L E          L      +VLE+ +          + A    
Sbjct: 319 QQRRSVDVIKACSHALLLEAL--------LSWDVASLVLEKES----------LLAVQKQ 360

Query: 382 MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDG 425
           + +        V+ + +  ++ +F   L+ L A GW T+R L G
Sbjct: 361 LREGSKSKHCGVVSETHRILDRIFPKFLAGLMAAGWVTERNLLG 404


>gi|224129512|ref|XP_002328735.1| predicted protein [Populus trichocarpa]
 gi|222839033|gb|EEE77384.1| predicted protein [Populus trichocarpa]
          Length = 398

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 92/281 (32%), Positives = 154/281 (54%), Gaps = 9/281 (3%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILK 129
           P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+  + A   A  ++WILK
Sbjct: 15  PEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTKTLLNSVGVSKSQAIPGAVAINWILK 74

Query: 130 DGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFA 189
           DG   VGK++ +  G + D + K+ R   D L +LG  +E+ +   P LFL +A   N A
Sbjct: 75  DGAGRVGKMLYARQGKKFDYDLKQLRFAGDLLMELGAAVELATAAVPHLFLPLACAANVA 134

Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
           K +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I ++    S    
Sbjct: 135 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVADLLGTGLSIMISKRNPS---- 190

Query: 250 KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL- 308
            +    LLS  +V S  +E+++  ++TLN  R ++ V  F+KTG++ S  +   +E++  
Sbjct: 191 LVTTFALLSCGYVLSSYQEVKSVVLHTLNGARFSVAVESFLKTGQVPSLHEGNVKENIFN 250

Query: 309 FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFIL 349
           FP   ++D       R       P  +  ++ LF  E++I+
Sbjct: 251 FP--WLKDRSISLGPRFKDAFQDPRAYLAIEPLFERERYIV 289


>gi|148905902|gb|ABR16113.1| unknown [Picea sitchensis]
          Length = 446

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 188/374 (50%), Gaps = 26/374 (6%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P G+P SV   YL +  + +LQ  ++    +LSTQ+LL A G+  T A        W 
Sbjct: 59  FLPEGFPNSVTSDYLPFQIWDSLQGLSTYIRCMLSTQALLSAIGVGETTATVVGATFQWF 118

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           L+D    +G ++ +   G+ +DS  K WR++AD + DLG  ++++SPL P  F+ +  LG
Sbjct: 119 LRDFTGMLGGILFTLYQGSNLDSNAKMWRLIADLMNDLGMLMDLVSPLYPASFVIILCLG 178

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           + A+    VA+ ATR  +   FA + N +D+ AK  +  T+  ++G+  G+ LA  V + 
Sbjct: 179 SIARSFTGVASGATRAALTQHFALQHNAADISAKEGSQETMATMIGMLLGMLLAH-VTAG 237

Query: 247 MQGKMIVG-PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
             G M     LL+  H+++  + +R   +N+LN  RT++++  F+  G++ +P D+   E
Sbjct: 238 HPGAMWSSFLLLTFFHMFANYQAVRCLCLNSLNEGRTSVLLKCFITNGQVLNPKDVSSME 297

Query: 306 DLL---FPGRLIEDA----GNVKVGRDPHKVVKPSKFHELK---ELFAEEKFILCLGSRW 355
            LL      R++          K+G     + +      LK     + + K++L   +  
Sbjct: 298 YLLPNWMQLRMLTKGRPFNATFKLGAKISSLSQSDMSKLLKLSASCYGKGKYLLLCRNGT 357

Query: 356 TDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK-----MNDVFTPLLS 410
            +++L +++   D L+     +Y   +  S+H  S + + DA +      M   ++  +S
Sbjct: 358 IEVILNKDSVSADILQ-----SYIHGLVLSYHMGSSSSVNDANNADSCLWMEKYYSMFIS 412

Query: 411 -ELQAKGWHTDRFL 423
            +LQA GW+T R L
Sbjct: 413 KQLQASGWNTGRLL 426


>gi|297815746|ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321594|gb|EFH52015.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 613

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 87/297 (29%), Positives = 154/297 (51%), Gaps = 9/297 (3%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
           + L +F  P G+P SV   YL Y+ +R +Q   S    VL+TQSLL+A GL +     A 
Sbjct: 198 NLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVLATQSLLYAVGLGKGAIPTAA 257

Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
            ++W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +   G+E+L+P+ PQ F+ +
Sbjct: 258 AINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMI 317

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
                  +  A +   ATR    + FA + N +++ AKGEA   +   +G+  GI +A+ 
Sbjct: 318 GAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSMGILLGIVVANC 377

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           + +S    +    +++ IH+Y+ ++  +   + TLNP R +++ ++++ +G+     ++ 
Sbjct: 378 IGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVN 437

Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMV 359
             E L    R +    N+K        V  S+     E   E    L LGS+ +D++
Sbjct: 438 DEEPLFPTVRFL----NMKSPEKLQDFVLSSEAKAAAEDIEER---LQLGSKLSDVI 487


>gi|308807048|ref|XP_003080835.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116059296|emb|CAL55003.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
           tauri]
          Length = 1023

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 89/267 (33%), Positives = 142/267 (53%), Gaps = 17/267 (6%)

Query: 60  HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RP 116
            R V+   +   P   P SV   Y  Y ++R  Q   S+ +SV +TQ +L A GL   R 
Sbjct: 573 RRCVERASHALLPE--PTSVTSDYWEYARYRFAQRIASSCISVFATQQMLTAVGLGASRT 630

Query: 117 TPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
            PA A  V+W+LKDG+  +GKL + +N G   DS+ KR+R  +  +YD  + +E+++P  
Sbjct: 631 LPA-AAAVNWVLKDGLGRLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYF 689

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL-- 233
           PQ FL +A + N  K + +  A   R PI  +F  E NL+++ AK  A   L + +GL  
Sbjct: 690 PQHFLPLATVANIGKSVGITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLAL 749

Query: 234 --GA--GIQLASTVCSSMQGKMIV---GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
             GA  G+   +TV   ++  + +   GP L+V+ +  + +E++A  + T+N +R  +I 
Sbjct: 750 AVGAAKGMSKVATVRPEIRRALPLFAFGP-LAVLDLICIYKELKAVQLRTINKERAEIIA 808

Query: 287 ADFVKTGRISSPADLRYREDLLFPGRL 313
             +VK   I + A +   E LL P RL
Sbjct: 809 EAYVKEREIPTRARVANAERLLIPARL 835


>gi|145349487|ref|XP_001419164.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579395|gb|ABO97457.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 269

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 91/270 (33%), Positives = 138/270 (51%), Gaps = 21/270 (7%)

Query: 56  RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL- 114
           R    R V+   +   P   P SV   Y  Y +FR  Q   S+ +SV +TQ +L A GL 
Sbjct: 6   RDAARRCVERAQHALLPE--PSSVTNDYWEYAKFRFAQRIASSCISVFATQQMLTAVGLG 63

Query: 115 --RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
             R  PA A  V+W+LKDG+  +GKL + +N G   DS+ KR+R  +  +YD  + +E++
Sbjct: 64  ASRTLPA-AAAVNWVLKDGLGRLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEII 122

Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
           +P  PQ FL +A + N  K + +  A   R PI  +F  E NL+++ AK  A   L + +
Sbjct: 123 TPYFPQHFLPLATVANIGKSVGITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNI 182

Query: 232 GLGAGIQLA------STVCSSMQGKMIV---GPL--LSVIHVYSVIEEMRAAPVNTLNPQ 280
           GL   +  A      +TV   ++  + V   GPL  L +I +Y   +E++A  + T+N +
Sbjct: 183 GLALAVGAAKAVSKVATVRPEIRRALPVFAFGPLAALDLICIY---KELKAVQLRTINKE 239

Query: 281 RTAMIVADFVKTGRISSPADLRYREDLLFP 310
           R  +I   FVK   I S A +   E L  P
Sbjct: 240 RAEIIAEAFVKEREIPSRAHVADAERLFIP 269


>gi|298709448|emb|CBJ31354.1| putative glutamate/ornithine acetyltransferase (ISS) [Ectocarpus
           siliculosus]
          Length = 789

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 79/235 (33%), Positives = 130/235 (55%), Gaps = 17/235 (7%)

Query: 76  PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-AQATVVSWILKDGMQH 134
           P  V   Y  +T +R  Q F +  ++V  TQ+LL A G++      A  +SW+LKD +  
Sbjct: 187 PEDVTPDYYHFTFWRMFQRFVAGTINVFGTQALLLALGIKAKRLGAAAAMSWVLKDALGK 246

Query: 135 VGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIA 193
            G+++  S +G R DS+ KRWR  +  LY  G GLE+++ + P  FL +A + N  K ++
Sbjct: 247 FGRILWASKMGRRFDSDAKRWRFRSSLLYAAGNGLEIVTYVFPASFLVLAAMANSFKQMS 306

Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAK---------------GEAISTLFNVVGLGAGIQ 238
           ++ + ATR  IY SFA+  N+ D+ AK               GEA   + +V+G+ +GI 
Sbjct: 307 MLTSSATRNTIYRSFARGENIGDITAKACPPAAAMPNAKKHEGEAQIAVVDVLGMLSGIG 366

Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
           L + V +S Q  +    +LS++ + ++  E+RA   + LN +RT ++V DFV +G
Sbjct: 367 LCTLVGTSRQTIVAAYVILSLVDISAIYNEIRAVCFSVLNHERTHLLVRDFVASG 421


>gi|170037702|ref|XP_001846695.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167880979|gb|EDS44362.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 417

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 185/381 (48%), Gaps = 28/381 (7%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQAT 122
           F N F P+GYP SV+  YL Y ++  +Q F S     L+T ++L   G+      P  AT
Sbjct: 49  FQNLFLPAGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPLSAT 108

Query: 123 VVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFL 180
            ++W+LKDGM H G+++ +   G  +D + K+WRI AD L D+  G+++   PL P++  
Sbjct: 109 -ITWVLKDGMGHFGRILFAWWKGTELDIDSKKWRIRADILNDIAMGVDLFVLPLYPKVST 167

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +    +  K I  VA  ATR  +    A  GNL+D+ +K  A  T  N++    G+ L 
Sbjct: 168 YILCATSTMKAIVGVAGGATRSALTQHHALRGNLADVASKDSAQETCVNLIASFVGLGLL 227

Query: 241 STVCSSMQGKMIVGPLL--SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
           + V      +++ G  L  +++H+Y+ I+ ++A  + T N  R  + + ++ K+G + SP
Sbjct: 228 THV---QNQRVLYGLFLFVTLLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGTMLSP 284

Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK-FHELKELFA----EEKFILCLGS 353
             +   E +   G+ +     VK+G    ++ +  +  ++L+ + A     EKF+L    
Sbjct: 285 QQVNKLERVTI-GQTVSLTARVKIGCSARELAQYYRNCYDLENVIACFDSREKFLLAETR 343

Query: 354 RWTDMVLEQNASGEDALRGWL-VAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSEL 412
            +  + L       D ++ +  VA+Y    ++         L+D + +  + +   LS  
Sbjct: 344 HYVGVYLHFTVKPLDIIKSYFYVASYLQDKSQ---------LRDRFWETQNKWNEFLSMA 394

Query: 413 QAKGWHTDRFLDGTGT-RFAW 432
           Q +GW+    L  T   R  W
Sbjct: 395 QREGWNVQAHLLKTDEYRLDW 415


>gi|356502614|ref|XP_003520113.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
           [Glycine max]
          Length = 486

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 101/390 (25%), Positives = 193/390 (49%), Gaps = 23/390 (5%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           +  V    L  F P G+P SV   Y+ +  +  L   ++   ++LSTQ+LL A G+    
Sbjct: 94  FTHVWRRLLQAFVPEGFPSSVTADYVPFQIWDLLXGLSTYIRTMLSTQALLIAIGVEEKS 153

Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A        W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  ++++SPL 
Sbjct: 154 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSYAKMWRLVADLMNDLGMLMDLISPLF 213

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
           P  F+ +  LG+ ++    VA+ ATR  +   FA + N +D+ AK  +  T+  ++G+  
Sbjct: 214 PSAFVFVVCLGSISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMAL 273

Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           G+ +A                L+V H+Y+    ++   +N+LNP+R++++   F++TG++
Sbjct: 274 GMLVARLTIGHPLAIWFSFLSLTVFHMYANYRAVQXLTLNSLNPERSSILFQHFMETGQV 333

Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV----KPSKFH--ELKE-------LF 342
            SP  +  +E +L P + I  + + K     HK V    + S F   E+KE        +
Sbjct: 334 LSPEQVSSQEHIL-PIQFI--SWSSKKANCLHKKVNLGMRISSFDNMEIKEHLLSAAPYY 390

Query: 343 AEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
            + K++L       D+++ ++++  D L+   +  +A  +AK+ H  S +V  +    ++
Sbjct: 391 TKAKYLLVERKGIIDVIVHKDSNAADVLK---LFFHALVLAKNVHK-SKSVHSEGQKWID 446

Query: 403 DVFTPLLSELQAKGWHTDRFLDGTGTRFAW 432
             +   + +L++ GW  +R L     R  W
Sbjct: 447 YQYEVFIQKLKSLGWKIERLLSPIIWRAYW 476


>gi|297844278|ref|XP_002890020.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335862|gb|EFH66279.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 440

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 100/390 (25%), Positives = 188/390 (48%), Gaps = 24/390 (6%)

Query: 52  VDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA 111
           +  S   ++ V    L  F P G+P SV   Y+ +  +  LQ  ++    +LSTQ+LL A
Sbjct: 39  IQRSANRFNHVWRRVLQAFVPEGFPGSVTPDYVGFQFWDTLQGLSTYIKMMLSTQALLSA 98

Query: 112 AGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGL 168
            G+    A        W L+D    +G ++ +   G+ +DS  K WR++AD + D+G  +
Sbjct: 99  IGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLM 158

Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
           ++LSPL P  F+ +  LG+ ++    VA+ ATR  +   FA + N +D+ AK  +  T+ 
Sbjct: 159 DLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMA 218

Query: 229 NVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
            ++G+  G+ LA     +          L+V H+Y+    +R   +N+LN +R+++++  
Sbjct: 219 TMMGMSLGMLLARFTSGNPMAIWFSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTH 278

Query: 289 FVKTGRISSPADLRYREDLLFPGRLIEDAGNVK-VGRDPHKVVKPSKFHELKEL------ 341
           F++TG++ SP  +   E +L        + N K + +  H  V+ S    L+ +      
Sbjct: 279 FMQTGQVLSPEQVSSMEGVLPVWATSLRSTNSKLLHKRVHLGVRVSSLPRLEMMQLLNGV 338

Query: 342 ----FAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASM---AKSFHDPSLTVL 394
               +   K++L        ++L +++S  D L+ ++ A   A++   + SF+      +
Sbjct: 339 GASSYKNAKYLLAHRKGNVSVILHKDSSAADVLKSYIHAIVLANLMEKSTSFYSEGEAWI 398

Query: 395 QDAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
              YD+        L +L++ GW T+R L 
Sbjct: 399 DKHYDE-------FLHKLRSGGWKTERLLS 421


>gi|47217805|emb|CAG07219.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 412

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 77/247 (31%), Positives = 137/247 (55%), Gaps = 3/247 (1%)

Query: 65  SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQAT 122
           +F + F P GYP SV+  YL+Y  +  LQ F+S+    L+T++ L   G+  +     A 
Sbjct: 74  TFQSVFLPQGYPDSVSSDYLQYQFWDTLQAFSSSLSGTLATRASLRGVGVGNQQATVAAA 133

Query: 123 VVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
             +W+LKDG    G+++ + L G ++DSE K+WR+ AD L D+   +++L+P  P  F  
Sbjct: 134 TTTWLLKDGTGMCGRILFAWLKGTKLDSEAKKWRLFADILNDIAMFMDILAPYFPAFFTL 193

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           +  +    K +  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + L S
Sbjct: 194 IMCVSGVFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLALVS 253

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
            V  +    + +  L +V+H+++  + +R+  + TLN  R ++++ +++K GR+ SP + 
Sbjct: 254 FVADNAALTLSLFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLQEYLKDGRMLSPLEA 313

Query: 302 RYREDLL 308
             RE +L
Sbjct: 314 NQREPVL 320


>gi|157113953|ref|XP_001652155.1| hypothetical protein AaeL_AAEL006646 [Aedes aegypti]
 gi|108877519|gb|EAT41744.1| AAEL006646-PA [Aedes aegypti]
          Length = 417

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 179/370 (48%), Gaps = 25/370 (6%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQAT 122
           F N F P+GYP SV+  YL Y ++  +Q F S     L+T ++L   G+      P  AT
Sbjct: 49  FQNLFLPAGYPDSVSGDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPLSAT 108

Query: 123 VVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFL 180
            ++W+LKDGM H G++I +   G  +D + K+WRI AD L DL   +++   P  P+   
Sbjct: 109 -ITWVLKDGMGHFGRIIFAWWKGTELDIDSKKWRIRADVLNDLAMSIDLFVLPYYPKAST 167

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +       K I  VA  ATR  +    A  GNL+D+ +K  A  T  N++    G+ L 
Sbjct: 168 FILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLFLL 227

Query: 241 STVCSSMQGKMIVGPLL--SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
           + V      +++ G  L  +++H+Y+ I+ ++A  + T N  R  + + ++ KTG + SP
Sbjct: 228 THV---QDQRVLYGLFLFVTLMHIYANIKAVKAVCLRTFNEARYLIALEEYFKTGTMLSP 284

Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKF-HELKELFA----EEKFILCLGS 353
             +   E +   G+ +     VK+G    ++ +  +  ++L+ L A     +KF++    
Sbjct: 285 EQVNKLERVTI-GQTVTLTARVKIGCSARELAQYYRVCYDLENLMACFDSRDKFLIAETR 343

Query: 354 RWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQ 413
            +  + L   A   D ++ +    Y AS  +  +      L+D Y ++ + +   L+  Q
Sbjct: 344 NYVGVYLHFTAKPLDIIKSYF---YVASYLQDKNQ-----LRDRYWEIQNKWNEFLNLAQ 395

Query: 414 AKGWHTDRFL 423
            +GW+    L
Sbjct: 396 CEGWNVQAHL 405


>gi|320170693|gb|EFW47592.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 710

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 82/256 (32%), Positives = 140/256 (54%), Gaps = 9/256 (3%)

Query: 41  FQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAAL 100
            Q  G  + ++ +D+R   HR++++FL    P GYP SV   Y+ Y++++A+Q  T    
Sbjct: 139 LQNAGQRAWRLREDTR---HRLIEAFL----PKGYPHSVTPNYMGYSRWQAVQSVTGTMT 191

Query: 101 SVLSTQSLLFAAGLRPTPAQ-ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILA 158
            VLSTQ+LL+A GL       A  ++WI+KDG+  +G ++ S  + ++ DS+PKR R  +
Sbjct: 192 GVLSTQALLYAVGLGAGAIPLAGALNWIVKDGLGQLGGVVYSTFISSKFDSDPKRHRFWS 251

Query: 159 DALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
           +A     T LE+L+PL P +FL +A + N  K I+ +AA +TR  +++S     NL D+ 
Sbjct: 252 NAALQASTLLEILTPLAPGMFLFLASVSNIGKNISWLAASSTRAQMHNSLTLRDNLGDVT 311

Query: 219 AKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
            K  + +   +++G G GI +A  V +     ++    LS++++ S         + TLN
Sbjct: 312 GKAGSQAIATSLIGTGLGIAIAPFVGTDPIAVLLAFLPLSIVNMVSNYRSNTIVHMRTLN 371

Query: 279 PQRTAMIVADFVKTGR 294
            QR   +   F+   R
Sbjct: 372 VQRAERLFMHFLLHNR 387


>gi|81879541|sp|Q91W34.1|CP058_MOUSE RecName: Full=UPF0420 protein C16orf58 homolog
 gi|16877856|gb|AAH17158.1| CDNA sequence BC017158 [Mus musculus]
          Length = 466

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 108/381 (28%), Positives = 181/381 (47%), Gaps = 39/381 (10%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 135 VKDSTGMLGRIILAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
           N AK I  VA  ATR  +    A+  N++D+ AK  +  T+ N+ GL   + +   V  C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
            S+     V  LL+ +H+Y+    +RA  + TLN  R  +++  F++ G +  PA     
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312

Query: 305 EDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGS--RWTD 357
           E L    +P        ++ +G   H +V  S   ELK+L     E ++LC         
Sbjct: 313 EPLWTGFWPSL------SLSLGVPLHHLV--SSVSELKQLVEGHHEPYLLCWNKSRNQVQ 364

Query: 358 MVLEQNASGEDALR----GWLVAAYA------ASMAKSFH----DP---SLTVLQDAYDK 400
           + L Q A  E  LR    G ++ A          +A+  H    DP   S  ++++ +  
Sbjct: 365 VALSQEAGPETVLRAATHGLILGALQEDGPLPGELAELRHQVQADPKKESWILVRETHQV 424

Query: 401 MNDVFTPLLSELQAKGWHTDR 421
           ++ +F   L  LQA GW T++
Sbjct: 425 LDTLFPKFLKGLQAAGWKTEK 445


>gi|42562046|ref|NP_172832.3| uncharacterized protein [Arabidopsis thaliana]
 gi|28393739|gb|AAO42280.1| unknown protein [Arabidopsis thaliana]
 gi|28973405|gb|AAO64027.1| unknown protein [Arabidopsis thaliana]
 gi|332190945|gb|AEE29066.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 440

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 97/383 (25%), Positives = 184/383 (48%), Gaps = 24/383 (6%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           ++ V    L  F P G+P SV   Y+ +  +  LQ  ++    +LSTQ+LL A G+    
Sbjct: 46  FNHVWRRVLQAFVPEGFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKS 105

Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A        W L+D    +G ++ +   G+ +DS  K WR++AD + D+G  +++LSPL 
Sbjct: 106 ATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLF 165

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
           P  F+ +  LG+ ++    VA+ ATR  +   FA + N +D+ AK  +  T+  ++G+  
Sbjct: 166 PSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSL 225

Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           G+ LA     +     +    L+V H+Y+    +R   +N+LN +R+++++  F++TG++
Sbjct: 226 GMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQV 285

Query: 296 SSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKEL----------FAE 344
            SP  +   E +L      +    +  + +     V+ S    L  L          +  
Sbjct: 286 LSPEQVSSMEGVLPLWATSLRSTNSKPLHKRVQLGVRVSSLPRLDMLQLLNGVGASSYKN 345

Query: 345 EKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASM---AKSFHDPSLTVLQDAYDKM 401
            K++L        ++L +++   D L+ ++ A   A++   + SF+      +   YD+ 
Sbjct: 346 AKYLLAHIKGNVSVILHKDSKPADVLKSYIHAIVLANLMEKSTSFYSEGEAWIDKHYDE- 404

Query: 402 NDVFTPLLSELQAKGWHTDRFLD 424
                 LL +L++ GW T+R L 
Sbjct: 405 ------LLHKLRSGGWKTERLLS 421


>gi|255081626|ref|XP_002508035.1| predicted protein [Micromonas sp. RCC299]
 gi|226523311|gb|ACO69293.1| predicted protein [Micromonas sp. RCC299]
          Length = 665

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 81/260 (31%), Positives = 138/260 (53%), Gaps = 17/260 (6%)

Query: 68  NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVV 124
           N  FP+    SV E Y  Y +FR LQ   S+ ++V +TQ +L A G+   R  PA A  +
Sbjct: 196 NSLFPARE--SVTEDYWEYAKFRFLQRMASSCITVFATQQMLAAIGMGASRRLPA-AAAI 252

Query: 125 SWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
           +W+LKDG+  +GKL + +N G   DS+ KR+R  +  +YD  + +E+++P  P+ FL +A
Sbjct: 253 NWVLKDGLGRLGKLGVAANFGREFDSDVKRFRFTSSVVYDASSLVEMITPFFPKKFLALA 312

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
            L N  K + +  A   R PI  SF  E NL+++ AK  A   + + +GL   + LA+ +
Sbjct: 313 TLANIGKSVGITTANVVRAPIQRSFVLEENLAEVAAKTSAQQVVADNLGLAVAV-LATGL 371

Query: 244 CSSM---QGKMIVGPL-----LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
              +   + ++I+ P      L+ + +Y +  E++A  + T+N +R  +I   ++  G +
Sbjct: 372 TGKVVNDRARLII-PFVAFIPLATMDLYCIYRELKAVQLTTINKERAEIICDHWINVGTV 430

Query: 296 SSPADLRYREDLLFPGRLIE 315
                +   E L  P R+ E
Sbjct: 431 PGTKYVSDAERLFIPARMDE 450


>gi|14424560|gb|AAH09308.1| C16orf58 protein [Homo sapiens]
          Length = 468

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 105/411 (25%), Positives = 191/411 (46%), Gaps = 31/411 (7%)

Query: 35  VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
           ++R +  +P+G  + +V     P     +      F P G+P SV+  YL Y  + ++Q 
Sbjct: 44  LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101

Query: 95  FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
           F S+    L+TQ++L   G+    A   A   +W++KD    +G+++ +   G+++D   
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161

Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
           K+WR+ AD L D+   LE+++P+CP  F       N AK I  VA  ATR  +    A+ 
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVCPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221

Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
            N++D+ AK  +  TL N+ GL   + +   V       +     L+ +H+Y+    +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281

Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
             + TLN  R  +++  +++ G +  P     R + L+ G     A ++ +G   H++V 
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337

Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
            S   EL++L    +E ++LC         +VL Q A  +  LR    G ++ A      
Sbjct: 338 -SSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396

Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
                      +       S  V+++ ++ ++ +F   L  LQ  GW T++
Sbjct: 397 LPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDAGWKTEK 447


>gi|225579041|ref|NP_663565.2| UPF0420 protein C16orf58 homolog [Mus musculus]
 gi|74204001|dbj|BAE29003.1| unnamed protein product [Mus musculus]
 gi|148685697|gb|EDL17644.1| cDNA sequence BC017158, isoform CRA_b [Mus musculus]
          Length = 466

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 108/381 (28%), Positives = 181/381 (47%), Gaps = 39/381 (10%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
           N AK I  VA  ATR  +    A+  N++D+ AK  +  T+ N+ GL   + +   V  C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
            S+     V  LL+ +H+Y+    +RA  + TLN  R  +++  F++ G +  PA     
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312

Query: 305 EDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRWTD-- 357
           E L    +P        ++ +G   H +V  S   ELK+L     E ++LC         
Sbjct: 313 EPLWTGFWPSL------SLSLGVPLHHLV--SSVSELKQLVEGHHEPYLLCWNKSRNQVR 364

Query: 358 MVLEQNASGEDALR----GWLVAAYA------ASMAKSFH----DP---SLTVLQDAYDK 400
           + L Q A  E  LR    G ++ A          +A+  H    DP   S  ++++ +  
Sbjct: 365 VALSQEAGPETVLRAATHGLILGALQEDGPLPGELAQLRHQVQADPKKESWILVRETHQV 424

Query: 401 MNDVFTPLLSELQAKGWHTDR 421
           ++ +F   L  LQA GW T++
Sbjct: 425 LDTLFPKFLKGLQAAGWKTEK 445


>gi|357612137|gb|EHJ67828.1| hypothetical protein KGM_02553 [Danaus plexippus]
          Length = 525

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/367 (25%), Positives = 176/367 (47%), Gaps = 21/367 (5%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
           F   F P GYP SV++ Y+ Y  +   Q F S    +L+TQ +    G+  T A   A  
Sbjct: 47  FAKIFLPQGYPNSVSKDYIFYQIWDTAQAFCSTITGILATQEVFRGVGVGDTNASPLAAT 106

Query: 124 VSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           V+W+ KDG  H+GK++ +   G  +D+  K+WR+ AD L D    +E+  PL        
Sbjct: 107 VTWVFKDGCGHIGKILFAYTHGTYLDAYSKKWRLYADTLNDAAMCIEIALPLFKNYITFA 166

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             +    K I  VA  ATR+ +    A  GNL+D+ AK  A  T  N++   A + L S 
Sbjct: 167 LCVSTCMKAIVGVAGGATRVAMTQHHALRGNLADVSAKDSAQETAVNLIASFAALFLISL 226

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           + +S+     +  +L ++H+      +RA  + TLN  R   ++  +++   I++P ++ 
Sbjct: 227 IGNSVT----IFIILLIMHIVFNYMAVRAVCLRTLNEPRFLQVIDTYLRKEVIANPCEIN 282

Query: 303 YREDLLF----PGRL-IEDAG-NVKVGRDPHKVVKPSK----FHELKELFAEEKFILC-- 350
             E ++F    P  L ++  G ++ +G    K++ P       +++K+++ ++K+I+   
Sbjct: 283 RNEPIIFYQLGPNLLDLKICGFHIIIGDSISKILNPRTNAVYINKVKDIYNDKKYIIHPD 342

Query: 351 LGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLS 410
            G+R   +  +++AS +D L  +  +   A +  + +D  L +     +     F  +  
Sbjct: 343 TGNRVMYVFPKEDASVDDMLCAYFQSVLLAIITCAINDHQLAIFSS--NNNTKPFAQVCV 400

Query: 411 ELQAKGW 417
            LQ+  W
Sbjct: 401 TLQSAEW 407


>gi|74201602|dbj|BAE28428.1| unnamed protein product [Mus musculus]
          Length = 466

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/381 (28%), Positives = 181/381 (47%), Gaps = 39/381 (10%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
           N AK I  VA  ATR  +    A+  N++D+ AK  +  T+ N+ GL   + +   V  C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
            S+     V  LL+ +H+Y+    +RA  + TLN  R  +++  F++ G +  PA     
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312

Query: 305 EDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRWTD-- 357
           E L    +P        ++ +G   H +V  S   ELK+L     E ++LC         
Sbjct: 313 EPLWTGFWPSL------SLSLGVPLHHLV--SSVSELKQLVEGHHEPYLLCWNKSRNQVR 364

Query: 358 MVLEQNASGEDALR----GWLVAAYA------ASMAKSFH----DP---SLTVLQDAYDK 400
           + L Q A  E  LR    G ++ A          +A+  H    DP   S  ++++ +  
Sbjct: 365 VALSQEAGPETVLRAATHGLILGALQEDGPLPGELAQLRHQVQADPKKESWILVRETHQV 424

Query: 401 MNDVFTPLLSELQAKGWHTDR 421
           ++ +F   L  LQA GW T++
Sbjct: 425 LDTLFPKFLKGLQAAGWKTEK 445


>gi|291410995|ref|XP_002721793.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
          Length = 468

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 102/376 (27%), Positives = 174/376 (46%), Gaps = 29/376 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPESVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAKATVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 137 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDIAMFLEIMAPVFPVCFTMTVSTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +    LL+ +H+Y+    +RA  + TLN  R  +++  F++ G +  P      E 
Sbjct: 257 PSVSLGCFFLLTALHIYANYRAVRALVMETLNESRLRLVLKHFLQQGEVLDPTPANQMEP 316

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELF--AEEKFILCL--GSRWTDMVLEQ 362
            L+ G     +  + +G   H +V  S   EL++L    +E ++LC         +VL Q
Sbjct: 317 -LWTG--FSPSPALSLGVPLHCLV--SSMSELQQLIDGHQEPYLLCWDQSRNQVQVVLSQ 371

Query: 363 NASGEDALRGWLVAAYAASMAKSFHDP-----------------SLTVLQDAYDKMNDVF 405
            A  E  LR         ++ +    P                 S  V+++ +  ++ +F
Sbjct: 372 MAGPETVLRAATHGLVLGALREDGPLPGELEELRRWVRAGPEKDSWVVVKETHKVLDKLF 431

Query: 406 TPLLSELQAKGWHTDR 421
              L  LQA GW T++
Sbjct: 432 PKFLKGLQAAGWKTEK 447


>gi|395514351|ref|XP_003761381.1| PREDICTED: UPF0420 protein C16orf58 homolog [Sarcophilus harrisii]
          Length = 455

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 115/410 (28%), Positives = 195/410 (47%), Gaps = 41/410 (10%)

Query: 57  PVYHRVVDSFLNK-------FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
           P  HR + S L+        F P G+P SV+  YL Y  +  +Q F S+    L+TQ++L
Sbjct: 45  PRLHRPLTSLLSSLPPSQAVFLPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVL 104

Query: 110 FAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGT 166
              G+    A   A  ++W+LKDG   +G++  + L G+++D + K+WR+ AD L D   
Sbjct: 105 QGVGVGDAAASVSAATITWLLKDGTGMLGRIGFAWLMGSKLDCDAKKWRLFADVLNDAAM 164

Query: 167 GLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 226
            LE+++P  P  F  +  + + AK I  VA  ATR  +    A+  N++D+ AK  +  T
Sbjct: 165 FLEIVAPAFPSCFTLIVCVSSLAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQET 224

Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
           L N+ GL   + L   V    +  +    L + +H+Y+    +RA  + T N  R  +++
Sbjct: 225 LVNLAGLLVSLLLLPLVSERPRLSLAAFFLFTALHLYANYRAVRAVIMETFNETRLYLVL 284

Query: 287 ADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--E 344
             F++ G + +P     R + L+ G L   A ++ +G   H +V  S   EL+++    +
Sbjct: 285 KHFIQWGEVLTPPSAN-RMEPLWTGCL--PALSLSLGVALHSLV--SSAFELQQVSEGHQ 339

Query: 345 EKFIL----CLGSRWTDMVLEQNASGEDALR----GWLVAA-------------YAASMA 383
           + ++L    C G     + L Q AS    LR    G L+AA                 + 
Sbjct: 340 QPYLLRWNQCRGK--IQVTLGQEASAGVILRAATHGLLLAALLEDVPLTEELETLRGQIR 397

Query: 384 KSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD-RFLDGTGTRFAW 432
            +    S T++++ Y  ++ ++   L ELQA GW T+   L+    R AW
Sbjct: 398 AAPEGDSWTIVKETYQILDKLYPKFLKELQAVGWETEHHLLEVDEWRAAW 447


>gi|219119716|ref|XP_002180612.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408085|gb|EEC48020.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 408

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/366 (26%), Positives = 175/366 (47%), Gaps = 16/366 (4%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
            +  F P+ +P SV  GY ++  F  L   + +A  VLSTQ+LL A G+      A+V++
Sbjct: 26  LIQHFLPAQFPASVAPGYAQFCGFCFLASVSGSAGMVLSTQTLLLAIGVVGNAHSASVMA 85

Query: 126 ----WILKDGMQHVGKLI-CSNLGA--RMDSEPKRWRILADALYDLGTGLEVLSPLCPQ- 177
               W+LKDG+  +G +I  S +G   R D+ PK+WR++     D  + L++LSP  P  
Sbjct: 86  GALNWVLKDGIGQLGGVIFASRMGETRRFDANPKKWRMMGALCLDSASLLDILSPFAPSA 145

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
           + L +A   N  K I  +   A+R  ++ +  + GNL D+ AK  + S    ++G G GI
Sbjct: 146 VVLPLACFANALKNIGYLTVGASRAALHQALTRAGNLGDVTAKAGSQSIAAGLLGTGVGI 205

Query: 238 QLASTVCSSMQGKMIVGPL-LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
            L++ +     G  ++G   LS+IH       ++   ++  N  R  +++  +++ G++ 
Sbjct: 206 GLSAAMGHD-AGNFVLGFCGLSLIHQLCNFAALQHVDLHHFNRHRLYLVLDHYLRHGKVL 264

Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA---EEKFIL-CLG 352
           SP ++  +E      R  + A  + +G      + P   +EL +L     +E +I+ C  
Sbjct: 265 SPREIATKESFFPLLREDDTASWLSIG-SALTAISPGGPNELTQLLKACPDESYIINCTN 323

Query: 353 SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSEL 412
                +V  Q+A GED ++G   A +         D    V+   +++    F   L EL
Sbjct: 324 DGKIHVVFLQDAQGEDLVKGMFHACFLRKKTGLSIDGVEAVVA-THEETKVRFPHFLEEL 382

Query: 413 QAKGWH 418
             + W+
Sbjct: 383 NRQAWN 388


>gi|334332835|ref|XP_001370394.2| PREDICTED: UPF0420 protein C16orf58 homolog [Monodelphis domestica]
          Length = 442

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 116/437 (26%), Positives = 198/437 (45%), Gaps = 45/437 (10%)

Query: 19  MSVQEIPI---YWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGY 75
           MS  E P+    W      + R     PDG LS +            + +    F P G+
Sbjct: 1   MSGSEGPVAIKCWERFGSRLPRSCTVAPDGSLSWEAGGGPGQGLRAALTAV---FLPHGF 57

Query: 76  PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQ 133
           P SV+  YL Y  +  +Q F S+    L+TQ++L   G+    A   A  ++W+LKDG  
Sbjct: 58  PDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVLQGVGVGDAAASVSAATITWLLKDGTG 117

Query: 134 HVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
            +G++  + L G+R+D E K+WR+ AD L D    LE+++P  P  F  +  + + AK I
Sbjct: 118 MLGRIGFAWLMGSRLDCEAKKWRLFADVLNDAAMFLEIVAPGFPSCFTLIVCVSSLAKCI 177

Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMI 252
             VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + L   V +     + 
Sbjct: 178 VGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLVSLLLLPLVSNRPSLSLA 237

Query: 253 VGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
              L +++H+Y+    +RA  + T N  R  +++  F++ G++ +P     R + L+ G 
Sbjct: 238 AFFLFTILHLYANFRAVRAVIMETFNEARLYLVLKHFLQWGKVLTPPSAN-RMEPLWTG- 295

Query: 313 LIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFIL----CLGSRWTDMVLEQNASG 366
               A ++ +G   H ++  S   EL+++    ++ ++L    C G     + L Q  S 
Sbjct: 296 -CHSALSLSLGVPLHSLI--SSVFELQQVSEGHQQPYLLRWDRCRGQ--IQVTLGQEVST 350

Query: 367 EDALRGWLVAAYAASMAKSFHD--------------------PSLTVLQDAYDKMNDVFT 406
              LR    A +   +A    D                     S T++++ Y  ++ ++ 
Sbjct: 351 GVILRA---ATHGLLLATLLEDVPLMEELETLRGQIRAAPEGDSWTIVKETYQILDKLYP 407

Query: 407 PLLSELQAKGWHTDRFL 423
             L ELQA GW T+  L
Sbjct: 408 KFLKELQAVGWETEHHL 424


>gi|440913323|gb|ELR62787.1| hypothetical protein M91_19503, partial [Bos grunniens mutus]
          Length = 437

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 104/384 (27%), Positives = 182/384 (47%), Gaps = 37/384 (9%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
            L  F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A  
Sbjct: 42  LLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAAT 101

Query: 124 VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
            +W++KD    +G+++ +  +G++MD   K+WR+ AD L D+   LE+++P+ P  F   
Sbjct: 102 ATWLVKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTIT 161

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             + N AK +  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   
Sbjct: 162 VCISNLAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPM 221

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           V  S    +     L+ +H+Y+    +RA  + TLN QR  +++  F++ G +  P    
Sbjct: 222 VSDSPSLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQRGEVLGPTSAN 281

Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRW----- 355
             E  L+ G     + ++ +G   H V   S   EL++L    +E ++L    RW     
Sbjct: 282 QMEP-LWTG--FWSSVSLSLGAPLHHVT--SSVSELQQLVEGHQEPYLL----RWDQSRN 332

Query: 356 -TDMVLEQNASGEDALR----GWLVAAYAA------------SMAKSFHD-PSLTVLQDA 397
              +VL Q A  E  LR    G ++ A               + A++  +  S  V+++ 
Sbjct: 333 QVQVVLSQMAGPEAILRAATHGLVLGALRGDGPLPEELEELRNQARAGPEKESWVVVRET 392

Query: 398 YDKMNDVFTPLLSELQAKGWHTDR 421
           +  ++ +F   L  LQ  GW T++
Sbjct: 393 HQVLDKLFPKFLKGLQDAGWSTEK 416


>gi|281201954|gb|EFA76161.1| DUF647 family protein [Polysphondylium pallidum PN500]
          Length = 430

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 103/395 (26%), Positives = 182/395 (46%), Gaps = 51/395 (12%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P+GYP SV   Y+ Y  +   Q  +S+   VL+T+++L   G+    A   + V+ WI
Sbjct: 51  FLPAGYPDSVTSDYMSYQVWDTAQALSSSLSGVLATRAMLRGYGVGNAAASVGSAVLQWI 110

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           L+DG   +G++  +   G ++D   K WR +AD L D+G  LE+LSPL   LF+ ++  G
Sbjct: 111 LRDGAGMIGRIYFAWKTGRQLDCNAKTWRFMADILNDIGMTLEMLSPLSNVLFMPLSVCG 170

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--- 243
             +K I  VA   T+  +   FAK  N+ D+ AK  +  T   +VG+  G  L++++   
Sbjct: 171 IVSKSICGVAGGCTKASLTMHFAKNDNMGDVSAKDGSQETAVGLVGMFLGAALSASIPDE 230

Query: 244 ----CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
                ++     ++  +L   H+Y     +    + ++N  R  +I   +++ G I +P 
Sbjct: 231 SEQTTTATVAIWMLFFILIAFHLYCNYRAVTGVVLTSINRYRATLIYKHYLQYGEIPTPI 290

Query: 300 DLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKF-------HELKELFAEEKFILCLG 352
            +  RE++LF     +D   + +G     ++  +K         E++E          L 
Sbjct: 291 QVSKRENILFR----DDKQPIYLGASLLNIINLNKQQQQQQQSKEIRESKTNSIIRHKLE 346

Query: 353 SRWTD---------MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMND 403
           +R+ D         + L+ + +  D     L+ AY  S+A         +L     + N 
Sbjct: 347 NRYIDIWQFQQEIYISLKVHCTAVD-----LIQAYLYSVA---------ILHQTKQQNNR 392

Query: 404 VFTPLLS------ELQAKGWHTDR-FLDGTGTRFA 431
            F  L++       L +KGW+TDR  LD  G R++
Sbjct: 393 AFIDLVTSKSFNDRLLSKGWNTDRQALDAEGWRYS 427


>gi|354505561|ref|XP_003514836.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cricetulus griseus]
 gi|344256981|gb|EGW13085.1| UPF0420 protein C16orf58-like [Cricetulus griseus]
          Length = 466

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 105/381 (27%), Positives = 180/381 (47%), Gaps = 39/381 (10%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 75  LLPQGFPDSVSTDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGDAKASVSAATSTWL 134

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           LKD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F     + 
Sbjct: 135 LKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADVLNDIAMFLEIMAPMYPIFFTMTVSIS 194

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
           N AK I  VA  ATR  +    A+  N++D+ AK  +  T+ N+ GL   + +   V  C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
            S+     +  LL+ +H+Y+    +RA  + TLN  R  +++  F++ G + +PA     
Sbjct: 255 PSLSLGCFI--LLTALHIYANYRAVRALVLETLNESRLQLLLKHFLQRGEVLAPASANQM 312

Query: 305 EDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLG--SRWTD 357
           E L    +P        ++ +G   H +V  S   EL++L    +E ++LC         
Sbjct: 313 EPLWTGFWPSL------SLSLGVPLHHLV--SSVFELEQLVEGHQEPYLLCWNQSQNQVQ 364

Query: 358 MVLEQNASGEDALR--------------GWLVAAYAASMAKSFHDP---SLTVLQDAYDK 400
           + L Q A  E  LR              G L    A    ++   P   S  ++++ +  
Sbjct: 365 VALSQVAGPEAVLRAATHGLVLGALQEDGPLPKELAELREQARAGPKKESWILVRETHQV 424

Query: 401 MNDVFTPLLSELQAKGWHTDR 421
           ++ +F   L  LQA GW T++
Sbjct: 425 LDTLFPKFLKGLQAAGWKTEK 445


>gi|78486556|ref|NP_001030594.1| UPF0420 protein C16orf58 homolog [Rattus norvegicus]
 gi|123781627|sp|Q499P8.1|CP058_RAT RecName: Full=UPF0420 protein C16orf58 homolog
 gi|71682994|gb|AAH99813.1| Similar to cDNA sequence BC017158 [Rattus norvegicus]
          Length = 466

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 101/379 (26%), Positives = 176/379 (46%), Gaps = 35/379 (9%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL+Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 75  LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D    LE+++P+ P  F       
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  T+ N+ GL   + +   V   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
           +   +    LL+ +H+Y+    +RA  + TLN  R  +++  F++ G +  PA     E 
Sbjct: 255 LSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314

Query: 307 L---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLG--SRWTDMV 359
           L    +P        ++ +G   H +V  S   ELK+L    +E ++LC         + 
Sbjct: 315 LWTGFWPSL------SLSLGVPLHHLV--SSVSELKQLVEGHQEPYLLCWNQSQNQVQVA 366

Query: 360 LEQNASGEDALR----GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMN 402
           L Q A  E  LR    G ++ A                 +     + S  ++++ +  ++
Sbjct: 367 LSQVAGPETVLRAATHGLILGALQEDGPLPGELAELRDMVQAGPKNESWILVRETHQVLD 426

Query: 403 DVFTPLLSELQAKGWHTDR 421
            +F   L  LQA GW T++
Sbjct: 427 TLFPKFLKGLQAAGWKTEK 445


>gi|357484331|ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
 gi|355513788|gb|AES95411.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
          Length = 630

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 85/303 (28%), Positives = 161/303 (53%), Gaps = 7/303 (2%)

Query: 58  VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RP 116
           +Y++  +  +    P G+P SV   YL Y+ +R +Q        VL+TQ+LL+A GL + 
Sbjct: 167 LYNKCREFVVRLMLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQALLYAVGLGKG 226

Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
               A  ++W+LKDG+ ++ K++ S+ G   D  PK WR+ AD L +   GLE+ +P  P
Sbjct: 227 AIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFP 286

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
            LF+ +      ++  A +   +TR   ++ FA + N +++ AKGE    +   +G+G G
Sbjct: 287 HLFVPIGAFAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMVSRFIGIGIG 346

Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
           I L + + SS    +    +++ +H+Y  ++  ++  + TLNP R +++ ++++ +G+ +
Sbjct: 347 IGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQSIQLRTLNPHRASLVFSEYLLSGQ-A 405

Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT 356
            P      E+ LFP   I +A      ++   +V  S   E K+   E +  L LGS+ +
Sbjct: 406 PPVKEVNAEEPLFPAVPILNAPFAN--KETQSIVLSS---EAKDAAVEIESRLQLGSKLS 460

Query: 357 DMV 359
           +++
Sbjct: 461 EII 463


>gi|168013684|ref|XP_001759421.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689351|gb|EDQ75723.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 712

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 158/302 (52%), Gaps = 27/302 (8%)

Query: 76  PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGM 132
           P  V   YL Y +++ +  F S+ L    TQ++L+A G+   R  PA A  ++W++KDG+
Sbjct: 283 PDQVWPHYLVYLKWKLVHRFLSSVLHFQCTQAMLWAVGVGAKRRLPA-AAALNWVMKDGV 341

Query: 133 QHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
             +GKL+  +N G   DS+ KR R     L+    GLE+L+PL PQ FL +A L N AK 
Sbjct: 342 GRLGKLLFTANFGRTFDSDLKRVRFWTSVLFSSSVGLEILTPLFPQHFLLLATLANIAKS 401

Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK- 250
           IA  A  AT   I+ SFA   NL+D+ AKG A + + + +GL A + L+  + +  + + 
Sbjct: 402 IAYAAYLATSSAIHRSFALGDNLADISAKGMAQTVVADNLGLAAAVCLSQFIHNFPKIEK 461

Query: 251 ---MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
              + + P+L+   ++++ ++++A  + TLN +R  +I+  +VK  RI S  ++   E++
Sbjct: 462 FLPLAMYPVLASAELFAIYQQLQAVHLQTLNKERLEIIINIWVKDRRIPSFEEVSKIENV 521

Query: 308 ----LFP--------------GRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFIL 349
               LF               G   ++  ++++G    + + P     +  L   +K+ L
Sbjct: 522 ALLDLFRVFGVNMVFANGGEGGSGFKNQLSLRIGAVSLRYLTPEGLMNIVNLRETQKYTL 581

Query: 350 CL 351
           CL
Sbjct: 582 CL 583


>gi|197101105|ref|NP_001126133.1| UPF0420 protein C16orf58 homolog [Pongo abelii]
 gi|75041556|sp|Q5R8F6.1|CP058_PONAB RecName: Full=UPF0420 protein C16orf58 homolog
 gi|55730464|emb|CAH91954.1| hypothetical protein [Pongo abelii]
          Length = 468

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 105/411 (25%), Positives = 191/411 (46%), Gaps = 31/411 (7%)

Query: 35  VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
           ++R +  +P+G  S +V     P     +      F P G+P SV+  YL Y  + ++Q 
Sbjct: 44  LSRAFTVKPEGRDSGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101

Query: 95  FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
           F S     L+TQ++L   G+    A   A   +W++KD    +G+++ +   G+++D   
Sbjct: 102 FASGLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161

Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
           K+WR+ AD L D+   LE+++P+ P  F       N AK I  VA  ATR  +    A+ 
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221

Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
            N++D+ AK  +  TL N+VGL   + +   V       +     L+ +H+Y+    +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281

Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
             + TLN  R  +++  +++ G + +P     R + L+ G     A ++ +G   H++V 
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLNPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337

Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
            S   EL++L    +E ++LC         +VL Q A  +  LR    G ++ A      
Sbjct: 338 -SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396

Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
                      +       S  ++++ ++ ++ +F   L  LQ  GW T++
Sbjct: 397 LPAELEELRNRVQAGPKKESWVIVKETHEVLDMLFPKFLKGLQDAGWKTEK 447


>gi|149067630|gb|EDM17182.1| similar to cDNA sequence BC017158, isoform CRA_b [Rattus
           norvegicus]
          Length = 466

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 101/379 (26%), Positives = 175/379 (46%), Gaps = 35/379 (9%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL+Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 75  LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D    LE+++P+ P  F       
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  T+ N+ GL   + +   V   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
           +        LL+ +H+Y+    +RA  + TLN  R  +++  F++ G +  PA     E 
Sbjct: 255 LSLSFGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314

Query: 307 L---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLG--SRWTDMV 359
           L    +P        ++ +G   H +V  S   ELK+L    +E ++LC         + 
Sbjct: 315 LWTGFWPSL------SLSLGVPLHHLV--SSVSELKQLVEGHQEPYLLCWNQSQNQVQVA 366

Query: 360 LEQNASGEDALR----GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMN 402
           L Q A  E  LR    G ++ A                 +     + S  ++++ +  ++
Sbjct: 367 LSQVAGPETVLRAATHGLILGALQEDGPLPGELAELRDRVQAGPKNESWILVRETHQVLD 426

Query: 403 DVFTPLLSELQAKGWHTDR 421
            +F   L  LQA GW T++
Sbjct: 427 TLFPKFLKGLQAAGWKTEK 445


>gi|449452843|ref|XP_004144168.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
           [Cucumis sativus]
          Length = 437

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 97/393 (24%), Positives = 185/393 (47%), Gaps = 39/393 (9%)

Query: 56  RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
           R V+ R  D+F+    P G+P SV   Y+ +  + +LQ  ++   ++LSTQ+LL A G+ 
Sbjct: 45  RLVWRRFFDAFV----PEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVG 100

Query: 116 PTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS 172
              A        W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  ++++S
Sbjct: 101 EQSATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLVS 160

Query: 173 PLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
           PL P  F+ +  LG+ ++    VA+ ATR  +   FA + N +D+ AK  +  T+  ++G
Sbjct: 161 PLFPSAFMLIVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIG 220

Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
           +  G+ LA           +    L++ H+Y+  + +    + +LNPQR ++ +  +++T
Sbjct: 221 MAIGMLLARITMGCPLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQT 280

Query: 293 GRISSPADLRYREDLLFPG----------RLIEDAGNVKVG---RDPHKVVKPSKFHELK 339
             + SP  +   E +L P           +LI     V++G      H     + F++L 
Sbjct: 281 AEVLSPQQVSMMEHVL-PAWASSWSLRSIKLIHT--RVRLGARISSLHHTEMXAFFNQLM 337

Query: 340 EL-----FAEEKFILCLGSRWTDMVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSL 391
                  +   K++L        ++++++A+  D  + +   LV AY       + +   
Sbjct: 338 NFVKSSGYKGAKYLLVERKETISVIVQKDATANDIFQSFVHALVMAYVPDQESRYFE--- 394

Query: 392 TVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
                +   M+  +   + +L+  GW TDR L 
Sbjct: 395 -----SMSWMDKHYECFIEKLKISGWKTDRLLS 422


>gi|328778772|ref|XP_394441.3| PREDICTED: UPF0420 protein C16orf58 homolog [Apis mellifera]
          Length = 464

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 178/370 (48%), Gaps = 21/370 (5%)

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           ++  Y  ++      F P GYP SV+  Y  Y  +  +Q F S  +  L+T S++   G+
Sbjct: 38  TKSFYSGLISIIKEVFLPQGYPDSVHPDYTSYQIWDTVQAFASTIMGTLTTHSIMQGVGV 97

Query: 115 --RPTPAQATVVSWILKDGMQHVGKLICS--NLGARMDSEPKRWRILADALYDLGTGLEV 170
                   A  ++WILKDG   VG+++ +  N G  +D + K+WR+ AD L DL  GLE+
Sbjct: 98  GEAAATPLAAAITWILKDGTGMVGRIMFAWWN-GTDLDGQCKKWRLFADILNDLAMGLEL 156

Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
           L P      L +  +    K I  VA  ATR  +    A + NL+D+ AK  +  T  N+
Sbjct: 157 LLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNL 216

Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
           +    GI + S +  + Q  M +   L +IH+Y+    ++A  +N+LN  R A+IV  ++
Sbjct: 217 IASFVGILILS-IFHNGQYIMELYLFLVIIHLYANYSAVKALCLNSLNEDRLALIVKSYI 275

Query: 291 KTGRISSPADLRYRED-LLFPGRLIEDAG-NVKVGRDPHKVVKPSKF----HELK-ELFA 343
               I  P ++  +E  LL     +   G N+K+G     ++K         EL  +LF 
Sbjct: 276 SNEVIPKPEEVNKKESVLLLTKSTMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFL 335

Query: 344 EEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK- 400
           + K+++ +   ++   + L+++A   D L  +    + A +   F   SL V  D + K 
Sbjct: 336 DRKYLISIDVQNKTIFICLKKDAQPYDVLEAY----FHACLCGLFICMSLKVPLDVFLKP 391

Query: 401 -MNDVFTPLL 409
            ++D+  PL+
Sbjct: 392 EVSDLSYPLM 401


>gi|207080128|ref|NP_001128778.1| DKFZP468D197 protein [Pongo abelii]
 gi|55727564|emb|CAH90537.1| hypothetical protein [Pongo abelii]
          Length = 466

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 106/413 (25%), Positives = 193/413 (46%), Gaps = 37/413 (8%)

Query: 35  VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
           ++R +  +P+G  S +V     P     +      F P G+P SV+  YL Y  + ++Q 
Sbjct: 44  LSRAFTVKPEGRDSGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101

Query: 95  FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
           F S+    L+TQ++L   G+    A   A   +W++KD    +G+++ +   G+++D   
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161

Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
           K+WR+ AD L D+   LE+++P+ P  F       N AK I  VA  ATR  +    A+ 
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221

Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--CSSMQGKMIVGPLLSVIHVYSVIEEM 269
            N++D+ AK  +  TL N+VGL   + +   V  C  +         L+ +H+Y+    +
Sbjct: 222 NNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCPGLGCFF----FLTALHIYANYRAV 277

Query: 270 RAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKV 329
           RA  + TLN  R  +++  +++ G + +P     R + L+ G     A ++ +G   H++
Sbjct: 278 RALVMETLNEGRLRLVLKHYLQRGEVLNPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRL 334

Query: 330 VKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS 381
           V  S   EL++L    +E ++LC         +VL Q A  +  LR    G ++ A    
Sbjct: 335 V--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGLMLGALQGD 392

Query: 382 -------------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
                        +       S  ++++ ++ ++ +F   L  LQ  GW T++
Sbjct: 393 GPLPAELEELRNRVQAGPKKESWVIVKETHEVLDMLFPKFLKGLQDAGWKTEK 445


>gi|312374874|gb|EFR22346.1| hypothetical protein AND_15402 [Anopheles darlingi]
          Length = 417

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 99/381 (25%), Positives = 181/381 (47%), Gaps = 28/381 (7%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
           F + F P GYP SV+  YL Y ++  +Q F S     L+T ++L   G+      A +  
Sbjct: 49  FQHLFLPVGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDSVTALSAT 108

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLE 181
           V+W+LKDGM H+G+++ +   G  +D + K+WRI AD L D+   +++   P  P+    
Sbjct: 109 VTWVLKDGMGHLGRILFAWWKGTELDIDSKKWRIRADVLNDIAMAIDLFVLPYYPKASTY 168

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           +       K I  VA  ATR  +    A  GNL+D+ +K  A  T  N++    G+ L  
Sbjct: 169 ILCGTTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLLLL- 227

Query: 242 TVCSSMQGKMIVGPLLSVI---HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
              + +Q + I+  L + +   H+Y+ I+ ++A  + T N  R  + + ++ K+G + SP
Sbjct: 228 ---TYLQNQRILYGLFAFVTFLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSP 284

Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK-FHELKELFA----EEKFILCLGS 353
             +   E +   G+ +     VK+G    ++ +  +  ++L+ L A     EKF++    
Sbjct: 285 RQVNKLERVTV-GQTVSLTARVKIGCSVRELTEFYRNCYDLENLIACFDSREKFLIAETR 343

Query: 354 RWTDMVLEQNASGEDALRGWL-VAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSEL 412
            +  + L  +    D ++ +  VA+Y    ++         L+D Y ++ + +   L+  
Sbjct: 344 HYVGVYLHFSVKPLDIIKSYFYVASYLQDKSQ---------LRDRYWEIQNKWNEFLNLA 394

Query: 413 QAKGWHTDRFLDGTGT-RFAW 432
           Q +GW+    L  T   R  W
Sbjct: 395 QCEGWNVQAHLLKTDEYRLDW 415


>gi|344294431|ref|XP_003418921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Loxodonta africana]
          Length = 468

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 103/390 (26%), Positives = 176/390 (45%), Gaps = 43/390 (11%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
           +   L  F P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   
Sbjct: 70  LSGLLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGALATQAVLLGIGVGDAKASVS 129

Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
           A   +W++KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F
Sbjct: 130 AATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVLPICF 189

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
                  N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +
Sbjct: 190 TITVCTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249

Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
              V       +     L+ +H+Y+    +RA  + TLN  R  +I+  F++ G + SP 
Sbjct: 250 LPLVSDCPSVSLGCFFFLTALHIYANYRAVRALVMETLNEGRLWLILKHFLQKGEVLSPT 309

Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
                E L    +P        ++ +G   H ++  S   EL++L    +E ++L     
Sbjct: 310 SANQMEPLWTGFWPSL------SLSLGVPLHCLI--STVSELQQLIGGHQEPYLL----H 357

Query: 355 W------TDMVLEQNASGEDALR----GWLVAAYAA-------------SMAKSFHDPSL 391
           W        +VL Q A  E  LR    G ++ A                 M       S 
Sbjct: 358 WDQSRNQVQVVLSQTAGPETVLRAATHGLVLGALQGDGPLPGELEELRNQMWAGPEKDSW 417

Query: 392 TVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
            ++++ +  ++ +F   L  LQ  GW T++
Sbjct: 418 AIVKETHQVLDKLFPKFLKGLQDAGWKTEK 447


>gi|257900512|ref|NP_073581.2| UPF0420 protein C16orf58 [Homo sapiens]
 gi|143955305|sp|Q96GQ5.2|CP058_HUMAN RecName: Full=UPF0420 protein C16orf58
 gi|62897385|dbj|BAD96633.1| hypothetical protein FLJ13868 variant [Homo sapiens]
 gi|119572504|gb|EAW52119.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
 gi|119572505|gb|EAW52120.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
 gi|119572506|gb|EAW52121.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
          Length = 468

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 104/411 (25%), Positives = 190/411 (46%), Gaps = 31/411 (7%)

Query: 35  VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
           ++R +  +P+G  + +V     P     +      F P G+P SV+  YL Y  + ++Q 
Sbjct: 44  LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101

Query: 95  FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
           F S+    L+TQ++L   G+    A   A   +W++KD    +G+++ +   G+++D   
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161

Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
           K+WR+ AD L D+   LE+++P+ P  F       N AK I  VA  ATR  +    A+ 
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221

Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
            N++D+ AK  +  TL N+ GL   + +   V       +     L+ +H+Y+    +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281

Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
             + TLN  R  +++  +++ G +  P     R + L+ G     A ++ +G   H++V 
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337

Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
            S   EL++L    +E ++LC         +VL Q A  +  LR    G ++ A      
Sbjct: 338 -SSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396

Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
                      +       S  V+++ ++ ++ +F   L  LQ  GW T++
Sbjct: 397 LPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDAGWKTEK 447


>gi|156379270|ref|XP_001631381.1| predicted protein [Nematostella vectensis]
 gi|156218420|gb|EDO39318.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 73/245 (29%), Positives = 131/245 (53%), Gaps = 3/245 (1%)

Query: 68  NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
           + F P GYP SV++ YL Y  +  +Q F S+    L+TQ++L  +  G     A A  ++
Sbjct: 3   SAFLPQGYPESVSKDYLTYQFWDTVQAFCSSITGTLATQAMLKGYGVGNESATALAATMT 62

Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           WIL+ G   VG ++ + + G+ +D   K+WR+ AD + DL   +E++SP     F  +A 
Sbjct: 63  WILRSGTGMVGSILFAWMQGSNLDCNAKKWRLFADIMNDLAILVELISPYFKAYFTLLAC 122

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
           L + AK I  VA  +TR  +    A+  N++D+ AK  +  TL N++ L AG+ +   V 
Sbjct: 123 LSSVAKSIVGVAGGSTRAALTQHQARRDNMADVAAKDGSQETLVNLMALLAGLVITPLVT 182

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
                  ++  + + +H+Y+    + +  + TLN  R  ++  +F++TG + SP ++   
Sbjct: 183 GHTLLTWVLFIIFTFLHLYANYCAVSSVVMETLNTTRLQIVTNEFLRTGHVLSPDEVGKM 242

Query: 305 EDLLF 309
           E ++F
Sbjct: 243 EPVIF 247


>gi|10436021|dbj|BAB14730.1| unnamed protein product [Homo sapiens]
          Length = 468

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 104/411 (25%), Positives = 190/411 (46%), Gaps = 31/411 (7%)

Query: 35  VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
           ++R +  +P+G  + +V     P     +      F P G+P SV+  YL Y  + ++Q 
Sbjct: 44  LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101

Query: 95  FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
           F S+    L+TQ++L   G+    A   A   +W++KD    +G+++ +   G+++D   
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161

Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
           K+WR+ AD L D+   LE+++P+ P  F       N AK I  VA  ATR  +    A+ 
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221

Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
            N++D+ AK  +  TL N+ GL   + +   V       +     L+ +H+Y+    +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281

Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
             + TLN  R  +++  +++ G +  P     R + L+ G     A ++ +G   H++V 
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337

Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
            S   EL++L    +E ++LC         +VL Q A  +  LR    G ++ A      
Sbjct: 338 -SSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396

Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
                      +       S  V+++ ++ ++ +F   L  LQ  GW T++
Sbjct: 397 LPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDAGWKTEK 447


>gi|255562731|ref|XP_002522371.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
           communis]
 gi|223538449|gb|EEF40055.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
           communis]
          Length = 523

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 99/390 (25%), Positives = 184/390 (47%), Gaps = 34/390 (8%)

Query: 76  PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGM 132
           P  V E Y+ Y +++ L    S+AL VL+TQ++  A G+   R  P+ A  ++W+LKDG+
Sbjct: 132 PKQVTENYMDYVKWKLLHRVFSSALQVLATQAMFRAVGIGYSRSLPS-AAALNWVLKDGL 190

Query: 133 QHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
             + + I  ++L +  D+  KR R     L+ LG GLE+L+P  P+ FL +A + N AK 
Sbjct: 191 GRLSRCIYTASLASAFDTNLKRVRFSTSILFSLGIGLELLTPAFPKQFLLLATIANIAKQ 250

Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ--- 248
           I++    AT   I+ SFA   NL ++ AK +  +  F+ +GL     L     ++ +   
Sbjct: 251 ISLACHLATGSAIHRSFAVADNLGEVSAKAQIQTVCFDNLGLMLAALLNMLFKNNQRLLA 310

Query: 249 -GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
               +V P+ S I ++ + + ++   + TL   R  MI+  ++  GR+ +PA++  +E  
Sbjct: 311 GLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLEMILDSWIALGRVPTPAEVSKKEGF 370

Query: 308 LFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMV------- 359
            F     ++   +++G  +    +       ++ L +E+ + +C+ + +  +        
Sbjct: 371 EFLRTKGKELWPIRIGCLNAKDQISKLSVIAMQSLSSEDYYFICVENSYRGLTRTKQQGI 430

Query: 360 ---LEQNASGEDALRGWLVAAYAA-----SMAKSFHDPSLTV-------LQDAYDKMNDV 404
              L + A   D + G L A Y       S  ++  DP  TV       ++D+       
Sbjct: 431 LLSLREGAGTADVIMGLLQACYIRKALLFSRWETISDPGSTVHKECIKLIEDSKQSAEKD 490

Query: 405 FTPLLSELQAKGWHTDRFLDGT--GTRFAW 432
            + L  ++ A GW     L  T   TR+++
Sbjct: 491 LSMLGDQMTALGWAVKNILLSTEEQTRYSF 520


>gi|307205774|gb|EFN84004.1| UPF0420 protein C16orf58 [Harpegnathos saltator]
          Length = 466

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 160/337 (47%), Gaps = 17/337 (5%)

Query: 54  DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
           +S+ +Y R V      F P GYP SV   YL Y  +  +Q F S  +  L+T S++   G
Sbjct: 35  ESQDIYFRFVSFVKEVFLPQGYPDSVRADYLAYQIWDTVQAFASTIMGTLTTHSVMQGIG 94

Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV 170
           +  + A   A  ++WILK G   +G ++ +   G ++D + K+WR+ AD L D+  GLE+
Sbjct: 95  VGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTQLDGQCKKWRLFADILNDIAMGLEL 154

Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
           + P      + +       K +  VA  ATR  +    A   NL+D+ AK  +  T  N+
Sbjct: 155 MVPYLSPYSVPILCTSTIMKSVVGVAGGATRAALTQHQAIRNNLADVSAKDSSQETCVNL 214

Query: 231 VGLGAGIQLASTVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
           +    GI + S       G+ ++     L  +H+Y+    ++A  ++TLN  R A+I+ +
Sbjct: 215 IASFVGIFVLSLF---HDGRYLIELYLFLVAVHLYANYSAVKALCLDTLNEDRLALIIKN 271

Query: 289 FVKTGRISSPADLRYREDLLFPGRLIEDAG--NVKVGRD-----PHKVVKPSKFHELKEL 341
           ++   RIS P  +   E +   G   ++    N+K+G          V+   +   L + 
Sbjct: 272 YMMNERISEPQKINKEESVFLLGNPSKEICGFNIKIGASFASLLKRNVISSMEMEFLLKF 331

Query: 342 FAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVA 376
           F + ++++ +   +R   ++L+++    + L G+  A
Sbjct: 332 FEDRRYLITIDIETRTIFIILKKDIKPIEVLEGYFYA 368


>gi|58392188|ref|XP_319167.2| AGAP010020-PA [Anopheles gambiae str. PEST]
 gi|55236239|gb|EAA13864.2| AGAP010020-PA [Anopheles gambiae str. PEST]
          Length = 417

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 97/380 (25%), Positives = 181/380 (47%), Gaps = 26/380 (6%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
           F + F P GYP SV+  YL Y ++  LQ F S     L+T ++L   G+    A   +  
Sbjct: 49  FQHLFLPVGYPDSVSSDYLSYQKWDTLQAFCSTISGTLTTHAILKGVGVGSDAANPLSAT 108

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLE 181
           ++W+LKDG  H+G+++ +   G+ +D + K+WRI AD L D+   +++   P  P+    
Sbjct: 109 ITWVLKDGTGHLGRILFAWWKGSELDIDSKKWRIRADILNDVAMAIDLFVLPYYPKASTY 168

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           +       K I  VA  ATR  +    A  GNL+D+ +K  A  T  N++    G+ L +
Sbjct: 169 ILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLFLLT 228

Query: 242 TVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
            +      K++ G    +++IH+Y+ I+ ++A  + T N  R  + + ++ K+G + SP 
Sbjct: 229 YL---QNQKVLYGLFAFVTLIHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSPQ 285

Query: 300 DLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK-FHELKELFA----EEKFILCLGSR 354
            +   E +   G+ +     VK+G    ++ +  +  ++L+ L A     +KF++     
Sbjct: 286 QVNKLERVTV-GQTVTLTARVKIGCSVRELTEFYRNCYDLENLIACFDSRDKFLIAETRH 344

Query: 355 WTDMVLEQNASGEDALRGWL-VAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQ 413
           +  + L       D ++ +  VA+Y    ++         L+D Y ++ + +   L+  Q
Sbjct: 345 YVGVYLHFTVKPLDIIKSYFYVASYLQDKSQ---------LRDRYWEIQNKWNEFLNMAQ 395

Query: 414 AKGWHTDRFLDGTGT-RFAW 432
            +GW+    L  T   R  W
Sbjct: 396 CEGWNVHAHLLKTDEYRLDW 415


>gi|194758785|ref|XP_001961639.1| GF15067 [Drosophila ananassae]
 gi|190615336|gb|EDV30860.1| GF15067 [Drosophila ananassae]
          Length = 417

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 164/357 (45%), Gaps = 19/357 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F P GYP SV++ Y  Y  +   Q F S     L T ++L     G     A +  V+WI
Sbjct: 53  FLPKGYPDSVSDDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAYSATVTWI 112

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
           LK+G  HVG++  +   G+++D + K+WR+ AD L DL  G+E+ + P  P L  ++   
Sbjct: 113 LKEGSGHVGRIFFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCG 172

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               K I  VA  ATR  +    A  GNL+D+ +K  +  T  N+V    G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
                 I   ++S +H+YS ++ +RA  + T N  R  + + +F ++ R+ SP  +   E
Sbjct: 233 QAVLYTIFYVVVS-LHLYSNLKAVRAVCLRTFNESRYLIALEEFFRSSRMLSPQQVNAME 291

Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
            +   G+ +  + N+K+G     ++   K   + +         E FI+  G +   + L
Sbjct: 292 RVTI-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIDNIVSSFDPHEHFIIAEGKKCLGVYL 350

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
             +    D L+ +  A           D S   ++D Y  +   +   ++  Q +GW
Sbjct: 351 HFDTRPLDVLKAYFFA------VSYLQDRSQ--IKDNYWDIQSKWQEFMALAQKEGW 399


>gi|402908238|ref|XP_003916859.1| PREDICTED: UPF0420 protein C16orf58 homolog [Papio anubis]
          Length = 468

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 177/376 (47%), Gaps = 29/376 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +     L+ +H+Y+    +R+  + TLN  R  +++  +++ G +  P     R +
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 315

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGS--RWTDMVLEQ 362
            L+ G     A ++ +G   H++V  S   EL++L    +E ++LC         +VL Q
Sbjct: 316 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWNQSRNQVQVVLNQ 371

Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
           NA  +  LR    G ++ A                 +       S  ++++ +  ++ +F
Sbjct: 372 NAGPKTVLRAATHGLMLGALQGDGPLPAELEELRNQVRAGPKKESWVIVKETHQVLDMLF 431

Query: 406 TPLLSELQAKGWHTDR 421
              L  LQ  GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447


>gi|357505287|ref|XP_003622932.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
 gi|355497947|gb|AES79150.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
          Length = 590

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 86/335 (25%), Positives = 165/335 (49%), Gaps = 20/335 (5%)

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA- 119
           +V  S +  FF    P  VNE Y+ + +++ L    S+AL VL+TQ++  A G+  + + 
Sbjct: 105 KVFGSVVRDFF---IPREVNENYVDFVKWKLLHRVFSSALQVLATQAMFTAIGVGYSSSL 161

Query: 120 -QATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
             A  ++W+LKDG+  + K I  ++L +  D+  KR R     L+    GLE+L+P  P+
Sbjct: 162 PSAAALNWVLKDGLGRLSKCIYTASLASAFDTNLKRVRFTTSVLFVGSIGLELLTPTFPR 221

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
            FL +A + N AK I++    ATR  ++ SFAK  NL ++ AK +  +  F+++GL    
Sbjct: 222 FFLLLATIANIAKQISLACYLATRSAVHQSFAKADNLGEISAKAQIQTVCFDILGLMLAA 281

Query: 238 QLASTVCSSMQGK----MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
            +   + +  + +      V PL + + ++ + + ++   + TL   R  +I+  +++ G
Sbjct: 282 LVNMWIGNHQRPQAGLHFFVYPLFAFMDLFGIYQGLKHVHLKTLTKDRLEIILKTWIEFG 341

Query: 294 RISSPADLRYREDLLFPGRLIEDAGNVKVGR-DPHKVVKPSKFHELKELFAEEKFILCL- 351
            + SPAD+  +E + F G   +    V++G   P   +       ++ +  E+ + +C+ 
Sbjct: 342 YVPSPADVSEKEVINFMGVKGKSLWPVRIGCISPKDQIPEWSMKTIQSITDEDYYFVCME 401

Query: 352 --------GSRWTDMVLEQNASGEDALRGWLVAAY 378
                   G     + + + A  +  + G L A Y
Sbjct: 402 IFEGLKRTGQLRILLSIREGAESKHIIMGLLQACY 436


>gi|114662190|ref|XP_001159530.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan troglodytes]
 gi|397472000|ref|XP_003807550.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan paniscus]
 gi|410212432|gb|JAA03435.1| chromosome 16 open reading frame 58 [Pan troglodytes]
 gi|410264128|gb|JAA20030.1| chromosome 16 open reading frame 58 [Pan troglodytes]
 gi|410292910|gb|JAA25055.1| chromosome 16 open reading frame 58 [Pan troglodytes]
 gi|410340381|gb|JAA39137.1| chromosome 16 open reading frame 58 [Pan troglodytes]
          Length = 468

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 104/411 (25%), Positives = 190/411 (46%), Gaps = 31/411 (7%)

Query: 35  VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
           ++R +  +P+G  + +V     P     +      F P G+P SV+  YL Y  + ++Q 
Sbjct: 44  LSRAFTVKPEGRDAGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101

Query: 95  FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
           F S+    L+TQ++L   G+    A   A   +W++KD    +G+++ +   G+++D   
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161

Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
           K+WR+ AD L D+   LE+++P+ P  F       N AK I  VA  ATR  +    A+ 
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221

Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
            N++D+ AK  +  TL N+ GL   + +   V       +     L+ +H+Y+    +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281

Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
             + TLN  R  +++  +++ G +  P     R + L+ G     A ++ +G   H++V 
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337

Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
            S   EL++L    +E ++LC         +VL Q A  +  LR    G ++ A      
Sbjct: 338 -SSVFELQQLVEGHQESYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396

Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
                      +       S  V+++ ++ ++ +F   L  LQ  GW T++
Sbjct: 397 LPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDAGWKTEK 447


>gi|426381987|ref|XP_004057610.1| PREDICTED: UPF0420 protein C16orf58 homolog [Gorilla gorilla
           gorilla]
          Length = 468

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 104/411 (25%), Positives = 190/411 (46%), Gaps = 31/411 (7%)

Query: 35  VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
           ++R +  +P+G  + +V     P     +      F P G+P SV+  YL Y  + ++Q 
Sbjct: 44  LSRAFTVKPEGRDAGEVGAPGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101

Query: 95  FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
           F S+    L+TQ++L   G+    A   A   +W++KD    +G+++ +   G+++D   
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNA 161

Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
           K+WR+ AD L D+   LE+++P+ P  F       N AK I  VA  ATR  +    A+ 
Sbjct: 162 KQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARR 221

Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
            N++D+ AK  +  TL N+ GL   + +   V       +     L+ +H+Y+    +RA
Sbjct: 222 NNMADVSAKDSSQETLVNLAGLLVSLLILPLVSGCPGFSLGCFFFLTALHIYANYRAVRA 281

Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
             + TLN  R  +++  +++ G +  P     R + L+ G     A ++ +G   H++V 
Sbjct: 282 LVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLGVPLHRLV- 337

Query: 332 PSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLVAAYAAS-- 381
            S   EL++L    +E ++LC         +VL Q A  +  LR    G ++ A      
Sbjct: 338 -SSVFELQQLVEGHQESYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGLMLGALQGDGP 396

Query: 382 -----------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
                      +       S  V+++ ++ ++ +F   L  LQ  GW T++
Sbjct: 397 LPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDAGWKTEK 447


>gi|8920582|gb|AAF81304.1|AC027656_21 Contains similarity to CG10338 gene product from Drosophila
           melanogaster gb|AE003660 [Arabidopsis thaliana]
          Length = 438

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 94/372 (25%), Positives = 179/372 (48%), Gaps = 24/372 (6%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F   G+P SV   Y+ +  +  LQ  ++    +LSTQ+LL A G+    A        W 
Sbjct: 55  FSSPGFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWF 114

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           L+D    +G ++ +   G+ +DS  K WR++AD + D+G  +++LSPL P  F+ +  LG
Sbjct: 115 LRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLG 174

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           + ++    VA+ ATR  +   FA + N +D+ AK  +  T+  ++G+  G+ LA     +
Sbjct: 175 SLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLARFTSGN 234

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
                +    L+V H+Y+    +R   +N+LN +R+++++  F++TG++ SP  +   E 
Sbjct: 235 PMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEG 294

Query: 307 LL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKEL----------FAEEKFILCLGSRW 355
           +L      +    +  + +     V+ S    L  L          +   K++L      
Sbjct: 295 VLPLWATSLRSTNSKPLHKRVQLGVRVSSLPRLDMLQLLNGVGASSYKNAKYLLAHIKGN 354

Query: 356 TDMVLEQNASGEDALRGWLVAAYAASM---AKSFHDPSLTVLQDAYDKMNDVFTPLLSEL 412
             ++L +++   D L+ ++ A   A++   + SF+      +   YD+       LL +L
Sbjct: 355 VSVILHKDSKPADVLKSYIHAIVLANLMEKSTSFYSEGEAWIDKHYDE-------LLHKL 407

Query: 413 QAKGWHTDRFLD 424
           ++ GW T+R L 
Sbjct: 408 RSGGWKTERLLS 419


>gi|348675633|gb|EGZ15451.1| hypothetical protein PHYSODRAFT_546234 [Phytophthora sojae]
          Length = 408

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 80/223 (35%), Positives = 118/223 (52%), Gaps = 8/223 (3%)

Query: 64  DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR----PTPA 119
           D   + F P     S+ + Y  Y ++  +    S A SVLS QSLL+A GL     PT  
Sbjct: 109 DVLWDLFLPKDARASLTKDYFPYAKWSFVGSVASCAASVLSMQSLLYAIGLGAGAIPT-- 166

Query: 120 QATVVSWILKDGM-QHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
            A  V+W+LKDG+ Q  G L  S +  R D++PKRWR+ +    D+    E+L+PL P  
Sbjct: 167 -AAAVNWVLKDGLGQFGGVLFASIVNNRYDADPKRWRVASSVALDVSVLGEILTPLAPGS 225

Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
           FL +A L N AK +A ++A ATR   ++SFA   NL+D+ AK  + +   ++ G G GI 
Sbjct: 226 FLAIASLANVAKNVAWLSASATRAGFHNSFAIRENLADVTAKAGSQAIASSIFGTGLGIL 285

Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
           ++    SS         +LS +H+ S  + +    + TLN QR
Sbjct: 286 ISQFTGSSTLNVAAAFAVLSTVHLISTYKSVDCVQLRTLNCQR 328


>gi|149725803|ref|XP_001495510.1| PREDICTED: UPF0420 protein C16orf58 homolog [Equus caballus]
          Length = 468

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 103/388 (26%), Positives = 181/388 (46%), Gaps = 45/388 (11%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
            L  F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A  
Sbjct: 73  LLALFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGDAKASVSAAT 132

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
            +W++KD    +G++I +   G+++D   K WR+ AD L D+   LE+++P+ P  F  +
Sbjct: 133 ATWLVKDSTGMLGRIIFAWWKGSKLDCNAKHWRLFADILNDVAMFLEIMAPVYPICFTMI 192

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             + N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   
Sbjct: 193 VCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           V       +    L + +H+Y+    +RA  + TLN  R  +++  F++ G +  P    
Sbjct: 253 VSGCPSFSLGCFFLFTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQRGEVLGPTSAN 312

Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRWT---- 356
            R + L+ G     + ++ +G   H+++  S   EL++L    +E ++L    RW     
Sbjct: 313 -RMEPLWTG--FWPSVSLSLGVPLHRLI--SSVSELQQLVEGHQEPYLL----RWNQSQN 363

Query: 357 --DMVLEQNASGEDALR----GWLVAAY-----------------AASMAKSFHDPSLTV 393
              +VL Q A  E  LR    G ++ A                   A +AK     S  +
Sbjct: 364 QVQVVLSQMAGPETVLRAATHGLVLGALRRDGPLPRELEELRNRVQAGLAKE----SWVI 419

Query: 394 LQDAYDKMNDVFTPLLSELQAKGWHTDR 421
           +++ +  ++ +F   L  LQ  GW T++
Sbjct: 420 VRETHHVLDKLFPKFLKGLQDAGWKTEK 447


>gi|42567566|ref|NP_195771.2| uncharacterized protein [Arabidopsis thaliana]
 gi|209863158|gb|ACI88737.1| At5g01510 [Arabidopsis thaliana]
 gi|332002971|gb|AED90354.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 509

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 106/373 (28%), Positives = 174/373 (46%), Gaps = 29/373 (7%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---------RPTPAQA 121
           FPSG+P SV++ YL Y  ++   + T    +VL T SLL A G+             A A
Sbjct: 120 FPSGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVGVGSFSGTSAAATAAASA 179

Query: 122 TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
             + W+ KDG+  +G+L I    G+  D +PK+WR+ AD +   G+  ++ + L P  FL
Sbjct: 180 AAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFL 239

Query: 181 EMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
            +A  GN AK +    AR  R P    I + FA  GNL ++ AK E       ++GLG G
Sbjct: 240 LLASTGNLAKAV----ARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFG 295

Query: 237 IQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
           I +  T  +  S    ++    + ++H++   + +     NT+N +R  +IV   V    
Sbjct: 296 ILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSV 355

Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKP--SKFHELKELFAEEKFILCLG 352
           +    D   RE++L   R ++      V  +    ++   SK   L +++ +EK+IL L 
Sbjct: 356 VPGYVDCNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVSKVKALLKMYTKEKYILTLN 415

Query: 353 SRWTD----MVLEQNASGEDALRG-WLVAAYAASMAKSFHDPSLTV--LQDAYDKMNDVF 405
               D    +  + NA+  D LR  W       +M +SF D       L+ +  +M++ F
Sbjct: 416 KLNKDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEESFKDKDSVFHWLKQSLSEMDNKF 475

Query: 406 TPLLSELQAKGWH 418
              L +L   GW+
Sbjct: 476 DDFLFKLDTAGWN 488


>gi|449489204|ref|XP_004158245.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 435

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 180/378 (47%), Gaps = 27/378 (7%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
           F + F P G+P SV   Y+ +  + +LQ  ++   ++LSTQ+LL A G+    A      
Sbjct: 51  FFDAFVPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVIGAT 110

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLE-VLSPLCPQLFLE 181
             W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  ++  +SPL P  F+ 
Sbjct: 111 FQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMYVDGPVSPLFPSAFML 170

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           +  LG+ ++    VA+ ATR  +   FA + N +D+ AK  +  T+  ++G+  G+ LA 
Sbjct: 171 IVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGMLLAR 230

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
                     +    L++ H+Y+  + +    + +LNPQR ++ +  +++T  + SP  +
Sbjct: 231 ITMGCPLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLSPQQV 290

Query: 302 RYREDLLFPGRLIEDAGNVKVGRDPHKV-VKPSKFH--ELKELF---------AEEKFIL 349
              E +L          ++K+     ++  + S  H  E+KEL           +EK++L
Sbjct: 291 SMMEHVLPAWASSWSLRSIKLIHTRVRLGARISSLHHTEIKELLHLAGANNNDKQEKYLL 350

Query: 350 CLGSRWTDMVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFT 406
                   ++++++A+  D  + +   LV AY       +         ++   M+  + 
Sbjct: 351 VERKETISVIVQKDATANDIFQSFVHALVMAYVPDQESRYF--------ESMSWMDKHYE 402

Query: 407 PLLSELQAKGWHTDRFLD 424
             + +L+  GW TDR L 
Sbjct: 403 CFIEKLKISGWKTDRLLS 420


>gi|195387984|ref|XP_002052672.1| GJ17681 [Drosophila virilis]
 gi|194149129|gb|EDW64827.1| GJ17681 [Drosophila virilis]
          Length = 417

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 90/365 (24%), Positives = 173/365 (47%), Gaps = 19/365 (5%)

Query: 68  NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
           N F P GYP SV++ Y  Y  +  +Q F S     L T ++L     G     A +  V+
Sbjct: 51  NIFLPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNAYSATVT 110

Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMA 183
           WILK+G  H+G+++ +   G+++D + K+WR+ AD L DL  G+E+ + P  P L  ++ 
Sbjct: 111 WILKEGSGHLGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
                 K I  VA  ATR  +    A  GNL+D+ +K  +  T  N++    G+ L  T+
Sbjct: 171 CGSTMLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFVGLYLL-TM 229

Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
             S      V  ++  +H+Y+ ++ +RA  + T N  R  + + +F ++  + +P  +  
Sbjct: 230 IKSQAMLYTVFYIVVTLHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSHMLTPQQVNK 289

Query: 304 REDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDM 358
            E +   G+++  + N+++G     ++   K   + E         E+FI+    ++  +
Sbjct: 290 LERVTV-GQMVSVSLNIRLGLSVKHLIDEYKSSNVIENIVSSFDPHERFIIAESKKYLGV 348

Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWH 418
            L  +   +D L+ +    +A S  +  +      +++ Y  +   +   L+  Q +GW 
Sbjct: 349 YLHFDTRPQDVLKAYF---FAVSYLQDRNQ-----IKEQYWDIQGKWQEFLNLAQKEGWL 400

Query: 419 TDRFL 423
            ++ L
Sbjct: 401 INQHL 405


>gi|431902902|gb|ELK09111.1| hypothetical protein PAL_GLEAN10000828 [Pteropus alecto]
          Length = 468

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/387 (26%), Positives = 182/387 (47%), Gaps = 43/387 (11%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
            L  F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A  
Sbjct: 73  LLTVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNANASVSAAT 132

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
            +W++KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F  +
Sbjct: 133 ATWLVKDSAGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPICFTMI 192

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             + N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   
Sbjct: 193 VCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           V       +    LL+ +H+Y+    +RA  + TLN  R  +++  F+++G +  P    
Sbjct: 253 VSECPSFSLGCFFLLTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQSGEVLDPTSAN 312

Query: 303 YREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRW-- 355
             E L    +P      + ++ +G   H+++  S+  EL++L    +E ++L     W  
Sbjct: 313 KMEPLWTGFWP------SPSLFLGVPLHRLI--SRVFELQQLIEGHQEPYLL----HWDQ 360

Query: 356 ----TDMVLEQNASGEDALR----GWLVAAYA------ASMAK-------SFHDPSLTVL 394
                 +VL Q A  E  LR    G ++ A        A + K            S  ++
Sbjct: 361 SQNQVQVVLSQMAGTETILRAATHGLVLGALQEDGPLPAELEKLRNRVRAGPEKESWVIV 420

Query: 395 QDAYDKMNDVFTPLLSELQAKGWHTDR 421
           ++ +  ++ +F   L  LQ  GW T++
Sbjct: 421 KETHQVLDKLFPKFLKGLQDVGWKTEK 447


>gi|380030107|ref|XP_003698699.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
           [Apis florea]
          Length = 464

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 102/370 (27%), Positives = 180/370 (48%), Gaps = 21/370 (5%)

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           ++  Y  ++      F P G+P SV+  Y  Y  +  +Q F S  +  L+T S++   G+
Sbjct: 38  TKSFYSGLISIIKEVFLPQGFPDSVHPDYTSYQVWDTVQAFASTIMGTLTTHSIMQGVGV 97

Query: 115 --RPTPAQATVVSWILKDGMQHVGKLICS--NLGARMDSEPKRWRILADALYDLGTGLEV 170
                   A  ++WILKDG   VG+++ +  N G  +D + K+WR+ AD L DL  GLE+
Sbjct: 98  GEAAATPLAAAITWILKDGTGMVGRIMFAWWN-GTDLDGQCKKWRLFADILNDLAMGLEL 156

Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
           L P      L +  +    K I  VA  ATR  + +  A + NL+D+ AK  +  T  N+
Sbjct: 157 LLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTNIKALQNNLADVSAKDGSQETCVNL 216

Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
           +    GI + S +  + Q  M +   L +IH+Y+    ++A  +N+LN  R A+IV  ++
Sbjct: 217 IASFVGILILS-IFHNGQYIMELYLFLVIIHLYANYSAVKALCLNSLNEDRLALIVKSYI 275

Query: 291 KTGRISSPADLRYREDLLFPGR-LIEDAG-NVKVGRDPHKVVKPSKF----HELK-ELFA 343
               I  P ++  +E +L   +  +   G N+K+G     ++K         EL  +LF 
Sbjct: 276 SNEVIPKPEEVNKKESVLLLTKPTMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFL 335

Query: 344 EEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK- 400
           + K+++ +   ++   + L+++A   D L  +    + A +   F   SL V  D + K 
Sbjct: 336 DRKYLISIDVQNKTIFICLKKDAQPYDVLEAY----FHACLCGLFICMSLKVPLDVFLKP 391

Query: 401 -MNDVFTPLL 409
            ++D+  PL+
Sbjct: 392 EVSDLSYPLM 401


>gi|332262940|ref|XP_003280516.1| PREDICTED: UPF0420 protein C16orf58 homolog [Nomascus leucogenys]
          Length = 468

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 177/376 (47%), Gaps = 29/376 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +     L+ +H+Y+    +RA  + TLN  R  +++  +++ G +  P     R +
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 315

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
            L+ G     A ++ +G   H++V  S   EL++L    +E ++LC         +VL Q
Sbjct: 316 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 371

Query: 363 NASGEDALR----GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMNDVF 405
            A  ++ LR    G ++ A                 +       S  ++++ +  ++ +F
Sbjct: 372 KAGPKNILRAATHGLMLGALQGDGPLPAELEELRNRVQAGPKKESWVIVKETHQVLDMLF 431

Query: 406 TPLLSELQAKGWHTDR 421
              L  LQ  GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447


>gi|255542032|ref|XP_002512080.1| conserved hypothetical protein [Ricinus communis]
 gi|223549260|gb|EEF50749.1| conserved hypothetical protein [Ricinus communis]
          Length = 380

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 98/365 (26%), Positives = 179/365 (49%), Gaps = 20/365 (5%)

Query: 74  GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDG 131
           G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    A        W L+D 
Sbjct: 3   GFPSSVTPDYVPFQVWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRDL 62

Query: 132 MQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAK 190
              +G ++ +   G+ +DS  K WR++AD + DLG  ++++SPL P  F+ +  LG+ ++
Sbjct: 63  TGMLGGILFTFYQGSDLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVVVVCLGSLSR 122

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
               VA+ ATR  +   FA + N +D+ AK  +  T+  + G+  G+ LA          
Sbjct: 123 SFTGVASGATRAALTQHFALQSNAADISAKEGSQETVATMTGMALGMLLARITIGHPLAI 182

Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
                 L++ H+Y+    +R   + +LN QR+++++  F++TG++ SP  +   E +L  
Sbjct: 183 WFSFLSLTMFHMYANYRAVRCLVLTSLNMQRSSILLQHFMETGQVLSPEQVSGMEHILPR 242

Query: 311 GRLIEDAGNVKVGRDPHKV---VKPSKFHELKELF------AEEKFILCLGSRWTDMVLE 361
                   +VK+      +   V      EL EL        + K++L       ++++ 
Sbjct: 243 WTNFWRTKHVKLLHSRMCLGVRVSSLDHQELMELLHSAGSHKKAKYLLMERKGTINILIH 302

Query: 362 QNASGEDALRGWLVAAYAASMAKSFHDPSLTVL--QDAYDKMNDVFTPLLSELQAKGWHT 419
           +N++  D L+ ++   +A  MAK   + + T L  Q   DK  ++F   L +L + GW T
Sbjct: 303 KNSTTSDVLQSFI---HALVMAKLMEENNSTYLESQSWMDKHYELF---LGKLSSSGWKT 356

Query: 420 DRFLD 424
           +R L 
Sbjct: 357 ERLLS 361


>gi|351711359|gb|EHB14278.1| hypothetical protein GW7_05117 [Heterocephalus glaber]
          Length = 469

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 175/376 (46%), Gaps = 29/376 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
           F P G+P SV+  YL Y Q+ ++Q F S+  S L+T S++   G+        A   +W+
Sbjct: 78  FLPQGFPESVSPDYLPYQQWDSVQAFASSLCSALATHSVMLGLGVGNAEASLAAATATWL 137

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           ++D    +G++I +   G+++D   K+WR+ AD L D    LE+++P+    F  +  + 
Sbjct: 138 VRDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDAAMLLELVAPVLSVSFTMIVSIS 197

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   + + 
Sbjct: 198 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLIMLPLMSNC 257

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
            +  +    LL+ +H+Y+    +R   + TLN  R  +++  F+++G +         E 
Sbjct: 258 PRLSLGCFFLLTALHIYANYRAVRTLVMETLNESRLRLVLKHFLQSGEVLDTTSANQMEP 317

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLG--SRWTDMVLEQ 362
            L+ G  I  + ++ +G   H VV  S   EL++L    +E ++LC         +VL Q
Sbjct: 318 -LWTG--IWPSPSLSLGVPLHCVV--SSVSELQQLVEGHQEPYLLCWNQSENQVQVVLSQ 372

Query: 363 NASGEDALRGWLVAAYAASMAKSFHDP-----------------SLTVLQDAYDKMNDVF 405
            A  E  LR         ++ K    P                    ++++    ++ +F
Sbjct: 373 MAGPEAVLRAATHGLLLGALKKDGPLPRELEELRNQVQAGPEKDDWAIVRETNQVLDKLF 432

Query: 406 TPLLSELQAKGWHTDR 421
              L  LQA GW T++
Sbjct: 433 PRFLEGLQAAGWQTEK 448


>gi|357120354|ref|XP_003561892.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
          Length = 452

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 100/389 (25%), Positives = 186/389 (47%), Gaps = 38/389 (9%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           + R+    L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    
Sbjct: 60  WGRIGGRVLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLSAIGVGEKS 119

Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A        W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  +++LSPL 
Sbjct: 120 ATVIGATFQWFLRDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 179

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
           P   + +  LG+ ++    VA+ ATR  +   FA   N +D+ AK  +  TL  + G+G 
Sbjct: 180 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMSGMGL 239

Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           G+ LA           +    L++ H+Y+  + +++  + TLN +RT++++  F++ G +
Sbjct: 240 GMLLAHVTRGHDLVVWVSFLSLTIFHMYANYKAVQSLSLVTLNYERTSILLQYFLEDGEV 299

Query: 296 SSPADLRYREDLLFP-----GRLIEDAGNVKVGRDPHKVV----KPSKF--------HEL 338
            +P  +  +E +L P      RL+         + PH++V    K S           + 
Sbjct: 300 RTPQQVSKQEHIL-PFWSSWRRLLRT-------KLPHELVHLGAKASMLTHSDMLLISKT 351

Query: 339 KELFAEEKFILCLGSRWTDMVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQ 395
           +  + +  + L        + + + A+  D L+ +   LV A     +KS H        
Sbjct: 352 RSYYTDANYFLLDKEGSVHIFIHKQAAATDVLKSFVHGLVLARLTRKSKSCH-------A 404

Query: 396 DAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
           +A+  M+D +   +S+L+ +G+ T+R L 
Sbjct: 405 EAHQWMDDKYNIFISKLKMEGYSTERLLS 433


>gi|147768659|emb|CAN60615.1| hypothetical protein VITISV_003254 [Vitis vinifera]
          Length = 470

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 99/329 (30%), Positives = 161/329 (48%), Gaps = 52/329 (15%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+   R  P  
Sbjct: 113 SFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNRSAPG- 171

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
           A  ++WILKDG   VGK++ +  G + D + K+ R+  D L +LG G+E+ +   P LFL
Sbjct: 172 AVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVPHLFL 231

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + ++V         
Sbjct: 232 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLV--------- 282

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
                    K +V                    ++TLN  R ++ +  F+KTGR+ S  +
Sbjct: 283 ---------KSVV--------------------LHTLNRARYSVAIDCFLKTGRVPSLQE 313

Query: 301 LRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDM- 358
              +E++  FP   +ED   V   R          +  ++ LF +E++++        + 
Sbjct: 314 GNMKENIFSFP--WLEDKPIVLGSRFKDAFQDAGSYLSIEPLFEKERYLVTYNPSKGKVY 371

Query: 359 -VLEQNASGEDALRGWLVAAYAASMAKSF 386
            +L+  A  +D L+    AA+ A M   F
Sbjct: 372 ALLKDQAKSDDILK----AAFHAHMLLHF 396


>gi|195437813|ref|XP_002066834.1| GK24688 [Drosophila willistoni]
 gi|194162919|gb|EDW77820.1| GK24688 [Drosophila willistoni]
          Length = 417

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 90/363 (24%), Positives = 172/363 (47%), Gaps = 19/363 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
           F P GYP SV++ Y  Y  +  +Q F S     L T ++L + G+      A +  V+WI
Sbjct: 53  FLPKGYPDSVSDDYAAYQIYDTIQAFCSTICGTLCTHAILKSIGVGSENINAYSATVTWI 112

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
           LK+G  HVG+++ +   G+++D + K+WR+ AD L DL  G+E+ + P  P L  ++   
Sbjct: 113 LKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCG 172

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               K +  VA  ATR  +    A  GNL+D+ +K  +  T  N++    G+ +  T+  
Sbjct: 173 STMLKAVVSVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLLASCVGLYVL-TLIK 231

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
           S      V  L+  +H+Y+ ++ +RA  + T N  R  + + +F ++ R+ +P  +   E
Sbjct: 232 SPAVLYTVFYLVVTLHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSRMLTPQQVNGME 291

Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHE-----LKELFAEEKFILCLGSRWTDMVL 360
            +   G+ +  + N+++G     +V   K        +      E+FI+    ++  + L
Sbjct: 292 RVTL-GQTVSVSLNIRLGLSVKNLVDEYKSSSDIENIVSSFDTHERFIIAESKKYLGVYL 350

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
             +   +D L+ +    +A S  +  +      L++ Y  +   +    ++ Q +GW+  
Sbjct: 351 HFDTRPQDVLKAYF---FAVSYLQDRNQ-----LKEKYWDIQAKWQEFFTQAQKEGWNIH 402

Query: 421 RFL 423
             L
Sbjct: 403 THL 405


>gi|340711243|ref|XP_003394188.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus terrestris]
          Length = 464

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 98/354 (27%), Positives = 172/354 (48%), Gaps = 19/354 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P GYP SV+  Y  Y  +  +Q F S  +  L+T S++   G+    A   A  ++WI
Sbjct: 53  FLPQGYPDSVHPDYTSYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEASATPLAAAITWI 112

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           LKDG   +G+++ +   G  +D + K+WR+ AD L DL  GLE+  P      L +  + 
Sbjct: 113 LKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDLAMGLELFLPYFSSYSLGILCIS 172

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
              K I  VA  ATR  +    A + NL+D+ AK  +  T  N++    GI + S +  +
Sbjct: 173 TAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHN 231

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
            +  M +   L +IH+Y+    ++A  +N+LN  R A+IV  ++    I  P ++  +E 
Sbjct: 232 GRYIMELYLFLVMIHLYANYMAVKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKES 291

Query: 307 LLFPGR-LIEDAG-NVKVGRDPHKVVK-----PSKFHELKELFAEEKFILCLG--SRWTD 357
           +L   +  +   G N+K+G     ++K      S      +LF + K+++ +   ++   
Sbjct: 292 VLLLTKPTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDRKYLISIDVQNKTIF 351

Query: 358 MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK--MNDVFTPLL 409
           + L+++A   D L  +    + A +   F   SL V  D + K  ++D+  PL+
Sbjct: 352 ICLKKDAQPYDVLEAY----FHACLCGLFICMSLKVPLDVFVKPEVSDLSYPLM 401


>gi|345801540|ref|XP_547054.3| PREDICTED: UPF0420 protein C16orf58 homolog [Canis lupus
           familiaris]
          Length = 468

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 98/383 (25%), Positives = 177/383 (46%), Gaps = 43/383 (11%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAEASVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F     + 
Sbjct: 137 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTMTVCIS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V + 
Sbjct: 197 NLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSAC 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +     L+ +H+Y+    +RA  + TLN  R  +++  F++ G +  P      E 
Sbjct: 257 PSFSLSCFFFLTALHIYANYRAVRALVLETLNEDRLRLVLKHFLQRGEVLDPTSANQMEP 316

Query: 307 L---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRW------ 355
           L    +P        ++ +G   H+++  S   EL++L    +E ++L     W      
Sbjct: 317 LWTGFWPSL------SLSLGVPLHRLI--SSVFELQQLVEGHQEPYLL----HWDKSKSQ 364

Query: 356 TDMVLEQNASGEDALR----GWLVAAYAAS--MAKSFHD-----------PSLTVLQDAY 398
             +VL Q A  E  LR    G ++ A      + +   +            S  ++++ +
Sbjct: 365 VQVVLSQKAGPETILRAATHGLVLEALQGDGPLPRELEELRNRVLAGADKESWVIIKETH 424

Query: 399 DKMNDVFTPLLSELQAKGWHTDR 421
             ++ +F   L  LQ  GW T++
Sbjct: 425 QVLDKLFPKFLKGLQDAGWKTEK 447


>gi|225452488|ref|XP_002278851.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
          Length = 514

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/345 (26%), Positives = 168/345 (48%), Gaps = 24/345 (6%)

Query: 53  DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
           DD       +  S +  FF    P  V+  Y+ Y +++ L    S+AL VL+TQ++  A 
Sbjct: 99  DDGLNNLFALCSSAVRDFF---IPQKVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAI 155

Query: 113 GL---RPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGL 168
           G+   R  P+ A  ++W+LKDG+  + + I  ++L +  D+  KR R     L+ L  G+
Sbjct: 156 GIGYSRSLPS-AAALNWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGV 214

Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
           E+L+P  P  FL +A + N AK + V    AT   ++ SFA   NL ++ AK +     F
Sbjct: 215 ELLTPAFPNYFLLLATIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCF 274

Query: 229 NVVGLGAGIQLASTVCSSMQGKM-----IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
           + +GL     L + +C + Q  +     +V P+ S I ++ + + ++   + TL   R  
Sbjct: 275 DNLGLMLAAVL-NILCKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLE 333

Query: 284 MIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELF 342
           +I+  +++ G + SPA++   E + FP     +   +++G  DP   +       ++ L 
Sbjct: 334 IILNMWIQLGYVPSPAEVSKEEGISFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSLS 393

Query: 343 AEEKFILCLGS--RWTD-------MVLEQNASGEDALRGWLVAAY 378
           +E+ + +C+ +  R T        + L + AS  D + G L A Y
Sbjct: 394 SEDFYFICMEACRRLTRPEQQGILLCLREGASTTDVISGLLQACY 438


>gi|296087683|emb|CBI34939.3| unnamed protein product [Vitis vinifera]
          Length = 559

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/345 (26%), Positives = 168/345 (48%), Gaps = 24/345 (6%)

Query: 53  DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
           DD       +  S +  FF    P  V+  Y+ Y +++ L    S+AL VL+TQ++  A 
Sbjct: 144 DDGLNNLFALCSSAVRDFF---IPQKVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAI 200

Query: 113 GL---RPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGL 168
           G+   R  P+ A  ++W+LKDG+  + + I  ++L +  D+  KR R     L+ L  G+
Sbjct: 201 GIGYSRSLPS-AAALNWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGV 259

Query: 169 EVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
           E+L+P  P  FL +A + N AK + V    AT   ++ SFA   NL ++ AK +     F
Sbjct: 260 ELLTPAFPNYFLLLATIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCF 319

Query: 229 NVVGLGAGIQLASTVCSSMQGKM-----IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
           + +GL     L + +C + Q  +     +V P+ S I ++ + + ++   + TL   R  
Sbjct: 320 DNLGLMLAAVL-NILCKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLE 378

Query: 284 MIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELF 342
           +I+  +++ G + SPA++   E + FP     +   +++G  DP   +       ++ L 
Sbjct: 379 IILNMWIQLGYVPSPAEVSKEEGISFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSLS 438

Query: 343 AEEKFILCLGS--RWTD-------MVLEQNASGEDALRGWLVAAY 378
           +E+ + +C+ +  R T        + L + AS  D + G L A Y
Sbjct: 439 SEDFYFICMEACRRLTRPEQQGILLCLREGASTTDVISGLLQACY 483


>gi|326527127|dbj|BAK04505.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 97/384 (25%), Positives = 181/384 (47%), Gaps = 28/384 (7%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           + R+    L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    
Sbjct: 55  WGRIGSRVLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKS 114

Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A        W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  +++LSPL 
Sbjct: 115 ATVIGATFQWFLRDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 174

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
           P   + +  LG+ ++    VA+ ATR  +   FA   N +D+ AK  +  TL  + G+G 
Sbjct: 175 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMSGMGL 234

Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           G+ LA           +    L+V H+Y+  + +++  ++TLN +RT++++  F++ G +
Sbjct: 235 GMLLAHVTRGHDLVVWVSFLSLTVFHMYANYKAVQSLSLSTLNYERTSILLQYFMEHGEV 294

Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK------------FHELKELFA 343
            +P  +  +E +L           VK+   PH++V                  + +  + 
Sbjct: 295 LTPQQVSKQEHILPFWSSWRKLLRVKL---PHELVNLGAKASMLTHSDMLLIAKTRSYYT 351

Query: 344 EEKFILCLGSRWTDMVLEQNASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQDAYDK 400
              + L        + + + A   D L+ +   LV A     +KS H        +A+  
Sbjct: 352 NANYFLLDKDGSVCIFIHKQAVATDVLKSFIHGLVLARLVPKSKSCH-------MEAHQW 404

Query: 401 MNDVFTPLLSELQAKGWHTDRFLD 424
           M++ +   +S+L+ +G+ T+R L 
Sbjct: 405 MDEKYNTFISKLKVEGYSTERLLS 428


>gi|194388018|dbj|BAG65393.1| unnamed protein product [Homo sapiens]
          Length = 422

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 98/375 (26%), Positives = 176/375 (46%), Gaps = 29/375 (7%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWIL 128
            P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W++
Sbjct: 32  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 91

Query: 129 KDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
           KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       N
Sbjct: 92  KDSTDMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 151

Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
            AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V    
Sbjct: 152 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 211

Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
              +     L+ +H+Y+    +RA  + TLN  R  +++  +++ G +  P     R + 
Sbjct: 212 GFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEP 270

Query: 308 LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQN 363
           L+ G     A ++ +G   H++V  S   EL++L    +E ++LC         +VL Q 
Sbjct: 271 LWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQK 326

Query: 364 ASGEDALR----GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMNDVFT 406
           A  +  LR    G ++ A                 +       S  V+++ ++ ++ +F 
Sbjct: 327 AGPKTILRAATHGLMLGALQGDGPLPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFP 386

Query: 407 PLLSELQAKGWHTDR 421
             L  LQ  GW T++
Sbjct: 387 KFLKGLQDAGWKTEK 401


>gi|297283910|ref|XP_002802508.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Macaca
           mulatta]
          Length = 429

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 97/376 (25%), Positives = 176/376 (46%), Gaps = 29/376 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 38  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 97

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 98  VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 157

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 158 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 217

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +     L+ +H+Y+    +R+  + TLN  R  +++  +++ G +  P     R +
Sbjct: 218 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 276

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
            L+ G     A ++ +G   H++V  S   EL++L    +E ++LC         +VL Q
Sbjct: 277 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 332

Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
            A  +  LR    G ++ A                 +       S  ++++ +  ++ +F
Sbjct: 333 KAGPKTVLRAATHGLMLGALQGDGPLPAELEELRNQVRAGPKKESWVIVKETHQVLDMLF 392

Query: 406 TPLLSELQAKGWHTDR 421
              L  LQ  GW T++
Sbjct: 393 PKFLKGLQDAGWKTEK 408


>gi|383859063|ref|XP_003705017.1| PREDICTED: UPF0420 protein C16orf58-like [Megachile rotundata]
          Length = 464

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 175/372 (47%), Gaps = 25/372 (6%)

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           ++P Y   V      F P GYP SV+  Y  Y  +  +Q F S  +  L+T S++   G+
Sbjct: 38  TKPFYSGFVSILKEVFLPQGYPDSVHPDYTAYQIWDTVQAFASTIMGTLTTHSIMQGVGV 97

Query: 115 RPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVL 171
               A   A  ++WILKDG   +G+++ +   G  +D + K+WR+ AD L D+  GLE+L
Sbjct: 98  GEPAATPLAAAITWILKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDVAMGLELL 157

Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
            P      L +  +    K I  VA  ATR  +    A + NL+D+ AK  +  T  N++
Sbjct: 158 LPYYSSYSLAILCVSTAMKSIVGVAGGATRAAVTQHQALQNNLADVSAKDGSQETCVNLI 217

Query: 232 GLGAGIQLASTVCSSMQGKMIVGP--LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
               GI + S       G+ I+     L ++H+Y+    +++  +N+LN  R A+IV  +
Sbjct: 218 ASLVGILILSIF---HNGRYIMELYFFLIIVHLYANYSAVKSLCLNSLNEDRLALIVKSY 274

Query: 290 VKTGRISSPADLRYREDLLF---PGRLIEDAGNVKVGRD-----PHKVVKPSKFHELKEL 341
           +    I  P ++  +E +L    P   I    N+K+G        + ++  S      +L
Sbjct: 275 ITNEVIPEPEEVNKKESVLLLTKPTMSIY-GFNIKMGVSLAALIKNNIISTSDTELSLKL 333

Query: 342 FAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYD 399
           F + K+++ +   ++   +  +++A   D L  +    + A +   F   SL V  D + 
Sbjct: 334 FLDRKYLISIDVQNKTIFICFKKDAQPCDVLEAY----FHACLCGLFICMSLRVPLDLFL 389

Query: 400 K--MNDVFTPLL 409
           K  ++D+  PL+
Sbjct: 390 KPEVSDMSYPLM 401


>gi|297810247|ref|XP_002873007.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318844|gb|EFH49266.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 510

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 105/373 (28%), Positives = 174/373 (46%), Gaps = 29/373 (7%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---------RPTPAQA 121
           FP+G+P SV++ YL Y  ++   + T    +VL T SLL A G+             A A
Sbjct: 121 FPTGFPASVSDDYLDYMLWQFPTNVTGWICNVLVTSSLLKAVGVGSFSGTSAAATAAASA 180

Query: 122 TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
             + W+ KDG+  +G+L I    G+  D +PK+WR+ AD +   G+  ++ + L P  FL
Sbjct: 181 AAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFL 240

Query: 181 EMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
            +A  GN AK +    AR  R P    I + FA  GNL ++ AK E       ++GLG G
Sbjct: 241 LLASTGNLAKAV----ARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFG 296

Query: 237 IQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
           I +  T  +  S     +    + ++H++   + +     NT+N +R  +IV   V    
Sbjct: 297 ILIIDTPGLVKSFPFVSLTWTSVRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSV 356

Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKP--SKFHELKELFAEEKFILCLG 352
           +    D   RE++L   R ++      V  +    ++   SK   L +++ +EK+IL L 
Sbjct: 357 VPGYVDCNKRENILLGQRFMKPRIIFGVSLEEVSGLEKSVSKVKALLKMYTKEKYILTLN 416

Query: 353 SRWTD----MVLEQNASGEDALRG-WLVAAYAASMAKSFHDPSLTV--LQDAYDKMNDVF 405
              +D    +  + NA+  D LR  W       +M +SF D       L+ +  +M++ F
Sbjct: 417 KLNSDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEESFKDKDSVFHWLKQSLSEMDNKF 476

Query: 406 TPLLSELQAKGWH 418
              L +L   GW+
Sbjct: 477 DDFLFKLDTAGWN 489


>gi|355756739|gb|EHH60347.1| hypothetical protein EGM_11687 [Macaca fascicularis]
          Length = 468

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 97/376 (25%), Positives = 176/376 (46%), Gaps = 29/376 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +     L+ +H+Y+    +R+  + TLN  R  +++  +++ G +  P     R +
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 315

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
            L+ G     A ++ +G   H++V  S   EL++L    +E ++LC         +VL Q
Sbjct: 316 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 371

Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
            A  +  LR    G ++ A                 +       S  ++++ +  ++ +F
Sbjct: 372 KAGPKTVLRAATHGLILGALQGDGPLPAELEELRNQVRAGPKKESWVIVKETHQVLDMLF 431

Query: 406 TPLLSELQAKGWHTDR 421
              L  LQ  GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447


>gi|302564091|ref|NP_001181525.1| UPF0420 protein C16orf58 [Macaca mulatta]
 gi|380815644|gb|AFE79696.1| hypothetical protein LOC64755 [Macaca mulatta]
          Length = 468

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 97/376 (25%), Positives = 176/376 (46%), Gaps = 29/376 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +     L+ +H+Y+    +R+  + TLN  R  +++  +++ G +  P     R +
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 315

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
            L+ G     A ++ +G   H++V  S   EL++L    +E ++LC         +VL Q
Sbjct: 316 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 371

Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
            A  +  LR    G ++ A                 +       S  ++++ +  ++ +F
Sbjct: 372 KAGPKTVLRAATHGLMLGALQGDGPLPAELEELRNQVRAGPKKESWVIVKETHQVLDMLF 431

Query: 406 TPLLSELQAKGWHTDR 421
              L  LQ  GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447


>gi|355710161|gb|EHH31625.1| hypothetical protein EGK_12728 [Macaca mulatta]
          Length = 468

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 97/376 (25%), Positives = 176/376 (46%), Gaps = 29/376 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGC 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +     L+ +H+Y+    +R+  + TLN  R  +++  +++ G +  P     R +
Sbjct: 257 PGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RME 315

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
            L+ G     A ++ +G   H++V  S   EL++L    +E ++LC         +VL Q
Sbjct: 316 PLWTG--FWPAPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 371

Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
            A  +  LR    G ++ A                 +       S  ++++ +  ++ +F
Sbjct: 372 KAGPKTVLRAATHGLMLGALQGDGPLPAELEELRNQVRAGPKKESWVIVRETHQVLDMLF 431

Query: 406 TPLLSELQAKGWHTDR 421
              L  LQ  GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447


>gi|7339490|emb|CAB82813.1| putative protein [Arabidopsis thaliana]
          Length = 631

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 86/320 (26%), Positives = 156/320 (48%), Gaps = 32/320 (10%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQAT 122
           + L +F  P G+P SV   YL Y+ +R +Q   S    VL+TQSLL+A GL +     A 
Sbjct: 192 NLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAA 251

Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
            ++W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +   G+E+L+P+ PQ F+ +
Sbjct: 252 AINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMI 311

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---------------------- 220
                  +  A +   ATR    + FA + N ++++                        
Sbjct: 312 GAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVYNNFYMALVLITYQQLFVFLNYSGS 371

Query: 221 -GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
            GEA   +   VG+  GI +A+ + +S    +    +++ IH+Y+ ++  +   + TLNP
Sbjct: 372 LGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNP 431

Query: 280 QRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELK 339
            R +++ ++++ +G+     ++   E+ LFP     +       + P K+       E K
Sbjct: 432 YRASLVFSEYLISGQAPLIKEVN-DEEPLFPTVRFSNM------KSPEKLQDFVLSSEAK 484

Query: 340 ELFAEEKFILCLGSRWTDMV 359
              A+ +  L LGS+ +D++
Sbjct: 485 AAAADIEERLQLGSKLSDVI 504


>gi|417401458|gb|JAA47615.1| Hypothetical protein [Desmodus rotundus]
          Length = 468

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 99/390 (25%), Positives = 175/390 (44%), Gaps = 43/390 (11%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
           +   L  F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   
Sbjct: 70  LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129

Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
           A   +WI+KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCF 189

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
                + N AK +  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +
Sbjct: 190 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249

Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
              V       +    LL+ +HVY+    ++A  + TLN  R  +++  F++ G +  P 
Sbjct: 250 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 309

Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
                E L    +P        ++ +G   H+++  S   EL++L     E ++L     
Sbjct: 310 SANQMEPLWTGFWPSL------SLSLGAPLHRLI--SSVFELQQLVEGHREPYLL----H 357

Query: 355 W------TDMVLEQNASGEDALRGWLVAAYAASMAKSFHDP-----------------SL 391
           W        +VL Q A  E  LR         ++ +    P                 S 
Sbjct: 358 WDQSQNRVQVVLSQMAGPETILRAATHGLVLGALREDGPLPRELEELRNQIRAGPKKDSW 417

Query: 392 TVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
            ++++ +  ++ +F   L  LQ  GW T++
Sbjct: 418 VIVKETHQVLDKLFPKFLKGLQDVGWKTEK 447


>gi|328872765|gb|EGG21132.1| DUF647 family protein [Dictyostelium fasciculatum]
          Length = 534

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 78/277 (28%), Positives = 137/277 (49%), Gaps = 6/277 (2%)

Query: 37  RHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFT 96
           +H       H S  V+D       ++   F   F P GYP SV   Y +Y ++  LQ+  
Sbjct: 88  KHVNNNDTTHTSSLVIDTDNQ--QKLTKMFQKLFLPKGYPQSVTHDYSQYMKWMFLQNTL 145

Query: 97  SAALSVLSTQSLL--FAAGLRPTPAQATVVSWILKDGMQHVG-KLICSNLGARMDSEPKR 153
            +   VLST +LL     GL  +   A  +SW+LKDG+      L  S     +D   K 
Sbjct: 146 GSTTYVLSTHALLTSVGVGLSTSLPFAAAISWVLKDGLGASALVLFASKYSTSLDYNLKS 205

Query: 154 WRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGN 213
           ++   D L+++G  LE+L+P  P  FL +A + N +KG+A +   ATR  +  SF+ + N
Sbjct: 206 FKFRGDILHNVGVMLEMLTPFVPGYFLPLASVSNLSKGLAGLIYGATRASLNRSFSLQEN 265

Query: 214 LSDLFAKGEAISTLFNVVGLGAGIQLASTVC-SSMQGKMIVGPLLSVIHVYSVIEEMRAA 272
           L D+ AK ++ S    + G+G G  +   +   +    +    ++S++H+   ++ +++ 
Sbjct: 266 LGDITAKYQSQSMASYLTGMGLGTTIGLLMSPMTTTANLCAVFIISLLHMSCSVKSLKSI 325

Query: 273 PVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
            + TLN QR+ +I+  +++   I SP  +   E+ +F
Sbjct: 326 CLKTLNSQRSKIIIDHWLREKIILSPEIVNKNENFMF 362


>gi|195344882|ref|XP_002039005.1| GM17287 [Drosophila sechellia]
 gi|194134135|gb|EDW55651.1| GM17287 [Drosophila sechellia]
          Length = 417

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 93/357 (26%), Positives = 165/357 (46%), Gaps = 19/357 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F P GYP SV+E Y  Y  +   Q F S     L T ++L     G     A +   +WI
Sbjct: 53  FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
           LK+G  HVG+++ +   G+++D + K+WR+ AD L DL  G+E+ + P  P L  ++   
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCC 172

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               K I  VA  ATR  +    A  GNL+D+ +K  +  T  N+V    G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
                 I   ++S +H+Y+ ++ +RA  + + N  R  + + +F ++ R+ SP  +   E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291

Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
            +   G+ +  + N+K+G     ++   K   + E         E FI+    +   + L
Sbjct: 292 RVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTKKCLGVYL 350

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
                 +D L+ +    +A S  +  +      +++ Y  +   +   L+  Q +GW
Sbjct: 351 HFETRPQDVLKAYF---FAVSYLQDRNQ-----IKEKYWDIQTKWQEFLALAQQEGW 399


>gi|342881518|gb|EGU82408.1| hypothetical protein FOXB_07090 [Fusarium oxysporum Fo5176]
          Length = 448

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 109/425 (25%), Positives = 196/425 (46%), Gaps = 36/425 (8%)

Query: 22  QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNE 81
           + + I   + S +V+R +    D H    VV  S+P    ++ S    F P GYP SV+ 
Sbjct: 12  EPVEIVELDKSGNVSRTWNHTVDNH----VVAKSKPSI-PLIPSLREAFMPVGYPHSVSS 66

Query: 82  GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLI 139
            YL Y  F +LQ F S   S+L+ ++LL   G+    + AT  ++  +LKD M  +  ++
Sbjct: 67  DYLNYQFFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAMSRIATIV 126

Query: 140 CSN-LGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVVAA 197
            ++  G R++ + KR+R LAD   D    LE+ SP   P   +     G   + +  VAA
Sbjct: 127 FAHQFGLRIEPDAKRFRFLADLFNDTAFFLELYSPYFGPYGKILALTSGETLRALCGVAA 186

Query: 198 RATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLL 257
            A++  +   FAK  NLS+L AK  +  T   ++GL  G  +   V       + +  +L
Sbjct: 187 GASKAALSVHFAKHDNLSELNAKEASQETAVGLIGLLVGTIVVKYV-EDHHSVLFLMTVL 245

Query: 258 SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDA 317
              H++     +RA  ++ LN QR  ++  +++KTG + SP ++ +RE++L    ++ + 
Sbjct: 246 VFAHLWMNYLGVRAVCMDNLNQQRATILFEEYLKTGNVLSPEEVAHRENILLWRPVVRNQ 305

Query: 318 GNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSR-----W---------TDMVLEQN 363
              K+ R     +  S +  +         ++  G+R     W           ++L ++
Sbjct: 306 HGQKIARIE---MAKSYWDAMGAKAPNSNVVIIDGARSSLFVWYHSKRSYTPIKIMLWKD 362

Query: 364 ASGEDALRGWLVA---AYAASMAKSFHDPSLTVLQ-----DAYDKMNDVFTPLLSELQAK 415
           AS   A+  W +A   A+    ++ + D    + +     D Y++  D    L  E+ AK
Sbjct: 363 ASAIHAISAWFMAIEMAWLLEKSQGYTDKLDKIFRVKETVDGYEQKYDK-RDLWDEMLAK 421

Query: 416 GWHTD 420
           GW+ +
Sbjct: 422 GWNIE 426


>gi|24585018|ref|NP_609897.2| CG10338 [Drosophila melanogaster]
 gi|22946770|gb|AAF53692.2| CG10338 [Drosophila melanogaster]
 gi|384475974|gb|AFH89820.1| FI20169p1 [Drosophila melanogaster]
          Length = 417

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 93/357 (26%), Positives = 164/357 (45%), Gaps = 19/357 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F P GYP SV+E Y  Y  +   Q F S     L T ++L     G     A +   +WI
Sbjct: 53  FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
           LK+G  HVG+++ +   G+++D + K+WR+ AD L DL  G+E+ + P  P    ++   
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               K I  VA  ATR  +    A  GNL+D+ +K  +  T  N+V    G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
                 I   ++S +H+Y+ ++ +RA  + + N  R  + + +F ++ R+ SP  +   E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291

Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
            +   G+ +  + N+K+G     ++   K   + E         E FI+    +   + L
Sbjct: 292 RVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTKKCLGVYL 350

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
                 +D L+ +    +A S  +  +      L++ Y  +   +   L+  Q +GW
Sbjct: 351 HFETRPQDVLKAYF---FAVSYLQDRNQ-----LKEKYWDIQTKWQEFLALAQQEGW 399


>gi|410984724|ref|XP_003998676.1| PREDICTED: UPF0420 protein C16orf58 homolog [Felis catus]
          Length = 468

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 99/390 (25%), Positives = 179/390 (45%), Gaps = 43/390 (11%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
           + S +  F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   
Sbjct: 70  LSSLIAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129

Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
           A   +W++KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F
Sbjct: 130 AATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFF 189

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
                  N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +
Sbjct: 190 TITICTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249

Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
              V       +     L+ +H+Y+    +RA  + TLN  R  +++  F++ G +  P 
Sbjct: 250 LPLVSDCPGFSLGCFFFLTALHIYANYRAVRALVIETLNEGRLRLVLKHFLQRGEVLGPT 309

Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
                E L    +P +      ++ +G   H++   S   +L++L    +E ++L    R
Sbjct: 310 SANQMEPLWTGFWPSQ------SLSLGVPLHRLT--SSVFDLQQLVEGHQEPYLL----R 357

Query: 355 W------TDMVLEQNASGEDALR----GWLVAAYAAS--MAKSFHD-----------PSL 391
           W        +VL Q A  E  LR    G ++ A      + +   +            + 
Sbjct: 358 WDLSQNQVQVVLSQMAGPETILRAATHGLVLEALQGDGPLPRGLEELRNQVRAGPEKDNW 417

Query: 392 TVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
            +++D +  ++ +F   L  LQ  GW T++
Sbjct: 418 VIVRDTHQVLDKLFPKFLKGLQDAGWKTEK 447


>gi|403368796|gb|EJY84239.1| hypothetical protein OXYTRI_18022 [Oxytricha trifallax]
          Length = 504

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 73/241 (30%), Positives = 121/241 (50%), Gaps = 4/241 (1%)

Query: 58  VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
           V +++       F P GYP SV EGYL+++ F  L   +  A+S LS QSL  A G   T
Sbjct: 85  VKNKIYIGLFETFLPVGYPNSVAEGYLKFSIFNNLSALSITAMSFLSAQSLFVAIGSTMT 144

Query: 118 PAQ--ATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
            A   A   +W+LKDG+  +G  L  S  G   D + K+WR +A    ++   +E+L+  
Sbjct: 145 QANLAAAAYTWVLKDGIGQLGAILFASRYGRNFDEDIKKWRFMAMIALNISIYIEILTLR 204

Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
            P  FL +A + N  K I  + + A+R  I   FAK  N+ D+  K  +  T   + G+G
Sbjct: 205 FPNHFLALASIANIGKNICFLLSAASRASINVQFAKRNNIGDISGKSVSQFTASTLCGVG 264

Query: 235 AGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
            G+ L+  +  SS+     V  +L+++++ +  +  +      LN QR  +I  ++ K+ 
Sbjct: 265 IGLMLSKLIDISSISQLYPVFMILTILNIATSYQASKVIDEVYLNNQRAFIIFNEYFKSN 324

Query: 294 R 294
           +
Sbjct: 325 K 325


>gi|195115010|ref|XP_002002060.1| GI17175 [Drosophila mojavensis]
 gi|193912635|gb|EDW11502.1| GI17175 [Drosophila mojavensis]
          Length = 417

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 89/365 (24%), Positives = 168/365 (46%), Gaps = 19/365 (5%)

Query: 68  NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
           N F P GYP SV++ Y  Y  +  +Q F S     L T ++L     G     A +  V+
Sbjct: 51  NIFLPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNAYSATVT 110

Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMA 183
           WILK+G  H+G++I +   G+++D + K+WR+ AD L DL  G+E+ + P  P    ++ 
Sbjct: 111 WILKEGSGHMGRIIFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
                 K I  VA  ATR  +    A  GNL+D+ +K  +  T  N++    G+ L  T+
Sbjct: 171 CGSTLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFMGLYLL-TM 229

Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
             S      V  ++  +H+Y+ ++ +RA  + T N  R  + + +F ++ ++ +P  +  
Sbjct: 230 IKSQAMLYTVFYIVVTLHLYANLKAVRAVCLRTFNESRYLITLEEFFRSSQMLTPQQVNK 289

Query: 304 REDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDM 358
            E +   G+++  +  +++G     ++   K  ++ E         E FI+    ++  +
Sbjct: 290 LERVTL-GQMVSVSLKIRLGLSVKHLIDEYKSSDVIEDIVSSFDPHEHFIIAESKKFLGV 348

Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWH 418
            L  +   +D L+ +          K         +++ Y  +   +   LS  Q +GW 
Sbjct: 349 YLHFDTRPQDVLKAYFFGVSYLQDRKH--------IKEKYWDIQGKWQEFLSLAQKEGWI 400

Query: 419 TDRFL 423
            ++ L
Sbjct: 401 INQHL 405


>gi|426254633|ref|XP_004020981.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ovis aries]
          Length = 462

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 101/384 (26%), Positives = 176/384 (45%), Gaps = 37/384 (9%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
            L  F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A  
Sbjct: 67  LLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAAT 126

Query: 124 VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
            +W++KD    +G+++ +  +G++MD   K+WR+ AD L D+   LE+++P+ P  F   
Sbjct: 127 ATWLVKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTVT 186

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             + N AK +  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   
Sbjct: 187 VCISNLAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPL 246

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           V  S    +     L+ +H+Y+    +RA  + TLN QR  +++  F++ G +  P    
Sbjct: 247 VSDSPSLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQWGEVLGPTSAN 306

Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRW----- 355
            R + L+ G     + ++ +G   H V   S   EL+ L    +E ++L     W     
Sbjct: 307 -RMEPLWTG--FWSSVSLSLGAPLHCVT--SSVSELQHLVEGHQEPYLL----HWDQSRN 357

Query: 356 -TDMVLEQNASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDA 397
              +VL   A  E  LR    G ++ A                 +       S  V+++ 
Sbjct: 358 QVQVVLSPMAGPEAVLRAATHGLVLGALRGDGPLPEELEELRNQVRAGPEKESWVVVRET 417

Query: 398 YDKMNDVFTPLLSELQAKGWHTDR 421
           +  ++ +F   L  LQ  GW T++
Sbjct: 418 HQVLDKLFPKFLKGLQDGGWSTEK 441


>gi|449017769|dbj|BAM81171.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 827

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 116/425 (27%), Positives = 187/425 (44%), Gaps = 61/425 (14%)

Query: 56  RPVYHRVVDSFLNKFF--PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
           R V+  +V S L + F   + +P  V++ YL YT    +Q+   AA SV++TQS+L A G
Sbjct: 328 RNVWRALVGSLLPEGFGDKTRFPPVVSDDYLPYTVLHFMQNLVHAAASVIATQSMLIAIG 387

Query: 114 L--RPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEV 170
           +  + T   +  V W+LKDG+  +  +I  S LG R D++PKRWR+  D LY LG G E+
Sbjct: 388 VAKQATIGVSAAVQWVLKDGLGRIAAIIFGSLLGNRYDADPKRWRMRGDMLYTLGVGTEI 447

Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
            + L PQ FL +  + N  K ++ +    T   I   F+   N  DL AK  +   L  +
Sbjct: 448 ATRLVPQYFLLVGTMANAIKSVSYMMRLPTAAAIRRHFSLRENFGDLSAKANSQEVLSML 507

Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
           +G   GI L+  + +S++       +   + +      +R   + TLN QRT +++  + 
Sbjct: 508 LGTFVGIGLSYMIDNSLKALFAFYIIYVQMFILFNYWSLRVLRMRTLNLQRTLIVMRRYW 567

Query: 291 KTGRI--SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFI 348
            +G +   S A     E+ L P RL   A  V+ G    +  +     E +E  A ++  
Sbjct: 568 NSGGVETCSVAMANQAENFLLPSRL-GPARRVRFGCRLGEAFR--TVGEFEEAIARQRL- 623

Query: 349 LCLGSRWTDMVLEQ--------NAS-----------------GEDALRGWLVA-----AY 378
                 W    L+Q        NA                  G DA RG +       A 
Sbjct: 624 ------WRGQYLQQTDVSKRNGNADADFTSVPDDVLDERYLLGVDAKRGRVFVVLHRDAK 677

Query: 379 AASMAKSFHDPSLTVLQD-------AYDKMNDVF------TPLLSE-LQAKGWHTDRFLD 424
            A   ++    +  V  D       + D++ D +      TP+L + ++A GW+ D F+ 
Sbjct: 678 VADELQALMHATYLVRADWGPPPDMSIDRVRDGYLLACAETPVLVQRMRAVGWNVDEFVH 737

Query: 425 GTGTR 429
             G R
Sbjct: 738 AGGER 742


>gi|403276814|ref|XP_003930078.1| PREDICTED: UPF0420 protein C16orf58 homolog [Saimiri boliviensis
           boliviensis]
          Length = 468

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 176/376 (46%), Gaps = 29/376 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGF 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
                     L+ +H+Y+    +RA  + TLN  R  +++  +++ G +  P     R +
Sbjct: 257 PGFSFGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTTAN-RME 315

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
            L+ G       ++ +G   H++V  S   EL++L    +E ++LC         +VL Q
Sbjct: 316 PLWTG--FWPTPSLSLGVPLHRLV--SSVSELQQLVEGHQEPYLLCWDQSQNQVQVVLNQ 371

Query: 363 NASGEDALR----GWLVAAYAA-----SMAKSFHD--------PSLTVLQDAYDKMNDVF 405
            A  +  LR    G ++ A        +  K   +         S  ++++ +  ++ +F
Sbjct: 372 KADPKTVLRAATHGLMLGALQGDGPLPAELKELRNRVRAGPKKESWVIVKETHQVLDMLF 431

Query: 406 TPLLSELQAKGWHTDR 421
              L  LQ  GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447


>gi|300176775|emb|CBK25344.2| unnamed protein product [Blastocystis hominis]
          Length = 409

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 74/214 (34%), Positives = 116/214 (54%), Gaps = 3/214 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA-QATVVSWIL 128
           F P  YP SV   YL Y  + A     SAA  VLSTQS+LFA G+       A  ++W+L
Sbjct: 105 FLPRNYPSSVKGKYLNYCSWAAFGMGLSAAGGVLSTQSMLFAIGMGAGAVPMAAALNWVL 164

Query: 129 KDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
           KDG+  +G +  + L  +R+D++ + WRIL+  L ++ T LEV++PL P  FL +A L N
Sbjct: 165 KDGLGQLGGMAFTALVNSRLDADSRGWRILSAWLLEISTWLEVMTPLFPHSFLLLATLAN 224

Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC-SS 246
             K I+ +A  ATR  I   F    N+ D+ AK  + +    V G   GI +++ +    
Sbjct: 225 VGKNISWLAGSATRAGIRYGFVNAHNMGDITAKEGSQTVAITVFGTFLGIVISNLIGHGH 284

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
           M+  ++    +S + ++S+ + +R   + TLN Q
Sbjct: 285 MEYVLMSSMCISAVSLFSIYQSLRCVSLPTLNYQ 318


>gi|195579948|ref|XP_002079818.1| GD24150 [Drosophila simulans]
 gi|194191827|gb|EDX05403.1| GD24150 [Drosophila simulans]
          Length = 417

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/357 (25%), Positives = 164/357 (45%), Gaps = 19/357 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F P GYP SV+E Y  Y  +   Q F S     L T ++L     G     A +   +WI
Sbjct: 53  FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
           LK+G  HVG+++ +   G+++D + K+WR+ AD L DL  G+E+ + P  P    ++   
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               K I  VA  ATR  +    A  GNL+D+ +K  +  T  N+V    G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
                 I   ++S +H+Y+ ++ +RA  + + N  R  + + +F ++ R+ SP  +   E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291

Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
            +   G+ +  + N+K+G     ++   K   + E         E FI+    +   + L
Sbjct: 292 RVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTKKCLGVYL 350

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
                 +D L+ +    +A S  +  +      +++ Y  +   +   L+  Q +GW
Sbjct: 351 HFETRPQDVLKAYF---FAVSYLQDRNQ-----IKEKYWDIQTKWQEFLALAQQEGW 399


>gi|125986275|ref|XP_001356901.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
 gi|195148693|ref|XP_002015302.1| GL19630 [Drosophila persimilis]
 gi|54645227|gb|EAL33967.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
 gi|194107255|gb|EDW29298.1| GL19630 [Drosophila persimilis]
          Length = 417

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 149/317 (47%), Gaps = 11/317 (3%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F P GYP SV+E Y  Y  +  +Q F S     L T ++L     G     A +  V+WI
Sbjct: 53  FLPKGYPDSVSEDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENINAYSATVTWI 112

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
           +K+G  HVG+++ +   G ++D + K+WR+ AD L DL  G+E+ + P  P L  ++   
Sbjct: 113 MKEGCGHVGRILFAWWKGCQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCG 172

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               K I  VA  ATR  +    A  GNL+D+ +K  +  T  N+V    G+ L  T+  
Sbjct: 173 STMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLL-TLIK 231

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
           S      V  ++  +H+Y+ ++ +R+  + T N  R  + + ++ ++ R+ +P  +   E
Sbjct: 232 SQAVLYTVFYVVVALHLYANLKAVRSVCLRTFNESRYLIALEEYFRSSRMLTPQQVNAME 291

Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
            +   G+ +  + NV++G     ++   K     E         E FI+    ++  + L
Sbjct: 292 RVTL-GQTVSVSLNVRLGLSVKNLIDEYKTSSSIENIVSSFDPHEHFIIAESKKYLGVYL 350

Query: 361 EQNASGEDALRGWLVAA 377
                 +D L+ +  A 
Sbjct: 351 HFETRQQDVLKAYFFAV 367


>gi|195035521|ref|XP_001989226.1| GH11606 [Drosophila grimshawi]
 gi|193905226|gb|EDW04093.1| GH11606 [Drosophila grimshawi]
          Length = 417

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 83/319 (26%), Positives = 153/319 (47%), Gaps = 11/319 (3%)

Query: 68  NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVS 125
           N F P GYP SV++ Y+ Y  +  LQ F S     L T ++L     G     A +  V+
Sbjct: 51  NIFLPKGYPDSVSDDYVVYQIWDTLQAFCSTICGTLCTHAILKGIGVGSENINAYSATVT 110

Query: 126 WILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMA 183
           WILK+G  HVG+++ +   G+++D + K+WR+ AD L DL  G+E+ + P  P    ++ 
Sbjct: 111 WILKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
                 K I  VA  ATR  +    A  GNL+D+ +K  +  T  N+V    G+ L + +
Sbjct: 171 CGSTMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLTMI 230

Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRY 303
            S      I   +++ +H+Y+ ++ +R+  + T N  R  + + +F ++  + +P  +  
Sbjct: 231 KSQAMLYTIFYIVVT-LHLYANLKAVRSICLRTFNESRYLIALEEFFRSSHMLTPQQVNK 289

Query: 304 REDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDM 358
            E +   G+ +  + N+++G     ++   K   + E         E+FI+    +   +
Sbjct: 290 MERVTV-GQTVSVSLNIRLGLSVKHLIDEYKSSSVIENIVSSFDPHERFIIAESKKHLGV 348

Query: 359 VLEQNASGEDALRGWLVAA 377
            L  +   +D L+ +  A 
Sbjct: 349 YLHFDTRPQDVLKAYFFAV 367


>gi|189241392|ref|XP_968556.2| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Tribolium
           castaneum]
          Length = 472

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/338 (28%), Positives = 161/338 (47%), Gaps = 14/338 (4%)

Query: 47  LSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQ 106
           +S + V DS  V   V+  F     P GYP SV+E Y  Y  +  +Q F S    + +TQ
Sbjct: 35  ISQESVADS--VLQSVLGFFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTTQ 92

Query: 107 SLLFAAGLRPTPAQA--TVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYD 163
           ++L   G+    A A    ++WI+KDG   VG++I +   G  +DS  K+WR+ AD L D
Sbjct: 93  AILKGVGVGSAEASALSATITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLND 152

Query: 164 LGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
               LE+  P    L  E+  L    K +  VA  ATR  I +  A +GN++++ AK  +
Sbjct: 153 GAMILELSIPHVTHLSTEILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDGS 212

Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
             TL N++    GI L +    +   + IV  LL  +H+YS  + + A     LN  R  
Sbjct: 213 QETLVNLIASFTGIFLINYFTDATT-QWIVTFLLIFLHLYSNYQAVTALIFKNLNNLRMT 271

Query: 284 MIVADFVKTGRISSPADLRYREDLLFP-GRLIEDAGNVKV--GRDPHKVVKPSKFHELK- 339
           +I+  ++    +  P  +   E +    G  +E     K+  G+   K ++     ELK 
Sbjct: 272 LILKSYLAVNAVLGPECINKTESVFLGFGTSVEKFCGFKIVLGQALGKTLRRYNSDELKS 331

Query: 340 --ELFAEEKFILC--LGSRWTDMVLEQNASGEDALRGW 373
             E++ ++K+++   L +R   +  E+  + ++ + G+
Sbjct: 332 LTEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGF 369


>gi|195484210|ref|XP_002090597.1| GE13200 [Drosophila yakuba]
 gi|194176698|gb|EDW90309.1| GE13200 [Drosophila yakuba]
          Length = 417

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 92/357 (25%), Positives = 164/357 (45%), Gaps = 19/357 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F P GYP SV+E Y  Y  +   Q F S     L T ++L     G     A +   +WI
Sbjct: 53  FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
           LK+G  HVG+++ +   G+++D + K+WR+ AD L DL  G+E+ + P  P    ++   
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               K I  VA  ATR  +    A  GNL+D+ +K  +  T  N+V    G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
                 I   ++S +H+Y+ ++ +RA  + + N  R  + + +F ++ R+ SP  +   E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291

Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
            +   G+ +  + N+K+G     ++   K   + E         E FI+    +   + L
Sbjct: 292 RVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAETKKCLGVYL 350

Query: 361 EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
                 +D L+ +    +A S  +  +      +++ Y  +   +   L+  Q +GW
Sbjct: 351 HFETRPQDVLKAYF---FAVSYLQDRNQ-----IKEKYWDIQTKWQEFLALAQQEGW 399


>gi|322787029|gb|EFZ13253.1| hypothetical protein SINV_10096 [Solenopsis invicta]
          Length = 468

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 87/342 (25%), Positives = 161/342 (47%), Gaps = 19/342 (5%)

Query: 54  DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
           +S  +  R+   F   F P GYP SV+  Y  Y  +  +Q F S     L+T S++   G
Sbjct: 35  ESLAIRSRITSVFKQIFLPQGYPDSVHPDYTAYQIWDTVQAFASTITGTLTTHSIMKGIG 94

Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV 170
           +  + A   A  ++WILK G   +G ++ +   G ++D + K+WR+ AD L D+  G+E+
Sbjct: 95  VGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTKLDGQCKKWRLFADILDDIAKGIEL 154

Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
           L P        +  +    K I  VA  ATR  + +  A + N++D+ AK  +  T  N+
Sbjct: 155 LVPFFSSYSTVILCISTTMKSIVGVAGGATRTALINHQAIQNNVADVSAKDGSQETCVNL 214

Query: 231 VGLGAGIQLASTVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
           +    GI + S       G+ ++     L  +H+Y+    ++A  ++TLN  R A+IV +
Sbjct: 215 IASFFGIFILSFF---HDGRYLLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAIIVKN 271

Query: 289 FVKTGRISSPADLRYREDLLF---PGRLIEDAGNVKVGRD-----PHKVVKPSKFHELKE 340
           ++   +I  P  +   E +     PGR +    N+K+G          ++  ++   L +
Sbjct: 272 YMINEQIPKPGKVNKEESIFLLTNPGRNVH-GFNIKIGVSFTSILKRNIISLTEMEFLLK 330

Query: 341 LFAEEKFILCLGSRWTD--MVLEQNASGEDALRGWLVAAYAA 380
            F   K+I+ +     +  ++L+++    + L G+  A+  A
Sbjct: 331 FFENRKYIITIDIEKKNIFIILKKDIQSIEILEGYFYASMCA 372


>gi|330802843|ref|XP_003289422.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
 gi|325080505|gb|EGC34058.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
          Length = 552

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 71/250 (28%), Positives = 131/250 (52%), Gaps = 7/250 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F PSGYP SV   Y RY  + ++Q   S+    L+T+++L  +  G       +    W+
Sbjct: 60  FLPSGYPDSVTSDYFRYQFWDSIQALCSSITGTLATRAILKGYGVGNSSATVASATTQWV 119

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGL 185
           ++     +G++I +   G  +D   K+WR +AD L D+G   E++SPL  + LFL ++ +
Sbjct: 120 IQSATGMIGRIIFAWANGTDLDCNSKKWRFMADILNDIGMTFEMISPLFNEALFLPLSCI 179

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC- 244
           G FAK I  VA   T+  +   FAK  NL+D+ AK  +  T  +++G+   + ++S +  
Sbjct: 180 GLFAKSICGVAGGCTKASLTQHFAKNDNLADVSAKDGSQETAVSLLGMILSVFVSSVLND 239

Query: 245 -SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLR 302
            +S+     +  + +  H+Y   + + A  +N++N  R  +I   +++  G I SP ++ 
Sbjct: 240 DTSILVTWFIFFIFTGFHLYCNYKAVSAVQLNSINKYRAHLIYEYYIRNQGSIPSPKEIS 299

Query: 303 YREDLLFPGR 312
             E++LF  +
Sbjct: 300 NFENILFNSK 309


>gi|194880075|ref|XP_001974360.1| GG21126 [Drosophila erecta]
 gi|190657547|gb|EDV54760.1| GG21126 [Drosophila erecta]
          Length = 417

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/316 (27%), Positives = 148/316 (46%), Gaps = 11/316 (3%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F P GYP SV+E Y  Y  +   Q F S     L T ++L     G     A +   +WI
Sbjct: 53  FLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATATWI 112

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV-LSPLCPQLFLEMAGL 185
           LK+G  HVG+++ +   G+++D + K+WR+ AD L DL  G+E+ + P  P    ++   
Sbjct: 113 LKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCC 172

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               K I  VA  ATR  +    A  GNL+D+ +K  +  T  N+V    G+ L S + S
Sbjct: 173 STLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLIKS 232

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
                 I   ++S +H+Y+ ++ +RA  + + N  R  + + +F ++ R+ SP  +   E
Sbjct: 233 QAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNAME 291

Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-----EEKFILCLGSRWTDMVL 360
            +   G+ +  + N+K+G     ++   K   + E         E FI+    +   + L
Sbjct: 292 RVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIEDIVSSFDPHEHFIIAETKKCLGVYL 350

Query: 361 EQNASGEDALRGWLVA 376
                 +D L+ +  A
Sbjct: 351 HFETRPQDVLKAYFFA 366


>gi|270013094|gb|EFA09542.1| hypothetical protein TcasGA2_TC011650 [Tribolium castaneum]
          Length = 450

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 95/339 (28%), Positives = 161/339 (47%), Gaps = 14/339 (4%)

Query: 46  HLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLST 105
            +S + V DS  V   V+  F     P GYP SV+E Y  Y  +  +Q F S    + +T
Sbjct: 34  EISQESVADS--VLQSVLGFFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTT 91

Query: 106 QSLLFAAGLRPTPAQA--TVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALY 162
           Q++L   G+    A A    ++WI+KDG   VG++I +   G  +DS  K+WR+ AD L 
Sbjct: 92  QAILKGVGVGSAEASALSATITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLN 151

Query: 163 DLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGE 222
           D    LE+  P    L  E+  L    K +  VA  ATR  I +  A +GN++++ AK  
Sbjct: 152 DGAMILELSIPHVTHLSTEILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDG 211

Query: 223 AISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRT 282
           +  TL N++    GI L +    +   + IV  LL  +H+YS  + + A     LN  R 
Sbjct: 212 SQETLVNLIASFTGIFLINYFTDATT-QWIVTFLLIFLHLYSNYQAVTALIFKNLNNLRM 270

Query: 283 AMIVADFVKTGRISSPADLRYREDLLFP-GRLIEDAGNVKV--GRDPHKVVKPSKFHELK 339
            +I+  ++    +  P  +   E +    G  +E     K+  G+   K ++     ELK
Sbjct: 271 TLILKSYLAVNAVLGPECINKTESVFLGFGTSVEKFCGFKIVLGQALGKTLRRYNSDELK 330

Query: 340 ---ELFAEEKFILC--LGSRWTDMVLEQNASGEDALRGW 373
              E++ ++K+++   L +R   +  E+  + ++ + G+
Sbjct: 331 SLTEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGF 369


>gi|307111229|gb|EFN59464.1| hypothetical protein CHLNCDRAFT_138052 [Chlorella variabilis]
          Length = 657

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 74/230 (32%), Positives = 124/230 (53%), Gaps = 18/230 (7%)

Query: 106 QSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADAL 161
           +SLL A G+   R  PA AT+ +W+LKDG+  +G+L + +  G   DS+ KR+R     +
Sbjct: 78  ESLLQAVGVGARRSLPAAATI-NWVLKDGLGRLGRLTVATRFGESFDSDLKRFRYATSII 136

Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
           Y +   LE L+PL PQ FL MA L N  K I +    AT+   + SF    NL+D+ AK 
Sbjct: 137 YAVSLSLEFLTPLAPQHFLVMASLANVGKSIGLTTFIATQPAFHRSFCLRENLADISAKT 196

Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGK-----MIVGPLLSVIHVYSVIEEMRAAPVNT 276
           +A   + + +GL A + L + +C   +       +++ PLL+   ++++  E+R+  + T
Sbjct: 197 QAQQMVMDNIGLAAAVGL-TYLCRHTEAARRALPLVMFPLLAAGDLWAIYSELRSIHLRT 255

Query: 277 LNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDP 326
           LN +R  +I   +++ GR+ SP  +   E  + P        +++VGR P
Sbjct: 256 LNKERAEIIAQHWLREGRVPSPRQVSEEERFVLP-------PHIEVGRMP 298


>gi|405974674|gb|EKC39300.1| UPF0420 protein C16orf58-like protein [Crassostrea gigas]
          Length = 444

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 91/312 (29%), Positives = 147/312 (47%), Gaps = 22/312 (7%)

Query: 49  VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
           +K VD S P +  +   F   F P GYP SV+  YL+Y  +  +Q F S+    L+ Q+L
Sbjct: 24  LKTVDLS-PKWTSLSQFFTVVFLPQGYPESVSSDYLQYQIWDTIQAFASSITGTLAAQAL 82

Query: 109 LFAAGLRPTPA--QATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLG 165
           L   G+    A   A  ++WILKDG   +G+++ + + G  +D + KRWR+ AD L D+ 
Sbjct: 83  LKGVGVGDESATVMAATLTWILKDGTGMIGRILFAWMQGTNLDCDAKRWRLFADILNDMA 142

Query: 166 TGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
             +E+L+P     F  +       K I  VA  ATR  +    A+  N++D+ AK  +  
Sbjct: 143 IFMEILAPYFKAYFTPIVCTAGVCKSIVGVAGGATRAALTQHQARRNNMADVSAKDGSQE 202

Query: 226 TLFNVVGLGAGIQLASTVC-------SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
           TL N+  L   + L   V        S      ++    +   V SV+ E       TLN
Sbjct: 203 TLVNLAALVCSLTLVPLVTGKDVLIWSLFLLFTLLHLFANYSAVTSVVME-------TLN 255

Query: 279 PQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV-KPSKFHE 337
             R  ++V  +++TG + S  ++ Y+E +L+  R      NV +G    KV  + S   E
Sbjct: 256 QARLHILVHHYLQTGEVLSLQEVNYKEPVLWKTR---RKLNVHLGTSIGKVFSRASILEE 312

Query: 338 LKELFAEEKFIL 349
              L+ +  ++L
Sbjct: 313 SIALYKDSNYLL 324


>gi|417401377|gb|JAA47577.1| Hypothetical protein [Desmodus rotundus]
          Length = 464

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 90/323 (27%), Positives = 153/323 (47%), Gaps = 26/323 (8%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
           +   L  F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   
Sbjct: 70  LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129

Query: 121 ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
           A   +WI+KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCF 189

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
                + N AK +  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +
Sbjct: 190 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249

Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
              V       +    LL+ +HVY+    ++A  + TLN  R  +++  F++ G +  P 
Sbjct: 250 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 309

Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
                E L    +P        ++ +G   H+++  S   EL++L     E ++L     
Sbjct: 310 SANQMEPLWTGFWPSL------SLSLGAPLHRLI--SSVFELQQLVEGHREPYLL----H 357

Query: 355 W------TDMVLEQNASGEDALR 371
           W        +VL Q A  E  LR
Sbjct: 358 WDQSQNRVQVVLSQMAGPETILR 380


>gi|397615116|gb|EJK63231.1| hypothetical protein THAOC_16128 [Thalassiosira oceanica]
          Length = 621

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 101/365 (27%), Positives = 166/365 (45%), Gaps = 82/365 (22%)

Query: 56  RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
           + +  +   S    F P G    V   Y R+ ++R LQ F +A + V+ TQSLL   GLR
Sbjct: 141 KSIIRKCRQSLHKNFVPEG----VTPSYYRFMRWRILQRFMNANVHVIGTQSLLM--GLR 194

Query: 116 PTPAQATVVS----------------WILKDGMQHVGKLI----CSNLGARMDSEPKRWR 155
               ++TVVS                W+LKD +   GKL+     S +G + DS+ KRWR
Sbjct: 195 GM-QRSTVVSAAAKGGAALGAVAATNWVLKDTL---GKLVRMAWASKMGRKFDSDAKRWR 250

Query: 156 ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG--- 212
             A  +Y LG GLEV + L P+ FL +A L N  K ++++ + ATR  +Y+SF +     
Sbjct: 251 YRACFIYALGNGLEVSTYLHPRFFLVLAMLANSCKQVSMLTSSATRNALYNSFRRMDDDV 310

Query: 213 -------------------------NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
                                    N+ D+ AKGEA   + +++G+ +GI L+  +  S+
Sbjct: 311 DTVGTGSNSTLKSSINGGRRRGVVENIGDITAKGEAQIAVVDLLGIASGICLSRAIGVSV 370

Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV----KTG---------- 293
           Q  + V  +L +  +Y +  E+R+    TLN +R   ++   +    ++G          
Sbjct: 371 QNVLTVWSVLQLGEIYCMYHEIRSIVYQTLNFERMYTVLGRLLPVDDESGSANGESCCCD 430

Query: 294 RISSPADLRYREDLLFP----GRLIEDAGNVKVGRDPHKVVKPSKFHEL-KELFAEEKFI 348
           RI +P  +   E +  P    GR       +  G     V+ P +   L +++F  E+F 
Sbjct: 431 RIPTPDQIAAEEKVFLPPDHLGR-----KEIAFGSAGRSVLSPGELDVLIRDVFRGERFF 485

Query: 349 LCLGS 353
           L +G+
Sbjct: 486 LAVGA 490


>gi|350411769|ref|XP_003489447.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus impatiens]
          Length = 464

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 97/354 (27%), Positives = 171/354 (48%), Gaps = 19/354 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
           F P GYP SV+  Y  Y  +  +Q F S  +  L+T S++   G+        A  ++WI
Sbjct: 53  FLPQGYPDSVHPDYTPYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEAAATPLAAAITWI 112

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           LKDG   +G+++ +   G  +D + K+WR+ AD L DL  GLE+  P      L +  + 
Sbjct: 113 LKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDLAMGLELFLPYFSSYSLGILCIS 172

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
              K I  VA  ATR  +    A + NL+D+ AK  +  T  N++    GI + S +  +
Sbjct: 173 TAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHN 231

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
            +  M +   L +IH+Y+    ++A  +N+LN  R A+IV  ++    I  P ++  +E 
Sbjct: 232 GRYIMELYLFLVMIHLYANYMAVKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKES 291

Query: 307 LLFPGR-LIEDAG-NVKVGRDPHKVVK-----PSKFHELKELFAEEKFILCLG--SRWTD 357
           +L   +  +   G N+K+G     ++K      S      +LF + K+++ +   ++   
Sbjct: 292 VLLLTKPTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDRKYLISIDVQNKTIF 351

Query: 358 MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK--MNDVFTPLL 409
           + L+++A   D L  +    + A +   F   SL V  D + K  ++D+  PL+
Sbjct: 352 ICLKKDAQPYDVLEAY----FHACLCGLFICMSLKVPLDVFVKPEVSDLSYPLM 401


>gi|242036507|ref|XP_002465648.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
 gi|241919502|gb|EER92646.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
          Length = 440

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 96/379 (25%), Positives = 180/379 (47%), Gaps = 26/379 (6%)

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           ++    L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    A 
Sbjct: 54  QIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSAT 113

Query: 121 --ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
                  W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  +++LSPL P 
Sbjct: 114 VIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPS 173

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
             + +  LG+ ++    VA+ ATR  +   FA   N +D+ AK  +  TL  ++G+G G+
Sbjct: 174 SLIVIMCLGSLSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMIGMGLGM 233

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            LA                L+V H+Y+  + +++  + TLN +R ++++  F + G + +
Sbjct: 234 LLAHITRGHALSVWTSFLSLTVFHMYANYKAVQSLSLTTLNYERASILLQYFKECGDVLA 293

Query: 298 PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV----KPSKFH--------ELKELFAEE 345
           P  +  +E +L           +K+   PH+ V    K S           + +  +   
Sbjct: 294 PQKVSQQEHILPFWSTWRKLNKIKL---PHERVHLGAKASMLTHSDMLVIAKTRSHYENT 350

Query: 346 KFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVF 405
            ++L        + + + A+  D LR ++     AS        S +   +A   M++++
Sbjct: 351 NYLLLNKQGSVYIFIHKLATPADVLRSFVHGLVLAS--------STSQHLEARRWMDEMY 402

Query: 406 TPLLSELQAKGWHTDRFLD 424
           T  +S+LQ++G+ T+R L 
Sbjct: 403 TNFISKLQSEGYSTERLLS 421


>gi|66818028|ref|XP_642707.1| DUF647 family protein [Dictyostelium discoideum AX4]
 gi|74861464|sp|Q86K80.1|U420_DICDI RecName: Full=UPF0420 protein
 gi|60470803|gb|EAL68775.1| DUF647 family protein [Dictyostelium discoideum AX4]
          Length = 527

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 76/269 (28%), Positives = 139/269 (51%), Gaps = 11/269 (4%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P+GYP SV   Y  Y  + ++Q   S     L+T+++L   G+  + A   +    W+
Sbjct: 63  FLPNGYPDSVTTDYFGYQFWDSIQALCSTITGTLATRAILKGYGVGDSSATVASATTQWL 122

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPL-CPQLFLEMAGL 185
           ++DGM  +G+++ +   G  +D   K+WR  AD L ++G   E++SPL   QLFL ++ +
Sbjct: 123 IRDGMGMIGRIVFAWRKGTDLDCNSKKWRYTADILNNIGMAFEMISPLFSSQLFLPLSCI 182

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV-- 243
           G  AK I  VA   T+  +   FAK  NL+D+ AK  +  T  N+VG+   + ++S +  
Sbjct: 183 GLIAKSICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAVNLVGMLLSVIVSSFIND 242

Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLR 302
            +S+    +V    + +H++     + A  + ++N  R  +I   F+   G I SP+++ 
Sbjct: 243 NTSLIVTWLVFLFFTSLHLFCNYRAVSAVQLKSINRYRAYLIYDYFIHNQGSIPSPSEIS 302

Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
             E++LF  + ++    ++VG     + K
Sbjct: 303 KLENILFSIKELD----IRVGVSLCNIYK 327


>gi|301780856|ref|XP_002925848.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ailuropoda
           melanoleuca]
 gi|281353232|gb|EFB28816.1| hypothetical protein PANDA_015415 [Ailuropoda melanoleuca]
          Length = 468

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 98/383 (25%), Positives = 177/383 (46%), Gaps = 43/383 (11%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGNAKASVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 137 VKDSTGMLGRILFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTMTVCTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V + 
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSAC 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +     L+ +H+Y+    +RA  + TLN  R  +++  F++ G +  P      E 
Sbjct: 257 PSFSLGCFFFLTALHIYANYRAVRALVLETLNEGRLRLVLKHFLQRGEVLGPTSANQMEP 316

Query: 307 L---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSRW------ 355
           L    +P        ++ +G   H ++  S   EL++L    +E ++L    RW      
Sbjct: 317 LWTGFWPSL------SLSLGVPLHCMI--SSVFELQQLVEGHQEPYLL----RWDKSQRQ 364

Query: 356 TDMVLEQNASGEDALR----GWLVAAYAAS--MAKSFHD-----------PSLTVLQDAY 398
             +VL Q A  E  LR    G ++ A   +  + +   +            S  ++++ +
Sbjct: 365 VQVVLSQMAGPETILRAATHGLVLEALQGNGPLPRELEELRNQVRAGPEKDSWVIVRETH 424

Query: 399 DKMNDVFTPLLSELQAKGWHTDR 421
             ++ +F   L  LQ  GW T++
Sbjct: 425 QVLDKLFPKFLKGLQDAGWKTEK 447


>gi|218194530|gb|EEC76957.1| hypothetical protein OsI_15243 [Oryza sativa Indica Group]
          Length = 836

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 81/294 (27%), Positives = 147/294 (50%), Gaps = 38/294 (12%)

Query: 107 SLLFAAGL-RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLG 165
           +LL+A GL +     A  V+W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +  
Sbjct: 476 ALLYAVGLGKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTA 535

Query: 166 TGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
            GLE+L+P+ P LF+ +       +  A +   ATR   Y+ FA + N +++ AKGEA  
Sbjct: 536 YGLEILTPVFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQG 595

Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
            +   +G+  GI LA+ + SS+   +I    ++ +H+Y  ++  ++  + TLNP R +++
Sbjct: 596 MVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLV 655

Query: 286 VADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG--RDPHKVVKP----------- 332
            ++++ +G++ S  ++   E L           N+ VG  R   K++ P           
Sbjct: 656 FSEYLLSGQVPSVKEVNDEEPLFL---------NLSVGTSRKESKILSPQAKDAAEIICR 706

Query: 333 -----SKFHELKE----------LFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
                SK  E+ E          L+  E+++L        ++L++ +S ED L+
Sbjct: 707 RLQLGSKLSEIIENKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLK 760



 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 64/182 (35%), Positives = 95/182 (52%), Gaps = 9/182 (4%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
           + R     +    P GYP SV+  YL+Y+ +R +Q   S    VLSTQ+LL+A GL +  
Sbjct: 310 WRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKGA 369

Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
              A  V+W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +   GLE+L+P+ P 
Sbjct: 370 IPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPH 429

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE--GNLSDLFAKGEAIS-TLFNVVGLG 234
           LF+ +       +  A +        I++ F  E   ++  + AKGEA    L   VGLG
Sbjct: 430 LFVPIGAAAGAGRSAAALIQT-----IFAHFMLENANHMRKVIAKGEAQGMALLYAVGLG 484

Query: 235 AG 236
            G
Sbjct: 485 KG 486


>gi|296233362|ref|XP_002761979.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Callithrix
           jacchus]
          Length = 468

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 173/376 (46%), Gaps = 29/376 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKATVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 137 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGF 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
                     L+ +H+Y+    +RA  + TLN  R  +++  +++ G +  P     R +
Sbjct: 257 PGFSFGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTTAN-RME 315

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
            L+ G       ++ +G   H++V  S   EL++L    +E ++LC         +VL Q
Sbjct: 316 PLWTG--FWPTPSLSLGVPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQ 371

Query: 363 NASGEDALR----GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMNDVF 405
            A  +  LR    G ++ A                 +       +  ++++ +  ++ +F
Sbjct: 372 KADPKTVLRAATHGLMLGALQGDGPLPAELEELRNHVQAGPKKETWVIVKETHQVLDMLF 431

Query: 406 TPLLSELQAKGWHTDR 421
              L  LQ  GW T++
Sbjct: 432 PKFLKGLQDAGWKTEK 447


>gi|66816549|ref|XP_642284.1| DUF647 family protein [Dictyostelium discoideum AX4]
 gi|60470350|gb|EAL68330.1| DUF647 family protein [Dictyostelium discoideum AX4]
          Length = 567

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 75/273 (27%), Positives = 146/273 (53%), Gaps = 13/273 (4%)

Query: 60  HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA-GLRPTP 118
           +++ DS    F P GYP +V   Y+ Y ++  +Q+   +   VLST +LL +  G+  T 
Sbjct: 117 NKLFDSIKKLFLPKGYPSTVTSDYIGYQKWMFIQNVIGSTTYVLSTHALLTSVIGISSTS 176

Query: 119 AQ--ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A   A  +SW+LKDG+     ++ +N      D + K+++   D L++ G  LE+ +P  
Sbjct: 177 ALPFAAAISWVLKDGLGATALVLFANKYSTLFDYDIKKYKFRGDLLHNFGVLLEMFTPFF 236

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK--GEAISTLFNVVGL 233
            Q FL +A + N +KG+A +   +TR  +  SF+ + N+ D+ AK   ++++   + + +
Sbjct: 237 QQFFLPLASISNLSKGLAGLIYGSTRASLNRSFSLKENIGDITAKYQSQSMAAYLSGMAI 296

Query: 234 GAGIQLASTV---CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
           G+ I +  +V    S     M+ G  LS IH++  I  +++  + TLN QRT++++ +++
Sbjct: 297 GSSIGMILSVFPLTSVSNLSMVFG--LSFIHIFCGINAIKSIDLKTLNQQRTSILIDNWL 354

Query: 291 KTGRISSPADLRYREDLLFPGRLIEDAGNVKVG 323
           K+  I +   +   E  +   + ++   N+K+G
Sbjct: 355 KSNLILTSKIVNQEEKFITNDKYLK--PNIKLG 385


>gi|332030173|gb|EGI69967.1| UPF0420 protein C16orf58-like protein [Acromyrmex echinatior]
          Length = 471

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 97/379 (25%), Positives = 178/379 (46%), Gaps = 17/379 (4%)

Query: 54  DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
           +S+ V+ R+   F   F P GYP SV+  Y  Y  +  +Q F S     L+T S++   G
Sbjct: 41  ESQTVHSRITSVFKQIFLPQGYPNSVHPDYTAYQIWDTVQAFASTITGTLTTHSIMKGIG 100

Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEV 170
           +  + A   A  ++WILK G   +G ++ +   G  +D + K+WR+ AD L D+  G+E+
Sbjct: 101 VGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTELDGQCKKWRLFADILDDIAKGIEL 160

Query: 171 LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
           L P      + +  +    K I  VA  ATR  +    A + N++D+ AK  +  T  N+
Sbjct: 161 LVPYFSSYSVIILCISTTMKSIVGVAGGATRTALIHHQAIQNNVADVSAKDGSQETCVNL 220

Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
           +    GI + S +    Q  + +   L  +H+Y+    ++A  ++TLN  R A+IV  ++
Sbjct: 221 IASFFGIFILS-LFHDGQYVLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAIIVKKYM 279

Query: 291 KTGRISSPADLRYREDLLF---PGRLIEDAGNVKVGRDPHKVVKPS--KFHELK---ELF 342
              +I  P  +   E +     P R I    ++K+G     ++K +   F E++   + F
Sbjct: 280 MNEQIPEPWKVNKEESVFLLKNPTRDIH-GFDIKIGVSFANILKRNIISFTEMEFLLKFF 338

Query: 343 AEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK 400
              K+I+ +    R   ++L+++    + L  +  A+  A         S+ VL  +  +
Sbjct: 339 ENRKYIILIDIEKRNIFVILKKDIQSVEILEAYFYASMCAFYLCIARKISIDVLLKS--E 396

Query: 401 MNDVFTPLLSELQAKGWHT 419
            +++  PLL+    +  HT
Sbjct: 397 TSEISYPLLNTYILQRIHT 415


>gi|290999547|ref|XP_002682341.1| predicted protein [Naegleria gruberi]
 gi|284095968|gb|EFC49597.1| predicted protein [Naegleria gruberi]
          Length = 261

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 77/260 (29%), Positives = 132/260 (50%), Gaps = 16/260 (6%)

Query: 67  LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT---- 122
           ++ F P  YP SV+  Y+ Y +++ALQ+   +   VL+ QSLL AAG+    A       
Sbjct: 2   ISVFLPKNYPLSVSSNYINYCKWQALQYTAGSFSGVLTMQSLLHAAGITSQLALGGNEYF 61

Query: 123 ------VVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
                  ++W++KDG+  +G  L  S +    D++PK WR+  +        LEV +P+ 
Sbjct: 62  LAFLGGALAWVIKDGLGQLGGILFASKVNTNFDADPKFWRMGGEYALVASALLEVTTPIT 121

Query: 176 -PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
            P  F+  A L N  K ++  AA ATR  +  SFAK  NL+D+ AK  + +   +++G  
Sbjct: 122 GPSWFIAQASLANIGKNVSCFAASATRAAMNQSFAKSDNLADVTAKATSQALACSLIGTT 181

Query: 235 AGIQLASTVCSSMQ---GKMI-VGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
            GI ++S +    Q    K+  +  +LSVI ++S+ + +    +  LN QR  +  + ++
Sbjct: 182 LGIFVSSALSLGTQFVFAKVFPIFCVLSVIQLFSLYKAVSNVKLKVLNKQRFLIACSQYL 241

Query: 291 KTGRISSPADLRYREDLLFP 310
           K   +  P+ +  RE  + P
Sbjct: 242 KDRAVLDPSKVSEREKFVLP 261


>gi|170582654|ref|XP_001896227.1| cDNA sequence BC017158 [Brugia malayi]
 gi|158596613|gb|EDP34929.1| cDNA sequence BC017158, putative [Brugia malayi]
          Length = 389

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 133/243 (54%), Gaps = 3/243 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
           F P GYP SV+  Y+ Y  +  +Q F S+  S LST+++L   G+  +     A  V+W+
Sbjct: 42  FMPRGYPQSVSPDYMNYQMWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAAAVAWL 101

Query: 128 LKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           LKDG+  + +++ + L +  +D++ K+WR++AD   DL   L++++P+ P LF+ +  L 
Sbjct: 102 LKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPSLFMPIICLS 161

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           +  + +  VA  ATR  + +  A   N+ D+ AK  +  TL NV  L   + L   V  +
Sbjct: 162 SMIRAVVGVAGSATRTTVTNHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLPVVSRN 221

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
           +    ++  L + IH+Y     +++    TLN     ++V D+++T +IS+ +++  +E 
Sbjct: 222 IVSVWLLFSLFTFIHLYGNYRAVKSLQFRTLNQSLLRIVVKDYIETRKISTVSEMNSKEP 281

Query: 307 LLF 309
           +L 
Sbjct: 282 ILL 284


>gi|395842960|ref|XP_003794274.1| PREDICTED: UPF0420 protein C16orf58 homolog [Otolemur garnettii]
          Length = 468

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 96/375 (25%), Positives = 172/375 (45%), Gaps = 29/375 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 77  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAKASVSAATATWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 137 VKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPICFTMTVSTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 197 NLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLILPLVSDC 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +     L+ +H+Y+    +RA  + TLN  R  +++  F++ G +  P      E 
Sbjct: 257 PSFSLGCFFFLTALHIYANYRAVRAIVMETLNQGRLRLVLKHFLQRGEVLDPTSANKMEP 316

Query: 307 LLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQ 362
           L    R    + ++ +G   H ++  S   +L++L    +E ++L          +VL Q
Sbjct: 317 LWTGFR---SSPSLSLGVPLHHLI--SSVFDLQQLVEGHQEPYLLYWDQSRNQVQVVLSQ 371

Query: 363 NASGEDALR----GWLVAAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVF 405
            A+ E  LR    G ++ A                 + +     S  ++++ +  ++  F
Sbjct: 372 MAASETILRAATHGLVLGALQGDGPLPRELEELRNQVREGPEKNSWVIVREVHRVLDVFF 431

Query: 406 TPLLSELQAKGWHTD 420
              L  LQ  GW T+
Sbjct: 432 PKFLKGLQDAGWKTE 446


>gi|449461779|ref|XP_004148619.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
 gi|449518467|ref|XP_004166263.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 495

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 97/367 (26%), Positives = 165/367 (44%), Gaps = 18/367 (4%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVS 125
            P+G+P SV++ YL+Y   +   + T      L T SLL A G+       T A A  + 
Sbjct: 119 LPTGFPESVSDDYLQYMIRQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTTTAASAVAIR 178

Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           W+ KDG+  VG+L I    G   D +PK+WR+ AD +   G+  ++ +PL P  FL +A 
Sbjct: 179 WVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLATPLYPSYFLPLAS 238

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
           LGN  K +A      +   I + FA  GNL ++ AK E    +  ++GL  GI +  T  
Sbjct: 239 LGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEIAAKEEVWEVVAQLLGLAIGILILDTPG 298

Query: 245 SSMQGKMIVGPLLS--VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
                 ++    LS  ++H++   + +     NT+N +R  ++    V   ++    +  
Sbjct: 299 LVNSYSVLSATWLSMRLLHLWLRYQSLAVLHFNTINLKRARILARAHVLHNKVPGTVECN 358

Query: 303 YREDLLFPGRLIEDAGNVKVGRDPHKVV---KPSKFHELKELFAEEKFILCLGSRWTDM- 358
             E +L   R    +  +  G    +++   + S   +L +L+A EK+IL L S+  D+ 
Sbjct: 359 SEESILVWERFTRPS--IIFGVPLEEMMGSERSSTVMKLLKLYANEKYILMLDSQDKDLK 416

Query: 359 ---VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
                +  AS    LR  +   Y  +      +  +  L  +  +M D F   +  L+  
Sbjct: 417 VFVSFKVGASSMTVLRS-IWQTYWLNKHLDTTENVIHQLAQSLSEMEDKFNEFVQLLEGA 475

Query: 416 GWHTDRF 422
           GW T + 
Sbjct: 476 GWDTHQL 482


>gi|46137815|ref|XP_390599.1| hypothetical protein FG10423.1 [Gibberella zeae PH-1]
          Length = 425

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 109/409 (26%), Positives = 186/409 (45%), Gaps = 41/409 (10%)

Query: 22  QEIPIYWFETSDSVTRHYQFQPDGH--LSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSV 79
           +   I  ++ S +  R +   PD    L  KV     P  H         F P GYP SV
Sbjct: 26  ERTEIIEYDKSGNFVRSWTHGPDNIVVLKSKVCTSWLPSMHEA-------FMPVGYPHSV 78

Query: 80  NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGK 137
           +  YL Y  F ++Q F S   S+L+ ++LL   G+    + AT  ++  +LKD +  V  
Sbjct: 79  SSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAISRVAT 138

Query: 138 L-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVV 195
           +      G R++ + KR+R LAD   D    +E+ SP L P   +     G   + +  V
Sbjct: 139 IAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTGEALRALCGV 198

Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--CSSMQGKMIV 253
           AA A++  +   FAK  NL++L AK  +  T   ++GL  G  +   V   +++   MIV
Sbjct: 199 AAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTLVVKYVEDHNTVVFLMIV 258

Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL--FPG 311
              L + H++     +R   ++ LN QR  ++  +++KTG+I +P ++  RE +L  +  
Sbjct: 259 ---LVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGKIMTPEEVATRESILIWWSS 315

Query: 312 RLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALR 371
             IE AG+       +K  K     ++  + A +   L +G     ++L ++++   A+ 
Sbjct: 316 TRIEMAGSY------YKATK-----DMVNVIATDGSTLFVGPHGIKIMLWEDSTAVQAID 364

Query: 372 GWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
            W  A     M  S+   S     D ++K   +   LL  ++ KGW  D
Sbjct: 365 AWFTA-----MKISYSKDS-----DYHEKKGALEESLLEGMRVKGWRLD 403


>gi|340516831|gb|EGR47078.1| predicted protein [Trichoderma reesei QM6a]
          Length = 448

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 111/421 (26%), Positives = 193/421 (45%), Gaps = 36/421 (8%)

Query: 22  QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVV----DSFLNKFFPSGYPF 77
            ++ I   + S  V + +    DGH+ + +   S   + R V     +  + F P G+P 
Sbjct: 23  NDVDIIELDKSGHVVQRWLHAHDGHV-LPLRRRSNSFFSRSVREWRKALFDSFLPVGFPH 81

Query: 78  SVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHV 135
           SV++ YL Y  F +LQ F S   S+L+ ++LL   G+    + AT  ++  ILKD M  +
Sbjct: 82  SVSKDYLSYQFFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTILKDAMSRI 141

Query: 136 GKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIA 193
             ++ ++  G R++ + KR+R LAD   D    LE+ SP        +A   G   + + 
Sbjct: 142 ATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLGSWGKVVALSTGEALRALC 201

Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
            VAA A++  +   FA++ NL++L AK  +  T   ++GL  G  +  TV  +    +++
Sbjct: 202 GVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVGTLVVKTV-ENHSAVVLL 260

Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRL 313
             +L + H+      +R+  +NT N QR  ++  +++K+G++ SP  +  RE ++F    
Sbjct: 261 MIVLVLAHLGMNYLGVRSVCMNTFNRQRATILFEEYLKSGKVLSPEQVANRESIIFWSPA 320

Query: 314 IEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT--------------DMV 359
           I +A   +V    H     S  H +    A EK I+  G+  T               ++
Sbjct: 321 IRNAQGKEV---VHLEFAESYSHAMSCSDAHEKLIIVDGAMHTRFICSRVPGRLTRIKLL 377

Query: 360 LEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHT 419
           L   AS  DA+  W  AA   ++A    D        + D        LL +L+ +GW  
Sbjct: 378 LWDGASPMDAVLAWF-AAMEIAIAGKDRDSEFVPRYKSED--------LLEQLKRQGWSA 428

Query: 420 D 420
           D
Sbjct: 429 D 429


>gi|414879010|tpg|DAA56141.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
          Length = 368

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 62/166 (37%), Positives = 95/166 (57%), Gaps = 1/166 (0%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPT 117
           + R  +  ++   P GYP SV+  YL Y+ +R +Q   S    VLSTQ+LL+A GL +  
Sbjct: 186 WRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKGA 245

Query: 118 PAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
              A  V+W+LKDG+ ++ K++ S  G   D  PK WR+ AD L ++  GLE+L+P+ P 
Sbjct: 246 IPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPH 305

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
           LF+ +       +  A +   ATR   Y+ FA + N +++ AKGEA
Sbjct: 306 LFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEA 351


>gi|324514354|gb|ADY45840.1| Unknown [Ascaris suum]
          Length = 394

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 77/247 (31%), Positives = 129/247 (52%), Gaps = 3/247 (1%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
           F++ F P GYP SV+  Y+ Y  +  +Q F S+  S L+T+++L  AG+      A A  
Sbjct: 44  FVDVFLPRGYPHSVSADYMSYQIWDTVQAFASSMTSALATEAVLRGAGVGDENASALAAT 103

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           ++W+++DG+  + ++  + L  A++D++ KRWR++AD L DL   LE+L+   P LF  +
Sbjct: 104 LTWLIRDGLGMLSRIGFAWLKSAQLDADCKRWRLVADFLNDLAFSLELLAAALPSLFTLL 163

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             L + A+ I  VA  ATR  +    A+  N++D+ AK  +  TL NV  L   + L   
Sbjct: 164 VCLSSLARSIVAVAGGATRTTVVQHQARCNNVADVSAKDGSQETLVNVTALLCSLILLPL 223

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           +        +   L + +H+++    + A    TLN +   +I   FV+TG + S     
Sbjct: 224 ISGRTVLIWMFYILFTGVHLFANYAAVTALHFETLNQKLLRIITNHFVETGDVFSVKVAN 283

Query: 303 YREDLLF 309
             E LLF
Sbjct: 284 ELEPLLF 290


>gi|345492163|ref|XP_001602603.2| PREDICTED: UPF0420 protein C16orf58 homolog [Nasonia vitripennis]
          Length = 466

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 95/344 (27%), Positives = 164/344 (47%), Gaps = 23/344 (6%)

Query: 51  VVDDSRPVY-----HRVVDSFLNK-FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLS 104
           VV + +PV        V  SF  + F P G+P SV++ Y+ Y  +  LQ F S     L+
Sbjct: 29  VVTNFKPVRGTKFAESVTYSFFKEVFLPQGFPDSVHKDYVPYQIWDTLQAFASTINGTLT 88

Query: 105 TQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADAL 161
           T S++   G+  + A   A  ++WILKDG   +G++I +   G  +DS+ K+WR+ AD L
Sbjct: 89  THSIMRGVGVGESTATPLAAAITWILKDGTGMIGRIIFAWWQGNNLDSQCKKWRLFADIL 148

Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKG 221
            D    LEV  P           L    K I  +A  ATR  I    A + N++D+ AK 
Sbjct: 149 NDAAMALEVTVPYISTFSSYTLCLTTGMKAIVGIAGGATRTAIMQHHAIKDNMADVSAKE 208

Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
            +  TL N+ G   GI +      + +  + V  LL ++H+ S    + +  +NTLN  R
Sbjct: 209 HSQGTLVNLAGSIVGILILLL--INEKFFIYVCLLLVIVHIVSNYFAVTSLELNTLNEDR 266

Query: 282 TAMIVADFVKTGRISSPADLRYREDLLF----PGRLIEDAGNVKVGRDPHKVVKPSKFHE 337
             +I+ D++    +S+  D+  RE ++     P + I    ++++G     V+K +  + 
Sbjct: 267 LCLIIEDYILNHTMSNVEDINSRESVIILLEKPAKKIF-GFDIEIGVSFESVLKMNLINT 325

Query: 338 -----LKELFAEEKF--ILCLGSRWTDMVLEQNASGEDALRGWL 374
                L  LF   K+  ++ L ++   ++L ++A  +  L+ + 
Sbjct: 326 KDVRFLLALFQRRKYLPVMDLKAKKIHIILSKDADDDIILKAYF 369


>gi|302753580|ref|XP_002960214.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
 gi|300171153|gb|EFJ37753.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
          Length = 381

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 94/365 (25%), Positives = 167/365 (45%), Gaps = 25/365 (6%)

Query: 78  SVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV----VSWILKDGMQ 133
           SV++ YL+Y  ++   + T      L T SLL A GL   P  A      + W+ KDG+ 
Sbjct: 2   SVSQDYLQYMLWQLPTNITGWICRALITSSLLQAVGLSTEPGSAAAASAAIKWVTKDGLG 61

Query: 134 HVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
            VG+L I    G   D +PK+WR+ A+ +   G   E+ +PL P  FL +A LG+  K +
Sbjct: 62  AVGRLFIGGRFGGVFDEDPKQWRLYAEVIGSFGGLFELATPLAPDYFLLLASLGHLTKAV 121

Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGK 250
           A      +   I + FA   N+ D+ AK E       ++GL  G+ L ST  V +S    
Sbjct: 122 ARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVVLLSTPVVATSYSQL 181

Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
            +   ++   H++   + + A  + T+N +R  +++ + ++   +S   +   +E+L+ P
Sbjct: 182 ALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHIQGRPLSGVGECNKKENLILP 241

Query: 311 GRLIEDAGNVKVGRDPHKVV----KPSKFHELKELFAEEKFILCLGSRWTD--------M 358
             ++     +++G    +V+     P +  E  EL+  EK++L + S++           
Sbjct: 242 YTMLMP--QIRLGCSLREVITLGKTPLEIEEAFELYNSEKYVLIVLSQFLSSCSCNIKFR 299

Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWH 418
           V +        LR  +  A+    +KSF   +   L+ +   M   F   L EL A GW+
Sbjct: 300 VCQDGCENTTLLRS-IYQAHVLFHSKSFDTFT---LKQSLTSMQKSFASFLDELVAAGWN 355

Query: 419 TDRFL 423
             + +
Sbjct: 356 PSKLV 360


>gi|387019749|gb|AFJ51992.1| UPF0420 protein C16orf58-like [Crotalus adamanteus]
          Length = 437

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 136/284 (47%), Gaps = 24/284 (8%)

Query: 37  RHYQFQPDGHLSVKVVDDSRP----VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRAL 92
           R YQ  P+G L+ K  D S+      +HR+  S    F P GYP SV+  YL Y  +  +
Sbjct: 20  RRYQALPNGKLNSKT-DASKDWHLNSFHRIFKSI---FLPQGYPESVSTDYLAYQFWDTI 75

Query: 93  QHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSE 150
           Q F S+    L+TQ++L   G+    +   A  V+WILKDG   +G++  +         
Sbjct: 76  QAFASSITGTLATQAVLKGVGVGDETSTVAAAAVTWILKDGTGMLGRIAFA--------- 126

Query: 151 PKRWR--ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
              W   + AD L D+   +E+++P  P  F  +     F K I  VA  ATR  +    
Sbjct: 127 ---WSKGLFADVLNDVAIFMEIMAPAFPACFTLIVCTSGFFKCIVGVAGGATRAALTMHQ 183

Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
           A+  N++D+ AK  +  TL N+ GL   + L   V  ++        L +++H+Y+  + 
Sbjct: 184 ARRDNMADVSAKDGSQETLVNLAGLLFSLFLIPLVVDNLPLTYAFYALFTILHLYANYQA 243

Query: 269 MRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
           +RA  + TLN  R  +++  ++K   +  PA +  +E LL   R
Sbjct: 244 VRAVCMETLNRARLHLVLQHYLKREEVLGPAVINPQEPLLLGFR 287


>gi|260815171|ref|XP_002602347.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
 gi|229287656|gb|EEN58359.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
          Length = 244

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 70/239 (29%), Positives = 125/239 (52%), Gaps = 3/239 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
           F P GYP SV++ YL Y  +  +Q F S+    L+TQ++L   G+      A    ++WI
Sbjct: 4   FLPQGYPESVSDDYLSYQIWDTVQAFCSSITGTLATQAVLKGVGVGDETATAAGATITWI 63

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           LKDG   +G+++ + L G  +D + KRWR+ AD + D    LE+++P  P  F  +  + 
Sbjct: 64  LKDGTGMLGRILFAWLQGTSLDCDAKRWRLFADIVNDFSIFLELVAPAFPLFFTGIVCVA 123

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
              K I  V+  ATR  +    A+  N++D+ AK  +  TL N++ L AG+ +   V  +
Sbjct: 124 GVGKAIVGVSGGATRAALTMHQARRDNMADVSAKDGSQETLVNLMALLAGLVVTPMVADN 183

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
                 +  + + +H+++    +    + TLN  R  ++V+ +++TG + +PA    +E
Sbjct: 184 PLLTWFLFFVFTCLHLFANYNAVTCVVMETLNQSRLHILVSHYLRTGVVLNPAQANVQE 242


>gi|302768148|ref|XP_002967494.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
 gi|300165485|gb|EFJ32093.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
          Length = 412

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 103/394 (26%), Positives = 180/394 (45%), Gaps = 34/394 (8%)

Query: 58  VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
           V+ R  +SF    F +G   SV++ YL+Y  ++   + T      L T SLL A GL   
Sbjct: 4   VFWRFSESFGFLQFSAG---SVSQDYLQYMVWQLPTNITGWICRALITSSLLQAVGLSTE 60

Query: 118 PAQATV----VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
           P  A      + W+ KDG+  VG+L I    G   D +PK+WR+ A+ +   G   E+ +
Sbjct: 61  PGSAAAASAAIKWVTKDGLGAVGRLFIGGRFGGVFDQDPKQWRMYAEVIGSFGGLFELAT 120

Query: 173 PLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
           PL P  FL +A LG+  K +A      +   I + FA   N+ D+ AK E       ++G
Sbjct: 121 PLAPDYFLLLASLGHLTKAVARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIG 180

Query: 233 LGAGIQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
           L  G+ L ST  V +S     +   ++   H++   + + A  + T+N +R  +++ + +
Sbjct: 181 LSLGVALLSTPVVATSYSQLALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHI 240

Query: 291 KTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV----KPSKFHELKELFAEEK 346
           +   +S  ++   +E+L+ P  ++     +++G    +V+     P +  E  EL+  EK
Sbjct: 241 QGRPLSGVSECNKKENLILPYTMLMP--QIRLGCSLREVITLGKTPLEIEEAFELYNSEK 298

Query: 347 FILCLGSRWTDM-----VLEQ---NASGEDALRGWLVAAYAASMAKS-------FHDPSL 391
           ++L + S+ ++M     VL Q   + S     R         ++ +S       FH  S 
Sbjct: 299 YVLIVFSK-SNMGILLKVLSQFLSSCSCNIKCRVCQDGCENTTLLRSIYQAHVLFHSKSF 357

Query: 392 T--VLQDAYDKMNDVFTPLLSELQAKGWHTDRFL 423
               L+ +   M   F   L EL A GW+  + +
Sbjct: 358 DTFTLKQSLTSMQKSFASFLDELVAAGWNPSKLV 391


>gi|302766892|ref|XP_002966866.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
 gi|300164857|gb|EFJ31465.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
          Length = 265

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 130/242 (53%), Gaps = 3/242 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWI 127
           F P GYP SV+  YL +  +  LQ  ++   S+LSTQ+LL   G+    A A      W 
Sbjct: 23  FLPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWF 82

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           L+D    VG ++ +   G+ +D+  K+WR+ AD + D+G  ++++SPL P+ F+    LG
Sbjct: 83  LRDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFMNDIGMLMDLVSPLFPRAFMFFLCLG 142

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           + A+ +  VA  ATR  +   FA++ N +D+ AK  +  T   +VG+  G+ LA    ++
Sbjct: 143 SAARAVTGVAGGATRAALTQHFARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANN 202

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
           +         L+  HVY+  + + A  + ++N +R A+++  F+K G+  SP +   +E 
Sbjct: 203 IVLMWTSFMSLTAFHVYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQES 262

Query: 307 LL 308
            L
Sbjct: 263 AL 264


>gi|339250912|ref|XP_003374441.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316969248|gb|EFV53376.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 340

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 67/225 (29%), Positives = 116/225 (51%), Gaps = 3/225 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P GYP SV E Y  Y  + +LQ F S+  +  +  ++  A G+    A   A  V W+
Sbjct: 55  FLPQGYPDSVTEDYFDYQCWDSLQAFCSSVTNAFALHAVFKAVGVGNESASLLAATVVWL 114

Query: 128 LKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           L++G+  + +++ +   A  +DS+ K WR +AD L D    L++++PL    F+ +A L 
Sbjct: 115 LRNGVGMMSQIMFAWFCAGHLDSDCKSWRFVADVLNDCSIFLDLIAPLSESTFVYVACLA 174

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N  + I  +A  ATR  I    ++  N++D+ AK  +  T+ N++ LG  + L   V   
Sbjct: 175 NVCRSIVGIAGGATRAAITQHQSRRHNMADVAAKDSSQETMVNLIALGFNLALLPLVAHH 234

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
           +     +  + + +H+Y+    +RA    TLN +R  M+V  F+K
Sbjct: 235 VLLVWTLFLMFTFMHIYANYRAIRALKFETLNEKRFRMLVQSFIK 279


>gi|74215416|dbj|BAE41911.1| unnamed protein product [Mus musculus]
          Length = 337

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 75/243 (30%), Positives = 125/243 (51%), Gaps = 7/243 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV--C 244
           N AK I  VA  ATR  +    A+  N++D+ AK  +  T+ N+ GL   + +   V  C
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
            S+     V  LL+ +H+Y+    +RA  + TLN  R  +++  F++ G +  PA     
Sbjct: 255 PSLSLGCFV--LLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQM 312

Query: 305 EDL 307
           E L
Sbjct: 313 EPL 315


>gi|449018513|dbj|BAM81915.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 600

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 80/246 (32%), Positives = 122/246 (49%), Gaps = 23/246 (9%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT------- 122
           F P+ YP SV   Y  Y  +  L +  +A  SVL T +LL +AG+   P++A        
Sbjct: 158 FLPTNYPHSVTASYAPYAFWHFLHNTLTACNSVLGTTALLVSAGVVADPSRAASTALTRD 217

Query: 123 --------------VVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTG 167
                         V SWILKDG+ H+ +L   S    R D + KR R+LAD L+ LGTG
Sbjct: 218 PALEHLPTAASASAVTSWILKDGVGHLVRLAFGSRYAHRFDGDLKRLRVLADVLWHLGTG 277

Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
           LE++S   P  F+ +A      KG+A V   +TR  IY + A   N+ D+ AKG+A S  
Sbjct: 278 LELVSRSFPDSFIGLASGAYALKGMAHVVFNSTRSTIYRTVATRSNIGDVTAKGDAQSIA 337

Query: 228 FNVVGLGAGIQLASTVC-SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
             ++G+  G+ L      +++     +  ++    +YS I+ MR   +  LN QR  ++V
Sbjct: 338 AELLGMSIGLSLYRGFAENNLFWTWAIFGIVVAGQMYSRIQSMRVLVLPLLNLQRAVLLV 397

Query: 287 ADFVKT 292
             +V +
Sbjct: 398 DAYVAS 403


>gi|408393145|gb|EKJ72412.1| hypothetical protein FPSE_07436 [Fusarium pseudograminearum CS3096]
          Length = 427

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 101/364 (27%), Positives = 174/364 (47%), Gaps = 30/364 (8%)

Query: 65  SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT-- 122
           S    F P GYP SV+  YL Y  F ++Q F S   S+L+ ++LL   G+    + AT  
Sbjct: 64  SMHEAFMPVGYPHSVSSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFA 123

Query: 123 VVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFL 180
           ++  +LKD +  V  +      G R++ + KR+R LAD   D    +E+ SP L P   +
Sbjct: 124 MLLTVLKDAISRVATIAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKI 183

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
                G   + +  VAA A++  +   FAK  NL++L AK  +  T   ++GL  G  + 
Sbjct: 184 LALTTGEALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTIVV 243

Query: 241 STV--CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
             V   +++   MIV   L + H++     +R   ++ LN QR  ++  +++KTG+I +P
Sbjct: 244 KYVEDHNTVVFLMIV---LVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGQIMTP 300

Query: 299 ADLRYREDLL--FPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWT 356
            ++  RE +L  +    IE AG+       +K  K     ++  + A +   L +G    
Sbjct: 301 EEVAKRESILLWWSSTRIEMAGSY------YKATK-----DMVNVIATDGSTLFVGPHGI 349

Query: 357 DMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKG 416
            ++L ++++   A+  W  A     M  S+   S +   D ++K   +   LL  ++AKG
Sbjct: 350 KIMLWEDSTAVQAIDAWFTA-----MKISYAKDSDS---DYHEKKGALEESLLEGMRAKG 401

Query: 417 WHTD 420
           W  D
Sbjct: 402 WRLD 405


>gi|327294507|ref|XP_003231949.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
 gi|326465894|gb|EGD91347.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
          Length = 483

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 104/420 (24%), Positives = 194/420 (46%), Gaps = 69/420 (16%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
           + S LN F P+GYP SV + Y+ Y  + +LQ F+S+   +LS++++L   G+    A   
Sbjct: 55  LTSILNVFLPAGYPHSVTDDYIEYQIYDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPT 114

Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-L 178
           A ++  +L++ M  +  ++ ++ LG  ++ E K +R+ AD L D    L+ LSP+ P+ +
Sbjct: 115 AALLLNVLQESMGRIATILFAHRLGTSLEPECKLYRLAADILNDSAMVLDCLSPIFPKPV 174

Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
            + +  L +  + +  VAA +++  + + FA+ GNL +L AK  +  T+ ++ G+  G  
Sbjct: 175 RVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLAGMLVGSV 234

Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
           + S + S ++   I   +L ++H+ +    +RA  + TLN QR  ++ +   +  R+ +P
Sbjct: 235 VVSYITSPLE-TWIALIILLIVHLGTNHAAVRAVKMTTLNRQRANIVFSYLFEDDRVLTP 293

Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK------------------------ 334
           A+    E      R+ E  G ++   +   ++   +                        
Sbjct: 294 AEASEEE------RIFERDGVLRWNAESSGILGTCQIGTSLEQLLLLLPEQADRGGALET 347

Query: 335 -------------FHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW----LV 375
                           L ELF EE++IL     SR   +VL+  A+ +  L+ W    LV
Sbjct: 348 GQTSSTTMNTSTNLTALLELFREEEYILYFDPLSRRGAIVLKAGATAKAQLKAWSHALLV 407

Query: 376 AAYAA---SMAKSFHD---------PS---LTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
           + + A   S A   H+         PS   L++L+D   +    F   +  L+  GW TD
Sbjct: 408 SRHLAGKNSTATCRHEKQDDQGELSPSVSVLSILRDTLQEHTRTFKGRVKRLEHAGWKTD 467


>gi|195996945|ref|XP_002108341.1| hypothetical protein TRIADDRAFT_19061 [Trichoplax adhaerens]
 gi|190589117|gb|EDV29139.1| hypothetical protein TRIADDRAFT_19061, partial [Trichoplax
           adhaerens]
          Length = 272

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 75/248 (30%), Positives = 132/248 (53%), Gaps = 9/248 (3%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P GYP SV+  YL Y  +  LQ F S+    L+TQ++L   G+    A   A  ++W+
Sbjct: 3   FLPQGYPESVSGDYLTYQLWDTLQAFCSSITGTLATQAILKGVGVGDEKATVAAAAMTWM 62

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKR------WRILADALYDLGTGLEVLSPLCPQLFL 180
           LKDG   +G+++ + L G  +D + K+      +R+ AD L D    +E+++P  P LF 
Sbjct: 63  LKDGTGMIGRILFAWLRGTSLDCDAKKMLIFISYRLFADFLNDGALFIEMIAPNFPHLFT 122

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +  + + AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+  L  G+ L 
Sbjct: 123 LLLCVASVAKSIVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLSALIVGLFLL 182

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
             V  + +    +  + +++H+Y+  + + +  + TLN +R  ++  +F + G I SPA+
Sbjct: 183 PAVTGNARIIWTLFFMFTMLHLYANYKAVSSVVMETLNRERLHILAYNFFRYGSILSPAE 242

Query: 301 LRYREDLL 308
           +  RE ++
Sbjct: 243 VCQRESVI 250


>gi|356524511|ref|XP_003530872.1| PREDICTED: uncharacterized protein LOC100783222 [Glycine max]
          Length = 513

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 76/300 (25%), Positives = 149/300 (49%), Gaps = 11/300 (3%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA--AGLRP 116
           + R   S +  FF    P  V   Y+ Y +++ L    S+AL VL+TQ++  A   G   
Sbjct: 104 WLRAFGSVVWDFF---IPRGVTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSN 160

Query: 117 TPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           +   A  ++W+LKDG+  + + I  ++L +  D+  KR R     L+    GLE+L+P  
Sbjct: 161 SLPSAAALNWVLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAF 220

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG--L 233
           P+ FL +A + N +K I++    ATR  ++ SFA   NL ++ AK +  +  F+++G  L
Sbjct: 221 PRCFLLLATIANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLML 280

Query: 234 GAGIQLASTVCSSMQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
            A + L        Q  +   + P  + + ++ + + ++   + TL   R  +I++ +++
Sbjct: 281 AALVNLWIENHRRQQAGLHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIE 340

Query: 292 TGRISSPADLRYREDLLFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILC 350
            G + SPA++  +E + F G   +    +++G  +P   V       ++ +  E+ + +C
Sbjct: 341 CGYVPSPAEVSEKEGINFLGVKGKSLWPIRIGCLNPKDQVPKWSMKTIQCITDEDYYFVC 400


>gi|302881823|ref|XP_003039822.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
           77-13-4]
 gi|256720689|gb|EEU34109.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
           77-13-4]
          Length = 444

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 157/312 (50%), Gaps = 17/312 (5%)

Query: 22  QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRP-VYHRVVDSFL-NKFFPSGYPFSV 79
           +E+ +   + S +V R +   P    S +++  ++P ++ R +   L + F P GYP SV
Sbjct: 12  REVEVIDLDKSGNVARRWVHMP----SNRIIRKAKPSLFKRSMRKILMDAFLPVGYPHSV 67

Query: 80  NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGK 137
           ++ YL Y  F +LQ F S   ++L+ ++LL   G+    + AT  ++  +LKD M  V  
Sbjct: 68  SDDYLAYQFFDSLQAFFSTITALLANRALLQGLGVGDANSSATFAMLLTVLKDAMSRVAT 127

Query: 138 LI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVV 195
           +I     G R++ + KR+R LAD   D    +E+ SP   P   +     G   + +  V
Sbjct: 128 IIFAQQFGLRIEPDAKRYRFLADLFNDTAFFMELYSPYFGPYGKILALTSGEALRALCGV 187

Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
           A  A++  +   FAK  NL++L AK  +  T   +VGL  G    + V   ++    V  
Sbjct: 188 AGGASKAALSVHFAKHDNLAELNAKEASQETAVGLVGLLVG----TFVVKHVEDHTSVVY 243

Query: 256 LLSVI---HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
           L++V+   H++     +R+  +N LN QR  ++  +++K+G + +P ++  RE +LF   
Sbjct: 244 LMTVLVLAHLWMNYLGVRSVCMNVLNRQRATILYKEYLKSGNVLTPNEVARRESILFWSP 303

Query: 313 LIEDAGNVKVGR 324
           ++ +    ++ R
Sbjct: 304 VVRNQHGQRIAR 315


>gi|186478458|ref|NP_001117280.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332190946|gb|AEE29067.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 353

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 165/340 (48%), Gaps = 24/340 (7%)

Query: 102 VLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILA 158
           +LSTQ+LL A G+    A        W L+D    +G ++ +   G+ +DS  K WR++A
Sbjct: 2   MLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVA 61

Query: 159 DALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
           D + D+G  +++LSPL P  F+ +  LG+ ++    VA+ ATR  +   FA + N +D+ 
Sbjct: 62  DLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADIS 121

Query: 219 AKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
           AK  +  T+  ++G+  G+ LA     +     +    L+V H+Y+    +R   +N+LN
Sbjct: 122 AKEGSQETMATMMGMSLGMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLN 181

Query: 279 PQRTAMIVADFVKTGRISSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHE 337
            +R+++++  F++TG++ SP  +   E +L      +    +  + +     V+ S    
Sbjct: 182 FERSSILLTHFIQTGQVLSPEQVSSMEGVLPLWATSLRSTNSKPLHKRVQLGVRVSSLPR 241

Query: 338 LKEL----------FAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASM---AK 384
           L  L          +   K++L        ++L +++   D L+ ++ A   A++   + 
Sbjct: 242 LDMLQLLNGVGASSYKNAKYLLAHIKGNVSVILHKDSKPADVLKSYIHAIVLANLMEKST 301

Query: 385 SFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
           SF+      +   YD+       LL +L++ GW T+R L 
Sbjct: 302 SFYSEGEAWIDKHYDE-------LLHKLRSGGWKTERLLS 334


>gi|307106356|gb|EFN54602.1| hypothetical protein CHLNCDRAFT_135084 [Chlorella variabilis]
          Length = 486

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 76/250 (30%), Positives = 124/250 (49%), Gaps = 29/250 (11%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
           F   F P G+P SV+  YL Y  + ++Q  +S    +LS+Q++L   G            
Sbjct: 65  FAACFLPEGFPHSVSSDYLPYQLWDSIQGLSSYLRGMLSSQAMLAGVGF----------- 113

Query: 126 WILKD--GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
             LKD  GM   G L  S  G+ +D   K+WR+LAD + D+G  LE+ SPL P  FL +A
Sbjct: 114 -FLKDFTGMLG-GILFASTQGSGLDCYAKQWRLLADVMNDIGLALELASPLLPGAFLLLA 171

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
            LG+ A+ +  VA  ATR+ +   FA + N +D+ AK  +  T   +VG+  G+      
Sbjct: 172 CLGSLARAVTGVAGSATRMALTQHFALQRNAADIAAKEGSQETAVTLVGMVLGMAF---- 227

Query: 244 CSSMQGKMIVGPL-------LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
              ++    + PL       L+ +HV++ +  MR   + +LN  R  +++  +++TG   
Sbjct: 228 ---IRAAHGLEPLIWSAFWALTALHVWANVRAMRCLRIASLNQARLGLLLRHYLRTGEAL 284

Query: 297 SPADLRYRED 306
           +P  +   E 
Sbjct: 285 TPGQMAAEES 294


>gi|449460628|ref|XP_004148047.1| PREDICTED: UPF0420 protein C16orf58-like [Cucumis sativus]
          Length = 523

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 74/278 (26%), Positives = 147/278 (52%), Gaps = 11/278 (3%)

Query: 53  DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
           D  R +Y   V +  + F P     +V+E Y+ Y +++ L    S+AL V++TQ++  A 
Sbjct: 107 DGFRTLYRVSVLAVKDFFIPK----NVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAI 162

Query: 113 GL--RPTPAQATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLE 169
           G+    + A A  ++W+LKDG+  + + L  +++ +  D+  KR R     L+ L  G+E
Sbjct: 163 GVGHSRSLASAAALNWVLKDGLGRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVE 222

Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
           +L+P  PQ FL +A + N  K I++    +T   ++ SFA   NL ++ AK +  S  F+
Sbjct: 223 LLTPAFPQYFLLLASIANIVKQISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFD 282

Query: 230 VVG--LGAGIQLASTVCSSMQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
            +G  L A +   S     +Q  +  +V P+ + + ++   + ++   + TL   R  +I
Sbjct: 283 NLGLVLAAFLNFLSKNDRRLQAALPFVVYPIFAAMDLFGTYQGLKHVHLQTLTKDRLEII 342

Query: 286 VADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG 323
           ++++++ G + +PA++  RE +    R  +D+  +++G
Sbjct: 343 LSNWIEQGYVPTPAEVSEREGIDLLCRQGKDSWPIRIG 380


>gi|357506015|ref|XP_003623296.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
 gi|355498311|gb|AES79514.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
          Length = 492

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 92/350 (26%), Positives = 154/350 (44%), Gaps = 23/350 (6%)

Query: 32  SDSVTRHYQFQPDGHLSVKVVDDSRPVYHR-------------VVDSFLNKFFPSGYPFS 78
           S+   + Y    D  L   ++++ R   +R             + D+      P+G+P S
Sbjct: 60  SNGTAKRYIIGDDSRLRTILIEEDRSTQNRFGVLHSPDKRLSWLPDTVKAFILPAGFPGS 119

Query: 79  VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVSWILKDGMQ 133
           V++ YL+Y   +   + T      + T SLL A G+         A A  + W+ KDG+ 
Sbjct: 120 VSDDYLQYMLLQFPTNVTGWICHTIVTSSLLKAVGVGSFSGTTAAASAAAIRWVSKDGIG 179

Query: 134 HVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
            VG+L I    G   D +PK+WR+ AD +   G+  ++ +PL P  FL +A LGN  K I
Sbjct: 180 AVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLTTPLYPGYFLPLASLGNLTKAI 239

Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGK 250
           A      +   I S FA   NL ++ AK E    +  +VGLG GI +  T  +  S    
Sbjct: 240 ARGLKDPSSRVIQSHFAISANLGEIAAKEEVWEVVAQLVGLGLGILILDTPGLVKSYTVL 299

Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
                ++ ++H++   E +     NT+N +R   +V   V    +    D    E++L  
Sbjct: 300 SSTWLIVRLLHLWLRYESLSVLQFNTINLKRARTLVKSHVLHSTVPGCMDCNKEENILVW 359

Query: 311 GRLIEDAGNVKVGRDPHKVVKPSKF--HELKELFAEEKFILCLGSRWTDM 358
            + ++         D    ++ S F    L +L+A EK+IL +  +  D+
Sbjct: 360 SQFMKPIIIFGSPLDKMDGMERSHFMVEALLKLYANEKYILVVNQQLDDL 409


>gi|397563898|gb|EJK43986.1| hypothetical protein THAOC_37516 [Thalassiosira oceanica]
          Length = 542

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 113/410 (27%), Positives = 176/410 (42%), Gaps = 58/410 (14%)

Query: 65  SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QAT 122
           SF++ F P+ YP SV   Y  +  +    +   +A  VLSTQ+LL A G+    A   A 
Sbjct: 110 SFVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAA 169

Query: 123 VVSWILKD-GMQHVGKLICSNLG---------------------------------ARMD 148
            ++W+LKD   Q  G L  S LG                                    D
Sbjct: 170 ALNWLLKDFAGQLGGVLFVSQLGIDHEFWKRKINRLFRENNQGNNTGRPKKGNYQRGTAD 229

Query: 149 SEPKRWRILADALYDLGTGLEVLSPLC-PQLFLEMAGLGNFAKGIAVVAARATRLPI--- 204
           + PKRWR++A    DL T LE+ +PL  PQ FL  A + N  K I  +AA A+R  I   
Sbjct: 230 TNPKRWRMVAALALDLSTLLEICTPLLGPQYFLPCASVANVGKNIGWLAASASRAAINQS 289

Query: 205 -----YSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS--SMQGKMIVGPLL 257
                  S +   NL D+ AK  + + + ++VG   GI  + T+ +     G M    +L
Sbjct: 290 LSAGGSYSSSSGSNLGDVTAKSGSQAIVASLVGTALGIFFSKTLYADHGTVGVMSGFIVL 349

Query: 258 SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDA 317
           S +H  +  + + A P+ +L+  R  +++++F +T  + SP+ +  +E   FP    + +
Sbjct: 350 SAVHQVATYKALMAVPLRSLDRHRLHIVLSEFSRTSSVPSPSQVATKEK-FFPLITPDHS 408

Query: 318 GNVKVGRDPHKVVKPSKFHELKELF-------AEEKFILCLGSRWTDMVLEQNASGEDAL 370
            N      P   + P    EL+ L          E++IL +G     +    +A+  D L
Sbjct: 409 CNWLFVGSPLLEICPKGISELQGLILWRDDAMQHEQYILKVGPASIQLTFLDDANEVDLL 468

Query: 371 RGWLVAAYAASMAKS--FHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWH 418
           RG L A  A +  K+   +D     L D +  M+  F      L  KGW 
Sbjct: 469 RGQLHAYLAHNRMKTDLAYDEDAVRL-DTHRSMDSQFGEFQEMLLEKGWQ 517


>gi|449509553|ref|XP_004163622.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58-like
           [Cucumis sativus]
          Length = 523

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 76/278 (27%), Positives = 148/278 (53%), Gaps = 11/278 (3%)

Query: 53  DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
           D  R +Y RV    +  FF    P +V+E Y+ Y +++ L    S+AL V++TQ++  A 
Sbjct: 107 DGFRTLY-RVSVLAVKXFF---IPKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAI 162

Query: 113 GL--RPTPAQATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLE 169
           G+    + A A  ++W+LKDG+  + + L  +++ +  D+  KR R     L+ L  G+E
Sbjct: 163 GVGHSRSLASAAALNWVLKDGLGRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVE 222

Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
           +L+P  PQ FL +A + N  K I++    +T   ++ SFA   NL ++ AK +  S  F+
Sbjct: 223 LLTPAFPQYFLLLASIANIVKQISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFD 282

Query: 230 VVG--LGAGIQLASTVCSSMQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
            +G  L A +   S     +Q  +  +V P+ + + ++   + ++   + TL   R  +I
Sbjct: 283 NLGLVLAAFLNFLSKNDRRLQAALPFVVYPIFAAMDLFGTYQGLKHVHLQTLTKDRLEII 342

Query: 286 VADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG 323
           ++++++ G + +PA++  RE +    R  +D+  +++G
Sbjct: 343 LSNWIEQGYVPTPAEVSEREGIDLLCRQGKDSWPIRIG 380


>gi|321455034|gb|EFX66180.1| hypothetical protein DAPPUDRAFT_302922 [Daphnia pulex]
          Length = 315

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 76/269 (28%), Positives = 135/269 (50%), Gaps = 16/269 (5%)

Query: 54  DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
           DS   +  +     + F P GYP SV++ YL Y  +  +Q F S+    LST+++L   G
Sbjct: 40  DSNGKFRGITHLLSSVFLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVG 99

Query: 114 L---RPTPAQATVVSWILKDGMQHVGKLI---CSNLGARMDSEPKRWRILADALYDLGTG 167
           +     TP  AT++ W++KDG   VG+++   C+  G ++D++ K+WR  AD L D    
Sbjct: 100 VGSSTSTPLAATLM-WLIKDGTGMVGRILFAWCN--GTKLDADSKKWRFFADLLNDCALS 156

Query: 168 LEVLSPLCPQ-------LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
           LE+ +P               +  +   AK I  VA  ATR  +    A++GNL+D+ AK
Sbjct: 157 LELCAPYAVAAVGGSNGTMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAK 216

Query: 221 GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
             +  TL N+  L   + L   +  +     ++  L +++HV++ ++ ++A  + TLN  
Sbjct: 217 DGSQETLVNLAALITSLWLLPILDGATGLTWLMFILFTLLHVFANLKAVKAVTMETLNRA 276

Query: 281 RTAMIVADFVKTGRISSPADLRYREDLLF 309
           R  +I+  F  T  ++S  ++   E ++ 
Sbjct: 277 RYMIILKQFASTKYVASVKEVNRLEPVII 305


>gi|224002080|ref|XP_002290712.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974134|gb|EED92464.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 656

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 94/348 (27%), Positives = 155/348 (44%), Gaps = 67/348 (19%)

Query: 69  KFFPSGY-PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV---- 123
           +F  S + P  V   Y R+ ++R LQ F +A + V+ TQSLL   GLR      +V    
Sbjct: 147 QFLHSAFVPEGVTPSYYRFMRWRILQRFINANVHVIGTQSLLM--GLRGMTRSGSVGEAA 204

Query: 124 ---------------VSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTG 167
                           +W+LKD +  + +++  S +G + D + KRWR  +  +Y LG G
Sbjct: 205 SFAAKEGAALGAAAATNWVLKDTLGKIVRMVWASKMGRKFDPDAKRWRFRSALIYALGNG 264

Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG----------NLS-- 215
           LEV + L PQ FL +A L N  K ++++ + ATR  +Y+SF + G          N S  
Sbjct: 265 LEVSTYLHPQYFLLLAMLANSCKQMSMLTSSATRNALYNSFKRVGTTSSTVSTSNNASGS 324

Query: 216 ----------------DLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
                           D+ AKGEA   + +++G+ +GI L+  V  S+Q    V  +L +
Sbjct: 325 SAIVRRGGNGGIENIGDITAKGEAQIAVVDLLGIASGICLSRAVGVSVQNVFAVWVVLQI 384

Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-------KTGR-------ISSPADLRYRE 305
             ++ +  EMR+    T N +R  M++   +       K G        I SP  +   E
Sbjct: 385 GEIFCMYHEMRSIVYRTFNFERMYMVLGKLMLDDAADNKGGDSSSTKNLIPSPEQIAANE 444

Query: 306 DLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHEL-KELFAEEKFILCLG 352
            +  P   +     +  G      + P++   L  ++F  EK+ L +G
Sbjct: 445 KVFLPPDHLARRA-IAFGSPGRSTLDPAELDVLVNDIFRGEKYFLVVG 491


>gi|348585092|ref|XP_003478306.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cavia porcellus]
          Length = 468

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 97/379 (25%), Positives = 170/379 (44%), Gaps = 35/379 (9%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL Y  + ++Q F S+  S L+T +++   G+    A   A   +W+
Sbjct: 77  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSSALATHAVMLGLGVGNAEASLAAATTTWL 136

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           ++D    +G+++ +   G+++D   K+WR+ AD L D    LE+ +P  P  F       
Sbjct: 137 VRDSTGMLGRIVFAWWKGSKLDCNVKQWRLFADILNDAAMFLEIAAPTLPFCFTVTVSTS 196

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +   V   
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLFSLVMLPLVSGC 256

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               +    LL+ +H+Y+    +RA  + TLN  R  +++  F++ G I         E 
Sbjct: 257 PSFSLGCFFLLTGLHIYANYRAVRALVMETLNESRLQLVLKHFLQRGEILDTTSANQMEP 316

Query: 307 L---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGS--RWTDMV 359
           L    +P        ++ +G   H ++  S   EL++L    +E ++L          +V
Sbjct: 317 LWTGFWPSL------SLSLGVPLHDLI--SSVSELQQLVEGHQEPYLLSWNQSRNKVQVV 368

Query: 360 LEQNASGEDALR----GWLVAAYA--ASMAKSFHD-----------PSLTVLQDAYDKMN 402
           L   A  E  LR    G L+ A      + +   +            S  ++++ Y  ++
Sbjct: 369 LSHVAGPEAVLRAATHGLLLGALKEDGPLPRELEELRNRVRAGREKDSQAIVRETYRVLD 428

Query: 403 DVFTPLLSELQAKGWHTDR 421
            +F   L  LQA GW T++
Sbjct: 429 KLFPRFLEGLQATGWKTEK 447


>gi|328869935|gb|EGG18310.1| DUF647 family protein [Dictyostelium fasciculatum]
          Length = 407

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 137/279 (49%), Gaps = 15/279 (5%)

Query: 44  DGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVL 103
           D   +VK+   +R  +   ++   + F P GYP SV   Y  Y  +  LQ   S+   VL
Sbjct: 24  DEQTNVKITKQTRK-HGSFIEVLRDMFLPKGYPNSVTSDYTGYQTWDTLQALCSSLTGVL 82

Query: 104 STQSLLFAAG---LRPTPAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILAD 159
           +T+++L   G   L  + A A++  WI +D    +G++I +   G  +D   K WR  AD
Sbjct: 83  ATRAILKGYGVGNLNASVASASI-QWITRDAAGMLGRIIFAWKWGTDLDCNSKTWRFAAD 141

Query: 160 ALYDLGTGLEVLSPL-CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLF 218
            L D G  LE++SPL  P  FL ++ LG  +K I  VA   T+  +   FAK  NL+D+ 
Sbjct: 142 LLNDFGMTLEMISPLFGPSYFLPISCLGLISKSICGVAGGCTKASLTQHFAKRDNLADVS 201

Query: 219 AKGEAISTLFNVVGLGAGIQLASTV---CSSMQ---GKMIVGPLLSVIHVYSVIEEMRAA 272
           AK  +  T   +VG+  G+ +A+++     S+Q      +   + +VIH+      +   
Sbjct: 202 AKDGSQETAIALVGMLLGMVMAASIPDDDDSIQVYFKTWLFFIIFTVIHLLCNYRAVSLV 261

Query: 273 PVNTLNPQRTAMIVADFVK--TGRISSPADLRYREDLLF 309
            + ++N  R  +I   F +    +I +P  +  +E++LF
Sbjct: 262 QLTSINRYRANLIYQHFNENNNNQIPTPKQISLKENILF 300


>gi|358397603|gb|EHK46971.1| hypothetical protein TRIATDRAFT_291240 [Trichoderma atroviride IMI
           206040]
          Length = 468

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 83/307 (27%), Positives = 153/307 (49%), Gaps = 8/307 (2%)

Query: 22  QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDD--SRPVYHRVVDSFLNKFFPSGYPFSV 79
            ++ I   + S    + +    DGH+  K  +   SR  +     + ++ F P G+P SV
Sbjct: 12  NDVEIVELDKSGHPQQQWLHAHDGHILPKTSNSLFSRS-FREWRKAIMDSFLPVGFPHSV 70

Query: 80  NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGK 137
           ++ YL Y  F +LQ F S   S+L+ ++LL   G+    + AT  ++  I+KD M  +  
Sbjct: 71  SKDYLAYQTFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTIMKDAMSRIAT 130

Query: 138 LICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIAVV 195
           ++ ++  G R++ + KR+R LAD   D    LE+ SP        +A   G   + +  V
Sbjct: 131 IVFAHRFGLRIEPDAKRYRFLADVFNDSAFFLELYSPYLGSWGKVIALSTGQALRALCGV 190

Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
           AA A++  +   FAK  NL++L AK  +  T   ++GL  G  L   +       +I+  
Sbjct: 191 AAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVG-TLVVKMIEDHSSVVILMI 249

Query: 256 LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIE 315
           LL + H+ +    +R+  + TLN QR  ++  +F+++G++ +P  +  RE+++F   ++ 
Sbjct: 250 LLVIAHLGTNYLGVRSVCMATLNRQRATIVFEEFLRSGKVLTPEQVAERENIIFWNPIMR 309

Query: 316 DAGNVKV 322
           +    KV
Sbjct: 310 NLHGKKV 316


>gi|198431163|ref|XP_002119233.1| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Ciona
           intestinalis]
          Length = 414

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 67/265 (25%), Positives = 132/265 (49%), Gaps = 12/265 (4%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
           F + F P GYP SV+E YL Y  +  +Q F S+    L+T ++L  +G+    A   +  
Sbjct: 49  FKDIFLPKGYPESVSEDYLSYQIWDTVQAFCSSITGTLATHAVLKGSGVGDETANVASAT 108

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           ++W+LKDG   +G++  +   G+ +D   K+WR+ AD + D    +++++P+ P+ +  +
Sbjct: 109 ITWLLKDGTGMLGRIAFAYFKGSSLDCNAKQWRLFADVMNDCAIFIQLVAPVLPKAYFTL 168

Query: 183 A-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
              L   A  +  VA   TR  +    AK  N++D+ AK  +  TL N+  L  G+ +  
Sbjct: 169 VMCLAGLAFSLVGVAGGCTRAALTMHQAKCNNMADVSAKDGSQETLVNLAALLTGLVIMP 228

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
            V +++    ++  + + +HV++  + + +  +   N  R + ++  F    ++ +P + 
Sbjct: 229 IVATNIPLMWLLFLIFTFLHVFANYKAVASVVMELFNRNRLSKVIKVFCDMNKVLTPKEA 288

Query: 302 RYREDLLFP--------GRLIEDAG 318
             +E + FP        G  +ED G
Sbjct: 289 NEKEAIFFPPSYSWLNLGCKLEDLG 313


>gi|357127303|ref|XP_003565322.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
          Length = 486

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 103/416 (24%), Positives = 181/416 (43%), Gaps = 40/416 (9%)

Query: 33  DSVTRHYQFQPDGHLSVKV---------VDDSRP--VYHRVVDSFLNKFFPSGYPFSVNE 81
           D V + Y    D  L V++          +D +P     R +  F+    P+G+P SV++
Sbjct: 54  DGVAKRYMLDGDSKLQVRLEKHEHLLNNAEDGKPDSSIPRAITDFV---LPAGFPGSVSD 110

Query: 82  GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVSWILKDGMQHVG 136
            YL+Y  ++   + T      L T SLL A G+         A A  + W+ KDG+   G
Sbjct: 111 DYLQYMLWQFPTNVTGWICHTLVTSSLLKAVGVGSFTGTSAAASAAAIRWVSKDGIGAFG 170

Query: 137 KL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVV 195
           +L I    G   D++PK+WR+ AD +  +G+  E+ +PL P  FL +A LGN AK +   
Sbjct: 171 RLLIGGRFGTLFDNDPKKWRMYADFIGSVGSIFELTTPLYPGYFLPLASLGNLAKAVGRG 230

Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGKMIV 253
               +   I + FAK GNL ++ AK E       ++GL  G+ +  T  V SS     + 
Sbjct: 231 FRDPSNRVIQNHFAKSGNLGEIAAKEEVWGVGAQLLGLSIGVLILDTSGVQSSYPTLTLT 290

Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRL 313
              + ++H++   + +      T+N +R  ++V   V    +         E++L   R 
Sbjct: 291 WLGVRLLHLWFRYQSLIVLKFRTVNLKRARILVRSHVAHHTVPGYVACNEEENILTWERF 350

Query: 314 IEDAGNVKV-------GRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTD-----MVLE 361
           ++   +  V       G +P  +V       L +L+  E +IL +    +      +  +
Sbjct: 351 LQPQISFGVPMERMLGGGEPSDMVS-----RLLKLYKNEMYILFVDQSESKEPAFVVTFK 405

Query: 362 QNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW 417
           + A+    LR  L  A+     K   D    +L+++   +   FT  + +++  GW
Sbjct: 406 EAATSMSVLRS-LWQAHWLHKNKPKQDDVFALLEESLAALEGGFTDFIEQMERAGW 460


>gi|212274801|ref|NP_001130544.1| uncharacterized protein LOC100191643 [Zea mays]
 gi|194689438|gb|ACF78803.1| unknown [Zea mays]
 gi|223974155|gb|ACN31265.1| unknown [Zea mays]
 gi|413956565|gb|AFW89214.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
          Length = 442

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 92/389 (23%), Positives = 179/389 (46%), Gaps = 38/389 (9%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           + R+    L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    
Sbjct: 50  WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109

Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A        W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  +++LSPL 
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---GEAISTLFNVVG 232
           P   + +  LG+ ++    VA+ ATR  +   FA   N +D+ AK    E ++T+  +  
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGL 229

Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
                Q+      S+    +    L++ H+Y+  + +++  + TLN +R ++++  F + 
Sbjct: 230 GMLLAQITRGHALSVWASFLS---LTMFHMYANYKAVQSLSLTTLNYERASILLQYFKEC 286

Query: 293 GRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHK-----------------VVKPSKF 335
           G +  P  +  +E +L           +K+   PH+                 V+  +++
Sbjct: 287 GEVLVPRKVSQQEHILPSWSNWRKLNRIKL---PHERVHLGAKASMLTHSDMLVIAKTRY 343

Query: 336 HELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQ 395
           H     +    + L        + + + A+  D LR ++     AS  ++     L    
Sbjct: 344 H-----YENANYFLLDKQGIVYVFIHKEATPADVLRSFVHGLVLASSTQNSKPQHL---- 394

Query: 396 DAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
           +A   M++++T  +S+LQ +G+ T+R L 
Sbjct: 395 EARRWMDEMYTSFISKLQTEGYSTERLLS 423


>gi|403362828|gb|EJY81150.1| DUF647 family protein [Oxytricha trifallax]
          Length = 507

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 75/248 (30%), Positives = 121/248 (48%), Gaps = 16/248 (6%)

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           ++ +Y+ + ++FL    P+GYP +V+  YL++T +  +  F   A+  LS QSL  A G 
Sbjct: 77  NKNIYNGLFETFL----PAGYPHTVSSSYLKFTIYSNISAFMYTAMGFLSAQSLFVAIGS 132

Query: 115 RPTPAQ--ATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
             T A   A   +W+LKDG+  +G  L  S  G   D + K+WR +A    ++   +E+L
Sbjct: 133 TMTQANLAAAAYTWVLKDGLGQLGGILFASRYGRNFDVDIKKWRFMAIFALNISMYIEIL 192

Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
           +   PQ FL +A L N  K I  + A A+R  I    AK  N+ D+  K  +  T   ++
Sbjct: 193 TLSFPQYFLALASLANVGKNICFLLASASRASINLQLAKNNNIGDISGKSVSQFTASTLI 252

Query: 232 G--LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT---LNPQRTAMIV 286
           G   G GI     + S  Q    + P+  V+ V ++     +A V     LN QR  ++ 
Sbjct: 253 GGACGLGISKIIDITSIYQ----LYPVFFVLTVMNIAASYYSAKVIDELYLNNQRAFLLF 308

Query: 287 ADFVKTGR 294
            +F K+  
Sbjct: 309 NEFFKSNE 316


>gi|356566391|ref|XP_003551415.1| PREDICTED: uncharacterized protein LOC100783455 [Glycine max]
          Length = 525

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 71/260 (27%), Positives = 132/260 (50%), Gaps = 10/260 (3%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFA--AGLRP 116
           + R   S +  FF    P  V   Y+ Y +++ L    S+AL VL+TQ++  A   G   
Sbjct: 115 WLRACGSVVWDFF---IPRGVTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSC 171

Query: 117 TPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           +   A  ++W+LKDG+  + + I  ++L +  D+  KR R     L+    GLE+L+P  
Sbjct: 172 SLPSAAALNWVLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAF 231

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG--L 233
           P+ FL +A + N +K I++    ATR  ++ SFA   NL ++ AK +  +  F+++G  L
Sbjct: 232 PRCFLLLATIANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLML 291

Query: 234 GAGIQLASTVCSSMQGKM--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
            A + L        Q      + P  + + ++ + + ++   + TL   R  +I++ +++
Sbjct: 292 AALVNLWIENHRRQQAGFHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIE 351

Query: 292 TGRISSPADLRYREDLLFPG 311
            G + SPA++  +E + F G
Sbjct: 352 CGYVPSPAEVSEKEGINFLG 371


>gi|219127360|ref|XP_002183905.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404628|gb|EEC44574.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 891

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 73/245 (29%), Positives = 128/245 (52%), Gaps = 27/245 (11%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWI 127
           F P GYP+SV EGYL Y  + ++Q  +S   +VL +  +L AAG+    A A +  ++W 
Sbjct: 438 FMPVGYPYSVREGYLSYQIYDSIQGLSSYLRAVLCSAQVLQAAGVGDANATAIMAALTWA 497

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FLEMAGL 185
           LKDG+  +G LI S  + + +D+  K +R+ AD + D+G  L++L+P  P+   + ++  
Sbjct: 498 LKDGLGMIGGLIFSYGVASHLDAYVKEFRLFADVINDVGMLLDMLAPYVPRADLIWISSA 557

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               K +  +AA AT+  I   FA +GN++DL AK     TL N++G+  G+ LA  +  
Sbjct: 558 ATLCKVMCGIAAGATKSSITQHFALQGNMADLNAKEATQETLVNLIGMVCGVTLARYLSQ 617

Query: 246 SMQG-----------------------KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRT 282
              G                       + +V  +L+++H++S  + +    + TLN +RT
Sbjct: 618 LEAGNEEGDTDRIGYRSWIPCISHDSIQWMVFTILTLLHIWSNWKGVSLLRLETLNRERT 677

Query: 283 AMIVA 287
            +++ 
Sbjct: 678 KVVLC 682


>gi|298708275|emb|CBJ48338.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 284

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 109/203 (53%), Gaps = 3/203 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS--WI 127
           F P+GYP SV + Y+ Y  +  +Q   S    VL   ++L A+G+    A A   +  W+
Sbjct: 71  FLPTGYPSSVRKEYMEYQTWDTVQALCSYLRGVLCLTAMLHASGVGDENASAIAAAMVWV 130

Query: 128 LKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           L+DG   +G L+ S   +  MDS  K WR+ AD   D+G  L++++P+    F  ++ L 
Sbjct: 131 LRDGFGMIGSLLFSYAASSHMDSNIKEWRLFADIANDVGLTLDLVAPVFRGNFAVVSSLA 190

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
              K +  VAA  T+  I S FA + N++D+ AK  A  T  NV G+  G+++AS +  +
Sbjct: 191 TVFKALCGVAAGCTKTSITSHFALKQNMADISAKENAQETAVNVCGMLLGVKVASYLSDA 250

Query: 247 MQGKMIVGPLLSVIHVYSVIEEM 269
             G++++    S++HVY+    M
Sbjct: 251 PTGRVLIFVFFSLLHVYANYRYM 273


>gi|218192328|gb|EEC74755.1| hypothetical protein OsI_10514 [Oryza sativa Indica Group]
          Length = 479

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 104/409 (25%), Positives = 187/409 (45%), Gaps = 50/409 (12%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           + RV    L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    
Sbjct: 59  WGRVGGRLLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKS 118

Query: 119 AQ--ATVVSWILKD--GM---------QHVGKLICSN------------------LGARM 147
           A        W L+D  GM         Q    L  SN                  LG+ +
Sbjct: 119 ATVIGATFQWFLRDLTGMLGGILFTFYQVSNSLANSNRQCHRIAIDSLFTDEITCLGSNL 178

Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
           DS  K WR++AD + DLG  +++LSPL P   + +  LG+ ++    VA+ ATR  +   
Sbjct: 179 DSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQH 238

Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
           FA   N +D+ AK  +  TL  ++G+G G+ LA        G  +    L++ H+Y+  +
Sbjct: 239 FALANNAADISAKEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYK 298

Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL-FPG------RLIEDAGNV 320
            +++  + TLN +R+++++  F+  G + +P  +  +E +L F        R+     +V
Sbjct: 299 AVQSLSLTTLNYERSSIMLQYFMDNGEVLTPQQVSKQEHILPFWSSWRKFLRIKLPHEHV 358

Query: 321 KVGRDPHKVVKPS--KFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGW---LV 375
            +G    ++  P   +  + +  +  E + L        + + + A+  D L  +   LV
Sbjct: 359 HLGAKASRLTHPDLLQIAKTRPYYRNENYFLLDKEGSVHIFIHKQAAATDILMSFIHGLV 418

Query: 376 AAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
            A+    +KS H        +A   +++ +   +S+LQ +G+ T+R L 
Sbjct: 419 LAHLMQKSKSGH-------AEARQWIDEKYNTFISKLQVEGYSTERLLS 460


>gi|222624448|gb|EEE58580.1| hypothetical protein OsJ_09902 [Oryza sativa Japonica Group]
          Length = 479

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 104/408 (25%), Positives = 186/408 (45%), Gaps = 52/408 (12%)

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           RV    L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    A 
Sbjct: 61  RVGGRLLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSAT 120

Query: 121 --ATVVSWILKD--GM---------QHVGKLICSN------------------LGARMDS 149
                  W L+D  GM         Q    L  SN                  LG+ +DS
Sbjct: 121 VIGATFQWFLRDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNLDS 180

Query: 150 EPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFA 209
             K WR++AD + DLG  +++LSPL P   + +  LG+ ++    VA+ ATR  +   FA
Sbjct: 181 NAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFA 240

Query: 210 KEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEM 269
              N +D+ AK  +  TL  ++G+G G+ LA        G  +    L++ H+Y+  + +
Sbjct: 241 LANNAADISAKEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAV 300

Query: 270 RAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG--------RLIEDAGNVK 321
           ++  + TLN +R+++++  F+  G + +P  +  +E +L P         R+     +V 
Sbjct: 301 QSLSLTTLNYERSSILLQYFMDNGEVLTPQQVSKQEHIL-PYWSSWRKFLRIKLPHEHVH 359

Query: 322 VGRDPHKVVKPS--KFHELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGW---LVA 376
           +G    ++  P   +  + +  +  E + L        + + + A+  D L  +   LV 
Sbjct: 360 LGAKASRLTHPDLLQIAKTRPYYRNENYFLLDKEGSVHIFIHKQAAATDILMSFIHGLVL 419

Query: 377 AYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
           A+    +KS H        +A   +++ +   +S+LQ +G+ T+R L 
Sbjct: 420 AHLMQKSKSGH-------AEARQWIDEKYNTFISKLQVEGYSTERLLS 460


>gi|400593658|gb|EJP61584.1| DUF647 domain protein [Beauveria bassiana ARSEF 2860]
          Length = 467

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 89/315 (28%), Positives = 154/315 (48%), Gaps = 14/315 (4%)

Query: 14  EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPS 73
           E DKS  VQ+    W        R    +P GH +       R     + DSFL    P 
Sbjct: 28  ELDKSGHVQK---RWLHEPSG--RIIHIRPQGHTNDLFSRSLREWKKILSDSFL----PV 78

Query: 74  GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDG 131
           G+P SV+  YL Y  F +LQ F S   S+L+ ++LL   G+    + AT  ++  IL+D 
Sbjct: 79  GFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFALLLTILRDA 138

Query: 132 MQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
              V  ++ ++  G R++ + KR+R LAD   D    LE+ SP L P   + +  +G   
Sbjct: 139 TSRVATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLSPLAKVVVLSVGEAL 198

Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
           + +  V+A A++  +   FAK  NL++L AK  +  T   ++GL  G  +   V    + 
Sbjct: 199 RALCGVSAGASKAALSMHFAKHDNLAELNAKEASQETAVGLIGLLVG-TIVVKVVEDHRS 257

Query: 250 KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
            + +  LL   H++     +R+  + TLN QR  ++  ++++TG++ SPA++  RE ++ 
Sbjct: 258 VVSLMILLVFGHLWMNYLGVRSVQMATLNRQRATILFQEYLRTGKVLSPAEVAKREYIIV 317

Query: 310 PGRLIEDAGNVKVGR 324
              ++ +    +V +
Sbjct: 318 WSSVVSNRHGQEVAK 332


>gi|170118219|ref|XP_001890291.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164634725|gb|EDQ99048.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 471

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 109/406 (26%), Positives = 194/406 (47%), Gaps = 52/406 (12%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           + RV+D     F P+GYP SV+  YLRY    ALQ F ++ + +LS++++L   G+    
Sbjct: 42  HRRVLDLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLVGLLSSRAILEGFGVGDPS 101

Query: 119 AQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A AT  ++  +L+D    +  ++ ++ LG+ +  + K++R+LAD L D    L+ +SP  
Sbjct: 102 ATATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDVKKYRLLADMLNDAAVILDTISPRL 161

Query: 176 PQLF---LEMAGLGNFA--KGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAIST 226
             LF   L +A L   A  + +  ++A  ++  I   FA      G++ DL AK  +  T
Sbjct: 162 DTLFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKET 221

Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEM--RAAPVNTLNPQRTAM 284
           +  ++G+  G  +   + +      I   LL ++ ++  I  +  R   + TLN QRT +
Sbjct: 222 VLALLGMLLGTLIVPHLTTPWT---IYTTLLVLVGLHLTINYIGVRGLVLRTLNGQRTWL 278

Query: 285 IVADFVKT--GRISSPADLRYREDLL-FPGRLIEDAGNVKVGRDPHKVVKPSKFHE---- 337
             + +++T  GR  +P ++   E +   PG   +    V +GR        S F E    
Sbjct: 279 AWSAYMRTQPGRAPTPDEVASLERIFERPGAFRDPRSGVLMGR----CTIGSSFSEILSG 334

Query: 338 -----LKELFAEEKFIL-----CL---------GSRWTDMVLEQNASGEDALRGWLVAAY 378
                L ELF EE+++L     CL         G     + L++  + ED L+ W+ AA 
Sbjct: 335 PIPPQLLELFKEERYLLWFDRRCLLRPESPSVEGFLRLHICLKEGYNTEDQLKAWIHAAE 394

Query: 379 AASMAKSFHDPS-----LTVLQDAYDKMNDVFTPLLSELQAKGWHT 419
                 S  D S     L VL++ + K+ ++    +++++A GW+T
Sbjct: 395 LCRAISSQRDASGVLDALGVLEETHQKVVELIPDFVNKMRALGWNT 440


>gi|195643336|gb|ACG41136.1| hypothetical protein [Zea mays]
          Length = 442

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 92/389 (23%), Positives = 178/389 (45%), Gaps = 38/389 (9%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           + R+    L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    
Sbjct: 50  WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109

Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A        W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  +++LSPL 
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---GEAISTLFNVVG 232
           P   + +  LG+ ++    VA+ ATR  +   FA   N +D+ AK    E ++T+  +  
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNATDISAKEGSQETLATMLGMGL 229

Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
                Q+      S+    +    L++ H+Y+  + +++  + TLN +R ++++  F + 
Sbjct: 230 GMLLAQITRGHALSVWASFLS---LTMFHMYANYKAVQSLSLTTLNYERASILLQYFKEC 286

Query: 293 GRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHK-----------------VVKPSKF 335
           G +  P  +  +E +L           +K+   PH+                 V+  ++ 
Sbjct: 287 GEVLVPRKVSQQEHILPSWSNWRKLNRIKL---PHERVHLGAKASMLTHSDMLVIAKTRC 343

Query: 336 HELKELFAEEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQ 395
           H     +    + L        + + + A+  D LR ++     AS  ++     L    
Sbjct: 344 H-----YENANYFLLDKQGIVYVFIHKEATPADVLRSFVHGLVLASSTQNSKPQHL---- 394

Query: 396 DAYDKMNDVFTPLLSELQAKGWHTDRFLD 424
           +A   M++++T  +S+LQ +G+ T+R L 
Sbjct: 395 EARRWMDEMYTSFISKLQTEGYSTERLLS 423


>gi|356523306|ref|XP_003530281.1| PREDICTED: UPF0420 protein-like [Glycine max]
          Length = 480

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 95/350 (27%), Positives = 157/350 (44%), Gaps = 23/350 (6%)

Query: 32  SDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVD--------SFLNKF-----FPSGYPFS 78
           S+   + Y    D  L   +V++ R   +R  D        S+L +       P+G+P S
Sbjct: 47  SNGTAKRYVLGDDSQLQAFLVEEDRSTPNRFQDLHSSDESLSWLPEIIKDFVLPAGFPGS 106

Query: 79  VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVSWILKDGMQ 133
           V++ YL Y   +   + T      L T SLL A G+         A A  + W+ KDG+ 
Sbjct: 107 VSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKAVGIGSFTGTTAAASAAAIRWVSKDGIG 166

Query: 134 HVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
            VG+L I    G+  D +PK+WR+ AD +   G+  ++ + L P  FL +A LGN  K +
Sbjct: 167 AVGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLTTQLYPAYFLPLASLGNLTKAV 226

Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMI 252
           A      +   I + FA  GNL ++ AK E    +  +VGL  GI +  T        +I
Sbjct: 227 ARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVVAQLVGLSLGILILDTPGLVKSYGVI 286

Query: 253 VGPLLS--VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
               LS  ++H++   E +     NT+N +R  ++V   V    +    D    E++L  
Sbjct: 287 SLTWLSMRLLHLWLRYESLSVLQFNTINIKRARILVKSHVLHSTVPGCTDCNREENILAW 346

Query: 311 GRLIEDAGNVKVGRDPHKVVKPSKF--HELKELFAEEKFILCLGSRWTDM 358
            + ++      +  +    V+ S F    L +L+A EK+IL +  +  D+
Sbjct: 347 SQFMKPKIIFGLPLEKMDGVERSYFMVEALIKLYASEKYILMVNQQTEDL 396


>gi|321459639|gb|EFX70690.1| hypothetical protein DAPPUDRAFT_60988 [Daphnia pulex]
          Length = 264

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 74/253 (29%), Positives = 130/253 (51%), Gaps = 16/253 (6%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
           F P GYP SV++ YL Y  +  +Q F S+    LST+++L   G+     TP  AT++ W
Sbjct: 5   FLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAATLM-W 63

Query: 127 ILKDGMQHVGKLI---CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ------ 177
           ++KDG   VG+++   C+  G ++D++ K+WR  AD L D    LE+ +P          
Sbjct: 64  LIKDGTGMVGRILFAWCN--GTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGGSN 121

Query: 178 -LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
                +  +   AK I  VA  ATR  +    A++GNL+D+ AK  +  TL N+  L   
Sbjct: 122 GTMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALITS 181

Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
           + L   +  +     ++  L +++HV++ ++ ++A  + TLN  R  +I+  F  T  ++
Sbjct: 182 LWLLPILDGATGLTWLMFFLFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTKYVA 241

Query: 297 SPADLRYREDLLF 309
           S  ++   E ++ 
Sbjct: 242 SVKEVNRLEPVIM 254


>gi|320164887|gb|EFW41786.1| GTP-binding protein [Capsaspora owczarzaki ATCC 30864]
          Length = 436

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 74/247 (29%), Positives = 125/247 (50%), Gaps = 4/247 (1%)

Query: 49  VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
           +++VD+  P            F P GYP SV+  Y  Y ++  LQ F S+    L+T ++
Sbjct: 31  LRLVDEKHPEASSFAQLLKAVFLPDGYPDSVSADYASYQRWDTLQAFCSSINGSLATLAV 90

Query: 109 LFAAGL--RPTPAQATVVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLG 165
           L   G+      A A  VSW+L+DG+  +G++  +   G  +D+  KRWR++AD   DL 
Sbjct: 91  LKGVGVGDETATATAAAVSWMLRDGVGMLGRIFFAWQKGPALDANAKRWRLVADIFNDLA 150

Query: 166 TGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
             +E+LSPL  + FL  A LG+  + I  VA  ATR  I    A+  N  D+ AK  +  
Sbjct: 151 MFVELLSPLVGEWFLAFACLGSVLRSIVGVAGGATRAAITQHQARRNNHGDVSAKDGSQE 210

Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKM-IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
           TL N+  L   + L   + S     + ++    + +H+++    +RA   +T N +R  +
Sbjct: 211 TLVNLAALLVSLWLLPALGSDHPTLVWLLFCSFTSLHLFANYRGVRATVFDTFNRERLFI 270

Query: 285 IVADFVK 291
           + +D+++
Sbjct: 271 VASDYMQ 277


>gi|358335880|dbj|GAA28290.2| UPF0420 protein C16orf58 [Clonorchis sinensis]
          Length = 428

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 83/310 (26%), Positives = 138/310 (44%), Gaps = 19/310 (6%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILK 129
           PSGYP SV+  YL Y  +  +Q   S+    L+ QS+L   G+    A      +SW+LK
Sbjct: 21  PSGYPNSVSPDYLEYQIWDTVQALASSVTGALAAQSVLIGVGVGDVKATILGASLSWMLK 80

Query: 130 DGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
           DG   +GK++     G+ +D + KRWR+LAD + D    LE+ S   P +F  +  L N 
Sbjct: 81  DGSGMIGKIVFVGWQGSNLDCDCKRWRLLADIINDCALFLELASSFFPSVFTFIICLANL 140

Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
            K +  VA  ATR  +    A + N++D+ AK  +  TL N++ L   + +   V  +  
Sbjct: 141 CKSVVSVAGGATRAAVTLHQALDNNVADVAAKDGSQETLSNLLALFFNLSVVYFVTGNWS 200

Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD-------- 300
                  +L+ IHVY+    ++   ++  N  R  + V  + +      P+         
Sbjct: 201 IIWTGFVILTFIHVYANYRAVQCLRLSNFNRNRFHIAVQQWFRQRLFHDPSVSDPTATPF 260

Query: 301 -----LRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRW 355
                +  RE +L P           +   P+K  +     +L  +F++EK++L   S W
Sbjct: 261 PSIEWVNSREPILLPASDARIHLGCSLQTLPNK--EQYNLRDLINVFSQEKYLL-YCSNW 317

Query: 356 TDMVLEQNAS 365
                  N+S
Sbjct: 318 RATTHSPNSS 327


>gi|440638734|gb|ELR08653.1| hypothetical protein GMDG_03339 [Geomyces destructans 20631-21]
          Length = 714

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 96/352 (27%), Positives = 162/352 (46%), Gaps = 32/352 (9%)

Query: 60  HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RP 116
           H    SF+  F P+GYP SV E Y+ Y  + +LQ F+S+   +++++++L   G+     
Sbjct: 82  HSWAKSFMEIFLPAGYPNSVTEDYMEYQIYDSLQAFSSSIAGLIASRAVLEGVGVGDATA 141

Query: 117 TPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           +P  A ++S +L+D    +  +   + LG  ++ E KRWR++AD   D    L+ LSP  
Sbjct: 142 SPTTALLLS-VLQDSTGRIATIFFAARLGLSLEPECKRWRLVADIFNDAAMILDCLSPAF 200

Query: 176 PQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
           P+ L + +       + +  VAA + +  + + FAK GNL +L AK  +  T+ +++G+ 
Sbjct: 201 PKPLRVSLLSASAALRALCGVAAGSAKASLSAHFAKSGNLGELNAKDSSQETIISLLGML 260

Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRT------------ 282
            G  L   + + +Q    +  LLS IH+ +    +RA  + TLN QR             
Sbjct: 261 VGTALIPYLTTPLQTWTALLLLLS-IHLTTNYFAVRAVSMRTLNRQRASLLLRPLLSPPI 319

Query: 283 AMIVADFVKTGRISSPADLRYREDLLF--PGRLIEDAGNVKVGRDPHKVV--------KP 332
           +      +    I+SP  +   E ++   P  L  D         P   V         P
Sbjct: 320 SPAPLPALPPNPIASPLQISQLETIIHLDPSSL-HDPNPAHASFVPLATVLSHASRGPNP 378

Query: 333 SKFHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASM 382
            + H L  +FA E +IL     +R   + L+ +A+   ALR W  A   A M
Sbjct: 379 IRPHILLSIFAAESYILYFDRYTRHAFIALKSSATASTALRSWFCAMECAHM 430


>gi|358380832|gb|EHK18509.1| hypothetical protein TRIVIDRAFT_14438, partial [Trichoderma virens
           Gv29-8]
          Length = 451

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 96/386 (24%), Positives = 176/386 (45%), Gaps = 33/386 (8%)

Query: 22  QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSF----LNKFFPSGYPF 77
            ++ I   + S  V + +    DGH+ +++ + S   + R    +     + F P G+P 
Sbjct: 23  NDVEIVELDKSGHVQQQWLHAHDGHV-LRLKNRSNGFFSRSAREWRKVLFDSFLPVGFPH 81

Query: 78  SVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHV 135
           SV++ YL Y  F +LQ F S   S+L+ ++LL   G+    + AT  ++  ILKD M  +
Sbjct: 82  SVSKDYLAYQMFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTILKDAMSRI 141

Query: 136 GKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIA 193
             ++ ++  G R++ + KR+R LAD   D    LE+ SP        +A   G   + + 
Sbjct: 142 ATIVFAHRFGLRIEPDAKRYRFLADIFNDSAFFLELYSPYLGSWGKVIALSTGEALRALC 201

Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
            VAA A++  +   FA++ NL++L AK  +  T   ++GL  G  +  T+       +I+
Sbjct: 202 GVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVGTLVVKTI-EDHSSVVIL 260

Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRL 313
             LL   H+      +R+  + TLN QR  ++ ++++K   + +P  +  RE ++F   +
Sbjct: 261 MILLVFAHLGMNYLGVRSVCMTTLNRQRATILFSEYLKNRVVLTPEQVAERESIIFWSPI 320

Query: 314 I--------------EDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRW---T 356
           +              E  G+     DPH  +       + +     +FI     ++    
Sbjct: 321 MRNIRGKKVVYLEFAESYGHAMSCSDPHDQLI------IVDGIKHTRFICSRVPKYLTPI 374

Query: 357 DMVLEQNASGEDALRGWLVAAYAASM 382
            M+L + A   DA+R W  +   A M
Sbjct: 375 KMLLWEGAQPIDAIRAWFASMEIAWM 400


>gi|330805147|ref|XP_003290548.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
 gi|325079335|gb|EGC32940.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
          Length = 493

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 72/271 (26%), Positives = 135/271 (49%), Gaps = 17/271 (6%)

Query: 65  SFLNK----FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA-GLRPTPA 119
           S +NK    F P GYP SV+  Y+ Y ++   Q+   +   VLST +LL    G+    A
Sbjct: 91  SIINKAKSYFLPKGYPNSVSPDYIHYYKWLFCQNVVGSVTYVLSTHALLTTVVGMSVASA 150

Query: 120 Q--ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
              A  +SW+LKDG+     +  +N     +D + K+++   D L++LG  LE+ +P   
Sbjct: 151 LPFAAAISWVLKDGLGAFALVFFANKFSTLLDFDLKKYKFRGDILHNLGVLLEMCTPFIQ 210

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
             FL +A + N +KG+A +   +TR  +  SF+ + N+ D+ AK ++ S    ++G+G G
Sbjct: 211 GYFLPLASISNLSKGLAGLIYGSTRASLNKSFSIKDNIGDITAKYQSQSMAAYLMGMGIG 270

Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
                 +  S    + +   LS +H++   + +++  + +LN QR ++I+ +++    I 
Sbjct: 271 STFGLLLSGSTFSGISIVFSLSFVHMFLGYKAVKSINLKSLNKQRLSIIIDNWLLNSEIL 330

Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPH 327
           +   +   E+ +F           K  ++PH
Sbjct: 331 NSKIVNNNEEFIFDS---------KYNKEPH 352


>gi|293336742|ref|NP_001169416.1| hypothetical protein [Zea mays]
 gi|224029221|gb|ACN33686.1| unknown [Zea mays]
 gi|414876408|tpg|DAA53539.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
          Length = 481

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 111/432 (25%), Positives = 184/432 (42%), Gaps = 33/432 (7%)

Query: 14  EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVY--------HRVVDS 65
           EP+K+      P+      D V + Y       L +++     PV           +  +
Sbjct: 36  EPEKAR-----PLLAERYRDGVVKRYISDGSSKLQLRLEKHESPVNGVEDENADSLIPQA 90

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQ 120
            L+   P+G+P SV+  YL Y   +   + T     VL T SLL A G+         A 
Sbjct: 91  ILDFVLPAGFPGSVSADYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAAS 150

Query: 121 ATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
           A  + W+ KDG+   G+L I    GA  D +PK+WR+ AD +   G+  E+ + L P  F
Sbjct: 151 AAAIRWVSKDGIGAFGRLLIGGRFGALFDDDPKKWRMYADFIGSAGSIFELSTTLYPGYF 210

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
           L +A LGN AK IA      +   I + FA+ GNL ++ AK E       ++GL  G+ +
Sbjct: 211 LPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVLI 270

Query: 240 ASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
                       +V   LSV  +H++   + +      T+N +R  ++V   V    +  
Sbjct: 271 MDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLKFRTINLKRGRILVKSHVAQHTVPG 330

Query: 298 PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK----FHELKELFAEEKFILC--- 350
                  E++L   R +     V  G    +++   K     + L +L+  EK+IL    
Sbjct: 331 YVACNEEENILTWERYLRP--QVSFGVPMERMLGGDKSSDMVNRLLKLYKNEKYILFFEQ 388

Query: 351 LGSRWTDMVL--EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPL 408
            GSR    ++  +++A+    LR  L  A+     +   D     L+++   + + FT  
Sbjct: 389 FGSREPTFLVTFKESATSMSVLRS-LWQAHWLHKNQLRQDEVFPWLEESIVALENGFTDF 447

Query: 409 LSELQAKGWHTD 420
           L +L + GW  D
Sbjct: 448 LEQLGSAGWDQD 459


>gi|256077542|ref|XP_002575062.1| hypothetical protein [Schistosoma mansoni]
          Length = 410

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 71/229 (31%), Positives = 112/229 (48%), Gaps = 4/229 (1%)

Query: 67  LNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
           L KFF P GYP SV++ YL Y  +  +Q F S+    L++Q++L   G+  + A   +  
Sbjct: 41  LKKFFLPVGYPSSVSDDYLEYQIWDTIQAFASSITGALASQAVLIGVGVGDSSATILSAS 100

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           ++W+ KDG   +G++I +   G ++D + K WR +AD L D    LE++SPL   LF  +
Sbjct: 101 LTWMFKDGSGMIGRIIFAGYHGIKLDCDCKFWRFVADILNDCALFLEIISPLFNYLFTPL 160

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             L N  K +  VA  ATR  I    A   NL+D+ AK  +  TL N++       L   
Sbjct: 161 LCLANVLKSLVGVAGSATRAAIVQHQAINNNLADVSAKDGSQETLSNLMAWLLNFILLYM 220

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
           V  +           + IH+YS    ++   + T N  R  + +  + K
Sbjct: 221 VTGNQFLIWFCFICCTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQWFK 269


>gi|312082381|ref|XP_003143421.1| hypothetical protein LOAG_07840 [Loa loa]
 gi|307761414|gb|EFO20648.1| hypothetical protein LOAG_07840 [Loa loa]
          Length = 393

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 70/249 (28%), Positives = 130/249 (52%), Gaps = 10/249 (4%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
           F + F P GYP SV+  Y+ Y  +  +Q F S+  S L+T+++L   G+        A  
Sbjct: 43  FRDIFMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALATEAILRGVGVGNETASTMAAA 102

Query: 124 VSWILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           V+W+L+DG+  + +++ +   +  +D++ K+WR++AD   DL   L++++P+ P LF+ +
Sbjct: 103 VAWLLRDGIGMLTRILFAWFYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPNLFMPI 162

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             L +  +G+  VA  ATR  + +  A   N+ D+ AK  +  TL N+  L   + L   
Sbjct: 163 VCLSSMVRGVVGVAGGATRTAVVNHQAILDNVGDVAAKDGSQETLINIFALFCSLLLLPI 222

Query: 243 VCSSMQGKMIVGPLLSV---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
           V     G + V  L  +   IH+Y     +++    TLN     +IV ++ +T +I + +
Sbjct: 223 V----NGNICVWLLFCIFTFIHLYGNYRAVKSLQFRTLNQSLLQIIVKNYAETKKIDTVS 278

Query: 300 DLRYREDLL 308
            +  +E +L
Sbjct: 279 KVNNKESML 287


>gi|449017626|dbj|BAM81028.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 518

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 85/285 (29%), Positives = 139/285 (48%), Gaps = 15/285 (5%)

Query: 33  DSVTRHYQFQPDGHLSVKVVDDSRPVY--HRVVDSFLNKFFPSGYPFSVNEGYLRYTQFR 90
           D   RH++  P+      V   +RP     R+V + L    P+ YP +V+  Y R+ ++ 
Sbjct: 66  DEEQRHWRVVPEPPAGTAV---TRPENWPQRLVQTLL---LPADYPSAVSSDYDRFWRWN 119

Query: 91  ALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKL-ICSNLGARM 147
            L+H    A  VL TQS+L A G+    A   A    WILKDG+ +  K+ + + L  R+
Sbjct: 120 ILRHTLLEAAEVLGTQSMLLALGVGGASALPLAAAWKWILKDGLGYFAKVALATQLAPRV 179

Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVA-ARATRLPIYS 206
           D++PKR+R   D +  LGT  E+L+   P  FL +A  GN  +  A VA   A R+ +Y 
Sbjct: 180 DNDPKRFRFYGDFVMALGTLFEMLTLAFPPWFLALASTGNLLRKAADVATGPAYRVFLY- 238

Query: 207 SFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVI 266
            F+   N  D+ +K E+   +  + G+GAGI +++         ++   LLS  H+Y+  
Sbjct: 239 HFSIRNNSGDVSSKSESQVVVGRLSGIGAGIAVSAVTSHDASALVMAYGLLSAGHLYATY 298

Query: 267 EEMRAAPVNTLNPQRTAMIVADFVKTG--RISSPADLRYREDLLF 309
           + +    + TLN  R   ++  +  +   R+ S A+    E   F
Sbjct: 299 QSVIRLELRTLNRTRLEYVLNVYTASSGKRVPSIAEANAAEQKSF 343


>gi|323455525|gb|EGB11393.1| hypothetical protein AURANDRAFT_11851, partial [Aureococcus
           anophagefferens]
          Length = 248

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 77/248 (31%), Positives = 126/248 (50%), Gaps = 9/248 (3%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR--PTPAQATVVSWI 127
           F P+G+P +V  GY  Y   +   +  S+A  VLST+ LL A G+        A   +W 
Sbjct: 1   FLPAGFPTTVTPGYGTYAALQFAAYACSSACGVLSTRVLLAAVGVGDGAAAPLAAAANWA 60

Query: 128 LKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLS-PLCPQLFLEMAGL 185
           +KDG+  +G +   +     +D+ P++WR+ + A  DL + +E+ + P  PQ F+ +AGL
Sbjct: 61  IKDGLGMLGGVAFAATWSNSLDARPRQWRLRSSAALDLASLVELAALPAFPQYFVPIAGL 120

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEG---NLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
            N AK ++ +AA A+R  I+ S +      NL+DL AK  + + + ++ GL  G+  AS 
Sbjct: 121 ANVAKNVSYLAASASRAAIHQSLSARSDASNLADLTAKTGSQTIVASLCGLALGVA-ASN 179

Query: 243 VCSSMQGKMIVG-PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
            C +    +      LS  H+      ++     TL+  R A +V  F +TG   +PAD+
Sbjct: 180 ACGAEASDVWPAWAALSACHLGCTYASLKYVNTTTLDDARLAALVDAFRETGACPTPADV 239

Query: 302 RYREDLLF 309
             RE L  
Sbjct: 240 SERESLFL 247


>gi|219129806|ref|XP_002185071.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403566|gb|EEC43518.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 500

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 80/256 (31%), Positives = 128/256 (50%), Gaps = 5/256 (1%)

Query: 56  RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
           R +   V     + F PSGYP     GYLRY+ +  +Q  ++   SVL+TQ +L   G+ 
Sbjct: 71  RVLLRNVQSGLRSTFLPSGYPARTPPGYLRYSVWSWVQDLSTQLRSVLATQRVLEGVGVG 130

Query: 116 PTPAQA--TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
              A A   +++++++DG      L   S   +R  S+ KRWRI AD + D+G  LEV +
Sbjct: 131 REGATALSALLNFLVRDGCGMAASLAFTSAAASRFQSDVKRWRIFADVMVDIGITLEVAA 190

Query: 173 PLC-PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
             C P LFL M  LGN  K +  VAA A    I   +AK  ++SD+ AK  A  T+   +
Sbjct: 191 SACPPTLFLPMISLGNACKAMCGVAAGACGGAINLHWAKGSDISDIQAKFGAQHTVTGSL 250

Query: 232 GLGAGIQLASTVCSSMQGKM-IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
           GL      A +V +   G++ I+   L+ +HV + +  MR    +  N  R  +++    
Sbjct: 251 GLVFAAFFAKSVSTVPGGRLWILYAALTALHVMANLRCMRIIHFDVFNTTRLRLVLRALF 310

Query: 291 KTGRISSPADLRYRED 306
           ++G +S+   +   E+
Sbjct: 311 ESGTLSNKTAVTTAEN 326


>gi|242056489|ref|XP_002457390.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
 gi|241929365|gb|EES02510.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
          Length = 624

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/362 (27%), Positives = 165/362 (45%), Gaps = 16/362 (4%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVS 125
            P+G+P SV+  YL Y   +   + T     VL T SLL A G+         A A  + 
Sbjct: 240 LPAGFPGSVSVDYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAASAAAIR 299

Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           W+ KDG+   G+L I    GA  D +PK+WR+ AD +   G+  E+++PL P  FL +A 
Sbjct: 300 WVSKDGIGAFGRLLIGGRFGALFDDDPKKWRMYADIIGSAGSIFELMTPLYPGYFLPLAS 359

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST-- 242
           LGN AK +A      +   I + FA+ GNL ++ AK E       ++GL AG+ +  T  
Sbjct: 360 LGNLAKAVARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSAGVLIMDTAG 419

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           V SS     +    + ++H++   + +      T+N +R  ++V   V    +       
Sbjct: 420 VKSSYLTLALTWLSVRLLHLWLRYQSLSVLKFRTINLKRGRILVRSHVAQHTVPGYVACN 479

Query: 303 YREDLLFPGRLI--EDAGNVKVGRDPHKVVKPSKFHELKELFAEEKFILC---LGSRWTD 357
             E++L   R +  + +  V + R             L +L+  EK+IL     GSR   
Sbjct: 480 EEENILTWERFLRPQISFGVPMERMLGGDESSDMVDRLLKLYKNEKYILFFEQFGSRDPT 539

Query: 358 MVL--EQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
            ++  +++A+    LR  L  A+     +   D     L+++   + + FT  L ++ + 
Sbjct: 540 FLVTFKESATSMSVLRS-LWQAHWLHKNQLRQDGVFPWLEESLVALENGFTDFLGQMGSA 598

Query: 416 GW 417
           GW
Sbjct: 599 GW 600


>gi|398388772|ref|XP_003847847.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
 gi|339467721|gb|EGP82823.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
          Length = 457

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 92/390 (23%), Positives = 180/390 (46%), Gaps = 39/390 (10%)

Query: 67  LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSW 126
           ++ F P+G+P SV E Y  Y  + +LQ F+S+  ++LS++++L + G+  T A  T  + 
Sbjct: 54  MDVFLPAGFPHSVTEDYTEYQIYDSLQAFSSSIAAMLSSRAVLSSIGVGDTKASPT-AAL 112

Query: 127 ILKDGMQHVGK----LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLE 181
           +L    + VG+    L     G  ++ E K +R++AD L D    L+VLSP+ P+ + + 
Sbjct: 113 LLSVLQESVGRLATILFAHRFGTSLEPECKMYRLMADVLNDSAFILDVLSPIFPKPIRIV 172

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           +    +  + +  VAA +++  + + FA+ GNL +L AK  +  T+ +++G+ AG  + S
Sbjct: 173 ILSFSSILRSLCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLMGMLAGSLVIS 232

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
            + + +     +  LLS IH+ +    +RA  + TLN QR  ++     +   + S  ++
Sbjct: 233 WITTPLATWTALIGLLS-IHLETNRRAVRAVKMRTLNRQRATLVFHHLQRQQTVPSIKEI 291

Query: 302 RYREDLL-FPG--------------------RLIEDAGNVKVGRDPHKVVKPSKFHELKE 340
              E +  + G                    RL+E     +        ++     ++  
Sbjct: 292 SSVEHIFEWDGVLRTSTRDIMGYCDIGTPFLRLLEAVSESQTSTKASH-IQQQTLSQILS 350

Query: 341 LFAEEKFILCLGSRWT------DMVLEQNASGEDALRGWLVAAYAA----SMAKSFHDPS 390
           L+   ++IL    R T      +++L++ A  +D L  W  A + A    +  +   +  
Sbjct: 351 LYNSSRYILWHDRRSTKDVPRFNIILKKGAEPKDLLIAWWQALFHAQDDSAAGQDGFEEK 410

Query: 391 LTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
           L  L+ +  +  + F      L+ KGW  D
Sbjct: 411 LAALERSLSRAKEFFERYEKSLREKGWDVD 440


>gi|393227224|gb|EJD34913.1| DUF647-domain-containing protein [Auricularia delicata TFB-10046
           SS5]
          Length = 377

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 93/360 (25%), Positives = 165/360 (45%), Gaps = 29/360 (8%)

Query: 89  FRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICS-NLGA 145
           F +LQ F S    +L+ +++L   G+    A AT  V+  I++D    V  ++ +  LG 
Sbjct: 2   FDSLQAFFSEIAGLLAARAVLTGLGVGDAGATATGAVLLKIIQDTCGRVATILSAWKLGP 61

Query: 146 RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-LEMAGLGNFAKGIAVVAARATRLPI 204
            ++ E K+WR+ AD   D G  LE LSP  P    + +  +    + +  V+A   +  +
Sbjct: 62  MLEPECKQWRLAADLFNDAGIILECLSPTLPHTMKISLLCVARVVRSLCGVSAGGAKAAL 121

Query: 205 YSSFAKEG-NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
              F   G +++D+ AK  +  T+  ++G+  G  +   + +S +G       L  +H+Y
Sbjct: 122 SQHFVGPGGSIADINAKESSQETVVALIGMLVGSMIVPRI-TSFEGTWTALLFLIGVHLY 180

Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF-PGRLIEDA---GN 319
           +    +RA  + TLN QR  +++ + ++ GR+ SP  +R RE +     RL+ D    G 
Sbjct: 181 ANWRGVRAVAMLTLNRQRATIVLGELIQRGRVLSPEQVRQRESVFCRSSRLVLDGVTLGQ 240

Query: 320 VKVGRD-PHKVVKPSKFH----------ELKELFAEEKFIL-CLGSRWTD----MVLEQN 363
             +G    H + + ++ H           L + FA E++IL C   R       + L++ 
Sbjct: 241 CHIGVSLQHLLAQIARVHGPDRSGSALAGLMDRFASERYILHCCCRRKGKVSIIICLKRG 300

Query: 364 ASGEDALRGW---LVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
               D LR W   LV A   S   +  +P   VL++   +  ++F   +S L   GW  D
Sbjct: 301 VESRDMLRAWTHALVLAQRYSAIPASGEPDELVLEETTARARELFLGSISRLSNAGWKLD 360


>gi|307191039|gb|EFN74793.1| UPF0420 protein C16orf58-like protein [Camponotus floridanus]
          Length = 825

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 82/339 (24%), Positives = 160/339 (47%), Gaps = 19/339 (5%)

Query: 54  DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
           + + +  R+V      F P GYP SV+  Y  Y  +  +Q   S  ++ L+T S++   G
Sbjct: 392 EKQSICSRIVSIVKEIFLPQGYPDSVHSDYTAYQIWDTIQASASTIMNTLATHSIMIGIG 451

Query: 114 LRPTPAQ--ATVVSWILKDGMQHVGKLICS--NLGARMDSEPKRWRILADALYDLGTGLE 169
           +  + A   A  ++WILK+G   VG ++ +  N G ++D + K+WR+ AD L D+  G+E
Sbjct: 452 VGESNATPLAAAITWILKNGAGMVGSIMFAWWN-GTKLDGQCKKWRLFADILNDMAAGME 510

Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
           +L        + +  +    K +  VA  ATR+ +    A   NL+D+ AK  +     N
Sbjct: 511 LLVSYFSSYSVVILCISTIMKSMVGVAGGATRVALTQHQAIRNNLADVAAKDGSQEMCVN 570

Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
           ++    GI + S +    Q  + +   L  +H+Y+    ++   ++ LN  R A+I+  +
Sbjct: 571 LIASFIGIFILS-LFHDRQYLIELYLFLVAVHLYANYSAVKVLCLDILNEDRLALIIKYY 629

Query: 290 VKTGRISSPADLRYREDLLF---PGRLIEDAG-NVKVGRDPHKVVKPSKFHELK-----E 340
           +   RI  P  L   E +     P + I   G N+K+G     V++ +    ++     +
Sbjct: 630 MINERIPEPKRLNKEESVSLLDNPTKSI--CGFNIKIGVSFASVLRKNVISSMEMEFLLK 687

Query: 341 LFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAA 377
            F + K+++ +   +R   ++L+++    + L  +  A+
Sbjct: 688 FFEDRKYLIAIDVKNRNIFVILKKDIQATEILEAYFYAS 726


>gi|452005321|gb|EMD97777.1| hypothetical protein COCHEDRAFT_1165025 [Cochliobolus
           heterostrophus C5]
          Length = 421

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 96/405 (23%), Positives = 183/405 (45%), Gaps = 57/405 (14%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQAT 122
            ++ F PSG+P SV E Y+ Y  F +LQ F+S+   +L+++++L   G+     TP  A 
Sbjct: 1   MIDVFLPSGFPQSVTEDYVEYQIFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTAAL 60

Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFL 180
           ++S +L++ +  +  +  ++  G  ++ E K +R+ AD L D G  L+ LSP  P+ L +
Sbjct: 61  LLS-VLQESVGRMATIAFADRFGTALEPECKMYRLTADILNDSGMILDCLSPGFPKPLRV 119

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +    +  + +  V A +++  + + FA  GNL ++ AK  +  T+ +++G+ AG  + 
Sbjct: 120 AVLSFASCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFVV 179

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           S + S +     +  LL VIH+ +    +RA  + +LN QR  ++ +  ++ G + SP +
Sbjct: 180 SHITSDIATWCTL-ILLLVIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLSPKE 238

Query: 301 LRYREDLLFPGRLI-----EDAGNVKVGRDPHKVV-----------------------KP 332
           +  RE +   G ++     E  G  ++G    +++                         
Sbjct: 239 VSARERIFERGGVLRWADDEVLGKCRIGTSLSELLGRIGRRDKLSGSLAFCSSGADGESE 298

Query: 333 SKFHELKELFAEEKFILCLGSRWTD---------MVLEQNASGEDALRGWLVAAYAAS-- 381
               +L  +F  E ++L     W           +VL+      D LR W  A   A   
Sbjct: 299 VDVFDLLSIFEAEDYVL-----WASGGPAAYEAVIVLKAGCGPIDQLRAWAHALLVAERR 353

Query: 382 ------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
                       +  +  L+D   ++  +F     E++AKGWH D
Sbjct: 354 QRGGDGGGTRARNGLMGELRDTLYEVRRMFDKYGDEMRAKGWHLD 398


>gi|242019564|ref|XP_002430230.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212515330|gb|EEB17492.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 527

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 136/291 (46%), Gaps = 16/291 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
           F P GYP+SV+  YL Y  +   Q F S+    L+ ++++   G+     TP  A + +W
Sbjct: 44  FLPKGYPYSVSSDYLEYQIWDFCQAFCSSICGTLTVRAVMKGIGVGSNVATPMSAAL-TW 102

Query: 127 ILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           I+KDG   +G ++ + L G+++DS+ K+WR+ AD L D+   +E+  P+     + M   
Sbjct: 103 IIKDGTGMIGSIMFAWLKGSKLDSDCKKWRLFADILNDIAMFIELSIPIIMDYSMYMLCA 162

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               K +   A  +TR  I +  A +GN +D+ AK  +     N++G   GI L   V  
Sbjct: 163 ATIFKALVGTAGSSTRAAIITHQAVKGNTADVSAKDGSQELFTNLLGFIFGIFLLKNVNE 222

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
           S Q   IV    ++ H+    + +R   +N  N +R  + +  ++ T  + +   +  RE
Sbjct: 223 SNQ-VWIVMCTATIFHLIFNYKAVRCLKINVFNGERLRLTLLSYLPTSCVPTIEQVNRRE 281

Query: 306 DLLFPGRLIED----AGNVKVGRDPHKVVKPSK-----FHELKELFAEEKF 347
            ++  G    D       +K G   +K VK  K        L  +F  +K+
Sbjct: 282 PVIL-GCGTSDYQLCGFKIKFGVSVNKAVKSEKMTNQQLSTLANVFQSKKY 331


>gi|417401281|gb|JAA47531.1| Hypothetical protein [Desmodus rotundus]
          Length = 457

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 95/390 (24%), Positives = 168/390 (43%), Gaps = 54/390 (13%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
           +   L  F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   
Sbjct: 70  LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWR-ILADALYDLGTGLEVLSPLCPQLF 179
           A   +WI+KD    +G++I +            W+ + AD L D+   LE+++P+ P  F
Sbjct: 130 AATATWIVKDSTGMLGRIIFA-----------WWKGLFADILNDVAMFLEIMAPIYPMCF 178

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
                + N AK +  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +
Sbjct: 179 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 238

Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
              V       +    LL+ +HVY+    ++A  + TLN  R  +++  F++ G +  P 
Sbjct: 239 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 298

Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
                E L    +P        ++ +G   H+++  S   EL++L     E ++L     
Sbjct: 299 SANQMEPLWTGFWPSL------SLSLGAPLHRLI--SSVFELQQLVEGHREPYLL----H 346

Query: 355 W------TDMVLEQNASGEDALRGWLVAAYAASMAKSFHDP-----------------SL 391
           W        +VL Q A  E  LR         ++ +    P                 S 
Sbjct: 347 WDQSQNRVQVVLSQMAGPETILRAATHGLVLGALREDGPLPRELEELRNQIRAGPKKDSW 406

Query: 392 TVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
            ++++ +  ++ +F   L  LQ  GW T++
Sbjct: 407 VIVKETHQVLDKLFPKFLKGLQDVGWKTEK 436


>gi|395332415|gb|EJF64794.1| hypothetical protein DICSQDRAFT_80493 [Dichomitus squalens LYAD-421
           SS1]
          Length = 506

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 110/439 (25%), Positives = 180/439 (41%), Gaps = 32/439 (7%)

Query: 22  QEIPIYWFETSDSVTRHYQ--FQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSV 79
           +E  I W +    + + ++    P+       V     V  R+       F P+ YP SV
Sbjct: 70  RECHISWSDGRSGIRKEWRDTASPEATEGNVAVSGRNSVTGRISSWLRQMFLPTNYPQSV 129

Query: 80  NEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGK 137
           +  Y  +   +AL+   +   SVL  Q+LL + G+    +   A  V WI+KDG   V K
Sbjct: 130 HSSYASFHTLQALETTMATITSVLCNQALLTSVGMSAEGSVFGAVAVQWIIKDGAGEVAK 189

Query: 138 L-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL---CPQLFLEMAGLGNFAKGIA 193
           L    +     DS PK + +  + L  LG+GL++ + L    P  FL  A  GN  K + 
Sbjct: 190 LFFIRHFSTYFDSHPKTFTLFGEVLGCLGSGLQIATVLIAPSPLNFLLCAAGGNIFKLVG 249

Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
                 T +     F+ +GN  D+ AK E+ +++  + G  AGI L   +  S     + 
Sbjct: 250 NAIWSTTHIKFIRYFSMQGNDGDVAAKDESQASVAQLAGYAAGISL---LTFSHAPAYLY 306

Query: 254 GPLLSVI--HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG 311
               + +  H+      MR A    L   R + +   +V  G + S  DL         G
Sbjct: 307 AIFFAAVPLHLTMTAYMMRVATFELLTLPRISHLAQTYVNDGVVGSQKDLDREHATGLFG 366

Query: 312 RLIEDAGNV------KVGR--DPHKVVKPSKFHELKELFAEEKFIL--CLGSR--WTDMV 359
              ++ G+       +VG   +    V  S +    ++F +++++L  C   R     ++
Sbjct: 367 EFYKNNGDRWLTLAPRVGDALNTSSDVDRSTWQVCSQVFQDDRYLLLPCDSPRGPLISVL 426

Query: 360 LEQNASGEDALRGWLVAAYAASMAK---SFHDPSL-TVLQDAYDKMNDVFTPLLSELQAK 415
              +A+ +D L+  L AA   S+     S   PSL TVL +   +    F+     L  +
Sbjct: 427 FHPDATTDDMLQSILHAASVRSLLSRDASCPTPSLRTVLSETRVQAQQDFSAFKQALHER 486

Query: 416 GWHTDR--FLDGTGTRFAW 432
           GW TD   F D  G R  W
Sbjct: 487 GWRTDELCFAD-LGHRVIW 504


>gi|322707810|gb|EFY99388.1| DUF647 domain-containing protein [Metarhizium anisopliae ARSEF 23]
          Length = 462

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 89/353 (25%), Positives = 163/353 (46%), Gaps = 19/353 (5%)

Query: 2   NLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHR 61
           +L+++    +  E DKS +VQ     W   SD+     + + + +    ++  S   + R
Sbjct: 20  DLIKQGDDVEISELDKSGNVQR---RWLHRSDNTIAILESKTEPN---SIIHSSVREWKR 73

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
            +    + F P G+P SV+  YL Y  + +LQ F S   S+L+ ++LL   G+    + A
Sbjct: 74  FLS---DAFLPIGFPHSVSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSSA 130

Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
           T  ++  IL D    +  ++ ++  G R++ + KR+R LAD   D    LE+ SP     
Sbjct: 131 TFAMLITILTDATSRIATIVFAHRFGLRIEPDAKRFRFLADIFNDSAFFLELYSPYFGDY 190

Query: 179 FLEMA-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
              +A  +G   + +  VAA A++  +   FAK  NL++L AK  +  T   ++GL  G 
Sbjct: 191 GKILALSVGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLVVGT 250

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            L   +    Q  + +   L   H++     +R+  + TLN QR  ++  +++ TGR+ +
Sbjct: 251 -LVVKLVQDHQSVVYLMVALVFAHLWMNYLGVRSVCMTTLNRQRATILFREYMTTGRVLT 309

Query: 298 PADLRYREDLLF-----PGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFAEE 345
           P  +  RE+++       GRL E    ++      +         +K +  EE
Sbjct: 310 PEAVAARENIVLWLPVVGGRLAERVCKIEFAESYQRAAASCNGRRVKVVDGEE 362


>gi|302847291|ref|XP_002955180.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
           nagariensis]
 gi|300259472|gb|EFJ43699.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
           nagariensis]
          Length = 1055

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 86/375 (22%), Positives = 175/375 (46%), Gaps = 37/375 (9%)

Query: 5   EKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQP---DGH----LSVKVVDDSRP 57
           E  ++     P    S++ +P+       +V +  Q  P   DG     +S   + DS  
Sbjct: 218 EASERHSNGHPSAPASLERLPV-------AVKKGDQLLPYVWDGRRITAMSEAALQDSDV 270

Query: 58  VYHRVVDSFL-------NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLF 110
              R+ +  L         F P   P  V   Y  + ++R  Q F S+ +   S  +LL 
Sbjct: 271 TTGRLRNWLLGVQRQLKEAFLPD--PRDVTPDYWEWLRWRLTQRFFSSTMQNFSFSALLM 328

Query: 111 AAGL--RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTG 167
           A GL  +   A +  ++W+LKDG+  + ++ + ++ G   D++ KR R +   ++     
Sbjct: 329 ATGLGAKKAFAASAAINWLLKDGVSRIVRMSVSTSFGQTFDADLKRMRFITSLIFTACMA 388

Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
            E  +P  PQ F+ +A + +  + + + A  AT+    ++ A  GNL+DL +K +A   +
Sbjct: 389 GEFATPFYPQHFVALASISSVGRAVGLSAFVATQPAFQAALATGGNLADLTSKNQAQHMV 448

Query: 228 FNVVGLGAGIQLASTVCSSM-QGKMIVG----PLLSVIHVYSVIEEMRAAPVNTLNPQRT 282
            +++ LG    L + +C  M +G +++     PL +   +  +  E++A  + TLN +R 
Sbjct: 449 MDMLALGVSAGL-TWLCRGMPRGGLLLPAVMYPLCAAGDLTCIWHELKAVQLRTLNRERA 507

Query: 283 AMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELF 342
            M++  +++ G +   A++   E+L+ P  +      +++G  P   + P+   +L++L 
Sbjct: 508 EMLIERWMRRGAVPDAAEISAAENLILPSDVWRGLLPLRIG--PLDRIAPANPEQLRKLL 565

Query: 343 AE---EKFILCLGSR 354
            E   E+++L +  R
Sbjct: 566 REYDGEEYVLHVADR 580


>gi|298706870|emb|CBJ25834.1| Unknown protein [Ectocarpus siliculosus]
          Length = 673

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 76/242 (31%), Positives = 126/242 (52%), Gaps = 15/242 (6%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
           V    +N F P+GYP SV   Y+RY  +  LQ  +S    VL+TQ++L   G+     TP
Sbjct: 196 VRGGLMNTFLPAGYPGSVRPEYVRYRCWDILQDLSSHLRGVLATQAVLEGMGVGRAGSTP 255

Query: 119 AQATVVSWILKDGMQHVGKLICSNLG-ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
             AT + W+ +DG   +G L+ +++  A      K WR+ AD   D+G  LE+L+PL P 
Sbjct: 256 LAAT-LQWMARDGASMIGGLLFTSVASANFGVNIKTWRLFADTSNDIGLALEMLAPL-PA 313

Query: 178 L-----FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVG 232
                 FL +  + +  K    VA  AT   I   +A + N++D+ AK  A  T+ +++G
Sbjct: 314 FREGGRFLRLICIASVFKAACGVAGGATGAAITEHWAIDNNIADIGAKNGAQHTVASLLG 373

Query: 233 LGAGIQLAS--TVCSSMQGKMIVG--PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
           LG  +  A   +V  S  G  + G   +L+++H+ +    MR   + +LNP R +++V +
Sbjct: 374 LGLSVWFARAVSVGGSASGSRVWGWYAVLTLVHLLANYAAMRTLALRSLNPTRASLLVTE 433

Query: 289 FV 290
           ++
Sbjct: 434 YL 435


>gi|7327810|emb|CAB82267.1| putative protein [Arabidopsis thaliana]
          Length = 447

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/311 (27%), Positives = 145/311 (46%), Gaps = 20/311 (6%)

Query: 124 VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           + W+ KDG+  +G+L I    G+  D +PK+WR+ AD +   G+  ++ + L P  FL +
Sbjct: 120 IRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFLLL 179

Query: 183 AGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
           A  GN AK +    AR  R P    I + FA  GNL ++ AK E       ++GLG GI 
Sbjct: 180 ASTGNLAKAV----ARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGIL 235

Query: 239 LAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
           +  T  +  S    ++    + ++H++   + +     NT+N +R  +IV   V    + 
Sbjct: 236 IIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSVVP 295

Query: 297 SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKP--SKFHELKELFAEEKFILCLGSR 354
              D   RE++L   R ++      V  +    ++   SK   L +++ +EK+IL L   
Sbjct: 296 GYVDCNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVSKVKALLKMYTKEKYILTLNKL 355

Query: 355 WTD----MVLEQNASGEDALRG-WLVAAYAASMAKSFHDPSLTV--LQDAYDKMNDVFTP 407
             D    +  + NA+  D LR  W       +M +SF D       L+ +  +M++ F  
Sbjct: 356 NKDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEESFKDKDSVFHWLKQSLSEMDNKFDD 415

Query: 408 LLSELQAKGWH 418
            L +L   GW+
Sbjct: 416 FLFKLDTAGWN 426


>gi|326427989|gb|EGD73559.1| hypothetical protein PTSG_05267 [Salpingoeca sp. ATCC 50818]
          Length = 404

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 72/231 (31%), Positives = 118/231 (51%), Gaps = 3/231 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-ATVVSWIL 128
           F P  Y  SV+  +L Y +++ +   T  A  VLS QSLL+A G+       A  ++WIL
Sbjct: 120 FLPRDYRASVSPKFLDYCKWQFVHMTTMTASGVLSMQSLLYAVGVGAGSVPLAAAINWIL 179

Query: 129 KDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGN 187
           KDG+  +G ++  ++ G R D +PKR R  A     +   +++++PL P  FL +A + N
Sbjct: 180 KDGLGQLGGMLYGSIFGTRFDDDPKRQRFNAVLSLQVSGIVDIITPLFPHHFLLLASVSN 239

Query: 188 FAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSM 247
           F K ++ +A+ ATR  +  SF +  NL D+ AK  + S   +V G G GI ++    +  
Sbjct: 240 FGKNVSYLASSATRAQMNLSFTRTSNLGDVTAKMTSQSIAASVFGTGLGILVSKVTGTEA 299

Query: 248 QGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
              M     LS++ +       +   + T N QR A I+A    T ++ +P
Sbjct: 300 ALLMAAYVPLSLVSIGGNYYSSKFVSLKTFNVQR-AEILARHYFTTKVCTP 349


>gi|225440682|ref|XP_002274737.1| PREDICTED: UPF0420 protein-like [Vitis vinifera]
          Length = 503

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 82/304 (26%), Positives = 144/304 (47%), Gaps = 18/304 (5%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---------RPTPAQA 121
            P+G+P SV++ YL Y   +   + T+     L T SLL A G+             A A
Sbjct: 119 LPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKAVGVGSFSATTAAASAAASA 178

Query: 122 TVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL 180
             + W+ KDG+  VG+L I    G   D +PK+WR+ AD +   G+  ++ + L P  FL
Sbjct: 179 AAIRWVSKDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLYPAYFL 238

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
           ++A LGN AK +A      +   I + FA  GNL ++ AK E       ++GL  GI + 
Sbjct: 239 QLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMVL 298

Query: 241 ST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
           ++  V +S          + ++H++   + +      ++N +R  ++V   V    +   
Sbjct: 299 TSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPGY 358

Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV---KP-SKFHELKELFAEEKFILCLGSR 354
            D   +E +L   R ++    +  G    +++   +P SK   L +L+++EK+IL +  +
Sbjct: 359 VDCNRKEKILSWQRFLKP--RIIFGVSLEELIGGERPVSKVRTLLKLYSQEKYILVVKQQ 416

Query: 355 WTDM 358
            TD 
Sbjct: 417 GTDF 420


>gi|449301217|gb|EMC97228.1| hypothetical protein BAUCODRAFT_147370 [Baudoinia compniacensis
           UAMH 10762]
          Length = 454

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/310 (27%), Positives = 150/310 (48%), Gaps = 16/310 (5%)

Query: 30  ETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQF 89
           + + +VT  Y   P     V  VD   P         L+ F P+G+P SV   Y+ Y  +
Sbjct: 12  DEAGNVTATYVASPTSSDGVCRVDVIHPATKAPWQRLLDIFLPAGFPHSVTVDYVEYQIY 71

Query: 90  RALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSNL-GAR 146
            +LQ F+S  + +LS++++L + G+    A  T  ++  I ++ +  +G +    L G  
Sbjct: 72  DSLQAFSSNIVGLLSSRAVLLSVGVGDADASPTYALLLHIAQESLGRIGTIAFVALYGTA 131

Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL--GNFAKGIAVVAARATRLPI 204
           ++ E K +R+LAD   D    L+ LSP  P+ +L +  L   +  K +  VAA + +  +
Sbjct: 132 LEPECKMYRLLADVFNDFAFVLDCLSPAIPK-YLRVVTLSFSSLLKALCGVAAGSAKASL 190

Query: 205 YSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYS 264
            + FA+ GNL +L AK  +  T+ +++G+  G  + S V  + Q       LL  IH+ +
Sbjct: 191 SAHFAQWGNLGELNAKDSSQETVISLLGMWVGSLVLSWV-RTPQATWTTLILLLAIHLET 249

Query: 265 VIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG- 323
               +RA  + TLN QR A +V   ++ G + +P ++  +E +   G ++  A    +G 
Sbjct: 250 NRRAVRAVVMRTLNRQR-ATLVFHHLQRGHVPTPEEVSKQERIFERGGVLRSADGQTLGF 308

Query: 324 -------RDP 326
                  RDP
Sbjct: 309 CAVGMPLRDP 318


>gi|345316269|ref|XP_001510147.2| PREDICTED: UPF0420 protein C16orf58 homolog, partial
           [Ornithorhynchus anatinus]
          Length = 353

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 69/241 (28%), Positives = 120/241 (49%), Gaps = 3/241 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR--PTPAQATVVSWI 127
           F P G+P SV+  YL Y  +  +Q F S+    L+TQ++L   G+        +   + +
Sbjct: 3   FLPRGFPDSVSPDYLSYQLWDTVQAFASSLTGSLATQAVLRGVGVGDGTMTVDSPTGALL 62

Query: 128 LKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +K G   +G++      G+++D + K+WR+ AD L D    LE+++P  P  F     + 
Sbjct: 63  IKHGTGMLGRIGFAWKKGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPAYFTLTVCVS 122

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           + AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + L   V   
Sbjct: 123 SLAKCIVGVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLMLLPLVSDR 182

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
               + +  L + +H+Y+    +RA  + TLN  R  +++  F++ G++ S A    +E 
Sbjct: 183 PSLCLPLFLLFTSLHLYANFRAVRAVVMETLNESRLRLVLEHFLREGQVLSLASANEKEP 242

Query: 307 L 307
           L
Sbjct: 243 L 243


>gi|453080329|gb|EMF08380.1| DUF647-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 464

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 102/405 (25%), Positives = 190/405 (46%), Gaps = 50/405 (12%)

Query: 58  VYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT 117
           ++H+ VD FL    P G+P SV + YL Y  + +LQ F+S+  ++LS++++L   G+   
Sbjct: 50  LWHKTVDVFL----PVGFPQSVTDDYLEYQIYDSLQAFSSSIAAMLSSRAVLEGVGVGDA 105

Query: 118 PAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
            A AT  ++  IL+D +  +  ++ ++  G  ++ E K +R+LAD L D G  L+ LSP+
Sbjct: 106 TANATTALLLSILQDSVGRITTIMFAHRFGMSLEPECKMYRLLADILNDTGFVLDCLSPI 165

Query: 175 CPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
            P+   + +    +  + +  VAA + +  + + FAK+GNL +L AK  +  T+ ++ G+
Sbjct: 166 FPKPTRVVILSFSSILRSLCGVAAGSAKASLSAHFAKQGNLGELNAKDSSQETVISLAGM 225

Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
             G  + + V + +     +  LLS IH+ +    +RA  + TLN QR A ++   ++ G
Sbjct: 226 LFGSWVITWVKTPVATWTALILLLS-IHLETNRRAVRAVKMQTLNRQR-ATLLYHHLRKG 283

Query: 294 RISSP-----ADLRYREDLLFPGRLIEDAGNVKVGR----------DPHKVVKPSK---- 334
            + +P      +L +  D +   R     G+  VG             HK  K  +    
Sbjct: 284 HLPTPIEVADQELIFERDGILRSREGRAIGHCSVGAPMSRLLAAVGQQHKTTKSIRVDDQ 343

Query: 335 --FHELKELFAEEKFILCLGSRWTD-----------MVLEQNASGEDALRGWLVAAYAAS 381
              + + EL+++ + I+     W D           +VL++ A  +     W  A   A 
Sbjct: 344 QHLNRILELYSDAQCIV-----WCDPSSRPGKASVFIVLKKAADSKALTLAWWQALEFAE 398

Query: 382 MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGW--HTDRFLD 424
                  P   +  D   ++ +VF  +  ++ A  W  H DR ++
Sbjct: 399 CLGVAPTPVADMNPDELSRLPEVFGKVRDQVNAL-WTTHYDRLVE 442


>gi|417401214|gb|JAA47499.1| Hypothetical protein [Desmodus rotundus]
          Length = 453

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 86/323 (26%), Positives = 146/323 (45%), Gaps = 37/323 (11%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ-- 120
           +   L  F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   
Sbjct: 70  LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129

Query: 121 ATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWR-ILADALYDLGTGLEVLSPLCPQLF 179
           A   +WI+KD    +G++I +            W+ + AD L D+   LE+++P+ P  F
Sbjct: 130 AATATWIVKDSTGMLGRIIFA-----------WWKGLFADILNDVAMFLEIMAPIYPMCF 178

Query: 180 LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQL 239
                + N AK +  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL   + +
Sbjct: 179 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 238

Query: 240 ASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
              V       +    LL+ +HVY+    ++A  + TLN  R  +++  F++ G +  P 
Sbjct: 239 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 298

Query: 300 DLRYREDL---LFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCLGSR 354
                E L    +P        ++ +G   H+++  S   EL++L     E ++L     
Sbjct: 299 SANQMEPLWTGFWPSL------SLSLGAPLHRLI--SSVFELQQLVEGHREPYLL----H 346

Query: 355 W------TDMVLEQNASGEDALR 371
           W        +VL Q A  E  LR
Sbjct: 347 WDQSQNRVQVVLSQMAGPETILR 369


>gi|413956564|gb|AFW89213.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
          Length = 485

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 69/256 (26%), Positives = 129/256 (50%), Gaps = 9/256 (3%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           + R+    L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    
Sbjct: 50  WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109

Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A        W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  +++LSPL 
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK---GEAISTLFNVVG 232
           P   + +  LG+ ++    VA+ ATR  +   FA   N +D+ AK    E ++T+  +  
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGL 229

Query: 233 LGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
                Q+      S+    +    L++ H+Y+  + +++  + TLN +R ++++  F + 
Sbjct: 230 GMLLAQITRGHALSVWASFLS---LTMFHMYANYKAVQSLSLTTLNYERASILLQYFKEC 286

Query: 293 GRISSPADLRYREDLL 308
           G +  P  +  +E +L
Sbjct: 287 GEVLVPRKVSQQEHIL 302


>gi|346318934|gb|EGX88536.1| DUF647 domain-containing protein [Cordyceps militaris CM01]
          Length = 467

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 142/296 (47%), Gaps = 14/296 (4%)

Query: 14  EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPS 73
           E DKS  VQ+  +Y     +   R    +P GH +       R     + DSFL    P 
Sbjct: 28  ELDKSGHVQQKWLY-----EPSGRIVHIRPKGHTNDLFSRSLREWKKILSDSFL----PV 78

Query: 74  GYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDG 131
           G+P SV+  YL Y  F +LQ F S   S+L+ ++LL   G+    + AT  ++  IL+D 
Sbjct: 79  GFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFALLLTILRDA 138

Query: 132 MQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
              V  +      G R++ + KR+R LAD   D    LE+ SP L P   +    +G   
Sbjct: 139 TSRVATIAFAHRFGLRIEPDAKRYRFLADLFNDSAFFLELYSPYLSPWGKVAALSIGEAL 198

Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
           + +  V+A A++  +   FAK  NL++L AK  +  T   ++GL     +   V    + 
Sbjct: 199 RALCGVSAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGL-LVGTVVVKVVEDHRS 257

Query: 250 KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
            + +  LL   H++     + +  + TLN QR  ++  ++++TG++ SPA++  RE
Sbjct: 258 VVFLMILLVFGHLWMNYLGVCSVEMTTLNRQRATILFQEYLQTGKVLSPAEVSKRE 313


>gi|302849057|ref|XP_002956059.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
           nagariensis]
 gi|300258564|gb|EFJ42799.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
           nagariensis]
          Length = 208

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/190 (33%), Positives = 95/190 (50%), Gaps = 5/190 (2%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
           + DS  + F P G+P SV   YL Y  +    H T      L+T SLL A G+  +PA  
Sbjct: 17  IPDSIRDLFLPPGFPDSVTPDYLPYQLWSLPTHVTGHLSHSLATSSLLTAVGVSTSPAAT 76

Query: 122 TVVS----WILKDGMQHVGKLIC-SNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
             +S    WI+KDG+  +G+ I  S   A  D +P+RWR++A+ L   G GLEV + L P
Sbjct: 77  VALSASIKWIIKDGVGALGRFIVGSRFSAEFDEDPRRWRLVAELLSTAGLGLEVATSLYP 136

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
           Q F+ +A  G F + +     +     I + FA+  N+  + AK E       +VGL A 
Sbjct: 137 QSFMLLASAGKFGQALGKGMGKPVFRVIQTHFARSQNVGAVAAKEEVWEVAAQMVGLMAS 196

Query: 237 IQLASTVCSS 246
           + +  T+  S
Sbjct: 197 VAVLRTLEGS 206


>gi|356566668|ref|XP_003551552.1| PREDICTED: uncharacterized protein LOC100787355 [Glycine max]
          Length = 1041

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 80/305 (26%), Positives = 139/305 (45%), Gaps = 18/305 (5%)

Query: 64  DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTP 118
           D+  +   P+G+P SV++ YL Y   +   + T      L T SLL A G+         
Sbjct: 649 DTIKDFILPAGFPGSVSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTSAT 708

Query: 119 AQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
           A A+ + W+ KDG+  VG+L +    G+  D +PK+WR+ AD +   G+   + + + P 
Sbjct: 709 ASASAIRWVSKDGIGAVGRLCLGGRFGSLFDDDPKQWRMYADFIGSAGSIFYLTTQVYPD 768

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
            FL +A LGN  K +A      +   I + FA  GNL ++ AK E    +  ++GL  GI
Sbjct: 769 YFLPLASLGNLTKAVARGLKDPSFCVIQNHFAISGNLGEVAAKEEIWEVVAQLIGLALGI 828

Query: 238 QLASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
            +  T        ++  P L +  +H++   + +     NT+N +R        V    +
Sbjct: 829 LILDTPSLVKSYGVLSLPWLGMQFLHLWLRYKSLSVLQFNTINLKR--------VLHSTV 880

Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKF--HELKELFAEEKFILCLGS 353
                    E++L   + ++   N  +  +    V+ S F    L +L+A EK+IL +  
Sbjct: 881 PGCTYCNREENILTRSQFMKPKINFGLPLEKIDGVERSHFMVEALLKLYASEKYILMVNQ 940

Query: 354 RWTDM 358
           +  D+
Sbjct: 941 QLEDL 945


>gi|432112420|gb|ELK35212.1| hypothetical protein MDA_GLEAN10003470, partial [Myotis davidii]
          Length = 638

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 93/358 (25%), Positives = 160/358 (44%), Gaps = 43/358 (12%)

Query: 95  FTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEP 151
           F S+    L+T ++L   G+        A   +W+LKD    +G++I +   G+++D   
Sbjct: 272 FASSLSGSLATHAVLLGIGVGNASASVSAATATWLLKDSTGMLGRIIFAWWKGSKLDCNA 331

Query: 152 KRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE 211
           K+WR+ AD L D    LE+++PL P  F     + N AK I  VA  ATR  +    A+ 
Sbjct: 332 KQWRLFADILNDTAMFLEIMAPLLPICFTMTICISNLAKCIVSVAGGATRAALTMHQARR 391

Query: 212 GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
            N++D+ AK  +  TL N+ GL   + +   V       +    LL+ +HVY+    +RA
Sbjct: 392 NNMADVSAKDGSQETLVNLAGLLVSLLMLPLVSDCPSFSLGCFFLLTALHVYANYRAVRA 451

Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL---LFPGRLIEDAGNVKVGRDPHK 328
             + TLN  R  +++  F++ G +  P      E L    +P        ++ +G   H 
Sbjct: 452 LVIETLNEGRLWLVLKHFLQRGEVLDPTSANQMEPLWTGFWPSL------SLSLGVPLHH 505

Query: 329 VVKPSKFHELKELFA--EEKFILCLGSRW------TDMVLEQNASGEDALR----GWLVA 376
           +V  S   EL++L    +E ++L    RW        +VL Q A  E  LR    G ++ 
Sbjct: 506 LV--SSVAELQQLAEGHQEPYLL----RWDQSQNQVQVVLSQEAGPETVLRAATHGLVLG 559

Query: 377 A-------------YAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
           A               + +    +  S  ++++ +  ++ +F   L  LQ  GW T++
Sbjct: 560 ALREDGPLPRELGELRSQVRAGPNTESWVIVKETHQVLDKLFPQFLRGLQDAGWETEK 617


>gi|297825247|ref|XP_002880506.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326345|gb|EFH56765.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 519

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 94/377 (24%), Positives = 176/377 (46%), Gaps = 38/377 (10%)

Query: 76  PFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQ 133
           P  V++ Y+ Y +++ L    S+AL VL+TQ +  A G+    + A +   +WILKDG+ 
Sbjct: 124 PKQVSDNYISYVKWKFLHRVFSSALQVLATQVMFRAIGIGQSRSLASSAAFNWILKDGLG 183

Query: 134 HVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGI 192
            + + I  ++L +  D+  KR R     L+ L  G+E+++P+ PQ FL +A + N AK I
Sbjct: 184 RLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQYFLLLASIANIAKQI 243

Query: 193 AVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS--MQGK 250
           ++    AT   ++ SFA   NL ++ AK +  +  F+ +GL   + L     ++  +Q  
Sbjct: 244 SLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAVLLNMLFQNNQRLQAS 303

Query: 251 M--IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL- 307
           +  ++ P+ S   +  + + ++   + TL   R  +I+  +++  ++ SPA++   E + 
Sbjct: 304 LPFVLYPIFSTFDLLGIYQGLKHINLQTLTKDRLEIILERWIEFRQVPSPAEVSEEEGID 363

Query: 308 LFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCL-----GSRWTD---- 357
           L   R  +    +++G  DP   +       ++ L  ++ F + +     G R T     
Sbjct: 364 LLGSRGSKRVWPIRIGCLDPKAQIPTLSMMAMQSLCNDDSFFITIELSSKGFRRTPKPGI 423

Query: 358 -MVLEQNASGEDALRGWLVAAYAASMAKSF-------HDPSLT--VLQD-------AYDK 400
            + L + A+  D +   L   Y   + K+F        D SL+   LQD       +   
Sbjct: 424 VICLREGANSVDIITSLLQTCY---IRKAFGTNRIKRRDVSLSDLTLQDWSLLTGESKRA 480

Query: 401 MNDVFTPLLSELQAKGW 417
             D    L  +LQ  GW
Sbjct: 481 ARDDIITLNRQLQEHGW 497


>gi|224013726|ref|XP_002296527.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968879|gb|EED87223.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 719

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 110/462 (23%), Positives = 183/462 (39%), Gaps = 111/462 (24%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATV 123
            ++ F P+ YP SV   Y  Y  +  L     ++  VLSTQ+LL A G+    A   A  
Sbjct: 239 LIHHFLPANYPQSVCPSYATYASYCFLGSIAGSSAMVLSTQALLVAVGVGTQSAAPMAAA 298

Query: 124 VSWILKDGMQHVGKLI-CSNLG-----------------------------------ARM 147
           ++W++KDG+  +G ++  S LG                                      
Sbjct: 299 LNWVMKDGVGQLGGVVFASQLGKGGVDVDYWKVKLNKLMGSSSSKSTETSARGNFQRGTA 358

Query: 148 DSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAKGIAVVAARATRLPIYS 206
           DS PKRWR++A    DL T LE+ +P + PQ FL  A + N  K +  +AA A+R  ++ 
Sbjct: 359 DSNPKRWRMVAALALDLSTLLEICTPWMGPQWFLPCASIANIGKNVGFLAASASRAAVHQ 418

Query: 207 SFAKEG-------------------------------NLSDLFAKGEAISTLFNVVGLGA 235
           S    G                               NL D+ AK  + + + +++G   
Sbjct: 419 SLCTGGSFLPPVPDNTTEKDESYDNPKKGKNVTSSSNNLGDVTAKSGSQAIVASLLGTAL 478

Query: 236 GIQLASTVCSSMQGKMIVGP--LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT- 292
           GI L+ T CS      I+    +LS +H       ++A P+ +++  R  +I+  ++   
Sbjct: 479 GIWLSRTFCSDYGTAGILAGFVILSAVHQVCTYRALKAVPLRSVDRHRLHIILNAYITAD 538

Query: 293 GR-----ISSPADLRYREDLLFPGRLIEDAGN-VKVGRDPHKVVKPSKFHELKELFAE-- 344
           GR     + +P+ +   E  L P    +++ + + VG D    + PS   EL+ L     
Sbjct: 539 GRSVQSMVLTPSQVAEEESFL-PMMAPDNSVHWLSVG-DSLIDICPSGLDELEALLIRRD 596

Query: 345 -----EKFILCLGSRWTD-----------MVLEQNASGEDALRGWLVA--AYAASMAKSF 386
                EK+I+ +    T+           +     A+  D L+G   A  A A  ++   
Sbjct: 597 DISQYEKYIMKVHLADTNATTAAVDGIVQLTFLDGATDNDLLQGMFHAYIARALMISHQC 656

Query: 387 HDPS----------LTVLQDAYDKMNDVFTPLLSELQAKGWH 418
           HD S            +++D++   N      L  LQ  GW 
Sbjct: 657 HDDSNDNHNHGKRGNQIIEDSHTITNHRMPEFLEHLQQGGWQ 698


>gi|171678219|ref|XP_001904059.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937179|emb|CAP61836.1| unnamed protein product [Podospora anserina S mat+]
          Length = 448

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 71/254 (27%), Positives = 134/254 (52%), Gaps = 9/254 (3%)

Query: 65  SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT-- 122
           + L  F P+GYP +V   YL Y  + +LQ F S+  S+L+++++L   G+  + A  T  
Sbjct: 41  ALLYAFLPAGYPHTVTTDYLPYQTYDSLQAFASSITSLLASRAVLEGLGVGSSEASPTGA 100

Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FL 180
           ++  I  D +  +  ++ ++ +G  ++ E K +R LAD   D    L++L+P  P L  L
Sbjct: 101 LILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDAAQFLDLLTPALPYLPKL 160

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +       + +  VAA A++  + + FA  GNL++L AK  +  T+ +++G+  G  + 
Sbjct: 161 GVIVSAGVLRSLCGVAANASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLVGSLVV 220

Query: 241 STVCSSMQ--GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
             V    +  G MI+   L+  H+      +RA  + +LN QR  ++  +++++G + +P
Sbjct: 221 RLVEDKHKVWGLMII---LAGCHLAMNYRAVRAVRMTSLNRQRATIVFREWLESGTVLNP 277

Query: 299 ADLRYREDLLFPGR 312
           A +  RE +L  GR
Sbjct: 278 AQVSQRESILMNGR 291


>gi|380471002|emb|CCF47486.1| hypothetical protein CH063_04176 [Colletotrichum higginsianum]
          Length = 478

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 76/258 (29%), Positives = 127/258 (49%), Gaps = 11/258 (4%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--V 123
             + F P GYP SV   Y+ Y  + +LQ F S    +LS +++L   G+    + AT  +
Sbjct: 64  LFDAFLPIGYPDSVTPDYIGYQAYDSLQAFFSTITGLLSNRAILQGLGVGDPNSSATYAL 123

Query: 124 VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLE 181
           +  ILKDG+  V  +      G  ++ E KR+R LAD   D    L++ SP   P   + 
Sbjct: 124 LLTILKDGISRVATIAFAYRFGLVIEPECKRYRFLADIFNDSAFFLDLFSPYFDPWTKVA 183

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
              L    + +  VAA A++  +   FA+  NLS+L AK  +  T   +VGL  G    S
Sbjct: 184 AIVLAEALRAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAIGLVGLLVG----S 239

Query: 242 TVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
            V   ++G+  V  L   L  +H+      +R   ++TLN QR  ++   + +T R+ +P
Sbjct: 240 IVVRFVEGREAVFVLMVFLVFVHLAMNYFGVRCVQLDTLNQQRATILFDHYAQTKRVLTP 299

Query: 299 ADLRYREDLLFPGRLIED 316
             + +RE+++F   +I++
Sbjct: 300 KQVAWRENIVFWSPVIKN 317


>gi|159467675|ref|XP_001692017.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278744|gb|EDP04507.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 216

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 68/211 (32%), Positives = 101/211 (47%), Gaps = 10/211 (4%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS---- 125
           F P GYP +V   YL Y  +    H T      L T SLL A G+   PA    +S    
Sbjct: 5   FLPPGYPQTVTPDYLSYQLWSVPTHITGHLSHALVTSSLLSAVGVSAGPAATVALSASIK 64

Query: 126 WILKDGMQHVGKLIC-SNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           WI+KDG+  +G+LI  S   A  D +P+RWR++A+ L   G GLEV + L P  FL +A 
Sbjct: 65  WIVKDGVGALGRLIVGSRFSAEFDEDPRRWRMVAELLSTCGMGLEVATVLYPHYFLLLAC 124

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
            G FA+ +     +     I + FA+  N+  + AK E       + GL A + +   + 
Sbjct: 125 SGKFAQALGKGMGKPVFRVIQTHFARAQNVGAVAAKEEVWEVTAQMAGLAASVAVLRALE 184

Query: 245 SSMQGKMIVGPLLSV--IHV---YSVIEEMR 270
           +S     +VG   ++  +HV   Y  +  +R
Sbjct: 185 ASGAADGVVGTWAAIQAVHVALRYRALATLR 215


>gi|328703155|ref|XP_001949028.2| PREDICTED: UPF0420 protein C16orf58 homolog [Acyrthosiphon pisum]
          Length = 457

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 156/325 (48%), Gaps = 21/325 (6%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWI 127
           F P G+P SV++ Y  Y  +  +Q F S   ++L+T ++L   G+    A A    +++I
Sbjct: 56  FLPRGFPNSVSKDYTEYQIWDTIQAFCSTISNILATNAILKGVGVGDAQATALAASITYI 115

Query: 128 LKDGMQHVGKLI---CSNLGARMDSEPKRWRILADALYDLGTGLEVL--SPLCPQLFLEM 182
           LKDG   +G++    C   G+ +D++ K+WR+ ADAL DL   +E+L   P   Q  L +
Sbjct: 116 LKDGAGMIGRITFAWCK--GSELDTQCKKWRLRADALNDLAIFIELLLSIPWIRQFSLII 173

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
               + AK I  VA  ATR  +    A   N+ D+ AK  +  T  N++    G+ +   
Sbjct: 174 LSFSSCAKSIVSVAGGATRAALTQHQAIRDNMGDVSAKDGSQETCINLIAFLIGLIMLPI 233

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV----KTGRISSP 298
           V + +    ++  +++ +H+++  + +++  +N  N  R  + +  ++    +   +  P
Sbjct: 234 VENRILLIWMIYLVVTSLHLFANYKAVKSLNINVFNSARYDLTLKYYLSNDTQNHDVQKP 293

Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELK-----ELFAEEKFILCLGS 353
             +  RE   F G     +  +++G   H+++  +           EL+ +  +IL + +
Sbjct: 294 DYINKRE-ACFLGDEKLCSFKIQLGTSIHELLYTNTLTTWDIIDHIELYKDYLYILIVDT 352

Query: 354 R--WTDMVLEQNASGEDALRGWLVA 376
                 +VL++N + E+ L+ +  A
Sbjct: 353 HRDIIRVVLDKNINTENILKAYFHA 377


>gi|355736030|gb|AES11869.1| hypothetical protein [Mustela putorius furo]
          Length = 218

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 56/167 (33%), Positives = 90/167 (53%), Gaps = 3/167 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A   +W+
Sbjct: 52  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLAGSLATHAVLLGIGVGNAEASVSAATATWL 111

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G ++D   K+WR+ AD L D+   LE+++P+ P  F  +    
Sbjct: 112 VKDSAGMLGRIIFAWWKGTKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMFFTMIVCTS 171

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
           N AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+ GL
Sbjct: 172 NLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGL 218


>gi|79561141|ref|NP_179928.2| uncharacterized protein [Arabidopsis thaliana]
 gi|51970902|dbj|BAD44143.1| unnamed protein product [Arabidopsis thaliana]
 gi|330252365|gb|AEC07459.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 520

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 74/300 (24%), Positives = 150/300 (50%), Gaps = 12/300 (4%)

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-- 118
           R+  S    FF    P  V++ Y+ Y +++ L    S+AL VL+TQ++  A G+  +   
Sbjct: 113 RLTSSAAKDFF---LPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSL 169

Query: 119 AQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
           A +   +WILKDG+  + + I  ++L +  D+  KR R     L+ L  G+E+++P+ PQ
Sbjct: 170 ASSAAFNWILKDGLGRLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQ 229

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
            FL +A + N AK I++    AT   ++ SFA   NL ++ AK +  +  F+ +GL   +
Sbjct: 230 YFLLLASIANIAKQISLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAV 289

Query: 238 QLASTVCSSMQGK----MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
            L      + + +     ++ P+ S   +  + + ++   + TL   R  +I+  +++  
Sbjct: 290 LLNMLFQHNQRLQACLPFVLYPIFSTFDLLGIYQGLKHINLQTLTKDRLEIILERWIEFR 349

Query: 294 RISSPADLRYREDL-LFPGRLIEDAGNVKVG-RDPHKVVKPSKFHELKELFAEEKFILCL 351
           ++ SPA++   E + L   R  +    +++G  DP   +       ++ L +++ + + +
Sbjct: 350 QVPSPAEVSEEEGIGLLGSRGSKRVWPIRIGCLDPKAQIPTLSMMAMQSLCSDDGYFITM 409


>gi|429848918|gb|ELA24353.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 440

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 87/309 (28%), Positives = 153/309 (49%), Gaps = 20/309 (6%)

Query: 13  KEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVV-----DSFL 67
           KEP K++ + E      ++S  + R +    DG L+    ++S     R++       F 
Sbjct: 5   KEP-KAIVIDER-----DSSGLIRRRWLHDHDGRLTEMPTENSPTTISRLLTMNIRQVFS 58

Query: 68  NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
           + FFP GYP SV++ YL Y  + +LQ F S   S+L+ +++L   G+    + AT  ++ 
Sbjct: 59  DAFFPIGYPDSVSKDYLGYQLYDSLQAFFSTITSLLANRAILQGLGVGDADSSATYALLL 118

Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
            ILKDG+  +  +      G  ++ E K++R LAD   D    L++ SPL    + ++A 
Sbjct: 119 TILKDGISRLATIGFAYRFGLVIEPECKKYRFLADIFNDSAFFLDLFSPLFGS-WTKVAA 177

Query: 185 L--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
           L      + +  VAA A++  +   FA  GNLS+L AK  +  T   ++GL  G  +   
Sbjct: 178 LVVAEALRAMCGVAAGASKAALSKHFALRGNLSELNAKESSQETAVGLIGLIVGSFVVRH 237

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRIS-SPAD 300
           V  S +    +  +L  +H+      +R   +N  N QR  +   +++KT G +  +PA+
Sbjct: 238 V-ESREAVFALMIILVFVHLGMNYLGVRCVQLNNFNQQRATIFFEEYMKTMGNVRLTPAE 296

Query: 301 LRYREDLLF 309
           +  RE+++F
Sbjct: 297 VAKRENIIF 305


>gi|367049005|ref|XP_003654882.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
 gi|347002145|gb|AEO68546.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
          Length = 451

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 71/259 (27%), Positives = 138/259 (53%), Gaps = 13/259 (5%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
           ++ + L  F P+GYP +V + YL Y  + +LQ F S+  S+L+ +++L   G+  + +  
Sbjct: 37  LLKAILYAFLPAGYPHTVTDDYLAYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSP 96

Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
           T  ++  I  D +  +  ++ ++ +G  ++ E K +R LAD   D    L++L+P  P  
Sbjct: 97  TGALILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADLFNDSAQFLDLLTPALP-Y 155

Query: 179 FLEMAGL--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
           F ++A +      + +  VAA A++  + + FA  GNL++L AK  +  T+ +++G+ AG
Sbjct: 156 FPKLAVIVSAGVLRSLCGVAAGASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLAG 215

Query: 237 IQLASTVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
               S V   ++ K +V  L   L+ +H+      +R   + +LN QR  ++  +++  G
Sbjct: 216 ----SLVVRLVEDKQLVWLLMVALAAVHLIMNYRAVRCVRMRSLNRQRATIVFREWLDHG 271

Query: 294 RISSPADLRYREDLLFPGR 312
            + +PA +  RE +L  GR
Sbjct: 272 AVLTPAQVARRESILRNGR 290


>gi|116205700|ref|XP_001228659.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
 gi|88182740|gb|EAQ90208.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
          Length = 448

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 70/255 (27%), Positives = 132/255 (51%), Gaps = 11/255 (4%)

Query: 65  SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT-- 122
           + L  F P+GYP SV + YL Y  + +LQ F S+  S+L+ +++L   G+  + +  T  
Sbjct: 39  AILYAFLPAGYPHSVTDDYLPYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGA 98

Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FL 180
           ++  I  D +  +  ++ ++ +G  ++ E K +R LAD   D    L++L+P  P    L
Sbjct: 99  LILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKL 158

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +       + +  VAA A++  + + FA  GNL++L AK  +  T+ +++G+  G    
Sbjct: 159 GVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVG---- 214

Query: 241 STVCSSMQGKMIVGPLLSV---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
           S V   ++ K +V  L+ V   +H+      +RA  + +LN QR  ++  +++  G + +
Sbjct: 215 SLVVKMVEDKQVVWMLMVVLAGVHLAMNYHAVRAVKMRSLNRQRATLVFREWLDRGTVLT 274

Query: 298 PADLRYREDLLFPGR 312
           P  +  RE +L  GR
Sbjct: 275 PEQVSQRESILRKGR 289


>gi|322700393|gb|EFY92148.1| DUF647 domain-containing protein [Metarhizium acridum CQMa 102]
          Length = 466

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 81/312 (25%), Positives = 149/312 (47%), Gaps = 14/312 (4%)

Query: 2   NLLEKIKKQKKKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHR 61
           +L++K    +  E DKS +VQ     W   SD+     + + +      ++  S   + R
Sbjct: 20  DLIKKGDDVEISELDKSGNVQR---RWLHRSDNTIAVLESETE---PSSIIHSSVREWKR 73

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
            +    + F P G+P SV+  YL Y  + +LQ F S   S+L+ ++LL   G+    + A
Sbjct: 74  FLS---DAFLPIGFPHSVSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSSA 130

Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
           T  ++  IL D    +  ++ ++  G R++ + KR+R LAD   D    LE+ SP     
Sbjct: 131 TFAMLITILTDATSRIATIVFAHRFGLRIEPDAKRFRFLADIFNDTAFFLELYSPYFGDY 190

Query: 179 FLEMA-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
              +A  +G   + +  VAA A++  +   FAK  NL++L AK  +  T   ++GL  G 
Sbjct: 191 GKILALSVGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVGT 250

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            L   +    Q  + +   L + H++     +R+  + T N QR  ++  +++ TG + +
Sbjct: 251 -LVVKLVQDHQSVVYLMVTLVLAHLWMNYLGVRSVCMTTFNRQRATILFREYMTTGTVLT 309

Query: 298 PADLRYREDLLF 309
           P  +  RE+++ 
Sbjct: 310 PEAVAARENIVL 321


>gi|350646547|emb|CCD58759.1| hypothetical protein Smp_142110 [Schistosoma mansoni]
          Length = 423

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 111/242 (45%), Gaps = 17/242 (7%)

Query: 67  LNKFF-PSGYPFSVNEGYLRY-------------TQFRALQHFTSAALSVLSTQSLLFAA 112
           L KFF P GYP SV++ YL Y             T+      F S+    L++Q++L   
Sbjct: 41  LKKFFLPVGYPSSVSDDYLEYQIWDTIQAGIFFITKISLFSAFASSITGALASQAVLIGV 100

Query: 113 GLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLE 169
           G+  + A   +  ++W+ KDG   +G++I +   G ++D + K WR +AD L D    LE
Sbjct: 101 GVGDSSATILSASLTWMFKDGSGMIGRIIFAGYHGIKLDCDCKFWRFVADILNDCALFLE 160

Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
           ++SPL   LF  +  L N  K +  VA  ATR  I    A   NL+D+ AK  +  TL N
Sbjct: 161 IISPLFNYLFTPLLCLANVLKSLVGVAGSATRAAIVQHQAINNNLADVSAKDGSQETLSN 220

Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
           ++       L   V  +           + IH+YS    ++   + T N  R  + +  +
Sbjct: 221 LMAWLLNFILLYMVTGNQFLIWFCFICCTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQW 280

Query: 290 VK 291
            K
Sbjct: 281 FK 282


>gi|320593995|gb|EFX06398.1| duf647 domain containing protein [Grosmannia clavigera kw1407]
          Length = 458

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 73/253 (28%), Positives = 141/253 (55%), Gaps = 17/253 (6%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
           F P+GYP SV + YL Y  + +LQ F+S+  S+L+ +++L   G+     +P  A V+  
Sbjct: 47  FLPAGYPQSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLK- 105

Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLS---PLCPQLFLEM 182
           I++D    +  ++ ++ +G  ++ E K +R +AD   D    L++L+   PL P++ + +
Sbjct: 106 IIQDTFSRMATILFAHRMGQAIEPECKTYRFMADLFNDSALMLDLLTPVLPLYPKVCIMV 165

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
           A   +  + +  +AA A++  + + FA+ GNL++L AK  +  T+ +++G+ AG    + 
Sbjct: 166 A--SSILRSLCGIAASASKASLSAHFARTGNLAELNAKEASQETVISLLGMLAG----TA 219

Query: 243 VCSSMQGKMIV---GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPA 299
           V   +Q K  V     +L ++H+      +R   + TLN QR  ++  ++++TG+I SP 
Sbjct: 220 VVHVVQDKTAVLYWMVMLLMVHLVMNYRGVRCVKMLTLNRQRATIVFREYLETGKILSPG 279

Query: 300 DLRYREDLLFPGR 312
           ++  RE +L  GR
Sbjct: 280 EVAERESVLLGGR 292


>gi|239610850|gb|EEQ87837.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 498

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 97/390 (24%), Positives = 183/390 (46%), Gaps = 35/390 (8%)

Query: 23  EIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEG 82
           ++P      +DS+ R  +        V +V  SR      + S LN F P GYP SV++ 
Sbjct: 34  QLPTPGSSENDSIRRERR-------RVDIVPPSRKESLSSLTSLLNVFLPVGYPHSVSDD 86

Query: 83  YLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWILKDGMQHVGKLIC 140
           YL Y  + +LQ F+S+   +L+++++L   G+    A  TV  +  +L D M  +  ++ 
Sbjct: 87  YLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVLHDSMGRIATILF 146

Query: 141 SN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAAR 198
           ++ LG  ++ E K +R+ AD L D     + LSP  P+ L + +    +  + +  VAA 
Sbjct: 147 AHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACSSVLRALCGVAAG 206

Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
           +++  + S FA+ GNL +L AK  +  T+ +++G+  G  + S V +           L 
Sbjct: 207 SSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRVSTPF-ATWATLLFLL 265

Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD------LRYREDLLFPGR 312
           ++H+ +    +R+  + TLN QR  ++ +   + GR+ +P        +  R+ +L    
Sbjct: 266 LVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCERIFERDGVLRWKA 325

Query: 313 LIEDAGNVKVGRDPHKVVKPSK------------FHELKELFAEEKFILCLGS--RWTDM 358
                G+ ++G    +++  S+               L  LF EE++IL   S  +   +
Sbjct: 326 SAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRRLLHLFREEEYILWFNSSDKGGTI 385

Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHD 388
           VL+ + +    L+ W   ++A  +AK   D
Sbjct: 386 VLKNSVTPLSQLKAW---SHALMVAKRMAD 412


>gi|224094527|ref|XP_002310175.1| predicted protein [Populus trichocarpa]
 gi|222853078|gb|EEE90625.1| predicted protein [Populus trichocarpa]
          Length = 333

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 65/232 (28%), Positives = 118/232 (50%), Gaps = 18/232 (7%)

Query: 86  YTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLI-CS 141
           Y +++ L    S+AL VL+TQ++  A G+   R  P+ A  ++W+LKDG+  + + I  +
Sbjct: 3   YVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPS-AAALNWVLKDGLGRLSRCIYTA 61

Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATR 201
           +L +  D+  KR R     L+ L  G+E+L+P  PQ FL +A L N AK I++    AT 
Sbjct: 62  SLASAFDTNLKRVRFTTSVLFSLSIGIELLTPTFPQYFLLLATLANIAKQISLACYLATG 121

Query: 202 LPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM---------- 251
             ++ SFA   N+ ++ AK +  +  F+ +GL     L     ++ +  +          
Sbjct: 122 SAVHRSFAIADNIGEVSAKAQIQTVSFDNLGLMLAALLNMLFKNNQRFSLSFPTMSFCFC 181

Query: 252 ---IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
               V P+ S I ++ + + ++   + TL   R  +I+  ++  G + SPA+
Sbjct: 182 LPFFVYPIFSAIDLFGIYQGLQHVHLQTLTKDRLEIILNSWIDFGHVPSPAE 233


>gi|261206350|ref|XP_002627912.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239592971|gb|EEQ75552.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 497

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 97/390 (24%), Positives = 183/390 (46%), Gaps = 35/390 (8%)

Query: 23  EIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEG 82
           ++P      +DS+ R  +        V +V  SR      + S LN F P GYP SV++ 
Sbjct: 33  QLPTPGSSENDSIRRERR-------RVDIVPPSRKESLSSLTSLLNVFLPVGYPHSVSDD 85

Query: 83  YLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV--VSWILKDGMQHVGKLIC 140
           YL Y  + +LQ F+S+   +L+++++L   G+    A  TV  +  +L D M  +  ++ 
Sbjct: 86  YLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVLHDSMGRIATILF 145

Query: 141 SN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAAR 198
           ++ LG  ++ E K +R+ AD L D     + LSP  P+ L + +    +  + +  VAA 
Sbjct: 146 AHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACSSVLRALCGVAAG 205

Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS 258
           +++  + S FA+ GNL +L AK  +  T+ +++G+  G  + S V +           L 
Sbjct: 206 SSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRVSTPF-ATWATLLFLL 264

Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD------LRYREDLLFPGR 312
           ++H+ +    +R+  + TLN QR  ++ +   + GR+ +P        +  R+ +L    
Sbjct: 265 LVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCERIFERDGVLRWKA 324

Query: 313 LIEDAGNVKVGRDPHKVVKPSK------------FHELKELFAEEKFILCLGS--RWTDM 358
                G+ ++G    +++  S+               L  LF EE++IL   S  +   +
Sbjct: 325 SAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRRLLHLFREEEYILWFNSSDKGGTI 384

Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHD 388
           VL+ + +    L+ W   ++A  +AK   D
Sbjct: 385 VLKNSVTPLSQLKAW---SHALMVAKRIAD 411


>gi|427789495|gb|JAA60199.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 464

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 121/248 (48%), Gaps = 8/248 (3%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
           F   F P GYP SV+  Y  Y  + ++Q F S+    L+TQS+L   G+    A   A  
Sbjct: 45  FSEIFLPVGYPDSVSRDYTEYQIWDSIQAFASSITGTLATQSVLSGVGVGDQGASVLAAT 104

Query: 124 VSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
            +WIL+DG    G+++ +   G+ +D + K+WR+ AD L D+   LE++          +
Sbjct: 105 TTWILRDGTGMTGRILFAWFQGSNLDYDSKKWRLFADILNDVAIFLELMCQYMKGYVTAV 164

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
             + + AK I  VA  ATR  +    A+  N++D+ AK  +  TL N+      + L   
Sbjct: 165 LCVSSVAKSIVGVAGGATRAALTQHQARNNNMADVSAKDGSQETLVNLAAFLFSLLLLRI 224

Query: 243 VCSSMQGKMIVGPLLS--VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPA 299
           V  +    ++ G  +S  ++H+++    +    + T N  R A++V  F++T G I+   
Sbjct: 225 VAGNPW--LLYGIFISFTLLHIFANYRAVSCVVMETFNRSRYAIVVRRFLETAGDIAPVT 282

Query: 300 DLRYREDL 307
            +  RE +
Sbjct: 283 WVNARESV 290


>gi|327350378|gb|EGE79235.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 497

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 93/364 (25%), Positives = 174/364 (47%), Gaps = 28/364 (7%)

Query: 49  VKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSL 108
           V +V  SR      + S LN F P GYP SV++ YL Y  + +LQ F+S+   +L+++++
Sbjct: 52  VDIVPPSRKESLSSLTSLLNVFLPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAV 111

Query: 109 LFAAGLRPTPAQATV--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG 165
           L   G+    A  TV  +  +L D M  +  ++ ++ LG  ++ E K +R+ AD L D  
Sbjct: 112 LQGVGVGDASASPTVALLHSVLHDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSA 171

Query: 166 TGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAI 224
              + LSP  P+ L + +    +  + +  VAA +++  + S FA+ GNL +L AK  + 
Sbjct: 172 MVFDCLSPAFPKHLRIVVLACSSVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQ 231

Query: 225 STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
            T+ +++G+  G  + S V S+          L ++H+ +    +R+  + TLN QR  +
Sbjct: 232 ETVISLLGMLCGSLVVSRV-STPFATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANI 290

Query: 285 IVADFVKTGRISSPADLRYREDLLFPGRLIE------DAGNVKVGRDPHKVVKPSK---- 334
           + +   + GR+ +P      E +     ++         G+ ++G    +++  S+    
Sbjct: 291 VFSTLFEDGRVLTPKQASKCERIFERDGVLRWKASAATLGSCRIGVSFQELLCGSRGRVN 350

Query: 335 --------FHELKELFAEEKFILCLGS--RWTDMVLEQNASGEDALRGWLVAAYAASMAK 384
                      L  LF EE++IL   S  +   +VL+ + +    L+ W   ++A  +AK
Sbjct: 351 SIRDINIDIRRLLHLFREEEYILWFNSSDKGGTIVLKNSVTPLSQLKAW---SHALMVAK 407

Query: 385 SFHD 388
              D
Sbjct: 408 RIAD 411


>gi|281211285|gb|EFA85450.1| DUF647 family protein [Polysphondylium pallidum PN500]
          Length = 427

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 56/166 (33%), Positives = 91/166 (54%), Gaps = 3/166 (1%)

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTP 118
           ++ ++F   F PSGYP SV++ Y  Y ++  +Q+   +   +LST +LL     GL  + 
Sbjct: 101 KLYETFKTLFLPSGYPESVSQDYASYQRWIFVQNTLGSVTYMLSTHALLTSVGVGLSASL 160

Query: 119 AQATVVSWILKDGMQHVG-KLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ 177
             +  +SW+LKDG+      L  S     +D + KR++   D L++ G  LE  +P  P 
Sbjct: 161 PFSAAISWVLKDGLGASALVLFASKYSTSLDFDLKRFKFRGDFLHNFGVFLETCTPFLPG 220

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
            FL MA + N AKGIA +   +TR  ++ SFA + N+ D+ AK ++
Sbjct: 221 YFLLMASISNLAKGIAGLIYGSTRASLHKSFALKENIGDITAKYQS 266


>gi|145347876|ref|XP_001418386.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578615|gb|ABO96679.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 362

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/240 (29%), Positives = 111/240 (46%), Gaps = 20/240 (8%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
           F PSG+P + +  Y+ + +++ L       L ++S QSLL A G+  TP          W
Sbjct: 8   FLPSGFPATTSADYVEWLRWQLLSLLFRDVLEIMSAQSLLVALGMGNTPGALPLTAAAKW 67

Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           +LKDG+     LI  + G  R D +PKRW  L++ L D+   +E+++P  P LFL +A  
Sbjct: 68  VLKDGVGSFATLIAGSFGGQRYDEDPKRWWGLSNTLEDVARAIELVTPAAPGLFLPLAAS 127

Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
             F +  A +  R + L       F +  NL D+ AK E       ++ L AGIQ+   V
Sbjct: 128 ATFVRA-AALTGRGSLLNGTFMQHFGRNNNLGDIRAKLEVQGRWLALIALPAGIQVFQLV 186

Query: 244 CSSMQ---------GKMIV--GPLLSVI--HVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
            ++           G  +   G    VI  HV++  +  ++   +TLN  R   +   FV
Sbjct: 187 SAAAADLAADGDEFGAFVAAFGAYGGVIGVHVFACWQAAKSLKFDTLNRFRLLKLAEAFV 246


>gi|449542984|gb|EMD33961.1| hypothetical protein CERSUDRAFT_117486 [Ceriporiopsis subvermispora
           B]
          Length = 505

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 106/433 (24%), Positives = 176/433 (40%), Gaps = 27/433 (6%)

Query: 22  QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNE 81
           +E  + W E S       +         + +  S     R+   F   F P+ YP SV+ 
Sbjct: 70  RECHVSWSEDSGVEKEWRELSQPTEAEERAISSSGTFVGRLSAWFRQMFLPTNYPQSVHR 129

Query: 82  GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKL- 138
            Y+ +   + ++      +SVL  Q+LL + G+    +   A  V WI+KDG   V KL 
Sbjct: 130 SYMPFHVLQFVEGTLGTLVSVLCNQALLTSVGVSAEGSIFGAVAVQWIIKDGAGEVAKLF 189

Query: 139 ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC---PQLFLEMAGLGNFAKGIAVV 195
                    DS PK + +  + L  LG+GL++ + L    P  FL  A  GN  K I   
Sbjct: 190 FIRKFSPYFDSHPKTFTLSGEVLVALGSGLQMATLLVNPTPLSFLICAAGGNAFKLIGYA 249

Query: 196 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
               T +     F+++GN  D+ AK E+ +++  + G  AGI L  T+         +  
Sbjct: 250 IWFTTHIKFIRYFSQQGNTGDVAAKDESQASVAQLAGYAAGIGLL-TLSHDPSYLYTIFA 308

Query: 256 LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIE 315
           L    H+ +    MR A    L   R +++   + + G +   + L  +      G    
Sbjct: 309 LAVPTHLVATTLMMRCATFELLTIPRLSLLAETYSRKGDVMPLSRLEAKRATGMFGEFYG 368

Query: 316 DAGNVKVGRDPH--KVVKP-----SKFHELKELFAEEKFIL----CLGSRWTDMVLEQNA 364
                 V   P   +VVK      +++    E+  +E+++L       +    + L  +A
Sbjct: 369 KGQGKLVSLAPRVAEVVKDEEGERARWALCTEVCRDERYLLYPPAAPQANPASVFLRPDA 428

Query: 365 SGEDALRGWLVAAYAASMAKSFHDPSLTV---LQDAYDKMNDVFTPLLSELQAKGWHTDR 421
           S +D LR  L   +AA +     D  +T+   L+D+++   + F      L  KGW TD 
Sbjct: 429 SSDDMLRAVL---HAARLRSLLADDIVTLREALKDSHEWTQEHFMTFKGALDDKGWRTDE 485

Query: 422 --FLDGTGTRFAW 432
             F D  G R  W
Sbjct: 486 VAFAD-QGHRLLW 497


>gi|443714847|gb|ELU07084.1| hypothetical protein CAPTEDRAFT_141059, partial [Capitella teleta]
          Length = 266

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 66/251 (26%), Positives = 123/251 (49%), Gaps = 5/251 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P G+P SV++ Y++Y  +  +Q F S+  + L+TQ++L   G+    A   A  ++W+
Sbjct: 6   FLPKGFPASVSDDYVQYQIWDTVQAFASSITNTLATQAVLKGVGVGDEQATVLAATLTWL 65

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KDG   +G+++ + + G+ +D + KRWR+ AD L D+   +++ +P     F  +  + 
Sbjct: 66  MKDGTGMLGRILFTWIQGSYLDCDCKRWRLFADILNDVSILMDICAPFFRVYFTLIVCVA 125

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
              + I  VA  ATR  +    A   N++D+ AK  +  TL N+  L   + L   V   
Sbjct: 126 GVCRSIVGVAGGATRAAVTQHQACRSNMADVSAKDGSQETLVNLAALLCNLALVPMVSDR 185

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV--KTGRISSPADLRYR 304
                I+    + +H+Y+  + +R+  + T N  R  +   +F+   TG       +  R
Sbjct: 186 QWLVWILFTCFTTLHIYANFKAVRSLDMPTFNQVRLHLAAQEFISDNTGNFPGVKTVNQR 245

Query: 305 EDLLFPGRLIE 315
           E +L    L+ 
Sbjct: 246 EPVLCSESLLS 256


>gi|367028787|ref|XP_003663677.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
           42464]
 gi|347010947|gb|AEO58432.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
           42464]
          Length = 448

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/258 (26%), Positives = 134/258 (51%), Gaps = 11/258 (4%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
           ++ + L  F P+GYP +V   YL Y  F +LQ F S+  S+L+ +++L   G+  + +  
Sbjct: 36  LLKAILYAFLPAGYPHTVTGDYLAYQTFDSLQAFASSITSLLANRAVLEGLGVGDSSSSP 95

Query: 122 T--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL 178
           T  ++  I  D +  +  ++ ++ +G  ++ E K +R LAD   D    L++L+P  P L
Sbjct: 96  TGALILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYL 155

Query: 179 -FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
             L +       + +  VAA A++  + + FA  GNL++L AK  +  T+ +++G+  G 
Sbjct: 156 PKLGVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVG- 214

Query: 238 QLASTVCSSMQGKMIVGPLLSV---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
              S V   ++ K +V  L+ +   +H+      +R+  + +LN QR  ++  +++  G 
Sbjct: 215 ---SLVVRMVEDKQVVWMLMVILAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGV 271

Query: 295 ISSPADLRYREDLLFPGR 312
           + +P  +  RE +L  GR
Sbjct: 272 VLTPEQVAQRESILGNGR 289


>gi|147807901|emb|CAN62157.1| hypothetical protein VITISV_025492 [Vitis vinifera]
          Length = 582

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 50/131 (38%), Positives = 72/131 (54%), Gaps = 1/131 (0%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-RPTPAQATVV 124
           FL    P G+P SV   YL YT +R +Q   S    VL+TQ+LL+A GL +     A  V
Sbjct: 216 FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 275

Query: 125 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           +W+LKDG+ ++ K++ S  G   D  PK WR+ AD L +   GLE+L+P  P  FL +  
Sbjct: 276 NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 335

Query: 185 LGNFAKGIAVV 195
           +    +  A +
Sbjct: 336 VAGAGRSAAAL 346


>gi|307111734|gb|EFN59968.1| hypothetical protein CHLNCDRAFT_133075 [Chlorella variabilis]
          Length = 434

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 97/417 (23%), Positives = 164/417 (39%), Gaps = 85/417 (20%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ----ATVVS 125
           F P+G+P SV   YLRY       H T      L+T S++ A G+   PA     +  V 
Sbjct: 39  FLPTGWPHSVTPDYLRYQLASVPAHITGWMSHSLATSSMITALGVDAGPATIAATSAAVK 98

Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCP-------- 176
           W+ KDG+   G+LI    L +  D +PKRWR++A+ +  +G  +E+ + + P        
Sbjct: 99  WLTKDGLGAAGRLIVGGRLSSVFDEDPKRWRMVAEGVTTVGLAMEIATQIYPGGLRSKFR 158

Query: 177 ------QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
                   F+ +AG G   K +A          I + F+   N+ D+ AK E       +
Sbjct: 159 VVSVVWSYFIALAGTGTLCKSMAKGMGTPCFRIIQTHFSATNNVGDVAAKEEVWEVAAQL 218

Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
           +GL A                      SV+ + ++  E   +P +T+ P+R A++ +  V
Sbjct: 219 LGLAA----------------------SVVALQAL--EAAGSP-DTVVPKRAALMASSHV 253

Query: 291 KTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDP-------------HKVVKPSKFHE 337
           +TG + S   +   E +L P  + + + +     D                  +  +  E
Sbjct: 254 RTGLVPSVEQVNLSEQVLLPPTMCQPSVHYGCTLDQALGSGGGGFGGGAASGGRGQQLAE 313

Query: 338 LKELFAEEKFILCL--GSRWTDMVLEQNASGEDALRGWLVAAY----------------- 378
           L  L+  E+++L    G+ W  +VLE  A   D LR    AA+                 
Sbjct: 314 LVRLYDGEQYMLTWHRGAAWV-LVLE-GAQRADLLRAMWQAAWLEEHHQQQGEGLEQQQG 371

Query: 379 -------AASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGT 428
                   +  +     P+  VL  +   M   +   L + Q +GW  D+ +   GT
Sbjct: 372 EGPAPSDGSGGSVGSVVPAAEVLAASLAAMQQQWPDFLQQAQEQGWLLDKTVLPQGT 428


>gi|384249834|gb|EIE23315.1| hypothetical protein COCSUDRAFT_15701 [Coccomyxa subellipsoidea
           C-169]
          Length = 265

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 65/228 (28%), Positives = 112/228 (49%), Gaps = 7/228 (3%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV----VS 125
           F P GYP +V   YLR+  +    H T    + L+T SLL A G+  +   AT     + 
Sbjct: 9   FLPRGYPETVTPDYLRFQLWAIPAHITGWMSTSLATSSLLKAVGVGNSAVGATAAAAAIK 68

Query: 126 WILKDGMQHVGK-LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           WI KDG+   G+ L+   LG   D +PKRWR++A+A   +G  LE+ +   P  F+ +AG
Sbjct: 69  WITKDGIGAAGRVLVGGRLGNVFDEDPKRWRMVAEAFLTVGLALEIATAFSPANFIVLAG 128

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
            GN ++ +           I + FA   N+ D+ AK E       ++GL   + L   + 
Sbjct: 129 AGNLSRAVGRGMTNPCFRIIQTHFAAASNVGDVAAKEEVWEVSAQLLGLAGSVGLLKAIE 188

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVN--TLNPQRTAMIVADFV 290
           ++ + + ++G   +V  ++S++     A +   ++N +R  ++ A  V
Sbjct: 189 ATGKPENVLGVWAAVQVLHSLLRYKSLAVLQFPSVNQKRACLLAAAHV 236


>gi|384252100|gb|EIE25577.1| DUF647-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 286

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 61/221 (27%), Positives = 111/221 (50%), Gaps = 5/221 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
           F P GYP +V   Y+ +  + ++Q   S    ++ + ++L   G+  + T     V  + 
Sbjct: 49  FLPDGYPDAVTSDYVGFQIWDSIQALCSYVRGMICSNAILTGIGVGKQATSPLVAVFQFF 108

Query: 128 LKD--GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           ++D  GM   G L     G+++D+  K+WR+ AD   ++G  L++ SPL P  FL +A L
Sbjct: 109 VRDLAGMCG-GVLFAFTQGSQLDAYAKQWRLFADIFNNVGYALDLASPLFPDQFLLLACL 167

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
           G+ A+ +  VA  ATR  +   F++  N  D+ AK  +  T   ++G+  G+        
Sbjct: 168 GSLARAVTGVAGGATRAALTQHFSRADNAGDISAKEGSQETATTLIGMILGMAFLQAASD 227

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
           S      V  LL+ +H+Y+ +  +RA  + +LN  R  +++
Sbjct: 228 SPVAVWAVFLLLTFLHIYANVRAVRALQLTSLNCSRLDILL 268


>gi|108706828|gb|ABF94623.1| expressed protein [Oryza sativa Japonica Group]
          Length = 497

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 77/266 (28%), Positives = 128/266 (48%), Gaps = 31/266 (11%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           + RV    L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    
Sbjct: 136 WGRVGGRLLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKS 195

Query: 119 AQ--ATVVSWILKD--GM---------QHVGKLICSN------------------LGARM 147
           A        W L+D  GM         Q    L  SN                  LG+ +
Sbjct: 196 ATVIGATFQWFLRDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNL 255

Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
           DS  K WR++AD + DLG  +++LSPL P   + +  LG+ ++    VA+ ATR  +   
Sbjct: 256 DSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQH 315

Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIE 267
           FA   N +D+ AK  +  TL  ++G+G G+ LA        G  +    L++ H+Y+  +
Sbjct: 316 FALANNAADISAKEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYK 375

Query: 268 EMRAAPVNTLNPQRTAMIVADFVKTG 293
            +++  + TLN +R+++++  F+  G
Sbjct: 376 AVQSLSLTTLNYERSSILLQYFMDNG 401


>gi|255082448|ref|XP_002504210.1| predicted protein [Micromonas sp. RCC299]
 gi|226519478|gb|ACO65468.1| predicted protein [Micromonas sp. RCC299]
          Length = 226

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/225 (28%), Positives = 114/225 (50%), Gaps = 5/225 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVV--SWI 127
           F P  YP SV++ Y  +  +   Q  +S     L+T++LL   G+  T A A      +I
Sbjct: 2   FLPEDYPNSVSDDYAAFQAWDTAQGLSSYVRGSLTTKALLEGVGVGATGATAASAAAQFI 61

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ--LFLEMAG 184
            +D    VG L+ +   G  +D+E K+WR+ AD   ++G  +E+ +PL      FL +A 
Sbjct: 62  ARDMCGMVGGLVFTLAKGRHLDAEAKQWRLFADLANNVGMAMELAAPLANDRWWFLALAC 121

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
            G+ A+ +   AA ATR  +   FA+  N +D+ AK  +  T   +VG+  G+ + ++  
Sbjct: 122 AGSVARSLCGCAAGATRAALTQHFARARNAADIAAKEGSQETAVTLVGMSLGVAVTTSTD 181

Query: 245 SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
                +  V   L+V HVY  +  +R+  +++LN +R   ++A +
Sbjct: 182 GKTAWQWAVFVALTVAHVYCNVRAVRSLVIDSLNRERVVRLLARY 226


>gi|340967040|gb|EGS22547.1| diphosphomevalonate decarboxylase-like protein [Chaetomium
           thermophilum var. thermophilum DSM 1495]
          Length = 425

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 68/255 (26%), Positives = 132/255 (51%), Gaps = 11/255 (4%)

Query: 65  SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT-- 122
           + L  F P+G+P +V E YL Y  + +LQ F S+  S+L+ +++L   G+  + +  T  
Sbjct: 39  AILYAFLPAGFPHTVTEDYLSYQLYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGA 98

Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL-FL 180
           ++  I  D +  +  ++ ++ +G  ++ E K +R LAD   D    L++L+P  P    L
Sbjct: 99  LILKITGDTISRIATILFAHRMGRAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKL 158

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +       + +  VAA A++  + + FA  GNL++L AK  +  T+ +++G+ AG    
Sbjct: 159 SVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLAG---- 214

Query: 241 STVCSSMQGKMIVG---PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
           S V   ++ K +V     LL+ +H+      +R+  + +LN QR  ++  +++  G + +
Sbjct: 215 SLVVRLVEDKQVVWFLMILLAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGTVLT 274

Query: 298 PADLRYREDLLFPGR 312
           P  +  RE +L   R
Sbjct: 275 PDQVAKRESILCKSR 289


>gi|326428268|gb|EGD73838.1| hypothetical protein PTSG_05532 [Salpingoeca sp. ATCC 50818]
          Length = 376

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 74/259 (28%), Positives = 122/259 (47%), Gaps = 9/259 (3%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWI 127
            P GYP SV+  Y  Y  +   Q F+S+ +  +ST ++L   G+   + T A AT  + +
Sbjct: 51  LPEGYPASVSSDYASYQIWDTAQAFSSSVMGTISTYAVLKGVGVGDAQATVAAATYTN-L 109

Query: 128 LKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           L+D    +  +  +   +  MD   K+WR++AD + D+   +E++SPL P  FL +  + 
Sbjct: 110 LRDATGMLAGIWFAWYSSTDMDRNAKQWRLVADVVNDMAMLVELVSPLIPAAFLPLMCMA 169

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           +F + I  VA  ATR  +    A+  N++D+ AK  +  TL N+  L   + +      S
Sbjct: 170 SFLRAIVGVAGGATRAALVRHQARLNNMADVSAKDGSQETLVNLAALVVSLAITPLAAES 229

Query: 247 --MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYR 304
             M   M + PL+  +H+Y     +RA  +   N  R     A F++T R+ S       
Sbjct: 230 RVMAWAMFL-PLV-FMHLYCNYRAVRALEMPAFNAHRLHHACAVFMRTNRVPSVCAANQA 287

Query: 305 EDLLFPGRLIEDAGNVKVG 323
           E L      IE    V +G
Sbjct: 288 EPLFPRAPAIEGHAKVCLG 306


>gi|156059886|ref|XP_001595866.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980]
 gi|154701742|gb|EDO01481.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 477

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 103/414 (24%), Positives = 187/414 (45%), Gaps = 60/414 (14%)

Query: 63  VDSFLNKFFPSGYPFSVNEGY---------LRYTQFRALQHFTSAALSVLSTQSLLFAAG 113
           V S    F P GYP SV E Y         L Y     L  F S    +L+++++L   G
Sbjct: 49  VSSLTGAFLPVGYPDSVTEDYTACVPCPIHLLYLLCEKLA-FASTIAGLLASRAVLQGLG 107

Query: 114 LRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEV 170
           +  + A AT  V+  +L++    +  ++ ++ LG+ ++ E K++R++AD   D    L+ 
Sbjct: 108 VGDSTASATGAVLLNVLQESAGRIATILFAHRLGSALEPECKKYRLMADIFNDGAMILDC 167

Query: 171 LSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
           +SP     P++FL  A   +  K +  VAA +++  + + FAK+GNL++L AK  +  TL
Sbjct: 168 ISPAFPKVPRVFLLSAS--SVCKSLCGVAAGSSKASLSAHFAKQGNLAELNAKDASQETL 225

Query: 228 FNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVA 287
            +++G+  G  + S + S     + +  LLS IH+ +    +R+  + TLN QR  ++++
Sbjct: 226 ISLLGMLVGTFVVSRISSQTATWIALISLLS-IHLGTNYLAVRSVTMRTLNRQRANLVIS 284

Query: 288 DFV------KTG-RISSPADLRYREDLLFPGRLIEDA-----GNVKVGRDPHKVV----- 330
           DFV      KT   + +P D+  +E +      I        G  KVG    +++     
Sbjct: 285 DFVSNINAEKTRFNLPTPKDISRKERIFERDGAIRHIQGVVLGYCKVGVSLREILSSISS 344

Query: 331 -----------KPSKFHELKELFAEEKFILCLGSRWTD--MVLEQNASGEDALRGWLVAA 377
                        S    L +++  + +++    R     ++L++N +    L  W  A 
Sbjct: 345 SPTVSGSYKEETSSIITSLFKIYKNQGYVMWYSRRQNTFLVILKENTAPSVQLDAWFHAV 404

Query: 378 YAASMA------KSFHDPSLTVLQDAYDKMNDVF-----TPLLSELQAKGWHTD 420
           +A S++       S +     +L+   D + D         L  EL+ KGW  D
Sbjct: 405 WALSVSSSPAVGNSTNGSDQDILKWIEDSLRDSSLCKEKANLYEELRNKGWDLD 458


>gi|296411034|ref|XP_002835240.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295628015|emb|CAZ79361.1| unnamed protein product [Tuber melanosporum]
          Length = 413

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 96/372 (25%), Positives = 173/372 (46%), Gaps = 46/372 (12%)

Query: 91  ALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARM 147
           +LQ F+S+   +LS++++L   G+    A AT  ++  I +D +  +  ++ ++  G  +
Sbjct: 6   SLQAFSSSIAGLLSSRAVLEGFGVGDANASATGALLLTIAQDCVSRLATILFAHRFGPAL 65

Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARATRLP 203
           ++E KR+R+ AD   D    L+ LSP  P+    L L  +G    A G+A   ++A+ L 
Sbjct: 66  EAETKRYRLAADVFNDSAMILDCLSPAFPRSVRILMLCCSGSLRAACGVAGGGSKAS-LS 124

Query: 204 IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
           ++  FA+ GN+ +L AK  +  T+  ++G+ AG  + S + SS     I   LL  +H+ 
Sbjct: 125 VH--FARSGNVGELNAKDSSQETVIGLMGMLAGSFVVSHI-SSTWATWISLTLLQAVHLG 181

Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDA-----G 318
           +  + + A  + TLN QR  ++ + F  TG I +P      E +     ++        G
Sbjct: 182 TNFKAVTAVALRTLNRQRANIVCSAFFDTGEILTPRQAARHERIFEKDGVLRWGGSGVLG 241

Query: 319 NVKVGRDPHKVVK-----PSKFH----------ELKELFAEEKFILC--LGSRWTDMVLE 361
           N ++G     +++     P +            EL  +F  EK++L   +  R   + L+
Sbjct: 242 NAEIGVKLTALIRSLRQSPERKKEGEGEGVSPVELASVFQNEKYLLWYDVQRRLVTICLK 301

Query: 362 QNASGEDALRGWLVAAYAA---------SMAKSFHDPSLTVLQD----AYDKMNDVFTPL 408
           +  S E+ L+ W+ A   A           A + + PS   L D    + D +   FTP+
Sbjct: 302 EKVSPEEQLKSWVHALVLAREIRHHWPKEGADNSNRPSQQALLDTVRRSLDDIGMSFTPM 361

Query: 409 LSELQAKGWHTD 420
           L  L+  GW  D
Sbjct: 362 LGRLKGAGWDLD 373


>gi|336273098|ref|XP_003351304.1| hypothetical protein SMAC_03608 [Sordaria macrospora k-hell]
 gi|380092824|emb|CCC09577.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 498

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 71/253 (28%), Positives = 131/253 (51%), Gaps = 11/253 (4%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
           + + L+ F P+GYP SV   YL Y  + +LQ F S+  S+L+ +++L   G+  + +  T
Sbjct: 38  LQALLHAFLPAGYPHSVTPDYLPYQTYDSLQAFFSSITSLLANRAVLEGLGVGDSSSSPT 97

Query: 123 --VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL- 178
             ++  I  D +  +  ++ ++ +G  ++ E K +R LAD L D    L++L+P  P   
Sbjct: 98  GALILKIAGDTISRIATILFAHRMGTAIEPECKFYRFLADLLNDSAQLLDLLTPALPYFP 157

Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
            L +       + +  VAA A++  + + FA  GNL++L AK  +  T+ +++    GI 
Sbjct: 158 KLGVIVSAGVLRSLCGVAANASKASLSAHFALRGNLAELNAKEASQETVVSLL----GIL 213

Query: 239 LASTVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           + S V   +Q + +V  L   L  +H+      +RA    TLN QR  ++V  +++ GR+
Sbjct: 214 VGSLVVKVVQDRGVVWALMVGLLGVHLLMNYWAVRAVRCRTLNRQRATVVVRGWLEWGRV 273

Query: 296 SSPADLRYREDLL 308
             P ++   E +L
Sbjct: 274 LGPEEVARMESIL 286


>gi|297740212|emb|CBI30394.3| unnamed protein product [Vitis vinifera]
          Length = 462

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 79/311 (25%), Positives = 141/311 (45%), Gaps = 35/311 (11%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWIL-- 128
            P+G+P SV++ YL Y   +   + T+     L T SLL          +A+  SW    
Sbjct: 81  LPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLL----------KASCWSWFFFS 130

Query: 129 --------------KDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP 173
                         +DG+  VG+L I    G   D +PK+WR+ AD +   G+  ++ + 
Sbjct: 131 NHCCCFCCCFCCCHQDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQ 190

Query: 174 LCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
           L P  FL++A LGN AK +A      +   I + FA  GNL ++ AK E       ++GL
Sbjct: 191 LYPAYFLQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGL 250

Query: 234 GAGIQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK 291
             GI + ++  V +S          + ++H++   + +      ++N +R  ++V   V 
Sbjct: 251 ALGIMVLTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVL 310

Query: 292 TGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV---KP-SKFHELKELFAEEKF 347
              +    D   +E +L   R ++    +  G    +++   +P SK   L +L+++EK+
Sbjct: 311 HSIVPGYVDCNRKEKILSWQRFLKP--RIIFGVSLEELIGGERPVSKVRTLLKLYSQEKY 368

Query: 348 ILCLGSRWTDM 358
           IL +  + TD 
Sbjct: 369 ILVVKQQGTDF 379


>gi|224090990|ref|XP_002309136.1| predicted protein [Populus trichocarpa]
 gi|222855112|gb|EEE92659.1| predicted protein [Populus trichocarpa]
          Length = 500

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 98/386 (25%), Positives = 164/386 (42%), Gaps = 44/386 (11%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----RPTPAQATVVS 125
            P+G+P SV++ YL+Y   +   + T      L T SLL A G          A A  + 
Sbjct: 107 LPAGFPGSVSDDYLQYMVLQFPTNITGWICHTLVTSSLLKAVGAGSFTGTDAAASAAAIR 166

Query: 126 WILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
           W+ KDG+  +G+L I    G   D +PK+WR+ AD +   G+  ++ + + P  FL +A 
Sbjct: 167 WVSKDGIGALGRLFIGGRFGDLFDDDPKQWRMYADFIGSAGSIFDLTTQVYPAYFLPLAS 226

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC 244
           LGN  K +A      +   I + FA  GNL ++ AK E       V  +GA +   +   
Sbjct: 227 LGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEVAAKEE-------VWEVGAQLLGLALGI 279

Query: 245 SSMQGKMIVG--PLLS-------VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
             +    +V   PLL+       ++H++   + +     +T+N +R  +++   +   R+
Sbjct: 280 LILDTPDLVKSFPLLTSTWMSMRLLHLWLRYQSLSVLRFDTINLKRARILIKSHILQSRV 339

Query: 296 SSPADLRYREDLL----------FPGRLIED--AGNVKVGRDPHKVVKPSKFHELKELFA 343
               D    E++L            G  +E+   G   V +  H VV   K   L +L+A
Sbjct: 340 LGCVDCNQDENILSWETFTRPRIIFGVPLEEMIGGERSVAKIKHGVV--IKIKMLLKLYA 397

Query: 344 EEKFILCLGSRWTD----MVLEQNASGEDALRGWLVAAYAASMAKSF----HDPSLTVLQ 395
           +EK  L +  + TD    +  +  A+    LR      +     KSF     D     L 
Sbjct: 398 KEKHFLVVNQQKTDFEVLVSFKVGATSMSVLRSVWQTYWLHENWKSFGNRSSDSDYDQLA 457

Query: 396 DAYDKMNDVFTPLLSELQAKGWHTDR 421
            +  +M   F   + +L A GW T +
Sbjct: 458 QSLLEMEVKFEDFMQQLDAVGWDTSQ 483


>gi|167517110|ref|XP_001742896.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163779520|gb|EDQ93134.1| predicted protein [Monosiga brevicollis MX1]
          Length = 272

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 98/185 (52%), Gaps = 2/185 (1%)

Query: 99  ALSVLSTQSLLFAAGLRPTPAQ-ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRI 156
           A  VLS Q+LL+AAGL       A  ++W+LKDG+  +G ++   L G R D +PKR R 
Sbjct: 3   ASGVLSMQALLYAAGLGAGSVPLAAAINWVLKDGLGQLGGVLYGTLFGPRFDRDPKRQRF 62

Query: 157 LADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD 216
            +          E+L PL P LFL +A   N AK I+ +A+ ATR  +++SF +  NL D
Sbjct: 63  WSLVALQGANLTELLIPLVPHLFLPLASAANIAKNISFLASSATRAQMHASFVRRANLGD 122

Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
           + AK  A S   +VVG   GI +++   S +   +     L++  +Y      +   ++T
Sbjct: 123 ITAKIGAQSIGASVVGTALGIFVSTHTGSDVTNIIPAFVPLALACLYFNHLSSKHVVLST 182

Query: 277 LNPQR 281
           LN QR
Sbjct: 183 LNEQR 187


>gi|451846811|gb|EMD60120.1| hypothetical protein COCSADRAFT_151456 [Cochliobolus sativus
           ND90Pr]
          Length = 482

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 108/460 (23%), Positives = 200/460 (43%), Gaps = 66/460 (14%)

Query: 14  EPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRP--VYHRVVDSFLNKFF 71
           E D++  V  +  Y   +S S    +    +G    K VD   P   +  V+   ++ F 
Sbjct: 13  EQDEAGKV--VRTYVLSSSSSAVGEHGKHGEG----KRVDVIVPKQAHKSVLQYIIDVFL 66

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWIL 128
           PSG+P SV E Y+ Y  F +LQ F+S+   +L+++++L   G+     TP  A ++S +L
Sbjct: 67  PSGFPQSVTEDYIEYQVFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTAALLLS-VL 125

Query: 129 KDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLG 186
           ++ +  +  +      G  ++ E K +R+ AD L D    L+ LSP  P+ L + +    
Sbjct: 126 QESVGRMATIAFADRFGTALEPECKMYRLTADILNDSAMILDCLSPGFPKPLRVAVLSFA 185

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
           +  + +  V A +++  + + FA  GNL ++ AK  +  T+ +++G+ AG  + S + S 
Sbjct: 186 SCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFVVSHITSD 245

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
           +        LL  IH+ +    +RA  + +LN QR  ++ +  ++ G + +P ++  RE 
Sbjct: 246 L-ATWCTLILLLAIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLNPKEVSARER 304

Query: 307 LLFPGRLI-----EDAGNVKVGRDPHKVV-----------------------KPSKFHEL 338
           +   G ++     E  G  ++G    +++                             +L
Sbjct: 305 IFERGGVLRWADDEVLGKCRIGTSLSELLGRIGRRDKLSGSLAFCSSGGDDGSEVDVFDL 364

Query: 339 KELFAEEKFILCLGSRWTD---------MVLEQNASGEDALRGWLVAAYAASMAKSFHDP 389
             +F  E ++L     W           +VL+      D LR W  A   A   +   D 
Sbjct: 365 LSIFEAEDYVL-----WATGGPAAYEAVIVLKAGCGPIDQLRAWAHALLVAEKRQRGGDS 419

Query: 390 SLT------VLQDAYDKMNDV---FTPLLSELQAKGWHTD 420
             T      V+ D  D + +V   F     E++AKGW+ D
Sbjct: 420 GGTRAKHGLVVGDLKDTLYEVRRMFDKYGDEMRAKGWNLD 459


>gi|402074762|gb|EJT70271.1| hypothetical protein GGTG_12444 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 438

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 133/252 (52%), Gaps = 15/252 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
           F P+G+P SV + YL Y  + +LQ F+S+  S+L+ +++L   G+     +P  A V+  
Sbjct: 48  FLPAGFPHSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGNASQSPTAALVLQ- 106

Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG---TGLEVLSPLCPQLFLEM 182
           IL+D    +  ++ ++ +G  ++ E K +R +AD   D       L  + P  P++ + +
Sbjct: 107 ILQDTFSRLATILFAHRMGTAIEPECKTYRFMADLFNDASQLLDLLVPVLPFLPKVSVMV 166

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
           +      + +  VAA A++  + + FAK GNL++L AK  +  T+ ++VG+  G  +   
Sbjct: 167 SAA--ILRSLCGVAAGASKASLSAHFAKMGNLAELNAKEASQETVVSLVGMLVGTAVVHM 224

Query: 243 V--CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           V    ++   MIV   L  IH+      +R   + TLN QR  ++  +F++TG++ +P  
Sbjct: 225 VEDKRAVWCWMIV---LIAIHLMMNYRAVRCVKMLTLNRQRATIVFREFLETGKVLTPGQ 281

Query: 301 LRYREDLLFPGR 312
           +  RE +L  GR
Sbjct: 282 VAARESILRNGR 293


>gi|403418386|emb|CCM05086.1| predicted protein [Fibroporia radiculosa]
          Length = 497

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 100/418 (23%), Positives = 179/418 (42%), Gaps = 67/418 (16%)

Query: 68  NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
           N F P+GYP SV+  YL+Y  + ALQ F S+  S+++++++L   G+    A AT  +  
Sbjct: 49  NVFLPAGYPASVSPDYLQYQIYNALQAFCSSLASLIASRAVLEGHGVGNADASATHAIFL 108

Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE--- 181
            +L+D    +  ++    LG  +  E K +R+LAD   D    L+ LSP    + L    
Sbjct: 109 TVLQDIFSRLTTIVSGYYLGTSLFPEAKTYRLLADIFNDAAIVLDTLSPHLDGVVLSLRY 168

Query: 182 ---MAGLGN-----------FAKGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEA 223
              ++G G+            ++ +  + A  ++  +   FA      G++ DL AK  +
Sbjct: 169 PFVLSGSGSSLRVSALCLSGASRALCGMVAGGSKAALTVHFASPGQGTGDVGDLNAKDGS 228

Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
             T+  ++G+ +G  +   V S+ +   +V  +L+ +H++     +R   + +LN QR  
Sbjct: 229 KETVLALLGMLSGTAVMHYVHSA-RATYLVLFVLTFLHLFGNYLAVRVVVLRSLNRQRAN 287

Query: 284 MIVADFV---------KTGRISSPADLRYREDLLFPGRLIEDA--------GNVKVG--- 323
           ++   +          K   + +P  +  RE +      + D         G  KVG   
Sbjct: 288 IVWMAYRAHIKGPMGGKRPLVLTPEAVAARERIFANPSALHDTSSPAHPFLGICKVGSSF 347

Query: 324 -----------RDPHKVVKPSKFHELKELFAEEKFILCLG-----SRWTDMVLEQNASGE 367
                      R       P + H + E+F+EE ++L  G     S    + L+      
Sbjct: 348 SSLASRQQSAPRGALAAPSPPQIHTILEIFSEEPYVLWFGPTSGPSCRLHVFLKDGHKPR 407

Query: 368 DALRGWLVAAYAASMAK-----SFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
           D L+ W  A  AA M       SF + +L  +Q +Y  + D+F P L  ++  GW  D
Sbjct: 408 DHLKAWTHAQEAAIMLNGRQPGSF-EVALAAVQTSYALVEDLFPPFLDAVRGVGWRID 464


>gi|295659580|ref|XP_002790348.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226281800|gb|EEH37366.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 485

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 83/337 (24%), Positives = 154/337 (45%), Gaps = 42/337 (12%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
           + S L  F P+GYP SV++ Y+ Y  + +LQ F+S+   +L+++++L   G+    A  T
Sbjct: 65  LTSLLIVFLPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPT 124

Query: 123 V--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-L 178
           V  +  +L++ M  +  ++ ++ LG  ++ E K +R+ AD L D    +E LSP  P+ L
Sbjct: 125 VALLHSVLQESMGRIATILFAHRLGTSLEPECKLYRLAADVLNDSAMVVECLSPAFPKHL 184

Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQ 238
            + +    +  + +  VAA +++  + S FAK GNL +L A G  + +  +   L  G+ 
Sbjct: 185 RVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGSLVVSHIS-TPLTTGLT 243

Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
           L                 L +IH+ +    +RA  + TLN QR  ++ +   + GR  +P
Sbjct: 244 LI---------------FLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTP 288

Query: 299 ADLRYREDLLFPGRLIE------DAGNVKVGRDPHKVVK--------------PSKFHEL 338
                 E +     ++         G  ++G    ++++              P     L
Sbjct: 289 KQTSKCERIFERDGILRWKASSTTLGFCRIGIAFQELLRSSNVSCRANSIRDIPIDIPRL 348

Query: 339 KELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW 373
             LF +E++IL     S+   +VL+ N      L+ W
Sbjct: 349 LRLFEKEEYILWFNPVSKKGTIVLKNNVKPVSQLKAW 385


>gi|392564112|gb|EIW57290.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 516

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 98/419 (23%), Positives = 168/419 (40%), Gaps = 37/419 (8%)

Query: 50  KVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
           K+   +  +  R+   F     P+ YP SV+  YL +   +  +   +   SVL  Q+LL
Sbjct: 95  KIAAPTNSLTGRISSWFGQMVLPTNYPHSVHRSYLPFHILQFFETTVATITSVLCNQALL 154

Query: 110 FAAGLRPTPA--QATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGT 166
            + G+    +   A  V WI+KDG   V KL          DS PK +    + +  LG+
Sbjct: 155 TSVGVSAEGSVFGAVAVQWIIKDGAGEVAKLFFIRRFSPYFDSHPKTFTFFGELMGCLGS 214

Query: 167 GLEVLSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
           GL++ + L    P  FL  A  GN  K +       T +     F+++GN  D+ AK E+
Sbjct: 215 GLQIATLLIAPSPGNFLLCAAGGNIFKIVGNAIWFTTHIKFMRYFSEQGNTGDVAAKDES 274

Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
            S++  +VG  +GI L + V  +      +      +H+      MR A    L   R +
Sbjct: 275 QSSIAQLVGYASGISLLA-VSHTAPYLYAIFAFAVPVHLAMTAYMMRVATFELLTLPRVS 333

Query: 284 MIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNV------KVGR--DPHKVVKPSKF 335
            +  ++V  G++SS  +L         G    D  +       +VG   D    ++  ++
Sbjct: 334 CLAQEYVTRGQVSSLGELDAAHRTGLFGEFYRDKSDRWLTLAPRVGDVLDSGTELERVRW 393

Query: 336 HELKELFAEEKFILCLGSRWTDMVLEQNASGEDAL---------RGWLVAAYAASM---- 382
                +F +++++L    R   +    +A+ +D L         R  L+   A+S     
Sbjct: 394 QICAGVFEDDRYLLFPRGRNISVFFHPDAANDDMLCAIYHAALVRDALINREASSTDVPP 453

Query: 383 AKSFHDPSLT--------VLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGT-GTRFAW 432
           A+   +P+ T        +L D   +    F      L+A+ W TD       G R  W
Sbjct: 454 AEGLLEPTDTDSQAALKLILADTGARAERDFEVFREALEAREWRTDELCYADHGHRVTW 512


>gi|315056527|ref|XP_003177638.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311339484|gb|EFQ98686.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 495

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 99/434 (22%), Positives = 184/434 (42%), Gaps = 71/434 (16%)

Query: 56  RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQ-FR---------ALQHFTSAALSVLS- 104
           +P     + S LN F P GYP SV + Y+  +  FR          +   TS+ L +   
Sbjct: 48  KPSSSSYLTSILNVFLPVGYPHSVTDDYIESSSLFRDYEKKELTSLISGLTSSILKLNCW 107

Query: 105 ---TQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADA 160
               QS +       +P  A +++ +L++ M  +  ++ ++ LG  ++ E K +R+ AD 
Sbjct: 108 SPVIQSSVGVGDATASPTAALLLN-VLQESMGRIATILFAHRLGTSLEPECKMYRLAADL 166

Query: 161 LYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
           L D    L+ +SP  P+ + + +  L +  + +  VAA +++  + + FA+ GNL +L A
Sbjct: 167 LNDSAIVLDCMSPAFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNA 226

Query: 220 KGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
           K  +  T+ ++ G+  G  + S + S +    I   LL ++H+ +    +RA  + TLN 
Sbjct: 227 KDSSQETVISLFGMLVGSVVVSYISSPL-ATWIALILLLIVHLGTNHAAVRAVKMTTLNR 285

Query: 280 QRTAMIVADFVKTGRISSPADLRYREDLL-------FPGRLIEDAGNVKVG--------- 323
           QR  ++ +   +  R+ +PA     E +        +  +     G  ++G         
Sbjct: 286 QRANIVFSYLFEDDRVLTPAQASEEERIFETDGVLRWKAKATGTLGICQIGVSLEQLLLL 345

Query: 324 ---------------RDPHKVVKPSKFHELKELFAEEKFILCL--GSRWTDMVLEQNASG 366
                           D   V   +    L ELF EE++IL     +R   +VL+  A+ 
Sbjct: 346 LPEQVNTNAQGSSQKPDTTMVGASTGLSALLELFKEEEYILYFSPAARRGAIVLKAGATP 405

Query: 367 EDALRGWLVAAYAAS-MAKSFHDPS-------------------LTVLQDAYDKMNDVFT 406
           +  L+ W  A   +  +A+  HD +                   L++L+D   +    F 
Sbjct: 406 QAQLKAWSHALLVSRHLAREEHDTAACINEKQGGQGELSPPVSILSILRDTLREHTRTFV 465

Query: 407 PLLSELQAKGWHTD 420
             +  L+  GW TD
Sbjct: 466 DRVKRLEHAGWQTD 479


>gi|401421841|ref|XP_003875409.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322491646|emb|CBZ26919.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 675

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 64/234 (27%), Positives = 111/234 (47%), Gaps = 7/234 (2%)

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           +R    RVV +    F P GYP SV E  L +  +  +Q       + LST+++L   G+
Sbjct: 57  ARQALRRVVTALERIFLPDGYPSSVTEDLLPFAFWAHVQVLAKNVTATLSTRAVLLGVGV 116

Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVL 171
             + A  T   +SW+++DG + +G ++  S +   ++   K W+++ D ++D    LE+ 
Sbjct: 117 GESKADLTSSTLSWMMQDGARMIGSVVFASVIPQGLECRAKTWQLVTDFIHDSAHLLELC 176

Query: 172 SPLCP---QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
           SP  P    +F  +    +  K +A V    TR P    FA   N +D+ AK      + 
Sbjct: 177 SPWLPGGQAIFRVVLVAASVIKALASVCGNGTRAPFTQHFALRNNGADIAAKAATRGHVG 236

Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
            ++GL  GI +A  V  +S +  + V  L S +H+++    +R   +  LN  R
Sbjct: 237 GLIGLALGIVIAYIVPATSRRLNLAVFALCSAVHIFANYRGVRGVQLRHLNAPR 290


>gi|389639940|ref|XP_003717603.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
 gi|351643422|gb|EHA51284.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
          Length = 479

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 67/258 (25%), Positives = 134/258 (51%), Gaps = 11/258 (4%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
           ++ +    F P+G+P +V E YL Y  + +LQ F+S+  S+L+ +++L   G+     +P
Sbjct: 41  LIKAIFYAFLPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSP 100

Query: 119 AQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG-TGLEVLSPLCP 176
             A V+  I++D +  +  ++ ++ +G  ++ E K +R LAD   D       ++  L  
Sbjct: 101 TAALVLK-IIQDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPL 159

Query: 177 QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
              + +       + +  VAA A++  + + FAK GNL++L AK  +  T+ ++VG+  G
Sbjct: 160 LPKVSVLVTSGCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTG 219

Query: 237 IQLASTV--CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
             +   V   +++   M+V   + ++  Y  +  +R   + TLN QR  ++  +F++TG+
Sbjct: 220 TAVVHMVEDKTAVWCWMVVLICMHLMMNYRAVRNVR---MLTLNRQRATIVFREFLETGK 276

Query: 295 ISSPADLRYREDLLFPGR 312
           + +PA    RE +L   R
Sbjct: 277 VLTPAQCSARESILLNSR 294


>gi|303278700|ref|XP_003058643.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459803|gb|EEH57098.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 679

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 56/186 (30%), Positives = 95/186 (51%), Gaps = 16/186 (8%)

Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
           DS+ KR+R  +  +YD  + +E+++P  P  FL +A + N  K + +  A   R PI  S
Sbjct: 314 DSDVKRFRFTSSVVYDCASLIEMITPFYPSKFLLLATIANVGKSVGITTANVVRAPIQMS 373

Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM------IVGPLLSV-- 259
           FA E NL+++ AK  A   L + +GL   +     V +   GK+      +  PLL+   
Sbjct: 374 FALEENLAEIAAKTSAQQVLADNIGLALAV-----VATGWTGKIANPRLRLAVPLLAFAP 428

Query: 260 ---IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
              + +Y +  E+++  + T+N +R  +I A FV+TG I S   +   E L  P R+ E 
Sbjct: 429 LASVDLYCIYRELKSVRLRTINKERGEIIAAGFVETGTIPSAVSVAAAERLFIPARMDES 488

Query: 317 AGNVKV 322
           +  +K+
Sbjct: 489 SLPLKI 494



 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 49/87 (56%), Gaps = 11/87 (12%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTP 118
           VVD+FL    P     +V   Y  Y +FR  Q   S+ ++V +TQ +L A GL   R  P
Sbjct: 170 VVDAFL----PCAN--AVTPDYWAYAKFRFAQRVASSCITVFATQQMLQAVGLGATRRLP 223

Query: 119 AQATVVSWILKDGMQHVGKL-ICSNLG 144
           A A  V+W+LKDG+  +GKL + +N G
Sbjct: 224 A-AAAVNWVLKDGLGRLGKLSVATNFG 249


>gi|308804794|ref|XP_003079709.1| unnamed protein product [Ostreococcus tauri]
 gi|116058166|emb|CAL53355.1| unnamed protein product [Ostreococcus tauri]
          Length = 499

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 67/249 (26%), Positives = 109/249 (43%), Gaps = 21/249 (8%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
           F PSG+P +V+  Y  + ++          + ++S QSLL A G+  TP          W
Sbjct: 120 FLPSGFPQTVSADYADWLRWHLTSLLFRDVIEIMSAQSLLVALGVGSTPGALPLTAAAKW 179

Query: 127 ILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           +LKDG+     L+    G R  D +PKRW  +++AL D+   +E+++P  P LFL +A  
Sbjct: 180 VLKDGVGSFATLLAGAFGGRWYDEDPKRWWAVSNALEDVARAIELVTPAAPGLFLPLAAS 239

Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
             F +  A +  R + +       F +  NL D+ AK E       +V L  GIQ+   V
Sbjct: 240 ATFVRA-AALTGRGSLMNGTFMQHFGRNNNLGDIRAKLEVQGRWLALVALPVGIQVFQLV 298

Query: 244 CSSMQGKMIVGPLLSVI-------------HVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
            +        G  L                H++S  +  R+   ++LN  R   +   +V
Sbjct: 299 NAQASALHAEGDELGAAAAAFGGYGLVIGAHIFSCWQAARSLKFDSLNGFRLLRLAEAYV 358

Query: 291 KT-GRISSP 298
            + G+I  P
Sbjct: 359 ASDGKIELP 367


>gi|115434452|ref|NP_001041984.1| Os01g0141600 [Oryza sativa Japonica Group]
 gi|54290177|dbj|BAD61065.1| unknown protein [Oryza sativa Japonica Group]
 gi|113531515|dbj|BAF03898.1| Os01g0141600 [Oryza sativa Japonica Group]
          Length = 323

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 95/192 (49%), Gaps = 17/192 (8%)

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           R +  F+    P+G+P SV++ YL Y  ++   + T      L T +LL A G+      
Sbjct: 101 RAIKDFV---LPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGT 157

Query: 121 ATV-----VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
           +       + W+ KDG+   G+L I    G   D +PK+WR+ AD +   G+  E+++PL
Sbjct: 158 SAAASAAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPL 217

Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGNLSDLFAKGEAISTLFNV 230
            P  FL +A LGN AK +     R  R P    I + FA+ GNL ++ AK E       +
Sbjct: 218 YPGYFLPLASLGNLAKAV----GRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQL 273

Query: 231 VGLGAGIQLAST 242
           +GL  G+ +  T
Sbjct: 274 LGLSIGVFIMDT 285


>gi|393217163|gb|EJD02652.1| DUF647-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 455

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 92/381 (24%), Positives = 164/381 (43%), Gaps = 36/381 (9%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F PSG+P SV   Y+ Y  F +LQ F+S+   +LS++++L  F  G     +   V+  +
Sbjct: 50  FLPSGFPDSVTPDYIGYQIFDSLQAFSSSIAGLLSSRAVLEGFGVGNENASSTHAVLLTV 109

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP-QLFLEMAGL 185
           L+D    +  ++ +   G  +  E K +R+ AD   D    L+ LSP  P QL +    L
Sbjct: 110 LQDATGRLATILFAWKFGPALAPEAKMYRLAADIFNDSAFVLDCLSPALPTQLRIAALCL 169

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
               + I  V     +  +   FA+ GN+ +L AK  +  T+  + G+  G  + + + +
Sbjct: 170 AGALRAICGVCGGGAKAALSVHFARTGNVGELNAKDASQETVIGLFGMLCGSFVMAHI-T 228

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTGRISSPADLRYR 304
           S +   I    L   H+ +    +RA  + TL+ QRT +    +  ++  + +P+++  R
Sbjct: 229 SRRTTWIALIFLLTAHLVTNYLAVRAVTLTTLSRQRTNIAYGVYCEQSNNVLTPSEVARR 288

Query: 305 EDLL-FPGRLIEDA------GNVKVGRDPHKVVKPSKFHELKE------LFAEEKFILCL 351
           E +   PG  I D+      G   +   P ++++ S+   +         F  E ++L L
Sbjct: 289 ERIFEVPGA-IRDSLQGFLLGTCTICTSPEELLRRSRQGSMASWKNALVAFEHEAYVL-L 346

Query: 352 GSRW-----TDMVLEQNASGEDALRGWLVAAYAASMAKS----------FHDPSLTVLQD 396
           G R        +  +  A  ED L+ W+ A     +A S              S + LQ 
Sbjct: 347 GIRADFPCKILICFKHGAKPEDYLKAWVNAVELCRIAGSRLQVGQESNNVTPGSFSALQK 406

Query: 397 AYDKMNDVFTPLLSELQAKGW 417
           + +  N  F   L +    GW
Sbjct: 407 SLECTNTKFLDFLDKANKAGW 427


>gi|413949258|gb|AFW81907.1| hypothetical protein ZEAMMB73_025152 [Zea mays]
          Length = 253

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 73/126 (57%), Gaps = 3/126 (2%)

Query: 65  SFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--A 121
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +T++LL + G+  + A   A
Sbjct: 120 SFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRAASGA 179

Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
             ++WILKDG   VGK+  +  G + D + K+ R   D L +LG G+E+ +   PQ FL 
Sbjct: 180 VAINWILKDGAGRVGKMFFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQFFLP 239

Query: 182 MAGLGN 187
           MA + N
Sbjct: 240 MACIAN 245


>gi|440465939|gb|ELQ35234.1| hypothetical protein OOU_Y34scaffold00720g28 [Magnaporthe oryzae
           Y34]
 gi|440485195|gb|ELQ65173.1| hypothetical protein OOW_P131scaffold00517g6 [Magnaporthe oryzae
           P131]
          Length = 431

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 67/250 (26%), Positives = 131/250 (52%), Gaps = 11/250 (4%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
           F P+G+P +V E YL Y  + +LQ F+S+  S+L+ +++L   G+     +P  A V+  
Sbjct: 40  FLPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLK- 98

Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLG-TGLEVLSPLCPQLFLEMAG 184
           I++D +  +  ++ ++ +G  ++ E K +R LAD   D       ++  L     + +  
Sbjct: 99  IIQDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLPKVSVLV 158

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV- 243
                + +  VAA A++  + + FAK GNL++L AK  +  T+ ++VG+  G  +   V 
Sbjct: 159 TSGCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTAVVHMVE 218

Query: 244 -CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
             +++   M+V   + ++  Y  +  +R   + TLN QR  ++  +F++TG++ +PA   
Sbjct: 219 DKTAVWCWMVVLICMHLMMNYRAVRNVR---MLTLNRQRATIVFREFLETGKVLTPAQCS 275

Query: 303 YREDLLFPGR 312
            RE +L   R
Sbjct: 276 ARESILLNSR 285


>gi|125568971|gb|EAZ10486.1| hypothetical protein OsJ_00318 [Oryza sativa Japonica Group]
          Length = 459

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 96/198 (48%), Gaps = 19/198 (9%)

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-----R 115
           R +  F+    P+G+P SV++ YL Y  ++   + T      L T +LL A G+      
Sbjct: 118 RAIKDFV---LPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGT 174

Query: 116 PTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL 174
              A A  + W+ KDG+   G+L I    G   D +PK+WR+ AD +   G+  E+++PL
Sbjct: 175 SAAASAAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPL 234

Query: 175 CPQLFLEMAGLGNFAKGI------AVVAARATRLP----IYSSFAKEGNLSDLFAKGEAI 224
            P  FL +A LGN AK I           R  R P    I + FA+ GNL ++ AK E  
Sbjct: 235 YPGYFLPLASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVW 294

Query: 225 STLFNVVGLGAGIQLAST 242
                ++GL  G+ +  T
Sbjct: 295 EVGAQLLGLSIGVFIMDT 312


>gi|67972176|dbj|BAE02430.1| unnamed protein product [Macaca fascicularis]
          Length = 326

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 74/299 (24%), Positives = 135/299 (45%), Gaps = 26/299 (8%)

Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLP 203
           G+++D   K+WR+ AD L D+   LE+++P+ P  F       N AK I  VA  ATR  
Sbjct: 12  GSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAA 71

Query: 204 IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
           +    A+  N++D+ AK  +  TL N+ GL   + +   V       +     L+ +H+Y
Sbjct: 72  LTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIY 131

Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG 323
           +    +R+  + TLN  R  +++  +++ G +  P     R + L+ G     A ++ +G
Sbjct: 132 ANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWPAPSLSLG 188

Query: 324 RDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR----GWLV 375
              H++V  S   EL++L    +E ++LC         +VL Q A  +  LR    G ++
Sbjct: 189 VPLHRLV--SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTVLRAATHGLML 246

Query: 376 AAYAA-------------SMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR 421
            A                 +       S  ++++ +  ++ +F   L  LQ  GW T++
Sbjct: 247 GALQGDGPLPAELEELRNQVRAGPKKESWVIVKETHQVLDMLFPKFLKGLQDAGWKTEK 305


>gi|225683629|gb|EEH21913.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
           Pb03]
          Length = 509

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 83/346 (23%), Positives = 161/346 (46%), Gaps = 40/346 (11%)

Query: 65  SFLNKFFPSGYPFSVNEGYL-----------RYTQFRALQHFTSAALSVLSTQSLLFAAG 113
           S L  F P+GYP SV++ Y+           +Y    +LQ F+S+   +L+++++L   G
Sbjct: 67  SLLIVFLPAGYPHSVSDDYMDLLFLALENDRKYKD--SLQAFSSSIAGLLASRAVLQGVG 124

Query: 114 LRPTPAQATV--VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEV 170
           +    A  TV  +  +L++ +  +  ++ ++ LG  ++ E K +R  AD L D    ++ 
Sbjct: 125 VGDASASPTVALLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDC 184

Query: 171 LSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
           LSP  P+ L + +    +  + +  VAA +++  + S FAK GNL +L AK  +  T+ +
Sbjct: 185 LSPAFPKHLRVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNAKDSSQETVIS 244

Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
           ++G+  G  + S + + +   + +  LL +IH+ +    +RA  + TLN QR  ++ +  
Sbjct: 245 LMGMLCGSLVVSHISTPLTTGLALIFLL-LIHLSTNYAAVRAVNMTTLNRQRANIVFSTL 303

Query: 290 VKTGRISSPADLRYREDLLFPGRLIE------DAGNVKVGRDPHKVV------------- 330
            + GR  +P      E +     ++         G  ++G    +++             
Sbjct: 304 FEEGRALTPKQTSKCERIFERDGILRWKASSTTLGFCRIGNSFQELLCGSNGSYRANSIR 363

Query: 331 -KPSKFHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW 373
             P     L  LF +E++IL     SR   +VL+ +      L+ W
Sbjct: 364 DAPIDIPRLLRLFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAW 409


>gi|452842814|gb|EME44750.1| hypothetical protein DOTSEDRAFT_130464 [Dothistroma septosporum
           NZE10]
          Length = 447

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 95/392 (24%), Positives = 181/392 (46%), Gaps = 48/392 (12%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSW 126
           F P G+P SV   YL Y  + +LQ F+S+  S+L+++++L + G+     +P  A ++S 
Sbjct: 47  FMPVGFPNSVTHDYLEYQIYDSLQAFSSSIASLLASRAVLSSVGVGDADASPTAAILLSI 106

Query: 127 ILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGL 185
                 +    L    LG  +  E K +R+LAD L D G  L+++S   P+ + + +   
Sbjct: 107 FQTSAGRLATILFAHRLGTSLGPECKMYRLLADILNDTGFLLDMISVAFPKPISVIILSF 166

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
            +  + +  VAA + +  + + FA+ GNL++L AK  +  T+ +++G+ AG  + S +  
Sbjct: 167 SSILRSLCGVAAGSAKASLSAHFARWGNLAELNAKDSSQETVISLLGMWAGSVVISYITE 226

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
                  +  LL+ IH+ +    +RA  + TLN QR A ++   ++ G + +P ++  +E
Sbjct: 227 PTAAWTTLIVLLA-IHLETNRRAVRAVTMQTLNRQR-ATLMFHHLQRGHVPTPVEIASQE 284

Query: 306 -----DLLFPGRLIEDAGNVKVGRDPHKVVK------------PSK---FHELKELFAEE 345
                D +  G      G+  VG    +++K            PS+      +  ++ + 
Sbjct: 285 RIFERDGVLRGNDDRILGHCAVGASTERLLKAVSGAQTSTKAFPSQGPAMQSILTVYEKS 344

Query: 346 KFILCLGSRWTD-----------MVLEQNASGEDALRGW---LVAAYAASMAKSFHD--- 388
            +IL     W D           +VL++ A+  + L  W   L  A A ++ KS      
Sbjct: 345 PYIL-----WQDRRSLPGRSSLFIVLKKGATPPEMLTAWWQALAVASANNIKKSTQPDQY 399

Query: 389 PSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
             L +++   D+  ++     + L+ KGW  D
Sbjct: 400 AELELMRSTMDQAANLLKQHENALKEKGWDLD 431


>gi|167526040|ref|XP_001747354.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774189|gb|EDQ87821.1| predicted protein [Monosiga brevicollis MX1]
          Length = 283

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 56/184 (30%), Positives = 94/184 (51%), Gaps = 1/184 (0%)

Query: 127 ILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           IL+D    +G +  + + G+ +D+  K+WR++AD L D    LE++SPL P  FL +  +
Sbjct: 4   ILRDATGMIGSISFAWIQGSDLDNNAKQWRLVADVLNDTAMLLELISPLLPAYFLFLVCV 63

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
            + A+ I  VA  ATR  +    A+  N+ D+ AK  +  TL N+V L   + L   V  
Sbjct: 64  ASLARAIVGVAGGATRAALIQHQARNNNMGDVAAKDGSQETLVNLVALLVNLILVPLVSQ 123

Query: 246 SMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
           + Q    +  L +  H++   + +R+  +  LN +R  +IV  + +  +I SP     RE
Sbjct: 124 NAQVVWSLFGLSTCCHIFFNYKAVRSLCMTRLNGRRAHLIVEHYTRHQQILSPKIANQRE 183

Query: 306 DLLF 309
            L F
Sbjct: 184 PLFF 187


>gi|413956563|gb|AFW89212.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
          Length = 263

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 55/172 (31%), Positives = 90/172 (52%), Gaps = 3/172 (1%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           + R+    L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    
Sbjct: 50  WERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 109

Query: 119 AQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A        W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  +++LSPL 
Sbjct: 110 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 169

Query: 176 PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
           P   + +  LG+ ++    VA+ ATR  +   FA   N +D+ AK  +  TL
Sbjct: 170 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETL 221


>gi|226292974|gb|EEH48394.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
           Pb18]
          Length = 495

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 81/335 (24%), Positives = 151/335 (45%), Gaps = 42/335 (12%)

Query: 65  SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV- 123
           S L  F P+GYP SV++ Y+ Y  + +LQ F+S+   +L+++++L   G+    A  TV 
Sbjct: 67  SLLIVFLPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVA 126

Query: 124 -VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFL 180
            +  +L++ +  +  ++ ++ LG  ++ E K +R  AD L D    ++ LSP  P+ L +
Sbjct: 127 LLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLSPAFPKHLRV 186

Query: 181 EMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
            +    +  + +  VAA +++  + S FAK GNL +L A G  + +  +   L  G+ L 
Sbjct: 187 VLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGSLVVSHIS-TPLTTGLALI 245

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
                           L +IH+ +    +RA  + TLN QR  ++ +   + GR  +P  
Sbjct: 246 ---------------FLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTPKQ 290

Query: 301 LRYREDLLFPGRLIE------DAGNVKVGRDPHKVV--------------KPSKFHELKE 340
               E +     ++         G  ++G    +++               P     L  
Sbjct: 291 TSKCERIFERDGILRWKASSTTLGFCRIGNSFQELLCGSNGSYRANSIRDAPIDIPRLLR 350

Query: 341 LFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW 373
           LF +E++IL     SR   +VL+ +      L+ W
Sbjct: 351 LFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAW 385


>gi|428168470|gb|EKX37414.1| hypothetical protein GUITHDRAFT_89666 [Guillardia theta CCMP2712]
          Length = 331

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 55/165 (33%), Positives = 85/165 (51%), Gaps = 3/165 (1%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F PSGYP S+ + Y +Y  F  LQ  +S    +LSTQ +L     G     + A  + W+
Sbjct: 130 FLPSGYPASLPDEYTKYQAFNLLQDLSSNLRGILSTQKILEGMGVGKAGVTSLAATMQWM 189

Query: 128 LKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
            +DG   +G L+ +   +     + KRWR+ AD + D+   L++L+P  P LFL +  L 
Sbjct: 190 ARDGSSMLGGLLFTAFASSSFGIDIKRWRLFADLINDVALLLDMLAPAYPDLFLPIICLS 249

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
           +  K +  VAA A    I   +A  GN++D+ AK  A  T  +V+
Sbjct: 250 SVFKAMCGVAAGACNTAIVQHWAARGNIADVGAKTGAQHTFVSVL 294


>gi|452823886|gb|EME30893.1| hypothetical protein Gasu_19060 [Galdieria sulphuraria]
          Length = 346

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 65/233 (27%), Positives = 110/233 (47%), Gaps = 17/233 (7%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWIL- 128
           F PSGYP SV + Y  + ++ +LQ   S    V+ T+S L   G+    A  T  + +L 
Sbjct: 68  FLPSGYPESVTDDYANFEKWDSLQGLCSYLRQVMVTRSTLSGLGVGNNFAHPTAAALMLL 127

Query: 129 -KDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGL 185
            ++G   VG L+ +    +R D + K WR+ AD + ++   +E  +PL P  +F  +   
Sbjct: 128 WREGFGVVGGLLFTWGKSSRFDKDSKSWRLFADVINNVALTMEWTAPLFPTPVFFALHSS 187

Query: 186 GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVC- 244
            N    +  +A  AT     + FAK  N+ D+ +K   I    ++VGL     L ST+C 
Sbjct: 188 ANIFHSLCGIAGGATCTAFQAHFAKRNNIGDICSKHGNIGRAVSLVGL-----LLSTLCF 242

Query: 245 -------SSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
                  +S+     V   L+V H+Y  I  +R+  +  LN +R  ++   ++
Sbjct: 243 WSSSLLENSLPFSFGVFCFLTVFHIYFNILALRSLHLAVLNKERGYIVAEHYI 295


>gi|259484769|tpe|CBF81274.1| TPA: DUF647 domain protein (AFU_orthologue; AFUA_6G06850)
           [Aspergillus nidulans FGSC A4]
          Length = 564

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 79/301 (26%), Positives = 149/301 (49%), Gaps = 20/301 (6%)

Query: 64  DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
           D  +  F P+GYP SV++ Y  Y  F +LQ F+S+   +L+++++L   G+     +P  
Sbjct: 67  DVLIEVFLPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPTS 126

Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
           A ++  IL+D    +  ++ ++ +G  ++ E K +R  AD   DL   L+  SP+ P  F
Sbjct: 127 ALLLH-ILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGF 185

Query: 180 LEMAGL--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
             +  L      + +  VA  +++  + + FA+ GNL+++ AK  +  T+ +++G+  G 
Sbjct: 186 NRVTVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAKDSSQETVISLIGMLVGS 245

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            + S + S     + +  LLS +H+      +R+  + TLN QR     A+ V +  +SS
Sbjct: 246 FVVSHITSFTATWLTLVFLLS-MHLSLNYAAVRSVQMTTLNRQR-----ANIVFSTLLSS 299

Query: 298 PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELK-ELFAEEKFILCLGS--R 354
             DL   + +  P R  E   ++K     H+   P+    +  E  A+E+ I   G+  +
Sbjct: 300 DPDL--ADFVSSPSRKDERRASLKQQEHTHQ--NPNHIRPITPEQVAKEERIFAAGAALK 355

Query: 355 W 355
           W
Sbjct: 356 W 356


>gi|255074355|ref|XP_002500852.1| predicted protein [Micromonas sp. RCC299]
 gi|226516115|gb|ACO62110.1| predicted protein [Micromonas sp. RCC299]
          Length = 561

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 91/184 (49%), Gaps = 7/184 (3%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
           F P+GYP +V+  Y  + ++          L V++ QSLL A G+  TP          W
Sbjct: 126 FLPAGYPDTVSPDYAPFIRWHLGSLMFRNVLEVITAQSLLVALGMGSTPGALPLTAATKW 185

Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           +LKDG+  +  L   +LG  + D +PKRW ++++A  D+   +E+++P  P LFL +A  
Sbjct: 186 VLKDGIGSLATLAAGSLGGQKCDEDPKRWWMVSNAFEDVARVIELVTPAAPALFLPLAAT 245

Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
             F +  A +  R + +   +   FA+  N +D+ A+ E       +V L  GI +  TV
Sbjct: 246 ATFVR-TAALTGRGSLVNGSLMQHFARNENTADVRARLEVQGRWLALVALPVGIGIFRTV 304

Query: 244 CSSM 247
             S 
Sbjct: 305 SQSF 308


>gi|403412621|emb|CCL99321.1| predicted protein [Fibroporia radiculosa]
          Length = 4586

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 114/447 (25%), Positives = 189/447 (42%), Gaps = 60/447 (13%)

Query: 22  QEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLN----KFFPSGYPF 77
           +E  + W E+   +T+ +  +  G L  +  + +    + +V  F +     F P+ YP 
Sbjct: 68  RECHVSWTESG--ITKEWH-EMSGALQERGEEQTTSTTNSIVGKFTSWLRQMFLPTNYPQ 124

Query: 78  SVNEGYLRYTQFRALQHFTS---AALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGM 132
           SV+  YLR   F ALQ F +     +SVL  Q+LL + G+    +   A  V WI+KDG 
Sbjct: 125 SVHPSYLR---FHALQFFETTFGTVVSVLCNQALLTSVGVSAEGSIFGAVAVQWIIKDGA 181

Query: 133 QHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPL---CPQLFLEMAGLGNF 188
             V KL          DS PK + +  + +  +G+ L++ + L    P  FL  A  GN 
Sbjct: 182 GEVAKLFFIRRFSPYFDSHPKSFTLFGEGIVAIGSCLQMATLLITPTPGNFLLCAAGGNI 241

Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
            K +       T +     F+ +GN+ D+ AK E+ +++  + G  AGI L  TV  S  
Sbjct: 242 FKLVGYAVWFTTHIKWVRYFSLQGNVGDVAAKDESQTSIAQLCGYAAGIGLL-TVSHSPA 300

Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVK------TGRISSPADLR 302
               +   L+ +H+ + +  MR A    L   R +++  ++ +      TG++++   LR
Sbjct: 301 YLYSLFFALTPLHLTATMFMMRDAKFEALTLPRLSLLAREYAREDGEAATGKVNT---LR 357

Query: 303 YREDLLFPG-----------RLIEDAGNVK--VGRDPHKVVKPSKFHELKELFAEEKFIL 349
             E+    G           R +E A  V   +G D  +  +P  +      F  EK++L
Sbjct: 358 EIEESRQTGAFGEFFKRKNDRFVELAPRVTEVIGGDVDR--EPVLWEMCTSEFQTEKYLL 415

Query: 350 CLGSRWTD----MVLEQNASGEDALRGWLVAA-----YAASMAKSFHDP------SLTV- 393
              +  ++    +    +A+ +D LR  L AA        S   S   P       L V 
Sbjct: 416 YPSASPSEKPITVFYHPDATSDDTLRSILHAARFRHDLLESRLSSLDSPWSSRTGELQVA 475

Query: 394 LQDAYDKMNDVFTPLLSELQAKGWHTD 420
           L  ++      F     EL  KGW TD
Sbjct: 476 LASSHAWTRQHFGAFKQELDDKGWRTD 502


>gi|157869108|ref|XP_001683106.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68223989|emb|CAJ04889.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 678

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/234 (26%), Positives = 110/234 (47%), Gaps = 7/234 (2%)

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           +R    RVV +      P GYP SV E  L +  +   Q       + LST+++L   G+
Sbjct: 57  ARQARRRVVTTLERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGV 116

Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVL 171
             + A  T   +SW+++DG + +G ++ + L  + ++   K W+++AD   D+   LE+ 
Sbjct: 117 GESKADLTSATLSWMMQDGTRMIGSVVFAPLIPQGLECRAKTWQLVADFTTDIANLLELC 176

Query: 172 SPLCP---QLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
           +P  P     F  +    +  K +  V    TR      FA   N +D+ AK    S++ 
Sbjct: 177 APWLPGGQTTFRVVLVAASVVKALGSVCRNCTRASFTRHFALRNNAADIAAKASTRSSVG 236

Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
           + +GL  G+ +A  V  +S    ++V  L SV+H+++    +R   +  LN  R
Sbjct: 237 SFIGLVLGVAVAYIVPATSRSLNLVVFALGSVLHIFASYRSVRGVQLKHLNAPR 290


>gi|444725757|gb|ELW66311.1| hypothetical protein TREES_T100000484 [Tupaia chinensis]
          Length = 406

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/365 (23%), Positives = 156/365 (42%), Gaps = 46/365 (12%)

Query: 83  YLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLIC 140
           + R+T  R    F S+    L+TQ++L   G+    A   A   +W++KD    +G+++ 
Sbjct: 41  WWRFT--RPFAAFASSLSGSLATQAVLLGIGVGNARASVSAATATWLVKDSTGMLGRIVF 98

Query: 141 SNLGARMDSEPKRWR-ILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARA 199
           +            W+ + AD L D+   LE+++P+ P  F       N AK +  VA  A
Sbjct: 99  A-----------WWKGLFADILNDVAMFLEIMAPIYPIFFTMTICTSNLAKCVVSVAGGA 147

Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
           TR  +    A+  N++D+ AK  +  TL N+ GL   + +   V       +    LL+ 
Sbjct: 148 TRAALTMHQARRNNMADVAAKDGSQETLVNLAGLLVSLLMLPLVSDFPSVSLGCFFLLTA 207

Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL--FPGRLIEDA 317
           +H+Y+    +RA  + TLN  R  +++  F++TG +  P      E L   F   L    
Sbjct: 208 LHIYANYRAVRALVMETLNEGRLRLVLKHFLRTGNVLDPTSANQMEPLWTGFCPSL---- 263

Query: 318 GNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALRGW 373
            ++ +G   H+++  S   +L++L    +E ++LC         +VL Q A  E  LR  
Sbjct: 264 -SLSLGVPLHRLI--SSVSDLQQLVEGHQEPYLLCWDQSRNQVQVVLSQKAGPETVLRAA 320

Query: 374 LVAAYAASMAKSFHDP-----------------SLTVLQDAYDKMNDVFTPLLSELQAKG 416
                  ++      P                 S  +++  +  ++ +F   L  LQ  G
Sbjct: 321 THGLVLGTLQGDGPLPGELEELRNWVRAGPEKDSRVIVRKMHQVLDTLFPKFLKGLQDAG 380

Query: 417 WHTDR 421
           W T++
Sbjct: 381 WKTEK 385


>gi|449690512|ref|XP_002160836.2| PREDICTED: UPF0420 protein-like, partial [Hydra magnipapillata]
          Length = 172

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/128 (36%), Positives = 78/128 (60%), Gaps = 3/128 (2%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATV 123
           F N F P G+P SV+E YLRY  +  LQ F S+   +L+TQ++L A G+        A  
Sbjct: 45  FQNVFLPQGFPESVSEDYLRYQLWDTLQAFCSSITGMLATQAMLKAYGVGDNTATVTAAT 104

Query: 124 VSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           ++W+LKDG   +G+++ +   G ++D + KRWR++AD L +    L++++PL  Q F+ +
Sbjct: 105 ITWMLKDGAGMIGRILFAWKKGTQLDCDAKRWRLVADVLNNCAIMLDLVAPLFKQYFVLI 164

Query: 183 AGLGNFAK 190
           A L + A+
Sbjct: 165 ACLSSVAR 172


>gi|302755442|ref|XP_002961145.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
 gi|300172084|gb|EFJ38684.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
          Length = 343

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 77/277 (27%), Positives = 125/277 (45%), Gaps = 38/277 (13%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
           F P GYP SV+  YL +  +  LQ  ++   S+LSTQ+LL   G+      A      W 
Sbjct: 54  FLPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWF 113

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           L+D    VG ++ +   G+ +D+  K+WR+ AD + D                       
Sbjct: 114 LRDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFMND----------------------- 150

Query: 187 NFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSS 246
                IA VA  ATR  +   FA++ N +D+ AK  +  T   +VG+  G+ LA    ++
Sbjct: 151 -----IAGVAGGATRAALTQHFARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANN 205

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRED 306
           +         L+  H+Y+  + + A  + ++N +R A+++  F+K G+  SP +   +E 
Sbjct: 206 IFLMWTSFMSLTAFHMYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQES 265

Query: 307 LL---FPGR--LIEDAGNVKVGRDPHKVVK--PSKFH 336
            L   F  R  +I  A    +  D  KV    P KFH
Sbjct: 266 ALPRPFTSRSGIIFGAKISSLCGDGRKVSTSLPKKFH 302


>gi|317037751|ref|XP_001399065.2| hypothetical protein ANI_1_1452164 [Aspergillus niger CBS 513.88]
          Length = 525

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/239 (26%), Positives = 123/239 (51%), Gaps = 12/239 (5%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ- 120
           V D  +  F P+GYP SV + Y  Y  F +LQ F+S+   +LS++++L   G+    A  
Sbjct: 68  VQDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASP 127

Query: 121 -ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ- 177
            A ++  IL+D    +  ++ ++ +G  ++ E K +R+ AD   DL   L+ LSP+ P  
Sbjct: 128 TAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMILDCLSPMIPAG 187

Query: 178 ----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
                 L  AG+    + +  VA  +++  + + F++ GNL+++ AK  +  T+ +++G+
Sbjct: 188 APRVTVLSTAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGM 244

Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
             G  + S V ++     I   +L  +H+      +R+  + +LN QR  ++ +  + T
Sbjct: 245 LVGSFVVSRV-TNFSTTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 302


>gi|310800465|gb|EFQ35358.1| hypothetical protein GLRG_10502 [Glomerella graminicola M1.001]
          Length = 416

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 77/256 (30%), Positives = 126/256 (49%), Gaps = 11/256 (4%)

Query: 68  NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
           + F P GYP SV   YL Y  F +LQ F S    +LS +++L   G+    + AT  ++ 
Sbjct: 7   DAFLPIGYPDSVTPDYLGYQTFDSLQAFFSTITGLLSNRAILQGLGVGDPNSSATYALLL 66

Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMA 183
            ILKDG+  V  ++ +   G  ++ E KR+R LAD   D    L + SP   P   +   
Sbjct: 67  TILKDGISRVATIVFAYRFGLVIEPECKRYRFLADIFNDSAFFLNLFSPYFDPWTKVGAL 126

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
            L      +  VAA A++  +   FA+  NLS+L AK  +  T   +VGL  G  +   V
Sbjct: 127 VLAEALGAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAVGLVGLLVGSVVVRFV 186

Query: 244 CSSMQGKMIVGPLLS---VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
               +G+  V  L++    +H+      +R   + TLN QR  ++   + +T ++ +PA 
Sbjct: 187 ----EGREAVFALMTFLVFVHLGMNYLGVRCVQLATLNQQRATILFEHYARTKQVLTPAQ 242

Query: 301 LRYREDLLFPGRLIED 316
           +  RE++LF   +I++
Sbjct: 243 VASRENILFWSPVIKN 258


>gi|328773288|gb|EGF83325.1| hypothetical protein BATDEDRAFT_85860 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 434

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 104/433 (24%), Positives = 181/433 (41%), Gaps = 53/433 (12%)

Query: 41  FQPDGHLSVKVVDDSRPVYHRVVDS-FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAA 99
           +  D H S    D S   +   V+  F N F P GYP SV+  Y +   +   ++   +A
Sbjct: 10  YASDRHPSTTRSDQSHIRFLGHVNQWFRNMFLPVGYPSSVHACYKKVHMWLFAENVAGSA 69

Query: 100 LSVLSTQSLLFAAG----LRPTPAQATVVSWILKDGMQHVGK-LICSNLGARMDSEPKRW 154
           +SVL+ Q++L + G       T A A  V W+LKDG+  +GK L+      + D+ PKRW
Sbjct: 70  ISVLTAQAMLTSIGASTAAHETAALAIAVDWVLKDGIGELGKMLMIQRFAHQFDTHPKRW 129

Query: 155 RILADALYDLGTGLEVLSPLC-PQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG- 212
           ++  +A    G  +++ + +  P  FL  A +G   + +      AT      + A    
Sbjct: 130 KLYGEACSISGALMQLATCITNPHHFLFFASIGVGLRSMHYSIWAATHTTFTRNMATHNG 189

Query: 213 -NLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG--PLLSVIHVYSVIEEM 269
            N+ D+ AK ++  +L +++G+ +GI + +    S Q   +     +LS I +      +
Sbjct: 190 VNVGDIVAKADSQLSLAHLLGMVSGIGMLAV---SFQPAALFAWFSVLSAIQIVCTGLLL 246

Query: 270 RAAPVNTLNPQRTAMIVADFVKTG---RISSPADLRYREDL----LFPGRLIEDAGNVKV 322
           R A    L+  R  +I  +F+      RI +   +   E+     L  G+++ D   + +
Sbjct: 247 REARFEVLDQYRLVLIPREFMSEKNALRIPTLDQVVRFENWLNEGLSKGQVVTD---LSI 303

Query: 323 GRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTD---------MVLEQNASGEDALRGW 373
           G       K S+     ++F   K++  LG +  D         +VL Q+      ++  
Sbjct: 304 GVTVDDAFKHSRLSHALQIFKNGKYL--LGFKLADSTHSQIKVYVVLHQSIQPVHVIQA- 360

Query: 374 LVAAYAASMAK-------------SFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
             A +A    K             S HD    +L+ +     D F   L  L+ + W TD
Sbjct: 361 --AFHAVRFEKEIIRQTQSTGAPISLHDAE-NILESSLAWTLDNFPAYLDGLEEQKWKTD 417

Query: 421 RFLDG-TGTRFAW 432
             + G  G R  W
Sbjct: 418 LIIWGDRGIRVVW 430


>gi|223974077|gb|ACN31226.1| unknown [Zea mays]
          Length = 207

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 54/164 (32%), Positives = 87/164 (53%), Gaps = 3/164 (1%)

Query: 67  LNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVV 124
           L  F P G+P SV   Y+ +  +  LQ  ++   ++LSTQ+LL A G+    A       
Sbjct: 2   LGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATF 61

Query: 125 SWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
            W L+D    +G ++ +   G+ +DS  K WR++AD + DLG  +++LSPL P   + + 
Sbjct: 62  QWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIM 121

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTL 227
            LG+ ++    VA+ ATR  +   FA   N +D+ AK  +  TL
Sbjct: 122 CLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETL 165


>gi|302837684|ref|XP_002950401.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
           nagariensis]
 gi|300264406|gb|EFJ48602.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
           nagariensis]
          Length = 500

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 67/235 (28%), Positives = 123/235 (52%), Gaps = 16/235 (6%)

Query: 92  LQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMD 148
           +Q  +S    VL++Q++L   G+    +   A V ++ ++D    +G ++ S L G+  D
Sbjct: 10  IQALSSYVRGVLTSQAILRGVGVGQEASTPLAAVFTFFVRDLAGMLGGILFSYLEGSSFD 69

Query: 149 SEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSS 207
           +  K+WR+ AD   DLG  +E+ SPL P+ +FL +A LG+ A+ +  VA  ATR  +   
Sbjct: 70  ACAKQWRLFADITNDLGMTVELASPLFPRVMFLPIACLGSVARSLTGVAGGATRAALTQH 129

Query: 208 FAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS--------SMQGKMIVGPL--- 256
           FA+ GN +D+ AK ++  T   ++G+  G+ +   + +        +     ++  L   
Sbjct: 130 FARRGNAADVSAKEQSQETATTILGMVTGMAVTRLLAAGEGPAGGPTGAAGFVIAWLVFL 189

Query: 257 -LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
            L+ +HV + +  MR   + +LN  R  ++V  +++ GR+ SP  +   EDL  P
Sbjct: 190 GLTAVHVVANVAAMRVLLLTSLNTPRLELLVDRYLQDGRVLSPRAVSQLEDLTPP 244


>gi|218187487|gb|EEC69914.1| hypothetical protein OsI_00335 [Oryza sativa Indica Group]
          Length = 386

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 60/184 (32%), Positives = 90/184 (48%), Gaps = 19/184 (10%)

Query: 53  DDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAA 112
           +D+     R +  F+    P+G+P SV++ YL Y  ++   + T      L T +LL A 
Sbjct: 57  EDANSSIPRAIKDFV---LPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAV 113

Query: 113 GL-----RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGT 166
           G+         A A  + W+ KDG+   G+L I    G   D +PK+WR+ AD +   G+
Sbjct: 114 GVGSFTGTSAAASAAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGS 173

Query: 167 GLEVLSPLCPQLFLEMAGLGNFAKGI------AVVAARATRLP----IYSSFAKEGNLSD 216
             E+++PL P  FL +A LGN AK I           R  R P    I + FA+ GNL +
Sbjct: 174 IFELITPLYPGYFLPLASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGE 233

Query: 217 LFAK 220
           + AK
Sbjct: 234 VAAK 237


>gi|428170130|gb|EKX39058.1| hypothetical protein GUITHDRAFT_114933 [Guillardia theta CCMP2712]
          Length = 405

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 76/265 (28%), Positives = 109/265 (41%), Gaps = 59/265 (22%)

Query: 50  KVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
           K V  +  V  +V  SFL        P  V+E Y  +T +R LQ   S+ +SV  TQSLL
Sbjct: 82  KFVRKASNVAFKVKQSFL--------PRGVSEDYYSFTIWRILQRIVSSTVSVFGTQSLL 133

Query: 110 FAAGLRPTPAQATVVS-WILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTG 167
            A G++         S W+  +     GK+I  +  G   DS+ KRWR            
Sbjct: 134 LALGVKTNKVGVAAASGWVFANAFGKFGKMIFAARWGRDFDSDAKRWR------------ 181

Query: 168 LEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFA-KEGNLSDLFAKGEAIST 226
                                          ATR   Y  FA +  NL D+ AKG+A   
Sbjct: 182 ------------------------------SATRGTFYKQFAGRNENLGDITAKGDAQVA 211

Query: 227 LFNVVGLGAGIQLA---STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
           + ++VGL  GI+LA    T    + G  +V   LS +    +++ +       LN +R  
Sbjct: 212 VADLVGLVVGIELARLAGTNALCVWGSYVV---LSALDFIFILKSLDTIVFRFLNLERAT 268

Query: 284 MIVADFVKTGRISSPADLRYREDLL 308
           M+ + FV  G I SPAD+   E +L
Sbjct: 269 MLASAFVSEGIIKSPADISRSEPVL 293


>gi|317143185|ref|XP_001819302.2| hypothetical protein AOR_1_114154 [Aspergillus oryzae RIB40]
          Length = 510

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 61/231 (26%), Positives = 122/231 (52%), Gaps = 12/231 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P+GYP SV++ Y+ Y  F +LQ F+S+   +LS++++L   G+    A   A ++  I
Sbjct: 75  FLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHI 134

Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-----LE 181
           L+D    +  ++ ++ +G  ++ E K +R+ AD   D    L+ LSP+ P  F     L 
Sbjct: 135 LQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLS 194

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
            AG+    + +  VA  +++  + + F++ GNL+++ AK  +  T+ +++G+  G  + S
Sbjct: 195 TAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVGSFVVS 251

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
            V +S     I   +L  +H+      +R+  + +LN QR  ++ +  + +
Sbjct: 252 RV-TSYTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNS 301


>gi|119187997|ref|XP_001244605.1| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
          Length = 472

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 106/450 (23%), Positives = 188/450 (41%), Gaps = 94/450 (20%)

Query: 43  PDGHLSVKVVDDSRPVYHRVVDSF----LNKFFPSGYPFSVNEGYLRYTQF--------- 89
           P   +  K V  S P   R   S+    L+ F P GYP SV+  YL Y  +         
Sbjct: 26  PAASIGTKPVTISAPTSRRTWSSYFTAILDAFLPVGYPHSVSGDYLEYQVYVRPIFLKLH 85

Query: 90  --------------RALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGM 132
                          +LQ F+S+   +LS++++L   G+     +P  A ++S IL+D M
Sbjct: 86  WSFTDVANNRLFVQDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSM 144

Query: 133 QHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAK 190
             +  ++ ++ LG  ++ E K +R+ AD   D    ++ LSP+ P+ + + +  L +  +
Sbjct: 145 GRIATILFAHQLGTSLEPECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLR 204

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
            +  VAA +++  + + FAK GNL++L A G  +                S + S +   
Sbjct: 205 ALCGVAAGSSKASLSAHFAKWGNLAELNAVGSLV---------------VSYITSPLATW 249

Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF-----VKTGRISSPADLRYRE 305
           + +  LL ++H+ +    +RA  + +LN QR  ++ +       V T + +S  +  +  
Sbjct: 250 VALISLL-IVHITTNYLAVRAVSMTSLNRQRANIVFSTLFDENRVLTAKQASKQERVFER 308

Query: 306 DLLFPGRLIEDA-GNVKVG---RDPHKVVKPSKFHELKE--------LFAEEKFILCLGS 353
           D +   +   D  G  ++G   +D    V  S   E  E        +F +E++IL    
Sbjct: 309 DGVLRWKASSDTLGFCQIGVPFKDVIDHVSSSSPAERTEATIVAILRVFEQEEYIL---- 364

Query: 354 RWTD-------MVLEQNASGEDALRGWLVAAYAASMAKSFHDPS---------------- 390
            W D       + L+ NAS    L+ W  A  AA+  K     +                
Sbjct: 365 -WFDPARKRGVIGLKTNASPTSQLKAWSHALLAANSLKIKESNTRGPLVREQVGTDSVLM 423

Query: 391 LTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
           L +LQD   + +  F   +  L+  GW  D
Sbjct: 424 LGILQDTLAEHSKDFDARIESLRDAGWDID 453


>gi|358373469|dbj|GAA90067.1| DUF647 domain protein [Aspergillus kawachii IFO 4308]
          Length = 527

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 64/239 (26%), Positives = 124/239 (51%), Gaps = 12/239 (5%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ- 120
           V+D  +  F P+GYP SV + Y  Y  F +LQ F+S+   +LS++++L   G+    A  
Sbjct: 68  VLDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASP 127

Query: 121 -ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ- 177
            A ++  IL+D    +  ++ ++ +G  ++ E K +R+ AD   DL   L+ LSP+ P  
Sbjct: 128 TAALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMIPAG 187

Query: 178 ----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL 233
                 L  AG+    + +  VA  +++  + + F++ GNL+++ AK  +  T+ +++G+
Sbjct: 188 APRVTVLSTAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGM 244

Query: 234 GAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
             G  + S V +S     I   +L  +H+      +R+  + +LN QR  ++ +  + T
Sbjct: 245 LVGSFVVSRV-TSFSTTWICLLMLLALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 302


>gi|397627073|gb|EJK68340.1| hypothetical protein THAOC_10484 [Thalassiosira oceanica]
          Length = 478

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 78/275 (28%), Positives = 124/275 (45%), Gaps = 27/275 (9%)

Query: 82  GYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLI 139
           GYL Y     LQ  T+A  SVL+TQ +L   G+      A A  + ++++DG   +  L+
Sbjct: 92  GYLEYVCMDNLQDLTTALRSVLATQRVLEGVGVGREGATALAATLDFLIRDGCGMIASLV 151

Query: 140 -CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP-QLFLEMAGLGNFAKGIAVVAA 197
             S   +    + KRW++ ADA  D G  LEV++P  P + FL +    N  K +  VAA
Sbjct: 152 LTSGFSSTFRRDIKRWKLFADAAVDAGITLEVVAPSLPSRWFLPVLCGANVCKALCGVAA 211

Query: 198 RATRLPIYSSFAK-----EGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMI 252
            A   P+   +A      +  LS++ AK  A  T+    GL     L+      M G+  
Sbjct: 212 GAVCGPVQLHWAARLGLGDEGLSEVAAKAGAQRTIVGATGL----VLSGLFARWMGGRGA 267

Query: 253 VGP-------LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-------KTGRISSP 298
             P       +L+ +H+ +    MR   +N LN +R  +I+ DF+        T  +  P
Sbjct: 268 NRPVWVGMYAVLTALHLLANYRSMRIVRLNWLNRERMDLILEDFLGRDGGSGDTLGLIDP 327

Query: 299 ADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPS 333
           A     E L+F  R+      V++G   + +V+ S
Sbjct: 328 AGAARVESLVFRPRVRRHQCGVRLGVSFNDLVRCS 362


>gi|345563243|gb|EGX46246.1| hypothetical protein AOL_s00110g70 [Arthrobotrys oligospora ATCC
           24927]
          Length = 445

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 87/394 (22%), Positives = 166/394 (42%), Gaps = 62/394 (15%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
           + + +N F P+GYP SV + YL                          A G   + A  T
Sbjct: 54  IQNIINIFLPAGYPKSVTDDYLDGD----------------------LAVGFEDSNASTT 91

Query: 123 --VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
             V+  I++D +  V  ++ ++ LG+ ++ E KR+R  AD   D    ++ LSP+ P+  
Sbjct: 92  AAVLLKIVQDSIGRVATILFAHRLGSSLEPECKRFRFAADIFNDSALIVDCLSPVFPKPL 151

Query: 178 --LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL-- 233
             + L  AG     + +  VA   ++  +   F+K GN+++L AK  +  T+ +++G+  
Sbjct: 152 RVVLLSTAGT---LRALCGVAGGGSKAQLSVHFSKWGNVAELNAKDSSQETVISLMGMLM 208

Query: 234 ----GAGIQLASTVCSSMQGKMIVGP---LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
                  ++  S V SS+  K         L  +H+++    ++A  + TLN QR   ++
Sbjct: 209 RADQTLRVKAGSLVISSVTSKTATWASMLFLIAVHLWTNYRAVKAVTMVTLNRQRANSVM 268

Query: 287 ADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVG------RDPHKVVKPS-----KF 335
           + F +TG + +P D+  +E +     ++       VG      +    V + S       
Sbjct: 269 STFAETGVMLTPKDVARQERIFERDGVLRWCDGAVVGWGVIGTKLSSIVARKSASGSLSL 328

Query: 336 HELKELFAEEKFILCLGSRWTD----MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSL 391
            E+ +    E+++L  G +  D    + LE   +     R W+    +A +       ++
Sbjct: 329 AEVAKACPTEQYMLWFGDKLRDIRVCICLENGCTPAGQFRAWMHGLLSAKLLSDRRSQAV 388

Query: 392 T------VLQDAYDKMNDVFTPLLSELQAKGWHT 419
           T       L DA  +++ +F   L +    GW T
Sbjct: 389 TESDALRALVDAKRQLDGIFEEFLEKAAEAGWDT 422


>gi|392871321|gb|EAS33216.2| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
          Length = 407

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 91/375 (24%), Positives = 174/375 (46%), Gaps = 52/375 (13%)

Query: 91  ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGAR 146
           +LQ F+S+   +LS++++L   G+     +P  A ++S IL+D M  +  ++ ++ LG  
Sbjct: 21  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSMGRIATILFAHQLGTS 79

Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIY 205
           ++ E K +R+ AD   D    ++ LSP+ P+ + + +  L +  + +  VAA +++  + 
Sbjct: 80  LEPECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLS 139

Query: 206 SSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSV 265
           + FAK GNL++L AK  +  T+ +++G+  G  + S + S +   + +  LL ++H+ + 
Sbjct: 140 AHFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLL-IVHITTN 198

Query: 266 IEEMRAAPVNTLNPQRTAMIVADF-----VKTGRISSPADLRYREDLLFPGRLIEDA-GN 319
              +RA  + +LN QR  ++ +       V T + +S  +  +  D +   +   D  G 
Sbjct: 199 YLAVRAVSMTSLNRQRANIVFSTLFDENRVLTAKQASKQERVFERDGVLRWKASSDTLGF 258

Query: 320 VKVG---RDPHKVVKPSKFHELKE--------LFAEEKFILCLGSRWTD-------MVLE 361
            ++G   +D    V  S   E  E        +F +E++IL     W D       + L+
Sbjct: 259 CQIGVPFKDVIDHVSSSSPAERTEATIVAILRVFEQEEYIL-----WFDPARKRGVIGLK 313

Query: 362 QNASGEDALRGWLVAAYAASMAKSFHDPS----------------LTVLQDAYDKMNDVF 405
            NAS    L+ W  A  AA+  K     +                L +LQD   + +  F
Sbjct: 314 TNASPTSQLKAWSHALLAANSLKIKESNTRGPLVREQVGTDSVLMLGILQDTLAEHSKDF 373

Query: 406 TPLLSELQAKGWHTD 420
              +  L+  GW  D
Sbjct: 374 DARIESLRDAGWDID 388


>gi|409048331|gb|EKM57809.1| hypothetical protein PHACADRAFT_251666 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 532

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 66/235 (28%), Positives = 105/235 (44%), Gaps = 9/235 (3%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P+ YP SV+  YL++   + ++      + VL  Q+LL + G+    +   A  V W+
Sbjct: 114 FLPTNYPQSVHPSYLKFHLLQFMECIVGTVVGVLCNQALLVSVGVSAEGSILGAVAVQWV 173

Query: 128 LKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQL---FLEMA 183
           +KDG   V KL +        DS PK +      +  LG+GL++ + + P     FL  A
Sbjct: 174 IKDGAGEVAKLWVIRRFSPYFDSHPKTFTFTGATMGLLGSGLQIAALVVPPTTLNFLLCA 233

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV 243
             GN  K +       T +     FA++GN+ D+ AK E+  ++  + G  AG+ L  TV
Sbjct: 234 AGGNIFKCVGGAVWLTTHIKFVRFFARQGNMGDVAAKSESQGSVGILSGYAAGVGLL-TV 292

Query: 244 CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSP 298
             S      +  L   IH+      +RAA    L   R + +  +F    R S P
Sbjct: 293 SHSAPYLYTIFALAIPIHLLITQWMLRAATFELLTLPRLSFLAREF--ASRASRP 345


>gi|402582701|gb|EJW76646.1| hypothetical protein WUBG_12446, partial [Wuchereria bancrofti]
          Length = 184

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 79/135 (58%), Gaps = 3/135 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWI 127
           F P GYP SV+  Y+ Y  +  +Q F S+  S LST+++L   G+  +     A  V+W+
Sbjct: 43  FMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAAAVAWL 102

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           LKDG+  + +++ + L    +D++ K+WR++AD   DL   L++++P+ P LF+ +  L 
Sbjct: 103 LKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAYCLDLITPIFPNLFMPIICLS 162

Query: 187 NFAKGIAVVAARATR 201
           +  + +  VA  ATR
Sbjct: 163 SMIRAVVGVAGSATR 177


>gi|378729487|gb|EHY55946.1| hypothetical protein HMPREF1120_04055 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 485

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 62/230 (26%), Positives = 116/230 (50%), Gaps = 11/230 (4%)

Query: 61  RVVDSFLNK----FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP 116
           R   S LNK    F P+GYP +V   Y  Y  + +LQ F S    +L+++++    G+  
Sbjct: 45  RTGTSILNKCMSPFLPAGYPSTVTPDYTPYQIYDSLQAFASTIAGLLASRAVFVGMGVGS 104

Query: 117 TPAQATVVSWILKDGMQHVGK----LICSNLGARMDSEPKRWRILADALYDLGTGLEVLS 172
             A + + + +L    + VG+    L       R+++E K +R  AD + D+   L+ +S
Sbjct: 105 EEA-SVLTTMLLYIAQETVGRVATILFAHQFSQRIEAEVKFYRFFADIVNDVAFVLDCMS 163

Query: 173 PLCPQL-FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVV 231
           P  P L  +    L N  + +  VA  +++  + S FAK GN+ +L AK  +  T+ +++
Sbjct: 164 PAMPILGRVVTLCLSNACRAVCGVAGGSSKAILSSHFAKAGNIGELNAKDGSQETVISLL 223

Query: 232 GLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
           G+  G  + S V  ++     + PLL+ +H+++  + +R+  +N+ N  R
Sbjct: 224 GMWVGGVVVSKVHGTLATWSWLVPLLA-LHLWANWKAVRSVRLNSFNTNR 272


>gi|380480425|emb|CCF42443.1| hypothetical protein CH063_12442 [Colletotrichum higginsianum]
          Length = 484

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 71/289 (24%), Positives = 131/289 (45%), Gaps = 25/289 (8%)

Query: 40  QFQPDGHLSVKVVDDSRPVYHRVVD------------SFLNKFFPSGYPFSVNEGYLRYT 87
           ++   G+L  K V+ +  +    VD              L  F P GYP SV E Y  Y 
Sbjct: 12  EYDKAGNLKAKYVESASRIGPSRVDVLLPFKRKKPWLRLLEIFLPDGYPHSVTEDYAAYQ 71

Query: 88  QFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQATVVSWILKDGMQHVGKLICSNL- 143
            + ++Q F  +   ++S++++    G+     +P  A +V  ++++       +  ++L 
Sbjct: 72  IYDSVQAFAGSIAGMISSRAVWEGLGVGDSLASPTGAMLVQ-VIRESTGRFATIAFAHLF 130

Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARA 199
           G  +++E K WR+ AD L D+   L+ LSP  P     L L  + L   A G   VA  A
Sbjct: 131 GTSIEAECKAWRLAADVLCDVAMVLDCLSPFFPHGVRFLVLCFSSLLYSASG---VAGNA 187

Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV 259
           ++  +   FAK  NL +L AK  +  T  +++G+ +G  + S + +  +        L  
Sbjct: 188 SKSSLSGHFAKWNNLGELNAKDASQETAISLIGMISGTFVVSLL-TDQRATWATLIFLLA 246

Query: 260 IHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
           +H++   + +RA    +LN QR  +  +     G++ +P  +  RE + 
Sbjct: 247 LHLFLNWKGVRAVKSRSLNRQRANIAFSALFSKGQVLTPHHVAERERIF 295


>gi|390599626|gb|EIN09022.1| DUF647-domain-containing protein [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 461

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 105/411 (25%), Positives = 173/411 (42%), Gaps = 52/411 (12%)

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           SR + H + D FL    P+GYP SV+  YLRY  + A Q F S+  ++++++++L   G+
Sbjct: 26  SRGLKHLLTDVFL----PAGYPSSVSPDYLRYQTYNAFQAFCSSLANLIASRAVLEGFGV 81

Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVL 171
               A AT  ++  +L+D    +  +     LG  +  E K +R+LAD   D    L+V 
Sbjct: 82  GNASASATHAMLLTVLQDVFGRLTTVAGGYYLGTALSPEAKTYRLLADVFNDASIVLDVF 141

Query: 172 SPLCPQL------FLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
            P    +       L +   G   + +  V A  ++  +   FA  G+L DL AK  +  
Sbjct: 142 QPFYASVPVAGARVLALCSSGCL-RALCGVCAGGSKAALAVHFASAGDLGDLNAKDGSRE 200

Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
           T+  +VGL +G  L S + +       +  LL   H+      +RA  + TL  QR ++ 
Sbjct: 201 TVIGLVGLLSGTILLSYIGTPGATYAALCALLGA-HLALNYLAVRAVAMRTLTRQRASIA 259

Query: 286 VADFVK--TGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPH----------KVVKPS 333
              +++   G   +P ++  RE LL    LI D    ++GR                K S
Sbjct: 260 WRVWMERGAGAAPTPEEVAEREGLLADPTLIADGTGRRLGRCSFGGARSQGFIATTGKDS 319

Query: 334 KFHELKELFAEEKFILCL----GSRWTDMVLEQNASG-EDALRGWLVAAYA--------- 379
                  L    ++ L +    GS  T  +L +   G ED L+ W+ A  A         
Sbjct: 320 AMVSTSRLIEAARYWLLIDSKPGSIPTATILLKRVYGPEDLLKAWVHALRAVQDRRAESS 379

Query: 380 ----ASMAKSF---HDPSLTVLQDAYDKMND---VFTPLLSELQAKGWHTD 420
               A   ++F   H P      D +D+ ND   +F    + ++  GW  D
Sbjct: 380 LQSGARGPQTFNHGHGPDSAGHTD-HDEGNDSERLFERFAARMRDAGWRVD 429


>gi|164423593|ref|XP_001728067.1| hypothetical protein NCU11380 [Neurospora crassa OR74A]
 gi|157070159|gb|EDO64976.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 357

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 78/288 (27%), Positives = 133/288 (46%), Gaps = 37/288 (12%)

Query: 39  YQFQPDGHLSVKVVDDSRPVY--HRV----VDSFLNKFFPSGYPFSVNEGYLRYTQFRAL 92
           Y   P G++      +SR     H+     + + LN F P+GYP SV   YL Y  + +L
Sbjct: 6   YDIDPAGNVLCTYNGESRESLKAHQTKVAPLHALLNAFLPAGYPHSVTPDYLPYQTYDSL 65

Query: 93  QHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGK----LICSNLGARMD 148
           Q F S+  S+L+ +++L   G+    +  T  + ILK     + +    L    LG  ++
Sbjct: 66  QAFFSSISSLLANRAVLEGLGVGDASSSPT-AALILKITSDTISRFATILFAHRLGTAIE 124

Query: 149 SEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAV------------VA 196
            E K +R LAD L D    L++L P+       M G   +   + V            VA
Sbjct: 125 PECKFFRFLADVLNDAAQLLDLLMPVM------MMGETRYGLKLGVVVGSGVLRSWCGVA 178

Query: 197 ARATRLPIYSSF-AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGP 255
           A A++  + + F  +  +L +L AK  +  T+ +++G+  G    S V   +Q +++V  
Sbjct: 179 AGASKASLSAHFVTQRSSLGELNAKEASQETVVSLLGMLVG----SLVVKVVQDRLVVWV 234

Query: 256 L---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           L   L  +H+      +R   + TLN QR  M+V  +++TGR+  P +
Sbjct: 235 LMVGLLGVHLLMNYRGVRCVRLRTLNRQRATMVVRSWLETGRVLGPEE 282


>gi|452984686|gb|EME84443.1| hypothetical protein MYCFIDRAFT_187449 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 435

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 104/391 (26%), Positives = 180/391 (46%), Gaps = 67/391 (17%)

Query: 19  MSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFS 78
           M+ Q + I  ++ +D+VT  Y  Q +G        D    + R VD FL    P G+P S
Sbjct: 1   MADQPLRITEYDEADTVTATY-VQSEGKSQGLSRVDVITAWQRAVDVFL----PVGFPHS 55

Query: 79  VNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHV 135
           V+E YL Y  + +LQ F+S+  S+LS++++L + G+     +P  A ++S IL+D +  +
Sbjct: 56  VSEDYLEYQIYDSLQAFSSSIASLLSSRAVLSSVGVGDSTASPTAAIILS-ILQDSVGRL 114

Query: 136 GK-LICSNLGARMDSEPKRWRILADAL-YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIA 193
              L   + G  ++ E K +R+LAD +   + +   VL  LC                  
Sbjct: 115 ATILFAHSFGTSLEPECKMYRLLADPIRVVVLSLSSVLRSLCG----------------- 157

Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
            VAA +++  + + FAK+GNL +L AK  +  T+ ++VG+ AG  + S + + +     +
Sbjct: 158 -VAAGSSKASLSAHFAKQGNLGELNAKDSSQETVISLVGMMAGSLVISWITTPIATWTAL 216

Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRL 313
             LLS IH+ +    +RA  + TL  QR A+I    V+  +  +P  +  +E +     +
Sbjct: 217 ILLLS-IHLETNRRAVRAVKMRTLTRQRAALIYYQ-VQQNKTPTPDFIASQERIFERDGV 274

Query: 314 IEDA-----GNVKVG----------RDPHKVVKPS-----KFHELKELFAEEKFILCLGS 353
           + +A     G   VG           + H   K S     KF  +  L+    FIL    
Sbjct: 275 LRNAKAEVLGFCSVGVPLARLLNSIGEQHSQTKSSTMASEKFTRILHLYQTSAFIL---- 330

Query: 354 RWTD-----------MVLEQNASGEDALRGW 373
            W+D           +V++++A   D +  W
Sbjct: 331 -WSDRHSRSRRNSLFIVVKKSAVSRDLILAW 360


>gi|255574377|ref|XP_002528102.1| conserved hypothetical protein [Ricinus communis]
 gi|223532491|gb|EEF34281.1| conserved hypothetical protein [Ricinus communis]
          Length = 485

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 84/379 (22%), Positives = 156/379 (41%), Gaps = 41/379 (10%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV------- 123
            P+G+P SV++ Y +Y   +   + T      L T SLL A G+                
Sbjct: 103 LPAGFPGSVSDDYFQYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFTGSTAAAAASAAA 162

Query: 124 VSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           + W+ KDG+  +G+L I    G+  D +PK+WR+ AD +   G+  ++++ + P  FL +
Sbjct: 163 IRWVSKDGIGALGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLITQVYPAYFLPL 222

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST 242
           A LGN  K +A      +   I + FA  GNL ++ AK E        VG          
Sbjct: 223 ASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWE-----VGAQLLGLALGI 277

Query: 243 VCSSMQGKMIVGPLLS-------VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           +     G +   P L+       ++H++     +     +T+N +R  ++V   +    +
Sbjct: 278 LILDTPGLVKSFPALASTWLSMRLLHLWLRYLSLSVLQFDTINLKRARLLVKSHILHSVV 337

Query: 296 SSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV----KPSKFHELKELFAEEKFILCL 351
               D    E++L      +    +  G    ++V      ++   L +L+ +E ++L +
Sbjct: 338 LGCVDCNREENILLWETFTKP--QITFGVPLQEMVGGERSAAQVKTLLKLYTKENYLLVV 395

Query: 352 GSRWTD----MVLEQNASGEDALRG-----WLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
             +  D    +  +  A+   ALR      WL   + +S         L  L  +  +M 
Sbjct: 396 NQQKGDFEVFVSFKVGATSMSALRSVWQTYWLHENWKSSTN------VLDKLAQSLLEME 449

Query: 403 DVFTPLLSELQAKGWHTDR 421
             F   + +L+  GW+T +
Sbjct: 450 VKFEDFVQQLEVAGWNTHQ 468


>gi|67539168|ref|XP_663358.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
 gi|40743657|gb|EAA62847.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
          Length = 559

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 78/301 (25%), Positives = 148/301 (49%), Gaps = 25/301 (8%)

Query: 64  DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQ 120
           D  +  F P+GYP SV++ Y  Y  F +LQ F+S+   +L+++++L   G+     +P  
Sbjct: 67  DVLIEVFLPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPTS 126

Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF 179
           A ++  IL+D    +  ++ ++ +G  ++ E K +R  AD   DL   L+  SP+ P  F
Sbjct: 127 ALLLH-ILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGF 185

Query: 180 LEMAGL--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGI 237
             +  L      + +  VA  +++  + + FA+ GNL+++ A+     T+ +++G+  G 
Sbjct: 186 NRVTVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAE-----TVISLIGMLVGS 240

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
            + S + S     + +  LLS +H+      +R+  + TLN QR     A+ V +  +SS
Sbjct: 241 FVVSHITSFTATWLTLVFLLS-MHLSLNYAAVRSVQMTTLNRQR-----ANIVFSTLLSS 294

Query: 298 PADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELK-ELFAEEKFILCLGS--R 354
             DL   + +  P R  E   ++K     H+   P+    +  E  A+E+ I   G+  +
Sbjct: 295 DPDL--ADFVSSPSRKDERRASLKQQEHTHQ--NPNHIRPITPEQVAKEERIFAAGAALK 350

Query: 355 W 355
           W
Sbjct: 351 W 351


>gi|170102118|ref|XP_001882275.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642647|gb|EDR06902.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 285

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 79/288 (27%), Positives = 141/288 (48%), Gaps = 28/288 (9%)

Query: 59  YHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           + RV+D     F P+GYP SV+  YLRY    ALQ F ++   +LS++++L   G+    
Sbjct: 1   HRRVLDLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLAGLLSSRAILEGFGVGDPS 60

Query: 119 AQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A AT  ++  +L+D    +  ++ ++ LG+ +  + K++R+LAD L D    L+ +SP  
Sbjct: 61  ATATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDAKKYRLLADMLNDAAVILDTISPRL 120

Query: 176 PQLF---LEMAGLGNFA--KGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAIST 226
             LF   L +A L   A  + +  ++A  ++  I   FA      G++ DL AK  +  T
Sbjct: 121 DTLFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKET 180

Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEM--RAAPVNTLNPQRTAM 284
           +  ++G+  G  +   + +      I   LL ++ ++  I  +  R   + TLN QRT +
Sbjct: 181 VLALLGMLLGTLIVPHLTTPWT---IYATLLVLVGLHLTINYIGVRGLVLRTLNGQRTWI 237

Query: 285 IVADFVKTG--RISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV 330
             + +++T   R  +P ++   E      R+ E  G     RDP + V
Sbjct: 238 AWSAYMRTQPRRAPTPVEVASLE------RIFERPGGF---RDPRRGV 276


>gi|414586422|tpg|DAA36993.1| TPA: hypothetical protein ZEAMMB73_408601, partial [Zea mays]
          Length = 72

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 39/70 (55%), Positives = 53/70 (75%), Gaps = 1/70 (1%)

Query: 358 MVLEQNASGEDALRGWLVAAYAASMAKS-FHDPSLTVLQDAYDKMNDVFTPLLSELQAKG 416
           MVLE++ASGEDALRGWLVAA+A+ M +S        VL +AY+KM  VF   +SE++++G
Sbjct: 1   MVLERSASGEDALRGWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPTFVSEVRSRG 60

Query: 417 WHTDRFLDGT 426
           W+TD+FLDG 
Sbjct: 61  WYTDQFLDGN 70


>gi|296826902|ref|XP_002851048.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
 gi|238838602|gb|EEQ28264.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
          Length = 479

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 104/439 (23%), Positives = 188/439 (42%), Gaps = 96/439 (21%)

Query: 56  RPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQF---------------------RALQH 94
           +P     + S LN F PSGYP SV + YL Y  +                      +LQ 
Sbjct: 47  QPSSTSYLTSILNVFLPSGYPHSVTDDYLEYQIYVSQQPFPSSDRKEKKLTVSYQDSLQA 106

Query: 95  FTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNLGARMDSEPK 152
           F+S+   +LS++++L   G+    A   A ++  +L++ M  +  ++ +           
Sbjct: 107 FSSSIAGLLSSRAVLQGVGVGDATASPTAALLLNVLQESMGRIATILFA----------- 155

Query: 153 RWRILADALYDLGTGLE---VLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
                    + LGT LE    LSP+ P+ + +    L +  + +  VAA +++  + + F
Sbjct: 156 ---------HRLGTSLEPEYCLSPVFPKPVRVGFLSLSSVLRALCGVAAGSSKASLSAHF 206

Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
           A+ GNL +L AK  +  T+ ++VG+  G  + + V S +   + +  LL+ IH+ +    
Sbjct: 207 ARWGNLGELNAKDSSQETVISLVGMLVGSVVVAYVSSPLATWVALLFLLA-IHLATNHAA 265

Query: 269 MRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP--GRLIEDA------GNV 320
           +RA  + TLN QR  ++ +      R+ +PA +  +E+ +F   G L   A      G  
Sbjct: 266 VRAVNMTTLNRQRANIVFSSLFDDDRVLTPAQVS-KEERIFERDGVLRWKAEATGVLGTC 324

Query: 321 KVG--------RDPHK----VVKPSKFHE-----------LKELFAEEKFILCLGSR--W 355
           +VG          PH       K S+  E           L +LF EE+++L    R   
Sbjct: 325 RVGVSLEQLLLLLPHSQTNDASKTSRRSEAGADFGVSLPVLVDLFKEEEYLLYFSPRDHA 384

Query: 356 TDMVLEQNASGEDALRGW----LVAAYAA--SMAKSFHDPS--------LTVLQDAYDKM 401
             ++L+   + +  L+ W    LV+   A  S +   ++          L++L+D   + 
Sbjct: 385 AVIILKAGVTAQAQLKAWSHALLVSRRLAGRSTSACINEKQGGQNELSVLSILRDTLREH 444

Query: 402 NDVFTPLLSELQAKGWHTD 420
           +  F   +  L+  GW TD
Sbjct: 445 SKAFDDRVERLKQAGWETD 463


>gi|303316636|ref|XP_003068320.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240108001|gb|EER26175.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 478

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 174/375 (46%), Gaps = 52/375 (13%)

Query: 91  ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGAR 146
           +LQ F+S+   +LS++++L   G+     +P  A ++S IL+D M  +  ++ ++ LG  
Sbjct: 92  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSMGRIATILFAHQLGTS 150

Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIY 205
           ++ E K +R+ AD   D    ++ LSP  P+ + + +  L +  + +  VAA +++  + 
Sbjct: 151 LEPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLS 210

Query: 206 SSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSV 265
           + FAK GNL++L AK  +  T+ +++G+  G  + S + S +   + +  LL ++H+ + 
Sbjct: 211 AHFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLL-IVHITTN 269

Query: 266 IEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE-----DLLFPGRLIEDA-GN 319
              +RA  + +LN QR  ++ +      R+ +P     +E     D +   +   D  G 
Sbjct: 270 YLAVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGF 329

Query: 320 VKVGRDPHKVV------KPSKFHE-----LKELFAEEKFILCLGSRWTD-------MVLE 361
            ++G     V+       P++  E     +  +F +E++IL     W D       + L+
Sbjct: 330 CQIGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYIL-----WFDPARKRGVIGLK 384

Query: 362 QNASGEDALRGWLVAAYAASMAK----SFHDP------------SLTVLQDAYDKMNDVF 405
            NAS    L+ W  A  AA+  K    +  +P             L +LQ    + +  F
Sbjct: 385 TNASPTSQLKAWSHALLAANSLKIKESNTREPLVREQVGTDSVLMLGILQGTLAEHSKDF 444

Query: 406 TPLLSELQAKGWHTD 420
              +  L+  GW  D
Sbjct: 445 DARIESLRDAGWDID 459


>gi|320038116|gb|EFW20052.1| hypothetical protein CPSG_03227 [Coccidioides posadasii str.
           Silveira]
          Length = 478

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 174/375 (46%), Gaps = 52/375 (13%)

Query: 91  ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSN-LGAR 146
           +LQ F+S+   +LS++++L   G+     +P  A ++S IL+D M  +  ++ ++ LG  
Sbjct: 92  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLS-ILQDSMGRIATILFAHQLGTS 150

Query: 147 MDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIY 205
           ++ E K +R+ AD   D    ++ LSP  P+ + + +  L +  + +  VAA +++  + 
Sbjct: 151 LEPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLS 210

Query: 206 SSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSV 265
           + FAK GNL++L AK  +  T+ +++G+  G  + S + S +   + +  LL ++H+ + 
Sbjct: 211 AHFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLL-IVHITTN 269

Query: 266 IEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE-----DLLFPGRLIEDA-GN 319
              +RA  + +LN QR  ++ +      R+ +P     +E     D +   +   D  G 
Sbjct: 270 YLAVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGF 329

Query: 320 VKVGRDPHKVV------KPSKFHE-----LKELFAEEKFILCLGSRWTD-------MVLE 361
            ++G     V+       P++  E     +  +F +E++IL     W D       + L+
Sbjct: 330 CQIGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYIL-----WFDPARKRGVIGLK 384

Query: 362 QNASGEDALRGWLVAAYAASMAK----SFHDP------------SLTVLQDAYDKMNDVF 405
            NAS    L+ W  A  AA+  K    +  +P             L +LQ    + +  F
Sbjct: 385 TNASPTSQLKAWSHALLAANSFKIKESNTREPLVREQVGTDSVLMLGILQGTLAEHSKDF 444

Query: 406 TPLLSELQAKGWHTD 420
              +  L+  GW  D
Sbjct: 445 DARIESLRDAGWDID 459


>gi|326479045|gb|EGE03055.1| DUF647 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 494

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 92/411 (22%), Positives = 162/411 (39%), Gaps = 108/411 (26%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
           + S LN F P+GYP SV + Y+ Y  + ++    +          +LFA           
Sbjct: 123 LTSILNVFLPAGYPHSVTDDYIEYQIYESMGRIAT----------ILFA----------- 161

Query: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLE 181
                               LG  ++ E K +R+ AD L D    L+ LSP+ P+ + + 
Sbjct: 162 ------------------HRLGTSLEPECKMYRLAADILNDSAMVLDCLSPIFPKPVRVG 203

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           +  L +  + +  VAA +++  + + FA+ GNL +L A G  +                S
Sbjct: 204 LLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVGSVV---------------VS 248

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
            + S ++  + +  LL ++H+ +    +RA  + TLN QR  ++ +   +   I +PA+ 
Sbjct: 249 YITSPLETWIALIVLL-IVHLGTNHAAVRAVKMTTLNRQRANIVFSHLFEDNLILTPAET 307

Query: 302 RYREDLLFPGRLIE-----------------------------DAGNVKVGRDPHKVVKP 332
              E +     ++                              D G ++  R     +  
Sbjct: 308 SKEERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRGALETSRTTSTTMDA 367

Query: 333 S-KFHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW----LVAAYAA----S 381
           S     L ELF EE++IL  G  SR   +VL+  A+ +  L+ W    LV+ + A    +
Sbjct: 368 STNLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKAWSHALLVSRHLARKNGT 427

Query: 382 MAKSFHD---------PS---LTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
                H+         PS   L++L+D   +    F   +  L+  GW TD
Sbjct: 428 ATSCIHEKQGGQGELSPSVSVLSILRDTLQEHTRTFKDRVKRLEHAGWQTD 478


>gi|303280291|ref|XP_003059438.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459274|gb|EEH56570.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 568

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 79/161 (49%), Gaps = 7/161 (4%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
           F P+GYP +V+  Y  + ++          L VL++QSLL A G+  TP     A    W
Sbjct: 150 FLPAGYPDTVSSDYATFIRWHLASLMFRNVLEVLTSQSLLVALGMGSTPGALPLAAATKW 209

Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           +LKDG+     L+    G  + D +PKRW  + ++L D+   LE+++P  P LFL +A  
Sbjct: 210 VLKDGVGSFATLLAGVFGGQKYDEDPKRWWAVTNSLEDVARVLELVTPAYPALFLPLAAS 269

Query: 186 GNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAI 224
             F +  A+   R + +         +  N  D+ AK EA+
Sbjct: 270 ATFVRAGALT-GRGSLINGTFMQHLGRNQNFGDVRAKMEAL 309


>gi|346978906|gb|EGY22358.1| hypothetical protein VDAG_03796 [Verticillium dahliae VdLs.17]
          Length = 455

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/262 (22%), Positives = 125/262 (47%), Gaps = 9/262 (3%)

Query: 68  NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
           + F P  YP SV+  Y++Y  + + Q F S    +L+ +++L   G+          ++ 
Sbjct: 79  DAFLPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHALLL 138

Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLC---PQLFLE 181
             LKDG+  +  +  +   G+ +++  K++R LAD   D    LE+ +P      ++ + 
Sbjct: 139 TCLKDGVSRLATITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELANPFFAGWAKILIL 198

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           ++      + +  +AA A++  + + FA+  NL++L  K  +  T   ++GL AG  +  
Sbjct: 199 VSAES--LRALCGIAAGASKAALSTHFARRNNLAELNTKEASQETAIGLIGLLAGTAIIH 256

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
            V  +      +   L  IH+      + A  ++T+N QR  +++  +++T  + +PA+ 
Sbjct: 257 YV-QNRGAVFFLTIFLVFIHLGMNYLGVCALQLDTVNRQRATIVMQHYLRTQEVLTPAEA 315

Query: 302 RYREDLLFPGRLIEDAGNVKVG 323
             +E+++F    I D  N  V 
Sbjct: 316 SKKENIIFWRPHIADEHNKTVA 337


>gi|238488016|ref|XP_002375246.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
 gi|220700125|gb|EED56464.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
          Length = 513

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 95/425 (22%), Positives = 182/425 (42%), Gaps = 81/425 (19%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P+GYP SV++ Y+ Y  F +LQ F+S+   +LS++++L   G+    A   A ++  I
Sbjct: 75  FLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHI 134

Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-----LE 181
           L+D    +  ++ ++ +G  ++ E K +R+ AD   D    L+ LSP+ P  F     L 
Sbjct: 135 LQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLS 194

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGL---GAGIQ 238
            AG+    + +  VA  +++  + + F++ GNL+++ AK  +  T+ +++G+     G  
Sbjct: 195 TAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVRFVGSF 251

Query: 239 LASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI------------V 286
           + S V +S     I   +L  +H+      +R+  + +LN QR  ++            +
Sbjct: 252 VVSRV-TSYTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNSDTDLDI 310

Query: 287 ADFVKTG------------------RISSPADLRYREDLLFPGRLI---------EDAGN 319
           A+F  T                   +I +PA +  +E +     ++            G 
Sbjct: 311 ANFNPTHETHPTPKHSKATQSQKQWQIPTPAQVSKQEKIFETDGILRWVSAPSTQHKLGT 370

Query: 320 VKVGRDPHKVVKPSK-------------FHELKELFAEEKFILCLG---SRW-TDMVLEQ 362
            ++G    + + PS                +L  LF  E ++L L      W   ++L+ 
Sbjct: 371 CRIGVSLEQFLAPSSTRTGSGSLKTSTPMSDLSSLFKSEDYLLFLHRNRQSWDARILLKT 430

Query: 363 NASGEDALRGWLVAAYAA----SMAKSFHDPSLTVLQDAYDK----MND--VFTPLLSEL 412
           +++ +  L+ W+    AA    S AK      +  + D   K    +ND       +S L
Sbjct: 431 SSTTQTQLKAWMHVLLAARVLCSSAKEMRTQEIEYIMDTISKTLTFLNDGSRTDQYMSAL 490

Query: 413 QAKGW 417
              GW
Sbjct: 491 TEAGW 495


>gi|241595406|ref|XP_002404468.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215502351|gb|EEC11845.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 472

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 66/271 (24%), Positives = 119/271 (43%), Gaps = 24/271 (8%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATV 123
           F   F PSG+P +V+  Y  Y  + ++Q F S+    L+TQ++L   G+    A   A  
Sbjct: 45  FSEIFLPSGFPDTVSRDYKEYQIWDSIQAFASSITGTLATQAVLTGVGVGDQTASVLAAT 104

Query: 124 VSWILKDGMQHVGKLI------------------C--SNLGARMDSEPKRWRILADALYD 163
            +WILKDG   +G+++                  C  SN    ++       + AD L D
Sbjct: 105 TTWILKDGTGMIGRILFALVSRYLGMCCATHNKYCAHSNDSTTLNHADNYCSLFADILND 164

Query: 164 LGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
               LE++S         +  + + AK +  VA  ATR  +    A+  N++D+ AK  +
Sbjct: 165 AAILLELMSQHLKGYITLVLCISSVAKSVVGVAGGATRAALTQHQARSNNMADVSAKDGS 224

Query: 224 ISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
             TL N+      + L   +  S      +  + +++H+++    +    + T N  R +
Sbjct: 225 QETLVNLAAFLCSLLLLRLLAGSQILVYGIFTIFTLLHIFANFRAVSCVVMETFNRSRYS 284

Query: 284 MIVADFV-KTGRISSPADLRYREDL-LFPGR 312
           ++V  F+ + G I+S   +  +E + L  GR
Sbjct: 285 LLVKHFLEEAGDIASVEKVNKQESVWLRCGR 315


>gi|396461797|ref|XP_003835510.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
 gi|312212061|emb|CBX92145.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
          Length = 543

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/316 (24%), Positives = 151/316 (47%), Gaps = 33/316 (10%)

Query: 117 TPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           +P  A ++S +L+D M     ++ ++ LG   + E K +R  AD L D G  LE LSP  
Sbjct: 187 SPTAALLLS-VLQDSMGRTATILFADRLGTAFEPECKMYRFTADILNDTGMVLECLSPAF 245

Query: 176 PQ-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
           P+ + + +    +  + +  V A + +  + + FA++GNL ++ AK  +  T+ +++G+ 
Sbjct: 246 PKPIRVCVLSCSSVLRSLCGVCAGSAKASLSAHFARKGNLGEVNAKDSSQETVISLLGML 305

Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGR 294
           AG  + S + S M     +  LL+ IH+ +    +RA  +  LN QR  ++ +  ++ G+
Sbjct: 306 AGSVVVSWISSPMATWSTLIGLLA-IHLATNYAAVRAVSMRCLNRQRANILFSRLLEHGQ 364

Query: 295 ISSPADLRYREDLLFPGRLIEDAGNVKVG--------------------RDPHKVVKPSK 334
           + SP ++  RE +     ++  + ++ +G                    R     ++ ++
Sbjct: 365 LLSPLEVSRRERVFEHDGVLRWSDDMAIGYCRIGVSLQTLLSHMGLRNARTGSLHLRGAE 424

Query: 335 FHELKELFAEEKFILCLGSRWTD--MVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLT 392
             EL  +F +E +IL   S      +VL++  +  D L+ W   A+A  +AK +  P   
Sbjct: 425 ISELLHVFVDEAYILWPASSAGHFLIVLKEGCTPMDQLKAW---AHALLLAKGYSLPGAK 481

Query: 393 VLQD----AYDKMNDV 404
            + D    A +++ DV
Sbjct: 482 SVVDQTSLAGNRLADV 497



 Score = 45.8 bits (107), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 38/56 (67%), Gaps = 4/56 (7%)

Query: 54  DSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL 109
           D++    RV+D FL    P+GYP SV E Y++Y  + +LQ F+S+  S+L+++++L
Sbjct: 45  DAKGYAQRVLDVFL----PAGYPHSVTEDYIQYQIYDSLQAFSSSIASMLASRAVL 96


>gi|440794597|gb|ELR15757.1| hypothetical protein ACA1_379390, partial [Acanthamoeba castellanii
           str. Neff]
          Length = 449

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 65/218 (29%), Positives = 93/218 (42%), Gaps = 30/218 (13%)

Query: 63  VDSFLNKFF-PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
           VD  L   F P+GYP SV + Y+ + ++   +   S+A  V      L   G        
Sbjct: 69  VDLMLKSLFLPTGYPASVRQDYIHFQKWEVAKGIVSSAGFV----GHLVGLGCL------ 118

Query: 122 TVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLE 181
                    GM     L+    G   D+E KR R  AD ++  G  LE+ +PL P  FL 
Sbjct: 119 ---------GMM----LVAGAFGKFFDTETKRIRWAADCIHVAGVALELATPLFPAYFLP 165

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           +A L N AKGIA +   AT+  I   FA   NL D+ AKG +   +  ++G+G GI    
Sbjct: 166 LASLANSAKGIAGLTTGATKAAINQGFALRDNLGDITAKGHSQGIVAYLMGMGIGIGTTY 225

Query: 242 TVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
                     ++G L +V  V +      + P  T  P
Sbjct: 226 VTGG------VLGALFAVYGVLAGGSTSSSGPFTTAAP 257


>gi|194382298|dbj|BAG58904.1| unnamed protein product [Homo sapiens]
          Length = 194

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 80/164 (48%)

Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLP 203
           G+++D   K+WR+ AD L D+   LE+++P+ P  F       N AK I  VA  ATR  
Sbjct: 12  GSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAA 71

Query: 204 IYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVY 263
           +    A+  N++D+ AK  +  TL N+ GL   + +   V       +     L+ +H+Y
Sbjct: 72  LTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIY 131

Query: 264 SVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDL 307
           +    +RA  + TLN  R  +++  +++ G +  P      E L
Sbjct: 132 ANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 175


>gi|412993086|emb|CCO16619.1| predicted protein [Bathycoccus prasinos]
          Length = 454

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 55/187 (29%), Positives = 87/187 (46%), Gaps = 9/187 (4%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-----AQATVV 124
           F P G+P +V+  YL + +++ L  F    L VLS QSLL A G+           A   
Sbjct: 40  FLPDGFPNTVSRDYLPWLKWQMLSLFFRDVLEVLSAQSLLVAVGMDVNQANAAAPVAGAA 99

Query: 125 SWILKDGMQHVGKLICSNLGA-RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
            W+LKDG   +  L    L   + D  PK +   A++L D+   +E+L+PL P  FL +A
Sbjct: 100 KWVLKDGTGGLATLALGALNTKKFDENPKFFWSTANSLEDVSRAMELLTPLFPSHFLLIA 159

Query: 184 GLGNFAKGIAVVAARATRL--PIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAS 241
           G   F +  A +  R++ +     + F++  N  D+  K E       +  L  GI L  
Sbjct: 160 GTATFVRS-AALTGRSSLINGTFMTHFSRNENAGDIRTKLEVQGRWLQIAALPVGIALFR 218

Query: 242 TVCSSMQ 248
            V + ++
Sbjct: 219 AVTNGLE 225


>gi|422292654|gb|EKU19956.1| hypothetical protein NGA_2057300, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 274

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 5/134 (3%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQATVVSW 126
           F P G+P SV   YLRY Q+  +Q  ++    +L+TQ++L   G+     TP  AT+  W
Sbjct: 142 FLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGILATQAILEGVGVGREGATPLAATL-QW 200

Query: 127 ILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           I +DG   +  L+  S L        K WR+ AD + D+G  L++L+PL P+ FL    L
Sbjct: 201 ITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFADFMVDIGITLDMLAPLFPKNFLLCICL 260

Query: 186 GNFAKGIAVVAARA 199
            +  K +  +AA A
Sbjct: 261 ASVCKSLCGIAAGA 274


>gi|387198702|gb|AFJ68865.1| hypothetical protein NGATSA_2057300, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 277

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 5/134 (3%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR---PTPAQATVVSW 126
           F P G+P SV   YLRY Q+  +Q  ++    +L+TQ++L   G+     TP  AT+  W
Sbjct: 145 FLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGILATQAILEGVGVGREGATPLAATL-QW 203

Query: 127 ILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGL 185
           I +DG   +  L+  S L        K WR+ AD + D+G  L++L+PL P+ FL    L
Sbjct: 204 ITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFADFMVDIGITLDMLAPLFPKNFLLCICL 263

Query: 186 GNFAKGIAVVAARA 199
            +  K +  +AA A
Sbjct: 264 ASVCKSLCGIAAGA 277


>gi|409076043|gb|EKM76417.1| hypothetical protein AGABI1DRAFT_131245 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 466

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 94/411 (22%), Positives = 170/411 (41%), Gaps = 45/411 (10%)

Query: 60  HRVVDSFLNK-FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           H+     L K F P+GYP +V   YL Y    A Q F ++  S+L++++ L   G+    
Sbjct: 44  HQDTKELLTKIFLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPS 103

Query: 119 AQAT--VVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A +T  ++  +L+D    +  +I +  LG R+  E K++R LAD L D+   ++V +P+ 
Sbjct: 104 ATSTGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVF 163

Query: 176 PQLFLEMA---GLGNFA--KGIAVVAARATRLPIYSSFAKE----GNLSDLFAKGEAIST 226
                  A   GL   A  + +  V A   +  I   FA      G++ DL AK  +  T
Sbjct: 164 SAFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKET 223

Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGP-LLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
           +  ++G+     L S +   +    +  P LL +I V+ VI  +    +   +P R    
Sbjct: 224 VLALLGM----LLGSVMIPHITSPWVTYPVLLLLISVHLVINYIAVRGLAIRSPNRHRAT 279

Query: 286 VADF---VKTGRISSPADLRYREDLLFPGRLIEDA------GNVKVGRDPHKVVK-PSKF 335
           +A      K     SP  +  RE +      I D       G   VG  P  V++ P+  
Sbjct: 280 IAWLWYRSKDECAPSPQAVASRELIFERPENIRDPTTSLVLGKCHVGSAPLDVMRGPALS 339

Query: 336 HELKELFAEEKFILCLGSRW-------TDMVLEQNASGEDALRGWL--------VAAYAA 380
            E  ++F  +K+I+C              +  ++  +  D L+ W+        +A    
Sbjct: 340 QEYLDIFHADKYIVCFNPSHETGHLPIIHICFKEGFTRADELKSWIHAVEICRSIALNQV 399

Query: 381 SMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD--RFLDGTGTR 429
            + K      L  ++  +  + + +   +  ++  GW+ D      G+G +
Sbjct: 400 PIQKGSKAVELEHIRSTFQVVKEKYDHFVDHMKKAGWNFDESHLASGSGPK 450


>gi|154292218|ref|XP_001546685.1| hypothetical protein BC1G_14192 [Botryotinia fuckeliana B05.10]
          Length = 449

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/249 (26%), Positives = 131/249 (52%), Gaps = 25/249 (10%)

Query: 95  FTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEP 151
           F S    +L+++++L   G+  + A AT  V+  IL++    +  ++ ++ LG+ ++ E 
Sbjct: 59  FASTIAGLLASRAVLQGLGVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPEC 118

Query: 152 KRWRILADALYDLGTGLEVLSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
           K++R++AD   D    L+ +SP     P++FL  A   +  K +  VAA +++  + + F
Sbjct: 119 KKYRLMADIFNDAAMILDCISPAFPKIPRVFLLSAS--SVCKSLCGVAAGSSKASLSAHF 176

Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
           AK GNL++L AK  +  TL +++G+  G  + S + S +   + +  LLS IH+ +    
Sbjct: 177 AKMGNLAELNAKDASQETLISLLGMLVGTFVVSKISSQVATWIALLALLS-IHLGTNYMA 235

Query: 269 MRAAPVNTLNPQRTAMIVADFVKT-----GRIS--SPADLRYREDLLFPGRLIEDAGNVK 321
           +R+  + TLN QR  ++++  + +      +IS  SP D+  +E      R+ E  G + 
Sbjct: 236 VRSVTMRTLNRQRANLVISSLLSSPEHEKNKISLPSPRDISLQE------RIFERDGAI- 288

Query: 322 VGRDPHKVV 330
             R+ H  +
Sbjct: 289 --RNTHGTI 295


>gi|336467636|gb|EGO55800.1| hypothetical protein NEUTE1DRAFT_103211 [Neurospora tetrasperma
           FGSC 2508]
 gi|350287710|gb|EGZ68946.1| hypothetical protein NEUTE2DRAFT_72230 [Neurospora tetrasperma FGSC
           2509]
          Length = 361

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 72/254 (28%), Positives = 127/254 (50%), Gaps = 23/254 (9%)

Query: 65  SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQA 121
           + L+ F P+GYP SV   YL Y  + +LQ F S+  S+L+ +++L   G+     +P  A
Sbjct: 38  ALLHAFLPAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLEGLGVGDASSSPTGA 97

Query: 122 TVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF- 179
            ++  I  D +  V  ++ ++ LG  ++ E K +R LAD L D    L++L P   + + 
Sbjct: 98  LILK-IASDTISRVATILFAHRLGTAIEPECKFFRFLADCLNDAAQLLDLLVPAMMERYG 156

Query: 180 LEMA---GLGNFAKGIAV-----VAARATRLPIYSSFAKEG--NLSDLFAKGEAISTLFN 229
           L++    G G       V      A+ +    + +S  K G  NL++L AK  +  T+ +
Sbjct: 157 LKLGVIVGAGVLRSWCGVAASASKASLSAHFIVVASQQKGGGNNLAELNAKEASQETVVS 216

Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPL---LSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
           ++G+  G    S V   +Q +++V  L   L  +H+      +R     TLN QR  M+V
Sbjct: 217 LLGMLVG----SLVVQVVQDRLVVWVLMVGLLGVHLLMNYWGVRCVRCRTLNRQRATMVV 272

Query: 287 ADFVKTGRISSPAD 300
            ++++TGR+  P +
Sbjct: 273 REWLETGRVLGPEE 286


>gi|116004169|ref|NP_001070441.1| UPF0420 protein C16orf58 homolog [Bos taurus]
 gi|61554841|gb|AAX46623.1| hypothetical protein FLJ13868 [Bos taurus]
          Length = 232

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 13/184 (7%)

Query: 12  KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVV--DDSRPVYHRVVDSFLNK 69
           +  PD S+  +     W+  S + T     +P G      V    + P + R++  FL  
Sbjct: 19  RAAPDGSLQWEAWGRRWWGFSGAFTA----KPGGRDGGGGVAPGTASPPFSRLLAVFL-- 72

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A   +W+
Sbjct: 73  --PQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 130

Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +  +G++MD   K+WR+ AD L D+   LE+++P+ P  F     + 
Sbjct: 131 VKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCIS 190

Query: 187 NFAK 190
           N AK
Sbjct: 191 NLAK 194


>gi|189196610|ref|XP_001934643.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187980522|gb|EDU47148.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 333

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 71/296 (23%), Positives = 139/296 (46%), Gaps = 29/296 (9%)

Query: 85  RYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICS 141
           R     +LQ F+S+   +L+++++L   G+     TP  A ++S I     Q    L   
Sbjct: 36  RVDVIDSLQAFSSSIAGMLASRAILEGVGVGDSTATPTTALLLSVIQTSVGQIATILFAH 95

Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARAT 200
            LG  ++ E K +R+ AD   D    L+ LSP  P+ + + +    +  + +  V A ++
Sbjct: 96  RLGTALEPECKMYRLAADVFNDTAMVLDCLSPAFPKSIRVAVLSFSSCLRALCGVCAGSS 155

Query: 201 RLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ--GKMIVGPLLS 258
           +  + + FA +GNL ++ AK  +  T+ +++G+ AG  + S + S +   G +I+   + 
Sbjct: 156 KASLSAHFAHKGNLGEVNAKDSSQETVISLLGMLAGSLVVSQITSPLATWGTLILLLTIH 215

Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAG 318
           +   Y+ +   RA  ++ LN QR  ++ ++  + G + SP D+  RE +   G ++  + 
Sbjct: 216 LATNYAAV---RAVNMHCLNRQRANILFSNMFQHGLVLSPRDVSQRERVFERGGVLRWSD 272

Query: 319 NVKVGR---------------DPHK-----VVKPSKFHELKELFAEEKFILCLGSR 354
           +  +GR                PH+      +K  +  +L E+FA E   +  G R
Sbjct: 273 DKVLGRCSIGVPLQRLLNRLGTPHQQTGSLTLKSMQISDLLEVFANESLHIIAGPR 328


>gi|296473307|tpg|DAA15422.1| TPA: hypothetical protein LOC767862 [Bos taurus]
          Length = 232

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 13/184 (7%)

Query: 12  KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVV--DDSRPVYHRVVDSFLNK 69
           +  PD S+  +     W+  S + T     +P G      V    + P + R++  FL  
Sbjct: 19  RAAPDGSLQREAWGRRWWGFSGAFTA----KPGGRDGGGGVAPGTASPPFSRLLAVFL-- 72

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A   +W+
Sbjct: 73  --PQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 130

Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G+++ +  +G++MD   K+WR+ AD L D+   LE+++P+ P  F     + 
Sbjct: 131 VKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCIS 190

Query: 187 NFAK 190
           N AK
Sbjct: 191 NLAK 194


>gi|426194386|gb|EKV44317.1| hypothetical protein AGABI2DRAFT_120451 [Agaricus bisporus var.
           bisporus H97]
          Length = 462

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 95/410 (23%), Positives = 171/410 (41%), Gaps = 47/410 (11%)

Query: 60  HRVVDSFLNK-FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP 118
           H+     L K F P+GYP +V   YL Y    A Q F ++  S+L++++ L   G+    
Sbjct: 44  HQDTKELLTKIFLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPS 103

Query: 119 AQAT--VVSWILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLC 175
           A +T  ++  +L+D    +  +I +  LG R+  E K++R LAD L D+   ++V +P+ 
Sbjct: 104 ATSTGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVF 163

Query: 176 PQLFLEMA---GLGNFA--KGIAVVAARATRLPIYSSFAKE----GNLSDLFAKGEAIST 226
                  A   GL   A  + +  V A   +  I   FA      G++ DL AK  +  T
Sbjct: 164 SAFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKET 223

Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGP-LLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
           +  ++G+     L S +   +    +  P LL +I V+ VI  +    +   +P R    
Sbjct: 224 VLALLGM----LLGSVLIPHITSPWVTYPVLLLLISVHLVINYIAVRGLAIRSPNRHRAT 279

Query: 286 VADF---VKTGRISSPADLRYREDLLFPGRLIEDA------GNVKVGRDPHKVVK-PSKF 335
           +A      K     SP  +   E +      I D       G   VG  P  V++ P+  
Sbjct: 280 IAWLWYRSKDECAPSPQAVASHELIFERPENIRDPTTSLVLGKCHVGSAPLDVMRGPALS 339

Query: 336 HELKELFAEEKFILCLGSRW-------TDMVLEQNASGEDALRGWLVAAYAASMAKSFHD 388
            E  ++F  +K+I+C              +  ++  +  D L+ W+   +A  + +S   
Sbjct: 340 KEYLDIFHADKYIVCFNPSHETGHLPIIHICFKEGFTRADELKSWI---HAVEICRSIAL 396

Query: 389 PSLTVLQDAYDKMNDVFTPL-------LSELQAKGWHTD--RFLDGTGTR 429
             + V + A + +   F  +       +  ++  GW+ D      G+G +
Sbjct: 397 NQVPVQKGALEHIRSTFQVVKEKYDHFVDHMKKAGWNFDESHLASGSGPK 446


>gi|299740308|ref|XP_001838896.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
 gi|298404184|gb|EAU82827.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
          Length = 768

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 63/224 (28%), Positives = 108/224 (48%), Gaps = 13/224 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQATVVSWI 127
           F P+GYP SV   YL Y    ALQ F S+  S++S+++ L  F  G     A   ++  I
Sbjct: 331 FLPAGYPNSVTPDYLTYQVLNALQAFCSSLASLISSRATLQGFGVGSASASANDALLLTI 390

Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPL-----CPQLFLE 181
           L D    +  ++ ++ LG+ + +E K++R +AD L D+   L+ LSP+      PQL + 
Sbjct: 391 LLDIYSRLTTIVSAHLLGSSLATEAKKYRFVADILNDVAVVLDTLSPVFKAQGYPQLRVV 450

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAISTLFNVVGLGAGI 237
              L    + +  +AA  ++  I   FA     +G+L DL AK  +  T+  + G+  G 
Sbjct: 451 ALCLSASCRALCGIAAGGSKAAISVHFATPVKGKGDLGDLNAKDSSKETVLALFGMLLGS 510

Query: 238 QLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
            +   + ++      +  LLS +H+      +    + T+N QR
Sbjct: 511 LIVPRLQTAWSTYTTLFILLS-LHIAINYVAVTGLVLRTMNWQR 553


>gi|62320358|dbj|BAD94740.1| putative protein [Arabidopsis thaliana]
          Length = 295

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 72/276 (26%), Positives = 123/276 (44%), Gaps = 19/276 (6%)

Query: 158 ADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLP----IYSSFAKEGN 213
           AD +   G+  ++ + L P  FL +A  GN AK +A    R  R P    I + FA  GN
Sbjct: 3   ADFIGSAGSFFDLATQLYPSQFLLLASTGNLAKAVA----RGLRDPSFRVIQNHFAISGN 58

Query: 214 LSDLFAKGEAISTLFNVVGLGAGIQLAST--VCSSMQGKMIVGPLLSVIHVYSVIEEMRA 271
           L ++ AK E       ++GLG GI +  T  +  S    ++    + ++H++   + +  
Sbjct: 59  LGEVAAKEEVWEVAAQLIGLGFGILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAV 118

Query: 272 APVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
              NT+N +R  +IV   V    +    D   RE++L   R ++      V  +    ++
Sbjct: 119 LQFNTVNLKRARIIVESHVVHSVVPGYVDCNKRENILLWQRFMKPRIIFGVSLEELSGLE 178

Query: 332 P--SKFHELKELFAEEKFILCLGSRWTD----MVLEQNASGEDALRG-WLVAAYAASMAK 384
              SK   L +++ +EK+IL L     D    +  + NA+  D LR  W       +M +
Sbjct: 179 KSVSKVKALLKMYTKEKYILTLNKLNKDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEE 238

Query: 385 SFHDPSLTV--LQDAYDKMNDVFTPLLSELQAKGWH 418
           SF D       L+ +  +M++ F   L +L   GW+
Sbjct: 239 SFKDKDSVFHWLKQSLSEMDNKFDDFLFKLDTAGWN 274


>gi|429856893|gb|ELA31783.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 500

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 67/273 (24%), Positives = 135/273 (49%), Gaps = 18/273 (6%)

Query: 48  SVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQS 107
           SV++    +  + +V+D FL    P GYP SV + Y  Y  + + Q F      ++S+++
Sbjct: 63  SVQLPTIRKKPWRKVLDIFL----PDGYPHSVTDDYAAYQIYDSFQAFAGTIAGMISSRA 118

Query: 108 LLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYD 163
           +    G+     +P  A ++  ++++ M  +G +  ++L G  +++E K +R+ +D L  
Sbjct: 119 VWEGMGVGDSMASPIGAMMIQ-VIRESMGRLGTIAFAHLMGTSIEAECKAYRMASDVLCS 177

Query: 164 LGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
           +   L+ LSPL P+    L L  + +   A G   VA  A++  +   FAK  NL +L A
Sbjct: 178 IAFILDCLSPLFPRSIRFLILCCSSILFAASG---VAGNASKSSLSGHFAKWNNLGELNA 234

Query: 220 KGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
           K  +  T  +++G+  G  + S + +       +  L++ I+++   + +RA    +LN 
Sbjct: 235 KDASQETAISLMGMLTGTLVVSCLTTPTTTWTTLFILIA-IYLFLNWKGVRAVKSRSLNR 293

Query: 280 QRTAMIVADFVKTGRISSPADLRYREDLLFPGR 312
           QR  +  +  +   ++ +P  +  +E L+F  R
Sbjct: 294 QRANIAFSALLSKDQVLTPGKISEKE-LIFERR 325


>gi|310789554|gb|EFQ25087.1| hypothetical protein GLRG_00231 [Glomerella graminicola M1.001]
          Length = 484

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 68/285 (23%), Positives = 131/285 (45%), Gaps = 23/285 (8%)

Query: 40  QFQPDGHLSVKVVDDSRPVYHRVVD------------SFLNKFFPSGYPFSVNEGYLRYT 87
           ++   G L  K V+    + H  VD              L  F P GYP SV++ Y  Y 
Sbjct: 27  EYDKAGSLKAKYVESIPIIGHPRVDVLLPVRRKKIWLQLLEIFLPDGYPHSVSDDYAAYQ 86

Query: 88  QFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSNL-G 144
            + ++Q F  +   ++S++++    G+  + A  T  ++  ++++       +  ++L G
Sbjct: 87  IYDSIQAFAGSIAGMISSRAVWEGLGVGDSLASPTGAMLIQVIRESTGRFATITFAHLFG 146

Query: 145 ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ----LFLEMAGLGNFAKGIAVVAARAT 200
             +++E K +R+ +D L D+   L+ LSP  P+    L L  + L   A G   VA  A+
Sbjct: 147 TSIEAECKAYRLASDILCDVAMVLDCLSPFFPRSIRFLVLCFSSLLYSASG---VAGNAS 203

Query: 201 RLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI 260
           +  +   FAK  NL +L AK  +  T  +++G+  G  + S +       + +  LL+ +
Sbjct: 204 KSSLSGHFAKWNNLGELNAKDASQETAISLMGMITGTFMVSLLTGPKATWVALISLLA-L 262

Query: 261 HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYRE 305
           H++   + +RA    +LN QR  +  +      ++ +P  +  RE
Sbjct: 263 HLFLNWKGVRAVKSRSLNRQRANIAFSALFSKDQVLTPVIVSERE 307


>gi|330932017|ref|XP_003303615.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
 gi|311320239|gb|EFQ88270.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
          Length = 391

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 157/346 (45%), Gaps = 51/346 (14%)

Query: 117 TPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
           TP  A ++S I     +    L    LG  ++ E K +R  AD   D    L+ LSP  P
Sbjct: 34  TPTTALLLSVIHSSTGRIATILFAHRLGTALEPECKMYRFAADIFNDTAMVLDCLSPAFP 93

Query: 177 Q-LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
           + + + +    +    +  V A +++  + + FA++GNL ++ AK  +  T+ +++G+ A
Sbjct: 94  KPIRVAVLSFSSCLSALCGVCAGSSKASLSAHFARKGNLGEVNAKDSSQETVISLLGMLA 153

Query: 236 GIQLASTVCSSMQ--GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG 293
           G  + S + S +   G +I+   + +   Y+ +   RA  ++ LN QR  ++ ++  + G
Sbjct: 154 GSVVISYITSPLATWGTLILLLTIHLATNYAAV---RAVIMHCLNRQRANILFSNMFQHG 210

Query: 294 RISSPADLRYREDLLFPGRLIEDAGNVKVGR---------------DPHK-----VVKPS 333
            + SP D+  RE +   G ++  + +  +GR                PHK      +K  
Sbjct: 211 LVLSPRDVSQRERVFERGGVLRWSDDKVLGRCSIGVPLQRLLNRLGTPHKQTGSLTLKSM 270

Query: 334 KFHELKELFAEEKFILCLGSRWTD--MVLEQNASGEDALRGWLVAAYAASMAK------- 384
           +  +L ++FA E +IL   S   +  +VL++     D ++ W   A+A  +AK       
Sbjct: 271 EISDLLDVFANEAYILLPASTVEEGLIVLKRTCEPIDQIKAW---AHALLLAKRREGLEA 327

Query: 385 ---------SFHDPSL----TVLQDAYDKMNDVFTPLLSELQAKGW 417
                    + H P++    + L+++  ++  +F     E++ KGW
Sbjct: 328 AGLGRKRDDASHGPAMEGLISELRESLKEVQAMFARYGDEMRDKGW 373


>gi|326469821|gb|EGD93830.1| hypothetical protein TESG_01362 [Trichophyton tonsurans CBS 112818]
          Length = 413

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 84/350 (24%), Positives = 152/350 (43%), Gaps = 65/350 (18%)

Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAG 184
           +L++ M  +  ++ ++ LG  ++ E K +R+ AD L D    L+ LSP+ P+ + + +  
Sbjct: 57  VLQESMGRIATILFAHRLGTSLEPECKMYRLAADILNDSAMVLDCLSPIFPKPVRVGLLS 116

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAI--STLFNVVGLGAGIQLAST 242
           L +  + +  VAA +++  + + FA+ GNL +L A G  I  S L  VV         S 
Sbjct: 117 LSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVGLNIPPSQLTGVV--------LSY 168

Query: 243 VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
           + S ++  + +  LL ++H+ +    +RA  + TLN QR  ++ +   +   I +PA+  
Sbjct: 169 ITSPLETWIALIVLL-IVHLGTNHAAVRAVKMTTLNRQRANIVFSHLFEDNLILTPAETS 227

Query: 303 YREDLLFPGRLIE-----------------------------DAGNVKVGRDPHKVVKPS 333
             E +     ++                              D G ++  R     +  S
Sbjct: 228 KEERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRGALETSRTTSTTMDAS 287

Query: 334 -KFHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGW----LVAAYAA----SM 382
                L ELF EE++IL  G  SR   +VL+  A+ +  L+ W    LV+ + A    + 
Sbjct: 288 TNLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKAWSHALLVSRHLARKNGTA 347

Query: 383 AKSFHD---------PS---LTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
               H+         PS   L++L+D   +    F   +  L+  GW TD
Sbjct: 348 TSCIHEKQGGQGELSPSVSVLSILRDTLQEHTRTFKDRVKRLEHAGWQTD 397


>gi|406860777|gb|EKD13834.1| DUF647 domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 501

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 120/248 (48%), Gaps = 26/248 (10%)

Query: 65  SFLNKFFPSGYPFSVNEGYLRY-----------TQFRALQHFTSAALSVLST-----QSL 108
           S L+ F P+GYP SV E YL Y           TQ   +    S ++  L +     +S 
Sbjct: 52  SVLDAFLPAGYPNSVTEDYLEYQIYVRAANKPWTQSELISEGLSTSVLKLHSWDAVLESR 111

Query: 109 LFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTG 167
           +       +P  A ++S +L+  M  +  ++ ++ LG  ++ E K +R+ AD   D    
Sbjct: 112 IGVGDSHASPTAALLLS-VLQQSMGRIATIVFAHRLGTALEPECKMYRLAADIFNDAAMI 170

Query: 168 LEVLSPLCPQLF-LEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 226
           L+ LSP  P+   + +  L +  + +  V A +++  + + FA +GNL +L AK  +  T
Sbjct: 171 LDCLSPALPKASRVALLSLSSVLRSMCGVTAGSSKASLSAHFATQGNLGELNAKDSSQET 230

Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVG---PLLSVIHVYSVIEEMRAAPVNTLNPQRTA 283
           + +++G+ AG    S V S +  K I      LL   H+ +    +RA  + TLN QR  
Sbjct: 231 VISLLGMLAG----SVVVSYITSKTITWIALILLLATHLGTNYLAVRAVCMQTLNRQRAN 286

Query: 284 MIVADFVK 291
           +++++ ++
Sbjct: 287 LVLSNILQ 294


>gi|347842141|emb|CCD56713.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 376

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 63/231 (27%), Positives = 121/231 (52%), Gaps = 25/231 (10%)

Query: 113 GLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLE 169
           G+  + A AT  V+  IL++    +  ++ ++ LG+ ++ E K++R++AD   D    L+
Sbjct: 4   GVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPECKKYRLMADIFNDAAMILD 63

Query: 170 VLSPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 226
            +SP     P++FL  A   +  K +  VAA +++  + + FAK GNL++L AK  +  T
Sbjct: 64  CISPAFPKIPRVFLLSAS--SVCKSLCGVAAGSSKASLSAHFAKMGNLAELNAKDASQET 121

Query: 227 LFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
           L +++G+  G  + S + S +   + +  LLS IH+ +    +R+  + TLN QR  +++
Sbjct: 122 LISLLGMLVGTFVVSKISSQVATWIALLALLS-IHLGTNYMAVRSVTMRTLNRQRANLVI 180

Query: 287 ADFVKT-----GRIS--SPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVV 330
           +  + +      +IS  SP D+  +E      R+ E  G +   R+ H  +
Sbjct: 181 SSLLSSPEHEKNKISLPSPRDISLQE------RIFERDGAI---RNTHGTI 222


>gi|339898153|ref|XP_001465441.2| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|321399425|emb|CAM67862.2| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 675

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 107/234 (45%), Gaps = 7/234 (2%)

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           +R    RVV +      P GYP SV E  L +  +   Q       + LST+++L   G+
Sbjct: 57  ARQARRRVVTALERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGV 116

Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVL 171
             + A  T   +SW+++DG + +G ++ ++L  + ++   K W+++AD   D+   LE+ 
Sbjct: 117 GESKADLTSSTLSWMMQDGTRMIGSIVFASLIPQGLECRAKTWQLVADFTTDIANLLELC 176

Query: 172 SPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
           +P    C  +F  +    +  K +  V    TR      FA   N +D+ AK    S++ 
Sbjct: 177 APWLPGCQTIFRVVLVAASAIKALGGVCDSGTRASFAQHFALRNNAADIAAKAATRSSVG 236

Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
           + +GL  G  +A +V  +S    +    L S   +++    +R   +  LN  R
Sbjct: 237 SFIGLALGTAVAYSVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPR 290


>gi|398015008|ref|XP_003860694.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322498916|emb|CBZ33989.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 675

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 107/234 (45%), Gaps = 7/234 (2%)

Query: 55  SRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL 114
           +R    RVV +      P GYP SV E  L +  +   Q       + LST+++L   G+
Sbjct: 57  ARQARRRVVTALERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGV 116

Query: 115 RPTPAQAT--VVSWILKDGMQHVGKLICSNLGAR-MDSEPKRWRILADALYDLGTGLEVL 171
             + A  T   +SW+++DG + +G ++ ++L  + ++   K W+++AD   D+   LE+ 
Sbjct: 117 GESKADLTSSTLSWMMQDGTRMIGSIVFASLIPQGLECRAKTWQLVADFTTDIANLLELC 176

Query: 172 SPL---CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 228
           +P    C  +F  +    +  K +  V    TR      FA   N +D+ AK    S++ 
Sbjct: 177 APWLPGCQTIFRVVLVAASAIKALGGVCDSGTRASFTQHFALRNNAADIAAKAATRSSVG 236

Query: 229 NVVGLGAGIQLASTV-CSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQR 281
           + +GL  G  +A +V  +S    +    L S   +++    +R   +  LN  R
Sbjct: 237 SFIGLALGTAVAYSVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPR 290


>gi|392585079|gb|EIW74420.1| DUF647-domain-containing protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 515

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 104/438 (23%), Positives = 181/438 (41%), Gaps = 82/438 (18%)

Query: 57  PVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRP 116
           P   R+    ++ F P+GYP SV   YL Y  + ALQ F S+   +++++  L   G+  
Sbjct: 50  PSLDRIRRVLVDLFLPAGYPASVTPDYLPYHAYNALQAFFSSLAGLIASRGALQGVGV-- 107

Query: 117 TPAQATVVSWILKDGMQHV-GKLICSN----LGARMDSEPKRWRILADALYDLGTGLEVL 171
             A AT  S +L   +Q V G++   +     G  +  E K +R++AD + D      VL
Sbjct: 108 GDASATATSALLLTVLQDVFGRITTISAAYVFGTSLYPETKTFRLVADLVNDASI---VL 164

Query: 172 SPLCPQLFLEMAGLGNFAKGIAVVAARATR---------------------LPIYSSFAK 210
             LCP+L L    L  F   I++ A+ A R                     +P  ++   
Sbjct: 165 DTLCPRLSLSF--LHTFLLAISLCASAALRALCGLVAGGSKAALTNHFACTVPASTAAGG 222

Query: 211 E----GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVI 266
           E    G++ +L AK  +  T+  + G+  G  L   + +S++   +    L   H+ +  
Sbjct: 223 EAVVAGDVGELNAKDASRETVVGLFGMLLGTLLVPRL-TSVRATWLTLVFLIAGHLLANY 281

Query: 267 EEMRAAPVNTLNPQRTAMI--------------------VADFVKTGR--ISSPADLRYR 304
             +R+  + TLN QR  ++                    ++   K G     +PA++  R
Sbjct: 282 LAVRSVSLRTLNRQRAGILWTAWAARMREREDADDVSPSISSLAKAGGQLALTPAEVSRR 341

Query: 305 EDLL-FPGRLIEDAGN----VKVGRDPHKVVKPSKFHELKEL--------FAEEKFIL-C 350
           E +L  PG L +  G+     ++G    +VV       ++          FA+E+++L  
Sbjct: 342 ERILALPGTLRDARGHSIGYCELGASVSRVVARGGAENVRRALQSSSDAKFAKERYMLFA 401

Query: 351 LGSRWTDMVLE------QNASGEDALR--GWLVAAYAASMAKSFHDPSLTVLQDAYDKMN 402
           L  R  +  +E      + A+  D L+     + A A   A S    + TV+++AY  + 
Sbjct: 402 LPRRRVESKVELTVCFKEGATPRDQLKAWAHALEAAARLRAASVSPDTNTVVEEAYAAVE 461

Query: 403 DVFTPLLSELQAKGWHTD 420
             F P +  L   GW  D
Sbjct: 462 AWFAPFMEALSGAGWRVD 479


>gi|224010239|ref|XP_002294077.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970094|gb|EED88432.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 354

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 52/180 (28%), Positives = 91/180 (50%), Gaps = 21/180 (11%)

Query: 75  YPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGM 132
           YP +V  GYL Y  + +LQ  +S    V+ST ++L AAG+      A +  ++W ++DG+
Sbjct: 19  YPHTVAPGYLEYQFYDSLQGLSSYLRGVVSTTAVLSAAGVGDAAATAMSAAMTWAIRDGL 78

Query: 133 QHVGKLICSNLG-ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKG 191
             VG L+ S +  +  D+  K +R++AD L D+G  L++  P                  
Sbjct: 79  GMVGGLLFSYVASSHFDAHVKEFRLVADVLNDVGLTLDMALP------------------ 120

Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
           + +   RAT+  I   F+  GN +D+ AK     TL +++G+  G+ L++    S  G++
Sbjct: 121 VVLSWTRATKGNITDHFSISGNRADVSAKESTQETLVSLIGMALGVWLSNLDSHSRVGEL 180


>gi|346467075|gb|AEO33382.1| hypothetical protein [Amblyomma maculatum]
          Length = 388

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 94/199 (47%), Gaps = 3/199 (1%)

Query: 103 LSTQSLLFAAGLRPTPAQ--ATVVSWILKDGMQHVGKLICSNL-GARMDSEPKRWRILAD 159
           L+ QS+L   G+    A   A   +WIL+DG    G+++ +   G+ +D + K+WR+ AD
Sbjct: 4   LAIQSVLTGVGVGDQGASVLAATTTWILRDGAGMTGRILFAWFQGSNLDYDSKKWRLFAD 63

Query: 160 ALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
            L D    LE++          +  + + AK +  VA  ATR  +    A+  N++D+ A
Sbjct: 64  ILNDAAIFLELMCQYFKGYVTPILCISSVAKSVVGVAGGATRAALTQHQARSNNMADVSA 123

Query: 220 KGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNP 279
           K  +  TL N+      + L   V  +      +  L +++H+++    +    + T N 
Sbjct: 124 KDGSQETLVNLAAFLFSLLLLQLVAGNTWLVSTIFILFTLLHIFANYRAVSCVVMETFNR 183

Query: 280 QRTAMIVADFVKTGRISSP 298
            R +++V  F++ G   +P
Sbjct: 184 SRYSVVVRHFLQKGGGIAP 202


>gi|258576451|ref|XP_002542407.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902673|gb|EEP77074.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 442

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 83/391 (21%), Positives = 170/391 (43%), Gaps = 67/391 (17%)

Query: 91  ALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICSNLGARM 147
           +LQ F+S+   +LS++++L   G+     +P  A +++ +L++ M  +  ++ ++   R+
Sbjct: 40  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAALLLN-VLQESMGRIATILFAH---RL 95

Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRLPIYS 206
             E K +R+ AD   D    ++ LSP+ P+ + + +  L +  + +  VAA +++  + +
Sbjct: 96  --ECKMYRLAADIFNDSAIVMDCLSPMFPKPMRVGVLSLSSVLRAMCGVAAGSSKASLSA 153

Query: 207 SFAKEGNLSDLFAKGEAISTLFNVVG------LGAGIQLASTVCSSMQGKMIVGP----- 255
            FA+ GNL++L AK  +  T+ +++G      LG    ++  + +   G ++V       
Sbjct: 154 HFARWGNLAELNAKDSSQETVISLMGMLVRPILGFLESISGLILTQQVGSLVVSRVTSTL 213

Query: 256 -------LLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
                  LL  IH+ +    +RA  + TLN QR  ++ +      R  +P+   ++E + 
Sbjct: 214 ATWAALILLLTIHITTNYFAVRAVNMTTLNRQRANIVFSTIFDENRTLTPSQAAHQERIF 273

Query: 309 FPGRLIE------DAGNVKVGRDPHKVVK------------PSKFHELKELFAEEKFILC 350
               ++         G+ ++G     +V+             S    L  LF  E++IL 
Sbjct: 274 ERDGVLRWKASPASLGSCRIGVSFQYMVQLIASSESCTSDGQSIVAALLHLFEREEYILW 333

Query: 351 LG--SRWTDMVLEQNASGEDALRGWLVAAYAASM-------------------AKSFHDP 389
                +   +VL+ NA+    L+ W  A   A +                   A   H  
Sbjct: 334 FNPTRKRGAIVLKTNATAASQLKAWSHALIVAQILRLGEWKSKTGIDQFILQPATRAHGL 393

Query: 390 SLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
            L +L++   + ++ F   L  L+  GW  D
Sbjct: 394 VLLILRETLTEHSENFDARLERLRNAGWDVD 424


>gi|149067629|gb|EDM17181.1| similar to cDNA sequence BC017158, isoform CRA_a [Rattus
           norvegicus]
          Length = 246

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 67/124 (54%), Gaps = 3/124 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL+Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 75  LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D    LE+++P+ P  F       
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194

Query: 187 NFAK 190
           N AK
Sbjct: 195 NLAK 198


>gi|300175334|emb|CBK20645.2| unnamed protein product [Blastocystis hominis]
          Length = 290

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 64/287 (22%), Positives = 123/287 (42%), Gaps = 28/287 (9%)

Query: 170 VLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFN 229
           +++PL P  FL +A + N  K I+ +A  AT   I   FA   N+ D+ AK  + ++   
Sbjct: 1   MITPLFPNCFLALATISNIGKSISFLANGATGAGIRYGFANAMNIGDVTAKEGSQTSAVY 60

Query: 230 VVGLGAGIQLASTVCSSMQGKMIVGPL-LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVAD 288
           ++G+  G+ ++S +       +    + +S + +Y++ + +R   + TLN QR+ +   D
Sbjct: 61  LLGMLFGVSVSSLLGDQSMSTIFASVICMSCLSLYAMNKSLRCVTLPTLNTQRSELACVD 120

Query: 289 FVKTGRISSPADLRYREDLLFPGRLIEDAG-NVKVGRDPHKVVKPSKFHELKE------L 341
           +   G I  P  +   E    P   + D G ++++G   H V+  S +  L+E      L
Sbjct: 121 YCSQGIIYQPEAVAKLERYYLPYLHVADCGTDIRLGMSLHSVL--SAYRSLEEADDVIQL 178

Query: 342 FAEEKFILCLGSRWTDMV-----------------LEQNASGEDALRGWLVAAYAASMAK 384
           F +  ++L        M                  L  +AS  D L+ +L A      + 
Sbjct: 179 FGDCNYMLSYCDLKQGMYRSIINRTEKTKCTIAISLNSSASTRDQLQAYLHALLLRHASG 238

Query: 385 SFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDR-FLDGTGTRF 430
                 L +L++  ++   +    + +L+  GW     FL+ T  R+
Sbjct: 239 GRFTTDLQLLKETKERSKLLLPFFVRDLEQNGWDCIHLFLEETKKRY 285


>gi|148685696|gb|EDL17643.1| cDNA sequence BC017158, isoform CRA_a [Mus musculus]
          Length = 226

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 67/124 (54%), Gaps = 3/124 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P G+P SV+  YL Y  + ++Q F S+    L+TQ++L   G+    A   A   +W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 128 LKDGMQHVGKLICSNL-GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           +KD    +G++I +   G+++D   K+WR+ AD L D+   LE+++P+ P  F       
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194

Query: 187 NFAK 190
           N AK
Sbjct: 195 NLAK 198


>gi|240280131|gb|EER43635.1| DUF647 domain-containing protein [Ajellomyces capsulatus H143]
          Length = 521

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 95/416 (22%), Positives = 168/416 (40%), Gaps = 71/416 (17%)

Query: 29  FETSDSV-TRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYT 87
           F T DS   RH + + D      +V  SR      + S LN F P GYP SV++ YL   
Sbjct: 34  FRTPDSSGVRHERRRVD------IVQPSRKDSSSYLASLLNVFLPVGYPHSVSDDYLD-- 85

Query: 88  QFRALQHFTSAALSV-LSTQSLLFAAGLRPTPAQATV--VSWILKDGMQHVGKLICSN-L 143
                  F S  L     T  +    G+    A  TV  +  +L+D M  +  ++ ++ L
Sbjct: 86  ----CLGFASGVLKFNCRTAGVPGWVGVGDASASPTVALLHSVLQDSMGRIATILFAHRL 141

Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVAARATRL 202
           G  ++ E K +R+ AD L D     + LSP  P+ L + +    +  + +  VAA +++ 
Sbjct: 142 GTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVLRALCGVAAGSSKA 201

Query: 203 PIYSSFAKEGNLSDL----------------------FAKGEAISTLFNVVGLGAGIQLA 240
            + S FAK GNL +L                      F   +A   L         I L 
Sbjct: 202 SLSSHFAKCGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKLQKDSSQETVISLL 261

Query: 241 STVCSSMQGKMIVGPLLS--------VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
             +C S+    I  P  +        ++H+ +    +R+  + TLN QRT ++ +   + 
Sbjct: 262 GILCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNMTTLNRQRTNIVFSTLFEK 321

Query: 293 GRISSPADLRYREDLLFPGRLIE------DAGNVKVGRDPHKVVKPS------------K 334
           G + +P +    E +     ++         G  ++G    ++++ S             
Sbjct: 322 GSVLTPTEASKCERIFERDGILRWKASSATLGYCQIGGSFQELLRGSIHGANSIRDVGID 381

Query: 335 FHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHD 388
            ++L  LF +E+++L      +   +VL+ N +    L+ W   ++A  +AK   D
Sbjct: 382 IYKLLRLFEKEEYVLWFNPIHKKGTIVLKNNVTPISQLQAW---SHALRVAKRLTD 434


>gi|391863517|gb|EIT72825.1| hypothetical protein Ao3042_00909 [Aspergillus oryzae 3.042]
          Length = 236

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 46/158 (29%), Positives = 87/158 (55%), Gaps = 11/158 (6%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P+GYP SV++ Y+ Y  F +LQ F+S+   +LS++++L   G+    A   A ++  I
Sbjct: 75  FLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHI 134

Query: 128 LKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLF-----LE 181
           L+D    +  ++ ++ +G  ++ E K +R+ AD   D    L+ LSP+ P  F     L 
Sbjct: 135 LQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLS 194

Query: 182 MAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFA 219
            AG+    + +  VA  +++  + + F++ GNL+++ A
Sbjct: 195 TAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNA 229


>gi|392558453|gb|EIW51641.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 450

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 97/408 (23%), Positives = 168/408 (41%), Gaps = 68/408 (16%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWI 127
           F P GYPF+V+   +RY  + A+Q F S+   + +++++L   G+    A AT  +   I
Sbjct: 28  FLPEGYPFTVSPQCVRYQIWNAVQAFCSSLAGLFASRAVLQGHGVGNASASATDAIFLTI 87

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSP------------- 173
           L+D    +  ++    LG  +  E K +R+LAD L D     + LSP             
Sbjct: 88  LQDVFSRLTTIVSGYYLGTSLYPEAKTYRLLADILNDAAMVTDTLSPHLAHFSFSLSYPF 147

Query: 174 LCPQ-------LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKE--GNLSDLFAKGEAI 224
           + P        + L M+G      G AV       L ++ + A E  G++ DL AK  + 
Sbjct: 148 VVPASNSSLRVVALCMSGAFRALCG-AVAGGSKAALTVHFATAGERPGDVGDLSAKDGSK 206

Query: 225 STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAM 284
            T+  ++G+ +G  +   V  + Q   +V   L   H+ +    +R   +   N QR  +
Sbjct: 207 ETVLALLGMLSGSVVLRYV-HTAQATYVVLFGLIFCHLAANFIAVRVIAMRAFNRQRAGI 265

Query: 285 IVADFVKTGRISS--PADLRY-REDLLFPGRLIEDAGNVKV------------------- 322
              +F ++  + +  P  L   R++     R+  D+  + +                   
Sbjct: 266 AWCEFRRSFDVVTVVPDYLHVARQE-----RIFTDSARIPISQHARGTYAHCALGVSFLS 320

Query: 323 --GRDPHKVVKPSKFHELKELFAEEKFILCLGSR--WTDMVLEQNASGEDALRGWLVAAY 378
             GR+P       +   L E+FA EK+IL   S      +VL+      D L+ W  A  
Sbjct: 321 LSGREP---FSDEQIATLLEVFAHEKYILWPASNPPQIAVVLKDGHGPHDHLKAWAHAHE 377

Query: 379 AA------SMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
            A      S A  F++  L  ++ A + ++ +F   L   Q  GW T+
Sbjct: 378 IANICDRESHAHGFNE-CLVAIRYALENVSRMFPLFLRAAQEVGWKTE 424


>gi|449543750|gb|EMD34725.1| hypothetical protein CERSUDRAFT_116920 [Ceriporiopsis subvermispora
           B]
          Length = 507

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 101/433 (23%), Positives = 176/433 (40%), Gaps = 84/433 (19%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--V 123
           F   F P+GYP +V+  YL Y  F  LQ   S+  S+L+++++L   G+    A AT  +
Sbjct: 53  FSGIFLPAGYPNTVSPDYLHYQIFNGLQAGCSSIASLLASRAVLEGHGVGEASASATNAM 112

Query: 124 VSWILKDGMQHVGKLICSN----LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
           +  I++D    V +LI       LG  +  E K++R+LAD   D    L+V+SPL     
Sbjct: 113 LVNIIQDV---VSRLITIGSGYYLGTSLFPEAKKYRLLADVYNDAAVVLDVISPLFTHWS 169

Query: 178 -------------LFLEMAGL---GNFAKGIAVVAARATRLPIYSSFAKE----GNLSDL 217
                         +L +  L   G+F + +  + A  ++  +   FA      G++ DL
Sbjct: 170 ISLVYPFIRREQGFYLRVLALCLSGSF-RALCGMVAGGSKAALTMHFATAGPVPGDIGDL 228

Query: 218 FAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTL 277
            AK  +  T+  ++G+  G  +   V S      ++  +L + H+      +R   + T 
Sbjct: 229 NAKDGSKETVLALIGMLLGTFVIKHVDSGFATYAVLS-MLILTHLAINFVAVRGVVMRTF 287

Query: 278 NPQRTAMIVADFVK--------TGRISSPADLRYRED-------LLFPGRLIEDAGNVKV 322
           N  R A   A + +         G I +P  +  +E        L+ P R  +  G+  +
Sbjct: 288 NRHRAAAAWASYRRYEQGIAQYEGAILTPELMAVKERIFADPTLLMHPLRPEKRLGHCHL 347

Query: 323 GR-------DPHKVVKPSKFHELKEL-----------FAEEKFILCL--GSRWTDMVL-- 360
           G        + H+   PS+   +  L           F  E++IL L   SR   + L  
Sbjct: 348 GASLSTIFLETHRPCPPSRTSTVPNLTSAQLNVVLQTFENERYILWLRHDSRSESLQLVV 407

Query: 361 --EQNASGEDALRGWLVAAYAA--------SMAKSFHDPSLTVLQDAYDKMNDVFTPLLS 410
             +      D L+GW+ A   A         M +  H+    V+Q +  ++  +F   L 
Sbjct: 408 CFKTGHQPIDRLKGWIHAQELAIALEDGVHPMFEKLHE----VVQTSLARVQKLFPLFLD 463

Query: 411 ELQAKGWHTDRFL 423
             + +GW  +  L
Sbjct: 464 AARGRGWMVEEGL 476


>gi|407259493|gb|AFT91377.1| hypothetical protein [Emericella rugulosa]
          Length = 549

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 60/241 (24%), Positives = 114/241 (47%), Gaps = 23/241 (9%)

Query: 64  DSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATV 123
           D  ++ F P+GYP SV++ Y             + AL+  +  + +       +P  A +
Sbjct: 67  DVLIDVFLPAGYPHSVSDDY-------------TPALAQPAIHAGVGVGNANASPTSALL 113

Query: 124 VSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           +  IL+D    +  ++ ++ +G  ++ E K +R  AD   DL   L+ LSP+ P  F  +
Sbjct: 114 LH-ILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCLSPMIPAGFNRV 172

Query: 183 AGL--GNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
             L      + +  VA  +++  + + FA+ GNL+D+ AK  +  T+ +++G+  G  + 
Sbjct: 173 TVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLADVNAKDSSQETVISLIGMLVGSFVV 232

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPAD 300
           S + +S     +    L  +H+      +R+  + TLN QR     A+ V +  +SS  D
Sbjct: 233 SHI-TSFTATWLTLVFLVSMHLSLNYAAVRSVQMTTLNRQR-----ANIVFSTLLSSDPD 286

Query: 301 L 301
           L
Sbjct: 287 L 287


>gi|452981822|gb|EME81581.1| hypothetical protein MYCFIDRAFT_100197, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 272

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 64/243 (26%), Positives = 115/243 (47%), Gaps = 6/243 (2%)

Query: 85  RYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDGMQHVGKLICS 141
           R     ++Q F S+   +LS++++L + G+     +P  A ++S + +   +        
Sbjct: 28  RVDAIDSIQAFVSSIAGLLSSRAVLLSVGVGDSSASPTSALLLSMLQQSLSRIAKVAFAY 87

Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFL-EMAGLGNFAKGIAVVAARAT 200
            LG  ++ E K +R  AD L+D    L  LSP+ P+ F   +  L +    +  V A +T
Sbjct: 88  RLGKAIEPECKMYRFCADLLHDSSFVLNCLSPMLPKAFRPPILALSSICFAVCDVVASST 147

Query: 201 RLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI 260
           +  + + FAK  NL +L AK  +  T+ +++G+  G  L + +         +  +LSV 
Sbjct: 148 KASLSAHFAKWENLGELNAKDSSQETIISLLGMLTGSLLVNWISGHWATWTALILMLSV- 206

Query: 261 HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNV 320
           H+ +    +RA  + TLN QR A +V   +  G I  PA++   E +      + DA N 
Sbjct: 207 HLEANRRAVRAIVMQTLNRQR-ATVVYHRLCVGHIPRPAEVSRVERIFERNGAVRDAENR 265

Query: 321 KVG 323
            +G
Sbjct: 266 LIG 268


>gi|388520739|gb|AFK48431.1| unknown [Medicago truncatula]
          Length = 236

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 55/98 (56%), Gaps = 4/98 (4%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPT----PAQATVVSWI 127
           P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+        A A  ++WI
Sbjct: 118 PEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLTSVGVSRNRATPAAAAVAINWI 177

Query: 128 LKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLG 165
           L DG   VGK++ +  G + D + K+ R   D L +LG
Sbjct: 178 LNDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELG 215


>gi|167378255|ref|XP_001734737.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165903647|gb|EDR29104.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 400

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 117/267 (43%), Gaps = 12/267 (4%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL-FAAGLRPTPAQATVVSWILKD 130
           P GYP SV  GY R+  F  +    +  + V+S  ++L F +G     A   VV   +  
Sbjct: 51  PVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAILRFNSG---HSAAFFVVFREVMT 107

Query: 131 GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAK 190
           GM H+  +I    G  +    K+WR+  D + +L   +E+     P +F  M+   +  +
Sbjct: 108 GMMHL--IITERWGTSIVFFAKQWRMRIDLISELLRIIEIAFISTP-IFYSMSVTCSVIE 164

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
           G+   + +  +  I +++AK  N+++L  K + + TLF VV L  G  +   +    +  
Sbjct: 165 GVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVVSLLIGWLILHVLGDFTESG 224

Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLRYREDLLF 309
           +IV     + H              TLN +R  ++++ F+     I SP  +   E  + 
Sbjct: 225 IIVFICALISHCVCNFLLSNVIVFKTLNYERLTILMSYFMNIKNEILSPVVVSRMESKIK 284

Query: 310 PGRLIEDAGNVKVGRDPHKVVKPSKFH 336
              ++    N+++G   + + K S  +
Sbjct: 285 VNEMV----NIRMGISLNSIPKSSNLY 307


>gi|409046959|gb|EKM56438.1| hypothetical protein PHACADRAFT_253558 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 464

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 71/259 (27%), Positives = 116/259 (44%), Gaps = 40/259 (15%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWI 127
           F P+GYP SV+  Y RY    ALQ F S+   +++ +++L   G+    A AT  ++  I
Sbjct: 2   FLPAGYPASVSGDYTRYQVLNALQGFFSSLGGLVAARAVLEGHGVGSADASATNALLLTI 61

Query: 128 LKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLG 186
           L+D    V  ++    LG  +  E K +R LAD   D+   L+ LSPL   + L+   L 
Sbjct: 62  LQDIFSRVTAILSGYYLGTALFPEAKTYRFLADIWNDIPLVLDTLSPLLSHVSLDFVWLA 121

Query: 187 -------------NFAKGIAVVAARATR-------------LPIYSS----FAKEGNLSD 216
                           +GIA+  +   R             L ++ S     A  G++SD
Sbjct: 122 FPPFLLPTLVHRPGPWRGIALCFSGIFRSMCAVAAAGSKAALTLHFSQPDHGAASGDISD 181

Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI---HVYSVIEEMRAAP 273
           L AK  +  T+  +VG+  G    S +   +Q +     LL+VI   H+Y     +R+  
Sbjct: 182 LSAKDGSKETVLALVGMLCG----SIIMPLIQTQTQTCGLLAVIIVFHIYINYYAVRSVV 237

Query: 274 VNTLNPQRTAMIVADFVKT 292
             + N QRT ++ + F ++
Sbjct: 238 FKSYNRQRTCILWSAFRRS 256


>gi|147803314|emb|CAN66551.1| hypothetical protein VITISV_039658 [Vitis vinifera]
          Length = 470

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 72/313 (23%), Positives = 138/313 (44%), Gaps = 36/313 (11%)

Query: 124 VSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA 183
           + W+ KDG+  VG+L    +G  +    K ++    +++DL T       L P  FL++A
Sbjct: 162 IRWVSKDGIGAVGRLF---IGLSVIKRAKGYK----SIFDLST------QLYPAYFLQLA 208

Query: 184 GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLAST- 242
            LGN AK +A      +   I + FA  GNL ++ AK E       ++GL  GI + ++ 
Sbjct: 209 SLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMVLTSP 268

Query: 243 -VCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
            V +S          + ++H++   + +      ++N +R  ++V   V    +    D 
Sbjct: 269 LVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPGYVDC 328

Query: 302 RYREDLLFPGRLIEDAGNVKVGRDPHKVV---KP-SKFHELKELFAEEKFILCLGSRWTD 357
             +E +L   R ++    +  G    +++   +P SK   L +L+++EK+IL +  + TD
Sbjct: 329 NRKEKILSWQRFLKP--RIIFGVSLEELIGGERPVSKVRTLLKLYSQEKYILVVKQQGTD 386

Query: 358 ----MVLEQNASGEDALRG-----WLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPL 408
               +  +  A+    LR      WL   +  S      D  L  L+ +  ++ + F   
Sbjct: 387 FEVFVSFKVGATSLSVLRSIWQSYWLQENWCGS------DNILVQLEQSLLELEESFGDF 440

Query: 409 LSELQAKGWHTDR 421
             +L++ GW   +
Sbjct: 441 TQQLKSSGWDLQK 453


>gi|121709968|ref|XP_001272600.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
 gi|119400750|gb|EAW11174.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
          Length = 512

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 92/189 (48%), Gaps = 18/189 (9%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFR-ALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           +VD  ++ F PSGYP SV + YL    F  A    T+  L V              TP  
Sbjct: 65  LVDLLVDVFLPSGYPHSVTDDYLPRVYFHLATPQPTNNPLGV-------GVGNADATPTS 117

Query: 121 ATVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
           A ++  IL+D    +  ++ ++ +G  ++ E K +R+ AD   DL   L+ LSP+ P   
Sbjct: 118 ALLLH-ILQDTSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMIPAGV 176

Query: 178 ---LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
                L  AG+    + +  VA  +++  + + FA+ GNL+++ AK  +  T+ ++VG+ 
Sbjct: 177 SRITVLSTAGV---LRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLVGML 233

Query: 235 AGIQLASTV 243
            G  + S V
Sbjct: 234 VGSLVVSHV 242


>gi|169604674|ref|XP_001795758.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
 gi|111066623|gb|EAT87743.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
          Length = 473

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 84/391 (21%), Positives = 162/391 (41%), Gaps = 63/391 (16%)

Query: 52  VDDSRPVYHR-VVDSFLNKFFPSGYPFS------------------VNEGYLRYTQFRAL 92
           VD  +P   +  V    + F P+GYP S                  V EG+L       L
Sbjct: 28  VDVVKPAERKSYVQQVFDVFLPAGYPQSDSLQAFSSSIAGMLASRAVLEGHLDAEDTLRL 87

Query: 93  QHFTSAALS--------VLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSN-L 143
           Q+   +  +        + S QS     G       A ++  +L++ M  +  ++ ++ L
Sbjct: 88  QNLVPSVYAASIGDQRHMSSHQSRCVGVGDSSASPTAALLLSVLQESMGRIATILFAHRL 147

Query: 144 GARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMA-GLGNFAKGIAVVAARATRL 202
           G  ++ E K +R+ AD   D    L+ LSP  P+ F  +     +  + +  V A + + 
Sbjct: 148 GTALEPECKMYRLAADVFNDTAMILDCLSPAFPKPFRVLVLSFSSVLRSLCGVCAGSAKA 207

Query: 203 PIYSSFAKEGNLSDLFAKGEAI--STLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVI 260
            + + FA++GNL ++ A    +  +  F++ GL  G    +T         I+  LLS I
Sbjct: 208 SLSAHFARKGNLGEVNAHITHMNPNVDFDIEGLEPGNGHIAT------WNAILIALLS-I 260

Query: 261 HVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPG---RLIEDA 317
           H+ +    ++A  +  LN QR  ++  + +  G + +P ++  RE +       R ++D 
Sbjct: 261 HLATNYAAVKAVSMRCLNRQRANIVFGNLMHHGSVLTPTEVSQRERIFERDGVLRWVDDC 320

Query: 318 --GNVKVG---------------RDPHKVVKPSKFHELKELFAEEKFILCLGSRWTD--M 358
             G+  +G               R     +      +L ++FA++K++L       +  +
Sbjct: 321 VIGHCTIGVPLGVLLASMSQRYRRTGSLELSDINLPDLMDIFADQKYVLWFTDSEYEAIV 380

Query: 359 VLEQNASGEDALRGWLVAAYAASMAKSFHDP 389
           VL++  +  D L+ W   A+A   A+   +P
Sbjct: 381 VLKEGCTPIDQLKAW---AHALLFAQDLRNP 408


>gi|159474944|ref|XP_001695583.1| hypothetical protein CHLREDRAFT_191958 [Chlamydomonas reinhardtii]
 gi|158275594|gb|EDP01370.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 487

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 74/138 (53%), Gaps = 8/138 (5%)

Query: 91  ALQHFTSAALSVLSTQSLLFAAGLR----PTPAQATVVSWILKDGMQHVGKLI-CSNLGA 145
            L +    A SVL++  +L+A GL     PT   A  ++W+LKDGM  +G L+    +  
Sbjct: 142 GLNNIAVTANSVLASTFMLYAVGLGAGAIPT---AGALNWVLKDGMGQLGTLVFGKTIAH 198

Query: 146 RMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIY 205
             D   K W  L+  L      LE+L+ L P  FL M  L N  KG+A +AA +TR   +
Sbjct: 199 NFDVHSKTWFFLSAVLLQAAAALEMLTVLVPGHFLLMGSLANMLKGLAWMAAGSTRSVFH 258

Query: 206 SSFAKEGNLSDLFAKGEA 223
            SFA++ N++D+ AKG +
Sbjct: 259 LSFARDNNIADVTAKGTS 276


>gi|67467345|ref|XP_649789.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56466294|gb|EAL44403.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449707066|gb|EMD46790.1| Hypothetical protein EHI5A_044670 [Entamoeba histolytica KU27]
          Length = 400

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 58/269 (21%), Positives = 116/269 (43%), Gaps = 16/269 (5%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD- 130
           P GYP SV  GY R+  F  +    +  + V+S      +A LR  P  +     + ++ 
Sbjct: 51  PVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSV-----SAILRINPGHSAAFFVVFREV 105

Query: 131 --GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
             GM H+  +I    G  +    K+WR+  + + +L   +E+     P +F  ++   + 
Sbjct: 106 MTGMMHL--IITERWGTSIVFFAKQWRMRIELISELLRIIEIAFISTP-IFYSISVTCSV 162

Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
            +G+   + +  +  I +++AK  N+++L  K + + TLF VV L  G  +   +    +
Sbjct: 163 IEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVVSLLIGWLILHVLGDFTE 222

Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLRYREDL 307
             +IV     + H              TLN +R  ++++ F+     I SP  +   E  
Sbjct: 223 SGIIVFICALISHCICNFLLSNVIVFKTLNYERLTILMSYFMNIKNEILSPVVVSRIESK 282

Query: 308 LFPGRLIEDAGNVKVGRDPHKVVKPSKFH 336
           +    ++    N+++G   + + K S  +
Sbjct: 283 IKVNEMV----NIRMGISLNSIPKSSDLY 307


>gi|342186523|emb|CCC96010.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 596

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 62/249 (24%), Positives = 102/249 (40%), Gaps = 7/249 (2%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
           P+GYP S  EG+ RY        F S+  S ++ QS+L    L  +P       W+LKD 
Sbjct: 143 PTGYPDSCAEGFRRYFLLSLCSTFVSSFASSIAYQSILNGFLLVSSPQL-----WMLKDL 197

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
              +     +N     ++ PK W +++  L++    LE++ P + P   L  A L +  +
Sbjct: 198 APALAAAYLANRVVSYENRPKFWFVVSVVLHNTSIVLEMIIPSVVPHNLLTAAVLTSCVR 257

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
             A +    TR      FA  GNL++L  K  +   +   V    GI     V +S+  +
Sbjct: 258 QSASLMFLVTRASALQHFAVSGNLAELTKKFNSFGIVIYTVSTALGIAYTYYV-ASLTAQ 316

Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFP 310
           ++        ++      M       LN    ++I+  +VK G +     L  RE   F 
Sbjct: 317 LVTVLFCCGANLVLSYMSMCNIAFRVLNGTTISVILRAYVKEGCLGPGHVLSPREVSDFI 376

Query: 311 GRLIEDAGN 319
           G  + D G 
Sbjct: 377 GLRMLDVGT 385


>gi|407034154|gb|EKE37096.1| hypothetical protein ENU1_209440 [Entamoeba nuttalli P19]
          Length = 400

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 58/269 (21%), Positives = 116/269 (43%), Gaps = 16/269 (5%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD- 130
           P GYP SV  GY R+  F  +    +  + V+S      +A LR  P  +     + ++ 
Sbjct: 51  PVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSV-----SAILRINPGHSAAFFVVFREV 105

Query: 131 --GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNF 188
             GM H+  +I    G  +    K+WR+  + + +L   +E+     P +F  ++   + 
Sbjct: 106 MTGMMHL--IITERWGTSIVFFAKQWRMRIELISELLRIIEIAFISTP-IFYSISVTCSV 162

Query: 189 AKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQ 248
            +G+   + +  +  I +++AK  N+++L  K + + TLF VV L  G  +   +    +
Sbjct: 163 IEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVVSLLIGWVILHVLGDFTE 222

Query: 249 GKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-GRISSPADLRYREDL 307
             +IV     + H              TLN +R  ++++ F+     I SP  +   E  
Sbjct: 223 SGIIVFICALISHCICNFLLSNVIVFKTLNYERLTILMSYFMNIKNEILSPVVVSRIESK 282

Query: 308 LFPGRLIEDAGNVKVGRDPHKVVKPSKFH 336
           +    ++    N+++G   + + K S  +
Sbjct: 283 IKVNEMV----NIRMGISLNSIPKSSDLY 307


>gi|302678497|ref|XP_003028931.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
 gi|300102620|gb|EFI94028.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
          Length = 278

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 74/273 (27%), Positives = 121/273 (44%), Gaps = 16/273 (5%)

Query: 63  VDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT 122
           VD     F P+G+P SV+  YLRY  + A+Q F S+   +LS+++LL   G+    A AT
Sbjct: 1   VDVLAKVFLPAGFPNSVSPDYLRYQIYNAIQAFCSSLAGLLSSRALLEGFGVGDANATAT 60

Query: 123 --VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-- 177
             V+  + +D    +  +I +  LG  +  E K +R LAD   D     +  +P+     
Sbjct: 61  NAVLLTVAQDVFSRLTTIIAAYLLGPSLFPEAKTFRFLADVFNDTAIIFDASTPILRASG 120

Query: 178 ------LFLEMAGLGNFAKGIAVVAARAT-RLPIYSSFAKEGNLSDLFAKGEAISTLFNV 230
                   L +AG      G     A+A   L   +     G++ D+ AK  +  TL  +
Sbjct: 121 HANLRVAALCLAGASRAICGTICGGAKAALTLHFATPLRGTGDIGDVNAKDASKETLLAL 180

Query: 231 VGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV 290
           +G+ AG  +   + SS Q    +   L   H+      +R   + TLN QR ++    F 
Sbjct: 181 MGMLAGSFVVHHL-SSTQATHALLLALLAAHLTVNYLGVRGVCLRTLNRQRASLAWTSFR 239

Query: 291 KTGRISSPADLRYREDLLF-PGRLIE--DAGNV 320
           + G   SP  +   E +L  PG ++   D G++
Sbjct: 240 EYGSAPSPTIVAGMEGILARPGAIVHAGDGGHI 272


>gi|303288586|ref|XP_003063581.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454649|gb|EEH51954.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 290

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 49/88 (55%), Gaps = 4/88 (4%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ---ATVVSW 126
           F P+GYP +V+  Y R+ ++    +F    L VL++QSLL A G+  TP     A    W
Sbjct: 2   FLPAGYPDTVSSDYARWLRWHLFSNFFRNVLEVLTSQSLLVALGMGSTPGALPLAAATKW 61

Query: 127 ILKDGMQHVGKLICSNLGA-RMDSEPKR 153
           +LKDG+     L+   +GA R D +PKR
Sbjct: 62  VLKDGVGSFATLLSGVVGAQRYDEDPKR 89


>gi|159124134|gb|EDP49252.1| DUF647 domain protein [Aspergillus fumigatus A1163]
          Length = 542

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/251 (24%), Positives = 111/251 (44%), Gaps = 36/251 (14%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
            ++ F PSGYP SV + YL       LQ       +   T +LL                
Sbjct: 70  LVDVFLPSGYPNSVTDDYLPAV----LQGVGVGNANASPTSALLL--------------- 110

Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
            IL+D    +  ++ ++ +G  ++ E K +R+ AD   DL   L+ LSP+ P      AG
Sbjct: 111 HILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVLDCLSPMIP------AG 164

Query: 185 LGNFA--------KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
           +G           + +  VA  +++  + + FA+ GNL+++ AK  +  T+ +++G+  G
Sbjct: 165 VGRVGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLLGMLVG 224

Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
             + S V +S          L  +H+      +R+  + +LN QR  ++ +  +++    
Sbjct: 225 SFIVSHV-TSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRANIVFSTLLESDPAL 283

Query: 297 SPADL-RYRED 306
            P  L R R D
Sbjct: 284 DPHRLSRSRVD 294


>gi|70991459|ref|XP_750578.1| DUF647 domain protein [Aspergillus fumigatus Af293]
 gi|66848211|gb|EAL88540.1| DUF647 domain protein [Aspergillus fumigatus Af293]
          Length = 542

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/251 (24%), Positives = 111/251 (44%), Gaps = 36/251 (14%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVS 125
            ++ F PSGYP SV + YL       LQ       +   T +LL                
Sbjct: 70  LVDVFLPSGYPNSVTDDYLPAV----LQGVGVGNANASPTSALLL--------------- 110

Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
            IL+D    +  ++ ++ +G  ++ E K +R+ AD   DL   L+ LSP+ P      AG
Sbjct: 111 HILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVLDCLSPMIP------AG 164

Query: 185 LGNFA--------KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAG 236
           +G           + +  VA  +++  + + FA+ GNL+++ AK  +  T+ +++G+  G
Sbjct: 165 VGRVGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLLGMLVG 224

Query: 237 IQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRIS 296
             + S V +S          L  +H+      +R+  + +LN QR  ++ +  +++    
Sbjct: 225 SFIVSHV-TSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRANIVFSTLLESDPAL 283

Query: 297 SPADL-RYRED 306
            P  L R R D
Sbjct: 284 DPHRLSRSRVD 294


>gi|303281718|ref|XP_003060151.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458806|gb|EEH56103.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 814

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 3/120 (2%)

Query: 57  PVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL-- 114
           P+    + +    F P GYP SV+  YL +    A Q   S A  +L + +LL  AG+  
Sbjct: 224 PLLRACLRALRRAFLPEGYPRSVSSDYLAFQLCDAAQGVCSYARGILCSTALLRGAGVGS 283

Query: 115 RPTPAQATVVSWILKDGMQHVGKL-ICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP 173
               A +    ++ +D    +G +   +  G  MD++ KRWR+ ADA+ DLG  +E+ SP
Sbjct: 284 STATAASATAQFVARDLTSMLGGVAFAATRGRAMDADAKRWRLFADAMNDLGMAIELASP 343



 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/117 (26%), Positives = 51/117 (43%), Gaps = 17/117 (14%)

Query: 179 FLEMAGLGNFAKGIAVVAARATRLPIYSSFAK-----------------EGNLSDLFAKG 221
           FL  A LG  A+ +  V +RATR  +   FA                   G+++D+ AK 
Sbjct: 410 FLVAACLGGLARSLCGVTSRATRAALTQHFAAGDDAGRSFTRGRTGGSTTGSVADVDAKE 469

Query: 222 EAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLN 278
               T   ++G+GAG+ +                LL+ +HV+  +  MR+  +NT+N
Sbjct: 470 GTQETAATLIGMGAGVLVTRLAADDAAAIWFWFALLTALHVFFNVRAMRSLRLNTMN 526


>gi|350630832|gb|EHA19204.1| hypothetical protein ASPNIDRAFT_186948 [Aspergillus niger ATCC
           1015]
          Length = 504

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 58/237 (24%), Positives = 111/237 (46%), Gaps = 29/237 (12%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA 121
           V D  +  F P+GYP SV + Y               A +VL    +   A   PT   A
Sbjct: 68  VQDFLVEVFLPAGYPHSVTDDY---------------APAVLQGVGV-GNADASPT---A 108

Query: 122 TVVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ--- 177
            ++  IL+D    +  ++ ++ +G  ++ E K +R+ AD   DL   L+ LSP+ P    
Sbjct: 109 ALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMTLDCLSPMIPAGAP 168

Query: 178 --LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGA 235
               L  AG+    + +  VA  +++  + + F++ GNL+++ AK  +  T+ +++G+  
Sbjct: 169 RVTVLSTAGV---LRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLV 225

Query: 236 GIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT 292
           G  + S V ++     I   +L  +H+      +R+  + +LN QR  ++ +  + T
Sbjct: 226 GSFVVSRV-TNFSTTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 281


>gi|390341857|ref|XP_003725545.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1
           [Strongylocentrotus purpuratus]
          Length = 311

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 53/233 (22%), Positives = 107/233 (45%), Gaps = 8/233 (3%)

Query: 161 LYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
           + D    +E++S   PQ F+ +A + +  K +  VA  ATR  +    A+  N++D+ AK
Sbjct: 1   MNDAAMCIELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAK 60

Query: 221 GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
             +  TL N+  L  G+ +      ++     +  L + +H+Y+    +    + TLN  
Sbjct: 61  DGSQETLVNLAALFVGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQN 120

Query: 281 RTAMIVADFVKTGR--ISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHEL 338
           R  ++V D++ + +  +S P  +  RE +++    +     + +G     V K  +  E+
Sbjct: 121 RFHILVQDYLHSSQFSMSGPDAVNSREPVIWR---LSRPLKLNLGVPFRTVAKSMRDIEM 177

Query: 339 K-ELFAEEKFILCLGSR--WTDMVLEQNASGEDALRGWLVAAYAASMAKSFHD 388
             +   +EKF+L L  +    D+V+  +++ ED ++  L A          HD
Sbjct: 178 CLQSNGQEKFLLSLDVKKGCIDIVIHHDSTSEDFIKACLQAEIIMYTTFHLHD 230


>gi|384494915|gb|EIE85406.1| hypothetical protein RO3G_10116 [Rhizopus delemar RA 99-880]
          Length = 188

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 51/92 (55%), Gaps = 3/92 (3%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P GYP SV+  Y ++  +  L+ +  +A+ VL +Q++L + GL    A   A  + W+
Sbjct: 58  FLPVGYPESVHSCYKKFHSWLFLETYVGSAVGVLCSQAMLASLGLGTVEATGGAVAIQWV 117

Query: 128 LKDGMQHVGKL-ICSNLGARMDSEPKRWRILA 158
           LKDG+  +GKL       +  DS PK W+ ++
Sbjct: 118 LKDGIGEIGKLFFIKKYASTFDSHPKTWKFVS 149


>gi|340059715|emb|CCC54109.1| conserved hypothetical protein, fragment, partial [Trypanosoma
           vivax Y486]
          Length = 618

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 54/227 (23%), Positives = 96/227 (42%), Gaps = 7/227 (3%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
           PSGYP S   G+ RY          S+  S +  QS+L    L  +P       W+LKD 
Sbjct: 183 PSGYPHSCGVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLASSPQL-----WMLKDL 237

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
           +  +     +N     ++ PK W +++  +++L    E++ P L PQ  L  A   +  +
Sbjct: 238 VPALAAAYLANRIVSYENRPKFWFVVSVGMHNLSVIAEMIVPSLVPQHLLFAAVTTSCIR 297

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
             A +    TR      FA   NL++L  K  +   +   V    GI   S V +S+  +
Sbjct: 298 QSASLMFLVTRASALQHFAISNNLAELTKKFNSFGIVIYTVATALGIAFTSVV-TSLTAQ 356

Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISS 297
           +I   +   +++    + M       LN    ++I+ ++++ G   S
Sbjct: 357 LITVLMCCAVNLIISHQSMCKIAFRMLNGTTISVILHEYIQNGSTCS 403


>gi|119467992|ref|XP_001257802.1| hypothetical protein NFIA_052500 [Neosartorya fischeri NRRL 181]
 gi|119405954|gb|EAW15905.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 515

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 52/223 (23%), Positives = 111/223 (49%), Gaps = 20/223 (8%)

Query: 91  ALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARM 147
           +LQ F S+   +LS++++L   G+    A  T  ++  IL+D    +  ++ ++ +G  +
Sbjct: 86  SLQAFCSSIAGLLSSRAVLQGVGVGNANASPTSALLLHILQDSSGRIATILFAHRVGTAL 145

Query: 148 DSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFA--------KGIAVVAARA 199
           + E K +R+ AD   D+   L+ LSP+ P      AG+G           + +  VA  +
Sbjct: 146 EPECKMYRLAADVFNDVAMVLDCLSPMIP------AGVGRVGVLSAAGVLRALCGVAGGS 199

Query: 200 TRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTV-CSSMQGKMIVGPLLS 258
           ++  + + FA+ GNL+++ AK  +  T+ +++G+  G  + S V C +     ++   L 
Sbjct: 200 SKASLSAHFARGGNLAEVNAKDSSQETIISLIGMLVGSFIVSHVTCFAATWASLL--FLL 257

Query: 259 VIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADL 301
            +H+      +R+  + +LN QR  ++++  +++     P  L
Sbjct: 258 TVHLGMNYAAVRSVQMTSLNRQRANIVLSTLLESDLALDPRSL 300


>gi|414876406|tpg|DAA53537.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
          Length = 295

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 67/274 (24%), Positives = 117/274 (42%), Gaps = 14/274 (5%)

Query: 158 ADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDL 217
           AD +   G+  E+ + L P  FL +A LGN AK IA      +   I + FA+ GNL ++
Sbjct: 3   ADFIGSAGSIFELSTTLYPGYFLPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLGEV 62

Query: 218 FAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSV--IHVYSVIEEMRAAPVN 275
            AK E       ++GL  G+ +            +V   LSV  +H++   + +      
Sbjct: 63  AAKEEVWEVGAQLLGLSIGVLIMDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLKFR 122

Query: 276 TLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSK- 334
           T+N +R  ++V   V    +         E++L   R +     V  G    +++   K 
Sbjct: 123 TINLKRGRILVKSHVAQHTVPGYVACNEEENILTWERYLRP--QVSFGVPMERMLGGDKS 180

Query: 335 ---FHELKELFAEEKFILC---LGSRWTDMVL--EQNASGEDALRGWLVAAYAASMAKSF 386
               + L +L+  EK+IL     GSR    ++  +++A+    LR  L  A+     +  
Sbjct: 181 SDMVNRLLKLYKNEKYILFFEQFGSREPTFLVTFKESATSMSVLRS-LWQAHWLHKNQLR 239

Query: 387 HDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTD 420
            D     L+++   + + FT  L +L + GW  D
Sbjct: 240 QDEVFPWLEESIVALENGFTDFLEQLGSAGWDQD 273


>gi|17945727|gb|AAL48912.1| RE31721p [Drosophila melanogaster]
          Length = 266

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/257 (22%), Positives = 111/257 (43%), Gaps = 16/257 (6%)

Query: 167 GLEV-LSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIS 225
           G+E+ + P  P    ++       K I  VA  ATR  +    A  GNL+D+ +K  +  
Sbjct: 2   GIEIYVLPKYPHFSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQE 61

Query: 226 TLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
           T  N+V    G+ L S + S      I   ++S +H+Y+ ++ +RA  + + N  R  + 
Sbjct: 62  TCVNLVASFVGLYLLSLIKSQAVLYTIFYVVVS-LHLYANLKAVRAVCLRSFNESRYLIA 120

Query: 286 VADFVKTGRISSPADLRYREDLLFPGRLIEDAGNVKVGRDPHKVVKPSKFHELKELFA-- 343
           + +F ++ R+ SP  +   E +   G+ +  + N+K+G     ++   K   + E     
Sbjct: 121 LEEFFRSSRMLSPQQVNAMERVTV-GQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSS 179

Query: 344 ---EEKFILCLGSRWTDMVLEQNASGEDALRGWLVAAYAASMAKSFHDPSLTVLQDAYDK 400
               E FI+    +   + L      +D L+ +    +A S  +  +      L++ Y  
Sbjct: 180 FDPHEHFIIAQTKKCLGVYLHFETRPQDVLKAYF---FAVSYLQDRNQ-----LKEKYWD 231

Query: 401 MNDVFTPLLSELQAKGW 417
           +   +   L+  Q +GW
Sbjct: 232 IQTKWQEFLALAQQEGW 248


>gi|242208267|ref|XP_002469985.1| predicted protein [Postia placenta Mad-698-R]
 gi|242226469|ref|XP_002477622.1| predicted protein [Postia placenta Mad-698-R]
 gi|220722555|gb|EED77179.1| predicted protein [Postia placenta Mad-698-R]
 gi|220731047|gb|EED84896.1| predicted protein [Postia placenta Mad-698-R]
          Length = 226

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/225 (24%), Positives = 109/225 (48%), Gaps = 12/225 (5%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQAT--VVS 125
           F P+G+P +V+ G  +Y  + ALQ F S+   +++++++L  F  G+    A AT  +  
Sbjct: 5   FLPAGFPATVSPG--KYQIYNALQAFCSSLAGLIASRAVLQGFRHGVGKAGASATHAIFL 62

Query: 126 WILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
            +L+D    +  ++    LG  +  E K +R++AD   D    L+ LSP      +    
Sbjct: 63  TVLQDIFSRLTTIVAGYYLGTSLFPEAKAYRLMADIFNDAAIILDTLSPHLGSFRVASLC 122

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAK----EGNLSDLFAKGEAISTLFNVVGLGAGIQLA 240
           L    + +  V A  ++  +   FA      G++ DL AK  +  T+  ++G+ +G  + 
Sbjct: 123 LSGAFRALCGVVAGGSKAALTVHFATTAKGAGDVGDLSAKDGSKETVLALLGMLSGTAVM 182

Query: 241 STVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMI 285
             V S+    +++  L++ +H++     +R     +LN QR +++
Sbjct: 183 HYVHSTRATYVVLFALIA-LHLFCNYLAVRVVVFRSLNRQRASIV 226


>gi|390341859|ref|XP_795645.3| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2
           [Strongylocentrotus purpuratus]
          Length = 311

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 55/239 (23%), Positives = 108/239 (45%), Gaps = 20/239 (8%)

Query: 161 LYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
           + D    +E++S   PQ F+ +A + +  K +  VA  ATR  +    A+  N++D+ AK
Sbjct: 1   MNDAAMCIELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAK 60

Query: 221 GEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQ 280
             +  TL N+  L  G+ +      ++     +  L + +H+Y+    +    + TLN  
Sbjct: 61  DGSQETLVNLAALFVGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQN 120

Query: 281 RTAMIVADFVKTGR--ISSPADLRYREDLLF----PGRLIEDAGNVKVGRDPHKVVKPSK 334
           R  ++V D++ + +  +S P  +  RE +++    P +L     N+ V   P + V  S 
Sbjct: 121 RFHILVQDYLHSSQFSMSGPDAVNSREPVIWRLSRPLKL-----NLGV---PFRTVAQS- 171

Query: 335 FHELKELF---AEEKFILCLGSR--WTDMVLEQNASGEDALRGWLVAAYAASMAKSFHD 388
             +L+       +EKF+L    +    D+ +  +++ ED ++  L A          HD
Sbjct: 172 MRDLEMCLQSNGQEKFLLSFDVKKGCIDIAIHHDSTSEDFIKACLQAEIIMYTTFHLHD 230


>gi|115389162|ref|XP_001212086.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114194482|gb|EAU36182.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 411

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 62/115 (53%), Gaps = 5/115 (4%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQAT 122
            +  F P+GYP SV + Y  Y  F +LQ F+S+   +LS++++L   G+     +P  A 
Sbjct: 57  LIEVFLPAGYPHSVTDDYTAYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNPNASPTSAL 116

Query: 123 VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCP 176
           ++  IL+D    +  ++ ++ +G  +  E K +R+ AD   DL   L+ LSP  P
Sbjct: 117 LLH-ILQDTSGRLATILFAHRVGTALQPECKSYRLAADLFNDLAMILDCLSPAAP 170


>gi|429329557|gb|AFZ81316.1| hypothetical protein BEWA_007250 [Babesia equi]
          Length = 743

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 57/299 (19%), Positives = 125/299 (41%), Gaps = 48/299 (16%)

Query: 32  SDSVTRHYQFQPDGHLSVKVVDDS-------RPVYHRVVDSFLNKFFPSG-----YPFSV 79
           S S +R+   +PD  +  + ++ +       R  ++R++ +F   F         +P   
Sbjct: 122 SSSDSRNNSAKPDPTIVSRGINQNYGRKLSVRSFFNRIIGAF--NFLKRCINIIIHPSVA 179

Query: 80  NEGYLRYTQFRALQHFTSAALSVLSTQS------------------LLFAAGLRPTPAQA 121
            + Y +Y  +R  + F+ + L  +++                    L+    +  +  ++
Sbjct: 180 TKQYYKYAHWRMFERFSFSVLQTMASNIKSAPTNLQQNKVENTNIILMLKNKISESKMKS 239

Query: 122 TVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLC---PQ 177
            + + I KD    +   +  S +G   D+ PK +R+L   L    T ++ +  +    P+
Sbjct: 240 ILATMIFKDAFSRIFHFLWFSEIGVGFDNNPKAFRLLGSILCSTATLIDFVCNVFTFGPK 299

Query: 178 LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG---NLSDLFAKGEAISTLFNVVGLG 234
           +   M    N  + I ++   A++ P Y+SF  +    N+ ++ AK EA + + +  G+ 
Sbjct: 300 MI--MGACTNAVRQIGLLTMSASQGPFYNSFHVKNTATNIGEITAKLEAQNPICDFSGIA 357

Query: 235 AGIQLASTVCSS----MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
            GI L + +           +I+  L+S +  Y  +   ++     LNPQR  +++ DF
Sbjct: 358 LGIYLTNLLNEQPFVIQATTVILTSLISNVATYMCV---KSVSFKNLNPQRCFVVLEDF 413


>gi|255945167|ref|XP_002563351.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211588086|emb|CAP86157.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 504

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 52/199 (26%), Positives = 88/199 (44%), Gaps = 16/199 (8%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYL-----------RYTQFRALQHFTSAALSVLSTQSLLF 110
           ++ S  + F PSGYP SV++ YL             +    L   T  A  +  TQ    
Sbjct: 61  IISSLSDVFLPSGYPQSVSDDYLPPPLLPSGSSRSISHPAQLHRPTQRAKQLTPTQHTGV 120

Query: 111 AAGLRPTPAQATVVSWILKDGMQHVGKLIC--SNLGARMDSEPKRWRILADALYDLGTGL 168
             G       + ++  IL+D    +   IC    +G  ++ E K +R+ AD   D+   L
Sbjct: 121 GVGNANASPTSALLLHILQDISGRIAT-ICFAHRIGTALEPECKTYRLAADVFNDIAMIL 179

Query: 169 EVLSPLCPQLFLEMAGLGNFA--KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 226
           + LSP  P     +A L      + +  VA  +++  + + FAK GNL++L AK  +  T
Sbjct: 180 DCLSPGVPAGPARVAVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQET 239

Query: 227 LFNVVGLGAGIQLASTVCS 245
           + ++ G+  G  + S + S
Sbjct: 240 VISLFGMLVGSVVISHITS 258


>gi|159490370|ref|XP_001703152.1| hypothetical protein CHLREDRAFT_195109 [Chlamydomonas reinhardtii]
 gi|158270782|gb|EDO96617.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 280

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 48/77 (62%), Gaps = 2/77 (2%)

Query: 83  YLRYTQFRALQHFTSAALSVLSTQSLLFAAGL--RPTPAQATVVSWILKDGMQHVGKLIC 140
           Y  Y  +R +Q+F   A+SV +TQSLL A G+  R +   A  ++W++KDG   +G+L+ 
Sbjct: 11  YAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRYSGEAAAAINWVIKDGAGRLGRLLF 70

Query: 141 SNLGARMDSEPKRWRIL 157
           +  G  +D E K++R++
Sbjct: 71  ARWGRELDCELKQFRLM 87


>gi|301123209|ref|XP_002909331.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262100093|gb|EEY58145.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 122

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/119 (35%), Positives = 60/119 (50%), Gaps = 26/119 (21%)

Query: 106 QSLLFAAGLR----PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADAL 161
           QSLL+ AGL     PT   A  V+ +L+DG+                    +WRI +   
Sbjct: 2   QSLLYDAGLVAGAIPT---AAAVNRVLRDGL-------------------GQWRITSSVA 39

Query: 162 YDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAK 220
            D+    E+L+PL P  FL +A L N AK +A ++A ATR   ++SF    NL+D+ AK
Sbjct: 40  LDVAVLGEILTPLAPGSFLVIASLTNVAKNVAWLSASATRAGFHNSFVIRENLADVTAK 98


>gi|440299036|gb|ELP91648.1| hypothetical protein EIN_206450 [Entamoeba invadens IP1]
          Length = 360

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/271 (21%), Positives = 111/271 (40%), Gaps = 17/271 (6%)

Query: 30  ETSDSVTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQF 89
           E+  S +R Y F  D +         +P +    +   +   P GYP +V  GY R+  F
Sbjct: 18  ESKGSWSRRYIFDEDYN---------QPPFLTTRNHLQDIIVPVGYPTTVERGYFRFVAF 68

Query: 90  RALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDGMQHVGKLICSNL-GARMD 148
                  +  L V+S      +A LR   + +     I ++ M     LI +   G+ + 
Sbjct: 69  SVAMESCNMMLLVMS-----ISASLRIDSSHSAAFFVIFREVMAGTMHLIITERWGSSIV 123

Query: 149 SEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSF 208
              K+WR+  + + +L   +++     P +F  +    +  + +   + +  +  I S++
Sbjct: 124 FYAKQWRMRIEIISELLRLIQIAFIASP-IFYSINVTCSTIESVLNASRQVIKTKILSNY 182

Query: 209 AKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEE 268
           AKE N+++L  K + I TL  V+ L  G  +   +    +  +IV  L  + HV      
Sbjct: 183 AKENNVAELTEKAQNIETLARVISLLIGFIVLHLLGDFTEHGIIVFMLALIAHVVCNFLM 242

Query: 269 MRAAPVNTLNPQRTAMIVADFVKT-GRISSP 298
            R      +N +R  +++  F      I SP
Sbjct: 243 GRVIVFKAINYERLMILMKYFTTVKNEILSP 273


>gi|215259607|gb|ACJ64295.1| conserved hypothetical protein [Culex tarsalis]
          Length = 234

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 55/242 (22%), Positives = 110/242 (45%), Gaps = 20/242 (8%)

Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLL- 257
           ATR  +    A  GNL+D+ +K  A  T  N++    G+ L + V      +++ G  L 
Sbjct: 3   ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHV---QNQRVLYGLFLF 59

Query: 258 -SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
            +++H+Y+ I+ ++A  + T N  R  + + ++ K+G + SP  +   E +   G+ +  
Sbjct: 60  VTLLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGSMLSPQQVNKLERVTI-GQTVSL 118

Query: 317 AGNVKVGRDPHKVVKPSK-FHELKELFA----EEKFILCLGSRWTDMVLEQNASGEDALR 371
              VK+G    ++ +  +  ++L+ + A     EKF++     +  + L       D ++
Sbjct: 119 TARVKMGCSARELAQYYRNCYDLENVIACFDSREKFLIAETRNYVGVYLHFTVKPLDIIK 178

Query: 372 GWLVAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDGTGT-RF 430
            +    Y AS      D S   L+D + ++ + +   L+  Q +GW+    L  T   R 
Sbjct: 179 SYF---YVASY---LQDKS--QLRDRFWEIQNKWNEFLNLAQREGWNVQAHLLKTDEYRL 230

Query: 431 AW 432
            W
Sbjct: 231 DW 232


>gi|215259609|gb|ACJ64296.1| conserved hypothetical protein [Culex tarsalis]
          Length = 234

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 56/248 (22%), Positives = 108/248 (43%), Gaps = 32/248 (12%)

Query: 199 ATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLL- 257
           ATR  +    A  GNL+D+ +K  A  T  N++    G+ L + V      +++ G  L 
Sbjct: 3   ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHV---QNHRVLYGLFLF 59

Query: 258 -SVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
            +++H+Y  I+ ++A  + T N  R  + + ++ K+G + SP  +   E +   G+ +  
Sbjct: 60  LTLLHIYPNIKAVQAVCLRTFNEARYLIALEEYFKSGSMLSPQQVNKLERVTI-GQTVSL 118

Query: 317 AGNVKVGRDPHKVVKPSKFHELKELFAEEKFILCLGSRWTDMVLE-QNASGE-------- 367
              VK+G    ++      H  +  +  E  I C  SR   ++ E +N  G         
Sbjct: 119 TARVKMGCSARELA-----HYYRNCYDLENVIACFDSREKFLIAETRNYIGVYLHFTVKP 173

Query: 368 -DALRGWL-VAAYAASMAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAKGWHTDRFLDG 425
            D ++ +  VA+Y    ++         L+D + ++ + +   L+  Q +GW+    L  
Sbjct: 174 LDIIKSYFYVASYLQDKSQ---------LRDRFWEIQNKWNEFLNLAQREGWNVQAHLLK 224

Query: 426 TGT-RFAW 432
           T   R  W
Sbjct: 225 TDEYRLDW 232


>gi|422292653|gb|EKU19955.1| hypothetical protein NGA_2040500, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 181

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 58/114 (50%)

Query: 195 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
           +AA A+   I    A+  NL+++ AKG A  T  ++ GLG G+  A     S +      
Sbjct: 2   IAAGASNGNIVEHMARTNNLAEVLAKGGAQHTAVSLFGLGFGMWFARVANQSPRRVWTAY 61

Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
            LL+VIH+ +    MR     ++N +R  +++  F +   + SPA++  RE ++
Sbjct: 62  TLLTVIHLTANYAAMRVLAFTSINRRRLNVLLTAFRENASVLSPAEVARRETII 115


>gi|325088852|gb|EGC42162.1| DUF647 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 507

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 65/314 (20%), Positives = 129/314 (41%), Gaps = 55/314 (17%)

Query: 127 ILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAG 184
           +L+D M  +  ++ ++ LG  ++ E K +R+ AD L D     + LSP  P+ L + +  
Sbjct: 48  VLQDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLA 107

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDL----------------------FAKGE 222
             +  + +  VAA +++  + S FAK GNL +L                      F   +
Sbjct: 108 FSSVLRALCGVAAGSSKASLSSHFAKCGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSD 167

Query: 223 AISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLS--------VIHVYSVIEEMRAAPV 274
           A   L         I L   +C S+    I  P  +        ++H+ +    +R+  +
Sbjct: 168 ADQKLQKDSSQETVISLLGILCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNM 227

Query: 275 NTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIE------DAGNVKVGRDPHK 328
            TLN QRT ++ +   + G + +P +    E +     ++         G  ++G    +
Sbjct: 228 TTLNRQRTNIVFSTLFEKGSVLTPTEASKCERIFERDGILRWKASSATLGYCQIGGSFQE 287

Query: 329 VVKPS------------KFHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWL 374
           +++ S              ++L  LF +E+++L      +   +VL+ N +    L+ W 
Sbjct: 288 LLRGSIHGANSIRDVGIDIYKLLRLFEKEEYVLWFNPIHKKGTIVLKNNVTPISQLQAW- 346

Query: 375 VAAYAASMAKSFHD 388
             ++A  +AK   D
Sbjct: 347 --SHALRVAKRLTD 358


>gi|407850424|gb|EKG04824.1| hypothetical protein TCSYLVIO_004113 [Trypanosoma cruzi]
          Length = 631

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 62/270 (22%), Positives = 112/270 (41%), Gaps = 29/270 (10%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
           P GYP S   G+ RY          S+  S +  QS+L    L  +P       W++KD 
Sbjct: 164 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 218

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
           +  +     +N     ++ PK W  ++  L +L    E+L P L PQ  L  A + + A+
Sbjct: 219 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 278

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
             A +    +R      FA   NL++L  K    G  I T+F   G+     + ++V +S
Sbjct: 279 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMMIYTIFTAFGI-----MYTSVVTS 333

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTG-----RISSPADL 301
           +  ++        +++    + M       LN    ++++ D+V  G      I +P ++
Sbjct: 334 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNATTLSVVLRDYVGRGGIHRRHIPTPQEV 393

Query: 302 RYREDLLFPGRLIEDAGNVKVGRDPHKVVK 331
             R  L    R++++       RDP+   +
Sbjct: 394 SDRIGL----RMLDEK-----ARDPNDCTR 414


>gi|74026006|ref|XP_829569.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70834955|gb|EAN80457.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 595

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 67/263 (25%), Positives = 105/263 (39%), Gaps = 21/263 (7%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD 130
            P+GYP S  EG+ RY        F S+  S +  QS+L    L  +P       W+LKD
Sbjct: 166 MPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSILNGFLLASSPQL-----WMLKD 220

Query: 131 GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
               +     +N     ++ PK W + + AL++L    E++ P   P   L  A L +  
Sbjct: 221 LAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVPNHLLAAAILTSCV 280

Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
           +  A +    TR      FA   NL++L  K  +   +   V    GI   S V S    
Sbjct: 281 RQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALGIAYTSLVPSVTAQ 340

Query: 250 ----KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-----GRISSPAD 300
                M  G  L++ H+      M       LN    ++I+  +++      GR+ SP +
Sbjct: 341 LATVLMCCGANLAISHL-----SMCNIAFRILNETTLSVILRFYMREEGHQRGRVLSPRE 395

Query: 301 LRYREDLLFPGRLIEDAGNVKVG 323
           +     L    R   D GN ++G
Sbjct: 396 VSDAIGLRMIDRGRSD-GNDRIG 417


>gi|261335583|emb|CBH18577.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 595

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 67/263 (25%), Positives = 105/263 (39%), Gaps = 21/263 (7%)

Query: 71  FPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKD 130
            P+GYP S  EG+ RY        F S+  S +  QS+L    L  +P       W+LKD
Sbjct: 166 MPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSILNGFLLASSPQL-----WMLKD 220

Query: 131 GMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFA 189
               +     +N     ++ PK W + + AL++L    E++ P   P   L  A L +  
Sbjct: 221 LAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVPNHLLAAAILTSCV 280

Query: 190 KGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQG 249
           +  A +    TR      FA   NL++L  K  +   +   V    GI   S V S    
Sbjct: 281 RQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALGIAYTSLVPSVTAQ 340

Query: 250 ----KMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKT-----GRISSPAD 300
                M  G  L++ H+      M       LN    ++I+  +++      GR+ SP +
Sbjct: 341 LATVLMCCGANLAISHL-----SMCNIAFRILNETTLSVILRFYMREEGHQRGRVLSPRE 395

Query: 301 LRYREDLLFPGRLIEDAGNVKVG 323
           +     L    R   D GN ++G
Sbjct: 396 VSDAIGLRMIDRGRSD-GNDRIG 417


>gi|397621626|gb|EJK66381.1| hypothetical protein THAOC_12704 [Thalassiosira oceanica]
          Length = 480

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 72/161 (44%), Gaps = 16/161 (9%)

Query: 66  FLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQA--TV 123
           F + F P+G+P +   GYL Y     +Q   S    V+ST + L AAG+  + A A    
Sbjct: 57  FGDLFLPAGHPSTTAGGYLHYQVCDGVQGLCSYLRGVVSTGAALAAAGVGDSDATAMGAA 116

Query: 124 VSWILKDGMQHVGKLICSNLG-ARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEM 182
           V+W ++DG+  +G L  S    ++ DS  K +R+LAD   D G  L++  P         
Sbjct: 117 VTWAVRDGLGMLGGLAFSYYASSKFDSRVKEYRLLADVANDAGMILDMARP--------- 167

Query: 183 AGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEA 223
               + A+G  V A      P     A +G L D+    E 
Sbjct: 168 ----SDARGGLVAATPGVPRPRLGQRALQGRLRDVRGSNEG 204


>gi|407411065|gb|EKF33281.1| hypothetical protein MOQ_002854, partial [Trypanosoma cruzi
           marinkellei]
          Length = 636

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 15/229 (6%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
           P GYP S   G+ RY          S+  S +  QS+L    L  +P       W++KD 
Sbjct: 170 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 224

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
           +  +     +N     ++ PK W  ++  L +L    E+L P L PQ  L  A + + A+
Sbjct: 225 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 284

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
             A +    +R      FA   NL++L  K    G  I T+F   G+     + ++V +S
Sbjct: 285 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-----MYTSVVTS 339

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           +  ++        +++    + M       LN    ++++ D+V  G I
Sbjct: 340 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGI 388


>gi|71661222|ref|XP_817635.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70882839|gb|EAN95784.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 630

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 15/229 (6%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
           P GYP S   G+ RY          S+  S +  QS+L    L  +P       W++KD 
Sbjct: 164 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 218

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
           +  +     +N     ++ PK W  ++  L +L    E+L P L PQ  L  A + + A+
Sbjct: 219 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 278

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
             A +    +R      FA   NL++L  K    G  I T+F   G+     + ++V +S
Sbjct: 279 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-----MYTSVVTS 333

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           +  ++        +++    + M       LN    ++++ D+V  G I
Sbjct: 334 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGI 382


>gi|71414333|ref|XP_809272.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70873629|gb|EAN87421.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 630

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 55/229 (24%), Positives = 96/229 (41%), Gaps = 15/229 (6%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
           P GYP S   G+ RY          S+  S +  QS+L    L  +P       W++KD 
Sbjct: 164 PRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLGSSPQL-----WMMKDL 218

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
           +  +     +N     ++ PK W  ++  L +L    E+L P L PQ  L  A + + A+
Sbjct: 219 LPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMTSCAR 278

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAK----GEAISTLFNVVGLGAGIQLASTVCSS 246
             A +    +R      FA   NL++L  K    G  I T+F   G+     + ++V +S
Sbjct: 279 QSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-----MYTSVVTS 333

Query: 247 MQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRI 295
           +  ++        +++    + M       LN    ++++ D+V  G I
Sbjct: 334 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGI 382


>gi|387203446|gb|AFJ68984.1| hypothetical protein NGATSA_2040500, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 181

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 57/114 (50%)

Query: 195 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
           +AA A+   I    A+  NL+++ AKG A  T  ++ GLG  +  A     S +      
Sbjct: 2   IAAGASNGNIVEHMARTNNLAEVLAKGGAQHTAVSLFGLGFRMWFARVANQSPRRVWTAY 61

Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL 308
            LL+VIH+ +    MR     ++N +R  +++  F +   + SPA++  RE ++
Sbjct: 62  TLLTVIHLTANYAAMRVLAFTSINRRRLNVLLTAFRENASVLSPAEVARRETII 115


>gi|56752623|gb|AAW24525.1| SJCHGC08554 protein [Schistosoma japonicum]
          Length = 127

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 2/72 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
           F P GYP SV++ YL Y  +  +Q F S+    L++Q++L   G+  + A      ++W+
Sbjct: 45  FLPVGYPSSVSDDYLEYQIWDTIQAFASSITGALASQAVLIGVGVGNSSATILGASLTWM 104

Query: 128 LKDGMQHVGKLI 139
            KDG   +G+++
Sbjct: 105 FKDGSGMIGRIL 116


>gi|242208074|ref|XP_002469889.1| predicted protein [Postia placenta Mad-698-R]
 gi|220731120|gb|EED84968.1| predicted protein [Postia placenta Mad-698-R]
          Length = 214

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 50/215 (23%), Positives = 101/215 (46%), Gaps = 10/215 (4%)

Query: 84  LRYTQFRALQHFTSAALSVLSTQSLL--FAAGLRPTPAQAT--VVSWILKDGMQHVGKLI 139
           +RY  + ALQ F S+   +++++++L  F  G+    A AT  +   +L+D    +  ++
Sbjct: 1   VRYQIYNALQAFCSSLAGLIASRAVLQGFRHGVGKAGASATHAIFLTVLQDIFSRLTTIV 60

Query: 140 CS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAAR 198
               LG  +  E K +R+ AD   D    L+ LSP      +    L    + +  V A 
Sbjct: 61  AGYYLGTSLFPEAKAYRLTADIFNDAAIILDTLSPHLGSFRVASLCLSGAFRALCGVVAG 120

Query: 199 ATRLPIYSSFAKE----GNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
            ++  +   FA      G++ DL AK  +  T+  ++G+ +G  +   V S+    +++ 
Sbjct: 121 GSKAALTVHFATTAKGAGDVGDLSAKDGSKETVLALLGMLSGTAVMHYVHSTRATYVVLF 180

Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADF 289
            L++ +H++     +R     +LN QR +++ A +
Sbjct: 181 ALIA-LHLFCNYLAVRVVVFRSLNRQRASIVWAAY 214


>gi|134084659|emb|CAK43337.1| unnamed protein product [Aspergillus niger]
          Length = 484

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 90/206 (43%), Gaps = 26/206 (12%)

Query: 62  VVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH-FTSAALSVL---------STQSLLFA 111
           V D  +  F P+GYP SV + Y    Q R L H   S +L  L         S   L   
Sbjct: 68  VQDFLVEVFLPAGYPHSVTDDYAPPYQTRRLIHEGLSPSLQQLDCWPPILESSAARLPLG 127

Query: 112 AGLRPTPAQAT--VVSWILKDGMQHVGKLICSN-LGARMDSEPKRWRILADALYDLGTGL 168
            G+    A  T  ++  IL+D    +  ++ ++ +G  ++ E K +R+ AD   DL   L
Sbjct: 128 VGVGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMIL 187

Query: 169 EVLSPLCPQ-----LFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNL-----SDLF 218
           + LSP+ P        L  AG+     G+A  +++A+   + + F++ GNL     +D  
Sbjct: 188 DCLSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKAS---LSAHFSRWGNLAEAAITDFL 244

Query: 219 AKGEAISTLFNVVGLGAGIQLASTVC 244
           +    +S   ++    AG    S +C
Sbjct: 245 STERFVSRDNHITHRNAGPAPQSQLC 270


>gi|332372572|gb|AEE61428.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 43/191 (22%), Positives = 89/191 (46%), Gaps = 9/191 (4%)

Query: 194 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIV 253
           V    ATR  I    A   N++++ AK     T+ N++G    I L +   SS    +++
Sbjct: 37  VCYCGATRASITHHQAIRDNMAEISAKDGTQETIVNLIGSFGSIFLLNYFSSSRAEWILI 96

Query: 254 GPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLL--FPG 311
             L+ V+H+Y+    ++A   NT N +R A+++  +   G + +PA +   E +L  F  
Sbjct: 97  ISLM-VLHLYTNFLAVKALIFNTFNRERLALVLKSYFTIGSVLNPAKVNVNESVLLGFGL 155

Query: 312 RLIEDAG-NVKVGRDPHKVVK---PSKFHELKELFAEEKFILCLG--SRWTDMVLEQNAS 365
            +    G ++ +G    K +     +    L+ ++ ++K++L +    +   + LE+  +
Sbjct: 156 NVKRMCGFDIVLGESIKKSLNDYAAADIKLLQSVYKDKKYMLLVNGTKKKIYVCLEKGET 215

Query: 366 GEDALRGWLVA 376
             D + G+  A
Sbjct: 216 AADVIAGYFHA 226


>gi|225560567|gb|EEH08848.1| DUF647 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 472

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 64/316 (20%), Positives = 126/316 (39%), Gaps = 58/316 (18%)

Query: 138 LICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAVVA 196
           L    LG  ++ E K +R+ AD L D     + LSP  P+ L + +    +  + +  VA
Sbjct: 8   LFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVLRALCGVA 67

Query: 197 ARATRLPIYSSFAKEGNLSDL----------------------FAKGEAISTLFNVVGLG 234
           A +++  + S FAK GNL +L                      F   +A   L       
Sbjct: 68  AGSSKASLSSHFAKWGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKLQKDSSQE 127

Query: 235 AGIQLASTVCSSMQGKMIVGPLLS--------VIHVYSVIEEMRAAPVNTLNPQRTAMIV 286
             I L   +C S+    I  P  +        ++H+ +    +R+  + TLN QR  ++ 
Sbjct: 128 TVISLLGMLCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNMTTLNRQRANIVF 187

Query: 287 ADFVKTGRISSPADLRYREDLLFPGRLIE------DAGNVKVGRDPHKVVKPSK------ 334
           +   + G + +P +    E +     ++         G  ++G    ++++ S       
Sbjct: 188 STLFEKGSVLTPTEASKCERIFERDGILRWKASPAALGYCQIGGSFQELLRGSTHGANSI 247

Query: 335 ------FHELKELFAEEKFILCLG--SRWTDMVLEQNASGEDALRGWLVAAYAASMAKSF 386
                  ++L  LF +E+++L      +   +VL+ N +    L+ W   ++A  +AK  
Sbjct: 248 RDVGIDIYKLLRLFEKEEYVLWFNPIHKKGTIVLKNNVTPISQLKAW---SHALIVAKRL 304

Query: 387 HDPSLTVLQDAYDKMN 402
            D    + ++  D++N
Sbjct: 305 TD----LRREEADRIN 316


>gi|194383144|dbj|BAG59128.1| unnamed protein product [Homo sapiens]
          Length = 156

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 4/107 (3%)

Query: 35  VTRHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQH 94
           ++R +  +P+G  + +V     P     +      F P G+P SV+  YL Y  + ++Q 
Sbjct: 44  LSRAFTVKPEGRDAGEVGASGAP--SPPLSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQA 101

Query: 95  FTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWILKDGMQHVGKLI 139
           F S+    L+TQ++L   G+    A   A   +W++KD    +G+++
Sbjct: 102 FASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIV 148


>gi|302417830|ref|XP_003006746.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261354348|gb|EEY16776.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 228

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 36/153 (23%), Positives = 71/153 (46%), Gaps = 13/153 (8%)

Query: 68  NKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQAT--VVS 125
           + F P  YP SV+  Y++Y  + + Q F S    +L+ +++L   G+          ++ 
Sbjct: 79  DAFLPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHALLL 138

Query: 126 WILKDGMQHVGKLICS-NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAG 184
             LKDG+  +  +  +   G+ +++  K++R LAD   D    LE+ +P           
Sbjct: 139 TCLKDGVSRLAMITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELANPF---------- 188

Query: 185 LGNFAKGIAVVAARATRLPIYSSFAKEGNLSDL 217
              +AK + +V+A + R     + A++  LS +
Sbjct: 189 FSGWAKILILVSAESLRALCGIALAQQAALSTI 221


>gi|146096657|ref|XP_001467882.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398020910|ref|XP_003863618.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134072248|emb|CAM70952.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322501851|emb|CBZ36933.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 468

 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 57/248 (22%), Positives = 100/248 (40%), Gaps = 10/248 (4%)

Query: 58  VYHRVVDSFLNKFF--PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
           V HR   S   + F  P G+P +   G+ R+        F S   S +  QSLL    L 
Sbjct: 20  VQHRETVSNRIRLFGMPKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG 79

Query: 116 PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-L 174
            +P       W+LKD +  +     +N     +S PK W +++  + +L    ++L P +
Sbjct: 80  SSPQL-----WMLKDLVPALFAAYMANKVVSYESRPKFWFVVSVFINNLTVISDMLIPSV 134

Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
            P   L  A + +  K  + +    TR      +A   NL++L  K  +   +   V   
Sbjct: 135 LPNHLLAAAIITSTVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATA 194

Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTG 293
            GI L  T  +S   ++I      V +++     M       LN     +++  F+ +  
Sbjct: 195 LGI-LYCTYIASFTVQLITVAACCVTNMFLSSMSMMPITFRLLNFDTMKLLLRAFILEKR 253

Query: 294 RISSPADL 301
           RI +P ++
Sbjct: 254 RILTPQEI 261


>gi|401427109|ref|XP_003878038.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322494285|emb|CBZ29584.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 468

 Score = 47.4 bits (111), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 101/248 (40%), Gaps = 10/248 (4%)

Query: 58  VYHRVVDSFLNKFF--PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLR 115
           V HR   S   + F  P G+P +   G+ R+        F S   S +  QSLL    L 
Sbjct: 20  VQHRETVSNRIRLFGMPKGFPETTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG 79

Query: 116 PTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-L 174
            +P       W+LKD +  +     +N     ++ PK W +++  + +L    ++L P +
Sbjct: 80  SSPQL-----WMLKDLVPALFAAYMANKVVSYENRPKFWFVVSVFMSNLMVISDMLIPSV 134

Query: 175 CPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLG 234
            P   L  A + +  K  + +    TR      +A   NL++L  K  +   +   V   
Sbjct: 135 LPNHLLAAAIITSTVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATA 194

Query: 235 AGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTG 293
            GI L  T  +S   ++I      V++++     M       LN     +++  F+ +  
Sbjct: 195 LGI-LYCTYIASFTVQLITVAACCVMNMFLSSMSMMPITFRLLNFDTMKLLLRAFILEKR 253

Query: 294 RISSPADL 301
           R+ +P ++
Sbjct: 254 RVMTPQEI 261


>gi|157874033|ref|XP_001685512.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68128584|emb|CAJ08716.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 468

 Score = 47.4 bits (111), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 54/250 (21%), Positives = 102/250 (40%), Gaps = 9/250 (3%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
           P G+P +   G+ R+        F S   S +  QSLL    L  +P       W+LKD 
Sbjct: 36  PKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLGSSPQL-----WMLKDL 90

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
           +  +     +N     +S PK W +++  + +L    ++L P + P   L  A + +  K
Sbjct: 91  VPALFAAYMANKVVSYESRPKFWFVVSVFVNNLTVISDMLIPSVLPNHLLAAAIITSTVK 150

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGK 250
             + +    TR      +A   NL++L  K  +   +   V    GI L  T  +S   +
Sbjct: 151 QSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATALGI-LYCTYIASFTVQ 209

Query: 251 MIVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFV-KTGRISSPADLRYREDL-L 308
           +I      V +++     M       LN     +++  F+ +  R+ +P ++     + +
Sbjct: 210 LITVAACCVTNMFLSSMSMMPITFRLLNVDTMKLLLGAFILEERRVMTPQEISTLLGVRM 269

Query: 309 FPGRLIEDAG 318
            P + ++ AG
Sbjct: 270 LPTQALKKAG 279


>gi|156088815|ref|XP_001611814.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154799068|gb|EDO08246.1| hypothetical protein BBOV_III006850 [Babesia bovis]
          Length = 540

 Score = 45.4 bits (106), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 88/193 (45%), Gaps = 19/193 (9%)

Query: 108 LLFAAGLRPTPAQATVVSWILKDGMQHVGKLI-CSNLGARMDSEPKRWRILADALYD--- 163
           L +A   + + A  T+V   LKD    +   +    +GA  D  PK +R+L   L     
Sbjct: 42  LTWATSGKVSSALTTIV---LKDIFSRILNFVWFGQIGAGFDDNPKAFRLLGGLLCSAAG 98

Query: 164 LGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATRLPIYSSFAKEG---NLSDLFAK 220
           +   +  +  L P++ L +  L N  K +  V   A++   +S+F  +    N+ ++ AK
Sbjct: 99  VACFIGNMMQLEPKVVLTV--LVNVIKHVGAVTMVASQATFHSTFCVKNAATNVGEITAK 156

Query: 221 GEAISTLFNVVGLGAGIQLA---STVCSSMQGKMIVGPLL-SVIHVYSVIEEMRAAPVNT 276
            E  S + + +G+ AG  L    + + +SMQ  +  G  L S I  Y      ++    T
Sbjct: 157 LETQSPICDFLGMAAGTSLGALTANMAASMQTVLFTGMCLTSNIATYMC---AKSVCFRT 213

Query: 277 LNPQRTAMIVADF 289
           LNPQR  +++ +F
Sbjct: 214 LNPQRCFVLLENF 226


>gi|399218598|emb|CCF75485.1| unnamed protein product [Babesia microti strain RI]
          Length = 635

 Score = 45.4 bits (106), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 44/197 (22%), Positives = 81/197 (41%), Gaps = 25/197 (12%)

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ 120
           ++   FLN  +PSG    V   Y +Y  +R L+ F+ +    +S   L     L  +   
Sbjct: 106 KIKKRFLNIVYPSG----VTSNYYKYANWRLLERFSHSMSQSISNNPLAAKILLNQSTYS 161

Query: 121 --------ATVVSWILKDGMQHVGKLICS---------NLGARMDSEPKRWRILADALYD 163
                   A  + +I  + M  + K I S         N+G   D  P+ +      +  
Sbjct: 162 TNDGVINIAYKIDFIATNLMTQIFKEIVSRIVKTMWVGNIGIGFDMNPRAFCFWGSLICT 221

Query: 164 LGTGLEVLSPLCPQLFLEMA-GLGNFAKGIAVVAARATRLPIYSSFAKEGNLSD---LFA 219
           +    + LS L    ++ +   +    + I ++   A+  PIY SF  + N+S+   + A
Sbjct: 222 ITNTADFLSTLFNFKYISITRHITTLFRQIGMLTTSASIGPIYGSFKSQNNVSNIGEITA 281

Query: 220 KGEAISTLFNVVGLGAG 236
           K EA++ L +++G   G
Sbjct: 282 KMEAVTPLCDLLGSATG 298


>gi|425769885|gb|EKV08364.1| hypothetical protein PDIP_68690 [Penicillium digitatum Pd1]
 gi|425771531|gb|EKV09972.1| hypothetical protein PDIG_59240 [Penicillium digitatum PHI26]
          Length = 406

 Score = 44.7 bits (104), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 58/114 (50%), Gaps = 10/114 (8%)

Query: 139 IC--SNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-----LFLEMAGLGNFAKG 191
           IC    +G  ++ E K +R+ AD   D+   L+ LSP  P      + L  AG+    + 
Sbjct: 34  ICFAHRVGTALEPECKTYRLAADVFNDIAMILDCLSPGVPAGPARVIVLSTAGV---LRA 90

Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS 245
           +  VA  +++  + + FAK GNL++L AK  +  T+ ++ G+  G  + S + S
Sbjct: 91  LCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQETVISLFGMLVGSVVISHITS 144


>gi|397575094|gb|EJK49531.1| hypothetical protein THAOC_31586, partial [Thalassiosira oceanica]
          Length = 194

 Score = 44.7 bits (104), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 65  SFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QAT 122
           S ++ F P+ YP SV   Y  +  +    +   +A  VLSTQ+LL A G+    A   A 
Sbjct: 113 SIVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAA 172

Query: 123 VVSWILKDGMQHVGKLI 139
            ++W+LKD    +G ++
Sbjct: 173 ALNWLLKDFAGQLGGVL 189


>gi|151555803|gb|AAI49327.1| FLJ13868 protein [Bos taurus]
          Length = 158

 Score = 44.7 bits (104), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 12/132 (9%)

Query: 12  KKEPDKSMSVQEIPIYWFETSDSVTRHYQFQPDGHLSVKVV--DDSRPVYHRVVDSFLNK 69
           +  PD S+  +     W+  S + T     +P G      V    + P + R+    L  
Sbjct: 29  RAAPDGSLQWEAWGRRWWGFSGAFTA----KPGGRDGGGGVAPGTASPPFSRL----LAV 80

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPA--QATVVSWI 127
           F P G+P SV+  YL Y  + ++Q F S+    L+T ++L   G+    A   A   +W+
Sbjct: 81  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 140

Query: 128 LKDGMQHVGKLI 139
           +KD    +G+++
Sbjct: 141 VKDSTGMLGRIV 152


>gi|159467581|ref|XP_001691970.1| hypothetical protein CHLREDRAFT_170864 [Chlamydomonas reinhardtii]
 gi|158278697|gb|EDP04460.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 131

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 136 GKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQ-LFLEMAGLGNFAKGIAV 194
           G L     G+  D+  K+WR+ AD   DLG  +E+ SPL P+ LFL  A  G+ A+ + V
Sbjct: 7   GILFAYAEGSSFDACAKQWRLFADIANDLGMTVELASPLLPRALFLPCACFGSIARSVTV 66

Query: 195 VA 196
            A
Sbjct: 67  FA 68


>gi|321459665|gb|EFX70716.1| hypothetical protein DAPPUDRAFT_327911 [Daphnia pulex]
          Length = 269

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 44/142 (30%), Positives = 72/142 (50%), Gaps = 29/142 (20%)

Query: 89  FRALQHFTSAALSVLSTQSLLFAAGL---RPTPAQATVVSWILKDG-MQHVGKLI---CS 141
             A+Q F  +    LST+++L   G+     TP  AT++ W++KDG  + VG+++   C+
Sbjct: 9   IHAIQAFVGSLSGSLSTRAILEGVGVGSSTSTPLAATLM-WLIKDGSTRMVGRILFAWCN 67

Query: 142 NLGARMDSEPKRWRILADALYDLGTGLEVLSPLCPQLFLEMAGLGNFAKGIAVVAARATR 201
             G ++D++              G+   + S LC      +AG+   AK I  VA  ATR
Sbjct: 68  --GTKLDADTAVG----------GSNGTMTSILC------VAGV---AKSIVGVAGGATR 106

Query: 202 LPIYSSFAKEGNLSDLFAKGEA 223
             +    A++GNL+D+ AK EA
Sbjct: 107 AALTQHQARQGNLADVSAKVEA 128


>gi|440799746|gb|ELR20790.1| protein C16orf58 family protein [Acanthamoeba castellanii str.
           Neff]
          Length = 101

 Score = 42.7 bits (99), Expect = 0.33,   Method: Composition-based stats.
 Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 2/71 (2%)

Query: 70  FFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQ--ATVVSWI 127
             P GYP SV+  Y  Y  +  +Q   S+   +L T++LL   G+    A   A  V W 
Sbjct: 21  LLPEGYPESVSGDYATYQTWDTVQALCSSITGLLCTRALLQGVGVGDVGATPGAAAVQWA 80

Query: 128 LKDGMQHVGKL 138
           L+DG   VG++
Sbjct: 81  LRDGAGMVGRI 91


>gi|3242710|gb|AAC23762.1| hypothetical protein [Arabidopsis thaliana]
          Length = 405

 Score = 42.7 bits (99), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 5/74 (6%)

Query: 61  RVVDSFLNKFFPSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTP-- 118
           R+  S    FF    P  V++ Y+ Y +++ L    S+AL VL+TQ++  A G+  +   
Sbjct: 113 RLTSSAAKDFF---LPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSL 169

Query: 119 AQATVVSWILKDGM 132
           A +   +WILKDG+
Sbjct: 170 ASSAAFNWILKDGL 183


>gi|12654037|gb|AAH00822.1| C16orf58 protein [Homo sapiens]
          Length = 231

 Score = 42.0 bits (97), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 42/186 (22%), Positives = 82/186 (44%), Gaps = 26/186 (13%)

Query: 257 LSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
           L+ +H+Y+    +RA  + TLN  R  +++  +++ G +  P     R + L+ G     
Sbjct: 30  LTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAAN-RMEPLWTG--FWP 86

Query: 317 AGNVKVGRDPHKVVKPSKFHELKELFA--EEKFILCL--GSRWTDMVLEQNASGEDALR- 371
           A ++ +G   H++V  S   EL++L    +E ++LC         +VL Q A  +  LR 
Sbjct: 87  APSLSLGVPLHRLV--SSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRA 144

Query: 372 ---GWLVAAYAAS-------------MAKSFHDPSLTVLQDAYDKMNDVFTPLLSELQAK 415
              G ++ A                 +       S  V+++ ++ ++ +F   L  LQ  
Sbjct: 145 ATHGLMLGALQGDGPLPAELEELRNRVRAGPKKESWVVVKETHEVLDMLFPKFLKGLQDA 204

Query: 416 GWHTDR 421
           GW T++
Sbjct: 205 GWKTEK 210


>gi|224044670|ref|XP_002188933.1| PREDICTED: condensin-2 complex subunit G2 [Taeniopygia guttata]
          Length = 1150

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 24/43 (55%)

Query: 37  RHYQFQPDGHLSVKVVDDSRPVYHRVVDSFLNKFFPSGYPFSV 79
           + +   P  HL  ++  DSRPV  R+V+  LN FFPS  P  V
Sbjct: 494 KFWNISPMEHLLSRLESDSRPVSRRIVNLLLNSFFPSAQPEDV 536


>gi|268318089|ref|YP_003291808.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
           [Rhodothermus marinus DSM 4252]
 gi|262335623|gb|ACY49420.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
           [Rhodothermus marinus DSM 4252]
          Length = 561

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 50/115 (43%), Gaps = 22/115 (19%)

Query: 202 LPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIH 261
           +P+Y   A  G + DLF         F  VG G GI+LA  V  ++ G+         +H
Sbjct: 46  IPVY---AVGGVVRDLFLDRPTTDIDFVTVGPGTGIRLARLVARALGGR--------TVH 94

Query: 262 VYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
           VY   E    A +    P R+ + V +FV   R S      YR+D   P  ++ED
Sbjct: 95  VY---ENFGTAAIRVPTPDRSGVFVLEFVAARRES------YRKDSRKP--IVED 138


>gi|345304379|ref|YP_004826281.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
           [Rhodothermus marinus SG0.5JP17-172]
 gi|345113612|gb|AEN74444.1| polynucleotide adenylyltransferase/metal dependent phosphohydrolase
           [Rhodothermus marinus SG0.5JP17-172]
          Length = 552

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 50/115 (43%), Gaps = 22/115 (19%)

Query: 202 LPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIH 261
           +P+Y   A  G + DLF         F  VG G GI+LA  V  ++ G+         +H
Sbjct: 37  IPVY---AVGGVVRDLFLDRPTTDIDFVTVGPGTGIRLARLVARALGGR--------TVH 85

Query: 262 VYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLFPGRLIED 316
           VY   E    A +    P R+ + V +FV   R S      YR+D   P  ++ED
Sbjct: 86  VY---ENFGTAAIRVPTPDRSGVFVLEFVAARRES------YRKDSRKP--IVED 129


>gi|449682081|ref|XP_002168725.2| PREDICTED: UPF0420 protein C16orf58 homolog [Hydra magnipapillata]
          Length = 237

 Score = 39.7 bits (91), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 25/111 (22%), Positives = 53/111 (47%)

Query: 192 IAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKM 251
           I  VA  ATR  +    A+  N++D+ AK  +  TL N++ L  G  +   V  ++    
Sbjct: 7   IVGVAGGATRAALTLHQARRNNMADVSAKDGSQETLVNLMALFTGFLITPLVVDNLFLTW 66

Query: 252 IVGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLR 302
               + + +H+Y+  + +    + T+N  R  ++V  +   G++ +P ++ 
Sbjct: 67  YFFLVFTFLHLYANYQAVSTVVMETINLPRLHILVKHYFIYGKVLTPKEVN 117


>gi|402580829|gb|EJW74778.1| hypothetical protein WUBG_14315 [Wuchereria bancrofti]
          Length = 191

 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 30/115 (26%), Positives = 54/115 (46%)

Query: 195 VAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVG 254
           VA  ATR  + +  A   N+ D+ AK  +  TL NV  L   + L   V  +     ++ 
Sbjct: 8   VAGSATRTTVINHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLPVVSKNAVFVWLLF 67

Query: 255 PLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFVKTGRISSPADLRYREDLLF 309
            L + IH+Y     ++     TLN     ++V D+++T +I +  ++  +E +L 
Sbjct: 68  CLFTFIHLYGNYRAVKMLQFRTLNQSLLRIVVKDYIQTRKIGTVNEMNNKEPILL 122


>gi|297602390|ref|NP_001052390.2| Os04g0290800 [Oryza sativa Japonica Group]
 gi|255675293|dbj|BAF14304.2| Os04g0290800, partial [Oryza sativa Japonica Group]
          Length = 107

 Score = 39.3 bits (90), Expect = 4.1,   Method: Composition-based stats.
 Identities = 20/71 (28%), Positives = 41/71 (57%)

Query: 217 LFAKGEAISTLFNVVGLGAGIQLASTVCSSMQGKMIVGPLLSVIHVYSVIEEMRAAPVNT 276
           + AKGEA   +   +G+  GI LA+ + SS+   +I    ++ +H+Y  ++  ++  + T
Sbjct: 30  VIAKGEAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRT 89

Query: 277 LNPQRTAMIVA 287
           LNP R + +++
Sbjct: 90  LNPYRASNVLS 100


>gi|154343241|ref|XP_001567566.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134064898|emb|CAM43006.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 439

 Score = 38.9 bits (89), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 39/179 (21%), Positives = 72/179 (40%), Gaps = 7/179 (3%)

Query: 72  PSGYPFSVNEGYLRYTQFRALQHFTSAALSVLSTQSLLFAAGLRPTPAQATVVSWILKDG 131
           P G+P +   G+ R+        F S   S +  QSLL    L  +P       W+LKD 
Sbjct: 7   PKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLGSSPQL-----WMLKDL 61

Query: 132 MQHVGKLICSNLGARMDSEPKRWRILADALYDLGTGLEVLSP-LCPQLFLEMAGLGNFAK 190
              +     +N     ++ PK W +++  + ++    ++L P + P   L  A + +  K
Sbjct: 62  APALLAAYMANQVVSYENRPKFWFVVSVLMNNIMVMSDMLIPSVVPNHLLAAAIVTSTVK 121

Query: 191 GIAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGAGIQLASTVCS-SMQ 248
             + +     R      +A   NL++L  K  +   +   +    GI   + + S +MQ
Sbjct: 122 QSSALMYFVARASTLQHYAINNNLAELTKKFNSFGMVSYTIATALGILYCTYIASFTMQ 180


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.135    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,715,635,325
Number of Sequences: 23463169
Number of extensions: 280363585
Number of successful extensions: 661667
Number of sequences better than 100.0: 503
Number of HSP's better than 100.0 without gapping: 387
Number of HSP's successfully gapped in prelim test: 116
Number of HSP's that attempted gapping in prelim test: 660305
Number of HSP's gapped (non-prelim): 582
length of query: 432
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 287
effective length of database: 8,957,035,862
effective search space: 2570669292394
effective search space used: 2570669292394
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)